Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAAACTGAAAAATTGGAGGAGAAACAACCAGAAGGACGACTCGAAGTTTTACAAATGCCAGAAACGAGAAAATTTGTGAGTATTCATTCTTCTTACTCCTTTACTTTCTGCTCCTCAATTTCGGAATGGGTATAGAATCGTTAACGTCTTCGTAATTTCCGTAGAAATTATGTTCATGAACGCTTGATATCGTCTTCAGATTCATTCTGATTCTGCCATGATTTTTTCTGGGATTTCTTATTCCCTTCGTCAACAGCTTTCTTTCTGTACAAATTTATTTGTTCACAAGGTGAATTCACGATTCATATAAGCAGTTTTCAAAGAGTGGATATCTGTAATGCGACTTTCCAGAGTTCTGGGAATAATTTCCTTTTTCATAATTTTTTTTTATGATTGATGACTTGAATTGGAAAATGTTTTGAGTACTTTTTCAAGAGTGCATATTGGAAATTCTGTTACATTGTACTTGGGTGAAGTGGTGTAACTTAGATGCATTTGATCAGTTGAATTTTAGCATTTGGTGTTAAGTTTGAAAGCTTTTTATCTGTTAATGGCTAGGTTATATCTCGGGTATTGTATTAGTTGCATTTTGTATGATTTATTGGTTGGTTTATGTGCAATTTTCATATTAAACCTTGCAGTTATATTAATTGCAAAATATTATTTGCCAAAATGAGCCTAGCTCAATTCGAATGCCCCACCCCAGATATTGTTTAACTCAAAAAATTGCAAACAGGTTGTAAGGTATCCAGTCGAAAATCACCTGTTAACAGAACCACACAACACAAACACAAGAAAGAACAAGGGAGATACTGTTGTATTGATAATTCCTCCCAGAAAAAAGGAATACCTCTGGGTTCAAACCTTCGTTAAGCAAGATACCAAAACAAACAAGATAGCAAAACACTCAGCTGTTAGAGAGAGCTGAGAACCTCTCCTGACCTATCCTCCTTCTACCAAATTCTGCCTAAAAAACAAAGTAAAGGACTCCCCCCCATAAATAAAAAAACAGCAGCCATACAAGGGACGTACAACCATACCCACGATCCTAAAAGTGGTCCCCAAGGAACCGTTATTCAACATTCTCCCTTTTCTCTCTTTTACCCTTCCTACTATAAGTTTTCGTGATAGGAGGCCTAACAATACCCCCCGGGGCCAGCAGCACCTTGTCCTCAAGGTGAAAATCTGGGAACAACTGTTGGAAGGCTTTAACTGATTCCCACGTCGCGTCACATTCAGGCTCTCCTTGCCACTTAACGAGCCATTCCTCCTCTCCCAGTTCTGAATTCCATCGAATCCCAAGCACCTCCTGTGGGTTTGCCTTCCACTCATAGCAGTCAGTCAGATCAGGGACTTGTGGATGCACTGTTAGGCCATCGCCAATCATTTTTTTGAGGAGAGAAACATGAAAAACATCATGTACCAAGGCGTCCTTGGGAAGCTCCAGCCGGTAAGCCACTGGACCGATTCTTTCCAGTATTTTAAAGGGGCCGAAATACTTGGGTGACAGCTTCTCACAACGTTTTCTTGCCAGAGACTTAAGACGGTACGGTCGGATTTTCAAAAATACAAACTCCCCTTCATTGAACTCGAGTTCCTCCTTTTCTTATCGGCCTGTTTCTTCATTCGTTCTTGTGCTAAACGCAAATGTTCCTTCAGGGCTACCAAGGCCAGATCTCGGTCTTTCAGCTGTTGTTCAAGAGAGTCGTTCGTTGTCCGAGAAGTGCCATAAGAGACAATCGGCGGCGGAGGGCGACCGTAAACCACATGGAAAGGGGTGGTGTTGATGGAGACGTGGTATGTAGTATTATACCAAAATTCAGCCCAAGGAATCCATTGTTCCCACTTTCTTGGATATTCCCCACGAAACAACGTAAGTATGTTTCTAAGCATCGTTAACTCTTCGTTCCGCTTTGGCCATCTGTTTGTGGATGGAAAGCGGTGCTCCTTTTCAATTGCGTTCCCTGCAACTTGAACATCTCATTCCAGAAATGACTTATAAAAATCTTATCTCGGTCCGAGACGATTGACTTTGGGAAACCATGCAACCGAACAATCTCTTGGATGAACAAGGCAATTTATATTCTTTCTTTGAGAAGGGATGCTTGACCTTCAGAAATGGTCAGATTTACTTAATCGATCCACCACCACTAGTATGGAGTCAAATCCCATAGATTTCGGTAGCCCTCAATGAAATCCATCGATATATCTTCCCATATCAAGTTCGAATTGGCAAGAAGTCAGAATAACCCCGCAGGGCCACTGCTTCCACCTTTATTACGCCGGCATGTCGAGCATCTTTCCACATACCGCTTAACCTCTGTCTTCATGTTCTTCCAAAACAGTTCAACAGTTAATCGTTTGTGGGTCCTCAAGAACCCCGAGTGGCCCCAAGAATCGAGTCATGGTACATATGCAATATGGAGGAATTAGAGTCGACTTATTGGATAGCACCAACCGCCCTTTATAGAGTAACCTTCCCTGCTGCAGGGTGAATTTGGGGTGTGACAACGGATCTCTCTCTAAATCTTCCCAGATCTTTCGTAACTCAGGATCTTTCTCCACTTCGTTGTTAACTGTGGTTATGTCCAGGACATGTGGGACTGAGATAGATCGGAGTTCTCCTTCATACTCCACTCTTGATAGTGCGTCGGCAGCCTTGTTCAACTGTCCCGGATTGTACTTTATCTCAAAGTCGTATCCCAACAATTTTGTGAGCCAACGTTGGTACTCCGGTTGTACTTCCCGTTGTTCGAGTAGGCTGCCTAAAGCTCTCTGATCCGTCAGCACAGTGAACCTCTGCCCCAATAAGTAATGCCTCCACTTCTGCACTGCCAGAACAATAGCCATTAACTCCCTCTCGTAAATTGGCTTTAACCGCGACCTTTGAGATAGTGACTGACTAAAGAAGGCTATCGGTCTTTGCTTCTGAGACAACACTGCCCCAACCCCTGTTCCTGATGCGTCCGTTTCGATCACAAATGGTTGGGAGAAATCGGGCAAGGCCAACACTGGAATGGTAACCATGGCGTGCCTCAGGGTTTCAAAAGCTGCGGTGGCTGCATTCGACCATTGGAAGGCGTCCTTTCTTAGTAGTTGGGTTAATGGCTCTGGCAAATGCCGCCATATCCCATCACAAACTCGCGTAATACCTCCACAATCCGAGGAAACCTCGAAGTTCCTTAATGGTTTTAGGCATTGGCCAATTTAACATGGTTTTAATTTTTTCTCCGTCGGCTTCAACCCCCTTCTCTGATATTATATGGCCTAGATAGGCTATTCGAGATTGCCCAAAAACACACTTATTTCTGTTTGCAAACAGGGCATGTTCCCTCAAAGCTTCCATGGTGATTTCCAGATGTTTGAGATGTGTTGAATAGTCTGAACTATAAATAAGTATGTCATCAAAAAATACAAGAATGAATCTTCGAAGGAAAGGACGAAAAACCTGGTTCATAAGCGACTGAAACGTTGCTGGGGCGTTTGTAAGTCCGAAGGGCATTACCATGAATTCATAGTGGCCTTCGTGAGTACGAAAGGCTGTTTTCGGTATGTCTTCTTCCTTCATTCGAATTTGGTGGTAACCTGATTTCAGGTCCAGCTTGGAAAAAACTTTAGAGCCATGGAGCTCATCCAGTAGTTCCTCGATCACCGGTATTGGAAATTTATCGGGGATAGTCACCCTGTTCAGCGCACGATAATCAATACAGAAATGCCAACCCCCATCTTTTTTCTTCACCAACAGGACAGGGCTAGAGAAAGGAGAGTTACTGGGTCGAATGATCCCTGAAGCCAACATTTCTTTTATCAAGCGCTCAATCTCCGATTTCTGAGCGTGGGCATATCTATAGGGTCTAACATTCACTGCCTGTTGGTCCCCCTGGAGTAAAATTCGATGGTCAACTGCTCGTCTCGGTGGTAAACCATGAGGGGCATCAAAAACAGAGGAGAACTTAGTCAGAAGATTTTGTATTTCAATTCCCATCGGTGTAACTTCCGCTAAGCTCCTTCATACTCACTCTCGCTTTGGTCTCCTAAATTCAACCCCTCGCTCCCACCTGTAGCCTTGGTGGTCGTCTTCTACCCAGGTCTTGGCTAACATTTTTAACGACACTTCTGCCCGAGTCAACGAGGGATCCCCATGGAGAACTACTTGTTGTCCCTTTAAATCGAATGTCATTCTTAGAGCTGCCCAGTCAACGTTCATTTCTCCCATCGTCCGAAGCCATTGCATTCCCAATATCACTTCTACATTTCCCATTTCTAATGGTAAGAAGTCTTCACACACTGTAAGCTCCGGTAAAGACACCACTAGCGATTTACAAACACCCTTACCCTTCGTGGCTATACCGTTGCCCATTACAATTCCATAGTTGGAGGTGTCTACCGTCGGTATCTGTAATTCTTCGACCAACTTCTGTGAAATAAAGTTGTGGGTAGCCCCACAGTCGATCAACACAATCACTTCCTTCTCACCTATCAACCCTTTTAACTTCATTGTTCCCGGTGTGGAGAAACCAATTACTGATTTTAAATCCAACTCAACCAGATCGCCCACCAGCTTGGTCGGAGTTTCTACTTCCTCTGGTCCCCAGTCTCTCTGGATGAGTTCGATCTCGTCAACTCCGTTCACCAACAGCACTCGTAGCTCTCTGACTTCTCGATTCTTACAGATATGGTCCTTCGAGTACTTTTCATCACACCGAAAACATAAGCCTCGAGCTCTCTTTGCTTGGAATTCGGCGTCCGATAACCGTTTCGGAGGAAACTCTCTCTTTCCCAACGTTCCTTTTTCGGGTAGGGTTACAGTGCGCGTACCTGAGGGATCGCCTGTTTTTGGGTTTGTCTTCATAATTGGGCCTGGGCTCCCCTTACTATTCCTCGCCGCTCGAGACCCATGCGAGGACTTGGGCTTAGCCCACTTTCTTCCTCGATTGCCAGACGAAGGGCCTCTTCTCTATCTTCGATTAATTGGGCCTGTCTAATCATGTCTTCAAGCCCGCTTGGCTGTCTACTCTTCACTGAAGCCCTGATGGTAGGTTTGAGCCCGTTAAGGAACGTATTCTCGAGGACGTCCTCCGGCAGCCCAGGTAGCGGCGATGAATAAATCTCGAACAGTTTCCGATATTCTGCCACCGTACCTTCTTGTTTCACGGCGAGGAAGCGGGCACAGAAACTTCCTTCTTGTGACGGTCGAAAACGCTCGAATATCCGCGCCTTCAAATCATTCCATCCTTGTATCTTCTTCCTATTCTCCGTCCATCGGAACCACTCCACCACGTCCGGCTCGAAACTAATAATCGATACCATAATTTTCTCCTTATCCGTCAACTGATAAATGTCGAAATAATGTTCAGCCCTGAAGACCCATGAGTCTGGCTTCTCTCCGGTAAATATCGGCATTTCCAGACGTTTCATTTTTCCTCTTTCGCCTCCAGGTTCTTTTCTTCCTAAGGAGGACTCTCCTTCCTCCGTTTCCTTCTGTTCCGACACTTCCCCTCGATTCTCATCCTCTGGGTACTTTTGGCCAGAAGCCTCCGTGAGCACGGATTTCCCTTTGGACAACGTGAGCTCACTGATCACATAGGTTAGTTTCTCGAGAGCCTTCTCCATGTTCTGGACCGAAACCTTCACCTCCCCGATGTCCTTCTCACAACCCGACACTCTTTCCTCTATCTCTTGCTTCGCCATTCTTCTTGTCTTACCCAGGATGACGTGCTCTGATACCAATTTGTAAGGTATCCAGTCGAAAATCACCTGTTAACAGAACCACACAACACAAACACAAGAAAGAACAAGGGAGATACTGTTGTATTGATAATTCCTCCCAGAAAAAAGGAATACCTCTGGGTTCAAACCTTCGTTAAGCAAGATACCAAAACAAACAAGATAGCAAAACACTCAGCTGTTAGAGAGAGCTGAGAACCTCTCCTGACCTATCCTCCTTCTACCAAATTCTGCCTAAAAAACAAAGTAAAGGACTCCCCCCCATAAATAAAAAAACAGCAGCCATACAAGGGACGTACAACCATACCCACGATCCTAAAAGTGGTCCCCAAGGAACCGTTATTCAACATTCTCCCTTTTCTCTCTTTTACCCTTCCTACTATAAGTTTTCGTGATAGGAGGCCTAACACAGTTATTTAAGTTTAAAGTTTAAATGGAGGAGATTAGAAGTTTTTGGTCAAAATCAACTAGATTGAACTTTTTGTTCTTTTTTTTTTCTTACTTATCCATTGCACCCTTATGATCTTGTACAGACACCGTTGTTTAGGAAATTTAAAATTGACACAACAAAGATACGCCAACTAGTATTGTTCGTAATATAACAAAGTATTTGTTTCAACAAACAATTGGTGAGATATCCAAGATTCCTCACATGGTGATTGAACCTTCATACCATGAGAATATGAAAGGTCTGGAATTTTATTATCGATTACCTGATCAGGTGAATTATTTGACTCATAGTTTCGATTTCTCTCTTTTGAACTTGGCAAGTTTAGATTCTTCAAGTAAAAAACGACTTAGGAGAATGGATTATCCATCAAAAATATTAGTTTTTGCTGGTTGCAGATATATAGACACCCTATACTTAAAAAGTTAAAAATGTTGTGTTATCAGACAATTCTGCATTTAATAGGATTCTGAGCTTTTGATGAACAAACTATCAGATTGATGCACATTGTTTTCATCAGTTGCTATATGAGTTGAGTTTATTCTGAAGCTTGACTAGTTTTCCTGTAATTTTTCAGACAAGATGGAAGTTGACGAGTTATATCTTGATCTCCTAGCACTGAGGGAATTATACATCCTTCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTGTAAGCCTCTGCGATATTAATCGTCGTTTTCATTCATGTTGTTTTTTTACTTTATTAAACAATTAATTCCCTCTTTTGGGGGTTCTGCTAATTTTTTTAAAACTCTTTTTCTTTTATTTCTAGACTTCTAGTGTTTTGAATTTTACATCATTTTAAATTTGTTTTCACTTTAAACAATATTTCATTCAAATTTCAATTCCAGAGTAATTTATAAATGCTATCATGAGAATTAGAGCTGTTCAATCATTATTCTCATAGTCAGATTGTCTGAGTAACTGTGTATATATGATGATGTGTTTTCTCCTTTATCTTATCAATCATAAAACTATGGTATCAAACTTGCAGATTAGTAGATTTTGGCTGCCCTGTGATTTTTATGTGAATTTAGCATAATAGAAATTGTGAGATAAGACTGTCCGAGTGAGCTTTCTACTTCAATATTTCACTATAGTTTGGTTTTATGGTCAAAGCAGCTGGATGAAAGGGCACAGATTTTATTGAAGCATTTGCTCGATGATGCATCTGCTGGAGTTCTCGAGTTCCAATCAAAGGTTTGTATTTCATTCTGCATGAAAACTTTTTTTAAACAAAAGAACAATTGGACACAGATACATATTCTAATTCAGAAACACTTCATGGATGTCTAGTAACATACATTCCCATTTGAGTTTGTACAGAACTTGGCAACAAACTCAGGCGTTTTTTACAACTTTCTACACAAAGATGATAAACACACAAACCCACTGGACGAGAAAGTTGCTGAATGGATGGAACGCAATCAAACTGCAAGAAAGATGGTAAATCCAGAGAAGATTGAACACAATCCCAAAAGAGACAGAGCTTCAGCTACAAATGTTGCCGCTAATGACTTATTAAATGGCATCAGTTCAGCAATCAGAAGAATTGAACTCCACATTTTATCCCTACAACGTTACACAAATCAAAGTAGGAACACAAGAAGCCATATCAATGAAACTAAATTTGCTTACTGCGGACAGTCTGTTCTTCAAGGGAATGAGACATTAAACCAGCAGAAAGATCAGTCAAGGACAGATCACTCAGCTTTAAGGACCAGATTTGCTGAGTCGATTAAAGGCCATAACTTGAGTAGTCAGTTAAGAAGTCATCTTGTCGGTGGACAGAAAATTGAGCCGATAGTGACCAGCCATTGTTCTGAGTTTGTTCATGGATTCAGAATACCTCTGAGGCAAGACAATGATGAGGCCATCAAACCTCCAACAGTTGAAACTTGCATATCTAAACAACACAAACTTGTAAATCCAATGACTCTGATAGATAAATCTGGATATTCAGTAGAGTCCAAGGCAACCGTCAGGTCCGGAATGAAGCTGAATCAAACTATACAAGAAAAGAGGAGCCATAATTCATATGGTCGTATGGTAATGAGGCCAACTTTGCTGGATCACCCCTCTCGAGAAGTAAGAAAGGAACAAACTCATAACAAGACCCATTTGGCCACTGAACAAGAATCAGAATTCACGAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAACTCAACAAACCAGTGAAAGTGAAACCACTGATGATTCTTCTTCTCCACATCACCAAGACAGTCCACTGGCAACTGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTTCAACAAAAGCATTCAAATTCAACCATGGGAAAAAAGAGTCCAAAAGAGCAATAGGACGGGCCAAGAGACTCAAAAACAAACTAAGACTTATCTTCCACCACCACCATCATCATCACCACCACCATAACGGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATTGCACAGATAACAAAAAACTAGCAGGTAAAGAAGAAAGATATGGGAAGCTAAAGAAAACAGCAATCAGAAGTGTGCCTTGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGATTCGAAGCCATGTATGGAGATCAAAAGCCATGAAGAAGAAAGAGCTTAGGGTGCTTAATTGTGGGAAGAAGAAGGGTGTAAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGCCACCGTGGAGTGAAGTTGCCCAATAAAGGGCATGTGAAAATAGGATATGTAAATAGAAAAACACAGCTTAAAATGGTTTAGTTTAGTGGGACAATTTTGAAGTTTCTGCAAGCTTCCTTTCATGAGAGTTAGAACATTTTTATCCAACAGTTATCAAATGGAGTCTATGTAGTCATATGCATATACCAAAAATCAAATTTTCTCAATCGTTCACTTTCCTACACAAACTAATAACTTCTCTTTCCATCTCTCCTATAAAATCAGGAGCCAAGAAAACTTCTGTTTCTTCAAAAACTCGATTAAAATTA
mRNA sequence
GAGAAACTGAAAAATTGGAGGAGAAACAACCAGAAGGACGACTCGAAGTTTTACAAATGCCAGAAACGAGAAAATTTACAAGATGGAAGTTGACGAGTTATATCTTGATCTCCTAGCACTGAGGGAATTATACATCCTTCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGCACAGATTTTATTGAAGCATTTGCTCGATGATGCATCTGCTGGAGTTCTCGAGTTCCAATCAAAGAACTTGGCAACAAACTCAGGCGTTTTTTACAACTTTCTACACAAAGATGATAAACACACAAACCCACTGGACGAGAAAGTTGCTGAATGGATGGAACGCAATCAAACTGCAAGAAAGATGGTAAATCCAGAGAAGATTGAACACAATCCCAAAAGAGACAGAGCTTCAGCTACAAATGTTGCCGCTAATGACTTATTAAATGGCATCAGTTCAGCAATCAGAAGAATTGAACTCCACATTTTATCCCTACAACGTTACACAAATCAAAGTAGGAACACAAGAAGCCATATCAATGAAACTAAATTTGCTTACTGCGGACAGTCTGTTCTTCAAGGGAATGAGACATTAAACCAGCAGAAAGATCAGTCAAGGACAGATCACTCAGCTTTAAGGACCAGATTTGCTGAGTCGATTAAAGGCCATAACTTGAGTAGTCAGTTAAGAAGTCATCTTGTCGGTGGACAGAAAATTGAGCCGATAGTGACCAGCCATTGTTCTGAGTTTGTTCATGGATTCAGAATACCTCTGAGGCAAGACAATGATGAGGCCATCAAACCTCCAACAGTTGAAACTTGCATATCTAAACAACACAAACTTGTAAATCCAATGACTCTGATAGATAAATCTGGATATTCAGTAGAGTCCAAGGCAACCGTCAGGTCCGGAATGAAGCTGAATCAAACTATACAAGAAAAGAGGAGCCATAATTCATATGGTCGTATGGTAATGAGGCCAACTTTGCTGGATCACCCCTCTCGAGAAGTAAGAAAGGAACAAACTCATAACAAGACCCATTTGGCCACTGAACAAGAATCAGAATTCACGAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAACTCAACAAACCAGTGAAAGTGAAACCACTGATGATTCTTCTTCTCCACATCACCAAGACAGTCCACTGGCAACTGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTTCAACAAAAGCATTCAAATTCAACCATGGGAAAAAAGAGTCCAAAAGAGCAATAGGACGGGCCAAGAGACTCAAAAACAAACTAAGACTTATCTTCCACCACCACCATCATCATCACCACCACCATAACGGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATTGCACAGATAACAAAAAACTAGCAGGTAAAGAAGAAAGATATGGGAAGCTAAAGAAAACAGCAATCAGAAGTGTGCCTTGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGATTCGAAGCCATGTATGGAGATCAAAAGCCATGAAGAAGAAAGAGCTTAGGGTGCTTAATTGTGGGAAGAAGAAGGGTGTAAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGCCACCGTGGAGTGAAGTTGCCCAATAAAGGGCATGTGAAAATAGGATATGTAAATAGAAAAACACAGCTTAAAATGGTTTAGTTTAGTGGGACAATTTTGAAGTTTCTGCAAGCTTCCTTTCATGAGAGTTAGAACATTTTTATCCAACAGTTATCAAATGGAGTCTATGTAGTCATATGCATATACCAAAAATCAAATTTTCTCAATCGTTCACTTTCCTACACAAACTAATAACTTCTCTTTCCATCTCTCCTATAAAATCAGGAGCCAAGAAAACTTCTGTTTCTTCAAAAACTCGATTAAAATTA
Coding sequence (CDS)
ATGGAAGTTGACGAGTTATATCTTGATCTCCTAGCACTGAGGGAATTATACATCCTTCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGCACAGATTTTATTGAAGCATTTGCTCGATGATGCATCTGCTGGAGTTCTCGAGTTCCAATCAAAGAACTTGGCAACAAACTCAGGCGTTTTTTACAACTTTCTACACAAAGATGATAAACACACAAACCCACTGGACGAGAAAGTTGCTGAATGGATGGAACGCAATCAAACTGCAAGAAAGATGGTAAATCCAGAGAAGATTGAACACAATCCCAAAAGAGACAGAGCTTCAGCTACAAATGTTGCCGCTAATGACTTATTAAATGGCATCAGTTCAGCAATCAGAAGAATTGAACTCCACATTTTATCCCTACAACGTTACACAAATCAAAGTAGGAACACAAGAAGCCATATCAATGAAACTAAATTTGCTTACTGCGGACAGTCTGTTCTTCAAGGGAATGAGACATTAAACCAGCAGAAAGATCAGTCAAGGACAGATCACTCAGCTTTAAGGACCAGATTTGCTGAGTCGATTAAAGGCCATAACTTGAGTAGTCAGTTAAGAAGTCATCTTGTCGGTGGACAGAAAATTGAGCCGATAGTGACCAGCCATTGTTCTGAGTTTGTTCATGGATTCAGAATACCTCTGAGGCAAGACAATGATGAGGCCATCAAACCTCCAACAGTTGAAACTTGCATATCTAAACAACACAAACTTGTAAATCCAATGACTCTGATAGATAAATCTGGATATTCAGTAGAGTCCAAGGCAACCGTCAGGTCCGGAATGAAGCTGAATCAAACTATACAAGAAAAGAGGAGCCATAATTCATATGGTCGTATGGTAATGAGGCCAACTTTGCTGGATCACCCCTCTCGAGAAGTAAGAAAGGAACAAACTCATAACAAGACCCATTTGGCCACTGAACAAGAATCAGAATTCACGAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAACTCAACAAACCAGTGAAAGTGAAACCACTGATGATTCTTCTTCTCCACATCACCAAGACAGTCCACTGGCAACTGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTTCAACAAAAGCATTCAAATTCAACCATGGGAAAAAAGAGTCCAAAAGAGCAATAGGACGGGCCAAGAGACTCAAAAACAAACTAAGACTTATCTTCCACCACCACCATCATCATCACCACCACCATAACGGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATTGCACAGATAACAAAAAACTAGCAGGTAAAGAAGAAAGATATGGGAAGCTAAAGAAAACAGCAATCAGAAGTGTGCCTTGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGATTCGAAGCCATGTATGGAGATCAAAAGCCATGAAGAAGAAAGAGCTTAGGGTGCTTAATTGTGGGAAGAAGAAGGGTGTAAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGCCACCGTGGAGTGAAGTTGCCCAATAAAGGGCATGTGAAAATAGGATATGTAAATAGAAAAACACAGCTTAAAATGGTTTAG
Protein sequence
MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNLATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVAANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDEAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQTIQEKRSHNSYGRMVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIFHHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYVNRKTQLKMV
Homology
BLAST of Tan0019749 vs. ExPASy Swiss-Prot
Match:
Q9FFP2 (Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1)
HSP 1 Score: 89.4 bits (220), Expect = 1.4e-16
Identity = 81/254 (31.89%), Postives = 124/254 (48.82%), Query Frame = 0
Query: 300 VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQ 359
+M+PTL+D S E +QT + T +E E S+ + E+ S+S S W TQ
Sbjct: 248 IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEVSTSQEYSGETGSSSGSEWETQ 307
Query: 360 QTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAK 419
+++E+ +SS P D ++ S+S T K+ + +GR K
Sbjct: 308 AENDTESKSESSYPPQNDDSVS---------EVSTSPPHTDRDTSREPGKQRRNVMGRFK 367
Query: 420 RLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTA 479
R+KNK+ IFHHHHHHHHHH+ H+ W +L+ FH +K KE + +
Sbjct: 368 RIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFHHKHQEK--SKERKRPMSESKG 427
Query: 480 IRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR- 537
+ + ++Q G F AL EG+ H SK K + K KK WWK+ ++ +
Sbjct: 428 LTTHKQQHQGGHFHALVEGLVRHRKHSKKQKHQ--------LKSDAKKTEWWKLLKKRQG 482
BLAST of Tan0019749 vs. NCBI nr
Match:
XP_022996027.1 (uncharacterized protein LOC111491355 isoform X2 [Cucurbita maxima])
HSP 1 Score: 627.5 bits (1617), Expect = 1.1e-175
Identity = 366/554 (66.06%), Postives = 412/554 (74.37%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY LLK CLRDANSEL + RA+ILLKHLLDDA+ G+LEF SK
Sbjct: 1 MEADELYLDLLALRQLYFFLLKCCLRDANSELVVGARAKILLKHLLDDATTGLLEFHSKT 60
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
LA FYNFL KDDK T PLDEKVAEWME NQTAR+M NPEKIEH P+RDRASA+NVA
Sbjct: 61 LA-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARRMANPEKIEHKPRRDRASASNVA 120
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GI+SA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+ NQ
Sbjct: 121 ANDLSSGINSALRRIELHILSLQRY------TRSHISETKLAYYGQSVNQGNESFNQ--- 180
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FV+GFRIPL QD D
Sbjct: 181 ---------------------------------QKVKPMVANHCSKFVNGFRIPLTQDKD 240
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+LV P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 241 EAM----------KQHELVLPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 300
Query: 301 MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTD 360
+VM+PTL HPSREVRKEQT HN+ HLA +QESEFTN SESAS SS AT QTSESETTD
Sbjct: 301 IVMKPTLWHHPSREVRKEQTHHNRRHLAAQQESEFTN--SESASCSSPATLQTSESETTD 360
Query: 361 DSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLI 420
DSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGKKES A+GR K L+NKL LI
Sbjct: 361 DSSSPDNQSSPTATGSEASSQYGNSSSNITRKAFKFSHGKKESNGAVGRFKSLRNKLGLI 420
Query: 421 FHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQ 480
FHHH HHHHHHH+GHN MWKQ+R +FH TD K+L KEE+ GKL+KT IRSV NQ
Sbjct: 421 FHHHQHHQHHHHHHHHGHNSMWKQVRTVFHRTDKKELTSKEEKTGKLRKTTIRSVSRNNQ 480
Query: 481 VGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV 540
VGKFQAL EG+RSHVW+SKAMKKKE R LNCG KKLHWWKM RR RGVK PNKG V
Sbjct: 481 VGKFQALPEGLRSHVWKSKAMKKKEQRGLNCG-----KKLHWWKMIRRRRGVKFPNKGRV 490
Query: 541 KIGYVNRKTQLKMV 548
KIGYVNRK +K++
Sbjct: 541 KIGYVNRKPDVKLI 490
BLAST of Tan0019749 vs. NCBI nr
Match:
XP_022996025.1 (uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima] >XP_022996026.1 uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima])
HSP 1 Score: 627.5 bits (1617), Expect = 1.1e-175
Identity = 366/554 (66.06%), Postives = 412/554 (74.37%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY LLK CLRDANSEL + RA+ILLKHLLDDA+ G+LEF SK
Sbjct: 45 MEADELYLDLLALRQLYFFLLKCCLRDANSELVVGARAKILLKHLLDDATTGLLEFHSKT 104
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
LA FYNFL KDDK T PLDEKVAEWME NQTAR+M NPEKIEH P+RDRASA+NVA
Sbjct: 105 LA-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARRMANPEKIEHKPRRDRASASNVA 164
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GI+SA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+ NQ
Sbjct: 165 ANDLSSGINSALRRIELHILSLQRY------TRSHISETKLAYYGQSVNQGNESFNQ--- 224
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FV+GFRIPL QD D
Sbjct: 225 ---------------------------------QKVKPMVANHCSKFVNGFRIPLTQDKD 284
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+LV P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 285 EAM----------KQHELVLPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 344
Query: 301 MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTD 360
+VM+PTL HPSREVRKEQT HN+ HLA +QESEFTN SESAS SS AT QTSESETTD
Sbjct: 345 IVMKPTLWHHPSREVRKEQTHHNRRHLAAQQESEFTN--SESASCSSPATLQTSESETTD 404
Query: 361 DSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLI 420
DSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGKKES A+GR K L+NKL LI
Sbjct: 405 DSSSPDNQSSPTATGSEASSQYGNSSSNITRKAFKFSHGKKESNGAVGRFKSLRNKLGLI 464
Query: 421 FHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQ 480
FHHH HHHHHHH+GHN MWKQ+R +FH TD K+L KEE+ GKL+KT IRSV NQ
Sbjct: 465 FHHHQHHQHHHHHHHHGHNSMWKQVRTVFHRTDKKELTSKEEKTGKLRKTTIRSVSRNNQ 524
Query: 481 VGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV 540
VGKFQAL EG+RSHVW+SKAMKKKE R LNCG KKLHWWKM RR RGVK PNKG V
Sbjct: 525 VGKFQALPEGLRSHVWKSKAMKKKEQRGLNCG-----KKLHWWKMIRRRRGVKFPNKGRV 534
Query: 541 KIGYVNRKTQLKMV 548
KIGYVNRK +K++
Sbjct: 585 KIGYVNRKPDVKLI 534
BLAST of Tan0019749 vs. NCBI nr
Match:
XP_038877121.1 (protein KOKOPELLI-like isoform X1 [Benincasa hispida])
HSP 1 Score: 613.6 bits (1581), Expect = 1.6e-171
Identity = 367/560 (65.54%), Postives = 422/560 (75.36%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNL 60
M+VD+LYLDLLALRELYILLLKSCL DANSELLDERAQILLKHLLDDA+AGVLEF S +L
Sbjct: 31 MDVDKLYLDLLALRELYILLLKSCLGDANSELLDERAQILLKHLLDDATAGVLEFLSNDL 90
Query: 61 ATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVAA 120
ATNS +F NFLHKDDK PL +KV EWM+ NQT RKM NPE RDRASA+NVA
Sbjct: 91 ATNSNIFDNFLHKDDKQVKPLADKVPEWMKHNQTRRKMGNPE------IRDRASASNVAI 150
Query: 121 NDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQ 180
N+L + ISSA+RRIELHILSLQ T+Q R TR H QSVLQ NE+LNQQ
Sbjct: 151 NNLSHSISSALRRIELHILSLQHCTSQRRKTRCH---------WQSVLQWNESLNQQNVH 210
Query: 181 SRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDND 240
RT S LR+RF + IKG R H VG Q K++P +HCSE+VHGFRIPL Q ND
Sbjct: 211 PRTGPSTLRSRFTKPIKG-------RGHFVGEQKKVKPKTANHCSEYVHGFRIPLSQTND 270
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY 300
EA+KP T+ET I+KQHK+VNPMTLIDKSGY SV SKAT R MKLNQT Q KR+ NSY
Sbjct: 271 EAMKPLTIETHITKQHKVVNPMTLIDKSGYTSVGSKATFRPAMKLNQTSKQQAKRNQNSY 330
Query: 301 GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSE 360
G+MVM PTLLD HPS+E R E+ ++KTHL AT+QESEFT+SE S S+SSSSW TQ+TS
Sbjct: 331 GQMVMGPTLLDHHPSKETRNERINSKTHLAATQQESEFTSSEFQSASSSSSSWTTQETSV 390
Query: 361 SETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRA 420
SET + SSP HQD PL+T S++SS TK F GK ESK+ +GR
Sbjct: 391 SETVANDGDSNPSSPSHQDDPLSTDSKSSS---------LTKTFYIKQGKTESKKVLGRF 450
Query: 421 KRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTA 480
KRLKNKL ++F HHHHHHHHHHN +NFMWK QLRKIFH DNK+ L KE+ K+KK A
Sbjct: 451 KRLKNKLGVVFHHHHHHHHHHHNSNNFMWKQQLRKIFHSRDNKRLLVSKEDGNEKVKKRA 510
Query: 481 IRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRG 540
IR+V KNQVGKFQALAEG+RSHVWRSKAMK+K ++ + CG KKGVKKLHWWKMFR RG
Sbjct: 511 IRNVCYKNQVGKFQALAEGLRSHVWRSKAMKRKGVKGMKCG-KKGVKKLHWWKMFRNRRG 558
Query: 541 VKLPNKGHVKIGYVNRKTQL 545
V+LPNKGH+KIGYVN+K +L
Sbjct: 571 VRLPNKGHMKIGYVNKKAKL 558
BLAST of Tan0019749 vs. NCBI nr
Match:
XP_038877123.1 (protein KOKOPELLI-like isoform X3 [Benincasa hispida] >XP_038877124.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida])
HSP 1 Score: 613.6 bits (1581), Expect = 1.6e-171
Identity = 367/560 (65.54%), Postives = 422/560 (75.36%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNL 60
M+VD+LYLDLLALRELYILLLKSCL DANSELLDERAQILLKHLLDDA+AGVLEF S +L
Sbjct: 1 MDVDKLYLDLLALRELYILLLKSCLGDANSELLDERAQILLKHLLDDATAGVLEFLSNDL 60
Query: 61 ATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVAA 120
ATNS +F NFLHKDDK PL +KV EWM+ NQT RKM NPE RDRASA+NVA
Sbjct: 61 ATNSNIFDNFLHKDDKQVKPLADKVPEWMKHNQTRRKMGNPE------IRDRASASNVAI 120
Query: 121 NDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQ 180
N+L + ISSA+RRIELHILSLQ T+Q R TR H QSVLQ NE+LNQQ
Sbjct: 121 NNLSHSISSALRRIELHILSLQHCTSQRRKTRCH---------WQSVLQWNESLNQQNVH 180
Query: 181 SRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQ-KIEPIVTSHCSEFVHGFRIPLRQDND 240
RT S LR+RF + IKG R H VG Q K++P +HCSE+VHGFRIPL Q ND
Sbjct: 181 PRTGPSTLRSRFTKPIKG-------RGHFVGEQKKVKPKTANHCSEYVHGFRIPLSQTND 240
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGY-SVESKATVRSGMKLNQTI--QEKRSHNSY 300
EA+KP T+ET I+KQHK+VNPMTLIDKSGY SV SKAT R MKLNQT Q KR+ NSY
Sbjct: 241 EAMKPLTIETHITKQHKVVNPMTLIDKSGYTSVGSKATFRPAMKLNQTSKQQAKRNQNSY 300
Query: 301 GRMVMRPTLLD-HPSREVRKEQTHNKTHL-ATEQESEFTNSE--SESASSSSWATQQTSE 360
G+MVM PTLLD HPS+E R E+ ++KTHL AT+QESEFT+SE S S+SSSSW TQ+TS
Sbjct: 301 GQMVMGPTLLDHHPSKETRNERINSKTHLAATQQESEFTSSEFQSASSSSSSWTTQETSV 360
Query: 361 SETT-----DDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRA 420
SET + SSP HQD PL+T S++SS TK F GK ESK+ +GR
Sbjct: 361 SETVANDGDSNPSSPSHQDDPLSTDSKSSS---------LTKTFYIKQGKTESKKVLGRF 420
Query: 421 KRLKNKLRLIF-HHHHHHHHHHNGHNFMWK-QLRKIFHCTDNKK-LAGKEERYGKLKKTA 480
KRLKNKL ++F HHHHHHHHHHN +NFMWK QLRKIFH DNK+ L KE+ K+KK A
Sbjct: 421 KRLKNKLGVVFHHHHHHHHHHHNSNNFMWKQQLRKIFHSRDNKRLLVSKEDGNEKVKKRA 480
Query: 481 IRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRG 540
IR+V KNQVGKFQALAEG+RSHVWRSKAMK+K ++ + CG KKGVKKLHWWKMFR RG
Sbjct: 481 IRNVCYKNQVGKFQALAEGLRSHVWRSKAMKRKGVKGMKCG-KKGVKKLHWWKMFRNRRG 528
Query: 541 VKLPNKGHVKIGYVNRKTQL 545
V+LPNKGH+KIGYVN+K +L
Sbjct: 541 VRLPNKGHMKIGYVNKKAKL 528
BLAST of Tan0019749 vs. NCBI nr
Match:
XP_022958322.1 (uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata] >XP_022958323.1 uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata])
HSP 1 Score: 605.5 bits (1560), Expect = 4.5e-169
Identity = 355/549 (64.66%), Postives = 401/549 (73.04%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY+ LLK CLRDANSEL + RA+IL KHLLDDA+ G+LEF SK
Sbjct: 1 MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKT 60
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
L FYNFL KDDK T PLDEKVAEWME NQTAR M NPEKIEH P RDRASA+NVA
Sbjct: 61 LP-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVA 120
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GISSA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+LN
Sbjct: 121 ANDLSSGISSALRRIELHILSLQRY------TRSHISETKLAYYGQSVHQGNESLNH--- 180
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FVHGFRIPL QD +
Sbjct: 181 ---------------------------------QKVKPMVANHCSKFVHGFRIPLTQDKN 240
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+L P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 241 EAM----------KQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 300
Query: 301 MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDD 360
+VMRPTL HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT D
Sbjct: 301 IVMRPTL------------WHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTAD 360
Query: 361 SSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIF 420
SSSP Q SP ATGSEASS+ +SSS+IS +AFKF+HGKKESK+A+GR K L+NKL LIF
Sbjct: 361 SSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIF 420
Query: 421 HHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQ 480
HHHHHH+HNGHN MWKQ+R++FH T K+L KEE+ G L+KT IRSV NQVGKFQ
Sbjct: 421 --HHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQ 478
Query: 481 ALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYV 540
ALAEG+RSHVW+SKAMKKKE R LNCGK G KKLHWWKM RR RGVKLPNKG VKIGYV
Sbjct: 481 ALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYV 478
Query: 541 NRKTQLKMV 548
N+K +K++
Sbjct: 541 NKKPHVKII 478
BLAST of Tan0019749 vs. ExPASy TrEMBL
Match:
A0A6J1K5J4 (uncharacterized protein LOC111491355 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491355 PE=4 SV=1)
HSP 1 Score: 627.5 bits (1617), Expect = 5.3e-176
Identity = 366/554 (66.06%), Postives = 412/554 (74.37%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY LLK CLRDANSEL + RA+ILLKHLLDDA+ G+LEF SK
Sbjct: 45 MEADELYLDLLALRQLYFFLLKCCLRDANSELVVGARAKILLKHLLDDATTGLLEFHSKT 104
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
LA FYNFL KDDK T PLDEKVAEWME NQTAR+M NPEKIEH P+RDRASA+NVA
Sbjct: 105 LA-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARRMANPEKIEHKPRRDRASASNVA 164
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GI+SA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+ NQ
Sbjct: 165 ANDLSSGINSALRRIELHILSLQRY------TRSHISETKLAYYGQSVNQGNESFNQ--- 224
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FV+GFRIPL QD D
Sbjct: 225 ---------------------------------QKVKPMVANHCSKFVNGFRIPLTQDKD 284
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+LV P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 285 EAM----------KQHELVLPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 344
Query: 301 MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTD 360
+VM+PTL HPSREVRKEQT HN+ HLA +QESEFTN SESAS SS AT QTSESETTD
Sbjct: 345 IVMKPTLWHHPSREVRKEQTHHNRRHLAAQQESEFTN--SESASCSSPATLQTSESETTD 404
Query: 361 DSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLI 420
DSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGKKES A+GR K L+NKL LI
Sbjct: 405 DSSSPDNQSSPTATGSEASSQYGNSSSNITRKAFKFSHGKKESNGAVGRFKSLRNKLGLI 464
Query: 421 FHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQ 480
FHHH HHHHHHH+GHN MWKQ+R +FH TD K+L KEE+ GKL+KT IRSV NQ
Sbjct: 465 FHHHQHHQHHHHHHHHGHNSMWKQVRTVFHRTDKKELTSKEEKTGKLRKTTIRSVSRNNQ 524
Query: 481 VGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV 540
VGKFQAL EG+RSHVW+SKAMKKKE R LNCG KKLHWWKM RR RGVK PNKG V
Sbjct: 525 VGKFQALPEGLRSHVWKSKAMKKKEQRGLNCG-----KKLHWWKMIRRRRGVKFPNKGRV 534
Query: 541 KIGYVNRKTQLKMV 548
KIGYVNRK +K++
Sbjct: 585 KIGYVNRKPDVKLI 534
BLAST of Tan0019749 vs. ExPASy TrEMBL
Match:
A0A6J1K0S1 (uncharacterized protein LOC111491355 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111491355 PE=4 SV=1)
HSP 1 Score: 627.5 bits (1617), Expect = 5.3e-176
Identity = 366/554 (66.06%), Postives = 412/554 (74.37%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY LLK CLRDANSEL + RA+ILLKHLLDDA+ G+LEF SK
Sbjct: 1 MEADELYLDLLALRQLYFFLLKCCLRDANSELVVGARAKILLKHLLDDATTGLLEFHSKT 60
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
LA FYNFL KDDK T PLDEKVAEWME NQTAR+M NPEKIEH P+RDRASA+NVA
Sbjct: 61 LA-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARRMANPEKIEHKPRRDRASASNVA 120
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GI+SA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+ NQ
Sbjct: 121 ANDLSSGINSALRRIELHILSLQRY------TRSHISETKLAYYGQSVNQGNESFNQ--- 180
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FV+GFRIPL QD D
Sbjct: 181 ---------------------------------QKVKPMVANHCSKFVNGFRIPLTQDKD 240
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+LV P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 241 EAM----------KQHELVLPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 300
Query: 301 MVMRPTLLDHPSREVRKEQT-HNKTHLATEQESEFTNSESESASSSSWATQQTSESETTD 360
+VM+PTL HPSREVRKEQT HN+ HLA +QESEFTN SESAS SS AT QTSESETTD
Sbjct: 301 IVMKPTLWHHPSREVRKEQTHHNRRHLAAQQESEFTN--SESASCSSPATLQTSESETTD 360
Query: 361 DSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLI 420
DSSSP +Q SP ATGSEASS+Y +SSS+I+ KAFKF+HGKKES A+GR K L+NKL LI
Sbjct: 361 DSSSPDNQSSPTATGSEASSQYGNSSSNITRKAFKFSHGKKESNGAVGRFKSLRNKLGLI 420
Query: 421 FHHH----HHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQ 480
FHHH HHHHHHH+GHN MWKQ+R +FH TD K+L KEE+ GKL+KT IRSV NQ
Sbjct: 421 FHHHQHHQHHHHHHHHGHNSMWKQVRTVFHRTDKKELTSKEEKTGKLRKTTIRSVSRNNQ 480
Query: 481 VGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV 540
VGKFQAL EG+RSHVW+SKAMKKKE R LNCG KKLHWWKM RR RGVK PNKG V
Sbjct: 481 VGKFQALPEGLRSHVWKSKAMKKKEQRGLNCG-----KKLHWWKMIRRRRGVKFPNKGRV 490
Query: 541 KIGYVNRKTQLKMV 548
KIGYVNRK +K++
Sbjct: 541 KIGYVNRKPDVKLI 490
BLAST of Tan0019749 vs. ExPASy TrEMBL
Match:
A0A6J1H2T7 (uncharacterized protein LOC111459571 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459571 PE=4 SV=1)
HSP 1 Score: 605.5 bits (1560), Expect = 2.2e-169
Identity = 355/549 (64.66%), Postives = 401/549 (73.04%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY+ LLK CLRDANSEL + RA+IL KHLLDDA+ G+LEF SK
Sbjct: 1 MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKT 60
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
L FYNFL KDDK T PLDEKVAEWME NQTAR M NPEKIEH P RDRASA+NVA
Sbjct: 61 LP-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVA 120
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GISSA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+LN
Sbjct: 121 ANDLSSGISSALRRIELHILSLQRY------TRSHISETKLAYYGQSVHQGNESLNH--- 180
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FVHGFRIPL QD +
Sbjct: 181 ---------------------------------QKVKPMVANHCSKFVHGFRIPLTQDKN 240
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+L P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 241 EAM----------KQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 300
Query: 301 MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDD 360
+VMRPTL HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT D
Sbjct: 301 IVMRPTL------------WHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTAD 360
Query: 361 SSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIF 420
SSSP Q SP ATGSEASS+ +SSS+IS +AFKF+HGKKESK+A+GR K L+NKL LIF
Sbjct: 361 SSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIF 420
Query: 421 HHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQ 480
HHHHHH+HNGHN MWKQ+R++FH T K+L KEE+ G L+KT IRSV NQVGKFQ
Sbjct: 421 --HHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQ 478
Query: 481 ALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYV 540
ALAEG+RSHVW+SKAMKKKE R LNCGK G KKLHWWKM RR RGVKLPNKG VKIGYV
Sbjct: 481 ALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYV 478
Query: 541 NRKTQLKMV 548
N+K +K++
Sbjct: 541 NKKPHVKII 478
BLAST of Tan0019749 vs. ExPASy TrEMBL
Match:
A0A6J1H1S0 (uncharacterized protein LOC111459571 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459571 PE=4 SV=1)
HSP 1 Score: 605.5 bits (1560), Expect = 2.2e-169
Identity = 355/549 (64.66%), Postives = 401/549 (73.04%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDASAGVLEFQSKN 60
ME DELYLDLLALR+LY+ LLK CLRDANSEL + RA+IL KHLLDDA+ G+LEF SK
Sbjct: 26 MEADELYLDLLALRQLYVFLLKCCLRDANSELVVGARAKILFKHLLDDATTGLLEFHSKT 85
Query: 61 LATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVA 120
L FYNFL KDDK T PLDEKVAEWME NQTAR M NPEKIEH P RDRASA+NVA
Sbjct: 86 LP-----FYNFLRKDDKQTKPLDEKVAEWMEHNQTARTMANPEKIEHKPGRDRASASNVA 145
Query: 121 ANDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKD 180
ANDL +GISSA+RRIELHILSLQRY TRSHI+ETK AY GQSV QGNE+LN
Sbjct: 146 ANDLSSGISSALRRIELHILSLQRY------TRSHISETKLAYYGQSVHQGNESLNH--- 205
Query: 181 QSRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDND 240
QK++P+V +HCS+FVHGFRIPL QD +
Sbjct: 206 ---------------------------------QKVKPMVANHCSKFVHGFRIPLTQDKN 265
Query: 241 EAIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQT-IQEKRSHNSYGR 300
EA+ KQH+L P TL+DKSG SKAT R MKLN+T IQEKRS NS GR
Sbjct: 266 EAM----------KQHELALPPTLMDKSGCPEGSKATARRAMKLNRTWIQEKRSKNSRGR 325
Query: 301 MVMRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDD 360
+VMRPTL HNKTHLA +QESE+TNSESESA SSS AT+QTSESETT D
Sbjct: 326 IVMRPTL------------WHNKTHLAAQQESEYTNSESESAPSSSPATRQTSESETTAD 385
Query: 361 SSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIF 420
SSSP Q SP ATGSEASS+ +SSS+IS +AFKF+HGKKESK+A+GR K L+NKL LIF
Sbjct: 386 SSSPGDQSSPPATGSEASSQCGNSSSNISREAFKFSHGKKESKKAVGRFKSLRNKLGLIF 445
Query: 421 HHHHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQ 480
HHHHHH+HNGHN MWKQ+R++FH T K+L KEE+ G L+KT IRSV NQVGKFQ
Sbjct: 446 --HHHHHHYHNGHNSMWKQVRRMFHRTGKKELTSKEEKNGMLRKTTIRSVSRNNQVGKFQ 503
Query: 481 ALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHVKIGYV 540
ALAEG+RSHVW+SKAMKKKE R LNCGK G KKLHWWKM RR RGVKLPNKG VKIGYV
Sbjct: 506 ALAEGLRSHVWKSKAMKKKEQRGLNCGKTNGGKKLHWWKMIRRRRGVKLPNKGRVKIGYV 503
Query: 541 NRKTQLKMV 548
N+K +K++
Sbjct: 566 NKKPHVKII 503
BLAST of Tan0019749 vs. ExPASy TrEMBL
Match:
A0A6J1ETH9 (protein KOKOPELLI-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435826 PE=4 SV=1)
HSP 1 Score: 577.8 bits (1488), Expect = 4.8e-161
Identity = 342/545 (62.75%), Postives = 396/545 (72.66%), Query Frame = 0
Query: 1 MEVDELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDASAGVLEFQSKNL 60
M+VDE YLDLLALRELYILLLKSCLRDA SELLDERAQILLK+LLDDA+A VLEF KN+
Sbjct: 1 MDVDESYLDLLALRELYILLLKSCLRDAPSELLDERAQILLKNLLDDATAEVLEFLPKNM 60
Query: 61 ATNSGVFYNFLHKDDKHTNPLDEKVAEWMERNQTARKMVNPEKIEHNPKRDRASATNVAA 120
AT+SG+FY FLHKDDK + PLDEKV EWM + PKR R SA+N
Sbjct: 61 ATDSGIFYKFLHKDDKQSKPLDEKVVEWM---------------KPIPKRARGSASNATT 120
Query: 121 NDLLNGISSAIRRIELHILSLQRYTNQSRNTRSHINETKFAYCGQSVLQGNETLNQQKDQ 180
+ +L GISSAIRRIE HILSLQRYT+QS+ RSHI +YCG+SVL+GNET N+QK Q
Sbjct: 121 DLILQGISSAIRRIEHHILSLQRYTSQSK--RSHI-----SYCGRSVLKGNETSNRQKVQ 180
Query: 181 SRTDHSALRTRFAESIKGHNLSSQLRSHLVGGQKIEPIVTSHCSEFVHGFRIPLRQDNDE 240
SRTDHS + R Q++ LVGGQ + +VT HCSEFVHGFR+PL Q + E
Sbjct: 181 SRTDHSTISAR------------QIKGLLVGGQNAKAVVTPHCSEFVHGFRLPLSQGSKE 240
Query: 241 AIKPPTVETCISKQHKLVNPMTLIDKSGYSVESKATVRSGMKLNQTIQEKRSHNSYGRMV 300
KP VET +SKQHKLVNPMTLIDK G SV SKAT+R K +Q+ + K+S NSYG MV
Sbjct: 241 GRKPLAVETHLSKQHKLVNPMTLIDKCGGSVGSKATIRPRKKPSQS-RVKKSQNSYGLMV 300
Query: 301 MRPTLLDHPSREVRKEQTHNKTHLATEQESEFTNSESESASSSSWATQQTSESETTDDSS 360
M+PTLLDHPSREVRKE+T KTHLAT+ ESEFT +SA SSSW TQQTSES T DD S
Sbjct: 301 MKPTLLDHPSREVRKEETQKKTHLATQHESEFT----DSACSSSWTTQQTSESGTLDDFS 360
Query: 361 SPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAKRLKNKLRLIFHH 420
SP HQD A SE SS +++ GKKESKRAIGR KRLKNKL +IF H
Sbjct: 361 SPSHQDERPANSSETSS-------------IRYSQGKKESKRAIGRFKRLKNKLGIIF-H 420
Query: 421 HHHHHHHHNGHNFMWKQLRKIFHCTDNKKLAGKEERYGKLKKTAIRSVPCKNQVGKFQAL 480
HHHHHHHHN H+FMW ++RKIFH T+NKKL E+RY K K TAIRS NQVGKFQA+
Sbjct: 421 HHHHHHHHNSHSFMWNRVRKIFHPTNNKKLTSMEDRYEKGKNTAIRSECRTNQVGKFQAI 480
Query: 481 AEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHRGVKLPNKGHV-KIGYVN 540
A+ +RSHV RSKA+ KK+ + CG KKGVKKLHWWK+FR GV+L NKG + +I YVN
Sbjct: 481 AKELRSHVRRSKALTKKDPWEMKCG-KKGVKKLHWWKLFRDRHGVRLHNKGRIRRIRYVN 491
Query: 541 RKTQL 545
+K QL
Sbjct: 541 KKPQL 491
BLAST of Tan0019749 vs. TAIR 10
Match:
AT5G63720.1 (kokopelli )
HSP 1 Score: 89.4 bits (220), Expect = 1.0e-17
Identity = 81/254 (31.89%), Postives = 124/254 (48.82%), Query Frame = 0
Query: 300 VMRPTLLDH-------PSREVRKEQTHNKTHLATEQE----SEFTNSESESASSSSWATQ 359
+M+PTL+D S E +QT + T +E E S+ + E+ S+S S W TQ
Sbjct: 248 IMKPTLMDQETETFDDDSSETEADQTPSATGSESEDEEVSTSQEYSGETGSSSGSEWETQ 307
Query: 360 QTSESETTDDSSSPHHQDSPLATGSEASSRYRSSSSSISTKAFKFNHGKKESKRAIGRAK 419
+++E+ +SS P D ++ S+S T K+ + +GR K
Sbjct: 308 AENDTESKSESSYPPQNDDSVS---------EVSTSPPHTDRDTSREPGKQRRNVMGRFK 367
Query: 420 RLKNKLRLIFHHHHHHHHHHNGHN----FMWKQLRKIFHCTDNKKLAGKEERYGKLKKTA 479
R+KNK+ IFHHHHHHHHHH+ H+ W +L+ FH +K KE + +
Sbjct: 368 RIKNKIGQIFHHHHHHHHHHHHHDKEKPSAWNKLQSKFHHKHQEK--SKERKRPMSESKG 427
Query: 480 IRSVPCKNQVGKFQALAEGIRSHVWRSKAMKKKELRVLNCGKKKGVKKLHWWKMFRRHR- 537
+ + ++Q G F AL EG+ H SK K + K KK WWK+ ++ +
Sbjct: 428 LTTHKQQHQGGHFHALVEGLVRHRKHSKKQKHQ--------LKSDAKKTEWWKLLKKRQG 482
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FFP2 | 1.4e-16 | 31.89 | Protein KOKOPELLI OS=Arabidopsis thaliana OX=3702 GN=KPL PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
XP_022996027.1 | 1.1e-175 | 66.06 | uncharacterized protein LOC111491355 isoform X2 [Cucurbita maxima] | [more] |
XP_022996025.1 | 1.1e-175 | 66.06 | uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima] >XP_022996026... | [more] |
XP_038877121.1 | 1.6e-171 | 65.54 | protein KOKOPELLI-like isoform X1 [Benincasa hispida] | [more] |
XP_038877123.1 | 1.6e-171 | 65.54 | protein KOKOPELLI-like isoform X3 [Benincasa hispida] >XP_038877124.1 protein KO... | [more] |
XP_022958322.1 | 4.5e-169 | 64.66 | uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata] >XP_0229583... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1K5J4 | 5.3e-176 | 66.06 | uncharacterized protein LOC111491355 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1K0S1 | 5.3e-176 | 66.06 | uncharacterized protein LOC111491355 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1H2T7 | 2.2e-169 | 64.66 | uncharacterized protein LOC111459571 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1H1S0 | 2.2e-169 | 64.66 | uncharacterized protein LOC111459571 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1ETH9 | 4.8e-161 | 62.75 | protein KOKOPELLI-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435826 ... | [more] |