Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGTGAAGTTCTTGCCTTTTTGCATGGAGACCAATTACCTCATTGTTGGTTTGGCAACGAGCGATAAACACTCGTTGGGTCCCTCTTGTCACTCTAGTTGGTTGGGTGCGTTTGTTGCGGGGCTCCTCCAGATGAAGCTGTATCACCAGACCTGGGTCATCCCCGCTAGAAGCATTCACTCTGCGTCTTTCTTCAGTGCTTCGCCCATTGGCAGAATTGTCGGAGCATTCTTTTCTCATTTGTAATCATAGGTGTTGAATGTGCAGAGCTTACTGACTTTAAACTCTTTCCAAAAGATGGTGCCAATTGTTGATTCCAATTTTTGGGTAGGTGCCACTCTCTTAGCTCGAGTTTTACATCGCATAGAAATCTAGCACTGGTATGTTGCTCAACTTATTGTACTATGATGCCTAAATTAGTACCCAAGGTAGGAGCAAACGACCCAGTGTGAAGCATCTAACATAATCACAAGGATTCGAGTGAAAATTTGTGCAGGAGTGTGTGTAAAGACTTCTAGTACGAATTTGTGCAAGAATTTTTGCCCACCTTGATTCAGGTTGAGGACCATCGACACTCTAATGATCCTCTCTTGTGCAGTAGTGAGGCACGACGACGTCCAGGAAGATATCCTGGCTCGTGCCAGGATTTGCTGCAGGGTGTTAGTGCCTCTCTCGTTTGCTGTCGTACATACGGATCATTTAGGTGTGTCCCACATTTTGATCTGACTAATAACTTTATGTTAATAAAATTTAACCGAGGATATAATGTATATTTAACCGAAATATTATTGTTATGATTTGACTCGGACCAAGTTGCTTGTAGGTGTTTGTTTAGCAAGATAACGTTGTTGTTGAAGTACAATGTATGACCAGGTGATTTGTAAAACATGGCCTATCTAGTTAAATGAATGTGCTTGAATAATTGAGCAATGAATGTACTTATTAAAGTAGAATGCATGGCCATGTGATTTGTAACATTTCTTCGATATTTTGTTTCTATTTTTTAAAAGATATAAACGTTTCGTAATTGTTTTTATTTCTTATTTTTTATTTTTAAGAAATACGGTAAAAAAATGTGCACGGATTTTTTTAGAAACAATAAAAAAATACTTTCTTTTTCTCATTTTCAATCTTTTAAATACACAAATAAAAATGTTATTTTATTATTTTTTACTTAAAAAAGCAAAATAAATATCAACAAGATTATATGTAACATAAAACCTTTAATGTTTTATTTAAAAATTAATTTTTAGGTAAAACTTTTTTTTTATTTTAATATTCCAGATAACAAGAAATAAGAAAGGGTTATATATGTTTGTTTTCTAATTAAAAAAAATAACTAAAAAATAAATAAATAGTTGTCAAATATAATTTTGGTTTTTTTTTTCAGGTGAAAAAAGAAAAAACAAGAAATCAGATTATCAAATAAGTCCATATTTTATAGTTTCTTATAGTTTCTTGTTCCTTCACATTTCCAATTTTTCAAAACATAAATAAAAGACATTGTTTCAACATATTTAACTTATTAAATGCATAGTAAGTATTAAAAATTAAATTTGTTTGGGGTAAAAATACCTAGGACACGTGGAGGACACTTATTGGTGTTTCAAGAATTGAATAAAAAGACTTGCCCAGGTCCACCTCGGCCCCATCGAGGAGACTGGGTTCGCCCATTACAAAAGGCTAAGTATAGACGCCGAGGTTCGACCTGACTGAGGCTCGATCTGCTCGAAACCCGACAGGTTCGACATGAGAAAAGACATATAACCGCCGGTCGTGAGTACTGTCGGATCCACCACCTATAAATAGAGGGACTCATTTCATGCTCAGGTATTGAATCTCACCTCGAACTAAACACGGTGTCCGATCTATACTAACTTGAGCATCGGAGAGTTTACCCTCTTGTGCAGGTTCACTCTTGGGTTCAAGTCGAGCCAAGGATCGGGTTCGAGCTAGGTTCATAGAACAGTGTTGTGCAACTCTTTGCATAAACATTTTGGCGCCGTCTGTGGGGATCGACATCTTAAGTCGTCCCGATCTAGGAAGCCTACGCAAAGATGGTGCAGCCAGCAAACTCTGCTAATACGACGGAACGGAGGGGTTTGAACGCTGATAACGGCACTCAGTGAGACCTTGACGCTAGAATAGTCGAGGACCAGGTCCGAGCAGGACAAGAGGGAGATCTGCCACGAAGATCTACCCGCCATGCGAACCAGGAGTTACCCCCTGCTCACCCGAAACCCTCAAAAGCCAACCGAGGACGAGGTGGGACCTTAAGAAAGCCTCCCGAAGGGCTGACCCGACAGCAGACCCTGAGGCTCTGGCTACCCTCCAGCGCGAGTTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAACACGGGCTAACCAAACAGCGTCTCCCTCCAGAGTCCCAGGCGTACCCAGAGAGAAGGGAGACTAAGTTCCATCTCTCCACCCTGGTGACCGCGAGCCTATTCCCAACAATGAGGAAGTGGATTATAGCTTGCGGGACAATGACCTTCGAAAGCACCTTGCTGATAAAAAGAAGAGAGCGTCGCGAGGACCAGAAGACTCTCCGTCTTACTCCCGAGAGTTCTCCAACTCCGACCTCAAGGCTCAATCAAAGTATAAGCCTCTGACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAACAGGTCGAGGCGCTTAAGGCTAGGTGCGAGAAGAAGGAGAGCTCGCTTGACGATGGCGACCTAGGAGAATCGCCATTCACCTCGGACATTTTGGAGGCTCCAATCCCTCCAAAGTTCAAGACTCCCACCATGAAGTCTTATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAGGCCTCATGGACTTTCAAACGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAGATCGCGCTCACCGACAGCGCGCCTGTGGTATCGAAGACTGCCGGCCAGGTCAATCTCGACCTATTCTCAGTTGAGGAAAGAGTTTATTAGTCAATTCTCTTCTCGGCATTATGATAGAAAGACAGCAACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGTTGAGAGAATATGTCACAAGGTTCCAGGAGGAGCAGCTAAAAGTCGCGCACTGCTCCGATGATTTGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACTCTCACCGTGAAGCTTGGAGAGGAGGCTCCAGCTACCTTTGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATTGATGGACAAGAGCTCCTTCGAACCAAGACTGGCCGACCAGAAAAGAAAATCGACCAGCGGAGAGTTGGCAAAGATAAGGGAAAGACTGATGCCAAGTCCAAAGATAAAGGACCATCCTCCTCCAGTAGCCGAGCTGAGTATCGTAGGTCGGAAAACGGCCCCAGTCGAAGCCGACCTTACGAACGTTATACTCCGATCACCATCCCAATTTCTGAAATACTTACAAACATTGAGGAAACTGGGATGGAAAAGCTCCTTAAGTGACCTGAGAAGCTCCGGGGAGACCCAGAAAAACGTAACAAAGATAAATATTGCCGTTTTCATCACGATCACGGCCATAATACGTCAAATTGCTGGGAACTGAAGTGCCAGATTGAAGATCTCATTCAAGATGGTTACTTCAAAAAATTTGTGGGCAAACCGAGGTCCAACTTGGTGGAAAAGAAAGAAGAGAGGAAGCGTTCAAGGACGCCGCCTCGCCGAGATGACCGACCTGCGGTCATCAACACTATTTTTGGAGGCCCAAGTGGGGGCCAATCTGGAAATAAAAGAAAAGAACTAGCTCGCGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGACCGACCTGCTCCATTACTTTCGACGATACCGACCTGGAGAGGGTCCACTTGCCCCACAATGACGCGCTTGTGATTGCTCCTCTCGTTGATTACGTCGTAGTCAGAAGAGTGCTGGTAGATGGAGGTGCATCCGCCAACATCTTGTCCCTTTCAACATATCTTGCCTTGGGATGGACCAGGTCGCAGCTGAAGAAGAGTCCGACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTGTATCAACTTGCCGATTTCGATTGGGCAAGATGATACACAGGTAACTCAGATGGCCGAGTTCGTCGTGATCGACGGAAGATCGGCCTACAACGCTATCTTTGGGAGACCCATCATCCATTCGTTCCAGGCCGTTCCTTCAACGCTTCATCAAGTCTTGAAATACTCAACTCCCAATGGAGTGGGCACGGTCCGAGGGGAGCAGAAGACTTCAAGGGAGTGTTATGCCTCCGCGCTTAAAGGGTCGTCAGTATGCGCCCTAAAAGAACAAACAATTCGGGACAAGCTGTGAGAGTCCGAGGCCAACCTACCTAGAGAAGCCAAAAGGCAGTTTTCTGCACCAACAGAGGAGCTCGAGCTTGTTCCCGTGCTTAGCCCTGAAAAACAAGTAAGCATAGGAACCAAGCTGGGGCCACTGATAGGGAAGAGCTGATCAACTTCCTCAGGTCCAACTCGGACGTCTTCGCATGATCTCACGAGGACATGCCAAGCATCGACCCAAAGATCATGGTGCATCGCCTCAACATAGACCCATCATTCCGACCTGTAAAGCAAAAAAGAAGACCTGTAAATAAGGAGAGGAGTAATGTAATTGTTGAGGAAGTTAACAAGCTATTAAAAGCTGAATATATTAGAGAAATTCTGTATCCCGAGTGGCTCTCTAATGTTGTATTAGTTAAAAAATCCAACGGGAAGTGGAGAATGTGTGTGGACTTCACAAATTTAAATAAGGCATGCCCAAAAGACTGCTTCCCCCTGCCGAGGATCGACCAGCTCGTGGACGCAACAGCTGGGCACGAGCTGCTCACCTTCATGGATGCCTACTCCGGATACAACCAAATCAAAATGCATGTGTCAGACCAAGATCATACCGCGTTCATAATAGACTAAGGTCTGTACTGCTACAAAGTGATGCCCTTCGGCCTAAAGAATGCAGGCGCGATCTACCAAAGGATGGTGAACAAAATGTTCACCAAGCAGATCGGTCGGAATATGGAAGTATATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCAAAGTCGCATCTTTCCGATCTGACCGAAGCCTTTGGAGTTCTGCGGAAGTATCAAATGAAGCTCAATCCAGCCAAATGTGCCTTTGGAGTCTCCTCGGGAAAGTTCCTTGGTTTTATGGTAAACAACCGTGGGATCGAAGCCAACCCAGAGAAGATAAAGACCGTGCTCGAGATGGAAGCCCCAAAACTCTGAAACAGCTTCAATGCCTCAATGGAAGGATTGCAGCTTTGAACAGGTTTGTTTCAAGGTCGACTGACAAATGCCTTCCTTTCTTCAAAGTCCTAAGAAAGAAGGGACCATTTGAACGGATGGCAGAGTGCGAGCAGGCATTGGAACAATTGAAAAACTATCTCTTCGGCACCTCTGCTCGCAAAACCGTTGCCAGGGGAGAAGCTCCACTTATACCTGGCAGTGTCCGACAGCACTGTCAGCTCGGCCCTAATCAAGCAGGAGGGAGTGCGCCAAAGTCCCGTTTATTACACCAGTAAGGCCATGACTGAGGTCGAGACCAGATATCCCCAAAGTCACCTCGGCCCGGAGACTTAGGCCGTACTTCCAAGCGCACATCGTCGTAGTACTTACCAACTTGCCCCTAAAGAACATCTTCCATAAGCCAGAAACATCAGGGCGGCTGATGAAGTGGGCAATGGAGCTAAGCGAGTACGACATCCAATTCGAGCCCAGAACTGCGTTGAAGGGGCAAGTAGTGGAAGATTTCATCGCAGAGCTTACCCCACAATCTCAGTCGGCTGAATCCGACCTCCCGTGGACGATCTACGTTGACGGATCTTCCAATGAGAGAGGCTGCGGAGCAGGAATCCTCTTACTCGCACCAAGTGGCGTGTGGTTCGAGTACGCCCTGCAGTTCAACTTTCGGACTTCAAATAATGAGGCTGAGTACGAAGCACTTCTGGCAGGCCTACGCATTGGGGGCCATCCACATCAAGGTCTTTAGCGACTCCAAATTGGTCGTAAATCAAATTAAGGAGGAGTATCAAACGAAAGACCCCCGAATGGAAAAATATCTAAGCAAAGTCAGATCGCACCTCGCCCAGTTCGAGACTTACGAGGTGAGTCAGGTTCCAAGGTCAGAAAATTTCAATGCTGATGCCTTAGCCAAATTGGCATCAGCATATGAGACCGACCTGGCCAAATCAATCCCAGTGGAGATCTTGGACAATCCTTAAATCCTGGATCCAGACGTGATGGGGATTGACACTCCATCACCCTCGTGGATGGATCCAATCGTGGAGTTCATCAAAGGAAATCCATCGCAAGATCTAAAGGAGCAAAAGAAGATGGCACGGAAAGCAGCTCGGTTCATACTCCGAGAAGGAGCGTTGTACAAGAAGTGGCTTCTCCCTGCCTCTGCTTAA
mRNA sequence
ATGTTTGTGAAGTTCTTGCCTTTTTGCATGGAGACCAATTACCTCATTGTTGGTTTGGCAACGAGCGATAAACACTCGTTGGGTCCCTCTTGTCACTCTAGTTGGTTGGGTGCGTTTGTTGCGGGGCTCCTCCAGATGAAGCTGTATCACCAGACCTGGGTCATCCCCGCTAGAAGCATTCACTCTGCGTCTTTCTTCAGTGCTTCGCCCATTGGCAGAATTGTCGGAGCATTCTTTTCTCATTTGTTGAGGACCATCGACACTCTAATGATCCTCTCTTGTGCAGTAGTGAGGCACGACGACGTCCAGGAAGATATCCTGGCTCGTGCCAGGATTTGCTGCAGGGTGTTAGTGCCTCTCTCGTTTGCTGTCGTACATACGGATCATTTAGGTTCACTCTTGGGTTCAAGTCGAGCCAAGGATCGGGTTCGAGCTAGGTTCATAGAACAGTGTTGTGCAACTCTTTGCATAAACATTTTGGCGCCGTCTGTGGGGATCGACATCTTAAGTCGTCCCGATCTAGGAAGCCTACGCAAAGATGGTCCGAGCAGGACAAGAGGGAGATCTGCCACGAAGATCTACCCGCCATGCGAACCAGGAGTTACCCCCTGCTCACCCGAAACCCTCAAAAGCCAACCGAGGACGAGGTGGGACCTTAAGAAAGCCTCCCGAAGGGCTGACCCGACAGCAGACCCTGAGGCTCTGGCTACCCTCCAGCGCGAGTTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAACACGGGCTAACCAAACAGCCTTGCGGGACAATGACCTTCGAAAGCACCTTGCTGATAAAAAGAAGAGAGCGTCGCGAGGACCAGAAGACTCTCCGTCTTACTCCCGAGAGTTCTCCAACTCCGACCTCAAGGCTCAATCAAAGTATAAGCCTCTGACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAACAGGTCGAGGCGCTTAAGGCTAGGTGCGAGAAGAAGGAGAGCTCGCTTGACGATGGCGACCTAGGAGAATCGCCATTCACCTCGGACATTTTGGAGGCTCCAATCCCTCCAAAGTTCAAGACTCCCACCATGAAGTCTTATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAGGCCTCATGGACTTTCAAACGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAGATCGCGCTCACCGACAGCGCGCCTGTGGTATCGAAGACTGCCGGCCAGACAGCAACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGTTGAGAGAATATGTCACAAGGTTCCAGGAGGAGCAGCTAAAAGTCGCGCACTGCTCCGATGATTTGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACTCTCACCGTGAAGCTTGGAGAGGAGGCTCCAGCTACCTTTGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATTGATGGACAAGAGCTCCTTCGAACCAAGACTGGCCGACCAGAAAAGAAAATCGACCAGCGGAGAGTTGGCAAAGATAAGGGAAAGACTGATGCCAAGTCCAAAGATAAAGGACCATCCTCCTCCAGTAGCCGAGCTGAGTATCGTAGGTCGGAAAACGGCCCCAGTCGAAGCCGACCTTACGAACAAAAACGTAACAAAGATAAATATTGCCGTTTTCATCACGATCACGGCCATAATACGTCAAATTGCTGGGAACTGAAGTGCCAGATTGAAGATCTCATTCAAGATGGTTACTTCAAAAAATTTGTGGGCAAACCGAGGTCCAACTTGGTGGAAAAGAAAGAAGAGAGGAAGCGTTCAAGGACGCCGCCTCGCCGAGATGACCGACCTGCGGTCATCAACACTATTTTTGGAGGCCCAAGTGGGGGCCAATCTGGAAATAAAAGAAAAGAACTAGCTCGCGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGACCGACCTGCTCCATTACTTTCGACGATACCGACCTGGAGAGGGTCCACTTGCCCCACAATGACGCGCTTGTGATTGCTCCTCTCGTTGATTACGTCGTAGTCAGAAGAGTGCTGGTAGATGGAGGTGCATCCGCCAACATCTTGTCCCTTTCAACATATCTTGCCTTGGGATGGACCAGGTCGCAGCTGAAGAAGAGTCCGACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTACGTGATGGGGATTGACACTCCATCACCCTCGTGGATGGATCCAATCGTGGAGTTCATCAAAGGAAATCCATCGCAAGATCTAAAGGAGCAAAAGAAGATGGCACGGAAAGCAGCTCGGTTCATACTCCGAGAAGGAGCGTTGTACAAGAAGTGGCTTCTCCCTGCCTCTGCTTAA
Coding sequence (CDS)
ATGTTTGTGAAGTTCTTGCCTTTTTGCATGGAGACCAATTACCTCATTGTTGGTTTGGCAACGAGCGATAAACACTCGTTGGGTCCCTCTTGTCACTCTAGTTGGTTGGGTGCGTTTGTTGCGGGGCTCCTCCAGATGAAGCTGTATCACCAGACCTGGGTCATCCCCGCTAGAAGCATTCACTCTGCGTCTTTCTTCAGTGCTTCGCCCATTGGCAGAATTGTCGGAGCATTCTTTTCTCATTTGTTGAGGACCATCGACACTCTAATGATCCTCTCTTGTGCAGTAGTGAGGCACGACGACGTCCAGGAAGATATCCTGGCTCGTGCCAGGATTTGCTGCAGGGTGTTAGTGCCTCTCTCGTTTGCTGTCGTACATACGGATCATTTAGGTTCACTCTTGGGTTCAAGTCGAGCCAAGGATCGGGTTCGAGCTAGGTTCATAGAACAGTGTTGTGCAACTCTTTGCATAAACATTTTGGCGCCGTCTGTGGGGATCGACATCTTAAGTCGTCCCGATCTAGGAAGCCTACGCAAAGATGGTCCGAGCAGGACAAGAGGGAGATCTGCCACGAAGATCTACCCGCCATGCGAACCAGGAGTTACCCCCTGCTCACCCGAAACCCTCAAAAGCCAACCGAGGACGAGGTGGGACCTTAAGAAAGCCTCCCGAAGGGCTGACCCGACAGCAGACCCTGAGGCTCTGGCTACCCTCCAGCGCGAGTTGGATGATATGCGCCATCGATTGCGCACAATGGAAGAAATGTACGCCGAGGCAACACGGGCTAACCAAACAGCCTTGCGGGACAATGACCTTCGAAAGCACCTTGCTGATAAAAAGAAGAGAGCGTCGCGAGGACCAGAAGACTCTCCGTCTTACTCCCGAGAGTTCTCCAACTCCGACCTCAAGGCTCAATCAAAGTATAAGCCTCTGACACCAGAAGCTGTGATCACCAGGGAAGAGTTCGACCTGATGAAACACAGGTTCGACGAACAGGTCGAGGCGCTTAAGGCTAGGTGCGAGAAGAAGGAGAGCTCGCTTGACGATGGCGACCTAGGAGAATCGCCATTCACCTCGGACATTTTGGAGGCTCCAATCCCTCCAAAGTTCAAGACTCCCACCATGAAGTCTTATGATGGGTCTAAGGACCCAAAAGACTATGTTGAGGTCTTCGAAGGCCTCATGGACTTTCAAACGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAGATCGCGCTCACCGACAGCGCGCCTGTGGTATCGAAGACTGCCGGCCAGACAGCAACTCACCTCGCCACCATCAGACAGAAGGAAGGTGAGACGTTGAGAGAATATGTCACAAGGTTCCAGGAGGAGCAGCTAAAAGTCGCGCACTGCTCCGATGATTTGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACTCTCACCGTGAAGCTTGGAGAGGAGGCTCCAGCTACCTTTGCCGAAGTGCTACAGAAGGCGAAGAAAGTCATTGATGGACAAGAGCTCCTTCGAACCAAGACTGGCCGACCAGAAAAGAAAATCGACCAGCGGAGAGTTGGCAAAGATAAGGGAAAGACTGATGCCAAGTCCAAAGATAAAGGACCATCCTCCTCCAGTAGCCGAGCTGAGTATCGTAGGTCGGAAAACGGCCCCAGTCGAAGCCGACCTTACGAACAAAAACGTAACAAAGATAAATATTGCCGTTTTCATCACGATCACGGCCATAATACGTCAAATTGCTGGGAACTGAAGTGCCAGATTGAAGATCTCATTCAAGATGGTTACTTCAAAAAATTTGTGGGCAAACCGAGGTCCAACTTGGTGGAAAAGAAAGAAGAGAGGAAGCGTTCAAGGACGCCGCCTCGCCGAGATGACCGACCTGCGGTCATCAACACTATTTTTGGAGGCCCAAGTGGGGGCCAATCTGGAAATAAAAGAAAAGAACTAGCTCGCGAAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGACCGACCTGCTCCATTACTTTCGACGATACCGACCTGGAGAGGGTCCACTTGCCCCACAATGACGCGCTTGTGATTGCTCCTCTCGTTGATTACGTCGTAGTCAGAAGAGTGCTGGTAGATGGAGGTGCATCCGCCAACATCTTGTCCCTTTCAACATATCTTGCCTTGGGATGGACCAGGTCGCAGCTGAAGAAGAGTCCGACACCCTTGGTTGGATTCTCTGGAGAATCGGTCTCTCCAGAAGGGTACGTGATGGGGATTGACACTCCATCACCCTCGTGGATGGATCCAATCGTGGAGTTCATCAAAGGAAATCCATCGCAAGATCTAAAGGAGCAAAAGAAGATGGCACGGAAAGCAGCTCGGTTCATACTCCGAGAAGGAGCGTTGTACAAGAAGTGGCTTCTCCCTGCCTCTGCTTAA
Protein sequence
MFVKFLPFCMETNYLIVGLATSDKHSLGPSCHSSWLGAFVAGLLQMKLYHQTWVIPARSIHSASFFSASPIGRIVGAFFSHLLRTIDTLMILSCAVVRHDDVQEDILARARICCRVLVPLSFAVVHTDHLGSLLGSSRAKDRVRARFIEQCCATLCINILAPSVGIDILSRPDLGSLRKDGPSRTRGRSATKIYPPCEPGVTPCSPETLKSQPRTRWDLKKASRRADPTADPEALATLQRELDDMRHRLRTMEEMYAEATRANQTALRDNDLRKHLADKKKRASRGPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSAPVVSKTAGQTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYEQKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNLVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESVSPEGYVMGIDTPSPSWMDPIVEFIKGNPSQDLKEQKKMARKAARFILREGALYKKWLLPASA
Homology
BLAST of Moc11g29410 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 682.9 bits (1761), Expect = 3.2e-192
Identity = 369/510 (72.35%), Postives = 399/510 (78.24%), Query Frame = 0
Query: 302 LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDI 361
+KA+S P TP VITREEFD ++ + D QVEALKA+CE+KE L+DGDLGESPFTSD+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 362 LEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSA---- 421
LEAPIPPKFK PT+K YDGSKDPKDYVEVFE LMDFQ A+DAIKCRAF+IALT SA
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 422 ---PVVSKTA------------------GQTATHLATIRQKEGETLREYVTRFQEEQLKV 481
P S + +TATHLATIRQKEGETLREYVTRFQEEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 482 AHCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKK 541
AHCSDD AMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE+K
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 542 IDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYE--------------- 601
I + R GKD D KSKDKG S SS RAEYRR+ENGP+RSRPYE
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 602 --------------------QKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKK 661
++R+KDKYCRFH +HGHNTS+ WELK QIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 662 FVGKPRSNLVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVC 721
FVGKPR++ EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 722 IIREQRPTCSITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYL 752
IIREQRPTC ITFD DLE VHLPHNDALVIAPL+D+VVV RVLVDGG SANILSL TYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
BLAST of Moc11g29410 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 661.0 bits (1704), Expect = 1.3e-185
Identity = 366/509 (71.91%), Postives = 390/509 (76.62%), Query Frame = 0
Query: 303 KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDIL 362
KA+S Y P+TP VITREEFD +K +FD QVEALKARCEKKESS DDGDLGE F+SDIL
Sbjct: 64 KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDIL 123
Query: 363 EAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSAPV--- 422
EA IPPKFKTPTMK YDGSKDPKDYVEVFE LMDFQ ATDAIKC AFQIALT SA +
Sbjct: 124 EALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYR 183
Query: 423 -----------------VSKTAG-----QTATHLATIRQKEGETLREYVTRFQEEQLKVA 482
+S+ + +T THLATIRQKEGETLREYVTRF EEQLKVA
Sbjct: 184 RLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVA 243
Query: 483 HCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKKI 542
HCSDD AMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK I
Sbjct: 244 HCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNI 303
Query: 543 DQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYE---------------- 602
DQ R GKDKGK D+KS+DKGPSSSSSR +YRRS + ++SRPYE
Sbjct: 304 DQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNI 363
Query: 603 -------------------QKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKF 662
+KRN DKYCRFH DHGHNTSN WELK QIEDLIQDGYFKKF
Sbjct: 364 EETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKF 423
Query: 663 VGKPRSNLVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCI 722
VGKPRSN VEKKEERKR RTPPRRDDRPAVI NK+KELAREARREVCI
Sbjct: 424 VGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKELAREARREVCI 483
Query: 723 IREQRPTCSITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLA 752
IREQRPT SI F+ DLE VHLPHNDALVIAPL+D V+VRR+LVDGGASANILSLSTYLA
Sbjct: 484 IREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLA 543
BLAST of Moc11g29410 vs. NCBI nr
Match:
XP_022155139.1 (uncharacterized protein LOC111022280 [Momordica charantia])
HSP 1 Score: 653.3 bits (1684), Expect = 2.8e-183
Identity = 362/486 (74.49%), Postives = 391/486 (80.45%), Query Frame = 0
Query: 266 ALRDNDLRKHLADKKKRASRGPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLM 325
+LRDNDLRKHL DKKK+AS PEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLM
Sbjct: 33 SLRDNDLRKHLTDKKKKASWEPEDSLSYSREFSNSNLKAQSKYKPLIPEAVINREEFDLM 92
Query: 326 KHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDILEAPIPPKFKTPTMKSYDGSKDPK 385
KHRFDEQVEALKARCEKKES DD DLGESPFTSDI+EAPIPPKFKTPTMK YDGSKDPK
Sbjct: 93 KHRFDEQVEALKARCEKKESPFDDDDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPK 152
Query: 386 DYVEVFEGLMDFQTATDAIKCRAFQIALTDSAPVVSKTAGQTATHLATIRQKEGETLREY 445
DYVEVFEGLMDFQ ATDAIKC AFQIALT SA + + A ++T Q E + ++
Sbjct: 153 DYVEVFEGLMDFQAATDAIKCLAFQIALTGSARLWCRRL--PARSISTYSQLRKEFIGQF 212
Query: 446 VTRFQEEQLKVAHCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL 505
R + + H + + DETLTVKLGEEAPATFAEVLQ AKKVIDGQEL
Sbjct: 213 SFRHYDRK-TATHLAT-------IRQKEDETLTVKLGEEAPATFAEVLQNAKKVIDGQEL 272
Query: 506 LRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYEQKRN 565
LRTKT RPEK+IDQ+R+ + K K D+KSKDKG SSS SR EYRRSE+GPSRSRPYE+
Sbjct: 273 LRTKTDRPEKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER--- 332
Query: 566 KDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNLVEKKEERKRSRTPPR 625
CWELK QIEDLIQD YFKKFVGKPRSN VEKKEERKRSRTPPR
Sbjct: 333 -----------------CWELKRQIEDLIQDSYFKKFVGKPRSNSVEKKEERKRSRTPPR 392
Query: 626 RDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDDTDLERVHLP 685
R+DRPAVINTIFGGPSGGQ NKRKELA EARR+V IIREQ+PTCSITF DTDLE VHLP
Sbjct: 393 REDRPAVINTIFGGPSGGQFENKRKELACEARRKVSIIREQKPTCSITFKDTDLEGVHLP 452
Query: 686 HNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESV 745
HNDALVIAPL+D+V+VRRVLVDGGASANILSL TYLAL TRSQLKKSPTPLVGFS ESV
Sbjct: 453 HNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALRGTRSQLKKSPTPLVGFSAESV 488
Query: 746 SPEGYV 752
SPEG +
Sbjct: 513 SPEGCI 488
BLAST of Moc11g29410 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 591.7 bits (1524), Expect = 9.8e-165
Identity = 330/488 (67.62%), Postives = 366/488 (75.00%), Query Frame = 0
Query: 299 NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFT 358
+S+ +A+S + P TP+ VITREEFD ++ + + QVEALKA+CE+KE L+DGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 359 SDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSAP 418
SD+LEA PT+KSYDGSKDPKDYVEVFEGLMDFQ A+DAIKCRAFQIALT SA
Sbjct: 62 SDVLEA--------PTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 419 VVSKTAGQTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDLAMCYFLTGLADETLT 478
+ FQE+QLKVA SDD AMCYFLTGLADE LT
Sbjct: 122 L----------------------------WFQEDQLKVAQSSDDSAMCYFLTGLADEALT 181
Query: 479 VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGP 538
VKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ R GKD+ K D KSKDKG
Sbjct: 182 VKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDE-KADLKSKDKG- 241
Query: 539 SSSSSRAEYRRSENGPSRSRPYE-----------------------------------QK 598
S SS RAE+RR+ NGP+RSRPYE ++
Sbjct: 242 SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPER 301
Query: 599 RNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNLVEKKEERKRSRTP 658
RNKDKYCRFH +H HNTS+ WELK QIEDLIQD YFKKFVGKPR++ EKKEERK SRTP
Sbjct: 302 RNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTP 361
Query: 659 PRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDDTDLERVH 718
RR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQRPTC ITFD DLE VH
Sbjct: 362 LRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVH 421
Query: 719 LPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGE 752
LPHNDALVIAPL+D+VVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS E
Sbjct: 422 LPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRE 451
BLAST of Moc11g29410 vs. NCBI nr
Match:
XP_022156088.1 (uncharacterized protein LOC111023060 [Momordica charantia])
HSP 1 Score: 581.6 bits (1498), Expect = 1.0e-161
Identity = 343/559 (61.36%), Postives = 381/559 (68.16%), Query Frame = 0
Query: 242 LDDMRHRLRTMEEMY--------AEATRANQTALRD------------------NDLRKH 301
++ MR ++RTMEEMY A + +Q D DLR H
Sbjct: 1 MEAMRTQMRTMEEMYNKMVQIAGARSRSGDQVVHEDVHEQGDLHCDPVDEEHLGGDLRDH 60
Query: 302 LADKKKRASRGPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEA 361
L K+ + RG S + + NS+ +A+S Y P+ PE VITREEF+ +K +FD QVEA
Sbjct: 61 LNRKRNSSHRGERTSTYWHK---NSNQQAESSYNPIAPEGVITREEFNQLKSKFDAQVEA 120
Query: 362 LKARCEKKESSLDDGDLGESPFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLM 421
LK RCEKKES+ DDGDLGESPFTSDILEA IPPKFKTPTMKSYDGSKDPKDYVEVFEGLM
Sbjct: 121 LKVRCEKKESAFDDGDLGESPFTSDILEASIPPKFKTPTMKSYDGSKDPKDYVEVFEGLM 180
Query: 422 DFQTATDAIKCRAFQIALTDSA-------PVVSKTA------------------GQTATH 481
DFQ ATDAIKCRAFQIALT SA P S + +T TH
Sbjct: 181 DFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTTTH 240
Query: 482 LATIRQKEGETLREYVTRFQEEQLKVAHCSDDLAMCYFLTGLADETLTVKLGEEAPATFA 541
LATIRQKEG+TL+EY+TRFQEEQLKV HCSDD +MCYFLTGLADET TVKLGEEA ATFA
Sbjct: 241 LATIRQKEGKTLKEYITRFQEEQLKVVHCSDDSSMCYFLTGLADETPTVKLGEEALATFA 300
Query: 542 EVLQKAKKVIDGQELLRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRS 601
EVLQ KK IDGQELLRTKT RPEK+IDQ++ +DK K D+KSKDKG SSS+SR +Y
Sbjct: 301 EVLQMEKKFIDGQELLRTKTDRPEKQIDQKKSSQDKRKADSKSKDKGSSSSNSRTDY--- 360
Query: 602 ENGPSRSRPYEQKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNL 661
C RSN
Sbjct: 361 -------------------------------------C------------------RSNS 420
Query: 662 VEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTC 721
VEKKEERKRSRTPPR DDRPAVINTIFGGPSGGQSGNKRKELAREA REVCIIREQRPTC
Sbjct: 421 VEKKEERKRSRTPPRLDDRPAVINTIFGGPSGGQSGNKRKELAREASREVCIIREQRPTC 480
Query: 722 SITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQL 750
S+TFDD+DLE VHLP+NDALVIAPL+D+V+VRRVLVDGGASANILS LALGWTRSQL
Sbjct: 481 SVTFDDSDLEGVHLPYNDALVIAPLIDHVLVRRVLVDGGASANILS----LALGWTRSQL 494
BLAST of Moc11g29410 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 682.9 bits (1761), Expect = 1.6e-192
Identity = 369/510 (72.35%), Postives = 399/510 (78.24%), Query Frame = 0
Query: 302 LKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDI 361
+KA+S P TP VITREEFD ++ + D QVEALKA+CE+KE L+DGDLGESPFTSD+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 362 LEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSA---- 421
LEAPIPPKFK PT+K YDGSKDPKDYVEVFE LMDFQ A+DAIKCRAF+IALT SA
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 422 ---PVVSKTA------------------GQTATHLATIRQKEGETLREYVTRFQEEQLKV 481
P S + +TATHLATIRQKEGETLREYVTRFQEEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 482 AHCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKK 541
AHCSDD AMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE+K
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 542 IDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYE--------------- 601
I + R GKD D KSKDKG S SS RAEYRR+ENGP+RSRPYE
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 602 --------------------QKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKK 661
++R+KDKYCRFH +HGHNTS+ WELK QIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 662 FVGKPRSNLVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVC 721
FVGKPR++ EKKEERKRSRTPPRR DRPAVINTIFGGPSGGQSG KRKELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 722 IIREQRPTCSITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYL 752
IIREQRPTC ITFD DLE VHLPHNDALVIAPL+D+VVV RVLVDGG SANILSL TYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
BLAST of Moc11g29410 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 661.0 bits (1704), Expect = 6.4e-186
Identity = 366/509 (71.91%), Postives = 390/509 (76.62%), Query Frame = 0
Query: 303 KAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDIL 362
KA+S Y P+TP VITREEFD +K +FD QVEALKARCEKKESS DDGDLGE F+SDIL
Sbjct: 64 KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDIL 123
Query: 363 EAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSAPV--- 422
EA IPPKFKTPTMK YDGSKDPKDYVEVFE LMDFQ ATDAIKC AFQIALT SA +
Sbjct: 124 EALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYR 183
Query: 423 -----------------VSKTAG-----QTATHLATIRQKEGETLREYVTRFQEEQLKVA 482
+S+ + +T THLATIRQKEGETLREYVTRF EEQLKVA
Sbjct: 184 RLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVA 243
Query: 483 HCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKKI 542
HCSDD AMCYFLTGLADETLTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK I
Sbjct: 244 HCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNI 303
Query: 543 DQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYE---------------- 602
DQ R GKDKGK D+KS+DKGPSSSSSR +YRRS + ++SRPYE
Sbjct: 304 DQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNI 363
Query: 603 -------------------QKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKF 662
+KRN DKYCRFH DHGHNTSN WELK QIEDLIQDGYFKKF
Sbjct: 364 EETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKF 423
Query: 663 VGKPRSNLVEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCI 722
VGKPRSN VEKKEERKR RTPPRRDDRPAVI NK+KELAREARREVCI
Sbjct: 424 VGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKELAREARREVCI 483
Query: 723 IREQRPTCSITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLA 752
IREQRPT SI F+ DLE VHLPHNDALVIAPL+D V+VRR+LVDGGASANILSLSTYLA
Sbjct: 484 IREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLA 543
BLAST of Moc11g29410 vs. ExPASy TrEMBL
Match:
A0A6J1DPC9 (uncharacterized protein LOC111022280 OS=Momordica charantia OX=3673 GN=LOC111022280 PE=4 SV=1)
HSP 1 Score: 653.3 bits (1684), Expect = 1.3e-183
Identity = 362/486 (74.49%), Postives = 391/486 (80.45%), Query Frame = 0
Query: 266 ALRDNDLRKHLADKKKRASRGPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLM 325
+LRDNDLRKHL DKKK+AS PEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLM
Sbjct: 33 SLRDNDLRKHLTDKKKKASWEPEDSLSYSREFSNSNLKAQSKYKPLIPEAVINREEFDLM 92
Query: 326 KHRFDEQVEALKARCEKKESSLDDGDLGESPFTSDILEAPIPPKFKTPTMKSYDGSKDPK 385
KHRFDEQVEALKARCEKKES DD DLGESPFTSDI+EAPIPPKFKTPTMK YDGSKDPK
Sbjct: 93 KHRFDEQVEALKARCEKKESPFDDDDLGESPFTSDIMEAPIPPKFKTPTMKPYDGSKDPK 152
Query: 386 DYVEVFEGLMDFQTATDAIKCRAFQIALTDSAPVVSKTAGQTATHLATIRQKEGETLREY 445
DYVEVFEGLMDFQ ATDAIKC AFQIALT SA + + A ++T Q E + ++
Sbjct: 153 DYVEVFEGLMDFQAATDAIKCLAFQIALTGSARLWCRRL--PARSISTYSQLRKEFIGQF 212
Query: 446 VTRFQEEQLKVAHCSDDLAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQEL 505
R + + H + + DETLTVKLGEEAPATFAEVLQ AKKVIDGQEL
Sbjct: 213 SFRHYDRK-TATHLAT-------IRQKEDETLTVKLGEEAPATFAEVLQNAKKVIDGQEL 272
Query: 506 LRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRSENGPSRSRPYEQKRN 565
LRTKT RPEK+IDQ+R+ + K K D+KSKDKG SSS SR EYRRSE+GPSRSRPYE+
Sbjct: 273 LRTKTDRPEKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER--- 332
Query: 566 KDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNLVEKKEERKRSRTPPR 625
CWELK QIEDLIQD YFKKFVGKPRSN VEKKEERKRSRTPPR
Sbjct: 333 -----------------CWELKRQIEDLIQDSYFKKFVGKPRSNSVEKKEERKRSRTPPR 392
Query: 626 RDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDDTDLERVHLP 685
R+DRPAVINTIFGGPSGGQ NKRKELA EARR+V IIREQ+PTCSITF DTDLE VHLP
Sbjct: 393 REDRPAVINTIFGGPSGGQFENKRKELACEARRKVSIIREQKPTCSITFKDTDLEGVHLP 452
Query: 686 HNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESV 745
HNDALVIAPL+D+V+VRRVLVDGGASANILSL TYLAL TRSQLKKSPTPLVGFS ESV
Sbjct: 453 HNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALRGTRSQLKKSPTPLVGFSAESV 488
Query: 746 SPEGYV 752
SPEG +
Sbjct: 513 SPEGCI 488
BLAST of Moc11g29410 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 591.7 bits (1524), Expect = 4.8e-165
Identity = 330/488 (67.62%), Postives = 366/488 (75.00%), Query Frame = 0
Query: 299 NSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEALKARCEKKESSLDDGDLGESPFT 358
+S+ +A+S + P TP+ VITREEFD ++ + + QVEALKA+CE+KE L+DGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 359 SDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLMDFQTATDAIKCRAFQIALTDSAP 418
SD+LEA PT+KSYDGSKDPKDYVEVFEGLMDFQ A+DAIKCRAFQIALT SA
Sbjct: 62 SDVLEA--------PTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 419 VVSKTAGQTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDLAMCYFLTGLADETLT 478
+ FQE+QLKVA SDD AMCYFLTGLADE LT
Sbjct: 122 L----------------------------WFQEDQLKVAQSSDDSAMCYFLTGLADEALT 181
Query: 479 VKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGP 538
VKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ R GKD+ K D KSKDKG
Sbjct: 182 VKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDE-KADLKSKDKG- 241
Query: 539 SSSSSRAEYRRSENGPSRSRPYE-----------------------------------QK 598
S SS RAE+RR+ NGP+RSRPYE ++
Sbjct: 242 SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPER 301
Query: 599 RNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNLVEKKEERKRSRTP 658
RNKDKYCRFH +H HNTS+ WELK QIEDLIQD YFKKFVGKPR++ EKKEERK SRTP
Sbjct: 302 RNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTP 361
Query: 659 PRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTCSITFDDTDLERVH 718
RR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQRPTC ITFD DLE VH
Sbjct: 362 LRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVH 421
Query: 719 LPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGE 752
LPHNDALVIAPL+D+VVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS E
Sbjct: 422 LPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRE 451
BLAST of Moc11g29410 vs. ExPASy TrEMBL
Match:
A0A6J1DPN4 (uncharacterized protein LOC111023060 OS=Momordica charantia OX=3673 GN=LOC111023060 PE=4 SV=1)
HSP 1 Score: 581.6 bits (1498), Expect = 4.9e-162
Identity = 343/559 (61.36%), Postives = 381/559 (68.16%), Query Frame = 0
Query: 242 LDDMRHRLRTMEEMY--------AEATRANQTALRD------------------NDLRKH 301
++ MR ++RTMEEMY A + +Q D DLR H
Sbjct: 1 MEAMRTQMRTMEEMYNKMVQIAGARSRSGDQVVHEDVHEQGDLHCDPVDEEHLGGDLRDH 60
Query: 302 LADKKKRASRGPEDSPSYSREFSNSDLKAQSKYKPLTPEAVITREEFDLMKHRFDEQVEA 361
L K+ + RG S + + NS+ +A+S Y P+ PE VITREEF+ +K +FD QVEA
Sbjct: 61 LNRKRNSSHRGERTSTYWHK---NSNQQAESSYNPIAPEGVITREEFNQLKSKFDAQVEA 120
Query: 362 LKARCEKKESSLDDGDLGESPFTSDILEAPIPPKFKTPTMKSYDGSKDPKDYVEVFEGLM 421
LK RCEKKES+ DDGDLGESPFTSDILEA IPPKFKTPTMKSYDGSKDPKDYVEVFEGLM
Sbjct: 121 LKVRCEKKESAFDDGDLGESPFTSDILEASIPPKFKTPTMKSYDGSKDPKDYVEVFEGLM 180
Query: 422 DFQTATDAIKCRAFQIALTDSA-------PVVSKTA------------------GQTATH 481
DFQ ATDAIKCRAFQIALT SA P S + +T TH
Sbjct: 181 DFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFFSRHYDRKTTTH 240
Query: 482 LATIRQKEGETLREYVTRFQEEQLKVAHCSDDLAMCYFLTGLADETLTVKLGEEAPATFA 541
LATIRQKEG+TL+EY+TRFQEEQLKV HCSDD +MCYFLTGLADET TVKLGEEA ATFA
Sbjct: 241 LATIRQKEGKTLKEYITRFQEEQLKVVHCSDDSSMCYFLTGLADETPTVKLGEEALATFA 300
Query: 542 EVLQKAKKVIDGQELLRTKTGRPEKKIDQRRVGKDKGKTDAKSKDKGPSSSSSRAEYRRS 601
EVLQ KK IDGQELLRTKT RPEK+IDQ++ +DK K D+KSKDKG SSS+SR +Y
Sbjct: 301 EVLQMEKKFIDGQELLRTKTDRPEKQIDQKKSSQDKRKADSKSKDKGSSSSNSRTDY--- 360
Query: 602 ENGPSRSRPYEQKRNKDKYCRFHHDHGHNTSNCWELKCQIEDLIQDGYFKKFVGKPRSNL 661
C RSN
Sbjct: 361 -------------------------------------C------------------RSNS 420
Query: 662 VEKKEERKRSRTPPRRDDRPAVINTIFGGPSGGQSGNKRKELAREARREVCIIREQRPTC 721
VEKKEERKRSRTPPR DDRPAVINTIFGGPSGGQSGNKRKELAREA REVCIIREQRPTC
Sbjct: 421 VEKKEERKRSRTPPRLDDRPAVINTIFGGPSGGQSGNKRKELAREASREVCIIREQRPTC 480
Query: 722 SITFDDTDLERVHLPHNDALVIAPLVDYVVVRRVLVDGGASANILSLSTYLALGWTRSQL 750
S+TFDD+DLE VHLP+NDALVIAPL+D+V+VRRVLVDGGASANILS LALGWTRSQL
Sbjct: 481 SVTFDDSDLEGVHLPYNDALVIAPLIDHVLVRRVLVDGGASANILS----LALGWTRSQL 494
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 1.6e-192 | 72.35 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 6.4e-186 | 71.91 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DPC9 | 1.3e-183 | 74.49 | uncharacterized protein LOC111022280 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A6J1D9E1 | 4.8e-165 | 67.62 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DPN4 | 4.9e-162 | 61.36 | uncharacterized protein LOC111023060 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
Match Name | E-value | Identity | Description | |