Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCACTTCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAACTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGACGATTCACTGAACGAAGGCGACTTTAGAGAGTCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCATGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGAATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGAGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCGCCTCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCAACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACTTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGTCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGGTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCAACCCGCCGAGGAAGGAGTTTGCCGCACCTACAGAGGAGCTCGAGCTTGTTTCGCTGCTTAGTCTCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCTATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATTGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGAAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACAAGCTGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGACCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCCGCTAAGTGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTCGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGACCAAGATGGAGGCACCGAAGACGCTGAAGCAGCTTCAGTGTCTCAATGGCAGGATTGCGGCCCTGAGCCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAATCCTACGAAAGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCTTTGCTCGCCAAGCCCATGCCGGGGGAAGGGCTCCAATTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGACCCTAATCAGGCAAGAGGAAGCGTGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGGCTTAGACTATACTTCCAAGCCCATACGGTGGTGGTGCTCACTAACTTGCCCCTAAAAAGCATCTTCCATAAGCCGGAAGCTTCTGGACGCCTAATGAAGTGGGCAATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGAACAAGCAGCGGCAGATTTCATACCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGGGACGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGACCAGAAAATTCTAATGCTGACGCCTTGGCCAAGCTAGCATCGGCGTACGAGACCGACTTGGCCAGGTCGGTCCCCATTGAGATCTTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGTGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCAAGATGGGGTATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCAAAAATGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCAAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGAGTCACGTCCTTCGTATGGACGAATATCATATGTCGCTTTGGTATACCGCATGCCATCGTGACAGACAATGGGAAGTAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTTAGCTCGTCCCCCGCCCATTCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGGGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTAGGCCGAGGAGCTACCAGAGGTTCTGTGGTTGTACCGGACCACCCAAAGAGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCAGCATGCCATCTGACAGAGTAGAGCATTACAAGCCTATGGCAAATGAGGAAGAGTTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGTAGTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTTGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCATGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCACTTCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAACTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGACGATTCACTGAACGAAGGCGACTTTAGAGAGTCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCATGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGAATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGAGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCGCCTCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCAACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACTTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGTCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGGTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCAACCCGCCGAGGAAGGAGTTTGCCGCACCTACAGAGGAGCTCGAGCTTGTTTCGCTGCTTAGTCTCGAGAAGCAGACCGACTTGGCCAGGTCGGTCCCCATTGAGATCTTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGTGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCAAGATGGGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCATGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCACTTCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAACTGAGGGGCAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGACGATTCACTGAACGAAGGCGACTTTAGAGAGTCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCATGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGAATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGAGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTACCGCTTCCATCGGGAGCACGACCATAACACGTCAGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCGCCTCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCAACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACTTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGTCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCTCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGGTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCCCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCAACCCGCCGAGGAAGGAGTTTGCCGCACCTACAGAGGAGCTCGAGCTTGTTTCGCTGCTTAGTCTCGAGAAGCAGACCGACTTGGCCAGGTCGGTCCCCATTGAGATCTTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGTGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCAAGATGGGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCATGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMMLAAGAGSRSENRVTRVDIREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFRESPFTSDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYFALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSVCALETLPGRDGTLEFEANPPRKEFAAPTEELELVSLLSLEKQTDLARSVPIEILDNPSILEPDLMEIGAPESSWMDPIADFIRGNSPQDPKECRKLARRAARFVVQDGVGHLVLRRVQTHVGALDPAWEGPFEVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc09g31960 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 951.0 bits (2457), Expect = 7.5e-273
Identity = 489/528 (92.61%), Postives = 504/528 (95.45%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFRESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LN+GD ESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPT+KPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD+E ADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLR APERRSKDKY RFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYFAL 670
REQ PTCPITF+GADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTY AL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFV 716
GWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc09g31960 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 941.4 bits (2432), Expect = 5.9e-270
Identity = 513/665 (77.14%), Postives = 542/665 (81.50%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFRESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRGKL+AQVEALKAKCEQK+ LN+GD ESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APT+K YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+AD KSKDK SFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYF 546
TNIEESGMEKLLKRPEKLR APERR+KDKY RFHREHDHNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFVGKPRTSSAEKK+ERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
VCIIREQ PTCPITF+ ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YFALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAY 726
Y ALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NVIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSVCALETL 786
N IFGRPIIHSFRAIPSTLHQVLKY T NGVG VRGEQ ASRE YASALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 601
Query: 787 PGRDGTLEFEANPPRKEFAAPTEELELVSLLSLEKQTDLARSVPIEILDNPSILEPDLME 846
RDGTLEF+AN PR+EFAAPTEELELV LL + ++ ++ + + ++ D+
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGV 604
Query: 847 IGAPE 849
G PE
Sbjct: 662 EGMPE 604
BLAST of Moc09g31960 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 940.6 bits (2430), Expect = 1.0e-269
Identity = 522/790 (66.08%), Postives = 566/790 (71.65%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VEGQGH+ L TEPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDIREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFR 240
AESS+NP G+ITREEFDQL+ K DAQVEALKA+CE+K+ S ++GD
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PTMKPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDVERADPKSKDK-ESFSSGRAEYRRAESGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD +AD KS+DK S SS R +YRR+ S +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDL 540
PI EILTNIEE+GMEKLLKRPEKLR PE+R+ DKY RFHR+H HNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKPR++S EKK+ERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I FN ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYFALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVI 720
ILSL TY ALGWTRSQLKKSPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSV 780
DGRSAYN IFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE SRE YAS K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALETLPGRD 788
CALE RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc09g31960 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 779.6 bits (2012), Expect = 3.0e-221
Identity = 399/422 (94.55%), Postives = 407/422 (96.45%), Query Frame = 0
Query: 228 KDDSLNEGDFRESPFTSDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLN+GD ES FTSDVLEAPIPPKFKAPT+KPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKDVERADPKSKDK SFSSGRAEYRRAESGPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDC 527
PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLR APERRSKDKY RFHREH HNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCIIREQGPTCPITF+GAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 650
VL
Sbjct: 464 VL 465
BLAST of Moc09g31960 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 774.2 bits (1998), Expect = 1.3e-219
Identity = 401/448 (89.51%), Postives = 414/448 (92.41%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 DVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 494
D+E DPKSKDK SFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 554
LKRPEKLR APERRSKDKY RFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 614
AEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYFALGWTRSQL 674
PITF+ ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY ALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNVIFGRPIIHS 734
KKSPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYN IFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSVCALETLPGRDGTLEFEA 794
FRAIPSTLHQVLKY T NGVGTVRGEQTASRE YAS LKG+SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 NPPRKEFAAPTEELELVSLLSLEKQTDL 823
+ P +EFAAP EELELV LLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
BLAST of Moc09g31960 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 951.0 bits (2457), Expect = 3.6e-273
Identity = 489/528 (92.61%), Postives = 504/528 (95.45%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFRESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LN+GD ESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPT+KPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD+E ADPKSKDK SFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLR APERRSKDKY RFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYFAL 670
REQ PTCPITF+GADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTY AL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFV 716
GWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc09g31960 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 941.4 bits (2432), Expect = 2.9e-270
Identity = 513/665 (77.14%), Postives = 542/665 (81.50%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFRESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRGKL+AQVEALKAKCEQK+ LN+GD ESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APT+K YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+AD KSKDK SFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYF 546
TNIEESGMEKLLKRPEKLR APERR+KDKY RFHREHDHNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFVGKPRTSSAEKK+ERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
VCIIREQ PTCPITF+ ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YFALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAY 726
Y ALGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NVIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSVCALETL 786
N IFGRPIIHSFRAIPSTLHQVLKY T NGVG VRGEQ ASRE YASALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 601
Query: 787 PGRDGTLEFEANPPRKEFAAPTEELELVSLLSLEKQTDLARSVPIEILDNPSILEPDLME 846
RDGTLEF+AN PR+EFAAPTEELELV LL + ++ ++ + + ++ D+
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGV 604
Query: 847 IGAPE 849
G PE
Sbjct: 662 EGMPE 604
BLAST of Moc09g31960 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 940.6 bits (2430), Expect = 4.9e-270
Identity = 522/790 (66.08%), Postives = 566/790 (71.65%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
MVQPANSTNTADRR LAA+ HQREVGA VEGQGH+ L TEPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDIREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGKLDAQVEALKAKCEQKDDSLNEGDFR 240
AESS+NP G+ITREEFDQL+ K DAQVEALKA+CE+K+ S ++GD
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PTMKPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDVERADPKSKDK-ESFSSGRAEYRRAESGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD +AD KS+DK S SS R +YRR+ S +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDL 540
PI EILTNIEE+GMEKLLKRPEKLR PE+R+ DKY RFHR+H HNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKPR++S EKK+ERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I FN ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYFALGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVI 720
ILSL TY ALGWTRSQLKKSPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNVIFGRPIIHSFRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSV 780
DGRSAYN IFGRPIIHSFRA+PSTLHQVLKY TLNGVGTVRGE SRE YAS K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALETLPGRD 788
CALE RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc09g31960 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 779.6 bits (2012), Expect = 1.4e-221
Identity = 399/422 (94.55%), Postives = 407/422 (96.45%), Query Frame = 0
Query: 228 KDDSLNEGDFRESPFTSDVLEAPIPPKFKAPTMKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLN+GD ES FTSDVLEAPIPPKFKAPT+KPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKESFSSGRAEYRRAESGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKDVERADPKSKDK SFSSGRAEYRRAESGPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRRAPERRSKDKYYRFHREHDHNTSDC 527
PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLR APERRSKDKY RFHREH HNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIIREQGPTCPITFNGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCIIREQGPTCPITF+GAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 650
VL
Sbjct: 464 VL 465
BLAST of Moc09g31960 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 774.2 bits (1998), Expect = 6.1e-220
Identity = 401/448 (89.51%), Postives = 414/448 (92.41%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 DVERADPKSKDKESFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 494
D+E DPKSKDK SFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRRAPERRSKDKYYRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 554
LKRPEKLR APERRSKDKY RFHREH HNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 614
AEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFNGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYFALGWTRSQL 674
PITF+ ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY ALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNVIFGRPIIHS 734
KKSPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYN IFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQVLKYPTLNGVGTVRGEQTASREWYASALKGSSVCALETLPGRDGTLEFEA 794
FRAIPSTLHQVLKY T NGVGTVRGEQTASRE YAS LKG+SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 NPPRKEFAAPTEELELVSLLSLEKQTDL 823
+ P +EFAAP EELELV LLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 7.5e-273 | 92.61 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 5.9e-270 | 77.14 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.0e-269 | 66.08 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 3.0e-221 | 94.55 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022152110.1 | 1.3e-219 | 89.51 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 3.6e-273 | 92.61 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 2.9e-270 | 77.14 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 4.9e-270 | 66.08 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 1.4e-221 | 94.55 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DD03 | 6.1e-220 | 89.51 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |