Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGTGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACAGATCACCGCGCCTGCCCTACCGCCTGCGCACCCAAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCTAGAAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGGTGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTCTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGAGACGAAGGACCCCAAGGACTACGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATTTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGTGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACGGCTCCGACGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACAAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGATGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGGACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCATAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACCTCGGGGAGCCCCAGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGATCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACCGACCGAACTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCTGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGATCCACCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCTACCCCCAATGGCGTGGGCGCGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGTCGACCTGTCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGATCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCGCCACAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGACAGGAGTAATGTAATTGTTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTAAGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCATGAACTGCTCACCTTCATGGACACCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTATATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGCCGAAGTCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTCGGCTTCATGGTGAATAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGACCGAGATAGAGGCACCGAAGACGCTGAAGCAGCTTCAGTGTCTCAATGGCAGGGTTGCGGCCCTGAGCCGGTTTGTTTCAAGAGCGACAGATAAGTGCCTTTCTTTCTTCAAAATCCTACGGAAGAAAGGGTCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCAGCACCTTTGCTCGCCAAGCCCATGCCGCGGGACAAGCTCCAATTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGGTCCTAATCAGACAACAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGGTACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGGCTCAGACCATACTTCCAAGCCCATACGGTCGTGGTGCTCACTAACTTGCCCCTAAAAAGCATCTTCCATAAGCCGGAAGCTTCTGGACGCCTAATGAAGTGGGCAATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCCGAGCGAGTCCGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGTGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAATCATTTATTGCCGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGAATTCCACGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAACCCCTCGATCTTAGAGCCAGATCAGATGGAGATCAGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCACTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATATTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACAAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCACTCTCCCACATAACGGAATCTAGGGTCACGTCCTTCGTGTGGACGAATATCGTATGTCGCTTTGGTATACCGCAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGGACCACCCAACGAGAATCGACGGGTGAGACACCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGTCATCTGACAGAGTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGATCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAAGGCATAGTCCGACCTGGGACGTACACATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGTGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACAGATCACCGCGCCTGCCCTACCGCCTGCGCACCCAAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGAAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGGTGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTCTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGAGACGAAGGACCCCAAGGACTACGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATTTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGTGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACGGCTCCGACGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACAAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGATGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGGACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCATAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACCTCGGGGAGCCCCAGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGATCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACCGACCGAACTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCTGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGATCCACCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCTACCCCCAATGGCGTGGGCGCGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGTCGACCTGTCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAACCCCTCGATCTTAGAGCCAGATCAGATGGAGATCAGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCACTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAAGGCATAGTCCGACCTGGGACGTACACATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGTGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACAGATCACCGCGCCTGCCCTACCGCCTGCGCACCCAAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGATCCAGCTCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGAAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGGTGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTCTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGAGACGAAGGACCCCAAGGACTACGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATTTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAATTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGTGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACGGCTCCGACGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACAAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGATGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGGACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCATAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACCTCGGGGAGCCCCAGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGATCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCGCCCCGGCGCACCGACCGAACTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCTGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGATCCACCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCTACCCCCAATGGCGTGGGCGCGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGTCGACCTGTCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAACCCCTCGATCTTAGAGCCAGATCAGATGGAGATCAGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAAAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCACTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAAGGCATAGTCCGACCTGGGACGTACACATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPVNSTNTTDRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSAQITAPALPPAHPRTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAMRTQMRSMEAMYNEMKRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQLKVAHGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKIGRGISGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRSTIHSFRAIPSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEVDLSRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDQMEISAPESSWMDPIADFIRGNSPQDPKERKKLARRAARFVVRDGALYRRGFSLPLLRCLTPEEGLRVQTHVGALDPAWEGPFEIKGIVRPGTYTLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc07g04420 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 945.7 bits (2443), Expect = 3.1e-271
Identity = 489/528 (92.61%), Postives = 501/528 (94.89%), Query Frame = 0
Query: 170 QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFTSDVL 229
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ LN GDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 230 EAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 289
EAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 290 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQLKVA 349
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEG TLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 350 HGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKI 409
H SDDSAMCYFLTGLAD+ALTVKLGEEAPATFA+VLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 410 GRGISGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 469
GRG SGKD E ADPKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 470 ESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 529
ESGMEKLLKRPEK RGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 530 GKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCII 589
GKPR SSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 590 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 649
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 650 GWTRSQLKRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV 694
GWTRSQLK+SPT LVGFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g04420 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 945.7 bits (2443), Expect = 3.1e-271
Identity = 506/630 (80.32%), Postives = 526/630 (83.49%), Query Frame = 0
Query: 166 SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFT 225
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LN GDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 226 SDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 285
SDVLE APTVK YD +KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 286 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQ 345
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 346 LKVAHGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRP 405
LKVA SDDSAMCYFLTGLAD+ALTVKLG+EAPATFA+VLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 406 ERKIGRGISGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT 465
ER I RG SGKDE+AD KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 466 NIEESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 525
NIEESGMEKLLKRPEK RGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 526 KFVGKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREV 585
KFVGKPR SSAEKKEERK SRTP RR DR AVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 586 CIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 645
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 646 LALGWTRSQLKRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYN 705
LALGWTRSQLK+S T LVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 706 AIFGRSTIHSFRAIPSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLA 765
AIFGR IHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 570
Query: 766 GRDGTLEFEVDLSRKEFAAPTEELELVPLL 793
RDGTLEF+ +L R+EFAAPTEELELVPLL
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc07g04420 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 932.9 bits (2410), Expect = 2.1e-267
Identity = 512/769 (66.58%), Postives = 563/769 (73.21%), Query Frame = 0
Query: 1 MVQPVNSTNTTDRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSAQITAPALPPAHP 60
MVQP NSTNT DRR LAA+ HQREVGA VVEGQGH+ L TEPL RSA+IT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAMRTQMRSMEAMYNEMKRGSHLG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 PAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPV-- 180
AESS+NP+
Sbjct: 121 --------------------------------------------------AESSYNPITP 180
Query: 181 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFTSDVLEAPIPPKFKAPT 240
G+ITREEFDQL+ + DAQVEALKA+CE+K+ S + GDLGE F+SD+LEA IPPKFK PT
Sbjct: 181 GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPT 240
Query: 241 VKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 300
+KPYD +KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQ
Sbjct: 241 MKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQ 300
Query: 301 LRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQLKVAHGSDDSAMCYFL 360
LR+EF++QFSSRHYD+KT THLATIRQKEG TLREYVTRF EEQLKVAH SDDSAMCYFL
Sbjct: 301 LRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFL 360
Query: 361 TGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKIGRGISGKDE-RA 420
TGLAD+ LTVKL EEAPATFA+VLQK KKVIDGQELLRTKTGRPE+ I +G +GKD+ +A
Sbjct: 361 TGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKA 420
Query: 421 DPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRP 480
D KS+DKG S SS R +YRR+ S +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRP
Sbjct: 421 DSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRP 480
Query: 481 EKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRISSAEKK 540
EK RG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR +S EKK
Sbjct: 481 EKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKK 540
Query: 541 EERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITF 600
EERKR RTPPRR DR AVIN K+KELAR ARREVCIIREQ PT I F
Sbjct: 541 EERKRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAF 600
Query: 601 DGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP 660
+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SP
Sbjct: 601 NHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSP 650
Query: 661 TLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRSTIHSFRAI 720
T LVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAIFGR IHSFRA+
Sbjct: 661 TPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAV 650
Query: 721 PSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLAGRD 766
PSTLHQVLKY T NGVG VRGE SRECYAS K SSVCALE RD
Sbjct: 721 PSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc07g04420 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 778.1 bits (2008), Expect = 8.7e-221
Identity = 399/422 (94.55%), Postives = 405/422 (95.97%), Query Frame = 0
Query: 207 KDDSLNGGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASD 266
KDDSLN GDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 267 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 326
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 327 EGVTLREYVTRFQEEQLKVAHGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAK 386
EG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLAD+ALTVKLGE+AP TFA+VLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 387 KVIDGQELLRTKTGRPERKIGRGISGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSR 446
KVIDGQELLRTKTGRP+RKIGRG SGKD ERADPKSKDKGSFSSGRAEYRRAESGPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 447 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDC 506
PYERFTPTTIPISEILTNIEESGMEKLLKRPEK RGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 507 WELKRQIEDLIQDGYFKKFVGKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGG 566
WELKRQIEDLIQDGYFKKFVGKPR SSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 567 QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 626
QSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 627 VL 628
VL
Sbjct: 464 VL 465
BLAST of Moc07g04420 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 777.7 bits (2007), Expect = 1.1e-220
Identity = 402/446 (90.13%), Postives = 413/446 (92.60%), Query Frame = 0
Query: 354 MCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKIGRGISGK 413
MCYFLTGLAD+ALTVKL EEAPATFA+VLQKAKKVIDGQELLRT KIG+G SGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 414 D-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 473
D E DPKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 474 LKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRISS 533
LKRPEK RGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPR SS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 534 AEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 593
AEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 594 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 653
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 654 KRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRSTIHS 713
K+SPT LVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGR IHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 714 FRAIPSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEV 773
FRAIPSTLHQVLKY TPNGVG VRGEQTASRECYAS LKG+SVCALETL RDGTLEFE
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 774 DLSRKEFAAPTEELELVPLLSPEKQL 799
DL +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc07g04420 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 945.7 bits (2443), Expect = 1.5e-271
Identity = 506/630 (80.32%), Postives = 526/630 (83.49%), Query Frame = 0
Query: 166 SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFT 225
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LN GDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 226 SDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 285
SDVLE APTVK YD +KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 286 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQ 345
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 346 LKVAHGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRP 405
LKVA SDDSAMCYFLTGLAD+ALTVKLG+EAPATFA+VLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 406 ERKIGRGISGKDERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILT 465
ER I RG SGKDE+AD KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 466 NIEESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 525
NIEESGMEKLLKRPEK RGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 526 KFVGKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREV 585
KFVGKPR SSAEKKEERK SRTP RR DR AVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 586 CIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 645
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 646 LALGWTRSQLKRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYN 705
LALGWTRSQLK+S T LVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 706 AIFGRSTIHSFRAIPSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLA 765
AIFGR IHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 570
Query: 766 GRDGTLEFEVDLSRKEFAAPTEELELVPLL 793
RDGTLEF+ +L R+EFAAPTEELELVPLL
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc07g04420 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 945.7 bits (2443), Expect = 1.5e-271
Identity = 489/528 (92.61%), Postives = 501/528 (94.89%), Query Frame = 0
Query: 170 QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFTSDVL 229
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ LN GDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 230 EAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 289
EAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 290 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQLKVA 349
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEG TLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 350 HGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKI 409
H SDDSAMCYFLTGLAD+ALTVKLGEEAPATFA+VLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 410 GRGISGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 469
GRG SGKD E ADPKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 470 ESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 529
ESGMEKLLKRPEK RGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 530 GKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCII 589
GKPR SSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 590 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 649
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 650 GWTRSQLKRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV 694
GWTRSQLK+SPT LVGFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc07g04420 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 932.9 bits (2410), Expect = 1.0e-267
Identity = 512/769 (66.58%), Postives = 563/769 (73.21%), Query Frame = 0
Query: 1 MVQPVNSTNTTDRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSAQITAPALPPAHP 60
MVQP NSTNT DRR LAA+ HQREVGA VVEGQGH+ L TEPL RSA+IT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARDPAPAPTSENFDALKREMEAMRTQMRSMEAMYNEMKRGSHLG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 PAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPV-- 180
AESS+NP+
Sbjct: 121 --------------------------------------------------AESSYNPITP 180
Query: 181 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNGGDLGESPFTSDVLEAPIPPKFKAPT 240
G+ITREEFDQL+ + DAQVEALKA+CE+K+ S + GDLGE F+SD+LEA IPPKFK PT
Sbjct: 181 GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPT 240
Query: 241 VKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 300
+KPYD +KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLPAR ISTYSQ
Sbjct: 241 MKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQ 300
Query: 301 LRREFLAQFSSRHYDKKTATHLATIRQKEGVTLREYVTRFQEEQLKVAHGSDDSAMCYFL 360
LR+EF++QFSSRHYD+KT THLATIRQKEG TLREYVTRF EEQLKVAH SDDSAMCYFL
Sbjct: 301 LRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFL 360
Query: 361 TGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKIGRGISGKDE-RA 420
TGLAD+ LTVKL EEAPATFA+VLQK KKVIDGQELLRTKTGRPE+ I +G +GKD+ +A
Sbjct: 361 TGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKA 420
Query: 421 DPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRP 480
D KS+DKG S SS R +YRR+ S +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRP
Sbjct: 421 DSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRP 480
Query: 481 EKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRISSAEKK 540
EK RG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR +S EKK
Sbjct: 481 EKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKK 540
Query: 541 EERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITF 600
EERKR RTPPRR DR AVIN K+KELAR ARREVCIIREQ PT I F
Sbjct: 541 EERKRLRTPPRRDDRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAF 600
Query: 601 DGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSP 660
+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLALGWTRSQLK+SP
Sbjct: 601 NHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSP 650
Query: 661 TLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRSTIHSFRAI 720
T LVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVV+DGRSAYNAIFGR IHSFRA+
Sbjct: 661 TPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAV 650
Query: 721 PSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLAGRD 766
PSTLHQVLKY T NGVG VRGE SRECYAS K SSVCALE RD
Sbjct: 721 PSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALEEQTIRD 650
BLAST of Moc07g04420 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 778.1 bits (2008), Expect = 4.2e-221
Identity = 399/422 (94.55%), Postives = 405/422 (95.97%), Query Frame = 0
Query: 207 KDDSLNGGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASD 266
KDDSLN GDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 267 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 326
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 327 EGVTLREYVTRFQEEQLKVAHGSDDSAMCYFLTGLADKALTVKLGEEAPATFADVLQKAK 386
EG TLREYVTRFQEEQLKVAH SDDSAMCYFLTGLAD+ALTVKLGE+AP TFA+VLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 387 KVIDGQELLRTKTGRPERKIGRGISGKD-ERADPKSKDKGSFSSGRAEYRRAESGPTRSR 446
KVIDGQELLRTKTGRP+RKIGRG SGKD ERADPKSKDKGSFSSGRAEYRRAESGPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 447 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKPRGAPERRSKDKYCRFHREHGHNTSDC 506
PYERFTPTTIPISEILTNIEESGMEKLLKRPEK RGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 507 WELKRQIEDLIQDGYFKKFVGKPRISSAEKKEERKRSRTPPRRTDRTAVINTIFGGPSGG 566
WELKRQIEDLIQDGYFKKFVGKPR SSAEKKEERKRSRTPPRRTDR AVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 567 QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 626
QSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 627 VL 628
VL
Sbjct: 464 VL 465
BLAST of Moc07g04420 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 777.7 bits (2007), Expect = 5.5e-221
Identity = 402/446 (90.13%), Postives = 413/446 (92.60%), Query Frame = 0
Query: 354 MCYFLTGLADKALTVKLGEEAPATFADVLQKAKKVIDGQELLRTKTGRPERKIGRGISGK 413
MCYFLTGLAD+ALTVKL EEAPATFA+VLQKAKKVIDGQELLRT KIG+G SGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 414 D-ERADPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 473
D E DPKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 474 LKRPEKPRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRISS 533
LKRPEK RGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPR SS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 534 AEKKEERKRSRTPPRRTDRTAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 593
AEKKEERKRSRTPPRRTDR AVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 594 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 653
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 654 KRSPTLLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRSTIHS 713
K+SPT LVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGR IHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 714 FRAIPSTLHQVLKYPTPNGVGAVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEV 773
FRAIPSTLHQVLKY TPNGVG VRGEQTASRECYAS LKG+SVCALETL RDGTLEFE
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 774 DLSRKEFAAPTEELELVPLLSPEKQL 799
DL +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 3.1e-271 | 92.61 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 3.1e-271 | 80.32 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 2.1e-267 | 66.58 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 8.7e-221 | 94.55 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022152110.1 | 1.1e-220 | 90.13 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 1.5e-271 | 80.32 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.5e-271 | 92.61 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 1.0e-267 | 66.58 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 4.2e-221 | 94.55 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DD03 | 5.5e-221 | 90.13 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |