Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCAGTGGTAGAGGGGCAAGGTCACGATGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCTAAGGCAACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGTGCCCGGGGTCCAGACCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATCTATAACGAAATGATATTAGCTGCCGGTGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAGGGCGTTCCCACCTCGGCCCAGTCGACGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAACAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCACGATGCTCAGGTTAAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCCTTGAGGGCCTCATGGATTTTCAAGCGACATCAGACGCAATCAAATGTCACGCCTTTCAGATCGCGCTTACTGGCAGCGCGCATTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCAACCCATCTCGCCACCATCAGACAAAAAGAAGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAGGATCGGCCTAGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTACAAGGACAAGGGATCTTTCTCCAGTGGACGAGCTGAGTTTCGAAGGGCGGTGAACAGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACAATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCAGAAAGGCGCAGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCAGACTGCTGGGAGTTGAAACGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGTAGCTAGGCACGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCATACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCTTTAACGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTCCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGATGAGAGAGTTTGCCGCACCCACTGAGGAGCTCAAGCTTGTTCCTCTACTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTTAGATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCAAATTATGACGCATCGCCTCAGCATAGATCCGTCATTCCGACCTGTAAAACAAAAAAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTGTTGAAAGCTGAATACATAAGAGAAATTTTGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTGGAGAATGTGCGTAGACTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTTCCACTGCCGAGGATTGATCAGCTTGTGGACGCCACAGCCGGGCACGAACTGCTCACTTTCATGGATGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCGGATCAAGATCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAAGGTCATGCCCTTCGGTTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAACAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTCGTCAAGAGCAAGCAGTCTAAGTCGCATCTTTCCGATCTGACCGAAGCCTTCGAGGTTCTAAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACCACCGGGGGATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGAGGCACCCAAAACGCTGAGACAGCTTTAGTGCCTCAATGGCAGGATTGCAGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTCCCTTTCTTCAAAGTCGTACGAAAGAAAAGGCCGTTTGAATGGACAGCGGAGTGTGAGCAAGCATTTCAGCAATTGAAGAGCTACCTCTGCTCGGCACCTTTGCTCGCCAAGCCCCTGCTAGGGGACAAGCTCCAGTTGTACTTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTCGCTTTAGTCACCTCGGCCCGACGGCTTAGACCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCCGAGCGAGTCCGACCTACCGTGGACAATCTATGTCGACGGATCCTCCAATGAGAAAGGGTGCGGGGTCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGTCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCGACTCTCAGCTGGTTGTGAGCTAGATCAAGGAAGAGTACCAAGCCAAAGACTCCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAAGTTACGATGTAAGCCGGGTTCCCCGAGCAGAAAATTCTAATGCCGACGCCTTGGCCAAGTTAGCATCAACGTACAAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGGTAATCCCTCGATCTCGGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGAAGCGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTCGTCGTCCGAGGTGGAGCATTGTACCGACGCCGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTATACGTCCTCAGAGAAATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGACCGACCCTCAGCCAGGACACCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAAACGTAATCCACCAACCTCTCGAGCTGCTTACCCCCATCACGGCCCCATGGCCATTCGCGCAGTGGGGGGTAGATATTATTGGTCCTTTCCCTCTGGGCAAAGGCCAGACATAGTTCGCTGTAGTGGCTGTGGATTACTTCACAAAGTGGACCGAGGCCGAGGCGCTCTCCCACATAACGGTATCCAGAGTCACATCCTTAATATGGACAAATATCATATGTCGCTTTGGTATACCGCAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGTAAACGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGTCGAGGAGTTACCAGAGGTTCTATGGTCGTACCGGACCACCCAAAGAGAATTGACGGGTGAAACCCCGTTCTCCATGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGAAATGCCATCTGATAGAGTAGAGCATTACGAGCCTACAACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGTCCAGCTACGCCTGGCAGAATATCAAGGCAGAATGGCCAGACACTACAACGCCCGCGTTCGACCTTTGACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCTGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGA
mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCAGTGGTAGAGGGGCAAGGTCACGATGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCTAAGGCAACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGTGCCCGGGGTCCAGACCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATCTATAACGAAATGATATTAGCTGCCGGTGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAGGGCGTTCCCACCTCGGCCCAGTCGACGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAACAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCACGATGCTCAGGTTAAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCCTTGAGGGCCTCATGGATTTTCAAGCGACATCAGACGCAATCAAATGTCACGCCTTTCAGATCGCGCTTACTGGCAGCGCGCATTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCAACCCATCTCGCCACCATCAGACAAAAAGAAGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAGGATCGGCCTAGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTACAAGGACAAGGGATCTTTCTCCAGTGGACGAGCTGAGTTTCGAAGGGCGGTGAACAGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACAATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCAGAAAGGCGCAGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCAGACTGCTGGGAGTTGAAACGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGTAGCTAGGCACGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCATACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCTTTAACGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTCCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGATGAGAGAGTTTGCCGCACCCACTGAGGAGCTCAAGCTTGTTCCTCTACTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGGTAATCCCTCGATCTCGGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGAAGCGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTCGTCGTCCGAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCTGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGA
Coding sequence (CDS)
ATGGTTCAACCCGCAAACTCGACCAATACGGCAGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCAGTGGTAGAGGGGCAAGGTCACGATGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTTCTACCACCTGCGCACCCCCCAAGGACATCTAAGGCAACCCGTGGCCGAGGTGGAACCTCTAAGAAGGGTGCCCGGGGTCCAGACCCGGCCCCGACAAGTGAGAACTTGGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACGAAAATGCGGTCCATGGAGGAAATCTATAACGAAATGATATTAGCTGCCGGTGCAGGGTCCCGATCTGAGAACCGAGTGACGCGCGTTGGCATACGCGAGCAAGGGCGTTCCCACCTCGGCCCAGTCGACGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTCCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAACAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCACGATGCTCAGGTTAAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGTCATTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCAAAGGACCCTAAGGATTATGTTGAGGTCCTTGAGGGCCTCATGGATTTTCAAGCGACATCAGACGCAATCAAATGTCACGCCTTTCAGATCGCGCTTACTGGCAGCGCGCATTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCAACCCATCTCGCCACCATCAGACAAAAAGAAGGTGAGACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGATGACTCGGCCATGTGCTACTTTCTCACCGGCCTAGCCGACGAAGCCCTCACGGTGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAAGTGCTGCAGAAGGCGAAGAAAGTCATCGATGGGCAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAGGATCGGCCTAGACAGAAGCGGAAAAGGTGAAAAGGCGGATCCCAAGTACAAGGACAAGGGATCTTTCTCCAGTGGACGAGCTGAGTTTCGAAGGGCGGTGAACAGACCCACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACAATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCCGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCAGAAAGGCGCAGTAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCAGACTGCTGGGAGTTGAAACGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTATGGGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAAGAGGAGCGTAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACACAAAAGAAAGGAGTTAGCTCGTGTAGCTAGGCACGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACAGTGCATACTTGGAGGAGGTCCACCTGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCTTTAACGACCTACCTCGTCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGGTGCATCGACTTGCCGGTCACGCTTGGGCAGGACCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGAGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCGCTCAAAGGCTCATCGGTCTCCGCCATCGAAACTCTCGCCAGTGGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGATGAGAGAGTTTGCCGCACCCACTGAGGAGCTCAAGCTTGTTCCTCTACTTAGTCCCGAGAAGCAAACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGGTAATCCCTCGATCTCGGAGCCAGATCTGATGGAGGTCGACGCTCCAGAGCCCTCATGGATGGACCCGATTGTGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGAAGCGCAGAAAGTTGGCAAGGCAAGCAGCTCGGTTCGTCGTCCGAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCTGTTTGAGGTCAAGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCATGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPPRTSKATRGRGGTSKKGARGPDPAPTSENLDALQREMEAMRTKMRSMEEIYNEMILAAGAGSRSENRVTRVGIREQGRSHLGPVDEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQIALTGSAHLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGLDRSGKGEKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEVCIIREQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTYLVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVSAIETLASGDGTLEFEADLPMREFAAPTEELKLVPLLSPEKQTDLARSVPVEILGNPSISEPDLMEVDAPEPSWMDPIVDFIRGNSPQDPKKRRKLARQAARFVVRVGHLVLRKVQTHVGALDPTWEGLFEVKGIVRPGTYILADLKGDVLAHP
Homology
BLAST of Moc04g08840 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 960.7 bits (2482), Expect = 9.3e-276
Identity = 519/665 (78.05%), Postives = 539/665 (81.05%), Query Frame = 0
Query: 188 SSNQQAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGDLGESSFT 247
SSNQQAESSHNP TP GVITR EFDQLRGK +AQV+ALKAKCEQKEGPLNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 248 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQIALTGSAH 307
SDVLE APTVK YDGSKDPKDYVEV EGLMDFQA SDAIKC AFQIALTGSA
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 308 LWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 367
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 368 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 427
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 428 ERRIGLDRSGKGEKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILT 487
ER I RSGK EKAD K KDKGSFSSGRAEFRRAVN PTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 488 NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 547
NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 548 KFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEV 607
KF+GKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELAR AR EV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 608 CIIREQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTY 667
CIIREQRPTCPITFDSA LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 668 LVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYN 727
L LGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 728 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVSAIETLA 787
AIFGRPIIHSFRAIPSTLHQVLKYSTPNG G VRGEQ ASRECYASALKGSSV A+ETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 788 SGDGTLEFEADLPMREFAAPTEELKLVPLLSPEKQTDLARSVPVEILGNPSISEPDLMEV 847
S DGTLEF+A+LP REFAAPTEEL+LVPLL + ++ ++ + + + D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGVE 605
Query: 848 DAPEP 853
PEP
Sbjct: 662 GMPEP 605
BLAST of Moc04g08840 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 951.0 bits (2457), Expect = 7.4e-273
Identity = 488/528 (92.42%), Postives = 497/528 (94.13%), Query Frame = 0
Query: 192 QAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGDLGESSFTSDVL 251
+AESS NP TPAGVITR EFDQLRG+ DAQV+ALKAKCEQKEGPLNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 252 EAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQIALTGSAHLWYR 311
EAPIPPKFKAPTVKPYDGSKDPKDYVEV E LMDFQA SDAIKC AF+IALTGSA LWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 312 RLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 371
RLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 372 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRI 431
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER+I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 432 GLDRSGKG-EKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILTNIE 491
G RSGK E ADPK KDKGSFSSGRAE+RRA N PTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 492 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFM 551
ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKF+
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 552 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEVCII 611
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELAR AR EVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 612 REQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTYLVL 671
REQRPTCPITFD A LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSL TYL L
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 672 GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 719
GWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc04g08840 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 935.6 bits (2417), Expect = 3.2e-268
Identity = 520/786 (66.16%), Postives = 566/786 (72.01%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREV A VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 PRTSKATRGRGGTSKKGARGPDPAPTSENLDALQREMEAMRTKMRSMEEIYNEMILAAGA 120
Sbjct: 61 ------------------------------------------------------------ 120
Query: 121 GSRSENRVTRVGIREQGRSHLGPVDEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 SPSRSHRSSNQQAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGD 240
PS+ AESS+NP TP GVITR EFDQL+ K DAQV+ALKA+CE+KE +DGD
Sbjct: 181 KPSK--------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGD 240
Query: 241 LGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQI 300
LGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEV E LMDFQA +DAIKC AFQI
Sbjct: 241 LGELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQI 300
Query: 301 ALTGSAHLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYV 360
ALTGSA LWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYV
Sbjct: 301 ALTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYV 360
Query: 361 TRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELL 420
TRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELL
Sbjct: 361 TRFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELL 420
Query: 421 RTKTGRPERRIGLDRSGKGE-KADPKYKDKG-SFSSGRAEFRRAVNRPTRSRPYERFTPT 480
RTKTGRPE+ I R+GK + KAD K +DKG S SS R ++RR+ + +SRPYE +TPT
Sbjct: 421 RTKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPT 480
Query: 481 TIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIE 540
TIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIE
Sbjct: 481 TIPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIE 540
Query: 541 DLIQDGYFKKFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE 600
DLIQDGYFKKF+GKPR++S EKKEERKR RTPPRR DRPAVIN K+KE
Sbjct: 541 DLIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKE 600
Query: 601 LARVARHEVCIIREQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 660
LAR AR EVCIIREQRPT I F+ A LE VHLPHNDALVIAPLID V+VRR+LVDGGAS
Sbjct: 601 LAREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGAS 644
Query: 661 ANILSLTTYLVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 720
ANILSL+TYL LGWTRSQLKKSPTPLVGFSGES+ EGCIDLPV++ QD TQVTQMAEFV
Sbjct: 661 ANILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFV 644
Query: 721 VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGS 780
VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG GTVRGE SRECYAS K S
Sbjct: 721 VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRS 644
Query: 781 SVSAIE 785
SV A+E
Sbjct: 781 SVCALE 644
BLAST of Moc04g08840 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 774.2 bits (1998), Expect = 1.2e-219
Identity = 402/448 (89.73%), Postives = 412/448 (91.96%), Query Frame = 0
Query: 379 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGLDRSGK 438
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK IG RSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTK-------IGQGRSGK 60
Query: 439 G-EKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILTNIEESGMEKL 498
E DPK KDKGSFS+GRAE+RRA N PTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 499 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGKPRTSS 558
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKF+GKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 559 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEVCIIREQRPTC 618
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LAR AR EVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 619 PITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTYLVLGWTRSQL 678
PITFD A L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYL LGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 679 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 738
KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 739 FRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVSAIETLASGDGTLEFEA 798
FRAIPSTLHQVLKYSTPNG GTVRGEQTASRECYAS LKG+SV A+ETL S DGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 799 DLPMREFAAPTEELKLVPLLSPEKQTDL 826
DLP REFAAP EEL+LVPLLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
BLAST of Moc04g08840 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 759.6 bits (1960), Expect = 3.2e-215
Identity = 404/546 (73.99%), Postives = 442/546 (80.95%), Query Frame = 0
Query: 284 MDFQATSDAIKCHAFQIALTGSAHLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTAT 343
MDFQA +DAIKC AFQIALTGSA LWYRRLPARSISTY+QLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 344 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 403
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 404 AEVLQKAKKVIDGQELLRTKTGRPERRIGLDR-SGKGEKADPKYKDKGSFSS-GRAEFRR 463
EVLQKAKKVIDGQELLRTKTGRPE++I + S + KAD K +DKGS SS R E+RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 464 AVNRPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHR 523
+ P+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 524 EHGHNTSDCWELKRQIEDLIQDGYFKKFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 583
+HGHNT+ CWELKRQIEDLIQDGYFKKF+GKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 584 TIFGGPSGGQSGHKRKELARVARHEVCIIREQRPTCPITFDSAYLEEVHLPHNDALVIAP 643
TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF A LE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 644 LIDHVVVRRVLVDGGASANILSLTTYLVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLP 703
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 704 VTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVR 763
VT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 764 GEQTASRECYASALKGSSVSAIETLASGDGTLEFEADLP---MREFAAPTEELKLVPLLS 823
GEQ SRECYASALKGS+V A+E + E EADLP R+F PTEEL+LVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 824 PEKQTD 825
PE+Q +
Sbjct: 541 PERQAN 506
BLAST of Moc04g08840 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 960.7 bits (2482), Expect = 4.5e-276
Identity = 519/665 (78.05%), Postives = 539/665 (81.05%), Query Frame = 0
Query: 188 SSNQQAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGDLGESSFT 247
SSNQQAESSHNP TP GVITR EFDQLRGK +AQV+ALKAKCEQKEGPLNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 248 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQIALTGSAH 307
SDVLE APTVK YDGSKDPKDYVEV EGLMDFQA SDAIKC AFQIALTGSA
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 308 LWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 367
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 368 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 427
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 428 ERRIGLDRSGKGEKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILT 487
ER I RSGK EKAD K KDKGSFSSGRAEFRRAVN PTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 488 NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 547
NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 548 KFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEV 607
KF+GKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELAR AR EV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 608 CIIREQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTY 667
CIIREQRPTCPITFDSA LEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 668 LVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYN 727
L LGWTRSQLKKS TPLVGFS ESVIPEGCIDLPVTLG DQTQVTQMAEFVVIDGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 728 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVSAIETLA 787
AIFGRPIIHSFRAIPSTLHQVLKYSTPNG G VRGEQ ASRECYASALKGSSV A+ETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 788 SGDGTLEFEADLPMREFAAPTEELKLVPLLSPEKQTDLARSVPVEILGNPSISEPDLMEV 847
S DGTLEF+A+LP REFAAPTEEL+LVPLL + ++ ++ + + + D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGVE 605
Query: 848 DAPEP 853
PEP
Sbjct: 662 GMPEP 605
BLAST of Moc04g08840 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 951.0 bits (2457), Expect = 3.6e-273
Identity = 488/528 (92.42%), Postives = 497/528 (94.13%), Query Frame = 0
Query: 192 QAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGDLGESSFTSDVL 251
+AESS NP TPAGVITR EFDQLRG+ DAQV+ALKAKCEQKEGPLNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 252 EAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQIALTGSAHLWYR 311
EAPIPPKFKAPTVKPYDGSKDPKDYVEV E LMDFQA SDAIKC AF+IALTGSA LWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 312 RLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 371
RLPA SISTY+QLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 372 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRI 431
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPER+I
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 432 GLDRSGKG-EKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILTNIE 491
G RSGK E ADPK KDKGSFSSGRAE+RRA N PTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 492 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFM 551
ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKF+
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 552 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEVCII 611
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELAR AR EVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 612 REQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTYLVL 671
REQRPTCPITFD A LEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSL TYL L
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 672 GWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 719
GWTRSQLKKSPTPLVGFSGESVIPEG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc04g08840 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 935.6 bits (2417), Expect = 1.6e-268
Identity = 520/786 (66.16%), Postives = 566/786 (72.01%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTNTADRR LAA+ HQREV A VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 PRTSKATRGRGGTSKKGARGPDPAPTSENLDALQREMEAMRTKMRSMEEIYNEMILAAGA 120
Sbjct: 61 ------------------------------------------------------------ 120
Query: 121 GSRSENRVTRVGIREQGRSHLGPVDEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQ 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 SPSRSHRSSNQQAESSHNPTTPAGVITRAEFDQLRGKHDAQVKALKAKCEQKEGPLNDGD 240
PS+ AESS+NP TP GVITR EFDQL+ K DAQV+ALKA+CE+KE +DGD
Sbjct: 181 KPSK--------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGD 240
Query: 241 LGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVLEGLMDFQATSDAIKCHAFQI 300
LGE SF+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEV E LMDFQA +DAIKC AFQI
Sbjct: 241 LGELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQI 300
Query: 301 ALTGSAHLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYV 360
ALTGSA LWYRRLPAR ISTY+QLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYV
Sbjct: 301 ALTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYV 360
Query: 361 TRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELL 420
TRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELL
Sbjct: 361 TRFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELL 420
Query: 421 RTKTGRPERRIGLDRSGKGE-KADPKYKDKG-SFSSGRAEFRRAVNRPTRSRPYERFTPT 480
RTKTGRPE+ I R+GK + KAD K +DKG S SS R ++RR+ + +SRPYE +TPT
Sbjct: 421 RTKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPT 480
Query: 481 TIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIE 540
TIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIE
Sbjct: 481 TIPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIE 540
Query: 541 DLIQDGYFKKFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKE 600
DLIQDGYFKKF+GKPR++S EKKEERKR RTPPRR DRPAVIN K+KE
Sbjct: 541 DLIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKE 600
Query: 601 LARVARHEVCIIREQRPTCPITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS 660
LAR AR EVCIIREQRPT I F+ A LE VHLPHNDALVIAPLID V+VRR+LVDGGAS
Sbjct: 601 LAREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGAS 644
Query: 661 ANILSLTTYLVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFV 720
ANILSL+TYL LGWTRSQLKKSPTPLVGFSGES+ EGCIDLPV++ QD TQVTQMAEFV
Sbjct: 661 ANILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFV 644
Query: 721 VIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGS 780
VIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG GTVRGE SRECYAS K S
Sbjct: 721 VIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRS 644
Query: 781 SVSAIE 785
SV A+E
Sbjct: 781 SVCALE 644
BLAST of Moc04g08840 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 774.2 bits (1998), Expect = 6.0e-220
Identity = 402/448 (89.73%), Postives = 412/448 (91.96%), Query Frame = 0
Query: 379 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERRIGLDRSGK 438
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK IG RSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTK-------IGQGRSGK 60
Query: 439 G-EKADPKYKDKGSFSSGRAEFRRAVNRPTRSRPYERFTPTTIPISEILTNIEESGMEKL 498
E DPK KDKGSFS+GRAE+RRA N PTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 499 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFMGKPRTSS 558
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKF+GKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 559 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARVARHEVCIIREQRPTC 618
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LAR AR EVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 619 PITFDSAYLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLTTYLVLGWTRSQL 678
PITFD A L EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSL TYL LGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 679 KKSPTPLVGFSGESVIPEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 738
KKSPTPLVGFSGESV+PEGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 739 FRAIPSTLHQVLKYSTPNGEGTVRGEQTASRECYASALKGSSVSAIETLASGDGTLEFEA 798
FRAIPSTLHQVLKYSTPNG GTVRGEQTASRECYAS LKG+SV A+ETL S DGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 799 DLPMREFAAPTEELKLVPLLSPEKQTDL 826
DLP REFAAP EEL+LVPLLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
BLAST of Moc04g08840 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 759.6 bits (1960), Expect = 1.5e-215
Identity = 404/546 (73.99%), Postives = 442/546 (80.95%), Query Frame = 0
Query: 284 MDFQATSDAIKCHAFQIALTGSAHLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTAT 343
MDFQA +DAIKC AFQIALTGSA LWYRRLPARSISTY+QLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 344 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 403
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 404 AEVLQKAKKVIDGQELLRTKTGRPERRIGLDR-SGKGEKADPKYKDKGSFSS-GRAEFRR 463
EVLQKAKKVIDGQELLRTKTGRPE++I + S + KAD K +DKGS SS R E+RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 464 AVNRPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHR 523
+ P+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 524 EHGHNTSDCWELKRQIEDLIQDGYFKKFMGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 583
+HGHNT+ CWELKRQIEDLIQDGYFKKF+GKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 584 TIFGGPSGGQSGHKRKELARVARHEVCIIREQRPTCPITFDSAYLEEVHLPHNDALVIAP 643
TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF A LE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 644 LIDHVVVRRVLVDGGASANILSLTTYLVLGWTRSQLKKSPTPLVGFSGESVIPEGCIDLP 703
LIDH +VRRVL+DG GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 704 VTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGEGTVR 763
VT+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 764 GEQTASRECYASALKGSSVSAIETLASGDGTLEFEADLP---MREFAAPTEELKLVPLLS 823
GEQ SRECYASALKGS+V A+E + E EADLP R+F PTEEL+LVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 824 PEKQTD 825
PE+Q +
Sbjct: 541 PERQAN 506
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 4.5e-276 | 78.05 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 3.6e-273 | 92.42 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 1.6e-268 | 66.16 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 6.0e-220 | 89.73 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 1.5e-215 | 73.99 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |