Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCAAACTCGACCAATACGGCGGATCGGACGACTCTAGCCGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCGGCGGTGGTAGAGAGGCAAGGCCACGGCGGCCCGGCAACAGAACCCCTCCGCAGGTCGGCACATATCACCGCTCCTGTTCTACCGCCTGCTCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAGCGAGTGAGGACTTTGATGCACTCCAGAGAGAAATGGAAGCAATGCGCACGCAAATGCAGTCCATGGAGGAAATGTATAACGAAATGATACTAGCCGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTTGACATACGGGAGCAAAGAGGTTCCCACCTCGGCCCAGCCGAGGAGGAACATCCCAAGGACAACGAGAGCGAGGGGCACACTCACCAGAGAGGAGACCTTCGTGAGCACCTAACCAGAAAGAGAGACTCATCCCTCCGGAAAAGACAGTCACCATCCTGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGTAACCCAGTAACTCCTGCAGGAATGATTACCAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGTCAACCTTCACCGAGGTGCTTCAGAAAACGAAGAAAGTCATCGATGGACATGAGCTCCTTCGAACCAAAACCGGTCGACCAGAACGAAAAATCAGCCGAGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGGGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGATTATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCTCGGAGAGACGCAGCAAGGACAAATATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAGGATGGCTACTTCAAGAAATTTGTGGGAAAACCCAGGACCAGCTCGGCAGAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCAGGCGCACTGATCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCAGTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCGTATGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCTGATCACGCTTGGGCAGGACCAAACTCGAGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGGAGATCGGCCTATAACGCCATCTTTGGAAGGCCCATCATCCATTCATTTCGGGCCATTCCCTCGACACTCCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGTACATCGGTCTGCGCCCTTGAAACTCTCACCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGATAAGCAAGTGAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGGTCCAACTCGGATGCCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGAAGATTATGACGCATCGCCTCAGCATAGATTCGTCATTCCGACCTGTAAAGCAAAAAAGAAGACCTATAAACAAGGAGCGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTAAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTTGAGAATGTGCATAGATTTTACAAACTTAAATAAGGCATGCCCAAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTACTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATCAAGATCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAATGTCATGCCCTTCGGTTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATTGAAGTGTATGTGGACGACATGCTTGTCAAGAGTAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAACCTTCGAGGTTCTGAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAAGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGAGGCACCTAAGACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCCAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGGAGGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCATCTCAGCAGTTGAAAAGCTACCTCGGTTCGGCACCTTTGCTCGCCAAGCCCATGCCGGGGGACAAGCTCCAGTTGTACTTGGCAGTGTCTGACAGCGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTTGCTCTAGTCACCTCGGCCCGGCGGCTTAGACCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACCTGCCCCTAAAAAACATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGTGGCAGATTTTATAGCCGAGCTCACACCAGCTTCCGAGCTGAGCGAGTCCGACCTGCCGTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCCGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGGATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCAACTCCCAGCTGGTTGTGAGCCAGATCAAGGAAGAGTACCAAGCCAAAGACTCTCGAATGGAGAGGTATTTGGGTAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTACGCCGGGTTCCCCGAGTAGAAAATTCTAATGCTGACGCCTTAGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGACGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGTGTTGTACCGACACGGCTTTTCCCTACCTTTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAAATCCACGAAGGAGTGTGCGGCAATCACTCAAGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATATTATTGGCCGACCCTCAGCCAGGACGCCAAGAAATTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTTCTCACCCCCATCTCGACCCCGTGGCCATTCGCGCAGTGGGGGGTGGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGTTGTGGTTGCTGTGGATTACTTCACAAAGTGGGCCGAGGCCGAAGCGCTCTCCCACATAACGGAATCCAGAGTCACGTCTTTCGTATGGACAAATATCATATGTCGCTTTGGTATACCGCAGGCCATTGTGACGGACAATGGGAGGCAGTTTGACAAAGCCAAGTTCAAAGACTTTTGCACCAAGCTTGGCATAAGTCACCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGTCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGGACCACCCAAAGGGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTCGAGCATTACGAGCCTACGGCAAACGCGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAACAATGGCCCAGCTACGCCTGGCGGAATATTAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCGAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGTCCGTTTGAGGTCAAGGGCATAATCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCGGATCGGACGACTCTAGCCGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCGGCGGTGGTAGAGAGGCAAGGCCACGGCGGCCCGGCAACAGAACCCCTCCGCAGGTCGGCACATATCACCGCTCCTGTTCTACCGCCTGCTCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAGCGAGTGAGGACTTTGATGCACTCCAGAGAGAAATGGAAGCAATGCGCACGCAAATGCAGTCCATGGAGGAAATGTATAACGAAATGATACTAGCCGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTTGACATACGGGAGCAAAGAGGTTCCCACCTCGGCCCAGCCGAGGAGGAACATCCCAAGGACAACGAGAGCGAGGGGCACACTCACCAGAGAGGAGACCTTCGTGAGCACCTAACCAGAAAGAGAGACTCATCCCTCCGGAAAAGACAGTCACCATCCTGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGTAACCCAGTAACTCCTGCAGGAATGATTACCAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGTCAACCTTCACCGAGGTGCTTCAGAAAACGAAGAAAGTCATCGATGGACATGAGCTCCTTCGAACCAAAACCGGTCGACCAGAACGAAAAATCAGCCGAGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGGGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGATTATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCTCGGAGAGACGCAGCAAGGACAAATATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAGGATGGCTACTTCAAGAAATTTGTGGGAAAACCCAGGACCAGCTCGGCAGAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCAGGCGCACTGATCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCAGTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCGTATGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCTGATCACGCTTGGGCAGGACCAAACTCGAGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGGAGATCGGCCTATAACGCCATCTTTGGAAGGCCCATCATCCATTCATTTCGGGCCATTCCCTCGACACTCCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGTACATCGGTCTGCGCCCTTGAAACTCTCACCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGATAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGACGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCGAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGTCCGTTTGAGGTCAAGGGCATAATCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCAAACTCGACCAATACGGCGGATCGGACGACTCTAGCCGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCGGCGGTGGTAGAGAGGCAAGGCCACGGCGGCCCGGCAACAGAACCCCTCCGCAGGTCGGCACATATCACCGCTCCTGTTCTACCGCCTGCTCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCTGGGGTCCAGCCCCGGCTCCAGCGAGTGAGGACTTTGATGCACTCCAGAGAGAAATGGAAGCAATGCGCACGCAAATGCAGTCCATGGAGGAAATGTATAACGAAATGATACTAGCCGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTTGACATACGGGAGCAAAGAGGTTCCCACCTCGGCCCAGCCGAGGAGGAACATCCCAAGGACAACGAGAGCGAGGGGCACACTCACCAGAGAGGAGACCTTCGTGAGCACCTAACCAGAAAGAGAGACTCATCCCTCCGGAAAAGACAGTCACCATCCTGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCGTAACCCAGTAACTCCTGCAGGAATGATTACCAGGGAGGAGTTCGACCAGCTGAGGGGCCAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTCGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGTGAAACTCGGAGAGGAGGCCCCGTCAACCTTCACCGAGGTGCTTCAGAAAACGAAGAAAGTCATCGATGGACATGAGCTCCTTCGAACCAAAACCGGTCGACCAGAACGAAAAATCAGCCGAGGCAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGGGCTGAGTATCGGAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGATTATCGAGGAGTCTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCTCGGAGAGACGCAGCAAGGACAAATATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAGGATGGCTACTTCAAGAAATTTGTGGGAAAACCCAGGACCAGCTCGGCAGAAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCAGGCGCACTGATCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTTCACCTGCCCCACAATGATGCAGTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCGTATGCTAACATCCTATCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGCTGATCACGCTTGGGCAGGACCAAACTCGAGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGGAGATCGGCCTATAACGCCATCTTTGGAAGGCCCATCATCCATTCATTTCGGGCCATTCCCTCGACACTCCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACCGCTTCAAGGGAGTGCTATGCCTCCGCACTCAAAGGTACATCGGTCTGCGCCCTTGAAACTCTCACCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGATAAGCAATTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGACGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAAGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCGAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGTCCGTTTGAGGTCAAGGGCATAATCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTADRTTLAASDAHQREVGAAVVERQGHGGPATEPLRRSAHITAPVLPPAHPRTSKATRGRGGTSKKGAWGPAPAPASEDFDALQREMEAMRTQMQSMEEMYNEMILAAGAGSRSENRVTRVDIREQRGSHLGPAEEEHPKDNESEGHTHQRGDLREHLTRKRDSSLRKRQSPSCSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRPERKISRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVPLLSPDKQLASAYETDLARSVPVEILDNPSISEPDLMETALQSPHGWTRLRTSLRATHHKTPRSAESWQGEQLGSWSEGRMARHYNARVRPRTFQVGHLVLRRVRTHVGALDPTWEGPFEVKGIIRPGTYVLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc11g30520 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 978.8 bits (2529), Expect = 3.4e-281
Identity = 498/528 (94.32%), Postives = 507/528 (96.02%), Query Frame = 0
Query: 191 QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 250
+AESSRNP TPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQA SDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP+TF EVLQK KKVIDG ELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 SRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTIIE 490
RGRSGKDIE DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT IE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPTYLAL 670
REQRPTC ITFDGADLEEVHLPHNDA+VIAPLIDHVVV RVLVDGG ANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFV 719
GWTRSQLKKSPTPLVGFSGESV+PEG IDL +TLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc11g30520 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 968.8 bits (2503), Expect = 3.6e-278
Identity = 517/631 (81.93%), Postives = 533/631 (84.47%), Query Frame = 0
Query: 187 SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFT 246
SSNQQAESS NP TP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAP+TF EVLQK KKVIDG ELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKISRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
T IEESGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPT 666
VCIIREQRPTC ITFD ADLEEVHLPHNDA+VIAPLIDHVVVRRVLVD G ANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVVIDGRSAY 726
YLALGWTRSQLKKS TPLVGFS ESV+PEGCIDL +TLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKG+SVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 TSRDGTLEFEADLPRREFAAPTEELELVPLL 818
SRDGTLEF+A+LPRREFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc11g30520 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 948.0 bits (2449), Expect = 6.5e-272
Identity = 523/791 (66.12%), Postives = 569/791 (71.93%), Query Frame = 0
Query: 1 MVQPANSTNTADRTTLAASDAHQREVGAAVVERQGHGGPATEPLRRSAHITAPVLPPAHP 60
MVQPANSTNTADR LAA+ HQREVGA VVE QGH TEPL RSA IT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGAWGPAPAPASEDFDALQREMEAMRTQMQSMEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDIREQRGSHLGPAEEEHPKDNESEGHTHQRGDLREHLTRKRDSSLRKRQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSCSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDL 240
AESS NP+TP G+ITREEFDQL+ + DAQVEALKA+CE+KE +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 GESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIA 300
GE F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQA +DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLR 420
RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP+TF EVLQKTKKVIDG ELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTGRPERKISRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTT 480
TKTGRPE+ I +GR+GKD K D KS+DKG S SS R +YRR+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIED 540
IPI EILT IEE+GMEKLLKRPEKLRG E+R+ DKYCRFHR+HGHNTS+ WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYA 660
AR ARREVCIIREQRPT SI F+ ADLE VHLPHNDA+VIAPLID V+VRR+LVDGGA A
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVV 720
NILSL TYLALGWTRSQLKKSPTPLVGFSGES+ EGCIDL +++ QD T+VTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE SRECYAS K +S
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLTSRD 791
VCALE T RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc11g30520 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 798.9 bits (2062), Expect = 4.9e-227
Identity = 410/446 (91.93%), Postives = 420/446 (94.17%), Query Frame = 0
Query: 378 MCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRPERKISRGRSGK 437
MCYFLTGLADEALTVKL EEAP+TF EVLQK KKVIDG ELLRTK G +GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIG-------QGRSGK 60
Query: 438 DIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTIIEESGMEKL 497
D+E TDPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILT IEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 557
LKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC 617
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 SITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPTYLALGWTRSQL 677
ITFD ADL EVHLPHNDA+VIAPLIDHVVVRRVLVDGGA ANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 KKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KKSPTPLVGFSGESVVPEGCIDL +TLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTSVCALETLTSRDGTLEFEA 797
FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKGTSVCALETLTSRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRREFAAPTEELELVPLLSPDKQL 824
DLP REFAAP EELELVPLLS +KQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc11g30520 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 783.1 bits (2021), Expect = 2.8e-222
Identity = 410/544 (75.37%), Postives = 451/544 (82.90%), Query Frame = 0
Query: 283 MDFQATSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 342
MDFQA +DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 343 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTF 402
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP+TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 403 TEVLQKTKKVIDGHELLRTKTGRPERKISRGRSGKDIEKTDPKSKDKGSFSS-GRAEYRR 462
EVLQK KKVIDG ELLRTKTGRPE++I + + ++ K D KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 463 AENGPTRSRPYERFTPTTIPISEILTIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHR 522
E+GP+RSRPYER+T +TIPISEILT IEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 523 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 582
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 583 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAP 642
TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTCSITF ADLE VHLPHNDA+VIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 643 LIDHVVVRRVLVDGGAYANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLL 702
LIDH +VRRVL+DG GCIDL
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 703 ITLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR 762
+T+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 763 GEQTASRECYASALKGTSVCALETLTSRDGTLEFEADLP---RREFAAPTEELELVPLLS 822
GEQ SRECYASALKG++VCALE T+R E EADLP +R+F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
BLAST of Moc11g30520 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 978.8 bits (2529), Expect = 1.7e-281
Identity = 498/528 (94.32%), Postives = 507/528 (96.02%), Query Frame = 0
Query: 191 QAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 250
+AESSRNP TPAG+ITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQA SDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP+TF EVLQK KKVIDG ELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 SRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTIIE 490
RGRSGKDIE DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT IE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPTYLAL 670
REQRPTC ITFDGADLEEVHLPHNDA+VIAPLIDHVVV RVLVDGG ANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFV 719
GWTRSQLKKSPTPLVGFSGESV+PEG IDL +TLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc11g30520 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 968.8 bits (2503), Expect = 1.7e-278
Identity = 517/631 (81.93%), Postives = 533/631 (84.47%), Query Frame = 0
Query: 187 SSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFT 246
SSNQQAESS NP TP G+ITREEFDQLRG+L+AQVEALKAKCEQKEGPLNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDGSKDPKDYVEVFEGLMDFQA SDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAP+TF EVLQK KKVIDG ELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKISRGRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
T IEESGMEKLLKRPEKLRGA ERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPT 666
VCIIREQRPTC ITFD ADLEEVHLPHNDA+VIAPLIDHVVVRRVLVD G ANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVVIDGRSAY 726
YLALGWTRSQLKKS TPLVGFS ESV+PEGCIDL +TLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVG VRGEQ ASRECYASALKG+SVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 TSRDGTLEFEADLPRREFAAPTEELELVPLL 818
SRDGTLEF+A+LPRREFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc11g30520 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 948.0 bits (2449), Expect = 3.1e-272
Identity = 523/791 (66.12%), Postives = 569/791 (71.93%), Query Frame = 0
Query: 1 MVQPANSTNTADRTTLAASDAHQREVGAAVVERQGHGGPATEPLRRSAHITAPVLPPAHP 60
MVQPANSTNTADR LAA+ HQREVGA VVE QGH TEPL RSA IT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGAWGPAPAPASEDFDALQREMEAMRTQMQSMEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRVDIREQRGSHLGPAEEEHPKDNESEGHTHQRGDLREHLTRKRDSSLRKRQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSCSHRSSNQQAESSRNPVTPAGMITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDL 240
AESS NP+TP G+ITREEFDQL+ + DAQVEALKA+CE+KE +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 GESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQATSDAIKCRAFQIA 300
GE F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQA +DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLR 420
RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP+TF EVLQKTKKVIDG ELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTGRPERKISRGRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTT 480
TKTGRPE+ I +GR+GKD K D KS+DKG S SS R +YRR+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIED 540
IPI EILT IEE+GMEKLLKRPEKLRG E+R+ DKYCRFHR+HGHNTS+ WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYA 660
AR ARREVCIIREQRPT SI F+ ADLE VHLPHNDA+VIAPLID V+VRR+LVDGGA A
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVV 720
NILSL TYLALGWTRSQLKKSPTPLVGFSGES+ EGCIDL +++ QD T+VTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NGVGTVRGE SRECYAS K +S
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLTSRD 791
VCALE T RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc11g30520 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 798.9 bits (2062), Expect = 2.4e-227
Identity = 410/446 (91.93%), Postives = 420/446 (94.17%), Query Frame = 0
Query: 378 MCYFLTGLADEALTVKLGEEAPSTFTEVLQKTKKVIDGHELLRTKTGRPERKISRGRSGK 437
MCYFLTGLADEALTVKL EEAP+TF EVLQK KKVIDG ELLRTK G +GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRTKIG-------QGRSGK 60
Query: 438 DIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTIIEESGMEKL 497
D+E TDPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILT IEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LKRPEKLRGASERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 557
LKRPEKLRGA ERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC 617
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQRPTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 SITFDGADLEEVHLPHNDAVVIAPLIDHVVVRRVLVDGGAYANILSLPTYLALGWTRSQL 677
ITFD ADL EVHLPHNDA+VIAPLIDHVVVRRVLVDGGA ANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 KKSPTPLVGFSGESVVPEGCIDLLITLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KKSPTPLVGFSGESVVPEGCIDL +TLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASALKGTSVCALETLTSRDGTLEFEA 797
FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAS LKGTSVCALETLTSRDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRREFAAPTEELELVPLLSPDKQL 824
DLP REFAAP EELELVPLLS +KQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc11g30520 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 783.1 bits (2021), Expect = 1.3e-222
Identity = 410/544 (75.37%), Postives = 451/544 (82.90%), Query Frame = 0
Query: 283 MDFQATSDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 342
MDFQA +DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 343 HLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPSTF 402
HLATIRQKE ETLREYVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKLGEEAP+TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 403 TEVLQKTKKVIDGHELLRTKTGRPERKISRGRSGKDIEKTDPKSKDKGSFSS-GRAEYRR 462
EVLQK KKVIDG ELLRTKTGRPE++I + + ++ K D KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 463 AENGPTRSRPYERFTPTTIPISEILTIIEESGMEKLLKRPEKLRGASERRSKDKYCRFHR 522
E+GP+RSRPYER+T +TIPISEILT IEESGMEKLLKRPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 523 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 582
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 583 TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDGADLEEVHLPHNDAVVIAP 642
TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTCSITF ADLE VHLPHNDA+VIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 643 LIDHVVVRRVLVDGGAYANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVVPEGCIDLL 702
LIDH +VRRVL+DG GCIDL
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 703 ITLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGTVR 762
+T+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 763 GEQTASRECYASALKGTSVCALETLTSRDGTLEFEADLP---RREFAAPTEELELVPLLS 822
GEQ SRECYASALKG++VCALE T+R E EADLP +R+F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 1.7e-281 | 94.32 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 1.7e-278 | 81.93 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 3.1e-272 | 66.12 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 2.4e-227 | 91.93 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 1.3e-222 | 75.37 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |