Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATAACCGCGCCTGCCCTATCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGAATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGTAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAAGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGATAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGGCAGTCACCATCCTGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGCTAATCACAAGGGAGGAGTTCGACCAGCTAAGGGGGAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGGGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGCAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCGAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGTCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATTGGGCGGGGCAGAAGTGGAAAAGATGAAAGGACAGATCCCAAGTCCAAAGACAGGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCGTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGACGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCAGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCGTCGGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTAGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTTGCGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAATGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAATTTTAAAGTATCCCACCCCCAATGGCGTGGGCACAGTCCAAGGGGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCAACCTGCCGAGGAAGGAGTTTGCCGCGTCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCACCTCAGCATAGTCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAAAGGAGTGATGTAATTGTTGAGGAAGTTAACAAACGTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCAAAAGATTGTTTCCCACTGCCGAGGATCGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTGCTCACTTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCTCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCAACCTACCAGAGAATGGTGAACAAAATGTTCGCCCAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTTGGCTTCATGGTAAACAATCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGTCGTGATCGAGATGGAGGCACCTAAAACGCTAAAACAGCTTCAGTGCCTCAATGGTAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCAACGGACAAGTGCCTCCCTTTCTTCAAGGTCTTACGAAAGAAAGGGCCGTTTGAATGGACGGCGGAGTGTGAGCAAGCGTGTCAGCAATTGAAGAACTACCTCTGTTCGGCACCCTTGCTTGCCAAGCCTATGCCGGGGGACAAGCTCCAATTGTACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACAGTGGTGGTGCTAACTAACTCGCCCCTTAAAATTATCTTCCACAAGCCGGAAGCTTCCGGACGCCTAATGAAGTGGGCAATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGGGTCCGACCTGCCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGGGGTCAACGATTTGAGTATGTCTTGCGGTTCAGCTTCCGGACTTCTAACAACAAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTCAAGCATTGAGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCATACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGGGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGACGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAAATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCTGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAAAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCTGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCGCTCTCCCACATAACGGAATCCAAGGTCACGTCCTTCGTATGGACGAACATCATATGTCGCTTTGGTATACCACAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAAGAGCTACCAGAAGTTCTATGGTCGTACCGGACCACCCAACGGGGGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTTAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGACGGAATATCAGGGCAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAAAGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATAACCGCGCCTGCCCTATCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGAATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGTAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAAGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGATAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGGCAGTCACCATCCTGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGCTAATCACAAGGGAGGAGTTCGACCAGCTAAGGGGGAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGGGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGCAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCGAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGTCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATTGGGCGGGGCAGAAGTGGAAAAGATGAAAGGACAGATCCCAAGTCCAAAGACAGGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCGTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGACGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCAGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCGTCGGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTAGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTTGCGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAATGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAATTTTAAAGTATCCCACCCCCAATGGCGTGGGCACAGTCCAAGGGGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCAACCTGCCGAGGAAGGAGTTTGCCGCGTCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGACGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAAATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGAAGAGGGCCTGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAAAGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGACGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATAACCGCGCCTGCCCTATCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGAATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGTAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATGGACGTACGCGAGCAAAAGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGATAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGGCAGTCACCATCCTGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGCTAATCACAAGGGAGGAGTTCGACCAGCTAAGGGGGAAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGGGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGCAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCGAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGTCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATTGGGCGGGGCAGAAGTGGAAAAGATGAAAGGACAGATCCCAAGTCCAAAGACAGGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCGTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGACGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCAGTCATCAATACCATTTTTGGAGGGCCAAACGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCGTCGGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGAGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTAGTGGTCAGGAGAGTGTTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTTGCGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAATGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAATTTTAAAGTATCCCACCCCCAATGGCGTGGGCACAGTCCAAGGGGAACAGACCGCTTCGAGGGAGTGCTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCAACCTGCCGAGGAAGGAGTTTGCCGCGTCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGACGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAAATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAGATGCCTAACCCCTGAAGAGGGCCTGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAAAGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQREDGAAAVEGQGHDGLAAEPLRRSARITAPALSPAHPRTSKATRGRGGTSKKGARGPAPTPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAVGAGSRSENRATRMDVREQKGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSCSHRSSNQQAESSHNPAGLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSVCALETLRDGTLEFEANLPRKEFAASTEELELVPLLSPEKQLASTYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPLLRCLTPEEGLMARHYNARVRPRAFQVGHLVLRRVQKHVGALDPAWEGPFEIKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc05g10080 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 959.5 bits (2479), Expect = 2.2e-275
Identity = 491/528 (92.99%), Postives = 506/528 (95.83%), Query Frame = 0
Query: 191 QAESSHN---PAGLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPP+FKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEE+PATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E DPKSKD+GSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIV 610
GKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSG KRKELARAARREVCI+
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 GEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
EQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFV 715
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPV LGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc05g10080 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 957.2 bits (2473), Expect = 1.1e-274
Identity = 516/671 (76.90%), Postives = 546/671 (81.37%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESSFT 246
SSNQQAESSHNPA G+ITREEFDQLRGKL+AQVEALKAKCEQK+ LNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+E+PATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+ D KSKD+GSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREV 606
KFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP+GGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
CI+ EQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVVDGRSAYN 726
LALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPV LG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSVCALETL- 786
AIFGRPIIHSFRAIPSTLHQ+LKY TPNGVG V+GEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLEFEANLPRKEFAASTEELELVPLLSPEKQLASTYETDLARSVPVEILDNPSILE 846
RDGTLEF+ANLPR+EFAA TEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
Query: 847 PDLMEIGAPEP 853
D+ G PEP
Sbjct: 662 -DIGVEGMPEP 605
BLAST of Moc05g10080 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 936.0 bits (2418), Expect = 2.6e-268
Identity = 513/790 (64.94%), Postives = 570/790 (72.15%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREDGAAAVEGQGHDGLAAEPLRRSARITAPALSPAHP 60
MVQPANSTNT DRR LAA+ HQRE GA VEGQGH+ L EPL RSARIT P L PAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAVGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMDVREQKGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSCSHRSSNQQAESSHNP--AGLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESSFTSDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E SF+SD+LEA IPP+FK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE+PATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RTDPKSKDRG-SFSSGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ + D KS+D+G S SS R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA 600
IQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCI+ EQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVV 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVV+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSV 780
DGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKY T NGVGTV+GE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALE--TLRD 785
CALE T+RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc05g10080 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 789.6 bits (2038), Expect = 3.0e-224
Identity = 399/422 (94.55%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 228 KDDSLNDGDLGESSFTSDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGESSFTSDVLEAPIPP+FKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE++P TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKD ER DPKSKD+GSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG 587
WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCI+ EQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
BLAST of Moc05g10080 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 786.6 bits (2030), Expect = 2.5e-223
Identity = 403/446 (90.36%), Postives = 419/446 (93.95%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EE+PATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 D-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKL 494
D E TDPKSKD+GSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIVGEQGPTC 614
AEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRK+LARAARREVCI+ EQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCIDLPV LGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSVCALETL--RDGTLEFEA 794
FRAIPSTLHQ+LKY TPNGVGTV+GEQTASRECYA+ LKG SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 NLPRKEFAASTEELELVPLLSPEKQL 818
+LP +EFAA EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc05g10080 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 959.5 bits (2479), Expect = 1.1e-275
Identity = 491/528 (92.99%), Postives = 506/528 (95.83%), Query Frame = 0
Query: 191 QAESSHN---PAGLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPP+FKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEE+PATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E DPKSKD+GSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIV 610
GKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSG KRKELARAARREVCI+
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 GEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
EQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFV 715
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPV LGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc05g10080 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 957.2 bits (2473), Expect = 5.3e-275
Identity = 516/671 (76.90%), Postives = 546/671 (81.37%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLGESSFT 246
SSNQQAESSHNPA G+ITREEFDQLRGKL+AQVEALKAKCEQK+ LNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+E+PATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+ D KSKD+GSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREV 606
KFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGP+GGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
CI+ EQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVVDGRSAYN 726
LALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPV LG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSVCALETL- 786
AIFGRPIIHSFRAIPSTLHQ+LKY TPNGVG V+GEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLEFEANLPRKEFAASTEELELVPLLSPEKQLASTYETDLARSVPVEILDNPSILE 846
RDGTLEF+ANLPR+EFAA TEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
Query: 847 PDLMEIGAPEP 853
D+ G PEP
Sbjct: 662 -DIGVEGMPEP 605
BLAST of Moc05g10080 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 936.0 bits (2418), Expect = 1.3e-268
Identity = 513/790 (64.94%), Postives = 570/790 (72.15%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREDGAAAVEGQGHDGLAAEPLRRSARITAPALSPAHP 60
MVQPANSTNT DRR LAA+ HQRE GA VEGQGH+ L EPL RSARIT P L PAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAVGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMDVREQKGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSCSHRSSNQQAESSHNP--AGLITREEFDQLRGKLDAQVEALKAKCEQKDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ K DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESSFTSDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E SF+SD+LEA IPP+FK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EE+PATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RTDPKSKDRG-SFSSGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ + D KS+D+G S SS R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELA 600
IQDGYFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCI+ EQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVV 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVV+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSV 780
DGRSAYNAIFGRPIIHSFRA+PSTLHQ+LKY T NGVGTV+GE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALE--TLRD 785
CALE T+RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc05g10080 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 789.6 bits (2038), Expect = 1.5e-224
Identity = 399/422 (94.55%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 228 KDDSLNDGDLGESSFTSDVLEAPIPPQFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGESSFTSDVLEAPIPP+FKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE++P TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKD ER DPKSKD+GSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPNGG 587
WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGP+GG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIVGEQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCI+ EQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
BLAST of Moc05g10080 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 786.6 bits (2030), Expect = 1.2e-223
Identity = 403/446 (90.36%), Postives = 419/446 (93.95%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EE+PATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 D-ERTDPKSKDRGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKL 494
D E TDPKSKD+GSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKEERKRSRTPPRRTDRPAVINTIFGGPNGGQSGHKRKELARAARREVCIVGEQGPTC 614
AEKKEERKRSRTPPRRTDRPAVINTIFGGP+GGQSGHKRK+LARAARREVCI+ EQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCIDLPVALGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCIDLPV LGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQILKYPTPNGVGTVQGEQTASRECYAAALKGPSVCALETL--RDGTLEFEA 794
FRAIPSTLHQ+LKY TPNGVGTV+GEQTASRECYA+ LKG SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 NLPRKEFAASTEELELVPLLSPEKQL 818
+LP +EFAA EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 2.2e-275 | 92.99 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 1.1e-274 | 76.90 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 2.6e-268 | 64.94 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 3.0e-224 | 94.55 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022152110.1 | 2.5e-223 | 90.36 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 1.1e-275 | 92.99 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 5.3e-275 | 76.90 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 1.3e-268 | 64.94 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 1.5e-224 | 94.55 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DD03 | 1.2e-223 | 90.36 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |