Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGAACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGTCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCAATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACGATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAATACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGATCGACCAGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGCCGAGCTGAGTCTCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAATGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGATATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAAAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGGTCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAATGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGGGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTTTCCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGGGAGGAGCTAATCCACTTCCTCATATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAATAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTAGACGCCACAGCCGGGCACGAACTGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATACTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAGGTCTTGAGGGCATATCAAATGAAGCTCAACCCGGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCGACAGATAAGTGCCTTCATTTCTTCAAAGTCTTACGAAAGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCTCTGCTCGCCAAGCCCATGCCGGGGGACAAGCTCCAATTGTACTTAGCAGTGTCTGACAGTGCCGTCATCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGGTACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGGCTCAGACCATACTTTCGAGCCCATACGGTGGTGGTGCTCACTAACTTGCCCCTAAAAAGCATCTTCCATAAGCCGGAAGCTTCTGGACGCCTAATGAAGTGGGCGATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGACAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGGGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAGTACCAAGCCAAAGAAACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTAACGAAGTAAGCCGGATTCCGCGGGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCTAGGTCGGTCCCCGTCGAGATCCCAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCGACTCACCACAAGATCCCAAGGAGCGCATAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTATGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTATAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCAAGCTGCTCACCCCCATCTCGGCCTCGTGGCCATTCGCGCAGTGGGAGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCGCTCTCCCATATAACGGAATCCAGGGTCACGTCCTTCGTATGGACGAATATCATATGTCGCTTTGGTATACCGCAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCGGCAAACTTGGCATAAGTCATCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAACCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGATACCAGAGGTTCTATGGTCGTACCGGACCACCCAACGAGAGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGACATGCCATCCGACAGAGTAGAGCATTACGAGCCTACGACAAATGAGGATGAACTACTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCATGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGAGACGTACACATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGAACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGTCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCAATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACGATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAATACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGATCGACCAGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGCCGAGCTGAGTCTCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAATGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGATATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAAAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGGTCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAATGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGGGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTTTCCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCTAGGTCGGTCCCCGTCGAGATCCCAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCGACTCACCACAAGATCCCAAGGAGCGCATAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACAAATGAGGATGAACTACTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCATGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGAGACGTACACATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGCCTGCGCACCCGAGAACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGTCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCAAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCAATCTGAAAATCGAGTGACGCGCGTGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACGATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAATACAGCGACCCATCTCGCCACCATCAGGCAAAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGATCGACCAGAGCGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATATAGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGCCGAGCTGAGTCTCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAATGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGAAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGATATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAAAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGCGACCCATCATCCACTCATTTCGGGCCATTCCCTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGGTCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAATGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGCAGGGATGGGCCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTTTCCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCTAGGTCGGTCCCCGTCGAGATCCCAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCGACTCACCACAAGATCCCAAGGAGCGCATAAAGTTGGCAAGGCGGGCAGCTCGAGTAGAGCATTACGAGCCTACGACAAATGAGGATGAACTACTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCATGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGAGACGTACACATTGGCCGATCTGAAAGGAGATGTCCTCGCGCACCCGTGGAACGCGGAACACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPVPAPTSENFDALKREMEAMRTQMRSMEEMYNEMMLAAGAGSQSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHDPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSVCALETLAGRDGPLEFEADLPRKEFAAPTEELELVPFLSPEKQLASAYETDLARSVPVEIPDNPSILEPDLMEIGAPESSWMDPIADFIRGDSPQDPKERIKLARRAARVEHYEPTTNEDELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFHVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPETYTLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc08g44710 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 966.8 bits (2498), Expect = 1.4e-277
Identity = 496/528 (93.94%), Postives = 507/528 (96.02%), Query Frame = 0
Query: 191 QAESSHD---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+AESS + PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDK TATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKT RPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEILMNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGRAE RRAENGPTRSRPYERFTPTTIPISEIL NIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCII 610
GKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV 716
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g44710 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 955.3 bits (2468), Expect = 4.2e-274
Identity = 513/631 (81.30%), Postives = 532/631 (84.31%), Query Frame = 0
Query: 187 SSNQQAESSHDPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 246
SSNQQAESSH+PA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKT RP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+AD KSKDKGSFSSGRAE RRA NGPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARRE 606
KKFVGKPRTSSAEKK+ERK SRTP RR DRPAVINTIFGGPSGGQSG+KRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
VCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAY 726
YLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ+ASRECYASALK SSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 AGRDGPLEFEADLPRKEFAAPTEELELVPFL 815
RDG LEF+A+LPR+EFAAPTEELELVP L
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc08g44710 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 940.6 bits (2430), Expect = 1.1e-269
Identity = 518/790 (65.57%), Postives = 567/790 (71.77%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VEGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPVPAPTSENFDALKREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SQSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHDP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
AESS++P G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+ T THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTDRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAESRRAENGPTRSRPYERFTPTTI 480
KT RPE+ I +GR+GKD +AD KS+DKG S SS R + RR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EIL NIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELA 600
IQDGYFKKFVGKPR++S EKK+ERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVI 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSV 780
DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYAS K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALETLAGRD 788
CALE RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc08g44710 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 792.7 bits (2046), Expect = 3.6e-225
Identity = 402/422 (95.26%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 228 KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y K T THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APTTFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTDRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSR 467
KVIDGQELLRTKT RP+RKIGRGRSGKD+ERADPKSKDKGSFSSGRAE RRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEIL NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGYKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSG+KRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 650
VL
Sbjct: 464 VL 465
BLAST of Moc08g44710 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 792.7 bits (2046), Expect = 3.6e-225
Identity = 405/446 (90.81%), Postives = 417/446 (93.50%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 DIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKL 494
D+E DPKSKDKGSFS+GRAE RRAENGPTRSRPYERFTPTTIPISEIL NIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQGPTC 614
AEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG+KRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSVCALETLAGRDGPLEFEA 794
FRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LK +SVCALETL RDG LEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 DLPRKEFAAPTEELELVPFLSPEKQL 821
DLP +EFAAP EELELVP LS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc08g44710 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 966.8 bits (2498), Expect = 6.7e-278
Identity = 496/528 (93.94%), Postives = 507/528 (96.02%), Query Frame = 0
Query: 191 QAESSHD---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+AESS + PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDK TATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TFAEVLQKAKKVIDGQELLRTKT RPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEILMNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGRAE RRAENGPTRSRPYERFTPTTIPISEIL NIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCII 610
GKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFV 716
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g44710 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 955.3 bits (2468), Expect = 2.0e-274
Identity = 513/631 (81.30%), Postives = 532/631 (84.31%), Query Frame = 0
Query: 187 SSNQQAESSHDPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 246
SSNQQAESSH+PA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAP TFAEVLQKAKKVIDGQELLRTKT RP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+AD KSKDKGSFSSGRAE RRA NGPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 MNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYF 546
NIEESGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARRE 606
KKFVGKPRTSSAEKK+ERK SRTP RR DRPAVINTIFGGPSGGQSG+KRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 VCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPT 666
VCIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAY 726
YLALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG DQT+VTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ+ASRECYASALK SSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 AGRDGPLEFEADLPRKEFAAPTEELELVPFL 815
RDG LEF+A+LPR+EFAAPTEELELVP L
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc08g44710 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 940.6 bits (2430), Expect = 5.2e-270
Identity = 518/790 (65.57%), Postives = 567/790 (71.77%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VEGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPVPAPTSENFDALKREMEAMRTQMRSMEEMYNEMMLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SQSENRVTRVDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHDP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
AESS++P G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+ T THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTDRPERKIGRGRSGKDIERADPKSKDKG-SFSSGRAESRRAENGPTRSRPYERFTPTTI 480
KT RPE+ I +GR+GKD +AD KS+DKG S SS R + RR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDL 540
PI EIL NIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELA 600
IQDGYFKKFVGKPR++S EKK+ERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVI 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVI
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSV 780
DGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYAS K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALETLAGRD 788
CALE RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc08g44710 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 792.7 bits (2046), Expect = 1.7e-225
Identity = 402/422 (95.26%), Postives = 410/422 (97.16%), Query Frame = 0
Query: 228 KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKNTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y K T THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFAEVLQKAK 407
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+APTTFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTDRPERKIGRGRSGKDIERADPKSKDKGSFSSGRAESRRAENGPTRSR 467
KVIDGQELLRTKT RP+RKIGRGRSGKD+ERADPKSKDKGSFSSGRAE RRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILMNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEIL NIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKKERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKPRTSSAEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGYKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSG+KRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 650
VL
Sbjct: 464 VL 465
BLAST of Moc08g44710 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 792.7 bits (2046), Expect = 1.7e-225
Identity = 405/446 (90.81%), Postives = 417/446 (93.50%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPTTFAEVLQKAKKVIDGQELLRTKTDRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAP TFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 DIERADPKSKDKGSFSSGRAESRRAENGPTRSRPYERFTPTTIPISEILMNIEESGMEKL 494
D+E DPKSKDKGSFS+GRAE RRAENGPTRSRPYERFTPTTIPISEIL NIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKKERKRSRTPPRRTDRPAVINTIFGGPSGGQSGYKRKELARAARREVCIIREQGPTC 614
AEKK+ERKRSRTPPRRTDRPAVINTIFGGPSGGQSG+KRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCIDLPVTLGQDQTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCIDLPVTLGQDQTRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRAIPSTLHQVLKYPTPNGVGTVRGEQVASRECYASALKCSSVCALETLAGRDGPLEFEA 794
FRAIPSTLHQVLKY TPNGVGTVRGEQ ASRECYAS LK +SVCALETL RDG LEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 DLPRKEFAAPTEELELVPFLSPEKQL 821
DLP +EFAAP EELELVP LS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 1.4e-277 | 93.94 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 4.2e-274 | 81.30 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.1e-269 | 65.57 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 3.6e-225 | 95.26 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022152110.1 | 3.6e-225 | 90.81 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 6.7e-278 | 93.94 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 2.0e-274 | 81.30 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 5.2e-270 | 65.57 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 1.7e-225 | 95.26 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DD03 | 1.7e-225 | 90.81 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |