Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCAAACTCGACCAATATGGCAGATCGAAGGACTCTAGTTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAACAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGAAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGCGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGTCAAATGTGAGTAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCACACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGTAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGCCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTACTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGAGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGCTAGCTCGTGCAGCAAGGCGCGAGGTGTGCATCATCAGGAAGCAGAGGCCGACCTGCTCAATCACTTTCGACAATACAGACTTAGAGGAGGTCCACCCGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACTGCTGGTTGGGTTCTCTGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAATTTTGAAGTATTCCACCCCCAATGGCCTGGGCACGTGACTGGGTGTTTATGCTGAACGCACGCGCTAAGCGTGGTAACAATTATATAGATGCAAGCGCACACCTGTCAGAGTAATATAAAATGGATGGAGTATCCAAGTATCGATCCTCAAGGACTTCTAAACAATTTCCAAACTCTAAAGTTTTAAAGAATAGTGAATGTTGGAATGAGTGATTCTTGTAAAATGGAAAGTGGTCACAAATAATGGTTTGGATTTGGGATTTGGACTTAGAATGATAAAACCTAGCCCGAATTACTTCAAGTTAGCCATCAAAGAATCAAGGACATGCTAAGGACAATATGCATTTTACTTTAGAAACCAACGACTACGCTTCTCACTTGACTTATGTCTAAGCAAGTTCAAAGTCCTAAGACATTCCCTAATATGTCTATTATGAGAGATATCTTAGTTTTAGGATGGAAATAGTCGCAACCTCGGGTTAAGGTCCTATGTCTAGGCTACTTAAGCGGAACCCTTATGTCTAAGGCGCTCCTCTCCTCGGTTTAACCACTCAACAATGCAAATAAGCAACCTTTTCAGTATTGACTTGTTAAGACTTACAGTGGCCTTCTCTAAATCCTAACTTTTTGGGAAACATGTTGAGCAATGAATGGAAAGCAAAAACACTACTTCTTAGCAACCCTTTTAGACATAAAAGGCTAACTATCCATAACCGGGCTAGAGACGGGAATTTAGAAAGACATATTGAAGAAAATACAGCAAGTAAAACGGTTGAATGTGGAATGCATGGGTGAAAAACAAAAGTTAACTAACGGAGAGTTTTTATCGGGATTGAGAGCTTCGCCCTTCTCTTTCAAGCAAGGGGGAGTGTCACGTGCTCTTTCCTCCAAAAACACAACTCTCTTAACTAAAAATAAAGTAAAGGCAAAGGGAAGATAAGCAAAGTGCTAGATTTTTATATATTTGAATAAGTAGAAAATGACTAAGTACAAATGGAAATAAAACAAGCTCAATAAAGGAATGTAAAGTGCTCGGGATTTAAAAACCAAAAGTAACTAATGAAAGCAACTCAAAAGTACTTAGAAAGCAACAAAGAAACACTAAAAATGATCAAGAGAAGGAAGCAAGGCTGGATGATTTTGGTGGAGAAGTTGCTGGAATTTATAGCTGTGCAGAATTGGTTAGGTAGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTTTGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGACGGGTTTCATTAAGGGCAAAGTTGGAAGAAAACCAGTTTTTCTGCAGGCTTCCCAGGCGCCTGGCGCCTCCCATGATACTTTTTCGTCTTTTCGCTTCTTTTGATTAATTTGCACTCCCAAATGGTCCTAAAATGTCATAAATACAATATAAGCCATCCACAAACCATAGTATATGAATTTTGCATGAATTGGAACTCAAACGATCGAGAAAAATCCGAAACAAAGCTGAAACATAGTAAAAACAGCATCTAAAACCTTCTTTTTACGCATGTAAACAAATGTAATCAGCACGGTCCGAGGGGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAGTAAGCATAGGAACCAAGCTGGGGGCGATCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGATGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGAAAATTATGATGCATCGCCTCAGCATAGATCCGTCATTCCAACCTGTAAAGCAAAAAAGAAGACTTATAAACAAGGAGCAGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTGGAGAATGTGCGTAGATTGTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACACCACAGCCGGGCACGAACTGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATCAAGGTCATACCGCATTCATAACAGACCAAGGTCTGTACTGTTACAAGATCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAATAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTATATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAGCCTTCGAGGTTCTGAGGACATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTTGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGAGGCACCTAAGACGCTGAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTAAACCAGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGGAGGAAAGAGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCATTTCAGCAGTTGAAAAGCTACCTCTGTTCGGCACCTTTGCTCGCCAAGCCCATGTCGGGGGACAAGCTCCAATTGTACTTAGCAGTGTCTGACAGCACCGTCAGCTCGACCCTAATCAGGAAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGTAAGGCTATGACCGAAGCCGAGACCAGATACCATCAAATGGAAAAGTTGGCTCTTGCTTTAGTCACCTCGGCCTGACGGCTTAGACCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATCTTCCATAAGCTAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGACAATCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGGGCCGGGGTCCTCTTGCTCGGACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGCGACTCCCAGCTGGTTGTGAGCCAGATCAAGGAAGAGTACCAAGCCAAAGACTCCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGGTTCCCCGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGACGTACGAGATCGACCTGGCCAGGTCGGTCCCTGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCACAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCTCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAAATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGAACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTTTCCATTTGGGCAAGGGCCAAACCAAGTTCGCTGTGGATTACTTCACAAAGTGGGCCGAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGAGTCACGTCCTTCGTATGGACAAATATCATATGTCGCTTTGGTATACCGCAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGTAGGTGGAGGCAGTCAACAAGATCATCAAACGAGGCATCAAACTTAGACTGGACTCCAAGAAAAGCAGGTGGGCCGAGGAGCTACTAGAGGTTCTATGGTCGTACCGGACCACCCAAAGAGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACGACAAATGAGGAAGAGCTGCTCCTCAACCTTGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCTCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGTGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATATGGCAGATCGAAGGACTCTAGTTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAACAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGAAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGCGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGTCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCACACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGTAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGCCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTACTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGAGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAACTTAGAGGAGGTCCACCCGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACTGCTGGTTGGGTTCTCTGGTCACCCAAATGGCCGAGTTCGTGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTTTGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGACGGGTTTCATTAAGGGCAAAGTTGGAAGAAAACCAGTTTTTCTGCAGGCTTCCCAGGCGCCTGGCGCCTCCCATGATACTTTTTCCACGGTCCGAGGGGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAATCGACCTGGCCAGGTCGGTCCCTGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCACAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGGCAGAATGGCCAGACATTACAACGCTCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGTGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCAAACTCGACCAATATGGCAGATCGAAGGACTCTAGTTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAACAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACTCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGAAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCTGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGAGGAGACCTTCGTGAACACCTCAACAGAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCTCACACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGCGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGTCAAATCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCACACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGACCACCTTCGTCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGTAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAAAGCCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTACTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCCAGGACCAGCTCGGCAGAAAAAAGGAAGAGCGGAAGCGTTCGAGGACGCCGCCCCGGAGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAACTTAGAGGAGGTCCACCCGCCCCACAATGACGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGATAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGTTGGACGAGGTCGCAATTGACGAAAAGCCCGACACTGCTGGTTGGGTTCTCTGGTCACCCAAATGGCCGAGTTCGTGGTCCTAAAACGATGGTGAATCACCCATTTCCAACTCTTTTGACTGACCAAATCGTGCTCAAACTGATAGTGGAACACTTCTTCTACACGTATGCATCGGTTGGAACCACCAAAACGCATTGGAAGACGGGTTTCATTAAGGGCAAAGTTGGAAGAAAACCAGTTTTTCTGCAGGCTTCCCAGGCGCCTGGCGCCTCCCATGATACTTTTTCCACGGTCCGAGGGGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTCGAAACTCTCGCCGGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAAATCGACCTGGCCAGGTCGGTCCCTGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCACAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTGGGCAGAATGGCCAGACATTACAACGCTCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGTGTCCAAACCCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTGAGGTCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNMADRRTLVASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMKEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVKSPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFVGKPRTSSAEKRKSGSVRGRRPGALTDLRSSIPFSEGQAGVNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGWTRSQLTKSPTLLVGFSGHPNGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYASVGTTKTHWKTGFIKGKVGRKPVFLQASQAPGASHDTFSTVRGEQTASRECYASALKGSSVCALETLAGRDGTLEFEADLPRREFAAPTEELELVPLLSPEKQIDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKEHRKLARRAARFVGRMARHYNARVRPRTFQVGHLVLRSVQTHVGALDPTWEGPFEVKGIVRPGTYVLADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc06g15970 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 736.5 bits (1900), Expect = 2.8e-208
Identity = 411/576 (71.35%), Postives = 435/576 (75.52%), Query Frame = 0
Query: 191 QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK----------------------- 250
+AESS NPATPAGVITR EFDQLRG+LDAQVEALK K
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 -SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
+PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQE+QLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGAPE RSKDKYCRFHREHGHNTSDYWELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKRKSGSVRGRRPGALTDLRSSIPF-----SEGQA----------------- 610
GKPRTSSAEK++ R R P TD + I S GQ+
Sbjct: 363 GKPRTSSAEKKEERK-RSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCI 422
Query: 611 -------------GVNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLA 670
G +LEEVH PHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLA
Sbjct: 423 IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLA 482
Query: 671 LGWTRSQLTKSPTLLVGFSGH---PNGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFYT 704
LGWTRSQL KSPT LVGFSG P G + P T+ + V + I
Sbjct: 483 LGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFVDRAI------- 542
BLAST of Moc06g15970 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 689.9 bits (1779), Expect = 3.0e-194
Identity = 425/802 (52.99%), Postives = 473/802 (58.98%), Query Frame = 0
Query: 1 MVQPANSTNMADRRTLVASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTN ADRR L A+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMKEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALK--------------- 240
AESS+NP TP GVITR EFDQL+ K DAQVEALK
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 ---------VKSPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA 300
+++ IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLR 420
RF E+QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TF EVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTT 480
TKTGRPE+ I +GR+GKD KAD KS+DKG S SS R +YRR+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTNIEESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIED 540
IPI EILTNIEE+GMEKLLKRPEKLRG PE R+ DKYCRFHR+HGHNTS+YWELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKR------KSGSVRGRRPGALTDLR--------------- 600
LIQDGYFKKFVGKPR++S EK+ ++ R RP + +
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVINKKKELAREARREVCIIRE 600
Query: 601 ----SSIPFSEGQAGVNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYL 660
SSI F+ +LE VH PHNDALVIAPLID V+VRR+L+DGGASANILSL TYL
Sbjct: 601 QRPTSSIAFNH----ADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYL 650
Query: 661 ALGWTRSQLTKSPTLLVGFSGHP---NGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFY 720
ALGWTRSQL KSPT LVGFSG G + P ++ T +T +I Y
Sbjct: 661 ALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSI--RQDDTQVTQMAEFVVIDGRSAY 650
Query: 721 TYASVGTTKTHWKTGFIKGKVGRKPVFLQASQAPGASHDTF--------STVRGEQTASR 742
+ +P+ P H TVRGE SR
Sbjct: 721 ------------------NAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSR 650
BLAST of Moc06g15970 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 642.9 bits (1657), Expect = 4.2e-180
Identity = 396/645 (61.40%), Postives = 421/645 (65.27%), Query Frame = 0
Query: 187 SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK---------------SPIP 246
SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALK K SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 PK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA 306
+APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------ 121
Query: 307 RSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVAHCSD 366
FQE QLKVA SD
Sbjct: 122 -----------------------------------------------FQEDQLKVAQSSD 181
Query: 367 DSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGR 426
DSAMCYFLTGLADEALTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPER I RGR
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 427 SGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGM 486
SGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGM
Sbjct: 242 SGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGM 301
Query: 487 EKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFVGKPR 546
EKLLKRPEKLRGAPE R+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPR
Sbjct: 302 EKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPR 361
Query: 547 TSSAEKRKSGSV------RGRRPGALTDLRSSIPFSEGQAG------------------- 606
TSSAEK++ + R RP + + S GQ+G
Sbjct: 362 TSSAEKKEERKLSRTPLRRIDRPAVINTIFGGP--SGGQSGHKRKELARAARREVCIIRE 421
Query: 607 -----------VNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGW 666
+LEEVH PHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGW
Sbjct: 422 QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGW 481
Query: 667 TRSQLTKSPTLLVGFSGH---PNGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYAS 726
TRSQL KS T LVGFS P G + P T+ + DQ + + E S
Sbjct: 482 TRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGH--------DQTQVTQMAEFVVIDGRS 541
Query: 727 VGTTKTHWKTGFIKGKVGRKPVFLQASQAPGASHDT--FST------VRGEQTASRECYA 769
+ +P+ P H +ST VRGEQ ASRECYA
Sbjct: 542 A------------YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYA 570
BLAST of Moc06g15970 vs. NCBI nr
Match:
XP_022156542.1 (uncharacterized protein LOC111023421 [Momordica charantia])
HSP 1 Score: 620.5 bits (1599), Expect = 2.2e-173
Identity = 323/374 (86.36%), Postives = 333/374 (89.04%), Query Frame = 0
Query: 203 GVITRAEFDQLRGKLDAQVEALKVK------------------------SPIPPKFKAPT 262
G+ITR EFDQLRG+LDAQVEALK K +PIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 263 VKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 322
VKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 323 LRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFL 382
LRREFLAQFSSRHYDKKTATHLATI QKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 383 TGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKA 442
TGLADEALTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+A
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 443 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE 502
DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 503 KLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFVGKPRTSSAEKRK 553
KLRGAPE RSKDKYCRFHREHGHNTSD+WELKRQIEDLIQDGYFKKFVGKPRTSSAEK++
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
BLAST of Moc06g15970 vs. NCBI nr
Match:
XP_022152033.1 (uncharacterized protein LOC111019842 [Momordica charantia])
HSP 1 Score: 619.8 bits (1597), Expect = 3.8e-173
Identity = 327/372 (87.90%), Postives = 333/372 (89.52%), Query Frame = 0
Query: 170 KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK-- 229
+RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALK K
Sbjct: 28 QRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCE 87
Query: 230 ----------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAAS 289
+PIP KFKAPTVKPYDGS+DPKDYVEVFEGLMDFQAAS
Sbjct: 88 QKEGSLNDGDLGESPFTSDVLEAPIPXKFKAPTVKPYDGSRDPKDYVEVFEGLMDFQAAS 147
Query: 290 DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQ 349
D IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATI Q
Sbjct: 148 DTIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKETATHLATIRQ 207
Query: 350 KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKA 409
KEGETLREYVTRFQE+QLKV HCSDDSAMCYFLTGLADEA TVKLGEEAP TF EVLQKA
Sbjct: 208 KEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAXTVKLGEEAPATFAEVLQKA 267
Query: 410 KKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS 469
KKVIDGQELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Sbjct: 268 KKVIDGQELLRTKTGRPERKIGRGRSGKDIERADLKSKDKGSFSSDRAGYRRAENGPTRS 327
Query: 470 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSD 518
RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPE RSKDKYCRFHREHGHNTSD
Sbjct: 328 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD 387
BLAST of Moc06g15970 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 736.5 bits (1900), Expect = 1.4e-208
Identity = 411/576 (71.35%), Postives = 435/576 (75.52%), Query Frame = 0
Query: 191 QAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK----------------------- 250
+AESS NPATPAGVITR EFDQLRG+LDAQVEALK K
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 -SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
+PIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQE+QLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFV 550
ESGMEKLLKRPEKLRGAPE RSKDKYCRFHREHGHNTSDYWELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKRKSGSVRGRRPGALTDLRSSIPF-----SEGQA----------------- 610
GKPRTSSAEK++ R R P TD + I S GQ+
Sbjct: 363 GKPRTSSAEKKEERK-RSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCI 422
Query: 611 -------------GVNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLA 670
G +LEEVH PHNDALVIAPLIDHVVV RVL+DGG SANILSLPTYLA
Sbjct: 423 IREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLA 482
Query: 671 LGWTRSQLTKSPTLLVGFSGH---PNGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFYT 704
LGWTRSQL KSPT LVGFSG P G + P T+ + V + I
Sbjct: 483 LGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFVDRAI------- 542
BLAST of Moc06g15970 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 689.9 bits (1779), Expect = 1.5e-194
Identity = 425/802 (52.99%), Postives = 473/802 (58.98%), Query Frame = 0
Query: 1 MVQPANSTNMADRRTLVASDAHQREVGATVVEGQGHDGLATEPLRRSARITAPVLPPAHP 60
MVQPANSTN ADRR L A+ HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPPSENFDALQREMEAMRTQMRSMKEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALK--------------- 240
AESS+NP TP GVITR EFDQL+ K DAQVEALK
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 ---------VKSPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA 300
+++ IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLR 420
RF E+QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAP TF EVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKTGRPERKIGRGRSGKDIEKADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTT 480
TKTGRPE+ I +GR+GKD KAD KS+DKG S SS R +YRR+ + +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTNIEESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIED 540
IPI EILTNIEE+GMEKLLKRPEKLRG PE R+ DKYCRFHR+HGHNTS+YWELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVGKPRTSSAEKR------KSGSVRGRRPGALTDLR--------------- 600
LIQDGYFKKFVGKPR++S EK+ ++ R RP + +
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVINKKKELAREARREVCIIRE 600
Query: 601 ----SSIPFSEGQAGVNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYL 660
SSI F+ +LE VH PHNDALVIAPLID V+VRR+L+DGGASANILSL TYL
Sbjct: 601 QRPTSSIAFNH----ADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSLSTYL 650
Query: 661 ALGWTRSQLTKSPTLLVGFSGHP---NGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFY 720
ALGWTRSQL KSPT LVGFSG G + P ++ T +T +I Y
Sbjct: 661 ALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSI--RQDDTQVTQMAEFVVIDGRSAY 650
Query: 721 TYASVGTTKTHWKTGFIKGKVGRKPVFLQASQAPGASHDTF--------STVRGEQTASR 742
+ +P+ P H TVRGE SR
Sbjct: 721 ------------------NAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSR 650
BLAST of Moc06g15970 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 642.9 bits (1657), Expect = 2.0e-180
Identity = 396/645 (61.40%), Postives = 421/645 (65.27%), Query Frame = 0
Query: 187 SSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK---------------SPIP 246
SSNQQAESSHNPATP GVITR EFDQLRGKL+AQVEALK K SP
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 PK-FKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPA 306
+APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 62 SDVLEAPTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW------ 121
Query: 307 RSISTYSQLRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVAHCSD 366
FQE QLKVA SD
Sbjct: 122 -----------------------------------------------FQEDQLKVAQSSD 181
Query: 367 DSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGR 426
DSAMCYFLTGLADEALTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPER I RGR
Sbjct: 182 DSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGR 241
Query: 427 SGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGM 486
SGKD EKAD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILTNIEESGM
Sbjct: 242 SGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGM 301
Query: 487 EKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFVGKPR 546
EKLLKRPEKLRGAPE R+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFKKFVGKPR
Sbjct: 302 EKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPR 361
Query: 547 TSSAEKRKSGSV------RGRRPGALTDLRSSIPFSEGQAG------------------- 606
TSSAEK++ + R RP + + S GQ+G
Sbjct: 362 TSSAEKKEERKLSRTPLRRIDRPAVINTIFGGP--SGGQSGHKRKELARAARREVCIIRE 421
Query: 607 -----------VNLEEVHPPHNDALVIAPLIDHVVVRRVLIDGGASANILSLPTYLALGW 666
+LEEVH PHNDALVIAPLIDHVVVRRVL+D G SANI+SL TYLALGW
Sbjct: 422 QRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGW 481
Query: 667 TRSQLTKSPTLLVGFSGH---PNGRVRGPKTMVNHPFPTLLTDQIVLKLIVEHFFYTYAS 726
TRSQL KS T LVGFS P G + P T+ + DQ + + E S
Sbjct: 482 TRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGH--------DQTQVTQMAEFVVIDGRS 541
Query: 727 VGTTKTHWKTGFIKGKVGRKPVFLQASQAPGASHDT--FST------VRGEQTASRECYA 769
+ +P+ P H +ST VRGEQ ASRECYA
Sbjct: 542 A------------YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYA 570
BLAST of Moc06g15970 vs. ExPASy TrEMBL
Match:
A0A6J1DS95 (uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023421 PE=4 SV=1)
HSP 1 Score: 620.5 bits (1599), Expect = 1.1e-173
Identity = 323/374 (86.36%), Postives = 333/374 (89.04%), Query Frame = 0
Query: 203 GVITRAEFDQLRGKLDAQVEALKVK------------------------SPIPPKFKAPT 262
G+ITR EFDQLRG+LDAQVEALK K +PIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 263 VKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 322
VKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 323 LRREFLAQFSSRHYDKKTATHLATITQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFL 382
LRREFLAQFSSRHYDKKTATHLATI QKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 383 TGLADEALTVKLGEEAPTTFVEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIEKA 442
TGLADEALTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD+E+A
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 443 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPE 502
DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIEESGMEKLLKRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 503 KLRGAPESRSKDKYCRFHREHGHNTSDYWELKRQIEDLIQDGYFKKFVGKPRTSSAEKRK 553
KLRGAPE RSKDKYCRFHREHGHNTSD+WELKRQIEDLIQDGYFKKFVGKPRTSSAEK++
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
BLAST of Moc06g15970 vs. ExPASy TrEMBL
Match:
A0A6J1DDS5 (uncharacterized protein LOC111019842 OS=Momordica charantia OX=3673 GN=LOC111019842 PE=4 SV=1)
HSP 1 Score: 619.8 bits (1597), Expect = 1.8e-173
Identity = 327/372 (87.90%), Postives = 333/372 (89.52%), Query Frame = 0
Query: 170 KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKVK-- 229
+RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALK K
Sbjct: 28 QRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCE 87
Query: 230 ----------------------SPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAAS 289
+PIP KFKAPTVKPYDGS+DPKDYVEVFEGLMDFQAAS
Sbjct: 88 QKEGSLNDGDLGESPFTSDVLEAPIPXKFKAPTVKPYDGSRDPKDYVEVFEGLMDFQAAS 147
Query: 290 DAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATITQ 349
D IKCRAFQIALT SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATI Q
Sbjct: 148 DTIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKETATHLATIRQ 207
Query: 350 KEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPTTFVEVLQKA 409
KEGETLREYVTRFQE+QLKV HCSDDSAMCYFLTGLADEA TVKLGEEAP TF EVLQKA
Sbjct: 208 KEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAXTVKLGEEAPATFAEVLQKA 267
Query: 410 KKVIDGQELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS 469
KKVIDGQELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Sbjct: 268 KKVIDGQELLRTKTGRPERKIGRGRSGKDIERADLKSKDKGSFSSDRAGYRRAENGPTRS 327
Query: 470 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPESRSKDKYCRFHREHGHNTSD 518
RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPE RSKDKYCRFHREHGHNTSD
Sbjct: 328 RPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD 387
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 1.4e-208 | 71.35 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 1.5e-194 | 52.99 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9E1 | 2.0e-180 | 61.40 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DS95 | 1.1e-173 | 86.36 | uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A6J1DDS5 | 1.8e-173 | 87.90 | uncharacterized protein LOC111019842 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |