Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAAGTCCACTGAACGATGGTGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCTAAGGATTACGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGACTGCCAGGCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGATAAAAAAACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAAGGTGAGACGCTACGGGAGTACGTCACCAGGTTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGACGACTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAAGCTCTCACGGTGAAGCTTGGAGAGGAGGCTCCAGCCACATTCGTCGAAGTGCTACAGAAGGCGAAGAAAGTCATCGACGGGCAGGAGCTCCTCTGAACCAAAACCGGCCGACCAGAGAGAAAGATCGACTAGGGCAGAGGTGGAAAGGATACAGGAAAGGCGCATCCCAAGTCTAAGGACAAGGGATCTTTCTCCAGTGGTCGAGCTGAGTACCGAAGGGCGGAGAACGGACCCACCAAGAGCAGACCTTACGAACGCTTCACTCCGACCACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCTCAAACGACCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGATTGCTGGGAATTAAAGCGCCAAGTTGAGGATCTTACTCAAGATGGCTACTTCAAGAAATTCGTGGGGAAGCCCAGGACCAACTCGGCAGAAAAGAAGGAAGAAAGGAAGCGTTCGAGGACGCCGCCCCGGCGCATTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGCCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGATAGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACTTGTGATCGCTCGCTTGATTGATCATGTGGTGGTCAGAAGAGTACTAGTAGACGGGGGCGCGTCTGCTAACATCCTGTCCCTACCAACATATCTCGCCCTAGGATGGACAAGATCACAATTAAAGAAAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAGTCGGTCACCCCACAGGGTTGCATCGACTTGCAGGTCACATTTGGGCAAGACAAAACACAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCGTTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCTAATGGCGTGAACACAGTCTGAGGAGAACAGACCGCTTCGAGGGAATGCTACGCCTACGCACTCAAAGGGTCATCGGTATGCACCCTCGAAGAGCAAGCCAGTGGGGATGGGCCGCTTGAGTTCGAGGCCGACCTGCCGAGAAGGGAGTTTTCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAGGTAAGCATAGGAACTAAGCTGGGGGCCACTGACAGAGAGGAGCTAATCCACTTCCTTAGGTCCAACTCGGACGACTTTGCATGGTCTCATGAGGACATGCCTGGCATTGACCCGAAGATTATGACGTATCGCCTCAGCATAGACCCGTCATTCCGACCTGTAAAACAAAAAAGAAGACCTATAAACAAGGAGAGAAGTGATGTAATTGTTGAGGAAGTAAACAAACTATTGAAAGCTGAATACATAAGAGAAATTTTGTATCTCGAGTGGCTCTCTAATGTTGTATTAGTTAATAAATCTAACGGGAAGTGGAGAATGTGCGTAGACTTTACGAACTTAAACAAGGCATGCCCGAAGGACTACTTCCCACTGCCGAGGATCGATCAGCTCGTGGACGCCATAGCTGGGCACGAACTGCTCACCTTCATGGATGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCAAAATCAAGATCATACCGCATTCATAACAGACTAAGGTCTGTATTGTTACAAGGTCATGCCCTTCGGTTTAAAAAACGCAAGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATATGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTTTCCGATCTGACCGAAGCCTTCGAGGTTCTGAGGACTTATCAAATGAAGTTCAACCCAGATAAGTGTGCCTTTGGAGTCTCTTCGGAAAAATTCCTTGGCTTCATGGTGAACAACCGGGGGATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGCTCGAGATGGGAGCACCCAAGACGCTGAAGCAGCTTCAATGCCTCAATGGCAGGATTGCAGCCCTGAACCGGTTTCTTTCAAGGTCGACAGATAAGTGTCTTCCTTTCTTCAAAATCCAACGAAAGAAAGGGCCGTTTGAATGGACAGCGGAGTGCGAGCAAGCATTTCAGCAGTTGAAGAGTTACCTCTGCTCGGCACCTTTGCTCGCCAAGCCCCTGCCAGGGGACAAGCTTCAGTTGTAGATAGCAGTGTCTGACAGCGCCATCAGCTCGGCTCTAATCAGGCAAGAGGAAACGCGGCAAAGCCCAGTCTACTACACAAGCAAGGCTATGACCGAGGCCGAGACCAGATACCCTCAAATGGAAAAGTTGGCTCTCGCTTTAGTCACATTGGCCTGATGGCTCAGATCATACTTCCAAGCCCATACTGTGGTGGTGCTCACTAACTTGCCCCTAAAAAACATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAACTGAGTGAGTATGACATCCAGTTCGAACCTCGAACTGCGTTGAAAGGACAAGCAGTGGCAGATTTCATAGCCGAGCTAACGCCACCTTCTGAGCTGAGCGAGTCCGACCTCGCGTGGACAATCTATGTCGACGGATCCTCCAATGAGAGGGGGTGTGGGGCCGGGGTTTTCTTGCTCGCACCAGGAGGCGAGCGATTTGAATATGCCTTGCGGTTTGGCTTCCGGACTTCTAATAACGAGGCCGAGTACGAGGCACTTATTGCCGGTCTACGAATCGCTAGAGCGTTGGGGGCCTCTTGTATCAAGGTCTTCAGCAACTCCCAGCTGGTTGTGAGCCAGATCAAGGAAGAGTACCAAGCCAAAGACTCGCGAATGGAGAAGTATTTGGACAAGGTCAGATCGTACCTCGCCTAGTTCCGAACTTACGAAGTAAGCCGGGTTCCTCGAGCAGAAAATTCTAATGCTGATGTCTTGGCCAAATTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTTGAGATCTTGGATAATCCCTCGATCTGGGAGCCAGATCTGATGGAGATCGGCGCCCCAGAGCCCTCATAGATGGACCCGATTATGGACTTCATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAAGTTAGCGAGGAAGGCAGCTCGGTTCGTAGTCCGAGGTGGAGCGTTGTACCGACGCGGTTTTTCCCTGCCTCTACTGAGATGCCTAACCCCTAAAGAGGGCCTATACGTCCTCAGAGAAATCCACGAGGGAGTATGCGGCAATCACTCAGGCGCCCGGTCGCTGTCGGCCAAGGTGGTTCGACAAGGATACTATTGGCCAACTCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGAAAACGTAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTAGACATTATTGGTCCTTTCCCTTTGGGCAAAGGCCAGACCAAGTTCGCTGTGGTTGTTGTGGATTACTTCACAAAGTGGGCCGAGGCCGAGGTGCTCTCCCACATAACGGAATCCAGAGTCACGTCCTTCATATGGACAAATATCATATGTGGCTTTGGTATACCGAAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAACGCAAAGTTCAAAGATTTTTGCAGCAAGCTTGGCATAAGTCACCTTAGCTCGTCCCCTGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAATAAGATCATCAAGCGAGGCATGAAACTTAGACTGGACTCCAAGAAAGACAGGTCAGCCGAAGAGCTACCAGAGGTTCTATGGTCTTACCGGACCACCCAAAGAGAGTCGACGGGTGAGACCCCGTTCTCCCTAGCCTTCGGCTCCGAAGCTGTCGTCCCAGTTGAGATCGGCATGCCATCTGACAGAGTAGAGGATTACGAGCCCACAGCAAATGGGGAAGAGCTGCTCCTCAACCTCGACTTATTGGAGGAAAGAAGAGAAATGGCCCAGCTACGCCTGGCGGAATATCAAGGCAGAAATGTCAGGCATTACAACGCCCGCGTTCGACCTCGAACCTTCCAAGTCGGACATCTGGTCTTAAGGAAGGTCCAAACCCATGTGGGTACCCTTGACCCGAACTGGGAGGGGCCGTTTGAAGTCATGGGAATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCTTCGCGCACCCATGGAACGCGGAGCACCTGAAGCGTTATTACCCTTGAAATGCCAAAATGGTTCTCAATGGACTTGTAAAAACTTTTTCAATAGGATTATGGTTGGAATAGATGAGATGATTTAATTTCACGACTCCGAGTTCGACCAGAAATTAAATGGGGGCCACAGACTCCCACACGATCAAATTCCAGCAGTCGGTTAAAATTCAATCCTCCAAAACCTAAGGGTACGAGGTGCGATGCCAAAACCACTGATGAACTTAAAATTCAAACCTTCAAGGTAAAGGGGCGATGTGAAAAGTTCAGAATGATCAAGCCTCTGAACCTAAGGGTACGAAGTGCGATATGAAGGACAGCTACGGACTTAGGATTTTAGCCTTTTAACGTTTTCAAGTTAAGGATGTGATGTTAAAAGTCCAGAGTTGGTGCAATGCATTGAATACTATACGGAGATTCAAATTCAAAGTCTTAAGGAAAGGCGCGATCTGAACAGTAAGAGGCGCGTGTTCACCTTTTTGTGCAAGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGACCGGGTTCGAGCTATGATCAGAAAAAACATTGTTGTGCATATTCTTGCATAAACATACTTGTTTCTTATTTGAGTTTGACACTTTACAATCATTCCTGCTTGTCTTGTTACTTGTTTGCTTCCTTAGCTTCTTGGTTGCAAACTCTTGGATCATCACACTATACTTGAGAGAGATTGCATCCTTGTGAGAGAGCATCATTTTGAGCGTAAACATGAGCGGACCACATTAATTGAGTGAAGCATGTGAGGAAGTGAATTGTGAGGTCCACTTTTCTTTCATCTTTGTTTCTTTTGCAGCCATGTCTCAACAAAGAAGAAGACGCGTTGAAAATCGAAGTATAGATACAAGACTTAGAGAAGCTCTACGAGAAGAATTGGTTGAACTACGCCAATCTCTAAGCAAATTACTTAAAGATTCTATCCAAGAATGGAGATTTATGTGCCAAGACTCGAGCAAGCAAGCACACCAGCCAGAACATCAACTGGACGACCGACCACCCCAAGAAGAAGTAGATCGTCCAATGATACAAGAGGAAGATTTAAATGGTAATGTTTCAGATGAAGAAAGGAGAGCCGATGAGAGTAAAGAAATACATCTTGTTCAATATGAATTAGAGATAGAGAATGAGCAAACTGACACACTCGAAGAAGAAAGCGAGAAAGAAGAATTTGAGAGTATTATACAAAGTAGTGAAAAAAGAGTAGGCGAAATCAAGAGAGAGATTACGACAATAAGGAGGAGGAATACCACTATCCATATCAGAAGTACAAAACGAAGATGTTTCTTAAGAAATATTGCTTCAAGCTTGATAATGTATCCATTGAAGTGCAACGGTTTAGAAGGGATCCAAAAAGAGCTAGATTTTTTACTTCTCATGACTACAAATTTGCTCTCTAAAGTGACAAATTAG
mRNA sequence
ATGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAAGTCCACTGAACGATGGTGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCTAAGGATTACGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGACTGCCAGGCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGATAAAAAAACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAAGGTGAGACGCTACGGGAGTACGTCACCAGGTTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGACGACTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAAGCTCTCACGGTGAAGCTTGGAGAGGAGGCTCCAGCCACATTCGTCGAAGTGCTACAGAAGGCGAAGAAAGGCAGAGGTGGAAAGGATACAGGAAAGGCGCATCCCAAGTCTAAGGACAAGGGATCTTTCTCCAGTGGTCGAGCTGAGTACCGAAGGGCGGAGAACGGACCCACCAAGAGCAGACCTTACGAACGCTTCACTCCGACCACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCTCAAACGACCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGATTGCTGGGAATTAAAGCGCCAAGTTGAGGATCTTACTCAAGATGGCTACTTCAAGAAATTCGTGGGGAAGCCCAGGACCAACTCGGCAGAAAAGAAGGAAGAAAGGAAGCGTTCGAGGACGCCGCCCCGGCGCATTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGCCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGATAGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACTTGTGATCGCTCGCTTGATTGATCATGTGGTGGTCAGAAGAGTACTAGTAGACGGGGGCGCGTCTGCTAACATCCTGTCCCTACCAACATATCTCGCCCTAGGATGGACAAGATCACAATTAAAGAAAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAGTCGGTCACCCCACAGGGTTGCATCGACTTGCAGGTCACATTTGGGCAAGACAAAACACAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGAGAACAGACCGCTTCGAGGGAATGCTACGCCTACGCACTCAAAGGGTCATCGGTATGCACCCTCGAAGAGCAAGCCAGTGGGGATGGGCCGCTTGAGTTCGAGGCCGACCTGCCGAGAAGGGAGTTTTCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAGGTAAGCATAGGAACTAAGCTGGGGGCCACTGACAGAGAGGAGCTAATCCACTTCCTTAGGTCCAACTCGGACGACTTTGCATGGTCTCATGAGGACATGCCTGGCATTGACCCGAAGATTATGACTCCAGAGTTGGTGCAATGCATTGAATACTATACGGAGATTCAAATTCAAAGTCTTAAGGAAAGGCGCGATCTGAACACCATGTCTCAACAAAGAAGAAGACGCGTTGAAAATCGAAGTATAGATACAAGACTTAGAGAAGCTCTACGAGAAGAATTGGTTGAACTACGCCAATCTCTAAGCAAATTACTTAAAGATTCTATCCAAGAATGGAGATTTATGTGCCAAGACTCGAGCAAGCAAGCACACCAGCCAGAACATCAACTGGACGACCGACCACCCCAAGAAGAAGTAGATCGTCCAATGATACAAGAGGAAGATTTAAATGGTAATGTTTCAGATGAAGAAAGGAGAGCCGATGAGAGTAAAGAAATACATCTTGTTCAATATGAATTAGAGATAGAGAATGAGCAAACTGACACACTCGAAGAAGAAAGCGAGAAAGAAGAATTTGAGAGTATTATACAAAGTAGTGAAAAAAGAGTAGGCGAAATCAAGAGAGAGATTACGACAATAAGGAGGAGGAATACCACTATCCATATCAGAAGTACAAAACGAAGATGTTTCTTAAGAAATATTGCTTCAAGCTTGATAATGTATCCATTGAAGTGCAACGGTTTAGAAGGGATCCAAAAAGAGCTAGATTTTTTACTTCTCATGACTACAAATTTGCTCTCTAAAGTGACAAATTAG
Coding sequence (CDS)
ATGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAAGTCCACTGAACGATGGTGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAACCTTATGATGGGTCGAAGGACCCTAAGGATTACGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGACTGCCAGGCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGATAAAAAAACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAAGGTGAGACGCTACGGGAGTACGTCACCAGGTTCCAGGAGGAACAATTGAAGGTCGCACACTGCTCTGACGACTCGGCCATGTGCTACTTTCTCACCGGCCTGGCCGACGAAGCTCTCACGGTGAAGCTTGGAGAGGAGGCTCCAGCCACATTCGTCGAAGTGCTACAGAAGGCGAAGAAAGGCAGAGGTGGAAAGGATACAGGAAAGGCGCATCCCAAGTCTAAGGACAAGGGATCTTTCTCCAGTGGTCGAGCTGAGTACCGAAGGGCGGAGAACGGACCCACCAAGAGCAGACCTTACGAACGCTTCACTCCGACCACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTCCTCAAACGACCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGATTGCTGGGAATTAAAGCGCCAAGTTGAGGATCTTACTCAAGATGGCTACTTCAAGAAATTCGTGGGGAAGCCCAGGACCAACTCGGCAGAAAAGAAGGAAGAAAGGAAGCGTTCGAGGACGCCGCCCCGGCGCATTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGCCAGTCCGGACAGAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGATAGTGCAGACTTGGAGGAGGTCCACCTACCCCACAATGATGCACTTGTGATCGCTCGCTTGATTGATCATGTGGTGGTCAGAAGAGTACTAGTAGACGGGGGCGCGTCTGCTAACATCCTGTCCCTACCAACATATCTCGCCCTAGGATGGACAAGATCACAATTAAAGAAAAGCCCGACACCGCTAGTTGGGTTCTCTGGAGAGTCGGTCACCCCACAGGGTTGCATCGACTTGCAGGTCACATTTGGGCAAGACAAAACACAGGTCACCCAAATGGCCGAGTTCGTGGTGATTGACGGAGAACAGACCGCTTCGAGGGAATGCTACGCCTACGCACTCAAAGGGTCATCGGTATGCACCCTCGAAGAGCAAGCCAGTGGGGATGGGCCGCTTGAGTTCGAGGCCGACCTGCCGAGAAGGGAGTTTTCCGCGCCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAACAGGTAAGCATAGGAACTAAGCTGGGGGCCACTGACAGAGAGGAGCTAATCCACTTCCTTAGGTCCAACTCGGACGACTTTGCATGGTCTCATGAGGACATGCCTGGCATTGACCCGAAGATTATGACTCCAGAGTTGGTGCAATGCATTGAATACTATACGGAGATTCAAATTCAAAGTCTTAAGGAAAGGCGCGATCTGAACACCATGTCTCAACAAAGAAGAAGACGCGTTGAAAATCGAAGTATAGATACAAGACTTAGAGAAGCTCTACGAGAAGAATTGGTTGAACTACGCCAATCTCTAAGCAAATTACTTAAAGATTCTATCCAAGAATGGAGATTTATGTGCCAAGACTCGAGCAAGCAAGCACACCAGCCAGAACATCAACTGGACGACCGACCACCCCAAGAAGAAGTAGATCGTCCAATGATACAAGAGGAAGATTTAAATGGTAATGTTTCAGATGAAGAAAGGAGAGCCGATGAGAGTAAAGAAATACATCTTGTTCAATATGAATTAGAGATAGAGAATGAGCAAACTGACACACTCGAAGAAGAAAGCGAGAAAGAAGAATTTGAGAGTATTATACAAAGTAGTGAAAAAAGAGTAGGCGAAATCAAGAGAGAGATTACGACAATAAGGAGGAGGAATACCACTATCCATATCAGAAGTACAAAACGAAGATGTTTCTTAAGAAATATTGCTTCAAGCTTGATAATGTATCCATTGAAGTGCAACGGTTTAGAAGGGATCCAAAAAGAGCTAGATTTTTTACTTCTCATGACTACAAATTTGCTCTCTAAAGTGACAAATTAG
Protein sequence
MGKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAKKGRGGKDTGKAHPKSKDKGSFSSGRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHNDALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTPQGCIDLQVTFGQDKTQVTQMAEFVVIDGEQTASRECYAYALKGSSVCTLEEQASGDGPLEFEADLPRREFSAPTEELELVPLLSPEKQVSIGTKLGATDREELIHFLRSNSDDFAWSHEDMPGIDPKIMTPELVQCIEYYTEIQIQSLKERRDLNTMSQQRRRRVENRSIDTRLREALREELVELRQSLSKLLKDSIQEWRFMCQDSSKQAHQPEHQLDDRPPQEEVDRPMIQEEDLNGNVSDEERRADESKEIHLVQYELEIENEQTDTLEEESEKEEFESIIQSSEKRVGEIKREITTIRRRNTTIHIRSTKRRCFLRNIASSLIMYPLKCNGLEGIQKELDFLLLMTTNLLSKVTN
Homology
BLAST of Moc03g00750 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 875.2 bits (2260), Expect = 4.5e-250
Identity = 451/504 (89.48%), Postives = 460/504 (91.27%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
G+LDAQVEALKAKCEQKE PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD
Sbjct: 27 GQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 86
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLP SISTYSQLRREFLA FSSR
Sbjct: 87 YVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYRRLPAXSISTYSQLRREFLAXFSSR 146
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
Sbjct: 147 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 206
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
GEEAPATF EVLQKAKK GR GKD A PKSKDKGSFSS
Sbjct: 207 GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIENADPKSKDKGSFSS 266
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAEYRRAENGPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Sbjct: 267 GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 326
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREHGHNTSD WELKRQ+E+L QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR
Sbjct: 327 KYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRT 386
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 421
DRPAVINTIFGGPSGGQSG+KRKELARAAR EVCIIREQRPTCPITFD ADLEEVHLPHN
Sbjct: 387 DRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHN 446
Query: 422 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 481
DALVIA LIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV P
Sbjct: 447 DALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIP 506
Query: 482 QGCIDLQVTFGQDKTQVTQMAEFV 485
+G IDL VT GQD+TQVTQMAEFV
Sbjct: 507 EGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc03g00750 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 765.4 bits (1975), Expect = 5.1e-217
Identity = 430/603 (71.31%), Postives = 444/603 (73.63%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
GKL+AQVEALKAKCEQKE PLNDGDLGESPFTSDVLE APTVK YDGSKDPKD
Sbjct: 30 GKLNAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE--------APTVKSYDGSKDPKD 89
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 90 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW-------------------------- 149
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
FQE+QLKVA SDDSAMCYFLTGLADEALTVKL
Sbjct: 150 ---------------------------FQEDQLKVAQSSDDSAMCYFLTGLADEALTVKL 209
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
G+EAPATF EVLQKAKK GR GKD KA KSKDKGSFSS
Sbjct: 210 GKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDE-KADLKSKDKGSFSS 269
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAE+RRA NGPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Sbjct: 270 GRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRNKD 329
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREH HNTSD WELKRQ+EDL QD YFKKFVGKPRT+SAEKKEERK SRTP RRI
Sbjct: 330 KYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTPLRRI 389
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 421
DRPAVINTIFGGPSGGQSG KRKELARAAR EVCIIREQRPTCPITFDSADLEEVHLPHN
Sbjct: 390 DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHN 449
Query: 422 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 481
DALVIA LIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV P
Sbjct: 450 DALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRESVIP 509
Query: 482 QGCIDLQVTFGQDKTQVTQMAEFVVID--------------------------------- 541
+GCIDL VT G D+TQVTQMAEFVVID
Sbjct: 510 EGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP 569
Query: 542 -------GEQTASRECYAYALKGSSVCTLEEQASGDGPLEFEADLPRREFSAPTEELELV 544
GEQ ASRECYA ALKGSSVC LE S DG LEF+A+LPRREF+APTEELELV
Sbjct: 570 NGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAPTEELELV 570
BLAST of Moc03g00750 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 747.7 bits (1929), Expect = 1.1e-211
Identity = 406/576 (70.49%), Postives = 438/576 (76.04%), Query Frame = 0
Query: 3 KLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDY 62
K DAQVEALKA+CE+KES +DGDLGE F+SD+LEA IPPKFK PT+KPYDGSKDPKDY
Sbjct: 88 KFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDY 147
Query: 63 VEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSRH 122
VEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRH
Sbjct: 148 VEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKEFISQFSSRH 207
Query: 123 YDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG 182
YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL
Sbjct: 208 YDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLR 267
Query: 183 EEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKG-SFSS 242
EEAPATF EVLQK KK GR GKD GKA KS+DKG S SS
Sbjct: 268 EEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSS 327
Query: 243 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 302
R +YRR+ + +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ D
Sbjct: 328 SRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTD 387
Query: 303 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 362
KYCRFHR+HGHNTS+ WELKRQ+EDL QDGYFKKFVGKPR+NS EKKEERKR RTPPRR
Sbjct: 388 KYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRD 447
Query: 363 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 422
DRPAVIN K+KELAR AR EVCIIREQRPT I F+ ADLE VHLPHN
Sbjct: 448 DRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHADLEGVHLPHN 507
Query: 423 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 482
DALVIA LID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES++
Sbjct: 508 DALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESISL 567
Query: 483 QGCIDLQVTFGQDKTQVTQMAEFVVID--------------------------------- 517
+GCIDL V+ QD TQVTQMAEFVVID
Sbjct: 568 EGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTL 627
BLAST of Moc03g00750 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 718.0 bits (1852), Expect = 9.2e-203
Identity = 371/422 (87.91%), Postives = 377/422 (89.34%), Query Frame = 0
Query: 18 KESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD 77
K+ LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 78 AIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 137
AIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 138 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAK 197
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TF EVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 198 K---------------------GRGGKDTGKAHPKSKDKGSFSSGRAEYRRAENGPTKSR 257
K GR GKD +A PKSKDKGSFSSGRAEYRRAE+GPTKSR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 258 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 317
PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 318 WELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGG 377
WELKRQ+EDL QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR DRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 378 QSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHNDALVIARLIDHVVVRR 419
QSG KRKELARAAR EVCIIREQ PTCPITFD AD EEVHLPHNDA VIA LIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
BLAST of Moc03g00750 vs. NCBI nr
Match:
XP_022156542.1 (uncharacterized protein LOC111023421 [Momordica charantia])
HSP 1 Score: 679.1 bits (1751), Expect = 4.8e-191
Identity = 352/408 (86.27%), Postives = 362/408 (88.73%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
G+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKD
Sbjct: 38 GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKD 97
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSR
Sbjct: 98 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQLRREFLAQFSSR 157
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
Sbjct: 158 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 217
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
GEEAPATF EVLQKAKK GR GKD +A PKSKDKGSFSS
Sbjct: 218 GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS 277
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAEYRRAENGPT+SRPYERFTPTTIPI EILTNIEESGMEKLLKRPEKLRGAPERRSKD
Sbjct: 278 GRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKD 337
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREHGHNTSD WELKRQ+EDL QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR
Sbjct: 338 KYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRT 397
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFD 389
DRPAVINTIFGGPSGGQ G KRKELARAAR E+ +E R + D
Sbjct: 398 DRPAVINTIFGGPSGGQLGHKRKELARAARRELTRNQENRSQLGLIID 445
BLAST of Moc03g00750 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 875.2 bits (2260), Expect = 2.2e-250
Identity = 451/504 (89.48%), Postives = 460/504 (91.27%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
G+LDAQVEALKAKCEQKE PLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD
Sbjct: 27 GQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 86
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYRRLP SISTYSQLRREFLA FSSR
Sbjct: 87 YVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYRRLPAXSISTYSQLRREFLAXFSSR 146
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
Sbjct: 147 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 206
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
GEEAPATF EVLQKAKK GR GKD A PKSKDKGSFSS
Sbjct: 207 GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIENADPKSKDKGSFSS 266
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAEYRRAENGPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD
Sbjct: 267 GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 326
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREHGHNTSD WELKRQ+E+L QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR
Sbjct: 327 KYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRT 386
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 421
DRPAVINTIFGGPSGGQSG+KRKELARAAR EVCIIREQRPTCPITFD ADLEEVHLPHN
Sbjct: 387 DRPAVINTIFGGPSGGQSGRKRKELARAARREVCIIREQRPTCPITFDGADLEEVHLPHN 446
Query: 422 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 481
DALVIA LIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSGESV P
Sbjct: 447 DALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVIP 506
Query: 482 QGCIDLQVTFGQDKTQVTQMAEFV 485
+G IDL VT GQD+TQVTQMAEFV
Sbjct: 507 EGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc03g00750 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 765.4 bits (1975), Expect = 2.4e-217
Identity = 430/603 (71.31%), Postives = 444/603 (73.63%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
GKL+AQVEALKAKCEQKE PLNDGDLGESPFTSDVLE APTVK YDGSKDPKD
Sbjct: 30 GKLNAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLE--------APTVKSYDGSKDPKD 89
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW
Sbjct: 90 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLW-------------------------- 149
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
FQE+QLKVA SDDSAMCYFLTGLADEALTVKL
Sbjct: 150 ---------------------------FQEDQLKVAQSSDDSAMCYFLTGLADEALTVKL 209
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
G+EAPATF EVLQKAKK GR GKD KA KSKDKGSFSS
Sbjct: 210 GKEAPATFAEVLQKAKKVIDGQELLRTKTGRPERGIDRGRSGKDE-KADLKSKDKGSFSS 269
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAE+RRA NGPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERR+KD
Sbjct: 270 GRAEFRRAVNGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRNKD 329
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREH HNTSD WELKRQ+EDL QD YFKKFVGKPRT+SAEKKEERK SRTP RRI
Sbjct: 330 KYCRFHREHDHNTSDRWELKRQIEDLIQDDYFKKFVGKPRTSSAEKKEERKLSRTPLRRI 389
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 421
DRPAVINTIFGGPSGGQSG KRKELARAAR EVCIIREQRPTCPITFDSADLEEVHLPHN
Sbjct: 390 DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDSADLEEVHLPHN 449
Query: 422 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 481
DALVIA LIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS ESV P
Sbjct: 450 DALVIAPLIDHVVVRRVLVDEGVSANIVSLLTYLALGWTRSQLKKSTTPLVGFSRESVIP 509
Query: 482 QGCIDLQVTFGQDKTQVTQMAEFVVID--------------------------------- 541
+GCIDL VT G D+TQVTQMAEFVVID
Sbjct: 510 EGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTP 569
Query: 542 -------GEQTASRECYAYALKGSSVCTLEEQASGDGPLEFEADLPRREFSAPTEELELV 544
GEQ ASRECYA ALKGSSVC LE S DG LEF+A+LPRREF+APTEELELV
Sbjct: 570 NGVGMVRGEQIASRECYASALKGSSVCALETLVSRDGTLEFKANLPRREFAAPTEELELV 570
BLAST of Moc03g00750 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 747.7 bits (1929), Expect = 5.3e-212
Identity = 406/576 (70.49%), Postives = 438/576 (76.04%), Query Frame = 0
Query: 3 KLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDY 62
K DAQVEALKA+CE+KES +DGDLGE F+SD+LEA IPPKFK PT+KPYDGSKDPKDY
Sbjct: 88 KFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDY 147
Query: 63 VEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSRH 122
VEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFSSRH
Sbjct: 148 VEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKEFISQFSSRH 207
Query: 123 YDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLG 182
YD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKL
Sbjct: 208 YDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLR 267
Query: 183 EEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKG-SFSS 242
EEAPATF EVLQK KK GR GKD GKA KS+DKG S SS
Sbjct: 268 EEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSS 327
Query: 243 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 302
R +YRR+ + +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ D
Sbjct: 328 SRVDYRRSNSSHNQSRPYEHYTPTTIPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTD 387
Query: 303 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 362
KYCRFHR+HGHNTS+ WELKRQ+EDL QDGYFKKFVGKPR+NS EKKEERKR RTPPRR
Sbjct: 388 KYCRFHRDHGHNTSNYWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRD 447
Query: 363 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHN 422
DRPAVIN K+KELAR AR EVCIIREQRPT I F+ ADLE VHLPHN
Sbjct: 448 DRPAVIN-------------KKKELAREARREVCIIREQRPTSSIAFNHADLEGVHLPHN 507
Query: 423 DALVIARLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVTP 482
DALVIA LID V+VRR+LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSGES++
Sbjct: 508 DALVIAPLIDLVLVRRILVDGGASANILSLSTYLALGWTRSQLKKSPTPLVGFSGESISL 567
Query: 483 QGCIDLQVTFGQDKTQVTQMAEFVVID--------------------------------- 517
+GCIDL V+ QD TQVTQMAEFVVID
Sbjct: 568 EGCIDLPVSIRQDDTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTL 627
BLAST of Moc03g00750 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 718.0 bits (1852), Expect = 4.5e-203
Identity = 371/422 (87.91%), Postives = 377/422 (89.34%), Query Frame = 0
Query: 18 KESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD 77
K+ LNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 78 AIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 137
AIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 138 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFVEVLQKAK 197
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TF EVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 198 K---------------------GRGGKDTGKAHPKSKDKGSFSSGRAEYRRAENGPTKSR 257
K GR GKD +A PKSKDKGSFSSGRAEYRRAE+GPTKSR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 258 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 317
PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 318 WELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRIDRPAVINTIFGGPSGG 377
WELKRQ+EDL QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR DRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 378 QSGQKRKELARAARCEVCIIREQRPTCPITFDSADLEEVHLPHNDALVIARLIDHVVVRR 419
QSG KRKELARAAR EVCIIREQ PTCPITFD AD EEVHLPHNDA VIA LIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
BLAST of Moc03g00750 vs. ExPASy TrEMBL
Match:
A0A6J1DS95 (uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023421 PE=4 SV=1)
HSP 1 Score: 679.1 bits (1751), Expect = 2.3e-191
Identity = 352/408 (86.27%), Postives = 362/408 (88.73%), Query Frame = 0
Query: 2 GKLDAQVEALKAKCEQKESPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKD 61
G+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKD
Sbjct: 38 GELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKD 97
Query: 62 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPGRSISTYSQLRREFLAQFSSR 121
YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQLRREFLAQFSSR
Sbjct: 98 YVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQLRREFLAQFSSR 157
Query: 122 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 181
HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL
Sbjct: 158 HYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKL 217
Query: 182 GEEAPATFVEVLQKAKK---------------------GRGGKDTGKAHPKSKDKGSFSS 241
GEEAPATF EVLQKAKK GR GKD +A PKSKDKGSFSS
Sbjct: 218 GEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERADPKSKDKGSFSS 277
Query: 242 GRAEYRRAENGPTKSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKD 301
GRAEYRRAENGPT+SRPYERFTPTTIPI EILTNIEESGMEKLLKRPEKLRGAPERRSKD
Sbjct: 278 GRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPEKLRGAPERRSKD 337
Query: 302 KYCRFHREHGHNTSDCWELKRQVEDLTQDGYFKKFVGKPRTNSAEKKEERKRSRTPPRRI 361
KYCRFHREHGHNTSD WELKRQ+EDL QDGYFKKFVGKPRT+SAEKKEERKRSRTPPRR
Sbjct: 338 KYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRT 397
Query: 362 DRPAVINTIFGGPSGGQSGQKRKELARAARCEVCIIREQRPTCPITFD 389
DRPAVINTIFGGPSGGQ G KRKELARAAR E+ +E R + D
Sbjct: 398 DRPAVINTIFGGPSGGQLGHKRKELARAARRELTRNQENRSQLGLIID 445
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 4.5e-250 | 89.48 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 5.1e-217 | 71.31 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.1e-211 | 70.49 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 9.2e-203 | 87.91 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022156542.1 | 4.8e-191 | 86.27 | uncharacterized protein LOC111023421 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 2.2e-250 | 89.48 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 2.4e-217 | 71.31 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 5.3e-212 | 70.49 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 4.5e-203 | 87.91 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DS95 | 2.3e-191 | 86.27 | uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
Match Name | E-value | Identity | Description | |