Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAATTTATGGGGGCGCAATTTTTGGTAATAAAAGCCCTAGGTCATACGCTCAAGAACAGCAGTGGAATTTAGCTGTAATTCAGAAGTCCTCTCTTCAGACATTTCTTACAAATCACTGAAGGACACACGGGATCTTTGGTCACTTGTCGGAGTCGCTATCCAAATCCATCGACATTCGAACCAAAATCCAACATCCCAACTTCTTTTCCTCTTCCCTTTCTCCTTCTGCAACCTGTTCAATCATCGGAATATTGCCGGAAAAACTCTGCATTCCGTTTTGGATTGCCTGAATTACTGACACTTGCGGCTCTTTTCTGATTTCAAACGATCGTCTCCGTCGGAAACCCATGTCTCGGAATTTTGCCGGATTTTCGGTTTGTGGGTTTTCAGGTGTGTTTACTTCTGTTACTGGAGATGTGTGTTTTTTTTTTAAACCCTTTGTTCATTATGTACCTTCAAGTGTGTTGATCTTTCTGCTCTGGGTCACTTTCGTATTTATTGTATTGGTTTTTTAAACCCTTTTGAGGTTTTTGTTTTTTGTGGGTTCGATTTCGGAGTTGCGGTTCACGATCGTTGATTTTGTTTCTGGTCCAGCTAATTTGATATACTTTGACTTCTGTTAGATGATTTGCTTTGTAGTTTACCCAACCGATATCCTTCTTATTTTACGTTGTTTGTTTATGCCCTTTCCTTAAATGGGTATTTACTCCATGTATATTCTGTCAAGGTTTTTTATATTTATATCATCTATATCCTGTGTAGTTTTCCACCGAAACGTATGCGATAAACTTTCAAAGTTGAGTTTTTCAGGTTGCCTGTGACTTATGTTTGCTTGGATTTTTTGATCTCCCAGGTTTCATTTTCTTAAATGCTGCATTATATTGGAGATAACAAGGAAGAGGAAATGAAGATGAGGAACAAATACAGGAAATCAACCACTTTACGTTGCAATGCAGGGAGTAGATGTTTCATATCTGTGATAATAGGGAGTCTAGTGGGGTGTATTCTTATACTACATATATTTTCTCCTATAAGCCGCAAGGATGAGATAGTTCGGGGCATCGAACTTCAAACAAGTCACCACCTTCGCTTCCGTGAACTTGAAGAGGTAGATGAGGAAAACATTCAAATTCCCCCTCCAAGGGGTAAGAGATCCCCACGTGCAGCGAAGCGAAGACCAAAGAAAACAACCACGCTAATTGATGAATTTCTTGATGAAGATTCACAGATTAGACACAAGTTCTTTCCTGATCATAAAACTTCCGTTGATCCAATGAAAACAGGAAATGATAGTATGTACTATTATCCAGGGAGAGTTTGGCTGGATACTGAAGGGAATCCCATTCAAGCTCATGGAGGTGGAATTTTGTTCGATGAAAGATCTGAATCATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCACGCTCACAAAAAGGGAGCTGCACGGGTAAAAGCATCAATACTCTGCCTTTCTGACCTCCATTTTTTGTAATGTTAGACTAGTCTAATGCTTCCATAAATAAATATAATTCCTATCACATAAACTGTTGGCATGCACCTATACAGATTTGTATGGAAACAACACAGTTTGATTAATACTGCATCCTGTTTTCTTCCATTTTCATTAGTAGCAATAGTTTATTCTTTATTACATGGACTTAAAAAATTTCAATTATTGCTTCGTGGTGGTTTTCTCTGACTTTATGTCTTTACCTTTTTAAATATAAAGCATCGGCTTGAGTGATCATAATATGTTGAATCCTTCTAGTTCATTTTTTACCTTATTCCACTCTCGATAAGAAATTCAGTCATCATTGCTGCATTACAATTAAATCGCTAGGTTCAAGTCCAACTTATGAATCTTATTAGCTTTTGAGATGGTGTGTGATTCTTGAGTGATAAGGTCTTGGAGGATGACTGAGGAAATTTTAAGGAATTGTATTAGTTTTTTCTGTTGCCACCTTGAAATTGGAACACTGAAGAAAAAGAAAAGAAAAGAAAAGAAATTAGTACACTGAAGAATGTCACAAAAACAGTAGAGAGAAAGGACACCAGAAAAATAAATTTGAGGCACTTTTCACCATTGCACCTACACAAACTGCCCGAGTAAGGGAAGAGGGGGGAATGTAAATTTCTACTCTGAATTCTATCATTCATGTTAGTACCAGTTAGCTTTATTTGGCATTTTTGCCCCCATATCATCCTACGAAGATCTCAGGAATGAAGGCTTAGTGGCTGAGATTTACTAGGAAAATTAACCTATTTCTTCAGGGACCAAATTCTGTAATCCAAGGACAAATGACGTACAAGAGAATGTAGTTTTTGTGAATTTTCAATCTTTTTATCCTGAAAGGGTCTGCCATAAGGACCATTGCTTTCTCAACTGCATTCCAGCAGTCCCCCACTCTTTTATGTTTGCTGGTGGCATCTCAAGTTGTGAAGTTTTAGAAACTGCACAAAAGCTTTCTTTATCTTTAAATAGTCTCTCCAGCCAAATGCTATCCTCACTTGAAATTGACATCGAGGTCCAGACCCGGAGGAAATGAACCTTAGATCAGGGTCTCTAGTATGAAACTTCTGAAAGTACCGGACATCTGTCTCAAGGTGCACCTCATTAGAAAAGATAATGCCTGTTTCTTTTTGCATGCTAGTGATGAGGCTTTTCTCTTGCTTGCAGAATAAATTTTATGGAAGAAGTTTGAATTTTTGCCTCCTCCTTCAAGCCATTGTTCGTTTAGTTTCTGTCCCCATCTAATGTCTTATTTTAACTCTAAAACTTCTAATGAGCATATAAGCTCGGTGCTTTTAACTGTATTGCCCGTATGAAAAAGTCTATAGATATAACATTTGTCCTAAGTAATGCATAAGTTTATATTCCTGAGGTTTGCATTTGCATTCTTTTCAATTAAAGTAGGACTGTAGAGCTTCCTGAATATACTATGTGACTGCATTAGCTTTGCTATATTCCCCCATGAGGATCAGAATAGAAAAGAATGATAGGGAGGGAAGAGAGCAAATTAAATGGGGAGTGGTAAAAAAGTACACGTGCTTATCAAAATTCTAAAACATAGTTGATGAGTGACCAATATATTAATATAAACAAAGGTTATCAAGATCTGAGGGTAAATTTGGCACCTTCAATTTTAGCATGATGCTGAGCTCTTTCTCTCTACGCTTATCAAATGGCCCTTGGAAATATATTCTCAAGCACCAAAATCTCATTTACTCGTCTCAATAGGTTAGTTCCTTTCCCGTTTCACTCCTTTTGGGGCATTGTATTGACCCTTTTGTACCTTTTTATTCTCTCACAAAAAACTTTTTCGATTGAAAAAGAAAAGACCTCCTTTACTCTAGAAGTCTAGAATAAAGGTGGCAGTGGCGTGCTTCAACACAGTCTTGGGAAGATCATTGGTTGAGCAATGCCCCCTTTTCAAAGGCGTTTCCAAGACATTATTCTTTATCCAACCGAAACCATCATCTAGAAGAAACGTCATTGGAAGATAAAATCTGAGTGGACGTCCCTCTGACAAATTCCCAACCAAATGACCTCATCCATTTCCATACAAAATGATATGGAGAGTTCACTGTGCGAAGAAAGTCAGAATTTTCGTATGGGAGCTTAGCCACAAGAGTCTAAACACCCATGACAAAGATCCAAAGGTTCCCCCCATGTGCTTTTGCCACTGTGGTGTACTCTTTGCAAGAAAGAATCAGAAATTGAAGACCATCTCTTTCTCAACTGCTCTTTTGCTCTGAAGTTTCTTGCTTTACACATTTTATTGGCATTTGACCCTCCCCAAGGCCCATTGACACAAACACCCTTCTCCTATGATTCTAGTGGGGCATCCCATAAAAAAGACAAGAAGGTTCTATGTTAGAGCCTTTTTCTGGACTTTTTGCCTTGAAAGAAACATTAAGAGCTTCAACAACAAAGAAAAAAAATACTTTGATTGGTTTGTTGAATCTTTTACCCTTTTTGCTCTTTCTTGGTGGAAACTTTGAGTGCATGCGTTAACACTCTTTTTGGCTAATTGAAAGAATTTTTTGTATCTCCCCTCGGCATGGTATTCAGCTTTCTTTCTCCTCTTTTATAATTTCATATCTTCAATGAAATTGTTTTGTCAGAAAGAAATAAAAATCTTAATATAATCTATAGTGATAGCCTAGGTAAGAATTAGAGTAATAGAATACTAAAGTTATAATACTTAGAAAGAATGATTCTCCTGAGGGATATAAATTCTGAATAATCTCAAGTACAATCATGGTATATTGATCTTTTTTTTGAGAATATTACATCATAAATATGGGGTGGGAATTCGACTTTATGACCATTTAGGAGAGGTTGAGATGCTTTAATCATTGAGCTATCAATTAGGTGCCTATTGATCTTTTAAAGTGTAAATAAGTTGAGAAATTCAAAAAACATTTGCGGTCAGTATCTTCAATCCCATCACATCATTGAACGTTTGGTCTGATAAGAATCAGACAAACTACTTGTAGAAGCTTTTTGCCATGGGATGAAAGTTTTTTCTGGAACTAAAATGTCGGAAAGATGTTTGGTGAATTCATTGGTCTAGAGGTTATTTGCTTATTTGATATCTTTTGTACTTCTTGGATATCTGTTTTGTATTATAACTAGATTGTTGATTTATCTTAAATGAAAAATGGGAAGATGATTCCACTGTTCTGAAGTAGGTTGTGGCCGAGCTCTCCCAATATTTATCCAAACTAACTCCCGTCTCTCTTGACTCACCTTACTTCTGTCTAATGGGATCAATTCTCCCTCTCCCTCTGCCGATAACTTTCCCTTTCTGTTGTCTGGACTCTGCACTGGTACCTCACCATGCCAACACTCAAGCTGTAAGTGCTGCTACACTGAGTGCAAACATTTTCCTGTTGATTTTGCCAGAAGTGGAAATAAAGTCAGACTTGAGAGAAACAACTTGAGCCTACATTTCTAGTACCAGTCTGGAACCTTGATCATTAAGAATTGCTCATCACCCCCTCCATCATCATACTCTTTACAAAAAAATCATAACAAATGACTGTTTCCTTTGGCTCCAAGAATCTAAGAGAAAGCATGATTTAGTTGTAAAAATGACAGATATTCAACAATAGAAGGAAGAGGAACCTCCTCATTCCCACCGGAAGTCACTTTCAAGGACGCAAAGATTTTCCCAACGGTCTTCAATTTCCAAACTTCCTTGTTTTCCACAGAAAATCTCAAGTCTCCCTTCTGATAAACAAGCAAATCAGATTTTGTATTATAAAGAAGTATTGGCTGGCTTGCCTGCTATCAGTTCTAAATCAAGCTTGACCTCACAGTTCTAAACCAACCTTTGTTGGAGGAGAACCATTGTTGTGACAAAAAGCAATACTCAGGATGATTGGAGCAAGATCGTAAATGATAGAGAATCCACCAACTCCATCTGCTCCTATTCCTTTTCAACCAGACAAAGTCCATTCTCCTATATCCTTGTGATTTTGAGGCTAACATTACGTGCTACAACAGCTTGTGGTGTACGGCTATACCCTTTACATTGGAGGTTGGAAAAGTGGAATAAGAAAATCCATAGCAAACATTCACCTGCACCCTTTTATGATAGATCGATAAGAATCCACAACCTTCCCCGAGTTAGTGGGCCACCAAGATTTTAAATCAGATTGGAGATAAATGTGGGAGCTTCATCGAATACTCCACTCAAACCGTTAAACCTTTTAGAATATATGGAAGCTTGTATTGAAGTTCAATGTAACTTTTATGGCTATATGCTGACTGAAATCAAGCTAAACAATGGAAAGAGAAGCTTTTCTGTCCATATCAGAAATCAGCGTGATCCACGATTTTATGCTTCTCATGTGTAAGGGTCTGATGTCTATGGCTGCATCTCTCCAGGTGGCAGTTGCAGGATTTTTTGGAGAAGGTGACTCGATTTACCCAATGGCCAAATAGAGGCTGCAAGACCCTTCCTTTTTTTTATCCTAGTTATTAGCCCTGGCCACTTATGCCCTTCCGAAGAAACGAACAAAGCCAAAAAGTTTAAACCTGTTAGTTTCCATAACGATCTGCTCTCCTCTCCTTTAATGTTCACCCATCTACTGAAATCTCCTTATGCGAACCAAATGCGTGTCCCACCTCCAACCAAAACGAAAATGATAACCTAATTCCTCTTCAAATTGCACATCTCCTTTTCAGAGCAAAACATTTCTTCAGGCTCCACCCTTAACCTAACCCAAAAGGACAACCAAAGCCCCGTTTGAAACTTAAGTACCTTTCTTACCCTATCTGCCGTGCCCCCTCTGCCTTTGTTCCTGAAGCAAACTCCAACAAGCCTCTAGGCCCCTCTTTAGACCCAAAGCTGGGCCTCTAAGCATGAGAGATTTACGTCAGGCCTGTCCCTTGCCGCTCTTGACCAATTTCGAACCTCAACTCTCGGCCTAAACCCAAGCAAAGAAGGGCTTTTGTTTGCTGAAGAGATTCGAACTCATACTTCCATTTGCAACCATGGGTCGAAGCCCAAGGGATACTGTCACTTACCCACCCTTCCTCTAGCCAACCAACAAAATAACTCCATTGCTGCATTCTTAAAACCCAAGCTACTCCTTTTGCTGCTGCCTGGGCCTTTCTAATCACCTAAGGGACATACCAGTTCCCGAAATGGAGCTGTCCAAGAAATGCAGCAAAAAGTGTGTGGCGAACTCATTTCTTGTAGCAACTATGAGAGAAGTTCACACTGATTTGCAACTTCCTGCATCCAGCATATTCCATACTGATTTAGTTTACAACAATAGTGGTTCAGCGTTGTGAGACAGAGTGAGCCATCGGATCAAACTGATTCTCAAGTTGACTCCATCTTGATTGCCAATGAGAATCTTTTTGGCATCCAAATTGGAAATTGTTTTCACTAGCTGCTTCAGTTCTACTTGGGTTATTGACAGTCCTTTGAGATACACCTTGTCTAGACTTCTCCAGATTTCTGGCCATAAACTTGCTAGGGTGGGTGATTGTTGGTCCATGAGAAGTAGATCCTGCTGGTTTCTTATCACCCGCATTTGCATCTCCTCTTGCATTTAAGCAAGTCTCAGTTTCGTATTTAAAAAAAAAATGTAATATGTCAGTCAAAGATATGGTTTTAGCACTTTTCCTTTGAATGTGATAATTGTTCTGAGGAGTTGGAAGGTTGTCAGAATAATTGATATTAAGTTGTACAACTTGAATAATTTGGAGCGTGACATTCTTATGATCTAATTATCTTATAACCATCAATTGTTTTGTTGTGACTTGCTTTGAGATTGGATATTTTGTAGTCCTATGCTATTACTAGCTTCAATACAATCACATTTTCTGTCGTCCTTTTGTCCATCACGTGAATCAGGTCCTTTGTTTTAATATTATTACCATTAACTTTTGCTGTCTTCTATTTCACTAACATAATGTATTCTCTACATGGACAACAAATTAGTATCCGAAACTCGACTAAAATATTTTAGTGTTCTTATGATTAGGTCGACATTATAGGAGTTGGTTGCTACTCCTCCAAAGACCTGTGGACATGGAAAAATGAAGGCATTGTTTTGACAGCGGTAGAAACAAATGAGACCCATGATCTTCACAAATCCAATGTACTCGAGAGGCCTAAAGTTATCTACAATTCGAGGACGGGAAAATACGTAATGTGGATGCATATAGACGATGCGAATTATACAAAAGCTTCCGTGGGTGTTGCCATCAGCGATTACCCCACCGGTCCATTCGATTATCTTTACAGCAAAAGACCCCATGGATTTGACAGCAGAGACATGACAATCTTTAAAGATGATGACGGTACAGCGTATCTCATTTACTCATCTGATGACAATAGTGAACTTCATATAGGGCCTCTCACAGAAGATTATCTCGACGTGACCAACGTCGTGAGAAGGATTTTCATTGGCTACCACCGGGAAGCGCCAGCTTTGTTCAAACACCAGGGAACTTACTATATGGTCACGTCGGGATGCACAGGATGGGCCCCGAATGAGGCACTGGCACACGCATCAGAGTCGATGTTGGGTCCATGGGAGACGATGGGAAATCCATGTATAGGAGGAAACAAGATGTTTCGACTAGCTACATTCTTTGCCCAGAGCACATTTGTTCTTCCCCTACCTTCACATCCTAGCTTGTTCATCTTCATGGCAGACCGATGGAACCCCGCAGATCTTAGAGACTCAAGATACGTTTGGTTGCCGTTGATGGTCGGAGGACTTGTCGATGAACCACTCGACTACAATTTCGGGTTCCCTTTGTGGCCAAGAGTGTCCATATATTGGCATAGAAAGTGGAGGCTTCCTCAGGGCAGGAGTCTGTCAAAATGACACATTTTCTTTGATCTCACCCTCAGATGTACCATACGAGAAAGATCTTTGGTTGATTCTTTCACAGTGTCCCATAGTTTTCCTTAACAATTGAATAATGTTTACTAGTTCATATTATAGGCACAGAATTTTCAATCCTTGAATGAATTTATACCCCTAATGGTTAATTAATAATGATGATATTAGGTTATAAATTTCAAT
mRNA sequence
CCAAATTTATGGGGGCGCAATTTTTGGTAATAAAAGCCCTAGGTCATACGCTCAAGAACAGCAGTGGAATTTAGCTGTAATTCAGAAGTCCTCTCTTCAGACATTTCTTACAAATCACTGAAGGACACACGGGATCTTTGGTCACTTGTCGGAGTCGCTATCCAAATCCATCGACATTCGAACCAAAATCCAACATCCCAACTTCTTTTCCTCTTCCCTTTCTCCTTCTGCAACCTGTTCAATCATCGGAATATTGCCGGAAAAACTCTGCATTCCGTTTTGGATTGCCTGAATTACTGACACTTGCGGCTCTTTTCTGATTTCAAACGATCGTCTCCGTCGGAAACCCATGTCTCGGAATTTTGCCGGATTTTCGGTTTGTGGGTTTTCAGGTTTCATTTTCTTAAATGCTGCATTATATTGGAGATAACAAGGAAGAGGAAATGAAGATGAGGAACAAATACAGGAAATCAACCACTTTACGTTGCAATGCAGGGAGTAGATGTTTCATATCTGTGATAATAGGGAGTCTAGTGGGGTGTATTCTTATACTACATATATTTTCTCCTATAAGCCGCAAGGATGAGATAGTTCGGGGCATCGAACTTCAAACAAGTCACCACCTTCGCTTCCGTGAACTTGAAGAGGTAGATGAGGAAAACATTCAAATTCCCCCTCCAAGGGGTAAGAGATCCCCACGTGCAGCGAAGCGAAGACCAAAGAAAACAACCACGCTAATTGATGAATTTCTTGATGAAGATTCACAGATTAGACACAAGTTCTTTCCTGATCATAAAACTTCCGTTGATCCAATGAAAACAGGAAATGATAGTATGTACTATTATCCAGGGAGAGTTTGGCTGGATACTGAAGGGAATCCCATTCAAGCTCATGGAGGTGGAATTTTGTTCGATGAAAGATCTGAATCATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCACGCTCACAAAAAGGGAGCTGCACGGGTCGACATTATAGGAGTTGGTTGCTACTCCTCCAAAGACCTGTGGACATGGAAAAATGAAGGCATTGTTTTGACAGCGGTAGAAACAAATGAGACCCATGATCTTCACAAATCCAATGTACTCGAGAGGCCTAAAGTTATCTACAATTCGAGGACGGGAAAATACGTAATGTGGATGCATATAGACGATGCGAATTATACAAAAGCTTCCGTGGGTGTTGCCATCAGCGATTACCCCACCGGTCCATTCGATTATCTTTACAGCAAAAGACCCCATGGATTTGACAGCAGAGACATGACAATCTTTAAAGATGATGACGGTACAGCGTATCTCATTTACTCATCTGATGACAATAGTGAACTTCATATAGGGCCTCTCACAGAAGATTATCTCGACGTGACCAACGTCGTGAGAAGGATTTTCATTGGCTACCACCGGGAAGCGCCAGCTTTGTTCAAACACCAGGGAACTTACTATATGGTCACGTCGGGATGCACAGGATGGGCCCCGAATGAGGCACTGGCACACGCATCAGAGTCGATGTTGGGTCCATGGGAGACGATGGGAAATCCATGTATAGGAGGAAACAAGATGTTTCGACTAGCTACATTCTTTGCCCAGAGCACATTTGTTCTTCCCCTACCTTCACATCCTAGCTTGTTCATCTTCATGGCAGACCGATGGAACCCCGCAGATCTTAGAGACTCAAGATACGTTTGGTTGCCGTTGATGGTCGGAGGACTTGTCGATGAACCACTCGACTACAATTTCGGGTTCCCTTTGTGGCCAAGAGTGTCCATATATTGGCATAGAAAGTGGAGGCTTCCTCAGGGCAGGAGTCTGTCAAAATGACACATTTTCTTTGATCTCACCCTCAGATGTACCATACGAGAAAGATCTTTGGTTGATTCTTTCACAGTGTCCCATAGTTTTCCTTAACAATTGAATAATGTTTACTAGTTCATATTATAGGCACAGAATTTTCAATCCTTGAATGAATTTATACCCCTAATGGTTAATTAATAATGATGATATTAGGTTATAAATTTCAAT
Coding sequence (CDS)
ATGCTGCATTATATTGGAGATAACAAGGAAGAGGAAATGAAGATGAGGAACAAATACAGGAAATCAACCACTTTACGTTGCAATGCAGGGAGTAGATGTTTCATATCTGTGATAATAGGGAGTCTAGTGGGGTGTATTCTTATACTACATATATTTTCTCCTATAAGCCGCAAGGATGAGATAGTTCGGGGCATCGAACTTCAAACAAGTCACCACCTTCGCTTCCGTGAACTTGAAGAGGTAGATGAGGAAAACATTCAAATTCCCCCTCCAAGGGGTAAGAGATCCCCACGTGCAGCGAAGCGAAGACCAAAGAAAACAACCACGCTAATTGATGAATTTCTTGATGAAGATTCACAGATTAGACACAAGTTCTTTCCTGATCATAAAACTTCCGTTGATCCAATGAAAACAGGAAATGATAGTATGTACTATTATCCAGGGAGAGTTTGGCTGGATACTGAAGGGAATCCCATTCAAGCTCATGGAGGTGGAATTTTGTTCGATGAAAGATCTGAATCATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCACGCTCACAAAAAGGGAGCTGCACGGGTCGACATTATAGGAGTTGGTTGCTACTCCTCCAAAGACCTGTGGACATGGAAAAATGAAGGCATTGTTTTGACAGCGGTAGAAACAAATGAGACCCATGATCTTCACAAATCCAATGTACTCGAGAGGCCTAAAGTTATCTACAATTCGAGGACGGGAAAATACGTAATGTGGATGCATATAGACGATGCGAATTATACAAAAGCTTCCGTGGGTGTTGCCATCAGCGATTACCCCACCGGTCCATTCGATTATCTTTACAGCAAAAGACCCCATGGATTTGACAGCAGAGACATGACAATCTTTAAAGATGATGACGGTACAGCGTATCTCATTTACTCATCTGATGACAATAGTGAACTTCATATAGGGCCTCTCACAGAAGATTATCTCGACGTGACCAACGTCGTGAGAAGGATTTTCATTGGCTACCACCGGGAAGCGCCAGCTTTGTTCAAACACCAGGGAACTTACTATATGGTCACGTCGGGATGCACAGGATGGGCCCCGAATGAGGCACTGGCACACGCATCAGAGTCGATGTTGGGTCCATGGGAGACGATGGGAAATCCATGTATAGGAGGAAACAAGATGTTTCGACTAGCTACATTCTTTGCCCAGAGCACATTTGTTCTTCCCCTACCTTCACATCCTAGCTTGTTCATCTTCATGGCAGACCGATGGAACCCCGCAGATCTTAGAGACTCAAGATACGTTTGGTTGCCGTTGATGGTCGGAGGACTTGTCGATGAACCACTCGACTACAATTTCGGGTTCCCTTTGTGGCCAAGAGTGTCCATATATTGGCATAGAAAGTGGAGGCTTCCTCAGGGCAGGAGTCTGTCAAAATGA
Protein sequence
MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDEIVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
Homology
BLAST of MC02g1116 vs. NCBI nr
Match:
XP_022149504.1 (uncharacterized protein LOC111017920 [Momordica charantia])
HSP 1 Score: 1011 bits (2613), Expect = 0.0
Identity = 479/479 (100.00%), Postives = 479/479 (100.00%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE
Sbjct: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ
Sbjct: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE
Sbjct: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG
Sbjct: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM
Sbjct: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
BLAST of MC02g1116 vs. NCBI nr
Match:
XP_022927964.1 (uncharacterized protein LOC111434812 [Cucurbita moschata])
HSP 1 Score: 914 bits (2363), Expect = 0.0
Identity = 421/474 (88.82%), Postives = 455/474 (95.99%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSG
Sbjct: 301 DDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVLPLPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 473
BLAST of MC02g1116 vs. NCBI nr
Match:
XP_023553112.1 (uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 913 bits (2359), Expect = 0.0
Identity = 420/474 (88.61%), Postives = 454/474 (95.78%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY+GD K+++M MRN+YRKS LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSG
Sbjct: 301 DDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVLPLPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 473
BLAST of MC02g1116 vs. NCBI nr
Match:
XP_022974783.1 (uncharacterized protein LOC111473520 [Cucurbita maxima])
HSP 1 Score: 904 bits (2336), Expect = 0.0
Identity = 419/474 (88.40%), Postives = 451/474 (95.15%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRK E
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYL YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFK QGTYYM+TSG
Sbjct: 301 DDGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVL LPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 473
BLAST of MC02g1116 vs. NCBI nr
Match:
XP_004148025.3 (uncharacterized protein LOC101203100 [Cucumis sativus])
HSP 1 Score: 900 bits (2326), Expect = 0.0
Identity = 419/479 (87.47%), Postives = 452/479 (94.36%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
ML+Y+GD K+E MKMRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+L+++S IS DE
Sbjct: 1 MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I +GI L+TSHHL F ELEEV+EENIQIPPPR KRSPRA KRRPKKTTTLIDEFLDEDSQ
Sbjct: 61 IGQGIHLRTSHHLHFPELEEVEEENIQIPPPR-KRSPRATKRRPKKTTTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPD K S+DPM TGNDSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTA ET+ETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRTGKYVMWMHIDD NYTKASVGVAISDYPTGPFDYLYSK+PHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYLIYSS+DNSELH+G L++DYLDVTNV RR+ IG HREAPALFKHQGTYYMVTSG
Sbjct: 301 DDGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEAL HA+ES++GPWETMGNPCIGGNKMFRLATFF+QSTFVLPLPS+P+LFIFM
Sbjct: 361 CTGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLW RVSIYWHRKWRLPQG + K
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478
BLAST of MC02g1116 vs. ExPASy TrEMBL
Match:
A0A6J1D780 (uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017920 PE=3 SV=1)
HSP 1 Score: 1011 bits (2613), Expect = 0.0
Identity = 479/479 (100.00%), Postives = 479/479 (100.00%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE
Sbjct: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ
Sbjct: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE
Sbjct: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG
Sbjct: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM
Sbjct: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
BLAST of MC02g1116 vs. ExPASy TrEMBL
Match:
A0A6J1EJG7 (uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC111434812 PE=3 SV=1)
HSP 1 Score: 914 bits (2363), Expect = 0.0
Identity = 421/474 (88.82%), Postives = 455/474 (95.99%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSG
Sbjct: 301 DDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVLPLPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 473
BLAST of MC02g1116 vs. ExPASy TrEMBL
Match:
A0A6J1IHC3 (uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520 PE=3 SV=1)
HSP 1 Score: 904 bits (2336), Expect = 0.0
Identity = 419/474 (88.40%), Postives = 451/474 (95.15%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRK E
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYL YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFK QGTYYM+TSG
Sbjct: 301 DDGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVL LPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 473
BLAST of MC02g1116 vs. ExPASy TrEMBL
Match:
A0A0A0LPY3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1)
HSP 1 Score: 896 bits (2315), Expect = 0.0
Identity = 417/479 (87.06%), Postives = 451/479 (94.15%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
ML+Y+GD K+E MKMRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+L+++S IS DE
Sbjct: 1 MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60
Query: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
I +GI L+TSHHL F ELEEV+EENIQIPPPR KRSPRA KRRPKKTTTLIDEFLDEDSQ
Sbjct: 61 IGQGIHLRTSHHLHFPELEEVEEENIQIPPPR-KRSPRATKRRPKKTTTLIDEFLDEDSQ 120
Query: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
+RHKFFPD K S+DPM TGNDSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 LRHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTA ET+ETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERP 240
Query: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRTGKYVMWMHIDD NYTKASVGVAISDYPTGPFDYLYSK+PHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
DDGTAYLIYSS+DNSELH+G L++DYLDVTNV RR+ IG HREAPALFKHQGTYYMVTSG
Sbjct: 301 DDGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSG 360
Query: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
CTGWAPNEAL HA+ES++GPWETMGNPCIGGNKMFRLATFF+QSTFVLPLPS+P+LFIFM
Sbjct: 361 CTGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFM 420
Query: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
ADRWNPADLRDSRYVWLPLMVGGLVD+PLDYNF FPLW RVSIYWHRKWRLPQG + K
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 478
BLAST of MC02g1116 vs. ExPASy TrEMBL
Match:
A0A6J1L3W6 (uncharacterized protein LOC111499658 OS=Cucurbita maxima OX=3661 GN=LOC111499658 PE=3 SV=1)
HSP 1 Score: 893 bits (2307), Expect = 0.0
Identity = 417/480 (86.88%), Postives = 452/480 (94.17%), Query Frame = 0
Query: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
MLHY GD K+EEMKM+NKYRKSTTLRC+AGS +SV+IGSL+GCIL+L ++SPISRKD
Sbjct: 1 MLHYSGD-KKEEMKMKNKYRKSTTLRCDAGSIRLLSVVIGSLMGCILLLQLYSPISRKDT 60
Query: 61 IV-RGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDS 120
+ R I+L+TSH L FRELEEV+EE IQIPPPRGKRSPRAAKRRPK+TTTLIDEFLDEDS
Sbjct: 61 FIGRDIQLRTSHRLYFRELEEVEEEKIQIPPPRGKRSPRAAKRRPKRTTTLIDEFLDEDS 120
Query: 121 QIRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYG 180
+RHKFFPD KTS+DP TGNDSM+YYPGRVWLDTEGNPIQAHGGG+L+DE SE++YWYG
Sbjct: 121 PLRHKFFPDRKTSIDPTITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLYDEISETFYWYG 180
Query: 181 EYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLER 240
EYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLER
Sbjct: 181 EYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLER 240
Query: 241 PKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFK 300
PKVIYNS+TGKYVMWMH+DDANYTKASVG+AISDYPTGPFDYLYSKRPHGFDSRDMTIFK
Sbjct: 241 PKVIYNSKTGKYVMWMHVDDANYTKASVGIAISDYPTGPFDYLYSKRPHGFDSRDMTIFK 300
Query: 301 DDDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTS 360
DDDGTAYL+YSS DNSELHIGPLTEDYLDVTNV RRI IG HREAPALFKHQGTYYM+TS
Sbjct: 301 DDDGTAYLVYSSVDNSELHIGPLTEDYLDVTNVARRILIGQHREAPALFKHQGTYYMITS 360
Query: 361 GCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIF 420
GCTGWAPNEAL HA+ES++GPWETM NPCIGGNKMFRLATFFAQSTFVLPLPS+PSLFIF
Sbjct: 361 GCTGWAPNEALTHAAESIMGPWETMENPCIGGNKMFRLATFFAQSTFVLPLPSYPSLFIF 420
Query: 421 MADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK 479
MADRWNPA+LRDSRY+WLPLMVGGLVDEPLDYNFGFPLW RVSIYWHRKW+LP+G +L K
Sbjct: 421 MADRWNPANLRDSRYIWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWKLPKGWNLLK 479
BLAST of MC02g1116 vs. TAIR 10
Match:
AT3G49880.1 (glycosyl hydrolase family protein 43 )
HSP 1 Score: 719.2 bits (1855), Expect = 2.3e-207
Identity = 333/465 (71.61%), Postives = 389/465 (83.66%), Query Frame = 0
Query: 15 MRNKY-RKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDEIVRGIELQ-TSHH 74
M+NK+ +K+T LRC+ ++ ++VGC+ ++H+ SR + + Q HH
Sbjct: 4 MKNKHNKKATFLRCSPFG------LVSTVVGCVFMIHLTMLYSRSYSVDLDLSPQLLIHH 63
Query: 75 LRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTS 134
RELE V+EENI +PPPR KRSPRA KR+PK TTL++EFLDE+SQIRH FFPD K++
Sbjct: 64 PIVRELERVEEENIHMPPPR-KRSPRAIKRKPKTPTTLVEEFLDENSQIRHLFFPDMKSA 123
Query: 135 VDPMK--TGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAH 194
P K T + S YY+PGR+W DTEGNPIQAHGGGILFD+ S+ YYWYGEYKDGPTY +H
Sbjct: 124 FGPTKEDTNDTSHYYFPGRIWTDTEGNPIQAHGGGILFDDISKVYYWYGEYKDGPTYLSH 183
Query: 195 KKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGK 254
KKGAARVDIIGVGCYSSKDLWTWKNEG+VL A ET+ETHDLHKSNVLERPKVIYNS TGK
Sbjct: 184 KKGAARVDIIGVGCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKVIYNSDTGK 243
Query: 255 YVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLIYS 314
YVMWMHIDDANYTKASVGVAISD PTGPFDYLYS+ PHGFDSRDMT++KDDD AYLIYS
Sbjct: 244 YVMWMHIDDANYTKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDDNVAYLIYS 303
Query: 315 SDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEAL 374
S+DNS LHIGPLTE+YLDV V++RI +G HREAPA+FKHQ TYYM+TSGCTGWAPNEAL
Sbjct: 304 SEDNSVLHIGPLTENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCTGWAPNEAL 363
Query: 375 AHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFMADRWNPADLR 434
AHA+ES++GPWET+GNPC+GGN +FR TFFAQSTFV+PLP P +FIFMADRWNPADLR
Sbjct: 364 AHAAESIMGPWETLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMADRWNPADLR 423
Query: 435 DSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGR 476
DSRY+WLPL+VGG D PL+Y+FGFP+W RVS+YWHR+WRLP R
Sbjct: 424 DSRYLWLPLIVGGPADRPLEYSFGFPMWSRVSVYWHRQWRLPSAR 461
BLAST of MC02g1116 vs. TAIR 10
Match:
AT5G67540.2 (Arabinanase/levansucrase/invertase )
HSP 1 Score: 708.4 bits (1827), Expect = 4.0e-204
Identity = 336/472 (71.19%), Postives = 388/472 (82.20%), Query Frame = 0
Query: 13 MKMRNKY-RKSTTLRCN--AGSRCFISVIIGSLVGCILILHIFSPISRKD----EIVRGI 72
MK NKY +KST+L CN G R + I+ ++VG L+ H+ S SRKD + V
Sbjct: 1 MKKNNKYNKKSTSLHCNDAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSD 60
Query: 73 ELQTSHHLR---FRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQIR 132
+LQ HHL REL V+EE +++PPPR KRSPR +KRR +K L++EFLD+ S IR
Sbjct: 61 QLQVVHHLAHPIVRELIRVEEEVLRMPPPR-KRSPRTSKRRSRKPIPLVEEFLDDKSPIR 120
Query: 133 HKFFPDHKTSV-DPMK-TGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 192
H FFP KT+ P K GN++ YY+PG++W+DT+GNPIQAHGGGIL D +S +YYWYGE
Sbjct: 121 HLFFPGIKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGE 180
Query: 193 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 252
YKDGPTYHAHKKG ARVDIIGVGCYSSKDLWTWKNEGIVL A ETN+THDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERP 240
Query: 253 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 312
KVIYN +T KYVMWMHIDDANYTKASVGVAIS+ PTGPF+YLYSKRPHGFDSRDMT+FKD
Sbjct: 241 KVIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKD 300
Query: 313 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 372
DDG AYLIYSS+ NS LHIGPLTEDYLDVT V++R+ +G HREAPA+FKHQ YYMVTS
Sbjct: 301 DDGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSW 360
Query: 373 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 432
CTGWAPNEALAHA+ES++GPWE +GNPCIGGNK+FRL TFFAQST+V+PLP P FIFM
Sbjct: 361 CTGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFM 420
Query: 433 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP 473
ADRWNPADLRDSRYVWLPL++GG D+PL++NFGFP W RVSIYWH KWRLP
Sbjct: 421 ADRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471
BLAST of MC02g1116 vs. TAIR 10
Match:
AT5G67540.1 (Arabinanase/levansucrase/invertase )
HSP 1 Score: 696.0 bits (1795), Expect = 2.1e-200
Identity = 330/464 (71.12%), Postives = 380/464 (81.90%), Query Frame = 0
Query: 19 YRKSTTLRCNAGS-RCFISVIIGSLVGCILILHIFSPISRKD----EIVRGIELQTSHHL 78
Y S LR AG R + I+ ++VG L+ H+ S SRKD + V +LQ HHL
Sbjct: 4 YSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHL 63
Query: 79 R---FRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQIRHKFFPDHK 138
REL V+EE +++PPPR KRSPR +KRR +K L++EFLD+ S IRH FFP K
Sbjct: 64 AHPIVRELIRVEEEVLRMPPPR-KRSPRTSKRRSRKPIPLVEEFLDDKSPIRHLFFPGIK 123
Query: 139 TSV-DPMK-TGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYH 198
T+ P K GN++ YY+PG++W+DT+GNPIQAHGGGIL D +S +YYWYGEYKDGPTYH
Sbjct: 124 TAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYH 183
Query: 199 AHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRT 258
AHKKG ARVDIIGVGCYSSKDLWTWKNEGIVL A ETN+THDLHKSNVLERPKVIYN +T
Sbjct: 184 AHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKT 243
Query: 259 GKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLI 318
KYVMWMHIDDANYTKASVGVAIS+ PTGPF+YLYSKRPHGFDSRDMT+FKDDDG AYLI
Sbjct: 244 EKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLI 303
Query: 319 YSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNE 378
YSS+ NS LHIGPLTEDYLDVT V++R+ +G HREAPA+FKHQ YYMVTS CTGWAPNE
Sbjct: 304 YSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNE 363
Query: 379 ALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFMADRWNPAD 438
ALAHA+ES++GPWE +GNPCIGGNK+FRL TFFAQST+V+PLP P FIFMADRWNPAD
Sbjct: 364 ALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPAD 423
Query: 439 LRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP 473
LRDSRYVWLPL++GG D+PL++NFGFP W RVSIYWH KWRLP
Sbjct: 424 LRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 466
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D780 | 0.0 | 100.00 | uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A6J1EJG7 | 0.0 | 88.82 | uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC1114348... | [more] |
A0A6J1IHC3 | 0.0 | 88.40 | uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520... | [more] |
A0A0A0LPY3 | 0.0 | 87.06 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1 | [more] |
A0A6J1L3W6 | 0.0 | 86.88 | uncharacterized protein LOC111499658 OS=Cucurbita maxima OX=3661 GN=LOC111499658... | [more] |