Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTAAAATCTATACTCTGCCTTTCTGAGCCTTCATTTTCTGTAATGATAGTCTAATACTTCCATGGCTAAAAATATTGGTTAAATTATAAACACATCCTAAACCAAGAAGTTGGTATCAATAATGCACCCAATTTTTTTGAAGTGTAATTTTAACTCTTTAAGTTTGAAGTTTGTTTTAAAACAGCCCTCGTAGGCTTTTGCAGTTAAATCTGTAGATGGAAAAGTTTGTATATTCTTATGTGAACAAATCTAAACACTTACATGGAAAAAATAAAGACACATGGTGCATCTGTGAACCTTGTATGTGGCCTCCTTTCAACCACCACCATTTTTTGTTCTTTCGTTTATGATTTTTTTTGCTAATCCTTCACCTTAAATTTACTTCCGGGTGTTTAGATTTGTTCACTTAAGTAATCCATGAGTAGAAAAAAACGTTTCTATCTATAGATTTAACGAAAAAAGTCTTCAAGGACTGTAATTAAAAAGTTCAACCTTCAATGGTTAAAACGAAACTTTTCGAAGTTTAGGGGTATAGCTGAACCCAACCCCATAGTTCAGGAATAATTTTGATAATTTAATCTAAAAATATTCCTATTGCATAAATTGTTGGCATGCATCTCTATGAATTTGTATGGAAACAAAAGATTTTGATTAGTACTCCAGTCTTTTTCCCTCCTTTTCCATTATTAACAATAGTTTCTTCTTTATTATGGCAATTTCCAATCCCAATTATTGCTCCATGGTGGTTTTAATGACTTTATGTCTATATCTTTTTAAATATGAAATATCAGGCTCCATTGATCGTTATATGTTGAATTCCTTCTAGTTCATCTTTTTGTACCTTATTCCACTCTTGATAAGGTCTACGTCCAACTCATGAATCTTTAGTTTTTGAGATGGTATGTGATGCTTGGGAGGTTACTTTATGATCTTGGGAGTATGACAGTGAGAAGAAATTTTAAGGCAGAGGAAGTGAGTGTGCTTTGTGGCCTTTTGGAGCTGATCGAGAGTATGCTGATTTTGAGATTAGAGATCAAGAAAATGGAGTTTAGATAAAACAGGATTTTCTTGGTTAAACGTTCAGTTAAAGAGTCAGTTGATGTTCCCTCTCTTTCTAAAGTTTCGGTAAATGATTTGTGGAAATCAAGAGTCTAATAAATGTGATATTCTAATCTGTTTGGTTTTGGGAAAGCTACAACACAATACATCAAGAAGTTACGATAAAATAATTATCAGACAATGAACCTTGCTTGGGGCAAGGAAAGAAGATGAAATTAGCATTAACCCGACTGCTCAACTTCTTTTATTAAACCCTTGCAGATTAGGACAAGTCCTTGGTGATTAGAGAAACACCAATCTAATTTTTTGAATCAAACAAATGGGGCCACAAGATTGAAGTACCCTTCAGCTAGCCTAATTTTGAAATTCAAGGTCCCAAAAAAGGTGAAAAGTTCTTTTATGGTAGTTAGCCTAAAGAAGTTTGAAGAAGCTTTAAAAGAATTTTCAACTTTCAAAAATAGATGATCTCTCCTTCTATGTGTTGCTTTGCTCACAAGAGGAGGAATCCCTTGATCACATGTTCTTGCAATGTTCTTTTGTGGCTAAAGGGCAGCAGCCTATCTTCAATGTGTTTGGGGTGGATTACTACTTCCCTAAGAACATTAATGATTGGCTTATGGAAGTGCTCAACGGTAGGGTGTTTGGCGGCAAGGGGAAGTCCTTTGGAGGTGTGATATGCCGTCCTTCCTTCAGAGCCTTTCGAAGGAAAGGAATAGCCAGGTTTTTAAAAGTAAGTATTCTTCTTGAGTCCTTTTGGCGTTTGGTGCAATACAAGGCCTCTCTGTGGTGTACAGTTCACACTAAATTCTCTTGCAATTAATGCCTTTTGATGATTCAATTGGAAGGCTCTCATTTTTTAGTTTTGTGGGGAGGGGTGACCACTGTGGGGAGTTGTTGTTTGGTTCTTTTGATTAATACGCGTCACTGCTTTTTATATATAAAATCATGATCTGCCAGATATTTTGGGAAACTTGCAACTAGGAATAGACAGCATGATGTTACTGGTCTAGATGATAGTTATAGCGTGAGGAACACTACCGAGGAGTTTAGAATTGATTTAGAGTTTTTTGATGCACCTCCTTGAAATTGGAATGTTAATTAGTCATGGCCGACAAGTTTATGGTCTTCTGAGCCCCAGAGGAAGGAATGCTGCTTGAGATGAGTTGGGTGATCTATTTCGTCTGTATGGGCCTTGTTGATGTGTGGTGGGGAGCTTTTATGTAATTTGCTTTGGTACTGAGAAGATCTTGGCTGGCTGGACGATGGAATCAATAGGCTCTTCAATTATTTTATTGAGAGAAATAGCCTCTTCGACTCTCCTCTATTGTACGGAAAGTTTACTTGGTTGAATAGTAGGGCACGTACTAGACTTCATAGGTTACTGATATCTAAAGGGTAGATGAATGTTCATGGGAGTGTGAGACAATTGCTAGGCCCTAGAATAACCTTCGACCGGTGTCCAGAGGTGATGCTTGTGTCCTTTCATATTTGAAAATATTTGGCTGAACCATACCTCCTTATAATCTTCCTTTCCTTCATGTTGAATGCGGAAACCACTGGAAATTAAGAAATTCGGAACGGTATCTTCCATGGGGAAGCTAAGGGTCCTCAAAGGAGTGTTATTGTGGAACAAAGACTCACGTAGGAGTTTCAAAATTCTGGAAATTAAAAAGCTCAAAACAAAGCCCTTTTGGTTAAATGGTTATGCCAATTTCCCCTCGGAGTCGATAGAGGATTATAGCAAGCAAATAGGGTCCTCATCCTTCTGAGTGGATGTCAGATGGGCCAAAGGCACTTACAAAAATATTAGACTTCTTAAATCTTAGAACAGAAATTGACATCCAGTGCAGACCTGAGGAAATAAATCTTAGAACAGGATCTCTAGTATTAGACTTCTTAAATTACCCCAATGTCTGTTTGAAGGTGCGCATCATTGGAAAAGGGACGCCCGTTTCTTTTTGCGTACTGATGAGGCCTACACTTCCTTCCAGCATCAATTTTATTGAAGTAATTTGAATTTAAGTCTCTTCCTTTAAGCCATTGTAACCTTCAGTCCTATAGTAGAAAATGAATTTCATGTTGTTCCTCTAGGAGAAATTTGTAAAAGAAAAAGCTTGAATGATATGATTCCTTCTTTAATGACGCGATTTTTGATGCCTTGAATCATAGGTCACCCCGTACACTTGAACTTAAAAAGAGGTGGTAATTTTTTTTTTATGTAAAGCGGGAAACACAAGGAATTTAAAATTCAAATATGCATTCAAAACCAACATAAAGGTAGTTTGAAGGTAATATCCGTGAGTCAAGTCAAAACGGTTTACCAAAAAAATTCACAAGTTTCGATAAGTAGCATTTAAAATAATAATAAAATGACAAGAAGGAAGACGACTCGATCTAAAGGGAACCCCCAATAGCTACACGGTGTCACAGTCATGCTTGTCCTAGTCGTGTTGCCCATGGCCGCATGCTCATTGCATGCCAGAACCCTTGAGTTGTACCACTTCATTGCTTTCACTGCTTTCATGATGTATAGTTTCTTTATTGTATTTTCTAGGTGTATGATGTTTCAATAGAATCATGTATATGCCTGGTAGACCTGAAATGCCTCTGAATGTAGCATAGGGAGATGTGACTAGTCGTCAAATCTTCGTACTCGGAGTTACATGCTATCTAAGCCATTGTGTTGCCTTTTTCGCTTATAGCCTATAACATTGCTTTATTCATTTCTCACAATTTTGTTTTCAAATTCTCTTTCTAAAATCTCTCTCTGGTGCTGTTTTCTAAAACCTTCTCCTTAAAGCAAGGCCAGAGGCTTGAGTATACGTCGTCAGGAAGAAGGCAACATGATTTCGAGTTGGCGGACTCACTCGGTATTGAGAGTGAGTGCCGCACAATCGCTTAAAAAACGTCCCGTAAGACTGCGATTGTGACAGTTGGTATCCGAGTCGAGTTGGCTCCAAATGTGATTCTGTAAACACGGCTACAACCAAGCAGTTGAACAAGTCCCACATCGATCGACTAGTCGACATCGAAGAACAGATGCAATTCTTGTGAGAAGTTCCTTACAATGTTCAGTACTTGGACGAGCGGGTGAAAGAGCTCGCCGACAAGACCAATATGGTTGATGTAGTTGCAGGTTGATTAGATGAGTTACCCATCCAGGAACATATTTACCGAGTAGACAACCTAAAAGCAAAAGCCACAAAGACCGGTGGCTTCGAGCGAGGAGACAGCTTGACGGGCTCTGCTGCCCTATTTTAAGAGTGAGTTGATGGTCTAGACAGCTCCCAAAAAGCAGTTATGTAGATGGTCTCTGAGATACTTGAAGATGTGAAATTAGCCTTCGACGTGGTCAGGGCAGAAATTGCTAGGAATCACGATTCTCCCCAATGGTATGATATTGTTCACTTTGAGCATAAACTTTCATGGTTTTGCTTTGGGCTTTCCTAAAGGTCTCATTCCAATGGAGATCTATTCCTTGCTTATAAATTCATGATCATTCCCTACATTAGCCAATGTGGGACAACCTCCAAACAATCCTCAACATTCCTCCCCTCGAACAAAGTACAACACAGAGCCTCCCCTGAGGCCTATGGAGCCCTTGAATAACCTCTCCTTAATTGAAGCTCAACTCCTCTGGAGCCCTCAAACAAAGTACACTCTTTGTTCAACTCTTTACCTTTTGACTACATCTTTGAGGCTCATATAAGTTTAGGGCATGGCTCTGATACCATGTTAGGAATCACGACTCTCCACAATTGTGATATTGCCTTTTTTGAGCATAAGCTCTCATGGCTTTGCTTTGGGCTTCCCCAAAAGGCCTCATTCCAATGGAGATCTATTTTTTTGCTTATAAATCCATGATCATTCCCTAAATTAGCAAATGTAGGACAACCTCCCAACAATACTCAACATAAATGACGGATCTAAGCACTAGAGTTAATATCACTGTTAGAACGGTGGGAAATCAAACCCCTACAAGGGGAGCCGTTCAGTTCAACAAGATCAAGGTTCCGCAGCCCAAACCCTTCTATGGGGTTCGAGATGCCGAGGTCCTAGCGACCTTGATCAATACTTTCGGGCGATGAACACAACAATAGAGGAAGCAAAGGTCACCTTGGCCACCATACATCAAGCCGAGGATGCGAAATTTTGATGGACATCGAAGTACATTGATATTCAATAGGGCCGGTGCACAATAGACACATGGAAAAGACGGAAACAAGAACTTCAATCTCAATTCTTCCCAGAGAATGTTGAAATCTTGGCTAGAAGGAAATTGCGAGATCTCAAGCACACAAGAAACATCCGAGAGTATGTCAATCAGTTCTCAGTGTTCATGTTGGACATCTGAGATATGTTCGAGAAAGACAAGATCTTCCCGTTCATAGAAGGACTAAAGCCATGAGCGAAGTCCAAACTATATGGGCAGAGAATACAAGACCTCTCCACGACCTATGCTACAGCTGAACGATTGTTTGATCTAAGTAACGAACAATCCCAAGATGTAAGGCGAAGCCAAACCTCCTCGAGTGGATTGTTTGATATGAGCTGAACGATTGTTTGATTTGAATCTACCATTTTACTCTTGGCCAATTTATGATTGACCCATGGTTCTACATACACGAGACCTTTTTCGATCGGCTCTTTTGACTCCCCTACTTTCTTTTGTAGGGCAGATAGAAACTTCAGGGCCCCATACGAGGGTTTGCTCTGTTCTTCGATGACGACTTTTCAGTTTTCCCACCTTCTGACTTTGCCTCTAAGTCTGCGTCCAAAGCTGCTTGGAAGGCTTGTAGTGTCGCTTTCTTGGGGCACTCGTATACTCTATAGTTCTCTCTACATAAGAAGCAAGAGGGAGGTAGCCGATTGAAGTAATTTTGATGGTAGGGCCCTCGACCTAACACTAGTTTTAGGAAAATCACCCTAGGTTGGTCTAGTGGTCCTCAAGGGACATGTAAATAATAAAAGACAAAATAAACTAGCTCAAACCATAGTAGCCGCTTACACAAGATTTAATATCCTATAAGTACCTTGACAACCAAATGTAGTTGGTAATAATCCTACAGGTACCTTGGTTGTGAGAATACTCAAAGTGCTTGTAAGCTTCCCATACACTAATGATATCAGAGATAAAGGAGTTCTAGGGAATCCGTGTTATTGAGAGGACTCTAAGCGCACAATCCTAAAATACATCCTTAGCCATCTTTGGAAGAATCTTATACCTCGTAAAATTTACCTTCCACAATCCTAAGATGTTAAACCTGTGTGCCCCGGTATTCCCTAACATCCCTTGTGTTATGGTCAAGCTATTTTCTCTTCCCTTCTAAGAGTGAAGACTATCCCCACAAACTAACATAAGTCCTTGTAGCATGCTTTATTCTCACTCACATGCAACTAAATTGCTCCAAGCAAAATACGTTTAACTGGAGTTCATATGATTGAGCCACCGAAAATGAATGTACACCTTAATGGGTTAGATAGTGACTTTCAATTCTTTTAATTCTTCATTAGTTATCCTATCATTAGGATCGCTCTCATTCATTATTCTAATCTGTACTTGAAAGATTTTCATTCTCTGTTTCCATCGCTTCTATCCTTTCAATTCTTTTACCAAGTTTTTAAAATTATTCCATGTATGTTACTTTGCAAGTGTGTGCTGTGTAACATCAAACCTACATTTTCAATGAAGTTTGTTCTAATTCAACAACACAATCAAGATTATTCAACATGTGGTAATCCTTTGTTGTGTGTCATGGCAAAAAGGATAGAAGTGATGTTTGGTAAATATTGGAAAGATTTTCTCTTGTGTGTTGCTGTTATGCTAGAATCTCATTATACGCTTGAATTCGTGATGTTTATGCTTACAGATTTATATAATGAAAGATTGCAAACCAAATTAGGAAGAAAAAAAGTTAAAGATTAGTTTTCTCGACACTTAAATATTAAATATTAGTTCTTGCTGTCCTCTTGCTGTCCTCCTTCGACCGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCGAAGATTTTGTCGAAATATTCACTCAAAGGTGTTTTCTGGTATACATGGCTCGAAAGGAACCCCGAAAATTTTAAAAATTCAAGTAGAAACTTCAACCAAAGTTGGAAATGCATGTTAGAACGTTTTCTTGAAGTTGAACTTCTGTTCTTATGTTCTTGCCTATTTTGGGTTGATTGCTTATCTTCAGCTTTTGAACCATCTATTAGGCTGTGTTCTAGGATCCTTCCTGTGATTGAGATGATTGATATCAGGAGTTTGAAGGGCCGTCAGAATTGTTGATACTGAGTTGATCGGAGGATGTAATATTTTTCATCTAATGATCTTAAAACCAACGATTGTTCTGTTGCGACTTTCTTTCAGATCGGATAATCTTGAATCCTGCGCCACTTGCTTTAAATTTATCACATTTTGTCACCATCTTTTGTTCATCATGTGACTTTCAAGAGCAACCAATTAACCTTCAAAACTACACTAACCTGTTGGCATTCTTATGATTAGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGATACAAAATGTTCGTCGTCTCCCACTCGATGTACCATACAAGAAAGTTGTTTGTTTGATTCTTTGGGTGTCCCATAGTTTTCCTTACCAACTCAATAATGTTAACTAGTCCGTATTAAAGGCACATAATTAACAACCCTTGAATGAACTTATAAATCATTTCTCCTCACAACACACTATGCCACCTTTCCAATGTTCAATACATTTAGGTCTAAGGATTTCTAAAAAGTAATATTAGTTTTGATGTGCCA
mRNA sequence
ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGATACAAAATGTTCGTCGTCTCCCACTCGATGTACCATACAAGAAAGTTGTTTGTTTGATTCTTTGGGTGTCCCATAGTTTTCCTTACCAACTCAATAATGTTAACTAGTCCGTATTAAAGGCACATAATTAACAACCCTTGAATGAACTTATAAATCATTTCTCCTCACAACACACTATGCCACCTTTCCAATGTTCAATACATTTAGGTCTAAGGATTTCTAAAAAGTAATATTAGTTTTGATGTGCCA
Coding sequence (CDS)
ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA
Protein sequence
MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Homology
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match:
A0A6J1EJG7 (uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC111434812 PE=3 SV=1)
HSP 1 Score: 1018.8 bits (2633), Expect = 7.2e-294
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match:
A0A6J1IHC3 (uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520 PE=3 SV=1)
HSP 1 Score: 1003.8 bits (2594), Expect = 2.4e-289
Identity = 472/478 (98.74%), Postives = 473/478 (98.95%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRK E
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYL YSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFK QGTYYMITSGC
Sbjct: 301 DGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVL LPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match:
A0A0A0LPY3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1)
HSP 1 Score: 940.6 bits (2430), Expect = 2.5e-270
Identity = 434/478 (90.79%), Postives = 458/478 (95.82%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S +S DE
Sbjct: 1 MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IG+GI LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQL
Sbjct: 61 IGQGIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPD K S+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRT KYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYL+YSSEDNSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEAL HA+ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 478
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match:
A0A1S3C6G0 (uncharacterized protein LOC103497430 OS=Cucumis melo OX=3656 GN=LOC103497430 PE=3 SV=1)
HSP 1 Score: 936.4 bits (2419), Expect = 4.7e-269
Identity = 432/478 (90.38%), Postives = 457/478 (95.61%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S + DE
Sbjct: 1 MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAIRPADE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IG+ I LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQ+
Sbjct: 61 IGQHIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQI 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPD KTS+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERS TYYWYGEY
Sbjct: 121 RHKFFPDQKTSIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSGTYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSKRPHG DSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKRPHGCDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYL+YSSEDNSELH+G LSEDYLDVTNVA+RIL+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSEDYLDVTNVARRILIGQHREAPALFKHQGTYYMVTSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEAL HA+ESIMGPWET+GNPC+GGNK+FRLATFFSQSTFVLP+PS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCMGGNKMFRLATFFSQSTFVLPVPSYPNLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYLWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match:
A0A6J1D780 (uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017920 PE=3 SV=1)
HSP 1 Score: 923.3 bits (2385), Expect = 4.1e-265
Identity = 421/474 (88.82%), Postives = 455/474 (95.99%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRKDE
Sbjct: 1 MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61 IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120
Query: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
+RHKFFPDHKTSVDPM G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180
Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERP 240
YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240
Query: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
Query: 301 DDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSG 360
DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSG
Sbjct: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360
Query: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFM 420
CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVLPLPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420
Query: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 474
ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474
BLAST of CmoCh20G010210 vs. NCBI nr
Match:
XP_022927964.1 (uncharacterized protein LOC111434812 [Cucurbita moschata])
HSP 1 Score: 1018.8 bits (2633), Expect = 1.5e-293
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478
BLAST of CmoCh20G010210 vs. NCBI nr
Match:
XP_023553112.1 (uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1015.0 bits (2623), Expect = 2.1e-292
Identity = 476/478 (99.58%), Postives = 476/478 (99.58%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHYLGDKKDQKMNMRNRYRKS ALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478
BLAST of CmoCh20G010210 vs. NCBI nr
Match:
XP_022974783.1 (uncharacterized protein LOC111473520 [Cucurbita maxima])
HSP 1 Score: 1003.8 bits (2594), Expect = 4.9e-289
Identity = 472/478 (98.74%), Postives = 473/478 (98.95%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRK E
Sbjct: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61 IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYL YSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFK QGTYYMITSGC
Sbjct: 301 DGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVL LPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478
BLAST of CmoCh20G010210 vs. NCBI nr
Match:
KAG7011053.1 (hypothetical protein SDJN02_27851, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 999.6 bits (2583), Expect = 9.3e-288
Identity = 470/472 (99.58%), Postives = 470/472 (99.58%), Query Frame = 0
Query: 7 DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ 66
DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ
Sbjct: 14 DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ 73
Query: 67 LRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFP 126
LRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLR KFFP
Sbjct: 74 LRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRPKFFP 133
Query: 127 DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY 186
DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY
Sbjct: 134 DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY 193
Query: 187 HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR 246
HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR
Sbjct: 194 HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR 253
Query: 247 TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL 306
TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL
Sbjct: 254 TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL 313
Query: 307 VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN 366
VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN
Sbjct: 314 VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN 373
Query: 367 EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA 426
EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA
Sbjct: 374 EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA 433
Query: 427 DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 434 DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 485
BLAST of CmoCh20G010210 vs. NCBI nr
Match:
XP_004148025.3 (uncharacterized protein LOC101203100 [Cucumis sativus])
HSP 1 Score: 942.6 bits (2435), Expect = 1.3e-270
Identity = 434/478 (90.79%), Postives = 459/478 (96.03%), Query Frame = 0
Query: 1 MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S +S DE
Sbjct: 1 MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60
Query: 61 IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
IG+GI LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQL
Sbjct: 61 IGQGIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQL 120
Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
RHKFFPD K S+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240
Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
VIYNSRT KYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDD 300
Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
DGTAYL+YSSEDNSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGC 360
Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
TGWAPNEAL HA+ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMA 420
Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
DRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478
BLAST of CmoCh20G010210 vs. TAIR 10
Match:
AT3G49880.1 (glycosyl hydrolase family protein 43 )
HSP 1 Score: 720.3 bits (1858), Expect = 1.0e-207
Identity = 330/461 (71.58%), Postives = 387/461 (83.95%), Query Frame = 0
Query: 15 MRNRY-RKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSSHL 74
M+N++ +K+T LRC ++ +++GC+ ++HL SR + + + H
Sbjct: 4 MKNKHNKKATFLRCSPFG------LVSTVVGCVFMIHLTMLYSRSYSVDLDLSPQLLIHH 63
Query: 75 HF-RELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 134
RELE VEEENI +PPPRKRSPRA KR+PK TL++EFLDE+SQ+RH FFPD K++
Sbjct: 64 PIVRELERVEEENIHMPPPRKRSPRAIKRKPKTPTTLVEEFLDENSQIRHLFFPDMKSAF 123
Query: 135 DPM--IPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHK 194
P D S +Y+PGR+W DTEGNPIQAHGGG+LFD+ S+ YYWYGEYKDGPTY +HK
Sbjct: 124 GPTKEDTNDTSHYYFPGRIWTDTEGNPIQAHGGGILFDDISKVYYWYGEYKDGPTYLSHK 183
Query: 195 KGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKY 254
KGAARVDIIGVGCYSSKDLWTW+NEG+VL AEET+ETHDLHKSNVLERPKVIYNS T KY
Sbjct: 184 KGAARVDIIGVGCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKVIYNSDTGKY 243
Query: 255 VMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSS 314
VMWMHIDDANYTKASVGVA+SD PTGPFDYLYS+ PHGFDSRDMT++KDDD AYL+YSS
Sbjct: 244 VMWMHIDDANYTKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDDNVAYLIYSS 303
Query: 315 EDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALA 374
EDNS LHIGPL+E+YLDV V KRI+VGQHREAPA+FKHQ TYYMITSGCTGWAPNEALA
Sbjct: 304 EDNSVLHIGPLTENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCTGWAPNEALA 363
Query: 375 HASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRD 434
HA+ESIMGPWETLGNPC+GGN +FR TFF+QSTFV+PLP PG+FIFMADRWNPADLRD
Sbjct: 364 HAAESIMGPWETLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMADRWNPADLRD 423
Query: 435 SRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
SRY+WLPL+VGG D+PL+Y+FGFP+WSRVS+YWHR+WRLP
Sbjct: 424 SRYLWLPLIVGGPADRPLEYSFGFPMWSRVSVYWHRQWRLP 458
BLAST of CmoCh20G010210 vs. TAIR 10
Match:
AT5G67540.2 (Arabinanase/levansucrase/invertase )
HSP 1 Score: 708.0 bits (1826), Expect = 5.2e-204
Identity = 331/471 (70.28%), Postives = 389/471 (82.59%), Query Frame = 0
Query: 13 MNMRNRY-RKSTALRCDAGSRCLISV--VIGSLMGCILLLHLCSPVSRKD-EIGRGI--- 72
M N+Y +KST+L C+ C S+ ++ +++G L+ HL S SRKD I + +
Sbjct: 1 MKKNNKYNKKSTSLHCNDAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSD 60
Query: 73 QLRTSSHLH---FRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRH 132
QL+ HL REL VEEE +++PPPRKRSPR +KRR +K L++EFLD+ S +RH
Sbjct: 61 QLQVVHHLAHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRH 120
Query: 133 KFFPDHKTSV--DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 192
FFP KT+ G+++ +Y+PG++W+DT+GNPIQAHGGG+L D +S TYYWYGEY
Sbjct: 121 LFFPGIKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEY 180
Query: 193 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 252
KDGPTYHAHKKG ARVDIIGVGCYSSKDLWTW+NEGIVL AEETN+THDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPK 240
Query: 253 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 312
VIYN +T KYVMWMHIDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDD
Sbjct: 241 VIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDD 300
Query: 313 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 372
DG AYL+YSSE NS LHIGPL+EDYLDVT V KR++VGQHREAPA+FKHQ YYM+TS C
Sbjct: 301 DGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWC 360
Query: 373 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 432
TGWAPNEALAHA+ESIMGPWE LGNPCIGGNK+FRL TFF+QST+V+PLP PG FIFMA
Sbjct: 361 TGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMA 420
Query: 433 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
DRWNPADLRDSRY+WLPL++GG DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 421 DRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471
BLAST of CmoCh20G010210 vs. TAIR 10
Match:
AT5G67540.1 (Arabinanase/levansucrase/invertase )
HSP 1 Score: 701.0 bits (1808), Expect = 6.4e-202
Identity = 328/463 (70.84%), Postives = 382/463 (82.51%), Query Frame = 0
Query: 19 YRKSTALRCDAGS-RCLISVVIGSLMGCILLLHLCSPVSRKD-EIGRGI---QLRTSSHL 78
Y S LR AG R + ++ +++G L+ HL S SRKD I + + QL+ HL
Sbjct: 4 YSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHL 63
Query: 79 H---FRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKT 138
REL VEEE +++PPPRKRSPR +KRR +K L++EFLD+ S +RH FFP KT
Sbjct: 64 AHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRHLFFPGIKT 123
Query: 139 SV--DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHA 198
+ G+++ +Y+PG++W+DT+GNPIQAHGGG+L D +S TYYWYGEYKDGPTYHA
Sbjct: 124 AAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHA 183
Query: 199 HKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTR 258
HKKG ARVDIIGVGCYSSKDLWTW+NEGIVL AEETN+THDLHKSNVLERPKVIYN +T
Sbjct: 184 HKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTE 243
Query: 259 KYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVY 318
KYVMWMHIDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDDDG AYL+Y
Sbjct: 244 KYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIY 303
Query: 319 SSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 378
SSE NS LHIGPL+EDYLDVT V KR++VGQHREAPA+FKHQ YYM+TS CTGWAPNEA
Sbjct: 304 SSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEA 363
Query: 379 LAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADL 438
LAHA+ESIMGPWE LGNPCIGGNK+FRL TFF+QST+V+PLP PG FIFMADRWNPADL
Sbjct: 364 LAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADL 423
Query: 439 RDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
RDSRY+WLPL++GG DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 424 RDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 466
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1EJG7 | 7.2e-294 | 100.00 | uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC1114348... | [more] |
A0A6J1IHC3 | 2.4e-289 | 98.74 | uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520... | [more] |
A0A0A0LPY3 | 2.5e-270 | 90.79 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1 | [more] |
A0A1S3C6G0 | 4.7e-269 | 90.38 | uncharacterized protein LOC103497430 OS=Cucumis melo OX=3656 GN=LOC103497430 PE=... | [more] |
A0A6J1D780 | 4.1e-265 | 88.82 | uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
Match Name | E-value | Identity | Description | |
XP_022927964.1 | 1.5e-293 | 100.00 | uncharacterized protein LOC111434812 [Cucurbita moschata] | [more] |
XP_023553112.1 | 2.1e-292 | 99.58 | uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo] | [more] |
XP_022974783.1 | 4.9e-289 | 98.74 | uncharacterized protein LOC111473520 [Cucurbita maxima] | [more] |
KAG7011053.1 | 9.3e-288 | 99.58 | hypothetical protein SDJN02_27851, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_004148025.3 | 1.3e-270 | 90.79 | uncharacterized protein LOC101203100 [Cucumis sativus] | [more] |