CmoCh20G010210 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G010210
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionArabinanase/levansucrase/invertase
LocationCmo_Chr20: 6762326 .. 6771609 (+)
RNA-Seq ExpressionCmoCh20G010210
SyntenyCmoCh20G010210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTAAAATCTATACTCTGCCTTTCTGAGCCTTCATTTTCTGTAATGATAGTCTAATACTTCCATGGCTAAAAATATTGGTTAAATTATAAACACATCCTAAACCAAGAAGTTGGTATCAATAATGCACCCAATTTTTTTGAAGTGTAATTTTAACTCTTTAAGTTTGAAGTTTGTTTTAAAACAGCCCTCGTAGGCTTTTGCAGTTAAATCTGTAGATGGAAAAGTTTGTATATTCTTATGTGAACAAATCTAAACACTTACATGGAAAAAATAAAGACACATGGTGCATCTGTGAACCTTGTATGTGGCCTCCTTTCAACCACCACCATTTTTTGTTCTTTCGTTTATGATTTTTTTTGCTAATCCTTCACCTTAAATTTACTTCCGGGTGTTTAGATTTGTTCACTTAAGTAATCCATGAGTAGAAAAAAACGTTTCTATCTATAGATTTAACGAAAAAAGTCTTCAAGGACTGTAATTAAAAAGTTCAACCTTCAATGGTTAAAACGAAACTTTTCGAAGTTTAGGGGTATAGCTGAACCCAACCCCATAGTTCAGGAATAATTTTGATAATTTAATCTAAAAATATTCCTATTGCATAAATTGTTGGCATGCATCTCTATGAATTTGTATGGAAACAAAAGATTTTGATTAGTACTCCAGTCTTTTTCCCTCCTTTTCCATTATTAACAATAGTTTCTTCTTTATTATGGCAATTTCCAATCCCAATTATTGCTCCATGGTGGTTTTAATGACTTTATGTCTATATCTTTTTAAATATGAAATATCAGGCTCCATTGATCGTTATATGTTGAATTCCTTCTAGTTCATCTTTTTGTACCTTATTCCACTCTTGATAAGGTCTACGTCCAACTCATGAATCTTTAGTTTTTGAGATGGTATGTGATGCTTGGGAGGTTACTTTATGATCTTGGGAGTATGACAGTGAGAAGAAATTTTAAGGCAGAGGAAGTGAGTGTGCTTTGTGGCCTTTTGGAGCTGATCGAGAGTATGCTGATTTTGAGATTAGAGATCAAGAAAATGGAGTTTAGATAAAACAGGATTTTCTTGGTTAAACGTTCAGTTAAAGAGTCAGTTGATGTTCCCTCTCTTTCTAAAGTTTCGGTAAATGATTTGTGGAAATCAAGAGTCTAATAAATGTGATATTCTAATCTGTTTGGTTTTGGGAAAGCTACAACACAATACATCAAGAAGTTACGATAAAATAATTATCAGACAATGAACCTTGCTTGGGGCAAGGAAAGAAGATGAAATTAGCATTAACCCGACTGCTCAACTTCTTTTATTAAACCCTTGCAGATTAGGACAAGTCCTTGGTGATTAGAGAAACACCAATCTAATTTTTTGAATCAAACAAATGGGGCCACAAGATTGAAGTACCCTTCAGCTAGCCTAATTTTGAAATTCAAGGTCCCAAAAAAGGTGAAAAGTTCTTTTATGGTAGTTAGCCTAAAGAAGTTTGAAGAAGCTTTAAAAGAATTTTCAACTTTCAAAAATAGATGATCTCTCCTTCTATGTGTTGCTTTGCTCACAAGAGGAGGAATCCCTTGATCACATGTTCTTGCAATGTTCTTTTGTGGCTAAAGGGCAGCAGCCTATCTTCAATGTGTTTGGGGTGGATTACTACTTCCCTAAGAACATTAATGATTGGCTTATGGAAGTGCTCAACGGTAGGGTGTTTGGCGGCAAGGGGAAGTCCTTTGGAGGTGTGATATGCCGTCCTTCCTTCAGAGCCTTTCGAAGGAAAGGAATAGCCAGGTTTTTAAAAGTAAGTATTCTTCTTGAGTCCTTTTGGCGTTTGGTGCAATACAAGGCCTCTCTGTGGTGTACAGTTCACACTAAATTCTCTTGCAATTAATGCCTTTTGATGATTCAATTGGAAGGCTCTCATTTTTTAGTTTTGTGGGGAGGGGTGACCACTGTGGGGAGTTGTTGTTTGGTTCTTTTGATTAATACGCGTCACTGCTTTTTATATATAAAATCATGATCTGCCAGATATTTTGGGAAACTTGCAACTAGGAATAGACAGCATGATGTTACTGGTCTAGATGATAGTTATAGCGTGAGGAACACTACCGAGGAGTTTAGAATTGATTTAGAGTTTTTTGATGCACCTCCTTGAAATTGGAATGTTAATTAGTCATGGCCGACAAGTTTATGGTCTTCTGAGCCCCAGAGGAAGGAATGCTGCTTGAGATGAGTTGGGTGATCTATTTCGTCTGTATGGGCCTTGTTGATGTGTGGTGGGGAGCTTTTATGTAATTTGCTTTGGTACTGAGAAGATCTTGGCTGGCTGGACGATGGAATCAATAGGCTCTTCAATTATTTTATTGAGAGAAATAGCCTCTTCGACTCTCCTCTATTGTACGGAAAGTTTACTTGGTTGAATAGTAGGGCACGTACTAGACTTCATAGGTTACTGATATCTAAAGGGTAGATGAATGTTCATGGGAGTGTGAGACAATTGCTAGGCCCTAGAATAACCTTCGACCGGTGTCCAGAGGTGATGCTTGTGTCCTTTCATATTTGAAAATATTTGGCTGAACCATACCTCCTTATAATCTTCCTTTCCTTCATGTTGAATGCGGAAACCACTGGAAATTAAGAAATTCGGAACGGTATCTTCCATGGGGAAGCTAAGGGTCCTCAAAGGAGTGTTATTGTGGAACAAAGACTCACGTAGGAGTTTCAAAATTCTGGAAATTAAAAAGCTCAAAACAAAGCCCTTTTGGTTAAATGGTTATGCCAATTTCCCCTCGGAGTCGATAGAGGATTATAGCAAGCAAATAGGGTCCTCATCCTTCTGAGTGGATGTCAGATGGGCCAAAGGCACTTACAAAAATATTAGACTTCTTAAATCTTAGAACAGAAATTGACATCCAGTGCAGACCTGAGGAAATAAATCTTAGAACAGGATCTCTAGTATTAGACTTCTTAAATTACCCCAATGTCTGTTTGAAGGTGCGCATCATTGGAAAAGGGACGCCCGTTTCTTTTTGCGTACTGATGAGGCCTACACTTCCTTCCAGCATCAATTTTATTGAAGTAATTTGAATTTAAGTCTCTTCCTTTAAGCCATTGTAACCTTCAGTCCTATAGTAGAAAATGAATTTCATGTTGTTCCTCTAGGAGAAATTTGTAAAAGAAAAAGCTTGAATGATATGATTCCTTCTTTAATGACGCGATTTTTGATGCCTTGAATCATAGGTCACCCCGTACACTTGAACTTAAAAAGAGGTGGTAATTTTTTTTTTATGTAAAGCGGGAAACACAAGGAATTTAAAATTCAAATATGCATTCAAAACCAACATAAAGGTAGTTTGAAGGTAATATCCGTGAGTCAAGTCAAAACGGTTTACCAAAAAAATTCACAAGTTTCGATAAGTAGCATTTAAAATAATAATAAAATGACAAGAAGGAAGACGACTCGATCTAAAGGGAACCCCCAATAGCTACACGGTGTCACAGTCATGCTTGTCCTAGTCGTGTTGCCCATGGCCGCATGCTCATTGCATGCCAGAACCCTTGAGTTGTACCACTTCATTGCTTTCACTGCTTTCATGATGTATAGTTTCTTTATTGTATTTTCTAGGTGTATGATGTTTCAATAGAATCATGTATATGCCTGGTAGACCTGAAATGCCTCTGAATGTAGCATAGGGAGATGTGACTAGTCGTCAAATCTTCGTACTCGGAGTTACATGCTATCTAAGCCATTGTGTTGCCTTTTTCGCTTATAGCCTATAACATTGCTTTATTCATTTCTCACAATTTTGTTTTCAAATTCTCTTTCTAAAATCTCTCTCTGGTGCTGTTTTCTAAAACCTTCTCCTTAAAGCAAGGCCAGAGGCTTGAGTATACGTCGTCAGGAAGAAGGCAACATGATTTCGAGTTGGCGGACTCACTCGGTATTGAGAGTGAGTGCCGCACAATCGCTTAAAAAACGTCCCGTAAGACTGCGATTGTGACAGTTGGTATCCGAGTCGAGTTGGCTCCAAATGTGATTCTGTAAACACGGCTACAACCAAGCAGTTGAACAAGTCCCACATCGATCGACTAGTCGACATCGAAGAACAGATGCAATTCTTGTGAGAAGTTCCTTACAATGTTCAGTACTTGGACGAGCGGGTGAAAGAGCTCGCCGACAAGACCAATATGGTTGATGTAGTTGCAGGTTGATTAGATGAGTTACCCATCCAGGAACATATTTACCGAGTAGACAACCTAAAAGCAAAAGCCACAAAGACCGGTGGCTTCGAGCGAGGAGACAGCTTGACGGGCTCTGCTGCCCTATTTTAAGAGTGAGTTGATGGTCTAGACAGCTCCCAAAAAGCAGTTATGTAGATGGTCTCTGAGATACTTGAAGATGTGAAATTAGCCTTCGACGTGGTCAGGGCAGAAATTGCTAGGAATCACGATTCTCCCCAATGGTATGATATTGTTCACTTTGAGCATAAACTTTCATGGTTTTGCTTTGGGCTTTCCTAAAGGTCTCATTCCAATGGAGATCTATTCCTTGCTTATAAATTCATGATCATTCCCTACATTAGCCAATGTGGGACAACCTCCAAACAATCCTCAACATTCCTCCCCTCGAACAAAGTACAACACAGAGCCTCCCCTGAGGCCTATGGAGCCCTTGAATAACCTCTCCTTAATTGAAGCTCAACTCCTCTGGAGCCCTCAAACAAAGTACACTCTTTGTTCAACTCTTTACCTTTTGACTACATCTTTGAGGCTCATATAAGTTTAGGGCATGGCTCTGATACCATGTTAGGAATCACGACTCTCCACAATTGTGATATTGCCTTTTTTGAGCATAAGCTCTCATGGCTTTGCTTTGGGCTTCCCCAAAAGGCCTCATTCCAATGGAGATCTATTTTTTTGCTTATAAATCCATGATCATTCCCTAAATTAGCAAATGTAGGACAACCTCCCAACAATACTCAACATAAATGACGGATCTAAGCACTAGAGTTAATATCACTGTTAGAACGGTGGGAAATCAAACCCCTACAAGGGGAGCCGTTCAGTTCAACAAGATCAAGGTTCCGCAGCCCAAACCCTTCTATGGGGTTCGAGATGCCGAGGTCCTAGCGACCTTGATCAATACTTTCGGGCGATGAACACAACAATAGAGGAAGCAAAGGTCACCTTGGCCACCATACATCAAGCCGAGGATGCGAAATTTTGATGGACATCGAAGTACATTGATATTCAATAGGGCCGGTGCACAATAGACACATGGAAAAGACGGAAACAAGAACTTCAATCTCAATTCTTCCCAGAGAATGTTGAAATCTTGGCTAGAAGGAAATTGCGAGATCTCAAGCACACAAGAAACATCCGAGAGTATGTCAATCAGTTCTCAGTGTTCATGTTGGACATCTGAGATATGTTCGAGAAAGACAAGATCTTCCCGTTCATAGAAGGACTAAAGCCATGAGCGAAGTCCAAACTATATGGGCAGAGAATACAAGACCTCTCCACGACCTATGCTACAGCTGAACGATTGTTTGATCTAAGTAACGAACAATCCCAAGATGTAAGGCGAAGCCAAACCTCCTCGAGTGGATTGTTTGATATGAGCTGAACGATTGTTTGATTTGAATCTACCATTTTACTCTTGGCCAATTTATGATTGACCCATGGTTCTACATACACGAGACCTTTTTCGATCGGCTCTTTTGACTCCCCTACTTTCTTTTGTAGGGCAGATAGAAACTTCAGGGCCCCATACGAGGGTTTGCTCTGTTCTTCGATGACGACTTTTCAGTTTTCCCACCTTCTGACTTTGCCTCTAAGTCTGCGTCCAAAGCTGCTTGGAAGGCTTGTAGTGTCGCTTTCTTGGGGCACTCGTATACTCTATAGTTCTCTCTACATAAGAAGCAAGAGGGAGGTAGCCGATTGAAGTAATTTTGATGGTAGGGCCCTCGACCTAACACTAGTTTTAGGAAAATCACCCTAGGTTGGTCTAGTGGTCCTCAAGGGACATGTAAATAATAAAAGACAAAATAAACTAGCTCAAACCATAGTAGCCGCTTACACAAGATTTAATATCCTATAAGTACCTTGACAACCAAATGTAGTTGGTAATAATCCTACAGGTACCTTGGTTGTGAGAATACTCAAAGTGCTTGTAAGCTTCCCATACACTAATGATATCAGAGATAAAGGAGTTCTAGGGAATCCGTGTTATTGAGAGGACTCTAAGCGCACAATCCTAAAATACATCCTTAGCCATCTTTGGAAGAATCTTATACCTCGTAAAATTTACCTTCCACAATCCTAAGATGTTAAACCTGTGTGCCCCGGTATTCCCTAACATCCCTTGTGTTATGGTCAAGCTATTTTCTCTTCCCTTCTAAGAGTGAAGACTATCCCCACAAACTAACATAAGTCCTTGTAGCATGCTTTATTCTCACTCACATGCAACTAAATTGCTCCAAGCAAAATACGTTTAACTGGAGTTCATATGATTGAGCCACCGAAAATGAATGTACACCTTAATGGGTTAGATAGTGACTTTCAATTCTTTTAATTCTTCATTAGTTATCCTATCATTAGGATCGCTCTCATTCATTATTCTAATCTGTACTTGAAAGATTTTCATTCTCTGTTTCCATCGCTTCTATCCTTTCAATTCTTTTACCAAGTTTTTAAAATTATTCCATGTATGTTACTTTGCAAGTGTGTGCTGTGTAACATCAAACCTACATTTTCAATGAAGTTTGTTCTAATTCAACAACACAATCAAGATTATTCAACATGTGGTAATCCTTTGTTGTGTGTCATGGCAAAAAGGATAGAAGTGATGTTTGGTAAATATTGGAAAGATTTTCTCTTGTGTGTTGCTGTTATGCTAGAATCTCATTATACGCTTGAATTCGTGATGTTTATGCTTACAGATTTATATAATGAAAGATTGCAAACCAAATTAGGAAGAAAAAAAGTTAAAGATTAGTTTTCTCGACACTTAAATATTAAATATTAGTTCTTGCTGTCCTCTTGCTGTCCTCCTTCGACCGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCGAAGATTTTGTCGAAATATTCACTCAAAGGTGTTTTCTGGTATACATGGCTCGAAAGGAACCCCGAAAATTTTAAAAATTCAAGTAGAAACTTCAACCAAAGTTGGAAATGCATGTTAGAACGTTTTCTTGAAGTTGAACTTCTGTTCTTATGTTCTTGCCTATTTTGGGTTGATTGCTTATCTTCAGCTTTTGAACCATCTATTAGGCTGTGTTCTAGGATCCTTCCTGTGATTGAGATGATTGATATCAGGAGTTTGAAGGGCCGTCAGAATTGTTGATACTGAGTTGATCGGAGGATGTAATATTTTTCATCTAATGATCTTAAAACCAACGATTGTTCTGTTGCGACTTTCTTTCAGATCGGATAATCTTGAATCCTGCGCCACTTGCTTTAAATTTATCACATTTTGTCACCATCTTTTGTTCATCATGTGACTTTCAAGAGCAACCAATTAACCTTCAAAACTACACTAACCTGTTGGCATTCTTATGATTAGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGATACAAAATGTTCGTCGTCTCCCACTCGATGTACCATACAAGAAAGTTGTTTGTTTGATTCTTTGGGTGTCCCATAGTTTTCCTTACCAACTCAATAATGTTAACTAGTCCGTATTAAAGGCACATAATTAACAACCCTTGAATGAACTTATAAATCATTTCTCCTCACAACACACTATGCCACCTTTCCAATGTTCAATACATTTAGGTCTAAGGATTTCTAAAAAGTAATATTAGTTTTGATGTGCCA

mRNA sequence

ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGATACAAAATGTTCGTCGTCTCCCACTCGATGTACCATACAAGAAAGTTGTTTGTTTGATTCTTTGGGTGTCCCATAGTTTTCCTTACCAACTCAATAATGTTAACTAGTCCGTATTAAAGGCACATAATTAACAACCCTTGAATGAACTTATAAATCATTTCTCCTCACAACACACTATGCCACCTTTCCAATGTTCAATACATTTAGGTCTAAGGATTTCTAAAAAGTAATATTAGTTTTGATGTGCCA

Coding sequence (CDS)

ATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAACCGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCCGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTAGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCACGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTGGATACTGAGGGTAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGGACTCGAAAATATGTAATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTATAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGATGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCAGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTAGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCTTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTTGATCAACCCCTGGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

Protein sequence

MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Homology
BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match: A0A6J1EJG7 (uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC111434812 PE=3 SV=1)

HSP 1 Score: 1018.8 bits (2633), Expect = 7.2e-294
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478

BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match: A0A6J1IHC3 (uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520 PE=3 SV=1)

HSP 1 Score: 1003.8 bits (2594), Expect = 2.4e-289
Identity = 472/478 (98.74%), Postives = 473/478 (98.95%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRK E
Sbjct: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61  IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYL YSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFK QGTYYMITSGC
Sbjct: 301 DGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVL LPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478

BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match: A0A0A0LPY3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1)

HSP 1 Score: 940.6 bits (2430), Expect = 2.5e-270
Identity = 434/478 (90.79%), Postives = 458/478 (95.82%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S +S  DE
Sbjct: 1   MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IG+GI LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQL
Sbjct: 61  IGQGIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPD K S+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRT KYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYL+YSSEDNSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEAL HA+ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match: A0A1S3C6G0 (uncharacterized protein LOC103497430 OS=Cucumis melo OX=3656 GN=LOC103497430 PE=3 SV=1)

HSP 1 Score: 936.4 bits (2419), Expect = 4.7e-269
Identity = 432/478 (90.38%), Postives = 457/478 (95.61%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S +   DE
Sbjct: 1   MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAIRPADE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IG+ I LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQ+
Sbjct: 61  IGQHIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQI 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPD KTS+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERS TYYWYGEY
Sbjct: 121 RHKFFPDQKTSIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSGTYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSKRPHG DSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKRPHGCDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYL+YSSEDNSELH+G LSEDYLDVTNVA+RIL+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSEDYLDVTNVARRILIGQHREAPALFKHQGTYYMVTSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEAL HA+ESIMGPWET+GNPC+GGNK+FRLATFFSQSTFVLP+PS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCMGGNKMFRLATFFSQSTFVLPVPSYPNLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYLWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of CmoCh20G010210 vs. ExPASy TrEMBL
Match: A0A6J1D780 (uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017920 PE=3 SV=1)

HSP 1 Score: 923.3 bits (2385), Expect = 4.1e-265
Identity = 421/474 (88.82%), Postives = 455/474 (95.99%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP+SRKDE
Sbjct: 1   MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPISRKDE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQ 120
           I RGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAAKRRPKKT TLIDEFLDEDSQ
Sbjct: 61  IVRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLIDEFLDEDSQ 120

Query: 121 LRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGE 180
           +RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGE
Sbjct: 121 IRHKFFPDHKTSVDPMKTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGE 180

Query: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERP 240
           YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERP
Sbjct: 181 YKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERP 240

Query: 241 KVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300
           KVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Sbjct: 241 KVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD 300

Query: 301 DDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSG 360
           DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSG
Sbjct: 301 DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSG 360

Query: 361 CTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFM 420
           CTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATFF+QSTFVLPLPSHP LFIFM
Sbjct: 361 CTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFM 420

Query: 421 ADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQG 474
           ADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Sbjct: 421 ADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG 474

BLAST of CmoCh20G010210 vs. NCBI nr
Match: XP_022927964.1 (uncharacterized protein LOC111434812 [Cucurbita moschata])

HSP 1 Score: 1018.8 bits (2633), Expect = 1.5e-293
Identity = 478/478 (100.00%), Postives = 478/478 (100.00%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478

BLAST of CmoCh20G010210 vs. NCBI nr
Match: XP_023553112.1 (uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1015.0 bits (2623), Expect = 2.1e-292
Identity = 476/478 (99.58%), Postives = 476/478 (99.58%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHYLGDKKDQKMNMRNRYRKS ALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE
Sbjct: 1   MLHYLGDKKDQKMNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61  IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC
Sbjct: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478

BLAST of CmoCh20G010210 vs. NCBI nr
Match: XP_022974783.1 (uncharacterized protein LOC111473520 [Cucurbita maxima])

HSP 1 Score: 1003.8 bits (2594), Expect = 4.9e-289
Identity = 472/478 (98.74%), Postives = 473/478 (98.95%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRK E
Sbjct: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKYE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IGRGIQLRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL
Sbjct: 61  IGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEETNETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETNETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYL YSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFK QGTYYMITSGC
Sbjct: 301 DGTAYLAYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKDQGTYYMITSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVL LPSHPGLFIFMA
Sbjct: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLRLPSHPGLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 478

BLAST of CmoCh20G010210 vs. NCBI nr
Match: KAG7011053.1 (hypothetical protein SDJN02_27851, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 999.6 bits (2583), Expect = 9.3e-288
Identity = 470/472 (99.58%), Postives = 470/472 (99.58%), Query Frame = 0

Query: 7   DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ 66
           DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ
Sbjct: 14  DKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQ 73

Query: 67  LRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFP 126
           LRTS HLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLR KFFP
Sbjct: 74  LRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRPKFFP 133

Query: 127 DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY 186
           DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY
Sbjct: 134 DHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTY 193

Query: 187 HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR 246
           HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR
Sbjct: 194 HAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSR 253

Query: 247 TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL 306
           TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL
Sbjct: 254 TRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYL 313

Query: 307 VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN 366
           VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN
Sbjct: 314 VYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPN 373

Query: 367 EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA 426
           EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA
Sbjct: 374 EALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPA 433

Query: 427 DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
Sbjct: 434 DLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 485

BLAST of CmoCh20G010210 vs. NCBI nr
Match: XP_004148025.3 (uncharacterized protein LOC101203100 [Cucumis sativus])

HSP 1 Score: 942.6 bits (2435), Expect = 1.3e-270
Identity = 434/478 (90.79%), Postives = 459/478 (96.03%), Query Frame = 0

Query: 1   MLHYLGDKKDQKMNMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDE 60
           ML+YLGDKKD++M MRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLL+L S +S  DE
Sbjct: 1   MLNYLGDKKDERMKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADE 60

Query: 61  IGRGIQLRTSSHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQL 120
           IG+GI LRTS HLHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQL
Sbjct: 61  IGQGIHLRTSHHLHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQL 120

Query: 121 RHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
           RHKFFPD K S+DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY
Sbjct: 121 RHKFFPDKKASIDPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYNSRT KYVMWMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDD
Sbjct: 241 VIYNSRTGKYVMWMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DGTAYL+YSSEDNSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGC
Sbjct: 301 DGTAYLIYSSEDNSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEAL HA+ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMA
Sbjct: 361 TGWAPNEALTHAAESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 479
           DRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 DRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of CmoCh20G010210 vs. TAIR 10
Match: AT3G49880.1 (glycosyl hydrolase family protein 43 )

HSP 1 Score: 720.3 bits (1858), Expect = 1.0e-207
Identity = 330/461 (71.58%), Postives = 387/461 (83.95%), Query Frame = 0

Query: 15  MRNRY-RKSTALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSSHL 74
           M+N++ +K+T LRC          ++ +++GC+ ++HL    SR   +   +  +   H 
Sbjct: 4   MKNKHNKKATFLRCSPFG------LVSTVVGCVFMIHLTMLYSRSYSVDLDLSPQLLIHH 63

Query: 75  HF-RELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 134
              RELE VEEENI +PPPRKRSPRA KR+PK   TL++EFLDE+SQ+RH FFPD K++ 
Sbjct: 64  PIVRELERVEEENIHMPPPRKRSPRAIKRKPKTPTTLVEEFLDENSQIRHLFFPDMKSAF 123

Query: 135 DPM--IPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHK 194
            P      D S +Y+PGR+W DTEGNPIQAHGGG+LFD+ S+ YYWYGEYKDGPTY +HK
Sbjct: 124 GPTKEDTNDTSHYYFPGRIWTDTEGNPIQAHGGGILFDDISKVYYWYGEYKDGPTYLSHK 183

Query: 195 KGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKY 254
           KGAARVDIIGVGCYSSKDLWTW+NEG+VL AEET+ETHDLHKSNVLERPKVIYNS T KY
Sbjct: 184 KGAARVDIIGVGCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKVIYNSDTGKY 243

Query: 255 VMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSS 314
           VMWMHIDDANYTKASVGVA+SD PTGPFDYLYS+ PHGFDSRDMT++KDDD  AYL+YSS
Sbjct: 244 VMWMHIDDANYTKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDDNVAYLIYSS 303

Query: 315 EDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALA 374
           EDNS LHIGPL+E+YLDV  V KRI+VGQHREAPA+FKHQ TYYMITSGCTGWAPNEALA
Sbjct: 304 EDNSVLHIGPLTENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCTGWAPNEALA 363

Query: 375 HASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRD 434
           HA+ESIMGPWETLGNPC+GGN +FR  TFF+QSTFV+PLP  PG+FIFMADRWNPADLRD
Sbjct: 364 HAAESIMGPWETLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMADRWNPADLRD 423

Query: 435 SRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
           SRY+WLPL+VGG  D+PL+Y+FGFP+WSRVS+YWHR+WRLP
Sbjct: 424 SRYLWLPLIVGGPADRPLEYSFGFPMWSRVSVYWHRQWRLP 458

BLAST of CmoCh20G010210 vs. TAIR 10
Match: AT5G67540.2 (Arabinanase/levansucrase/invertase )

HSP 1 Score: 708.0 bits (1826), Expect = 5.2e-204
Identity = 331/471 (70.28%), Postives = 389/471 (82.59%), Query Frame = 0

Query: 13  MNMRNRY-RKSTALRCDAGSRCLISV--VIGSLMGCILLLHLCSPVSRKD-EIGRGI--- 72
           M   N+Y +KST+L C+    C  S+  ++ +++G  L+ HL S  SRKD  I + +   
Sbjct: 1   MKKNNKYNKKSTSLHCNDAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSD 60

Query: 73  QLRTSSHLH---FRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRH 132
           QL+   HL     REL  VEEE +++PPPRKRSPR +KRR +K   L++EFLD+ S +RH
Sbjct: 61  QLQVVHHLAHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRH 120

Query: 133 KFFPDHKTSV--DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 192
            FFP  KT+        G+++ +Y+PG++W+DT+GNPIQAHGGG+L D +S TYYWYGEY
Sbjct: 121 LFFPGIKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEY 180

Query: 193 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 252
           KDGPTYHAHKKG ARVDIIGVGCYSSKDLWTW+NEGIVL AEETN+THDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPK 240

Query: 253 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 312
           VIYN +T KYVMWMHIDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDD
Sbjct: 241 VIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDD 300

Query: 313 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 372
           DG AYL+YSSE NS LHIGPL+EDYLDVT V KR++VGQHREAPA+FKHQ  YYM+TS C
Sbjct: 301 DGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWC 360

Query: 373 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 432
           TGWAPNEALAHA+ESIMGPWE LGNPCIGGNK+FRL TFF+QST+V+PLP  PG FIFMA
Sbjct: 361 TGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMA 420

Query: 433 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
           DRWNPADLRDSRY+WLPL++GG  DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 421 DRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471

BLAST of CmoCh20G010210 vs. TAIR 10
Match: AT5G67540.1 (Arabinanase/levansucrase/invertase )

HSP 1 Score: 701.0 bits (1808), Expect = 6.4e-202
Identity = 328/463 (70.84%), Postives = 382/463 (82.51%), Query Frame = 0

Query: 19  YRKSTALRCDAGS-RCLISVVIGSLMGCILLLHLCSPVSRKD-EIGRGI---QLRTSSHL 78
           Y  S  LR  AG  R  +  ++ +++G  L+ HL S  SRKD  I + +   QL+   HL
Sbjct: 4   YSSSAGLRGFAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSDQLQVVHHL 63

Query: 79  H---FRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKT 138
                REL  VEEE +++PPPRKRSPR +KRR +K   L++EFLD+ S +RH FFP  KT
Sbjct: 64  AHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRHLFFPGIKT 123

Query: 139 SV--DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHA 198
           +        G+++ +Y+PG++W+DT+GNPIQAHGGG+L D +S TYYWYGEYKDGPTYHA
Sbjct: 124 AAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEYKDGPTYHA 183

Query: 199 HKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTR 258
           HKKG ARVDIIGVGCYSSKDLWTW+NEGIVL AEETN+THDLHKSNVLERPKVIYN +T 
Sbjct: 184 HKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPKVIYNEKTE 243

Query: 259 KYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVY 318
           KYVMWMHIDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDDDG AYL+Y
Sbjct: 244 KYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDDDGVAYLIY 303

Query: 319 SSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEA 378
           SSE NS LHIGPL+EDYLDVT V KR++VGQHREAPA+FKHQ  YYM+TS CTGWAPNEA
Sbjct: 304 SSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWCTGWAPNEA 363

Query: 379 LAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADL 438
           LAHA+ESIMGPWE LGNPCIGGNK+FRL TFF+QST+V+PLP  PG FIFMADRWNPADL
Sbjct: 364 LAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMADRWNPADL 423

Query: 439 RDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 472
           RDSRY+WLPL++GG  DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 424 RDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 466

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EJG77.2e-294100.00uncharacterized protein LOC111434812 OS=Cucurbita moschata OX=3662 GN=LOC1114348... [more]
A0A6J1IHC32.4e-28998.74uncharacterized protein LOC111473520 OS=Cucurbita maxima OX=3661 GN=LOC111473520... [more]
A0A0A0LPY32.5e-27090.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G285310 PE=3 SV=1[more]
A0A1S3C6G04.7e-26990.38uncharacterized protein LOC103497430 OS=Cucumis melo OX=3656 GN=LOC103497430 PE=... [more]
A0A6J1D7804.1e-26588.82uncharacterized protein LOC111017920 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
Match NameE-valueIdentityDescription
XP_022927964.11.5e-293100.00uncharacterized protein LOC111434812 [Cucurbita moschata][more]
XP_023553112.12.1e-29299.58uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo][more]
XP_022974783.14.9e-28998.74uncharacterized protein LOC111473520 [Cucurbita maxima][more]
KAG7011053.19.3e-28899.58hypothetical protein SDJN02_27851, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_004148025.31.3e-27090.79uncharacterized protein LOC101203100 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT3G49880.11.0e-20771.58glycosyl hydrolase family protein 43 [more]
AT5G67540.25.2e-20470.28Arabinanase/levansucrase/invertase [more]
AT5G67540.16.4e-20270.84Arabinanase/levansucrase/invertase [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006710Glycoside hydrolase, family 43PFAMPF04616Glyco_hydro_43coord: 199..382
e-value: 9.7E-21
score: 74.3
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domain superfamilyGENE3D2.115.10.20Glycosyl hydrolase domain; family 43coord: 137..449
e-value: 3.0E-98
score: 331.1
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domain superfamilySUPERFAMILY75005Arabinanase/levansucrase/invertasecoord: 156..439
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..105
NoneNo IPR availablePANTHERPTHR22925:SF52GLYCOSYL HYDROLASE FAMILY 43 PROTEINcoord: 3..476
NoneNo IPR availablePANTHERPTHR22925GLYCOSYL HYDROLASE 43 FAMILY MEMBERcoord: 3..476
NoneNo IPR availableCDDcd18825GH43_CtGH43-likecoord: 158..440
e-value: 4.16656E-167
score: 470.542

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G010210.1CmoCh20G010210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds