Cp4.1LG01g17280 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g17280
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
Descriptionglycosyl hydrolase family protein 43
LocationCp4.1LG01 : 11896584 .. 11906426 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TAACGTGTTTTTTTAAAGTCAAAATATGGAAAACCCTAGGTCATAGGCGCAAGAAGAGTTGTGGACTTCAGACATTTCTTAAAAATGCGTGCTGGACACACTGAATCTTTGGTTTCAAGTCCAGGTCACCATCTATCCACATTCCTACCAAACTAAGATCCGACATCCCAACTTCTCCTTTTCCGTTCCTCATTCTGCAAATCTGAAAGCGACCCGACCCCATCGGAGGACCTCATCAACAACCCATCTTCAATCATCGTCGCCGGATTCTACTCAATTACTGCACTGCATTTCCGATTTTCCTTTTACTTCAAACCATCTCTTCTCCCCAATCCTATGTCTCCCAACTTCGCTTCATTTTTCAGGTCTGTTTTCCTTCTCAAATCGCTTTTTCTTGCTCTTCTGTTACTGCAATTATTTTCGTTTCTCTTCCTTCTGTTTCTTAATTTCTGTTGATCTATCTGCTCTCTGTCACCATCTTCGTATATTTTGTTGAATCTATGTCGTTGATGCGTTTTATTTCCGACATATTGTTCACGATTGTTCAATTTCCTTTGATTTTGTTTCTCGTATTTACTCATCGAGGGATCCTTCTCGTCTTTCCTCCTTTGTTTATCCTCTTCTTTATCTTGTTGTTTGTTTTTACCTAAATCTTAACTACAATAGTTTATGAGCTTTTGAAATTGGGGTTTTCCTCTCTGTACAGTAGTAGAATGTTGTGACTTATGTCCGTTTGAATTTGTTGAATCCCATAGGTTTCATTTTCTTAAATGCTGCATTATCTTGGAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAAACGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATCGATGAATTTCTTGATGAAGATTCACAACTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTAGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTAAAATCTATACTCTGCCTTTCTGAGCCTTCATTTTCTGTAATGATAGTCTAATACTTCCATGGCTAAAATTATTGGTTAAATTATAAAGACATCCTAAACCAAGAAGTTGGTATAAAAAATGCACCCTATTTTTTTGAAGTGTAATTTTAACTCTTTAAGTTTGAAGTTTGTTTTAAAATAGCCCTCGTAGGCTTTTGCAGTTAAATTTGTAGATGGAAAAGTTTGTATATTCTTATGTGAACAAATCTAAACACTTACATGGAAAAAATAAAGACACATGGTGCATCTGTGAACCTTGTATGTGGCCTCCTTTCAACCACCACCATTTTTTGTTCTTTCGTTTATGATTTTTTTTGCTAATCCTTCTCCTAAAATTTACTTCCGGGTGTTTAGATTTGTTCACTTAAGTAATACATAAGTAGAAACAAACGTTCCTATCTATAAATTTAACGAAAAAGTCTTCAAGGACTGTAATTAAAAAGTTCAACCTTCAATGGTTAAAATGAAACTTTTAGAAGTTTAGGGGTATAGCTGAACCCAACCCCATAGTTCAGGAATAATTTTGATAATTTAATCTAGAAATGTTCCTATTGCATAAATTGTTGGCATGCATCTCTATGAATTTGTATGGAAACAAAAGATTTTGATTAGTACTCCAGTCTTTTTCCCTCCTTTTCCATTATTAACAATAGTTTCTTCTTTATTATGGCAATTTCCAATCCCAATTATTGCTCCATGGTGGTTTTAATGACTTTATGTCTATTATCTTTTTAAATATGAAATATCAGCTCCAGTGATCGTTGTATGTTGAATTCCTTATAGTTCATCTTTTTGTACCTTATTCCACTCTTGATAAGGTCTACGTCCAACTCATGAATCTTTAGTTTTTGAGATGGTATGTGATGCTTGGTTGGTTACTTTAAGATCTTGGGAGTATGACAGTGAGAAGAAATTTTAAGGCAGAGGAAGTGAGTGTGCTTTGTGGCCTTTTGGAGCTGATCGAGAGTATGCTGATTTTGAGATTAGAGGTCAAGAAAATGGAATTTAGATAAAACAGGATTTTCTTGGTTAAACGTTCGGTTAAAGAGTCAGTTGATGTTCCCTCTCTTTCTAAAGTTTCGTTAAATGATTTGTGGAAATCAAGAGTCTAATAAATGTGATATTCTAATCTGTTTGGTTTTGGGAAAGCTACAACACAATACATCAAGAAGTTACGAGAAAATAATTATCAGACAATGAACCTTGCTTGGGGCAAGGAAAGAAGATGAAGTTAGCATTAACCCGACTGCTCAACTTCTTTTATTAAACGCTTGCAGATTAGGACAAGTCCTTGGTGATTAGAGAAACACCAATCTAATTTTTTGACTCAAACGAATGAGGCCACAAGATTGAAGTACCCTTGGGCGAGCCTAATTTTGAAATTCAAGGTCCCAAAAACGGTAAAAAGTTCTTTTATGGTAGTTAGCCTAAAGAAGTTTGAAGAAGCTTTAAAAGAATTTTCAACTTTCAAAAATGGATGATCTCTCCTTCTATGAGTTGCCTTTGCTTACAAGAGGAGGAATCCCTTGATCACTTTTTCTTGCAATGTTCTTTTGTGGCTAAAGGGCAGCAGCCTATCTTCAATGTATTTGGGGTGGATTACTACTTCCCTAAGAACATTAATGATTGGCTTATGGAAGTGCTCAACGGTAGGGTGTTTGGCGGCAAGGGGAAAGTCCTTTGGAGGTGTGATATGCCTTCCTTCCTTCAGAGCCTTTGGAAGGAAAGGAATAGCCAGGTTCTTAAAAGTAAGTATTCTTCTTGAGTCCTTTTGGCGTTTGGTGCAATACAAGGCCTCTTGGTGGTGCACAGTTCACACTAAATTCTCTTGCAATTAATGCCTTTTGATGATTCAATTGGAAGGCTCTCATTTGTTAGTTTTGTGGGGAGGGGTGACCACTGTGGGGTTGTTGTTTGGTTCTTTCGATCAATACGCGTCACTGCTTTTTATATATAAAATAATGACCTGCCAGATATTTTGGGAAACTTGCAATTAGGAATAGACAGCATGATGTTACTGGTCTAGATGATAGTTATAGCGTGAGGAACACTACCGAGGAGTTTAGAATTGATTTAGAGTTTTTTGATGCACCTCCTTGAAATTGGAATGTTAATTAGTCATGGCCGACGAGTTTATGGTCCTCTGAGCCCCAGGGGAAGGAATGCTGTTTGAGACGAGTTGGGTGACCTATTTCGTCTGGATGGGCCTTGTTGATGTGTGGTGGGGAGCTTTTATGTAATTTGCTTTGGTACTGAGAAGATCTTGGCTGGCTGGACAATGAAATCAATAGGCTCTTCAATTATTTTATTGAGAGAAATAGCCTCTTCGACTCTCCTCTATTGTACGGAAAGTTTACTTGGTTGAATAGTAGGGCACGTAGTAGACTTCATAGGTTACTAATATCTAAAGGGTAGATGAATGTTCATGGGAGTGTGAGACAATTGCTAGGCCCTAGAATAACCTTCGACAGGTGTCCTGAGGTGATGCTTGTGTCCTTTCATATTTGAAAATATTTGGTTCATATTGAATGCGGAAACCACTGGAAATTAAGAAATTGGGAACGGGTATCTTCCATGGGGAAGCTGACGGTCCTCAAAGGAGTGTTATTGTGGAACAAAGACTCACGTAGGAGTTTTAAAATTCTGGAAATTAAAAAGCTCAAAACAAAGCCCTTTTGGTTAAATGGTTATGCCAATTTCCCCTTGAGTCGATAGAGGATTATAGCAAGCAAATAGGGTCCTCATCCTTCTGAGTGGATGTCAGATGCGTCAAAGGCACTTACAAAAATATTAGACTTCTTAAATCTTAGAACAGAAATTGACATCCAGTGCAGACCTGAGGAAATAAATCTTAGAACAGGATCTCTAGTATTAGACTTATTAAATTATCCCAATGTCTGTTTGAAGGTGAGCATCATTGGAAAAGGGACGCCTGTTTCTTTTTGCGTACTGATGAGGCCTGCACTTCCTTCCAGCATCAATTTTATTGAAGTAGTTTGAATTTAAGCCTCCTCCTTTAAGCCACTGTAACCTTCAGTCCTATAGTAGAAAATGAATTTCATGTTGTTCCTTTTACTTTTGCTTTTCTCTAGACCTTTAGGAAAAATTTGAAAAAGAAAAAGCTTGAATGATATGATTCCTTCTTTAATGACGCGATTTTGATGCCTTGAATCACTGGTCACCACGTACACTTGAACTTAAAAAGAGGTGGTAATTTTTTTTATGTTAGGCGGGAGACACAAGGAATTTTAAATTCAAACATGCATTCAAAACCGACATAAAGGTAGTTTGAAGGTAATATTTGTGAGTCAAGTCAAAATGGTTTACCAAAAAAATTCACAAGTTTTGAGAAGTAGCATTTAAAATAATAATAAAATGACAAGAAGGAAGACGACTTGATCTAAAGGGCACCCTTTATAGCTGCACGGTGTCACAGTCATGCTTGTCCTAGGCGTGTTGCCCATGGCCGCATGCCCATTGCATGCTAGAACCCTTGTGTTGTATCACTTCACTGCTTTCATGATGTATAGTTTCTTTATTGTATTTTCTAGGTGTATGATGTTTCAACAGAATCATGTATATGCTTGATAGACCTGAAATACCTCAAAATGTAGCATAGGAAGATGTGACTAGTTGTCAAATCTTCGTACTCGGAGTTATATGCTATCTAAGCCATTGTGTTACCGTTTTCGCTTATAGCCTATAACATTGCTTTATTCATTTCTCTCCATCTTGTTTTCAAATTCTCTTTCTAAAATCTCTCTCTGGTTCTGTTTTCTAAAACCTTCTCCTTAAAGCAAGGCCAGAGGCTTGAGTATACGTCGTCAGGAAGAAGACAACATGATTTCGAGTTGGCGGACTCACTCGGTATTGAGAGTGAGTGCGCACAATTGCTTAAAAAACGTCTTGTAAGACTGCGATTGTGACAGTTGGTATCAGAGTCGAGTTGGCTCCAAAATTGATTCGGTAAACATGGCTACAACCAAGCAGTTGAACAAGTCCCACGTCGATTGACTAGTCGACATCGAAGAACAGATGCAAGAGCTCGCCGACAAGACTAATATGGTTGATGTAGTTGCAGGTTGATTAGATGAGTTACCCATCCAAGAACTTATGTACCGAGTAGACAACCTAAAAGAAAAAGCCACAAAGACTGGTGGCTTATCGCTGCCCTAATTGAAGAGTGAGTCGATGGTCTAGACATATCCTAAAAGGAGTTATGCAGATGGTCTCTGAGATATTTGAAGATGTGAAATTAGCCTTCGACGTGGTCAGGGCAGAAATTGTTAGGAATCACGGCTCTCCACAATGGTATGTTATTGTCCACTTTGAGCATAAGCTTTCATAGTTTTGTTTTGGGCTTTCCCAAAAGGCCTCATTCCAACGGAGACCTATTCCTTGCTTATAAACCCAGGATCATTCCCTAAATTAGCCAATGTGGGACAACCTCCCAACAATCCTCAACAATCCTCCCCTCGAACAAAATACAACATAGAGTCTCCCTTGAGGCCTATGGAGTCCTCGAATAGCCTCCCCTTAATTGAGGCTCGACTCCTCTGGAGTCCTCAAACAAAGTACACCCTTTGTTCGACTCTTTAGTCATTTTTTACTACATCTTCGAGGCTCACATAAGTTTAGGGCACGACTCTGATACCATGTTAGGAATCACGACTCTCCACAATGGTGATATTGTCCACTTTGAGCATAAGCTCTCATGATTTTGCTTTGGGATTCCCCAAAAGGCCTCATTCCAATGGAGATCTATTCTTTGCTTATAAACCCATGATCATTCCCTAAATTAGCCAATGTAGGACAAACTCCCAACAATACTCAACATAAATGGCAGATCTAAGCACTAGAGTTAATCTCATCGTTAGAGCAGTGTGAAATCAAACCCCTACAGGGGGAGAAGTTCAGTTCAACAAGATCAAGGTTCCGGAGCCCAAACTCTTCTATGGGGTTCGAGCTGCCGAGGTTCTAGCGACCTTGATCAATACTTTCAGGCGATGAACACAGCAACAGATGAAGCAAAGGTCACCTTGGCCACCATACATCAAGCCGAGGATGCAAAATTTTGATGGACATCGTACATTGATATTCAAGAGGGCCGGTGCACAAATGGAAAAGACTGAAACAAGAACTTCAATCTCAATTCTTCCCAGAGAAGGTTGAAATCTTGGCTAGAAGGAAAGTGGGGGATCTCAAGCACACAAGAAACATCTGAGAGTATGTCAAACAGTTCTCAGTGTTCATGTTGGACATCTGAGATATGTTCGAGAAAGACAAGATCTTCCACTTCATAGAAGGACTAAAGCCATGAGCGAAGTCCAAACTATATGAGCAGAGAATACAAGACCTCTCCACGACCTATGGTACAGCTGAACGATTGTTTGATCTAAGTAACGAACAATCCCAAGATGTAAGGCGAAGCCAAACCTCCTCGAGTGGATTGTTTGATATGAGCTGAACGATTGTTTGATTGGAGCTAAACAATTGTTTGATCTGAATCTACCATTTTACTCTTGGCCAATTTATGATTGACCCATGGTTCTACATACATGAGACCTTTTTCGATCGACTCTTTTGACTCCCCGACTTTCTTTTGTAGGGCAGATAGAAACTTTAGGGCCCCATACAAGGGTTTGCTCTATTCTCCGACGACGACTTTTCGGTTTTCGCACCTTCTGACTTTGCCTCGTATACTCTATGTCCAAAGCTAGAAACTTTAGGGCAGATAGAAACTTTAGGGCCCCATACAAGTCTGTGTCCAAAGCTGCTTGGAAGGTTTGTAGTGTCGCTTTCTTGGGGCACTCGTATACTCTATGGTTCTCTCTACATAAGAAGCAGGAGGGAGGTAGCCGATTGAAGTAATTTTGATGATAGGGCCCTCGGCCTAAGACTAGTTTCAGGAAAATCACCCTAGGTTGGCCTAGTGGTCATCAAGGGATATGTAAACAATAAAAGACAAAATAAACTAGCTCAAACCATAGTAGCCGCTTACACAAGATTTAATATCCTACGAGTACCTTGACAACCAAATGTAATTAGTAATAATCCTACAGGTACCTTGGTTGTGAGAATACTCAAAGTGCTTGTAAGCTTCCCATACACTAATGATATTAGAGATAACAGGAGTTCTAGGGAATCCGTGTTATTGAGAGGACTCTAAGTGCACAATCCTAAAATACATCCTTAGCCATCTTTGGAAGAATCTTATACCTCGTAAAATTTACCTTCCACAATCCTAAGATGTCAAACCTGTGTGCCCCGGTATTCCTCAACATCCCCTGTGTCATGGTCAAGCTATTTTTCTCTTCGCTTCTAAGAGTGAAGACTATCCCCAATTTTTCTCTTCGCTTCTAAGAGTGAAGACTATCCCCATTGACTAACATAAGTCCTTCTAGCATGCTTTGTTCTCACTCACTTGTAATTGAAAGAAAATTTTCAGGAAGTCACCCAACATAGAATTGCTCCAAGCAAAACACGTTTAATTGGAGTTCATATGATTGAGCCACCGAAAAGGAACGTACACCTTATTGGTTTAGGTAGCGACTTTCAATTCTTTTAATTCTTCATTAGTTATCCTATCCTTAGGATTGCTCTCATTCATTATCCTAATCTTCACTCCGAAAGATTTTCATTCTCTGTTTCCATCGCTTCTATCCTTTCAATTCTTTTACCAAGTTTTTAAAATTATTCCATGATGTTACTTTGCAAGTGTCTGCTGTGTAACATCAAACCTACATTTTCAATGAAGTTTGTTCTAATTCAACAACACAATCAAGATTATTCAACATGTGGTAATCCTTTGTTGTGTGTCATGGCAAGAAGGATAGAAGCGATGTTTGGTAAATATTGGAAAGATTTTCTCGTGTGTTGCTGTTATGCTAGAATCTCATTATACGCTTGAATTCGTGATGTTTATGCTTACAGATTTATATGATGAAGAGATTGCAAACCAAATTAGGAAGAAAAAAAAGTTAAAAGATTAGTTTTCTCGACACTTAAATATTATATTAGTTCTTGCTGTCCTCCTTCGACAGCAAGAACAATAGAACAGCTCCTTCTGGGCCATCTTTTCTAAGAATAATGCAAAGATTTTGTCGAAATGTTCACTCAAAGGTGTTTTCTGGTAAACGTGGCTTGAAAGGAACCCCGAAAATTTTAAAAATTCAAGTAGAAAAACTTCAACCAAAGTTGGAAATGCATGTTAGAACGTTTTCTTGAAGTTGAACTTCTGTTCTTATGTTCTTGCCTATTTTGGGTTGATTGCTTATCTTCAGCTTTTTCACCATCTATTAGGCTGTGTTCTAGGATCCTTCCTGTGATTGAGATAATTGATGTCAGGAGTTTGGAGGGCCGTCAGAATTGTTGATACTGAGTTGTTCGGAGGATGTAATATTTTTCATCTAATGATCTTAAAACCAACGATTGTTCTGTTGTGACTTTCTTTCAGATCGGATAATCTTGAATCCTGCGCCACTTGCTTTAAATTTATCACATTTTTTCACCATCTTTTGTTCATCATGTGACTTTCAAGAGCAACCAATTAACCTTCAAAACTACACTAACCTGTTGGCATTCTTATGATTAGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGAACTCGAAAATATGTCATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGACGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCGGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTGGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCCTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

mRNA sequence

TAACGTGTTTTTTTAAAGTCAAAATATGGAAAACCCTAGGTCATAGGCGCAAGAAGAGTTGTGGACTTCAGACATTTCTTAAAAATGCGTGCTGGACACACTGAATCTTTGGTTTCAAGTCCAGGTCACCATCTATCCACATTCCTACCAAACTAAGATCCGACATCCCAACTTCTCCTTTTCCGTTCCTCATTCTGCAAATCTGAAAGCGACCCGACCCCATCGGAGGACCTCATCAACAACCCATCTTCAATCATCGTCGCCGGATTCTACTCAATTACTGCACTGCATTTCCGATTTTCCTTTTACTTCAAACCATCTCTTCTCCCCAATCCTATGTCTCCCAACTTCGCTTCATTTTTCAGATAAAAAGGACCAGAAAATGAATATGAGGAACAGATACAGGAAATCAAACGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATCGATGAATTTCTTGATGAAGATTCACAACTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTAGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGAACTCGAAAATATGTCATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGACGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCGGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTGGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCCTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

Coding sequence (CDS)

ATGAATATGAGGAACAGATACAGGAAATCAAACGCTTTACGTTGCGATGCCGGGAGCAGATGTTTGATATCTGTGGTAATAGGGAGTCTAATGGGGTGTATTCTTTTGCTACATTTATGTTCTCCTGTAAGCCGTAAGGATGAGATAGGTCGGGGTATCCAACTTCGAACAAGTCGTCACCTTCACTTTCGTGAACTTGAAGAGGTGGAAGAAGAAAACATTCAAATTCCCCCTCCTCGTAAGAGATCTCCGCGTGCAGCAAAGCGAAGACCAAAGAAAACGCCCACACTGATCGATGAATTTCTTGATGAAGATTCACAACTTAGGCACAAATTCTTTCCTGATCATAAAACTTCTGTTGATCCAATGATCCCGGGAGACGATAGCATGTTCTATTATCCGGGGAGAGTTTGGCTAGATACTGAGGGGAATCCTATTCAAGCTCACGGAGGTGGAGTTTTATTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTATAGGAGTCGGTTGCTACTCTTCCAAAGACTTATGGACCTGGAGAAATGAAGGCATTGTTTTGACAGCGGAAGAAACCAACGAGACTCATGATCTTCACAAATCCAACGTGCTCGAGAGGCCGAAAGTAATCTACAACTCAAGAACTCGAAAATATGTCATGTGGATGCATATCGATGATGCGAACTATACGAAGGCTTCTGTTGGTGTTGCCGTCAGTGATTACCCAACCGGTCCGTTCGATTATCTTTACAGCAAAAGACCACATGGATTTGATAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTATCTCGTTTACTCATCTGAAGACAATAGTGAGCTCCATATAGGACCTCTTTCAGAAGATTATCTCGACGTGACCAATGTAGCGAAAAGGATTCTCGTCGGCCAGCACCGGGAAGCACCGGCTTTGTTTAAACACCAGGGAACTTACTATATGATCACATCGGGTTGCACGGGATGGGCACCAAACGAGGCACTGGCACACGCATCAGAGTCGATAATGGGTCCATGGGAGACGTTGGGAAACCCTTGTATAGGTGGAAACAAGTTGTTTCGACTGGCTACCTTCTTCTCTCAGAGCACATTTGTTCTTCCCCTACCTTCACACCCCGGCTTGTTTATTTTCATGGCAGACCGATGGAACCCTGCCGACCTTAGAGACTCGAGGTACATTTGGTTGCCGTTGATGGTTGGAGGACTTGTCGATCAACCCCTCGACTACAATTTTGGGTTCCCTTTGTGGTCAAGAGTGTCGATATATTGGCATAGGAAGTGGAGGCTTCCTCAAGGCTGGAATCCGTTGAAATGA

Protein sequence

MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRHLHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSVDPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK
BLAST of Cp4.1LG01g17280 vs. TrEMBL
Match: A0A0A0LPY3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285310 PE=3 SV=1)

HSP 1 Score: 919.1 bits (2374), Expect = 2.2e-264
Identity = 424/466 (90.99%), Postives = 445/466 (95.49%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRNRYRKS ALRCDAGSRCLISVVIGSLMGCILLL+L S +S  DEIG+GI LRTS H
Sbjct: 13  MKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADEIGQGIHLRTSHH 72

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQLRHKFFPD K S+
Sbjct: 73  LHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQLRHKFFPDKKASI 132

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG
Sbjct: 133 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 193 AARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 252

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDDDGTAYL+YSSED
Sbjct: 253 WMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDDDGTAYLIYSSED 312

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HA
Sbjct: 313 NSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGCTGWAPNEALTHA 372

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSR 420
           +ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMADRWNPADLRDSR
Sbjct: 373 AESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMADRWNPADLRDSR 432

Query: 421 YIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           Y+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 433 YVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of Cp4.1LG01g17280 vs. TrEMBL
Match: A0A0A0LJS1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285330 PE=3 SV=1)

HSP 1 Score: 870.2 bits (2247), Expect = 1.2e-249
Identity = 400/467 (85.65%), Postives = 434/467 (92.93%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRN++RKS  LRCD+ S+CLISVVIGSLM CILLL L SP SRK+E+G+GIQ+RTS H
Sbjct: 38  MEMRNKFRKSTTLRCDSQSKCLISVVIGSLMVCILLLSLLSPTSRKNEMGQGIQIRTSHH 97

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LH REL+EVEEENIQIPPP KR  RA KRRPK+   LIDEFLDEDSQLR KFFPDHKT +
Sbjct: 98  LHLRELQEVEEENIQIPPPHKRPRRAPKRRPKRMTPLIDEFLDEDSQLRRKFFPDHKTFI 157

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGV+FDERSETYYWYGEYKDGPTYHAH+KG
Sbjct: 158 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVIFDERSETYYWYGEYKDGPTYHAHEKG 217

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIG+GCYSSKDLW+W+NEGIVL AEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 218 AARVDIIGIGCYSSKDLWSWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 277

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHID+ NYTKASVGVA+SDYP GPF YL SKRPHGFDSRDMTIFKDD+GTAYL+YSS+ 
Sbjct: 278 WMHIDNVNYTKASVGVAISDYPNGPFHYLQSKRPHGFDSRDMTIFKDDNGTAYLIYSSQG 337

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+GPLSEDYLDVTNVA+R+L+GQHREAPALFKH+GTYYMITSGCTGWAPNEALAHA
Sbjct: 338 NSELHVGPLSEDYLDVTNVARRVLIGQHREAPALFKHKGTYYMITSGCTGWAPNEALAHA 397

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLP-SHPGLFIFMADRWNPADLRDS 420
           +ESIMGPWET+GNPCIG NK+FRLATF SQSTFV+PLP S+P LFIFMADRWNPADLRDS
Sbjct: 398 AESIMGPWETIGNPCIGENKMFRLATFLSQSTFVIPLPSSYPNLFIFMADRWNPADLRDS 457

Query: 421 RYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           RY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 458 RYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 504

BLAST of Cp4.1LG01g17280 vs. TrEMBL
Match: A0A0D2T2K4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G083400 PE=3 SV=1)

HSP 1 Score: 798.9 bits (2062), Expect = 3.3e-228
Identity = 363/468 (77.56%), Postives = 417/468 (89.10%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M +RN+YRKS A  C+ GSRC IS+V+ SL+G +L+L + S +S ++ +   I+LR SRH
Sbjct: 1   MRVRNKYRKSTAFPCNVGSRCSISIVVWSLVGFLLMLQIYSLISHRNTVSGDIKLRMSRH 60

Query: 61  LHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTS 120
              RELE+VEEENIQIPPPR KRSPRAAKRRPK+T TL+DEFLDE+SQ+RH FFPD KT+
Sbjct: 61  PLVRELEQVEEENIQIPPPRGKRSPRAAKRRPKRTTTLVDEFLDENSQIRHVFFPDMKTA 120

Query: 121 VDPMIP-GDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHK 180
           +DP+   G+DS +YYPGR+WLDTEGNPIQAHGGG+++DERS TYYWYGEYKDGPTYHAHK
Sbjct: 121 IDPLKDAGNDSFYYYPGRIWLDTEGNPIQAHGGGMIYDERSSTYYWYGEYKDGPTYHAHK 180

Query: 181 KGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKY 240
           KGAARVDIIGVGCYSSKDLWTW+NEGIVLTAEE+NETHDLHKSNVLERPKVIYN  T KY
Sbjct: 181 KGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEESNETHDLHKSNVLERPKVIYNENTGKY 240

Query: 241 VMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSS 300
           VMWMHIDDANYTKA+VG+AVSDYPTGPFDYL S+RPHG++SRDMT+FKD+DG AYL+YSS
Sbjct: 241 VMWMHIDDANYTKAAVGIAVSDYPTGPFDYLGSQRPHGYESRDMTVFKDEDGVAYLIYSS 300

Query: 301 EDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALA 360
           EDNSELHIGPL++DYLDV    +RILVGQHREAPALFK++GTYYMITSGCTGWAPNEALA
Sbjct: 301 EDNSELHIGPLTKDYLDVKPDIRRILVGQHREAPALFKYRGTYYMITSGCTGWAPNEALA 360

Query: 361 HASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRD 420
           HA++SIMGPWET+GNPCIGGNK+FRLATFFSQSTFV+PLP  PG +IFMADRWNPADL D
Sbjct: 361 HAADSIMGPWETMGNPCIGGNKMFRLATFFSQSTFVIPLPGIPGSYIFMADRWNPADLSD 420

Query: 421 SRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           SRY+WLPL+VGG  D+P ++NFGFPLW RVSIYWHRKWRLP  W   K
Sbjct: 421 SRYVWLPLIVGGPADRPFEFNFGFPLWPRVSIYWHRKWRLPSSWRVTK 468

BLAST of Cp4.1LG01g17280 vs. TrEMBL
Match: M5W5C1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005306mg PE=3 SV=1)

HSP 1 Score: 798.5 bits (2061), Expect = 4.3e-228
Identity = 358/467 (76.66%), Postives = 410/467 (87.79%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRN+YRK     C+AGSRC  S V+ SL+GC L+  L S V + D +   +Q R++ H
Sbjct: 1   MRMRNKYRKPTTFHCNAGSRCSTSAVVWSLVGCFLMFQLYSLVHQNDRMRGEMQFRSTHH 60

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
               ELEEVEEENIQIPPPRKRSPRAAKR+P++  TLIDEFLDE+SQ+RH FFP  K  +
Sbjct: 61  PQIHELEEVEEENIQIPPPRKRSPRAAKRKPRRPTTLIDEFLDENSQIRHVFFPGQKHVI 120

Query: 121 DPMIP-GDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKK 180
           DPM   G+DS +YYPGR+WLDT+GNPIQAHGGG+L+D++  TYYWYGEYKDGPTYHAHKK
Sbjct: 121 DPMKDTGNDSYYYYPGRIWLDTDGNPIQAHGGGILYDDKLRTYYWYGEYKDGPTYHAHKK 180

Query: 181 GAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYV 240
           GAARVDIIGVGCYSS+DLW W+NEGIVL AE+TNETHDLH+ NVLERPKVIYN RT KYV
Sbjct: 181 GAARVDIIGVGCYSSRDLWKWKNEGIVLAAEKTNETHDLHELNVLERPKVIYNERTGKYV 240

Query: 241 MWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSE 300
           MWMHIDD NYTKA+VG+A+SDYPTGPFDYLYSKRPHGF+SRDMTIFKDDDG AYL+YSSE
Sbjct: 241 MWMHIDDVNYTKAAVGIAISDYPTGPFDYLYSKRPHGFESRDMTIFKDDDGVAYLIYSSE 300

Query: 301 DNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAH 360
           DNSELHIGPL+EDYLDVTN+ +R+LVGQHREAPALFK++GTYYMITSGCTGWAPNEALAH
Sbjct: 301 DNSELHIGPLTEDYLDVTNIMRRVLVGQHREAPALFKYEGTYYMITSGCTGWAPNEALAH 360

Query: 361 ASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDS 420
           A+ESIMGPWET+GNPC GGNK+ RL TFF+QSTFV+P+P+ PG FIF+ADRWNPADLRDS
Sbjct: 361 AAESIMGPWETMGNPCAGGNKVSRLTTFFAQSTFVVPVPAFPGSFIFIADRWNPADLRDS 420

Query: 421 RYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           RY+WLPL+VGG  D+PLDYNFGFPLWSRVSIYWHRKWRLP+GW+  K
Sbjct: 421 RYVWLPLIVGGPADRPLDYNFGFPLWSRVSIYWHRKWRLPRGWSSSK 467

BLAST of Cp4.1LG01g17280 vs. TrEMBL
Match: A0A061DHP7_THECC (Glycosyl hydrolase family protein 43 isoform 2 OS=Theobroma cacao GN=TCM_000802 PE=3 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 7.4e-228
Identity = 367/464 (79.09%), Postives = 414/464 (89.22%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M +RN+YRK  A  C+AGSRC +S V+ SL+G +L+LHL S VS ++ +G  IQLR SRH
Sbjct: 1   MRVRNKYRKPTAFPCNAGSRCSMSAVVWSLVGFVLMLHLYSLVSHRNPVGGDIQLRMSRH 60

Query: 61  LHFRELEEVEEENIQIPPPR-KRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTS 120
              RELE+VEEENIQIPPPR KRSPRAAKRRPK+T TLIDEFLDE+SQLRH FFPD KT+
Sbjct: 61  PLVRELEQVEEENIQIPPPRGKRSPRAAKRRPKRTTTLIDEFLDENSQLRHVFFPDMKTA 120

Query: 121 VDPMIPG-DDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHK 180
           +DP     +DS +Y+PGR+WLDTEGNPIQAHGGG+L+DERS TYYWYGEYKDGPTYHAHK
Sbjct: 121 IDPTKDARNDSYYYHPGRIWLDTEGNPIQAHGGGILYDERSSTYYWYGEYKDGPTYHAHK 180

Query: 181 KGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKY 240
           KGAARVD+IGVGCYSSKDLWTW+NEGIVL AEET+ETHDLHKSNVLERPKVIYN    KY
Sbjct: 181 KGAARVDVIGVGCYSSKDLWTWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNDNMGKY 240

Query: 241 VMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSS 300
           VMWMHIDDANYTKA+VG+A SDYPTGPF+YL S+RPHG++SRDMTIFKDDDG AYL+YSS
Sbjct: 241 VMWMHIDDANYTKAAVGIASSDYPTGPFEYLRSQRPHGYESRDMTIFKDDDGVAYLIYSS 300

Query: 301 EDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALA 360
           EDNSELHIGPL+EDYLDV    +RILVGQHREAPALFK+QGTYYMITSGCTGWAPNEALA
Sbjct: 301 EDNSELHIGPLTEDYLDVKPDMRRILVGQHREAPALFKYQGTYYMITSGCTGWAPNEALA 360

Query: 361 HASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRD 420
           HA+ESIMGPWET+GNPCIGGNK+FRLATFF+QSTFV+PLP  PG +IFMADRWNPADL+D
Sbjct: 361 HAAESIMGPWETMGNPCIGGNKMFRLATFFAQSTFVIPLPGIPGSYIFMADRWNPADLKD 420

Query: 421 SRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGW 463
           SRY+WLPL+VGG  D+PL++NFGFPLW RVSIYWHRKWRLP  W
Sbjct: 421 SRYVWLPLIVGGPADRPLEFNFGFPLWPRVSIYWHRKWRLPLRW 464

BLAST of Cp4.1LG01g17280 vs. TAIR10
Match: AT3G49880.1 (AT3G49880.1 glycosyl hydrolase family protein 43)

HSP 1 Score: 718.0 bits (1852), Expect = 3.8e-207
Identity = 332/461 (72.02%), Postives = 386/461 (83.73%), Query Frame = 1

Query: 3   MRNRY-RKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTS-RH 62
           M+N++ +K+  LRC      L+S V+G    C+ ++HL    SR   +   +  +    H
Sbjct: 4   MKNKHNKKATFLRCSPFG--LVSTVVG----CVFMIHLTMLYSRSYSVDLDLSPQLLIHH 63

Query: 63  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 122
              RELE VEEENI +PPPRKRSPRA KR+PK   TL++EFLDE+SQ+RH FFPD K++ 
Sbjct: 64  PIVRELERVEEENIHMPPPRKRSPRAIKRKPKTPTTLVEEFLDENSQIRHLFFPDMKSAF 123

Query: 123 DPMIP--GDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHK 182
            P      D S +Y+PGR+W DTEGNPIQAHGGG+LFD+ S+ YYWYGEYKDGPTY +HK
Sbjct: 124 GPTKEDTNDTSHYYFPGRIWTDTEGNPIQAHGGGILFDDISKVYYWYGEYKDGPTYLSHK 183

Query: 183 KGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKY 242
           KGAARVDIIGVGCYSSKDLWTW+NEG+VL AEET+ETHDLHKSNVLERPKVIYNS T KY
Sbjct: 184 KGAARVDIIGVGCYSSKDLWTWKNEGVVLAAEETDETHDLHKSNVLERPKVIYNSDTGKY 243

Query: 243 VMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSS 302
           VMWMHIDDANYTKASVGVA+SD PTGPFDYLYS+ PHGFDSRDMT++KDDD  AYL+YSS
Sbjct: 244 VMWMHIDDANYTKASVGVAISDNPTGPFDYLYSRSPHGFDSRDMTVYKDDDNVAYLIYSS 303

Query: 303 EDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALA 362
           EDNS LHIGPL+E+YLDV  V KRI+VGQHREAPA+FKHQ TYYMITSGCTGWAPNEALA
Sbjct: 304 EDNSVLHIGPLTENYLDVKPVMKRIMVGQHREAPAIFKHQNTYYMITSGCTGWAPNEALA 363

Query: 363 HASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRD 422
           HA+ESIMGPWETLGNPC+GGN +FR  TFF+QSTFV+PLP  PG+FIFMADRWNPADLRD
Sbjct: 364 HAAESIMGPWETLGNPCVGGNSIFRSTTFFAQSTFVIPLPGVPGVFIFMADRWNPADLRD 423

Query: 423 SRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 460
           SRY+WLPL+VGG  D+PL+Y+FGFP+WSRVS+YWHR+WRLP
Sbjct: 424 SRYLWLPLIVGGPADRPLEYSFGFPMWSRVSVYWHRQWRLP 458

BLAST of Cp4.1LG01g17280 vs. TAIR10
Match: AT5G67540.2 (AT5G67540.2 Arabinanase/levansucrase/invertase)

HSP 1 Score: 706.4 bits (1822), Expect = 1.1e-203
Identity = 330/471 (70.06%), Postives = 388/471 (82.38%), Query Frame = 1

Query: 1   MNMRNRY-RKSNALRCDAGSRCLISV--VIGSLMGCILLLHLCSPVSRKDE-IGRGI--- 60
           M   N+Y +KS +L C+    C  S+  ++ +++G  L+ HL S  SRKD  I + +   
Sbjct: 1   MKKNNKYNKKSTSLHCNDAGGCRYSLLTIVWTVVGFFLVAHLISLYSRKDNNIHQQVSSD 60

Query: 61  QLRTSRHLHF---RELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRH 120
           QL+   HL     REL  VEEE +++PPPRKRSPR +KRR +K   L++EFLD+ S +RH
Sbjct: 61  QLQVVHHLAHPIVRELIRVEEEVLRMPPPRKRSPRTSKRRSRKPIPLVEEFLDDKSPIRH 120

Query: 121 KFFPDHKTSV--DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEY 180
            FFP  KT+        G+++ +Y+PG++W+DT+GNPIQAHGGG+L D +S TYYWYGEY
Sbjct: 121 LFFPGIKTAAFGPTKDMGNETSYYFPGKIWMDTQGNPIQAHGGGILLDVKSNTYYWYGEY 180

Query: 181 KDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPK 240
           KDGPTYHAHKKG ARVDIIGVGCYSSKDLWTW+NEGIVL AEETN+THDLHKSNVLERPK
Sbjct: 181 KDGPTYHAHKKGPARVDIIGVGCYSSKDLWTWKNEGIVLGAEETNKTHDLHKSNVLERPK 240

Query: 241 VIYNSRTRKYVMWMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDD 300
           VIYN +T KYVMWMHIDDANYTKASVGVA+S+ PTGPF+YLYSKRPHGFDSRDMT+FKDD
Sbjct: 241 VIYNEKTEKYVMWMHIDDANYTKASVGVAISNSPTGPFEYLYSKRPHGFDSRDMTVFKDD 300

Query: 301 DGTAYLVYSSEDNSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGC 360
           DG AYL+YSSE NS LHIGPL+EDYLDVT V KR++VGQHREAPA+FKHQ  YYM+TS C
Sbjct: 301 DGVAYLIYSSEVNSVLHIGPLTEDYLDVTPVMKRVMVGQHREAPAIFKHQNIYYMVTSWC 360

Query: 361 TGWAPNEALAHASESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMA 420
           TGWAPNEALAHA+ESIMGPWE LGNPCIGGNK+FRL TFF+QST+V+PLP  PG FIFMA
Sbjct: 361 TGWAPNEALAHAAESIMGPWEKLGNPCIGGNKVFRLTTFFAQSTYVIPLPGVPGAFIFMA 420

Query: 421 DRWNPADLRDSRYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLP 460
           DRWNPADLRDSRY+WLPL++GG  DQPL++NFGFP WSRVSIYWH KWRLP
Sbjct: 421 DRWNPADLRDSRYVWLPLVIGGPADQPLEFNFGFPSWSRVSIYWHSKWRLP 471

BLAST of Cp4.1LG01g17280 vs. NCBI nr
Match: gi|778669992|ref|XP_004148025.2| (PREDICTED: uncharacterized protein LOC101203100 [Cucumis sativus])

HSP 1 Score: 919.1 bits (2374), Expect = 3.1e-264
Identity = 424/466 (90.99%), Postives = 445/466 (95.49%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRNRYRKS ALRCDAGSRCLISVVIGSLMGCILLL+L S +S  DEIG+GI LRTS H
Sbjct: 13  MKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAISPADEIGQGIHLRTSHH 72

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQLRHKFFPD K S+
Sbjct: 73  LHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQLRHKFFPDKKASI 132

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG
Sbjct: 133 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 192

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 193 AARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 252

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHIDD NYTKASVGVA+SDYPTGPFDYLYSK+PHGFDSRDMTIFKDDDGTAYL+YSSED
Sbjct: 253 WMHIDDVNYTKASVGVAISDYPTGPFDYLYSKKPHGFDSRDMTIFKDDDGTAYLIYSSED 312

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+G LS+DYLDVTNVA+R+L+GQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HA
Sbjct: 313 NSELHVGSLSKDYLDVTNVARRVLIGQHREAPALFKHQGTYYMVTSGCTGWAPNEALTHA 372

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSR 420
           +ESIMGPWET+GNPCIGGNK+FRLATFFSQSTFVLPLPS+P LFIFMADRWNPADLRDSR
Sbjct: 373 AESIMGPWETMGNPCIGGNKMFRLATFFSQSTFVLPLPSYPNLFIFMADRWNPADLRDSR 432

Query: 421 YIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           Y+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 433 YVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of Cp4.1LG01g17280 vs. NCBI nr
Match: gi|659115999|ref|XP_008457848.1| (PREDICTED: uncharacterized protein LOC103497430 [Cucumis melo])

HSP 1 Score: 914.8 bits (2363), Expect = 5.9e-263
Identity = 422/466 (90.56%), Postives = 444/466 (95.28%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRNRYRKS ALRCDAGSRCLISVVIGSLMGCILLL+L S +   DEIG+ I LRTS H
Sbjct: 13  MKMRNRYRKSTALRCDAGSRCLISVVIGSLMGCILLLNLYSAIRPADEIGQHIHLRTSHH 72

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LHF ELEEVEEENIQIPPPRKRSPRA KRRPKKT TLIDEFLDEDSQ+RHKFFPD KTS+
Sbjct: 73  LHFPELEEVEEENIQIPPPRKRSPRATKRRPKKTTTLIDEFLDEDSQIRHKFFPDQKTSI 132

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGVLFDERS TYYWYGEYKDGPTYHAHKKG
Sbjct: 133 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSGTYYWYGEYKDGPTYHAHKKG 192

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIGVGCYSSKDLWTW+NEGIVLTAEET+ETHDLHKSNVLERPKVIYNSRTRKYVM
Sbjct: 193 AARVDIIGVGCYSSKDLWTWKNEGIVLTAEETDETHDLHKSNVLERPKVIYNSRTRKYVM 252

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHIDD NYTKASVGVA+SDYPTGPFDYLYSKRPHG DSRDMTIFKDDDGTAYL+YSSED
Sbjct: 253 WMHIDDVNYTKASVGVAISDYPTGPFDYLYSKRPHGCDSRDMTIFKDDDGTAYLIYSSED 312

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+G LSEDYLDVTNVA+RIL+GQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HA
Sbjct: 313 NSELHVGSLSEDYLDVTNVARRILIGQHREAPALFKHQGTYYMVTSGCTGWAPNEALTHA 372

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLPSHPGLFIFMADRWNPADLRDSR 420
           +ESIMGPWET+GNPC+GGNK+FRLATFFSQSTFVLP+PS+P LFIFMADRWNPADLRDSR
Sbjct: 373 AESIMGPWETMGNPCMGGNKMFRLATFFSQSTFVLPVPSYPNLFIFMADRWNPADLRDSR 432

Query: 421 YIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           Y+WLPLMVGGLVD+PLDYNFGFPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 433 YLWLPLMVGGLVDEPLDYNFGFPLWSRVSIYWHRKWRLPQGWNSLK 478

BLAST of Cp4.1LG01g17280 vs. NCBI nr
Match: gi|778670000|ref|XP_004148027.2| (PREDICTED: uncharacterized protein LOC101203585 isoform X2 [Cucumis sativus])

HSP 1 Score: 870.2 bits (2247), Expect = 1.7e-249
Identity = 400/467 (85.65%), Postives = 434/467 (92.93%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRN++RKS  LRCD+ S+CLISVVIGSLM CILLL L SP SRK+E+G+GIQ+RTS H
Sbjct: 1   MEMRNKFRKSTTLRCDSQSKCLISVVIGSLMVCILLLSLLSPTSRKNEMGQGIQIRTSHH 60

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LH REL+EVEEENIQIPPP KR  RA KRRPK+   LIDEFLDEDSQLR KFFPDHKT +
Sbjct: 61  LHLRELQEVEEENIQIPPPHKRPRRAPKRRPKRMTPLIDEFLDEDSQLRRKFFPDHKTFI 120

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGV+FDERSETYYWYGEYKDGPTYHAH+KG
Sbjct: 121 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVIFDERSETYYWYGEYKDGPTYHAHEKG 180

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIG+GCYSSKDLW+W+NEGIVL AEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 181 AARVDIIGIGCYSSKDLWSWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 240

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHID+ NYTKASVGVA+SDYP GPF YL SKRPHGFDSRDMTIFKDD+GTAYL+YSS+ 
Sbjct: 241 WMHIDNVNYTKASVGVAISDYPNGPFHYLQSKRPHGFDSRDMTIFKDDNGTAYLIYSSQG 300

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+GPLSEDYLDVTNVA+R+L+GQHREAPALFKH+GTYYMITSGCTGWAPNEALAHA
Sbjct: 301 NSELHVGPLSEDYLDVTNVARRVLIGQHREAPALFKHKGTYYMITSGCTGWAPNEALAHA 360

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLP-SHPGLFIFMADRWNPADLRDS 420
           +ESIMGPWET+GNPCIG NK+FRLATF SQSTFV+PLP S+P LFIFMADRWNPADLRDS
Sbjct: 361 AESIMGPWETIGNPCIGENKMFRLATFLSQSTFVIPLPSSYPNLFIFMADRWNPADLRDS 420

Query: 421 RYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           RY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 RYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 467

BLAST of Cp4.1LG01g17280 vs. NCBI nr
Match: gi|778669997|ref|XP_011649339.1| (PREDICTED: uncharacterized protein LOC101203585 isoform X1 [Cucumis sativus])

HSP 1 Score: 870.2 bits (2247), Expect = 1.7e-249
Identity = 400/467 (85.65%), Postives = 434/467 (92.93%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M MRN++RKS  LRCD+ S+CLISVVIGSLM CILLL L SP SRK+E+G+GIQ+RTS H
Sbjct: 38  MEMRNKFRKSTTLRCDSQSKCLISVVIGSLMVCILLLSLLSPTSRKNEMGQGIQIRTSHH 97

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LH REL+EVEEENIQIPPP KR  RA KRRPK+   LIDEFLDEDSQLR KFFPDHKT +
Sbjct: 98  LHLRELQEVEEENIQIPPPHKRPRRAPKRRPKRMTPLIDEFLDEDSQLRRKFFPDHKTFI 157

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDTEGNPIQAHGGGV+FDERSETYYWYGEYKDGPTYHAH+KG
Sbjct: 158 DPMITGNDSMFYYPGRVWLDTEGNPIQAHGGGVIFDERSETYYWYGEYKDGPTYHAHEKG 217

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIG+GCYSSKDLW+W+NEGIVL AEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 218 AARVDIIGIGCYSSKDLWSWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 277

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHID+ NYTKASVGVA+SDYP GPF YL SKRPHGFDSRDMTIFKDD+GTAYL+YSS+ 
Sbjct: 278 WMHIDNVNYTKASVGVAISDYPNGPFHYLQSKRPHGFDSRDMTIFKDDNGTAYLIYSSQG 337

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELH+GPLSEDYLDVTNVA+R+L+GQHREAPALFKH+GTYYMITSGCTGWAPNEALAHA
Sbjct: 338 NSELHVGPLSEDYLDVTNVARRVLIGQHREAPALFKHKGTYYMITSGCTGWAPNEALAHA 397

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPLP-SHPGLFIFMADRWNPADLRDS 420
           +ESIMGPWET+GNPCIG NK+FRLATF SQSTFV+PLP S+P LFIFMADRWNPADLRDS
Sbjct: 398 AESIMGPWETIGNPCIGENKMFRLATFLSQSTFVIPLPSSYPNLFIFMADRWNPADLRDS 457

Query: 421 RYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           RY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 458 RYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 504

BLAST of Cp4.1LG01g17280 vs. NCBI nr
Match: gi|659115993|ref|XP_008457845.1| (PREDICTED: uncharacterized protein LOC103497428 [Cucumis melo])

HSP 1 Score: 864.4 bits (2232), Expect = 9.2e-248
Identity = 400/467 (85.65%), Postives = 433/467 (92.72%), Query Frame = 1

Query: 1   MNMRNRYRKSNALRCDAGSRCLISVVIGSLMGCILLLHLCSPVSRKDEIGRGIQLRTSRH 60
           M  RN++RKS  LRCD+ S+CLISVVIGSLM CILLL+L S ++RKDE+G+GIQ+RTS H
Sbjct: 1   METRNKFRKSTTLRCDSQSKCLISVVIGSLMVCILLLNLLSTITRKDEMGQGIQIRTSHH 60

Query: 61  LHFRELEEVEEENIQIPPPRKRSPRAAKRRPKKTPTLIDEFLDEDSQLRHKFFPDHKTSV 120
           LH REL+EVEEENIQIP P KR  R  KRRPK+T  LIDEFLDEDSQLR KFFPDHKTS+
Sbjct: 61  LHLRELQEVEEENIQIPAPHKRPRRVPKRRPKRTTPLIDEFLDEDSQLRQKFFPDHKTSI 120

Query: 121 DPMIPGDDSMFYYPGRVWLDTEGNPIQAHGGGVLFDERSETYYWYGEYKDGPTYHAHKKG 180
           DPMI G+DSMFYYPGRVWLDT GNPIQAHGGGV+FDERS+TYYWYGEYKDGPTYHAH+KG
Sbjct: 121 DPMIMGNDSMFYYPGRVWLDTGGNPIQAHGGGVIFDERSKTYYWYGEYKDGPTYHAHEKG 180

Query: 181 AARVDIIGVGCYSSKDLWTWRNEGIVLTAEETNETHDLHKSNVLERPKVIYNSRTRKYVM 240
           AARVDIIGVGCYSSKDLWTW+NEGIVL AEET+ETHDLHKSNVLERPKVIYNSRT KYVM
Sbjct: 181 AARVDIIGVGCYSSKDLWTWKNEGIVLAAEETDETHDLHKSNVLERPKVIYNSRTGKYVM 240

Query: 241 WMHIDDANYTKASVGVAVSDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLVYSSED 300
           WMHID+ NYTKASVGVA+SDYP GPF YL+SKRPHGFDSRDMTIFKDD+GTAYL+YSSE 
Sbjct: 241 WMHIDNVNYTKASVGVAISDYPNGPFHYLHSKRPHGFDSRDMTIFKDDNGTAYLIYSSEG 300

Query: 301 NSELHIGPLSEDYLDVTNVAKRILVGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360
           NSELHIGPLSEDYL+VTNVA+RIL+GQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA
Sbjct: 301 NSELHIGPLSEDYLNVTNVARRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA 360

Query: 361 SESIMGPWETLGNPCIGGNKLFRLATFFSQSTFVLPL-PSHPGLFIFMADRWNPADLRDS 420
           +ESIMGPWET+GNPCIG NK+FRLATF SQSTFV+PL  S+P LFIFMADRWNPADLRDS
Sbjct: 361 AESIMGPWETIGNPCIGENKMFRLATFLSQSTFVIPLSSSYPNLFIFMADRWNPADLRDS 420

Query: 421 RYIWLPLMVGGLVDQPLDYNFGFPLWSRVSIYWHRKWRLPQGWNPLK 467
           RY+WLPLMVGGLVDQPLDYNF FPLWSRVSIYWHRKWRLPQGWN LK
Sbjct: 421 RYVWLPLMVGGLVDQPLDYNFRFPLWSRVSIYWHRKWRLPQGWNSLK 467

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LPY3_CUCSA2.2e-26490.99Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285310 PE=3 SV=1[more]
A0A0A0LJS1_CUCSA1.2e-24985.65Uncharacterized protein OS=Cucumis sativus GN=Csa_2G285330 PE=3 SV=1[more]
A0A0D2T2K4_GOSRA3.3e-22877.56Uncharacterized protein OS=Gossypium raimondii GN=B456_008G083400 PE=3 SV=1[more]
M5W5C1_PRUPE4.3e-22876.66Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005306mg PE=3 SV=1[more]
A0A061DHP7_THECC7.4e-22879.09Glycosyl hydrolase family protein 43 isoform 2 OS=Theobroma cacao GN=TCM_000802 ... [more]
Match NameE-valueIdentityDescription
AT3G49880.13.8e-20772.02 glycosyl hydrolase family protein 43[more]
AT5G67540.21.1e-20370.06 Arabinanase/levansucrase/invertase[more]
Match NameE-valueIdentityDescription
gi|778669992|ref|XP_004148025.2|3.1e-26490.99PREDICTED: uncharacterized protein LOC101203100 [Cucumis sativus][more]
gi|659115999|ref|XP_008457848.1|5.9e-26390.56PREDICTED: uncharacterized protein LOC103497430 [Cucumis melo][more]
gi|778670000|ref|XP_004148027.2|1.7e-24985.65PREDICTED: uncharacterized protein LOC101203585 isoform X2 [Cucumis sativus][more]
gi|778669997|ref|XP_011649339.1|1.7e-24985.65PREDICTED: uncharacterized protein LOC101203585 isoform X1 [Cucumis sativus][more]
gi|659115993|ref|XP_008457845.1|9.2e-24885.65PREDICTED: uncharacterized protein LOC103497428 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: INTERPRO
TermDefinition
IPR023296Glyco_hydro_beta-prop_sf
IPR006710Glyco_hydro_43
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g17280.1Cp4.1LG01g17280.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006710Glycoside hydrolase, family 43PANTHERPTHR22925GLYCOSYL HYDROLASE 43 FAMILY MEMBERcoord: 1..462
score: 8.8E
IPR006710Glycoside hydrolase, family 43PFAMPF04616Glyco_hydro_43coord: 187..370
score: 7.7
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domainGENE3DG3DSA:2.115.10.20coord: 155..380
score: 9.3
IPR023296Glycosyl hydrolase, five-bladed beta-propellor domainunknownSSF75005Arabinanase/levansucrase/invertasecoord: 144..427
score: 1.18
NoneNo IPR availablePANTHERPTHR22925:SF32SUBFAMILY NOT NAMEDcoord: 1..462
score: 8.8E
NoneNo IPR availablePROFILEPS51257PROKAR_LIPOPROTEINcoord: 1..33
score:

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None