CmaCh19G008050 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G008050
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGlycerol kinase
LocationCma_Chr19 : 7774374 .. 7776449 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTTCGCTTCTTCTAGTAGTGTTATCTGTCAGAACAGAGCCTTGTCGTCCTCCGTCGTTTCTTCTCCGGGACTTTTACACCATCGCTGCTTCTCACGGCTTCAATCGCAGCGTATTCTTCATTGCAATCGTCGTTTGTCTACGAACATCGGGATAAACGCCGCTCCGTCTCTTTCTTCGGCGCCCTCTTCCGTCGTCGCTAAAACTGCTCTATCCGATGCTCATGTTCAAAGTCAGAGTTCCAGTTCTGCTCCTGGTAGTGGTAAGTGTTGTTGGAATAATTTTCTTTTACTGTTTTATTTCGTGTTCATCATTAGTTCTTATTGGAATGACTTGCAACGTCAGTTTGACTGATTTTCAGTGCTTGGTTTCTAATTATCTGAATCCATCCGTGTTTTGTGGTATTTTTTTTTCATTCTTTTAAATCTTTGACACAAAATTTGGCTTGAGTGTTGGATATTTGGAAAACAGACGATTTATTTTGTGTATCCATTCGTTTTAAGGGTGGTCTGATTTTGCCAAAAACGTCTCTGGCGAGTGGGATGGATATGGTGCGGATTTTTCTTCTGGAGGAACGCCAATTGAACTTCCAGAATTCGTTGTCCCCGATGCTTATAGAGAATGGGAGGTTAAGGTTTTCGACTGGCAGACTCAGTGCCCCACTCTTGCGGAACCTGAGAAGCCCTCTTTCATGTACAAGACAATAAAGCTACTTCCTACAGTGGGATGTGAAGCCGATGCTGCAACCCGTTACAGCATTGATGAGAGAAATGCTGGAAATGGAATTGGTTCAAATGATGAAGTGACTGCCTTTGCGTATCAACGTAGTGGATGTTATGTAGTTGTTTGGCCGGTTAAGGTTGGGGGTTCTTATAAGTTAATGGAGTTGGAGCATTGCCTGGTTAGTCCTCAAGATCGTGAATCCCGTGTGAGGGTTGTTCAGGTTGTCCGAGTCGAAGGCACACGGCTAGTGTTGCAGAGTATCAAAGTTTTCTGCGAGCAGTGGTATGGACCATTCAGAAACGGAGAACAGCTTGGTGGATGTGCCATCCGAGACTCATCATTTGCTTCTACAGCTGCCTTGAAAGCTTCTGAGGTTGTTGGTTCATGGCAGGGTCCTGTCTCTGTTGCCCGTTTTGATGGTTCTCAGATTGTAAGTTATATAATCTTCCTTTGCTTTTGATCATCTTTTATAGCACGGTGAAGTGATCCTAAAATCTCAGCTTTTAGTTCGATGCTACCATTTTTTAGTTTTGCATTGCCTAATGCTTTGGTGTTGATCCATAACTTTCGTTTTTCTTCTCCTTAAAATTCTTAGAAGTCAAATTTTAATCTTTCGCTTATAAGTATGAAATCAAAAGATTGCCTGGCATCATCGACCAAATCATTGATACTATGAATTTTCAGAAAATATCAACAAATTTTGTGACTTTTATGCATTGGTGATTTCTCTTGAATGATTGATTACAATTTACAACATTTTTCTAATGATATTTGCCCACACCCCAATATTTTTTAAATTATGATGGCAAGTTTTGTGTTAAGCTGAGGTCTCGAACCTTACTTTTTTGACCGATGGAAAAATGTGTACCGAAGTGTTTGTCTGGTCAATTATCAGTGGGAACTAGCTTATAGTTTGTACAGCATCTATCCATTCACCAAAATTTTGATAGCATATTTGGAGTAATCCTTGAGCAAATTGTTGTTTACTTTTCTAACAGTTTCCCCATTCCATTTCTTTTTCGAGACATTTAGTCTATATGGTGACACTGTAACCATGCTAAAGTGAATTTAATAGGCACAATATAACTCCGTTTCCTGTTTGAATATGTATTGGAAAATTGACTGCAGAATGTTATACAAGAACTTTTGGCTGACAATGTGCAAAAGTCGGTGAGAACTGAATCAGAACTCAAGGTACTTCCCAAGCAACTATGGTGTTCCCTTAAAGAAAGTGAAGACAGTGGGGATACTTGCTGTGAAGTAGGATGGCTTTTTGATCACGGACATGCAATTACATCAAGATGCATCTTCTCAAGCTCAGCAAAATTGAAGGCAAGTTTTTGCATTTAA

mRNA sequence

ATGGCGTTCGCTTCTTCTAGTAGTGTTATCTGTCAGAACAGAGCCTTGTCGTCCTCCGTCGTTTCTTCTCCGGGACTTTTACACCATCGCTGCTTCTCACGGCTTCAATCGCAGCGTATTCTTCATTGCAATCGTCGTTTGTCTACGAACATCGGGATAAACGCCGCTCCGTCTCTTTCTTCGGCGCCCTCTTCCGTCGTCGCTAAAACTGCTCTATCCGATGCTCATGTTCAAAGTCAGAGTTCCAGTTCTGCTCCTGGTAGTGGGTGGTCTGATTTTGCCAAAAACGTCTCTGGCGAGTGGGATGGATATGGTGCGGATTTTTCTTCTGGAGGAACGCCAATTGAACTTCCAGAATTCGTTGTCCCCGATGCTTATAGAGAATGGGAGGTTAAGGTTTTCGACTGGCAGACTCAGTGCCCCACTCTTGCGGAACCTGAGAAGCCCTCTTTCATGTACAAGACAATAAAGCTACTTCCTACAGTGGGATGTGAAGCCGATGCTGCAACCCGTTACAGCATTGATGAGAGAAATGCTGGAAATGGAATTGGTTCAAATGATGAAGTGACTGCCTTTGCGTATCAACGTAGTGGATGTTATGTAGTTGTTTGGCCGGTTAAGGTTGGGGGTTCTTATAAGTTAATGGAGTTGGAGCATTGCCTGGTTAGTCCTCAAGATCGTGAATCCCGTGTGAGGGTTGTTCAGGTTGTCCGAGTCGAAGGCACACGGCTAGTGTTGCAGAGTATCAAAGTTTTCTGCGAGCAGTGGTATGGACCATTCAGAAACGGAGAACAGCTTGGTGGATGTGCCATCCGAGACTCATCATTTGCTTCTACAGCTGCCTTGAAAGCTTCTGAGGTTGTTGGTTCATGGCAGGGTCCTGTCTCTGTTGCCCGTTTTGATGGTTCTCAGATTAATGTTATACAAGAACTTTTGGCTGACAATGTGCAAAAGTCGGTGAGAACTGAATCAGAACTCAAGGTACTTCCCAAGCAACTATGGTGTTCCCTTAAAGAAAGTGAAGACAGTGGGGATACTTGCTGTGAAGTAGGATGGCTTTTTGATCACGGACATGCAATTACATCAAGATGCATCTTCTCAAGCTCAGCAAAATTGAAGGCAAGTTTTTGCATTTAA

Coding sequence (CDS)

ATGGCGTTCGCTTCTTCTAGTAGTGTTATCTGTCAGAACAGAGCCTTGTCGTCCTCCGTCGTTTCTTCTCCGGGACTTTTACACCATCGCTGCTTCTCACGGCTTCAATCGCAGCGTATTCTTCATTGCAATCGTCGTTTGTCTACGAACATCGGGATAAACGCCGCTCCGTCTCTTTCTTCGGCGCCCTCTTCCGTCGTCGCTAAAACTGCTCTATCCGATGCTCATGTTCAAAGTCAGAGTTCCAGTTCTGCTCCTGGTAGTGGGTGGTCTGATTTTGCCAAAAACGTCTCTGGCGAGTGGGATGGATATGGTGCGGATTTTTCTTCTGGAGGAACGCCAATTGAACTTCCAGAATTCGTTGTCCCCGATGCTTATAGAGAATGGGAGGTTAAGGTTTTCGACTGGCAGACTCAGTGCCCCACTCTTGCGGAACCTGAGAAGCCCTCTTTCATGTACAAGACAATAAAGCTACTTCCTACAGTGGGATGTGAAGCCGATGCTGCAACCCGTTACAGCATTGATGAGAGAAATGCTGGAAATGGAATTGGTTCAAATGATGAAGTGACTGCCTTTGCGTATCAACGTAGTGGATGTTATGTAGTTGTTTGGCCGGTTAAGGTTGGGGGTTCTTATAAGTTAATGGAGTTGGAGCATTGCCTGGTTAGTCCTCAAGATCGTGAATCCCGTGTGAGGGTTGTTCAGGTTGTCCGAGTCGAAGGCACACGGCTAGTGTTGCAGAGTATCAAAGTTTTCTGCGAGCAGTGGTATGGACCATTCAGAAACGGAGAACAGCTTGGTGGATGTGCCATCCGAGACTCATCATTTGCTTCTACAGCTGCCTTGAAAGCTTCTGAGGTTGTTGGTTCATGGCAGGGTCCTGTCTCTGTTGCCCGTTTTGATGGTTCTCAGATTAATGTTATACAAGAACTTTTGGCTGACAATGTGCAAAAGTCGGTGAGAACTGAATCAGAACTCAAGGTACTTCCCAAGCAACTATGGTGTTCCCTTAAAGAAAGTGAAGACAGTGGGGATACTTGCTGTGAAGTAGGATGGCTTTTTGATCACGGACATGCAATTACATCAAGATGCATCTTCTCAAGCTCAGCAAAATTGAAGGCAAGTTTTTGCATTTAA

Protein sequence

MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAITSRCIFSSSAKLKASFCI
BLAST of CmaCh19G008050 vs. TrEMBL
Match: A0A0A0K0X9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G026250 PE=4 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 1.4e-184
Identity = 328/373 (87.94%), Postives = 345/373 (92.49%), Query Frame = 1

Query: 1   MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSLS 60
           MAFASSSSVICQNRALSSSVVSS  LLHHRCFSRLQSQRILHCNRR S+NIGINA+P  S
Sbjct: 1   MAFASSSSVICQNRALSSSVVSSSALLHHRCFSRLQSQRILHCNRRSSSNIGINASPGAS 60

Query: 61  SAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEF 120
           S    +VAKTALSDAHVQS SS SAPG GWSDFA+NVSGEWDGYGADFS  GTPIELPE 
Sbjct: 61  S----LVAKTALSDAHVQSYSSCSAPGPGWSDFAQNVSGEWDGYGADFSYEGTPIELPES 120

Query: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAG 180
           VVPDAYREWEVKVFDWQTQCPTLAEPE+PS MYKTIKLLPTVGCEADAATRYSIDERN  
Sbjct: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEQPSLMYKTIKLLPTVGCEADAATRYSIDERNIR 180

Query: 181 NGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVE 240
           +GIG NDEV AF YQRSGCYVVVWP++V GS KLMELEHCLV+P DRESRVRVVQVVRVE
Sbjct: 181 DGIGGNDEVNAFGYQRSGCYVVVWPIEVRGSCKLMELEHCLVNPHDRESRVRVVQVVRVE 240

Query: 241 GTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF 300
           G+RLVLQ+I+VFCEQWYGPFRNGEQLGGCAI DS+FASTAALKASEVVG WQGPVSVARF
Sbjct: 241 GSRLVLQNIRVFCEQWYGPFRNGEQLGGCAIADSAFASTAALKASEVVGEWQGPVSVARF 300

Query: 301 DGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAI 360
           DGSQINVIQELLADNVQKSVRTESELK+LPKQLWCSLKES+DSGDT CEVGWLF HGHAI
Sbjct: 301 DGSQINVIQELLADNVQKSVRTESELKLLPKQLWCSLKESKDSGDTYCEVGWLFAHGHAI 360

Query: 361 TSRCIFSSSAKLK 374
           TSRCIFSS++KLK
Sbjct: 361 TSRCIFSSTSKLK 369

BLAST of CmaCh19G008050 vs. TrEMBL
Match: A0A061E9F6_THECC (Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1)

HSP 1 Score: 427.9 bits (1099), Expect = 1.2e-116
Identity = 220/376 (58.51%), Postives = 281/376 (74.73%), Query Frame = 1

Query: 5   SSSSVICQNRALSSSVVSSP-----GLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSL 64
           ++SSVI  NR ++S     P      + HH+  S++     LH   RL     +    + 
Sbjct: 4   AASSVISHNRTINSLAFPLPKRRNIAVNHHQHHSKVY----LHSFPRLHD---VYQCSNS 63

Query: 65  SSAPSSVVAKTALSDAHVQSQS-SSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELP 124
            +   S +AKTA+SD  + + + +++A  + WS+FA+NVSGEWDG+GADFS  G+PIELP
Sbjct: 64  KTFNPSCLAKTAVSDVQLHNPNPTTAAAANAWSEFARNVSGEWDGFGADFSIEGSPIELP 123

Query: 125 EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERN 184
           E VVP+AYR+WEVKV+DWQTQCPTLAEP +    YKTIKLLPTVGCEADAATRYS+DERN
Sbjct: 124 ESVVPEAYRDWEVKVYDWQTQCPTLAEPGEKVMTYKTIKLLPTVGCEADAATRYSMDERN 183

Query: 185 AGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVR 244
               +G +++V+AFAYQ SGCY  +WPV   G+++L ELEHCL++P+D+ESRVR++QVVR
Sbjct: 184 I---VGVDNKVSAFAYQASGCYTAIWPVADNGTHELWELEHCLINPRDKESRVRIIQVVR 243

Query: 245 VEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVA 304
           V+GT+LVLQ+I+VFCEQWYGPFRNG+QLGGCAIRDS+FA TA  KAS+++G WQGP +VA
Sbjct: 244 VDGTKLVLQNIRVFCEQWYGPFRNGDQLGGCAIRDSAFAPTATTKASDIIGEWQGPNAVA 303

Query: 305 RFDGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGH 364
            FDGS    +QEL  +   KS+R ES L +LPKQLWC++KES   G+TC EVGWLFD G 
Sbjct: 304 TFDGSGDIFLQELKDNGSLKSIRDESNLILLPKQLWCAIKES--GGETCSEVGWLFDQGC 363

Query: 365 AITSRCIFSSSAKLKA 375
           AITSRC FSS  KLKA
Sbjct: 364 AITSRCSFSSEGKLKA 367

BLAST of CmaCh19G008050 vs. TrEMBL
Match: A0A061E2P9_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 2.8e-116
Identity = 219/375 (58.40%), Postives = 280/375 (74.67%), Query Frame = 1

Query: 5   SSSSVICQNRALSSSVVSSP-----GLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSL 64
           ++SSVI  NR ++S     P      + HH+  S++     LH   RL     +    + 
Sbjct: 4   AASSVISHNRTINSLAFPLPKRRNIAVNHHQHHSKVY----LHSFPRLHD---VYQCSNS 63

Query: 65  SSAPSSVVAKTALSDAHVQSQS-SSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELP 124
            +   S +AKTA+SD  + + + +++A  + WS+FA+NVSGEWDG+GADFS  G+PIELP
Sbjct: 64  KTFNPSCLAKTAVSDVQLHNPNPTTAAAANAWSEFARNVSGEWDGFGADFSIEGSPIELP 123

Query: 125 EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERN 184
           E VVP+AYR+WEVKV+DWQTQCPTLAEP +    YKTIKLLPTVGCEADAATRYS+DERN
Sbjct: 124 ESVVPEAYRDWEVKVYDWQTQCPTLAEPGEKVMTYKTIKLLPTVGCEADAATRYSMDERN 183

Query: 185 AGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVR 244
               +G +++V+AFAYQ SGCY  +WPV   G+++L ELEHCL++P+D+ESRVR++QVVR
Sbjct: 184 I---VGVDNKVSAFAYQASGCYTAIWPVADNGTHELWELEHCLINPRDKESRVRIIQVVR 243

Query: 245 VEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVA 304
           V+GT+LVLQ+I+VFCEQWYGPFRNG+QLGGCAIRDS+FA TA  KAS+++G WQGP +VA
Sbjct: 244 VDGTKLVLQNIRVFCEQWYGPFRNGDQLGGCAIRDSAFAPTATTKASDIIGEWQGPNAVA 303

Query: 305 RFDGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGH 364
            FDGS    +QEL  +   KS+R ES L +LPKQLWC++KES   G+TC EVGWLFD G 
Sbjct: 304 TFDGSGDIFLQELKDNGSLKSIRDESNLILLPKQLWCAIKES--GGETCSEVGWLFDQGC 363

Query: 365 AITSRCIFSSSAKLK 374
           AITSRC FSS  KLK
Sbjct: 364 AITSRCSFSSEGKLK 366

BLAST of CmaCh19G008050 vs. TrEMBL
Match: A0A061E3I4_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 2.8e-116
Identity = 219/375 (58.40%), Postives = 280/375 (74.67%), Query Frame = 1

Query: 5   SSSSVICQNRALSSSVVSSP-----GLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSL 64
           ++SSVI  NR ++S     P      + HH+  S++     LH   RL     +    + 
Sbjct: 4   AASSVISHNRTINSLAFPLPKRRNIAVNHHQHHSKVY----LHSFPRLHD---VYQCSNS 63

Query: 65  SSAPSSVVAKTALSDAHVQSQS-SSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELP 124
            +   S +AKTA+SD  + + + +++A  + WS+FA+NVSGEWDG+GADFS  G+PIELP
Sbjct: 64  KTFNPSCLAKTAVSDVQLHNPNPTTAAAANAWSEFARNVSGEWDGFGADFSIEGSPIELP 123

Query: 125 EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERN 184
           E VVP+AYR+WEVKV+DWQTQCPTLAEP +    YKTIKLLPTVGCEADAATRYS+DERN
Sbjct: 124 ESVVPEAYRDWEVKVYDWQTQCPTLAEPGEKVMTYKTIKLLPTVGCEADAATRYSMDERN 183

Query: 185 AGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVR 244
               +G +++V+AFAYQ SGCY  +WPV   G+++L ELEHCL++P+D+ESRVR++QVVR
Sbjct: 184 I---VGVDNKVSAFAYQASGCYTAIWPVADNGTHELWELEHCLINPRDKESRVRIIQVVR 243

Query: 245 VEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVA 304
           V+GT+LVLQ+I+VFCEQWYGPFRNG+QLGGCAIRDS+FA TA  KAS+++G WQGP +VA
Sbjct: 244 VDGTKLVLQNIRVFCEQWYGPFRNGDQLGGCAIRDSAFAPTATTKASDIIGEWQGPNAVA 303

Query: 305 RFDGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGH 364
            FDGS    +QEL  +   KS+R ES L +LPKQLWC++KES   G+TC EVGWLFD G 
Sbjct: 304 TFDGSGDIFLQELKDNGSLKSIRDESNLILLPKQLWCAIKES--GGETCSEVGWLFDQGC 363

Query: 365 AITSRCIFSSSAKLK 374
           AITSRC FSS  KLK
Sbjct: 364 AITSRCSFSSEGKLK 366

BLAST of CmaCh19G008050 vs. TrEMBL
Match: M5W2T0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007176mg PE=4 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 8.1e-116
Identity = 218/342 (63.74%), Postives = 264/342 (77.19%), Query Frame = 1

Query: 44  NRRLSTNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSS-----SAPGSGWSDFAKNVS 103
           NRR ST I  + A S ++        T LSD  +Q Q  S     S   + WS+FA+NVS
Sbjct: 36  NRRQSTQIVQSTASSSTN--------TTLSDVKLQIQEDSNFNPTSTSATAWSEFARNVS 95

Query: 104 GEWDGYGADFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKL 163
           GEWDGYGADF+  G PIELPE VVP AYREWEVKVFDWQTQCPTLA PE+P  +YK I+L
Sbjct: 96  GEWDGYGADFTKEGNPIELPENVVPGAYREWEVKVFDWQTQCPTLANPEEPVLVYKNIEL 155

Query: 164 LPTVGCEADAATRYSIDERNAGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELE 223
           LPTVGCEADAATRY++ E+N G   G N+EV+AFAYQ SGCYV VWPV+  G+ KL+ELE
Sbjct: 156 LPTVGCEADAATRYTVIEKNIG---GVNNEVSAFAYQSSGCYVAVWPVEEKGN-KLLELE 215

Query: 224 HCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFAS 283
           +CL++PQD+ESRVR++QV+R++  +++LQ+ +VFCEQWYGPFRNG+QLGGCAIRDS+FAS
Sbjct: 216 YCLINPQDKESRVRIIQVIRIDNMKMMLQNTRVFCEQWYGPFRNGDQLGGCAIRDSAFAS 275

Query: 284 TAALKASEVVGSWQGPVSVARFDG-----SQINVIQELLADNVQKSVRTESELKVLPKQL 343
           TAAL ASEVVG+WQGP ++A FDG      + N  QELL ++ QKSVR ES L +LPKQL
Sbjct: 276 TAALNASEVVGTWQGPRALANFDGYGLENDKQNFFQELLDNSEQKSVRDESGLILLPKQL 335

Query: 344 WCSLKESEDSGDTCCEVGWLFDHGHAITSRCIFSSSAKLKAS 376
           WCSLKE +D GDT  EVGWL DHG AITS+C FSS+A LKA+
Sbjct: 336 WCSLKECKD-GDTYSEVGWLLDHGRAITSKCTFSSTALLKAT 364

BLAST of CmaCh19G008050 vs. TAIR10
Match: AT4G38225.3 (AT4G38225.3 unknown protein)

HSP 1 Score: 345.1 bits (884), Expect = 5.4e-95
Identity = 174/313 (55.59%), Postives = 228/313 (72.84%), Query Frame = 1

Query: 62  APSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEFV 121
           +P S  A+++   A  QSQ +     + WS+FA+NVSGEWDG+GADF+  G P+ELPE V
Sbjct: 48  SPLSCSAESSSEIALAQSQPAPDF--NPWSEFAQNVSGEWDGFGADFTCEGQPLELPESV 107

Query: 122 VPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAGN 181
           VP+A+REWEVKVFDWQTQCPTLA+P   SF+YK+IKLLPTVGCEADAATRYSID+R  G 
Sbjct: 108 VPEAFREWEVKVFDWQTQCPTLAQPNSLSFLYKSIKLLPTVGCEADAATRYSIDQRIIGG 167

Query: 182 GIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRV-E 241
           G  S     AF+Y  +G YV VWP++       +E+EHCL++P+D+ESRVR+ QVV + E
Sbjct: 168 GKSS---ALAFSYSVTGSYVAVWPLR----NNQLEVEHCLINPKDKESRVRIFQVVSLAE 227

Query: 242 GTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF 301
            T + LQS+KVFCEQWYGPFR+G+QLGGCAIR S FA+T    AS V GSW+  ++   F
Sbjct: 228 TTNMSLQSVKVFCEQWYGPFRDGDQLGGCAIRSSGFAATPTTAASVVTGSWRVLLATTSF 287

Query: 302 DGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAI 361
             S    IQ++  + V + VR E++L +LP++LWCSL++ +D  +    VGW+F+ GHAI
Sbjct: 288 HASDFGCIQQVTGEKVIEIVREENDLLLLPQELWCSLQQGKDR-ERVFSVGWVFEPGHAI 347

Query: 362 TSRCIFSSSAKLK 374
           TS C+FSS +KLK
Sbjct: 348 TSSCVFSSDSKLK 350

BLAST of CmaCh19G008050 vs. NCBI nr
Match: gi|659100669|ref|XP_008451209.1| (PREDICTED: uncharacterized protein LOC103492572 [Cucumis melo])

HSP 1 Score: 663.3 bits (1710), Expect = 2.5e-187
Identity = 332/373 (89.01%), Postives = 347/373 (93.03%), Query Frame = 1

Query: 1   MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSLS 60
           MAFASSSSVICQNRALSSSVVSSP L HHRCFSRLQSQRILHCNRR S+NIGINA+P  S
Sbjct: 1   MAFASSSSVICQNRALSSSVVSSPALFHHRCFSRLQSQRILHCNRRSSSNIGINASPGAS 60

Query: 61  SAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEF 120
           S    VVAKTALSDAHVQS SS  AP  GWSDFA+NVSGEWDGYGADFSS GTPIELPE 
Sbjct: 61  S----VVAKTALSDAHVQSYSSCPAPAPGWSDFAQNVSGEWDGYGADFSSEGTPIELPES 120

Query: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAG 180
           VVPDAYREWEVKVFDWQTQCPTLAEPE+PS MYKTIKLLPTVGCEADAATRYSIDERN G
Sbjct: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEQPSLMYKTIKLLPTVGCEADAATRYSIDERNIG 180

Query: 181 NGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVE 240
           +GIG N EVTAF YQRSGCYVVVWP++VGGS KLMELEHCLV+PQDRESRVRVVQVVRVE
Sbjct: 181 DGIGGNGEVTAFGYQRSGCYVVVWPIEVGGSCKLMELEHCLVNPQDRESRVRVVQVVRVE 240

Query: 241 GTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF 300
           G+RLVLQ+IKVFCEQWYGPFRNGEQLGGCAI DS+FASTAALKASEVVG WQGPVSVARF
Sbjct: 241 GSRLVLQNIKVFCEQWYGPFRNGEQLGGCAINDSAFASTAALKASEVVGKWQGPVSVARF 300

Query: 301 DGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAI 360
           DGSQINVIQELLADNVQKSVRTESELK+LPKQLWCSLKES+DSGDT CEVGWLF HGHAI
Sbjct: 301 DGSQINVIQELLADNVQKSVRTESELKLLPKQLWCSLKESKDSGDTYCEVGWLFAHGHAI 360

Query: 361 TSRCIFSSSAKLK 374
           TSRCIFSS++KLK
Sbjct: 361 TSRCIFSSTSKLK 369

BLAST of CmaCh19G008050 vs. NCBI nr
Match: gi|449471978|ref|XP_004153459.1| (PREDICTED: uncharacterized protein LOC101221421 [Cucumis sativus])

HSP 1 Score: 653.7 bits (1685), Expect = 2.0e-184
Identity = 328/373 (87.94%), Postives = 345/373 (92.49%), Query Frame = 1

Query: 1   MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSLS 60
           MAFASSSSVICQNRALSSSVVSS  LLHHRCFSRLQSQRILHCNRR S+NIGINA+P  S
Sbjct: 1   MAFASSSSVICQNRALSSSVVSSSALLHHRCFSRLQSQRILHCNRRSSSNIGINASPGAS 60

Query: 61  SAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEF 120
           S    +VAKTALSDAHVQS SS SAPG GWSDFA+NVSGEWDGYGADFS  GTPIELPE 
Sbjct: 61  S----LVAKTALSDAHVQSYSSCSAPGPGWSDFAQNVSGEWDGYGADFSYEGTPIELPES 120

Query: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAG 180
           VVPDAYREWEVKVFDWQTQCPTLAEPE+PS MYKTIKLLPTVGCEADAATRYSIDERN  
Sbjct: 121 VVPDAYREWEVKVFDWQTQCPTLAEPEQPSLMYKTIKLLPTVGCEADAATRYSIDERNIR 180

Query: 181 NGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVE 240
           +GIG NDEV AF YQRSGCYVVVWP++V GS KLMELEHCLV+P DRESRVRVVQVVRVE
Sbjct: 181 DGIGGNDEVNAFGYQRSGCYVVVWPIEVRGSCKLMELEHCLVNPHDRESRVRVVQVVRVE 240

Query: 241 GTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF 300
           G+RLVLQ+I+VFCEQWYGPFRNGEQLGGCAI DS+FASTAALKASEVVG WQGPVSVARF
Sbjct: 241 GSRLVLQNIRVFCEQWYGPFRNGEQLGGCAIADSAFASTAALKASEVVGEWQGPVSVARF 300

Query: 301 DGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAI 360
           DGSQINVIQELLADNVQKSVRTESELK+LPKQLWCSLKES+DSGDT CEVGWLF HGHAI
Sbjct: 301 DGSQINVIQELLADNVQKSVRTESELKLLPKQLWCSLKESKDSGDTYCEVGWLFAHGHAI 360

Query: 361 TSRCIFSSSAKLK 374
           TSRCIFSS++KLK
Sbjct: 361 TSRCIFSSTSKLK 369

BLAST of CmaCh19G008050 vs. NCBI nr
Match: gi|1009178066|ref|XP_015870320.1| (PREDICTED: uncharacterized protein LOC107407544 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 450.7 bits (1158), Expect = 2.6e-123
Identity = 231/358 (64.53%), Postives = 273/358 (76.26%), Query Frame = 1

Query: 17  SSSVVSSPGLLHHRCFS-RLQSQRILHCNRRLSTNIGINAAPSLSSAPSSVVAKTALSDA 76
           SS   + P L HH   + R + +RI H         G++A+ S  +   S V +     A
Sbjct: 28  SSPNPTHPVLFHHFPHTLRPRIRRIAH---------GVSASKSSVTTTLSDVEQHRQLKA 87

Query: 77  HVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEFVVPDAYREWEVKVFD 136
           +    + +S   + WS+FA+NVSGEWDGYGA+FS+ G PIELPE VVP+AYREWEVK+FD
Sbjct: 88  NDNGSTPTSTSNTAWSEFARNVSGEWDGYGAEFSNEGNPIELPESVVPEAYREWEVKLFD 147

Query: 137 WQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAGNGIGSNDEVTAFAYQ 196
           WQTQCPTLA+PE  +  YK IKLLPTVGCEADAATRYSIDERN G   G ND+V+AFAYQ
Sbjct: 148 WQTQCPTLADPEGSALNYKLIKLLPTVGCEADAATRYSIDERNIG---GLNDKVSAFAYQ 207

Query: 197 RSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQ 256
            SGCYV VWP +  GSY LMELEHCL++PQDRESRVRV+QV+R+E  ++VLQSIKVFCEQ
Sbjct: 208 SSGCYVAVWPTENKGSYNLMELEHCLINPQDRESRVRVIQVIRLESMKMVLQSIKVFCEQ 267

Query: 257 WYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADN 316
           WYGPFRNG+QLGGCAIRDS+FASTAAL+ASEVVG WQGP +VA FD SQ N +QEL+ DN
Sbjct: 268 WYGPFRNGDQLGGCAIRDSAFASTAALEASEVVGIWQGPNAVANFDASQANSLQELVDDN 327

Query: 317 VQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAITSRCIFSSSAKLK 374
           +QKSVR   +L +LPKQLWCSLKES D G TC EVGWL +HGHAITS+C FSS   LK
Sbjct: 328 MQKSVRDGLDLVLLPKQLWCSLKESNDGG-TCSEVGWLLNHGHAITSKCTFSSKTTLK 372

BLAST of CmaCh19G008050 vs. NCBI nr
Match: gi|1009178068|ref|XP_015870321.1| (PREDICTED: uncharacterized protein LOC107407544 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 446.4 bits (1147), Expect = 4.9e-122
Identity = 231/358 (64.53%), Postives = 273/358 (76.26%), Query Frame = 1

Query: 17  SSSVVSSPGLLHHRCFS-RLQSQRILHCNRRLSTNIGINAAPSLSSAPSSVVAKTALSDA 76
           SS   + P L HH   + R + +RI H         G++A+ S  +   S V +     A
Sbjct: 28  SSPNPTHPVLFHHFPHTLRPRIRRIAH---------GVSASKSSVTTTLSDVEQHRQLKA 87

Query: 77  HVQSQSSSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELPEFVVPDAYREWEVKVFD 136
           +    + +S   + WS+FA+NVSGEWDGYGA+FS+ G PIELPE VVP+AYREWEVK+FD
Sbjct: 88  NDNGSTPTSTSNTAWSEFARNVSGEWDGYGAEFSNEGNPIELPESVVPEAYREWEVKLFD 147

Query: 137 WQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNAGNGIGSNDEVTAFAYQ 196
           WQTQCPTLA+PE  +  YK IKLLPTVGCEADAATRYSIDERN G   G ND+V+AFAYQ
Sbjct: 148 WQTQCPTLADPEGSALNYKLIKLLPTVGCEADAATRYSIDERNIG---GLNDKVSAFAYQ 207

Query: 197 RSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQ 256
            SGCYV VWP +  GSY LMELEHCL++PQDRESRVRV+QV+R+E  ++VLQSIKVFCEQ
Sbjct: 208 SSGCYVAVWPTENKGSYNLMELEHCLINPQDRESRVRVIQVIRLESMKMVLQSIKVFCEQ 267

Query: 257 WYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADN 316
           WYGPFRNG+QLGGCAIRDS+FASTAAL+ASEVVG WQGP +VA FD SQ N +QEL+ DN
Sbjct: 268 WYGPFRNGDQLGGCAIRDSAFASTAALEASEVVGIWQGPNAVANFDASQ-NSLQELVDDN 327

Query: 317 VQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGHAITSRCIFSSSAKLK 374
           +QKSVR   +L +LPKQLWCSLKES D G TC EVGWL +HGHAITS+C FSS   LK
Sbjct: 328 MQKSVRDGLDLVLLPKQLWCSLKESNDGG-TCSEVGWLLNHGHAITSKCTFSSKTTLK 371

BLAST of CmaCh19G008050 vs. NCBI nr
Match: gi|590688938|ref|XP_007043087.1| (Uncharacterized protein isoform 3 [Theobroma cacao])

HSP 1 Score: 427.9 bits (1099), Expect = 1.8e-116
Identity = 220/376 (58.51%), Postives = 281/376 (74.73%), Query Frame = 1

Query: 5   SSSSVICQNRALSSSVVSSP-----GLLHHRCFSRLQSQRILHCNRRLSTNIGINAAPSL 64
           ++SSVI  NR ++S     P      + HH+  S++     LH   RL     +    + 
Sbjct: 4   AASSVISHNRTINSLAFPLPKRRNIAVNHHQHHSKVY----LHSFPRLHD---VYQCSNS 63

Query: 65  SSAPSSVVAKTALSDAHVQSQS-SSSAPGSGWSDFAKNVSGEWDGYGADFSSGGTPIELP 124
            +   S +AKTA+SD  + + + +++A  + WS+FA+NVSGEWDG+GADFS  G+PIELP
Sbjct: 64  KTFNPSCLAKTAVSDVQLHNPNPTTAAAANAWSEFARNVSGEWDGFGADFSIEGSPIELP 123

Query: 125 EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERN 184
           E VVP+AYR+WEVKV+DWQTQCPTLAEP +    YKTIKLLPTVGCEADAATRYS+DERN
Sbjct: 124 ESVVPEAYRDWEVKVYDWQTQCPTLAEPGEKVMTYKTIKLLPTVGCEADAATRYSMDERN 183

Query: 185 AGNGIGSNDEVTAFAYQRSGCYVVVWPVKVGGSYKLMELEHCLVSPQDRESRVRVVQVVR 244
               +G +++V+AFAYQ SGCY  +WPV   G+++L ELEHCL++P+D+ESRVR++QVVR
Sbjct: 184 I---VGVDNKVSAFAYQASGCYTAIWPVADNGTHELWELEHCLINPRDKESRVRIIQVVR 243

Query: 245 VEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVA 304
           V+GT+LVLQ+I+VFCEQWYGPFRNG+QLGGCAIRDS+FA TA  KAS+++G WQGP +VA
Sbjct: 244 VDGTKLVLQNIRVFCEQWYGPFRNGDQLGGCAIRDSAFAPTATTKASDIIGEWQGPNAVA 303

Query: 305 RFDGSQINVIQELLADNVQKSVRTESELKVLPKQLWCSLKESEDSGDTCCEVGWLFDHGH 364
            FDGS    +QEL  +   KS+R ES L +LPKQLWC++KES   G+TC EVGWLFD G 
Sbjct: 304 TFDGSGDIFLQELKDNGSLKSIRDESNLILLPKQLWCAIKES--GGETCSEVGWLFDQGC 363

Query: 365 AITSRCIFSSSAKLKA 375
           AITSRC FSS  KLKA
Sbjct: 364 AITSRCSFSSEGKLKA 367

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0K0X9_CUCSA1.4e-18487.94Uncharacterized protein OS=Cucumis sativus GN=Csa_7G026250 PE=4 SV=1[more]
A0A061E9F6_THECC1.2e-11658.51Uncharacterized protein isoform 3 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1[more]
A0A061E2P9_THECC2.8e-11658.40Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1[more]
A0A061E3I4_THECC2.8e-11658.40Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_007582 PE=4 SV=1[more]
M5W2T0_PRUPE8.1e-11663.74Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007176mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G38225.35.4e-9555.59 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659100669|ref|XP_008451209.1|2.5e-18789.01PREDICTED: uncharacterized protein LOC103492572 [Cucumis melo][more]
gi|449471978|ref|XP_004153459.1|2.0e-18487.94PREDICTED: uncharacterized protein LOC101221421 [Cucumis sativus][more]
gi|1009178066|ref|XP_015870320.1|2.6e-12364.53PREDICTED: uncharacterized protein LOC107407544 isoform X1 [Ziziphus jujuba][more]
gi|1009178068|ref|XP_015870321.1|4.9e-12264.53PREDICTED: uncharacterized protein LOC107407544 isoform X2 [Ziziphus jujuba][more]
gi|590688938|ref|XP_007043087.1|1.8e-11658.51Uncharacterized protein isoform 3 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G008050.1CmaCh19G008050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR11014PEPTIDASE M20 FAMILY MEMBERcoord: 66..184
score: 1.8
NoneNo IPR availablePANTHERPTHR11014:SF83SUBFAMILY NOT NAMEDcoord: 66..184
score: 1.8

The following gene(s) are paralogous to this gene:

None