CsGy3G037020 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy3G037020
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionSeed maturation-like protein
LocationGy14Chr3: 34981833 .. 34985109 (-)
RNA-Seq ExpressionCsGy3G037020
SyntenyCsGy3G037020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTTACTAATATTATATTTTTAAATATAATAAAATAAATTAAAATATTTACAAACTAATCGATTTTATATTTTAAAGCTTATACTAATTGAAAATCACTCATCCTAAATGGAAGAAGCGGATCTAAGTGCAATGATAAGGAGGATATTTTTGAAAATTCCATTGTCCACAGTCTAGTTTCTCATTTTACACCTAGAAAAAGAAAATAAATATTCCCACCAACAATTTACACCACAACATTTTTTTTTTCTTTTTTTCCTTTTAATAAACATTATGTTCGACGTGATATATAAATATATTCTTGTATTAATATGTGAAGTGAAGTCGTGAGTTATATTCACGTATTTACAAGGTTCGGAAAGCGAAGCTCAGAGATAGGAACACAAAATGGGATTTTGATTGGTAGAATCTCCAAGAGCAGCGAGTCCACATAATAGAGCGAGCTCGCCGTTGAAGGGATCTTCCAACCTTTTCTTCTTCCTTCCTTCCTTCCTTTCCCATGGCGGTCTCTGCTCGTGCCTTCTTCCTTTCTCGTCTCACTGACTTTTCTATCAAGCCCCGTCTACCTCCCCAACCACCACCGCCGCCTCCGCCCTTGCCTTCTTTTTCCTATTCACATCTCACCCTTCAACGTCGCCGTTTCCCCTCCGCCTCTACGTCTGGTGCAACCACTGTTAGCTGCCTTGTCTCCGGTGTTGATGGTGGTGGAGTTTCCGATGACTTTGTTTCCACTCGGAAGTTGAAATTCGATCGCGGATTCTCTGTAATCGCTAATATGCTTAAGCGGATTGAGCCGCTTCATACATCCGATATCTCCAAGGGTGTTTCTGATGTTGCTAAGGATTCTATGAAGCAGACTATTTCTTCAATGTTGGGTTTGCTTCCTTCTGATCAGTTCTCCGTCACCGTTAGGGTTTCCAAAAGCCCTCTCCATAATCTCCTCTCTTCGTCAATCATCACCGGGTAATTCTTTGCTGTTTTACGCAGGTTTGATGAAATATTTTTGTGTTTAGGTGTTGTTTCCCGTGGAATTCTTTGAATTGAGATGAGTTACTCTGTTTAGGTACACTTTGTGGAACGCAGAGTACCGGTTGTCTTTGATGAGGAATTTCGATATCTCGCCGGATAATTTGACGGGCTTGGACAGGTCGAAGCCATTGGAAGTTTCGGATATTGAGGAGAATCGGGTAGGCGTTGATTCCAATATGGAGGATTTGGATACGAGACCTCGTCTCTTGTCCGATTTGCCACCCGAGGCGTTGAAGTACATCCAACAGTTGCAAACGGAATTATCAAATCTTAAGGATGTAAGGATTTACCGAGTTGCTTACAATCTCCACTCTTGCTTATTGACAACACTTCTGTTCTATTACATTTACGAAATCATTTCTGCACTTTCTTTTATAGAAAAGGATCAGTATCCTGGATTGCGGCAGAACCTTACATCTGTTTCATGATTTTATTTCTAAAAATTAAAAAATGAATGGTTTCATTTACTATGTTAAGATTCAATGTTAGTTGGATACAATTCCTCATAGTTTTATGCTATTAGCTATGTTATTCGGTTCTTTTATGGTTTATTAATTCATACTACTTGCAGCGAGGAGTTACATGTTGTCACTGTTCGACTGAATTCTTAGTTTACTATGCATATTTGTTTTCTCTAATGTTTAGACGAATCTTTTTAGTTAAGCCTTAAGGTTTGTCATTTATTTTGAGTCAAGCTTAAATATAAATGTAAATTCCTCAATTGGTAATTGCTTGTCAGATGCTAGAAATACTGTATGTAGACAAATACAAAATTTTAATAGGTTCTTTGGTTTCTTTTTTTCTCTCTCACTGATCTCTCTTGTATCTCTTTGGATATTTCATTCATCAATGGAATGTTTCTTATCCAAAAAAATTGTCAATATGTTCCCATATGTTATATGTTGAGGGTTTGGGCAATTGAAGTACTACATATTTCCTCCATTTCCACTATTTACCTAGTTTTACAATATTTTGGAAGCTTTTAGTTTTAGATTTATTCAAGCATTTAAATTATTTTCAGTGTTTATCAGGCATGCAGCTAGAGGTCATAAATTTTGTAGAAAAATTATTATGCTAGGTTAAGTGCTGTGATTCATCATGGATATTGGCTCCAATTTTTCAAAAGAAAACCATTAGTTGGCTATCATGTTCTTTTGTTCAATCATATTTGGGCTCCCTTAATTTATATTCAAAGGGTTAAGGCAAAATAAACTAGCGTTTGATTGCTTACATCATTCAGCATATTTCAGTTATCTGTCTTTTAGTATAGTTCATTCAATCAAGGTAGGGAGGTTTTCCTGTTTATTGGGGTGGGATTAAATGTTTTTTCTGCCTTGTTAATGCACATATTTTGCACCTTAATTTAGGAAAAGAAAAGGCTTCATGACTTGATGATTCTTTGTGTTACCTTCCCAATTTTTTGTTTTGAATGTGTCAGGAACTCAATGCTCAGAAGCAAGAAAATATCCATATAGAACATGGCAGAGGAAACAGGAACGATCTGTTGGAATATCTACGGTCGTTGGATTCCGATATGGTACTTCATTTTTTCACTTCTGTAACTATACTGTTGTGCCTATATGCCTTTATTTTTCATCTTGGAAGTAGGCCCAGCTGTTGAACTTCCTTCTGGGGGCTCTCATTTTTACCGCCACGTTTATTATGTAGGTGACTGAACTCTGCAAACCATCTACGTCAGAGGTGGAGGAAATCATTCATGAACTTGTTGGAAACATATTGCAAAGGTTCTTCAAAGATGATGCCAGCTCTAGTTTCATTGAGGATTCAAGTGTGGCGGATTTAGAGAAACTTGCAGATGCTGGTGATGAGTTTTGTGATACTGTAGGCACTTCTCGGGATTACCTAGCAAAGCTTTTATTCTGGTTTGTTGGCAACCATTGGCCTCTATTTTATCTTTTAGTTTATATTCATTTTAATTCTGGAAATTTTTTACCATGGTTAAACCTGAGTGGTCTCGTGCAAACTAAGAACTTTGAAGCTCGTTTGCAACGAAAATGTTATTGACTAGACTTGTTTTAGGTTCCAAACAGATCTTGCGGCCCTTTTCAAGAATCTTAAAATGACTCATGTTCAATGATCTAACTCGTTGATGGAATTTTCCTGCTGCAGGTGTATGTTATTGGGGCATCACATGAGAAGCTTGGAGAATAGGCTGCAGTTAAGCTGTGTCGTTGGGTTGTTATGAGAGAAGGAAGTGTACATTCTTTCTGTACCCTGACCCCCCCCC

mRNA sequence

GTTTTACTAATATTATATTTTTAAATATAATAAAATAAATTAAAATATTTACAAACTAATCGATTTTATATTTTAAAGCTTATACTAATTGAAAATCACTCATCCTAAATGGAAGAAGCGGATCTAAGTGCAATGATAAGGAGGATATTTTTGAAAATTCCATTGTCCACAGTCTAGTTTCTCATTTTACACCTAGAAAAAGAAAATAAATATTCCCACCAACAATTTACACCACAACATTTTTTTTTTCTTTTTTTCCTTTTAATAAACATTATGTTCGACGTGATATATAAATATATTCTTGTATTAATATGTGAAGTGAAGTCGTGAGTTATATTCACGTATTTACAAGGTTCGGAAAGCGAAGCTCAGAGATAGGAACACAAAATGGGATTTTGATTGGTAGAATCTCCAAGAGCAGCGAGTCCACATAATAGAGCGAGCTCGCCGTTGAAGGGATCTTCCAACCTTTTCTTCTTCCTTCCTTCCTTCCTTTCCCATGGCGGTCTCTGCTCGTGCCTTCTTCCTTTCTCGTCTCACTGACTTTTCTATCAAGCCCCGTCTACCTCCCCAACCACCACCGCCGCCTCCGCCCTTGCCTTCTTTTTCCTATTCACATCTCACCCTTCAACGTCGCCGTTTCCCCTCCGCCTCTACGTCTGGTGCAACCACTGTTAGCTGCCTTGTCTCCGGTGTTGATGGTGGTGGAGTTTCCGATGACTTTGTTTCCACTCGGAAGTTGAAATTCGATCGCGGATTCTCTGTAATCGCTAATATGCTTAAGCGGATTGAGCCGCTTCATACATCCGATATCTCCAAGGGTGTTTCTGATGTTGCTAAGGATTCTATGAAGCAGACTATTTCTTCAATGTTGGGTTTGCTTCCTTCTGATCAGTTCTCCGTCACCGTTAGGGTTTCCAAAAGCCCTCTCCATAATCTCCTCTCTTCGTCAATCATCACCGGGTACACTTTGTGGAACGCAGAGTACCGGTTGTCTTTGATGAGGAATTTCGATATCTCGCCGGATAATTTGACGGGCTTGGACAGGTCGAAGCCATTGGAAGTTTCGGATATTGAGGAGAATCGGGTAGGCGTTGATTCCAATATGGAGGATTTGGATACGAGACCTCGTCTCTTGTCCGATTTGCCACCCGAGGCGTTGAAGTACATCCAACAGTTGCAAACGGAATTATCAAATCTTAAGGATGAACTCAATGCTCAGAAGCAAGAAAATATCCATATAGAACATGGCAGAGGAAACAGGAACGATCTGTTGGAATATCTACGGTCGTTGGATTCCGATATGGTGACTGAACTCTGCAAACCATCTACGTCAGAGGTGGAGGAAATCATTCATGAACTTGTTGGAAACATATTGCAAAGGTTCTTCAAAGATGATGCCAGCTCTAGTTTCATTGAGGATTCAAGTGTGGCGGATTTAGAGAAACTTGCAGATGCTGGTGATGAGTTTTGTGATACTGTAGGCACTTCTCGGGATTACCTAGCAAAGCTTTTATTCTGGTGTATGTTATTGGGGCATCACATGAGAAGCTTGGAGAATAGGCTGCAGTTAAGCTGTGTCGTTGGGTTGTTATGAGAGAAGGAAGTGTACATTCTTTCTGTACCCTGACCCCCCCCC

Coding sequence (CDS)

ATGGCGGTCTCTGCTCGTGCCTTCTTCCTTTCTCGTCTCACTGACTTTTCTATCAAGCCCCGTCTACCTCCCCAACCACCACCGCCGCCTCCGCCCTTGCCTTCTTTTTCCTATTCACATCTCACCCTTCAACGTCGCCGTTTCCCCTCCGCCTCTACGTCTGGTGCAACCACTGTTAGCTGCCTTGTCTCCGGTGTTGATGGTGGTGGAGTTTCCGATGACTTTGTTTCCACTCGGAAGTTGAAATTCGATCGCGGATTCTCTGTAATCGCTAATATGCTTAAGCGGATTGAGCCGCTTCATACATCCGATATCTCCAAGGGTGTTTCTGATGTTGCTAAGGATTCTATGAAGCAGACTATTTCTTCAATGTTGGGTTTGCTTCCTTCTGATCAGTTCTCCGTCACCGTTAGGGTTTCCAAAAGCCCTCTCCATAATCTCCTCTCTTCGTCAATCATCACCGGGTACACTTTGTGGAACGCAGAGTACCGGTTGTCTTTGATGAGGAATTTCGATATCTCGCCGGATAATTTGACGGGCTTGGACAGGTCGAAGCCATTGGAAGTTTCGGATATTGAGGAGAATCGGGTAGGCGTTGATTCCAATATGGAGGATTTGGATACGAGACCTCGTCTCTTGTCCGATTTGCCACCCGAGGCGTTGAAGTACATCCAACAGTTGCAAACGGAATTATCAAATCTTAAGGATGAACTCAATGCTCAGAAGCAAGAAAATATCCATATAGAACATGGCAGAGGAAACAGGAACGATCTGTTGGAATATCTACGGTCGTTGGATTCCGATATGGTGACTGAACTCTGCAAACCATCTACGTCAGAGGTGGAGGAAATCATTCATGAACTTGTTGGAAACATATTGCAAAGGTTCTTCAAAGATGATGCCAGCTCTAGTTTCATTGAGGATTCAAGTGTGGCGGATTTAGAGAAACTTGCAGATGCTGGTGATGAGTTTTGTGATACTGTAGGCACTTCTCGGGATTACCTAGCAAAGCTTTTATTCTGGTGTATGTTATTGGGGCATCACATGAGAAGCTTGGAGAATAGGCTGCAGTTAAGCTGTGTCGTTGGGTTGTTATGA

Protein sequence

MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVSCLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQTISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTGLDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNAQKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDDASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSCVVGLL*
Homology
BLAST of CsGy3G037020 vs. NCBI nr
Match: XP_011652254.1 (uncharacterized protein LOC101208572 [Cucumis sativus] >KGN59616.1 hypothetical protein Csa_001093 [Cucumis sativus])

HSP 1 Score: 706 bits (1821), Expect = 6.07e-256
Identity = 365/365 (100.00%), Postives = 365/365 (100.00%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS
Sbjct: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT
Sbjct: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA
Sbjct: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC
Sbjct: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 365

BLAST of CsGy3G037020 vs. NCBI nr
Match: XP_008443458.1 (PREDICTED: uncharacterized protein LOC103487044 [Cucumis melo] >KAA0053716.1 uncharacterized protein E6C27_scaffold135G00920 [Cucumis melo var. makuwa] >TYK17742.1 uncharacterized protein E5676_scaffold1804G00130 [Cucumis melo var. makuwa])

HSP 1 Score: 681 bits (1757), Expect = 3.20e-246
Identity = 354/365 (96.99%), Postives = 358/365 (98.08%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS  TSGATTVS
Sbjct: 1   MAASARAFFLSRVTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT
Sbjct: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNL G
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLMG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           LDRSKPLEVSDIEEN VGVDSNMEDLDTRPRLLSDLPPEALKYI+QLQTELSNLKDELNA
Sbjct: 181 LDRSKPLEVSDIEENLVGVDSNMEDLDTRPRLLSDLPPEALKYIEQLQTELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQEN  IEHGRGNRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQENFQIEHGRGNRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 363

BLAST of CsGy3G037020 vs. NCBI nr
Match: XP_038905998.1 (uncharacterized protein LOC120091907 [Benincasa hispida])

HSP 1 Score: 652 bits (1682), Expect = 8.58e-235
Identity = 339/365 (92.88%), Postives = 352/365 (96.44%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SAR FFLSR+TDFSIKPRLPPQPPPPPPPLPSFS+SHL+  RRRFPS  TSGATTVS
Sbjct: 1   MASSARTFFLSRVTDFSIKPRLPPQPPPPPPPLPSFSFSHLSFHRRRFPS--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CL+SGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPL TSDISKGVSDVAKDSMKQT
Sbjct: 61  CLISGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLDTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPD LTG
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDKLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           L+RSK LEVSDIEE RVGVDSNMEDLDTRPRLL+DLPPEALKYIQQLQ+ELSNLKDELNA
Sbjct: 181 LERSKSLEVSDIEEIRVGVDSNMEDLDTRPRLLTDLPPEALKYIQQLQSELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           +KQEN+ IEHGRGNRNDLLEYLRSLDS+MVTELCKPST EVEEIIHELVGNILQRFFKDD
Sbjct: 241 RKQENMQIEHGRGNRNDLLEYLRSLDSNMVTELCKPSTLEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASS+FIE SSVADLEKLADAG+EFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSTFIEHSSVADLEKLADAGNEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 363

BLAST of CsGy3G037020 vs. NCBI nr
Match: XP_023522197.1 (uncharacterized protein LOC111786074 [Cucurbita pepo subsp. pepo] >XP_023526062.1 uncharacterized protein LOC111789657 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 629 bits (1623), Expect = 7.48e-226
Identity = 329/365 (90.14%), Postives = 344/365 (94.25%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPP   PPPPPLPSFSYSHL LQRRRFPS  TSGATTVS
Sbjct: 1   MAASARAFFLSRITDFSIKPRLPP---PPPPPLPSFSYSHLGLQRRRFPS--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CL+SGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPL TS+IS+GVSD AKDSMKQT
Sbjct: 61  CLISGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLDTSNISQGVSDAAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQF+VTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSL RNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFAVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLTRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
            DRSKPLEVSD+EE RVGVDSN+EDLDTRPRLL+   PEALKYIQQLQ+ELSNLKDELNA
Sbjct: 181 FDRSKPLEVSDVEEIRVGVDSNLEDLDTRPRLLTGFSPEALKYIQQLQSELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQ N+ +EHG+ NRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQVNMQMEHGKENRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASS+F EDSSVADLEKLAD G EFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSTFNEDSSVADLEKLADVGSEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 360

BLAST of CsGy3G037020 vs. NCBI nr
Match: XP_022935548.1 (uncharacterized protein LOC111442389 [Cucurbita moschata])

HSP 1 Score: 627 bits (1617), Expect = 6.14e-225
Identity = 328/365 (89.86%), Postives = 343/365 (93.97%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPP   PPPPPLPSFSYSHL LQRRRFP   TSGATTVS
Sbjct: 1   MAASARAFFLSRVTDFSIKPRLPP---PPPPPLPSFSYSHLGLQRRRFPP--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CL+SGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPL TS+IS+GVSD AKDSMKQT
Sbjct: 61  CLISGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLDTSNISQGVSDAAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQF+VTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSL RNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFAVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLTRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
            DRSKPLEVSD+ E RVGVDSN+EDLDTRPRLL+D  PEALKYIQQLQ+ELSNLKDELNA
Sbjct: 181 FDRSKPLEVSDVGEIRVGVDSNLEDLDTRPRLLTDFSPEALKYIQQLQSELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQ N+ +EHG+ NRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQVNMQMEHGKENRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASS+F EDSSVADLEKLAD G EFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSTFNEDSSVADLEKLADVGSEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 360

BLAST of CsGy3G037020 vs. ExPASy TrEMBL
Match: A0A0A0LHU8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G829060 PE=4 SV=1)

HSP 1 Score: 706 bits (1821), Expect = 2.94e-256
Identity = 365/365 (100.00%), Postives = 365/365 (100.00%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS
Sbjct: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT
Sbjct: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA
Sbjct: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC
Sbjct: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 365

BLAST of CsGy3G037020 vs. ExPASy TrEMBL
Match: A0A5A7UHK4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1804G00130 PE=4 SV=1)

HSP 1 Score: 681 bits (1757), Expect = 1.55e-246
Identity = 354/365 (96.99%), Postives = 358/365 (98.08%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS  TSGATTVS
Sbjct: 1   MAASARAFFLSRVTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT
Sbjct: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNL G
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLMG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           LDRSKPLEVSDIEEN VGVDSNMEDLDTRPRLLSDLPPEALKYI+QLQTELSNLKDELNA
Sbjct: 181 LDRSKPLEVSDIEENLVGVDSNMEDLDTRPRLLSDLPPEALKYIEQLQTELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQEN  IEHGRGNRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQENFQIEHGRGNRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 363

BLAST of CsGy3G037020 vs. ExPASy TrEMBL
Match: A0A1S3B853 (uncharacterized protein LOC103487044 OS=Cucumis melo OX=3656 GN=LOC103487044 PE=4 SV=1)

HSP 1 Score: 681 bits (1757), Expect = 1.55e-246
Identity = 354/365 (96.99%), Postives = 358/365 (98.08%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS  TSGATTVS
Sbjct: 1   MAASARAFFLSRVTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPS--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT
Sbjct: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNL G
Sbjct: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLMG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
           LDRSKPLEVSDIEEN VGVDSNMEDLDTRPRLLSDLPPEALKYI+QLQTELSNLKDELNA
Sbjct: 181 LDRSKPLEVSDIEENLVGVDSNMEDLDTRPRLLSDLPPEALKYIEQLQTELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQEN  IEHGRGNRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQENFQIEHGRGNRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 363

BLAST of CsGy3G037020 vs. ExPASy TrEMBL
Match: A0A6J1FAU6 (uncharacterized protein LOC111442389 OS=Cucurbita moschata OX=3662 GN=LOC111442389 PE=4 SV=1)

HSP 1 Score: 627 bits (1617), Expect = 2.97e-225
Identity = 328/365 (89.86%), Postives = 343/365 (93.97%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPP   PPPPPLPSFSYSHL LQRRRFP   TSGATTVS
Sbjct: 1   MAASARAFFLSRVTDFSIKPRLPP---PPPPPLPSFSYSHLGLQRRRFPP--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CL+SGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPL TS+IS+GVSD AKDSMKQT
Sbjct: 61  CLISGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLDTSNISQGVSDAAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQF+VTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSL RNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFAVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLTRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
            DRSKPLEVSD+ E RVGVDSN+EDLDTRPRLL+D  PEALKYIQQLQ+ELSNLKDELNA
Sbjct: 181 FDRSKPLEVSDVGEIRVGVDSNLEDLDTRPRLLTDFSPEALKYIQQLQSELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQ N+ +EHG+ NRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQVNMQMEHGKENRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASS+F EDSSVADLEKLAD G EFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSTFNEDSSVADLEKLADVGSEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 360

BLAST of CsGy3G037020 vs. ExPASy TrEMBL
Match: A0A6J1J6Z6 (uncharacterized protein LOC111482318 OS=Cucurbita maxima OX=3661 GN=LOC111482318 PE=4 SV=1)

HSP 1 Score: 626 bits (1614), Expect = 8.51e-225
Identity = 327/365 (89.59%), Postives = 343/365 (93.97%), Query Frame = 0

Query: 1   MAVSARAFFLSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 60
           MA SARAFFLSR+TDFSIKPRLPP   PPPP LPSFSYSHL LQRRRFP   TSGATTVS
Sbjct: 1   MAASARAFFLSRITDFSIKPRLPP---PPPPSLPSFSYSHLGLQRRRFPP--TSGATTVS 60

Query: 61  CLVSGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQT 120
           CL+SGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPL TS+IS+GVSD AKDSMKQT
Sbjct: 61  CLISGVDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLDTSNISQGVSDAAKDSMKQT 120

Query: 121 ISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTG 180
           ISSMLGLLPSDQF+VTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSL RNFDISPDNLTG
Sbjct: 121 ISSMLGLLPSDQFAVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLTRNFDISPDNLTG 180

Query: 181 LDRSKPLEVSDIEENRVGVDSNMEDLDTRPRLLSDLPPEALKYIQQLQTELSNLKDELNA 240
            DRSKPLEVSD++E RVGVDSN+EDLDTRPRLL+D  PEALKYIQQLQ+ELSNLKDELNA
Sbjct: 181 FDRSKPLEVSDVQEIRVGVDSNLEDLDTRPRLLTDFSPEALKYIQQLQSELSNLKDELNA 240

Query: 241 QKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300
           QKQ N+ +EHG+ NRNDLLEYLRSLDS+MVTELCKPSTSEVEEIIHELVGNILQRFFKDD
Sbjct: 241 QKQVNMQMEHGKENRNDLLEYLRSLDSNMVTELCKPSTSEVEEIIHELVGNILQRFFKDD 300

Query: 301 ASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSC 360
           ASS+F EDSSVADLEKLAD G EFCDTVGTSRDYLAKLLFWCMLLGHH+RSLENRLQLSC
Sbjct: 301 ASSTFNEDSSVADLEKLADVGSEFCDTVGTSRDYLAKLLFWCMLLGHHLRSLENRLQLSC 360

Query: 361 VVGLL 365
           VVGLL
Sbjct: 361 VVGLL 360

BLAST of CsGy3G037020 vs. TAIR 10
Match: AT5G14970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14910.1); Has 579 Blast hits to 397 proteins in 95 species: Archae - 0; Bacteria - 294; Metazoa - 0; Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 199 (source: NCBI BLink). )

HSP 1 Score: 345.1 bits (884), Expect = 6.8e-95
Identity = 207/376 (55.05%), Postives = 261/376 (69.41%), Query Frame = 0

Query: 2   AVSARAFF-LSRLTDFSIKPRLPPQPPPPPPPLPSFSYSHLTLQRRRFPSASTSGATTVS 61
           A SARAFF LSR+TD S K  +  QPPP   P     Y+         P+ + S +  +S
Sbjct: 3   AASARAFFMLSRVTDLSKKKLILHQPPPSSSP-HRLPYA---------PNRAVSSSAVIS 62

Query: 62  CLVSGVDGGGVS--DDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMK 121
           CL     GGGVS  D +VSTR+ K DRGF+VIAN++ RI+PL TS ISKG+SD AKDSMK
Sbjct: 63  CL----SGGGVSSDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMK 122

Query: 122 QTISSMLGLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNL 181
           QTISSMLGLLPSDQFSV+V +S+ PL+ LL SSIITGYTLWNAEYR+SL RNFDI  D  
Sbjct: 123 QTISSMLGLLPSDQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDIPID-- 182

Query: 182 TGLDRSKPLEVSDIEENRVGVDSNM-EDLDT--------RPRLLSDLPPEALKYIQQLQT 241
               R +  + S  +  R G +  M EDL           P++  DL PEAL YIQ LQ+
Sbjct: 183 ---PRKEEEDQSSKDNVRFGSEKGMSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQS 242

Query: 242 ELSNLKDELNAQKQENIHIEHGRGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELV 301
           ELS++K+EL++QK++ + IE  +GNRNDLL+YLRSLD +MVTEL + S+ EVEEI+++LV
Sbjct: 243 ELSSMKEELDSQKKKALRIECEKGNRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLV 302

Query: 302 GNILQRFFKDDASSSFIEDSSVADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHM 361
            N+L+R F+D  +S+F+++  +    +  + GD     V TSRDYLAKLLFWCMLLGHH+
Sbjct: 303 QNVLERLFEDQTTSNFMQNPGI----RTTEGGDGTGRKVDTSRDYLAKLLFWCMLLGHHL 355

Query: 362 RSLENRLQLSCVVGLL 366
           R LENRL LSCVVGLL
Sbjct: 363 RGLENRLHLSCVVGLL 355

BLAST of CsGy3G037020 vs. TAIR 10
Match: AT2G14910.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G14970.1); Has 605 Blast hits to 425 proteins in 102 species: Archae - 0; Bacteria - 300; Metazoa - 25; Fungi - 0; Plants - 89; Viruses - 0; Other Eukaryotes - 191 (source: NCBI BLink). )

HSP 1 Score: 168.7 bits (426), Expect = 8.7e-42
Identity = 143/383 (37.34%), Postives = 201/383 (52.48%), Query Frame = 0

Query: 13  LTDFSIK-PRLPPQPPPPPP---PLPSFSYSHLTLQRRRFPSAS-TSGATTVSCLVSG-- 72
           L+ FS+  P+L  +P  P P    LP F+        RRF S + TS +TT S   S   
Sbjct: 6   LSSFSLSLPQLLHKPTKPLPFLFLLPRFN--------RRFRSLTITSSSTTSSNNFSSNC 65

Query: 73  VDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQTISSML 132
            D G   DDF      +  +   V++++++ IEPL  S I K V     D+MK+TIS ML
Sbjct: 66  GDDGFSLDDFTLHSDSRSPKK-CVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 125

Query: 133 GLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTGLDRSK 192
           GLLPSD+F V +     PL  LL SS++TGYTL NAEYRL L +N D+S   L     S+
Sbjct: 126 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDS-HASE 185

Query: 193 PLEVSDIEENRVGVDSNMEDLDTRPRLLSD---------LPPEALKYIQQLQTELSNLKD 252
             E  D+E      D      D+R + LS+         +  EA +YI +LQ++LS++K 
Sbjct: 186 NTEY-DMEGTFPDEDHVSSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKK 245

Query: 253 ELNAQKQENIHIEHGR---GNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNIL 312
           EL   +++N  ++  +     +NDLL+YLRSL  + V EL +P+  EV+E IH +V  +L
Sbjct: 246 ELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLL 305

Query: 313 QRFFKDDASSSFIEDSSVADLEKLADAGDEFC------------DTVGTSRDYLAKLLFW 365
                     S    S V   E +    DE C              +  +RDYLA+LLFW
Sbjct: 306 ATL--SPKMHSKFPASEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 365

BLAST of CsGy3G037020 vs. TAIR 10
Match: AT2G14910.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G14970.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 141.7 bits (356), Expect = 1.1e-33
Identity = 130/360 (36.11%), Postives = 183/360 (50.83%), Query Frame = 0

Query: 13  LTDFSIK-PRLPPQPPPPPP---PLPSFSYSHLTLQRRRFPSAS-TSGATTVSCLVSG-- 72
           L+ FS+  P+L  +P  P P    LP F+        RRF S + TS +TT S   S   
Sbjct: 6   LSSFSLSLPQLLHKPTKPLPFLFLLPRFN--------RRFRSLTITSSSTTSSNNFSSNC 65

Query: 73  VDGGGVSDDFVSTRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQTISSML 132
            D G   DDF      +  +   V++++++ IEPL  S I K V     D+MK+TIS ML
Sbjct: 66  GDDGFSLDDFTLHSDSRSPKK-CVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 125

Query: 133 GLLPSDQFSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTGLDRSK 192
           GLLPSD+F V +     PL  LL SS++TGYTL NAEYRL L +N D+S   L     S+
Sbjct: 126 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDS-HASE 185

Query: 193 PLEVSDIEENRVGVDSNMEDLDTRPRLLSD---------LPPEALKYIQQLQTELSNLKD 252
             E  D+E      D      D+R + LS+         +  EA +YI +LQ++LS++K 
Sbjct: 186 NTEY-DMEGTFPDEDHVSSKRDSRTQNLSETIDEEGLGRVSSEAQEYILRLQSQLSSVKK 245

Query: 253 ELNAQKQENIHIEHGR---GNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNIL 312
           EL   +++N  ++  +     +NDLL+YLRSL  + V EL +P+  EV+E IH +V  +L
Sbjct: 246 ELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLL 305

Query: 313 QRFFKDDASSSFIEDSSVADLEKLADAGDEFC------------DTVGTSRDYLAKLLFW 342
                     S    S V   E +    DE C              +  +RDYLA+LLFW
Sbjct: 306 ATL--SPKMHSKFPASEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFW 352

BLAST of CsGy3G037020 vs. TAIR 10
Match: AT1G63610.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14910.1); Has 537 Blast hits to 411 proteins in 100 species: Archae - 0; Bacteria - 231; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 212 (source: NCBI BLink). )

HSP 1 Score: 80.5 bits (197), Expect = 3.1e-15
Identity = 91/352 (25.85%), Postives = 152/352 (43.18%), Query Frame = 0

Query: 28  PPPPPLPSFSYSH-----LTLQRRRFPSASTSG--ATTVSCLVSGVDGGGVSDDFVS--- 87
           PP  P PSF  +H      T     FP  + +   + T + LV  V   G S D  +   
Sbjct: 14  PPSRPCPSFLANHEPKLSTTSSSVTFPLKTNTWKCSGTGNLLVLRVKAYGSSSDSSADSS 73

Query: 88  -----TRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQTISSMLGLLPSDQ 147
                TR+ K  R   ++   ++ ++P       K       ++M+QT+++M+G LP   
Sbjct: 74  TPPNGTRQPKSRR--DILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQF 133

Query: 148 FSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTGLDRSKPLEVSDI 207
           F+VTV      L  L+ S ++TGY   NA+YRL L ++ +        L   +  +  D 
Sbjct: 134 FAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLE-----QVALPEPRDQKGGD- 193

Query: 208 EENRVGVDSNMEDLDTRPRLLSDLPP-EALKYIQQLQTELSNLKDELNAQKQENIHIEHG 267
           E+   G   N+     R   +S     +A KYI+ L+ E+    +ELN Q          
Sbjct: 194 EDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEI----EELNRQVGRK-----S 253

Query: 268 RGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDDASSSFIEDSSV 327
              +N++LEYL+SL+   + EL   +  +V   ++  V  +L                  
Sbjct: 254 ANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLL------------------ 313

Query: 328 ADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSCVVG 364
                ++D      +   TS   LAKLL+W M++G+ +R++E R  +  V+G
Sbjct: 314 ----AVSDPNQMKTNVTETSAADLAKLLYWLMVVGYSIRNIEVRFDMERVLG 326

BLAST of CsGy3G037020 vs. TAIR 10
Match: AT1G63610.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14910.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 78.6 bits (192), Expect = 1.2e-14
Identity = 90/352 (25.57%), Postives = 151/352 (42.90%), Query Frame = 0

Query: 28  PPPPPLPSFSYSH-----LTLQRRRFPSASTSG--ATTVSCLVSGVDGGGVSDDFVS--- 87
           PP  P PSF  +H      T     FP  + +   + T + LV  V   G S D  +   
Sbjct: 14  PPSRPCPSFLANHEPKLSTTSSSVTFPLKTNTWKCSGTGNLLVLRVKAYGSSSDSSADSS 73

Query: 88  -----TRKLKFDRGFSVIANMLKRIEPLHTSDISKGVSDVAKDSMKQTISSMLGLLPSDQ 147
                TR+    R   ++   ++ ++P       K       ++M+QT+++M+G LP   
Sbjct: 74  TPPNGTRQQPKSRR-DILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQF 133

Query: 148 FSVTVRVSKSPLHNLLSSSIITGYTLWNAEYRLSLMRNFDISPDNLTGLDRSKPLEVSDI 207
           F+VTV      L  L+ S ++TGY   NA+YRL L ++ +        L   +  +  D 
Sbjct: 134 FAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLE-----QVALPEPRDQKGGD- 193

Query: 208 EENRVGVDSNMEDLDTRPRLLSDLPP-EALKYIQQLQTELSNLKDELNAQKQENIHIEHG 267
           E+   G   N+     R   +S     +A KYI+ L+ E+    +ELN Q          
Sbjct: 194 EDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEI----EELNRQVGRK-----S 253

Query: 268 RGNRNDLLEYLRSLDSDMVTELCKPSTSEVEEIIHELVGNILQRFFKDDASSSFIEDSSV 327
              +N++LEYL+SL+   + EL   +  +V   ++  V  +L                  
Sbjct: 254 ANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLL------------------ 313

Query: 328 ADLEKLADAGDEFCDTVGTSRDYLAKLLFWCMLLGHHMRSLENRLQLSCVVG 364
                ++D      +   TS   LAKLL+W M++G+ +R++E R  +  V+G
Sbjct: 314 ----AVSDPNQMKTNVTETSAADLAKLLYWLMVVGYSIRNIEVRFDMERVLG 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_011652254.16.07e-256100.00uncharacterized protein LOC101208572 [Cucumis sativus] >KGN59616.1 hypothetical ... [more]
XP_008443458.13.20e-24696.99PREDICTED: uncharacterized protein LOC103487044 [Cucumis melo] >KAA0053716.1 unc... [more]
XP_038905998.18.58e-23592.88uncharacterized protein LOC120091907 [Benincasa hispida][more]
XP_023522197.17.48e-22690.14uncharacterized protein LOC111786074 [Cucurbita pepo subsp. pepo] >XP_023526062.... [more]
XP_022935548.16.14e-22589.86uncharacterized protein LOC111442389 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A0A0LHU82.94e-256100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G829060 PE=4 SV=1[more]
A0A5A7UHK41.55e-24696.99Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B8531.55e-24696.99uncharacterized protein LOC103487044 OS=Cucumis melo OX=3656 GN=LOC103487044 PE=... [more]
A0A6J1FAU62.97e-22589.86uncharacterized protein LOC111442389 OS=Cucurbita moschata OX=3662 GN=LOC1114423... [more]
A0A6J1J6Z68.51e-22589.59uncharacterized protein LOC111482318 OS=Cucurbita maxima OX=3661 GN=LOC111482318... [more]
Match NameE-valueIdentityDescription
AT5G14970.16.8e-9555.05unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14910.18.7e-4237.34unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... [more]
AT2G14910.21.1e-3336.11unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 24 plant structures; EXP... [more]
AT1G63610.13.1e-1525.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G63610.21.2e-1425.57unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 224..244
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 20..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 19..38
NoneNo IPR availablePANTHERPTHR33598:SF10HOP-INTERACTING PROTEIN THI043coord: 1..365
NoneNo IPR availablePANTHERPTHR33598OS02G0833400 PROTEINcoord: 1..365
IPR008479Protein of unknown function DUF760PFAMPF05542DUF760coord: 92..171
e-value: 1.5E-17
score: 63.7
coord: 257..361
e-value: 1.0E-25
score: 89.9

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G037020.2CsGy3G037020.2mRNA