CsGy3G018830 (gene) Cucumber (Gy14) v2

NameCsGy3G018830
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionglutelin type-A 2-like
LocationChr3 : 14966462 .. 14967802 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCAATGAATCCCAAGCCTTTCTTTGAGGGAGAAGGTGGTTCATATCACAAATGGCTGCCTTCTGACTATCCCTTGCTGGCTCAGACTAACGTGGCCGGCGGCCGCCTTCTCCTCCGCCCTCGAGGCTTCGCTGTTCCTCACTATTCTGATTGCTCTAAATTTGGCTATGTTCTTCAAGGTAACCTCTCTCCTTTTTTACTCGTTTGATGGCTATTTGAATTTTATTTTGTACATATTGAATAAGGGTTTCTTCATACGTCTTCTAAATTGTTGTGTGATATATTTAGTTGAGTTTGAATTCTCTAATTAATTTTTTAATGAGTTATGATAAAAATTAATTATATTAACAGCACTCAATTAGTAGTAGACCTTCATACTTGTTTTACGACGGTATTTTTAGGTGAGGATGGAGTTACAGGATTCGTGTTTCCAAAAAAATGCAACGAGGTGGTAATAAAGCTAAAGAAAGGAGATCTGATCCCAGTGCCAGCTGGAGTCACGTCGTGGTGGTTCAATGACGGAGACTCTGATTTGGAAATCATCTTTTTGGGTGAAACCAAAAGGGCTCATGTCCCCGGTGACATTACATATTTCATTCTCTCTGGCCCTCGCGGTCTCCTACAAGGCTTCACGCCAGAGTATGTCCAAAAAAGTTGCTCTCTAAACCAGGAAGAAACAAACACATTCCTCAAAAGCCAACCCAATGTCCTAATCTTTACCGTTCAACCATCCCAATCCCTCCCCAAACCCCACAAATATAGCAAACTAGTTTACAACATTGATGCAGCCGCACCGGACAACAGAGCCAAAGTTGGCGACGCTGCCGTTACAATGGTGACGGAATCCACATTTCCATTCATTGGTCAAACTGGGTTGACGCCAGTTCTCGAAAAGCTCGACGCCAATGCCATCCGTTCCCCAGTCTACATTGCTGAGCCGTCCGACCAACTGATCTACGTGACTAAAGGATCCGGGAAGATTCAGGTCGTCGGATTTTCGAGTAAATTTGATGCAGATGTGAAAACAGGTCAGCTGATTTTGGTCCCTCGATACTTCGCCGTCGGGAAAATCGCCGGAGAAGAAGGCTTGGAGTGCATTTCCATGATTGTAGCAACACAGTAATGAAAAGCTTCAACTATTTATTTTTATTTTATTTGGGTTTTTTTTCTTCTGAATATTAAAAGTTATAACATTTCTGATATTATATTGAACAGTCCTATGGTGGAAGAATTGGCCGGAAAGACATCTGTTTTGGAGGCATTGTCGTCGGAGGTTTTTCAAGTTTCTTTCAATGTCACGGCGGAGTTTGAGAAGCTCTTTAGGTCAAAGGTTTAA

mRNA sequence

ATGGAGGCAATGAATCCCAAGCCTTTCTTTGAGGGAGAAGGTGGTTCATATCACAAATGGCTGCCTTCTGACTATCCCTTGCTGGCTCAGACTAACGTGGCCGGCGGCCGCCTTCTCCTCCGCCCTCGAGGCTTCGCTGTTCCTCACTATTCTGATTGCTCTAAATTTGGCTATGTTCTTCAAGGTGAGGATGGAGTTACAGGATTCGTGTTTCCAAAAAAATGCAACGAGGTGGTAATAAAGCTAAAGAAAGGAGATCTGATCCCAGTGCCAGCTGGAGTCACGTCGTGGTGGTTCAATGACGGAGACTCTGATTTGGAAATCATCTTTTTGGGTGAAACCAAAAGGGCTCATGTCCCCGGTGACATTACATATTTCATTCTCTCTGGCCCTCGCGGTCTCCTACAAGGCTTCACGCCAGAGTATGTCCAAAAAAGTTGCTCTCTAAACCAGGAAGAAACAAACACATTCCTCAAAAGCCAACCCAATGTCCTAATCTTTACCGTTCAACCATCCCAATCCCTCCCCAAACCCCACAAATATAGCAAACTAGTTTACAACATTGATGCAGCCGCACCGGACAACAGAGCCAAAGTTGGCGACGCTGCCGTTACAATGGTGACGGAATCCACATTTCCATTCATTGGTCAAACTGGGTTGACGCCAGTTCTCGAAAAGCTCGACGCCAATGCCATCCGTTCCCCAGTCTACATTGCTGAGCCGTCCGACCAACTGATCTACGTGACTAAAGGATCCGGGAAGATTCAGGTCGTCGGATTTTCGAGTAAATTTGATGCAGATGTGAAAACAGGTCAGCTGATTTTGGTCCCTCGATACTTCGCCGTCGGGAAAATCGCCGGAGAAGAAGGCTTGGAGTGCATTTCCATGATTGTAGCAACACATCCTATGGTGGAAGAATTGGCCGGAAAGACATCTGTTTTGGAGGCATTGTCGTCGGAGGTTTTTCAAGTTTCTTTCAATGTCACGGCGGAGTTTGAGAAGCTCTTTAGGTCAAAGGTTTAA

Coding sequence (CDS)

ATGGAGGCAATGAATCCCAAGCCTTTCTTTGAGGGAGAAGGTGGTTCATATCACAAATGGCTGCCTTCTGACTATCCCTTGCTGGCTCAGACTAACGTGGCCGGCGGCCGCCTTCTCCTCCGCCCTCGAGGCTTCGCTGTTCCTCACTATTCTGATTGCTCTAAATTTGGCTATGTTCTTCAAGGTGAGGATGGAGTTACAGGATTCGTGTTTCCAAAAAAATGCAACGAGGTGGTAATAAAGCTAAAGAAAGGAGATCTGATCCCAGTGCCAGCTGGAGTCACGTCGTGGTGGTTCAATGACGGAGACTCTGATTTGGAAATCATCTTTTTGGGTGAAACCAAAAGGGCTCATGTCCCCGGTGACATTACATATTTCATTCTCTCTGGCCCTCGCGGTCTCCTACAAGGCTTCACGCCAGAGTATGTCCAAAAAAGTTGCTCTCTAAACCAGGAAGAAACAAACACATTCCTCAAAAGCCAACCCAATGTCCTAATCTTTACCGTTCAACCATCCCAATCCCTCCCCAAACCCCACAAATATAGCAAACTAGTTTACAACATTGATGCAGCCGCACCGGACAACAGAGCCAAAGTTGGCGACGCTGCCGTTACAATGGTGACGGAATCCACATTTCCATTCATTGGTCAAACTGGGTTGACGCCAGTTCTCGAAAAGCTCGACGCCAATGCCATCCGTTCCCCAGTCTACATTGCTGAGCCGTCCGACCAACTGATCTACGTGACTAAAGGATCCGGGAAGATTCAGGTCGTCGGATTTTCGAGTAAATTTGATGCAGATGTGAAAACAGGTCAGCTGATTTTGGTCCCTCGATACTTCGCCGTCGGGAAAATCGCCGGAGAAGAAGGCTTGGAGTGCATTTCCATGATTGTAGCAACACATCCTATGGTGGAAGAATTGGCCGGAAAGACATCTGTTTTGGAGGCATTGTCGTCGGAGGTTTTTCAAGTTTCTTTCAATGTCACGGCGGAGTTTGAGAAGCTCTTTAGGTCAAAGGTTTAA

Protein sequence

MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPGDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKYSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEPSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV
BLAST of CsGy3G018830 vs. NCBI nr
Match: XP_004151504.1 (PREDICTED: legumin J [Cucumis sativus] >KGN57580.1 hypothetical protein Csa_3G218160 [Cucumis sativus])

HSP 1 Score: 692.2 bits (1785), Expect = 8.9e-196
Identity = 340/340 (100.00%), Postives = 340/340 (100.00%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP
Sbjct: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK
Sbjct: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE
Sbjct: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240

Query: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300
           PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300

Query: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV
Sbjct: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of CsGy3G018830 vs. NCBI nr
Match: XP_008456076.1 (PREDICTED: glutelin type-A 2-like [Cucumis melo])

HSP 1 Score: 653.7 bits (1685), Expect = 3.5e-184
Identity = 322/340 (94.71%), Postives = 329/340 (96.76%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           MEAMNPKPFFEGEGGSY KWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHY+DCSKFGYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QGEDGVTGFVFP KCNEVV+KLKKGDLIPVP+G+TSWWFNDGDSDLEIIFLGETK AHVP
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITYFILSGPRGLLQGF PEYVQKS SL+QEETN FLKSQ NVLIFTVQPSQSLPKPHK
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           +SKLVYNIDAA PDNRAKVG AAVTMVTESTFPFIGQTGLT VLEKLDANAIRSPVYIAE
Sbjct: 181 HSKLVYNIDAAVPDNRAKVGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIAE 240

Query: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300
           PSDQLIYVTKGSGKIQVVGFSSKFDADVK GQLILVPRYFAVGK+AGEEGLECISMIVAT
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVAT 300

Query: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV
Sbjct: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of CsGy3G018830 vs. NCBI nr
Match: XP_022922755.1 (legumin J-like [Cucurbita moschata])

HSP 1 Score: 547.0 bits (1408), Expect = 4.6e-152
Identity = 264/339 (77.88%), Postives = 293/339 (86.43%), Query Frame = 0

Query: 2   EAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQ 61
           + MNPKPF E E GSYHKWLPS+YPLLAQ  VA GRLLLRPRGF VPHY+DCSK GYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAQNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPG 121
           GE+GV G VFP K +EVV+ LKKGDLIPVP GV+SWWFNDGDSDLEIIFLGE+K AHVPG
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKY 181
           DI+YF+LSGP  LL GF+PEYV K+ SLN EET  FLKSQ N LIF++Q +QSLPKP KY
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPSKY 182

Query: 182 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 241
           SK VYNIDAAAPD R K G  AVT VTES FPFIGQ+GLT +LEKL+ANA+RSPVY+AEP
Sbjct: 183 SKFVYNIDAAAPDGRVKGGAGAVTTVTESKFPFIGQSGLTAILEKLNANAVRSPVYVAEP 242

Query: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 301
            DQLIYV KG GKIQ+VG SSK DA+VK GQLILVP++FAVGKIAGE+GLECIS+I ATH
Sbjct: 243 YDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITATH 302

Query: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           P+VEELAGKTSVLEALS E+FQVSFNVTAEFEKL RSK+
Sbjct: 303 PVVEELAGKTSVLEALSPEIFQVSFNVTAEFEKLLRSKI 341

BLAST of CsGy3G018830 vs. NCBI nr
Match: XP_022985328.1 (12S seed storage protein CRD-like [Cucurbita maxima])

HSP 1 Score: 544.7 bits (1402), Expect = 2.3e-151
Identity = 264/339 (77.88%), Postives = 292/339 (86.14%), Query Frame = 0

Query: 2   EAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQ 61
           + MNPKPF E E GSYHKWLPS+YPLLA   VA GRLLLRPRGF VPHY+DCSK GYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLAHNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPG 121
           GE+GV G VFP K +EVV+ LKKGDLIPVP GV+SWWFNDGDSDLEIIFLGE+K AHVPG
Sbjct: 63  GENGVAGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKY 181
           DI+YF+LSG   LL GF+PEYV ++ SLN EET  FLKSQ N LIF++Q +QSLPKP KY
Sbjct: 123 DISYFVLSGILSLLNGFSPEYVGETYSLNGEETTQFLKSQSNALIFSIQQTQSLPKPPKY 182

Query: 182 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 241
           SK VYNIDAAAPD R K G  AVT VTES FPFIGQ+GLT +LEKLDANA+RSPVY+AEP
Sbjct: 183 SKFVYNIDAAAPDGRVKGGAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEP 242

Query: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 301
            DQLIYV KG GKIQ+VGFSSK DA+VK GQLILVP++FAVGKIAGE+GLECIS+I ATH
Sbjct: 243 YDQLIYVAKGRGKIQIVGFSSKIDAEVKMGQLILVPKFFAVGKIAGEDGLECISIITATH 302

Query: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           P+VEELAGKTSVLEALS EVFQVSFNVTAEFEKL RSK+
Sbjct: 303 PVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKI 341

BLAST of CsGy3G018830 vs. NCBI nr
Match: XP_023552908.1 (12S seed storage globulin 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 538.9 bits (1387), Expect = 1.3e-149
Identity = 261/339 (76.99%), Postives = 290/339 (85.55%), Query Frame = 0

Query: 2   EAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQ 61
           + MNPKPF E E GSYHKWLPS+YPLLA+  VA GRLLLRPRGF VPHY+DCSK GYVLQ
Sbjct: 3   QPMNPKPFTEVEAGSYHKWLPSEYPLLARNKVAAGRLLLRPRGFVVPHYADCSKVGYVLQ 62

Query: 62  GEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPG 121
           GE+GV G VFP K +EVV+ LKKGDLIPVP GV+SWWFNDGDSDLEIIFLGE+K AHVPG
Sbjct: 63  GENGVVGLVFPSKSDEVVVNLKKGDLIPVPNGVSSWWFNDGDSDLEIIFLGESKNAHVPG 122

Query: 122 DITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKY 181
           DI+YF+LSGP  LL GF+PEYV K+ SLN EET  FLKSQ N LI ++Q +QSLPKP K+
Sbjct: 123 DISYFVLSGPLSLLHGFSPEYVGKTYSLNGEETTQFLKSQSNALICSIQQTQSLPKPPKF 182

Query: 182 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 241
           SK VYNIDAAAPD R K    AVT VTES FPFIGQ+GLT +LEKLDANA+RSPVY+AEP
Sbjct: 183 SKFVYNIDAAAPDGRVKGSAGAVTTVTESKFPFIGQSGLTAILEKLDANAVRSPVYVAEP 242

Query: 242 SDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 301
            DQLIYV KG GKIQ+VG SSK DA+VK GQLILVP++FAVGK AGE+GLECIS+I ATH
Sbjct: 243 YDQLIYVAKGRGKIQIVGSSSKIDAEVKMGQLILVPKFFAVGKFAGEDGLECISIITATH 302

Query: 302 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           P+VEELAGKTSVLEALS EVFQVSFNVTAEFEKL RSK+
Sbjct: 303 PVVEELAGKTSVLEALSPEVFQVSFNVTAEFEKLLRSKI 341

BLAST of CsGy3G018830 vs. TAIR10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 249.2 bits (635), Expect = 3.6e-66
Identity = 133/337 (39.47%), Postives = 192/337 (56.97%), Query Frame = 0

Query: 6   PKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQGEDG 65
           PK  + G+GGSY  W P + P+L Q N+   +L L   GFAVP YSD SK  YVLQG  G
Sbjct: 10  PKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKVAYVLQG-SG 69

Query: 66  VTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPGDITY 125
             G V P+K  E VI +K+GD I +P GV +WWFN+ D +L I+FLGET + H  G  T 
Sbjct: 70  TAGIVLPEK-EEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGHKAGQFTE 129

Query: 126 FILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKYSK-- 185
           F L+G  G+  GF+ E+V ++  L++      + SQ    I  +     +P+P + ++  
Sbjct: 130 FYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQPKEENRAG 189

Query: 186 LVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEPSD 245
            V N   A  D   K G   V + T++  P +G+ G    L ++DA+++ SP +  + + 
Sbjct: 190 FVLNCLEAPLDVDIKDGGRVVVLNTKN-LPLVGEVGFGADLVRIDAHSMCSPGFSCDSAL 249

Query: 246 QLIYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 305
           Q+ Y+  GSG++QVVG   K   +  +K G L +VPR+F V KIA  +G+   S++    
Sbjct: 250 QVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFSIVTTPD 309

Query: 306 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRS 339
           P+   LAG TSV ++LS EV Q +F V  E EK FRS
Sbjct: 310 PIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRS 343

BLAST of CsGy3G018830 vs. TAIR10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 248.8 bits (634), Expect = 4.7e-66
Identity = 134/338 (39.64%), Postives = 191/338 (56.51%), Query Frame = 0

Query: 6   PKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQGEDG 65
           PK  + G+GGSY  W P + P+L   N+   +L L   G A+P YSD  K  YVLQG  G
Sbjct: 10  PKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKVAYVLQGA-G 69

Query: 66  VTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPGDITY 125
             G V P+K  E VI +KKGD I +P GV +WWFN+ D++L ++FLGET + H  G  T 
Sbjct: 70  TAGIVLPEK-EEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGHKAGQFTD 129

Query: 126 FILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHKYSK-- 185
           F L+G  G+  GF+ E+V ++  L++      + SQ    I  V  S  +P+P K  +  
Sbjct: 130 FYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEPKKGDRKG 189

Query: 186 LVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEPSD 245
            V N   A  D   K G   V + T++  P +G+ G    L ++D +++ SP +  + + 
Sbjct: 190 FVLNCLEAPLDVDIKDGGRVVVLNTKN-LPLVGEVGFGADLVRIDGHSMCSPGFSCDSAL 249

Query: 246 QLIYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATH 305
           Q+ Y+  GSG++Q+VG   K   +  VK G L +VPR+F V KIA  +GL   S++    
Sbjct: 250 QVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFSIVTTPD 309

Query: 306 PMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSK 340
           P+   LAG+TSV +ALS EV Q +F V  E EK FRSK
Sbjct: 310 PIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSK 344

BLAST of CsGy3G018830 vs. TAIR10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 102.8 bits (255), Expect = 4.2e-22
Identity = 98/395 (24.81%), Postives = 157/395 (39.75%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + ++ P    + E G    W     P L    V   R+ L+P    +P +       YV+
Sbjct: 41  INSLAPAQATKFEAGQMEVW-DHMSPELRCAGVTVARITLQPNSIFLPAFFSPPALAYVV 100

Query: 61  QGEDGVTGFV---FPKKCNEV----------------------VIKLKKGDLIPVPAGVT 120
           QGE GV G +    P+   EV                      +   ++GD+    AGV+
Sbjct: 101 QGE-GVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVS 160

Query: 121 SWWFNDGDSD-LEIIFLGETKRAHVPGDI-TYFILSGPR--------------GLLQGFT 180
            WW+N GDSD + +I L  T R +    +   F L+G R                  GF 
Sbjct: 161 QWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFD 220

Query: 181 PEYVQKSCSLNQEETNTFLKSQPN---------VLIFTVQP---------SQSLPKPHKY 240
           P  + ++  +N E        + N          L F + P         +  + + +  
Sbjct: 221 PNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCT 280

Query: 241 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 300
           +K+  NID     +        ++ +     P +    L  +   L +  +  P + A  
Sbjct: 281 AKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTAN- 340

Query: 301 SDQLIYVTKGSGKIQVV--GFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVA 335
           +  ++YVT G  KIQVV     S F+  V  GQ+I++P+ FAV K AGE G E IS    
Sbjct: 341 AHTVLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTN 400

BLAST of CsGy3G018830 vs. TAIR10
Match: AT1G03880.1 (cruciferin 2)

HSP 1 Score: 101.7 bits (252), Expect = 9.3e-22
Identity = 88/400 (22.00%), Postives = 166/400 (41.50%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + A+ P    + EGG    W     P L  +  A  R ++ P+G  +P + +  K  +V+
Sbjct: 35  LNALEPSQIIKSEGGRIEVW-DHHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVV 94

Query: 61  QGEDGVTGFVFP------------------------KKCNEVVIKLKKGDLIPVPAGVTS 120
            G  G+ G V P                        +  ++ V  L+ GD I  P+GV  
Sbjct: 95  HGR-GLMGRVIPGCAETFMESPVFGEGXXXXXXXGFRDMHQKVEHLRCGDTIATPSGVAQ 154

Query: 121 WWFNDGDSDLEIIFLGE--TKRAHVPGDITYFILSG--PRG--------------LLQGF 180
           W++N+G+  L ++   +  + +  +  ++  F+++G  P+G              +  GF
Sbjct: 155 WFYNNGNEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGF 214

Query: 181 TPEYVQKSCSLNQEETNTFLKSQPN------------VLIFTVQPSQSLPKPHKYS---- 240
            PE + ++  +N E        Q N            V+   ++  +   +PH+ +    
Sbjct: 215 APEILAQAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEIANGLE 274

Query: 241 ------KLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPV 300
                 +   N+D  +  +  K     ++ +     P +    L+ +   +  NA+  P 
Sbjct: 275 ETLCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLPQ 334

Query: 301 YIAEPSDQLIYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECI 335
           +    ++  +YVT G   IQ+V  + +  FD ++ +GQL++VP+ F+V K A  E  E I
Sbjct: 335 WNVN-ANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEWI 394

BLAST of CsGy3G018830 vs. TAIR10
Match: AT5G44120.3 (RmlC-like cupins superfamily protein)

HSP 1 Score: 83.2 bits (204), Expect = 3.4e-16
Identity = 85/404 (21.04%), Postives = 155/404 (38.37%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + A+ P    + E G    W     P L  + V+  R ++  +G  +P + + +K  +V 
Sbjct: 41  LNALEPSHVLKSEAGRIEVW-DHHAPQLRCSGVSFARYIIESKGLYLPSFFNTAKLSFVA 100

Query: 61  QGEDGVTGFVFP-------------------------KKCNEVVIKLKKGDLIPVPAGVT 120
           +G  G+ G V P                         +  ++ V  ++ GD I    GV 
Sbjct: 101 KGR-GLMGKVIPGCAETFQDSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVA 160

Query: 121 SWWFNDGDSDLEIIFLGE--TKRAHVPGDITYFILSG--PRG--------------LLQG 180
            W++NDG   L I+ + +  + +  +  +   F L+G  P+G              +  G
Sbjct: 161 QWFYNDGQEPLVIVSVFDLASHQNQLDRNPRPFYLAGNNPQGQVWLQGREQQPQKNIFNG 220

Query: 181 FTPEYVQKSCSLNQEETNTFLKSQPN-VLIFTVQPSQSLPKPHK---------------- 240
           F PE + ++  ++ +          N   I  VQ    + +P                  
Sbjct: 221 FGPEVIAQALKIDLQTAQQLQNQDDNRGNIVRVQGPFGVIRPPLRGQXXXXXXXXXXXXX 280

Query: 241 -----------YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDA 300
                       ++   N+D  +  +  K     ++ +     P +    L+ +   +  
Sbjct: 281 XXXXGLEETICSARCTDNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRLSALRGSIRQ 340

Query: 301 NAIRSPVYIAEPSDQLIYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAG 332
           NA+  P + A  ++ ++YVT G  +IQ+V  +    FD  V  GQLI VP+ F+V K A 
Sbjct: 341 NAMVLPQWNAN-ANAILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVVKRAT 400

BLAST of CsGy3G018830 vs. Swiss-Prot
Match: sp|P07728|GLUA1_ORYSJ (Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2)

HSP 1 Score: 104.4 bits (259), Expect = 2.6e-21
Identity = 86/388 (22.16%), Postives = 164/388 (42.27%), Query Frame = 0

Query: 31  TNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQGEDGVTGFVFP------------------ 90
           T V+  R ++ PRG  +PHY++ +   Y++QG  G+TG  FP                  
Sbjct: 80  TGVSVVRRVIEPRGLLLPHYTNGASLVYIIQGR-GITGPTFPGCPESYQQQFQQSGQAQL 139

Query: 91  ----------KKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPGD 150
                     K  ++ + + ++GD+I +PAGV  W +NDG+  +  I++ +        D
Sbjct: 140 TESQSQSQKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDLNNGANQLD 199

Query: 151 ITY--FILSG---------------PRGLLQGFTPEYVQKSCSLNQEETNTF-LKSQPNV 210
                F+L+G                + +  GF+ E + ++  ++ +       ++    
Sbjct: 200 PRQRDFLLAGNKRNPQAYRREVEERSQNIFSGFSTELLSEALGVSSQVARQLQCQNDQRG 259

Query: 211 LIFTVQPSQSLPKPH--------------------KYSKLVY------------------ 270
            I  V+   SL +P+                    +Y +  Y                  
Sbjct: 260 EIVRVEHGLSLLQPYASLQEQEQGQVQSRERYQEGQYQQSQYGSGCSNGLDETFCTLRVR 319

Query: 271 -NIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEPSDQL 330
            NID     +        VT +    FP +    ++ V   L  NA+ SP +    +  +
Sbjct: 320 QNIDNPNRADTYNPRAGRVTNLNTQNFPILSLVQMSAVKVNLYQNALLSPFWNIN-AHSV 379

Query: 331 IYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATHPM 332
           +Y+T+G  ++QVV  + K  F+ +++ GQL+++P+++AV K A  EG   I+     + M
Sbjct: 380 VYITQGRARVQVVNNNGKTVFNGELRRGQLLIIPQHYAVVKKAQREGCAYIAFKTNPNSM 439

BLAST of CsGy3G018830 vs. Swiss-Prot
Match: sp|P07730|GLUA2_ORYSJ (Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 3.4e-21
Identity = 88/388 (22.68%), Postives = 162/388 (41.75%), Query Frame = 0

Query: 31  TNVAGGRLLLRPRGFAVPHYSDCSKFGYVLQGEDGVTGFVFP------------------ 90
           T V+  R ++ PRG  +PHY++ +   Y++QG  G+TG  FP                  
Sbjct: 80  TGVSVVRRVIEPRGLLLPHYTNGASLVYIIQGR-GITGPTFPGCPETYQQQFQQSGQAQL 139

Query: 91  ----------KKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVPGD 150
                     K  ++ + + ++GD+I +PAGV  W +NDG+  +  I++ +        D
Sbjct: 140 TESQSQSHKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDINNGANQLD 199

Query: 151 ITY--FILSG---------------PRGLLQGFTPEYVQKSCSL-NQEETNTFLKSQPNV 210
                F+L+G                + +  GF+ E + ++  + NQ       ++    
Sbjct: 200 PRQRDFLLAGNKRNPQAYRREVEEWSQNIFSGFSTELLSEAFGISNQVARQLQCQNDQRG 259

Query: 211 LIFTVQPSQSLPKPHK--------------------YSKLVY------------------ 270
            I  V+   SL +P+                     Y +  Y                  
Sbjct: 260 EIVRVERGLSLLQPYASLQEQEQGQMQSREHYQEGGYQQSQYGSGCPNGLDETFCTMRVR 319

Query: 271 -NIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEPSDQL 330
            NID     +        VT +    FP +    ++ V   L  NA+ SP +    +  +
Sbjct: 320 QNIDNPNRADTYNPRAGRVTNLNSQNFPILNLVQMSAVKVNLYQNALLSPFWNIN-AHSI 379

Query: 331 IYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVATHPM 332
           +Y+T+G  ++QVV  + K  F+ +++ GQL++VP+++ V K A  EG   I+     + M
Sbjct: 380 VYITQGRAQVQVVNNNGKTVFNGELRRGQLLIVPQHYVVVKKAQREGCAYIAFKTNPNSM 439

BLAST of CsGy3G018830 vs. Swiss-Prot
Match: sp|Q9ZWA9|CRU4_ARATH (12S seed storage protein CRD OS=Arabidopsis thaliana OX=3702 GN=CRD PE=1 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 7.5e-21
Identity = 98/395 (24.81%), Postives = 157/395 (39.75%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + ++ P    + E G    W     P L    V   R+ L+P    +P +       YV+
Sbjct: 41  INSLAPAQATKFEAGQMEVW-DHMSPELRCAGVTVARITLQPNSIFLPAFFSPPALAYVV 100

Query: 61  QGEDGVTGFV---FPKKCNEV----------------------VIKLKKGDLIPVPAGVT 120
           QGE GV G +    P+   EV                      +   ++GD+    AGV+
Sbjct: 101 QGE-GVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVS 160

Query: 121 SWWFNDGDSD-LEIIFLGETKRAHVPGDI-TYFILSGPR--------------GLLQGFT 180
            WW+N GDSD + +I L  T R +    +   F L+G R                  GF 
Sbjct: 161 QWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFD 220

Query: 181 PEYVQKSCSLNQEETNTFLKSQPN---------VLIFTVQP---------SQSLPKPHKY 240
           P  + ++  +N E        + N          L F + P         +  + + +  
Sbjct: 221 PNIIAEAFKINIETAKQLQNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCT 280

Query: 241 SKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAEP 300
           +K+  NID     +        ++ +     P +    L  +   L +  +  P + A  
Sbjct: 281 AKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTAN- 340

Query: 301 SDQLIYVTKGSGKIQVV--GFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVA 335
           +  ++YVT G  KIQVV     S F+  V  GQ+I++P+ FAV K AGE G E IS    
Sbjct: 341 AHTVLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTN 400

BLAST of CsGy3G018830 vs. Swiss-Prot
Match: sp|P15456|CRU2_ARATH (12S seed storage protein CRB OS=Arabidopsis thaliana OX=3702 GN=CRB PE=1 SV=2)

HSP 1 Score: 101.7 bits (252), Expect = 1.7e-20
Identity = 88/400 (22.00%), Postives = 166/400 (41.50%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + A+ P    + EGG    W     P L  +  A  R ++ P+G  +P + +  K  +V+
Sbjct: 35  LNALEPSQIIKSEGGRIEVW-DHHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVV 94

Query: 61  QGEDGVTGFVFP------------------------KKCNEVVIKLKKGDLIPVPAGVTS 120
            G  G+ G V P                        +  ++ V  L+ GD I  P+GV  
Sbjct: 95  HGR-GLMGRVIPGCAETFMESPVFGEGXXXXXXXGFRDMHQKVEHLRCGDTIATPSGVAQ 154

Query: 121 WWFNDGDSDLEIIFLGE--TKRAHVPGDITYFILSG--PRG--------------LLQGF 180
           W++N+G+  L ++   +  + +  +  ++  F+++G  P+G              +  GF
Sbjct: 155 WFYNNGNEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGF 214

Query: 181 TPEYVQKSCSLNQEETNTFLKSQPN------------VLIFTVQPSQSLPKPHKYS---- 240
            PE + ++  +N E        Q N            V+   ++  +   +PH+ +    
Sbjct: 215 APEILAQAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEIANGLE 274

Query: 241 ------KLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPV 300
                 +   N+D  +  +  K     ++ +     P +    L+ +   +  NA+  P 
Sbjct: 275 ETLCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLPQ 334

Query: 301 YIAEPSDQLIYVTKGSGKIQVVGFSSK--FDADVKTGQLILVPRYFAVGKIAGEEGLECI 335
           +    ++  +YVT G   IQ+V  + +  FD ++ +GQL++VP+ F+V K A  E  E I
Sbjct: 335 WNVN-ANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEWI 394

BLAST of CsGy3G018830 vs. Swiss-Prot
Match: sp|O23880|13S2_FAGES (13S globulin seed storage protein 2 OS=Fagopyrum esculentum OX=3617 GN=FA18 PE=2 SV=1)

HSP 1 Score: 100.5 bits (249), Expect = 3.7e-20
Identity = 105/428 (24.53%), Postives = 166/428 (38.79%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           + A  P      E G    W   D P    T     R++++P G  +P YS+     +V 
Sbjct: 51  LTASEPSRRVRSEAGVTEIW-DHDTPEFRCTGFVAVRVVIQPGGLLLPSYSNAPYITFVE 110

Query: 61  QGEDGVTGFVFPKKCNEV-----------------------------------VIKLKKG 120
           QG  GV G V P  C E                                    + ++++G
Sbjct: 111 QGR-GVQGVVIP-GCPETFQSDSEFEYPQSQRGRHSRQSESEEESSRGDQHQKIFRIREG 170

Query: 121 DLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAH--VPGDITYFILSGP------------ 180
           D+IP PAGV  W  NDG+ DL  + L +    H  +  ++  F L+G             
Sbjct: 171 DVIPSPAGVVQWTHNDGNDDLISVTLLDANSYHKQLDENVRSFFLAGQSQRETREEGSDR 230

Query: 181 -------------RGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQ-SLPK 240
                          +L GF  E + +       ET + L+ + +   F VQ     L  
Sbjct: 231 QSRESDDDEALLGANILSGFQDEILHELFRDVDRETISKLRGENDQRGFIVQAQDLKLRV 290

Query: 241 PHKYSKLVYNIDAAAPDNRAKVGDAAVTMVTESTF-------------------PFIGQ- 300
           P  + +     +    D R   G +  +   E  F                   P  G+ 
Sbjct: 291 PQDFEE---EYERERGDRRRGQGGSGRSNGVEQGFCNLKFRRNFNTPTNTYVFNPRAGRI 350

Query: 301 ----TGLTPVLEKLDANAIRSPVY---IAEP-----SDQLIYVTKGSGKIQVVGFSSK-- 332
               +   P+LE L  +A    +Y   I  P     +   +YVT+G G++QVVG   K  
Sbjct: 351 NTVNSNSLPILEFLQLSAQHVVLYKNAIIGPRWNLNAHSALYVTRGEGRVQVVGDEGKSV 410

BLAST of CsGy3G018830 vs. TrEMBL
Match: tr|A0A0A0L6K0|A0A0A0L6K0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218160 PE=4 SV=1)

HSP 1 Score: 692.2 bits (1785), Expect = 5.9e-196
Identity = 340/340 (100.00%), Postives = 340/340 (100.00%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP
Sbjct: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK
Sbjct: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE
Sbjct: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240

Query: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300
           PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300

Query: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV
Sbjct: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of CsGy3G018830 vs. TrEMBL
Match: tr|A0A1S3C2D5|A0A1S3C2D5_CUCME (glutelin type-A 2-like OS=Cucumis melo OX=3656 GN=LOC103496119 PE=4 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 2.3e-184
Identity = 322/340 (94.71%), Postives = 329/340 (96.76%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           MEAMNPKPFFEGEGGSY KWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHY+DCSKFGYVL
Sbjct: 1   MEAMNPKPFFEGEGGSYLKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYADCSKFGYVL 60

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QGEDGVTGFVFP KCNEVV+KLKKGDLIPVP+G+TSWWFNDGDSDLEIIFLGETK AHVP
Sbjct: 61  QGEDGVTGFVFPNKCNEVVMKLKKGDLIPVPSGITSWWFNDGDSDLEIIFLGETKNAHVP 120

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITYFILSGPRGLLQGF PEYVQKS SL+QEETN FLKSQ NVLIFTVQPSQSLPKPHK
Sbjct: 121 GDITYFILSGPRGLLQGFAPEYVQKSYSLSQEETNKFLKSQSNVLIFTVQPSQSLPKPHK 180

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           +SKLVYNIDAA PDNRAKVG AAVTMVTESTFPFIGQTGLT VLEKLDANAIRSPVYIAE
Sbjct: 181 HSKLVYNIDAAVPDNRAKVGAAAVTMVTESTFPFIGQTGLTAVLEKLDANAIRSPVYIAE 240

Query: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIVAT 300
           PSDQLIYVTKGSGKIQVVGFSSKFDADVK GQLILVPRYFAVGK+AGEEGLECISMIVAT
Sbjct: 241 PSDQLIYVTKGSGKIQVVGFSSKFDADVKIGQLILVPRYFAVGKMAGEEGLECISMIVAT 300

Query: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
           HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV
Sbjct: 301 HPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 340

BLAST of CsGy3G018830 vs. TrEMBL
Match: tr|A0A1S3C332|A0A1S3C332_CUCME (glutelin type-B 5 OS=Cucumis melo OX=3656 GN=LOC103496120 PE=4 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 3.4e-119
Identity = 209/342 (61.11%), Postives = 262/342 (76.61%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           ++ M+P  FF GEGGS+HKW PSD+P++ QT V  GRLLL PRGFAVPH SD SK GYVL
Sbjct: 5   LKPMDPTNFFTGEGGSFHKWFPSDHPIIPQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVL 64

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QG  GV G VFP K  E V++LKKGD+IPVP GVTSWWFNDGDSD E++ +G+T+ A +P
Sbjct: 65  QG-SGVAGIVFPCKSEEAVVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIP 124

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITY + +GP G+LQGF+ +Y++K   L +EE    LKSQPN LIF ++  Q+LP+P  
Sbjct: 125 GDITYVVFAGPLGVLQGFSSDYIEKVYDLTEEEREVLLKSQPNGLIFKLKDDQTLPEPDC 184

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           +S LV+NI  AAPD+  K G   VT++TE  FPFIG++GLT VLEKL+ANA+RSPVY+A+
Sbjct: 185 HSDLVFNIYDAAPDSVVK-GGGTVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVAD 244

Query: 241 PSDQLIYVTKGSGKIQVVG--FSSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIV 300
           PS QLIYV  GSG+IQ+       + DA+VK GQLILVP+YFAVGK+AGEEGLEC ++I 
Sbjct: 245 PSVQLIYVASGSGRIQIAETFMRKQIDAEVKAGQLILVPKYFAVGKMAGEEGLECFTIIT 304

Query: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
            THP++EEL GK+S+  A S +VFQ SFNVTA FEKL  SK+
Sbjct: 305 TTHPLLEELGGKSSIFGAFSPQVFQASFNVTAHFEKLLISKI 344

BLAST of CsGy3G018830 vs. TrEMBL
Match: tr|A0A0A0LC21|A0A0A0LC21_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218170 PE=4 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 4.5e-119
Identity = 204/342 (59.65%), Postives = 264/342 (77.19%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           ++ M+P  FF GEGGS+HKW PSD+P+++QT V  GRLLL PRGFAVPH SD SK GYVL
Sbjct: 5   LKPMDPSNFFTGEGGSFHKWFPSDFPIISQTKVGAGRLLLHPRGFAVPHNSDSSKVGYVL 64

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           QG  GV G +FP K  E  ++LKKGD+IPVP GVTSWWFNDGDSD E++ +G+T+ A +P
Sbjct: 65  QG-SGVAGIIFPCKSEEAAVRLKKGDVIPVPEGVTSWWFNDGDSDFEVLLVGDTRNALIP 124

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDITY + +GP G+LQGF+ +Y++K   L ++E    LKSQPN LIF ++  Q+LP+P  
Sbjct: 125 GDITYVVFAGPLGVLQGFSSDYIEKVYDLTEKEREVLLKSQPNGLIFKLKDDQTLPEPDC 184

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
           +S LV+NI   APD   K G  +VT++TE  FPFIG++GLT VLEKL+ANA+RSPVY+A+
Sbjct: 185 HSDLVFNIYHTAPDAVVK-GGGSVTVLTEEKFPFIGKSGLTAVLEKLEANAVRSPVYVAD 244

Query: 241 PSDQLIYVTKGSGKIQVVGFSSKF--DADVKTGQLILVPRYFAVGKIAGEEGLECISMIV 300
           PS QLIYV  GSG++Q+     ++  DA+VK GQL+LVP+YFAVGK+AGEEGLEC ++I 
Sbjct: 245 PSVQLIYVASGSGRVQIAETFMRYQIDAEVKAGQLVLVPKYFAVGKMAGEEGLECFTIIT 304

Query: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRSKV 341
            THP++EEL GKTS+  A S +VF+ SFN+TA FEKLFRSK+
Sbjct: 305 TTHPLLEELGGKTSIFGAFSPQVFEASFNLTAHFEKLFRSKI 344

BLAST of CsGy3G018830 vs. TrEMBL
Match: tr|A0A0A0K550|A0A0A0K550_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G337100 PE=4 SV=1)

HSP 1 Score: 424.5 bits (1090), Expect = 2.3e-115
Identity = 207/340 (60.88%), Postives = 257/340 (75.59%), Query Frame = 0

Query: 1   MEAMNPKPFFEGEGGSYHKWLPSDYPLLAQTNVAGGRLLLRPRGFAVPHYSDCSKFGYVL 60
           ++AMNP+  FEG GGSY+KW PSDYPLLAQ+ V  G LLL PRGFA+ HYSD SK GYVL
Sbjct: 6   LKAMNPRKHFEGVGGSYNKWYPSDYPLLAQSKVGAGMLLLHPRGFAILHYSDASKVGYVL 65

Query: 61  QGEDGVTGFVFPKKCNEVVIKLKKGDLIPVPAGVTSWWFNDGDSDLEIIFLGETKRAHVP 120
           +G +GVTGF+FP   NE VIKLKKGD+IPVP GVTSWW+NDGDSDLEI FLGETK AHVP
Sbjct: 66  RGNNGVTGFIFPNTSNEEVIKLKKGDIIPVPTGVTSWWYNDGDSDLEIAFLGETKYAHVP 125

Query: 121 GDITYFILSGPRGLLQGFTPEYVQKSCSLNQEETNTFLKSQPNVLIFTVQPSQSLPKPHK 180
           GDI+Y+ILSGP+G+LQGF+ +YV K+ +LN+ +T+T L SQ N +IF +Q  Q+LP P K
Sbjct: 126 GDISYYILSGPQGILQGFSQDYVAKTFNLNEMDTSTLLNSQQNGMIFKLQEGQTLPTPTK 185

Query: 181 YSKLVYNIDAAAPDNRAKVGDAAVTMVTESTFPFIGQTGLTPVLEKLDANAIRSPVYIAE 240
            +K VYN+D                 V+ES FPFIG+TGL  V+E+L  N +RSPV +  
Sbjct: 186 DTKFVYNLD----------NYDFFMKVSESEFPFIGETGLAVVVERLGPNVVRSPVLLVS 245

Query: 241 PSDQLIYVTKGSGKIQVVGF--SSKFDADVKTGQLILVPRYFAVGKIAGEEGLECISMIV 300
           P+DQLIYV +GSG +Q+VG   SSK +  V++GQLI VP+YFA GKIA E+G+E  S++ 
Sbjct: 246 PADQLIYVARGSGTVQIVGLSSSSKIELHVESGQLIFVPKYFAAGKIAAEQGMEFFSILT 305

Query: 301 ATHPMVEELAGKTSVLEALSSEVFQVSFNVTAEFEKLFRS 339
           A   +V EL GKTSV+EALS+EV  VSFN+TAEFEK+ RS
Sbjct: 306 AKLGLVGELKGKTSVMEALSAEVIAVSFNITAEFEKVLRS 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004151504.18.9e-196100.00PREDICTED: legumin J [Cucumis sativus] >KGN57580.1 hypothetical protein Csa_3G21... [more]
XP_008456076.13.5e-18494.71PREDICTED: glutelin type-A 2-like [Cucumis melo][more]
XP_022922755.14.6e-15277.88legumin J-like [Cucurbita moschata][more]
XP_022985328.12.3e-15177.8812S seed storage protein CRD-like [Cucurbita maxima][more]
XP_023552908.11.3e-14976.9912S seed storage globulin 1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G07750.13.6e-6639.47RmlC-like cupins superfamily protein[more]
AT2G28680.14.7e-6639.64RmlC-like cupins superfamily protein[more]
AT1G03890.14.2e-2224.81RmlC-like cupins superfamily protein[more]
AT1G03880.19.3e-2222.00cruciferin 2[more]
AT5G44120.33.4e-1621.04RmlC-like cupins superfamily protein[more]
Match NameE-valueIdentityDescription
sp|P07728|GLUA1_ORYSJ2.6e-2122.16Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2[more]
sp|P07730|GLUA2_ORYSJ3.4e-2122.68Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1[more]
sp|Q9ZWA9|CRU4_ARATH7.5e-2124.8112S seed storage protein CRD OS=Arabidopsis thaliana OX=3702 GN=CRD PE=1 SV=1[more]
sp|P15456|CRU2_ARATH1.7e-2022.0012S seed storage protein CRB OS=Arabidopsis thaliana OX=3702 GN=CRB PE=1 SV=2[more]
sp|O23880|13S2_FAGES3.7e-2024.5313S globulin seed storage protein 2 OS=Fagopyrum esculentum OX=3617 GN=FA18 PE=2... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L6K0|A0A0A0L6K0_CUCSA5.9e-196100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218160 PE=4 SV=1[more]
tr|A0A1S3C2D5|A0A1S3C2D5_CUCME2.3e-18494.71glutelin type-A 2-like OS=Cucumis melo OX=3656 GN=LOC103496119 PE=4 SV=1[more]
tr|A0A1S3C332|A0A1S3C332_CUCME3.4e-11961.11glutelin type-B 5 OS=Cucumis melo OX=3656 GN=LOC103496120 PE=4 SV=1[more]
tr|A0A0A0LC21|A0A0A0LC21_CUCSA4.5e-11959.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G218170 PE=4 SV=1[more]
tr|A0A0A0K550|A0A0A0K550_CUCSA2.3e-11560.88Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G337100 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
Vocabulary: INTERPRO
TermDefinition
IPR011051RmlC_Cupin_sf
IPR014710RmlC-like_jellyroll
IPR006045Cupin_1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G018830.1CsGy3G018830.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 186..334
e-value: 1.0E-11
score: 54.9
coord: 2..155
e-value: 1.6E-28
score: 110.7
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 3..154
e-value: 2.5E-19
score: 69.3
coord: 192..332
e-value: 1.1E-14
score: 54.2
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 1..183
e-value: 6.2E-28
score: 99.5
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 197..340
e-value: 4.5E-23
score: 83.4
NoneNo IPR availablePANTHERPTHR31189:SF0SUBFAMILY NOT NAMEDcoord: 6..339
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 6..339
IPR011051RmlC-like cupin domain superfamilySUPERFAMILYSSF51182RmlC-like cupinscoord: 3..336