Cla97C06G113120 (gene) Watermelon (97103) v2

NameCla97C06G113120
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Description11-S seed storage protein
LocationCla97Chr06 : 4148901 .. 4150087 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATTGATTTGACCCCTCAATTGCCGAAGAAGATTTATGGTAGCGATGGAGGTTCCTATCATTCATGGTCTCCCAAGGAGCTTCCCATGCTCCGTGAAGGAAACATTGGCGCTTCCAAGCTTGCTATTGAGAAGAATGGCTTCGCTCTCCCTTGTTATTCTGATTCTGCCAAGGTTGCTTATGTTCTTCAAGGTATTTTTTTCTTTTTAATGTGGTTTAGTTTTATATAAATCTCAAATCTGTTCGTTTATATCCATTTTCAGTGTGTTATTTCAACATGAATCTAATTAAAGTGCTAAATGGTTCAGGCAATGGAATAGCTGGAATCATTTTGTCGGAATCAGAGGAGAAGGTTTTTGGGATAAAAAAAGGAGATGCAATTGCTCTTCCATTCGGCATAGTGACGTGGTGGTTCAATAAAGAAGCTACTGATTTGGTAGTTCTATTCTTGGGCAGCACATCAAAGTCTCACAAATCAGGTGAGTTCACTGACTTCTTCCTCACTGGCGCCAACGGAATCTTTACTGGATTCTCAATGGAGTTTGTCAAGCGAGCCTTGGATATGGATGAGGTATCGGTGAAATCTTTAGTGAAAAATCAAACTGGAACCAGAATCGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAGCCAAAAATGGAGCACCGAAACGGAATGGTGCTGAATTGTGAGGAGGCACTGCTAGATGTGGACGTGAAGAACGGAGGACGAGTTGTGGTTCTAAATACGAAGAATCTACCCCTGGTAGGGGAGGTAGGATTGGGAGCAGATTTGGTTCGATTGGATGGAAGTGCGATGTGCTCGCCTGGATTCTCGTGTGATTCGGCGTTGCAGGTGACATATATCGTGAAAGGAAGCGGGAGAGTCGAGGTTGTAGGAGTAGACGGGAAGAAGGTTCTGGAGACAAGAGTGAAAGCTGGAAATTTGTTCATAGTACCAAGGTTTTTTGTGGTATCAAAGATCGGAGATTCTGAAGGAATGGAGTGGTTCTCCATCATTAGTACTCCCAATCCCGTTTTCACTCACTTGGCCGGCAGTATCGGGGTCTGGAAGTCTCTTTCATCGGAAGTTATTCAAGCATCCTTTAATGTAGATGCTAATTTGGTGAAAAAATTTTCTTCCAAGAGAACTTCTGATGCCATCTTCTTCCCTCCTTCTAATTAG

mRNA sequence

ATGGACATTGATTTGACCCCTCAATTGCCGAAGAAGATTTATGGTAGCGATGGAGGTTCCTATCATTCATGGTCTCCCAAGGAGCTTCCCATGCTCCGTGAAGGAAACATTGGCGCTTCCAAGCTTGCTATTGAGAAGAATGGCTTCGCTCTCCCTTGTTATTCTGATTCTGCCAAGGTTGCTTATGTTCTTCAAGGCAATGGAATAGCTGGAATCATTTTGTCGGAATCAGAGGAGAAGGTTTTTGGGATAAAAAAAGGAGATGCAATTGCTCTTCCATTCGGCATAGTGACGTGGTGGTTCAATAAAGAAGCTACTGATTTGGTAGTTCTATTCTTGGGCAGCACATCAAAGTCTCACAAATCAGGTGAGTTCACTGACTTCTTCCTCACTGGCGCCAACGGAATCTTTACTGGATTCTCAATGGAGTTTGTCAAGCGAGCCTTGGATATGGATGAGGTATCGGTGAAATCTTTAGTGAAAAATCAAACTGGAACCAGAATCGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAGCCAAAAATGGAGCACCGAAACGGAATGGTGCTGAATTGTGAGGAGGCACTGCTAGATGTGGACGTGAAGAACGGAGGACGAGTTGTGGTTCTAAATACGAAGAATCTACCCCTGGTAGGGGAGGTAGGATTGGGAGCAGATTTGGTTCGATTGGATGGAAGTGCGATGTGCTCGCCTGGATTCTCGTGTGATTCGGCGTTGCAGGTGACATATATCGTGAAAGGAAGCGGGAGAGTCGAGGTTGTAGGAGTAGACGGGAAGAAGGTTCTGGAGACAAGAGTGAAAGCTGGAAATTTGTTCATAGTACCAAGGTTTTTTGTGGTATCAAAGATCGGAGATTCTGAAGGAATGGAGTGGTTCTCCATCATTAGTACTCCCAATCCCGTTTTCACTCACTTGGCCGGCAGTATCGGGGTCTGGAAGTCTCTTTCATCGGAAGTTATTCAAGCATCCTTTAATGTAGATGCTAATTTGGTGAAAAAATTTTCTTCCAAGAGAACTTCTGATGCCATCTTCTTCCCTCCTTCTAATTAG

Coding sequence (CDS)

ATGGACATTGATTTGACCCCTCAATTGCCGAAGAAGATTTATGGTAGCGATGGAGGTTCCTATCATTCATGGTCTCCCAAGGAGCTTCCCATGCTCCGTGAAGGAAACATTGGCGCTTCCAAGCTTGCTATTGAGAAGAATGGCTTCGCTCTCCCTTGTTATTCTGATTCTGCCAAGGTTGCTTATGTTCTTCAAGGCAATGGAATAGCTGGAATCATTTTGTCGGAATCAGAGGAGAAGGTTTTTGGGATAAAAAAAGGAGATGCAATTGCTCTTCCATTCGGCATAGTGACGTGGTGGTTCAATAAAGAAGCTACTGATTTGGTAGTTCTATTCTTGGGCAGCACATCAAAGTCTCACAAATCAGGTGAGTTCACTGACTTCTTCCTCACTGGCGCCAACGGAATCTTTACTGGATTCTCAATGGAGTTTGTCAAGCGAGCCTTGGATATGGATGAGGTATCGGTGAAATCTTTAGTGAAAAATCAAACTGGAACCAGAATCGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAGCCAAAAATGGAGCACCGAAACGGAATGGTGCTGAATTGTGAGGAGGCACTGCTAGATGTGGACGTGAAGAACGGAGGACGAGTTGTGGTTCTAAATACGAAGAATCTACCCCTGGTAGGGGAGGTAGGATTGGGAGCAGATTTGGTTCGATTGGATGGAAGTGCGATGTGCTCGCCTGGATTCTCGTGTGATTCGGCGTTGCAGGTGACATATATCGTGAAAGGAAGCGGGAGAGTCGAGGTTGTAGGAGTAGACGGGAAGAAGGTTCTGGAGACAAGAGTGAAAGCTGGAAATTTGTTCATAGTACCAAGGTTTTTTGTGGTATCAAAGATCGGAGATTCTGAAGGAATGGAGTGGTTCTCCATCATTAGTACTCCCAATCCCGTTTTCACTCACTTGGCCGGCAGTATCGGGGTCTGGAAGTCTCTTTCATCGGAAGTTATTCAAGCATCCTTTAATGTAGATGCTAATTTGGTGAAAAAATTTTCTTCCAAGAGAACTTCTGATGCCATCTTCTTCCCTCCTTCTAATTAG

Protein sequence

MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPKMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN
BLAST of Cla97C06G113120 vs. NCBI nr
Match: XP_004150394.1 (PREDICTED: glutelin type-B 5-like isoform X1 [Cucumis sativus] >KGN44409.1 hypothetical protein Csa_7G281380 [Cucumis sativus])

HSP 1 Score: 654.8 bits (1688), Expect = 1.6e-184
Identity = 326/356 (91.57%), Postives = 339/356 (95.22%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQLPKKIYGSDGGSY++WSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           IISTPNPVFTHLAGSIGVWK+LS EVI+A+FNV+A+LVK FSSKR+SDAIFFPPSN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSDAIFFPPSN 356

BLAST of Cla97C06G113120 vs. NCBI nr
Match: XP_008461502.1 (PREDICTED: glutelin type-B 5-like [Cucumis melo])

HSP 1 Score: 653.7 bits (1685), Expect = 3.7e-184
Identity = 326/356 (91.57%), Postives = 338/356 (94.94%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQLPKKIYG DGGSY+SWSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           IISTPNPVFTHLAGSIGVWK+LS EVIQA+FNV+A+LVK FSSKR+SDAIFFPPSN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSDAIFFPPSN 356

BLAST of Cla97C06G113120 vs. NCBI nr
Match: XP_011659088.1 (PREDICTED: 11S globulin seed storage protein 2-like isoform X2 [Cucumis sativus])

HSP 1 Score: 652.9 bits (1683), Expect = 6.3e-184
Identity = 325/356 (91.29%), Postives = 339/356 (95.22%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQLPKKIYGSDGGSY++WSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           IISTPNPVFTHLAGSIGVWK+LS EVI+A+FNV+A+LVK FSSKR+SDAIFFPPSN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSDAIFFPPSN 356

BLAST of Cla97C06G113120 vs. NCBI nr
Match: XP_023535755.1 (glutelin type-D 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 648.7 bits (1672), Expect = 1.2e-182
Identity = 324/355 (91.27%), Postives = 335/355 (94.37%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQL KKIYGSDGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKDGVKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS 356
           IISTPNPVFTHLAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Sbjct: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of Cla97C06G113120 vs. NCBI nr
Match: XP_022976927.1 (glutelin type-D 1-like [Cucurbita maxima])

HSP 1 Score: 644.8 bits (1662), Expect = 1.7e-181
Identity = 322/355 (90.70%), Postives = 334/355 (94.08%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQL KKIYG DGGSY+SWSPKELPMLREGNIGA+KLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGCDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVK+QTGT IVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKSQTGTGIVKLKDGVKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV+GSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPS 356
           IISTPNPVFTHLAGSIGVWKSLS EVIQA+FNVDA+LVK FSSKR SDAIFFPPS
Sbjct: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of Cla97C06G113120 vs. TrEMBL
Match: tr|A0A0A0K666|A0A0A0K666_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=4 SV=1)

HSP 1 Score: 654.8 bits (1688), Expect = 1.1e-184
Identity = 326/356 (91.57%), Postives = 339/356 (95.22%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQLPKKIYGSDGGSY++WSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           IISTPNPVFTHLAGSIGVWK+LS EVI+A+FNV+A+LVK FSSKR+SDAIFFPPSN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSDAIFFPPSN 356

BLAST of Cla97C06G113120 vs. TrEMBL
Match: tr|A0A1S3CG59|A0A1S3CG59_CUCME (glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=4 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 2.4e-184
Identity = 326/356 (91.57%), Postives = 338/356 (94.94%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQLPKKIYG DGGSY+SWSPKELPMLREGNIGASKLA+EKNGFALP YSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG+G+AGIIL ESEEKV  IKKGDAIALPFG+VTWWFNKEATDLVVLFLG TSK+H
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFS EFV RA DMDE SVKSLVKNQTGT IVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHRNGM LNCEEA LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIVKGSGR EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGD EGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           IISTPNPVFTHLAGSIGVWK+LS EVIQA+FNV+A+LVK FSSKR+SDAIFFPPSN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSDAIFFPPSN 356

BLAST of Cla97C06G113120 vs. TrEMBL
Match: tr|A0A151TM61|A0A151TM61_CAJCA (Glutelin type-A 1 OS=Cajanus cajan OX=3821 GN=KK1_021772 PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 2.2e-161
Identity = 278/356 (78.09%), Postives = 315/356 (88.48%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M+IDLTPQL KK+YG +GGSY++WSP ELPMLREGNIGA+KLA+EKNGFALP YSDS+KV
Sbjct: 1   MEIDLTPQLSKKVYGDNGGSYYAWSPSELPMLREGNIGAAKLALEKNGFALPRYSDSSKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG+G+AGI+L ESEEKV  IKKGDA+ALPFG+VTWW+NKE T+LV+LFLG TSK H
Sbjct: 61  AYVLQGSGVAGIVLPESEEKVVAIKKGDALALPFGVVTWWYNKEDTELVILFLGDTSKGH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           K+GEFTDFFLTG+NGIFTGFS EFV RA D++E  VK+LV+ QTG  IVKL    K+PEP
Sbjct: 121 KAGEFTDFFLTGSNGIFTGFSTEFVGRAWDLEEKDVKTLVEKQTGKGIVKLDGSIKLPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHR GM LNCEEA LDVD+KNGGRVVVLNTKNLPLVGEVGLGADLVR+DG+AMCSPGF
Sbjct: 181 KEEHRKGMALNCEEAPLDVDIKNGGRVVVLNTKNLPLVGEVGLGADLVRIDGNAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV+GSGRV+VVG DG++VLET +KAGNLFIVPRFFVVSKI D +GM WFS
Sbjct: 241 SCDSALQVTYIVRGSGRVQVVGADGRRVLETTIKAGNLFIVPRFFVVSKIADPDGMAWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           II+TPNP+FTHLAGSI  WK+LS  V+QASFNVD  + K F SKRTSDAIFFPP N
Sbjct: 301 IITTPNPIFTHLAGSISAWKALSPTVLQASFNVDEGVEKLFRSKRTSDAIFFPPPN 356

BLAST of Cla97C06G113120 vs. TrEMBL
Match: tr|A0A022QLG5|A0A022QLG5_ERYGU (Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a008978mg PE=4 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 3.8e-161
Identity = 278/356 (78.09%), Postives = 317/356 (89.04%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M++DLTPQL KK+YG DGG+Y++W P ELPMLREGNIGA KLA+EKNGFALP YSDSAKV
Sbjct: 1   MELDLTPQLAKKLYGGDGGAYYAWCPNELPMLREGNIGAGKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGI+L E EEKV  IKKGDAIALPFG+VTWW+NKE T+LV+LFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIVLPEKEEKVLPIKKGDAIALPFGVVTWWYNKEETELVILFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           KSG FTDFFLTG NGIFTGFS EFV RA D++E +VK+LV +Q+G  IVKL    KMPEP
Sbjct: 121 KSGSFTDFFLTGPNGIFTGFSTEFVGRAWDLEESTVKTLVGSQSGNGIVKLDSTFKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K+EH NGM LNCEEA LDVD+KNGG+VVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KIEHYNGMALNCEEAPLDVDIKNGGKVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV+GSGRV+VVGVDGK+VLETR+KAGNLFIVPRFFVVSKI D EGM+WFS
Sbjct: 241 SCDSALQVTYIVRGSGRVQVVGVDGKRVLETRLKAGNLFIVPRFFVVSKIADPEGMDWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           II+TPNP+FTHLAG   VWK+LS EV+QA+FNV A++ +KF+SKR ++ IFFPP N
Sbjct: 301 IITTPNPIFTHLAGRTSVWKALSPEVLQAAFNVPADVEEKFTSKRKAEEIFFPPPN 356

BLAST of Cla97C06G113120 vs. TrEMBL
Match: tr|W9RVS5|W9RVS5_9ROSA (Glutelin type-A 1 OS=Morus notabilis OX=981085 GN=L484_018618 PE=4 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 1.1e-160
Identity = 276/356 (77.53%), Postives = 319/356 (89.61%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M IDL+P+L K++YG DGG+Y++WSP ELPMLREGNIGA+KLA+EKNGFALP YSDS+KV
Sbjct: 1   MSIDLSPKLSKRVYGGDGGAYYAWSPSELPMLREGNIGAAKLALEKNGFALPRYSDSSKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQGNG+AGI+L ESEEKV  IKKGD+IALPFG+VTWW+NKE TDLVVLFLG TSK+H
Sbjct: 61  AYVLQGNGVAGIVLPESEEKVVAIKKGDSIALPFGVVTWWYNKEDTDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           K+GEFTDF+LTG NGIFTGFS EFV RA D++E  VK+LV  Q+G  IVKL+EG  +PEP
Sbjct: 121 KAGEFTDFYLTGCNGIFTGFSTEFVGRAWDLEEDVVKTLVGRQSGQGIVKLQEGFNLPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K EHR G+ LNCEEA LDVD+K+GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHREGLALNCEEAPLDVDIKDGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTY+V+GSGRV+VVGVDGK+VLET VKAGNLFIVPRF+VVSKI D +G+EWFS
Sbjct: 241 SCDSALQVTYVVRGSGRVQVVGVDGKRVLETTVKAGNLFIVPRFYVVSKIADPDGLEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           II+TPNPVFTHLAG   VWK+LS +V++ASFNV++++ K F SKRTSDAIFFPP N
Sbjct: 301 IITTPNPVFTHLAGRTSVWKALSPQVLEASFNVESDVEKHFRSKRTSDAIFFPPPN 356

BLAST of Cla97C06G113120 vs. Swiss-Prot
Match: sp|P07728|GLUA1_ORYSJ (Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2)

HSP 1 Score: 117.5 bits (293), Expect = 3.1e-25
Identity = 97/403 (24.07%), Postives = 168/403 (41.69%), Query Frame = 0

Query: 37  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSES 96
           +   +  IE  G  LP Y++ A + Y++QG GI G                     L+ES
Sbjct: 82  VSVVRRVIEPRGLLLPHYTNGASLVYIIQGRGITGPTFPGCPESYQQQFQQSGQAQLTES 141

Query: 97  E----------EKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLG--STSKSHKSGEF 156
           +          +K+   ++GD IALP G+  W +N     +V +++   +   +      
Sbjct: 142 QSQSQKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDLNNGANQLDPRQ 201

Query: 157 TDFFLTG---------------ANGIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIV 216
            DF L G               +  IF+GFS E +  AL +     + L  +N     IV
Sbjct: 202 RDFLLAGNKRNPQAYRREVEERSQNIFSGFSTELLSEALGVSSQVARQLQCQNDQRGEIV 261

Query: 217 KLKEGTKMPEP---KMEHRNGMVLN-----------------CEEAL----------LDV 276
           +++ G  + +P     E   G V +                 C   L           ++
Sbjct: 262 RVEHGLSLLQPYASLQEQEQGQVQSRERYQEGQYQQSQYGSGCSNGLDETFCTLRVRQNI 321

Query: 277 DVKN--------GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI 336
           D  N         GRV  LNT+N P++  V + A  V L  +A+ SP ++  +A  V YI
Sbjct: 322 DNPNRADTYNPRAGRVTNLNTQNFPILSLVQMSAVKVNLYQNALLSPFWNI-NAHSVVYI 381

Query: 337 VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH 355
            +G  RV+VV  +GK V    ++ G L I+P+ + V K    EG  + +  + PN + +H
Sbjct: 382 TQGRARVQVVNNNGKTVFNGELRRGQLLIIPQHYAVVKKAQREGCAYIAFKTNPNSMVSH 441

BLAST of Cla97C06G113120 vs. Swiss-Prot
Match: sp|Q09151|GLUA3_ORYSJ (Glutelin type-A 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA3 PE=2 SV=2)

HSP 1 Score: 117.5 bits (293), Expect = 3.1e-25
Identity = 92/398 (23.12%), Postives = 164/398 (41.21%), Query Frame = 0

Query: 44  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-----------------------------L 103
           IE  G  LP YS+ A + YV+QG GI G                                
Sbjct: 88  IEPRGLLLPHYSNGATLVYVIQGRGITGPTFPGCPETYQQQFQQSEQDQQLEGQSQSHKF 147

Query: 104 SESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGS--TSKSHKSGEFTDFFLTG 163
            +  +K+   ++GD +ALP G+  W +N     +V +++     S +       DFFL G
Sbjct: 148 RDEHQKIHRFQQGDVVALPAGVAHWCYNDGDAPIVAIYVTDIYNSANQLDPRHRDFFLAG 207

Query: 164 AN----------------GIFTGFSMEFVKRALDMDEVSVKSL-VKNQTGTRIVKLKEGT 223
            N                 +F GFS+E +  AL +     + L  +N     IV+++ G 
Sbjct: 208 NNKIGQQLYRYEARDNSKNVFGGFSVELLSEALGISSGVARQLQCQNDQRGEIVRVEHGL 267

Query: 224 KMPEP----------KMEHRNGMVLNCEEALLDVDVKNG--------------------- 283
            + +P          +++ R+      ++  L     NG                     
Sbjct: 268 SLLQPYASLQEQQQEQVQSRDYGQTQYQQKQLQGSCSNGLDETFCTMRVRQNIDNPNLAD 327

Query: 284 ------GRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRV 343
                 GR+  LN +  P++  V + A  V L  +A+ SP ++  +A  V YI +G  RV
Sbjct: 328 TYNPRAGRITYLNGQKFPILNLVQMSAVKVNLYQNALLSPFWNI-NAHSVVYITQGRARV 387

Query: 344 EVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGV 357
           +VV  +GK V +  ++ G L I+P+  VV K    EG  + ++ + P+ + +H+AG   +
Sbjct: 388 QVVNNNGKTVFDGELRRGQLLIIPQHHVVIKKAQREGCSYIALKTNPDSMVSHMAGKNSI 447

BLAST of Cla97C06G113120 vs. Swiss-Prot
Match: sp|Q9XHP0|11S2_SESIN (11S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=2 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 4.0e-25
Identity = 92/407 (22.60%), Postives = 176/407 (43.24%), Query Frame = 0

Query: 16  SDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIIL- 75
           S+GG+   W  ++    +   I A +  I  NG +LP Y  S ++ Y+ +G G+  I++ 
Sbjct: 50  SEGGTTELWDERQ-EQFQCAGIVAMRSTIRPNGLSLPNYHPSPRLVYIERGQGLISIMVP 109

Query: 76  --------------------SESE---------EKVFGIKKGDAIALPFGIVTWWFNKEA 135
                               SE +         +KV  +++GD +A+P G   W +N  +
Sbjct: 110 GCAETYQVHRSQRTMERTEASEQQDRGSVRDLHQKVHRLRQGDIVAIPSGAAHWCYNDGS 169

Query: 136 TDLVVLFLGSTS--KSHKSGEFTDFFLTGA---------------NGIFTGFSMEFVKRA 195
            DLV + +   +   +    +F  F+L G                + IF  F  E +  A
Sbjct: 170 EDLVAVSINDVNHLSNQLDQKFRAFYLAGGVPRSGEQEQQARQTFHNIFRAFDAELLSEA 229

Query: 196 LDMDEVSVKSL-VKNQTGTRIVKLKEGTKMPEP-----KMEHRNGMVLN-CEEAL----- 255
            ++ + +++ +  + +    IV  +E      P     + EHR   + N  EE       
Sbjct: 230 FNVPQETIRRMQSEEEERGLIVMARERMTFVRPDEEEGEQEHRGRQLDNGLEETFCTMKF 289

Query: 256 ---------LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQ 315
                     D+  +  GRV V++   LP++  + L A+   L  +A+ SP +S  +   
Sbjct: 290 RTNVESRREADIFSRQAGRVHVVDRNKLPILKYMDLSAEKGNLYSNALVSPDWSM-TGHT 349

Query: 316 VTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNP 355
           + Y+ +G  +V+VV  +G+ ++  RV  G +F+VP+++  +    + G EW +  +T +P
Sbjct: 350 IVYVTRGDAQVQVVDHNGQALMNDRVNQGEMFVVPQYYTSTARAGNNGFEWVAFKTTGSP 409

BLAST of Cla97C06G113120 vs. Swiss-Prot
Match: sp|Q6K508|GLUD1_ORYSJ (Glutelin type-D 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUD1 PE=2 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 4.0e-25
Identity = 91/384 (23.70%), Postives = 172/384 (44.79%), Query Frame = 0

Query: 44  IEKNGFALPCYSDSAKVAYVLQGNGIAGII-------------------------LSESE 103
           IE  G  +P YS++  +AY++QG G  G+                            +  
Sbjct: 81  IEPQGLVVPRYSNTPALAYIIQGKGYVGLTFPGCPATHQQQFQLFEQRQSDQAHKFRDEH 140

Query: 104 EKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFL--------------------GSTSK 163
           +K+   ++GD +ALP  +  W++N   T  VV+++                    G+  +
Sbjct: 141 QKIHEFRQGDVVALPASVAHWFYNGGDTPAVVVYVYDIKSFANQLEPRQKEFLLAGNNQR 200

Query: 164 SHKSGEFTDFFLTGANGIFTGFSMEFVKRALDMD-EVSVKSLVKNQTGTRIVKLKEGTKM 223
             +  E + F  +G N IF+GF+ E +  AL ++ E S +   +N     I+++K G ++
Sbjct: 201 GQQIFEHSIFQHSGQN-IFSGFNTEVLSEALGINTEASKRLQSQNDQRGDIIRVKHGLQL 260

Query: 224 PEPKM-----EHR------------NGMVLNCEEALLDVDVKN----------GGRVVVL 283
            +P +     EHR            NG+  N       V+++N           GR+ +L
Sbjct: 261 LKPTLTQRQEEHRQYQQVQYREGQYNGLDENFCTIKARVNIENPSRADYYNPRAGRITLL 320

Query: 284 NTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRVEVVGVDGKKVLE 343
           N +  P++  +G+GA  V L  +A+ SP ++  +A  V YI++GS RV+V    G+ V  
Sbjct: 321 NNQKFPILNLIGMGAARVNLYQNALLSPFWNI-NAHSVVYIIQGSVRVQVANNQGRSVFN 380

Query: 344 TRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSIGVWKSLSSEVIQAS 355
             +  G L I+P+   V K  +  G ++ +I +  +P  + +AG   + ++L  +VI  +
Sbjct: 381 GVLHQGQLLIIPQNHAVIKKAEHNGCQYVAIKTISDPTVSWVAGKNSILRALPVDVIANA 440

BLAST of Cla97C06G113120 vs. Swiss-Prot
Match: sp|P07730|GLUA2_ORYSJ (Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 3.4e-24
Identity = 91/403 (22.58%), Postives = 171/403 (42.43%), Query Frame = 0

Query: 37  IGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAG-------------------IILSES 96
           +   +  IE  G  LP Y++ A + Y++QG GI G                     L+ES
Sbjct: 82  VSVVRRVIEPRGLLLPHYTNGASLVYIIQGRGITGPTFPGCPETYQQQFQQSGQAQLTES 141

Query: 97  E----------EKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLG--STSKSHKSGEF 156
           +          +K+   ++GD IALP G+  W +N     +V +++   +   +      
Sbjct: 142 QSQSHKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDINNGANQLDPRQ 201

Query: 157 TDFFLTG---------------ANGIFTGFSMEFVKRALDM-DEVSVKSLVKNQTGTRIV 216
            DF L G               +  IF+GFS E +  A  + ++V+ +   +N     IV
Sbjct: 202 RDFLLAGNKRNPQAYRREVEEWSQNIFSGFSTELLSEAFGISNQVARQLQCQNDQRGEIV 261

Query: 217 KLKEGTKMPEP------------------------KMEHRNGMVLNCEEALLDVDVK--- 276
           +++ G  + +P                        + ++ +G     +E    + V+   
Sbjct: 262 RVERGLSLLQPYASLQEQEQGQMQSREHYQEGGYQQSQYGSGCPNGLDETFCTMRVRQNI 321

Query: 277 -----------NGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI 336
                        GRV  LN++N P++  V + A  V L  +A+ SP ++  +A  + YI
Sbjct: 322 DNPNRADTYNPRAGRVTNLNSQNFPILNLVQMSAVKVNLYQNALLSPFWNI-NAHSIVYI 381

Query: 337 VKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTH 355
            +G  +V+VV  +GK V    ++ G L IVP+ +VV K    EG  + +  + PN + +H
Sbjct: 382 TQGRAQVQVVNNNGKTVFNGELRRGQLLIVPQHYVVVKKAQREGCAYIAFKTNPNSMVSH 441

BLAST of Cla97C06G113120 vs. TAIR10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 539.7 bits (1389), Expect = 1.4e-153
Identity = 262/356 (73.60%), Postives = 300/356 (84.27%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M++DL+P+LPKK+YG DGGSY +W P+ELPMLR+GNIGASKLA+EK G ALP YSDS KV
Sbjct: 1   MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG G AGI+L E EEKV  IKKGD+IALPFG+VTWWFN E T+LVVLFLG T K H
Sbjct: 61  AYVLQGAGTAGIVLPEKEEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           K+G+FTDF+LTG+NGIFTGFS EFV RA D+DE +VK LV +QTG  IVK+    KMPEP
Sbjct: 121 KAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K   R G VLNC EA LDVD+K+GGRVVVLNTKNLPLVGEVG GADLVR+DG +MCSPGF
Sbjct: 181 KKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV GSGRV++VG DGK+VLET VKAG LFIVPRFFVVSKI DS+G+ WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           I++TP+P+FTHLAG   VWK+LS EV+QA+F VD  + K F SKRTSDAIFF PSN
Sbjct: 301 IVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSPSN 356

BLAST of Cla97C06G113120 vs. TAIR10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 537.7 bits (1384), Expect = 5.3e-153
Identity = 258/356 (72.47%), Postives = 304/356 (85.39%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKV 60
           M++DLTP+LPKK+YG DGGSY +W P+ELPML++GNIGA+KLA+EKNGFA+P YSDS+KV
Sbjct: 1   MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60

Query: 61  AYVLQGNGIAGIILSESEEKVFGIKKGDAIALPFGIVTWWFNKEATDLVVLFLGSTSKSH 120
           AYVLQG+G AGI+L E EEKV  IK+GD+IALPFG+VTWWFN E  +LV+LFLG T K H
Sbjct: 61  AYVLQGSGTAGIVLPEKEEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEP 180
           K+G+FT+F+LTG NGIFTGFS EFV RA D+DE +VK LV +QTG  IVKL  G KMP+P
Sbjct: 121 KAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQP 180

Query: 181 KMEHRNGMVLNCEEALLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K E+R G VLNC EA LDVD+K+GGRVVVLNTKNLPLVGEVG GADLVR+D  +MCSPGF
Sbjct: 181 KEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFS 300
           SCDSALQVTYIV GSGRV+VVG DGK+VLET +KAG+LFIVPRFFVVSKI D++GM WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSSEVIQASFNVDANLVKKFSSKRTSDAIFFPPSN 357
           I++TP+P+FTHLAG+  VWKSLS EV+QA+F V   + K F S RTS AIFFPPSN
Sbjct: 301 IVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPPSN 356

BLAST of Cla97C06G113120 vs. TAIR10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 111.3 bits (277), Expect = 1.2e-24
Identity = 91/400 (22.75%), Postives = 165/400 (41.25%), Query Frame = 0

Query: 17  DGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGIAGIILS- 76
           + G    W     P LR   +  +++ ++ N   LP +     +AYV+QG G+ G I S 
Sbjct: 53  EAGQMEVWDHMS-PELRCAGVTVARITLQPNSIFLPAFFSPPALAYVVQGEGVMGTIASG 112

Query: 77  -------------------------ESEEKVFGIKKGDAIALPFGIVTWWFNKEATD-LV 136
                                    +  +K+   ++GD  A   G+  WW+N+  +D ++
Sbjct: 113 CPETFAEVEGSSGRGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQWWYNRGDSDAVI 172

Query: 137 VLFLGSTSKSHKSGEFTDFF-LTGA--------------NGIFTGFSMEFVKRALDMDEV 196
           V+ L  T++ ++  +    F L G+              N  F+GF    +  A  ++  
Sbjct: 173 VIVLDVTNRENQLDQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDPNIIAEAFKINIE 232

Query: 197 SVKSLVKNQTGTR--IVKLKEGTK--MPEPKMEHRNGMVLNCEEALLDVDV--------- 256
           + K L +NQ   R  I++        +P P+   ++G+    EE      +         
Sbjct: 233 TAKQL-QNQKDNRGNIIRANGPLHFVIPPPREWQQDGIANGIEETYCTAKIHENIDDPER 292

Query: 257 -----KNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSG 316
                   GR+  LN+ NLP++  V L A    L    M  P ++  +A  V Y+  G  
Sbjct: 293 SDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTA-NAHTVLYVTGGQA 352

Query: 317 RVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSIISTPNPVFTHLAGSI 357
           +++VV  +G+ V   +V  G + ++P+ F VSK     G EW S  +  N     L+G  
Sbjct: 353 KIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTNDNAYINTLSGQT 412

BLAST of Cla97C06G113120 vs. TAIR10
Match: AT1G03880.1 (cruciferin 2)

HSP 1 Score: 105.9 bits (263), Expect = 5.2e-23
Identity = 98/408 (24.02%), Postives = 166/408 (40.69%), Query Frame = 0

Query: 10  PKKIYGSDGGSYHSWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGI 69
           P +I  S+GG    W     P LR       +  IE  G  LP + ++ K+ +V+ G G+
Sbjct: 40  PSQIIKSEGGRIEVWD-HHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVVHGRGL 99

Query: 70  AGIIL-------------------------SESEEKVFGIKKGDAIALPFGIVTWWFNKE 129
            G ++                          +  +KV  ++ GD IA P G+  W++N  
Sbjct: 100 MGRVIPGCAETFMESPVFGEGXXXXXXXGFRDMHQKVEHLRCGDTIATPSGVAQWFYNNG 159

Query: 130 ATDLVVLFLG--STSKSHKSGEFTDFFLTG----------------ANGIFTGFSMEFVK 189
              L+++     +++++        F + G                 N IF GF+ E + 
Sbjct: 160 NEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGFAPEILA 219

Query: 190 RALDMDEVSVKSLVKNQTGTR--IVKLK-------------EGTKMPEPKMEHRNGM--- 249
           +A  ++ V     ++NQ   R  IVK+              EG + P    E  NG+   
Sbjct: 220 QAFKIN-VETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPH---EIANGLEET 279

Query: 250 --VLNCEEAL-----LDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFS 309
              + C E L      DV   + G +  LN+ NLP++  + L A    +  +AM  P ++
Sbjct: 280 LCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLPQWN 339

Query: 310 CDSALQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDSEGMEWFSI 348
             +A    Y+  G   +++V  +G++V +  + +G L +VP+ F V K    E  EW   
Sbjct: 340 V-NANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEWIEF 399

BLAST of Cla97C06G113120 vs. TAIR10
Match: AT2G28490.1 (RmlC-like cupins superfamily protein)

HSP 1 Score: 65.5 bits (158), Expect = 7.7e-11
Identity = 89/414 (21.50%), Postives = 169/414 (40.82%), Query Frame = 0

Query: 11  KKIYGSDGGSYH-SWSPKELPMLREGNIGASKLAIEKNGFALPCYSDSAKVAYVLQGNGI 70
           +++  S+GG      SP+   + +  +IG   L +E     +P Y DS+ + ++ QG   
Sbjct: 92  RQVIKSEGGEMRVVLSPRGRIIEKPMHIGF--LTMEPKTLFVPQYLDSSLLIFIRQGEAT 151

Query: 71  AGIILSE--SEEKVFGIKKGDAIALPFGIVTWWFNKE-ATDLVVLFLGSTSKSHKSGEFT 130
            G+I  +   E K   +K GD   +P G V +  N      L V+     ++S     F 
Sbjct: 152 LGVICKDEFGERK---LKAGDIYWIPAGSVFYLHNTGLGQRLHVICSIDPTQSLGFETFQ 211

Query: 131 DFFLTGA-NGIFTGFSMEFVKRALDMDEVSVKSLVKNQTGTRIVKLKEGTKMPEPK---- 190
            F++ G  + +  GF    +  A ++    ++ ++ +Q    IV + EG + P+P+    
Sbjct: 212 PFYIGGGPSSVLAGFDPHTLTSAFNVSLPELQQMMMSQFRGPIVYVTEGPQ-PQPQSTVW 271

Query: 191 ------------------MEHRNG------------------MVLN-------------C 250
                             +E + G                   +L+             C
Sbjct: 272 TQFLGLRGEEKHKQLKKLLETKQGSPQDQQYSSGWSWRNIVRSILDLTEEKNKGSGSSEC 331

Query: 251 EEALLDVDVKNG-------GRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSA 310
           E++    D K+        G  + L+  +   +   G+G  LV L   AM +P  +  +A
Sbjct: 332 EDSYNIYDKKDKPSFDNKYGWSIALDYDDYKPLKHSGIGVYLVNLTAGAMMAPHMN-PTA 391

Query: 311 LQVTYIVKGSGRVEVVGVDGKKVLETRVKAGNLFIVPRFF----VVSKIGDSEGMEWFSI 356
            +   ++ GSG ++VV  +G   + TRV  G++F +PR+F    + S+ G  E + + + 
Sbjct: 392 TEYGIVLAGSGEIQVVFPNGTSAMNTRVSVGDVFWIPRYFAFCQIASRTGPFEFVGFTTS 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004150394.11.6e-18491.57PREDICTED: glutelin type-B 5-like isoform X1 [Cucumis sativus] >KGN44409.1 hypot... [more]
XP_008461502.13.7e-18491.57PREDICTED: glutelin type-B 5-like [Cucumis melo][more]
XP_011659088.16.3e-18491.29PREDICTED: 11S globulin seed storage protein 2-like isoform X2 [Cucumis sativus][more]
XP_023535755.11.2e-18291.27glutelin type-D 1-like [Cucurbita pepo subsp. pepo][more]
XP_022976927.11.7e-18190.70glutelin type-D 1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0K666|A0A0A0K666_CUCSA1.1e-18491.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=4 SV=1[more]
tr|A0A1S3CG59|A0A1S3CG59_CUCME2.4e-18491.57glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=4 SV=1[more]
tr|A0A151TM61|A0A151TM61_CAJCA2.2e-16178.09Glutelin type-A 1 OS=Cajanus cajan OX=3821 GN=KK1_021772 PE=4 SV=1[more]
tr|A0A022QLG5|A0A022QLG5_ERYGU3.8e-16178.09Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a008978mg PE... [more]
tr|W9RVS5|W9RVS5_9ROSA1.1e-16077.53Glutelin type-A 1 OS=Morus notabilis OX=981085 GN=L484_018618 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|P07728|GLUA1_ORYSJ3.1e-2524.07Glutelin type-A 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA1 PE=1 SV=2[more]
sp|Q09151|GLUA3_ORYSJ3.1e-2523.12Glutelin type-A 3 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA3 PE=2 SV=2[more]
sp|Q9XHP0|11S2_SESIN4.0e-2522.6011S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=2 SV=1[more]
sp|Q6K508|GLUD1_ORYSJ4.0e-2523.70Glutelin type-D 1 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUD1 PE=2 SV=1[more]
sp|P07730|GLUA2_ORYSJ3.4e-2422.58Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28680.11.4e-15373.60RmlC-like cupins superfamily protein[more]
AT1G07750.15.3e-15372.47RmlC-like cupins superfamily protein[more]
AT1G03890.11.2e-2422.75RmlC-like cupins superfamily protein[more]
AT1G03880.15.2e-2324.02cruciferin 2[more]
AT2G28490.17.7e-1121.50RmlC-like cupins superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0045735nutrient reservoir activity
Vocabulary: INTERPRO
TermDefinition
IPR011051RmlC_Cupin_sf
IPR014710RmlC-like_jellyroll
IPR006045Cupin_1
IPR00604411S_seedstore_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0045735 nutrient reservoir activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G113120.1Cla97C06G113120.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR00604411-S seed storage protein, plantPRINTSPR0043911SGLOBULINcoord: 258..274
score: 30.67
coord: 276..291
score: 37.82
coord: 316..333
score: 21.8
coord: 211..231
score: 29.85
coord: 294..312
score: 25.22
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 194..339
e-value: 7.7E-16
score: 68.6
coord: 3..157
e-value: 1.3E-29
score: 114.4
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 10..153
e-value: 2.6E-25
score: 88.7
coord: 202..338
e-value: 3.1E-21
score: 75.4
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 196..356
e-value: 2.1E-32
score: 113.7
IPR014710RmlC-like jelly roll foldGENE3DG3DSA:2.60.120.10coord: 3..195
e-value: 1.5E-34
score: 121.4
NoneNo IPR availablePANTHERPTHR31189FAMILY NOT NAMEDcoord: 1..356
NoneNo IPR availablePANTHERPTHR31189:SF24SUBFAMILY NOT NAMEDcoord: 1..356
IPR011051RmlC-like cupin domain superfamilySUPERFAMILYSSF51182RmlC-like cupinscoord: 9..345

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C06G113120Cla001230Watermelon (97103) v1wmwmbB409
Cla97C06G113120ClCG06G003670Watermelon (Charleston Gray)wcgwmbB273
Cla97C06G113120Lsi09G015670Bottle gourd (USVL1VR-Ls)lsiwmbB026
Cla97C06G113120Bhi12G001659Wax gourdwgowmbB099
The following gene(s) are paralogous to this gene:

None