HG10022501 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022501
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionglutelin type-D 1-like
LocationChr05: 24903334 .. 24904515 (+)
RNA-Seq ExpressionHG10022501
SyntenyHG10022501
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATCGATTTGACCCCTCAATTGGCGAAGAAAGTCTATGGCGGTGATGGAGGCTCCTATTATTCGTGGTCTCCCAAGGAGCTTCCCATGCTCCGTCAAGGAAACATCGGCGCCTCCAAGCTCGCTCTTGAGAAGAATGGCTTCGCTCTTCCTCGCTACTCTGATTCCGCCAAGGTTGCTTACGTTCTTCAAGGTATATTTTCAATGCAAGTTAGTTTTGTGTGAATTTCAAATTTGTTTAGATATATTTTTAGTGTTTTTATTTTAGTATGGATAGATCTAATCGACTTGCTAAATGTTTTAGGCATTGGAGTAGCTGGAATCATTCTTCCGGAATCAGAGGAGAAGGTGATCACAATCAAGAAAGGAGATGCGATCGCTCTTCCATTTGGTGTGGTAACTTGGTGGTTCAACAAAGAAGCCACCGATTTGGTGGTTCTGTTCTTAGGCGACACATCAAAGGCTCACAAATTAGGCGAGTTCACTGACTTCTTCCTAACTGGCGCCAACGGAATCTTCACCGGCTTCTCAACGGAGTTCGTTGGGCGAGCTTGGGATATGGATGAGGCGTCAGTGAAATCTCTAGTACAAAACCAATTTGGAACCAGAATTGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAAGGGAAGAAGGATCATCGAAACGGAATGGTGTTGAATTGCAAGGAGGCGCCGCTGGATGTGGACGTGAAGAATGGAGGACGAGTTGTGGTTTTGAACACGAAGAACCTACCTCTAGTAGGGGAGGTAGGATTGGGAGCAGATCTAGTCCGATTGGACGGAAATGCAATGTGCTCGCCTGGATTCTCATGTGATTCGGCGCTGCAGGTAACGTACATCGTGAAAGGAAGCGGAAGAGCAGAGGTTGTAGGAGTGGATGGGAAGAAGGTTTTGGAGACAAGAGTGGAAGCTGGAAATTTGTTCATAGTACCAAGATTTTTCGTGGTATCGAAGATAGGAGATCCTGAAGGAATGGAGTGGTTCTCTATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATTGGCGTTTGGAAGTCTCTTTCGCCAGAAGTTATTCAGGCAGCCTTCAATGTGGATGGTGATTTGGTGAAGAACTTCTCTTCCAAGAGGGCTTCTGATGCCATCTTCTTCCCTCCTTCCAATTAG

mRNA sequence

ATGGACATCGATTTGACCCCTCAATTGGCGAAGAAAGTCTATGGCGGTGATGGAGGCTCCTATTATTCGTGGTCTCCCAAGGAGCTTCCCATGCTCCGTCAAGGAAACATCGGCGCCTCCAAGCTCGCTCTTGAGAAGAATGGCTTCGCTCTTCCTCGCTACTCTGATTCCGCCAAGGTTGCTTACGTTCTTCAAGTATGGATAGATCTAATCGACTTGCTAAATGTTTTAGGCATTGGAGTAGCTGGAATCATTCTTCCGGAATCAGAGGAGAAGGTGATCACAATCAAGAAAGGAGATGCGATCGCTCTTCCATTTGGTGTGGTAACTTGGTGGTTCAACAAAGAAGCCACCGATTTGGTGGTTCTGTTCTTAGGCGACACATCAAAGGCTCACAAATTAGGCGAGTTCACTGACTTCTTCCTAACTGGCGCCAACGGAATCTTCACCGGCTTCTCAACGGAGTTCGTTGGGCGAGCTTGGGATATGGATGAGGCGTCAGTGAAATCTCTAGTACAAAACCAATTTGGAACCAGAATTGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAAGGGAAGAAGGATCATCGAAACGGAATGGTGTTGAATTGCAAGGAGGCGCCGCTGGATGTGGACGTGAAGAATGGAGGACGAGTTGTGGTTTTGAACACGAAGAACCTACCTCTAGTAGGGGAGGTAGGATTGGGAGCAGATCTAGTCCGATTGGACGGAAATGCAATGTGCTCGCCTGGATTCTCATGTGATTCGGCGCTGCAGGTAACGTACATCGTGAAAGGAAGCGGAAGAGCAGAGGTTGTAGGAGTGGATGGGAAGAAGGTTTTGGAGACAAGAGTGGAAGCTGGAAATTTGTTCATAGTACCAAGATTTTTCGTGGTATCGAAGATAGGAGATCCTGAAGGAATGGAGTGGTTCTCTATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATTGGCGTTTGGAAGTCTCTTTCGCCAGAAGTTATTCAGGCAGCCTTCAATGTGGATGGTGATTTGGTGAAGAACTTCTCTTCCAAGAGGGCTTCTGATGCCATCTTCTTCCCTCCTTCCAATTAG

Coding sequence (CDS)

ATGGACATCGATTTGACCCCTCAATTGGCGAAGAAAGTCTATGGCGGTGATGGAGGCTCCTATTATTCGTGGTCTCCCAAGGAGCTTCCCATGCTCCGTCAAGGAAACATCGGCGCCTCCAAGCTCGCTCTTGAGAAGAATGGCTTCGCTCTTCCTCGCTACTCTGATTCCGCCAAGGTTGCTTACGTTCTTCAAGTATGGATAGATCTAATCGACTTGCTAAATGTTTTAGGCATTGGAGTAGCTGGAATCATTCTTCCGGAATCAGAGGAGAAGGTGATCACAATCAAGAAAGGAGATGCGATCGCTCTTCCATTTGGTGTGGTAACTTGGTGGTTCAACAAAGAAGCCACCGATTTGGTGGTTCTGTTCTTAGGCGACACATCAAAGGCTCACAAATTAGGCGAGTTCACTGACTTCTTCCTAACTGGCGCCAACGGAATCTTCACCGGCTTCTCAACGGAGTTCGTTGGGCGAGCTTGGGATATGGATGAGGCGTCAGTGAAATCTCTAGTACAAAACCAATTTGGAACCAGAATTGTGAAGTTGAAGGAAGGAACGAAGATGCCAGAAGGGAAGAAGGATCATCGAAACGGAATGGTGTTGAATTGCAAGGAGGCGCCGCTGGATGTGGACGTGAAGAATGGAGGACGAGTTGTGGTTTTGAACACGAAGAACCTACCTCTAGTAGGGGAGGTAGGATTGGGAGCAGATCTAGTCCGATTGGACGGAAATGCAATGTGCTCGCCTGGATTCTCATGTGATTCGGCGCTGCAGGTAACGTACATCGTGAAAGGAAGCGGAAGAGCAGAGGTTGTAGGAGTGGATGGGAAGAAGGTTTTGGAGACAAGAGTGGAAGCTGGAAATTTGTTCATAGTACCAAGATTTTTCGTGGTATCGAAGATAGGAGATCCTGAAGGAATGGAGTGGTTCTCTATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATTGGCGTTTGGAAGTCTCTTTCGCCAGAAGTTATTCAGGCAGCCTTCAATGTGGATGGTGATTTGGTGAAGAACTTCTCTTCCAAGAGGGCTTCTGATGCCATCTTCTTCCCTCCTTCCAATTAG

Protein sequence

MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRIVKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASDAIFFPPSN
Homology
BLAST of HG10022501 vs. NCBI nr
Match: XP_008461502.1 (PREDICTED: glutelin type-B 5-like [Cucumis melo] >KAA0052863.1 glutelin type-B 5-like [Cucumis melo var. makuwa] >TYK04362.1 glutelin type-B 5-like [Cucumis melo var. makuwa])

HSP 1 Score: 674.5 bits (1739), Expect = 5.3e-190
Identity = 336/368 (91.30%), Postives = 347/368 (94.29%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KK+YGGDGGSYYSWSPKELPMLR+GNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLKEGTKMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKEGTKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ DLVKNFSSKR+SD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSD 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. NCBI nr
Match: XP_023535755.1 (glutelin type-D 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 669.8 bits (1727), Expect = 1.3e-188
Identity = 334/367 (91.01%), Postives = 345/367 (94.01%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLAKK+YG DGGSYYSWSPKELPMLR+GNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLK+G KMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKDGVKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIV+GSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVD DLVKNFSSKRASD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASD 355

Query: 361 AIFFPPS 368
           AIFFPPS
Sbjct: 361 AIFFPPS 355

BLAST of HG10022501 vs. NCBI nr
Match: XP_004150394.1 (glutelin type-D 1 [Cucumis sativus] >KGN44409.1 hypothetical protein Csa_015780 [Cucumis sativus])

HSP 1 Score: 669.5 bits (1726), Expect = 1.7e-188
Identity = 333/368 (90.49%), Postives = 346/368 (94.02%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KK+YG DGGSYY+WSPKELPMLR+GNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLKEGTKMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKEGTKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK+LSPEVI+AAFNV+ DLVKNFSSKR+SD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSD 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. NCBI nr
Match: KAG6592225.1 (Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025082.1 Glutelin type-B 5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 668.7 bits (1724), Expect = 2.9e-188
Identity = 333/367 (90.74%), Postives = 345/367 (94.01%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLAKK+YG DGGSYYSWSPKELPMLR+GNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLK+G KMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKDGVKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIV+GSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSII+TPNPVFTHLAGSIGVWKSLSPEVIQAAFNVD DLVKNFSSKRASD
Sbjct: 301 KIGDPEGMEWFSIITTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASD 355

Query: 361 AIFFPPS 368
           AIFFPPS
Sbjct: 361 AIFFPPS 355

BLAST of HG10022501 vs. NCBI nr
Match: XP_022976927.1 (glutelin type-D 1-like [Cucurbita maxima])

HSP 1 Score: 666.8 bits (1719), Expect = 1.1e-187
Identity = 333/367 (90.74%), Postives = 345/367 (94.01%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLAKK+YG DGGSYYSWSPKELPMLR+GNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGCDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV++Q GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKSQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLK+G KMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKDGVKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIV+GSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVD DLVKNFSSKRASD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASD 355

Query: 361 AIFFPPS 368
           AIFFPPS
Sbjct: 361 AIFFPPS 355

BLAST of HG10022501 vs. ExPASy Swiss-Prot
Match: Q8GZP6 (11S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occidentale OX=171929 PE=1 SV=1)

HSP 1 Score: 115.5 bits (288), Expect = 1.2e-24
Identity = 100/400 (25.00%), Postives = 165/400 (41.25%), Query Frame = 0

Query: 17  DGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNV 76
           + G+  +W P      R   +   +  ++ NG  LP+YS++ ++ YV+Q           
Sbjct: 42  EAGTVEAWDPNH-EQFRCAGVALVRHTIQPNGLLLPQYSNAPQLIYVVQ----------- 101

Query: 77  LGIGVAGIILP----------------------ESEEKVITIKKGDAIALPFGVVTWWFN 136
            G G+ GI  P                      +  +K+   ++GD IA+P GV  W +N
Sbjct: 102 -GEGMTGISYPGCPETYQAPQQGRQQGQSGRFQDRHQKIRRFRRGDIIAIPAGVAHWCYN 161

Query: 137 KEATDLVVLFLGDTS-----------KAHKLGEFTDFF------LTGANGIFTGFSTEFV 196
           +  + +V + L D S           K H  G   D F       +    +F+GF TE +
Sbjct: 162 EGNSPVVTVTLLDVSNSQNQLDRTPRKFHLAGNPKDVFQQQQQHQSRGRNLFSGFDTELL 221

Query: 197 GRAWDMDEASVKSLVQNQFGTRIVKLKE---------------GTKMPEGKKDHR----- 256
             A+ +DE  +K L        IVK+K+               G++  E  +D +     
Sbjct: 222 AEAFQVDERLIKQLKSEDNRGGIVKVKDDELRVIRPSRSQSERGSESEEESEDEKRRWGQ 281

Query: 257 --NGM-----VLNCKE-----APLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGN 316
             NG+      +  KE     A  D+     GR+  LN+ NLP++  + L  +   L  N
Sbjct: 282 RDNGIEETICTMRLKENINDPARADIYTPEVGRLTTLNSLNLPILKWLQLSVEKGVLYKN 341

Query: 317 AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVSKIGDP 346
           A+  P ++ +S   + Y  KG G+ +VV   G +V +  V  G + +VP+ F V K    
Sbjct: 342 ALVLPHWNLNSH-SIIYGCKGKGQVQVVDNFGNRVFDGEVREGQMLVVPQNFAVVKRARE 401

BLAST of HG10022501 vs. ExPASy Swiss-Prot
Match: Q9XHP0 (11S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=1 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.3e-23
Identity = 87/418 (20.81%), Postives = 180/418 (43.06%), Query Frame = 0

Query: 17  DGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNV 76
           +GG+   W  ++    +   I A +  +  NG +LP Y  S ++ Y+ +           
Sbjct: 51  EGGTTELWDERQ-EQFQCAGIVAMRSTIRPNGLSLPNYHPSPRLVYIER----------- 110

Query: 77  LGIGVAGIILP------------------------------ESEEKVITIKKGDAIALPF 136
            G G+  I++P                              +  +KV  +++GD +A+P 
Sbjct: 111 -GQGLISIMVPGCAETYQVHRSQRTMERTEASEQQDRGSVRDLHQKVHRLRQGDIVAIPS 170

Query: 137 GVVTWWFNKEATDLVVLFLGDTSK-AHKLGE-FTDFFLTGA---------------NGIF 196
           G   W +N  + DLV + + D +  +++L + F  F+L G                + IF
Sbjct: 171 GAAHWCYNDGSEDLVAVSINDVNHLSNQLDQKFRAFYLAGGVPRSGEQEQQARQTFHNIF 230

Query: 197 TGFSTEFVGRAWDMDEASVKSL--VQNQFGTRIVKLKEGTKM----PEGKKDHRNGMVLN 256
             F  E +  A+++ + +++ +   + + G  ++  +  T +     EG+++HR   + N
Sbjct: 231 RAFDAELLSEAFNVPQETIRRMQSEEEERGLIVMARERMTFVRPDEEEGEQEHRGRQLDN 290

Query: 257 CKEAPL---------------DVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGNAMC 316
             E                  D+  +  GRV V++   LP++  + L A+   L  NA+ 
Sbjct: 291 GLEETFCTMKFRTNVESRREADIFSRQAGRVHVVDRNKLPILKYMDLSAEKGNLYSNALV 350

Query: 317 SPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVSKIGDPEGM 367
           SP +S  +   + Y+ +G  + +VV  +G+ ++  RV  G +F+VP+++  +      G 
Sbjct: 351 SPDWSM-TGHTIVYVTRGDAQVQVVDHNGQALMNDRVNQGEMFVVPQYYTSTARAGNNGF 410

BLAST of HG10022501 vs. ExPASy Swiss-Prot
Match: P05190 (Legumin type B OS=Vicia faba OX=3906 GN=LEB4 PE=3 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 1.5e-22
Identity = 101/425 (23.76%), Postives = 177/425 (41.65%), Query Frame = 0

Query: 17  DGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNV 76
           + G   +W+P   P LR   +   +  ++ NG  LP YS S ++ Y++Q           
Sbjct: 50  EAGLTETWNPNH-PELRCAGVSLIRRTIDPNGLHLPSYSPSPQLIYIIQ----------- 109

Query: 77  LGIGVAGIIL-----------------------PESEEKVITIKKGDAIALPFGVVTWWF 136
            G GV G+ L                       P+S +K+   +KGD IA+P G+  W +
Sbjct: 110 -GKGVIGLTLPGCPQTYQEPRSSQSRQGSRQQQPDSHQKIRRFRKGDIIAIPSGIPYWTY 169

Query: 137 NKEATDLVVLFLGDTSKAHKLGEFTD--FFLTG--------------------------- 196
           N     LV + L DTS      + T   F+L G                           
Sbjct: 170 NNGDEPLVAISLLDTSNIANQLDSTPRVFYLVGNPEVEFPETQEEQQERHQQKHSLPVGR 229

Query: 197 ----------------ANGIFTGFSTEFVGRAWDMDEASVKSL-VQNQFGTRIVKLKEGT 256
                            N + +GFS+EF+   ++ +E + K L        +IV+++ G 
Sbjct: 230 RGGQHQQEEESEEQKDGNSVLSGFSSEFLAHTFNTEEDTAKRLRSPRDKRNQIVRVEGGL 289

Query: 257 KM--PEGKKDH--------------RNGM-----VLNCKE---APLDVDVKN--GGRVVV 316
           ++  PEG+++               RNG+      L  +E    P   D+ N   G +  
Sbjct: 290 RIINPEGQQEEEEEEEEEKQRSEQGRNGLEETICSLKIRENIAQPARADLYNPRAGSIST 349

Query: 317 LNTKNLPLVGEVGLGADLVRLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVL 346
            N+  LP++  + L A+ VRL  N + +P ++  +A  + Y+++G GR  +V   G  V 
Sbjct: 350 ANSLTLPILRYLRLSAEYVRLYRNGIYAPHWNI-NANSLLYVIRGEGRVRIVNSQGNAVF 409

BLAST of HG10022501 vs. ExPASy Swiss-Prot
Match: P12615 (12S seed storage globulin 1 OS=Avena sativa OX=4498 PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 1.5e-22
Identity = 100/431 (23.20%), Postives = 175/431 (40.60%), Query Frame = 0

Query: 33  RQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNVLGIGVAGIILP----- 92
           R   +   +  +E  G  LP+Y ++  + Y+LQ            G G  G+  P     
Sbjct: 77  RCAGVSVIRRVIEPQGLLLPQYHNAPGLVYILQ------------GRGFTGLTFPGCPAT 136

Query: 93  ------------------------ESEEKVITIKKGDAIALPFGVVTWWFNKEATDLVVL 152
                                   +  ++V  IK+GD +ALP G+V W +N     +V +
Sbjct: 137 FQQQFQQFDQARFAQGQSKSQNLKDEHQRVHHIKQGDVVALPAGIVHWCYNDGDAPIVAV 196

Query: 153 FLGD-TSKAHKL-GEFTDFFLTGAN--------GIFTGFSTEFVGRAWDM-DEASVKSLV 212
           ++ D  + A++L     +F L G N         IF+GFS + +  A  +  +A+ K   
Sbjct: 197 YVFDVNNNANQLEPRQKEFLLAGNNKREQQFGQNIFSGFSVQLLSEALGISQQAAQKIQS 256

Query: 213 QNQFGTRIVKLKEGTKM------PEGKKDHR----------------------------- 272
           QN     I+++ +G +        +G  +H+                             
Sbjct: 257 QNDQRGEIIRVSQGLQFLKPFVSQQGPVEHQAYQPIQSQQEQSTQYQVGQSPQYQEGQST 316

Query: 273 ------------NGMVLN-CK-------EAPLDVDVKN--GGRVVVLNTKNLPLVGEVGL 332
                       NG+  N C        E P   D  N   GR+  LN+KN P +  V +
Sbjct: 317 QYQSGQSWDQSFNGLEENFCSLEARQNIENPKRADTYNPRAGRITHLNSKNFPTLNLVQM 376

Query: 333 GADLVRLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPR 367
            A  V L  NA+ SP ++  +A  V ++++G  R +VV   G+ V    +  G L I+P+
Sbjct: 377 SATRVNLYQNAILSPYWNI-NAHSVMHMIQGRARVQVVNNHGQTVFNDILRRGQLLIIPQ 436

BLAST of HG10022501 vs. ExPASy Swiss-Prot
Match: P09800 (Legumin B OS=Gossypium hirsutum OX=3635 GN=LEGB PE=2 SV=1)

HSP 1 Score: 105.5 bits (262), Expect = 1.3e-21
Identity = 100/465 (21.51%), Postives = 180/465 (38.71%), Query Frame = 0

Query: 11  KKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDL 70
           K  +  + G    W   E    +   +   +  +++ G  LP ++ +  + YV Q     
Sbjct: 61  KHRFRSEAGETEFWDQNE-DQFQCAGVAFLRHKIQRKGLLLPSFTSAPMLFYVEQ----- 120

Query: 71  IDLLNVLGIGVAGIILP--------------------ESEEKVITIKKGDAIALPFGVVT 130
                  G G+ G + P                    +  +K+  +K+GD +ALP GV  
Sbjct: 121 -------GEGIHGAVFPGCPETYQSQSQQNIQDRPQRDQHQKLRRLKEGDVVALPAGVAH 180

Query: 131 WWFNKEATDLVVLFLGDT-SKAHKLGE-FTDFFL-------------------------- 190
           W FN   + LV++ L D  + A++L E F  FFL                          
Sbjct: 181 WIFNNGRSQLVLVALVDVGNDANQLDENFRKFFLAGSPQGGVVRGGQSRDRNQRQSRTQR 240

Query: 191 ----------TGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTR--IVKLKEGTKMP 250
                     +G N + +GF    + +A+ +D    + L QN+   R  IV+++ G + P
Sbjct: 241 GEREEEESQESGGNNVLSGFRDNLLAQAFGIDTRLARKL-QNERDNRGAIVRMEHGFEWP 300

Query: 251 E-----------------------------------------GKKDHRNG-------MVL 310
           E                                         G++   NG       M L
Sbjct: 301 EEGQRRQGREEEGEEEREPKWQRRQESQEEGSEEEEREERGRGRRRSGNGLEETFCSMRL 360

Query: 311 NCKEAPLDVDVKN--GGRVVVLNTKNLPLVGEVGLGADLVRLDGNAMCSPGFSCDSALQV 366
             +      DV N  GGR+  +N+ NLP++  + L A+   L  NA+ +P ++  +A  +
Sbjct: 361 KHRTPASSADVFNPRGGRITTVNSFNLPILQYLQLSAERGVLYNNAIYAPHWNM-NAHSI 420

BLAST of HG10022501 vs. ExPASy TrEMBL
Match: A0A5A7UAB0 (Glutelin type-B 5-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold675G00320 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 2.6e-190
Identity = 336/368 (91.30%), Postives = 347/368 (94.29%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KK+YGGDGGSYYSWSPKELPMLR+GNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLKEGTKMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKEGTKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ DLVKNFSSKR+SD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSD 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. ExPASy TrEMBL
Match: A0A1S3CG59 (glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=3 SV=1)

HSP 1 Score: 674.5 bits (1739), Expect = 2.6e-190
Identity = 336/368 (91.30%), Postives = 347/368 (94.29%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KK+YGGDGGSYYSWSPKELPMLR+GNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLKEGTKMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKEGTKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ DLVKNFSSKR+SD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSD 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. ExPASy TrEMBL
Match: A0A0A0K666 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=3 SV=1)

HSP 1 Score: 669.5 bits (1726), Expect = 8.2e-189
Identity = 333/368 (90.49%), Postives = 346/368 (94.02%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KK+YG DGGSYY+WSPKELPMLR+GNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLKEGTKMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKEGTKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK+LSPEVI+AAFNV+ DLVKNFSSKR+SD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSD 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. ExPASy TrEMBL
Match: A0A6J1IH21 (glutelin type-D 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477153 PE=3 SV=1)

HSP 1 Score: 666.8 bits (1719), Expect = 5.3e-188
Identity = 333/367 (90.74%), Postives = 345/367 (94.01%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLAKK+YG DGGSYYSWSPKELPMLR+GNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGCDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV++Q GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKSQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLK+G KMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKDGVKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIV+GSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVD DLVKNFSSKRASD
Sbjct: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASD 355

Query: 361 AIFFPPS 368
           AIFFPPS
Sbjct: 361 AIFFPPS 355

BLAST of HG10022501 vs. ExPASy TrEMBL
Match: A0A6J1EX25 (glutelin type-D 1-like OS=Cucurbita moschata OX=3662 GN=LOC111439037 PE=3 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 3.4e-187
Identity = 331/367 (90.19%), Postives = 344/367 (93.73%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DLTPQLAKK+Y  DGGSYYSWSPKELPMLR+GNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEMDLTPQLAKKIYVSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G GVAGIILPESEEKVI IKKGDAIALPFGVVTWWFNKEATDL
Sbjct: 61  AYVLQ------------GNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLGDTSKAHK GEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLV+NQ GT I
Sbjct: 121 VVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKLK+G KMPE KK+HRNGM LNC+EAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV
Sbjct: 181 VKLKDGVKMPEPKKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           RLDG+AMCSPGFSCDSALQVTYIV+GSGRAEVVGVDGKKVLETRV+AGNLFIVPRFFVVS
Sbjct: 241 RLDGSAMCSPGFSCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KIGDPEGMEWFSII+TPNPVFTHLAGSIGVWKSLSPEVIQAAFNVD DLVKNFSSKRASD
Sbjct: 301 KIGDPEGMEWFSIITTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASD 355

Query: 361 AIFFPPS 368
           AIFFPPS
Sbjct: 361 AIFFPPS 355

BLAST of HG10022501 vs. TAIR 10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 554.3 bits (1427), Expect = 7.4e-158
Identity = 269/368 (73.10%), Postives = 306/368 (83.15%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DL+P+L KKVYGGDGGSY++W P+ELPMLR GNIGASKLALEK G ALPRYSDS KV
Sbjct: 1   MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G G AGI+LPE EEKVI IKKGD+IALPFGVVTWWFN E T+L
Sbjct: 61  AYVLQ------------GAGTAGIVLPEKEEKVIAIKKGDSIALPFGVVTWWFNNEDTEL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           VVLFLG+T K HK G+FTDF+LTG+NGIFTGFSTEFVGRAWD+DE +VK LV +Q G  I
Sbjct: 121 VVLFLGETHKGHKAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VK+    KMPE KK  R G VLNC EAPLDVD+K+GGRVVVLNTKNLPLVGEVG GADLV
Sbjct: 181 VKVDASLKMPEPKKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           R+DG++MCSPGFSCDSALQVTYIV GSGR ++VG DGK+VLET V+AG LFIVPRFFVVS
Sbjct: 241 RIDGHSMCSPGFSCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KI D +G+ WFSI++TP+P+FTHLAG   VWK+LSPEV+QAAF VD ++ K F SKR SD
Sbjct: 301 KIADSDGLSWFSIVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSD 356

Query: 361 AIFFPPSN 369
           AIFF PSN
Sbjct: 361 AIFFSPSN 356

BLAST of HG10022501 vs. TAIR 10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 550.4 bits (1417), Expect = 1.1e-156
Identity = 265/368 (72.01%), Postives = 310/368 (84.24%), Query Frame = 0

Query: 1   MDIDLTPQLAKKVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DLTP+L KKVYGGDGGSY +W P+ELPML+QGNIGA+KLALEKNGFA+PRYSDS+KV
Sbjct: 1   MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60

Query: 61  AYVLQVWIDLIDLLNVLGIGVAGIILPESEEKVITIKKGDAIALPFGVVTWWFNKEATDL 120
           AYVLQ            G G AGI+LPE EEKVI IK+GD+IALPFGVVTWWFN E  +L
Sbjct: 61  AYVLQ------------GSGTAGIVLPEKEEKVIAIKQGDSIALPFGVVTWWFNNEDPEL 120

Query: 121 VVLFLGDTSKAHKLGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTRI 180
           V+LFLG+T K HK G+FT+F+LTG NGIFTGFSTEFVGRAWD+DE +VK LV +Q G  I
Sbjct: 121 VILFLGETHKGHKAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGI 180

Query: 181 VKLKEGTKMPEGKKDHRNGMVLNCKEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLV 240
           VKL  G KMP+ K+++R G VLNC EAPLDVD+K+GGRVVVLNTKNLPLVGEVG GADLV
Sbjct: 181 VKLDAGFKMPQPKEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLV 240

Query: 241 RLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVS 300
           R+D ++MCSPGFSCDSALQVTYIV GSGR +VVG DGK+VLET ++AG+LFIVPRFFVVS
Sbjct: 241 RIDAHSMCSPGFSCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVS 300

Query: 301 KIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDGDLVKNFSSKRASD 360
           KI D +GM WFSI++TP+P+FTHLAG+  VWKSLSPEV+QAAF V  ++ K+F S R S 
Sbjct: 301 KIADADGMSWFSIVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSS 356

Query: 361 AIFFPPSN 369
           AIFFPPSN
Sbjct: 361 AIFFPPSN 356

BLAST of HG10022501 vs. TAIR 10
Match: AT1G03880.1 (cruciferin 2 )

HSP 1 Score: 95.9 bits (237), Expect = 7.2e-20
Identity = 99/405 (24.44%), Postives = 163/405 (40.25%), Query Frame = 0

Query: 12  KVYGGDGGSYYSWSPKELPMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLI 71
           ++   +GG    W     P LR       +  +E  G  LP + ++ K+ +V        
Sbjct: 42  QIIKSEGGRIEVWD-HHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFV-------- 101

Query: 72  DLLNVLGIGVAGIILP-------------------------ESEEKVITIKKGDAIALPF 131
               V G G+ G ++P                         +  +KV  ++ GD IA P 
Sbjct: 102 ----VHGRGLMGRVIPGCAETFMESPVFGEGQGQGQSQGFRDMHQKVEHLRCGDTIATPS 161

Query: 132 GVVTWWFNKEATDLVVLFLGD-TSKAHKLG-EFTDFFLTG----------------ANGI 191
           GV  W++N     L+++   D  S  ++L      F + G                 N I
Sbjct: 162 GVAQWFYNNGNEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNI 221

Query: 192 FTGFSTEFVGRAWDMDEASVKSLVQNQFGTR--IVKLKE--GTKMPE------GKKDHR- 251
           F GF+ E + +A+ ++  + + L QNQ   R  IVK+    G   P       G++ H  
Sbjct: 222 FNGFAPEILAQAFKINVETAQQL-QNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEI 281

Query: 252 -NGM-----VLNCKE---APLDVDV--KNGGRVVVLNTKNLPLVGEVGLGADLVRLDGNA 311
            NG+      + C E    P D DV   + G +  LN+ NLP++  + L A    +  NA
Sbjct: 282 ANGLEETLCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNA 341

Query: 312 MCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVSKIGDPE 352
           M  P ++  +A    Y+  G    ++V  +G++V +  + +G L +VP+ F V K    E
Sbjct: 342 MVLPQWNV-NANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGE 401

BLAST of HG10022501 vs. TAIR 10
Match: AT5G44120.3 (RmlC-like cupins superfamily protein )

HSP 1 Score: 89.0 bits (219), Expect = 8.8e-18
Identity = 90/387 (23.26%), Postives = 150/387 (38.76%), Query Frame = 0

Query: 30  PMLRQGNIGASKLALEKNGFALPRYSDSAKVAYVLQVWIDLIDLLNVLGIGVAGIILP-- 89
           P LR   +  ++  +E  G  LP + ++AK+++V +            G G+ G ++P  
Sbjct: 65  PQLRCSGVSFARYIIESKGLYLPSFFNTAKLSFVAK------------GRGLMGKVIPGC 124

Query: 90  ------------------------ESEEKVITIKKGDAIALPFGVVTWWFNKEATDLVVL 149
                                   +  +KV  I+ GD IA   GV  W++N     LV++
Sbjct: 125 AETFQDSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVAQWFYNDGQEPLVIV 184

Query: 150 FLGD-TSKAHKLGEF-TDFFLTGAN----------------GIFTGFSTEFVGRAWDMDE 209
            + D  S  ++L      F+L G N                 IF GF  E + +A  +D 
Sbjct: 185 SVFDLASHQNQLDRNPRPFYLAGNNPQGQVWLQGREQQPQKNIFNGFGPEVIAQALKIDL 244

Query: 210 ASVKSL------------VQNQFGTRIVKLKEGTKMPEGKKDHRNGMVLNCKEAPL---- 269
            + + L            VQ  FG     L+      E +++ R+G   N  E  +    
Sbjct: 245 QTAQQLQNQDDNRGNIVRVQGPFGVIRPPLRGQRPQEEEEEEGRHGRHGNGLEETICSAR 304

Query: 270 -----------DVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGNAMCSPGFSCDSAL 329
                      DV     G +  LN+ +LP++  + L A    +  NAM  P ++  +A 
Sbjct: 305 CTDNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRLSALRGSIRQNAMVLPQWNA-NAN 364

Query: 330 QVTYIVKGSGRAEVVGVDGKKVLETRVEAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPN 346
            + Y+  G  + ++V  +G +V + +V  G L  VP+ F V K       +W    +  N
Sbjct: 365 AILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVVKRATSNRFQWVEFKTNAN 424

BLAST of HG10022501 vs. TAIR 10
Match: AT4G28520.1 (cruciferin 3 )

HSP 1 Score: 82.8 bits (203), Expect = 6.3e-16
Identity = 76/301 (25.25%), Postives = 129/301 (42.86%), Query Frame = 0

Query: 88  ESEEKVITIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAHKLGEFTD--FFLTGA 147
           +  +KV  +++GD  A   G   W +N     LV++ L D +      +     F L G 
Sbjct: 191 DMHQKVEHVRRGDVFANTPGSAHWIYNSGEQPLVIIALLDIANYQNQLDRNPRVFHLAGN 250

Query: 148 N---------------GIFTGFSTEFVGRAWDMDEASVKSLVQNQFGTR--IVKLK---- 207
           N                +++GF  + + +A  +D    + L QNQ  +R  IV++K    
Sbjct: 251 NQQGGFGGSQQQQEQKNLWSGFDAQVIAQALKIDVQLAQQL-QNQQDSRGNIVRVKGPFQ 310

Query: 208 ----------EGTKMPEGKKDHRNGM---VLNCKE-------APLDVDVKNGGRVVVLNT 267
                     E  +    +    NG+   + + +        A  DV   + GRV  +N+
Sbjct: 311 VVRPPLRQPYESEEWRHPRSPQGNGLEETICSMRSHENIDDPARADVYKPSLGRVTSVNS 370

Query: 268 KNLPLVGEVGLGADLVRLDGNAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETR 327
             LP++  V L A    L GNAM  P ++  +A ++ Y   G GR +VV  +G+ VL+ +
Sbjct: 371 YTLPILEYVRLSATRGVLQGNAMVLPKYNM-NANEILYCTGGQGRIQVVNDNGQNVLDQQ 430

Query: 328 VEAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFN 346
           V+ G L ++P+ F           EW S  +  N + + LAG   + ++L  EVI   F 
Sbjct: 431 VQKGQLVVIPQGFAYVVQSHGNKFEWISFKTNENAMISTLAGRTSLLRALPLEVISNGFQ 489

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008461502.15.3e-19091.30PREDICTED: glutelin type-B 5-like [Cucumis melo] >KAA0052863.1 glutelin type-B 5... [more]
XP_023535755.11.3e-18891.01glutelin type-D 1-like [Cucurbita pepo subsp. pepo][more]
XP_004150394.11.7e-18890.49glutelin type-D 1 [Cucumis sativus] >KGN44409.1 hypothetical protein Csa_015780 ... [more]
KAG6592225.12.9e-18890.74Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025082.1... [more]
XP_022976927.11.1e-18790.74glutelin type-D 1-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q8GZP61.2e-2425.0011S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occident... [more]
Q9XHP02.3e-2320.8111S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=1 SV=1[more]
P051901.5e-2223.76Legumin type B OS=Vicia faba OX=3906 GN=LEB4 PE=3 SV=1[more]
P126151.5e-2223.2012S seed storage globulin 1 OS=Avena sativa OX=4498 PE=2 SV=1[more]
P098001.3e-2121.51Legumin B OS=Gossypium hirsutum OX=3635 GN=LEGB PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7UAB02.6e-19091.30Glutelin type-B 5-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold6... [more]
A0A1S3CG592.6e-19091.30glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=3 SV=1[more]
A0A0A0K6668.2e-18990.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=3 SV=1[more]
A0A6J1IH215.3e-18890.74glutelin type-D 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477153 PE=3 SV=1[more]
A0A6J1EX253.4e-18790.19glutelin type-D 1-like OS=Cucurbita moschata OX=3662 GN=LOC111439037 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28680.17.4e-15873.10RmlC-like cupins superfamily protein [more]
AT1G07750.11.1e-15672.01RmlC-like cupins superfamily protein [more]
AT1G03880.17.2e-2024.44cruciferin 2 [more]
AT5G44120.38.8e-1823.26RmlC-like cupins superfamily protein [more]
AT4G28520.16.3e-1625.25cruciferin 3 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR00604411-S seed storage protein, plantPRINTSPR0043911SGLOBULINcoord: 328..345
score: 23.21
coord: 223..243
score: 29.85
coord: 306..324
score: 25.22
coord: 288..303
score: 37.82
coord: 270..286
score: 27.02
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 3..169
e-value: 3.7E-24
score: 96.3
coord: 209..351
e-value: 7.7E-16
score: 68.6
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 11..167
e-value: 7.7E-21
score: 74.2
coord: 211..348
e-value: 2.7E-22
score: 79.0
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 207..368
e-value: 2.3E-47
score: 162.4
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 3..206
e-value: 2.3E-29
score: 104.4
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 2..358
NoneNo IPR availablePANTHERPTHR31189:SF51SUBFAMILY NOT NAMEDcoord: 2..358
NoneNo IPR availableCDDcd02243cupin_11S_legumin_Ccoord: 210..365
e-value: 4.88442E-73
score: 222.349
NoneNo IPR availableCDDcd02242cupin_11S_legumin_Ncoord: 4..185
e-value: 4.51634E-64
score: 201.275
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 16..357

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022501.1HG10022501.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0000326 protein storage vacuole
molecular_function GO:0045735 nutrient reservoir activity