HG10022502 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022502
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionglutelin type-B 5-like
LocationChr05: 24908733 .. 24909967 (+)
RNA-Seq ExpressionHG10022502
SyntenyHG10022502
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATATCGATTTGACTCCTCAATTGCCCAAGAAGATCTATGGTGGTGATGGAGGTTCCTATTATTCATGGTCTCCCAAAGAACTTCCCATGCTCCGTGAAGGAAACATTGGTGCATCCAAGCTCGCTCTTGAGAAGAATGGATTTGCTCTCCCTCGTTACTCTGATTCTGCTAAGGTTGCTTATGTTCTTCAAGGTAATTTGTCAATTTTGATCACATATGGAACTGCTATTGTTGTTTTAAGTTTCTTTGGATCTTAAGATCGGATTTAGTTTTTGTGAATTTCAAATCTGTTTAGATCTATTTAGATCTAGTGTTTGATTTCAGTAAAAGGATCTGATCAGTGTGCAAAAATGTATTAGGTAATGGAGTAGTCGGAATCATTCTGCCGGAAAAGGAGGAGAAGGTGGTCGCAATTAAGAAAGGAGATGCGATTGCTCTTCCATTCGGTGTGGTGACGTGGTGGTTCAATAAAGAAGCCACTGATTTGGTGGTTCTGTTCTTAGGCGATACATCAAAGGCTCACAAGTCAGGTGAGTTTACCGACTTCTTCCTAACCGGTGCCAATGGAATCTTCACCGGCTTCTCCACGGAGTTTGTCGGGCGAGCATGGGATATGGATGAGGCATCTGTGAAATCTCTAGTGAAGAATCAAACTGGAACTGGAATTGTGAAGTTGAAGGAAGGAACAAAGATGCCAGAGCCGAAGAAGGAGCATCGAACTGGAATGGCATTGAATTGTGAGGAGGCGCCCTTGGATGTGGACGTGAAGAATGGAGGACGTGTTGTGGTTCTGAACACTAAGAATCTGCCACTCGTCGGGGAGGTAGGACTAGGAGCCGATCTAGTCCGATTGGATGGTAGTGCGATGTGTTCGCCTGGATTCTCATGTGATTCAGCGCTACAAGTGACATACATCGTGAAAGGGAGTGGAAGAGCGGAAGTTGTAGGAGTGGACGGGAAGAAAGTTTTGGAGACAAGAGTAAAAGCTGGAAATCTGTTCATAGTACCAAGGTTCTTCGTAGTATCGAAGATTGGAGATCCCGAAGGAATGGAGTGGTTCTCCATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATCGGTGTTTGGAAGTCTCTTTCACCGGAAGTTATTCAAGCAGCTTTCAACGTCGATGCTGATTTGGTGAAGAACTTCTCTTCGAAGAGAGCTTCTGATGCGATCTTCTTCCCTCCAAATTAG

mRNA sequence

ATGGATATCGATTTGACTCCTCAATTGCCCAAGAAGATCTATGGTGGTGATGGAGGTTCCTATTATTCATGGTCTCCCAAAGAACTTCCCATGCTCCGTGAAGGAAACATTGGTGCATCCAAGCTCGCTCTTGAGAAGAATGGATTTGCTCTCCCTCGTTACTCTGATTCTGCTAAGGTTGCTTATGTTCTTCAAGGTAATGGAGTAGTCGGAATCATTCTGCCGGAAAAGGAGGAGAAGGTGGTCGCAATTAAGAAAGGAGATGCGATTGCTCTTCCATTCGGTGTGGTGACGTGGTGGTTCAATAAAGAAGCCACTGATTTGGTGGTTCTGTTCTTAGGCGATACATCAAAGGCTCACAAGTCAGGTGAGTTTACCGACTTCTTCCTAACCGGTGCCAATGGAATCTTCACCGGCTTCTCCACGGAGTTTGTCGGGCGAGCATGGGATATGGATGAGGCATCTGTGAAATCTCTAGTGAAGAATCAAACTGGAACTGGAATTGTGAAGTTGAAGGAAGGAACAAAGATGCCAGAGCCGAAGAAGGAGCATCGAACTGGAATGGCATTGAATTGTGAGGAGGCGCCCTTGGATGTGGACGTGAAGAATGGAGGACGTGTTGTGGTTCTGAACACTAAGAATCTGCCACTCGTCGGGGAGGTAGGACTAGGAGCCGATCTAGTCCGATTGGATGGTAGTGCGATGTGTTCGCCTGGATTCTCATGTGATTCAGCGCTACAAGTGACATACATCGTGAAAGGGAGTGGAAGAGCGGAAGTTGTAGGAGTGGACGGGAAGAAAGTTTTGGAGACAAGAGTAAAAGCTGGAAATCTGTTCATAGTACCAAGGTTCTTCGTAGTATCGAAGATTGGAGATCCCGAAGGAATGGAGTGGTTCTCCATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATCGGTGTTTGGAAGTCTCTTTCACCGGAAGTTATTCAAGCAGCTTTCAACGTCGATGCTGATTTGGTGAAGAACTTCTCTTCGAAGAGAGCTTCTGATGCGATCTTCTTCCCTCCAAATTAG

Coding sequence (CDS)

ATGGATATCGATTTGACTCCTCAATTGCCCAAGAAGATCTATGGTGGTGATGGAGGTTCCTATTATTCATGGTCTCCCAAAGAACTTCCCATGCTCCGTGAAGGAAACATTGGTGCATCCAAGCTCGCTCTTGAGAAGAATGGATTTGCTCTCCCTCGTTACTCTGATTCTGCTAAGGTTGCTTATGTTCTTCAAGGTAATGGAGTAGTCGGAATCATTCTGCCGGAAAAGGAGGAGAAGGTGGTCGCAATTAAGAAAGGAGATGCGATTGCTCTTCCATTCGGTGTGGTGACGTGGTGGTTCAATAAAGAAGCCACTGATTTGGTGGTTCTGTTCTTAGGCGATACATCAAAGGCTCACAAGTCAGGTGAGTTTACCGACTTCTTCCTAACCGGTGCCAATGGAATCTTCACCGGCTTCTCCACGGAGTTTGTCGGGCGAGCATGGGATATGGATGAGGCATCTGTGAAATCTCTAGTGAAGAATCAAACTGGAACTGGAATTGTGAAGTTGAAGGAAGGAACAAAGATGCCAGAGCCGAAGAAGGAGCATCGAACTGGAATGGCATTGAATTGTGAGGAGGCGCCCTTGGATGTGGACGTGAAGAATGGAGGACGTGTTGTGGTTCTGAACACTAAGAATCTGCCACTCGTCGGGGAGGTAGGACTAGGAGCCGATCTAGTCCGATTGGATGGTAGTGCGATGTGTTCGCCTGGATTCTCATGTGATTCAGCGCTACAAGTGACATACATCGTGAAAGGGAGTGGAAGAGCGGAAGTTGTAGGAGTGGACGGGAAGAAAGTTTTGGAGACAAGAGTAAAAGCTGGAAATCTGTTCATAGTACCAAGGTTCTTCGTAGTATCGAAGATTGGAGATCCCGAAGGAATGGAGTGGTTCTCCATTATCAGCACTCCCAATCCTGTTTTCACTCACTTGGCTGGTAGCATCGGTGTTTGGAAGTCTCTTTCACCGGAAGTTATTCAAGCAGCTTTCAACGTCGATGCTGATTTGGTGAAGAACTTCTCTTCGAAGAGAGCTTCTGATGCGATCTTCTTCCCTCCAAATTAG

Protein sequence

MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAHKSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEPKKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN
Homology
BLAST of HG10022502 vs. NCBI nr
Match: XP_038897477.1 (glutelin type-D 1-like [Benincasa hispida])

HSP 1 Score: 704.1 bits (1816), Expect = 6.0e-199
Identity = 346/355 (97.46%), Postives = 351/355 (98.87%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           MD+DLTPQLPKKIYGGDGGSYY+WSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MDVDLTPQLPKKIYGGDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPEKEEKV+AIKKGDAIALPFGVVTWWFNKEA DLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPEKEEKVIAIKKGDAIALPFGVVTWWFNKEAIDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVK+LVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKTLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K+EHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KQEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSA QVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSAFQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN
Sbjct: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 355

BLAST of HG10022502 vs. NCBI nr
Match: XP_008461502.1 (PREDICTED: glutelin type-B 5-like [Cucumis melo] >KAA0052863.1 glutelin type-B 5-like [Cucumis melo var. makuwa] >TYK04362.1 glutelin type-B 5-like [Cucumis melo var. makuwa])

HSP 1 Score: 701.0 bits (1808), Expect = 5.1e-198
Identity = 345/355 (97.18%), Postives = 352/355 (99.15%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQG+GV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ADLVKNFSSKR+SDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSDAIFFPPS 355

BLAST of HG10022502 vs. NCBI nr
Match: XP_004150394.1 (glutelin type-D 1 [Cucumis sativus] >KGN44409.1 hypothetical protein Csa_015780 [Cucumis sativus])

HSP 1 Score: 698.4 bits (1801), Expect = 3.3e-197
Identity = 343/355 (96.62%), Postives = 351/355 (98.87%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLPKKIYG DGGSYY+WSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWK+LSPEVI+AAFNV+ADLVKNFSSKR+SDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSDAIFFPPS 355

BLAST of HG10022502 vs. NCBI nr
Match: XP_023535755.1 (glutelin type-D 1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 696.0 bits (1795), Expect = 1.6e-196
Identity = 343/355 (96.62%), Postives = 349/355 (98.31%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KKIYG DGGSYYSWSPKELPMLREGNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKDGVKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV+GSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of HG10022502 vs. NCBI nr
Match: KAG6592225.1 (Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025082.1 Glutelin type-B 5 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 694.9 bits (1792), Expect = 3.6e-196
Identity = 342/355 (96.34%), Postives = 349/355 (98.31%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KKIYG DGGSYYSWSPKELPMLREGNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKDGVKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV+GSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           II+TPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPP+
Sbjct: 301 IITTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of HG10022502 vs. ExPASy Swiss-Prot
Match: Q8GZP6 (11S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occidentale OX=171929 PE=1 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.5e-27
Identity = 99/391 (25.32%), Postives = 168/391 (42.97%), Query Frame = 0

Query: 17  DGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGVVGIILP- 76
           + G+  +W P      R   +   +  ++ NG  LP+YS++ ++ YV+QG G+ GI  P 
Sbjct: 42  EAGTVEAWDPNH-EQFRCAGVALVRHTIQPNGLLLPQYSNAPQLIYVVQGEGMTGISYPG 101

Query: 77  ---------------------EKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLG 136
                                ++ +K+   ++GD IA+P GV  W +N+  + +V + L 
Sbjct: 102 CPETYQAPQQGRQQGQSGRFQDRHQKIRRFRRGDIIAIPAGVAHWCYNEGNSPVVTVTLL 161

Query: 137 DTS-----------KAHKSGEFTDFF------LTGANGIFTGFSTEFVGRAWDMDEASVK 196
           D S           K H +G   D F       +    +F+GF TE +  A+ +DE  +K
Sbjct: 162 DVSNSQNQLDRTPRKFHLAGNPKDVFQQQQQHQSRGRNLFSGFDTELLAEAFQVDERLIK 221

Query: 197 SLVKNQTGTGIVKLKE---------------GTKMPEPKKE--HRTGMALN------C-- 256
            L       GIVK+K+               G++  E  ++   R G   N      C  
Sbjct: 222 QLKSEDNRGGIVKVKDDELRVIRPSRSQSERGSESEEESEDEKRRWGQRDNGIEETICTM 281

Query: 257 -------EEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSA 316
                  + A  D+     GR+  LN+ NLP++  + L  +   L  +A+  P ++ +S 
Sbjct: 282 RLKENINDPARADIYTPEVGRLTTLNSLNLPILKWLQLSVEKGVLYKNALVLPHWNLNSH 341

Query: 317 LQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTP 337
             + Y  KG G+ +VV   G +V +  V+ G + +VP+ F V K    E  EW S  +  
Sbjct: 342 -SIIYGCKGKGQVQVVDNFGNRVFDGEVREGQMLVVPQNFAVVKRAREERFEWISFKTND 401

BLAST of HG10022502 vs. ExPASy Swiss-Prot
Match: P14614 (Glutelin type-B 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUB4 PE=1 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 9.8e-27
Identity = 95/394 (24.11%), Postives = 168/394 (42.64%), Query Frame = 0

Query: 44  LEKNGFALPRYSDSAKVAYVLQGNGVVGIILP-------------------------EKE 103
           +E  G  +PRYS++  + Y++QG G +G+  P                         ++ 
Sbjct: 88  IEPQGLLVPRYSNTPGMVYIIQGRGSMGLTFPGCPATYQQQFQQFLPEGQSQSQKFRDEH 147

Query: 104 EKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAHKSGE--FTDFFLTGAN-- 163
           +K+   ++GD +ALP GV  W++N+    +V L++ D +      E    +F L G N  
Sbjct: 148 QKIHQFRQGDIVALPAGVAHWFYNEGDAPVVALYVFDLNNNANQLEPRQKEFLLAGNNNR 207

Query: 164 ---------------GIFTGFSTEFVGRAWDMDEASVKSLV-KNQTGTGIVKLKEGTKMP 223
                           IF+GF+ E +  A  ++    K L  +N     I+++K G K+ 
Sbjct: 208 EQQMYGRSIEQHSGQNIFSGFNNELLSEALGVNALVAKRLQGQNDQRGEIIRVKNGLKLL 267

Query: 224 EPKKEHRTGMALNCEEA------------------------------------PLDVDVK 283
            P    +   A   E+A                                    P   D  
Sbjct: 268 RPAFAQQQEQAQQQEQAQAQYQVQYSEEQQPSTRCNGLDENFCTIKARLNIENPSHADTY 327

Query: 284 N--GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRAEV 343
           N   GR+  LN++  P++  V L A  V L  +A+ SP ++  +A  + YIV+G  R +V
Sbjct: 328 NPRAGRITRLNSQKFPILNLVQLSATRVNLYQNAILSPFWNV-NAHSLVYIVQGHARVQV 387

Query: 344 VGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK 355
           V   GK V    ++ G L I+P+ +VV K  + EG ++ S  +  N + +HLAG   +++
Sbjct: 388 VSNLGKTVFNGVLRPGQLLIIPQHYVVLKKAEHEGCQYISFKTNANSMVSHLAGKNSIFR 447

BLAST of HG10022502 vs. ExPASy Swiss-Prot
Match: Q6ERU3 (Glutelin type-B 5 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUB5 PE=2 SV=1)

HSP 1 Score: 122.5 bits (306), Expect = 9.8e-27
Identity = 95/394 (24.11%), Postives = 168/394 (42.64%), Query Frame = 0

Query: 44  LEKNGFALPRYSDSAKVAYVLQGNGVVGIILP-------------------------EKE 103
           +E  G  +PRYS++  + Y++QG G +G+  P                         ++ 
Sbjct: 88  IEPQGLLVPRYSNTPGMVYIIQGRGSMGLTFPGCPATYQQQFQQFLPEGQSQSQKFRDEH 147

Query: 104 EKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAHKSGE--FTDFFLTGAN-- 163
           +K+   ++GD +ALP GV  W++N+    +V L++ D +      E    +F L G N  
Sbjct: 148 QKIHQFRQGDIVALPAGVAHWFYNEGDAPVVALYVFDLNNNANQLEPRQKEFLLAGNNNR 207

Query: 164 ---------------GIFTGFSTEFVGRAWDMDEASVKSLV-KNQTGTGIVKLKEGTKMP 223
                           IF+GF+ E +  A  ++    K L  +N     I+++K G K+ 
Sbjct: 208 EQQMYGRSIEQHSGQNIFSGFNNELLSEALGVNALVAKRLQGQNDQRGEIIRVKNGLKLL 267

Query: 224 EPKKEHRTGMALNCEEA------------------------------------PLDVDVK 283
            P    +   A   E+A                                    P   D  
Sbjct: 268 RPAFAQQQEQAQQQEQAQAQYQVQYSEEQQPSTRCNGLDENFCTIKARLNIENPSHADTY 327

Query: 284 N--GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRAEV 343
           N   GR+  LN++  P++  V L A  V L  +A+ SP ++  +A  + YIV+G  R +V
Sbjct: 328 NPRAGRITRLNSQKFPILNLVQLSATRVNLYQNAILSPFWNV-NAHSLVYIVQGHARVQV 387

Query: 344 VGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWK 355
           V   GK V    ++ G L I+P+ +VV K  + EG ++ S  +  N + +HLAG   +++
Sbjct: 388 VSNLGKTVFNGVLRPGQLLIIPQHYVVLKKAEHEGCQYISFKTNANSMVSHLAGKNSIFR 447

BLAST of HG10022502 vs. ExPASy Swiss-Prot
Match: P07730 (Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 2.8e-26
Identity = 92/403 (22.83%), Postives = 166/403 (41.19%), Query Frame = 0

Query: 37  IGASKLALEKNGFALPRYSDSAKVAYVLQGNGVVGIILP--------------------- 96
           +   +  +E  G  LP Y++ A + Y++QG G+ G   P                     
Sbjct: 82  VSVVRRVIEPRGLLLPHYTNGASLVYIIQGRGITGPTFPGCPETYQQQFQQSGQAQLTES 141

Query: 97  --------EKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKA--HKSGEF 156
                   ++ +K+   ++GD IALP GV  W +N     +V +++ D +          
Sbjct: 142 QSQSHKFKDEHQKIHRFRQGDVIALPAGVAHWCYNDGEVPVVAIYVTDINNGANQLDPRQ 201

Query: 157 TDFFLTG---------------ANGIFTGFSTEFVGRAWDMDEASVKSL-VKNQTGTGIV 216
            DF L G               +  IF+GFSTE +  A+ +     + L  +N     IV
Sbjct: 202 RDFLLAGNKRNPQAYRREVEEWSQNIFSGFSTELLSEAFGISNQVARQLQCQNDQRGEIV 261

Query: 217 KLKEGTKMPEP------------------------KKEHRTGMALNCEEA---------- 276
           +++ G  + +P                        + ++ +G     +E           
Sbjct: 262 RVERGLSLLQPYASLQEQEQGQMQSREHYQEGGYQQSQYGSGCPNGLDETFCTMRVRQNI 321

Query: 277 --PLDVDVKN--GGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYI 336
             P   D  N   GRV  LN++N P++  V + A  V L  +A+ SP ++  +A  + YI
Sbjct: 322 DNPNRADTYNPRAGRVTNLNSQNFPILNLVQMSAVKVNLYQNALLSPFWNI-NAHSIVYI 381

Query: 337 VKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTH 355
            +G  + +VV  +GK V    ++ G L IVP+ +VV K    EG  + +  + PN + +H
Sbjct: 382 TQGRAQVQVVNNNGKTVFNGELRRGQLLIVPQHYVVVKKAQREGCAYIAFKTNPNSMVSH 441

BLAST of HG10022502 vs. ExPASy Swiss-Prot
Match: Q9XHP0 (11S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=1 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 8.3e-26
Identity = 89/408 (21.81%), Postives = 175/408 (42.89%), Query Frame = 0

Query: 17  DGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGVVGIILP- 76
           +GG+   W  ++    +   I A +  +  NG +LP Y  S ++ Y+ +G G++ I++P 
Sbjct: 51  EGGTTELWDERQ-EQFQCAGIVAMRSTIRPNGLSLPNYHPSPRLVYIERGQGLISIMVPG 110

Query: 77  -----------------------------EKEEKVVAIKKGDAIALPFGVVTWWFNKEAT 136
                                        +  +KV  +++GD +A+P G   W +N  + 
Sbjct: 111 CAETYQVHRSQRTMERTEASEQQDRGSVRDLHQKVHRLRQGDIVAIPSGAAHWCYNDGSE 170

Query: 137 DLVVLFLGDTSKAHKSGE----FTDFFLTGA---------------NGIFTGFSTEFVGR 196
           DLV + + D +  H S +    F  F+L G                + IF  F  E +  
Sbjct: 171 DLVAVSINDVN--HLSNQLDQKFRAFYLAGGVPRSGEQEQQARQTFHNIFRAFDAELLSE 230

Query: 197 AWDMDEASVKSLVKNQTGTG-IVKLKEGTKMPEP-----KKEHRTGMALNCEEAPL---- 256
           A+++ + +++ +   +   G IV  +E      P     ++EHR     N  E       
Sbjct: 231 AFNVPQETIRRMQSEEEERGLIVMARERMTFVRPDEEEGEQEHRGRQLDNGLEETFCTMK 290

Query: 257 -----------DVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSAL 316
                      D+  +  GRV V++   LP++  + L A+   L  +A+ SP +S  +  
Sbjct: 291 FRTNVESRREADIFSRQAGRVHVVDRNKLPILKYMDLSAEKGNLYSNALVSPDWSM-TGH 350

Query: 317 QVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPN 355
            + Y+ +G  + +VV  +G+ ++  RV  G +F+VP+++  +      G EW +  +T +
Sbjct: 351 TIVYVTRGDAQVQVVDHNGQALMNDRVNQGEMFVVPQYYTSTARAGNNGFEWVAFKTTGS 410

BLAST of HG10022502 vs. ExPASy TrEMBL
Match: A0A5A7UAB0 (Glutelin type-B 5-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold675G00320 PE=3 SV=1)

HSP 1 Score: 701.0 bits (1808), Expect = 2.5e-198
Identity = 345/355 (97.18%), Postives = 352/355 (99.15%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQG+GV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ADLVKNFSSKR+SDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSDAIFFPPS 355

BLAST of HG10022502 vs. ExPASy TrEMBL
Match: A0A1S3CG59 (glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=3 SV=1)

HSP 1 Score: 701.0 bits (1808), Expect = 2.5e-198
Identity = 345/355 (97.18%), Postives = 352/355 (99.15%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQG+GV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGSGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWK+LSPEVIQAAFNV+ADLVKNFSSKR+SDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIQAAFNVEADLVKNFSSKRSSDAIFFPPS 355

BLAST of HG10022502 vs. ExPASy TrEMBL
Match: A0A0A0K666 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=3 SV=1)

HSP 1 Score: 698.4 bits (1801), Expect = 1.6e-197
Identity = 343/355 (96.62%), Postives = 351/355 (98.87%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQLPKKIYG DGGSYY+WSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLPKKIYGSDGGSYYAWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWK+LSPEVI+AAFNV+ADLVKNFSSKR+SDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKALSPEVIEAAFNVEADLVKNFSSKRSSDAIFFPPS 355

BLAST of HG10022502 vs. ExPASy TrEMBL
Match: A0A6J1IH21 (glutelin type-D 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477153 PE=3 SV=1)

HSP 1 Score: 693.0 bits (1787), Expect = 6.7e-196
Identity = 342/355 (96.34%), Postives = 349/355 (98.31%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M+IDLTPQL KKIYG DGGSYYSWSPKELPMLREGNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEIDLTPQLAKKIYGCDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVK+QTGTGIVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKSQTGTGIVKLKDGVKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV+GSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPP+
Sbjct: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of HG10022502 vs. ExPASy TrEMBL
Match: A0A6J1EX25 (glutelin type-D 1-like OS=Cucurbita moschata OX=3662 GN=LOC111439037 PE=3 SV=1)

HSP 1 Score: 690.3 bits (1780), Expect = 4.3e-195
Identity = 340/355 (95.77%), Postives = 348/355 (98.03%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DLTPQL KKIY  DGGSYYSWSPKELPMLREGNIGA+KLALEKNGFALPRYSDSAKV
Sbjct: 1   MEMDLTPQLAKKIYVSDGGSYYSWSPKELPMLREGNIGAAKLALEKNGFALPRYSDSAKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQGNGV GIILPE EEKV+AIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH
Sbjct: 61  AYVLQGNGVAGIILPESEEKVIAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLK+G KMPEP
Sbjct: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKDGVKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KKEHR GMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF
Sbjct: 181 KKEHRNGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV+GSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS
Sbjct: 241 SCDSALQVTYIVRGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           II+TPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPP+
Sbjct: 301 IITTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPS 355

BLAST of HG10022502 vs. TAIR 10
Match: AT2G28680.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 570.9 bits (1470), Expect = 7.4e-163
Identity = 270/355 (76.06%), Postives = 310/355 (87.32%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DL+P+LPKK+YGGDGGSY++W P+ELPMLR+GNIGASKLALEK G ALPRYSDS KV
Sbjct: 1   MELDLSPRLPKKVYGGDGGSYFAWCPEELPMLRDGNIGASKLALEKYGLALPRYSDSPKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQG G  GI+LPEKEEKV+AIKKGD+IALPFGVVTWWFN E T+LVVLFLG+T K H
Sbjct: 61  AYVLQGAGTAGIVLPEKEEKVIAIKKGDSIALPFGVVTWWFNNEDTELVVLFLGETHKGH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           K+G+FTDF+LTG+NGIFTGFSTEFVGRAWD+DE +VK LV +QTG GIVK+    KMPEP
Sbjct: 121 KAGQFTDFYLTGSNGIFTGFSTEFVGRAWDLDETTVKKLVGSQTGNGIVKVDASLKMPEP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           KK  R G  LNC EAPLDVD+K+GGRVVVLNTKNLPLVGEVG GADLVR+DG +MCSPGF
Sbjct: 181 KKGDRKGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDGHSMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV GSGR ++VG DGK+VLET VKAG LFIVPRFFVVSKI D +G+ WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQIVGADGKRVLETHVKAGVLFIVPRFFVVSKIADSDGLSWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           I++TP+P+FTHLAG   VWK+LSPEV+QAAF VD ++ K F SKR SDAIFF P+
Sbjct: 301 IVTTPDPIFTHLAGRTSVWKALSPEVLQAAFKVDPEVEKAFRSKRTSDAIFFSPS 355

BLAST of HG10022502 vs. TAIR 10
Match: AT1G07750.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 569.3 bits (1466), Expect = 2.1e-162
Identity = 266/355 (74.93%), Postives = 314/355 (88.45%), Query Frame = 0

Query: 1   MDIDLTPQLPKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKV 60
           M++DLTP+LPKK+YGGDGGSY +W P+ELPML++GNIGA+KLALEKNGFA+PRYSDS+KV
Sbjct: 1   MELDLTPKLPKKVYGGDGGSYSAWCPEELPMLKQGNIGAAKLALEKNGFAVPRYSDSSKV 60

Query: 61  AYVLQGNGVVGIILPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGDTSKAH 120
           AYVLQG+G  GI+LPEKEEKV+AIK+GD+IALPFGVVTWWFN E  +LV+LFLG+T K H
Sbjct: 61  AYVLQGSGTAGIVLPEKEEKVIAIKQGDSIALPFGVVTWWFNNEDPELVILFLGETHKGH 120

Query: 121 KSGEFTDFFLTGANGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTGIVKLKEGTKMPEP 180
           K+G+FT+F+LTG NGIFTGFSTEFVGRAWD+DE +VK LV +QTG GIVKL  G KMP+P
Sbjct: 121 KAGQFTEFYLTGTNGIFTGFSTEFVGRAWDLDENTVKKLVGSQTGNGIVKLDAGFKMPQP 180

Query: 181 KKEHRTGMALNCEEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGF 240
           K+E+R G  LNC EAPLDVD+K+GGRVVVLNTKNLPLVGEVG GADLVR+D  +MCSPGF
Sbjct: 181 KEENRAGFVLNCLEAPLDVDIKDGGRVVVLNTKNLPLVGEVGFGADLVRIDAHSMCSPGF 240

Query: 241 SCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFS 300
           SCDSALQVTYIV GSGR +VVG DGK+VLET +KAG+LFIVPRFFVVSKI D +GM WFS
Sbjct: 241 SCDSALQVTYIVGGSGRVQVVGGDGKRVLETHIKAGSLFIVPRFFVVSKIADADGMSWFS 300

Query: 301 IISTPNPVFTHLAGSIGVWKSLSPEVIQAAFNVDADLVKNFSSKRASDAIFFPPN 356
           I++TP+P+FTHLAG+  VWKSLSPEV+QAAF V  ++ K+F S R S AIFFPP+
Sbjct: 301 IVTTPDPIFTHLAGNTSVWKSLSPEVLQAAFKVAPEVEKSFRSTRTSSAIFFPPS 355

BLAST of HG10022502 vs. TAIR 10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 113.2 bits (282), Expect = 4.2e-25
Identity = 88/385 (22.86%), Postives = 159/385 (41.30%), Query Frame = 0

Query: 30  PMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGVVGII---------------- 89
           P LR   +  +++ L+ N   LP +     +AYV+QG GV+G I                
Sbjct: 65  PELRCAGVTVARITLQPNSIFLPAFFSPPALAYVVQGEGVMGTIASGCPETFAEVEGSSG 124

Query: 90  ----------LPEKEEKVVAIKKGDAIALPFGVVTWWFNKEATDLVVLFLGD-TSKAHKS 149
                       +  +K+   ++GD  A   GV  WW+N+  +D V++ + D T++ ++ 
Sbjct: 125 RGGGGDPGRRFEDMHQKLENFRRGDVFASLAGVSQWWYNRGDSDAVIVIVLDVTNRENQL 184

Query: 150 GEFTDFF-LTGA--------------NGIFTGFSTEFVGRAWDMDEASVKSLVKNQTGTG 209
            +    F L G+              N  F+GF    +  A+ ++  + K L   +   G
Sbjct: 185 DQVPRMFQLAGSRTQEEEQPLTWPSGNNAFSGFDPNIIAEAFKINIETAKQLQNQKDNRG 244

Query: 210 IVKLKEGT---KMPEPKKEHRTGMALNCEEAPLDVDV--------------KNGGRVVVL 269
            +    G     +P P++  + G+A   EE      +                 GR+  L
Sbjct: 245 NIIRANGPLHFVIPPPREWQQDGIANGIEETYCTAKIHENIDDPERSDHFSTRAGRISTL 304

Query: 270 NTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSALQVTYIVKGSGRAEVVGVDGKKVLE 329
           N+ NLP++  V L A    L    M  P ++  +A  V Y+  G  + +VV  +G+ V  
Sbjct: 305 NSLNLPVLRLVRLNALRGYLYSGGMVLPQWTA-NAHTVLYVTGGQAKIQVVDDNGQSVFN 364

Query: 330 TRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTPNPVFTHLAGSIGVWKSLSPEVIQAA 356
            +V  G + ++P+ F VSK     G EW S  +  N     L+G     +++  +VI+A+
Sbjct: 365 EQVGQGQIIVIPQGFAVSKTAGETGFEWISFKTNDNAYINTLSGQTSYLRAVPVDVIKAS 424

BLAST of HG10022502 vs. TAIR 10
Match: AT1G03880.1 (cruciferin 2 )

HSP 1 Score: 108.2 bits (269), Expect = 1.4e-23
Identity = 93/394 (23.60%), Postives = 158/394 (40.10%), Query Frame = 0

Query: 10  PKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGV 69
           P +I   +GG    W     P LR       +  +E  G  LP + ++ K+ +V+ G G+
Sbjct: 40  PSQIIKSEGGRIEVWD-HHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVVHGRGL 99

Query: 70  VGIILP-------------------------EKEEKVVAIKKGDAIALPFGVVTWWFNKE 129
           +G ++P                         +  +KV  ++ GD IA P GV  W++N  
Sbjct: 100 MGRVIPGCAETFMESPVFGEGQGQGQSQGFRDMHQKVEHLRCGDTIATPSGVAQWFYNNG 159

Query: 130 ATDLVVLFLGD--TSKAHKSGEFTDFFLTG----------------ANGIFTGFSTEFVG 189
              L+++   D  +++         F + G                 N IF GF+ E + 
Sbjct: 160 NEPLILVAAADLASNQNQLDRNLRPFLIAGNNPQGQEWLQGRKQQKQNNIFNGFAPEILA 219

Query: 190 RAWDMDEASVKSLVKNQTGTG-IVKLK-------------EGTKMPEPKKE--HRTGMAL 249
           +A+ ++  + + L   Q   G IVK+              EG + P         T   +
Sbjct: 220 QAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGGQQPHEIANGLEETLCTM 279

Query: 250 NCEE---APLDVDV--KNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSPGFSCDSA 309
            C E    P D DV   + G +  LN+ NLP++  + L A    +  +AM  P ++  +A
Sbjct: 280 RCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLSALRGSIRKNAMVLPQWNV-NA 339

Query: 310 LQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEWFSIISTP 340
               Y+  G    ++V  +G++V +  + +G L +VP+ F V K    E  EW    +  
Sbjct: 340 NAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGFSVMKHAIGEQFEWIEFKTNE 399

BLAST of HG10022502 vs. TAIR 10
Match: AT5G44120.3 (RmlC-like cupins superfamily protein )

HSP 1 Score: 96.7 bits (239), Expect = 4.1e-20
Identity = 88/398 (22.11%), Postives = 159/398 (39.95%), Query Frame = 0

Query: 10  PKKIYGGDGGSYYSWSPKELPMLREGNIGASKLALEKNGFALPRYSDSAKVAYVLQGNGV 69
           P  +   + G    W     P LR   +  ++  +E  G  LP + ++AK+++V +G G+
Sbjct: 46  PSHVLKSEAGRIEVWD-HHAPQLRCSGVSFARYIIESKGLYLPSFFNTAKLSFVAKGRGL 105

Query: 70  VGIILP--------------------------EKEEKVVAIKKGDAIALPFGVVTWWFNK 129
           +G ++P                          +  +KV  I+ GD IA   GV  W++N 
Sbjct: 106 MGKVIPGCAETFQDSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVAQWFYND 165

Query: 130 EATDLVVLFLGDTSKAHKSGEFT--DFFLTGAN----------------GIFTGFSTEFV 189
               LV++ + D +      +     F+L G N                 IF GF  E +
Sbjct: 166 GQEPLVIVSVFDLASHQNQLDRNPRPFYLAGNNPQGQVWLQGREQQPQKNIFNGFGPEVI 225

Query: 190 GRAWDMDEASVKSL----------VKNQTGTGIVKLKEGTKMPEPKKE-------HRTGM 249
            +A  +D  + + L          V+ Q   G+++     + P+ ++E       H  G+
Sbjct: 226 AQALKIDLQTAQQLQNQDDNRGNIVRVQGPFGVIRPPLRGQRPQEEEEEEGRHGRHGNGL 285

Query: 250 -----ALNC-----EEAPLDVDVKNGGRVVVLNTKNLPLVGEVGLGADLVRLDGSAMCSP 309
                +  C     + +  DV     G +  LN+ +LP++  + L A    +  +AM  P
Sbjct: 286 EETICSARCTDNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRLSALRGSIRQNAMVLP 345

Query: 310 GFSCDSALQVTYIVKGSGRAEVVGVDGKKVLETRVKAGNLFIVPRFFVVSKIGDPEGMEW 337
            ++  +A  + Y+  G  + ++V  +G +V + +V  G L  VP+ F V K       +W
Sbjct: 346 QWNA-NANAILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQGFSVVKRATSNRFQW 405

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897477.16.0e-19997.46glutelin type-D 1-like [Benincasa hispida][more]
XP_008461502.15.1e-19897.18PREDICTED: glutelin type-B 5-like [Cucumis melo] >KAA0052863.1 glutelin type-B 5... [more]
XP_004150394.13.3e-19796.62glutelin type-D 1 [Cucumis sativus] >KGN44409.1 hypothetical protein Csa_015780 ... [more]
XP_023535755.11.6e-19696.62glutelin type-D 1-like [Cucurbita pepo subsp. pepo][more]
KAG6592225.13.6e-19696.34Glutelin type-D 1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025082.1... [more]
Match NameE-valueIdentityDescription
Q8GZP61.5e-2725.3211S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occident... [more]
P146149.8e-2724.11Glutelin type-B 4 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUB4 PE=1 SV=1[more]
Q6ERU39.8e-2724.11Glutelin type-B 5 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUB5 PE=2 SV=1[more]
P077302.8e-2622.83Glutelin type-A 2 OS=Oryza sativa subsp. japonica OX=39947 GN=GLUA2 PE=1 SV=1[more]
Q9XHP08.3e-2621.8111S globulin seed storage protein 2 OS=Sesamum indicum OX=4182 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7UAB02.5e-19897.18Glutelin type-B 5-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold6... [more]
A0A1S3CG592.5e-19897.18glutelin type-B 5-like OS=Cucumis melo OX=3656 GN=LOC103500083 PE=3 SV=1[more]
A0A0A0K6661.6e-19796.62Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G281380 PE=3 SV=1[more]
A0A6J1IH216.7e-19696.34glutelin type-D 1-like OS=Cucurbita maxima OX=3661 GN=LOC111477153 PE=3 SV=1[more]
A0A6J1EX254.3e-19595.77glutelin type-D 1-like OS=Cucurbita moschata OX=3662 GN=LOC111439037 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28680.17.4e-16376.06RmlC-like cupins superfamily protein [more]
AT1G07750.12.1e-16274.93RmlC-like cupins superfamily protein [more]
AT1G03890.14.2e-2522.86RmlC-like cupins superfamily protein [more]
AT1G03880.11.4e-2323.60cruciferin 2 [more]
AT5G44120.34.1e-2022.11RmlC-like cupins superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR00604411-S seed storage protein, plantPRINTSPR0043911SGLOBULINcoord: 258..274
score: 26.96
coord: 276..291
score: 37.82
coord: 316..333
score: 23.21
coord: 211..231
score: 29.85
coord: 294..312
score: 25.22
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 197..339
e-value: 3.9E-15
score: 66.3
coord: 3..157
e-value: 2.8E-34
score: 129.9
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 199..336
e-value: 2.0E-21
score: 76.1
coord: 9..155
e-value: 1.2E-28
score: 99.6
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 3..194
e-value: 5.3E-36
score: 126.2
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 195..355
e-value: 8.6E-47
score: 160.5
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 2..346
NoneNo IPR availablePANTHERPTHR31189:SF51SUBFAMILY NOT NAMEDcoord: 2..346
NoneNo IPR availableCDDcd02242cupin_11S_legumin_Ncoord: 4..173
e-value: 3.78187E-72
score: 221.305
NoneNo IPR availableCDDcd02243cupin_11S_legumin_Ccoord: 198..353
e-value: 2.1114E-72
score: 220.038
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 10..345

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022502.1HG10022502.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0045735 nutrient reservoir activity