HG10012034 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012034
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description11S globulin subunit beta-like
LocationChr01: 16787017 .. 16789112 (+)
RNA-Seq ExpressionHG10012034
SyntenyHG10012034
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAAACCCTCTTTTACTCTCTCTTTCCCTTTGCTTCTTGGTTCTCTTCAATGCCTGCCTAGCAATCAACGAGAACCTTCACGATGTCTCCCGACACTTCAGCGAGGGCCAGAGTCGGTATCGTGAGTGTCGTCTGGACAGGCTCGAAGCCCTTGAACCCTCCCATCGTATCGACGCTGAAGGTGGTGTGATCGAGATGTGGGATCCTAGCCACGAGATGTTTCGATGCGCCGGTGTGGCTATCCAAAGATATATTATCGATCCGAATGGCCTTCTTCTCCCTCAGTACACCAATGCCCCACGACTTATCTACATTGAGAGAGGTACCAACTTTTATTCTTTTGCCTATCCTTTTTCTGGTTGAGTGTTAGACTATTTAATATTAAATTTACCTTCACCCATCAACTTGAATTTTTTGAATTAGTCCGTGATTTAAGATGATATCAGAGCAGATGGTTTAGGAGGTCCTATGTTCAAACTCTGCATTTTCATCCCAATTAATATTTGATTTCCACTTATTAGGTCTTCTACTTATTTCAAGCCCACAAGTGAGGGAGCGTGATAGATGATATAATGTTAAATTTACCTTTATTCATCAGCTTAAACTTTTGGGTCAATTGATGATTTAAGATTGAGCTAAAAGTTGATAATTAATGATGTGATGAATTTTCTGGAATAGGGAGAGGGTTCGAGGGAGTTGTACTCCCAGGTTGCCCTGAAACGTACGAAGAGTCTCAACAATCGGCTGGGGAGTTCCGAGACCGACATCAAAAGATTCGCCATGTACGTGCTGGCGACCTCTTCGCTGTGCCTGCTGGTTCTGCACATTGGACCTACAATGATGGCAATGAGAGATTGATTGCAGTTGTTCTTCTTGACGTTAGTAACCACGCCAACCAACTTGACTTCCATCCTAGGGTAAGTTAGATTTGAATTCAGGTTGATATAATGTCTTAATTAATTAAATTAAATTTAGAATATTGGTACGATTGTGGATGTGTTACAATAGTAAATTTAATTTCTTTTCTGTCAACTATGTAGGCCTTCTACTTGGCCGGGAACCCAGAAGAGGAGTTTCCAGAGTGGAGGTCAGATTGGAAGCGAGAGCAGGGACGACACGGTAGTCGTAAGGAGGGATCAAGTAACAAGAACAACATCTTCTATGCCTTCGACGACAGAGTTCTTGCAGAAATTCTCAACATAAATACGGAGACAGCGAGGAAGCTTCGCGGAGAAGATGACTTTAGGCGCAACATCATAAAGGTTGAGGGACAACTTGAGGTGATTAGGCCACCAAGATCGCGAGGAGGAGGAAGAGGAGAGGAGCGAGAATGGGAAGAGGAACAAGAAGAGGAGATGGAGAGACAACATGAGCGCCACCAACGTCGTCGATGGACGGACAATGGTTTGGGTGAAACTATTTGCTCTATGAGAATGAAGGAGAATATTGGTGATGCTTCACGCGCTGATATACACACACCTGAAGCTGGTCGTCTTGCCACCACCAACAGCCATCGCTTCCCCATCCTTCGCTGGCTTCAACTTAGTGCCGAGCGAGGTGTTCTTTACAGAGTAAGGAAGCATGTCACATAATTTAGGGCCTGTTTGGAATAAGTTTCTAAATGGCTTAAAAAGTACTTTTAAGCAGGAAAATTGACATAGTTTGTGGTATTACTGCAGAATGCAATGTATGTTCCACACTGGAACCAAAACGCACATAGCGTAATATTCGTAACAAGAGGGCGAGCGAGAGTACAAGTAGTAGACCACAAAGGCCAAACCGTATTCGACGGCGAGCTACAACAACGACAGGTTCTAGTGGTTCCACAAAACTTCGCCATAGTGAAAAAGGCAAGCGAGGAAGGGTTCGAGTGGGTTTCATTCAAGACCAACGATAATGCCATGATTAACACACTGGCCGGTCGCACCTCCGCCATGAGAGCATTCCCAGTTCAAGTCATTGCCAGTGCTTATAGAATGTCGACTGAAGAGGCTCGAAGGCTCAAATTCAACAGAGAAGAGACCACTTTACTTCCTCCGAGCATGGCCTCATCGGCGCGCAGGGCCAACTACGTCCTAGGGGATGTAATGTAA

mRNA sequence

ATGTCAAACCCTCTTTTACTCTCTCTTTCCCTTTGCTTCTTGGTTCTCTTCAATGCCTGCCTAGCAATCAACGAGAACCTTCACGATGTCTCCCGACACTTCAGCGAGGGCCAGAGTCGGTATCGTGAGTGTCGTCTGGACAGGCTCGAAGCCCTTGAACCCTCCCATCGTATCGACGCTGAAGGTGGTGTGATCGAGATGTGGGATCCTAGCCACGAGATGTTTCGATGCGCCGGTGTGGCTATCCAAAGATATATTATCGATCCGAATGGCCTTCTTCTCCCTCAGTACACCAATGCCCCACGACTTATCTACATTGAGAGAGGGAGAGGGTTCGAGGGAGTTGTACTCCCAGGTTGCCCTGAAACGTACGAAGAGTCTCAACAATCGGCTGGGGAGTTCCGAGACCGACATCAAAAGATTCGCCATGTACGTGCTGGCGACCTCTTCGCTGTGCCTGCTGGTTCTGCACATTGGACCTACAATGATGGCAATGAGAGATTGATTGCAGTTGTTCTTCTTGACGTTAGTAACCACGCCAACCAACTTGACTTCCATCCTAGGGCCTTCTACTTGGCCGGGAACCCAGAAGAGGAGTTTCCAGAGTGGAGGTCAGATTGGAAGCGAGAGCAGGGACGACACGGTAGTCGTAAGGAGGGATCAAGTAACAAGAACAACATCTTCTATGCCTTCGACGACAGAGTTCTTGCAGAAATTCTCAACATAAATACGGAGACAGCGAGGAAGCTTCGCGGAGAAGATGACTTTAGGCGCAACATCATAAAGGTTGAGGGACAACTTGAGGTGATTAGGCCACCAAGATCGCGAGGAGGAGGAAGAGGAGAGGAGCGAGAATGGGAAGAGGAACAAGAAGAGGAGATGGAGAGACAACATGAGCGCCACCAACGTCGTCGATGGACGGACAATGGTTTGGGTGAAACTATTTGCTCTATGAGAATGAAGGAGAATATTGGTGATGCTTCACGCGCTGATATACACACACCTGAAGCTGGTCGTCTTGCCACCACCAACAGCCATCGCTTCCCCATCCTTCGCTGGCTTCAACTTAGTGCCGAGCGAGGTGTTCTTTACAGAAATGCAATGTATGTTCCACACTGGAACCAAAACGCACATAGCGTAATATTCGTAACAAGAGGGCGAGCGAGAGTACAAGTAGTAGACCACAAAGGCCAAACCGTATTCGACGGCGAGCTACAACAACGACAGGTTCTAGTGGTTCCACAAAACTTCGCCATAGTGAAAAAGGCAAGCGAGGAAGGGTTCGAGTGGGTTTCATTCAAGACCAACGATAATGCCATGATTAACACACTGGCCGGTCGCACCTCCGCCATGAGAGCATTCCCAGTTCAAGTCATTGCCAGTGCTTATAGAATGTCGACTGAAGAGGCTCGAAGGCTCAAATTCAACAGAGAAGAGACCACTTTACTTCCTCCGAGCATGGCCTCATCGGCGCGCAGGGCCAACTACGTCCTAGGGGATGTAATGTAA

Coding sequence (CDS)

ATGTCAAACCCTCTTTTACTCTCTCTTTCCCTTTGCTTCTTGGTTCTCTTCAATGCCTGCCTAGCAATCAACGAGAACCTTCACGATGTCTCCCGACACTTCAGCGAGGGCCAGAGTCGGTATCGTGAGTGTCGTCTGGACAGGCTCGAAGCCCTTGAACCCTCCCATCGTATCGACGCTGAAGGTGGTGTGATCGAGATGTGGGATCCTAGCCACGAGATGTTTCGATGCGCCGGTGTGGCTATCCAAAGATATATTATCGATCCGAATGGCCTTCTTCTCCCTCAGTACACCAATGCCCCACGACTTATCTACATTGAGAGAGGGAGAGGGTTCGAGGGAGTTGTACTCCCAGGTTGCCCTGAAACGTACGAAGAGTCTCAACAATCGGCTGGGGAGTTCCGAGACCGACATCAAAAGATTCGCCATGTACGTGCTGGCGACCTCTTCGCTGTGCCTGCTGGTTCTGCACATTGGACCTACAATGATGGCAATGAGAGATTGATTGCAGTTGTTCTTCTTGACGTTAGTAACCACGCCAACCAACTTGACTTCCATCCTAGGGCCTTCTACTTGGCCGGGAACCCAGAAGAGGAGTTTCCAGAGTGGAGGTCAGATTGGAAGCGAGAGCAGGGACGACACGGTAGTCGTAAGGAGGGATCAAGTAACAAGAACAACATCTTCTATGCCTTCGACGACAGAGTTCTTGCAGAAATTCTCAACATAAATACGGAGACAGCGAGGAAGCTTCGCGGAGAAGATGACTTTAGGCGCAACATCATAAAGGTTGAGGGACAACTTGAGGTGATTAGGCCACCAAGATCGCGAGGAGGAGGAAGAGGAGAGGAGCGAGAATGGGAAGAGGAACAAGAAGAGGAGATGGAGAGACAACATGAGCGCCACCAACGTCGTCGATGGACGGACAATGGTTTGGGTGAAACTATTTGCTCTATGAGAATGAAGGAGAATATTGGTGATGCTTCACGCGCTGATATACACACACCTGAAGCTGGTCGTCTTGCCACCACCAACAGCCATCGCTTCCCCATCCTTCGCTGGCTTCAACTTAGTGCCGAGCGAGGTGTTCTTTACAGAAATGCAATGTATGTTCCACACTGGAACCAAAACGCACATAGCGTAATATTCGTAACAAGAGGGCGAGCGAGAGTACAAGTAGTAGACCACAAAGGCCAAACCGTATTCGACGGCGAGCTACAACAACGACAGGTTCTAGTGGTTCCACAAAACTTCGCCATAGTGAAAAAGGCAAGCGAGGAAGGGTTCGAGTGGGTTTCATTCAAGACCAACGATAATGCCATGATTAACACACTGGCCGGTCGCACCTCCGCCATGAGAGCATTCCCAGTTCAAGTCATTGCCAGTGCTTATAGAATGTCGACTGAAGAGGCTCGAAGGCTCAAATTCAACAGAGAAGAGACCACTTTACTTCCTCCGAGCATGGCCTCATCGGCGCGCAGGGCCAACTACGTCCTAGGGGATGTAATGTAA

Protein sequence

MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEILNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHERHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREETTLLPPSMASSARRANYVLGDVM
Homology
BLAST of HG10012034 vs. NCBI nr
Match: XP_038888918.1 (11S globulin-like [Benincasa hispida])

HSP 1 Score: 910.6 bits (2352), Expect = 6.0e-261
Identity = 448/494 (90.69%), Postives = 468/494 (94.74%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDA 60
           M NPL LSLS CFLVLFN CLA +ENL DVSRHF EG+ RYRECRLDRL+ALEPS RI+A
Sbjct: 1   MGNPLFLSLSFCFLVLFNGCLATDENLRDVSRHFREGERRYRECRLDRLDALEPSRRIEA 60

Query: 61  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 120
           EGGVIEMWDPSHEMFRCAGVA+QRYIIDPNGLLLP YTNAP+LIYIERGRGF+GVVLPGC
Sbjct: 61  EGGVIEMWDPSHEMFRCAGVAVQRYIIDPNGLLLPHYTNAPQLIYIERGRGFKGVVLPGC 120

Query: 121 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 180
           PETY+ESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSA WTYNDGNERLIAVVLLDVSNHA
Sbjct: 121 PETYQESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAQWTYNDGNERLIAVVLLDVSNHA 180

Query: 181 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEIL 240
           NQLDFHPRAFYLAGNPEEEFPEWRS+W++E  R  SRKEGSSNKNNIFYAFDDRVLAEIL
Sbjct: 181 NQLDFHPRAFYLAGNPEEEFPEWRSEWRQEGRRQSSRKEGSSNKNNIFYAFDDRVLAEIL 240

Query: 241 NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHER 300
           NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGG RGEEREWEEEQEEEMERQ ER
Sbjct: 241 NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGHRGEEREWEEEQEEEMERQRER 300

Query: 301 HQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAER 360
           HQ RRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAER
Sbjct: 301 HQGRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAER 360

Query: 361 GVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIV 420
           GVLYRNAMYVPHWNQNAHS+IFVTRGRARVQVVD +GQTVFDGELQQRQVLVVPQNFAIV
Sbjct: 361 GVLYRNAMYVPHWNQNAHSIIFVTRGRARVQVVDCRGQTVFDGELQQRQVLVVPQNFAIV 420

Query: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREET 480
           KKA +EGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLKFNR+ET
Sbjct: 421 KKAGDEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKFNRDET 480

Query: 481 TLLPPSMASSARRA 495
           TLLPP M+SS R A
Sbjct: 481 TLLPPRMSSSRRPA 494

BLAST of HG10012034 vs. NCBI nr
Match: XP_008447425.1 (PREDICTED: 11S globulin subunit beta-like [Cucumis melo] >KAA0038056.1 11S globulin subunit beta-like [Cucumis melo var. makuwa] >TYK20548.1 11S globulin subunit beta-like [Cucumis melo var. makuwa])

HSP 1 Score: 910.2 bits (2351), Expect = 7.8e-261
Identity = 449/497 (90.34%), Postives = 468/497 (94.16%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDA 60
           M NPL LSLSLCFLVLFN CLA +ENL +VSR F EGQSRYRECRLDRL+ALEPS RI+A
Sbjct: 1   MGNPLFLSLSLCFLVLFNGCLATDENLREVSRRFGEGQSRYRECRLDRLDALEPSRRIEA 60

Query: 61  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 120
           EGGVIEMWDPSHEMFRCAGVAIQRY+IDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 61  EGGVIEMWDPSHEMFRCAGVAIQRYVIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 120

Query: 121 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 180
           PETY+ESQQSAGEFRDRHQKI HVRAGDLFAVPAGSAHWTYNDGNE+LIAVVLLDVSNHA
Sbjct: 121 PETYQESQQSAGEFRDRHQKIHHVRAGDLFAVPAGSAHWTYNDGNEKLIAVVLLDVSNHA 180

Query: 181 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEIL 240
           NQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH  R+EGSSNKNNIF+AFDDRVLAEIL
Sbjct: 181 NQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSGRREGSSNKNNIFFAFDDRVLAEIL 240

Query: 241 NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHER 300
           NIN E ARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGG RGEE+EWEEEQEEEM+RQ ER
Sbjct: 241 NINIELARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRER 300

Query: 301 HQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAER 360
           HQRRRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAER
Sbjct: 301 HQRRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAER 360

Query: 361 GVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIV 420
           GVLYRNAMY PHWN NAHSVIFVTRGRARVQVVD +GQTV+DGELQQ QVLVVPQNFAIV
Sbjct: 361 GVLYRNAMYAPHWNLNAHSVIFVTRGRARVQVVDCRGQTVYDGELQQYQVLVVPQNFAIV 420

Query: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREET 480
           KKASEEGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLK NREET
Sbjct: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKLNREET 480

Query: 481 TLLPPSMASSARRANYV 498
           TLLPP M+SS R AN V
Sbjct: 481 TLLPPRMSSSRRPANPV 497

BLAST of HG10012034 vs. NCBI nr
Match: XP_011651441.2 (11S globulin [Cucumis sativus] >KAE8650657.1 hypothetical protein Csa_009929 [Cucumis sativus])

HSP 1 Score: 900.6 bits (2326), Expect = 6.2e-258
Identity = 445/498 (89.36%), Postives = 467/498 (93.78%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVS-RHFSEGQSRYRECRLDRLEALEPSHRID 60
           M NPL LSLSLCFLVLFN CLA +ENL DVS R++ EGQSRYRECRLDRL+ALEPS RI+
Sbjct: 1   MGNPLFLSLSLCFLVLFNGCLATDENLRDVSRRYYGEGQSRYRECRLDRLDALEPSRRIE 60

Query: 61  AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPG 120
           AEGG+IEMWDPSHEMFRCAGVA+QRYIIDPNGLLLPQYTNAPRLIY+ERGRG +GVVLPG
Sbjct: 61  AEGGIIEMWDPSHEMFRCAGVAVQRYIIDPNGLLLPQYTNAPRLIYVERGRGIKGVVLPG 120

Query: 121 CPETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNH 180
           CPETY+ESQQSAGEFRDRHQKI HVRAGDLFAVPAGSAHWTYNDGNE+LIAVVLLDVSNH
Sbjct: 121 CPETYQESQQSAGEFRDRHQKIHHVRAGDLFAVPAGSAHWTYNDGNEKLIAVVLLDVSNH 180

Query: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEI 240
           ANQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH SRKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSSRKEGSSNKNNIFYAFDDRVLAEI 240

Query: 241 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 300
           LNIN E A K+RG DDFRRNIIKVEGQL+VIRPPRSRGG RGEE+EWEEEQEEEM+RQ E
Sbjct: 241 LNINIELATKIRGGDDFRRNIIKVEGQLQVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRE 300

Query: 301 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 360
           RHQ RRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 301 RHQGRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 360

Query: 361 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 420
           RGVLYRNAMY PHWNQNAHSVIFVTRGRARVQVVD +GQTV+DGELQQRQVLVVPQNFAI
Sbjct: 361 RGVLYRNAMYAPHWNQNAHSVIFVTRGRARVQVVDCRGQTVYDGELQQRQVLVVPQNFAI 420

Query: 421 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 480
           VKKASEEGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLK NREE
Sbjct: 421 VKKASEEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKLNREE 480

Query: 481 TTLLPPSMASSARRANYV 498
           TTLL P M+SS R AN V
Sbjct: 481 TTLLAPRMSSSRRPANPV 498

BLAST of HG10012034 vs. NCBI nr
Match: XP_008447426.1 (PREDICTED: 11S globulin subunit beta-like [Cucumis melo] >KAA0038058.1 11S globulin subunit beta-like [Cucumis melo var. makuwa] >TYK20545.1 11S globulin subunit beta-like [Cucumis melo var. makuwa])

HSP 1 Score: 890.6 bits (2300), Expect = 6.4e-255
Identity = 442/490 (90.20%), Postives = 464/490 (94.69%), Query Frame = 0

Query: 2   SNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSE-GQSRYRECRLDRLEALEPSHRIDA 61
           +NPL LSLSLCFLVLFNACLA N+N   VSR F E GQSRYRECRLD+LEA+EPS RI+A
Sbjct: 3   NNPLFLSLSLCFLVLFNACLATNDNFRYVSRRFGEAGQSRYRECRLDKLEAVEPSRRIEA 62

Query: 62  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 121
           EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 63  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 122

Query: 122 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 181
           P+TY+ESQQS G FRD+HQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA
Sbjct: 123 PQTYQESQQSGGAFRDQHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 182

Query: 182 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGS-RKEGSSNKNNIFYAFDDRVLAEI 241
           NQLDFHPR FYLAGNPEEEFPEWR  WKREQGRH S RKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 183 NQLDFHPRTFYLAGNPEEEFPEWRLQWKREQGRHMSGRKEGSSNKNNIFYAFDDRVLAEI 242

Query: 242 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 301
           LNIN E ARKLRGEDDFRRNIIKVEG LEVIRPPRSRGG RGEE+EWEEEQEEEMERQ E
Sbjct: 243 LNINIELARKLRGEDDFRRNIIKVEGGLEVIRPPRSRGGRRGEEQEWEEEQEEEMERQRE 302

Query: 302 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 361
           RHQR RW +NGL ETICSM+MKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 303 RHQRSRWDENGLDETICSMKMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 362

Query: 362 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 421
           RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVV+ +GQTVFDGELQQRQVLVVPQNFA+
Sbjct: 363 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVNCRGQTVFDGELQQRQVLVVPQNFAV 422

Query: 422 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 481
           +KKASE+GFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE
Sbjct: 423 LKKASEDGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 482

Query: 482 TTLLPPSMAS 490
           TTL+PP M+S
Sbjct: 483 TTLIPPRMSS 492

BLAST of HG10012034 vs. NCBI nr
Match: XP_004152049.1 (11S globulin [Cucumis sativus] >KGN57907.1 hypothetical protein Csa_009548 [Cucumis sativus])

HSP 1 Score: 887.5 bits (2292), Expect = 5.4e-254
Identity = 438/490 (89.39%), Postives = 460/490 (93.88%), Query Frame = 0

Query: 1   MSNPL-LLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRID 60
           M NPL  LSLSLCFLVLFN CLA  EN HDVSR F EGQSRYRECRLD LEALEPS RI+
Sbjct: 1   MGNPLHFLSLSLCFLVLFNGCLATKENFHDVSRRFREGQSRYRECRLDMLEALEPSRRIE 60

Query: 61  AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPG 120
           AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRL+YIE GRG +GVVLPG
Sbjct: 61  AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLMYIESGRGIKGVVLPG 120

Query: 121 CPETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNH 180
           CP+TY+ESQ+SAG FRD+HQKIRHVRAGDLFAVPAGSAHWTYNDGNE+LIAVVLLDVSNH
Sbjct: 121 CPQTYQESQKSAGAFRDQHQKIRHVRAGDLFAVPAGSAHWTYNDGNEKLIAVVLLDVSNH 180

Query: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEI 240
           ANQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH  RKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSGRKEGSSNKNNIFYAFDDRVLAEI 240

Query: 241 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 300
           LNIN E A KLRG DDFRRNIIKVEGQL+VIRPPRSRGG RGEE+EWEEEQEEEM+RQ E
Sbjct: 241 LNINIELASKLRGGDDFRRNIIKVEGQLQVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRE 300

Query: 301 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 360
           RHQ RRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 301 RHQGRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 360

Query: 361 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 420
           RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVV+ +GQTVFDGELQQRQVLVVPQNFA+
Sbjct: 361 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVNCRGQTVFDGELQQRQVLVVPQNFAV 420

Query: 421 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 480
           +KKAS+EGFEWVSFKTNDNAMINTLAGR SAMRAFPVQVIASAYR+STEEARRLKFNREE
Sbjct: 421 LKKASDEGFEWVSFKTNDNAMINTLAGRISAMRAFPVQVIASAYRVSTEEARRLKFNREE 480

Query: 481 TTLLPPSMAS 490
           T L+PPSM+S
Sbjct: 481 TNLIPPSMSS 490

BLAST of HG10012034 vs. ExPASy Swiss-Prot
Match: A0A1L6K371 (11S globulin OS=Juglans nigra OX=16719 PE=1 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 4.9e-157
Identity = 292/515 (56.70%), Postives = 381/515 (73.98%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDA 60
           M+ P+LLS+SLC + L N CLA         +     Q R+ EC+L RL ALEPS+RI+A
Sbjct: 1   MAKPILLSISLCLVALVNGCLA---------QSGGRQQPRFGECKLKRLVALEPSNRIEA 60

Query: 61  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 120
           E GVIE WDP+++ F+CAGVA+ R  I+PNGLLLPQY+NAP+L+YI +GRG  GV+ PGC
Sbjct: 61  EAGVIESWDPNNQQFQCAGVAVVRRTIEPNGLLLPQYSNAPQLLYIVKGRGITGVLFPGC 120

Query: 121 PETYEESQQ----------SAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIA 180
           PET+EESQQ          SA   RDRHQKIRH R GD+ A PAG AHW YNDG+  ++A
Sbjct: 121 PETFEESQQGQSRIRPSLRSASFQRDRHQKIRHFREGDVIAFPAGVAHWCYNDGDTPVVA 180

Query: 181 VVLLDVSNHANQLDFHPRAFYLAGNPEEEF-PEWRSDW------KREQGRHGS-RKEGSS 240
           V L+D +N+ANQLD +PR FYLAGNP++EF P+ + ++      ++ Q RHG   ++   
Sbjct: 181 VALMDTTNNANQLDQNPRNFYLAGNPDDEFRPQGQQEYEQHRRQQQHQQRHGEPGQQQRG 240

Query: 241 NKNNIFYAFDDRVLAEILNINTETARKLRGEDDFRRNIIKVEG-QLEVIRPPRSRGGGRG 300
           + NN+F  FD   LA+  N++TETAR+L+ E+D RR+I++VEG QL+VIRP  SR     
Sbjct: 241 SGNNVFSGFDADFLADAFNVDTETARRLQSENDHRRSIVRVEGRQLQVIRPRWSR---EE 300

Query: 301 EEREWEEEQEEEMERQHERHQRRRW--TDNGLGETICSMRMKENIGDASRADIHTPEAGR 360
           +ERE  +E+E E E + ER Q RR    DNGL ETIC++R++ENIGD SRADI+T EAGR
Sbjct: 301 QEREERKERERERESESERRQSRRGGRDDNGLEETICTLRLRENIGDPSRADIYTEEAGR 360

Query: 361 LATTNSHRFPILRWLQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQT 420
           ++T NSH  P+LRWLQLSAERG LY +A+YVPHWN NAHSV++  RGRA VQVVD+ GQT
Sbjct: 361 ISTANSHTLPVLRWLQLSAERGALYSDALYVPHWNLNAHSVVYALRGRAEVQVVDNFGQT 420

Query: 421 VFDGELQQRQVLVVPQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVI 480
           VFD EL++ Q+L +PQNFA+VK+A  EGFEWVSFKTN+NAM++ LAGRTSA+RA P +V+
Sbjct: 421 VFDDELREGQLLTIPQNFAVVKRARNEGFEWVSFKTNENAMVSPLAGRTSAIRALPEEVL 480

Query: 481 ASAYRMSTEEARRLKFNREETTLL--PPSMASSAR 493
           A+A ++  E+ARRLKFNR+E+TL+   PS + S+R
Sbjct: 481 ANALQIPREDARRLKFNRQESTLVRSRPSSSRSSR 503

BLAST of HG10012034 vs. ExPASy Swiss-Prot
Match: Q2TPW5 (11S globulin seed storage protein Jug r 4 OS=Juglans regia OX=51240 PE=1 SV=1)

HSP 1 Score: 552.0 bits (1421), Expect = 7.1e-156
Identity = 290/514 (56.42%), Postives = 387/514 (75.29%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLV-LFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRID 60
           M+ P+LLS+ L  +V LFN CLA         +     Q ++ +C+L+RL+ALEP++RI+
Sbjct: 1   MAKPILLSIYLFLIVALFNGCLA---------QSGGRQQQQFGQCQLNRLDALEPTNRIE 60

Query: 61  AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPG 120
           AE GVIE WDP+++ F+CAGVA+ R  I+PNGLLLPQY+NAP+L+YI RGRG  GV+ PG
Sbjct: 61  AEAGVIESWDPNNQQFQCAGVAVVRRTIEPNGLLLPQYSNAPQLVYIARGRGITGVLFPG 120

Query: 121 CPETYEESQQSA--GEFR----DRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVL 180
           CPET+EESQ+ +  G+ R    DRHQKIRH R GD+ A PAG AHW+YNDG+  ++A+ L
Sbjct: 121 CPETFEESQRQSQQGQSREFQQDRHQKIRHFREGDIIAFPAGVAHWSYNDGSNPVVAISL 180

Query: 181 LDVSNHANQLDFHPRAFYLAGNPEEEF-PEWRSDW---KREQ------GRHGSRKEGSSN 240
           LD +N+ANQLD +PR FYLAGNP++EF P+ + ++   +R+Q      G HG ++ G   
Sbjct: 181 LDTNNNANQLDQNPRNFYLAGNPDDEFRPQGQQEYEQHRRQQQRQQRPGEHGQQQRGLG- 240

Query: 241 KNNIFYAFDDRVLAEILNINTETARKLRGEDDFRRNIIKVEG-QLEVIRPPRSRGGGRGE 300
            NN+F  FD   LA+  N++TETAR+L+ E+D RR+I++VEG QL+VIRP  SR     +
Sbjct: 241 -NNVFSGFDADFLADAFNVDTETARRLQSENDHRRSIVRVEGRQLQVIRPRWSR---EEQ 300

Query: 301 EREWEEEQEEEMERQHERHQRRRW--TDNGLGETICSMRMKENIGDASRADIHTPEAGRL 360
           ERE  +E+E E E + ER Q RR    DNGL ETIC++R++ENIGD SRADI+T EAGR+
Sbjct: 301 EREERKERERERESESERRQSRRGGRDDNGLEETICTLRLRENIGDPSRADIYTEEAGRI 360

Query: 361 ATTNSHRFPILRWLQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTV 420
           +T NSH  P+LRWLQLSAERG LY +A+YVPHWN NAHSV++  RGRA VQVVD+ GQTV
Sbjct: 361 STVNSHTLPVLRWLQLSAERGALYSDALYVPHWNLNAHSVVYALRGRAEVQVVDNFGQTV 420

Query: 421 FDGELQQRQVLVVPQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIA 480
           FD EL++ Q+L +PQNFA+VK+A  EGFEWVSFKTN+NAM++ LAGRTSA+RA P +V+A
Sbjct: 421 FDDELREGQLLTIPQNFAVVKRARNEGFEWVSFKTNENAMVSPLAGRTSAIRALPEEVLA 480

Query: 481 SAYRMSTEEARRLKFNREETTLL--PPSMASSAR 493
           +A+++  E+ARRLKFNR+E+TL+   PS + S+R
Sbjct: 481 TAFQIPREDARRLKFNRQESTLVRSRPSRSRSSR 500

BLAST of HG10012034 vs. ExPASy Swiss-Prot
Match: B5KVH4 (11S globulin seed storage protein 1 OS=Carya illinoinensis OX=32201 PE=1 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 1.9e-153
Identity = 286/511 (55.97%), Postives = 378/511 (73.97%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLV--LFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRI 60
           M+ P+LLS+ LC ++  LFN CLA         +     Q ++ +C+L+RL+ALEP++RI
Sbjct: 1   MAKPILLSIYLCLIIVALFNGCLA---------QSGGRQQHKFGQCQLNRLDALEPTNRI 60

Query: 61  DAEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLP 120
           +AE GVIE WDP+H+  +CAGVA+ R  I+PNGLLLP Y+NAP+L+YI RGRG  GV+ P
Sbjct: 61  EAEAGVIESWDPNHQQLQCAGVAVVRRTIEPNGLLLPHYSNAPQLVYIARGRGITGVLFP 120

Query: 121 GCPETYEESQQSA-----GEF-RDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVV 180
           GCPET+EESQ+ +      EF +DRHQKIRH R GD+ A PAG AHW YNDG+  ++A+ 
Sbjct: 121 GCPETFEESQRQSQQGQRREFQQDRHQKIRHFREGDIIAFPAGVAHWCYNDGSSPVVAIF 180

Query: 181 LLDVSNHANQLDFHPRAFYLAGNPEEEF-PEWRSDWK--REQGRHGSRK--EGSSNK--- 240
           LLD  N+ANQLD +PR FYLAGNP++EF P+ + +++  R Q +H  R+   G   +   
Sbjct: 181 LLDTHNNANQLDQNPRNFYLAGNPDDEFRPQGQQEYEQHRRQQQHQQRRGEHGEQQRDLG 240

Query: 241 NNIFYAFDDRVLAEILNINTETARKLRGEDDFRRNIIKVEG-QLEVIRPPRSRGGGRGEE 300
           NN+F  FD   LA+  N++TETAR+L+ E+D R +I++VEG QL+VIRP  SR     EE
Sbjct: 241 NNVFSGFDAEFLADAFNVDTETARRLQSENDHRGSIVRVEGRQLQVIRPRWSR-----EE 300

Query: 301 REWEEEQEEEMER--QHERHQRRRW--TDNGLGETICSMRMKENIGDASRADIHTPEAGR 360
           +E EE +E E ER  + ER Q RR    DNGL ETIC++ ++ENIGD SRADI+T EAGR
Sbjct: 301 QEHEERKERERERESESERRQSRRGGRDDNGLEETICTLSLRENIGDPSRADIYTEEAGR 360

Query: 361 LATTNSHRFPILRWLQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQT 420
           ++T NSH  PILRWLQLSAERG LY +A+YVPHWN NAHSV++  RGRA VQVVD+ GQT
Sbjct: 361 ISTVNSHNLPILRWLQLSAERGALYSDALYVPHWNLNAHSVVYALRGRAEVQVVDNFGQT 420

Query: 421 VFDGELQQRQVLVVPQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVI 480
           VFD EL++ Q+L +PQNFA+VK+A +EGFEWVSFKTN+NAM++ LAGRTSA+RA P +V+
Sbjct: 421 VFDDELREGQLLTIPQNFAVVKRARDEGFEWVSFKTNENAMVSPLAGRTSAIRALPEEVL 480

Query: 481 ASAYRMSTEEARRLKFNREETTLLPPSMASS 491
            +A+++  E+ARRLKFNR+E+TL+     SS
Sbjct: 481 VNAFQIPREDARRLKFNRQESTLVRSRSRSS 497

BLAST of HG10012034 vs. ExPASy Swiss-Prot
Match: Q8GZP6 (11S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occidentale OX=171929 PE=1 SV=1)

HSP 1 Score: 487.3 bits (1253), Expect = 2.1e-136
Identity = 252/482 (52.28%), Postives = 335/482 (69.50%), Query Frame = 0

Query: 9   LSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVIEMW 68
           LS+CFL+LF+ CLA        SR   + Q    EC++DRL+ALEP +R++ E G +E W
Sbjct: 1   LSVCFLILFHGCLA--------SRQEWQQQD---ECQIDRLDALEPDNRVEYEAGTVEAW 60

Query: 69  DPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYEESQ 128
           DP+HE FRCAGVA+ R+ I PNGLLLPQY+NAP+LIY+ +G G  G+  PGCPETY+  Q
Sbjct: 61  DPNHEQFRCAGVALVRHTIQPNGLLLPQYSNAPQLIYVVQGEGMTGISYPGCPETYQAPQ 120

Query: 129 Q-----SAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHANQL 188
           Q      +G F+DRHQKIR  R GD+ A+PAG AHW YN+GN  ++ V LLDVSN  NQL
Sbjct: 121 QGRQQGQSGRFQDRHQKIRRFRRGDIIAIPAGVAHWCYNEGNSPVVTVTLLDVSNSQNQL 180

Query: 189 DFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEILNIN 248
           D  PR F+LAGNP++ F        ++Q +H SR        N+F  FD  +LAE   ++
Sbjct: 181 DRTPRKFHLAGNPKDVF--------QQQQQHQSR------GRNLFSGFDTELLAEAFQVD 240

Query: 249 TETARKLRGEDDFRRNIIKV-EGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHERHQ 308
               ++L+ ED+ R  I+KV + +L VIRP RS+   RG E E E E E           
Sbjct: 241 ERLIKQLKSEDN-RGGIVKVKDDELRVIRPSRSQ-SERGSESEEESEDE----------- 300

Query: 309 RRRW--TDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAER 368
           +RRW   DNG+ ETIC+MR+KENI D +RADI+TPE GRL T NS   PIL+WLQLS E+
Sbjct: 301 KRRWGQRDNGIEETICTMRLKENINDPARADIYTPEVGRLTTLNSLNLPILKWLQLSVEK 360

Query: 369 GVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIV 428
           GVLY+NA+ +PHWN N+HS+I+  +G+ +VQVVD+ G  VFDGE+++ Q+LVVPQNFA+V
Sbjct: 361 GVLYKNALVLPHWNLNSHSIIYGCKGKGQVQVVDNFGNRVFDGEVREGQMLVVPQNFAVV 420

Query: 429 KKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREET 483
           K+A EE FEW+SFKTND AM + LAGRTS +   P +V+A+A+++S E+AR++KFN ++T
Sbjct: 421 KRAREERFEWISFKTNDRAMTSPLAGRTSVLGGMPEEVLANAFQISREDARKIKFNNQQT 444

BLAST of HG10012034 vs. ExPASy Swiss-Prot
Match: E3SH28 (Prunin 1 Pru du 6.0101 OS=Prunus dulcis OX=3755 PE=1 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 1.8e-127
Identity = 250/558 (44.80%), Postives = 338/558 (60.57%), Query Frame = 0

Query: 10  SLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVIEMWD 69
           SLC L++FN CLA  ++            S   +C+L++L+A EP +RI AE G IE W+
Sbjct: 8   SLCLLLVFNGCLAARQS----------QLSPQNQCQLNQLQAREPDNRIQAEAGQIETWN 67

Query: 70  PSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYEESQQ 129
            + E F+CAGVA  R  I  NGL LP Y+NAP+LIYI +GRG  G V  GCPET+EESQQ
Sbjct: 68  FNQEDFQCAGVAASRITIQRNGLHLPSYSNAPQLIYIVQGRGVLGAVFSGCPETFEESQQ 127

Query: 130 SAGEFR------------------------------------------------------ 189
           S+ + R                                                      
Sbjct: 128 SSQQGRQQEQEQERQQQQQGEQGRQQGQQEQQQERQGRQQGRQQQEEGRQQEQQQGQQGR 187

Query: 190 ----------DRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHANQLDF 249
                     DRHQK R +R GD+ A+PAG A+W+YNDG++ L+AV L  VS+  NQLD 
Sbjct: 188 PQQQQQFRQFDRHQKTRRIREGDVVAIPAGVAYWSYNDGDQELVAVNLFHVSSDHNQLDQ 247

Query: 250 HPRAFYLAGNPEEEFPEWRSDWKREQGRHG------------SRKEGSSNKNNIFYAFDD 309
           +PR FYLAGNPE EF +      R+QG  G             ++E   + NN+F  F+ 
Sbjct: 248 NPRKFYLAGNPENEFNQQGQSQPRQQGEQGRPGQHQQPFGRPRQQEQQGSGNNVFSGFNT 307

Query: 310 RVLAEILNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEE 369
           ++LA+ LN+N ETAR L+G++D R  II+V G L+ ++PPR R     +ERE EE Q+E+
Sbjct: 308 QLLAQALNVNEETARNLQGQNDNRNQIIRVRGNLDFVQPPRGR-----QEREHEERQQEQ 367

Query: 370 MERQHERHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRW 429
           ++ Q  + Q  +   NGL ET CS+R+KENIG+  RADI +P AGR++T NSH  PILR+
Sbjct: 368 LQ-QERQQQGGQLMANGLEETFCSLRLKENIGNPERADIFSPRAGRISTLNSHNLPILRF 427

Query: 430 LQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVV 489
           L+LSAERG  YRN +Y PHWN NAHSV++V RG ARVQVV+  G  + D E+QQ Q+ +V
Sbjct: 428 LRLSAERGFFYRNGIYSPHWNVNAHSVVYVIRGNARVQVVNENGDAILDQEVQQGQLFIV 487

Query: 490 PQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRL 492
           PQN  ++++A  +GFE+ +FKT +NA INTLAGRTS +RA P +V+A+AY++S E+AR+L
Sbjct: 488 PQNHGVIQQAGNQGFEYFAFKTEENAFINTLAGRTSFLRALPDEVLANAYQISREQARQL 547

BLAST of HG10012034 vs. ExPASy TrEMBL
Match: A0A5A7T783 (11S globulin subunit beta-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold237G001360 PE=3 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 3.8e-261
Identity = 449/497 (90.34%), Postives = 468/497 (94.16%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDA 60
           M NPL LSLSLCFLVLFN CLA +ENL +VSR F EGQSRYRECRLDRL+ALEPS RI+A
Sbjct: 1   MGNPLFLSLSLCFLVLFNGCLATDENLREVSRRFGEGQSRYRECRLDRLDALEPSRRIEA 60

Query: 61  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 120
           EGGVIEMWDPSHEMFRCAGVAIQRY+IDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 61  EGGVIEMWDPSHEMFRCAGVAIQRYVIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 120

Query: 121 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 180
           PETY+ESQQSAGEFRDRHQKI HVRAGDLFAVPAGSAHWTYNDGNE+LIAVVLLDVSNHA
Sbjct: 121 PETYQESQQSAGEFRDRHQKIHHVRAGDLFAVPAGSAHWTYNDGNEKLIAVVLLDVSNHA 180

Query: 181 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEIL 240
           NQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH  R+EGSSNKNNIF+AFDDRVLAEIL
Sbjct: 181 NQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSGRREGSSNKNNIFFAFDDRVLAEIL 240

Query: 241 NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHER 300
           NIN E ARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGG RGEE+EWEEEQEEEM+RQ ER
Sbjct: 241 NINIELARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRER 300

Query: 301 HQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAER 360
           HQRRRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAER
Sbjct: 301 HQRRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAER 360

Query: 361 GVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIV 420
           GVLYRNAMY PHWN NAHSVIFVTRGRARVQVVD +GQTV+DGELQQ QVLVVPQNFAIV
Sbjct: 361 GVLYRNAMYAPHWNLNAHSVIFVTRGRARVQVVDCRGQTVYDGELQQYQVLVVPQNFAIV 420

Query: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREET 480
           KKASEEGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLK NREET
Sbjct: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKLNREET 480

Query: 481 TLLPPSMASSARRANYV 498
           TLLPP M+SS R AN V
Sbjct: 481 TLLPPRMSSSRRPANPV 497

BLAST of HG10012034 vs. ExPASy TrEMBL
Match: A0A1S3BGV4 (11S globulin subunit beta-like OS=Cucumis melo OX=3656 GN=LOC103489872 PE=3 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 3.8e-261
Identity = 449/497 (90.34%), Postives = 468/497 (94.16%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDA 60
           M NPL LSLSLCFLVLFN CLA +ENL +VSR F EGQSRYRECRLDRL+ALEPS RI+A
Sbjct: 1   MGNPLFLSLSLCFLVLFNGCLATDENLREVSRRFGEGQSRYRECRLDRLDALEPSRRIEA 60

Query: 61  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 120
           EGGVIEMWDPSHEMFRCAGVAIQRY+IDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 61  EGGVIEMWDPSHEMFRCAGVAIQRYVIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 120

Query: 121 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 180
           PETY+ESQQSAGEFRDRHQKI HVRAGDLFAVPAGSAHWTYNDGNE+LIAVVLLDVSNHA
Sbjct: 121 PETYQESQQSAGEFRDRHQKIHHVRAGDLFAVPAGSAHWTYNDGNEKLIAVVLLDVSNHA 180

Query: 181 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEIL 240
           NQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH  R+EGSSNKNNIF+AFDDRVLAEIL
Sbjct: 181 NQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSGRREGSSNKNNIFFAFDDRVLAEIL 240

Query: 241 NINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHER 300
           NIN E ARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGG RGEE+EWEEEQEEEM+RQ ER
Sbjct: 241 NINIELARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRER 300

Query: 301 HQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAER 360
           HQRRRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAER
Sbjct: 301 HQRRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAER 360

Query: 361 GVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIV 420
           GVLYRNAMY PHWN NAHSVIFVTRGRARVQVVD +GQTV+DGELQQ QVLVVPQNFAIV
Sbjct: 361 GVLYRNAMYAPHWNLNAHSVIFVTRGRARVQVVDCRGQTVYDGELQQYQVLVVPQNFAIV 420

Query: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREET 480
           KKASEEGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLK NREET
Sbjct: 421 KKASEEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKLNREET 480

Query: 481 TLLPPSMASSARRANYV 498
           TLLPP M+SS R AN V
Sbjct: 481 TLLPPRMSSSRRPANPV 497

BLAST of HG10012034 vs. ExPASy TrEMBL
Match: A0A0A0L7E7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G384800 PE=3 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 3.3e-257
Identity = 443/498 (88.96%), Postives = 465/498 (93.37%), Query Frame = 0

Query: 1   MSNPLLLSLSLCFLVLFNACLAINENLHDVS-RHFSEGQSRYRECRLDRLEALEPSHRID 60
           M NPL LSLSLCFLVLFN CLA +ENL DVS R++ EGQSRYRECRLDRL+ALEPS RI+
Sbjct: 1   MGNPLFLSLSLCFLVLFNGCLATDENLRDVSRRYYGEGQSRYRECRLDRLDALEPSRRIE 60

Query: 61  AEGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPG 120
           AEGG+IEMWDPSHEMFRCAGVA+QRYIIDPNGLLLPQYTNAPRLIY+ERGRG +GVVLPG
Sbjct: 61  AEGGIIEMWDPSHEMFRCAGVAVQRYIIDPNGLLLPQYTNAPRLIYVERGRGIKGVVLPG 120

Query: 121 CPETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNH 180
           CPETY+ESQQSAGEFRDRHQKI HVRAGDLFAVPAGSAHW YNDGNE+LIAVVLLDVSNH
Sbjct: 121 CPETYQESQQSAGEFRDRHQKIHHVRAGDLFAVPAGSAHWAYNDGNEKLIAVVLLDVSNH 180

Query: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEI 240
           ANQLDFHPRAFYLAGNPEEEFPEWRS WK EQGRH  RKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 181 ANQLDFHPRAFYLAGNPEEEFPEWRSQWKGEQGRHSGRKEGSSNKNNIFYAFDDRVLAEI 240

Query: 241 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 300
           LNIN E A K+RG DDFRRNIIKVEGQL+VIRPPRSRGG RGEE+EWEEEQEEEM+RQ E
Sbjct: 241 LNINIELATKIRGGDDFRRNIIKVEGQLQVIRPPRSRGGRRGEEQEWEEEQEEEMQRQRE 300

Query: 301 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 360
           RHQ RRW DNGL ETICSMRMKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 301 RHQGRRWDDNGLDETICSMRMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 360

Query: 361 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 420
           RGVLYRNAMY PHWNQNAHSVIFVTRGRARVQVVD +GQTV+DGELQQRQVLVVPQNFAI
Sbjct: 361 RGVLYRNAMYAPHWNQNAHSVIFVTRGRARVQVVDCRGQTVYDGELQQRQVLVVPQNFAI 420

Query: 421 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 480
           VKKASEEGFEWVSFKTNDNAMINTLAGRTS MRAFPVQV+ASAYRMSTEEARRLK NREE
Sbjct: 421 VKKASEEGFEWVSFKTNDNAMINTLAGRTSVMRAFPVQVLASAYRMSTEEARRLKLNREE 480

Query: 481 TTLLPPSMASSARRANYV 498
           TTLL P M+SS R AN V
Sbjct: 481 TTLLAPRMSSSRRPANPV 498

BLAST of HG10012034 vs. ExPASy TrEMBL
Match: A0A5A7T5D7 (11S globulin subunit beta-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold237G001320 PE=3 SV=1)

HSP 1 Score: 890.6 bits (2300), Expect = 3.1e-255
Identity = 442/490 (90.20%), Postives = 464/490 (94.69%), Query Frame = 0

Query: 2   SNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSE-GQSRYRECRLDRLEALEPSHRIDA 61
           +NPL LSLSLCFLVLFNACLA N+N   VSR F E GQSRYRECRLD+LEA+EPS RI+A
Sbjct: 3   NNPLFLSLSLCFLVLFNACLATNDNFRYVSRRFGEAGQSRYRECRLDKLEAVEPSRRIEA 62

Query: 62  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 121
           EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 63  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 122

Query: 122 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 181
           P+TY+ESQQS G FRD+HQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA
Sbjct: 123 PQTYQESQQSGGAFRDQHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 182

Query: 182 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGS-RKEGSSNKNNIFYAFDDRVLAEI 241
           NQLDFHPR FYLAGNPEEEFPEWR  WKREQGRH S RKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 183 NQLDFHPRTFYLAGNPEEEFPEWRLQWKREQGRHMSGRKEGSSNKNNIFYAFDDRVLAEI 242

Query: 242 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 301
           LNIN E ARKLRGEDDFRRNIIKVEG LEVIRPPRSRGG RGEE+EWEEEQEEEMERQ E
Sbjct: 243 LNINIELARKLRGEDDFRRNIIKVEGGLEVIRPPRSRGGRRGEEQEWEEEQEEEMERQRE 302

Query: 302 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 361
           RHQR RW +NGL ETICSM+MKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 303 RHQRSRWDENGLDETICSMKMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 362

Query: 362 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 421
           RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVV+ +GQTVFDGELQQRQVLVVPQNFA+
Sbjct: 363 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVNCRGQTVFDGELQQRQVLVVPQNFAV 422

Query: 422 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 481
           +KKASE+GFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE
Sbjct: 423 LKKASEDGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 482

Query: 482 TTLLPPSMAS 490
           TTL+PP M+S
Sbjct: 483 TTLIPPRMSS 492

BLAST of HG10012034 vs. ExPASy TrEMBL
Match: A0A1S3BHF6 (11S globulin subunit beta-like OS=Cucumis melo OX=3656 GN=LOC103489873 PE=3 SV=1)

HSP 1 Score: 890.6 bits (2300), Expect = 3.1e-255
Identity = 442/490 (90.20%), Postives = 464/490 (94.69%), Query Frame = 0

Query: 2   SNPLLLSLSLCFLVLFNACLAINENLHDVSRHFSE-GQSRYRECRLDRLEALEPSHRIDA 61
           +NPL LSLSLCFLVLFNACLA N+N   VSR F E GQSRYRECRLD+LEA+EPS RI+A
Sbjct: 3   NNPLFLSLSLCFLVLFNACLATNDNFRYVSRRFGEAGQSRYRECRLDKLEAVEPSRRIEA 62

Query: 62  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGC 121
           EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGF+GVVLPGC
Sbjct: 63  EGGVIEMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFKGVVLPGC 122

Query: 122 PETYEESQQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 181
           P+TY+ESQQS G FRD+HQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA
Sbjct: 123 PQTYQESQQSGGAFRDQHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHA 182

Query: 182 NQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGS-RKEGSSNKNNIFYAFDDRVLAEI 241
           NQLDFHPR FYLAGNPEEEFPEWR  WKREQGRH S RKEGSSNKNNIFYAFDDRVLAEI
Sbjct: 183 NQLDFHPRTFYLAGNPEEEFPEWRLQWKREQGRHMSGRKEGSSNKNNIFYAFDDRVLAEI 242

Query: 242 LNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHE 301
           LNIN E ARKLRGEDDFRRNIIKVEG LEVIRPPRSRGG RGEE+EWEEEQEEEMERQ E
Sbjct: 243 LNINIELARKLRGEDDFRRNIIKVEGGLEVIRPPRSRGGRRGEEQEWEEEQEEEMERQRE 302

Query: 302 RHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAE 361
           RHQR RW +NGL ETICSM+MKENIGDASRAD++TPEAGRL+TTNSHRFPILRWLQLSAE
Sbjct: 303 RHQRSRWDENGLDETICSMKMKENIGDASRADMYTPEAGRLSTTNSHRFPILRWLQLSAE 362

Query: 362 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAI 421
           RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVV+ +GQTVFDGELQQRQVLVVPQNFA+
Sbjct: 363 RGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVNCRGQTVFDGELQQRQVLVVPQNFAV 422

Query: 422 VKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 481
           +KKASE+GFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE
Sbjct: 423 LKKASEDGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREE 482

Query: 482 TTLLPPSMAS 490
           TTL+PP M+S
Sbjct: 483 TTLIPPRMSS 492

BLAST of HG10012034 vs. TAIR 10
Match: AT5G44120.3 (RmlC-like cupins superfamily protein )

HSP 1 Score: 402.9 bits (1034), Expect = 3.8e-112
Identity = 217/497 (43.66%), Postives = 307/497 (61.77%), Query Frame = 0

Query: 6   LLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVI 65
           LLS  L  L+LF+   A         +   +GQ    EC+LD+L ALEPSH + +E G I
Sbjct: 7   LLSFCLTLLILFHGYAA---------QQGQQGQQFPNECQLDQLNALEPSHVLKSEAGRI 66

Query: 66  EMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYE 125
           E+WD      RC+GV+  RYII+  GL LP + N  +L ++ +GRG  G V+PGC ET++
Sbjct: 67  EVWDHHAPQLRCSGVSFARYIIESKGLYLPSFFNTAKLSFVAKGRGLMGKVIPGCAETFQ 126

Query: 126 ES---------QQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDV 185
           +S         Q  +  FRD HQK+ H+R+GD  A   G A W YNDG E L+ V + D+
Sbjct: 127 DSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVAQWFYNDGQEPLVIVSVFDL 186

Query: 186 SNHANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVL 245
           ++H NQLD +PR FYLAGN               QG+   +      + NIF  F   V+
Sbjct: 187 ASHQNQLDRNPRPFYLAGN-------------NPQGQVWLQGREQQPQKNIFNGFGPEVI 246

Query: 246 AEILNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMER 305
           A+ L I+ +TA++L+ +DD R NI++V+G   VIRPP      RG+      ++EEE E 
Sbjct: 247 AQALKIDLQTAQQLQNQDDNRGNIVRVQGPFGVIRPPL-----RGQ----RPQEEEEEEG 306

Query: 306 QHERHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQL 365
           +H RH       NGL ETICS R  +N+ D SRAD++ P+ G ++T NS+  PILR+++L
Sbjct: 307 RHGRH------GNGLEETICSARCTDNLDDPSRADVYKPQLGYISTLNSYDLPILRFIRL 366

Query: 366 SAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQN 425
           SA RG + +NAM +P WN NA+++++VT G A++Q+V+  G  VFDG++ Q Q++ VPQ 
Sbjct: 367 SALRGSIRQNAMVLPQWNANANAILYVTDGEAQIQIVNDNGNRVFDGQVSQGQLIAVPQG 426

Query: 426 FAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFN 485
           F++VK+A+   F+WV FKTN NA INTLAGRTS +R  P++VI + +++S EEARR+KFN
Sbjct: 427 FSVVKRATSNRFQWVEFKTNANAQINTLAGRTSVLRGLPLEVITNGFQISPEEARRVKFN 466

Query: 486 REETTLLPPSMASSARR 494
             ETTL   S  +S  R
Sbjct: 487 TLETTLTHSSGPASYGR 466

BLAST of HG10012034 vs. TAIR 10
Match: AT1G03880.1 (cruciferin 2 )

HSP 1 Score: 392.9 bits (1008), Expect = 3.9e-109
Identity = 215/495 (43.43%), Postives = 295/495 (59.60%), Query Frame = 0

Query: 6   LLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVI 65
           ++S SL  L+LFN   A               Q    EC+LD+L ALEPS  I +EGG I
Sbjct: 7   IISFSLTLLILFNGYTA---------------QQWPNECQLDQLNALEPSQIIKSEGGRI 66

Query: 66  EMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYE 125
           E+WD      RC+G A +R++I+P GL LP + NA +L ++  GRG  G V+PGC ET+ 
Sbjct: 67  EVWDHHAPQLRCSGFAFERFVIEPQGLFLPTFLNAGKLTFVVHGRGLMGRVIPGCAETFM 126

Query: 126 ES--------QQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVS 185
           ES        Q  +  FRD HQK+ H+R GD  A P+G A W YN+GNE LI V   D++
Sbjct: 127 ESPVFGEGQGQGQSQGFRDMHQKVEHLRCGDTIATPSGVAQWFYNNGNEPLILVAAADLA 186

Query: 186 NHANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLA 245
           ++ NQLD + R F +AGN   +  EW    K+++            +NNIF  F   +LA
Sbjct: 187 SNQNQLDRNLRPFLIAGN-NPQGQEWLQGRKQQK------------QNNIFNGFAPEILA 246

Query: 246 EILNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQ 305
           +   IN ETA++L+ + D R NI+KV G   VIRPP  RG G               ++ 
Sbjct: 247 QAFKINVETAQQLQNQQDNRGNIVKVNGPFGVIRPPLRRGEGG--------------QQP 306

Query: 306 HERHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLS 365
           HE         NGL ET+C+MR  EN+ D S AD++ P  G ++T NS+  PILR L+LS
Sbjct: 307 HE-------IANGLEETLCTMRCTENLDDPSDADVYKPSLGYISTLNSYNLPILRLLRLS 366

Query: 366 AERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNF 425
           A RG + +NAM +P WN NA++ ++VT G+A +Q+V+  G+ VFD E+   Q+LVVPQ F
Sbjct: 367 ALRGSIRKNAMVLPQWNVNANAALYVTNGKAHIQMVNDNGERVFDQEISSGQLLVVPQGF 426

Query: 426 AIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNR 485
           +++K A  E FEW+ FKTN+NA +NTLAGRTS MR  P++VI + Y++S EEA+R+KF+ 
Sbjct: 427 SVMKHAIGEQFEWIEFKTNENAQVNTLAGRTSVMRGLPLEVITNGYQISPEEAKRVKFST 452

Query: 486 EETTLLPPSMASSAR 493
            ETTL   S  S  R
Sbjct: 487 IETTLTHSSPMSYGR 452

BLAST of HG10012034 vs. TAIR 10
Match: AT4G28520.1 (cruciferin 3 )

HSP 1 Score: 367.9 bits (943), Expect = 1.3e-101
Identity = 205/539 (38.03%), Postives = 297/539 (55.10%), Query Frame = 0

Query: 6   LLSLSLCFLVLFNACLAINENLHDVSRHFSEGQSRYRECRLDRLEALEPSHRIDAEGGVI 65
           LL  +   L++ N CLA         +          EC LD L+ L+ +  I +E G I
Sbjct: 7   LLVATFGVLLVLNGCLA--------RQSLGVPPQLQNECNLDNLDVLQATETIKSEAGQI 66

Query: 66  EMWDPSHEMFRCAGVAIQRYIIDPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYE 125
           E WD +H   RC GV++ RY+I+  GL LP +  +P++ Y+ +G G  G V+PGC ET+ 
Sbjct: 67  EYWDHNHPQLRCVGVSVARYVIEQGGLYLPTFFTSPKISYVVQGTGISGRVVPGCAETFM 126

Query: 126 ESQQSAGE---------------------------------------------------- 185
           +SQ   G+                                                    
Sbjct: 127 DSQPMQGQQQGQPWQGRQGQQGQPWEGQGQQGQQGRQGQPWEGQGQQGQQGRQGQQGQPW 186

Query: 186 ----------FRDRHQKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHANQL 245
                     FRD HQK+ HVR GD+FA   GSAHW YN G + L+ + LLD++N+ NQL
Sbjct: 187 EGQGQQGQQGFRDMHQKVEHVRRGDVFANTPGSAHWIYNSGEQPLVIIALLDIANYQNQL 246

Query: 246 DFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEILNIN 305
           D +PR F+LAGN              +QG  G  ++    K N++  FD +V+A+ L I+
Sbjct: 247 DRNPRVFHLAGN-------------NQQGGFGGSQQQQEQK-NLWSGFDAQVIAQALKID 306

Query: 306 TETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHERHQR 365
            + A++L+ + D R NI++V+G  +V+RPP  +     E  EW             RH R
Sbjct: 307 VQLAQQLQNQQDSRGNIVRVKGPFQVVRPPLRQ---PYESEEW-------------RHPR 366

Query: 366 RRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAERGVL 425
                NGL ETICSMR  ENI D +RAD++ P  GR+ + NS+  PIL +++LSA RGVL
Sbjct: 367 SP-QGNGLEETICSMRSHENIDDPARADVYKPSLGRVTSVNSYTLPILEYVRLSATRGVL 426

Query: 426 YRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIVKKA 483
             NAM +P +N NA+ +++ T G+ R+QVV+  GQ V D ++Q+ Q++V+PQ FA V ++
Sbjct: 427 QGNAMVLPKYNMNANEILYCTGGQGRIQVVNDNGQNVLDQQVQKGQLVVIPQGFAYVVQS 486

BLAST of HG10012034 vs. TAIR 10
Match: AT1G03890.1 (RmlC-like cupins superfamily protein )

HSP 1 Score: 351.3 bits (900), Expect = 1.3e-96
Identity = 191/475 (40.21%), Postives = 273/475 (57.47%), Query Frame = 0

Query: 34  FSEGQSRYRE------CRLDRLEALEPSHRIDAEGGVIEMWDPSHEMFRCAGVAIQRYII 93
           F   ++R RE      C   ++ +L P+     E G +E+WD      RCAGV + R  +
Sbjct: 20  FHGAEARQREAPFPNACHFSQINSLAPAQATKFEAGQMEVWDHMSPELRCAGVTVARITL 79

Query: 94  DPNGLLLPQYTNAPRLIYIERGRGFEGVVLPGCPETYEESQQSAG---------EFRDRH 153
            PN + LP + + P L Y+ +G G  G +  GCPET+ E + S+G          F D H
Sbjct: 80  QPNSIFLPAFFSPPALAYVVQGEGVMGTIASGCPETFAEVEGSSGRGGGGDPGRRFEDMH 139

Query: 154 QKIRHVRAGDLFAVPAGSAHWTYNDGNERLIAVVLLDVSNHANQLDFHPRAFYLAGN--P 213
           QK+ + R GD+FA  AG + W YN G+   + V++LDV+N  NQLD  PR F LAG+   
Sbjct: 140 QKLENFRRGDVFASLAGVSQWWYNRGDSDAVIVIVLDVTNRENQLDQVPRMFQLAGSRTQ 199

Query: 214 EEEFP-EWRSDWKREQGRHGSRKEGSSNKNNIFYAFDDRVLAEILNINTETARKLRGEDD 273
           EEE P  W S                   NN F  FD  ++AE   IN ETA++L+ + D
Sbjct: 200 EEEQPLTWPSG------------------NNAFSGFDPNIIAEAFKINIETAKQLQNQKD 259

Query: 274 FRRNIIKVEGQLEVIRPPRSRGGGRGEEREWEEEQEEEMERQHERHQRRRWTDNGLGETI 333
            R NII+  G L  + PP          REW+++                   NG+ ET 
Sbjct: 260 NRGNIIRANGPLHFVIPP---------PREWQQD----------------GIANGIEETY 319

Query: 334 CSMRMKENIGDASRADIHTPEAGRLATTNSHRFPILRWLQLSAERGVLYRNAMYVPHWNQ 393
           C+ ++ ENI D  R+D  +  AGR++T NS   P+LR ++L+A RG LY   M +P W  
Sbjct: 320 CTAKIHENIDDPERSDHFSTRAGRISTLNSLNLPVLRLVRLNALRGYLYSGGMVLPQWTA 379

Query: 394 NAHSVIFVTRGRARVQVVDHKGQTVFDGELQQRQVLVVPQNFAIVKKASEEGFEWVSFKT 453
           NAH+V++VT G+A++QVVD  GQ+VF+ ++ Q Q++V+PQ FA+ K A E GFEW+SFKT
Sbjct: 380 NAHTVLYVTGGQAKIQVVDDNGQSVFNEQVGQGQIIVIPQGFAVSKTAGETGFEWISFKT 439

Query: 454 NDNAMINTLAGRTSAMRAFPVQVIASAYRMSTEEARRLKFNREETTL-LPPSMAS 490
           NDNA INTL+G+TS +RA PV VI ++Y ++ EEA+R+KF+++ET L + PS +S
Sbjct: 440 NDNAYINTLSGQTSYLRAVPVDVIKASYGVNEEEAKRIKFSQQETMLSMTPSSSS 451

BLAST of HG10012034 vs. TAIR 10
Match: AT5G44120.2 (RmlC-like cupins superfamily protein )

HSP 1 Score: 323.6 bits (828), Expect = 2.9e-88
Identity = 173/389 (44.47%), Postives = 246/389 (63.24%), Query Frame = 0

Query: 114 GVVLPGCPETYEES---------QQSAGEFRDRHQKIRHVRAGDLFAVPAGSAHWTYNDG 173
           G V+PGC ET+++S         Q  +  FRD HQK+ H+R+GD  A   G A W YNDG
Sbjct: 2   GKVIPGCAETFQDSSEFQPRFEGQGQSQRFRDMHQKVEHIRSGDTIATTPGVAQWFYNDG 61

Query: 174 NERLIAVVLLDVSNHANQLDFHPRAFYLAGNPEEEFPEWRSDWKREQGRHGSRKEGSSNK 233
            E L+ V + D+++H NQLD +PR FYLAGN               QG+   +      +
Sbjct: 62  QEPLVIVSVFDLASHQNQLDRNPRPFYLAGN-------------NPQGQVWLQGREQQPQ 121

Query: 234 NNIFYAFDDRVLAEILNINTETARKLRGEDDFRRNIIKVEGQLEVIRPPRSRGGGRGEER 293
            NIF  F   V+A+ L I+ +TA++L+ +DD R NI++V+G   VIRPP      RG+  
Sbjct: 122 KNIFNGFGPEVIAQALKIDLQTAQQLQNQDDNRGNIVRVQGPFGVIRPPL-----RGQ-- 181

Query: 294 EWEEEQEEEMERQHERHQRRRWTDNGLGETICSMRMKENIGDASRADIHTPEAGRLATTN 353
               ++EEE E +H RH       NGL ETICS R  +N+ D SRAD++ P+ G ++T N
Sbjct: 182 --RPQEEEEEEGRHGRH------GNGLEETICSARCTDNLDDPSRADVYKPQLGYISTLN 241

Query: 354 SHRFPILRWLQLSAERGVLYRNAMYVPHWNQNAHSVIFVTRGRARVQVVDHKGQTVFDGE 413
           S+  PILR+++LSA RG + +NAM +P WN NA+++++VT G A++Q+V+  G  VFDG+
Sbjct: 242 SYDLPILRFIRLSALRGSIRQNAMVLPQWNANANAILYVTDGEAQIQIVNDNGNRVFDGQ 301

Query: 414 LQQRQVLVVPQNFAIVKKASEEGFEWVSFKTNDNAMINTLAGRTSAMRAFPVQVIASAYR 473
           + Q Q++ VPQ F++VK+A+   F+WV FKTN NA INTLAGRTS +R  P++VI + ++
Sbjct: 302 VSQGQLIAVPQGFSVVKRATSNRFQWVEFKTNANAQINTLAGRTSVLRGLPLEVITNGFQ 361

Query: 474 MSTEEARRLKFNREETTLLPPSMASSARR 494
           +S EEARR+KFN  ETTL   S  +S  R
Sbjct: 362 ISPEEARRVKFNTLETTLTHSSGPASYGR 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888918.16.0e-26190.6911S globulin-like [Benincasa hispida][more]
XP_008447425.17.8e-26190.34PREDICTED: 11S globulin subunit beta-like [Cucumis melo] >KAA0038056.1 11S globu... [more]
XP_011651441.26.2e-25889.3611S globulin [Cucumis sativus] >KAE8650657.1 hypothetical protein Csa_009929 [Cu... [more]
XP_008447426.16.4e-25590.20PREDICTED: 11S globulin subunit beta-like [Cucumis melo] >KAA0038058.1 11S globu... [more]
XP_004152049.15.4e-25489.3911S globulin [Cucumis sativus] >KGN57907.1 hypothetical protein Csa_009548 [Cucu... [more]
Match NameE-valueIdentityDescription
A0A1L6K3714.9e-15756.7011S globulin OS=Juglans nigra OX=16719 PE=1 SV=1[more]
Q2TPW57.1e-15656.4211S globulin seed storage protein Jug r 4 OS=Juglans regia OX=51240 PE=1 SV=1[more]
B5KVH41.9e-15355.9711S globulin seed storage protein 1 OS=Carya illinoinensis OX=32201 PE=1 SV=1[more]
Q8GZP62.1e-13652.2811S globulin seed storage protein Ana o 2.0101 (Fragment) OS=Anacardium occident... [more]
E3SH281.8e-12744.80Prunin 1 Pru du 6.0101 OS=Prunus dulcis OX=3755 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7T7833.8e-26190.3411S globulin subunit beta-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BGV43.8e-26190.3411S globulin subunit beta-like OS=Cucumis melo OX=3656 GN=LOC103489872 PE=3 SV=1[more]
A0A0A0L7E73.3e-25788.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G384800 PE=3 SV=1[more]
A0A5A7T5D73.1e-25590.2011S globulin subunit beta-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BHF63.1e-25590.2011S globulin subunit beta-like OS=Cucumis melo OX=3656 GN=LOC103489873 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G44120.33.8e-11243.66RmlC-like cupins superfamily protein [more]
AT1G03880.13.9e-10943.43cruciferin 2 [more]
AT4G28520.11.3e-10138.03cruciferin 3 [more]
AT1G03890.11.3e-9640.21RmlC-like cupins superfamily protein [more]
AT5G44120.22.9e-8844.47RmlC-like cupins superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 284..304
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 271..305
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 291..305
NoneNo IPR availablePANTHERPTHR31189OS03G0336100 PROTEIN-RELATEDcoord: 8..492
NoneNo IPR availablePANTHERPTHR31189:SF3512S SEED STORAGE PROTEIN CRCcoord: 8..492
NoneNo IPR availableCDDcd02242cupin_11S_legumin_Ncoord: 46..265
e-value: 1.06531E-110
score: 324.924
NoneNo IPR availableCDDcd02243cupin_11S_legumin_Ccoord: 331..484
e-value: 8.9957E-92
score: 274.736
IPR00604411-S seed storage protein, plantPRINTSPR0043911SGLOBULINcoord: 390..406
score: 57.09
coord: 426..444
score: 54.56
coord: 344..364
score: 58.23
coord: 408..423
score: 54.79
coord: 321..338
score: 49.18
coord: 448..465
score: 43.4
IPR006045Cupin 1SMARTSM00835Cupin_1_3coord: 322..471
e-value: 2.0E-57
score: 206.7
coord: 49..248
e-value: 1.8E-29
score: 113.9
IPR006045Cupin 1PFAMPF00190Cupin_1coord: 50..197
e-value: 3.2E-32
score: 111.2
coord: 325..470
e-value: 2.1E-35
score: 121.5
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 32..324
e-value: 1.2E-109
score: 367.9
IPR014710RmlC-like jelly roll foldGENE3D2.60.120.10Jelly Rollscoord: 325..493
e-value: 6.3E-77
score: 258.6
IPR011051RmlC-like cupin domain superfamilySUPERFAMILY51182RmlC-like cupinscoord: 9..480

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012034.1HG10012034.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010431 seed maturation
cellular_component GO:0043245 extraorganismal space
molecular_function GO:0019863 IgE binding
molecular_function GO:0045735 nutrient reservoir activity