HG10003599 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003599
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionkeratin, type II cytoskeletal 3 isoform X1
LocationChr08: 3921793 .. 3924682 (-)
RNA-Seq ExpressionHG10003599
SyntenyHG10003599
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGGAAACAGACACCAAGGCGGAGAAGATGAGAACGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTTGTCGGAGGTCAATTCTTCATCATTTCTCTCTTTCCCTTTTTGTTTCACTCATAACTAACTTGTTATTCCCCTGTTGGATTCCTTTGGTTTTTCTATTTCAATTATGTTGCTTAAAATCAATTTTCTATGTGGATTGTTCCACTTTACAACTTGCTGATTCCTTTAATCGGAGGTATTCTTGTTGAATTGGAGGGTGAGGTAGGTTGGGTGGGGTTATTCAGTCTGATTATCGTTTTAATTTGTATTTGTGCCTTCTACTGTCTTTGACTCTTATCCTTCACTATTTCATGTTTCTTGCATTCTGGTTGAGAGTGTTTATGGATTTGCTGCTGAAATTCTGTAGCCTTATGACATTTTCCCCAAGTTCAGGAAGCGTTTGACTTTTTTTTGAAGCACAAAGATGGAGTAGACGCAATACAAGATCACAGCATTTCTATTTTTTTTCAAAAAGTAATGATAAAAGATACTATGGTTAAAGCAAATATGTTAAATTTTAGACTTGAGAAAGGGAATGCTATTTTCTAGGGTGGTCTGATGAGCATTCAAAGGATTAAAGTGAACCACCAAGCAAAATACCTAATTTTCCTACAAAATAGCTCAAATATGAATCCCCAAATAGGCCCAGATGTAGTTCATTCAGGTGCTTCTTCTTTTTCTTTTAGTCTATCTGCAGTGTTCATCTTTAACATATTTTTAAATTTTATTTTTGTCCTTTGGGAAGTGCAATGTATTTTTTATCTTCCCAGTGGAGGGAGGATGGTATGAGTGGTATACTTCTGGATACTGTTGACTATTTCATGCTTCATTTTTGTTATTCAACAATTTAGACATGCCTTTTGATGGATATATTCGAGAAGAATAGTAAATCTGTATTAGTGGTCGTGAAATTTATGGACGCAATTTCTTCAAGAAGTTTATTGGGATGTTAAGTGTCATTCGTATGCGATTATGTTTTGTTTGTATAATTGTAATCAATATGATAAGCTTTTATCATTTTCACGAGTTTATCATCCAACACCTTTTTTCTTTGTCTTTTCATTTCTTCTTTCCTGCTAATATATATATTCGGTTTTGTTAACTTTGGAGTCTACAAGCGTCACTTGGCTGCATGTTTTAAGTTCCATCAATCTGTTTAGAAGGATTTTAAGGTGCCAAAAGCTTGAAATGTACTAGGCTTTAAAAATTGGACTTCAAAAAATTGGACTTCACAAAGTACTTCCATGTTGATGAGGAATAAATTCTCTCTATCCTTTGTGATAGTATGGTTTGTAGTCTTTAAAAACTGCCATGGATTTTCTTTTTGATAGTAGACATTATATGGATGACAAAAGAGAGAACAAAGGGAACTGGGAAGATAAAACATTCTTTGCAATTCCCACATGCACACTTATGCACAAACATAGAAGAAGAAGAAGCTATCATGGATTTTGGGTTTTGGTAGAAGGTAGGGCTTGAGTGTCAACTAATCAATATTATCATTCTGTAAATTTTATTTTGAACTTTATACTGTGGTGAACTATCCTTGTTTAGTTTAATTGAAACTACTCAAATATGAATTTGCAAAAACTGTTCTCCACTCTAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAGTTGGCTTAGGATTTGGATATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTTGGAGATCTATTAAATGGTCATCAAAGTATTTTTCCTCAGTAAGTTCCAAACCAACTGTTTTCAACTTGTGACAATTATGTTCTGTTAGTGATGCATTAGGGATAATTAACTTTCTTGTGGCTTTGGTGACCTGGATTTTTCCTAAGCATGTATATACTAAGATGGCAATTTTTTGTGACGGAAGATTCAAAATAGTCCCATTCTTTATCTTTTTTCACTCCTCTCTCTTCCCACGCTGTCCCATTTCTCTCTCTTTCTGCTTAGTTTTACCCTTCATGCTCTCTCTCCTTCCCTATTTTTCTCAATCTTATGAGTACCTTCCCTCTCTATATTGTGATTTACTTTCAGTTTCCACATCTACCCTTCTAACAGGACTTGTTACATTAAAAATATCCGTGGTCTTTCCATCTTGTCATTTCAACTCAGCTCATCTTCTTTCGATAACCTAGAACAATTGGGGTCAGCTTAGATGTTGATCTTGAAAAATTGCCAATTGTTTTTAAAATATTCATGTTCTTTCAAAATTTGCAATATTACCCTTAGTCTTTTAAAAATGTTTTTTCCACTCAAATTACTAGCATAACTCTTTCTTGTTCCCATTTGACTAGGGGATGCTATCTTTGTCCACCTCAGTTGTCTGGGTGAAATCATTGAACCTAACTGTTAATTGTCCAAATCGTTAAACCACCCTATGTTGTACCATCTTGATTACATTTTTAGGACTGTCAGAATAATAATTCCTTTTCAAATCATCTTGTAAGTTGCATTGCATAGGATTAAAATAATGTTTTTTCTTAAGCTGTAATAACTGGAATCTAAACAGTGAAGTTAGAAATTGATTGATGACATTTCCACGATTTTCAAATACTAGAACTTCAATATTCTGATATCAAGAAATATTTCATGATTTTCATTTTTACTTGTCATCAGTAGGGACGATATTGGCACGCTTGTTGACGAGGTTGTCCTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA

mRNA sequence

ATGAACGGAAACAGACACCAAGGCGGAGAAGATGAGAACGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTTGTCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAGTTGGCTTAGGATTTGGATATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTTGGAGATCTATTAAATGGTCATCAAAGTATTTTTCCTCAGGACGATATTGGCACGCTTGTTGACGAGGTTGTCCTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA

Coding sequence (CDS)

ATGAACGGAAACAGACACCAAGGCGGAGAAGATGAGAACGGATTACTCTGGAATCTTCCAGTTCTTAAATCTTCCAGAATCGGAAAGTTAGGTCCCGCCTTCGGTCTGGGCGTTGGTTGTGGCGTCGGCTTTGGCATCGGCCTTGTCGGAGGTGCTGGATTTGGTCCAGGAATTCCTGGCTTACAACTTGGCTTTGGTCTTGGTGCTGGATGTGGAGTTGGCTTAGGATTTGGATATGGTGTTGGCAGGGGCATTGCTCAAGATGACAAACGGAGATACTCTAACGTTGGAGATCTATTAAATGGTCATCAAAGTATTTTTCCTCAGGACGATATTGGCACGCTTGTTGACGAGGTTGTCCTAAATACAAAGAGGCTTATACGAGCTACTTCAAGGGAGATTGACAAGTGGAAAAGATGA

Protein sequence

MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVVLNTKRLIRATSREIDKWKR
Homology
BLAST of HG10003599 vs. NCBI nr
Match: XP_038889093.1 (ctenidin-1 isoform X1 [Benincasa hispida])

HSP 1 Score: 258.1 bits (658), Expect = 4.4e-65
Identity = 131/139 (94.24%), Postives = 134/139 (96.40%), Query Frame = 0

Query: 2   NGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGL 61
           NGNRHQGGEDE GLLWNLPVLKSSR GKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGL
Sbjct: 6   NGNRHQGGEDEGGLLWNLPVLKSSRFGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGL 65

Query: 62  QLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQ-DDIGTLVDEVV 121
           QLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLL+GHQSIFPQ DDI  LVDE+V
Sbjct: 66  QLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLHGHQSIFPQKDDIVALVDELV 125

Query: 122 LNTKRLIRATSREIDKWKR 140
           LN+KRLIRATSREIDKWKR
Sbjct: 126 LNSKRLIRATSREIDKWKR 144

BLAST of HG10003599 vs. NCBI nr
Match: KAG6592270.1 (hypothetical protein SDJN03_14616, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 258.1 bits (658), Expect = 4.4e-65
Identity = 125/139 (89.93%), Postives = 134/139 (96.40%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 1   MNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNGHQSIFPQD+IG LVDE+ 
Sbjct: 61  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGHQSIFPQDEIGALVDELA 120

Query: 121 LNTKRLIRATSREIDKWKR 140
           LNTK+LIR T+REIDKWKR
Sbjct: 121 LNTKKLIRVTAREIDKWKR 139

BLAST of HG10003599 vs. NCBI nr
Match: XP_023521209.1 (fibroin heavy chain isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 255.0 bits (650), Expect = 3.8e-64
Identity = 124/139 (89.21%), Postives = 133/139 (95.68%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+RH+G EDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGSRHEGREDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNGHQSIFPQD+IG LVDE+ 
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGHQSIFPQDEIGALVDELA 124

Query: 121 LNTKRLIRATSREIDKWKR 140
           LNTK+LIR T+REIDKWKR
Sbjct: 125 LNTKKLIRVTAREIDKWKR 143

BLAST of HG10003599 vs. NCBI nr
Match: XP_016902400.1 (PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo] >XP_016902401.1 PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo])

HSP 1 Score: 253.8 bits (647), Expect = 8.4e-64
Identity = 128/140 (91.43%), Postives = 133/140 (95.00%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNGNR++ GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGNRNRDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFP-QDDIGTLVDEV 120
           LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L GHQSIFP QDDIG LVD++
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGHQSIFPHQDDIGALVDDL 124

Query: 121 VLNTKRLIRATSREIDKWKR 140
           VLNTKRLIRATSREIDKWKR
Sbjct: 125 VLNTKRLIRATSREIDKWKR 144

BLAST of HG10003599 vs. NCBI nr
Match: XP_022932820.1 (keratin, type II cytoskeletal 2 epidermal isoform X2 [Cucurbita moschata])

HSP 1 Score: 253.4 bits (646), Expect = 1.1e-63
Identity = 123/139 (88.49%), Postives = 133/139 (95.68%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNG QSIFPQD+IG LVDE+ 
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPQDEIGALVDELA 124

Query: 121 LNTKRLIRATSREIDKWKR 140
           LNTK+LIR T++EIDKWKR
Sbjct: 125 LNTKKLIRVTAQEIDKWKR 143

BLAST of HG10003599 vs. ExPASy TrEMBL
Match: A0A1S4E347 (glycine-rich cell wall structural protein 1 OS=Cucumis melo OX=3656 GN=LOC103498610 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 4.1e-64
Identity = 128/140 (91.43%), Postives = 133/140 (95.00%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNGNR++ GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGNRNRDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFP-QDDIGTLVDEV 120
           LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L GHQSIFP QDDIG LVD++
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGHQSIFPHQDDIGALVDDL 124

Query: 121 VLNTKRLIRATSREIDKWKR 140
           VLNTKRLIRATSREIDKWKR
Sbjct: 125 VLNTKRLIRATSREIDKWKR 144

BLAST of HG10003599 vs. ExPASy TrEMBL
Match: A0A6J1F360 (keratin, type II cytoskeletal 2 epidermal isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439279 PE=4 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 5.3e-64
Identity = 123/139 (88.49%), Postives = 133/139 (95.68%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNG QSIFPQD+IG LVDE+ 
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPQDEIGALVDELA 124

Query: 121 LNTKRLIRATSREIDKWKR 140
           LNTK+LIR T++EIDKWKR
Sbjct: 125 LNTKKLIRVTAQEIDKWKR 143

BLAST of HG10003599 vs. ExPASy TrEMBL
Match: A0A0A0KSM8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646690 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 2.6e-63
Identity = 127/140 (90.71%), Postives = 132/140 (94.29%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNGNR+Q GEDE GLLWNLPVLKSSR G LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGNRNQDGEDERGLLWNLPVLKSSRFGNLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFP-QDDIGTLVDEV 120
           LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGD+L G QSIFP QDDIG LVD++
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDVLRGRQSIFPHQDDIGALVDDL 124

Query: 121 VLNTKRLIRATSREIDKWKR 140
           VLNTKRLIRATS+EIDKWKR
Sbjct: 125 VLNTKRLIRATSKEIDKWKR 144

BLAST of HG10003599 vs. ExPASy TrEMBL
Match: A0A6J1IA95 (keratin, type II cytoskeletal 3 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472645 PE=4 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 2.2e-62
Identity = 120/139 (86.33%), Postives = 132/139 (94.96%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+R +GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGSRREGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNG QSIFPQD+IG +VDE+ 
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPQDEIGAVVDELA 124

Query: 121 LNTKRLIRATSREIDKWKR 140
           LNTK+LI+ T++EIDKWKR
Sbjct: 125 LNTKKLIQVTAKEIDKWKR 143

BLAST of HG10003599 vs. ExPASy TrEMBL
Match: A0A6J1EY31 (keratin, type II cytoskeletal 2 epidermal isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439279 PE=4 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 3.8e-62
Identity = 122/140 (87.14%), Postives = 133/140 (95.00%), Query Frame = 0

Query: 1   MNGNRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 60
           MNG+RH+GGEDE+G+LW LPVLKSSRIG+LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG
Sbjct: 5   MNGSRHEGGEDESGMLWKLPVLKSSRIGRLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPG 64

Query: 61  LQLGFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFP-QDDIGTLVDEV 120
           LQLGFGLGAGCGVGLGFGYG GRGIAQDDKRRYSNVGDLLNG QSIFP +D+IG LVDE+
Sbjct: 65  LQLGFGLGAGCGVGLGFGYGAGRGIAQDDKRRYSNVGDLLNGRQSIFPHRDEIGALVDEL 124

Query: 121 VLNTKRLIRATSREIDKWKR 140
            LNTK+LIR T++EIDKWKR
Sbjct: 125 ALNTKKLIRVTAQEIDKWKR 144

BLAST of HG10003599 vs. TAIR 10
Match: AT4G10330.1 (glycine-rich protein )

HSP 1 Score: 161.0 bits (406), Expect = 6.9e-40
Identity = 78/134 (58.21%), Postives = 100/134 (74.63%), Query Frame = 0

Query: 4   NRHQGGEDENGLLWNLPVLKSSRIGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQL 63
           NR + G+D+ GLLW LP ++   IGK+GPAFGLGVGCG GFG GL+GG GFGPG+PGLQ 
Sbjct: 3   NRRRTGDDDKGLLWKLPQVRIRDIGKVGPAFGLGVGCGFGFGAGLIGGVGFGPGVPGLQF 62

Query: 64  GFGLGAGCGVGLGFGYGVGRGIAQDDKRRYSNVGDLLNGHQSIFPQDDIGTLVDEVVLNT 123
           G G GAGCG+G+GFGYGVGRG A D  R Y NVG            +++ +L+DE+V++T
Sbjct: 63  GLGFGAGCGIGVGFGYGVGRGAAYDHSRSYYNVGKP--------SLNEVDSLIDELVVST 122

Query: 124 KRLIRATSREIDKW 138
           K+L++AT+ EIDKW
Sbjct: 123 KKLVKATTNEIDKW 128

BLAST of HG10003599 vs. TAIR 10
Match: AT1G66820.1 (glycine-rich protein )

HSP 1 Score: 55.1 bits (131), Expect = 5.3e-08
Identity = 36/70 (51.43%), Postives = 43/70 (61.43%), Query Frame = 0

Query: 30  LGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQ-----LGFGLGAGCGVGLGFGYGVGRG 89
           +GP  G G+GCG G GIGL GG G G    GL      LGFG+G G G G G+G+GVG G
Sbjct: 39  VGPGIGGGIGCGAGIGIGLSGGLGIGAS-EGLDHSNVVLGFGIGCGIGFGFGYGFGVGGG 98

Query: 90  IAQDD-KRRY 94
            + DD K R+
Sbjct: 99  YSFDDIKERF 107

BLAST of HG10003599 vs. TAIR 10
Match: AT4G14301.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G23450.1); Has 101 Blast hits to 93 proteins in 34 species: Archae - 0; Bacteria - 27; Metazoa - 16; Fungi - 5; Plants - 34; Viruses - 3; Other Eukaryotes - 16 (source: NCBI BLink). )

HSP 1 Score: 41.6 bits (96), Expect = 6.1e-04
Identity = 32/58 (55.17%), Postives = 36/58 (62.07%), Query Frame = 0

Query: 27  IGKLGPAFGLGVGCGVGFGIGLVGGAGFGPGIPGLQLGFGLGAGCGVGLGFGYGVGRG 85
           IGK G  FG G+G G GFG G+  G GFG GI G   G G G G G G GFG G+G+G
Sbjct: 53  IGKGGGIFGHGIGKGGGFGGGISKGGGFGGGI-GKGGGIGGGIGKGKGWGFGGGIGKG 109

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038889093.14.4e-6594.24ctenidin-1 isoform X1 [Benincasa hispida][more]
KAG6592270.14.4e-6589.93hypothetical protein SDJN03_14616, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023521209.13.8e-6489.21fibroin heavy chain isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_016902400.18.4e-6491.43PREDICTED: glycine-rich cell wall structural protein 1 [Cucumis melo] >XP_016902... [more]
XP_022932820.11.1e-6388.49keratin, type II cytoskeletal 2 epidermal isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S4E3474.1e-6491.43glycine-rich cell wall structural protein 1 OS=Cucumis melo OX=3656 GN=LOC103498... [more]
A0A6J1F3605.3e-6488.49keratin, type II cytoskeletal 2 epidermal isoform X2 OS=Cucurbita moschata OX=36... [more]
A0A0A0KSM82.6e-6390.71Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G646690 PE=4 SV=1[more]
A0A6J1IA952.2e-6286.33keratin, type II cytoskeletal 3 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1EY313.8e-6287.14keratin, type II cytoskeletal 2 epidermal isoform X1 OS=Cucurbita moschata OX=36... [more]
Match NameE-valueIdentityDescription
AT4G10330.16.9e-4058.21glycine-rich protein [more]
AT1G66820.15.3e-0851.43glycine-rich protein [more]
AT4G14301.16.1e-0455.17unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34201:SF1GLYCINE-RICH PROTEINcoord: 1..139
NoneNo IPR availablePANTHERPTHR34201GLYCINE-RICH PROTEINcoord: 1..139

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003599.1HG10003599.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane