HG10022471 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022471
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionproline-rich proteoglycan 2-like
LocationChr05: 24691728 .. 24692306 (-)
RNA-Seq ExpressionHG10022471
SyntenyHG10022471
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA

mRNA sequence

ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA

Coding sequence (CDS)

ATGTTGCATGTTTTCTTCTTCTTTTTGCTTCTTGCTATGGTATCAAATATAAAAAGTGTGGAGGCAAAGAGAGATGAAAGTAGTGATCAACTTATTTTGAGTCTCAAAAGTCTTGTTCCCGGAGGGCCAAATCAAGCTTTTCCTCCCGAAACTCCACCTCACAATAATAAATTTACTTTGGATCTCAAGCACCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCACCTGGAGTTCCTCCTCACAGTATTGTTACGAACTCAAAAGCAAAAAATGGTATTGGCGAGCTTACTTTGGGTCCCAAGAGACTTGTTCCAGGAGGGCCAAATCAAGCTTTCCCTCCCGAAACTCCACCTCACAATAGTAAATTTACTTTGGGTCTCAAGTATCTTGTTCCCGAAGGGCCAAATCAAGCGTACCCTCCTGGAGTTCCTCCTCACAATATTGTTACGAATTCAAAAGCAAAAAATGGTATTGGTGAGCTTACTTTAGGTTCCAAGAGACTTGTTCCAGGCGGGCCAAATCAAGCATTTCCCCCTGAAACTCCTCCTCACAGCATTGTTACCAAGTTAATACCTTGA

Protein sequence

MLHVFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTKLIP
Homology
BLAST of HG10022471 vs. NCBI nr
Match: XP_022932063.1 (uncharacterized protein LOC111438388 [Cucurbita moschata])

HSP 1 Score: 156.8 bits (395), Expect = 1.9e-34
Identity = 88/169 (52.07%), Postives = 105/169 (62.13%), Query Frame = 0

Query: 33  LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
           L  K LVP GPNQ FPPETP HN    L  K LVP+GPNQ +PP  P H++ +     +G
Sbjct: 681 LGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETPLHNLGSKRLVPDG 740

Query: 93  IGEL--------TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVP 152
             ++         LG KRLVP GPNQ FPPETP HN    LG K LVP+GPNQ +PP  P
Sbjct: 741 PNQMFPPETPLHNLGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETP 800

Query: 153 PHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTK-LIP 193
            HN+ +     +G     LGSKRLVPGGPNQ F PETP H++ +K L+P
Sbjct: 801 LHNLGSKRLVPDG---PNLGSKRLVPGGPNQMFSPETPLHNLGSKRLVP 838

BLAST of HG10022471 vs. NCBI nr
Match: XP_022159163.1 (proline-rich proteoglycan 2-like [Momordica charantia])

HSP 1 Score: 117.9 bits (294), Expect = 9.9e-23
Identity = 81/194 (41.75%), Postives = 92/194 (47.42%), Query Frame = 0

Query: 4   VFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLK 63
           VF F   +A++S+  +VEA+RD + D L L LK  VP GPN A  PE PP    FTL LK
Sbjct: 12  VFLFLSFVAILSDANTVEARRDANGDVLALDLKRRVPMGPNPATSPERPP----FTLGLK 71

Query: 64  HLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKF 123
             VP  PN    P  PP               LTLG KR VP GPN A  PE+PP    F
Sbjct: 72  RHVPTDPNPTTSPDRPP---------------LTLGLKRRVPTGPNPATSPESPP----F 131

Query: 124 TLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIG--------ELTLGSKRLVPGGPNQ 183
             GLK  VP GPN A  P  PP       +   G            T   KR VP GPN 
Sbjct: 132 IFGLKRRVPTGPNPATSPERPPFTFSLKRRVPTGPNPATSPERLPFTFSLKRRVPTGPNP 182

Query: 184 AFPPETPPHSIVTK 190
           A  PE PP ++  K
Sbjct: 192 ATSPERPPFTLGLK 182

BLAST of HG10022471 vs. NCBI nr
Match: VDC94000.1 (unnamed protein product [Brassica oleracea])

HSP 1 Score: 111.7 bits (278), Expect = 7.1e-21
Identity = 70/162 (43.21%), Postives = 81/162 (50.00%), Query Frame = 0

Query: 35  LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
           +K LVP GPN    P + PH+      +K LVP GPN    P  PPHSIV          
Sbjct: 230 VKRLVPSGPNNETSPSSSPHSIA-DFGVKRLVPSGPNNETSPPSPPHSIV---------- 289

Query: 95  ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
               G KRLVP GPN    P +PPH      G+K LVP GPN    P  PPH I     A
Sbjct: 290 --DFGVKRLVPSGPNNETSPPSPPHPIA-DFGVKRLVPSGPNNETSPPSPPHPI-----A 349

Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
             G+  L +G KRLVP GPN    P +PPH I    V +L+P
Sbjct: 350 DFGVKRLNIGVKRLVPSGPNNETSPPSPPHFIADFGVKRLVP 372

BLAST of HG10022471 vs. NCBI nr
Match: KAG2315933.1 (hypothetical protein Bca52824_019055 [Brassica carinata])

HSP 1 Score: 110.2 bits (274), Expect = 2.1e-20
Identity = 72/176 (40.91%), Postives = 85/176 (48.30%), Query Frame = 0

Query: 35  LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
           LK LVP GPN    P +PPH+ +  + +K LVP GPN    P  PPHSI           
Sbjct: 138 LKRLVPSGPNNETSPPSPPHSME-DIRVKRLVPSGPNNETSPPSPPHSIA---------- 197

Query: 95  ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIV---TN 154
               G KRLVP GPN    P +PPH+  F  G+K LVP GPN    P  PPH I      
Sbjct: 198 --EFGLKRLVPSGPNNETSPPSPPHSIAF-FGMKRLVPSGPNNETSPPSPPHFIADFGLK 257

Query: 155 SKAKNGIGELT-----------LGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
               +G    T           +G KRLVP GPN    P +PPHSI    V +L+P
Sbjct: 258 RLVPSGPNNETSPPSPPHSMEDIGVKRLVPSGPNNETSPPSPPHSIADFGVKRLVP 299

BLAST of HG10022471 vs. NCBI nr
Match: XP_033143265.1 (leucine-rich repeat extensin-like protein 5 [Brassica rapa])

HSP 1 Score: 110.2 bits (274), Expect = 2.1e-20
Identity = 69/162 (42.59%), Postives = 80/162 (49.38%), Query Frame = 0

Query: 35  LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
           +K LVP GPN    P +PPH+  F   +K LVP GPN    P  PPHSI           
Sbjct: 79  VKRLVPSGPNNETSPPSPPHSADF--GVKRLVPSGPNNETSPPSPPHSIA---------- 138

Query: 95  ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
               G KRLVP GPN    P +PPH S  + G+K LVP GPN    P  PPH+I      
Sbjct: 139 --DYGVKRLVPSGPNNETSPPSPPH-SIASFGVKRLVPSGPNNETSPPSPPHSIA----- 198

Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
                    G K LVP GPN    P +PPHSI    V +L+P
Sbjct: 199 -------DFGVKGLVPSGPNNETSPPSPPHSIADFGVKRLVP 213

BLAST of HG10022471 vs. ExPASy TrEMBL
Match: A0A6J1F0K6 (uncharacterized protein LOC111438388 OS=Cucurbita moschata OX=3662 GN=LOC111438388 PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 9.3e-35
Identity = 88/169 (52.07%), Postives = 105/169 (62.13%), Query Frame = 0

Query: 33  LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
           L  K LVP GPNQ FPPETP HN    L  K LVP+GPNQ +PP  P H++ +     +G
Sbjct: 681 LGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETPLHNLGSKRLVPDG 740

Query: 93  IGEL--------TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVP 152
             ++         LG KRLVP GPNQ FPPETP HN    LG K LVP+GPNQ +PP  P
Sbjct: 741 PNQMFPPETPLHNLGSKRLVPDGPNQMFPPETPLHN----LGSKRLVPDGPNQMFPPETP 800

Query: 153 PHNIVTNSKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSIVTK-LIP 193
            HN+ +     +G     LGSKRLVPGGPNQ F PETP H++ +K L+P
Sbjct: 801 LHNLGSKRLVPDG---PNLGSKRLVPGGPNQMFSPETPLHNLGSKRLVP 838

BLAST of HG10022471 vs. ExPASy TrEMBL
Match: A0A6J1DY20 (proline-rich proteoglycan 2-like OS=Momordica charantia OX=3673 GN=LOC111025587 PE=4 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 4.8e-23
Identity = 81/194 (41.75%), Postives = 92/194 (47.42%), Query Frame = 0

Query: 4   VFFFFLLLAMVSNIKSVEAKRDESSDQLILSLKSLVPGGPNQAFPPETPPHNNKFTLDLK 63
           VF F   +A++S+  +VEA+RD + D L L LK  VP GPN A  PE PP    FTL LK
Sbjct: 12  VFLFLSFVAILSDANTVEARRDANGDVLALDLKRRVPMGPNPATSPERPP----FTLGLK 71

Query: 64  HLVPEGPNQAYPPGVPPHSIVTNSKAKNGIGELTLGPKRLVPGGPNQAFPPETPPHNSKF 123
             VP  PN    P  PP               LTLG KR VP GPN A  PE+PP    F
Sbjct: 72  RHVPTDPNPTTSPDRPP---------------LTLGLKRRVPTGPNPATSPESPP----F 131

Query: 124 TLGLKYLVPEGPNQAYPPGVPPHNIVTNSKAKNGIG--------ELTLGSKRLVPGGPNQ 183
             GLK  VP GPN A  P  PP       +   G            T   KR VP GPN 
Sbjct: 132 IFGLKRRVPTGPNPATSPERPPFTFSLKRRVPTGPNPATSPERLPFTFSLKRRVPTGPNP 182

Query: 184 AFPPETPPHSIVTK 190
           A  PE PP ++  K
Sbjct: 192 ATSPERPPFTLGLK 182

BLAST of HG10022471 vs. ExPASy TrEMBL
Match: A0A3P6B7Y9 (Uncharacterized protein OS=Brassica oleracea OX=3712 GN=BOLC3T17342H PE=4 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.4e-21
Identity = 70/162 (43.21%), Postives = 81/162 (50.00%), Query Frame = 0

Query: 35  LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
           +K LVP GPN    P + PH+      +K LVP GPN    P  PPHSIV          
Sbjct: 230 VKRLVPSGPNNETSPSSSPHSIA-DFGVKRLVPSGPNNETSPPSPPHSIV---------- 289

Query: 95  ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
               G KRLVP GPN    P +PPH      G+K LVP GPN    P  PPH I     A
Sbjct: 290 --DFGVKRLVPSGPNNETSPPSPPHPIA-DFGVKRLVPSGPNNETSPPSPPHPI-----A 349

Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
             G+  L +G KRLVP GPN    P +PPH I    V +L+P
Sbjct: 350 DFGVKRLNIGVKRLVPSGPNNETSPPSPPHFIADFGVKRLVP 372

BLAST of HG10022471 vs. ExPASy TrEMBL
Match: M4D9I0 (Uncharacterized protein OS=Brassica rapa subsp. pekinensis OX=51351 PE=4 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 4.5e-21
Identity = 68/165 (41.21%), Postives = 83/165 (50.30%), Query Frame = 0

Query: 33  LSLKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNG 92
           + +K LVP GPN    P +PPH  +  + +K LVP GPN    P  PPH I   +   + 
Sbjct: 25  IGVKRLVPSGPNNETSPPSPPHFIE-DIGVKRLVPSGPNNETSPPSPPHFIEDETSPPSP 84

Query: 93  IGEL-TLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTN 152
              +   G KRLVP GPN    P +PPH S    G+K LVP GPN    P  PPH+I   
Sbjct: 85  PHSIANFGVKRLVPSGPNNETSPPSPPH-SIANFGVKRLVPSGPNNETSPPSPPHSIA-- 144

Query: 153 SKAKNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
                       G KRLVP GPN    P +PPHSI    V +L+P
Sbjct: 145 ----------NFGVKRLVPSGPNNETSPPSPPHSIADFGVKRLVP 175

BLAST of HG10022471 vs. ExPASy TrEMBL
Match: A0A398A230 (Uncharacterized protein OS=Brassica campestris OX=3711 GN=BRARA_C03751 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 3.8e-20
Identity = 68/162 (41.98%), Postives = 80/162 (49.38%), Query Frame = 0

Query: 35  LKSLVPGGPNQAFPPETPPHNNKFTLDLKHLVPEGPNQAYPPGVPPHSIVTNSKAKNGIG 94
           +K LVP GPN    P +PPH  +  +++K LVP GPN    P  PPHSI           
Sbjct: 120 IKRLVPSGPNNETSPPSPPHFIE-DIEVKRLVPSGPNNETSPPSPPHSIA---------- 179

Query: 95  ELTLGPKRLVPGGPNQAFPPETPPHNSKFTLGLKYLVPEGPNQAYPPGVPPHNIVTNSKA 154
               G KRLVP GPN    P +PPH S    G+K LVP GPN    P  PPH+I      
Sbjct: 180 --DFGVKRLVPSGPNNETSPPSPPH-SIADFGVKRLVPSGPNNEASPPSPPHSIA----- 239

Query: 155 KNGIGELTLGSKRLVPGGPNQAFPPETPPHSI----VTKLIP 193
                    G KRLVP GPN    P +PP SI    V +L+P
Sbjct: 240 -------DFGVKRLVPSGPNNETSPPSPPRSIAGFGVKRLVP 255

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022932063.11.9e-3452.07uncharacterized protein LOC111438388 [Cucurbita moschata][more]
XP_022159163.19.9e-2341.75proline-rich proteoglycan 2-like [Momordica charantia][more]
VDC94000.17.1e-2143.21unnamed protein product [Brassica oleracea][more]
KAG2315933.12.1e-2040.91hypothetical protein Bca52824_019055 [Brassica carinata][more]
XP_033143265.12.1e-2042.59leucine-rich repeat extensin-like protein 5 [Brassica rapa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1F0K69.3e-3552.07uncharacterized protein LOC111438388 OS=Cucurbita moschata OX=3662 GN=LOC1114383... [more]
A0A6J1DY204.8e-2341.75proline-rich proteoglycan 2-like OS=Momordica charantia OX=3673 GN=LOC111025587 ... [more]
A0A3P6B7Y93.4e-2143.21Uncharacterized protein OS=Brassica oleracea OX=3712 GN=BOLC3T17342H PE=4 SV=1[more]
M4D9I04.5e-2141.21Uncharacterized protein OS=Brassica rapa subsp. pekinensis OX=51351 PE=4 SV=1[more]
A0A398A2303.8e-2041.98Uncharacterized protein OS=Brassica campestris OX=3711 GN=BRARA_C03751 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 173..192
NoneNo IPR availablePANTHERPTHR37380CLE FAMILY OSCLE501 PROTEINcoord: 4..187

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022471.1HG10022471.1mRNA