HG10021820 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021820
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionextensin-like isoform X2
LocationChr05: 17171790 .. 17172355 (+)
RNA-Seq ExpressionHG10021820
SyntenyHG10021820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTATTTCGAAGCTCGTCTTACCCTCTTCCTCTCATTTGCTCTCATCACTGCAGCTCGACTCACGCCCCATTCAGGTATGCACACTGCTACTGGTTGTTGAAAAGTCTAGATTTATGTAGCTCGTGTTTTGAATAGTGTATAAAAATTAAATCTAATATCATGTCAGAAAAACAAGACACGGCGATTATGATAAAGGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATACTGATCCAGGAGCTAATCCAAGACATGACCCAAACCAGCCGCCAATGATGATAAATGACTATGCTGATCCAAGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATGCTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAATCAGCCGCCTCCACCTTCGCCTCGACGATTTGAGGTTAGAACTACCAATGGCCACATTAGCAGAAATCTCTAA

mRNA sequence

ATGGATTATTTCGAAGCTCGTCTTACCCTCTTCCTCTCATTTGCTCTCATCACTGCAGCTCGACTCACGCCCCATTCAGAAAAACAAGACACGGCGATTATGATAAAGGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATACTGATCCAGGAGCTAATCCAAGACATGACCCAAACCAGCCGCCAATGATGATAAATGACTATGCTGATCCAAGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATGCTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAATCAGCCGCCTCCACCTTCGCCTCGACGATTTGAGGTTAGAACTACCAATGGCCACATTAGCAGAAATCTCTAA

Coding sequence (CDS)

ATGGATTATTTCGAAGCTCGTCTTACCCTCTTCCTCTCATTTGCTCTCATCACTGCAGCTCGACTCACGCCCCATTCAGAAAAACAAGACACGGCGATTATGATAAAGGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATACTGATCCAGGAGCTAATCCAAGACATGACCCAAACCAGCCGCCAATGATGATAAATGACTATGCTGATCCAAGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGATTATGCTGATCCAGGAGCTAATCCAAGACATGATCCAAACCAACCGCCAATGATGATAAATGACTATACTGATCCAGGAGCTAATCCAAGACATGATCCAAATCAGCCGCCTCCACCTTCGCCTCGACGATTTGAGGTTAGAACTACCAATGGCCACATTAGCAGAAATCTCTAA

Protein sequence

MDYFEARLTLFLSFALITAARLTPHSEKQDTAIMIKDYTDPGANPRHDPNQPPMMINDYTDPGANPRHDPNQPPMMINDYADPRANPRHDPNQPPMMINDYADPGANPRHDPNQPPMMINDYTDPGANPRHDPNQPPPPSPRRFEVRTTNGHISRNL
Homology
BLAST of HG10021820 vs. NCBI nr
Match: XP_038895781.1 (circumsporozoite protein-like [Benincasa hispida])

HSP 1 Score: 253.4 bits (646), Expect = 1.2e-63
Identity = 126/174 (72.41%), Postives = 133/174 (76.44%), Query Frame = 0

Query: 1   MDYFEARLTLF--LSFALITAARLTPHSEKQDTAIMIKDYTDPGANPRHDPNQPPMMIND 60
           M+YF+ARLT F  LSFALIT AR+TPHSE Q+  I I DY DPGANPRHDPNQPPMMIND
Sbjct: 1   MNYFKARLTFFLLLSFALITLARITPHSESQNNTITINDYADPGANPRHDPNQPPMMIND 60

Query: 61  YTDPGANPRHDPNQPPMMINDYADP----------------RANPRHDPNQPPMMINDYA 120
           Y DPGANPRHDPNQ PMMINDY DP                RANPRHDPNQPPMMINDY 
Sbjct: 61  YADPGANPRHDPNQTPMMINDYTDPGANPRHDPNQPPMMINRANPRHDPNQPPMMINDYT 120

Query: 121 DPGANPRHDPNQPPMMINDYTDPGANPRHDPNQPPPPSPRRFEVRTTNGHISRN 157
           DP ANPRHDPNQPPMMINDY D GANPRHD NQ PP  PRRF+V    GH+S+N
Sbjct: 121 DPRANPRHDPNQPPMMINDYADAGANPRHDSNQSPP--PRRFKVSAAGGHVSKN 172

BLAST of HG10021820 vs. NCBI nr
Match: KAF4369213.1 (hypothetical protein G4B88_009511 [Cannabis sativa] >KAF4387559.1 hypothetical protein F8388_011707 [Cannabis sativa])

HSP 1 Score: 107.5 bits (267), Expect = 1.1e-19
Identity = 63/153 (41.18%), Postives = 82/153 (53.59%), Query Frame = 0

Query: 21  RLTPHSEKQDTAIMIKDYTDPGANPRHDPNQPPMMIN-------DYTDPGANPRHDPNQP 80
           +L P     +  + I+DY DPG+NPRH  + PP+M N       DY   G NPRH P+ P
Sbjct: 105 KLPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNENELILQDYPPVGPNPRHKPHHP 164

Query: 81  P------MMINDYADPRANPRHDP------NQPPMMINDYADPGANPRHDPNQPPMMIN- 140
           P      + + DYADP  NPRH P      N+  + I DYADPG+NPRH  + PP+M N 
Sbjct: 165 PLANENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNE 224

Query: 141 ------DYTDPGANPRHDPNQPPPPSPRRFEVR 148
                 DY   G NPRH P+ PP  +  +  V+
Sbjct: 225 NELILQDYPPVGPNPRHKPHHPPLANENKLSVQ 255

BLAST of HG10021820 vs. NCBI nr
Match: XP_022948581.1 (extensin-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 104.0 bits (258), Expect = 1.2e-18
Identity = 78/194 (40.21%), Postives = 94/194 (48.45%), Query Frame = 0

Query: 8   LTLFLSFALITAARLTPHSEKQDTAIMIKDYTDPGANPRHDPNQP-----------PMMI 67
           L   LSFAL+++AR+   S KQ T I+I DY+DPG NP   P  P           P  I
Sbjct: 9   LVFLLSFALVSSARIMSDSGKQATKIVINDYSDPGKNPTLSPVVPKQFPPPPPDFEPFAI 68

Query: 68  NDYTDPGAN-------PRHDPNQPP----MMINDYADPRAN-------PRHDPNQPP--- 127
           NDY+DPG N       P+  P  PP     +INDY+DP  N       P+  P  PP   
Sbjct: 69  NDYSDPGKNPTLSPVVPKQFPPPPPDFEAFVINDYSDPGKNPTLSPVVPKQFPPPPPDFK 128

Query: 128 -MMINDYADPGAN-------PRHDPNQPP----MMINDYTDPGANPRHDPNQP---PPPS 155
             +INDY+DPG N       P+  P  PP     +INDY+DPG NP   P  P   PPP 
Sbjct: 129 AFVINDYSDPGKNPTLSPVVPKQFPPPPPDFKAFVINDYSDPGKNPTLSPVVPKQFPPPP 188

BLAST of HG10021820 vs. NCBI nr
Match: XP_030505386.1 (sporozoite surface protein 2-like [Cannabis sativa])

HSP 1 Score: 103.2 bits (256), Expect = 2.1e-18
Identity = 61/150 (40.67%), Postives = 80/150 (53.33%), Query Frame = 0

Query: 24  PHSEKQDTAIMIKDYTDPGANPRHDPNQPPMMIN-------DYTDPGANPRHDPNQPP-- 83
           P     +  + I+DY DPG+NPRH  + PP+M N       DY   G NPRH P+ PP  
Sbjct: 69  PQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNKNELILQDYPPVGPNPRHKPHHPPLA 128

Query: 84  ----MMINDYADPRANPRHDP------NQPPMMINDYADPGANPRHDPNQPPMMIN---- 143
               + + DYADP  NPRH P      N+  + I DYADPG+NPRH  + PP+M N    
Sbjct: 129 NENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNENEL 188

Query: 144 ---DYTDPGANPRHDPNQPPPPSPRRFEVR 148
              DY   G NPR+ P+ PP  +  +  V+
Sbjct: 189 ILQDYPPVGPNPRYKPHHPPLANENKLSVQ 216

BLAST of HG10021820 vs. NCBI nr
Match: XP_022948580.1 (extensin-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 99.0 bits (245), Expect = 3.9e-17
Identity = 78/196 (39.80%), Postives = 94/196 (47.96%), Query Frame = 0

Query: 8   LTLFLSFALITAARLTPH--SEKQDTAIMIKDYTDPGANPRHDPNQP-----------PM 67
           L   LSFAL+++AR+     S KQ T I+I DY+DPG NP   P  P           P 
Sbjct: 9   LVFLLSFALVSSARIMSDSVSGKQATKIVINDYSDPGKNPTLSPVVPKQFPPPPPDFEPF 68

Query: 68  MINDYTDPGAN-------PRHDPNQPP----MMINDYADPRAN-------PRHDPNQPP- 127
            INDY+DPG N       P+  P  PP     +INDY+DP  N       P+  P  PP 
Sbjct: 69  AINDYSDPGKNPTLSPVVPKQFPPPPPDFEAFVINDYSDPGKNPTLSPVVPKQFPPPPPD 128

Query: 128 ---MMINDYADPGAN-------PRHDPNQPP----MMINDYTDPGANPRHDPNQP---PP 155
               +INDY+DPG N       P+  P  PP     +INDY+DPG NP   P  P   PP
Sbjct: 129 FKAFVINDYSDPGKNPTLSPVVPKQFPPPPPDFKAFVINDYSDPGKNPTLSPVVPKQFPP 188

BLAST of HG10021820 vs. ExPASy TrEMBL
Match: A0A7J6FET4 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_011707 PE=4 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 5.3e-20
Identity = 63/153 (41.18%), Postives = 82/153 (53.59%), Query Frame = 0

Query: 21  RLTPHSEKQDTAIMIKDYTDPGANPRHDPNQPPMMIN-------DYTDPGANPRHDPNQP 80
           +L P     +  + I+DY DPG+NPRH  + PP+M N       DY   G NPRH P+ P
Sbjct: 105 KLPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNENELILQDYPPVGPNPRHKPHHP 164

Query: 81  P------MMINDYADPRANPRHDP------NQPPMMINDYADPGANPRHDPNQPPMMIN- 140
           P      + + DYADP  NPRH P      N+  + I DYADPG+NPRH  + PP+M N 
Sbjct: 165 PLANENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNE 224

Query: 141 ------DYTDPGANPRHDPNQPPPPSPRRFEVR 148
                 DY   G NPRH P+ PP  +  +  V+
Sbjct: 225 NELILQDYPPVGPNPRHKPHHPPLANENKLSVQ 255

BLAST of HG10021820 vs. ExPASy TrEMBL
Match: A0A6J1G9M0 (extensin-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452213 PE=4 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 5.9e-19
Identity = 78/194 (40.21%), Postives = 94/194 (48.45%), Query Frame = 0

Query: 8   LTLFLSFALITAARLTPHSEKQDTAIMIKDYTDPGANPRHDPNQP-----------PMMI 67
           L   LSFAL+++AR+   S KQ T I+I DY+DPG NP   P  P           P  I
Sbjct: 9   LVFLLSFALVSSARIMSDSGKQATKIVINDYSDPGKNPTLSPVVPKQFPPPPPDFEPFAI 68

Query: 68  NDYTDPGAN-------PRHDPNQPP----MMINDYADPRAN-------PRHDPNQPP--- 127
           NDY+DPG N       P+  P  PP     +INDY+DP  N       P+  P  PP   
Sbjct: 69  NDYSDPGKNPTLSPVVPKQFPPPPPDFEAFVINDYSDPGKNPTLSPVVPKQFPPPPPDFK 128

Query: 128 -MMINDYADPGAN-------PRHDPNQPP----MMINDYTDPGANPRHDPNQP---PPPS 155
             +INDY+DPG N       P+  P  PP     +INDY+DPG NP   P  P   PPP 
Sbjct: 129 AFVINDYSDPGKNPTLSPVVPKQFPPPPPDFKAFVINDYSDPGKNPTLSPVVPKQFPPPP 188

BLAST of HG10021820 vs. ExPASy TrEMBL
Match: A0A6J1GAA0 (extensin-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452213 PE=4 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.9e-17
Identity = 78/196 (39.80%), Postives = 94/196 (47.96%), Query Frame = 0

Query: 8   LTLFLSFALITAARLTPH--SEKQDTAIMIKDYTDPGANPRHDPNQP-----------PM 67
           L   LSFAL+++AR+     S KQ T I+I DY+DPG NP   P  P           P 
Sbjct: 9   LVFLLSFALVSSARIMSDSVSGKQATKIVINDYSDPGKNPTLSPVVPKQFPPPPPDFEPF 68

Query: 68  MINDYTDPGAN-------PRHDPNQPP----MMINDYADPRAN-------PRHDPNQPP- 127
            INDY+DPG N       P+  P  PP     +INDY+DP  N       P+  P  PP 
Sbjct: 69  AINDYSDPGKNPTLSPVVPKQFPPPPPDFEAFVINDYSDPGKNPTLSPVVPKQFPPPPPD 128

Query: 128 ---MMINDYADPGAN-------PRHDPNQPP----MMINDYTDPGANPRHDPNQP---PP 155
               +INDY+DPG N       P+  P  PP     +INDY+DPG NP   P  P   PP
Sbjct: 129 FKAFVINDYSDPGKNPTLSPVVPKQFPPPPPDFKAFVINDYSDPGKNPTLSPVVPKQFPP 188

BLAST of HG10021820 vs. ExPASy TrEMBL
Match: A0A7J6FEU1 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009514 PE=4 SV=1)

HSP 1 Score: 94.7 bits (234), Expect = 3.6e-16
Identity = 57/143 (39.86%), Postives = 73/143 (51.05%), Query Frame = 0

Query: 33  IMIKDYTDPGANPRHDP------NQPPMMINDYTDPGANPRHD-------------PNQP 92
           + ++DY DP  NPRH P      N+  + I DY DPG+NPRH              P+ P
Sbjct: 85  LSVQDYADPKPNPRHKPPPQQPTNENKLSIQDYADPGSNPRHQHPPPLMTNENEHKPHHP 144

Query: 93  P------MMINDYADPRANPRHDP------NQPPMMINDYADPGANPRHDPNQPPMMIN- 138
           P      + + DYA+P  NPRH P      N+  + I DY DPG+NPRH  + PP+M N 
Sbjct: 145 PLANENQLSVQDYANPEPNPRHKPPPQQPTNENKLSIQDYVDPGSNPRH-KHPPPLMTNE 204

BLAST of HG10021820 vs. ExPASy TrEMBL
Match: A0A0E0DTA0 (Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 4.6e-16
Identity = 54/140 (38.57%), Postives = 71/140 (50.71%), Query Frame = 0

Query: 31  TAIMIKDYTDPGANPRHDPNQPP---------------MMINDYTDPGANPRHDPNQPP- 90
           T + + DY  PGANPRH+P +PP               + +NDY  PGANPRH+P +PP 
Sbjct: 55  TEVEVNDYPAPGANPRHNPKRPPGREMSVQGMVAMATDVEVNDYPAPGANPRHNPKRPPG 114

Query: 91  --MMINDYADPRANPRHDPNQPPMMINDYADPGANPRHDPNQPP---------------M 138
             M +        N         + +NDY  PGANPRH+P +PP               +
Sbjct: 115 REMSVLGTVAATTN---------VEVNDYPAPGANPRHNPKRPPGREMFAQGMAAATTNV 174

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895781.11.2e-6372.41circumsporozoite protein-like [Benincasa hispida][more]
KAF4369213.11.1e-1941.18hypothetical protein G4B88_009511 [Cannabis sativa] >KAF4387559.1 hypothetical p... [more]
XP_022948581.11.2e-1840.21extensin-like isoform X2 [Cucurbita moschata][more]
XP_030505386.12.1e-1840.67sporozoite surface protein 2-like [Cannabis sativa][more]
XP_022948580.13.9e-1739.80extensin-like isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7J6FET45.3e-2041.18Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_011707 PE=4 SV=1[more]
A0A6J1G9M05.9e-1940.21extensin-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452213 PE=4 SV=1[more]
A0A6J1GAA01.9e-1739.80extensin-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452213 PE=4 SV=1[more]
A0A7J6FEU13.6e-1639.86Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009514 PE=4 SV=1[more]
A0A0E0DTA04.6e-1638.57Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..157
IPR034430Protein PSYPANTHERPTHR37177PROTEIN PSY1coord: 8..52

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021820.1HG10021820.1mRNA