HG10000916 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10000916
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr09: 11511336 .. 11515644 (-)
RNA-Seq ExpressionHG10000916
SyntenyHG10000916
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAATTTGTACGACTAACTCAAAAAATGGGTCCGATAGGCCAAATGACCCATTTTTTAAAAAAAAACTGGCAAATGCCCTTCTGCTGACGTCACGCAATAGGCTAAATTATGAGCGTGTCCTCACTAAACCCCTTTGTAAGTCTATCGGTGTAATTCAAAATTTACGCGCCAACGACTTCTTCTCGCTCAATCGCTCACTTACTCTTTACTCCCTTACTTGATCTCGCTGCCGTTGAACACCCATCAACTCATCTGCAAATTTAAACATCCATCCGCCGCCGACTTCCAAATCTATCAAAGTAAGTGAATTCGAACTTTGTCAAATCGATTAAGTCTGAAGTTCACTGTGTTTGCATTTGTGGGTTTAGCGATCGACATCATCTCTAGGGTTTACAAAAATTAATAGATAATATCATTGTGTGTTGTGTTTCGAAGGGGTCTGTGGAGATTGCATGAAACACTGAGGTTGAAGACGAGTGTTAGAGAGGGTGAACTGGGTATGTAATTGAGCGAGATCGGTATCCGCAGATGAGCGAGAGCGAGATGAGCGAGATGAGCAATATGAGTGAGATGAGCGATATGAGCGAGATGAGCGAGATCGAAAGCGAGGGAAATAATACAATATTGCTAAAGAGCTAGAACAGCGAGAGTATGTAGAGCGAGACGAGCAAGACTAGTGGGAGCAAGAGCGATACGTGCGTGAATATTGTGGTAGAGAAGAGCGATACCAGCGAGAATAGGGTTTAGGGTTACCAGCGAGACTCAGTTGAGCGAGATCGTGACGAGCGAGATTTGGACAGGATTGCATTACATTGACTTCTTTTTTAGATATATAATAAGTTTCATGTCCACTTATAGTTAAATTATTGCATGCAGGCACAATATGTCAACAGCTTTGAAAATACGTGAAGTCGTTCGCTTTTCAGGACAAGTAACGAGCTTGGCCCACATTGCGAATGCGAACAAGATTGTGAAGGAGAAGCTTATGCCAAGGCAGCTTGAGATGTTCAAGAGAACAGTATTCAGGCGCTTTGTTAACATTGACATAGTATTTAATAGCCCAATCATCCACCACATGTTGCTGAGGGAGGTGAAGGATAGGAGAGTGGACTCGATGAGCTTTTCTATTAGACGGAATGTCGTTACCTTTTCAAAGGACAACTTTTTGTTGATCACTGGTTTGTGGCGGGACCCTACCAGAGTTGATCGGGTCGAGGAGTCTACTGAAGATCTTGAGACGAAGTACTTTGGTAATCGGTCAGACTCATTCCATATCAATCTGCTAGAAGAAGAATATAAGAATTTGGATTTTGAAAATGACTTAGACGCTGTGAAGATTATATTGGTGTACTACACGGAGGTTGCAATGATGGGAAAGAATAAGCAGAAGAGTGCAGTAGATCGTAGACGATTCAAGGAAGTGGAGAACCTTGAATACTATAACAGCATCGACTGGGGAAGCATTATTTGGGAGAGGACTCTTGATGGCCTGAAGACAGCATTGAAAGACAAGGTTAGCATATATAAGAGTAAGGTGAAATCGAACAAAAAATTTATTATCAAGTACTCCCTTCGTGGATTTCCTCAGGCATTCCAGGTACTTGTTATATTTAAATACATTCTCATTATTGGTTTAACATTCCTCATAACTCTTATACTGACTCTGTTTCATATTCAGGTGTGGACATATTAGATCCTGTCATCTACTGTAGGAAACATTGCTACTAGAACTAGCAAGGTAGCTATTCCCCCGCATATTACGATGGTCGTGCTCTCACTCGGTGTCTGCCAAAGTAGTGCAACGAGATATTTTTTATTCTACACATGTAAGTGCCATAAACCTCGTCCACTTATTATTTTAATATTTTCTATGGGTCTTATTGTACTGTATCGTTTATGCAGACAAGGGTCAATGAGAATCTTGTAATGTCAGATGCAGAGAAGGAATTCCGAGACACCCAGTTCATAGGCGACTTGTGATTGGTCCACGAGTGGAGGAAGATACTAGCAATAGTTTAGAAAGTGAAGAGGATGAGCATGATAATGATTCTGACGATGGTGGAGACGATCAGGATGATCGGAAGGGTCAAGATGACCGGGAGGATCGGGAGAACCGGGATGATCGGGAGGACCGAGATGACCGGGATGATGGAGATGCTCAATTTGATAGTGATGATCGGGTTCATGGGGATGATGGAGTTGATTGTGATGATCGGGATGATGTGGATGACGTGAACACATCTGAGCCCACGAATTCGAGAATGCCAACGACTGAGCCCAGTGGATTCGACGCTACACCTAATGTATACTCCTACCTGCGGAGTATCGACAGTTCTGTGTCGAGGTTGGAGGGTCGTGTATTGAAGGTCGATGATGAACTTGGTGTCGTGAAGGCACAACTGACGACCATTACGTCATTATTACAGTCGTTGATTAAGGTTTGTCTTTCTTTCTTTCTTTTATTTGTTTGCTTCAAGTACAGTCTTTGTTTCTTTTATTTGTTGGTTTATTGACATTTCCATATCATGATTGAAGGATCCCTCTATGATGTCGGGTTTGAGGAGGTCACCCACACCACTCCGAGACCCAGATTCGAGGTCACCCTCTCCACCCCGACACACGGATTCGAGGAGGTCACCCACACCACCCCGACCCACGGATTCGAGGAGGTCACCCAAACCACCCTGAGACACGAATTCGAGGTCGCCCATCCCACCACCACCTCGACCTCATTCTTCACCCCGAGAAATGGATTCGCGGAGGTCACCTTCCCCACCTCCACATCTACTTCCTTCCCCACCCCCCCTCCACCTTCTTCCCCACCCCCCTCCTTCTTCACTCCCACCTTTACCTCCTTCTCCACCACAGTTTACCGAGGCCAGTAGTTCACCTCCACTACTTTCACTCCCTTCTACACATCCACTTCATTCTCCACTCCGGGATATAGATGTTACCATTTCACACCACCACTTCCACTTCCCTCTCCACCGCCTTCTACTGTTATAGAGCCAGTGCTGGAGCCCGTTCCTCTTACATGTTCTACCCTAGTAGACAGACATCCATTGGTGAGTGTACTTTGTGATTGCAATTTAACTATATGTTGTTTAGTCCCTGCTATAGAGCCAGTGTTAGTCCTAAAATGTTTTTTAATGTATGTAGGTACGTGGCGATCGTCGAAAAGGTGGCCGTAAAAGGAAGGGTGTTGACAAGTACACTCCAACGGACACACGACCAAAGAAAACGATGAAACATGAGACAGCATCAGCGCAATCGTTCAAGTACCCTCTACCGGGAGTTTATGCACCGATGGGTAGACCAAGGGTCAGCTACAATATAGCACACGACGTTCCATCCGGCGTATTCGACGGTATGATGTCATGGATTGCTAATAAGATTACGGATAGTGAGTTTAGAGTTAATTCATATAAACCATTGTTCAAGAAATTCTTCGAGGAGCTGACTGAACCGAGTTCATGGGTTGAGTGCGATGTAAGTCTACATTCTTAATACTTGCAGTAAATTGACCAACCGAAGTACTTTATGTCTAATTTCTCATTTTGGATGCAGCCGATAAAATTCATGTTCCGGTTCATTGCCGACAAGTTTCACAGTCGTCCGAAAATGTGCATGAATAAGTTCACCGTCATACCAACGGGGATAGTGGTAAATATTTGTATTTTTATGTTTTCGAGGTCATATGTTTACAAACTTCTTTGATTGCTATGTTTTTTTACTTTGTGAATTATTCATGGTCCAGGGTCATCTTAATGCACGCGATGGTCTATATCAACGGATTAAGAATCAACCACAACTGGCACATGCACTGCTATGGGAGGAGGAAGACACATATCCGGACTACGTACGGGGAAAATTTGACACACATAACACCTAGTGGGCGGATATAAACTTTGTGTATAGCGTGGTCAACACCGGCGAACACTGGGTCACTGTTGCATTGGACATGAACAGGCCAGCCAGATATCGTGTTTGATTCGCTTCCATCCGTAACATCAGTGAAGAAGCTGGAAACCCTTCTGGAGCCGCTAAGTCACACCCTACCGTCATTGCTTAACTATTGTGACCTGAAGCGGTTTAAGCCAAATTTGGAGATGAGACGATGGCGCATTTTTCAACCAACGAAGAAGAACATGCAACATAGGTCGCTGGATTGTGGAATTTTTGCTGTTAAACTGCTAGAGCATTTAATTACCGAGGCAAATATACATGTAATTACACAGGAAAAAATGACTGATTATAGGATACAGTTAGCATGTCAACTATGGGCGAATGAACCATTCTTCTGA

mRNA sequence

ATGGGAAATTTATGCAGAGAAGGAATTCCGAGACACCCAGTTCATAGGCGACTTGTGATTGGTCCACGAGTGGAGGAAGATACTAGCAATAGTTTAGAAAGTGAAGAGGATGAGCATGATAATGATTCTGACGATGGTGGAGACGATCAGGATGATCGGAAGGGTCAAGATGACCGGGAGGATCGGGAGAACCGGGATGATCGGGAGGACCGAGATGACCGGGATGATGGAGATGCTCAATTTGATAGTGATGATCGGGTTCATGGGGATGATGGAGTTGATTGTGATGATCGGGATGATGTGGATGACGTGAACACATCTGAGCCCACGAATTCGAGAATGCCAACGACTGAGCCCAGTGGATTCGACGCTACACCTAATGTATACTCCTACCTGCGGAGTATCGACAGTTCTGTGTCGAGGTTGGAGGGTCGTGTATTGAAGGTCGATGATGAACTTGGTGTCGTGAAGGCACAACTGACGACCATTACGTCATTATTACAGTCGTTGATTAAGCGGTTTAAGCCAAATTTGGAGATGAGACGATGGCGCATTTTTCAACCAACGAAGAAGAACATGCAACATAGGTCGCTGGATTGTGGAATTTTTGCTGTTAAACTGCTAGAGCATTTAATTACCGAGGCAAATATACATGTAATTACACAGGAAAAAATGACTGATTATAGGATACAGTTAGCATGTCAACTATGGGCGAATGAACCATTCTTCTGA

Coding sequence (CDS)

ATGGGAAATTTATGCAGAGAAGGAATTCCGAGACACCCAGTTCATAGGCGACTTGTGATTGGTCCACGAGTGGAGGAAGATACTAGCAATAGTTTAGAAAGTGAAGAGGATGAGCATGATAATGATTCTGACGATGGTGGAGACGATCAGGATGATCGGAAGGGTCAAGATGACCGGGAGGATCGGGAGAACCGGGATGATCGGGAGGACCGAGATGACCGGGATGATGGAGATGCTCAATTTGATAGTGATGATCGGGTTCATGGGGATGATGGAGTTGATTGTGATGATCGGGATGATGTGGATGACGTGAACACATCTGAGCCCACGAATTCGAGAATGCCAACGACTGAGCCCAGTGGATTCGACGCTACACCTAATGTATACTCCTACCTGCGGAGTATCGACAGTTCTGTGTCGAGGTTGGAGGGTCGTGTATTGAAGGTCGATGATGAACTTGGTGTCGTGAAGGCACAACTGACGACCATTACGTCATTATTACAGTCGTTGATTAAGCGGTTTAAGCCAAATTTGGAGATGAGACGATGGCGCATTTTTCAACCAACGAAGAAGAACATGCAACATAGGTCGCTGGATTGTGGAATTTTTGCTGTTAAACTGCTAGAGCATTTAATTACCGAGGCAAATATACATGTAATTACACAGGAAAAAATGACTGATTATAGGATACAGTTAGCATGTCAACTATGGGCGAATGAACCATTCTTCTGA

Protein sequence

MGNLCREGIPRHPVHRRLVIGPRVEEDTSNSLESEEDEHDNDSDDGGDDQDDRKGQDDREDRENRDDREDRDDRDDGDAQFDSDDRVHGDDGVDCDDRDDVDDVNTSEPTNSRMPTTEPSGFDATPNVYSYLRSIDSSVSRLEGRVLKVDDELGVVKAQLTTITSLLQSLIKRFKPNLEMRRWRIFQPTKKNMQHRSLDCGIFAVKLLEHLITEANIHVITQEKMTDYRIQLACQLWANEPFF
Homology
BLAST of HG10000916 vs. NCBI nr
Match: XP_038875042.1 (uncharacterized protein LOC120067568 [Benincasa hispida])

HSP 1 Score: 99.4 bits (246), Expect = 4.6e-17
Identity = 46/83 (55.42%), Postives = 62/83 (74.70%), Query Frame = 0

Query: 162 TITSLLQSL-IKRFKPNLEMRRWRIFQPTKKNMQHRSLDCGIFAVKLLEHLITEANIHVI 221
           T+ SLL    +KR+KP+++  RW+I QPT  N Q+  LDCGIF VKLLEHL+T   +  I
Sbjct: 71  TLPSLLHYCDLKRWKPDIKTSRWKISQPTSTNTQYGLLDCGIFVVKLLEHLVTGNALSEI 130

Query: 222 TQEKMTDYRIQLACQLWANEPFF 244
           TQ+K+TDYR++LACQLW N P++
Sbjct: 131 TQDKITDYRMKLACQLWTNTPYY 153

BLAST of HG10000916 vs. NCBI nr
Match: XP_038874902.1 (uncharacterized protein LOC120067405 [Benincasa hispida])

HSP 1 Score: 70.1 bits (170), Expect = 3.0e-08
Identity = 35/70 (50.00%), Postives = 48/70 (68.57%), Query Frame = 0

Query: 162 TITSLLQSL-IKRFKPNLEMRRWRIFQPTKKNMQHRSLDCGIFAVKLLEHLITEANIHVI 221
           T+ SLL    + R K +L   RW+I +P   N+QH SLDCGIF +KLL+HL+T  +  VI
Sbjct: 122 TVLSLLHYCDVDRSKSDLYTVRWKISRPMNVNIQHGSLDCGIFVIKLLKHLVTSTDPSVI 181

Query: 222 TQEKMTDYRI 231
           TQEK+ +YR+
Sbjct: 182 TQEKINEYRM 191

BLAST of HG10000916 vs. ExPASy TrEMBL
Match: A0A6J1DRI2 (uncharacterized protein LOC111022515 OS=Momordica charantia OX=3673 GN=LOC111022515 PE=3 SV=1)

HSP 1 Score: 67.4 bits (163), Expect = 9.4e-08
Identity = 34/84 (40.48%), Postives = 50/84 (59.52%), Query Frame = 0

Query: 162 TITSLLQS--LIKRFKPNLEMRRWRIFQPTKKNMQHRSLDCGIFAVKLLEHLITEANIHV 221
           TI SLL +  L+      L+   WR++ PT    Q  S+DCGIFA K LE+L++  ++  
Sbjct: 68  TIPSLLYACGLMDTTDCKLKKTSWRVYHPTTDTRQKGSIDCGIFACKFLEYLVSGNSLET 127

Query: 222 ITQEKMTDYRIQLACQLWANEPFF 244
           + Q +++  R Q A QLW NEP+F
Sbjct: 128 LVQAEVSHIRRQYATQLWHNEPYF 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875042.14.6e-1755.42uncharacterized protein LOC120067568 [Benincasa hispida][more]
XP_038874902.13.0e-0850.00uncharacterized protein LOC120067405 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DRI29.4e-0840.48uncharacterized protein LOC111022515 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 131..240
e-value: 3.3E-9
score: 38.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..124
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 48..94
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 107..124
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 9..23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 30..47
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 156..234
e-value: 4.4E-7
score: 30.0
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 165..237

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10000916.1HG10000916.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity