HG10021303 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10021303
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNase H domain-containing protein
LocationChr05: 7514295 .. 7515410 (-)
RNA-Seq ExpressionHG10021303
SyntenyHG10021303
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACTCATAGGAACGACCATAGAACTTGGAGGAGGTCGACTGACCCTATGAGGGTAAAGTTATGCCTAAGTGTTGTTGTACTTTTACACATACTTGGATCACTAGTGAAATAAACCCGACAATTTATAACGGTTAGTAACGACTTGATTAATGACCTTAGAAATTTATTAGCGAATAACCGTTAGGAAACCTATTTCTATATAATCGAGAGGGTTTGTGACATTTGAGGTACGCTAAAATACATTCTAAACTCACTCTCTATATCAATTTATACCTCTAAACTTTAAAGGTTGTATCAATTAAAATCCTGAATTTCGTAAGTGTATTAATTTACACTTTTCTTTAAATTTCATTAAAAGTATCATGTTAGACTTGTAATTATATCAATTAAACTTCTAAATTGTTATAGTGAATCAATTTGTATTATTTATTAAAAATATCTTCAATAATCTTGAATGCATTAGTTCTACATCTATATCTATACTATATTAAATGTGGGCATAAGGAAGAATTTTTACCTTTCCATATTGCCCTTCAATTTTAAGCAAATTCCTATTATACCCTCACATCTCTTAGGTTAGTTACCACTCATCACCCATTCTATTTACTCACACCCTTTCATGTGATTGTTTGTTACGATTGTGTTTTTTCAGTACATCCTCATCTGTATTCAGGGGATTGGTTTTGGTGGAGTGATTGGTTCGGAGTCTGGAGACATTTTGGCTACTCTGGCTGGGCACCGTGAAGGGTTGTGTAGTGTGTTGGGAGCTGAGGCCACTGCTGTGTATGAGGGTCTCAGGCTGGCAGAGAGATTGAGTCTTTCTAATCTTCTAGTTCTTTCGAACTCTTTATCTCTTATTCAAATGCTTAAAGGGACTACTGGTATCTACTGGGAGGTTTCTAATTATATTGAAGATATCAAATGTTGCTTATCCGCTTTTCGGTCAGTGACTTTTCGCCATGTGTCAAGATCTTCCAATCAAACGGCACACTTGTTGGCTCGTGATGGTTCGTCGGGTGTGAGTTTTTTACGACTCAGTTTTTTTTCGGATACGTTGTCCCATTTGTATTCAGGGGATGTTCGTTGTTTGAACAACATTGAGACTATCTAA

mRNA sequence

ATGTCGACTCATAGGAACGACCATAGAACTTGGAGGAGGTCGACTGACCCTATGAGGTACATCCTCATCTGTATTCAGGGGATTGGTTTTGGTGGAGTGATTGGTTCGGAGTCTGGAGACATTTTGGCTACTCTGGCTGGGCACCGTGAAGGGTTGTGTAGTGTGTTGGGAGCTGAGGCCACTGCTGTGTATGAGGGTCTCAGGCTGGCAGAGAGATTGAGTCTTTCTAATCTTCTAGTTCTTTCGAACTCTTTATCTCTTATTCAAATGCTTAAAGGGACTACTGGTATCTACTGGGAGGTTTCTAATTATATTGAAGATATCAAATGTTGCTTATCCGCTTTTCGGTCAGTGACTTTTCGCCATGTGTCAAGATCTTCCAATCAAACGGCACACTTGTTGGCTCGTGATGGTTCGTCGGGTGTGAGTTTTTTACGACTCAGTTTTTTTTCGGATACGTTGTCCCATTTGTATTCAGGGGATGTTCGTTGTTTGAACAACATTGAGACTATCTAA

Coding sequence (CDS)

ATGTCGACTCATAGGAACGACCATAGAACTTGGAGGAGGTCGACTGACCCTATGAGGTACATCCTCATCTGTATTCAGGGGATTGGTTTTGGTGGAGTGATTGGTTCGGAGTCTGGAGACATTTTGGCTACTCTGGCTGGGCACCGTGAAGGGTTGTGTAGTGTGTTGGGAGCTGAGGCCACTGCTGTGTATGAGGGTCTCAGGCTGGCAGAGAGATTGAGTCTTTCTAATCTTCTAGTTCTTTCGAACTCTTTATCTCTTATTCAAATGCTTAAAGGGACTACTGGTATCTACTGGGAGGTTTCTAATTATATTGAAGATATCAAATGTTGCTTATCCGCTTTTCGGTCAGTGACTTTTCGCCATGTGTCAAGATCTTCCAATCAAACGGCACACTTGTTGGCTCGTGATGGTTCGTCGGGTGTGAGTTTTTTACGACTCAGTTTTTTTTCGGATACGTTGTCCCATTTGTATTCAGGGGATGTTCGTTGTTTGAACAACATTGAGACTATCTAA

Protein sequence

MSTHRNDHRTWRRSTDPMRYILICIQGIGFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLIQMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSSGVSFLRLSFFSDTLSHLYSGDVRCLNNIETI
Homology
BLAST of HG10021303 vs. NCBI nr
Match: GAU28886.1 (hypothetical protein TSUD_293380 [Trifolium subterraneum])

HSP 1 Score: 69.7 bits (169), Expect = 2.8e-08
Identity = 45/119 (37.82%), Postives = 69/119 (57.98%), Query Frame = 0

Query: 29  GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
           GFGGVI +ESG  L+  +G  +G   +L AE  A+Y+GL LA+ +++  L+  S+SL  I
Sbjct: 145 GFGGVIRNESGFYLSGFSGFIQGSSDILLAELFAIYKGLTLAKNMAIDELVCYSDSLHCI 204

Query: 89  QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSSGVSFLRL 148
            ++KG +  Y      I+DIK  +S   ++T  H  R  N  A+ LA+ G+S  S L +
Sbjct: 205 NLIKGPSIKYHVYVVLIQDIKELMSQ-SNITLCHTLREGNNCANFLAKLGASSDSDLTI 262

BLAST of HG10021303 vs. NCBI nr
Match: KAF7802400.1 (uncharacterized protein G2W53_041511 [Senna tora])

HSP 1 Score: 68.9 bits (167), Expect = 4.7e-08
Identity = 45/107 (42.06%), Postives = 63/107 (58.88%), Query Frame = 0

Query: 31  GGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLIQM 90
           G +I    G  LA +     G  SV  AEA AVYEG+ +A+R+S+ N+LV  +S S+I++
Sbjct: 630 GCIIRDYMGRCLAAMTKEITGCYSVEMAEALAVYEGMVVAKRMSILNVLVEGDSASVIKL 689

Query: 91  LKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARD 138
           L G       VS  I+DI    S+FRS +F  V R +N+ AH+LA D
Sbjct: 690 LNGEGMDCTYVSFIIKDILQLCSSFRSYSFNWVRREANKVAHILAHD 736

BLAST of HG10021303 vs. NCBI nr
Match: GAU51664.1 (hypothetical protein TSUD_268670 [Trifolium subterraneum])

HSP 1 Score: 67.8 bits (164), Expect = 1.0e-07
Identity = 42/112 (37.50%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 29   GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
            GFGGVI +ESG  L+  +G  +G   +L AE  A+Y+GL LA+ +++  L+  S+ L  I
Sbjct: 1090 GFGGVIRNESGFYLSGFSGFIQGSSDILLAELFAIYKGLTLAKNMAIDELVCYSDYLHCI 1149

Query: 89   QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSS 141
             ++KG +  Y   +  I+DIK  +S   ++T  H  R  N  A+ LA+ G+S
Sbjct: 1150 NLIKGPSIKYHVYAVLIQDIKELMSQ-SNITLCHTLREGNNCANFLAKLGAS 1200

BLAST of HG10021303 vs. NCBI nr
Match: KAF5180544.1 (In chloroplast atpase biogenesis protein, partial [Thalictrum thalictroides])

HSP 1 Score: 67.0 bits (162), Expect = 1.8e-07
Identity = 38/115 (33.04%), Postives = 59/115 (51.30%), Query Frame = 0

Query: 26  QGIGFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSL 85
           QG G+GG++    G  L        G  ++L  E  A+Y+G++LA+ L + N+LV ++S 
Sbjct: 77  QGGGYGGILRDSDGAALYAFTRQSRG-TTILCIELEAIYKGVQLAKELGIKNILVCADSK 136

Query: 86  SLIQMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSS 141
             I  + G   + W   + + +I      F  VTFRH  R +N+ A LLA  G S
Sbjct: 137 QAIDCINGYGPLQWRSRHLVAEIHAAKLGFEQVTFRHFFRETNRAADLLAALGES 190

BLAST of HG10021303 vs. NCBI nr
Match: XP_030483444.1 (uncharacterized protein LOC115700033 [Cannabis sativa])

HSP 1 Score: 66.6 bits (161), Expect = 2.3e-07
Identity = 45/109 (41.28%), Postives = 60/109 (55.05%), Query Frame = 0

Query: 28  IGFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSL 87
           IGFG +I S    + A L+   +G  SV  AEA A+  GL  A+ + L    V S+SLSL
Sbjct: 668 IGFGAIIFSSDKQMKAALSKPLQGSFSVFQAEAVALLVGLCWAQDVGLPVERVFSDSLSL 727

Query: 88  IQMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLAR 137
           +  L+G T    E+     DIK  LS+F  V+  HVSR+ N  AH LA+
Sbjct: 728 VSALEGQTVYLNELGVIFSDIKVLLSSFLGVSLSHVSRNFNVEAHRLAK 776

BLAST of HG10021303 vs. ExPASy TrEMBL
Match: A0A2Z6M8Z8 (RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_293380 PE=4 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 1.3e-08
Identity = 45/119 (37.82%), Postives = 69/119 (57.98%), Query Frame = 0

Query: 29  GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
           GFGGVI +ESG  L+  +G  +G   +L AE  A+Y+GL LA+ +++  L+  S+SL  I
Sbjct: 145 GFGGVIRNESGFYLSGFSGFIQGSSDILLAELFAIYKGLTLAKNMAIDELVCYSDSLHCI 204

Query: 89  QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSSGVSFLRL 148
            ++KG +  Y      I+DIK  +S   ++T  H  R  N  A+ LA+ G+S  S L +
Sbjct: 205 NLIKGPSIKYHVYVVLIQDIKELMSQ-SNITLCHTLREGNNCANFLAKLGASSDSDLTI 262

BLAST of HG10021303 vs. ExPASy TrEMBL
Match: A0A2Z6P5H6 (Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_268670 PE=4 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 5.1e-08
Identity = 42/112 (37.50%), Postives = 66/112 (58.93%), Query Frame = 0

Query: 29   GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
            GFGGVI +ESG  L+  +G  +G   +L AE  A+Y+GL LA+ +++  L+  S+ L  I
Sbjct: 1090 GFGGVIRNESGFYLSGFSGFIQGSSDILLAELFAIYKGLTLAKNMAIDELVCYSDYLHCI 1149

Query: 89   QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSS 141
             ++KG +  Y   +  I+DIK  +S   ++T  H  R  N  A+ LA+ G+S
Sbjct: 1150 NLIKGPSIKYHVYAVLIQDIKELMSQ-SNITLCHTLREGNNCANFLAKLGAS 1200

BLAST of HG10021303 vs. ExPASy TrEMBL
Match: A0A7J6V620 (In chloroplast atpase biogenesis protein OS=Thalictrum thalictroides OX=46969 GN=FRX31_029867 PE=4 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 8.7e-08
Identity = 38/115 (33.04%), Postives = 59/115 (51.30%), Query Frame = 0

Query: 26  QGIGFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSL 85
           QG G+GG++    G  L        G  ++L  E  A+Y+G++LA+ L + N+LV ++S 
Sbjct: 77  QGGGYGGILRDSDGAALYAFTRQSRG-TTILCIELEAIYKGVQLAKELGIKNILVCADSK 136

Query: 86  SLIQMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSS 141
             I  + G   + W   + + +I      F  VTFRH  R +N+ A LLA  G S
Sbjct: 137 QAIDCINGYGPLQWRSRHLVAEIHAAKLGFEQVTFRHFFRETNRAADLLAALGES 190

BLAST of HG10021303 vs. ExPASy TrEMBL
Match: A0A2Z6P3A9 (RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_188980 PE=4 SV=1)

HSP 1 Score: 66.2 bits (160), Expect = 1.5e-07
Identity = 43/119 (36.13%), Postives = 67/119 (56.30%), Query Frame = 0

Query: 29  GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
           GFGGVI +ESG  L+  +G  +G   +L AE   +Y+ L LA+ +++  L+  S+SL  I
Sbjct: 317 GFGGVIRNESGFYLSGFSGFIQGSSDILLAELFVIYKSLTLAKNMAIDELVCYSDSLHCI 376

Query: 89  QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSSGVSFLRL 148
            ++KG +  Y   +  I+DIK  +S   ++T  H  R  N  A  LA+ G+S  S L +
Sbjct: 377 NLIKGPSIKYHVYAVLIQDIKELMSQ-SNITLCHTLREGNNCADFLAKLGASSDSDLTI 434

BLAST of HG10021303 vs. ExPASy TrEMBL
Match: A0A2Z6MP33 (RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_20510 PE=4 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 3.3e-07
Identity = 40/112 (35.71%), Postives = 67/112 (59.82%), Query Frame = 0

Query: 29  GFGGVIGSESGDILATLAGHREGLCSVLGAEATAVYEGLRLAERLSLSNLLVLSNSLSLI 88
           G+GGVI ++SG  L+  +G  +G   +L AE  A+Y+GL LA+ +++  L+  S+SL  I
Sbjct: 36  GYGGVIRNDSGFYLSGFSGFIQGSSDILLAELFAIYKGLTLAKNMAIDELVCYSDSLHYI 95

Query: 89  QMLKGTTGIYWEVSNYIEDIKCCLSAFRSVTFRHVSRSSNQTAHLLARDGSS 141
            ++KG +  Y   +  I+DIK  +S   ++T  H  R  +  A+ LA+ G+S
Sbjct: 96  NLIKGLSIKYHVHAVLIQDIKELMSQ-SNITLCHTLREGSNCANFLAKLGAS 146

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAU28886.12.8e-0837.82hypothetical protein TSUD_293380 [Trifolium subterraneum][more]
KAF7802400.14.7e-0842.06uncharacterized protein G2W53_041511 [Senna tora][more]
GAU51664.11.0e-0737.50hypothetical protein TSUD_268670 [Trifolium subterraneum][more]
KAF5180544.11.8e-0733.04In chloroplast atpase biogenesis protein, partial [Thalictrum thalictroides][more]
XP_030483444.12.3e-0741.28uncharacterized protein LOC115700033 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2Z6M8Z81.3e-0837.82RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_2933... [more]
A0A2Z6P5H65.1e-0837.50Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_268670 PE=4 SV... [more]
A0A7J6V6208.7e-0833.04In chloroplast atpase biogenesis protein OS=Thalictrum thalictroides OX=46969 GN... [more]
A0A2Z6P3A91.5e-0736.13RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_1889... [more]
A0A2Z6MP333.3e-0735.71RNase H domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_2051... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 29..137
e-value: 3.5E-19
score: 68.8
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 23..139
e-value: 1.7E-14
score: 56.0
NoneNo IPR availablePANTHERPTHR47723OS05G0353850 PROTEINcoord: 29..141
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 29..136
e-value: 6.80128E-12
score: 57.324
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 28..144

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10021303.1HG10021303.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity