HG10014501 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014501
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRetrotrans_gag domain-containing protein
LocationChr02: 12853109 .. 12853666 (+)
RNA-Seq ExpressionHG10014501
SyntenyHG10014501
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCCGCTTGGAGACGATCCTCTAGTACCTCCTCGGAACAACGTTCAGCAAAATGGAGATCAGCAACCACAGCAACAACCTGCTATAGACCAACGAAACTCCATCTTCATAGCAGATGATAGAGATAGAGCAATCAGAGACTATGTTGCGCCTGCATTTCAAACTTTAGACACTGGCATCCTCGACCAACCAATAGATGCGCTCACATTTGAAATGAAACCGTTGATGTTTCAAATGTTGAACTCATTTGGTCAATTTCCCATACTACCCAATGAAGACCCCCACAAACATCTCAATCTTTTTATGAGAATGTGTCGTTATTGTAAAGTTAATGGTGTTTCTGAAGAGGCATTAAGATTTAAGTTGTTTCCTTACTCTTTGCAGGGAGACGCAGAAGCATGGCTTGATTCATTGCAGCCTAATTCCATCACCACATGGGATAACCTCGTGGACAAATTTATAGAAAAATACTTCTCACTAAACAAGAATACTAAGTACAGAGGTGATATCATCGCTTTCAGGCAAGCACCATCAGAATCTGTAGATGGAGCTTAG

mRNA sequence

ATGGATCCGCTTGGAGACGATCCTCTAGTACCTCCTCGGAACAACGTTCAGCAAAATGGAGATCAGCAACCACAGCAACAACCTGCTATAGACCAACGAAACTCCATCTTCATAGCAGATGATAGAGATAGAGCAATCAGAGACTATGTTGCGCCTGCATTTCAAACTTTAGACACTGGCATCCTCGACCAACCAATAGATGCGCTCACATTTGAAATGAAACCGTTGATGTTTCAAATGTTGAACTCATTTGGTCAATTTCCCATACTACCCAATGAAGACCCCCACAAACATCTCAATCTTTTTATGAGAATGTGTCGTTATTGTAAAGTTAATGGTGTTTCTGAAGAGGCATTAAGATTTAAGTTGTTTCCTTACTCTTTGCAGGGAGACGCAGAAGCATGGCTTGATTCATTGCAGCCTAATTCCATCACCACATGGGATAACCTCGTGGACAAATTTATAGAAAAATACTTCTCACTAAACAAGAATACTAAGTACAGAGGTGATATCATCGCTTTCAGGCAAGCACCATCAGAATCTGTAGATGGAGCTTAG

Coding sequence (CDS)

ATGGATCCGCTTGGAGACGATCCTCTAGTACCTCCTCGGAACAACGTTCAGCAAAATGGAGATCAGCAACCACAGCAACAACCTGCTATAGACCAACGAAACTCCATCTTCATAGCAGATGATAGAGATAGAGCAATCAGAGACTATGTTGCGCCTGCATTTCAAACTTTAGACACTGGCATCCTCGACCAACCAATAGATGCGCTCACATTTGAAATGAAACCGTTGATGTTTCAAATGTTGAACTCATTTGGTCAATTTCCCATACTACCCAATGAAGACCCCCACAAACATCTCAATCTTTTTATGAGAATGTGTCGTTATTGTAAAGTTAATGGTGTTTCTGAAGAGGCATTAAGATTTAAGTTGTTTCCTTACTCTTTGCAGGGAGACGCAGAAGCATGGCTTGATTCATTGCAGCCTAATTCCATCACCACATGGGATAACCTCGTGGACAAATTTATAGAAAAATACTTCTCACTAAACAAGAATACTAAGTACAGAGGTGATATCATCGCTTTCAGGCAAGCACCATCAGAATCTGTAGATGGAGCTTAG

Protein sequence

MDPLGDDPLVPPRNNVQQNGDQQPQQQPAIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
Homology
BLAST of HG10014501 vs. NCBI nr
Match: XP_024027611.1 (uncharacterized protein LOC112093437 [Morus notabilis])

HSP 1 Score: 167.5 bits (423), Expect = 1.1e-37
Identity = 77/147 (52.38%), Postives = 101/147 (68.71%), Query Frame = 0

Query: 36  IFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDP 95
           + IA+DRDRAIRDY  P    L  GI+   I A  FE+KP+MFQML + GQF ++  +DP
Sbjct: 50  VVIAEDRDRAIRDYAIPMLDGLHPGIVRPEIQATKFELKPVMFQMLQTVGQFSVMITDDP 109

Query: 96  HKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFI 155
           H HL LF+ +C   K  GV+EEALR KLFPYSL+  A AWL+SL P+S+  W++L +KF+
Sbjct: 110 HMHLRLFIEVCEAFKAPGVTEEALRLKLFPYSLRDRARAWLNSLPPDSVANWNDLAEKFL 169

Query: 156 EKYFSLNKNTKYRGDIIAFRQAPSESV 183
            KYF  NKN K R DI +F+Q   E++
Sbjct: 170 VKYFPPNKNAKLRNDITSFQQLEGEAL 196

BLAST of HG10014501 vs. NCBI nr
Match: ERM93404.1 (hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda])

HSP 1 Score: 165.6 bits (418), Expect = 4.0e-37
Identity = 76/152 (50.00%), Postives = 104/152 (68.42%), Query Frame = 0

Query: 34  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNE 93
           N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF  +P E
Sbjct: 11  NPIILADDRARAIREYAAPMFNELNPGIVRPEIQAPQFELKPVMFQMLQTVGQFSGMPTE 70

Query: 94  DPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDK 153
           DPH HL  F+ +    K+ GVSEE LR KLFP+SL+  A +WL++L P+S+T W++L +K
Sbjct: 71  DPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWNDLAEK 130

Query: 154 FIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           F+ KYF   +N K+R +I++F+Q   ES   A
Sbjct: 131 FLRKYFPPTRNAKFRSEIMSFQQLEDESTSDA 162

BLAST of HG10014501 vs. NCBI nr
Match: XP_017233063.1 (PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus])

HSP 1 Score: 164.9 bits (416), Expect = 6.8e-37
Identity = 77/146 (52.74%), Postives = 100/146 (68.49%), Query Frame = 0

Query: 37  FIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPH 96
           FI DD+DRAIR Y AP F+ L++GI+   I A  FE+KP+MFQML + GQF  +P EDPH
Sbjct: 53  FIVDDKDRAIRQYAAPRFEELNSGIIRPNIQATQFELKPVMFQMLQTIGQFSGMPTEDPH 112

Query: 97  KHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIE 156
            HL LFM +    K  GV E+ALR KLFPYS++  A  WL+SL   S+TTW++L +KF+ 
Sbjct: 113 LHLRLFMEISDSFKFQGVPEDALRLKLFPYSVRDRARTWLNSLPAGSVTTWNDLTEKFLS 172

Query: 157 KYFSLNKNTKYRGDIIAFRQAPSESV 183
           KYF  N N K R +I +F+Q   ES+
Sbjct: 173 KYFPPNMNAKLRNEINSFQQQDDESL 198

BLAST of HG10014501 vs. NCBI nr
Match: XP_030497803.1 (uncharacterized protein LOC115713460 [Cannabis sativa])

HSP 1 Score: 164.5 bits (415), Expect = 8.9e-37
Identity = 77/157 (49.04%), Postives = 108/157 (68.79%), Query Frame = 0

Query: 29  AIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFP 88
           A ++ N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF 
Sbjct: 10  AHNEANPIALADDRTRAIREYAAPMFNELNPGIVRPEIQAPHFELKPVMFQMLQTVGQFG 69

Query: 89  ILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWD 148
             P EDPH H+  F+ +    K+ GVSEEALR KLFP+SL+  A AWL++L P+S+T W+
Sbjct: 70  GSPTEDPHLHIRSFLEVSDSFKLQGVSEEALRLKLFPFSLRDRARAWLNTLPPDSVTNWN 129

Query: 149 NLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           +L +KF+ KYF   +N K+R +I++F+Q+  E+   A
Sbjct: 130 DLAEKFLRKYFPPTRNAKFRSEIMSFQQSEDETTSDA 166

BLAST of HG10014501 vs. NCBI nr
Match: XP_030508936.1 (uncharacterized protein LOC115723589 [Cannabis sativa])

HSP 1 Score: 162.5 bits (410), Expect = 3.4e-36
Identity = 76/155 (49.03%), Postives = 106/155 (68.39%), Query Frame = 0

Query: 31  DQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPIL 90
           ++ N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF   
Sbjct: 47  NEANPIALADDRARAIREYAAPMFNELNPGIVRPEIQAPHFELKPVMFQMLQTVGQFGGS 106

Query: 91  PNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNL 150
           P EDPH H+  F+ +    K+ GVSEEALR KLFP+SL+  A AWL++L P+S+T W++L
Sbjct: 107 PTEDPHLHIRSFLEVSDSFKLQGVSEEALRLKLFPFSLRDRARAWLNTLPPDSVTNWNDL 166

Query: 151 VDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
            +KF+ KYF   +N K+R +I++F+Q   E+   A
Sbjct: 167 AEKFLRKYFPPTRNAKFRSEIMSFQQLEDETTSDA 201

BLAST of HG10014501 vs. ExPASy TrEMBL
Match: U5CUI2 (Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMTR_s04947p00003620 PE=4 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.9e-37
Identity = 76/152 (50.00%), Postives = 104/152 (68.42%), Query Frame = 0

Query: 34  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNE 93
           N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF  +P E
Sbjct: 11  NPIILADDRARAIREYAAPMFNELNPGIVRPEIQAPQFELKPVMFQMLQTVGQFSGMPTE 70

Query: 94  DPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDK 153
           DPH HL  F+ +    K+ GVSEE LR KLFP+SL+  A +WL++L P+S+T W++L +K
Sbjct: 71  DPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWNDLAEK 130

Query: 154 FIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           F+ KYF   +N K+R +I++F+Q   ES   A
Sbjct: 131 FLRKYFPPTRNAKFRSEIMSFQQLEDESTSDA 162

BLAST of HG10014501 vs. ExPASy TrEMBL
Match: A0A6J1E251 (uncharacterized protein LOC111025302 OS=Momordica charantia OX=3673 GN=LOC111025302 PE=4 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.7e-33
Identity = 82/187 (43.85%), Postives = 116/187 (62.03%), Query Frame = 0

Query: 1   MDPLGDDPLVPPRNNVQQNGDQQPQQ--QPAIDQRNSIFIADDRDRAIRDYVAPAFQTLD 60
           M+    DP  PP  N   NGD   ++      +  N I +AD+RD A+R+YV  AF  L+
Sbjct: 1   MNRNAQDP--PPPQNPPVNGDMAGEEAANRVGEIPNLILLADNRDVAMRNYVTHAFHNLN 60

Query: 61  TGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEA 120
           +GI +    A  FE+KP+MFQ+L + GQF  L NEDP+ HL  F+ +    ++ G SE+A
Sbjct: 61  SGINNPLPQAAQFELKPVMFQILQTMGQFGGLTNEDPYSHLKSFIEIANAFQLPGNSEDA 120

Query: 121 LRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAP 180
           LR K+FP+SL+  A  W+++L+PNSI TW  L DKF+ KY +L KN   R DI++FRQ  
Sbjct: 121 LRLKMFPFSLRDGARTWINALEPNSINTWAELTDKFLAKYHTLTKNADLREDIVSFRQKE 180

Query: 181 SESVDGA 186
           +E+V  A
Sbjct: 181 NEAVQEA 185

BLAST of HG10014501 vs. ExPASy TrEMBL
Match: A0A6J1H7E4 (uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC111461168 PE=4 SV=1)

HSP 1 Score: 149.1 bits (375), Expect = 1.9e-32
Identity = 68/152 (44.74%), Postives = 103/152 (67.76%), Query Frame = 0

Query: 34  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNE 93
           N+I +ADDR+RAIR Y  PA   L+  I+   + A TFE+KP+MFQML + GQF  LP+E
Sbjct: 31  NAIQLADDRERAIRAYAHPAVDELNPCIIRPEMQATTFELKPVMFQMLQTIGQFHGLPSE 90

Query: 94  DPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDK 153
           DPH HL  F+ +    +  GV ++ +R  LFPYSL+  A++WL++L P +I +W++L +K
Sbjct: 91  DPHLHLKSFLGVSDSFRFQGVDKDVIRLSLFPYSLRDGAKSWLNTLAPRTIDSWNSLAEK 150

Query: 154 FIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           F+ KYF   +N ++R +I+AF+Q   E++  A
Sbjct: 151 FLIKYFPPTRNARFRNEIVAFQQFEDETLSEA 182

BLAST of HG10014501 vs. ExPASy TrEMBL
Match: A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 7.1e-32
Identity = 70/153 (45.75%), Postives = 104/153 (67.97%), Query Frame = 0

Query: 34  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPN 93
           N I +AD RDRA+RDY A   + L++ +++  P DA  FE KP+M QMLN+ GQF  L +
Sbjct: 8   NPIHVADQRDRAMRDYAAXILEDLNSSVINSFPADA-KFEFKPMMLQMLNNIGQFGGLEH 67

Query: 94  EDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVD 153
           EDP  HL  F+++    ++ G+S++ALR  LFP+S+ G A AWL++   ++ITTW ++VD
Sbjct: 68  EDPRSHLKSFIKVANTFRLPGISDDALRLTLFPFSVSGQATAWLNAFPSDTITTWSDMVD 127

Query: 154 KFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           KF+ KYF   +N   R +II+FRQ  +E+V+ A
Sbjct: 128 KFLVKYFPPTRNADVREEIISFRQKENEAVNVA 159

BLAST of HG10014501 vs. ExPASy TrEMBL
Match: A0A6J1DSZ5 (uncharacterized protein LOC111024107 OS=Momordica charantia OX=3673 GN=LOC111024107 PE=4 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.6e-31
Identity = 70/153 (45.75%), Postives = 102/153 (66.67%), Query Frame = 0

Query: 34  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPN 93
           N I +AD +DRA+RDY A   + L++ +++  P DA  FE KP+M QMLN   QF  L +
Sbjct: 8   NPIHVADQKDRAMRDYAATILEDLNSSVMNPLPADA-QFEFKPMMLQMLNIICQFGGLEH 67

Query: 94  EDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVD 153
           EDP  HL  F+++   C++ G+S++ALR  LFP+SL G A AWL++    +ITTW ++VD
Sbjct: 68  EDPRSHLKSFIKVANTCRLPGISDDALRLTLFPFSLSGQATAWLNAFPSGTITTWSDMVD 127

Query: 154 KFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA 186
           KF+ KYF   +N   R +II+FRQ  +E+V+ A
Sbjct: 128 KFLVKYFPPTRNADVREEIISFRQKENEAVNVA 159

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_024027611.11.1e-3752.38uncharacterized protein LOC112093437 [Morus notabilis][more]
ERM93404.14.0e-3750.00hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda][more]
XP_017233063.16.8e-3752.74PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus][more]
XP_030497803.18.9e-3749.04uncharacterized protein LOC115713460 [Cannabis sativa][more]
XP_030508936.13.4e-3649.03uncharacterized protein LOC115723589 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
U5CUI21.9e-3750.00Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMT... [more]
A0A6J1E2511.7e-3343.85uncharacterized protein LOC111025302 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1H7E41.9e-3244.74uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC1114611... [more]
A0A6J1DY397.1e-3245.75uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DSZ51.6e-3145.75uncharacterized protein LOC111024107 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 122..183
e-value: 2.8E-10
score: 40.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..32
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 12..32

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10014501.1HG10014501.1mRNA