Clc08G04357 (gene) Watermelon (cordophanus) v2

Overview
NameClc08G04357
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationClcChr08: 12966006 .. 12967369 (+)
RNA-Seq ExpressionClc08G04357
SyntenyClc08G04357
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAAATGAGTTAGATGTAGAAGTTAGTTTTTCTTAAGCAATATTAAGTATATTACATCTTGATTAATGCATATCATTAGCTTGCATAACTTTGCCTGACAAACAAAAGTAGTAAGCATGACTGTCTAGCTTGAGAGAGTGGATGGTTAATCTTTAGCATTTATCGAGACTAGATGAGGTGCAGAGATGTAGTTCATCTTTTGCCAACGCAAACTTACCATGCGTCTTAGAGATAAGATTAATGTGGCGTCGAAACATCGAGAGATGGATTCACAAATTAAGTTTGTAGGTATGTAATTATACTTGGATATGCTTAAGTAATCAATTAGCTTCTGCATTAACTCTACTTTTATCGCATGGTTCCATTCATTCTTGACGCATGCTATCTGACACTAAATGACATTTGACTAACCATCGCCGCGTCCACCTCATTGTTAGTTGTAGGAAATTTTATTCATTCTTTATGTATAGTTTTAGACACACAACCAATACATGCACATACTACATGTACAGTTCCCCGCATAGTCAATCATCTAATCTCAATGCCTAGTTATTACAAATCCCTGCGTTCGACCCTAGACTCACCAAGAAACCTATAAATGCTTATACTTGGGCTCTTTATAGGAAAACTTGCTTAATCATGATTACACTTGAGTAATTCTCGACGCATCCTTGTACGGAAATGATAAATCTATTTTTGTGCAACAAGCTAAGAAACTCAATTTAATTCAAATAGAACTTTAACAAACAGTTAGGATAAATTTTCTGCTATTGGCGAGCCTCTTTCTTACCGTGATCACTTGGTATACATCCTTGAAGGGTTGGGTAATGAATATAATTCATTTGTCACCTCTATTCAAAATCGCACTGATAGACCCTCCATTGCAGATGTTTGTAGTCTCGTGTTGGCTTATGAAGCTCGATTGGAAAAACAAACTTCAGTTGATCAGCTCAGTTTAATTCAAGCTAATGTTGCAAGCATGTCTGTCTCAAATAATTCTCGTTGCCCAAAATGGTCTCCATCAGGTAAGTCTTCTTCCTTTTCTCCTATGCAAGTACCTGGAACTATGCCATTTAATAGTGCCACTCTTACTGGTCTTGGTGTTCTTAGGTCCCCTCGTCCTTTTTTATCCTCCTATAAATTGTTCACCATGGCCTTCTCAAAATCGCCCTCCTAATCCTCAATGCCATATATGCTCCAAGTTTGGTCACACAACCTTGTCCTACTATAGCCTTCACAATCTATCCTTCTATACTTTTCTTCAAGCTCGCACTCCAACACATCATGCTGACTTTACTCAAATCGCTTCCCATGTTCCTTCATTTGCTCAACCTTTTGATATGTCTTACTCCAGTCCTTGA

mRNA sequence

ATGCTAAATGAGTTAGATGTAGAAGATAAATTTTCTGCTATTGGCGAGCCTCTTTCTTACCGTGATCACTTGGTATACATCCTTGAAGGGTTGGGTAATGAATATAATTCATTTGTCACCTCTATTCAAAATCGCACTGATAGACCCTCCATTGCAGATGTTTGTAGTCTCGTGTTGGCTTATGAAGCTCGATTGGAAAAACAAACTTCAGTTGATCAGCTCAGTTTAATTCAAGCTAATGTTGCAAGCATGTCTGTCTCAAATAATTCTCGTTGCCCAAAATGGTCTCCATCAGGTCCCCTCGTCCTTTTTTATCCTCCTATAAATTGTTCACCATGGCCTTCTCAAAATCGCCCTCCTAATCCTCAATGCCATATATGCTCCAAGTTTGGTCACACAACCTTGTCCTACTATAGCCTTCACAATCTATCCTTCTATACTTTTCTTCAAGCTCGCACTCCAACACATCATGCTGACTTTACTCAAATCGCTTCCCATGTTCCTTCATTTGCTCAACCTTTTGATATGTCTTACTCCAGTCCTTGA

Coding sequence (CDS)

ATGCTAAATGAGTTAGATGTAGAAGATAAATTTTCTGCTATTGGCGAGCCTCTTTCTTACCGTGATCACTTGGTATACATCCTTGAAGGGTTGGGTAATGAATATAATTCATTTGTCACCTCTATTCAAAATCGCACTGATAGACCCTCCATTGCAGATGTTTGTAGTCTCGTGTTGGCTTATGAAGCTCGATTGGAAAAACAAACTTCAGTTGATCAGCTCAGTTTAATTCAAGCTAATGTTGCAAGCATGTCTGTCTCAAATAATTCTCGTTGCCCAAAATGGTCTCCATCAGGTCCCCTCGTCCTTTTTTATCCTCCTATAAATTGTTCACCATGGCCTTCTCAAAATCGCCCTCCTAATCCTCAATGCCATATATGCTCCAAGTTTGGTCACACAACCTTGTCCTACTATAGCCTTCACAATCTATCCTTCTATACTTTTCTTCAAGCTCGCACTCCAACACATCATGCTGACTTTACTCAAATCGCTTCCCATGTTCCTTCATTTGCTCAACCTTTTGATATGTCTTACTCCAGTCCTTGA

Protein sequence

MLNELDVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARLEKQTSVDQLSLIQANVASMSVSNNSRCPKWSPSGPLVLFYPPINCSPWPSQNRPPNPQCHICSKFGHTTLSYYSLHNLSFYTFLQARTPTHHADFTQIASHVPSFAQPFDMSYSSP
Homology
BLAST of Clc08G04357 vs. NCBI nr
Match: XP_022155181.1 (uncharacterized protein LOC111022315 [Momordica charantia])

HSP 1 Score: 137.1 bits (344), Expect = 1.5e-28
Identity = 71/158 (44.94%), Postives = 103/158 (65.19%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           ++ DKF+A+GEPLSYRDHL ++L+GLG+EYN+FVTSI NR D PS+ DV SL+LAYEARL
Sbjct: 158 EIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARL 217

Query: 66  EKQTSVDQLSLIQANVASMSVSNNSR--CPKWS---------PSGPL------VLFYPPI 125
           +KQ +VDQL++ QAN+ ++S+ +NS+   PK+S         P+ P+       +   P 
Sbjct: 218 DKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQ 277

Query: 126 NCSPWPSQNRPPNPQCHICSKFGHTTLSYYSLHNLSFY 147
           +   WP +      QC IC K GH+    Y   N++++
Sbjct: 278 SVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYH 315

BLAST of Clc08G04357 vs. NCBI nr
Match: XP_022148871.1 (uncharacterized protein LOC111017438 [Momordica charantia])

HSP 1 Score: 134.8 bits (338), Expect = 7.4e-28
Identity = 85/185 (45.95%), Postives = 111/185 (60.00%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           ++  K S+IGEP+S +DH+ YI+EGLG EYN+FVTSIQNR+D  ++ DV +L+LAY+ RL
Sbjct: 27  EITGKLSSIGEPISLKDHISYIIEGLGIEYNAFVTSIQNRSDMXTLEDVRTLLLAYDYRL 86

Query: 66  EKQTSVDQLSLIQANVASMSV------SNNSRCP-KWSP------SGPLVLFYPPINCS- 125
           EKQ SVDQL+++QANVA++ +      + N R P + SP      + P +L  P  N   
Sbjct: 87  EKQNSVDQLNVVQANVANLQLNRQSGHTRNHRVPSQHSPRPFNFKTQPGLLGKPNQNSQP 146

Query: 126 PW-PSQNRPPNP--QCHICSKFGHTTLSYYSLHNLSF-YTFLQARTPTHHADFTQIASHV 173
           PW PSQ  P NP  QC IC K GHTT   Y   NL +   F     P  HA F  + S  
Sbjct: 147 PWPPSQQSPRNPKVQCQICFKLGHTTSKCYHRANLQYKLNFFPTNQPNPHALFHHVNSSS 206

BLAST of Clc08G04357 vs. NCBI nr
Match: XP_038887133.1 (uncharacterized protein LOC120077323 [Benincasa hispida])

HSP 1 Score: 129.4 bits (324), Expect = 3.1e-26
Identity = 61/90 (67.78%), Postives = 77/90 (85.56%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           DV D F+AIGEPLSYRDHL YILEGLG+EYN FV+SI NRT+RPSIADV +L++ Y++RL
Sbjct: 55  DVLDNFAAIGEPLSYRDHLSYILEGLGSEYNPFVSSIHNRTNRPSIADVRNLLITYDSRL 114

Query: 66  EKQTSVDQLSLIQANVASMSVSNNSRCPKW 96
           EKQT+ D L LIQANVA +S+++ +R P+W
Sbjct: 115 EKQTATDHLQLIQANVAHLSINSQNRHPQW 144

BLAST of Clc08G04357 vs. NCBI nr
Match: XP_038891713.1 (uncharacterized protein LOC120081111 [Benincasa hispida])

HSP 1 Score: 120.2 bits (300), Expect = 1.9e-23
Identity = 75/187 (40.11%), Postives = 103/187 (55.08%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           DV DKFS +GE +SYRDHL +IL+GLG+EYN+FVTSIQN  D  S+ DV SL+L+YEA+L
Sbjct: 26  DVADKFSVVGESISYRDHLTHILDGLGSEYNAFVTSIQNHVDNLSVEDVWSLLLSYEAQL 85

Query: 66  EKQTSVDQLSLIQANVASMSVSNNSR----------------CPKWSPSGP----LVLFY 125
           EKQ ++D L++ QA ++ +S  +NS+                 P +SP  P     V   
Sbjct: 86  EKQNAIDHLNIAQAYLSKLSFQHNSKRNALRPFPNHSSLILPSPNFSPILPSTSNSVHAR 145

Query: 126 PPINCSPWPSQNRPPN-PQCHICSKFGH--------TTLSYYSLHNLSFYTFLQARTPTH 164
           P  N   WP    P + PQC I  KFGH         + +Y  ++  +  + +Q  TPT 
Sbjct: 146 PSFN-KKWPPSKPPSSKPQCQIYHKFGHIVPQCHQLASFAYQCMNPQAHVSSVQPPTPTS 205

BLAST of Clc08G04357 vs. NCBI nr
Match: GFS33695.1 (hypothetical protein Acr_00g0030110 [Actinidia rufa])

HSP 1 Score: 107.8 bits (268), Expect = 9.7e-20
Identity = 63/149 (42.28%), Postives = 89/149 (59.73%), Query Frame = 0

Query: 9   DKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARLEKQ 68
           +  ++IGEP++Y DHL+Y L GLG +YN FVTSIQ++  RPSI +V SL+L+Y+ARLE+Q
Sbjct: 70  NSLASIGEPVTYTDHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQ 129

Query: 69  TSVDQLSLIQANVASMSVSNNSRCPKWSPSGPLVLFYPPINCSPWP-SQNR--------- 128
           ++ D LS +QAN+A+++       PK+    P    +P  N    P  QNR         
Sbjct: 130 SATDTLSSLQANLANLTYQK----PKF--KNPSTNSFPNSNSYSHPRGQNRNPSYSPNPS 189

Query: 129 --PPNPQCHICSKFGHTTLSYYSLHNLSF 146
              P P+C IC K GHT    Y   NL++
Sbjct: 190 SPKPRPRCQICLKPGHTANKCYHRTNLNY 212

BLAST of Clc08G04357 vs. ExPASy TrEMBL
Match: A0A6J1DQX7 (uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022315 PE=4 SV=1)

HSP 1 Score: 137.1 bits (344), Expect = 7.2e-29
Identity = 71/158 (44.94%), Postives = 103/158 (65.19%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           ++ DKF+A+GEPLSYRDHL ++L+GLG+EYN+FVTSI NR D PS+ DV SL+LAYEARL
Sbjct: 158 EIADKFAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARL 217

Query: 66  EKQTSVDQLSLIQANVASMSVSNNSR--CPKWS---------PSGPL------VLFYPPI 125
           +KQ +VDQL++ QAN+ ++S+ +NS+   PK+S         P+ P+       +   P 
Sbjct: 218 DKQNTVDQLNIAQANLVNLSLQHNSKRPPPKFSFPNHYKHSFPNSPISAAQSQSILGKPQ 277

Query: 126 NCSPWPSQNRPPNPQCHICSKFGHTTLSYYSLHNLSFY 147
           +   WP +      QC IC K GH+    Y   N++++
Sbjct: 278 SVHKWPPKPSSSKIQCQICGKLGHSAAVCYHRTNIAYH 315

BLAST of Clc08G04357 vs. ExPASy TrEMBL
Match: A0A6J1D6N7 (uncharacterized protein LOC111017438 OS=Momordica charantia OX=3673 GN=LOC111017438 PE=4 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 3.6e-28
Identity = 85/185 (45.95%), Postives = 111/185 (60.00%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           ++  K S+IGEP+S +DH+ YI+EGLG EYN+FVTSIQNR+D  ++ DV +L+LAY+ RL
Sbjct: 27  EITGKLSSIGEPISLKDHISYIIEGLGIEYNAFVTSIQNRSDMXTLEDVRTLLLAYDYRL 86

Query: 66  EKQTSVDQLSLIQANVASMSV------SNNSRCP-KWSP------SGPLVLFYPPINCS- 125
           EKQ SVDQL+++QANVA++ +      + N R P + SP      + P +L  P  N   
Sbjct: 87  EKQNSVDQLNVVQANVANLQLNRQSGHTRNHRVPSQHSPRPFNFKTQPGLLGKPNQNSQP 146

Query: 126 PW-PSQNRPPNP--QCHICSKFGHTTLSYYSLHNLSF-YTFLQARTPTHHADFTQIASHV 173
           PW PSQ  P NP  QC IC K GHTT   Y   NL +   F     P  HA F  + S  
Sbjct: 147 PWPPSQQSPRNPKVQCQICFKLGHTTSKCYHRANLQYKLNFFPTNQPNPHALFHHVNSSS 206

BLAST of Clc08G04357 vs. ExPASy TrEMBL
Match: A0A7J0E8R3 (Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_02g0010880 PE=4 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 4.7e-20
Identity = 63/149 (42.28%), Postives = 89/149 (59.73%), Query Frame = 0

Query: 9   DKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARLEKQ 68
           +  ++IGEP++Y DHL+Y L GLG +YN FVTSIQ++  RPSI +V SL+L+Y+ARLE+Q
Sbjct: 70  NSLASIGEPVTYTDHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQ 129

Query: 69  TSVDQLSLIQANVASMSVSNNSRCPKWSPSGPLVLFYPPINCSPWP-SQNR--------- 128
           ++ D LS +QAN+A+++       PK+    P    +P  N    P  QNR         
Sbjct: 130 SATDTLSSLQANLANLTYQK----PKF--KNPSTNSFPNSNSYSHPRGQNRNPSYSPNPS 189

Query: 129 --PPNPQCHICSKFGHTTLSYYSLHNLSF 146
              P P+C IC K GHT    Y   NL++
Sbjct: 190 SPKPRPRCQICLKPGHTANKCYHRTNLNY 212

BLAST of Clc08G04357 vs. ExPASy TrEMBL
Match: A0A7J0DER3 (Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_00g0030110 PE=4 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 4.7e-20
Identity = 63/149 (42.28%), Postives = 89/149 (59.73%), Query Frame = 0

Query: 9   DKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARLEKQ 68
           +  ++IGEP++Y DHL+Y L GLG +YN FVTSIQ++  RPSI +V SL+L+Y+ARLE+Q
Sbjct: 70  NSLASIGEPVTYTDHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQ 129

Query: 69  TSVDQLSLIQANVASMSVSNNSRCPKWSPSGPLVLFYPPINCSPWP-SQNR--------- 128
           ++ D LS +QAN+A+++       PK+    P    +P  N    P  QNR         
Sbjct: 130 SATDTLSSLQANLANLTYQK----PKF--KNPSTNSFPNSNSYSHPRGQNRNPSYSPNPS 189

Query: 129 --PPNPQCHICSKFGHTTLSYYSLHNLSF 146
              P P+C IC K GHT    Y   NL++
Sbjct: 190 SPKPRPRCQICLKPGHTANKCYHRTNLNY 212

BLAST of Clc08G04357 vs. ExPASy TrEMBL
Match: A5BPS3 (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_013628 PE=4 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 1.7e-17
Identity = 64/173 (36.99%), Postives = 88/173 (50.87%), Query Frame = 0

Query: 6   DVEDKFSAIGEPLSYRDHLVYILEGLGNEYNSFVTSIQNRTDRPSIADVCSLVLAYEARL 65
           +V DK+SA+GEPLSYRD L+Y L GL  EY+ FVTSI NR+D+ S+ +V SL+  Y   L
Sbjct: 92  EVFDKYSAMGEPLSYRDKLMYTLNGLTEEYDGFVTSIYNRSDKSSLKEVHSLLYTYAYWL 151

Query: 66  EKQTSVDQLSLIQANVASMSVSN------NSRCPKWS-------PSGPLVLFY------- 125
           E++ +  QL   Q N+ + S  N          PK+        P+ P+ L+        
Sbjct: 152 EQRNTAQQLQFSQVNLTAYSGQNKFHKNQQPNFPKYPQNSYSSFPNKPINLYQSFRPNSQ 211

Query: 126 ---------PPINCSPW----PSQNRPPNPQCHICSKFGHTTLSYYSLHNLSF 146
                     P + S W     + N  P PQC IC KFGH  L+ Y   NL++
Sbjct: 212 PSILGKPQGQPQSSSKWFQKQGTGNFGPRPQCQICGKFGHMALNCYHRANLNY 264

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022155181.11.5e-2844.94uncharacterized protein LOC111022315 [Momordica charantia][more]
XP_022148871.17.4e-2845.95uncharacterized protein LOC111017438 [Momordica charantia][more]
XP_038887133.13.1e-2667.78uncharacterized protein LOC120077323 [Benincasa hispida][more]
XP_038891713.11.9e-2340.11uncharacterized protein LOC120081111 [Benincasa hispida][more]
GFS33695.19.7e-2042.28hypothetical protein Acr_00g0030110 [Actinidia rufa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DQX77.2e-2944.94uncharacterized protein LOC111022315 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1D6N73.6e-2845.95uncharacterized protein LOC111017438 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
A0A7J0E8R34.7e-2042.28Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_02g0010880 PE=4 SV=1[more]
A0A7J0DER34.7e-2042.28Uncharacterized protein OS=Actinidia rufa OX=165716 GN=Acr_00g0030110 PE=4 SV=1[more]
A5BPS31.7e-1736.99Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_013628 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 9..67
e-value: 1.8E-7
score: 31.0
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 14..154
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 14..154

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc08G04357.1Clc08G04357.1mRNA