Clc03G10300 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G10300
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
LocationClcChr03: 12761832 .. 12762095 (+)
RNA-Seq ExpressionClc03G10300
SyntenyClc03G10300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTCAAACGATCACCCAAAATGAAGAAACACGTAAGTCGCTGGAAGCGATGCTCCAGAACCAAATGGGTGAAATAAAAAGCCATGCCATTGCAATCCGAAATTTAGAACTTGAAGTGGGACAAATGGCAAGTGAGCGCAATACCAGACCCCAAGGAGCATTGCCTAGTAATATCGAAGCATCACGTGGTAACGGTAAGGAACAATGTCAAGCTGTGACATCGAGGAGTGGGAAGATTCTTCCCACTGAAACGCAAGATTAG

mRNA sequence

ATGCGTCAAACGATCACCCAAAATGAAGAAACACGTAAGTCGCTGGAAGCGATGCTCCAGAACCAAATGGGTGAAATAAAAAGCCATGCCATTGCAATCCGAAATTTAGAACTTGAAGTGGGACAAATGGCAAGTGAGCGCAATACCAGACCCCAAGGAGCATTGCCTAGTAATATCGAAGCATCACGTGGTAACGGTAAGGAACAATGTCAAGCTGTGACATCGAGGAGTGGGAAGATTCTTCCCACTGAAACGCAAGATTAG

Coding sequence (CDS)

ATGCGTCAAACGATCACCCAAAATGAAGAAACACGTAAGTCGCTGGAAGCGATGCTCCAGAACCAAATGGGTGAAATAAAAAGCCATGCCATTGCAATCCGAAATTTAGAACTTGAAGTGGGACAAATGGCAAGTGAGCGCAATACCAGACCCCAAGGAGCATTGCCTAGTAATATCGAAGCATCACGTGGTAACGGTAAGGAACAATGTCAAGCTGTGACATCGAGGAGTGGGAAGATTCTTCCCACTGAAACGCAAGATTAG

Protein sequence

MRQTITQNEETRKSLEAMLQNQMGEIKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASRGNGKEQCQAVTSRSGKILPTETQD
Homology
BLAST of Clc03G10300 vs. NCBI nr
Match: WP_217833224.1 (hypothetical protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 78.6 bits (192), Expect = 3.0e-11
Identity = 45/81 (55.56%), Postives = 61/81 (75.31%), Query Frame = 0

Query: 14 SLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASRG---NG 73
          S+EA+L+  M +    +++ A +IRNL+L++GQ+A E  TR +G+LPSN EA RG   +G
Sbjct: 3  SMEALLKEYMQKNDALMQTQASSIRNLKLQLGQIAGEFKTRQKGSLPSNTEAPRGMGSSG 62

Query: 74 KEQCQAVTSRSGKILPTETQD 88
          KEQCQAVT RSGK+L TETQD
Sbjct: 63 KEQCQAVTLRSGKVLHTETQD 83

BLAST of Clc03G10300 vs. NCBI nr
Match: XP_022159060.1 (uncharacterized protein LOC111025500 [Momordica charantia])

HSP 1 Score: 72.0 bits (175), Expect = 2.8e-09
Identity = 33/80 (41.25%), Postives = 59/80 (73.75%), Query Frame = 0

Query: 4   TITQNEETRKSLEAMLQNQMGEIKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASR 63
           T+ + E  ++++   ++N    ++S A+++RNLE++VGQ+A++  ++P+G LPS+I+  +
Sbjct: 204 TLVRVEIRKETMLKYMENNDTTVQSQAVSLRNLEMQVGQLATDLKSKPKGVLPSDIKVPK 263

Query: 64  GNGKEQCQAVTSRSGKILPT 84
            +GKEQC A+T RSGK LPT
Sbjct: 264 RDGKEQCNALTLRSGKTLPT 283

BLAST of Clc03G10300 vs. NCBI nr
Match: XP_017217165.1 (PREDICTED: uncharacterized protein LOC108194733 [Daucus carota subsp. sativus])

HSP 1 Score: 69.7 bits (169), Expect = 1.4e-08
Identity = 37/87 (42.53%), Postives = 57/87 (65.52%), Query Frame = 0

Query: 1   MRQTITQNEETRKSLEAMLQNQMGEIKSHAIAIRNLELEVGQMASERNTRPQGALPSNIE 60
           +++ I +NE +R  +EA++Q+Q       A ++RNLE +VGQ+A+E   RP G LPS+ E
Sbjct: 307 LKEYIIKNEASRSQIEALVQSQ-------AASLRNLENQVGQLANELRNRPHGTLPSDTE 366

Query: 61  ASRGNGKEQCQAVTSRSGKILPTETQD 88
             +G G E C+A+T +SGK+L   T D
Sbjct: 367 KPKGVGNEHCKAMTLKSGKVLGNTTND 386

BLAST of Clc03G10300 vs. NCBI nr
Match: XP_030507648.1 (uncharacterized protein LOC115722545 [Cannabis sativa])

HSP 1 Score: 68.9 bits (167), Expect = 2.4e-08
Identity = 35/72 (48.61%), Postives = 54/72 (75.00%), Query Frame = 0

Query: 14  SLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASRGNGKEQ 73
           SLE+++++ M +    I+S A ++RNLE+++GQ+A++   RPQG LPS+ E  R +GKE 
Sbjct: 320 SLESLMRDYMAKNDAVIQSQAASLRNLEVQLGQLANDLKNRPQGTLPSDTENPRRDGKEH 379

Query: 74  CQAVTSRSGKIL 82
           C+A+T RSGKIL
Sbjct: 380 CKAITLRSGKIL 391

BLAST of Clc03G10300 vs. NCBI nr
Match: XP_030509259.1 (uncharacterized protein LOC115723937 [Cannabis sativa])

HSP 1 Score: 68.6 bits (166), Expect = 3.1e-08
Identity = 35/72 (48.61%), Postives = 54/72 (75.00%), Query Frame = 0

Query: 14  SLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASRGNGKEQ 73
           SLE+++++ M +    I+S A ++RNLE+++GQ+A++   RPQG LPS+ E  R +GKE 
Sbjct: 389 SLESLMRDYMAKNDAVIQSQAASLRNLEVQLGQLANDLKNRPQGTLPSDTENPRRDGKEH 448

Query: 74  CQAVTSRSGKIL 82
           C+AVT RSGKI+
Sbjct: 449 CKAVTLRSGKII 460

BLAST of Clc03G10300 vs. ExPASy TrEMBL
Match: A0A6J1DXK5 (uncharacterized protein LOC111025500 OS=Momordica charantia OX=3673 GN=LOC111025500 PE=4 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.4e-09
Identity = 33/80 (41.25%), Postives = 59/80 (73.75%), Query Frame = 0

Query: 4   TITQNEETRKSLEAMLQNQMGEIKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEASR 63
           T+ + E  ++++   ++N    ++S A+++RNLE++VGQ+A++  ++P+G LPS+I+  +
Sbjct: 204 TLVRVEIRKETMLKYMENNDTTVQSQAVSLRNLEMQVGQLATDLKSKPKGVLPSDIKVPK 263

Query: 64  GNGKEQCQAVTSRSGKILPT 84
            +GKEQC A+T RSGK LPT
Sbjct: 264 RDGKEQCNALTLRSGKTLPT 283

BLAST of Clc03G10300 vs. ExPASy TrEMBL
Match: A0A6J1DTD1 (uncharacterized protein LOC111024136 OS=Momordica charantia OX=3673 GN=LOC111024136 PE=4 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 9.8e-08
Identity = 36/80 (45.00%), Postives = 54/80 (67.50%), Query Frame = 0

Query: 7   QNEETRKSLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEAS 66
           Q+E +  SLE +++  M      ++     +RNLEL+VGQ+A++ N+RP GALPS+ E  
Sbjct: 226 QSEGSFASLEKLMKQYMANNDATVERQVSPLRNLELQVGQLATDLNSRPIGALPSDTEVP 285

Query: 67  RGNGKEQCQAVTSRSGKILP 83
           + +GKEQC+A+T  SGK LP
Sbjct: 286 KRDGKEQCKALTLGSGKALP 305

BLAST of Clc03G10300 vs. ExPASy TrEMBL
Match: A0A6J1DWK1 (uncharacterized protein LOC111025053 OS=Momordica charantia OX=3673 GN=LOC111025053 PE=4 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 1.3e-07
Identity = 36/80 (45.00%), Postives = 54/80 (67.50%), Query Frame = 0

Query: 7   QNEETRKSLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEAS 66
           Q++ +  SLE +++  M      ++S A ++RNLEL+VGQ+A +  +RP GALPS+ E  
Sbjct: 243 QSKGSITSLENIMKQYMANNDATVQSQAASLRNLELQVGQLAMDLKSRPVGALPSDTEVP 302

Query: 67  RGNGKEQCQAVTSRSGKILP 83
           + + KEQC A+T RSGK LP
Sbjct: 303 KRDSKEQCNALTLRSGKALP 322

BLAST of Clc03G10300 vs. ExPASy TrEMBL
Match: A0A5B6X063 (Uncharacterized protein OS=Gossypium australe OX=47621 GN=EPI10_031377 PE=4 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 1.7e-07
Identity = 36/86 (41.86%), Postives = 55/86 (63.95%), Query Frame = 0

Query: 2   RQTITQNEETRKSLEAMLQNQMGEIKSHAIAIRNLELEVGQMASERNTRPQGALPSNIEA 61
           +Q + Q+  +  S+E +L+  +      A+++R LE +VGQ+AS  ++RPQGALP +IE 
Sbjct: 146 QQNVQQSSSSSSSMEVLLKEYI-RAMFQAVSLRALENQVGQIASALSSRPQGALPRDIEN 205

Query: 62  SRGNGKEQCQAVTSRSGKILPTETQD 88
           SR  GKE C+++T RSG  LP    D
Sbjct: 206 SRSQGKEHCKSITLRSGTQLPRVVND 230

BLAST of Clc03G10300 vs. ExPASy TrEMBL
Match: A0A6J1GJ68 (uncharacterized protein LOC111454344 OS=Cucurbita moschata OX=3662 GN=LOC111454344 PE=4 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 1.1e-06
Identity = 35/84 (41.67%), Postives = 56/84 (66.67%), Query Frame = 0

Query: 5   ITQNEETR-KSLEAMLQNQMGE----IKSHAIAIRNLELEVGQMASERNTRPQGALPSNI 64
           ITQ + T   S+E++++  M +    I+S   +++NLE++VGQ+A+E   RP G LP++ 
Sbjct: 31  ITQAQYTSGTSIESLIKEYMAKNDVVIQSQQASLQNLEVQVGQLATELRNRPLGKLPADT 90

Query: 65  EASRGNGKEQCQAVTSRSGKILPT 84
           E  +  GKEQCQA+  RSGK +P+
Sbjct: 91  ETPKREGKEQCQAIELRSGKKIPS 114

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WP_217833224.13.0e-1155.56hypothetical protein, partial [Synechococcus sp. PCC 7002][more]
XP_022159060.12.8e-0941.25uncharacterized protein LOC111025500 [Momordica charantia][more]
XP_017217165.11.4e-0842.53PREDICTED: uncharacterized protein LOC108194733 [Daucus carota subsp. sativus][more]
XP_030507648.12.4e-0848.61uncharacterized protein LOC115722545 [Cannabis sativa][more]
XP_030509259.13.1e-0848.61uncharacterized protein LOC115723937 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DXK51.4e-0941.25uncharacterized protein LOC111025500 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DTD19.8e-0845.00uncharacterized protein LOC111024136 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DWK11.3e-0745.00uncharacterized protein LOC111025053 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A5B6X0631.7e-0741.86Uncharacterized protein OS=Gossypium australe OX=47621 GN=EPI10_031377 PE=4 SV=1[more]
A0A6J1GJ681.1e-0641.67uncharacterized protein LOC111454344 OS=Cucurbita moschata OX=3662 GN=LOC1114543... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 45..87

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G10300.1Clc03G10300.1mRNA