ClCG05G011820 (gene) Watermelon (Charleston Gray)

NameClCG05G011820
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionRetrotransposon protein, putative, unclassified
LocationCG_Chr05 : 14612026 .. 14613544 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAACTCCTAAGGAAAAAGGTGGAATGAGTTTTTGTGATTTTCCTTTGTTTAATCAAGTGTTGTTGCCCAAGCATACATGGCGTAGTTTAAATCGCGACCAATCTTTGGTGACTCAAATTTTGAAGGCAAGGTACTTTCCTAATTCCCATTTCTTGCTTGTAGGATTGGGAGATCTGTCCCTTGTAGACTTGGCTTAACATTTTGTGGGGTAGAGACCTTGCCGTAAAAGGTGGCAGGTAGAAAGTTGGTGATGACCAATCAATCAATATTTACCATGATCAATGGATTCCTAGAGAGACCACCTTCAAGCCTCTTTCTTCTTCTCAATCTTTGAGTTATCCTTTTGCTAGTGGTTTGATTGCAAATGATGAATGGAATTTGAGTTTAATGGACAAGTTGCAAATGGTGATGCCTGAATCATCAAGCAGATTCCTCTTTATAATTCACCTGGTCCTAACGCCTTCTTCTAACACTATGATAAGAAAGGTATATATTTGGTTAAAAGTGGTTACAAGTTTGCTCAAGTGACTACTATTACTACGTCCTCCTCATCCCTTATGATGAGTAAGATGTGGAAGATGTTGTGGTCTTTGGAAATTCCACCAAAGTCAAAAAATTTCCTTTTAGAGGGTTTGTCTCAACACTTTTCCCACATGTGAGGTTCTTGTAGGTCGTCAGACGTTACATGATAGAAAGTGTCCTTTGTGTCAGCGAGTTTCTAAATCTAACATTCATGCTCTATGGTTTTGTAAGGGAAGTCCAAACATTTGGTCTTCTTGGATCCCTAGTTTGCTTCTTTTTTATATTCACTACTCTAATTATGCTTCTCTTTTTTGTAGGGCAATCAAGATTCTCTGTTCCAGGAACTTAGGATCTTTTCCCTTATCACGTCCTCAATTTGACACGAGCGTAACTTGTTGGTATTTGATAGCAAGCCTTCCAACTCCTTGGATCCAGTTGGTTGGACATTATCTATGGTTGATTCTTTTGATAGTTGGTCAAGATCTTTACTCATGGGTTTGGATGCGATGGATCCTTCTACCGTTGGGGATGATAAATGGAGGACCCCTCCTCCAGAAATTTTCAAGGTGAATTATGATGCAACTATTGGAAGCGAGGAGGTGGGCATTGAACTGTTATTAGTGATTCCAATGGCGAGACCCTGTTGGTTATGGAAGATCCAACCTTTCATTGGTTTAGTGGAGCTTCCTGAGGCTATTGCCCTCTATGAAGGCATGTCTAAGGCTTTGGAAGCTAGTATTTACCCTCTAAGGGCTGAGATTGATTCTTTGATTCCGTGGGGCCTTCTGATGAACTCATCTCAATATTCTAATGATATTCAGTATTTTGTGGATTCTTTACGTGCTTTACATCATTATGGAGCTATTTAAGGCTTCTATTTTGCTTAGTATAATTGCAATATTGTGGCTCACGAGCTTGTGTCTCACGCATGAAGATCTCGAATGTCGTCGATTTGGCTTGAAGATCATCCCCCTTGGGTGGTCTCTTTTTTAA

mRNA sequence

ATGACAACTCCTAAGGAAAAAGGTGGAATGAGTTTTTGTGATTTTCCTTTGTTTAATCAAGTGTTGTTGCCCAAGCATACATGGCGTAGTTTAAATCGCGACCAATCTTTGGTGACTCAAATTTTGAAGGCAAGGTACTTTCCTAATTCCCATTTCTTGCTTACTTGGCTTAACATTTTGTGGGGTAGAGACCTTGCCGTAAAAGGTGGCAGAGAGACCACCTTCAAGCCTCTTTCTTCTTCTCAATCTTTGAGTTATCCTTTTGCTAGTGGTTTGATTGCAAATGATGAATGGAATTTGAGTTTAATGGACAAGTTGCAAATGGTGATGCCTGAATCATCAAGCAGATTCCTCTTTATAATTCACCTGGTCCTAACGCCTTCTTCTAACACTATGATAAGAAAGCGTAACTTGTTGGTATTTGATAGCAAGCCTTCCAACTCCTTGGATCCAGTTGGTTGGACATTATCTATGGTTGATTCTTTTGATAGTTGGTCAAGATCTTTACTCATGGGTTTGGATGCGATGGATCCTTCTACCGTTGGGGATGATAAATGGAGGACCCCTCCTCCAGAAATTTTCAAGGTGAATTATGATGCAACTATTGGAAGCGAGGAGGTGGGCATTGAACTGTTATTAGTGATTCCAATGGCGAGACCCTGTTGGTTATGGAAGATCCAACCTTTCATTGGTTTAGTGGAGCTTCCTGAGGCTATTGCCCTCTATGAAGGCATGTCTAAGGCTTTGGAAGCTAGTATTTACCCTCTAAGGGCTGAGATTGATTCTTTGATTCCGTGGGGCCTTCTGATGAACTCATCTCAATATTCTAATGATATTCAGTATTTTGTGGATTCTTTACATCTCGAATGTCGTCGATTTGGCTTGAAGATCATCCCCCTTGGGTGGTCTCTTTTTTAA

Coding sequence (CDS)

ATGACAACTCCTAAGGAAAAAGGTGGAATGAGTTTTTGTGATTTTCCTTTGTTTAATCAAGTGTTGTTGCCCAAGCATACATGGCGTAGTTTAAATCGCGACCAATCTTTGGTGACTCAAATTTTGAAGGCAAGGTACTTTCCTAATTCCCATTTCTTGCTTACTTGGCTTAACATTTTGTGGGGTAGAGACCTTGCCGTAAAAGGTGGCAGAGAGACCACCTTCAAGCCTCTTTCTTCTTCTCAATCTTTGAGTTATCCTTTTGCTAGTGGTTTGATTGCAAATGATGAATGGAATTTGAGTTTAATGGACAAGTTGCAAATGGTGATGCCTGAATCATCAAGCAGATTCCTCTTTATAATTCACCTGGTCCTAACGCCTTCTTCTAACACTATGATAAGAAAGCGTAACTTGTTGGTATTTGATAGCAAGCCTTCCAACTCCTTGGATCCAGTTGGTTGGACATTATCTATGGTTGATTCTTTTGATAGTTGGTCAAGATCTTTACTCATGGGTTTGGATGCGATGGATCCTTCTACCGTTGGGGATGATAAATGGAGGACCCCTCCTCCAGAAATTTTCAAGGTGAATTATGATGCAACTATTGGAAGCGAGGAGGTGGGCATTGAACTGTTATTAGTGATTCCAATGGCGAGACCCTGTTGGTTATGGAAGATCCAACCTTTCATTGGTTTAGTGGAGCTTCCTGAGGCTATTGCCCTCTATGAAGGCATGTCTAAGGCTTTGGAAGCTAGTATTTACCCTCTAAGGGCTGAGATTGATTCTTTGATTCCGTGGGGCCTTCTGATGAACTCATCTCAATATTCTAATGATATTCAGTATTTTGTGGATTCTTTACATCTCGAATGTCGTCGATTTGGCTTGAAGATCATCCCCCTTGGGTGGTCTCTTTTTTAA

Protein sequence

MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFLLTWLNILWGRDLAVKGGRETTFKPLSSSQSLSYPFASGLIANDEWNLSLMDKLQMVMPESSSRFLFIIHLVLTPSSNTMIRKRNLLVFDSKPSNSLDPVGWTLSMVDSFDSWSRSLLMGLDAMDPSTVGDDKWRTPPPEIFKVNYDATIGSEEVGIELLLVIPMARPCWLWKIQPFIGLVELPEAIALYEGMSKALEASIYPLRAEIDSLIPWGLLMNSSQYSNDIQYFVDSLHLECRRFGLKIIPLGWSLF
BLAST of ClCG05G011820 vs. TrEMBL
Match: J3L042_ORYBR (Uncharacterized protein OS=Oryza brachyantha PE=4 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 5.3e-09
Identity = 37/80 (46.25%), Postives = 47/80 (58.75%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHF-------- 60
           ++TPK  GGM F DF LFNQ +L K  WR +    SL +++LK RYFP + F        
Sbjct: 27  LSTPKFLGGMGFRDFVLFNQAMLGKQGWRLVTDPDSLCSRVLKGRYFPTTSFWDAAKPRS 86

Query: 61  -LLTWLNILWGRDLAVKGGR 72
              TW +IL+GRDL  KG R
Sbjct: 87  ASFTWRSILFGRDLLKKGVR 106

BLAST of ClCG05G011820 vs. TrEMBL
Match: Q9FW98_ORYSJ (Putative non-LTR retroelement reverse transcriptase OS=Oryza sativa subsp. japonica GN=OSJNBa0026L12.31 PE=4 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 9.1e-09
Identity = 35/80 (43.75%), Postives = 47/80 (58.75%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHF-------- 60
           ++TPK  GGM F +F  FNQ +L +  WR L    SL +++LK RYFPNS F        
Sbjct: 859 LSTPKFLGGMGFREFTTFNQAMLGRQCWRLLTDPDSLCSRVLKGRYFPNSSFWEAAQPKS 918

Query: 61  -LLTWLNILWGRDLAVKGGR 72
              TW ++L+GR+L  KG R
Sbjct: 919 PSFTWRSLLFGRELLAKGVR 938

BLAST of ClCG05G011820 vs. TrEMBL
Match: Q7XCU0_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os10g37170 PE=4 SV=2)

HSP 1 Score: 69.3 bits (168), Expect = 9.1e-09
Identity = 35/80 (43.75%), Postives = 47/80 (58.75%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHF-------- 60
           ++TPK  GGM F +F  FNQ +L +  WR L    SL +++LK RYFPNS F        
Sbjct: 816 LSTPKFLGGMGFREFTTFNQAMLGRQCWRLLTDPDSLCSRVLKGRYFPNSSFWEAAQPKS 875

Query: 61  -LLTWLNILWGRDLAVKGGR 72
              TW ++L+GR+L  KG R
Sbjct: 876 PSFTWRSLLFGRELLAKGVR 895

BLAST of ClCG05G011820 vs. TrEMBL
Match: B8AVA6_ORYSI (Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_14862 PE=4 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 9.1e-09
Identity = 38/80 (47.50%), Postives = 45/80 (56.25%), Query Frame = 1

Query: 1    MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHF-------- 60
            MTTPK  GGM F D  LFNQ +L +  WR +    SL  ++LK RYFPNS          
Sbjct: 1618 MTTPKSLGGMGFRDLGLFNQAMLARQGWRIVTDLVSLCARVLKGRYFPNSDLWNAPKPTA 1677

Query: 61   -LLTWLNILWGRDLAVKGGR 72
               TW +IL+GRDL  KG R
Sbjct: 1678 TSFTWRSILFGRDLLRKGVR 1697

BLAST of ClCG05G011820 vs. TrEMBL
Match: A3AR26_ORYSJ (Uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_13823 PE=4 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 1.2e-08
Identity = 38/80 (47.50%), Postives = 45/80 (56.25%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHF-------- 60
           MTTPK  GGM F D  LFNQ +L +  WR +    SL  ++LK RYFPNS          
Sbjct: 331 MTTPKSLGGMGFRDLGLFNQAMLARQGWRIVTDLVSLCARVLKGRYFPNSDLWNAPKPTA 390

Query: 61  -LLTWLNILWGRDLAVKGGR 72
              TW +IL+GRDL  KG R
Sbjct: 391 TSFTWPSILFGRDLLRKGVR 410

BLAST of ClCG05G011820 vs. NCBI nr
Match: gi|685317536|ref|XP_009149313.1| (PREDICTED: uncharacterized protein LOC103872633 [Brassica rapa])

HSP 1 Score: 74.7 bits (182), Expect = 3.1e-10
Identity = 44/115 (38.26%), Postives = 59/115 (51.30%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFL------- 60
           +  PK++GGM F D  +FNQ LL K  WR L+   SL+   LK+RYFPN  FL       
Sbjct: 364 LCVPKDRGGMGFKDIEIFNQSLLAKQAWRILSSPSSLLGSFLKSRYFPNKSFLSAPLGSR 423

Query: 61  --LTWLNILWGRDLAVKGGRETTFKPLSSSQSLSYPFASGLIANDEWNLSLMDKL 107
               W +IL+GR+L VKG R      +    SLS   +  L+  D   + LM  +
Sbjct: 424 PSYAWRSILYGRELLVKGLRHM----VGDGCSLSVWSSPWLVDGDRMRIPLMKNI 474

BLAST of ClCG05G011820 vs. NCBI nr
Match: gi|985434686|ref|XP_015382715.1| (PREDICTED: uncharacterized protein LOC107175626 [Citrus sinensis])

HSP 1 Score: 73.9 bits (180), Expect = 5.3e-10
Identity = 73/310 (23.55%), Postives = 129/310 (41.61%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFLLT----- 60
           M+  K +GG+ F D   FNQ L+ K  WR L     L+ ++L+A+Y+ +S FL T     
Sbjct: 89  MSQVKSRGGLGFRDLTSFNQALVAKQAWRLLQLPNLLLARVLQAKYYKHSLFLNTTVGSN 148

Query: 61  ----WLNILWGRDLAVKGGR-----ETTFKPLSSSQSLSYPFASGLIANDEWNLSLMDKL 120
               W +ILWGR +  K  +      +TF+  +      + +   L +N + N       
Sbjct: 149 LSFIWKSILWGRQVIEKARKTWNLAPSTFEHQNVPGQDIFSYIQNLCSNSKRN------- 208

Query: 121 QMVMPESSSRFLFIIHLVLTPSSNTMIRKRNLLVFDSKPSNSLDPVGWTLSMVDSFDSWS 180
                E+    ++           T+   RN L+F+ K    +       SM++++    
Sbjct: 209 -----EAELMIMYCW---------TIWYARNKLIFERKQIEPMFSAAKAESMLEAYHRVR 268

Query: 181 RSLLMGLDAMDPSTVGDDKWRTPPPEIFKVNYDATIG--SEEVGIELLLVIPMARPCWLW 240
           ++  + +   +   V   +W  PP  + K+N DA     +++VG+  +L     R     
Sbjct: 269 KAGTLHIS--NTREVSQQRWSPPPKNVLKLNVDAATNNKNQKVGLGAVLRDSNGRVVAAG 328

Query: 241 -KIQPFIGLVELPEAIALYEGMSKALEASIYPLRAEIDSLIPWGLLMNSSQYSNDIQYFV 294
            K   F   V   EA A+  G+  A EA+   L  E D      L+ N+    ++I + +
Sbjct: 329 IKQVSFRKDVSYAEAEAIQWGLQIAKEAAATSLIVETDCKDVAELVNNTKGSRSEIFWII 375

BLAST of ClCG05G011820 vs. NCBI nr
Match: gi|923822726|ref|XP_013694835.1| (PREDICTED: uncharacterized protein LOC106398878 [Brassica napus])

HSP 1 Score: 72.4 bits (176), Expect = 1.5e-09
Identity = 42/115 (36.52%), Postives = 61/115 (53.04%), Query Frame = 1

Query: 1   MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFL------- 60
           +  PK+ GGM F D  LFNQ LL K  WR L+   SL+++ LK+RYFP+ +FL       
Sbjct: 618 LCVPKQLGGMGFKDIELFNQALLAKQAWRILSDQSSLLSRFLKSRYFPSGNFLSASMGSR 677

Query: 61  --LTWLNILWGRDLAVKGGRETTFKPLSSSQSLSYPFASGLIANDEWNLSLMDKL 107
               W +IL GR+L  KG R      + + Q+LS   +  ++  D   + LM  +
Sbjct: 678 PSYAWRSILHGRELLAKGLRHM----VGNGQTLSVWSSPWIVDGDGLRIPLMKNI 728

BLAST of ClCG05G011820 vs. NCBI nr
Match: gi|923682615|ref|XP_013654036.1| (PREDICTED: uncharacterized protein LOC106358780 [Brassica napus])

HSP 1 Score: 72.0 bits (175), Expect = 2.0e-09
Identity = 41/112 (36.61%), Postives = 59/112 (52.68%), Query Frame = 1

Query: 4   PKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFL---------L 63
           PK++GGM F D   FNQ LL K  WR L+   SL+++ LK+RYFPN  FL          
Sbjct: 517 PKDRGGMGFKDIETFNQALLAKQAWRILSSPSSLISRFLKSRYFPNGSFLPASLGSRPSF 576

Query: 64  TWLNILWGRDLAVKGGRETTFKPLSSSQSLSYPFASGLIANDEWNLSLMDKL 107
            W +IL+GR+L  KG R      + +  S+S   +  L+  +   + LM  +
Sbjct: 577 AWRSILFGRELLSKGLR----LMVGNGSSISVWSSPWLVDGERMRIPLMKNI 624

BLAST of ClCG05G011820 vs. NCBI nr
Match: gi|923616930|ref|XP_013746214.1| (PREDICTED: uncharacterized protein LOC106448947 [Brassica napus])

HSP 1 Score: 72.0 bits (175), Expect = 2.0e-09
Identity = 77/329 (23.40%), Postives = 138/329 (41.95%), Query Frame = 1

Query: 1    MTTPKEKGGMSFCDFPLFNQVLLPKHTWRSLNRDQSLVTQILKARYFPNSHFLLT----- 60
            +T PK  GG+ F D  L+NQ LL K  WR + +   L+++IL+ +Y   + FL       
Sbjct: 725  LTLPKHLGGLGFRDVRLYNQALLAKIAWRLITKPDCLLSRILQGKYCHKTSFLKVTSAPS 784

Query: 61   ----WLNILWGRDLAVKG-------------------GRETTFKP----LSSSQSLSYPF 120
                W  ILWGRDL ++                    G  T  +P    L   Q L   F
Sbjct: 785  SSHGWKGILWGRDLLLRHLGKVIGNGENTRVWADNWIGTSTDLRPHGPVLPQDQDL---F 844

Query: 121  ASGLIAND--EWNLSLMDKLQMVMPESSSRFLFIIHLVLTPSSNTMIRKRNLLVFDSKP- 180
             + L++ +  EWN +L++KL   +P+ S   L +   + +   +T+  +    ++ ++  
Sbjct: 845  VADLLSRETKEWNRALVEKL---LPDLSDHILQLRPSLTSAPDSTIWTQTKNGIYSARSG 904

Query: 181  --SNSLDPVGWTLSMVD------SFDSWSRSLLMGLD------AMDPSTVGDDKWRTPPP 240
              +  +  +  T +++D        + WS +LL  L       A +    GD+  R    
Sbjct: 905  YHTTQVSRIQSTTTLLDRESWNWQKNIWSPNLLPKLKHFLWKCARNCLPTGDNLIRR--- 964

Query: 241  EIFKVNYDAT---IGSEEVGIELLLVIPMARPCWLWKIQPFIGLVELPEAIALYEGMSKA 275
                +N + T    G +E    L  + P A    +W+  P+    +  ++I+  E +  +
Sbjct: 965  ---GINRNTTCLRCGEQETLNHLFFICPFATK--VWETAPWTSTFDSSQSISFREELQTS 1024

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
J3L042_ORYBR5.3e-0946.25Uncharacterized protein OS=Oryza brachyantha PE=4 SV=1[more]
Q9FW98_ORYSJ9.1e-0943.75Putative non-LTR retroelement reverse transcriptase OS=Oryza sativa subsp. japon... [more]
Q7XCU0_ORYSJ9.1e-0943.75Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
B8AVA6_ORYSI9.1e-0947.50Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_14862 PE=4... [more]
A3AR26_ORYSJ1.2e-0847.50Uncharacterized protein OS=Oryza sativa subsp. japonica GN=OsJ_13823 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|685317536|ref|XP_009149313.1|3.1e-1038.26PREDICTED: uncharacterized protein LOC103872633 [Brassica rapa][more]
gi|985434686|ref|XP_015382715.1|5.3e-1023.55PREDICTED: uncharacterized protein LOC107175626 [Citrus sinensis][more]
gi|923822726|ref|XP_013694835.1|1.5e-0936.52PREDICTED: uncharacterized protein LOC106398878 [Brassica napus][more]
gi|923682615|ref|XP_013654036.1|2.0e-0936.61PREDICTED: uncharacterized protein LOC106358780 [Brassica napus][more]
gi|923616930|ref|XP_013746214.1|2.0e-0923.40PREDICTED: uncharacterized protein LOC106448947 [Brassica napus][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G011820.1ClCG05G011820.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None