ClCG05G010420 (gene) Watermelon (Charleston Gray)

NameClCG05G010420
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionRetrotransposon protein, putative, unclassified
LocationCG_Chr05 : 11601576 .. 11605279 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTTTTTCAGACAACTGACTGATGTTGAGTCTATTTGTGATATGATTGCAAGGCATGTTACTGATCCTTGCTTTTGTGGTGAGCAAACTGATAATAAAGGAAAGGGGTTGATCCTTCCCACCCACATCCCAACTCATATTGGTCGGCAACCCCCTTATTCATCTTTTTGGAAGTTGAATGTTCATGCAGCTTGGAGCGCTAATACCAATTCAGGTGGGTGGGGTTGGGTGCTTCGTGACCATTTGGATCATGTGCGCTTAGTAGGGTTGAAGTTTGTTCCTAGATGCCAAAAGGTGAAGCTCCTTAAAGTTATGGCAATTTGTTTTGGGCTTAAGACTCTATCTTCTATAAGTATATCGAATACATTAGAGGGATCTGATTGTTTGGAGGTCATCATTCTATTGAATGATGTTGCTACAAATCTTTCTGAAATCTTCTTTTTTATCGAGGAGGCTAAGGATAGAGGTCGAGAGCTAGGAATCATCTCCTTTTCCCATGTCTGCCGTAGTTCGACTGTGTTGGCACACTGTGTTGCGCGTAAATCTTTGGAGGGTTGGGAGTCCTCAATTCTGTCTACTCCTTATCCTGAATGGTTTGCTTTGATGATGTCTCATGATATTTTGTGTAGTTGTTTTGGTTGTTTTTGCCTTTTTTCCTCAAAAGAAAAAAAAAAAAGAGCAAGTAAAGTGATCAAATATGGAGATAGAAACGTCCCAGATAATAGCTTTTATCGAAAGGTCTACCAAACTTAGTTGCTTTAATTCTTGAAAAAATTAACACAACAGCTAGATCTATAAGTTGTTTATCTGAATTTCCCCTTCCAAAGTCAATCTTTTCTTGCACCTAGTTTCTAATCTCTACATATTCCATCTAAATGTTTGTTTCCTATAATTTAATATAAAAACACATCCAAGCTTACAATTAACATATAAAAATAGAAAAAAATATGAGCCAAAGAAAACACTTTTCATATCACACCCTCTCCCAGATTACCCTCTTTAACCTGGAAGGAGATGTGAGGATAGCAAACTTCACCCTTTTTGTGAAGTTCACTGTTCATCTGTACCTGACTAGCCTCTTGAAAACATTTGTTTAACATAATCGTCCACAATGTATATTCCGAAATCATAAAACATTTAAATCCACCCGACTAATAATTATAGCAACATGATGATACATATTTATTTCAATAGAAAATCCAACTCGATTACAACCAAACTTAAACTACATACACATTATCCACTCTCACTTCTCAACAGCTAACTAGCTATGTCTACATGCCACATTACTCAAAATTATTTACATATACAACAAGTGTGATGGAAAGTATTCTTTGATGACTTGAAACTTACGCAGGATGAGGCTTAAGTAGGCTAAGCACCAAGAACTCTTTCGCTACCTGGATGGAATAGAAAACATGGAAAAACATGAGCTGGATGTCCAGTAAGTGACGTAATATTGAAAACACAACTTTAGAGCTACATCATGCTTAGTATTAGAGGAAACATAGATTATAACATAAAAATCAAGCTTAGGCAATGAAGAAAACATAAGCAATGCCATTTGATTAAATAAAATTTCTAAACTCATGAAACATTCCTTTGCATTAAAAAATACCCTTAGTTGAGGTGGAGCAAGCTCAACACACTCAAACGTCGAAAACAAACCTTAGTTGGGGAGAAGTAATCTCGACACACCCAACATCGAAAAACCTTAGTTGATGTGGAGTAATCTCAACACAATAGCTTCATTTCTCACTATGTGCACATAGGAACTATCTCGGCTGCCCAAGTACCTTTTCCATGTCCTTCAATACTGGGTCTCTAGGATACTCAAATGCCTTCTCAACGTCCTTCAGCATTGAGTTCCTTCATGTCATAGGGTTGCCCATATGCCTTCTCAACGTCCTTTAACATTGGGTCCCTTATAAAACATTTTAGAAAACAAGTATGCTTTGTACAAAAGTTCATCATAAAACATTTCATGAAAAATCAAATCATTTCTCAATAACCAGCCTCAGTTAAACATGCTTTTTAAAGACTTGAGAAAATAGATAAATCATGACTTTCATGCCATAAATTCAAGTATAATAAGTAATCATGCTTAACATTTAGCAAATCATCATAAAATCGAATTTCACAAGAAATCATGGCTAAACATTATAAATCAAATCCTTTTTAAAGAAAGACAAGTGTCACTCACAATCTGGGCCTGTTCCTCGAGGCTTATAGACTTTTCCAATCTCAATTTGACCTGAACTATCATTTACGAGCCATTAGATCATCTCCTTAAAGTCCTTGTAAACATCATTTAACTAAACTCCATCTAAATTCTCCATATTAAACTTAATGTCCACCTTTTTCATCATTTTTACCCCTTTAGCGGCCTTTTTGCAAATTTCTCAAAATTTTCTACAATATCCGAAAATCAGCTTAAAATTCATATTTACCCATTTTAAGGCCAATTATTCAAGAAAATATCAATTAGAGACCAAAAATTACACAAAATGGTCATAGGCACCCTAGATGGGCAATAGGTGGGCGAGGGGTGCTGGCAGGGGCGTGGGCGTGCTGCTGAGTGAATGGGCGACACTGGCGAGCGGAATTGTGGGCAGCAACTCCTGCTCACGAGCGTGGATGGTCGTATAACATGCGGATGGGGTGAGTATGTTGCACAGACGTTCAAGAGTGGGCGCTGAAAGCGCGCACGTATGGGGTTGGGGTGTGTGGGCGTGAGCAACAGCTCTGGGCAGCCTCTAGCTGACTTCGGCCAACAATCTCCTGTTTTCAACCTGATTTTTTAGAGCTCCAAAACACCTTTTCCTTTCAAATAAACTTCAAATAAAAATTATGGACTCAAGGATAACCTCTTAACTTACTTACTAAGAAAATTAAGCTTCAAAACCTACCTAAGCACCCTTGAAAAATTCCTCAAAGTTAACTCATTTGGTTTGCTCGGTTTCAGTGACAACCTTCGGTTTCTTCAAGTTTCCTCAAGCTTCCCAACCTCAAATCATTCTGGTAGCCTAGGTTAGATTCCTAACCTACAAAGCTCTTTTTGGTAAAAAAAACGACGAAAATTGCACGAGTGAATCAGGAGAATCGAAATTGTAATTGAGCTGCTTGCTTTTGGTCCAAGTTGCTGCCCAGTCCAGCAAATCACCTTTGGCCATAAAACCCTTTGCTTCAAAGTGGCTCCTCATCTTTTCTTCTCATCCTAGCCATTTAAATTAAATATTATTTGACTTTAAATAATCATTTTAATTGAATTTAATATTTATTCCTTAATTGAACATTTTCCCTAAATTAATTTCAACTTCTTAACGTACTTTGACTTGAGATTCTCCCCCAACACTTAGCAAAATTTTCCTAAGTCCAGCCAAAATTCCTTGTTCAACTACACCATTATCTCCATCTCTTTTATTCCTTATTAAAAGCTAATGGAAAAATCACAATTACTTAAACATGCACTAGAATTCCATCTACATTTCTACTCATTCAAGCCAACCCTAGCCAACCTTTAACCATGATTTTTAACTTGTGCTAATGCAAAGGAAAATAGAAATTAGGGCTCACATTTCAAGTGTTTTTCACTCAACACCACATGGGTGCGGCAATCTTTTATGTAAGTCAGCTCAGAGTATCTCGAGACTAGAGTAGGGAAAGGCAGGTCGGAGGTCAAGGCCACTGAGCACCTGTAG

mRNA sequence

ATGGTGTTTTTCAGACAACTGACTGATGTTGAGTCTATTTGTGATATGATTGCAAGGCATGTTACTGATCCTTGCTTTTGTGGTGAGCAAACTGATAATAAAGGAAAGGGGTTGATCCTTCCCACCCACATCCCAACTCATATTGGTCGGCAACCCCCTTATTCATCTTTTTGGAAGTTGAATGTTCATGCAGCTTGGAGCGCTAATACCAATTCAGGTGGGTGGGGTTGGGTGCTTCGTGACCATTTGGATCATGTGCGCTTAGTAGGGTTGAAGTTTGTTCCTAGATGCCAAAAGGTGAAGCTCCTTAAAGTTATGGCAATTTGTTTTGGGCTTAAGACTCTATCTTCTATAAGTATATCGAATACATTAGAGGGATCTGATTGTTTGGAGGTCATCATTCTATTGAATGATGTTGCTACAAATCTTTCTGAAATCTTCTTTTTTATCGAGGAGGCTAAGGATAGAGGTCGAGAGCTAGGAATCATCTCCTTTTCCCATGTCTGCCGTAGTTCGACTGTGTTGGCACACTGTGTTGCGCGTAAATCTTTGGAGGGTTGGGAGTCCTCAATTCTGTCTACTCCTTATCCTGAATGCTCAGAGTATCTCGAGACTAGAGTAGGGAAAGGCAGGTCGGAGGTCAAGGCCACTGAGCACCTGTAG

Coding sequence (CDS)

ATGGTGTTTTTCAGACAACTGACTGATGTTGAGTCTATTTGTGATATGATTGCAAGGCATGTTACTGATCCTTGCTTTTGTGGTGAGCAAACTGATAATAAAGGAAAGGGGTTGATCCTTCCCACCCACATCCCAACTCATATTGGTCGGCAACCCCCTTATTCATCTTTTTGGAAGTTGAATGTTCATGCAGCTTGGAGCGCTAATACCAATTCAGGTGGGTGGGGTTGGGTGCTTCGTGACCATTTGGATCATGTGCGCTTAGTAGGGTTGAAGTTTGTTCCTAGATGCCAAAAGGTGAAGCTCCTTAAAGTTATGGCAATTTGTTTTGGGCTTAAGACTCTATCTTCTATAAGTATATCGAATACATTAGAGGGATCTGATTGTTTGGAGGTCATCATTCTATTGAATGATGTTGCTACAAATCTTTCTGAAATCTTCTTTTTTATCGAGGAGGCTAAGGATAGAGGTCGAGAGCTAGGAATCATCTCCTTTTCCCATGTCTGCCGTAGTTCGACTGTGTTGGCACACTGTGTTGCGCGTAAATCTTTGGAGGGTTGGGAGTCCTCAATTCTGTCTACTCCTTATCCTGAATGCTCAGAGTATCTCGAGACTAGAGTAGGGAAAGGCAGGTCGGAGGTCAAGGCCACTGAGCACCTGTAG

Protein sequence

MVFFRQLTDVESICDMIARHVTDPCFCGEQTDNKGKGLILPTHIPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYPECSEYLETRVGKGRSEVKATEHL
BLAST of ClCG05G010420 vs. TrEMBL
Match: B8AMK5_ORYSI (Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_12637 PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 2.3e-06
Identity = 52/175 (29.71%), Postives = 79/175 (45.14%), Query Frame = 1

Query: 16  MIARHVTDPCFCGEQ--TDNKGKGLILPT-HIPTHIGR-----QPPYSSFWKLNVHAAWS 75
           M  +H  +  F   Q   D KGKG+  P  H+   +       +PP   + KLNV  A+S
Sbjct: 358 MFLQHYAETLFMVRQKEVDLKGKGICQPNMHVRPSVPDSRTVWKPPPPGWVKLNVDGAFS 417

Query: 76  ANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSS-ISISNTLEG 135
           A    G  G ++R+      L   +FV RC + + ++++A C GLK  +  + +   LE 
Sbjct: 418 AEQGIGAIGIIIRNSEGKAILSSWRFVRRCAEAEEVELLACCEGLKLAAEWVPLPVELE- 477

Query: 136 SDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVAR 182
           SDC  VI  L     + S   F   E KD    +  +  SH  R    +AH +A+
Sbjct: 478 SDCTTVITRLKSKGEDRSRWAFLWRETKDVMSLVKEVRLSHCKRECNRVAHELAQ 531

BLAST of ClCG05G010420 vs. TrEMBL
Match: M7Z309_TRIUA (Bidirectional sugar transporter SWEET14 OS=Triticum urartu GN=TRIUR3_00534 PE=3 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 8.8e-06
Identity = 45/157 (28.66%), Postives = 69/157 (43.95%), Query Frame = 1

Query: 40  LPTHIPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQK 99
           LP     H+ ++P Y    K+NV AA+ A+T SG  G V RD           F+P  + 
Sbjct: 362 LPMRPRDHMWKKPSYGMV-KVNVDAAFHADTLSGASGAVGRDDKGEFIAAASWFIPHVRD 421

Query: 100 VKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRE 159
           V   ++MAI  G+  LSSI  +     SDCL  +  L  +   +      + E K    +
Sbjct: 422 VDSAELMAIRNGIYLLSSIGCTKIEVESDCLFAVETLQSMDDYMGPDSAVVAECKKLSID 481

Query: 160 LGIISFSHVCRSSTVLA-----HCVARKSLEGWESSI 192
              +SF H  R +  +A     HC + +  + W+  I
Sbjct: 482 FNKLSFKHCYREANRVADELAKHCFSIRVPDSWDDVI 517

BLAST of ClCG05G010420 vs. NCBI nr
Match: gi|568876828|ref|XP_006491472.1| (PREDICTED: uncharacterized protein LOC102626455 [Citrus sinensis])

HSP 1 Score: 75.9 bits (185), Expect = 1.0e-10
Identity = 44/147 (29.93%), Postives = 76/147 (51.70%), Query Frame = 1

Query: 51   QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICF 110
            +PP  +  KLNV AA S      G G ++RD    +  VG+K     ++V L +  AI +
Sbjct: 1294 KPPSQNVLKLNVDAAVSTKDQKVGLGAIVRDAEGKILAVGIKQAQFRERVSLAEAEAIHW 1353

Query: 111  GLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCR 170
            GL+  + IS S+ +  SDC EV+ LLN+   + +EI + + + +   +E   + FS + R
Sbjct: 1354 GLQVANQISSSSLIVESDCKEVVELLNNTKGSRTEIHWILSDVRRESKEFKQVQFSFIPR 1413

Query: 171  SSTVLAHCVARKSLEGWESSILSTPYP 198
            +    AH +A+ +L    + +    +P
Sbjct: 1414 TCNTYAHALAKFALRNSSTDVWVGTFP 1440

BLAST of ClCG05G010420 vs. NCBI nr
Match: gi|985429304|ref|XP_015384077.1| (PREDICTED: uncharacterized protein LOC107176301 [Citrus sinensis])

HSP 1 Score: 71.2 bits (173), Expect = 2.5e-09
Identity = 42/138 (30.43%), Postives = 75/138 (54.35%), Query Frame = 1

Query: 44  IPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLL 103
           +P H  + PP + F K+NV AA ++     G G V+RD  ++    G+      + V   
Sbjct: 215 VPQHKWKPPPKNVF-KVNVDAAINSKRQMAGLGAVIRDSENNFVATGIMQTGMKESVSYA 274

Query: 104 KVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGII 163
           +  AI +GLK     ++S  +  SDCLEV+ L+N+  +N + +++ IEE +++ R    +
Sbjct: 275 EAEAIDWGLKLARRAALSTLIIESDCLEVVELVNNTKSNKTGLWWIIEEIQNQKRTFCNV 334

Query: 164 SFSHVCRSSTVLAHCVAR 182
             +H+ R+  V AH +A+
Sbjct: 335 IVNHIPRTCNVCAHSLAK 351

BLAST of ClCG05G010420 vs. NCBI nr
Match: gi|985461851|ref|XP_015388559.1| (PREDICTED: uncharacterized protein LOC107178191 [Citrus sinensis])

HSP 1 Score: 68.6 bits (166), Expect = 1.6e-08
Identity = 41/141 (29.08%), Postives = 73/141 (51.77%), Query Frame = 1

Query: 51  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICF 110
           +P   +F K+NV AA ++     G G V++D    +   G+K VP  + V      A+ +
Sbjct: 106 EPLPGNFLKVNVDAAINSRNQVSGLGAVIKDPSGKIVAAGIKQVPLREGVSFADAEAMEW 165

Query: 111 GLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCR 170
           GLK     S+S  +  +DC EV+ LLN+   + + I + I + +++ R+   + F H+ R
Sbjct: 166 GLKVAKEFSLSAMIMETDCKEVVDLLNNTKGSRTGISWVISDIQEQQRDFKEVKFRHIPR 225

Query: 171 SSTVLAHCVARKSLEGWESSI 192
           +    AH +A+ +L    S+I
Sbjct: 226 TCNTCAHSLAKLALGANTSAI 246

BLAST of ClCG05G010420 vs. NCBI nr
Match: gi|985448646|ref|XP_015385738.1| (PREDICTED: uncharacterized protein LOC107177034 [Citrus sinensis])

HSP 1 Score: 67.0 bits (162), Expect = 4.7e-08
Identity = 42/146 (28.77%), Postives = 73/146 (50.00%), Query Frame = 1

Query: 52  PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFG 111
           PP  + +K+NV AA + +    G G V+RD  + V++  +K       V+  +  A+ +G
Sbjct: 147 PPPDNTFKVNVDAAVNFDRQKAGLGVVIRDSSNKVKVAAVKSTLFTGDVQTAEAEAVEWG 206

Query: 112 LKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRS 171
           L    S +    +  SDC EV+ L+N+   + +EI + I E +   ++   IS+ H  RS
Sbjct: 207 LVVAKSAAFQCIMVESDCQEVVKLINNNEGSRTEIMWVISEIQSLSKDFQNISYYHTPRS 266

Query: 172 STVLAHCVARKSLEGWESSILSTPYP 198
               AH +A+ +L   E+ +   P P
Sbjct: 267 CNTHAHSLAKLALRNNETVVWLEPIP 292

BLAST of ClCG05G010420 vs. NCBI nr
Match: gi|985448644|ref|XP_015385737.1| (PREDICTED: uncharacterized protein LOC107177033 [Citrus sinensis])

HSP 1 Score: 67.0 bits (162), Expect = 4.7e-08
Identity = 42/146 (28.77%), Postives = 73/146 (50.00%), Query Frame = 1

Query: 52  PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFG 111
           PP  + +K+NV AA + +    G G V+RD  + V++  +K       V+  +  A+ +G
Sbjct: 74  PPPDNTFKVNVDAAVNFDRQKAGLGVVIRDSSNKVKVAAVKSTLFTGDVQTAEAEAVEWG 133

Query: 112 LKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRS 171
           L    S +    +  SDC EV+ L+N+   + +EI + I E +   ++   IS+ H  RS
Sbjct: 134 LVVAKSAAFQCIMVESDCQEVVKLINNNEGSRTEIMWVISEIQSLSKDFQNISYYHTPRS 193

Query: 172 STVLAHCVARKSLEGWESSILSTPYP 198
               AH +A+ +L   E+ +   P P
Sbjct: 194 CNTHAHSLAKLALRNNETVVWLEPIP 219

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
B8AMK5_ORYSI2.3e-0629.71Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_12637 PE=4... [more]
M7Z309_TRIUA8.8e-0628.66Bidirectional sugar transporter SWEET14 OS=Triticum urartu GN=TRIUR3_00534 PE=3 ... [more]
Match NameE-valueIdentityDescription
gi|568876828|ref|XP_006491472.1|1.0e-1029.93PREDICTED: uncharacterized protein LOC102626455 [Citrus sinensis][more]
gi|985429304|ref|XP_015384077.1|2.5e-0930.43PREDICTED: uncharacterized protein LOC107176301 [Citrus sinensis][more]
gi|985461851|ref|XP_015388559.1|1.6e-0829.08PREDICTED: uncharacterized protein LOC107178191 [Citrus sinensis][more]
gi|985448646|ref|XP_015385738.1|4.7e-0828.77PREDICTED: uncharacterized protein LOC107177034 [Citrus sinensis][more]
gi|985448644|ref|XP_015385737.1|4.7e-0828.77PREDICTED: uncharacterized protein LOC107177033 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G010420.1ClCG05G010420.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33033FAMILY NOT NAMEDcoord: 51..199
score: 1.3
NoneNo IPR availablePANTHERPTHR33033:SF19SUBFAMILY NOT NAMEDcoord: 51..199
score: 1.3
NoneNo IPR availablePFAMPF13456RVT_3coord: 61..182
score: 1.1

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None