ClCG01G004140.1 (mRNA) Watermelon (Charleston Gray)

NameClCG01G004140.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionUnknown protein
LocationCG_Chr01 : 4482845 .. 4485833 (+)
Sequence length273
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AACGGAGAAGTGAATAAACCGGTTACAGTTCCTCCTCCCTTCACTAAAATCCTCGGCTTCAACCTTGCGATTCTCTCGCTCTCTGATCGACAAATGCCTGTAAGTTTCTTCAACTTTAAGATCCAGATCTTGTAGAACATTAAATCAATTGAAATTCTCAAATTCTAGGGTTCTTCTTTTTCTATAAATTTCGTTTTCTTCTGCAATGGAAATCCTTAATTTCTGATCCGGGAAATTTGATTCCGTTCGTCGCAGTTCAGGACGTTCATTGAAGTGGAACCGCCCAGTCCACTCCGATACATAATTGGGGCTGTCATAATGATGATCGGAGTCGTATTGCCCCTTGGATACATGCTGTTCCGGAATAAGCGTGGACCTTCTTCTTCTTCTTACTCCAAACAGACGTAGGTGTGCCTATGGAATTTACTTGCCGTTTTAGTTTATCAGTTCTTTCCTCTTGATCTGTTGCGTTATCTGTCTAATTTAATCATTACTGTGATCCGTTTATGACTGGAAATCAAGGTCTTCTCCAGGTGTTTTAAATGAAATTGGATTTGTTCTTTAATTGTTCATTGACCTGATTTATGGACTTTCGGAGCTCGGTTTCACTGGAGGAAATTGGAATTGAGCATTTTTTTTCCTTCTAGCGTTAGTTATTAGGATTCTGCGATCTGAATGCACCTTGTAAATTCCCAAATGAGGTTTTGGTGAGATTTCTGTCTTAAAGGTGAAGGGGATTTCTCGTTTTGCGCTGAATTTTTTCTTAGGGATGTTGAGAAAAGAATGATTATGCAGAGTATATGAAAGCAAAATACTTGTCTCTAGCGTAGGAGCTCCTTGTTTTCTTAGCTCAATGGTTCCGGTTGTTTAGGCTCTCTTCCTGAACTTCTTCTGGAAATATCAGAAATCTGGCAAATGATCATGATCCTAGACGAGTTTTCTGGATGGAAGTAGAAGGAGGGTGGGATATTTTGAGTAACTAATTAGTATTTGGATGACCAGGATATTGTGAATTCGTATTTGTATTTTAGATATAATTCGAGTTGGCACATGTTGAAAGTTCGAAAGTTTCAGTGGATGCATTTAGTCATTCAACTTACAAATCATTGGACTTCTTTTTTAAGATCTTCTTTCAGAATTATGGATGGAAGGAATCTGAGTCTTTGAGAGCAAATACCTTCCTTGGTCTGATCGATTTGAGTTTGCTAAACTGAAAGCCTCCCCCTGGCTCTCTCCAAGTCTTTTTTTTGGTTAGATATTAATTTAAATTTGGATGCCTTCATTTTGTGTTAACAATCTTTGCAATTATTTTGCGTTGTTGTTGTTTTAGTTTTAGTCACTCCCTTGTGGAGTTTGTATCATGTATCTTTTGAGCATTACTCTCTTTCACTTCTTGCTTAACAAAAATTCAAAATAGATCATTTCTCCTTTCCTTCGATCCTGCAATATTCTCTTCTCTCTCCCCGTTGCTGCATAAGTTTATCATCTGCTGGTACCTCTATTATTAGCGTTGCCCTCTTGATGGAAAGGAAGTGTTAAAAAGTGGGTGGTAATGGAAGGAGAGGCATAGTGTGGGTGGGACTTTCATCTTCCTTATAATAAATGGTTGTGGAGAGTTGATTAGAACATTCAATTTGCAAAATGTAATACTTCAAGTGTCAACTGTACTTTCATGATGAATCGTAAAACACTCTTTTCTATAGACTATATCTCTATTTTTAAGGTGAAATGTAAAATTTATATGATAGTACCCCCCTTGTTCTTTCTACTCGGTCTATTCCCATTTAACTTCCTTTTTTTTTTTGTTTCCTTTATGAAATCTTAGTTCTTGTATCCCTGTTTTAGTTTGAGTTGAACAACGTTGGAAATTGTACAGACATGATGCATATGTAATTAATGTAGGCATTTCCTATCCCATAAAACATAAAGCTCTCTCCTTTTTTATGGAGGAGTATCATTTTGGATTTTTAAGGGAAGAAGATTCTCCTGTGTGTTGGACGTTGGGAGATGACCTCTAGGACTAAGATATTGCTCACCTATCTGTGTCAATTGAGAGCTATCCGCGTAAAAGTTTAATGAAAGGACTAAACTGCAGTTTAAAGAAGGGTAGTGTAAGTGATTGTAAGCAAATCATAGATAGGAGCAAGTTAACGAGCTCGATCTTTCGTTGTACATTGCTTTGTTCTGAGCAACACTAATGGAAAATTATTCTGAGCAACACTAATGGAAAATCTGCAATTTGAAATTTCAGGACCAAAGTTTTGATTTAGTTACATAGAGGAAGAGATGCTTCTATTTCAGTTTTGGGCTCTCAGCTAATGGAGGATGATGCTATTCACAGGGTAATGTAAATGCATTGAGATTTCAAATTTCTGTGGAAACTGACATGATTATAGTCCAATTATCGACCTTAAGGCGACACATTTTGTCGTCATCAATGTCATACAACTGTATTTCATCATTTTGTATCGTTGAAGGAATTATGTGTGATTGCTGTTGATGACCCGCATTCAGTATTATGAAGATACGACCACAGAGCCATGAAATTACCATTCGTTGGTCCTTAAAAGTTGAAGTTCTCTATCCTTTCAGTGACTTTGAATTGAAAGTTTTAGGTGTAGTCGAATTATGATGAGAAACCCATCATTGACAAGATCAAGTTCAGTTTTCTTTCGTTCTTTGCTTAAGAAGAATCATTCCAATGATCAATATCCAAACAAATATCATAGGGTGCAGAAATGATTGTTTATAGAAAGATGAGTAAGGGAATGAAAGCAGCAACTCATATTAGTATATTTGTGGTTAAAATTGAAGTGAATAGGCAAAGAAGTTATTCAAGTCAAAGTGTCGTTGAGAATTATTTCCACCTTCCCAGTGCTTGTCCAGCACGGATCCTGTCTCCACGTTTAATAGAAAACTTGAAGTCTGATTTGGAGTTCCCATTTTCGTGTAAATTAGAAACAGGAGCTTGGAAGATCAATACCACCGT

mRNA sequence

AACGGAGAAGTGAATAAACCGGTTACAGTTCCTCCTCCCTTCACTAAAATCCTCGGCTTCAACCTTGCGATTCTCTCGCTCTCTGATCGACAAATGCCTTTCAGGACGTTCATTGAAGTGGAACCGCCCAGTCCACTCCGATACATAATTGGGGCTGTCATAATGATGATCGGAGTCGTATTGCCCCTTGGATACATGCTGTTCCGGAATAAGCGTGGACCTTCTTCTTCTTCTTACTCCAAACAGACGAGCTTGGAAGATCAATACCACCGT

Coding sequence (CDS)

AACGGAGAAGTGAATAAACCGGTTACAGTTCCTCCTCCCTTCACTAAAATCCTCGGCTTCAACCTTGCGATTCTCTCGCTCTCTGATCGACAAATGCCTTTCAGGACGTTCATTGAAGTGGAACCGCCCAGTCCACTCCGATACATAATTGGGGCTGTCATAATGATGATCGGAGTCGTATTGCCCCTTGGATACATGCTGTTCCGGAATAAGCGTGGACCTTCTTCTTCTTCTTACTCCAAACAGACGAGCTTGGAAGATCAATACCACCGT

Protein sequence

NGEVNKPVTVPPPFTKILGFNLAILSLSDRQMPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQTSLEDQYHR
BLAST of ClCG01G004140.1 vs. TrEMBL
Match: A0A0A0KPA4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G162600 PE=4 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 4.2e-18
Identity = 50/53 (94.34%), Postives = 50/53 (94.34%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP-SSSSYSKQT 84
          MPF TFIEVEPPSPLRYI GAVIMMIGVVLPLGYMLFRNKRGP SSSSYSKQT
Sbjct: 1  MPFSTFIEVEPPSPLRYIFGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53

BLAST of ClCG01G004140.1 vs. TrEMBL
Match: K7MLL8_SOYBN (Uncharacterized protein OS=Glycine max PE=4 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 5.4e-18
Identity = 48/71 (67.61%), Postives = 57/71 (80.28%), Query Frame = 1

Query: 16  KILGFNLAILSLSDRQ---MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKR 75
           ++  FN   LSLS+R    M  R+ +EVEPPSPLRY+IGA +MMIGVVLP+GYM+FRNKR
Sbjct: 51  RVFNFNFGSLSLSNRNVVAMQIRSLVEVEPPSPLRYLIGAAVMMIGVVLPVGYMMFRNKR 110

Query: 76  GPSSSSYSKQT 84
            PSSSSYSKQT
Sbjct: 111 VPSSSSYSKQT 121

BLAST of ClCG01G004140.1 vs. TrEMBL
Match: A0A0D2PL38_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G234300 PE=4 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 1.6e-17
Identity = 45/53 (84.91%), Postives = 50/53 (94.34%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQTS 85
          MPF+T IEVEPPSPLRYIIGA +MMIGVVLP+GYM+FRNKR PSSSSYSKQT+
Sbjct: 1  MPFKTMIEVEPPSPLRYIIGAAVMMIGVVLPVGYMMFRNKRVPSSSSYSKQTN 53

BLAST of ClCG01G004140.1 vs. TrEMBL
Match: A0A0D2SXG3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G234300 PE=4 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 2.1e-17
Identity = 45/52 (86.54%), Postives = 49/52 (94.23%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQT 84
          MPF+T IEVEPPSPLRYIIGA +MMIGVVLP+GYM+FRNKR PSSSSYSKQT
Sbjct: 1  MPFKTMIEVEPPSPLRYIIGAAVMMIGVVLPVGYMMFRNKRVPSSSSYSKQT 52

BLAST of ClCG01G004140.1 vs. TrEMBL
Match: A0A061G4H5_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_015942 PE=4 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 4.6e-17
Identity = 45/53 (84.91%), Postives = 49/53 (92.45%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQTS 85
          MPFR  IEVEPPSPLRYIIGA IMM+GVVLP+GYM+FRNKR PSSSSYSKQT+
Sbjct: 1  MPFRRMIEVEPPSPLRYIIGAAIMMLGVVLPVGYMMFRNKRVPSSSSYSKQTN 53

BLAST of ClCG01G004140.1 vs. TAIR10
Match: AT4G16695.1 (AT4G16695.1 unknown protein)

HSP 1 Score: 88.2 bits (217), Expect = 2.8e-18
Identity = 41/53 (77.36%), Postives = 46/53 (86.79%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQTS 85
          MPF+T IEVEPPS LRY+IG+ +MMIGVVLP+GYM+FRNKR P SSSYSKQT+
Sbjct: 1  MPFKTVIEVEPPSLLRYLIGSAVMMIGVVLPVGYMMFRNKRVPFSSSYSKQTN 53

BLAST of ClCG01G004140.1 vs. NCBI nr
Match: gi|659073438|ref|XP_008437060.1| (PREDICTED: uncharacterized protein LOC103482600 isoform X1 [Cucumis melo])

HSP 1 Score: 100.5 bits (249), Expect = 1.6e-18
Identity = 51/54 (94.44%), Postives = 52/54 (96.30%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP-SSSSYSKQTS 85
          MPF TFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP SSSSYSKQT+
Sbjct: 1  MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQTT 54

BLAST of ClCG01G004140.1 vs. NCBI nr
Match: gi|659073440|ref|XP_008437061.1| (PREDICTED: uncharacterized protein LOC103482600 isoform X2 [Cucumis melo])

HSP 1 Score: 100.1 bits (248), Expect = 2.1e-18
Identity = 51/53 (96.23%), Postives = 51/53 (96.23%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP-SSSSYSKQT 84
          MPF TFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP SSSSYSKQT
Sbjct: 1  MPFSTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53

BLAST of ClCG01G004140.1 vs. NCBI nr
Match: gi|778700122|ref|XP_011654818.1| (PREDICTED: uncharacterized protein LOC101203605 isoform X1 [Cucumis sativus])

HSP 1 Score: 99.0 bits (245), Expect = 4.6e-18
Identity = 50/54 (92.59%), Postives = 51/54 (94.44%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP-SSSSYSKQTS 85
          MPF TFIEVEPPSPLRYI GAVIMMIGVVLPLGYMLFRNKRGP SSSSYSKQT+
Sbjct: 1  MPFSTFIEVEPPSPLRYIFGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQTT 54

BLAST of ClCG01G004140.1 vs. NCBI nr
Match: gi|778700125|ref|XP_011654819.1| (PREDICTED: uncharacterized protein LOC101203605 isoform X2 [Cucumis sativus])

HSP 1 Score: 98.6 bits (244), Expect = 6.0e-18
Identity = 50/53 (94.34%), Postives = 50/53 (94.34%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGP-SSSSYSKQT 84
          MPF TFIEVEPPSPLRYI GAVIMMIGVVLPLGYMLFRNKRGP SSSSYSKQT
Sbjct: 1  MPFSTFIEVEPPSPLRYIFGAVIMMIGVVLPLGYMLFRNKRGPSSSSSYSKQT 53

BLAST of ClCG01G004140.1 vs. NCBI nr
Match: gi|823213452|ref|XP_012439470.1| (PREDICTED: uncharacterized protein LOC105765095 isoform X1 [Gossypium raimondii])

HSP 1 Score: 96.7 bits (239), Expect = 2.3e-17
Identity = 45/53 (84.91%), Postives = 50/53 (94.34%), Query Frame = 1

Query: 32 MPFRTFIEVEPPSPLRYIIGAVIMMIGVVLPLGYMLFRNKRGPSSSSYSKQTS 85
          MPF+T IEVEPPSPLRYIIGA +MMIGVVLP+GYM+FRNKR PSSSSYSKQT+
Sbjct: 1  MPFKTMIEVEPPSPLRYIIGAAVMMIGVVLPVGYMMFRNKRVPSSSSYSKQTN 53

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0KPA4_CUCSA4.2e-1894.34Uncharacterized protein OS=Cucumis sativus GN=Csa_5G162600 PE=4 SV=1[more]
K7MLL8_SOYBN5.4e-1867.61Uncharacterized protein OS=Glycine max PE=4 SV=1[more]
A0A0D2PL38_GOSRA1.6e-1784.91Uncharacterized protein OS=Gossypium raimondii GN=B456_008G234300 PE=4 SV=1[more]
A0A0D2SXG3_GOSRA2.1e-1786.54Uncharacterized protein OS=Gossypium raimondii GN=B456_008G234300 PE=4 SV=1[more]
A0A061G4H5_THECC4.6e-1784.91Uncharacterized protein OS=Theobroma cacao GN=TCM_015942 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16695.12.8e-1877.36 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659073438|ref|XP_008437060.1|1.6e-1894.44PREDICTED: uncharacterized protein LOC103482600 isoform X1 [Cucumis melo][more]
gi|659073440|ref|XP_008437061.1|2.1e-1896.23PREDICTED: uncharacterized protein LOC103482600 isoform X2 [Cucumis melo][more]
gi|778700122|ref|XP_011654818.1|4.6e-1892.59PREDICTED: uncharacterized protein LOC101203605 isoform X1 [Cucumis sativus][more]
gi|778700125|ref|XP_011654819.1|6.0e-1894.34PREDICTED: uncharacterized protein LOC101203605 isoform X2 [Cucumis sativus][more]
gi|823213452|ref|XP_012439470.1|2.3e-1784.91PREDICTED: uncharacterized protein LOC105765095 isoform X1 [Gossypium raimondii][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG01G004140ClCG01G004140gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG01G004140.1ClCG01G004140.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG01G004140.1.cds1ClCG01G004140.1.cds1CDS
ClCG01G004140.1.cds2ClCG01G004140.1.cds2CDS
ClCG01G004140.1.cds3ClCG01G004140.1.cds3CDS


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37749FAMILY NOT NAMEDcoord: 17..90
score: 8.0