Cla023304 (gene) Watermelon (97103) v1

NameCla023304
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr11 : 19411765 .. 19412298 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCTGTGGAATTTAGAAGACAAATGGAAGCTCTCAACCCAACAAGCTTTCATTCTCTTTACGTGCACGGCGGTCGCGGTTGTTGGGTTCTGCGCGGCGGCGTGGGCGAAGAAGAGGAAGGGAGAGAAGGGAGAGAAGAAAGAGCGCCGAAGGACGACGGCAACGGAGAAGTGGTGGAAATGGGCGGCAGAAAGGCGGAAGGGGAGCGGCGGCGGAGAGACGCCGGTGCGGCTGTTGGGCAGAGAAGGGGAAGGGGAAGAATTGGGAAGCCGGAATTCGACGGCGGCGGTGTGGCAACGGCCGATATTAATGGGGGAAAAGTGTGAGATGCTGAAATACAGTGGGCTTATTCTGTACGACGAAAGGGGACGATTGCTGCAGGAGCAAATCGCCGCCATGGAAAATGGTTACAAGGTGCTGCTTCGTAAGTTCTTTCGGATTTTTTTAATAGGGAAAAAAAAAAAAAAAAAGCGTTCAATGATTAAAAACATATTTACTCATACGACTTTAAAGAAAATGACATTATAA

mRNA sequence

ATGGAAGCTCTGTGGAATTTAGAAGACAAATGGAAGCTCTCAACCCAACAAGCTTTCATTCTCTTTACGTGCACGGCGGTCGCGGTTGTTGGGTTCTGCGCGGCGGCGTGGGCGAAGAAGAGGAAGGGAGAGAAGGGAGAGAAGAAAGAGCGCCGAAGGACGACGGCAACGGAGAAGTGGTGGAAATGGGCGGCAGAAAGGCGGAAGGGGAGCGGCGGCGGAGAGACGCCGGTGCGGCTGTTGGGCAGAGAAGGGGAAGGGGAAGAATTGGGAAGCCGGAATTCGACGGCGGCGGTGTGGCAACGGCCGATATTAATGGGGGAAAAGTGTGAGATGCTGAAATACAGTGGGCTTATTCTGTACGACGAAAGGGGACGATTGCTGCAGGAGCAAATCGCCGCCATGGAAAATGGTTACAAGGTGCTGCTTCGTAAGTTCTTTCGGATTTTTTTAATAGGGAAAAAAAAAAAAAAAAAGCGTTCAATGATTAAAAACATATTTACTCATACGACTTTAAAGAAAATGACATTATAA

Coding sequence (CDS)

ATGGAAGCTCTGTGGAATTTAGAAGACAAATGGAAGCTCTCAACCCAACAAGCTTTCATTCTCTTTACGTGCACGGCGGTCGCGGTTGTTGGGTTCTGCGCGGCGGCGTGGGCGAAGAAGAGGAAGGGAGAGAAGGGAGAGAAGAAAGAGCGCCGAAGGACGACGGCAACGGAGAAGTGGTGGAAATGGGCGGCAGAAAGGCGGAAGGGGAGCGGCGGCGGAGAGACGCCGGTGCGGCTGTTGGGCAGAGAAGGGGAAGGGGAAGAATTGGGAAGCCGGAATTCGACGGCGGCGGTGTGGCAACGGCCGATATTAATGGGGGAAAAGTGTGAGATGCTGAAATACAGTGGGCTTATTCTGTACGACGAAAGGGGACGATTGCTGCAGGAGCAAATCGCCGCCATGGAAAATGGTTACAAGGTGCTGCTTCGTAAGTTCTTTCGGATTTTTTTAATAGGGAAAAAAAAAAAAAAAAAGCGTTCAATGATTAAAAACATATTTACTCATACGACTTTAAAGAAAATGACATTATAA

Protein sequence

MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKWWKWAAERRKGSGGGETPVRLLGREGEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDERGRLLQEQIAAMENGYKVLLRKFFRIFLIGKKKKKKRSMIKNIFTHTTLKKMTL
BLAST of Cla023304 vs. TrEMBL
Match: A0A0A0L376_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G622860 PE=4 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 5.9e-61
Identity = 121/142 (85.21%), Postives = 125/142 (88.03%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLEDKWKLSTQQAF+L TCT  AV+G CAAAWAKKRKGEK  K   RRT   ++W
Sbjct: 1   MEALWNLEDKWKLSTQQAFVLLTCTVAAVIGLCAAAWAKKRKGEK--KAHPRRTD--QRW 60

Query: 61  WKWAA--ERRKGSGGGETPVRLLGREGEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120
           WKW A   RRKGSGGGETPVRLLGRE EGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL
Sbjct: 61  WKWPATESRRKGSGGGETPVRLLGREEEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120

Query: 121 ILYDERGRLLQEQIAAMENGYK 141
           ILYDERGRLLQEQIAAMENGYK
Sbjct: 121 ILYDERGRLLQEQIAAMENGYK 138

BLAST of Cla023304 vs. TrEMBL
Match: B9N7S2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s10460g PE=4 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 4.0e-25
Identity = 72/165 (43.64%), Postives = 91/165 (55.15%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLED+WKL+TQ+A +LF CTA+AV+  CAA   K++   K     +  +T + K 
Sbjct: 1   MEALWNLEDEWKLTTQEAVLLFVCTALAVIALCAAIMLKRKAQTKQRSVNQDPSTGSSKR 60

Query: 61  WK--------WAAERR------------------KGSGGGE---TPVRLLGREGEGEELG 120
           W         W   RR                   GSG G     P  +LG E     +G
Sbjct: 61  WSEPEPGSNNWITIRRVLMESMRWSGASKWDEGSSGSGSGSGMLLPPPVLGLERCESSMG 120

Query: 121 --SRNSTAAVWQRPILMGEKCEMLKYSGLILYDERGRLLQEQIAA 135
             S +S +AVWQRPILMGEKCE+ +YSGLILYDERGRLL   + +
Sbjct: 121 WQSPDSLSAVWQRPILMGEKCELPRYSGLILYDERGRLLDHSLTS 165

BLAST of Cla023304 vs. TrEMBL
Match: A0A0D2PI92_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G332200 PE=4 SV=1)

HSP 1 Score: 119.4 bits (298), Expect = 4.4e-24
Identity = 70/160 (43.75%), Postives = 92/160 (57.50%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKK--RKGEKGEKKERRRTTATE 60
           MEALW+LE+KWKL+TQ+A I+F C A A+VG CAA   KK  RK +  +       +   
Sbjct: 1   MEALWSLEEKWKLTTQEAVIVFVCAASAIVGLCAATVLKKKARKKQVADGDAVGDGSMNA 60

Query: 61  KW------W----------KWAAERRKGSGG---GETPVRLLGREGEGEELG--SRNSTA 120
           KW      W           W+  +R G       E P  LLG E  G+ +G  S NS +
Sbjct: 61  KWREPGCSWVPKKVLMGSVMWSGAKRWGERSFRWEERPPPLLGLEEYGDSVGWRSHNSDS 120

Query: 121 AVWQRPILMGEKCEMLKYSGLILYDERGRLLQEQIAAMEN 138
            VWQRPILMGEKCE+ ++SGLILYDERG+LL + +  + +
Sbjct: 121 PVWQRPILMGEKCELPRFSGLILYDERGQLLDDSVKRLSD 160

BLAST of Cla023304 vs. TrEMBL
Match: A0A061FGE7_THECC (Ribosomal RNA small subunit methyltransferase G, putative OS=Theobroma cacao GN=TCM_032143 PE=4 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 5.8e-24
Identity = 73/154 (47.40%), Postives = 88/154 (57.14%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAA-WAKKRKGEKGEKKERRRTT---A 60
           MEALW+LE+KWKL+TQ+A +L  C A AVVG CAA    KKRK +K +  +R R      
Sbjct: 16  MEALWDLEEKWKLTTQEAVLLLACAASAVVGLCAATVLKKKRKAQKKQMVDRDRVADGAV 75

Query: 61  TEKWWK-----------------WAAERRKGS---GGGETPVRLLGREGEGEELG--SRN 120
             KW +                 W+   R G    G  E P  LLG EG     G  S N
Sbjct: 76  HAKWCEPSCNWVSAKRVFMGSAMWSGANRWGERSFGWEERPPPLLGLEGYDSCAGWRSHN 135

Query: 121 STAAVWQRPILMGEKCEMLKYSGLILYDERGRLL 129
           S + VWQRPILMGEKCE+ ++SGLILYDERG+LL
Sbjct: 136 SDSPVWQRPILMGEKCELPRFSGLILYDERGQLL 169

BLAST of Cla023304 vs. TrEMBL
Match: A0A068UBX3_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00021414001 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 3.8e-23
Identity = 71/162 (43.83%), Postives = 90/162 (55.56%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKG-------------E 60
           MEALW LEDKWKLSTQ+A   F CTA  V+G C A + K+R    G             +
Sbjct: 1   MEALWKLEDKWKLSTQEAVAFFACTAFLVIGVCFATFLKRRAKRSGLVHQEPCMNTEATD 60

Query: 61  KKERRRTTATEKW------------WKWAA--ERRKGSGGGE---TPVRLLGREGEGEEL 120
           + +R      +KW            W  A+  E R+ SG       P+ ++G E   E L
Sbjct: 61  EAKRSDQKQIKKWGAVKELLMGSVRWSGASKLEERRLSGSQRERAAPLLVVGGEKCEENL 120

Query: 121 G--SRNSTAAVWQRPILMGEKCEMLKYSGLILYDERGRLLQE 131
           G  S NS++AVWQRPILMGEKCE+ ++SGLILYDERGR L +
Sbjct: 121 GRLSHNSSSAVWQRPILMGEKCELPRFSGLILYDERGRPLDQ 162

BLAST of Cla023304 vs. NCBI nr
Match: gi|778695623|ref|XP_011654026.1| (PREDICTED: uncharacterized protein LOC105435286 [Cucumis sativus])

HSP 1 Score: 253.1 bits (645), Expect = 3.7e-64
Identity = 128/152 (84.21%), Postives = 132/152 (86.84%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLEDKWKLSTQQAF+L TCT  AV+G CAAAWAKKRKGEK  K   RRT   ++W
Sbjct: 1   MEALWNLEDKWKLSTQQAFVLLTCTVAAVIGLCAAAWAKKRKGEK--KAHPRRTD--QRW 60

Query: 61  WKWAA--ERRKGSGGGETPVRLLGREGEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120
           WKW A   RRKGSGGGETPVRLLGRE EGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL
Sbjct: 61  WKWPATESRRKGSGGGETPVRLLGREEEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120

Query: 121 ILYDERGRLLQEQIAAMENGYKVLLRKFFRIF 151
           ILYDERGRLLQEQIAAMENGYKVLLRK F  F
Sbjct: 121 ILYDERGRLLQEQIAAMENGYKVLLRKLFGRF 148

BLAST of Cla023304 vs. NCBI nr
Match: gi|700199869|gb|KGN55027.1| (hypothetical protein Csa_4G622860 [Cucumis sativus])

HSP 1 Score: 241.9 bits (616), Expect = 8.5e-61
Identity = 121/142 (85.21%), Postives = 125/142 (88.03%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLEDKWKLSTQQAF+L TCT  AV+G CAAAWAKKRKGEK  K   RRT   ++W
Sbjct: 1   MEALWNLEDKWKLSTQQAFVLLTCTVAAVIGLCAAAWAKKRKGEK--KAHPRRTD--QRW 60

Query: 61  WKWAA--ERRKGSGGGETPVRLLGREGEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120
           WKW A   RRKGSGGGETPVRLLGRE EGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL
Sbjct: 61  WKWPATESRRKGSGGGETPVRLLGREEEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120

Query: 121 ILYDERGRLLQEQIAAMENGYK 141
           ILYDERGRLLQEQIAAMENGYK
Sbjct: 121 ILYDERGRLLQEQIAAMENGYK 138

BLAST of Cla023304 vs. NCBI nr
Match: gi|659131296|ref|XP_008465611.1| (PREDICTED: uncharacterized protein LOC103503252 [Cucumis melo])

HSP 1 Score: 234.2 bits (596), Expect = 1.8e-58
Identity = 119/142 (83.80%), Postives = 125/142 (88.03%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLEDKWKLSTQQAF+L TCT  AV+  CAAAWA+KRKGEK  K  RRRT   ++W
Sbjct: 1   MEALWNLEDKWKLSTQQAFVLLTCTLAAVIALCAAAWARKRKGEK--KAHRRRTD--QRW 60

Query: 61  WKWAAERR-KGSGGG-ETPVRLLGREGEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120
           WKWAAE R KGSGGG ETPVRLLG E EGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL
Sbjct: 61  WKWAAEIRWKGSGGGGETPVRLLGEEEEGEELGSRNSTAAVWQRPILMGEKCEMLKYSGL 120

Query: 121 ILYDERGRLLQEQIAAMENGYK 141
           ILYDERGRLLQ+QIAAMENGYK
Sbjct: 121 ILYDERGRLLQDQIAAMENGYK 138

BLAST of Cla023304 vs. NCBI nr
Match: gi|566194776|ref|XP_006377711.1| (hypothetical protein POPTR_0011s10460g [Populus trichocarpa])

HSP 1 Score: 122.9 bits (307), Expect = 5.8e-25
Identity = 72/165 (43.64%), Postives = 91/165 (55.15%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTTATEKW 60
           MEALWNLED+WKL+TQ+A +LF CTA+AV+  CAA   K++   K     +  +T + K 
Sbjct: 1   MEALWNLEDEWKLTTQEAVLLFVCTALAVIALCAAIMLKRKAQTKQRSVNQDPSTGSSKR 60

Query: 61  WK--------WAAERR------------------KGSGGGE---TPVRLLGREGEGEELG 120
           W         W   RR                   GSG G     P  +LG E     +G
Sbjct: 61  WSEPEPGSNNWITIRRVLMESMRWSGASKWDEGSSGSGSGSGMLLPPPVLGLERCESSMG 120

Query: 121 --SRNSTAAVWQRPILMGEKCEMLKYSGLILYDERGRLLQEQIAA 135
             S +S +AVWQRPILMGEKCE+ +YSGLILYDERGRLL   + +
Sbjct: 121 WQSPDSLSAVWQRPILMGEKCELPRYSGLILYDERGRLLDHSLTS 165

BLAST of Cla023304 vs. NCBI nr
Match: gi|743925357|ref|XP_011006814.1| (PREDICTED: uncharacterized protein LOC105112724 [Populus euphratica])

HSP 1 Score: 120.9 bits (302), Expect = 2.2e-24
Identity = 72/163 (44.17%), Postives = 92/163 (56.44%), Query Frame = 1

Query: 1   MEALWNLEDKWKLSTQQAFILFTCTAVAVVGFCAAAWAKKRKGEKGEKKERRRTT----- 60
           MEALWNLED+WKL+TQ+A +LF CTA+AV+  CAA   K++   K     +  +T     
Sbjct: 1   MEALWNLEDEWKLTTQEAVLLFVCTALAVIALCAAIMLKRKAQTKQRAVNQDPSTDFSIR 60

Query: 61  ------ATEKW------------WKWAA---ERRKGSGGGET-PVRLLGREGEGEELG-- 120
                  +  W            W  A+   ERR GSG G   P  +LG E     +G  
Sbjct: 61  RSEPEPGSNNWITIRRVLMESMRWSGASKWDERRSGSGSGMLLPPPVLGSERCESSMGWQ 120

Query: 121 SRNSTAAVWQRPILMGEKCEMLKYSGLILYDERGRLLQEQIAA 135
           S  S +AVWQRPILMGEKCE+ +YSG+ILYDERGRLL   + +
Sbjct: 121 SPESLSAVWQRPILMGEKCELPRYSGVILYDERGRLLDHSLTS 163

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0L376_CUCSA5.9e-6185.21Uncharacterized protein OS=Cucumis sativus GN=Csa_4G622860 PE=4 SV=1[more]
B9N7S2_POPTR4.0e-2543.64Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0011s10460g PE=4 SV=1[more]
A0A0D2PI92_GOSRA4.4e-2443.75Uncharacterized protein OS=Gossypium raimondii GN=B456_007G332200 PE=4 SV=1[more]
A0A061FGE7_THECC5.8e-2447.40Ribosomal RNA small subunit methyltransferase G, putative OS=Theobroma cacao GN=... [more]
A0A068UBX3_COFCA3.8e-2343.83Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00021414001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778695623|ref|XP_011654026.1|3.7e-6484.21PREDICTED: uncharacterized protein LOC105435286 [Cucumis sativus][more]
gi|700199869|gb|KGN55027.1|8.5e-6185.21hypothetical protein Csa_4G622860 [Cucumis sativus][more]
gi|659131296|ref|XP_008465611.1|1.8e-5883.80PREDICTED: uncharacterized protein LOC103503252 [Cucumis melo][more]
gi|566194776|ref|XP_006377711.1|5.8e-2543.64hypothetical protein POPTR_0011s10460g [Populus trichocarpa][more]
gi|743925357|ref|XP_011006814.1|2.2e-2444.17PREDICTED: uncharacterized protein LOC105112724 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla023304Cla023304.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33237FAMILY NOT NAMEDcoord: 1..132
score: 1.2
NoneNo IPR availablePANTHERPTHR33237:SF9GENOMIC DNA, CHROMOSOME 3, BAC CLONE: T21E2coord: 1..132
score: 1.2