Cla021273 (gene) Watermelon (97103) v1

NameCla021273
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr5 : 1743435 .. 1744100 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTACTCTCTCATTTCCCCCTCTTCCTTTGAATCGTGAAGGATCCACCAGAATGCTCAAGGATTTTCTTCAGGAAACCAATGCAAATGGAATCGCATCTTCTAAACCTAAACCGGCGTCGTTTAAGACTCTGGCTATCCACGCCGTCGTCGCCGCCGTCAAGAGGATCTCGTTTCCGTCTGTGAAATCGCCAAGGATTTTCCCCAGAAGCTTTTCGCGGAGGCTGTTGAGGAAAAAAGAGAGAGATGAGAGAGAAATCGGAGGCGATTTCGTTGTTAAGATTAAGGACATCATACGGTGGAAATCGTTCAGGGATTTAGTCGACGAGACGGCGGCTCCGCCGCTTGGTTTCGCTGATTCACCGGATCGTTATACGGCCGTTGCCACGACTACGACGACGACGACGACGACTAGCAGTAATAGTTCGAGCTGGTGCGAGAGCGATTTCACGGCGGAGGATTTGCCGTCGCCGTCGTGGAGAGATTGGTCCGACGGCGCAGTGGGAAAAATGTGTTTCTCATGTGTCGGTGAAGATTCGAGGGGAACGAGATCAGCACACGCAGAAAACGACAAAAAGGTGAGAAATAATTCAGTACTTCGCGTTGTAACTTTGGTCAGTTTTTGCCACCCTAGGATCCGGTTTATTTGGTTTGAGATTTAA

mRNA sequence

ATGGCGGCTACTCTCTCATTTCCCCCTCTTCCTTTGAATCGTGAAGGATCCACCAGAATGCTCAAGGATTTTCTTCAGGAAACCAATGCAAATGGAATCGCATCTTCTAAACCTAAACCGGCGTCGTTTAAGACTCTGGCTATCCACGCCGTCGTCGCCGCCGTCAAGAGGATCTCGTTTCCGTCTGTGAAATCGCCAAGGATTTTCCCCAGAAGCTTTTCGCGGAGGCTGTTGAGGAAAAAAGAGAGAGATGAGAGAGAAATCGGAGGCGATTTCGTTGTTAAGATTAAGGACATCATACGGTGGAAATCGTTCAGGGATTTAGTCGACGAGACGGCGGCTCCGCCGCTTGGTTTCGCTGATTCACCGGATCGTTATACGGCCGTTGCCACGACTACGACGACGACGACGACGACTAGCAGTAATAGTTCGAGCTGGTGCGAGAGCGATTTCACGGCGGAGGATTTGCCGTCGCCGTCGTGGAGAGATTGGTCCGACGGCGCAGTGGGAAAAATGTGTTTCTCATGTGTCGGTGAAGATTCGAGGGGAACGAGATCAGCACACGCAGAAAACGACAAAAAGGTGAGAAATAATTCAGTACTTCGCGTTGTAACTTTGGTCAGTTTTTGCCACCCTAGGATCCGGTTTATTTGGTTTGAGATTTAA

Coding sequence (CDS)

ATGGCGGCTACTCTCTCATTTCCCCCTCTTCCTTTGAATCGTGAAGGATCCACCAGAATGCTCAAGGATTTTCTTCAGGAAACCAATGCAAATGGAATCGCATCTTCTAAACCTAAACCGGCGTCGTTTAAGACTCTGGCTATCCACGCCGTCGTCGCCGCCGTCAAGAGGATCTCGTTTCCGTCTGTGAAATCGCCAAGGATTTTCCCCAGAAGCTTTTCGCGGAGGCTGTTGAGGAAAAAAGAGAGAGATGAGAGAGAAATCGGAGGCGATTTCGTTGTTAAGATTAAGGACATCATACGGTGGAAATCGTTCAGGGATTTAGTCGACGAGACGGCGGCTCCGCCGCTTGGTTTCGCTGATTCACCGGATCGTTATACGGCCGTTGCCACGACTACGACGACGACGACGACGACTAGCAGTAATAGTTCGAGCTGGTGCGAGAGCGATTTCACGGCGGAGGATTTGCCGTCGCCGTCGTGGAGAGATTGGTCCGACGGCGCAGTGGGAAAAATGTGTTTCTCATGTGTCGGTGAAGATTCGAGGGGAACGAGATCAGCACACGCAGAAAACGACAAAAAGGTGAGAAATAATTCAGTACTTCGCGTTGTAACTTTGGTCAGTTTTTGCCACCCTAGGATCCGGTTTATTTGGTTTGAGATTTAA

Protein sequence

MAATLSFPPLPLNREGSTRMLKDFLQETNANGIASSKPKPASFKTLAIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDEREIGGDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTTSSNSSSWCESDFTAEDLPSPSWRDWSDGAVGKMCFSCVGEDSRGTRSAHAENDKKVRNNSVLRVVTLVSFCHPRIRFIWFEI
BLAST of Cla021273 vs. TrEMBL
Match: A0A0A0L6C4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G124790 PE=4 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 1.1e-80
Identity = 158/203 (77.83%), Postives = 169/203 (83.25%), Query Frame = 1

Query: 2   AATLSFPPLPLNREGSTRMLKDFLQETNANGIASSKPKPASFKTLAIHAVVAAVKRISFP 61
           A TLSFPP PLNRE  TRMLKDFL ETN NG+AS KPKP SFK LA HAVVAAVKRIS P
Sbjct: 3   APTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRISLP 62

Query: 62  SVKSPRIFPRSFSRRLLRKKERDEREIGGDFVVKIKDIIRWKSFRDLVDET--AAPPLGF 121
           SVKSPRIFPRS SRRLL+K ERDERE GGDFVVKIKDIIRWKSFRDL+DET  AAPPL F
Sbjct: 63  SVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPLDF 122

Query: 122 ADSPDRYTAVA------TTTTTTTTTSSNSSSWCESDFTAEDLPSPSWRDWS-DGAVGKM 181
           A+SPDRYT  A      TTTTTTTT+SS SSSWCESDFTAEDL SPSWRDWS DG +GKM
Sbjct: 123 AESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMGKM 182

Query: 182 CFSCVGEDSRGTRSAHAENDKKV 196
            F CVGEDS  T +A+A+ND++V
Sbjct: 183 YFPCVGEDSNETTAAYAQNDEEV 205

BLAST of Cla021273 vs. TrEMBL
Match: A0A061DUN2_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 1.8e-27
Identity = 80/167 (47.90%), Postives = 100/167 (59.88%), Query Frame = 1

Query: 30  ANGIASSKPKPASFKTLAIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDEREIG 89
           A  +  S+ K AS       A++ AV+ I F SVKSP I PRS SR+L +K  + E E  
Sbjct: 67  AQQLQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETET- 126

Query: 90  GDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTT-----SSNSS 149
               V++KDIIRWKS RDLV+E   PP  FA SP   T  +TTTTTTT +     SSNSS
Sbjct: 127 -RTTVRVKDIIRWKSSRDLVEE-KFPPADFASSPHHCTTRSTTTTTTTGSKSTPCSSNSS 186

Query: 150 SWCESDFTAEDLPSPSWRDWSDGAVGKMCFSCVGEDSRGTRSAHAEN 192
           SWC+SDFT+E LPS  + + S+  VGK    CVG+D   T +  A N
Sbjct: 187 SWCDSDFTSEYLPSEEYHE-SEVDVGKKFLPCVGKDPMETTTGLAAN 229

BLAST of Cla021273 vs. TrEMBL
Match: A0A061E1G2_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 1.8e-27
Identity = 80/167 (47.90%), Postives = 100/167 (59.88%), Query Frame = 1

Query: 30  ANGIASSKPKPASFKTLAIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDEREIG 89
           A  +  S+ K AS       A++ AV+ I F SVKSP I PRS SR+L +K  + E E  
Sbjct: 67  AQQLQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETET- 126

Query: 90  GDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTT-----SSNSS 149
               V++KDIIRWKS RDLV+E   PP  FA SP   T  +TTTTTTT +     SSNSS
Sbjct: 127 -RTTVRVKDIIRWKSSRDLVEE-KFPPADFASSPHHCTTRSTTTTTTTGSKSTPCSSNSS 186

Query: 150 SWCESDFTAEDLPSPSWRDWSDGAVGKMCFSCVGEDSRGTRSAHAEN 192
           SWC+SDFT+E LPS  + + S+  VGK    CVG+D   T +  A N
Sbjct: 187 SWCDSDFTSEYLPSEEYHE-SEVDVGKKFLPCVGKDPMETTTGLAAN 229

BLAST of Cla021273 vs. TrEMBL
Match: A0A067K104_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14469 PE=4 SV=1)

HSP 1 Score: 118.6 bits (296), Expect = 9.4e-24
Identity = 70/148 (47.30%), Postives = 84/148 (56.76%), Query Frame = 1

Query: 47  AIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDER----------EIGGDFVVKI 106
           A H+V+ A+K   F SVK+P IFPRS SRRL +K     R          E      V I
Sbjct: 3   AFHSVINALKNFQFTSVKAPSIFPRSISRRLSKKSSASSRDTERASESKLESEVKITVTI 62

Query: 107 KDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTTSSNSSSWCESDFTAEDL 166
           KDIIRWKSFRDL++E + PPL  A SP   T  +T + TTT  SSN SSWC+SDFT+E L
Sbjct: 63  KDIIRWKSFRDLMEEKS-PPLDLASSPHHCTTTSTASATTTPCSSNGSSWCDSDFTSEYL 122

Query: 167 P-----SPSWRDWSDGAVGKMCFSCVGE 180
           P     S  + +     VGK    CVGE
Sbjct: 123 PFWNGNSEEYGENEAMEVGKKDLPCVGE 149

BLAST of Cla021273 vs. TrEMBL
Match: B9GPU1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s15410g PE=4 SV=2)

HSP 1 Score: 115.9 bits (289), Expect = 6.1e-23
Identity = 73/165 (44.24%), Postives = 97/165 (58.79%), Query Frame = 1

Query: 30  ANGIAS-----SKPKPASFKTL-AIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKK-E 89
           +N IAS     S+ K A+  T+ A  A++ AVK + F ++KSP + PRS SRRL +KK +
Sbjct: 74  SNNIASYKLLKSRSKAAASTTISAFQAMMNAVKNVHFIAIKSPSLLPRSLSRRLSKKKCQ 133

Query: 90  RDEREIGGDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTTSSN 149
             E E+     + +KDIIRWKSFRD+V++  APP     SP   T   TTT +T+TT  +
Sbjct: 134 NKENEV--KMTITVKDIIRWKSFRDIVEDDKAPPSDLPPSPHHCT--TTTTRSTSTTPRS 193

Query: 150 SSSWCESDFTAEDLPSPSWRDWSDGAV------GKMCFSCVGEDS 182
            SSWC+SDF ++ L  PSW    D  V      GK    CVGEDS
Sbjct: 194 GSSWCDSDFNSDYL--PSWNGNFDECVENEVGAGKKFLPCVGEDS 232

BLAST of Cla021273 vs. NCBI nr
Match: gi|659075368|ref|XP_008438108.1| (PREDICTED: uncharacterized protein LOC103483313 isoform X1 [Cucumis melo])

HSP 1 Score: 325.5 bits (833), Expect = 7.3e-86
Identity = 167/208 (80.29%), Postives = 177/208 (85.10%), Query Frame = 1

Query: 2   AATLSFPPLPLNREGSTRMLKDFLQETNANGIASSKPKPASFKTLAIHAVVAAVKRISFP 61
           A +LSFPP PLNREG TRMLKDFL ETN NGIASSKPKP SFK LA HAVVAAVKRISFP
Sbjct: 3   APSLSFPPFPLNREGPTRMLKDFLHETNPNGIASSKPKPTSFKALAFHAVVAAVKRISFP 62

Query: 62  SVKSPRIFPRSFSRRLLRKKERDEREIGGDFVVKIKDIIRWKSFRDLVDET--AAPPLGF 121
           SVKSPRIFPRS SRRLLRK ERDERE GGDFVVKIKDIIRWKSFRDL+DET  AAPPL F
Sbjct: 63  SVKSPRIFPRSLSRRLLRKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPLDF 122

Query: 122 ADSPDRYT----AVATTTTTTTTTSSNSSSWCESDFTAEDLPSPSWRDWS-DGAVGKMCF 181
           A+SPDRYT    A  TTTTTTTT+SS SSSWCESDFTAEDLPSPSWRDWS DG +GKM F
Sbjct: 123 AESPDRYTYTAAATTTTTTTTTTSSSKSSSWCESDFTAEDLPSPSWRDWSDDGTIGKMYF 182

Query: 182 SCVGEDSRGTRSAHAENDKKVRNNSVLR 203
            CVGEDS  T +AHA+NDK+V  N++ R
Sbjct: 183 RCVGEDSTETTAAHAKNDKEVGINALSR 210

BLAST of Cla021273 vs. NCBI nr
Match: gi|659075370|ref|XP_008438109.1| (PREDICTED: uncharacterized protein LOC103483313 isoform X2 [Cucumis melo])

HSP 1 Score: 323.2 bits (827), Expect = 3.6e-85
Identity = 165/201 (82.09%), Postives = 173/201 (86.07%), Query Frame = 1

Query: 2   AATLSFPPLPLNREGSTRMLKDFLQETNANGIASSKPKPASFKTLAIHAVVAAVKRISFP 61
           A +LSFPP PLNREG TRMLKDFL ETN NGIASSKPKP SFK LA HAVVAAVKRISFP
Sbjct: 3   APSLSFPPFPLNREGPTRMLKDFLHETNPNGIASSKPKPTSFKALAFHAVVAAVKRISFP 62

Query: 62  SVKSPRIFPRSFSRRLLRKKERDEREIGGDFVVKIKDIIRWKSFRDLVDET--AAPPLGF 121
           SVKSPRIFPRS SRRLLRK ERDERE GGDFVVKIKDIIRWKSFRDL+DET  AAPPL F
Sbjct: 63  SVKSPRIFPRSLSRRLLRKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPLDF 122

Query: 122 ADSPDRYT----AVATTTTTTTTTSSNSSSWCESDFTAEDLPSPSWRDWS-DGAVGKMCF 181
           A+SPDRYT    A  TTTTTTTT+SS SSSWCESDFTAEDLPSPSWRDWS DG +GKM F
Sbjct: 123 AESPDRYTYTAAATTTTTTTTTTSSSKSSSWCESDFTAEDLPSPSWRDWSDDGTIGKMYF 182

Query: 182 SCVGEDSRGTRSAHAENDKKV 196
            CVGEDS  T +AHA+NDK+V
Sbjct: 183 RCVGEDSTETTAAHAKNDKEV 203

BLAST of Cla021273 vs. NCBI nr
Match: gi|449432203|ref|XP_004133889.1| (PREDICTED: uncharacterized protein LOC101208043 [Cucumis sativus])

HSP 1 Score: 307.8 bits (787), Expect = 1.6e-80
Identity = 158/203 (77.83%), Postives = 169/203 (83.25%), Query Frame = 1

Query: 2   AATLSFPPLPLNREGSTRMLKDFLQETNANGIASSKPKPASFKTLAIHAVVAAVKRISFP 61
           A TLSFPP PLNRE  TRMLKDFL ETN NG+AS KPKP SFK LA HAVVAAVKRIS P
Sbjct: 3   APTLSFPPFPLNREAPTRMLKDFLHETNPNGLASPKPKPTSFKALAFHAVVAAVKRISLP 62

Query: 62  SVKSPRIFPRSFSRRLLRKKERDEREIGGDFVVKIKDIIRWKSFRDLVDET--AAPPLGF 121
           SVKSPRIFPRS SRRLL+K ERDERE GGDFVVKIKDIIRWKSFRDL+DET  AAPPL F
Sbjct: 63  SVKSPRIFPRSLSRRLLKKTERDERETGGDFVVKIKDIIRWKSFRDLIDETTAAAPPLDF 122

Query: 122 ADSPDRYTAVA------TTTTTTTTTSSNSSSWCESDFTAEDLPSPSWRDWS-DGAVGKM 181
           A+SPDRYT  A      TTTTTTTT+SS SSSWCESDFTAEDL SPSWRDWS DG +GKM
Sbjct: 123 AESPDRYTYTAAATTTTTTTTTTTTSSSKSSSWCESDFTAEDLASPSWRDWSDDGTMGKM 182

Query: 182 CFSCVGEDSRGTRSAHAENDKKV 196
            F CVGEDS  T +A+A+ND++V
Sbjct: 183 YFPCVGEDSNETTAAYAQNDEEV 205

BLAST of Cla021273 vs. NCBI nr
Match: gi|590722615|ref|XP_007051945.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 131.0 bits (328), Expect = 2.6e-27
Identity = 80/167 (47.90%), Postives = 100/167 (59.88%), Query Frame = 1

Query: 30  ANGIASSKPKPASFKTLAIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDEREIG 89
           A  +  S+ K AS       A++ AV+ I F SVKSP I PRS SR+L +K  + E E  
Sbjct: 67  AQQLQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETET- 126

Query: 90  GDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTT-----SSNSS 149
               V++KDIIRWKS RDLV+E   PP  FA SP   T  +TTTTTTT +     SSNSS
Sbjct: 127 -RTTVRVKDIIRWKSSRDLVEE-KFPPADFASSPHHCTTRSTTTTTTTGSKSTPCSSNSS 186

Query: 150 SWCESDFTAEDLPSPSWRDWSDGAVGKMCFSCVGEDSRGTRSAHAEN 192
           SWC+SDFT+E LPS  + + S+  VGK    CVG+D   T +  A N
Sbjct: 187 SWCDSDFTSEYLPSEEYHE-SEVDVGKKFLPCVGKDPMETTTGLAAN 229

BLAST of Cla021273 vs. NCBI nr
Match: gi|590722618|ref|XP_007051946.1| (Uncharacterized protein isoform 2 [Theobroma cacao])

HSP 1 Score: 131.0 bits (328), Expect = 2.6e-27
Identity = 80/167 (47.90%), Postives = 100/167 (59.88%), Query Frame = 1

Query: 30  ANGIASSKPKPASFKTLAIHAVVAAVKRISFPSVKSPRIFPRSFSRRLLRKKERDEREIG 89
           A  +  S+ K AS       A++ AV+ I F SVKSP I PRS SR+L +K  + E E  
Sbjct: 67  AQQLQRSRSKAASTTISTFQAMIKAVRNIHFTSVKSPSILPRSLSRKLSKKNSQKETET- 126

Query: 90  GDFVVKIKDIIRWKSFRDLVDETAAPPLGFADSPDRYTAVATTTTTTTTT-----SSNSS 149
               V++KDIIRWKS RDLV+E   PP  FA SP   T  +TTTTTTT +     SSNSS
Sbjct: 127 -RTTVRVKDIIRWKSSRDLVEE-KFPPADFASSPHHCTTRSTTTTTTTGSKSTPCSSNSS 186

Query: 150 SWCESDFTAEDLPSPSWRDWSDGAVGKMCFSCVGEDSRGTRSAHAEN 192
           SWC+SDFT+E LPS  + + S+  VGK    CVG+D   T +  A N
Sbjct: 187 SWCDSDFTSEYLPSEEYHE-SEVDVGKKFLPCVGKDPMETTTGLAAN 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0L6C4_CUCSA1.1e-8077.83Uncharacterized protein OS=Cucumis sativus GN=Csa_3G124790 PE=4 SV=1[more]
A0A061DUN2_THECC1.8e-2747.90Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1[more]
A0A061E1G2_THECC1.8e-2747.90Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_005434 PE=4 SV=1[more]
A0A067K104_JATCU9.4e-2447.30Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14469 PE=4 SV=1[more]
B9GPU1_POPTR6.1e-2344.24Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s15410g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
gi|659075368|ref|XP_008438108.1|7.3e-8680.29PREDICTED: uncharacterized protein LOC103483313 isoform X1 [Cucumis melo][more]
gi|659075370|ref|XP_008438109.1|3.6e-8582.09PREDICTED: uncharacterized protein LOC103483313 isoform X2 [Cucumis melo][more]
gi|449432203|ref|XP_004133889.1|1.6e-8077.83PREDICTED: uncharacterized protein LOC101208043 [Cucumis sativus][more]
gi|590722615|ref|XP_007051945.1|2.6e-2747.90Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|590722618|ref|XP_007051946.1|2.6e-2747.90Uncharacterized protein isoform 2 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021273Cla021273.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33623FAMILY NOT NAMEDcoord: 47..167
score: 1.7
NoneNo IPR availablePANTHERPTHR33623:SF4SUBFAMILY NOT NAMEDcoord: 47..167
score: 1.7

The following gene(s) are paralogous to this gene:

None