Cla020943 (gene) Watermelon (97103) v1

NameCla020943
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionHomology to unknown gene (AHRD V1 **-- Q016Y8_OSTTA)
LocationChr5 : 25281026 .. 25281854 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAGGCTGCTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGGGCGATGGCCAATTAGCCTTGAAGAAGCTTGCAACTGCTACGCTTGAGAAACTAATGCCCAGAATCGAAGGAAAAGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAAGTTTACTCTCCTTTCCAACCACTGATCCTCTCAAATCGCTCACCAGTAATATCTCATTTGGTACTGTAGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGTTCTCTTTGCAACTGTTTATTAGTGGGCTAATGTTGTTGCACTGCCTGACTAATCTTTTCTATTTCTGAGGCTTAACAATCTGCCTTAGTTCTGTTTTTTAAAAATCCTTCTTGTTTATTAACCATTTTTTAAATACTAGAAATGAAACTTTTCTTTGATTAACAAGTTATGTTCCGCTTTTCTAAATACTTTCAAAATATCAAAATTTTCAAACACATGTATACTGAAAACTACTTTTAGTTTTGAAATTTTGACTATGATCTTAAGAATGTTTAATAAGGTAGAATGTATGATGAGAAAATCTGTTGATATGGCTGAATGATGATATTCTAAAGTTTCAGATTTTGTTTTATTGTTATATTCTTAATATCTTTCCCTTCTCCGCCCTTTTCTAGGCCTCCATCATTCGGCAAATGAAGGAGTCAGAGATGGCCATGCAATGGACGTTCACGTACAAGCTTACGAGCCGCTTGCGAATGGTCCTTCAATCAGCTCCAGCTCAACGAGCACTTGTGCTCGTAGAATATTCTGCTTCATCACTGGATTAA

mRNA sequence

ATGGTTCAGGCTGCTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGGGCGATGGCCAATTAGCCTTGAAGAAGCTTGCAACTGCTACGCTTGAGAAACTAATGCCCAGAATCGAAGGAAAAGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAAGTTTACTCTCCTTTCCAACCACTGATCCTCTCAAATCGCTCACCAGTAATATCTCATTTGGTACTGTAGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGCCTCCATCATTCGGCAAATGAAGGAGTCAGAGATGGCCATGCAATGGACGTTCACGTACAAGCTTACGAGCCGCTTGCGAATGGTCCTTCAATCAGCTCCAGCTCAACGAGCACTTGTGCTCGTAGAATATTCTGCTTCATCACTGGATTAA

Coding sequence (CDS)

ATGGTTCAGGCTGCTCGGGCATTTGAGAACCAATTGGCAGAATCAATATTGGAGGGCGATGGCCAATTAGCCTTGAAGAAGCTTGCAACTGCTACGCTTGAGAAACTAATGCCCAGAATCGAAGGAAAAGGTGAATTTGGTCAGGCTAGGTGGAGACTAGTGTATGCTCCACAAATCCCAAGTTTACTCTCCTTTCCAACCACTGATCCTCTCAAATCGCTCACCAGTAATATCTCATTTGGTACTGTAGTTGAAGTCCAGCTTGGAAAGCGCATTCAGGCCTCCATCATTCGGCAAATGAAGGAGTCAGAGATGGCCATGCAATGGACGTTCACGTACAAGCTTACGAGCCGCTTGCGAATGGTCCTTCAATCAGCTCCAGCTCAACGAGCACTTGTGCTCGTAGAATATTCTGCTTCATCACTGGATTAA

Protein sequence

MVQAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSLLSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMVLQSAPAQRALVLVEYSASSLD
BLAST of Cla020943 vs. TrEMBL
Match: A0A0A0LEE3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G879460 PE=4 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 1.3e-63
Identity = 128/141 (90.78%), Postives = 136/141 (96.45%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +A RAFENQLAESILE  GQLAL+KLATATLEKLMPRIEGKGEFGQA WRLVYAPQIP+L
Sbjct: 1228 EATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQASWRLVYAPQIPTL 1287

Query: 63   LSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMV 122
            LSFPTTDPL+SLTSNISFGTVVEVQLGKRIQAS+IRQMKE+EMAMQWTFTYKLTSRLRMV
Sbjct: 1288 LSFPTTDPLQSLTSNISFGTVVEVQLGKRIQASMIRQMKETEMAMQWTFTYKLTSRLRMV 1347

Query: 123  LQSAPAQRALVLVEYSASSLD 144
            LQSAPAQR L+LVEYSA+SLD
Sbjct: 1348 LQSAPAQRTLLLVEYSATSLD 1368

BLAST of Cla020943 vs. TrEMBL
Match: A0A0D2V631_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 2.3e-55
Identity = 118/142 (83.10%), Postives = 128/142 (90.14%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            QAAR FE+QLAESILEGDGQLA KKLATATLE LMPRIEGKGEFGQARWRLVYAPQIPSL
Sbjct: 1982 QAARVFESQLAESILEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSL 2041

Query: 63   LSF-PTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRM 122
            LS  PT DPLKSL SNISFGT VEVQLGKR+QASI+RQ+KESEMAMQWT  YKLTSRLR+
Sbjct: 2042 LSVDPTADPLKSLASNISFGTEVEVQLGKRLQASIVRQLKESEMAMQWTLIYKLTSRLRV 2101

Query: 123  VLQSAPAQRALVLVEYSASSLD 144
            +LQSAP++R  +L EYSA+S D
Sbjct: 2102 LLQSAPSKR--LLFEYSATSQD 2121

BLAST of Cla020943 vs. TrEMBL
Match: A0A0D2SN74_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 5.1e-55
Identity = 117/142 (82.39%), Postives = 128/142 (90.14%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +AAR FE+QLAESILEGDGQLA KKLATATLE LMPRIEGKGEFGQARWRLVYAPQIPSL
Sbjct: 1346 EAARVFESQLAESILEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSL 1405

Query: 63   LSF-PTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRM 122
            LS  PT DPLKSL SNISFGT VEVQLGKR+QASI+RQ+KESEMAMQWT  YKLTSRLR+
Sbjct: 1406 LSVDPTADPLKSLASNISFGTEVEVQLGKRLQASIVRQLKESEMAMQWTLIYKLTSRLRV 1465

Query: 123  VLQSAPAQRALVLVEYSASSLD 144
            +LQSAP++R  +L EYSA+S D
Sbjct: 1466 LLQSAPSKR--LLFEYSATSQD 1485

BLAST of Cla020943 vs. TrEMBL
Match: A0A0D2U6V0_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 5.1e-55
Identity = 117/142 (82.39%), Postives = 128/142 (90.14%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +AAR FE+QLAESILEGDGQLA KKLATATLE LMPRIEGKGEFGQARWRLVYAPQIPSL
Sbjct: 2046 EAARVFESQLAESILEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSL 2105

Query: 63   LSF-PTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRM 122
            LS  PT DPLKSL SNISFGT VEVQLGKR+QASI+RQ+KESEMAMQWT  YKLTSRLR+
Sbjct: 2106 LSVDPTADPLKSLASNISFGTEVEVQLGKRLQASIVRQLKESEMAMQWTLIYKLTSRLRV 2165

Query: 123  VLQSAPAQRALVLVEYSASSLD 144
            +LQSAP++R  +L EYSA+S D
Sbjct: 2166 LLQSAPSKR--LLFEYSATSQD 2185

BLAST of Cla020943 vs. TrEMBL
Match: A0A0B0MVS8_GOSAR (Acetolactate synthase large subunit OS=Gossypium arboreum GN=F383_27597 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 5.1e-55
Identity = 117/142 (82.39%), Postives = 128/142 (90.14%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +AAR FE+QLAESILEGDGQLA KKLATATLE LMPRIEGKGEFGQARWRLVYAPQIPSL
Sbjct: 1060 EAARVFESQLAESILEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSL 1119

Query: 63   LSF-PTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRM 122
            LS  PT DPLKSL SNISFGT VEVQLGKR+QASI+RQ+KESEMAMQWT  YKLTSRLR+
Sbjct: 1120 LSVDPTADPLKSLASNISFGTEVEVQLGKRLQASIVRQLKESEMAMQWTLIYKLTSRLRV 1179

Query: 123  VLQSAPAQRALVLVEYSASSLD 144
            +LQSAP++R  +L EYSA+S D
Sbjct: 1180 LLQSAPSKR--LLFEYSATSQD 1199

BLAST of Cla020943 vs. NCBI nr
Match: gi|778687064|ref|XP_011652499.1| (PREDICTED: uncharacterized protein LOC101203544 [Cucumis sativus])

HSP 1 Score: 250.4 bits (638), Expect = 1.9e-63
Identity = 128/141 (90.78%), Postives = 136/141 (96.45%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +A RAFENQLAESILE  GQLAL+KLATATLEKLMPRIEGKGEFGQA WRLVYAPQIP+L
Sbjct: 2013 EATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQASWRLVYAPQIPTL 2072

Query: 63   LSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMV 122
            LSFPTTDPL+SLTSNISFGTVVEVQLGKRIQAS+IRQMKE+EMAMQWTFTYKLTSRLRMV
Sbjct: 2073 LSFPTTDPLQSLTSNISFGTVVEVQLGKRIQASMIRQMKETEMAMQWTFTYKLTSRLRMV 2132

Query: 123  LQSAPAQRALVLVEYSASSLD 144
            LQSAPAQR L+LVEYSA+SLD
Sbjct: 2133 LQSAPAQRTLLLVEYSATSLD 2153

BLAST of Cla020943 vs. NCBI nr
Match: gi|700204995|gb|KGN60128.1| (hypothetical protein Csa_3G879460 [Cucumis sativus])

HSP 1 Score: 250.4 bits (638), Expect = 1.9e-63
Identity = 128/141 (90.78%), Postives = 136/141 (96.45%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +A RAFENQLAESILE  GQLAL+KLATATLEKLMPRIEGKGEFGQA WRLVYAPQIP+L
Sbjct: 1228 EATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQASWRLVYAPQIPTL 1287

Query: 63   LSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMV 122
            LSFPTTDPL+SLTSNISFGTVVEVQLGKRIQAS+IRQMKE+EMAMQWTFTYKLTSRLRMV
Sbjct: 1288 LSFPTTDPLQSLTSNISFGTVVEVQLGKRIQASMIRQMKETEMAMQWTFTYKLTSRLRMV 1347

Query: 123  LQSAPAQRALVLVEYSASSLD 144
            LQSAPAQR L+LVEYSA+SLD
Sbjct: 1348 LQSAPAQRTLLLVEYSATSLD 1368

BLAST of Cla020943 vs. NCBI nr
Match: gi|659132755|ref|XP_008466367.1| (PREDICTED: uncharacterized protein LOC103503795 isoform X3 [Cucumis melo])

HSP 1 Score: 244.6 bits (623), Expect = 1.1e-61
Identity = 126/141 (89.36%), Postives = 133/141 (94.33%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +A RAFENQLAESILE  GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIP+L
Sbjct: 1627 EATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTL 1686

Query: 63   LSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMV 122
            LSFPTTDPL SLTSNIS GTVVEVQLGKRIQAS+IRQMKE+EMAMQW  TYKLTSRLRMV
Sbjct: 1687 LSFPTTDPLLSLTSNISIGTVVEVQLGKRIQASMIRQMKETEMAMQWMITYKLTSRLRMV 1746

Query: 123  LQSAPAQRALVLVEYSASSLD 144
            LQSAPAQR L+LVEYSA+SLD
Sbjct: 1747 LQSAPAQRTLLLVEYSATSLD 1767

BLAST of Cla020943 vs. NCBI nr
Match: gi|659132751|ref|XP_008466365.1| (PREDICTED: uncharacterized protein LOC103503795 isoform X1 [Cucumis melo])

HSP 1 Score: 244.6 bits (623), Expect = 1.1e-61
Identity = 126/141 (89.36%), Postives = 133/141 (94.33%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            +A RAFENQLAESILE  GQLAL+KLATATLEKLMPRIEGKGEFGQARWRLVYAPQIP+L
Sbjct: 2013 EATRAFENQLAESILESGGQLALEKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPTL 2072

Query: 63   LSFPTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRMV 122
            LSFPTTDPL SLTSNIS GTVVEVQLGKRIQAS+IRQMKE+EMAMQW  TYKLTSRLRMV
Sbjct: 2073 LSFPTTDPLLSLTSNISIGTVVEVQLGKRIQASMIRQMKETEMAMQWMITYKLTSRLRMV 2132

Query: 123  LQSAPAQRALVLVEYSASSLD 144
            LQSAPAQR L+LVEYSA+SLD
Sbjct: 2133 LQSAPAQRTLLLVEYSATSLD 2153

BLAST of Cla020943 vs. NCBI nr
Match: gi|763797600|gb|KJB64555.1| (hypothetical protein B456_010G053900 [Gossypium raimondii])

HSP 1 Score: 223.0 bits (567), Expect = 3.3e-55
Identity = 118/142 (83.10%), Postives = 128/142 (90.14%), Query Frame = 1

Query: 3    QAARAFENQLAESILEGDGQLALKKLATATLEKLMPRIEGKGEFGQARWRLVYAPQIPSL 62
            QAAR FE+QLAESILEGDGQLA KKLATATLE LMPRIEGKGEFGQARWRLVYAPQIPSL
Sbjct: 1982 QAARVFESQLAESILEGDGQLAFKKLATATLETLMPRIEGKGEFGQARWRLVYAPQIPSL 2041

Query: 63   LSF-PTTDPLKSLTSNISFGTVVEVQLGKRIQASIIRQMKESEMAMQWTFTYKLTSRLRM 122
            LS  PT DPLKSL SNISFGT VEVQLGKR+QASI+RQ+KESEMAMQWT  YKLTSRLR+
Sbjct: 2042 LSVDPTADPLKSLASNISFGTEVEVQLGKRLQASIVRQLKESEMAMQWTLIYKLTSRLRV 2101

Query: 123  VLQSAPAQRALVLVEYSASSLD 144
            +LQSAP++R  +L EYSA+S D
Sbjct: 2102 LLQSAPSKR--LLFEYSATSQD 2121

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LEE3_CUCSA1.3e-6390.78Uncharacterized protein OS=Cucumis sativus GN=Csa_3G879460 PE=4 SV=1[more]
A0A0D2V631_GOSRA2.3e-5583.10Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1[more]
A0A0D2SN74_GOSRA5.1e-5582.39Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1[more]
A0A0D2U6V0_GOSRA5.1e-5582.39Uncharacterized protein OS=Gossypium raimondii GN=B456_010G053900 PE=4 SV=1[more]
A0A0B0MVS8_GOSAR5.1e-5582.39Acetolactate synthase large subunit OS=Gossypium arboreum GN=F383_27597 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
gi|778687064|ref|XP_011652499.1|1.9e-6390.78PREDICTED: uncharacterized protein LOC101203544 [Cucumis sativus][more]
gi|700204995|gb|KGN60128.1|1.9e-6390.78hypothetical protein Csa_3G879460 [Cucumis sativus][more]
gi|659132755|ref|XP_008466367.1|1.1e-6189.36PREDICTED: uncharacterized protein LOC103503795 isoform X3 [Cucumis melo][more]
gi|659132751|ref|XP_008466365.1|1.1e-6189.36PREDICTED: uncharacterized protein LOC103503795 isoform X1 [Cucumis melo][more]
gi|763797600|gb|KJB64555.1|3.3e-5583.10hypothetical protein B456_010G053900 [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009306 protein secretion
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
cellular_component GO:0048046 apoplast
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005887 integral component of plasma membrane
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU44520watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla020943Cla020943.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU44520WMU44520transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34457FAMILY NOT NAMEDcoord: 15..143
score: 1.3
NoneNo IPR availablePANTHERPTHR34457:SF1EMBRYO DEFECTIVE 2410 PROTEINcoord: 15..143
score: 1.3

The following gene(s) are paralogous to this gene:

None