Cla004822 (gene) Watermelon (97103) v1

NameCla004822
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr10 : 11076098 .. 11076430 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGATTATATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCAGACGACCGACTCCTCCTCAACACACACACCGGCGATTATCCAATCACATCGATTTCATACTCCGATTCAACCGTCGTAATCTCGCCGCCGTCCATGTCGACTTGCTCTAAAATGCACGAATTTAGAACCCTAGGCATCGACTAG

mRNA sequence

ATGGAGTTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGATTATATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCAGACGACCGACTCCTCCTCAACACACACACCGGCGATTATCCAATCACATCGATTTCATACTCCGATTCAACCGTCGTAATCTCGCCGCCGTCCATGTCGACTTGCTCTAAAATGCACGAATTTAGAACCCTAGGCATCGACTAG

Coding sequence (CDS)

ATGGAGTTTTTTTCACTCTCTCATCTTTACCTTCCACATTCTCGCCTCCACTTCTATCTCCGCCCAGCGCACCGCCGCCGCCGCCCCTCCCTGCCGATTATATGCGGCGCGCTCACCGTCAAGTATCCGTTTGGCACAGGCTACGGCTGCGGTTCTCCGCGGTTTTCCTCTCACGTGACTTGCTCCTCAGACGACCGACTCCTCCTCAACACACACACCGGCGATTATCCAATCACATCGATTTCATACTCCGATTCAACCGTCGTAATCTCGCCGCCGTCCATGTCGACTTGCTCTAAAATGCACGAATTTAGAACCCTAGGCATCGACTAG

Protein sequence

MEFFSLSHLYLPHSRLHFYLRPAHRRRRPSLPIICGALTVKYPFGTGYGCGSPRFSSHVTCSSDDRLLLNTHTGDYPITSISYSDSTVVISPPSMSTCSKMHEFRTLGID
BLAST of Cla004822 vs. TrEMBL
Match: A0A061DHT7_THECC (Membrane lipoprotein OS=Theobroma cacao GN=TCM_001086 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 9.5e-25
Identity = 55/76 (72.37%), Postives = 63/76 (82.89%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSDDRLLLNTHTGDYPITSISYSDSTVVISPPS 94
           CG+L VKYPFGTGYGCGSPRF  +VTCSSD RLLL THTG YPITS+SY DST++I+PP 
Sbjct: 33  CGSLQVKYPFGTGYGCGSPRFQPYVTCSSD-RLLLTTHTGSYPITSVSYKDSTLIITPPY 92

Query: 95  MSTCSKMHEFRTLGID 111
           MSTCS M +   LG+D
Sbjct: 93  MSTCSSMQQSPNLGLD 107

BLAST of Cla004822 vs. TrEMBL
Match: B9T3R3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0038740 PE=4 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 3.0e-23
Identity = 52/77 (67.53%), Postives = 63/77 (81.82%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLNTHTGDYPITSISYSDSTVVISPP 94
           CG+L VKYPFGT YGCGSPRF  ++TC+S  D+LLL THTG YPITSISY+ +T+ ISPP
Sbjct: 33  CGSLQVKYPFGTAYGCGSPRFYPYITCASGGDQLLLTTHTGSYPITSISYTATTITISPP 92

Query: 95  SMSTCSKMHEFRTLGID 111
           SMSTC+ MH+   LG+D
Sbjct: 93  SMSTCTSMHQSPNLGLD 109

BLAST of Cla004822 vs. TrEMBL
Match: A0A059AMD5_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00510 PE=4 SV=1)

HSP 1 Score: 113.6 bits (283), Expect = 1.5e-22
Identity = 56/98 (57.14%), Postives = 68/98 (69.39%), Query Frame = 1

Query: 16  LHFYLRPAHRRRRPSLPIICGALTVKYPFGTGYGCGSPRFSSHVTC---SSDDRLLLNTH 75
           L  +LRPA     P+    CG+LTVKYPFGTGYGCGSPRF  +VTC     DD L+L TH
Sbjct: 13  LLLHLRPAESPN-PACRNACGSLTVKYPFGTGYGCGSPRFHPYVTCVHGQDDDTLVLTTH 72

Query: 76  TGDYPITSISYSDSTVVISPPSMSTCSKMHEFRTLGID 111
           TG YP+TSISY+ S+++ISP  MSTC+ M     LG+D
Sbjct: 73  TGSYPVTSISYTTSSLIISPSDMSTCTSMKPSPNLGLD 109

BLAST of Cla004822 vs. TrEMBL
Match: B9HFF1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05260g PE=4 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 8.3e-21
Identity = 47/77 (61.04%), Postives = 61/77 (79.22%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLNTHTGDYPITSISYSDSTVVISPP 94
           CG++ VKYPFG+G+GCGSPRF  ++ CS + D+LLL THTG YPITSISY+ ST +I+PP
Sbjct: 37  CGSIQVKYPFGSGHGCGSPRFHPYIACSPEGDQLLLTTHTGSYPITSISYTTSTFIITPP 96

Query: 95  SMSTCSKMHEFRTLGID 111
            MSTC+ M +   LG+D
Sbjct: 97  HMSTCTSMQQSPNLGLD 113

BLAST of Cla004822 vs. TrEMBL
Match: A0A0D2PS33_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G096700 PE=4 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 2.4e-20
Identity = 47/76 (61.84%), Postives = 60/76 (78.95%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSDDRLLLNTHTGDYPITSISYSDSTVVISPPS 94
           CG+L +KYPFGTGYGCGSPRF  ++TC S ++LLL THTG Y IT+ISY DST+ I+P +
Sbjct: 34  CGSLQIKYPFGTGYGCGSPRFEPYITCKS-NQLLLTTHTGSYLITAISYKDSTLTITPSA 93

Query: 95  MSTCSKMHEFRTLGID 111
           MSTC+ M +   LG+D
Sbjct: 94  MSTCNSMQQSPNLGLD 108

BLAST of Cla004822 vs. NCBI nr
Match: gi|590707106|ref|XP_007047912.1| (Membrane lipoprotein [Theobroma cacao])

HSP 1 Score: 120.9 bits (302), Expect = 1.4e-24
Identity = 55/76 (72.37%), Postives = 63/76 (82.89%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSDDRLLLNTHTGDYPITSISYSDSTVVISPPS 94
           CG+L VKYPFGTGYGCGSPRF  +VTCSSD RLLL THTG YPITS+SY DST++I+PP 
Sbjct: 33  CGSLQVKYPFGTGYGCGSPRFQPYVTCSSD-RLLLTTHTGSYPITSVSYKDSTLIITPPY 92

Query: 95  MSTCSKMHEFRTLGID 111
           MSTCS M +   LG+D
Sbjct: 93  MSTCSSMQQSPNLGLD 107

BLAST of Cla004822 vs. NCBI nr
Match: gi|255584291|ref|XP_002532882.1| (PREDICTED: uncharacterized protein LOC8281252 [Ricinus communis])

HSP 1 Score: 115.9 bits (289), Expect = 4.4e-23
Identity = 52/77 (67.53%), Postives = 63/77 (81.82%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLNTHTGDYPITSISYSDSTVVISPP 94
           CG+L VKYPFGT YGCGSPRF  ++TC+S  D+LLL THTG YPITSISY+ +T+ ISPP
Sbjct: 33  CGSLQVKYPFGTAYGCGSPRFYPYITCASGGDQLLLTTHTGSYPITSISYTATTITISPP 92

Query: 95  SMSTCSKMHEFRTLGID 111
           SMSTC+ MH+   LG+D
Sbjct: 93  SMSTCTSMHQSPNLGLD 109

BLAST of Cla004822 vs. NCBI nr
Match: gi|629088297|gb|KCW54550.1| (hypothetical protein EUGRSUZ_I00510 [Eucalyptus grandis])

HSP 1 Score: 113.6 bits (283), Expect = 2.2e-22
Identity = 56/98 (57.14%), Postives = 68/98 (69.39%), Query Frame = 1

Query: 16  LHFYLRPAHRRRRPSLPIICGALTVKYPFGTGYGCGSPRFSSHVTC---SSDDRLLLNTH 75
           L  +LRPA     P+    CG+LTVKYPFGTGYGCGSPRF  +VTC     DD L+L TH
Sbjct: 13  LLLHLRPAESPN-PACRNACGSLTVKYPFGTGYGCGSPRFHPYVTCVHGQDDDTLVLTTH 72

Query: 76  TGDYPITSISYSDSTVVISPPSMSTCSKMHEFRTLGID 111
           TG YP+TSISY+ S+++ISP  MSTC+ M     LG+D
Sbjct: 73  TGSYPVTSISYTTSSLIISPSDMSTCTSMKPSPNLGLD 109

BLAST of Cla004822 vs. NCBI nr
Match: gi|702460129|ref|XP_010027908.1| (PREDICTED: wall-associated receptor kinase-like 20 [Eucalyptus grandis])

HSP 1 Score: 113.6 bits (283), Expect = 2.2e-22
Identity = 56/98 (57.14%), Postives = 68/98 (69.39%), Query Frame = 1

Query: 16  LHFYLRPAHRRRRPSLPIICGALTVKYPFGTGYGCGSPRFSSHVTC---SSDDRLLLNTH 75
           L  +LRPA     P+    CG+LTVKYPFGTGYGCGSPRF  +VTC     DD L+L TH
Sbjct: 43  LLLHLRPAESPN-PACRNACGSLTVKYPFGTGYGCGSPRFHPYVTCVHGQDDDTLVLTTH 102

Query: 76  TGDYPITSISYSDSTVVISPPSMSTCSKMHEFRTLGID 111
           TG YP+TSISY+ S+++ISP  MSTC+ M     LG+D
Sbjct: 103 TGSYPVTSISYTTSSLIISPSDMSTCTSMKPSPNLGLD 139

BLAST of Cla004822 vs. NCBI nr
Match: gi|224093746|ref|XP_002309973.1| (hypothetical protein POPTR_0007s05260g [Populus trichocarpa])

HSP 1 Score: 107.8 bits (268), Expect = 1.2e-20
Identity = 47/77 (61.04%), Postives = 61/77 (79.22%), Query Frame = 1

Query: 35  CGALTVKYPFGTGYGCGSPRFSSHVTCSSD-DRLLLNTHTGDYPITSISYSDSTVVISPP 94
           CG++ VKYPFG+G+GCGSPRF  ++ CS + D+LLL THTG YPITSISY+ ST +I+PP
Sbjct: 37  CGSIQVKYPFGSGHGCGSPRFHPYIACSPEGDQLLLTTHTGSYPITSISYTTSTFIITPP 96

Query: 95  SMSTCSKMHEFRTLGID 111
            MSTC+ M +   LG+D
Sbjct: 97  HMSTCTSMQQSPNLGLD 113

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A061DHT7_THECC9.5e-2572.37Membrane lipoprotein OS=Theobroma cacao GN=TCM_001086 PE=4 SV=1[more]
B9T3R3_RICCO3.0e-2367.53Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0038740 PE=4 SV=1[more]
A0A059AMD5_EUCGR1.5e-2257.14Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_I00510 PE=4 SV=1[more]
B9HFF1_POPTR8.3e-2161.04Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05260g PE=4 SV=1[more]
A0A0D2PS33_GOSRA2.4e-2061.84Uncharacterized protein OS=Gossypium raimondii GN=B456_008G096700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|590707106|ref|XP_007047912.1|1.4e-2472.37Membrane lipoprotein [Theobroma cacao][more]
gi|255584291|ref|XP_002532882.1|4.4e-2367.53PREDICTED: uncharacterized protein LOC8281252 [Ricinus communis][more]
gi|629088297|gb|KCW54550.1|2.2e-2257.14hypothetical protein EUGRSUZ_I00510 [Eucalyptus grandis][more]
gi|702460129|ref|XP_010027908.1|2.2e-2257.14PREDICTED: wall-associated receptor kinase-like 20 [Eucalyptus grandis][more]
gi|224093746|ref|XP_002309973.1|1.2e-2061.04hypothetical protein POPTR_0007s05260g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025287WAK_GUB
Vocabulary: Molecular Function
TermDefinition
GO:0030247polysaccharide binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0030247 polysaccharide binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla004822Cla004822.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025287Wall-associated receptor kinase, galacturonan-binding domainPFAMPF13947GUB_WAK_bindcoord: 34..102
score: 1.2
NoneNo IPR availablePANTHERPTHR33355FAMILY NOT NAMEDcoord: 35..110
score: 5.6
NoneNo IPR availablePANTHERPTHR33355:SF4SUBFAMILY NOT NAMEDcoord: 35..110
score: 5.6

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla004822Cla97C10G192240Watermelon (97103) v2wmwmbB059
Cla004822Lsi03G009300Bottle gourd (USVL1VR-Ls)lsiwmB241
Cla004822Bhi11G000836Wax gourdwgowmB029
The following gene(s) are paralogous to this gene:

None