Cla002661 (gene) Watermelon (97103) v1

NameCla002661
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr3 : 13373687 .. 13374577 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGACGCCTACAGACGCTGAAATGAATGACGCCCTCACAATCAACGCCAAATCGGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAATCATTGGCGCCTAACTGCTTGATCGCCGAGGCCAACCTGTGGCTTTATTTCATCAAGCGGTTGCTGCTTCCCACAACGCATGACGCCTCAATTTCAAGAGACCGGGCCATGGTCATTTACTGCATCATGTGGGGAATTCAGCTGGACGTGGGACGCATCATTGCCCCACAGATTCGGGGGTTATTCTTCAAGCCAAGGGGTCAGTTATTCTTCCCCTTTCTTGTGACTAGAATGTGCGCCAACACACAATTCATGGAGGATGCGCCAATCAGATCAGTAAACGGAGTCCTTTCAGCGCAGAGTTTGAGACGAATATTGAAGGACTCACCGCATCTGCTTGACGTGGCCAACTCAAAGAAGAGGCCGCCAAAGTCACAAGAACCATCATCCCCCCAACCTCCTCAACCAAAGAAGAGGAAACTAGTCAAGAAGAATTTCGAGATTCAGGCGTCAAGTTCACAACCTGGTGAAGCCGAAGAAGCTCTTCAGCTCGACGCATACACCCAAGCCTTGACCATTTATACTCATCCCATCGCCCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTATCCTTCCCCGAAAATCCAAAATGAGCCACTCCATCTCCCTACCACCCAACGCAGCATTCCAACGCCTCTTCAGCTCCCTATCGCAGACTTAAATGAAGAGACGAAGGAAGTGTCGCCGCCTTCTTCGCCCCACCTGGAGTTGACCCTCTCCCCGCCTCAAGTATCCACCTTTCGAGAGGCAACTCCACCACCACCGCACCACCCGTTGAGCCGACCCTGA

mRNA sequence

ATGTCGACGCCTACAGACGCTGAAATGAATGACGCCCTCACAATCAACGCCAAATCGGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAATCATTGGCGCCTAACTGCTTGATCGCCGAGGCCAACCTGTGGCTTTATTTCATCAAGCGGTTGCTGCTTCCCACAACGCATGACGCCTCAATTTCAAGAGACCGGGCCATGGTCATTTACTGCATCATGTGGGGAATTCAGCTGGACGTGGGACGCATCATTGCCCCACAGATTCGGGGGTTATTCTTCAAGCCAAGGGGTCAGTTATTCTTCCCCTTTCTTGTGACTAGAATGTGCGCCAACACACAATTCATGGAGGATGCGCCAATCAGATCAGTAAACGGAGTCCTTTCAGCGCAGAGTTTGAGACGAATATTGAAGGACTCACCGCATCTGCTTGACGTGGCCAACTCAAAGAAGAGGCCGCCAAAGTCACAAGAACCATCATCCCCCCAACCTCCTCAACCAAAGAAGAGGAAACTAGTCAAGAAGAATTTCGAGATTCAGGCGTCAAGTTCACAACCTGGTGAAGCCGAAGAAGCTCTTCAGCTCGACGCATACACCCAAGCCTTGACCATTTATACTCATCCCATCGCCCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTATCCTTCCCCGAAAATCCAAAATGAGCCACTCCATCTCCCTACCACCCAACGCAGCATTCCAACGCCTCTTCAGCTCCCTATCGCAGACTTAAATGAAGAGACGAAGGAAGTGTCGCCGCCTTCTTCGCCCCACCTGGAGTTGACCCTCTCCCCGCCTCAAGTATCCACCTTTCGAGAGGCAACTCCACCACCACCGCACCACCCGTTGAGCCGACCCTGA

Coding sequence (CDS)

ATGTCGACGCCTACAGACGCTGAAATGAATGACGCCCTCACAATCAACGCCAAATCGGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAATCATTGGCGCCTAACTGCTTGATCGCCGAGGCCAACCTGTGGCTTTATTTCATCAAGCGGTTGCTGCTTCCCACAACGCATGACGCCTCAATTTCAAGAGACCGGGCCATGGTCATTTACTGCATCATGTGGGGAATTCAGCTGGACGTGGGACGCATCATTGCCCCACAGATTCGGGGGTTATTCTTCAAGCCAAGGGGTCAGTTATTCTTCCCCTTTCTTGTGACTAGAATGTGCGCCAACACACAATTCATGGAGGATGCGCCAATCAGATCAGTAAACGGAGTCCTTTCAGCGCAGAGTTTGAGACGAATATTGAAGGACTCACCGCATCTGCTTGACGTGGCCAACTCAAAGAAGAGGCCGCCAAAGTCACAAGAACCATCATCCCCCCAACCTCCTCAACCAAAGAAGAGGAAACTAGTCAAGAAGAATTTCGAGATTCAGGCGTCAAGTTCACAACCTGGTGAAGCCGAAGAAGCTCTTCAGCTCGACGCATACACCCAAGCCTTGACCATTTATACTCATCCCATCGCCCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTATCCTTCCCCGAAAATCCAAAATGAGCCACTCCATCTCCCTACCACCCAACGCAGCATTCCAACGCCTCTTCAGCTCCCTATCGCAGACTTAAATGAAGAGACGAAGGAAGTGTCGCCGCCTTCTTCGCCCCACCTGGAGTTGACCCTCTCCCCGCCTCAAGTATCCACCTTTCGAGAGGCAACTCCACCACCACCGCACCACCCGTTGAGCCGACCCTGA

Protein sequence

MSTPTDAEMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRDRAMVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRMCANTQFMEDAPIRSVNGVLSAQSLRRILKDSPHLLDVANSKKRPPKSQEPSSPQPPQPKKRKLVKKNFEIQASSSQPGEAEEALQLDAYTQALTIYTHPIAPITEEPSSPPTSPYPSPKIQNEPLHLPTTQRSIPTPLQLPIADLNEETKEVSPPSSPHLELTLSPPQVSTFREATPPPPHHPLSRP
BLAST of Cla002661 vs. TrEMBL
Match: W9RBS1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1)

HSP 1 Score: 75.5 bits (184), Expect = 1.2e-10
Identity = 45/157 (28.66%), Postives = 80/157 (50.96%), Query Frame = 1

Query: 6   DAEMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRD 65
           + ++ + L   A  G++W  S KG  +   + L   A +W +F+   LL +TH  +ISR+
Sbjct: 154 EEQLKEVLKTIAILGAQWLLSAKGSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRN 213

Query: 66  RAMVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRMCANTQFMEDAPIRSVN 125
           RA+++Y ++ G  ++VGR+I  QIR    K +G L+FP L++ +C  +    +A    + 
Sbjct: 214 RAILLYAVLVGKPINVGRLIIDQIRACAEKGKGGLYFPSLISELCIQSHVAWEASEPRLR 273

Query: 126 GVLSAQSLRRILKDSPHLLDVANSKKRPPKSQEPSSP 163
               A  L  I + S    + +   +   +  EPS P
Sbjct: 274 NT-GAMDLVAITRISSGRSEKSEKGEEEEEQDEPSRP 309

BLAST of Cla002661 vs. TrEMBL
Match: W9QTD9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 4.7e-10
Identity = 41/138 (29.71%), Postives = 73/138 (52.90%), Query Frame = 1

Query: 2   STPTDAEMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDAS 61
           S  TD ++   L   A  G+ W  SP+G  +     L   A +W +F+    +P+TH  +
Sbjct: 78  SEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYHFLTARFMPSTHGKT 137

Query: 62  ISRDRAMVIYCIMWGIQLDVGRIIAPQIRGL-FFKPRGQLFFPFLVTRMC--ANTQFMED 121
           +++DR +++Y I+ GI +++  I   +I+     + RG L+FP L+T++   AN  + +D
Sbjct: 138 VAKDRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSLITQLWLKANVPYHKD 197

Query: 122 APIRSVNGVLSAQSLRRI 137
             I    G +S  S+ RI
Sbjct: 198 EAIVHNAGAISTLSISRI 215

BLAST of Cla002661 vs. TrEMBL
Match: W9S7D3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006967 PE=4 SV=1)

HSP 1 Score: 70.5 bits (171), Expect = 4.0e-09
Identity = 30/103 (29.13%), Postives = 61/103 (59.22%), Query Frame = 1

Query: 8   EMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRDRA 67
           ++++ +      G+EW  + +G  +    CL     +W +F++  L+P++H   + ++RA
Sbjct: 123 QLDEVINELCVEGTEWWRATRGSMTFPKECLQPGPKIWYHFLRFRLMPSSHYRLVHKERA 182

Query: 68  MVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRMC 111
           +++YC+M G  L+VGR+I  Q+     +  G L+FP L+T++C
Sbjct: 183 ILLYCMMKGRPLNVGRMIRQQVGVCAGRKNGGLWFPSLITQLC 225

BLAST of Cla002661 vs. TrEMBL
Match: A0A0A0LAN0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G263250 PE=4 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 2.2e-07
Identity = 34/96 (35.42%), Postives = 57/96 (59.38%), Query Frame = 1

Query: 44  LWLYFIKRLLLPTTHDASISRDRAMVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFP 103
           +WL  IK+ ++PT H ++IS +R M++YCIM  I +++G+II+  I  L   PRG   F 
Sbjct: 28  IWLVLIKKKIMPTRHVSTISMERVMLVYCIMKKILVNIGKIISNHIIALVKHPRGARPFS 87

Query: 104 FLVTRMCANTQFM-EDAP-IRSVNGVLSAQSLRRIL 138
           +L+ ++C     M E  P +   +G+    +L RI+
Sbjct: 88  YLIEQLCLRACLMLEKLPQVEVKDGIWLPSTLHRII 123

BLAST of Cla002661 vs. TrEMBL
Match: A0A061F2U9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024087 PE=4 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.4e-06
Identity = 31/92 (33.70%), Postives = 53/92 (57.61%), Query Frame = 1

Query: 20  GSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRDRAMVIYCIMWGIQL 79
           G++W TS     S   + +  E  +WL+F+   LL +TH + +++DRA++IY I+    +
Sbjct: 96  GAQWKTSHDEPVSFKRSVMKKELQVWLHFVAARLLSSTHISDVTKDRAVLIYAIVAHKSI 155

Query: 80  DVGRIIAPQIRGLFFKPRGQLFFPFLVTRMCA 112
           DVG++I+  I       R  + FP L+T +CA
Sbjct: 156 DVGKVISHAILHTGRTKRDGIGFPSLITALCA 187

BLAST of Cla002661 vs. NCBI nr
Match: gi|703082781|ref|XP_010092041.1| (hypothetical protein L484_000844 [Morus notabilis])

HSP 1 Score: 75.5 bits (184), Expect = 1.8e-10
Identity = 45/157 (28.66%), Postives = 80/157 (50.96%), Query Frame = 1

Query: 6   DAEMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRD 65
           + ++ + L   A  G++W  S KG  +   + L   A +W +F+   LL +TH  +ISR+
Sbjct: 154 EEQLKEVLKTIAILGAQWLLSAKGSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRN 213

Query: 66  RAMVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRMCANTQFMEDAPIRSVN 125
           RA+++Y ++ G  ++VGR+I  QIR    K +G L+FP L++ +C  +    +A    + 
Sbjct: 214 RAILLYAVLVGKPINVGRLIIDQIRACAEKGKGGLYFPSLISELCIQSHVAWEASEPRLR 273

Query: 126 GVLSAQSLRRILKDSPHLLDVANSKKRPPKSQEPSSP 163
               A  L  I + S    + +   +   +  EPS P
Sbjct: 274 NT-GAMDLVAITRISSGRSEKSEKGEEEEEQDEPSRP 309

BLAST of Cla002661 vs. NCBI nr
Match: gi|703087370|ref|XP_010093253.1| (hypothetical protein L484_022412 [Morus notabilis])

HSP 1 Score: 73.6 bits (179), Expect = 6.7e-10
Identity = 41/138 (29.71%), Postives = 73/138 (52.90%), Query Frame = 1

Query: 2   STPTDAEMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDAS 61
           S  TD ++   L   A  G+ W  SP+G  +     L   A +W +F+    +P+TH  +
Sbjct: 78  SEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYHFLTARFMPSTHGKT 137

Query: 62  ISRDRAMVIYCIMWGIQLDVGRIIAPQIRGL-FFKPRGQLFFPFLVTRMC--ANTQFMED 121
           +++DR +++Y I+ GI +++  I   +I+     + RG L+FP L+T++   AN  + +D
Sbjct: 138 VAKDRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSLITQLWLKANVPYHKD 197

Query: 122 APIRSVNGVLSAQSLRRI 137
             I    G +S  S+ RI
Sbjct: 198 EAIVHNAGAISTLSISRI 215

BLAST of Cla002661 vs. NCBI nr
Match: gi|703121924|ref|XP_010102456.1| (hypothetical protein L484_006967 [Morus notabilis])

HSP 1 Score: 70.5 bits (171), Expect = 5.7e-09
Identity = 30/103 (29.13%), Postives = 61/103 (59.22%), Query Frame = 1

Query: 8   EMNDALTINAKSGSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRDRA 67
           ++++ +      G+EW  + +G  +    CL     +W +F++  L+P++H   + ++RA
Sbjct: 123 QLDEVINELCVEGTEWWRATRGSMTFPKECLQPGPKIWYHFLRFRLMPSSHYRLVHKERA 182

Query: 68  MVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRMC 111
           +++YC+M G  L+VGR+I  Q+     +  G L+FP L+T++C
Sbjct: 183 ILLYCMMKGRPLNVGRMIRQQVGVCAGRKNGGLWFPSLITQLC 225

BLAST of Cla002661 vs. NCBI nr
Match: gi|700202594|gb|KGN57727.1| (hypothetical protein Csa_3G263250 [Cucumis sativus])

HSP 1 Score: 64.7 bits (156), Expect = 3.1e-07
Identity = 34/96 (35.42%), Postives = 57/96 (59.38%), Query Frame = 1

Query: 44  LWLYFIKRLLLPTTHDASISRDRAMVIYCIMWGIQLDVGRIIAPQIRGLFFKPRGQLFFP 103
           +WL  IK+ ++PT H ++IS +R M++YCIM  I +++G+II+  I  L   PRG   F 
Sbjct: 28  IWLVLIKKKIMPTRHVSTISMERVMLVYCIMKKILVNIGKIISNHIIALVKHPRGARPFS 87

Query: 104 FLVTRMCANTQFM-EDAP-IRSVNGVLSAQSLRRIL 138
           +L+ ++C     M E  P +   +G+    +L RI+
Sbjct: 88  YLIEQLCLRACLMLEKLPQVEVKDGIWLPSTLHRII 123

BLAST of Cla002661 vs. NCBI nr
Match: gi|590634332|ref|XP_007028347.1| (Uncharacterized protein TCM_024087 [Theobroma cacao])

HSP 1 Score: 62.0 bits (149), Expect = 2.0e-06
Identity = 31/92 (33.70%), Postives = 53/92 (57.61%), Query Frame = 1

Query: 20  GSEWNTSPKGIQSLAPNCLIAEANLWLYFIKRLLLPTTHDASISRDRAMVIYCIMWGIQL 79
           G++W TS     S   + +  E  +WL+F+   LL +TH + +++DRA++IY I+    +
Sbjct: 96  GAQWKTSHDEPVSFKRSVMKKELQVWLHFVAARLLSSTHISDVTKDRAVLIYAIVAHKSI 155

Query: 80  DVGRIIAPQIRGLFFKPRGQLFFPFLVTRMCA 112
           DVG++I+  I       R  + FP L+T +CA
Sbjct: 156 DVGKVISHAILHTGRTKRDGIGFPSLITALCA 187

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
W9RBS1_9ROSA1.2e-1028.66Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1[more]
W9QTD9_9ROSA4.7e-1029.71Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1[more]
W9S7D3_9ROSA4.0e-0929.13Uncharacterized protein OS=Morus notabilis GN=L484_006967 PE=4 SV=1[more]
A0A0A0LAN0_CUCSA2.2e-0735.42Uncharacterized protein OS=Cucumis sativus GN=Csa_3G263250 PE=4 SV=1[more]
A0A061F2U9_THECC1.4e-0633.70Uncharacterized protein OS=Theobroma cacao GN=TCM_024087 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|703082781|ref|XP_010092041.1|1.8e-1028.66hypothetical protein L484_000844 [Morus notabilis][more]
gi|703087370|ref|XP_010093253.1|6.7e-1029.71hypothetical protein L484_022412 [Morus notabilis][more]
gi|703121924|ref|XP_010102456.1|5.7e-0929.13hypothetical protein L484_006967 [Morus notabilis][more]
gi|700202594|gb|KGN57727.1|3.1e-0735.42hypothetical protein Csa_3G263250 [Cucumis sativus][more]
gi|590634332|ref|XP_007028347.1|2.0e-0633.70Uncharacterized protein TCM_024087 [Theobroma cacao][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002661Cla002661.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None