Cla003479 (gene) Watermelon (97103) v1

NameCla003479
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr9 : 28049757 .. 28050452 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCAATGCCTACAGAAGCCGAAATGAATGATGCCCTCACAACCATCGCCAAACCAGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAAACATTGGCGCCAACTTACTTGATCGCCGAGGCCAACCAGTGGCTTTACTTCATCAAGCGGTCGCTAATCCCCACAACGCATGACGCCTCAATTTCAAAGGATCGTGCCTTGGTCATTTACTGCATCATGCGAGGAATTCGGCTAGACGTGGGACGCATCATTGCCCCACAGATTCGGGGTTTGTTTTTCAAGCCAAGGGGTCAGCTATTCTTCCCCTTTCTTGTGACAAGACTGTGCGCCAACGCAGAGTTCATGGAGAATGCGCCAGTCAGAGCAGTAAATGGAGTCCTTTCAGCCCAAAGTCTCAGACAGATATTAAAGGACTCCCAGCATCTACTTGAGGTGGCCAACTCAAAGAAGAGGACGACCAAGTCACAAGAACCATCATCTCCCCAACCTTCTCAACCCAAGAAGAGGAAACTGGTCAAGAAGAATTTTGAGATACAGACGTCGAGTTCACAACCTAGTGAAGCCGAAGAAACTGTTCAGCTCGACGCCAACACACAAGCCTTGACCATTTATACTCCTCCCATTGACCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTCATCTTCCCCTAAAATCCAAAAATAG

mRNA sequence

ATGTCAATGCCTACAGAAGCCGAAATGAATGATGCCCTCACAACCATCGCCAAACCAGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAAACATTGGCGCCAACTTACTTGATCGCCGAGGCCAACCAGTGGCTTTACTTCATCAAGCGGTCGCTAATCCCCACAACGCATGACGCCTCAATTTCAAAGGATCGTGCCTTGGTCATTTACTGCATCATGCGAGGAATTCGGCTAGACGTGGGACGCATCATTGCCCCACAGATTCGGGGTTTGTTTTTCAAGCCAAGGGGTCAGCTATTCTTCCCCTTTCTTGTGACAAGACTGTGCGCCAACGCAGAGTTCATGGAGAATGCGCCAGTCAGAGCAGTAAATGGAGTCCTTTCAGCCCAAAGTCTCAGACAGATATTAAAGGACTCCCAGCATCTACTTGAGGTGGCCAACTCAAAGAAGAGGACGACCAAGTCACAAGAACCATCATCTCCCCAACCTTCTCAACCCAAGAAGAGGAAACTGGTCAAGAAGAATTTTGAGATACAGACGTCGAGTTCACAACCTAGTGAAGCCGAAGAAACTGTTCAGCTCGACGCCAACACACAAGCCTTGACCATTTATACTCCTCCCATTGACCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTCATCTTCCCCTAAAATCCAAAAATAG

Coding sequence (CDS)

ATGTCAATGCCTACAGAAGCCGAAATGAATGATGCCCTCACAACCATCGCCAAACCAGGGTCAGAGTGGAACACTTCCCCAAAGGGCATTCAAACATTGGCGCCAACTTACTTGATCGCCGAGGCCAACCAGTGGCTTTACTTCATCAAGCGGTCGCTAATCCCCACAACGCATGACGCCTCAATTTCAAAGGATCGTGCCTTGGTCATTTACTGCATCATGCGAGGAATTCGGCTAGACGTGGGACGCATCATTGCCCCACAGATTCGGGGTTTGTTTTTCAAGCCAAGGGGTCAGCTATTCTTCCCCTTTCTTGTGACAAGACTGTGCGCCAACGCAGAGTTCATGGAGAATGCGCCAGTCAGAGCAGTAAATGGAGTCCTTTCAGCCCAAAGTCTCAGACAGATATTAAAGGACTCCCAGCATCTACTTGAGGTGGCCAACTCAAAGAAGAGGACGACCAAGTCACAAGAACCATCATCTCCCCAACCTTCTCAACCCAAGAAGAGGAAACTGGTCAAGAAGAATTTTGAGATACAGACGTCGAGTTCACAACCTAGTGAAGCCGAAGAAACTGTTCAGCTCGACGCCAACACACAAGCCTTGACCATTTATACTCCTCCCATTGACCCAATCACTGAGGAGCCCTCTTCACCACCAACTTCCCCTTCATCTTCCCCTAAAATCCAAAAATAG

Protein sequence

MSMPTEAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRLCANAEFMENAPVRAVNGVLSAQSLRQILKDSQHLLEVANSKKRTTKSQEPSSPQPSQPKKRKLVKKNFEIQTSSSQPSEAEETVQLDANTQALTIYTPPIDPITEEPSSPPTSPSSSPKIQK
BLAST of Cla003479 vs. TrEMBL
Match: W9RBS1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 6.0e-13
Identity = 50/160 (31.25%), Postives = 80/160 (50.00%), Query Frame = 1

Query: 6   EAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKD 65
           E ++ + L TIA  G++W  S KG  T     L   A  W +F+   L+ +TH  +IS++
Sbjct: 154 EEQLKEVLKTIAILGAQWLLSAKGSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRN 213

Query: 66  RALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRLCANAEFMENAPVRAVN 125
           RA+++Y ++ G  ++VGR+I  QIR    K +G L+FP L++ LC  +     A    + 
Sbjct: 214 RAILLYAVLVGKPINVGRLIIDQIRACAEKGKGGLYFPSLISELCIQSHVAWEASEPRLR 273

Query: 126 GVLSAQSLRQILKDSQHLLEVANSKKRTTKSQEPSSPQPS 166
               A  L  I + S    E +   +   +  EPS P  S
Sbjct: 274 NT-GAMDLVAITRISSGRSEKSEKGEEEEEQDEPSRPSTS 312

BLAST of Cla003479 vs. TrEMBL
Match: W9QTD9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 6.2e-10
Identity = 40/135 (29.63%), Postives = 72/135 (53.33%), Query Frame = 1

Query: 5   TEAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISK 64
           T+ ++   L  +A  G+ W  SP+G  T     L   A  W +F+    +P+TH  +++K
Sbjct: 81  TDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYHFLTARFMPSTHGKTVAK 140

Query: 65  DRALVIYCIMRGIRLDVGRIIAPQIRGL-FFKPRGQLFFPFLVTRLC--ANAEFMENAPV 124
           DR L++Y I+ GI +++  I   +I+     + RG L+FP L+T+L   AN  + ++  +
Sbjct: 141 DRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSLITQLWLKANVPYHKDEAI 200

Query: 125 RAVNGVLSAQSLRQI 137
               G +S  S+ +I
Sbjct: 201 VHNAGAISTLSISRI 215

BLAST of Cla003479 vs. TrEMBL
Match: A0A0A0LAN0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G263250 PE=4 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 2.4e-09
Identity = 44/159 (27.67%), Postives = 83/159 (52.20%), Query Frame = 1

Query: 45  WLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPF 104
           WL  IK+ ++PT H ++IS +R +++YCIM+ I +++G+II+  I  L   PRG   F +
Sbjct: 29  WLVLIKKKIMPTRHVSTISMERVMLVYCIMKKILVNIGKIISNHIIALVKHPRGARPFSY 88

Query: 105 LVTRLCANAEFM-ENAP-VRAVNGVLSAQSLRQILKDSQHLLEVANSKKR---------T 164
           L+ +LC  A  M E  P V   +G+    +L +I+   ++  ++   K +          
Sbjct: 89  LIEQLCLRACLMLEKLPQVEVKDGIWLPSTLHRIIAIHKNKAKIKCLKTKEGCKVVKEID 148

Query: 165 TKSQEPSSPQPSQPKKRKLVKKNFEIQTSSSQPSEAEET 193
               E    + + P+KRK   K  ++ +  ++ S+ E++
Sbjct: 149 DDDVEEEDKKDNIPQKRKRQDKEDDLGSKKAKSSKIEDS 187

BLAST of Cla003479 vs. TrEMBL
Match: A0A061F2U9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024087 PE=4 SV=1)

HSP 1 Score: 63.2 bits (152), Expect = 4.9e-07
Identity = 32/94 (34.04%), Postives = 53/94 (56.38%), Query Frame = 1

Query: 20  GSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRL 79
           G++W TS     +   + +  E   WL+F+   L+ +TH + ++KDRA++IY I+    +
Sbjct: 96  GAQWKTSHDEPVSFKRSVMKKELQVWLHFVAARLLSSTHISDVTKDRAVLIYAIVAHKSI 155

Query: 80  DVGRIIAPQIRGLFFKPRGQLFFPFLVTRLCANA 114
           DVG++I+  I       R  + FP L+T LCA A
Sbjct: 156 DVGKVISHAILHTGRTKRDGIGFPSLITALCARA 189

BLAST of Cla003479 vs. TrEMBL
Match: W9QMD0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000655 PE=4 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 6.4e-07
Identity = 51/203 (25.12%), Postives = 84/203 (41.38%), Query Frame = 1

Query: 6   EAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKD 65
           + +  + L  +    + W             YL      W +F+   LIP+TH + ++KD
Sbjct: 281 DRDPQEILEALCDGPTRWTIKQNTESAFEARYLTNYTKVWFHFVCTRLIPSTHISEVTKD 340

Query: 66  RALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRL-----CANAEFMENAP 125
           RALV+  I RG  L+VG II   I     K    L +P L+T L      A  +     P
Sbjct: 341 RALVLLAIERGEPLNVGAIINSGIHHALRKHNISLPYPSLLTELFLAAGVALPDAHLEKP 400

Query: 126 VRAVNGVLSAQSLRQILKDSQHLLEVANSKKRTTKSQEPSSPQPSQPKKRKLVKKNFEIQ 185
           +RA               D   ++ +A+ +  + +     S QP QPK+++    + E  
Sbjct: 401 IRAF--------------DLNSIMRIASGRAASEQDGGAGSSQPPQPKRKRAATTSRE-- 460

Query: 186 TSSSQPSEAEETVQLDANTQALT 204
             +S+  + EE +    +  ALT
Sbjct: 461 DFASRVDQHEEQIADFQSQSALT 467

BLAST of Cla003479 vs. NCBI nr
Match: gi|703082781|ref|XP_010092041.1| (hypothetical protein L484_000844 [Morus notabilis])

HSP 1 Score: 82.8 bits (203), Expect = 8.6e-13
Identity = 50/160 (31.25%), Postives = 80/160 (50.00%), Query Frame = 1

Query: 6   EAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKD 65
           E ++ + L TIA  G++W  S KG  T     L   A  W +F+   L+ +TH  +IS++
Sbjct: 154 EEQLKEVLKTIAILGAQWLLSAKGSYTCNRHELQPAAKVWYHFLASRLLLSTHGKTISRN 213

Query: 66  RALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPFLVTRLCANAEFMENAPVRAVN 125
           RA+++Y ++ G  ++VGR+I  QIR    K +G L+FP L++ LC  +     A    + 
Sbjct: 214 RAILLYAVLVGKPINVGRLIIDQIRACAEKGKGGLYFPSLISELCIQSHVAWEASEPRLR 273

Query: 126 GVLSAQSLRQILKDSQHLLEVANSKKRTTKSQEPSSPQPS 166
               A  L  I + S    E +   +   +  EPS P  S
Sbjct: 274 NT-GAMDLVAITRISSGRSEKSEKGEEEEEQDEPSRPSTS 312

BLAST of Cla003479 vs. NCBI nr
Match: gi|703087370|ref|XP_010093253.1| (hypothetical protein L484_022412 [Morus notabilis])

HSP 1 Score: 72.8 bits (177), Expect = 8.9e-10
Identity = 40/135 (29.63%), Postives = 72/135 (53.33%), Query Frame = 1

Query: 5   TEAEMNDALTTIAKPGSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISK 64
           T+ ++   L  +A  G+ W  SP+G  T     L   A  W +F+    +P+TH  +++K
Sbjct: 81  TDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWYHFLTARFMPSTHGKTVAK 140

Query: 65  DRALVIYCIMRGIRLDVGRIIAPQIRGL-FFKPRGQLFFPFLVTRLC--ANAEFMENAPV 124
           DR L++Y I+ GI +++  I   +I+     + RG L+FP L+T+L   AN  + ++  +
Sbjct: 141 DRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSLITQLWLKANVPYHKDEAI 200

Query: 125 RAVNGVLSAQSLRQI 137
               G +S  S+ +I
Sbjct: 201 VHNAGAISTLSISRI 215

BLAST of Cla003479 vs. NCBI nr
Match: gi|700202594|gb|KGN57727.1| (hypothetical protein Csa_3G263250 [Cucumis sativus])

HSP 1 Score: 70.9 bits (172), Expect = 3.4e-09
Identity = 44/159 (27.67%), Postives = 83/159 (52.20%), Query Frame = 1

Query: 45  WLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRLDVGRIIAPQIRGLFFKPRGQLFFPF 104
           WL  IK+ ++PT H ++IS +R +++YCIM+ I +++G+II+  I  L   PRG   F +
Sbjct: 29  WLVLIKKKIMPTRHVSTISMERVMLVYCIMKKILVNIGKIISNHIIALVKHPRGARPFSY 88

Query: 105 LVTRLCANAEFM-ENAP-VRAVNGVLSAQSLRQILKDSQHLLEVANSKKR---------T 164
           L+ +LC  A  M E  P V   +G+    +L +I+   ++  ++   K +          
Sbjct: 89  LIEQLCLRACLMLEKLPQVEVKDGIWLPSTLHRIIAIHKNKAKIKCLKTKEGCKVVKEID 148

Query: 165 TKSQEPSSPQPSQPKKRKLVKKNFEIQTSSSQPSEAEET 193
               E    + + P+KRK   K  ++ +  ++ S+ E++
Sbjct: 149 DDDVEEEDKKDNIPQKRKRQDKEDDLGSKKAKSSKIEDS 187

BLAST of Cla003479 vs. NCBI nr
Match: gi|1021475585|ref|XP_016165875.1| (PREDICTED: involucrin-like [Arachis ipaensis])

HSP 1 Score: 66.2 bits (160), Expect = 8.3e-08
Identity = 46/170 (27.06%), Postives = 83/170 (48.82%), Query Frame = 1

Query: 38  LIAEANQWLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRLDVGRIIAPQIRGLFFKPR 97
           L  +A  W   ++RSLIPT++++ ++ +RA++I+CIM+G  ++VG +IA  I  +  K +
Sbjct: 19  LTPQAKGWHDIVRRSLIPTSNNSEVTVNRAIMIHCIMKGGEINVGEVIANNIIDIAEKVK 78

Query: 98  --GQLFFPFLVTRLCANA-----EFMENAPVRAVNGVLSAQSLRQILKDSQHLLEVANSK 157
               L +P  + RLC  A     EF E   V ++   L+ + L  +         +A  K
Sbjct: 79  QDSWLGYPSTILRLCKEARVPLEEFKETDMV-SIGKPLTKERLEFVTTTQLERQPLARRK 138

Query: 158 KRTTKSQEPSSPQPSQPKKRKLVKKNFEIQTSSSQPSEAEETVQLDANTQ 201
           KR    QE    +  +P    + +    ++  S Q S+ + + +  A  Q
Sbjct: 139 KRKEARQEEERQEMEEPSTLNMNQLQAALEGISGQYSQIQRSQEEQAQQQ 187

BLAST of Cla003479 vs. NCBI nr
Match: gi|590634332|ref|XP_007028347.1| (Uncharacterized protein TCM_024087 [Theobroma cacao])

HSP 1 Score: 63.2 bits (152), Expect = 7.1e-07
Identity = 32/94 (34.04%), Postives = 53/94 (56.38%), Query Frame = 1

Query: 20  GSEWNTSPKGIQTLAPTYLIAEANQWLYFIKRSLIPTTHDASISKDRALVIYCIMRGIRL 79
           G++W TS     +   + +  E   WL+F+   L+ +TH + ++KDRA++IY I+    +
Sbjct: 96  GAQWKTSHDEPVSFKRSVMKKELQVWLHFVAARLLSSTHISDVTKDRAVLIYAIVAHKSI 155

Query: 80  DVGRIIAPQIRGLFFKPRGQLFFPFLVTRLCANA 114
           DVG++I+  I       R  + FP L+T LCA A
Sbjct: 156 DVGKVISHAILHTGRTKRDGIGFPSLITALCARA 189

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
W9RBS1_9ROSA6.0e-1331.25Uncharacterized protein OS=Morus notabilis GN=L484_000844 PE=4 SV=1[more]
W9QTD9_9ROSA6.2e-1029.63Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1[more]
A0A0A0LAN0_CUCSA2.4e-0927.67Uncharacterized protein OS=Cucumis sativus GN=Csa_3G263250 PE=4 SV=1[more]
A0A061F2U9_THECC4.9e-0734.04Uncharacterized protein OS=Theobroma cacao GN=TCM_024087 PE=4 SV=1[more]
W9QMD0_9ROSA6.4e-0725.12Uncharacterized protein OS=Morus notabilis GN=L484_000655 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|703082781|ref|XP_010092041.1|8.6e-1331.25hypothetical protein L484_000844 [Morus notabilis][more]
gi|703087370|ref|XP_010093253.1|8.9e-1029.63hypothetical protein L484_022412 [Morus notabilis][more]
gi|700202594|gb|KGN57727.1|3.4e-0927.67hypothetical protein Csa_3G263250 [Cucumis sativus][more]
gi|1021475585|ref|XP_016165875.1|8.3e-0827.06PREDICTED: involucrin-like [Arachis ipaensis][more]
gi|590634332|ref|XP_007028347.1|7.1e-0734.04Uncharacterized protein TCM_024087 [Theobroma cacao][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla003479Cla003479.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla003479Watermelon (97103) v2wmwmbB046
Cla003479Wax gourdwgowmB249
Cla003479Watermelon (Charleston Gray)wcgwmB011