Cla000407 (gene) Watermelon (97103) v1

NameCla000407
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionUnknown Protein (AHRD V1)
LocationChr0 : 5396130 .. 5397251 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGTCACCTTTTTCCCAGAGGAAGCCCCATTGCCACCATACATCACAGACATCATTGATCGCCACGGCTGGCAGAACTTCTGTGCCGGCCCATCTTGGATTCAACCTGCGTTAGTCCGAAGTTTCTACAACGGACACATTGATGAAGAGGAAGATGTGGTGGTAGTTAAGGGAAGTGAAGTCCCATTCAACGCAAGGGAAATCAATGCCATATTTCAGTTGAGGGACGACTCAGACACAGAGGGCAACAGGCTCATTGAATCGAAAATGCCTGAATACATGCAAGACACTCTACGGGTCATGGCCAAGCGAAGGTCAAAATGGGACGTATCACCCACAGGCATCCGAACCTTGTCTGCCAGCAGCTTCGTGCCTGAGGCCAACTTGTGGGTATACTTTGTGAAGAAGTGGCTCATCCCGACAACGCATGACAAGACAGTATCAAGGGTTCGAATGATGGCAGCATATTGCATTATGCGAGGAATCCGTTTGGACGTCAGATGCATAATAGCAGCTCAGATTAGAGGCGTCTTCAGTAAGCCAAGGGGGCAACTATTTTTCCCTTGGCTCATTATGAGCTTATGTGCTAATGCTGCCATCGAAAGTGATAAGGAAATGTTAGACGCAGATCAGATGATGGAGGTGAATAGTGTTTTTGCCAACAAAACTCTGCGTCGGCTGATGAGAGGATCACAGCATCAACCAGAGCCCACCAAGGCCACCAAAGCATCCCATCAGCCAGCACCCCCAGCTGAATCACTTCCAAAGGCGTTAAAGAGGAAAGAAAAAGGGAAGTCATCCAAACTAAAATGCAAGACTGTGGCAAAGCCTGATGACACCATTAGAACTTCCCCTTCCTCAAGTCCTTCACCCCCACTCCCATTGACCGCCAGAGAGCCAACGCCTTCTCCACCTCCTCCAGAAATGCAACCTCAACCATCCACTTTCCCATCAATCCAAACTTCTCTTTCTTCCTTTTTCCAAAGAGTTGATGACTCACACTATGAGGATTTTAGTGCGACTACTATGCATCCATCATCCCCAGCGTTTGACGAAACTCCATCAACCTCTCGCCCCCAGCCAAACCGTCCTCACCCACAACCACAGCCTCAACCTTAA

mRNA sequence

ATGGAGGTCACCTTTTTCCCAGAGGAAGCCCCATTGCCACCATACATCACAGACATCATTGATCGCCACGGCTGGCAGAACTTCTGTGCCGGCCCATCTTGGATTCAACCTGCGTTAGTCCGAAGTTTCTACAACGGACACATTGATGAAGAGGAAGATGTGGTGGTAGTTAAGGGAAGTGAAGTCCCATTCAACGCAAGGGAAATCAATGCCATATTTCAGTTGAGGGACGACTCAGACACAGAGGGCAACAGGCTCATTGAATCGAAAATGCCTGAATACATGCAAGACACTCTACGGGTCATGGCCAAGCGAAGGTCAAAATGGGACGTATCACCCACAGGCATCCGAACCTTGTCTGCCAGCAGCTTCGTGCCTGAGGCCAACTTGTGGGTATACTTTGTGAAGAAGTGGCTCATCCCGACAACGCATGACAAGACAGTATCAAGGGTTCGAATGATGGCAGCATATTGCATTATGCGAGGAATCCGTTTGGACGTCAGATGCATAATAGCAGCTCAGATTAGAGGCGTCTTCAGTAAGCCAAGGGGGCAACTATTTTTCCCTTGGCTCATTATGAGCTTATGTGCTAATGCTGCCATCGAAAGTGATAAGGAAATGTTAGACGCAGATCAGATGATGGAGGTGAATAGTGTTTTTGCCAACAAAACTCTGCGTCGGCTGATGAGAGGATCACAGCATCAACCAGAGCCCACCAAGGCCACCAAAGCATCCCATCAGCCAGCACCCCCAGCTGAATCACTTCCAAAGGCGTTAAAGAGGAAAGAAAAAGGGAAGTCATCCAAACTAAAATGCAAGACTGTGGCAAAGCCTGATGACACCATTAGAACTTCCCCTTCCTCAAGTCCTTCACCCCCACTCCCATTGACCGCCAGAGAGCCAACGCCTTCTCCACCTCCTCCAGAAATGCAACCTCAACCATCCACTTTCCCATCAATCCAAACTTCTCTTTCTTCCTTTTTCCAAAGAGTTGATGACTCACACTATGAGGATTTTAGTGCGACTACTATGCATCCATCATCCCCAGCGTTTGACGAAACTCCATCAACCTCTCGCCCCCAGCCAAACCGTCCTCACCCACAACCACAGCCTCAACCTTAA

Coding sequence (CDS)

ATGGAGGTCACCTTTTTCCCAGAGGAAGCCCCATTGCCACCATACATCACAGACATCATTGATCGCCACGGCTGGCAGAACTTCTGTGCCGGCCCATCTTGGATTCAACCTGCGTTAGTCCGAAGTTTCTACAACGGACACATTGATGAAGAGGAAGATGTGGTGGTAGTTAAGGGAAGTGAAGTCCCATTCAACGCAAGGGAAATCAATGCCATATTTCAGTTGAGGGACGACTCAGACACAGAGGGCAACAGGCTCATTGAATCGAAAATGCCTGAATACATGCAAGACACTCTACGGGTCATGGCCAAGCGAAGGTCAAAATGGGACGTATCACCCACAGGCATCCGAACCTTGTCTGCCAGCAGCTTCGTGCCTGAGGCCAACTTGTGGGTATACTTTGTGAAGAAGTGGCTCATCCCGACAACGCATGACAAGACAGTATCAAGGGTTCGAATGATGGCAGCATATTGCATTATGCGAGGAATCCGTTTGGACGTCAGATGCATAATAGCAGCTCAGATTAGAGGCGTCTTCAGTAAGCCAAGGGGGCAACTATTTTTCCCTTGGCTCATTATGAGCTTATGTGCTAATGCTGCCATCGAAAGTGATAAGGAAATGTTAGACGCAGATCAGATGATGGAGGTGAATAGTGTTTTTGCCAACAAAACTCTGCGTCGGCTGATGAGAGGATCACAGCATCAACCAGAGCCCACCAAGGCCACCAAAGCATCCCATCAGCCAGCACCCCCAGCTGAATCACTTCCAAAGGCGTTAAAGAGGAAAGAAAAAGGGAAGTCATCCAAACTAAAATGCAAGACTGTGGCAAAGCCTGATGACACCATTAGAACTTCCCCTTCCTCAAGTCCTTCACCCCCACTCCCATTGACCGCCAGAGAGCCAACGCCTTCTCCACCTCCTCCAGAAATGCAACCTCAACCATCCACTTTCCCATCAATCCAAACTTCTCTTTCTTCCTTTTTCCAAAGAGTTGATGACTCACACTATGAGGATTTTAGTGCGACTACTATGCATCCATCATCCCCAGCGTTTGACGAAACTCCATCAACCTCTCGCCCCCAGCCAAACCGTCCTCACCCACAACCACAGCCTCAACCTTAA

Protein sequence

MEVTFFPEEAPLPPYITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAIFQLRDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWVYFVKKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGVFSKPRGQLFFPWLIMSLCANAAIESDKEMLDADQMMEVNSVFANKTLRRLMRGSQHQPEPTKATKASHQPAPPAESLPKALKRKEKGKSSKLKCKTVAKPDDTIRTSPSSSPSPPLPLTAREPTPSPPPPEMQPQPSTFPSIQTSLSSFFQRVDDSHYEDFSATTMHPSSPAFDETPSTSRPQPNRPHPQPQPQP
BLAST of Cla000407 vs. TrEMBL
Match: W9QTD9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 2.5e-21
Identity = 65/195 (33.33%), Postives = 101/195 (51.79%), Query Frame = 1

Query: 13  PPYITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAI 72
           P +IT +I +HGW+ FC  PS     LVR FY   +D  ++ V V+  +VPF AR IN+I
Sbjct: 4   PAFITRVIHQHGWRQFCQHPSNPIVPLVREFYANLLDFNQETVFVQNVKVPFTARAINSI 63

Query: 73  FQLRDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWV 132
           F L +  D E          E ++  L  +A   + W +SP G  T         A +W 
Sbjct: 64  FGLEEVVD-EYVDFASEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWY 123

Query: 133 YFVKKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGV-FSKPRGQLFFPWL 192
           +F+    +P+TH KTV++ R++  Y I+ GI +++  I   +I+    ++ RG L+FP L
Sbjct: 124 HFLTARFMPSTHGKTVAKDRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSL 183

Query: 193 IMSLCANAAIESDKE 207
           I  L   A +   K+
Sbjct: 184 ITQLWLKANVPYHKD 197

BLAST of Cla000407 vs. TrEMBL
Match: A0A0L0M384_9BURK (Uncharacterized protein OS=Candidatus Burkholderia verschuerenii GN=BVER_02375c PE=4 SV=1)

HSP 1 Score: 60.8 bits (146), Expect = 3.9e-06
Identity = 71/272 (26.10%), Postives = 113/272 (41.54%), Query Frame = 1

Query: 9   EAPLPPYITDIIDRHGWQNFCAGPSWIQP---ALVRSFYNGHIDEEEDVVVVKGSEVPFN 68
           E PL  YI    +R  W  F  G   ++P   +L+   Y     E+++ V + G  V F+
Sbjct: 18  EPPLQAYI----ERRRWTAF-VGHDRLRPGDRSLIYEVYARLHVEKQEEVTIDGHTVDFS 77

Query: 69  AREINAIFQLRDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFV 128
           AR INA F L D       R ++ +    +   L    +  + W       R +  S   
Sbjct: 78  ARAINAFFDLPDLGIHYDQRPVDER---DIYRELTGQEQNHADW-------RHIRKSELR 137

Query: 129 PEANLWVYFVKKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGV--FSKPR 188
            +A L  +FV+  L+PT+ +  V ++R    Y I   + +DV  +I  Q+      SK  
Sbjct: 138 AKARLINHFVRCRLMPTSSNPEVLKMRAHLIYAISTDVAVDVGRVIRDQMVAATKTSKSN 197

Query: 189 GQLFFPWLIMSLCANAAIESDKEMLDADQMMEVNSVFANKTLRRLMRGSQHQPEPTKATK 248
             L FP LI ++C  A I    +     Q+M V    A K   R     QH+  P  +  
Sbjct: 198 LSLGFPNLITAMCIKAGIRPRGDPTTFIQLMSVLRFEAIKD--RQPDPLQHE-SPPPSPP 257

Query: 249 ASHQPAPPAESLPKALKRKEKGKSSKLKCKTV 276
             HQ  PP    P     +++  S + + ++V
Sbjct: 258 HQHQTLPPPSPPPSPPHEQQQQDSEQPEPESV 271

BLAST of Cla000407 vs. NCBI nr
Match: gi|703087370|ref|XP_010093253.1| (hypothetical protein L484_022412 [Morus notabilis])

HSP 1 Score: 111.3 bits (277), Expect = 3.7e-21
Identity = 65/195 (33.33%), Postives = 101/195 (51.79%), Query Frame = 1

Query: 13  PPYITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAI 72
           P +IT +I +HGW+ FC  PS     LVR FY   +D  ++ V V+  +VPF AR IN+I
Sbjct: 4   PAFITRVIHQHGWRQFCQHPSNPIVPLVREFYANLLDFNQETVFVQNVKVPFTARAINSI 63

Query: 73  FQLRDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWV 132
           F L +  D E          E ++  L  +A   + W +SP G  T         A +W 
Sbjct: 64  FGLEEVVD-EYVDFASEVTDEQLEVVLAEVAIEGATWQISPQGAYTCIRKELKRHAQIWY 123

Query: 133 YFVKKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGV-FSKPRGQLFFPWL 192
           +F+    +P+TH KTV++ R++  Y I+ GI +++  I   +I+    ++ RG L+FP L
Sbjct: 124 HFLTARFMPSTHGKTVAKDRVLLLYSILTGISVNIEEITIKEIKACSAARKRGGLYFPSL 183

Query: 193 IMSLCANAAIESDKE 207
           I  L   A +   K+
Sbjct: 184 ITQLWLKANVPYHKD 197

BLAST of Cla000407 vs. NCBI nr
Match: gi|703082781|ref|XP_010092041.1| (hypothetical protein L484_000844 [Morus notabilis])

HSP 1 Score: 104.4 bits (259), Expect = 4.5e-19
Identity = 68/244 (27.87%), Postives = 121/244 (49.59%), Query Frame = 1

Query: 13  PPYITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAI 72
           P +I+D+I   GWQ FC  P      LV+ FY    ++ ++ V V   ++ F +  IN +
Sbjct: 76  PGFISDVIISRGWQIFCRHPIDPIVPLVKEFYANLQNQGQNTVFVWEIDITFTSNYINGV 135

Query: 73  FQLRDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWV 132
             + +  D E   LI   + E +++ L+ +A   ++W +S  G  T +     P A +W 
Sbjct: 136 LGIPNQDD-EFVELITDAIEEQLKEVLKTIAILGAQWLLSAKGSYTCNRHELQPAAKVWY 195

Query: 133 YFVKKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGVFSKPRGQLFFPWLI 192
           +F+   L+ +TH KT+SR R +  Y ++ G  ++V  +I  QIR    K +G L+FP LI
Sbjct: 196 HFLASRLLLSTHGKTISRNRAILLYAVLVGKPINVGRLIIDQIRACAEKGKGGLYFPSLI 255

Query: 193 MSLC--ANAAIESDKEMLDADQMMEVNSVFANKTLRRLMRGSQHQPEPTKATKASHQPAP 252
             LC  ++ A E+ +  L     M++ ++       R+  G   + E  +  +   +P+ 
Sbjct: 256 SELCIQSHVAWEASEPRLRNTGAMDLVAI------TRISSGRSEKSEKGEEEEEQDEPSR 312

Query: 253 PAES 255
           P+ S
Sbjct: 316 PSTS 312

BLAST of Cla000407 vs. NCBI nr
Match: gi|848854240|ref|XP_012847380.1| (PREDICTED: uncharacterized protein LOC105967324 [Erythranthe guttata])

HSP 1 Score: 78.2 bits (191), Expect = 3.4e-11
Identity = 82/309 (26.54%), Postives = 120/309 (38.83%), Query Frame = 1

Query: 16  ITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAIFQL 75
           I +  DR  W+     P      LVR FY         VVV++  EV ++A  IN  F L
Sbjct: 128 IIENFDRRQWKYLAKRPEDYNENLVRLFYANAGFHPGSVVVLRDKEVRYDAATINKFFHL 187

Query: 76  RDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWVYFV 135
            +       +L   KM    ++ ++ +  R SKW+      R    S    EA +W+YFV
Sbjct: 188 PEVPTEPFQQL---KMATTYEELIQTLCPRGSKWEKGEHWFRRGELSR---EAKVWMYFV 247

Query: 136 KKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGVFSKPRGQLFFPWLIMSL 195
              L+PT         R+   Y IM+   ++V  II A+IR         L FP LI +L
Sbjct: 248 SSRLMPTKSTAKTYAPRLTLMYAIMKRTPVNVGAIIHAEIRRCLGDTSLGLMFPSLITTL 307

Query: 196 CANAAI-----------------------------ESDKEMLDADQMMEVNSVFANKTLR 255
           C  A +                             E D ++ DAD     + V A +  +
Sbjct: 308 CDRAGVPTYADDRLTRSPQPIFPYHKLQPASGPLGEGDSDLPDADP----SGVHAARPQQ 367

Query: 256 RLMRGSQHQPEPTKATKASHQPAPPAESLPKALKRKEKGKSSKLKCKTVAKPDDTIRTSP 296
           R  RG Q + EP +  +   Q     E      K+  K  ++ L+C+T    DD +    
Sbjct: 368 RARRGRQ-EAEP-QLPEYVGQWMDVMEENVGWFKKMMKAMAAALQCRT----DDIV---- 416

BLAST of Cla000407 vs. NCBI nr
Match: gi|848851984|ref|XP_012838115.1| (PREDICTED: uncharacterized protein LOC105958644 [Erythranthe guttata])

HSP 1 Score: 77.0 bits (188), Expect = 7.6e-11
Identity = 57/186 (30.65%), Postives = 80/186 (43.01%), Query Frame = 1

Query: 16  ITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAIFQL 75
           I +  DR  W+     P      LVR FY         VVV++  EV ++A  IN  F L
Sbjct: 128 IFENFDRRQWKYLAKRPEDYNENLVRLFYANAGFHPGSVVVLRNQEVRYDAATINKFFHL 187

Query: 76  RDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWVYFV 135
            + S     +L   KM    ++ ++ +  R SKW       R    S    EA +W+YFV
Sbjct: 188 PEVSTEPFQQL---KMATTYEELIQTLCPRGSKWQKGEHWFRRGELSR---EAKVWMYFV 247

Query: 136 KKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGVFSKPRGQLFFPWLIMSL 195
              L+PT         R+   Y IM+   ++V  II A+IR         L FP LI +L
Sbjct: 248 SSRLMPTKSTAKTYAPRLTLMYAIMKRTPVNVGAIIHAEIRRCLGDTSLGLMFPSLITTL 307

Query: 196 CANAAI 202
           C  A +
Sbjct: 308 CDRAGV 307

BLAST of Cla000407 vs. NCBI nr
Match: gi|848932492|ref|XP_012829106.1| (PREDICTED: uncharacterized protein LOC105950298 [Erythranthe guttata])

HSP 1 Score: 76.3 bits (186), Expect = 1.3e-10
Identity = 57/186 (30.65%), Postives = 80/186 (43.01%), Query Frame = 1

Query: 16  ITDIIDRHGWQNFCAGPSWIQPALVRSFYNGHIDEEEDVVVVKGSEVPFNAREINAIFQL 75
           I +  DR  W+     P      LVR FY         VVV++  EV ++A  IN  F L
Sbjct: 128 IIENFDRRQWKYLAKRPEDYNENLVRLFYANAGFHPGSVVVLRDIEVRYDAATINKFFHL 187

Query: 76  RDDSDTEGNRLIESKMPEYMQDTLRVMAKRRSKWDVSPTGIRTLSASSFVPEANLWVYFV 135
            + S     +L   KM    ++ ++ +  R SKW       R    S    EA +W+YFV
Sbjct: 188 PEVSTEPFQQL---KMATTYEELIQTLCPRGSKWQKGEHWFRRGELSR---EAKVWMYFV 247

Query: 136 KKWLIPTTHDKTVSRVRMMAAYCIMRGIRLDVRCIIAAQIRGVFSKPRGQLFFPWLIMSL 195
              L+PT         R+   Y IM+   ++V  II A+IR         L FP LI +L
Sbjct: 248 SSRLMPTKSTAKTYAPRLTLMYAIMKRTPVNVGAIIHAEIRRCLGDTSLGLMFPSLITTL 307

Query: 196 CANAAI 202
           C  A +
Sbjct: 308 CDRAGV 307

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
W9QTD9_9ROSA2.5e-2133.33Uncharacterized protein OS=Morus notabilis GN=L484_022412 PE=4 SV=1[more]
A0A0L0M384_9BURK3.9e-0626.10Uncharacterized protein OS=Candidatus Burkholderia verschuerenii GN=BVER_02375c ... [more]
Match NameE-valueIdentityDescription
gi|703087370|ref|XP_010093253.1|3.7e-2133.33hypothetical protein L484_022412 [Morus notabilis][more]
gi|703082781|ref|XP_010092041.1|4.5e-1927.87hypothetical protein L484_000844 [Morus notabilis][more]
gi|848854240|ref|XP_012847380.1|3.4e-1126.54PREDICTED: uncharacterized protein LOC105967324 [Erythranthe guttata][more]
gi|848851984|ref|XP_012838115.1|7.6e-1130.65PREDICTED: uncharacterized protein LOC105958644 [Erythranthe guttata][more]
gi|848932492|ref|XP_012829106.1|1.3e-1030.65PREDICTED: uncharacterized protein LOC105950298 [Erythranthe guttata][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla000407Cla000407.1mRNA


The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla000407Watermelon (97103) v1wmwmB000
Cla000407Watermelon (97103) v1wmwmB001