Cla018856 (gene) Watermelon (97103) v1

NameCla018856
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPHD and RING finger domain-containing protein 1 (AHRD V1 *--- PHRF1_HUMAN); contains Interpro domain(s) IPR019787 Zinc finger, PHD-finger
LocationChr6 : 22886118 .. 22887209 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCAATCATAGAGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCAAGGGCCGTTCCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTAAGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATATCTTTGATTAGTTGCGATGGTCATTTGGGAGAAGACAAAGAGCAAGCTGCAGCTTCTCAACATAACCACAAAAGTGAAATTGTTGGAAATTGTGTCCTACCTTTTCCTGTTTACGATGGAAAAACTCAAGTTTCAGAATTAGAATCAGTCAATGGTTGTACCATTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAATATCCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATGGAACTTGTTTCAACTTCTGTGAAAGTAGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCAAGTTATGGAGGATACGGTTGAGGATATTTCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTTTTGTCTTCTATGGCTCATGCTCCTGAGGAAGAAAGTGATTTTAGAAACGACAATAACTGTTTTCGATTATGCAAAACTTGTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGTGAAGATGCATTTCATGTGTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTCTGAAGAAGAAGCATAAAATTTTGAAGGAAACGATCTCAAAGAAATTGGCAAACACCTTGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCGTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGAGTGGTCTGGCCCGATTTCAGAGTATGTTATTTTCTTCAATTATATATATTCTTTCTTCATGCAAACATAA

mRNA sequence

ATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCAATCATAGAGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCAAGGGCCGTTCCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTAAGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATATCTTTGATTAGTTGCGATGGTCATTTGGGAGAAGACAAAGAGCAAGCTGCAGCTTCTCAACATAACCACAAAAGTGAAATTGTTGGAAATTGTGTCCTACCTTTTCCTGTTTACGATGGAAAAACTCAAGTTTCAGAATTAGAATCAGTCAATGGTTGTACCATTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAATATCCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATGGAACTTGTTTCAACTTCTGTGAAAGTAGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCAAGTTATGGAGGATACGGTTGAGGATATTTCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTTTTGTCTTCTATGGCTCATGCTCCTGAGGAAGAAAGTGATTTTAGAAACGACAATAACTGTTTTCGATTATGCAAAACTTGTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGTGAAGATGCATTTCATGTGTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTCTGAAGAAGAAGCATAAAATTTTGAAGGAAACGATCTCAAAGAAATTGGCAAACACCTTGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCGTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGAGTGGTCTGGCCCGATTTCAGAGTATGTTATTTTCTTCAATTATATATATTCTTTCTTCATGCAAACATAA

Coding sequence (CDS)

ATGTGCCCCCATTGTGATGAATTCTCTCATGATGGCTGCAGAAAAGCTGGACCAATCATAGAGGAAAAGAAGAACAATGGTGGATTGCGTTGCTTAAATTTTCCAAGGGCCGTTCCCCAGATATCAACTGTTAGTACGATGCCTGAAGGTTCAAAATCTAATGTAGTATATAGGAGAAAGAAACTAAGAGGCAATTCTGATTCCAGGTTGTTGGCTAATGGGACAGATTGTATATCTTTGATTAGTTGCGATGGTCATTTGGGAGAAGACAAAGAGCAAGCTGCAGCTTCTCAACATAACCACAAAAGTGAAATTGTTGGAAATTGTGTCCTACCTTTTCCTGTTTACGATGGAAAAACTCAAGTTTCAGAATTAGAATCAGTCAATGGTTGTACCATTGGGGAAGGGCATGGTTCAGACGAAACACCTAATAATATCCTGCAAAAAAGTTTGGAGGTTGACAGCATTAATGATAGCTGCTCCTCATCTAAGTCAAACATGGAACTTGTTTCAACTTCTGTGAAAGTAGAAGTGGATGACACAGGTGAGTGCTCCTCTTCTAGTATTCAAGTTATGGAGGATACGGTTGAGGATATTTCAGGAAGAGATCTATGCATCTCTATCCTTAGAAGCAATGGGCTTTTGTCTTCTATGGCTCATGCTCCTGAGGAAGAAAGTGATTTTAGAAACGACAATAACTGTTTTCGATTATGCAAAACTTGTGGCTCTTCAGAATCAGTCTTGAAGATGTTAATCTGTGATCATTGTGAAGATGCATTTCATGTGTCATGTTGCAATCATCGCATGAAGAAAGTGTCAAATGATGAGTGGTATTGCAATTCATGTCTGAAGAAGAAGCATAAAATTTTGAAGGAAACGATCTCAAAGAAATTGGCAAACACCTTGAGTAGAAATGGATCTTCTAAGGGTGAATCAAATTCCATAGCATTAATGTTAAAGGACACAGAACCGTATACAACTGGTGTTCGGATTGGCAAAGGTTTTCAAGCAGAAGTTCCAGAGTGGTCTGGCCCGATTTCAGAGTATGTTATTTTCTTCAATTATATATATTCTTTCTTCATGCAAACATAA

Protein sequence

MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRKKLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKTQVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLANTLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISEYVIFFNYIYSFFMQT
BLAST of Cla018856 vs. Swiss-Prot
Match: LID2_SCHPO (Lid2 complex component lid2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=lid2 PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 3.2e-06
Identity = 30/85 (35.29%), Postives = 43/85 (50.59%), Query Frame = 1

Query: 238 CKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSC------------LKK 297
           C+ CG  ++   +L+CD CE A+H SC +  +  +  ++WYC++C             K 
Sbjct: 271 CEYCGLDKNPETILLCDGCEAAYHTSCLDPPLTSIPKEDWYCDACKFNISDYDPRKGFKW 330

Query: 298 KHKILKETISKKLANTL-SRNGSSK 310
           K   LKE  S ++ NTL  RN SSK
Sbjct: 331 KLSSLKER-SAEIFNTLGERNSSSK 354

BLAST of Cla018856 vs. TrEMBL
Match: A0A0A0LU88_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G425890 PE=4 SV=1)

HSP 1 Score: 652.5 bits (1682), Expect = 3.0e-184
Identity = 320/363 (88.15%), Postives = 332/363 (91.46%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAG I EEKKN+GGLRCLNFPR  P   TV  MPEGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGRI-EEKKNSGGLRCLNFPRTFP---TVIMMPEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDG+L EDKEQAAASQHNH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGNLAEDKEQAAASQHNHEREIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVS S+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSASLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVM D +EDISGRDLCISILRSNGLLSS  HAPEEESDFR+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMGDAIEDISGRDLCISILRSNGLLSSTTHAPEEESDFRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HKILKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKRVSNDEWCCNSCLKKNHKILKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISEYVIFFNYIYSFF 360
           T SRNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVP+WSGPISEYVIFFNY+Y FF
Sbjct: 301 TSSRNGSSKGESNSIALMLKDTKPYTTCIRIGKGFQAEVPDWSGPISEYVIFFNYMYIFF 359

Query: 361 MQT 364
           MQT
Sbjct: 361 MQT 359

BLAST of Cla018856 vs. TrEMBL
Match: F6I5N1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0103g00660 PE=4 SV=1)

HSP 1 Score: 307.8 bits (787), Expect = 1.8e-80
Identity = 172/367 (46.87%), Postives = 224/367 (61.04%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           M   CD+    GCRKA PI E +KN+     LNFP++  Q+ST+S M EGS  N VY+R+
Sbjct: 9   MYSKCDKHPQGGCRKAVPITEGRKNDDDSCSLNFPQSCSQLSTISVMSEGSVPNFVYQRR 68

Query: 61  KLRGNSDSRLLANGT-------DCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPF 120
           KLR    +   A  +        C+S +S +      K++    Q   ++E V + V+  
Sbjct: 69  KLRRKHATIFSAQASADTKASAGCLSAVSSEAPSVAAKDENGVPQVGLETETVRDLVI-L 128

Query: 121 PVYDGKTQVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTS 180
           PV   +    + +S++ C++ E HGSD+ P N + K  E  S NDSCSSSKSNMEL S S
Sbjct: 129 PVECNREP--KAQSIDRCSVREEHGSDDAPKNSMSKVNEFYSANDSCSSSKSNMELGSAS 188

Query: 181 VKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSS---MAHAPEEESDFRN 240
           +K +VDDT ECSSSS+  M    EDIS +DLCIS+LRS GLL         P +   + +
Sbjct: 189 MKNDVDDTAECSSSSVLGMATMGEDISEKDLCISVLRSEGLLRGCLPSRSCPTDGVSYGS 248

Query: 241 DNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKIL 300
           D+   R CK CG SE  LK+LICDHCE+AFH+ CCN  +KK+  DEW+C+SCLKK  K+L
Sbjct: 249 DSRGSRTCKVCGKSEISLKILICDHCEEAFHMFCCNPSIKKIPVDEWFCHSCLKKTRKML 308

Query: 301 KETISKKLAN-----TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGP 353
           KET  ++  N     + SRN + KGE   IA MLKDT PYTTGVRIGKGFQAEV +WS P
Sbjct: 309 KETTIRRSLNINGETSRSRNTTCKGELGPIAFMLKDTGPYTTGVRIGKGFQAEVADWSSP 368

BLAST of Cla018856 vs. TrEMBL
Match: B9SP83_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0594380 PE=4 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 7.1e-77
Identity = 170/379 (44.85%), Postives = 230/379 (60.69%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQIS-TVSTMPEGSKSNVVYRR 60
           MCP   E+SH G R+   I E+K  N    C +      Q+S TV T P+ S    VY R
Sbjct: 1   MCPKYHEYSHIGSREGASITEDK--NEVYSCPSSIVTSLQLSRTVCTTPQSSVPMFVYSR 60

Query: 61  KKLRGNSDSRLL----------ANGTDCISLIS-CDGHLGEDKEQAAASQHNHKSEIVGN 120
           +KL+GN+ +  +           +G DC+S++S C   L   KEQ   SQ     E    
Sbjct: 61  RKLQGNASTSAVFSAQDPASTKRSGEDCVSVVSYCAPSL---KEQHVVSQAELAIEDPN- 120

Query: 121 CVLPFPVYDGKTQVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNME 180
             +P+    G++ V +LES+NGC++ E   SD+   +  QK +EVDSINDSCSSSKS+ME
Sbjct: 121 --MPYIAGKGESCVMKLESLNGCSLVEERVSDQASKSTEQKIIEVDSINDSCSSSKSDME 180

Query: 181 LVSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLS----SMAHAPEE 240
           LVS S+  + +DT ECSSSS   +E   E++S +DLC S++RS G+      S  H   E
Sbjct: 181 LVSASMHTQAEDTSECSSSSAMFVEALGEELSEKDLCTSVVRSKGVFGRVWPSRTHGSAE 240

Query: 241 ESDFRNDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLK 300
                + ++  R CK C   ES LKMLICD+CE++FH+SCCN R+K++  DEW+C+SC K
Sbjct: 241 GVGDSSASSSSRFCKLCAHLESPLKMLICDNCEESFHLSCCNPRIKRIPQDEWFCHSCAK 300

Query: 301 KKHKILKETISKKLANTLSRNGSS----KGESNSIALMLKDTEPYTTGVRIGKGFQAEVP 360
           K+ KIL ET+S + +N +   G S      ESN IALML+DT PYTTGVRIGKGFQAEV 
Sbjct: 301 KRRKILTETVSTRFSNMIGEKGRSGNSYTDESNPIALMLRDTPPYTTGVRIGKGFQAEVS 360

BLAST of Cla018856 vs. TrEMBL
Match: A0A061FTE8_THECC (RING/FYVE/PHD zinc finger superfamily protein, putative isoform 6 (Fragment) OS=Theobroma cacao GN=TCM_045436 PE=4 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 1.4e-72
Identity = 163/374 (43.58%), Postives = 216/374 (57.75%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVY-RR 60
           M P CD  S   C++A              C   PR   Q S+VS M EG    +VY RR
Sbjct: 39  MSPKCDVPSQSECKEAE-----------YSCPPLPRTGSQQSSVSVMSEGPVPTLVYSRR 98

Query: 61  KKLRGNSDSRLLA-------------NGTDCISLISCDGHLGEDKEQAAASQHNHKSEIV 120
           KK RG+S S   A                DC+S++S D       EQ   SQ  H +   
Sbjct: 99  KKRRGSSSSASAAVANFCAEAPVNSKRSGDCLSVVSSDALSVAVMEQNGVSQVGHGNVAT 158

Query: 121 GNCVLPFPVYDGKTQVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSN 180
           G+ + P      +  +S+ E  NG +  + HGSD+    + QK+++VDSINDSCSSSKSN
Sbjct: 159 GDLLTPLAC-SREPHISKYEFANGFSGVDNHGSDDVRKTVRQKTIDVDSINDSCSSSKSN 218

Query: 181 MELVSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSM--AHAP-E 240
           MEL   S+K E+D+ GEC SSS+   E   ED+S +D C SILR+ G +  +  + AP  
Sbjct: 219 MELALASIKGEMDENGECCSSSVIAAEVVREDLSEKDRCFSILRNQGNVEEVGPSRAPLN 278

Query: 241 EESDFRNDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCL 300
           EE      ++C R+CK CG SE+  KMLICD+CE+AFH+ CCN R+KKV  DEWYC SC+
Sbjct: 279 EEIGTSGASSCSRVCKICGRSETAQKMLICDNCEEAFHLRCCNPRIKKVPVDEWYCFSCM 338

Query: 301 KKKHKILKETISKKLANTLS-----RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAE 353
           KKK  ++K+T ++  ++        R  SS+GES+ I LML+D EPY T VRIGKGFQA+
Sbjct: 339 KKKRIMVKDTTARNSSSITGCMGRCRGVSSEGESSPIELMLRDAEPYRTSVRIGKGFQAD 398

BLAST of Cla018856 vs. TrEMBL
Match: A0A061FRS6_THECC (RING/FYVE/PHD zinc finger superfamily protein, putative isoform 4 OS=Theobroma cacao GN=TCM_045436 PE=4 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 4.0e-72
Identity = 162/370 (43.78%), Postives = 214/370 (57.84%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVY-RR 60
           M P CD  S   C++A              C   PR   Q S+VS M EG    +VY RR
Sbjct: 39  MSPKCDVPSQSECKEAE-----------YSCPPLPRTGSQQSSVSVMSEGPVPTLVYSRR 98

Query: 61  KKLRGNSDSRLLA-------------NGTDCISLISCDGHLGEDKEQAAASQHNHKSEIV 120
           KK RG+S S   A                DC+S++S D       EQ   SQ  H +   
Sbjct: 99  KKRRGSSSSASAAVANFCAEAPVNSKRSGDCLSVVSSDALSVAVMEQNGVSQVGHGNVAT 158

Query: 121 GNCVLPFPVYDGKTQVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSN 180
           G+ + P      +  +S+ E  NG +  + HGSD+    + QK+++VDSINDSCSSSKSN
Sbjct: 159 GDLLTPLAC-SREPHISKYEFANGFSGVDNHGSDDVRKTVRQKTIDVDSINDSCSSSKSN 218

Query: 181 MELVSTSVKVEVDDTGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSM--AHAP-E 240
           MEL   S+K E+D+ GEC SSS+   E   ED+S +D C SILR+ G +  +  + AP  
Sbjct: 219 MELALASIKGEMDENGECCSSSVIAAEVVREDLSEKDRCFSILRNQGNVEEVGPSRAPLN 278

Query: 241 EESDFRNDNNCFRLCKTCGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCL 300
           EE      ++C R+CK CG SE+  KMLICD+CE+AFH+ CCN R+KKV  DEWYC SC+
Sbjct: 279 EEIGTSGASSCSRVCKICGRSETAQKMLICDNCEEAFHLRCCNPRIKKVPVDEWYCFSCM 338

Query: 301 KKKHKILKETISKKLANTLS-----RNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAE 349
           KKK  ++K+T ++  ++        R  SS+GES+ I LML+D EPY T VRIGKGFQA+
Sbjct: 339 KKKRIMVKDTTARNSSSITGCMGRCRGVSSEGESSPIELMLRDAEPYRTSVRIGKGFQAD 396

BLAST of Cla018856 vs. NCBI nr
Match: gi|700210389|gb|KGN65485.1| (hypothetical protein Csa_1G425890 [Cucumis sativus])

HSP 1 Score: 652.5 bits (1682), Expect = 4.3e-184
Identity = 320/363 (88.15%), Postives = 332/363 (91.46%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAG I EEKKN+GGLRCLNFPR  P   TV  MPEGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGRI-EEKKNSGGLRCLNFPRTFP---TVIMMPEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDG+L EDKEQAAASQHNH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGNLAEDKEQAAASQHNHEREIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVS S+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSASLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVM D +EDISGRDLCISILRSNGLLSS  HAPEEESDFR+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMGDAIEDISGRDLCISILRSNGLLSSTTHAPEEESDFRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HKILKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKRVSNDEWCCNSCLKKNHKILKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISEYVIFFNYIYSFF 360
           T SRNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVP+WSGPISEYVIFFNY+Y FF
Sbjct: 301 TSSRNGSSKGESNSIALMLKDTKPYTTCIRIGKGFQAEVPDWSGPISEYVIFFNYMYIFF 359

Query: 361 MQT 364
           MQT
Sbjct: 361 MQT 359

BLAST of Cla018856 vs. NCBI nr
Match: gi|659107176|ref|XP_008453560.1| (PREDICTED: uncharacterized protein LOC103494237 isoform X2 [Cucumis melo])

HSP 1 Score: 649.8 bits (1675), Expect = 2.8e-183
Identity = 317/348 (91.09%), Postives = 324/348 (93.10%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR  P   T   M EGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFP---TAIMMSEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDGHL EDKEQAAASQ NH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVSTS+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAH PEEESD R+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISE 349
           TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVP+WSGPIS+
Sbjct: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISD 345

BLAST of Cla018856 vs. NCBI nr
Match: gi|659107172|ref|XP_008453557.1| (PREDICTED: uncharacterized protein LOC103494237 isoform X1 [Cucumis melo])

HSP 1 Score: 649.8 bits (1675), Expect = 2.8e-183
Identity = 317/348 (91.09%), Postives = 324/348 (93.10%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPR  P   T   M EGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRTFP---TAIMMSEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDGHL EDKEQAAASQ NH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGHLAEDKEQAAASQRNHEHEIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVSTS+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSTSLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAH PEEESD R+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHVPEEESDSRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHK+LKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKVLKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISE 349
           TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVP+WSGPIS+
Sbjct: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPDWSGPISD 345

BLAST of Cla018856 vs. NCBI nr
Match: gi|778660829|ref|XP_011656986.1| (PREDICTED: uncharacterized protein LOC101212408 isoform X1 [Cucumis sativus])

HSP 1 Score: 623.2 bits (1606), Expect = 2.8e-175
Identity = 306/348 (87.93%), Postives = 318/348 (91.38%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAG I EEKKN+GGLRCLNFPR  P   TV  MPEGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGRI-EEKKNSGGLRCLNFPRTFP---TVIMMPEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDG+L EDKEQAAASQHNH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGNLAEDKEQAAASQHNHEREIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVS S+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSASLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVM D +EDISGRDLCISILRSNGLLSS  HAPEEESDFR+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMGDAIEDISGRDLCISILRSNGLLSSTTHAPEEESDFRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HKILKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKRVSNDEWCCNSCLKKNHKILKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISE 349
           T SRNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVP+WSGPIS+
Sbjct: 301 TSSRNGSSKGESNSIALMLKDTKPYTTCIRIGKGFQAEVPDWSGPISD 344

BLAST of Cla018856 vs. NCBI nr
Match: gi|778660832|ref|XP_011656989.1| (PREDICTED: uncharacterized protein LOC101212408 isoform X2 [Cucumis sativus])

HSP 1 Score: 623.2 bits (1606), Expect = 2.8e-175
Identity = 306/348 (87.93%), Postives = 318/348 (91.38%), Query Frame = 1

Query: 1   MCPHCDEFSHDGCRKAGPIIEEKKNNGGLRCLNFPRAVPQISTVSTMPEGSKSNVVYRRK 60
           MCPHCDEFSHDGCRKAG I EEKKN+GGLRCLNFPR  P   TV  MPEGSKSNVVYRRK
Sbjct: 1   MCPHCDEFSHDGCRKAGRI-EEKKNSGGLRCLNFPRTFP---TVIMMPEGSKSNVVYRRK 60

Query: 61  KLRGNSDSRLLANGTDCISLISCDGHLGEDKEQAAASQHNHKSEIVGNCVLPFPVYDGKT 120
           KLRG+SDSR LANGTDCISLISCDG+L EDKEQAAASQHNH+ EIVGN V PFPV DGKT
Sbjct: 61  KLRGSSDSRFLANGTDCISLISCDGNLAEDKEQAAASQHNHEREIVGNAVPPFPVCDGKT 120

Query: 121 QVSELESVNGCTIGEGHGSDETPNNILQKSLEVDSINDSCSSSKSNMELVSTSVKVEVDD 180
           QVSELES NGC  GEGHGSDETPNN LQKSLEVDSINDSCSSSKSNMELVS S+KVEVDD
Sbjct: 121 QVSELESANGCIFGEGHGSDETPNNNLQKSLEVDSINDSCSSSKSNMELVSASLKVEVDD 180

Query: 181 TGECSSSSIQVMEDTVEDISGRDLCISILRSNGLLSSMAHAPEEESDFRNDNNCFRLCKT 240
           TGECSSSSIQVM D +EDISGRDLCISILRSNGLLSS  HAPEEESDFR+DNNCFRLCKT
Sbjct: 181 TGECSSSSIQVMGDAIEDISGRDLCISILRSNGLLSSTTHAPEEESDFRSDNNCFRLCKT 240

Query: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKKVSNDEWYCNSCLKKKHKILKETISKKLAN 300
           CGSSESVLKMLICDHCEDAFHVSCCNHRMK+VSNDEW CNSCLKK HKILKE ISKKL N
Sbjct: 241 CGSSESVLKMLICDHCEDAFHVSCCNHRMKRVSNDEWCCNSCLKKNHKILKEAISKKLTN 300

Query: 301 TLSRNGSSKGESNSIALMLKDTEPYTTGVRIGKGFQAEVPEWSGPISE 349
           T SRNGSSKGESNSIALMLKDT+PYTT +RIGKGFQAEVP+WSGPIS+
Sbjct: 301 TSSRNGSSKGESNSIALMLKDTKPYTTCIRIGKGFQAEVPDWSGPISD 344

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LID2_SCHPO3.2e-0635.29Lid2 complex component lid2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 2484... [more]
Match NameE-valueIdentityDescription
A0A0A0LU88_CUCSA3.0e-18488.15Uncharacterized protein OS=Cucumis sativus GN=Csa_1G425890 PE=4 SV=1[more]
F6I5N1_VITVI1.8e-8046.87Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0103g00660 PE=4 SV=... [more]
B9SP83_RICCO7.1e-7744.85Putative uncharacterized protein OS=Ricinus communis GN=RCOM_0594380 PE=4 SV=1[more]
A0A061FTE8_THECC1.4e-7243.58RING/FYVE/PHD zinc finger superfamily protein, putative isoform 6 (Fragment) OS=... [more]
A0A061FRS6_THECC4.0e-7243.78RING/FYVE/PHD zinc finger superfamily protein, putative isoform 4 OS=Theobroma c... [more]
Match NameE-valueIdentityDescription
gi|700210389|gb|KGN65485.1|4.3e-18488.15hypothetical protein Csa_1G425890 [Cucumis sativus][more]
gi|659107176|ref|XP_008453560.1|2.8e-18391.09PREDICTED: uncharacterized protein LOC103494237 isoform X2 [Cucumis melo][more]
gi|659107172|ref|XP_008453557.1|2.8e-18391.09PREDICTED: uncharacterized protein LOC103494237 isoform X1 [Cucumis melo][more]
gi|778660829|ref|XP_011656986.1|2.8e-17587.93PREDICTED: uncharacterized protein LOC101212408 isoform X1 [Cucumis sativus][more]
gi|778660832|ref|XP_011656989.1|2.8e-17587.93PREDICTED: uncharacterized protein LOC101212408 isoform X2 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001965Znf_PHD
IPR011011Znf_FYVE_PHD
IPR013083Znf_RING/FYVE/PHD
IPR019786Zinc_finger_PHD-type_CS
IPR019787Znf_PHD-finger
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016573 histone acetylation
biological_process GO:0008150 biological_process
biological_process GO:0045892 negative regulation of transcription, DNA-templated
biological_process GO:0045944 positive regulation of transcription from RNA polymerase II promoter
cellular_component GO:0005575 cellular_component
cellular_component GO:0000123 histone acetyltransferase complex
cellular_component GO:0000790 nuclear chromatin
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0004402 histone acetyltransferase activity
molecular_function GO:0042393 histone binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla018856Cla018856.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 237..283
score: 1.
IPR011011Zinc finger, FYVE/PHD-typeunknownSSF57903FYVE/PHD zinc fingercoord: 224..289
score: 1.48
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 221..295
score: 2.6
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 238..282
scor
IPR019787Zinc finger, PHD-fingerPFAMPF00628PHDcoord: 238..284
score: 4.
IPR019787Zinc finger, PHD-fingerPROFILEPS50016ZF_PHD_2coord: 235..285
score: 9
NoneNo IPR availablePANTHERPTHR10615HISTONE ACETYLTRANSFERASEcoord: 189..352
score: 1.5
NoneNo IPR availablePANTHERPTHR10615:SF117SUBFAMILY NOT NAMEDcoord: 189..352
score: 1.5

The following gene(s) are paralogous to this gene:

None