Cla015998 (gene) Watermelon (97103) v1

NameCla015998
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionWW domain-binding protein 4 (Fragment) (AHRD V1 *--* E2A9B9_CAMFO); contains Interpro domain(s) IPR000690 Zinc finger, C2H2-type matrin
LocationChr2 : 6015410 .. 6019352 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGAGGTATTCATTTCTTCAAATTCCTTGTTGTTTTCCCGCTAATCCTTTTTGTTGGGGTTCTTGCGTATATTTACTTTTTGATGAACTTGAAATAATCTGGAACACCTAAGCTACTTGATCTCCTTCCATGTTTAAAATTCACGTAGCTATCTAATTGTATATGTAATGTAATTCTGGCACCCATCTTGCAGATGTTACGTGGAAATGTTATTCTGATTCAAATACTGGCATTTTCCGCTAGATTTTAATATTGCCGATTTATTGTTTGGTTTGATCGTTTGTTGATCTTTTGTTGAGGATGAGAGAGAGGCAAAGCATTTTGCATATTTTATGTTCTTTCTTTTTTGCTTCATTGATTAGTTATGGAAAGCATGTATAATGTGTAATTCCTGAAAGCTTTTGCATTTGGAAATTGAAATATTGCTTCACGTTTACGTTGGTTGTGATGCCAATTCTCATTTCAATTTTTGGTGCAAGTTATAGTTTGGTTCCATGTATGCATTTTCATATGTACATATGTGTGTGTGTACATATTAACCTTCCATTATGAACTCTATTATTTTGAATTGTTGTCTGTGTTTTGATTCACATATTGGGAATGAAACCTGTGATCTTATATATATATATATATATATTCAGCGTTTGGAAGTTCAGATGTGTGGATAAACATTTATTTTACTTAGAATGATAATCCATGACAGCAAGACTGCTAAAGAGGTTTTTTCTTTTTCTTTTTTTTCTTTTTTTAAATCCCCCTTCCCAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCGTCCACAATCCGAAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAAGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGTGCCATTGAGCAGATTGAGGCAGTAAGTGAATGAACATTCTATTCATTGTATCTTTGTTTTTTTTTTTTTTTTTTTGGCAAATAAAAATTCATATAAAAACAGAATCTAAGACATCACTCAACGTTTTCAACATAAAAAATTTGAATTTCTTTCAATTAAATGCATCTAGGTGGTGTCACATTGCATAAAGGATATTATTATATTATGCCTCAGTTACTGGGAACTCAAAGGCAACATCTTTGCCACAAAGCTTCCTGCAAACTGCAGCGAAACTCTCCAACTTGTACTCGGTGTTGTTCCACTCCTTTGGTTCCAAGAAAACCTTCTCGTTTTTTGATCCGTCAACCCTGTATCTGGTTTGTTTCCCCACAATCTCTGCAGGCAAAAGAATATCTTTGTTGATGACTAAGTATAGTAGTATGCATCTTTTCATCAAAAAAGAAATATAGTAGTATGTGTCTGTTTTAACTTTCAAATCTTGGGGGCTTGGTTCAATTCAGAAGTATTAAAGTAAGCCTTAATGTACTAAAAGTTATGAATTCTCATGTCAACTTTATTGTAGTTAGAGATTGGTTAAAGACATAAAGTTAAATTATCTGCTTGCTTCGCATCAGTACTGCAAAGGAACCAAGCTGTTTACATGAATCCTTAATCTTTAGCTTAACAAAGGTTAGCTTATTAGTGAATTGTGCAAGATGAATGTAGTTTCTGCTTTGAATTTGACCTGTTAATTTCGTGTAGTTTCCCTAACAGGAGTTCTTAAAATCTAATTGTTGTATGTTGATTATTACACTTGAGTCTTGAGTATATAATGTGGGTTCTTGAGTCTGATGTACTTGTACATGCCTAGTAAGAATAGATAGTAATGTCCTACTATAAATAATTTCTCACGGTTGATTTTTCTTTCTAGAGTAAGGATTCTTTTGAGTATCTTATCCAGAAAAACAAAAAAGGATCCTTTCTAAGCATGGTTGTCCACATAACGAGTATGCCAATCTCTGGCGACTTTCTTATTTAAGGAGTTTGTTTTAATGGGATTGTGATATGGATACTTTTAACAATGGTAAAGAAGAGAATTGAAGTTGCTGAAACTTTAAATAACAGGGATAAGTGCCAAATTTTTAAGTTCACGCAGAGTCCTCTAAGTGTATGGCCTCAATATGTTCATCGTGTATGAGCAAGAAGAATAGTGTGCTTAATCCATGAAACCATCATGGGTTGGCCTAGTGGTAAATAAGGGGGGTATGACCTTGATAAAGGGGCTAAGAGGTCATGGGTTCAGTCCATGGTGGTCACCTACCTAGGATTTAATATCCTACGAGTTTCCTTGGCACCCAAATTTTGTAGGGTCAGGCGGGTTGTTCTATGAGATTAGTCGAGGTGCGTGTAAGTTGGCTCGGACACTCATGGATATAAAAAAAGTGTGTTTAATCCAATTTTTCTCTCAATTCTGCAGAAAGCCAATCGTAGTTATCAGAAGGACGTAGCAAATTTACGGGAAGCTAGAGATTCTCATGCACTTCCTGTTGATGTTCATGAAAATGGTGAAGAGAGTAAGTACTGTAGAGTTGCAATTTATTTGTTTTAAGAAATGCCCTTCTATTTCTATCTCTCTCCAGAACTACCTGAAAATTAATGCCTTATGTTTGATGTGAATTTGAGCATCTGTTGACAATGCTCTGAGAAATATCATTCCATTGATATTCTTTTGTGACAACACTAAATTCCTTGCTTTTTTGTGGTGCTCAAAAAATAGGAATTTCTTTTGTAATTATAGCTTGTCCATGATTCAAATGAATTGGAAAGCTCTCTTGTAGGCGTTTTGTTGGGTGAGAGTTTTTTCTTCTCCTTTGCCTTTAGGTTGTTTTGTTTTGTGCCTTTTTCTCATACTTCATAATATACTTCCTTTTAAAAAAAAAGTTGATCTACTTAAGTGAAAGGTCTATGTAATGGGACATAATATTATGTGCATATTTATTATGCATGGGTGTTTATATAGATAATCAATAAATTGGGCCTTCCATTGTTCAGGACACTCCTATATCGGTGTAAGGGTTTATATCAATTTATTGGTTAAGGAACATGTCATGCATTATTGTTGTAAGTTCTAACCACTAACAACTTCAGTATGAGACCAGAAGGCTGGATAAATTATTTGTGTTTTTGTTTTGTTTCTCTGTGCTTTTCACTTCAAGTGGTAAAGAAAAGTCTTCTTTTTGCTTTGGTAAGAAACAAGAGATTCCCTTAATAAAAAAATGAAGGAAGAAGGGACAGGAAGACACATTTTATATTCAATGTTTCTTTGATCCATCAACTTATCCATTTGGATAATTCTTTTCTTTTACCCTTTGTGAGCCTGGTACTGATCATTATCATTTATTATCTTCTTGATCGTGTTCTTCAAACTGATACAGCTCCCAATGATGACAGAATGGGAGCTTGACAGCACTTCGGGCTATTATTATAATGAAAGCAATGGTTTTTACTATGATTCAAGTTCAGGCTTCTACTACTCTGATGCCATTGGTACTGATTGGTTTTTGGGAAGATGAACTTTGAGCTTCTATCTTTTACATCTTAGAATACGACTATACAAGTATTTGTTGCTTACCAAGTTAGTTGTTTGCAACAGGCAAGTGGGTAACACAGGAAGAGGCACATTCTTCGCCTCAATTCTTTTTGAACTCCAAACACAAGAAACCAGTTTTAGCAAAGCCATCATCAGCCTCTGCAAGTGCAGATATAAAAGATAAAAATGTGGATAAAGGCGAAAGTGGGCCGCCGCCTGGGTTAGTTGTTTCAGCTTCTTTAAACCCGAAGCGATCTGTTAAAGGTGCTCCTTCATCGATTGCTGTTGGTAAGAGGAAGAGGCCAGATGATAAGCAAAAGGCTATATCTGAAGAAGAAAAAGCTGCACTCAAAGCAAGGGAGGCTGCGAAGAAGAGGGTTGAAAAAAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGA

mRNA sequence

ATGACTGAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCGTCCACAATCCGAAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAAGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGTGCCATTGAGCAGATTGAGGCAAAAGCCAATCGTAGTTATCAGAAGGACGTAGCAAATTTACGGGAAGCTAGAGATTCTCATGCACTTCCTGTTGATGTTCATGAAAATGGTGAAGAGAAATGGGAGCTTGACAGCACTTCGGGCTATTATTATAATGAAAGCAATGGTTTTTACTATGATTCAAGTTCAGGCTTCTACTACTCTGATGCCATTGGCAAGTGGGTAACACAGGAAGAGGCACATTCTTCGCCTCAATTCTTTTTGAACTCCAAACACAAGAAACCAGTTTTAGCAAAGCCATCATCAGCCTCTGCAAGTGCAGATATAAAAGATAAAAATGTGGATAAAGGCGAAAGTGGGCCGCCGCCTGGGTTAGTTGTTTCAGCTTCTTTAAACCCGAAGCGATCTGTTAAAGGTGCTCCTTCATCGATTGCTGTTGGTAAGAGGAAGAGGCCAGATGATAAGCAAAAGGCTATATCTGAAGAAGAAAAAGCTGCACTCAAAGCAAGGGAGGCTGCGAAGAAGAGGGTTGAAAAAAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGA

Coding sequence (CDS)

ATGACTGAGTATTGGGTTAGCCAAGGTAACAAATGGTGCGACTTCTGCAAAATATTTATTTCAAATAATCCGTCCACAATCCGAAATCATGAGCTCGGTCAACGTCATAAGGACAATGTTGCCAAAAAGCTTGCAAACATGAGAAAAGAGAATGCTGCCAAGGACAAAGAACAAAAGGAAGCAGTACGTGCCATTGAGCAGATTGAGGCAAAAGCCAATCGTAGTTATCAGAAGGACGTAGCAAATTTACGGGAAGCTAGAGATTCTCATGCACTTCCTGTTGATGTTCATGAAAATGGTGAAGAGAAATGGGAGCTTGACAGCACTTCGGGCTATTATTATAATGAAAGCAATGGTTTTTACTATGATTCAAGTTCAGGCTTCTACTACTCTGATGCCATTGGCAAGTGGGTAACACAGGAAGAGGCACATTCTTCGCCTCAATTCTTTTTGAACTCCAAACACAAGAAACCAGTTTTAGCAAAGCCATCATCAGCCTCTGCAAGTGCAGATATAAAAGATAAAAATGTGGATAAAGGCGAAAGTGGGCCGCCGCCTGGGTTAGTTGTTTCAGCTTCTTTAAACCCGAAGCGATCTGTTAAAGGTGCTCCTTCATCGATTGCTGTTGGTAAGAGGAAGAGGCCAGATGATAAGCAAAAGGCTATATCTGAAGAAGAAAAAGCTGCACTCAAAGCAAGGGAGGCTGCGAAGAAGAGGGTTGAAAAAAGGGAGAAGCCACTTCTCGGCCTCTACAAATTGCCTTGA

Protein sequence

MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKEAVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGFYYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKGESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRVEKREKPLLGLYKLP
BLAST of Cla015998 vs. Swiss-Prot
Match: ZOP1_ARATH (Zinc finger protein ZOP1 OS=Arabidopsis thaliana GN=ZOP1 PE=1 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 3.8e-78
Identity = 148/256 (57.81%), Postives = 189/256 (73.83%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWC+FCKI+I NNP++IRNH+LG+RH++ V KKL +MR+ +AAKDKE K+
Sbjct: 1   MTEYWVSQGNKWCEFCKIWIQNNPTSIRNHDLGKRHRECVDKKLTDMRERSAAKDKELKK 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
             + ++QIEAKA RSYQKD+A  ++   ++  P    E+G   W LDS SGYYYN++NG 
Sbjct: 61  NEKLLQQIEAKATRSYQKDIATAQQVAKANGAP----EDGTSDWMLDSASGYYYNQTNGL 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           +YDS SGFYYSD+IG WVTQ+EA+++ +   +S  K P++ KP S+S +           
Sbjct: 121 HYDSQSGFYYSDSIGHWVTQDEAYAAVK--TSSGTKVPLVKKPVSSSGAGP--------- 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVG--KRKRPDDKQKAISEEEKAALKAREAAKK 240
             G PPG +V+ASLNPKR+VKGA SS+ +G  KRKR D+K K +S EEKAALKAREAA+K
Sbjct: 181 SVGKPPGRLVTASLNPKRAVKGAASSVDLGNNKRKRQDEKPKKVSAEEKAALKAREAARK 240

Query: 241 RVEKREKPLLGLYKLP 255
           RVE REKPLLGLY  P
Sbjct: 241 RVEDREKPLLGLYNRP 241

BLAST of Cla015998 vs. Swiss-Prot
Match: WBP4_HUMAN (WW domain-binding protein 4 OS=Homo sapiens GN=WBP4 PE=1 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-13
Identity = 32/83 (38.55%), Postives = 57/83 (68.67%), Query Frame = 1

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVAK+++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVAKRISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKANRSYQKDVANL 84
          A +    +EA A ++YQ+D+  L
Sbjct: 61 ASKEFAAMEAAALKAYQEDLKRL 83

BLAST of Cla015998 vs. Swiss-Prot
Match: WBP4_RAT (WW domain-binding protein 4 OS=Rattus norvegicus GN=Wbp4 PE=2 SV=1)

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-13
Identity = 32/83 (38.55%), Postives = 57/83 (68.67%), Query Frame = 1

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVA+K++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVARKISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKANRSYQKDVANL 84
          A +    +EA A ++YQ+D+  L
Sbjct: 61 ASKEFAAMEAAALKAYQEDLKRL 83

BLAST of Cla015998 vs. Swiss-Prot
Match: WBP4_MOUSE (WW domain-binding protein 4 OS=Mus musculus GN=Wbp4 PE=1 SV=4)

HSP 1 Score: 78.2 bits (191), Expect = 1.5e-13
Identity = 35/98 (35.71%), Postives = 62/98 (63.27%), Query Frame = 1

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  ++  HE G+ HK+NVA++++ +++++  K KE+++
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSVEFHERGKNHKENVARRISEIKQKSLDKAKEEEK 60

Query: 61 AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHE 99
          A +    +EA A ++YQ+D+  L        LP D+ E
Sbjct: 61 ASKEFAAMEAAALKAYQEDLKRL-----GLPLPSDISE 93

BLAST of Cla015998 vs. Swiss-Prot
Match: WBP4_CHICK (WW domain-binding protein 4 OS=Gallus gallus GN=WBP4 PE=2 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 2.5e-13
Identity = 33/83 (39.76%), Postives = 54/83 (65.06%), Query Frame = 1

Query: 1  MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
          M +YW SQ  K+CD+CK +I++N  +I  HE G+ HK+NVAK+++ +RK++  K KE++ 
Sbjct: 1  MADYWKSQPKKFCDYCKCWIADNRPSIDFHERGKNHKENVAKRISEIRKKSMEKAKEEEN 60

Query: 61 AVRAIEQIEAKANRSYQKDVANL 84
            +    +E  A ++YQ+D+  L
Sbjct: 61 MSKEFAAMEEAAMKAYQEDLKRL 83

BLAST of Cla015998 vs. TrEMBL
Match: A0A0A0K4T4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G074280 PE=4 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 1.2e-136
Identity = 240/254 (94.49%), Postives = 248/254 (97.64%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKANRSYQKD+AN REARDSHALPVDV E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           YYDS+SGFYYSDAIGKWVTQEEAHSSPQFFL+SKHKKP+LAKPSSASAS  IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILAKPSSASASTAIKDKNVDKG 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           E GPPPGLVVSASLNPKRS+KGAPSSIAVGKRKRPD+KQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSIKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EKREKPLLGLYKLP 255
           EKREKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Cla015998 vs. TrEMBL
Match: A9PJ87_9ROSI (Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 4.2e-100
Identity = 183/252 (72.62%), Postives = 214/252 (84.92%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNP++IRNHELGQRHKDNVAKKL +MRK+N AK+K+QKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPTSIRNHELGQRHKDNVAKKLDSMRKDNIAKEKQQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+EQIEAKANRSYQKDVANL+EA    AL  D+ E+G+EKW+ DSTSGYYYN+SNG 
Sbjct: 61  AARALEQIEAKANRSYQKDVANLKEASSLRAL--DIQEDGQEKWDYDSTSGYYYNQSNGL 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           +YD +SGFYYSDAIGKWVTQEEA+++ Q    S++K+    KP  ASA + +K+  V   
Sbjct: 121 HYDPNSGFYYSDAIGKWVTQEEAYAAVQISSGSRNKESSFKKPLPASAVSSVKENKV-AA 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           +SGPPPG VVSASLNP+RSVKGAPS  AV KRKRPD+K KA+S EEKAALKAREAA+KRV
Sbjct: 181 QSGPPPGPVVSASLNPRRSVKGAPSKFAVNKRKRPDEKPKAVSVEEKAALKAREAARKRV 240

Query: 241 EKREKPLLGLYK 253
           E+REK LLGLY+
Sbjct: 241 EEREKSLLGLYQ 249

BLAST of Cla015998 vs. TrEMBL
Match: M5VKZ1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010352mg PE=4 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 7.1e-100
Identity = 180/253 (71.15%), Postives = 217/253 (85.77%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKI+ISNNPS+IRNHELGQRHKDNVAKKLA+MRKE  AK+KE+KE
Sbjct: 1   MTEYWVSQGNKWCDFCKIYISNNPSSIRNHELGQRHKDNVAKKLADMRKEKVAKEKEEKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+ QIEAKA RSYQKDVA+ +EARD+ A   DV ++G+EKW+ +STSGYYYN+SNGF
Sbjct: 61  AERALLQIEAKAKRSYQKDVASFQEARDARAF--DVEDDGQEKWQYNSTSGYYYNQSNGF 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSAS-ASADIKDKNVDK 180
           YYD++SGFYYSDAIGKWV QEEA+++PQF  N+ +K+ +L  P S S A    ++K  DK
Sbjct: 121 YYDANSGFYYSDAIGKWVAQEEAYANPQFLSNAGYKETILKNPGSTSGAGPATENKRADK 180

Query: 181 GESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKR 240
            ++GPPPG VVSASLNP RSVKGA SS++VGKRKR ++K +AIS +E AALKAREAA+KR
Sbjct: 181 SQNGPPPGPVVSASLNPMRSVKGARSSVSVGKRKRQEEKPRAISAQEAAALKAREAARKR 240

Query: 241 VEKREKPLLGLYK 253
           VE+REKPLLGLY+
Sbjct: 241 VEEREKPLLGLYR 251

BLAST of Cla015998 vs. TrEMBL
Match: W9RY93_9ROSA (WW domain-binding protein 4 OS=Morus notabilis GN=L484_027564 PE=4 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 7.1e-100
Identity = 179/250 (71.60%), Postives = 213/250 (85.20%), Query Frame = 1

Query: 4   YWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKEAVR 63
           YWVSQGNKWCDFCKI++SNNP++IRNHELGQRHKDNVAK+LA+MRKE AAK+KE+KEA R
Sbjct: 23  YWVSQGNKWCDFCKIYLSNNPASIRNHELGQRHKDNVAKRLADMRKEKAAKEKEEKEAAR 82

Query: 64  AIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGFYYD 123
            +EQIEAKA RSYQKD+ NL++ARDSHAL +D    G+E+WE DSTSGYYYN++NGFYYD
Sbjct: 83  LLEQIEAKARRSYQKDMTNLKDARDSHALDID----GQEEWEYDSTSGYYYNQTNGFYYD 142

Query: 124 SSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKD-KNVDKGES 183
           S SGFYYSD IGKWVTQEEA+++PQF  N+ HK+  L KP S+S    +K+ K   KG++
Sbjct: 143 SKSGFYYSDTIGKWVTQEEAYATPQFSSNAGHKEKTLKKPVSSSELGLVKENKTAAKGQN 202

Query: 184 GPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRVEK 243
           G PPG VVS S+NPKRSVKGAPSS+ VGKRKRPD+K KA+S+EE AALKAR AAKKRVE+
Sbjct: 203 GAPPGPVVSGSVNPKRSVKGAPSSLTVGKRKRPDEKPKAVSKEEAAALKARVAAKKRVEE 262

Query: 244 REKPLLGLYK 253
           REK L GLY+
Sbjct: 263 REKSLHGLYR 268

BLAST of Cla015998 vs. TrEMBL
Match: B9HRX6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s10850g PE=4 SV=2)

HSP 1 Score: 370.9 bits (951), Expect = 1.2e-99
Identity = 182/252 (72.22%), Postives = 214/252 (84.92%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNP++IRNHELGQRHKDNVAKKL +MRK+N AK+K+QKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPTSIRNHELGQRHKDNVAKKLDSMRKDNIAKEKQQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+EQIEAKANRSYQKDVANL+EA    AL  D+ E+G+EKW+ DSTSGYYYN+SNG 
Sbjct: 61  AARALEQIEAKANRSYQKDVANLKEASSLRAL--DIQEDGQEKWDYDSTSGYYYNQSNGL 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           +YD +SGFYYSDAIGKWVTQEEA+++ +    S++K+    KP  ASA + +K+  V   
Sbjct: 121 HYDPNSGFYYSDAIGKWVTQEEAYAAVRISSGSRNKESSFKKPLPASAVSSVKENKV-AA 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           +SGPPPG VVSASLNP+RSVKGAPS  AV KRKRPD+K KA+S EEKAALKAREAA+KRV
Sbjct: 181 QSGPPPGPVVSASLNPRRSVKGAPSKFAVNKRKRPDEKPKAVSVEEKAALKAREAARKRV 240

Query: 241 EKREKPLLGLYK 253
           E+REK LLGLY+
Sbjct: 241 EEREKSLLGLYQ 249

BLAST of Cla015998 vs. NCBI nr
Match: gi|778725157|ref|XP_011658907.1| (PREDICTED: uncharacterized protein C18H10.07 [Cucumis sativus])

HSP 1 Score: 493.8 bits (1270), Expect = 1.8e-136
Identity = 240/254 (94.49%), Postives = 248/254 (97.64%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKANRSYQKD+AN REARDSHALPVDV E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           YYDS+SGFYYSDAIGKWVTQEEAHSSPQFFL+SKHKKP+LAKPSSASAS  IKDKNVDKG
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILAKPSSASASTAIKDKNVDKG 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           E GPPPGLVVSASLNPKRS+KGAPSSIAVGKRKRPD+KQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSIKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EKREKPLLGLYKLP 255
           EKREKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Cla015998 vs. NCBI nr
Match: gi|659109879|ref|XP_008454929.1| (PREDICTED: uncharacterized protein LOC103495224 [Cucumis melo])

HSP 1 Score: 486.5 bits (1251), Expect = 2.8e-134
Identity = 237/254 (93.31%), Postives = 244/254 (96.06%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNV KKLANMRKENAAKDKEQKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVTKKLANMRKENAAKDKEQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           AVRAIEQIEAKANRSYQKD+AN REARDSHALPVDV E G+EKWELDSTSGYYYNESNGF
Sbjct: 61  AVRAIEQIEAKANRSYQKDIANFREARDSHALPVDVQETGDEKWELDSTSGYYYNESNGF 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           YYDS+SGFYYSDAIGKWVTQEEAHSSPQFFL+SKHKKP+L  PSSASAS  IKDKNVDK 
Sbjct: 121 YYDSNSGFYYSDAIGKWVTQEEAHSSPQFFLDSKHKKPILGMPSSASASTAIKDKNVDKA 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           E GPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPD+KQKAISEEEKAALKAREAAKKRV
Sbjct: 181 EGGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDEKQKAISEEEKAALKAREAAKKRV 240

Query: 241 EKREKPLLGLYKLP 255
           EKREKPLLGLY+LP
Sbjct: 241 EKREKPLLGLYRLP 254

BLAST of Cla015998 vs. NCBI nr
Match: gi|118489270|gb|ABK96440.1| (unknown [Populus trichocarpa x Populus deltoides])

HSP 1 Score: 372.5 bits (955), Expect = 6.0e-100
Identity = 183/252 (72.62%), Postives = 214/252 (84.92%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNP++IRNHELGQRHKDNVAKKL +MRK+N AK+K+QKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPTSIRNHELGQRHKDNVAKKLDSMRKDNIAKEKQQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+EQIEAKANRSYQKDVANL+EA    AL  D+ E+G+EKW+ DSTSGYYYN+SNG 
Sbjct: 61  AARALEQIEAKANRSYQKDVANLKEASSLRAL--DIQEDGQEKWDYDSTSGYYYNQSNGL 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           +YD +SGFYYSDAIGKWVTQEEA+++ Q    S++K+    KP  ASA + +K+  V   
Sbjct: 121 HYDPNSGFYYSDAIGKWVTQEEAYAAVQISSGSRNKESSFKKPLPASAVSSVKENKV-AA 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           +SGPPPG VVSASLNP+RSVKGAPS  AV KRKRPD+K KA+S EEKAALKAREAA+KRV
Sbjct: 181 QSGPPPGPVVSASLNPRRSVKGAPSKFAVNKRKRPDEKPKAVSVEEKAALKAREAARKRV 240

Query: 241 EKREKPLLGLYK 253
           E+REK LLGLY+
Sbjct: 241 EEREKSLLGLYQ 249

BLAST of Cla015998 vs. NCBI nr
Match: gi|694332726|ref|XP_009356986.1| (PREDICTED: RNA-binding protein 5 [Pyrus x bretschneideri])

HSP 1 Score: 372.5 bits (955), Expect = 6.0e-100
Identity = 183/253 (72.33%), Postives = 221/253 (87.35%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKI+IS+NPS++RNHELGQRHKDNVAKKLA+MRKE AAK+KE+KE
Sbjct: 1   MTEYWVSQGNKWCDFCKIYISHNPSSVRNHELGQRHKDNVAKKLADMRKEKAAKEKEEKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+ QIEAKA RSYQKDVANL+EARDS A  VD  ++  EKW+ +STSGYYYN+SNGF
Sbjct: 61  AERALLQIEAKAKRSYQKDVANLKEARDSRAFEVD--DDDLEKWQYNSTSGYYYNQSNGF 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSAS-ASADIKDKNVDK 180
           YYD +SGFYYSD+IGKWVTQEEA+++PQF  N+ +K+ +L KP SAS A +  ++K  DK
Sbjct: 121 YYDPNSGFYYSDSIGKWVTQEEAYANPQFLSNAGYKETILKKPVSASGAGSATENKGADK 180

Query: 181 GESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKR 240
            ++GPPPG VVS+S+NPKRSVKGA SS+AVGKRKR D+K +A+S EE AALKAREAA+KR
Sbjct: 181 TQNGPPPGPVVSSSINPKRSVKGAASSVAVGKRKR-DEKPRAVSAEEAAALKAREAARKR 240

Query: 241 VEKREKPLLGLYK 253
           V++REKPLLGLY+
Sbjct: 241 VQEREKPLLGLYR 250

BLAST of Cla015998 vs. NCBI nr
Match: gi|743794344|ref|XP_011000045.1| (PREDICTED: RNA-binding protein 5 isoform X1 [Populus euphratica])

HSP 1 Score: 372.1 bits (954), Expect = 7.8e-100
Identity = 183/252 (72.62%), Postives = 214/252 (84.92%), Query Frame = 1

Query: 1   MTEYWVSQGNKWCDFCKIFISNNPSTIRNHELGQRHKDNVAKKLANMRKENAAKDKEQKE 60
           MTEYWVSQGNKWCDFCKIFISNNP++IRNHELGQRHKDNVAKKL +MRK+N AK+K+QKE
Sbjct: 1   MTEYWVSQGNKWCDFCKIFISNNPTSIRNHELGQRHKDNVAKKLDSMRKDNIAKEKQQKE 60

Query: 61  AVRAIEQIEAKANRSYQKDVANLREARDSHALPVDVHENGEEKWELDSTSGYYYNESNGF 120
           A RA+EQIEAKANRSYQKDVANL+EA    AL  D+ E+G+EKW+ DSTSGYYYN+SNG 
Sbjct: 61  ATRALEQIEAKANRSYQKDVANLKEASSLRAL--DIQEDGQEKWDYDSTSGYYYNQSNGL 120

Query: 121 YYDSSSGFYYSDAIGKWVTQEEAHSSPQFFLNSKHKKPVLAKPSSASASADIKDKNVDKG 180
           +YD +SGFYYSDAIGKWVTQEEA+++ +    SK+K+    KP  ASA + +K+  V   
Sbjct: 121 HYDPNSGFYYSDAIGKWVTQEEAYAAVRISSGSKNKESSFKKPLPASAVSSVKENKV-AA 180

Query: 181 ESGPPPGLVVSASLNPKRSVKGAPSSIAVGKRKRPDDKQKAISEEEKAALKAREAAKKRV 240
           +SGPPPG VVSASLNP+RS KGAPS IAV KRKRPD+K KA+S EEKAALKAREAA+KRV
Sbjct: 181 QSGPPPGPVVSASLNPRRSAKGAPSKIAVNKRKRPDEKPKAVSVEEKAALKAREAARKRV 240

Query: 241 EKREKPLLGLYK 253
           E+REK LLGLY+
Sbjct: 241 EEREKSLLGLYQ 249

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ZOP1_ARATH3.8e-7857.81Zinc finger protein ZOP1 OS=Arabidopsis thaliana GN=ZOP1 PE=1 SV=1[more]
WBP4_HUMAN1.1e-1338.55WW domain-binding protein 4 OS=Homo sapiens GN=WBP4 PE=1 SV=1[more]
WBP4_RAT1.1e-1338.55WW domain-binding protein 4 OS=Rattus norvegicus GN=Wbp4 PE=2 SV=1[more]
WBP4_MOUSE1.5e-1335.71WW domain-binding protein 4 OS=Mus musculus GN=Wbp4 PE=1 SV=4[more]
WBP4_CHICK2.5e-1339.76WW domain-binding protein 4 OS=Gallus gallus GN=WBP4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K4T4_CUCSA1.2e-13694.49Uncharacterized protein OS=Cucumis sativus GN=Csa_7G074280 PE=4 SV=1[more]
A9PJ87_9ROSI4.2e-10072.62Putative uncharacterized protein OS=Populus trichocarpa x Populus deltoides PE=2... [more]
M5VKZ1_PRUPE7.1e-10071.15Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010352mg PE=4 SV=1[more]
W9RY93_9ROSA7.1e-10071.60WW domain-binding protein 4 OS=Morus notabilis GN=L484_027564 PE=4 SV=1[more]
B9HRX6_POPTR1.2e-9972.22Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s10850g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
gi|778725157|ref|XP_011658907.1|1.8e-13694.49PREDICTED: uncharacterized protein C18H10.07 [Cucumis sativus][more]
gi|659109879|ref|XP_008454929.1|2.8e-13493.31PREDICTED: uncharacterized protein LOC103495224 [Cucumis melo][more]
gi|118489270|gb|ABK96440.1|6.0e-10072.62unknown [Populus trichocarpa x Populus deltoides][more]
gi|694332726|ref|XP_009356986.1|6.0e-10072.33PREDICTED: RNA-binding protein 5 [Pyrus x bretschneideri][more]
gi|743794344|ref|XP_011000045.1|7.8e-10072.62PREDICTED: RNA-binding protein 5 isoform X1 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000690Matrin/U1-C_Znf_C2H2
IPR003604Matrin/U1-like-C_Znf_C2H2
IPR013085U1-CZ_Znf_C2H2
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0008270zinc ion binding
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0080188 RNA-directed DNA methylation
biological_process GO:0008380 RNA splicing
biological_process GO:0008150 biological_process
biological_process GO:0044699 single-organism process
biological_process GO:0090304 nucleic acid metabolic process
biological_process GO:0010467 gene expression
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0000398 mRNA splicing, via spliceosome
biological_process GO:0009845 seed germination
cellular_component GO:0005634 nucleus
cellular_component GO:0015030 Cajal body
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003725 double-stranded RNA binding
molecular_function GO:0003690 double-stranded DNA binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU76202watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla015998Cla015998.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU76202WMU76202transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000690Zinc finger, C2H2-type matrinPROFILEPS50171ZF_MATRINcoord: 11..42
score: 9
IPR003604Zinc finger, U1-typeSMARTSM00451ZnF_U1_5coord: 8..43
score: 2.
IPR013085Zinc finger, U1-C typePFAMPF06220zf-U1coord: 11..42
score: 4.
NoneNo IPR availableunknownCoilCoilcoord: 37..74
scor
NoneNo IPR availablePANTHERPTHR13173WW DOMAIN BINDING PROTEIN 4coord: 1..252
score: 2.8
NoneNo IPR availableunknownSSF57667beta-beta-alpha zinc fingerscoord: 12..57
score: 5.6

The following gene(s) are paralogous to this gene:

None