Cla006953 (gene) Watermelon (97103) v1

NameCla006953
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionAT-hook DNA-binding protein (Fragment) (AHRD V1 *--- A1DR81_CATRO); contains Interpro domain(s) IPR014476 Predicted AT-hook DNA-binding
LocationChr6 : 315844 .. 316683 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGAATTACACCCCACAATCGCCCTCTTCCTCCTCCTTTCCTTTCCAAAGATCTCCATCTCCACCATGGATTATTCCACCACCACCAAAACTCCGACGACGACCAAACCGCCGGAGCTAAGAGAGATAGAGAGGACGAAAACCCCACCACGGCGGCGATGGCGGACAGCAAAGAATTATCTAATTCTTCATCTCGACGCCCACGTGGCCGACCAGCTGGTTCCAAAAACAAGCCAAAGCCACCCATTATAATTACTAGAGACAGCGCTAACGCCCTCAGATCTCATCTCATCGAAATCTCCACTGCCTCCGACATCGTAGATTCCCTCGCTGCCTTCGCACGCCGCCGCCAGCGTGGCGTTTGTATTCTAAGTGCCACGGGTACCGTCGCCAACGTCACTCTCCGGCAGCCGGCCTCCCCCGGCGCTGTCATCACCTTACATGGGAGATTTGAAATTCTCTCCCTCTCCGGCTCTTTCCTCCCACCGCCGGCTCCCCCGGCCGCCTCTGGTCTGACCGTCTACTTGGCTGGCGGGCAGGGGCAGGTCGTCGGTGGGAATGTAATCGGCCCACTTTCGGCGTCCGGTCCGGTGATTATCATGGCGGCTTCTTTTGGTAATGCTGCGTATGAACGGCTACCCATTGATGACGACGACGAAACGTCACCGGCAGCAGAGATTGCCGGACAACAAGCAGCACCGCCGCCGCCACAATTGATGGGGGATCCTAATGGGGGATTGTTTCATGGGATGGCGCAGAATGTTGTGAATTCTTCATGCCAATTGCCGGCAGAGGCGGCGGCGGCGTTTTGGGGAGGAGGTCGGCCACCGTATTGA

mRNA sequence

ATGGATGGAATTACACCCCACAATCGCCCTCTTCCTCCTCCTTTCCTTTCCAAAGATCTCCATCTCCACCATGGATTATTCCACCACCACCAAAACTCCGACGACGACCAAACCGCCGGAGCTAAGAGAGATAGAGAGGACGAAAACCCCACCACGGCGGCGATGGCGGACAGCAAAGAATTATCTAATTCTTCATCTCGACGCCCACGTGGCCGACCAGCTGGTTCCAAAAACAAGCCAAAGCCACCCATTATAATTACTAGAGACAGCGCTAACGCCCTCAGATCTCATCTCATCGAAATCTCCACTGCCTCCGACATCGTAGATTCCCTCGCTGCCTTCGCACGCCGCCGCCAGCGTGGCGTTTGTATTCTAAGTGCCACGGGTACCGTCGCCAACGTCACTCTCCGGCAGCCGGCCTCCCCCGGCGCTGTCATCACCTTACATGGGAGATTTGAAATTCTCTCCCTCTCCGGCTCTTTCCTCCCACCGCCGGCTCCCCCGGCCGCCTCTGGTCTGACCGTCTACTTGGCTGGCGGGCAGGGGCAGGTCGTCGGTGGGAATGTAATCGGCCCACTTTCGGCGTCCGGTCCGGTGATTATCATGGCGGCTTCTTTTGGTAATGCTGCGTATGAACGGCTACCCATTGATGACGACGACGAAACGTCACCGGCAGCAGAGATTGCCGGACAACAAGCAGCACCGCCGCCGCCACAATTGATGGGGGATCCTAATGGGGGATTGTTTCATGGGATGGCGCAGAATGTTGTGAATTCTTCATGCCAATTGCCGGCAGAGGCGGCGGCGGCGTTTTGGGGAGGAGGTCGGCCACCGTATTGA

Coding sequence (CDS)

ATGGATGGAATTACACCCCACAATCGCCCTCTTCCTCCTCCTTTCCTTTCCAAAGATCTCCATCTCCACCATGGATTATTCCACCACCACCAAAACTCCGACGACGACCAAACCGCCGGAGCTAAGAGAGATAGAGAGGACGAAAACCCCACCACGGCGGCGATGGCGGACAGCAAAGAATTATCTAATTCTTCATCTCGACGCCCACGTGGCCGACCAGCTGGTTCCAAAAACAAGCCAAAGCCACCCATTATAATTACTAGAGACAGCGCTAACGCCCTCAGATCTCATCTCATCGAAATCTCCACTGCCTCCGACATCGTAGATTCCCTCGCTGCCTTCGCACGCCGCCGCCAGCGTGGCGTTTGTATTCTAAGTGCCACGGGTACCGTCGCCAACGTCACTCTCCGGCAGCCGGCCTCCCCCGGCGCTGTCATCACCTTACATGGGAGATTTGAAATTCTCTCCCTCTCCGGCTCTTTCCTCCCACCGCCGGCTCCCCCGGCCGCCTCTGGTCTGACCGTCTACTTGGCTGGCGGGCAGGGGCAGGTCGTCGGTGGGAATGTAATCGGCCCACTTTCGGCGTCCGGTCCGGTGATTATCATGGCGGCTTCTTTTGGTAATGCTGCGTATGAACGGCTACCCATTGATGACGACGACGAAACGTCACCGGCAGCAGAGATTGCCGGACAACAAGCAGCACCGCCGCCGCCACAATTGATGGGGGATCCTAATGGGGGATTGTTTCATGGGATGGCGCAGAATGTTGTGAATTCTTCATGCCAATTGCCGGCAGAGGCGGCGGCGGCGTTTTGGGGAGGAGGTCGGCCACCGTATTGA

Protein sequence

MDGITPHNRPLPPPFLSKDLHLHHGLFHHHQNSDDDQTAGAKRDREDENPTTAAMADSKELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSPAAEIAGQQAAPPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGGRPPY
BLAST of Cla006953 vs. Swiss-Prot
Match: AHL22_ARATH (AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.5e-80
Identity = 171/318 (53.77%), Postives = 204/318 (64.15%), Query Frame = 1

Query: 8   NRPLPPPFLSKDLHLH-HGLFHHHQNS---------DDDQTAGAKRDRE---DENPTTAA 67
           +R LPPPFLS+DLHLH H  F H Q           D  +  G KRDR+   D N  ++A
Sbjct: 5   SRSLPPPFLSRDLHLHPHHQFQHQQQQQQQNHGHDIDQHRIGGLKRDRDADIDPNEHSSA 64

Query: 68  MADSKELS------------NSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEIS 127
             D                 N  +RRPRGRPAGSKNKPKPPIIITRDSANAL+SH++E++
Sbjct: 65  GKDQSTPGSGGESGGGGGGDNHITRRPRGRPAGSKNKPKPPIIITRDSANALKSHVMEVA 124

Query: 128 TASDIVDSLAAFARRRQRGVCILSATGTVANVTLRQPAS-PG---AVITLHGRFEILSLS 187
              D+++S+  FARRRQRG+C+LS  G V NVT+RQPAS PG   +V+ LHGRFEILSLS
Sbjct: 125 NGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGRFEILSLS 184

Query: 188 GSFLPPPAPPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDD 247
           GSFLPPPAPPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP+++
Sbjct: 185 GSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAAYERLPLEE 244

Query: 248 DDETSPAA-----------------EIAGQQAAPPPPQLMGDPNGGLFHGMAQNVVNSSC 280
           DD+    A                 +   Q       QLM DP      G+  N++N S 
Sbjct: 245 DDQEEQTAGAVANNIDGNATMGGGTQTQTQTQQQQQQQLMQDPT-SFIQGLPPNLMN-SV 304

BLAST of Cla006953 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 1.2e-74
Identity = 163/325 (50.15%), Postives = 199/325 (61.23%), Query Frame = 1

Query: 1   MDGITPHNRP--LPPPFLSKDLHLH------HGLFHHHQN----SDDDQTAGA------K 60
           MD +  H     LPPPF ++D  LH          HHHQ     +D DQ  G+      K
Sbjct: 1   MDPVQSHGSQSSLPPPFHARDFQLHLQQQQQEFFLHHHQQQRNQTDGDQQGGSGGNRQIK 60

Query: 61  RDRED--------------ENPTTAAMADSKELSNSS------SRRPRGRPAGSKNKPKP 120
            DRE+              E         S E    S      +RRPRGRPAGSKNKPKP
Sbjct: 61  MDREETSDNIDNIANNSGSEGKDIDIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKP 120

Query: 121 PIIITRDSANALRSHLIEISTASDIVDSLAAFARRRQRGVCILSATGTVANVTLRQPA-- 180
           PIIITRDSANALR+H++EI    D+V+S+A FARRRQRGVC++S TG V NVT+RQP   
Sbjct: 121 PIIITRDSANALRTHVMEIGDGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSH 180

Query: 181 -SPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLAGGQGQVVGGNVIGPLSASGPV 240
            SPG+V++LHGRFEILSLSGSFLPPPAPP A+GL+VYLAGGQGQVVGG+V+GPL  +GPV
Sbjct: 181 PSPGSVVSLHGRFEILSLSGSFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPV 240

Query: 241 IIMAASFGNAAYERLPIDDDDETSPAAEIAGQQAAPPPP----QLMGDPNGGLFH-GMAQ 280
           ++MAASF NAAYERLP+++D+  +P     G  +   PP    QL         H G+  
Sbjct: 241 VVMAASFSNAAYERLPLEEDEMQTPVHGGGGGGSLESPPMMGQQLQHQQQAMSGHQGLPP 300

BLAST of Cla006953 vs. Swiss-Prot
Match: AHL26_ARATH (AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 2.4e-73
Identity = 164/339 (48.38%), Postives = 200/339 (59.00%), Query Frame = 1

Query: 1   MDGITPHNRP--LPPPFLSKDLHLH-----------------HGLFHHHQ------NSDD 60
           MD +  H     LPPPF ++D  LH                     HHHQ      + D 
Sbjct: 1   MDPVQSHGSQSSLPPPFHARDFQLHLQQQQQHQQQHQQQQQQQFFLHHHQQPQRNLDQDH 60

Query: 61  DQTAGA------KRDRE------DENPTTAAMADSKELS--------------NSSSRRP 120
           +Q  G+      K DRE      D    T + ++ KE+S                 +RRP
Sbjct: 61  EQQGGSILNRSIKMDREETSDNMDNIANTNSGSEGKEMSLHGGEGGSGGGGSGEQMTRRP 120

Query: 121 RGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRRQRGVCILSATG 180
           RGRPAGSKNKPK PIIITRDSANALR+H++EI    DIVD +A FARRRQRGVC++S TG
Sbjct: 121 RGRPAGSKNKPKAPIIITRDSANALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMSGTG 180

Query: 181 TVANVTLRQPASP-GAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLAGGQGQVVGGN 240
           +V NVT+RQP SP G+V++LHGRFEILSLSGSFLPPPAPPAA+GL+VYLAGGQGQVVGG+
Sbjct: 181 SVTNVTIRQPGSPPGSVVSLHGRFEILSLSGSFLPPPAPPAATGLSVYLAGGQGQVVGGS 240

Query: 241 VIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSP---AAEIAGQQAAPPPPQLMGDPN 280
           V+GPL  SGPV++MAASF NAAYERLP+++D+  +P        G       P +MG   
Sbjct: 241 VVGPLLCSGPVVVMAASFSNAAYERLPLEEDEMQTPVQGGGGGGGGGGGMGSPPMMGQQQ 300

BLAST of Cla006953 vs. Swiss-Prot
Match: AHL18_ARATH (AT-hook motif nuclear-localized protein 18 OS=Arabidopsis thaliana GN=AHL18 PE=2 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 4.4e-72
Identity = 152/276 (55.07%), Postives = 183/276 (66.30%), Query Frame = 1

Query: 8   NRPLPPPFLSKDLHLHHGLFHHHQNSDDDQTAGAKRDREDENPTTAAMADSKELSNSSSR 67
           +R   P FLS D H H+    HHQN+   +    +   E  N             N   R
Sbjct: 5   SRSHTPQFLSSD-HQHY----HHQNAGRQKRGREEEGVEPNNIGEDLATFPSGEENIKKR 64

Query: 68  RPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRRQRGVCILSA 127
           RPRGRPAGSKNKPK PII+TRDSANA R H++EI+ A D+++SLA FARRRQRGVC+L+ 
Sbjct: 65  RPRGRPAGSKNKPKAPIIVTRDSANAFRCHVMEITNACDVMESLAVFARRRQRGVCVLTG 124

Query: 128 TGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLAGGQGQVVGG 187
            G V NVT+RQP   G V++LHGRFEILSLSGSFLPPPAPPAASGL VYLAGGQGQV+GG
Sbjct: 125 NGAVTNVTVRQPG--GGVVSLHGRFEILSLSGSFLPPPAPPAASGLKVYLAGGQGQVIGG 184

Query: 188 NVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSPAAEIAGQQA----APPPPQLMGD 247
           +V+GPL+AS PV++MAASFGNA+YERLP+++++ET    EI G  A         QLM D
Sbjct: 185 SVVGPLTASSPVVVMAASFGNASYERLPLEEEEETE--REIDGNAARAIGTQTQKQLMQD 244

Query: 248 PNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGGRPPY 280
                F G   N++N S  LP E   A+WG  RP +
Sbjct: 245 ATS--FIGSPSNLIN-SVSLPGE---AYWGTQRPSF 265

BLAST of Cla006953 vs. Swiss-Prot
Match: AHL25_ARATH (AT-hook motif nuclear-localized protein 25 OS=Arabidopsis thaliana GN=AHL25 PE=1 SV=1)

HSP 1 Score: 250.4 bits (638), Expect = 2.4e-65
Identity = 135/266 (50.75%), Postives = 172/266 (64.66%), Query Frame = 1

Query: 14  PFLSKDLHLHH---GLFHHHQNSDDDQTAGAKRDREDENPTTAAMADSKELSNSSSRRPR 73
           P L ++LHL           QN+ +   + A   + +  PT  A + +    +SS RRPR
Sbjct: 7   PLLGQELHLQRPEDSRTPPDQNNMELNRSEADEAKAETTPTGGATSSATASGSSSGRRPR 66

Query: 74  GRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRRQRGVCILSATGT 133
           GRPAGSKNKPKPP IITRDS N LRSH++E+++ SDI ++++ +A RR  GVCI+S TG 
Sbjct: 67  GRPAGSKNKPKPPTIITRDSPNVLRSHVLEVTSGSDISEAVSTYATRRGCGVCIISGTGA 126

Query: 134 VANVTLRQPASP--GAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLAGGQGQVVGGN 193
           V NVT+RQPA+P  G VITLHGRF+ILSL+G+ LPPPAPP A GLTVYLAGGQGQVVGGN
Sbjct: 127 VTNVTIRQPAAPAGGGVITLHGRFDILSLTGTALPPPAPPGAGGLTVYLAGGQGQVVGGN 186

Query: 194 VIGPLSASGPVIIMAASFGNAAYERLPIDDDD--------------ETSPAAEIAGQQAA 253
           V G L ASGPV++MAASF NA Y+RLPI++++              E S ++E+ G  A 
Sbjct: 187 VAGSLIASGPVVLMAASFANAVYDRLPIEEEETPPPRTTGVQQQQPEASQSSEVTGSGAQ 246

Query: 254 PPPPQLMGDPNGG--LFHGMAQNVVN 259
                L G   GG   F+ +  N+ N
Sbjct: 247 ACESNLQGGNGGGGVAFYNLGMNMNN 272

BLAST of Cla006953 vs. TrEMBL
Match: A0A0A0KBK2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G076820 PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 9.1e-141
Identity = 259/284 (91.20%), Postives = 267/284 (94.01%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFH-HHQNSDDDQTAGAKRDRE-DENPTTAAMADS 60
           MDGITPHNRPLPPPFLSKDLHLHHGLFH HHQNSDDD T G KRDR+ D+NPT     D+
Sbjct: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFHAHHQNSDDDHTPGPKRDRDSDDNPTMD--DDT 60

Query: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRR 120
           KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLA FARRR
Sbjct: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLATFARRR 120

Query: 121 QRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLA 180
           QRGVCILSATGTVANVTLRQP+SPGAVITL GRFEILSLSGSFLPPPAPPAASGLTVYLA
Sbjct: 121 QRGVCILSATGTVANVTLRQPSSPGAVITLPGRFEILSLSGSFLPPPAPPAASGLTVYLA 180

Query: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSPAA-EIAGQQ--AAP 240
           GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDD+DETSPA  ++AGQQ  AAP
Sbjct: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDEDETSPAPDQMAGQQAAAAP 240

Query: 241 PPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGGRPPY 280
           PPPQL+GDPNGGLFHGMAQNVVNSSCQLP EAAAAFWGGGRPPY
Sbjct: 241 PPPQLLGDPNGGLFHGMAQNVVNSSCQLPGEAAAAFWGGGRPPY 282

BLAST of Cla006953 vs. TrEMBL
Match: A0A061DTV5_THECC (AT-hook motif nuclear-localized protein 22 OS=Theobroma cacao GN=TCM_005492 PE=4 SV=1)

HSP 1 Score: 369.0 bits (946), Expect = 5.1e-99
Identity = 193/308 (62.66%), Postives = 232/308 (75.32%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHL--HHGLFHHHQ--NSDDDQTAGAKRDREDENPTTAAMA 60
           MD +T H RPLPPPFL++DLHL  HH   HHHQ  NS+++Q  G KRDRE+   TT A A
Sbjct: 23  MDPVTAHGRPLPPPFLTRDLHLNPHHQFQHHHQQENSEEEQNRGQKRDREETATTTTATA 82

Query: 61  -----DSKELS------NSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTAS 120
                + KEL+         +RRPRGRP+GSKNKPKPPIIITRDSANALRSH++EI+   
Sbjct: 83  TTDTSEGKELAIIPGTEGEITRRPRGRPSGSKNKPKPPIIITRDSANALRSHVMEIANGC 142

Query: 121 DIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPP 180
           DI++S++ FARRRQRGVCILS +GTV NVTLRQP +PGAV+TLHGRFEILSLSGSFLPPP
Sbjct: 143 DIMESISTFARRRQRGVCILSGSGTVTNVTLRQPGAPGAVVTLHGRFEILSLSGSFLPPP 202

Query: 181 APPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDET--- 240
           APPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP++++++    
Sbjct: 203 APPAASGLTIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLPLEEEEQPVAP 262

Query: 241 --------SPAAEIAGQQAAPPPP---QLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAF 280
                   SP++ +  QQ    PP   QL+ DPNG    G+  N++N S QLPAE   A+
Sbjct: 263 IPGSGPLGSPSSMVGQQQQQQQPPQQQQLLQDPNGSFVQGLPPNLLN-SVQLPAE---AY 322

BLAST of Cla006953 vs. TrEMBL
Match: B9RDE5_RICCO (ESC, putative OS=Ricinus communis GN=RCOM_1612340 PE=4 SV=1)

HSP 1 Score: 358.2 bits (918), Expect = 8.9e-96
Identity = 192/306 (62.75%), Postives = 223/306 (72.88%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLH--HGLFHHHQ------NSDDDQTAGAK----RDREDE 60
           MD +  H RPLPPPF ++DLHLH  H   HHHQ      NS+D+QT        + RE +
Sbjct: 1   MDPVAAHGRPLPPPFHTRDLHLHPHHQFQHHHQQQQQQQNSEDEQTGNGSINRGQKREHD 60

Query: 61  NPTTAAMADSKEL-------SNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEI 120
             TT    + KEL           +RRPRGRPAGSKNKPKPPIIITRDSANALRSH++EI
Sbjct: 61  EITTP---EGKELVPTTGGGDGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEI 120

Query: 121 STASDIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSF 180
           +  SDI++S++ FARRRQRGVCILS TGTV NVTLRQPASPGAV+TLHGRFEILSLSGSF
Sbjct: 121 ANGSDIMESVSTFARRRQRGVCILSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSF 180

Query: 181 LPPPAPPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDE 240
           LPPPAPPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP+++DD 
Sbjct: 181 LPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPLEEDDG 240

Query: 241 TSP--------AAEIAGQQAAPPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWG 280
             P        +  + GQ     P QLM DPN  LF G+  N++N S QLPAE   A+WG
Sbjct: 241 QVPVPGSGPLDSPGVVGQTQPQQPQQLMQDPNPPLFQGLPPNLLN-SVQLPAE---AYWG 299

BLAST of Cla006953 vs. TrEMBL
Match: B9IC09_POPTR (DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0014s06650g PE=4 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 7.6e-95
Identity = 190/304 (62.50%), Postives = 225/304 (74.01%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFHHHQNSDDDQTA------GAKRDREDENPTTAA 60
           MD ++ H RPLPPPF ++D HLH    H  QNS+D+Q+       G KR+ ++ N     
Sbjct: 1   MDPVSAHGRPLPPPFHTRDFHLHQFQHHQQQNSEDEQSGNGDLNRGQKREHDEINNNNNT 60

Query: 61  MADSKELSNSS------SRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIV 120
           +   + + +SS      SRRPRGRPAGSKNKPKPPIIITRDSANALRSH++EI+T SDI+
Sbjct: 61  VEGLELVPSSSGGEGEISRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIATGSDIM 120

Query: 121 DSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPP 180
           +S++ FARRRQRGVCILS TGTV NVTL+QPASPGAV+TLHGRFEILSLSGSFLPPPAPP
Sbjct: 121 ESVSTFARRRQRGVCILSGTGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPPAPP 180

Query: 181 AASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDET------ 240
           AASGLTVYLAGGQGQV+GG+V GPL ASGPV++MAASFGNAAYERLP+++D E+      
Sbjct: 181 AASGLTVYLAGGQGQVIGGSVAGPLLASGPVVVMAASFGNAAYERLPLEEDIESQTPMLG 240

Query: 241 -----SPAAEIAGQQAA-PPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWG-GG 280
                SP     GQQ       QLM DP   LF G+ QN++N S QLPAE   A+WG GG
Sbjct: 241 SGPLGSPGINNIGQQQQNQQQQQLMQDPKTSLFQGLPQNLLN-SVQLPAE---AYWGTGG 300

BLAST of Cla006953 vs. TrEMBL
Match: B9GPH9_POPTR (DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0002s15030g PE=4 SV=1)

HSP 1 Score: 353.6 bits (906), Expect = 2.2e-94
Identity = 190/303 (62.71%), Postives = 222/303 (73.27%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFHHHQNSDDDQTA------GAKRDREDENPTTAA 60
           MD +  H RPLPPPF ++D HLH       QNS+D+Q+       G KR+  +       
Sbjct: 1   MDPVAAHGRPLPPPFHTRDFHLHQFQHQQQQNSEDEQSGNGNLNRGQKREHAEIATNNNN 60

Query: 61  MADSKELSNSSS-------RRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDI 120
            A+ KEL  SS+       RRPRGRPAGSKNKPKPPIIITRDS NALRSH++EI+T  DI
Sbjct: 61  TAEGKELVPSSAGGEGEITRRPRGRPAGSKNKPKPPIIITRDSPNALRSHVMEIATGCDI 120

Query: 121 VDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAP 180
           ++S++ FARRRQRGVCILSATGTV NVTL+QPASPGAV+TLHGRFEILSLSGSFLPPPAP
Sbjct: 121 MESVSTFARRRQRGVCILSATGTVTNVTLKQPASPGAVVTLHGRFEILSLSGSFLPPPAP 180

Query: 181 PAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDET----- 240
           PAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP+++D+       
Sbjct: 181 PAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPLEEDESQTPVPG 240

Query: 241 -----SPAAEIAGQQAAPPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWG-GGR 280
                SP     GQQ      QLM DPN  LF G+ QN++N S QLP+E   A+WG GGR
Sbjct: 241 TGPLGSPGVSSIGQQ-NQQQHQLMQDPNTSLFQGLPQNLLN-SVQLPSE---AYWGTGGR 298

BLAST of Cla006953 vs. NCBI nr
Match: gi|778711158|ref|XP_011656695.1| (PREDICTED: AT-hook motif nuclear-localized protein 18-like [Cucumis sativus])

HSP 1 Score: 507.7 bits (1306), Expect = 1.3e-140
Identity = 259/284 (91.20%), Postives = 267/284 (94.01%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFH-HHQNSDDDQTAGAKRDRE-DENPTTAAMADS 60
           MDGITPHNRPLPPPFLSKDLHLHHGLFH HHQNSDDD T G KRDR+ D+NPT     D+
Sbjct: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFHAHHQNSDDDHTPGPKRDRDSDDNPTMD--DDT 60

Query: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRR 120
           KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLA FARRR
Sbjct: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLATFARRR 120

Query: 121 QRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLA 180
           QRGVCILSATGTVANVTLRQP+SPGAVITL GRFEILSLSGSFLPPPAPPAASGLTVYLA
Sbjct: 121 QRGVCILSATGTVANVTLRQPSSPGAVITLPGRFEILSLSGSFLPPPAPPAASGLTVYLA 180

Query: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSPAA-EIAGQQ--AAP 240
           GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDD+DETSPA  ++AGQQ  AAP
Sbjct: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDEDETSPAPDQMAGQQAAAAP 240

Query: 241 PPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGGRPPY 280
           PPPQL+GDPNGGLFHGMAQNVVNSSCQLP EAAAAFWGGGRPPY
Sbjct: 241 PPPQLLGDPNGGLFHGMAQNVVNSSCQLPGEAAAAFWGGGRPPY 282

BLAST of Cla006953 vs. NCBI nr
Match: gi|659134198|ref|XP_008467078.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 506.1 bits (1302), Expect = 3.8e-140
Identity = 259/284 (91.20%), Postives = 266/284 (93.66%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFH-HHQNSDDDQTAGAKRDRE-DENPTTAAMADS 60
           MDGITPHNRPLPPPFLSKDLHLHHGLFH HHQNSDDD T G KRDR+ D+NPT     D+
Sbjct: 1   MDGITPHNRPLPPPFLSKDLHLHHGLFHAHHQNSDDDHTPGPKRDRDSDDNPTMD--DDT 60

Query: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLAAFARRR 120
           KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLA FARRR
Sbjct: 61  KELSNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTASDIVDSLATFARRR 120

Query: 121 QRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPPAPPAASGLTVYLA 180
           QRGVCILSATGTVANVTLRQPASPGAVITL GRFEILSLSGSFLPPPAPPAASGLTVYLA
Sbjct: 121 QRGVCILSATGTVANVTLRQPASPGAVITLPGRFEILSLSGSFLPPPAPPAASGLTVYLA 180

Query: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSPAA-EIAGQQ--AAP 240
           GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDD+DETSPAA ++AGQQ  AA 
Sbjct: 181 GGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDEDETSPAADQMAGQQAAAAA 240

Query: 241 PPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGGRPPY 280
           PPPQL+GDPNGGLFHGMAQN VNSSCQLP EAAAAFWGGGRPPY
Sbjct: 241 PPPQLLGDPNGGLFHGMAQNAVNSSCQLPGEAAAAFWGGGRPPY 282

BLAST of Cla006953 vs. NCBI nr
Match: gi|590722914|ref|XP_007052032.1| (AT-hook motif nuclear-localized protein 22 [Theobroma cacao])

HSP 1 Score: 369.0 bits (946), Expect = 7.3e-99
Identity = 193/308 (62.66%), Postives = 232/308 (75.32%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHL--HHGLFHHHQ--NSDDDQTAGAKRDREDENPTTAAMA 60
           MD +T H RPLPPPFL++DLHL  HH   HHHQ  NS+++Q  G KRDRE+   TT A A
Sbjct: 23  MDPVTAHGRPLPPPFLTRDLHLNPHHQFQHHHQQENSEEEQNRGQKRDREETATTTTATA 82

Query: 61  -----DSKELS------NSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTAS 120
                + KEL+         +RRPRGRP+GSKNKPKPPIIITRDSANALRSH++EI+   
Sbjct: 83  TTDTSEGKELAIIPGTEGEITRRPRGRPSGSKNKPKPPIIITRDSANALRSHVMEIANGC 142

Query: 121 DIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPP 180
           DI++S++ FARRRQRGVCILS +GTV NVTLRQP +PGAV+TLHGRFEILSLSGSFLPPP
Sbjct: 143 DIMESISTFARRRQRGVCILSGSGTVTNVTLRQPGAPGAVVTLHGRFEILSLSGSFLPPP 202

Query: 181 APPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDET--- 240
           APPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP++++++    
Sbjct: 203 APPAASGLTIYLAGGQGQVVGGSVVGPLVASGPVVIMAASFGNAAYERLPLEEEEQPVAP 262

Query: 241 --------SPAAEIAGQQAAPPPP---QLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAF 280
                   SP++ +  QQ    PP   QL+ DPNG    G+  N++N S QLPAE   A+
Sbjct: 263 IPGSGPLGSPSSMVGQQQQQQQPPQQQQLLQDPNGSFVQGLPPNLLN-SVQLPAE---AY 322

BLAST of Cla006953 vs. NCBI nr
Match: gi|225453933|ref|XP_002279636.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera])

HSP 1 Score: 361.3 bits (926), Expect = 1.5e-96
Identity = 193/304 (63.49%), Postives = 227/304 (74.67%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLHHG-LFHHHQ--NSDDDQTAGA------KRDREDENPT 60
           MD +T H RPLPPPF ++DL LHH   + HH   NS+D+Q+  +      KRDR++ N T
Sbjct: 1   MDPVTAHGRPLPPPFHTRDLQLHHHHQYQHHPQANSEDEQSGSSSLNRAQKRDRDESNAT 60

Query: 61  T-AAMADSKELSNSS-----SRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEISTAS 120
              +  D KE   SS     +RRPRGRPAGSKNKPKPPIIITRDSANALRSH++EI+T  
Sbjct: 61  NNTSPIDGKEFGTSSGDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEIATGC 120

Query: 121 DIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSFLPPP 180
           DI+DSL  FARRRQRG+CILS +GTV NVTLRQPASPGAV+TLHGRFEILSLSGSFLPPP
Sbjct: 121 DIMDSLNTFARRRQRGICILSGSGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSFLPPP 180

Query: 181 APPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDETSP- 240
           APPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP++D++   P 
Sbjct: 181 APPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPLEDEEPQVPI 240

Query: 241 -------AAEIAGQ--QAAPPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWGGG 280
                  +  + GQ  Q      QL+ DPN  LF G+  N++N SCQLPAE   A+WG  
Sbjct: 241 PGSGPLGSPGMVGQQPQQQQQQQQLLPDPNASLFQGLPPNLLN-SCQLPAE---AYWGTA 300

BLAST of Cla006953 vs. NCBI nr
Match: gi|255541340|ref|XP_002511734.1| (PREDICTED: AT-hook motif nuclear-localized protein 22 [Ricinus communis])

HSP 1 Score: 358.2 bits (918), Expect = 1.3e-95
Identity = 192/306 (62.75%), Postives = 223/306 (72.88%), Query Frame = 1

Query: 1   MDGITPHNRPLPPPFLSKDLHLH--HGLFHHHQ------NSDDDQTAGAK----RDREDE 60
           MD +  H RPLPPPF ++DLHLH  H   HHHQ      NS+D+QT        + RE +
Sbjct: 1   MDPVAAHGRPLPPPFHTRDLHLHPHHQFQHHHQQQQQQQNSEDEQTGNGSINRGQKREHD 60

Query: 61  NPTTAAMADSKEL-------SNSSSRRPRGRPAGSKNKPKPPIIITRDSANALRSHLIEI 120
             TT    + KEL           +RRPRGRPAGSKNKPKPPIIITRDSANALRSH++EI
Sbjct: 61  EITTP---EGKELVPTTGGGDGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRSHVMEI 120

Query: 121 STASDIVDSLAAFARRRQRGVCILSATGTVANVTLRQPASPGAVITLHGRFEILSLSGSF 180
           +  SDI++S++ FARRRQRGVCILS TGTV NVTLRQPASPGAV+TLHGRFEILSLSGSF
Sbjct: 121 ANGSDIMESVSTFARRRQRGVCILSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLSGSF 180

Query: 181 LPPPAPPAASGLTVYLAGGQGQVVGGNVIGPLSASGPVIIMAASFGNAAYERLPIDDDDE 240
           LPPPAPPAASGLT+YLAGGQGQVVGG+V+GPL ASGPV+IMAASFGNAAYERLP+++DD 
Sbjct: 181 LPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLLASGPVVIMAASFGNAAYERLPLEEDDG 240

Query: 241 TSP--------AAEIAGQQAAPPPPQLMGDPNGGLFHGMAQNVVNSSCQLPAEAAAAFWG 280
             P        +  + GQ     P QLM DPN  LF G+  N++N S QLPAE   A+WG
Sbjct: 241 QVPVPGSGPLDSPGVVGQTQPQQPQQLMQDPNPPLFQGLPPNLLN-SVQLPAE---AYWG 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL22_ARATH1.5e-8053.77AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1... [more]
AHL24_ARATH1.2e-7450.15AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
AHL26_ARATH2.4e-7348.38AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2... [more]
AHL18_ARATH4.4e-7255.07AT-hook motif nuclear-localized protein 18 OS=Arabidopsis thaliana GN=AHL18 PE=2... [more]
AHL25_ARATH2.4e-6550.75AT-hook motif nuclear-localized protein 25 OS=Arabidopsis thaliana GN=AHL25 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0KBK2_CUCSA9.1e-14191.20Uncharacterized protein OS=Cucumis sativus GN=Csa_6G076820 PE=4 SV=1[more]
A0A061DTV5_THECC5.1e-9962.66AT-hook motif nuclear-localized protein 22 OS=Theobroma cacao GN=TCM_005492 PE=4... [more]
B9RDE5_RICCO8.9e-9662.75ESC, putative OS=Ricinus communis GN=RCOM_1612340 PE=4 SV=1[more]
B9IC09_POPTR7.6e-9562.50DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0014s06650g PE=4 SV=1[more]
B9GPH9_POPTR2.2e-9462.71DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0002s15030g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778711158|ref|XP_011656695.1|1.3e-14091.20PREDICTED: AT-hook motif nuclear-localized protein 18-like [Cucumis sativus][more]
gi|659134198|ref|XP_008467078.1|3.8e-14091.20PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|590722914|ref|XP_007052032.1|7.3e-9962.66AT-hook motif nuclear-localized protein 22 [Theobroma cacao][more]
gi|225453933|ref|XP_002279636.1|1.5e-9663.49PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera][more]
gi|255541340|ref|XP_002511734.1|1.3e-9562.75PREDICTED: AT-hook motif nuclear-localized protein 22 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0010228 vegetative to reproductive phase transition of meristem
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla006953Cla006953.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 95..207
score: 7.1
IPR005175PPC domainPROFILEPS51742PPCcoord: 91..231
score: 39
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 94..220
score: 4.6
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 1..279
score: 2.6E
NoneNo IPR availablePANTHERPTHR31100:SF2AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 18-RELATEDcoord: 1..279
score: 2.6E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 91..218
score: 7.32

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cla006953Cla021299Watermelon (97103) v1wmwmB137