Cp4.1LG13g01990 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g01990
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAT hook motif DNA-binding family protein
LocationCp4.1LG13 : 1532186 .. 1533055 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGCTCACTCCCTTCCTCCTCCCTTCCACACCCGAGATTTCCATCTCCAGACTCCGTTTCACCATAACAACAACAACCACAACAACAACAATTCCGAAGAAGAACATAGCGCCACCACCAGCCGTCTCAAGCGTGATCGCGACGACGACCATCACCCCAACAACTCTGCCCCCAACAGCGGTGGAGATCCCGACGGAGAAATCACTCGTCGCCCTAGAGGCAGGCCTGCCGGATCCAAAAACAAACCCAAACCCCCTATTATCATTACTCGTGACAGTGCTAACGCCCTTCGGACCCATGTCATCGAGGTCACCGATGGCTGCGACATCGTCGACAGTGTTGCTACCTTCGCTCGCCGCCGACAACGTGGTGTTTGTATCATGAGCGGTACCGGCACCGTTACCAACGTCACTCTCCGCCAACCTGCCTCCCCCGGTGCAATTGTCAACTTACACGGTCGTTTCGAGATCTTGTCCCTAGCCGGGTCTTTCCTCCCTCCTCCGGCTCCTCCTGCCGCTACCACCTTAACCATTTACCTCGCCGGCGGCCAGGGTCAAGTCGTCGGCGGTAGCGTCGTCGGTACTTTAATCGCTTCTGGCCCTGTCGTCATCATGGCTGCCTCGTTTAGTAACGCCGCTTATGAACGGCTCCCGTTGGAGGAGGACGACCAGCCGCCACTTCCGTCGATGCAAGGCGGCGGCGGAATTGGGTCTCCCGATGAAATTGGACAGCCCCAACAACAGCAGCAACTCGTCGGCGACGGAAATGGGCCGTTGTTTCAAGGTTTGCCTCCGAATCTGCTAAATTCAATTCAAATGCCACCATCGGAAGCAGCTTATTGGGCAAATTCCNATAATGTCAGTTGA

mRNA sequence

ATGGATGCTCACTCCCTTCCTCCTCCCTTCCACACCCGAGATTTCCATCTCCAGACTCCGTTTCACCATAACAACAACAACCACAACAACAACAATTCCGAAGAAGAACATAGCGCCACCACCAGCCGTCTCAAGCGTGATCGCGACGACGACCATCACCCCAACAACTCTGCCCCCAACAGCGGTGGAGATCCCGACGGAGAAATCACTCGTCGCCCTAGAGGCAGGCCTGCCGGATCCAAAAACAAACCCAAACCCCCTATTATCATTACTCGTGACAGTGCTAACGCCCTTCGGACCCATGTCATCGAGGTCACCGATGGCTGCGACATCGTCGACAGTGTTGCTACCTTCGCTCGCCGCCGACAACGTGGTGTTTGTATCATGAGCGGTACCGGCACCGTTACCAACGTCACTCTCCGCCAACCTGCCTCCCCCGGTGCAATTGTCAACTTACACGGTCGTTTCGAGATCTTGTCCCTAGCCGGGTCTTTCCTCCCTCCTCCGGCTCCTCCTGCCGCTACCACCTTAACCATTTACCTCGCCGGCGGCCAGGGTCAAGTCGTCGGCGGTAGCGTCGTCGGTACTTTAATCGCTTCTGGCCCTGTCGTCATCATGGCTGCCTCGTTTAGTAACGCCGCTTATGAACGGCTCCCGTTGGAGGAGGACGACCAGCCGCCACTTCCGTCGATGCAAGGCGGCGGCGGAATTGGGTCTCCCGATGAAATTGGACAGCCCCAACAACAGCAGCAACTCGTCGGCGACGGAAATGGGCCGTTGTTTCAAGGTTTGCCTCCGAATCTGCTAAATTCAATTCAAATGCCACCATCGGAAGCAGCTTATTGGGCAAATTCCNATAATGTCAGTTGA

Coding sequence (CDS)

ATGGATGCTCACTCCCTTCCTCCTCCCTTCCACACCCGAGATTTCCATCTCCAGACTCCGTTTCACCATAACAACAACAACCACAACAACAACAATTCCGAAGAAGAACATAGCGCCACCACCAGCCGTCTCAAGCGTGATCGCGACGACGACCATCACCCCAACAACTCTGCCCCCAACAGCGGTGGAGATCCCGACGGAGAAATCACTCGTCGCCCTAGAGGCAGGCCTGCCGGATCCAAAAACAAACCCAAACCCCCTATTATCATTACTCGTGACAGTGCTAACGCCCTTCGGACCCATGTCATCGAGGTCACCGATGGCTGCGACATCGTCGACAGTGTTGCTACCTTCGCTCGCCGCCGACAACGTGGTGTTTGTATCATGAGCGGTACCGGCACCGTTACCAACGTCACTCTCCGCCAACCTGCCTCCCCCGGTGCAATTGTCAACTTACACGGTCGTTTCGAGATCTTGTCCCTAGCCGGGTCTTTCCTCCCTCCTCCGGCTCCTCCTGCCGCTACCACCTTAACCATTTACCTCGCCGGCGGCCAGGGTCAAGTCGTCGGCGGTAGCGTCGTCGGTACTTTAATCGCTTCTGGCCCTGTCGTCATCATGGCTGCCTCGTTTAGTAACGCCGCTTATGAACGGCTCCCGTTGGAGGAGGACGACCAGCCGCCACTTCCGTCGATGCAAGGCGGCGGCGGAATTGGGTCTCCCGATGAAATTGGACAGCCCCAACAACAGCAGCAACTCGTCGGCGACGGAAATGGGCCGTTGTTTCAAGGTTTGCCTCCGAATCTGCTAAATTCAATTCAAATGCCACCATCGGAAGCAGCTTATTGGGCAAATTCCNATAATGTCAGTTGA

Protein sequence

MDAHSLPPPFHTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAPNSGGDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGIGSPDEIGQPQQQQQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAYWANSXNVS
BLAST of Cp4.1LG13g01990 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 312.8 bits (800), Expect = 4.0e-84
Identity = 184/311 (59.16%), Postives = 223/311 (71.70%), Query Frame = 1

Query: 5   SLPPPFHTRDF--HLQTP-----FHHNNNNHNNNNSEEEHSATTSR-LKRDRDD-----D 64
           SLPPPFH RDF  HLQ        HH+    N  + +++  +  +R +K DR++     D
Sbjct: 12  SLPPPFHARDFQLHLQQQQQEFFLHHHQQQRNQTDGDQQGGSGGNRQIKMDREETSDNID 71

Query: 65  HHPNNSAPNS------GGDPDG--------EITRRPRGRPAGSKNKPKPPIIITRDSANA 124
           +  NNS          GG  +G        ++TRRPRGRPAGSKNKPKPPIIITRDSANA
Sbjct: 72  NIANNSGSEGKDIDIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANA 131

Query: 125 LRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPA---SPGAIVNLHG 184
           LRTHV+E+ DGCD+V+SVATFARRRQRGVC+MSGTG VTNVT+RQP    SPG++V+LHG
Sbjct: 132 LRTHVMEIGDGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHG 191

Query: 185 RFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAA 244
           RFEILSL+GSFLPPPAPP AT L++YLAGGQGQVVGGSVVG L+ +GPVV+MAASFSNAA
Sbjct: 192 RFEILSLSGSFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAA 251

Query: 245 YERLPLEEDD-QPPLPSMQGGGGIGSPDEIGQPQQQQQLVGDGNGPLFQGLPPNLLNSIQ 284
           YERLPLEED+ Q P+    GGG + SP  +GQ  Q QQ    G+    QGLPPNLL S+Q
Sbjct: 252 YERLPLEEDEMQTPVHGGGGGGSLESPPMMGQQLQHQQQAMSGH----QGLPPNLLGSVQ 311

BLAST of Cp4.1LG13g01990 vs. Swiss-Prot
Match: AHL22_ARATH (AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 2.6e-83
Identity = 185/309 (59.87%), Postives = 212/309 (68.61%), Query Frame = 1

Query: 3   AHSLPPPFHTRDFHLQT--PFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNN---- 62
           + SLPPPF +RD HL     F H       N+  +        LKRDRD D  PN     
Sbjct: 5   SRSLPPPFLSRDLHLHPHHQFQHQQQQQQQNHGHDIDQHRIGGLKRDRDADIDPNEHSSA 64

Query: 63  ----SAPNSGGDP------DGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVT 122
               S P SGG+       D  ITRRPRGRPAGSKNKPKPPIIITRDSANAL++HV+EV 
Sbjct: 65  GKDQSTPGSGGESGGGGGGDNHITRRPRGRPAGSKNKPKPPIIITRDSANALKSHVMEVA 124

Query: 123 DGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPAS-PG---AIVNLHGRFEILSLA 182
           +GCD+++SV  FARRRQRG+C++SG G VTNVT+RQPAS PG   ++VNLHGRFEILSL+
Sbjct: 125 NGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGRFEILSLS 184

Query: 183 GSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEE 242
           GSFLPPPAPPAA+ LTIYLAGGQGQVVGGSVVG L+ASGPVVIMAASF NAAYERLPLEE
Sbjct: 185 GSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAAYERLPLEE 244

Query: 243 DDQPPLPSMQGGGGIGSPDEIG---------QPQQQQQLVGDGNGPLFQGLPPNLLNSIQ 283
           DDQ    +      I     +G         Q QQQQQL+ D      QGLPPNL+NS+Q
Sbjct: 245 DDQEEQTAGAVANNIDGNATMGGGTQTQTQTQQQQQQQLMQDPTS-FIQGLPPNLMNSVQ 304

BLAST of Cp4.1LG13g01990 vs. Swiss-Prot
Match: AHL26_ARATH (AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 1.4e-81
Identity = 188/328 (57.32%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 5   SLPPPFHTRDF--HLQTP----------------FHHNNNNHNNNNSEEEH---SATTSR 64
           SLPPPFH RDF  HLQ                   HH+     N + + E    S     
Sbjct: 12  SLPPPFHARDFQLHLQQQQQHQQQHQQQQQQQFFLHHHQQPQRNLDQDHEQQGGSILNRS 71

Query: 65  LKRDRDD--DHHPNNSAPNSG----------------GDPDGE-ITRRPRGRPAGSKNKP 124
           +K DR++  D+  N +  NSG                G   GE +TRRPRGRPAGSKNKP
Sbjct: 72  IKMDREETSDNMDNIANTNSGSEGKEMSLHGGEGGSGGGGSGEQMTRRPRGRPAGSKNKP 131

Query: 125 KPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPA 184
           K PIIITRDSANALRTHV+E+ DGCDIVD +ATFARRRQRGVC+MSGTG+VTNVT+RQP 
Sbjct: 132 KAPIIITRDSANALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMSGTGSVTNVTIRQPG 191

Query: 185 SP-GAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPV 244
           SP G++V+LHGRFEILSL+GSFLPPPAPPAAT L++YLAGGQGQVVGGSVVG L+ SGPV
Sbjct: 192 SPPGSVVSLHGRFEILSLSGSFLPPPAPPAATGLSVYLAGGQGQVVGGSVVGPLLCSGPV 251

Query: 245 VIMAASFSNAAYERLPLEEDD-QPPLP----SMQGGGGIGSPDEIGQPQQQQQLVGDGNG 284
           V+MAASFSNAAYERLPLEED+ Q P+        GGGG+GSP  +GQ Q    +      
Sbjct: 252 VVMAASFSNAAYERLPLEEDEMQTPVQGGGGGGGGGGGMGSPPMMGQQQAMAAMAA---- 311

BLAST of Cp4.1LG13g01990 vs. Swiss-Prot
Match: AHL18_ARATH (AT-hook motif nuclear-localized protein 18 OS=Arabidopsis thaliana GN=AHL18 PE=2 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 1.4e-65
Identity = 144/258 (55.81%), Postives = 181/258 (70.16%), Query Frame = 1

Query: 30  NNNSEEEHSATTSRLKRDRDDDH-HPNNSAPNSGGDPDGEIT---RRPRGRPAGSKNKPK 89
           +++ +  H     R KR R+++   PNN   +    P GE     RRPRGRPAGSKNKPK
Sbjct: 14  SSDHQHYHHQNAGRQKRGREEEGVEPNNIGEDLATFPSGEENIKKRRPRGRPAGSKNKPK 73

Query: 90  PPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPAS 149
            PII+TRDSANA R HV+E+T+ CD+++S+A FARRRQRGVC+++G G VTNVT+RQP  
Sbjct: 74  APIIVTRDSANAFRCHVMEITNACDVMESLAVFARRRQRGVCVLTGNGAVTNVTVRQPG- 133

Query: 150 PGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVI 209
            G +V+LHGRFEILSL+GSFLPPPAPPAA+ L +YLAGGQGQV+GGSVVG L AS PVV+
Sbjct: 134 -GGVVSLHGRFEILSLSGSFLPPPAPPAASGLKVYLAGGQGQVIGGSVVGPLTASSPVVV 193

Query: 210 MAASFSNAAYERLPLEEDDQPPLP-SMQGGGGIGSPDEIGQPQQQQQLVGDGNGPLFQGL 269
           MAASF NA+YERLPLEE+++            IG+       Q Q+QL+ D     F G 
Sbjct: 194 MAASFGNASYERLPLEEEEETEREIDGNAARAIGT-------QTQKQLMQDATS--FIGS 253

Query: 270 PPNLLNSIQMPPSEAAYW 283
           P NL+NS+ +P    AYW
Sbjct: 254 PSNLINSVSLPGE--AYW 258

BLAST of Cp4.1LG13g01990 vs. Swiss-Prot
Match: AHL23_ARATH (AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1 SV=1)

HSP 1 Score: 245.4 bits (625), Expect = 7.9e-64
Identity = 144/275 (52.36%), Postives = 183/275 (66.55%), Query Frame = 1

Query: 11  HTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAPNSGGDPDGE-- 70
           H  D HL    HHN+++ +          T      D +++H   + A   G    G   
Sbjct: 18  HRPDLHL----HHNSSSDDVTPGAGMGHFTVD--DEDNNNNHQGLDLASGGGSGSSGGGG 77

Query: 71  --------ITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFAR 130
                   + RRPRGRP GSKNKPKPP+IITR+SAN LR H++EVT+GCD+ D VAT+AR
Sbjct: 78  GHGGGGDVVGRRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYAR 137

Query: 131 RRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIY 190
           RRQRG+C++SG+GTVTNV++RQP++ GA+V L G FEILSL+GSFLPPPAPP AT+LTI+
Sbjct: 138 RRQRGICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFLPPPAPPGATSLTIF 197

Query: 191 LAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGIGSP 250
           LAGGQGQVVGGSVVG L A+GPV+++AASF+N AYERLPLEED+Q       GGG  G  
Sbjct: 198 LAGGQGQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDEQ---QQQLGGGSNG-- 257

Query: 251 DEIGQPQQQQQLVGDGNGPLFQGLPPNLLNSIQMP 276
              G     +   G G G  F  LP N+  ++Q+P
Sbjct: 258 ---GGNLFPEVAAGGGGGLPFFNLPMNMQPNVQLP 278

BLAST of Cp4.1LG13g01990 vs. TrEMBL
Match: A0A0A0KKQ4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157420 PE=4 SV=1)

HSP 1 Score: 480.7 bits (1236), Expect = 1.2e-132
Identity = 259/304 (85.20%), Postives = 266/304 (87.50%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQT-PFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAP 60
           MD+HSLPPPFHTRDFHLQ  PFH N    NNNNSEEEHS TT+RLKRDRDDD   NNS P
Sbjct: 1   MDSHSLPPPFHTRDFHLQQHPFHTNT---NNNNSEEEHSTTTTRLKRDRDDD--TNNSNP 60

Query: 61  NSGGD--PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120
           NS GD  PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT
Sbjct: 61  NSAGDPTPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120

Query: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL 180
           FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL
Sbjct: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL 180

Query: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGI 240
           TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQP LPS+QGGGGI
Sbjct: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPQLPSLQGGGGI 240

Query: 241 GSPDEIGQ----------------PQQQQQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAY 286
           GSPDE+GQ                 QQQQQL+ DGN PLF GLPPNLLNSIQMPPSE+ Y
Sbjct: 241 GSPDEVGQSQITAQTAHHQQQQQNQQQQQQLLNDGNAPLFHGLPPNLLNSIQMPPSESPY 299

BLAST of Cp4.1LG13g01990 vs. TrEMBL
Match: B9GKT4_POPTR (DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0001s08190g PE=4 SV=2)

HSP 1 Score: 374.4 bits (960), Expect = 1.2e-100
Identity = 218/296 (73.65%), Postives = 237/296 (80.07%), Query Frame = 1

Query: 4   HSLPPPFHTRDF--HLQTPFHHNNNNHNNNNSEEEHSATTS----RLKRDRDDDHHPNNS 63
           HSLPPPFHTRDF  H Q    H  ++    NSE+E S ++S     LKR+RD+    NNS
Sbjct: 9   HSLPPPFHTRDFQLHHQQQQQHQFHHQQQQNSEDEQSGSSSGLNKSLKRERDES---NNS 68

Query: 64  APN-------SGGDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCD 123
             N       + GD DGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTH++EV DGCD
Sbjct: 69  MGNREGQELITSGDGDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHLMEVADGCD 128

Query: 124 IVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPA 183
           IV+SVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIV LHGRFEILSLAGSFLPPPA
Sbjct: 129 IVESVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVTLHGRFEILSLAGSFLPPPA 188

Query: 184 PPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPS 243
           PPAAT LTIYLAGGQGQVVGGSVVGTL ASGPVVIMAASFSNAAYERLPLEE+D P +P 
Sbjct: 189 PPAATGLTIYLAGGQGQVVGGSVVGTLTASGPVVIMAASFSNAAYERLPLEEED-PQMP- 248

Query: 244 MQGGGGIGSPDEIGQPQ---QQQQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAYWA 284
           MQ GGG+GSP  +GQ Q   QQ Q++ + N  LF GLPPNLLNSIQ+P    AYWA
Sbjct: 249 MQ-GGGMGSPGGVGQQQQQPQQHQVMAEQNAQLFHGLPPNLLNSIQLPAE--AYWA 296

BLAST of Cp4.1LG13g01990 vs. TrEMBL
Match: W9SN10_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023188 PE=4 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 4.7e-100
Identity = 212/300 (70.67%), Postives = 228/300 (76.00%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQTP---FHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNS 60
           MDAHSLPPPFHTRDFH Q     FHH ++  +   S    S    R +   DDD+    +
Sbjct: 1   MDAHSLPPPFHTRDFHFQQQQQQFHHQSSEDDQTGSCSNLSKAPKRDRSGGDDDNDSGGN 60

Query: 61  APNS------GGDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDI 120
             N       GGD   ++TRRPRGRPAGSKNKPKPPIIITRDSANALRTHV+E+ DGCDI
Sbjct: 61  QSNQDLVIVGGGDDHDKMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIADGCDI 120

Query: 121 VDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAP 180
           V+SV+TFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIV LHGRFEILSLAGSFLPPPAP
Sbjct: 121 VESVSTFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVTLHGRFEILSLAGSFLPPPAP 180

Query: 181 PAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSM 240
           PAAT LTIYLAGGQGQVVGGSVVGTLIASGPVV+MAASFSNAAYERLPLEEDDQ  LP  
Sbjct: 181 PAATGLTIYLAGGQGQVVGGSVVGTLIASGPVVVMAASFSNAAYERLPLEEDDQTQLPLQ 240

Query: 241 QGGGGIGSPDEIGQPQQ------QQQLVGDGNGP-LFQGLPPNLLNSIQMPPSEAAYWAN 285
            GGGG+GSP   GQ QQ        +    G  P LF GLPPNLLNSIQMP    AYWA+
Sbjct: 241 GGGGGVGSPTG-GQQQQIMAAAAAAEAANSGGAPTLFHGLPPNLLNSIQMPAE--AYWAS 297

BLAST of Cp4.1LG13g01990 vs. TrEMBL
Match: A1DR80_CATRO (AT-hook DNA-binding protein OS=Catharanthus roseus PE=2 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 8.1e-100
Identity = 214/294 (72.79%), Postives = 237/294 (80.61%), Query Frame = 1

Query: 4   HSLPPPFHTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAPNSGG 63
           HSLPPPF+TRDF+LQ  FHH N+N     SE+E S T S LKRDRD+ +  +    + GG
Sbjct: 3   HSLPPPFNTRDFNLQHQFHHQNHN-----SEDEQSGT-SGLKRDRDEKND-SGDGKDGGG 62

Query: 64  DP-DGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRR 123
           DP  GE+TRRPRGRPAGSKNKPKPPIIITRDSANALRTHV+E+ DGCDI++SVATFARRR
Sbjct: 63  DPGSGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIADGCDIMESVATFARRR 122

Query: 124 QRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLA 183
           QRGVCIMSG+GTVTNVTLRQPASPGA+V LHGRFEILSLAGSFLPPPAPPAAT+LTIYLA
Sbjct: 123 QRGVCIMSGSGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPPAPPAATSLTIYLA 182

Query: 184 GGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPL-EEDDQPPLPS-----MQGGGG 243
           GGQGQVVGGSVVG L+ASGPVVIMAASFSNAAYERLPL EE++  PL S       GGGG
Sbjct: 183 GGQGQVVGGSVVGALLASGPVVIMAASFSNAAYERLPLDEEENSIPLQSGGSLGSPGGGG 242

Query: 244 IGSPDEIGQP-----QQQQQLVGDGNGP--LFQGLPPNLLNSIQMPPSEAAYWA 284
            G+    GQ      QQQQQL+G G  P  LF G+PPNLLNSIQ+P    AYWA
Sbjct: 243 GGNGGVPGQQQQQGGQQQQQLLGGGGDPSALFHGMPPNLLNSIQLP--NEAYWA 287

BLAST of Cp4.1LG13g01990 vs. TrEMBL
Match: A0A067JNW3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20733 PE=4 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 1.4e-99
Identity = 212/288 (73.61%), Postives = 231/288 (80.21%), Query Frame = 1

Query: 4   HSLPPPFHTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSRL-------KRDRDDDHHPNN 63
           HSLPPPFHTRDF L   F H+  +  NNNSE+E S ++S         KR+RD+ ++   
Sbjct: 9   HSLPPPFHTRDFQLHHQFPHHQQH--NNNSEDEQSGSSSGAAGLNKSQKRERDEGNNSEG 68

Query: 64  SAPNSGGDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVA 123
                 G   GEITRRPRGRPAGSKNKPKPPIIITRDSANALRTH++EV DGCDIV+SVA
Sbjct: 69  KELIPTGS-QGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHLMEVADGCDIVESVA 128

Query: 124 TFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATT 183
           TFARRRQRGV IMSGTGTVTNVTLRQPASPGA+V LHGRFEILSLAGSFLPPPAPPAAT 
Sbjct: 129 TFARRRQRGVSIMSGTGTVTNVTLRQPASPGAVVTLHGRFEILSLAGSFLPPPAPPAATG 188

Query: 184 LTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGG 243
           LTIYLAGGQGQVVGGSVVGTL ASGPVVIMAASFSNAAYERLPLEE+D P LP    GGG
Sbjct: 189 LTIYLAGGQGQVVGGSVVGTLTASGPVVIMAASFSNAAYERLPLEEED-PQLPMQ--GGG 248

Query: 244 IGSPDEIGQPQQQ-QQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAYWA 284
           IGSP  +GQ QQQ QQ++G+ N  LF GL PNLLNSIQ+P    AYWA
Sbjct: 249 IGSPGAVGQQQQQPQQVLGEANAQLFHGLQPNLLNSIQLPTE--AYWA 288

BLAST of Cp4.1LG13g01990 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 312.8 bits (800), Expect = 2.3e-85
Identity = 184/311 (59.16%), Postives = 223/311 (71.70%), Query Frame = 1

Query: 5   SLPPPFHTRDF--HLQTP-----FHHNNNNHNNNNSEEEHSATTSR-LKRDRDD-----D 64
           SLPPPFH RDF  HLQ        HH+    N  + +++  +  +R +K DR++     D
Sbjct: 12  SLPPPFHARDFQLHLQQQQQEFFLHHHQQQRNQTDGDQQGGSGGNRQIKMDREETSDNID 71

Query: 65  HHPNNSAPNS------GGDPDG--------EITRRPRGRPAGSKNKPKPPIIITRDSANA 124
           +  NNS          GG  +G        ++TRRPRGRPAGSKNKPKPPIIITRDSANA
Sbjct: 72  NIANNSGSEGKDIDIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANA 131

Query: 125 LRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPA---SPGAIVNLHG 184
           LRTHV+E+ DGCD+V+SVATFARRRQRGVC+MSGTG VTNVT+RQP    SPG++V+LHG
Sbjct: 132 LRTHVMEIGDGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHG 191

Query: 185 RFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAA 244
           RFEILSL+GSFLPPPAPP AT L++YLAGGQGQVVGGSVVG L+ +GPVV+MAASFSNAA
Sbjct: 192 RFEILSLSGSFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAA 251

Query: 245 YERLPLEEDD-QPPLPSMQGGGGIGSPDEIGQPQQQQQLVGDGNGPLFQGLPPNLLNSIQ 284
           YERLPLEED+ Q P+    GGG + SP  +GQ  Q QQ    G+    QGLPPNLL S+Q
Sbjct: 252 YERLPLEEDEMQTPVHGGGGGGSLESPPMMGQQLQHQQQAMSGH----QGLPPNLLGSVQ 311

BLAST of Cp4.1LG13g01990 vs. TAIR10
Match: AT2G45430.1 (AT2G45430.1 AT-hook motif nuclear-localized protein 22)

HSP 1 Score: 310.1 bits (793), Expect = 1.5e-84
Identity = 185/309 (59.87%), Postives = 212/309 (68.61%), Query Frame = 1

Query: 3   AHSLPPPFHTRDFHLQT--PFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNN---- 62
           + SLPPPF +RD HL     F H       N+  +        LKRDRD D  PN     
Sbjct: 5   SRSLPPPFLSRDLHLHPHHQFQHQQQQQQQNHGHDIDQHRIGGLKRDRDADIDPNEHSSA 64

Query: 63  ----SAPNSGGDP------DGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVT 122
               S P SGG+       D  ITRRPRGRPAGSKNKPKPPIIITRDSANAL++HV+EV 
Sbjct: 65  GKDQSTPGSGGESGGGGGGDNHITRRPRGRPAGSKNKPKPPIIITRDSANALKSHVMEVA 124

Query: 123 DGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPAS-PG---AIVNLHGRFEILSLA 182
           +GCD+++SV  FARRRQRG+C++SG G VTNVT+RQPAS PG   ++VNLHGRFEILSL+
Sbjct: 125 NGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGRFEILSLS 184

Query: 183 GSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEE 242
           GSFLPPPAPPAA+ LTIYLAGGQGQVVGGSVVG L+ASGPVVIMAASF NAAYERLPLEE
Sbjct: 185 GSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAAYERLPLEE 244

Query: 243 DDQPPLPSMQGGGGIGSPDEIG---------QPQQQQQLVGDGNGPLFQGLPPNLLNSIQ 283
           DDQ    +      I     +G         Q QQQQQL+ D      QGLPPNL+NS+Q
Sbjct: 245 DDQEEQTAGAVANNIDGNATMGGGTQTQTQTQQQQQQQLMQDPTS-FIQGLPPNLMNSVQ 304

BLAST of Cp4.1LG13g01990 vs. TAIR10
Match: AT4G12050.1 (AT4G12050.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 304.3 bits (778), Expect = 8.0e-83
Identity = 188/328 (57.32%), Postives = 221/328 (67.38%), Query Frame = 1

Query: 5   SLPPPFHTRDF--HLQTP----------------FHHNNNNHNNNNSEEEH---SATTSR 64
           SLPPPFH RDF  HLQ                   HH+     N + + E    S     
Sbjct: 12  SLPPPFHARDFQLHLQQQQQHQQQHQQQQQQQFFLHHHQQPQRNLDQDHEQQGGSILNRS 71

Query: 65  LKRDRDD--DHHPNNSAPNSG----------------GDPDGE-ITRRPRGRPAGSKNKP 124
           +K DR++  D+  N +  NSG                G   GE +TRRPRGRPAGSKNKP
Sbjct: 72  IKMDREETSDNMDNIANTNSGSEGKEMSLHGGEGGSGGGGSGEQMTRRPRGRPAGSKNKP 131

Query: 125 KPPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPA 184
           K PIIITRDSANALRTHV+E+ DGCDIVD +ATFARRRQRGVC+MSGTG+VTNVT+RQP 
Sbjct: 132 KAPIIITRDSANALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMSGTGSVTNVTIRQPG 191

Query: 185 SP-GAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPV 244
           SP G++V+LHGRFEILSL+GSFLPPPAPPAAT L++YLAGGQGQVVGGSVVG L+ SGPV
Sbjct: 192 SPPGSVVSLHGRFEILSLSGSFLPPPAPPAATGLSVYLAGGQGQVVGGSVVGPLLCSGPV 251

Query: 245 VIMAASFSNAAYERLPLEEDD-QPPLP----SMQGGGGIGSPDEIGQPQQQQQLVGDGNG 284
           V+MAASFSNAAYERLPLEED+ Q P+        GGGG+GSP  +GQ Q    +      
Sbjct: 252 VVMAASFSNAAYERLPLEEDEMQTPVQGGGGGGGGGGGMGSPPMMGQQQAMAAMAA---- 311

BLAST of Cp4.1LG13g01990 vs. TAIR10
Match: AT3G60870.1 (AT3G60870.1 AT-hook motif nuclear-localized protein 18)

HSP 1 Score: 251.1 bits (640), Expect = 8.1e-67
Identity = 144/258 (55.81%), Postives = 181/258 (70.16%), Query Frame = 1

Query: 30  NNNSEEEHSATTSRLKRDRDDDH-HPNNSAPNSGGDPDGEIT---RRPRGRPAGSKNKPK 89
           +++ +  H     R KR R+++   PNN   +    P GE     RRPRGRPAGSKNKPK
Sbjct: 14  SSDHQHYHHQNAGRQKRGREEEGVEPNNIGEDLATFPSGEENIKKRRPRGRPAGSKNKPK 73

Query: 90  PPIIITRDSANALRTHVIEVTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPAS 149
            PII+TRDSANA R HV+E+T+ CD+++S+A FARRRQRGVC+++G G VTNVT+RQP  
Sbjct: 74  APIIVTRDSANAFRCHVMEITNACDVMESLAVFARRRQRGVCVLTGNGAVTNVTVRQPG- 133

Query: 150 PGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVI 209
            G +V+LHGRFEILSL+GSFLPPPAPPAA+ L +YLAGGQGQV+GGSVVG L AS PVV+
Sbjct: 134 -GGVVSLHGRFEILSLSGSFLPPPAPPAASGLKVYLAGGQGQVIGGSVVGPLTASSPVVV 193

Query: 210 MAASFSNAAYERLPLEEDDQPPLP-SMQGGGGIGSPDEIGQPQQQQQLVGDGNGPLFQGL 269
           MAASF NA+YERLPLEE+++            IG+       Q Q+QL+ D     F G 
Sbjct: 194 MAASFGNASYERLPLEEEEETEREIDGNAARAIGT-------QTQKQLMQDATS--FIGS 253

Query: 270 PPNLLNSIQMPPSEAAYW 283
           P NL+NS+ +P    AYW
Sbjct: 254 PSNLINSVSLPGE--AYW 258

BLAST of Cp4.1LG13g01990 vs. TAIR10
Match: AT4G17800.1 (AT4G17800.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 245.4 bits (625), Expect = 4.4e-65
Identity = 144/275 (52.36%), Postives = 183/275 (66.55%), Query Frame = 1

Query: 11  HTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAPNSGGDPDGE-- 70
           H  D HL    HHN+++ +          T      D +++H   + A   G    G   
Sbjct: 18  HRPDLHL----HHNSSSDDVTPGAGMGHFTVD--DEDNNNNHQGLDLASGGGSGSSGGGG 77

Query: 71  --------ITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVATFAR 130
                   + RRPRGRP GSKNKPKPP+IITR+SAN LR H++EVT+GCD+ D VAT+AR
Sbjct: 78  GHGGGGDVVGRRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYAR 137

Query: 131 RRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTLTIY 190
           RRQRG+C++SG+GTVTNV++RQP++ GA+V L G FEILSL+GSFLPPPAPP AT+LTI+
Sbjct: 138 RRQRGICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFLPPPAPPGATSLTIF 197

Query: 191 LAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGIGSP 250
           LAGGQGQVVGGSVVG L A+GPV+++AASF+N AYERLPLEED+Q       GGG  G  
Sbjct: 198 LAGGQGQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDEQ---QQQLGGGSNG-- 257

Query: 251 DEIGQPQQQQQLVGDGNGPLFQGLPPNLLNSIQMP 276
              G     +   G G G  F  LP N+  ++Q+P
Sbjct: 258 ---GGNLFPEVAAGGGGGLPFFNLPMNMQPNVQLP 278

BLAST of Cp4.1LG13g01990 vs. NCBI nr
Match: gi|659073628|ref|XP_008437165.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 483.4 bits (1243), Expect = 2.7e-133
Identity = 259/304 (85.20%), Postives = 267/304 (87.83%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQT-PFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAP 60
           MD+HSLPPPFHTRDFHLQ  PFH N    NNNNSEEEHS TT+RLKRDRDDD   NNS P
Sbjct: 1   MDSHSLPPPFHTRDFHLQQHPFHTNT---NNNNSEEEHSTTTTRLKRDRDDD--TNNSNP 60

Query: 61  NSGGD--PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120
           NS GD  PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT
Sbjct: 61  NSAGDPTPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120

Query: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL 180
           FARRRQRGVCIMSGTGTVTNVTLRQPASPGAI+NLHGRFEILSLAGSFLPPPAPPAATTL
Sbjct: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIINLHGRFEILSLAGSFLPPPAPPAATTL 180

Query: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGI 240
           TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPS+QGGGGI
Sbjct: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSLQGGGGI 240

Query: 241 GSPDEIGQ----------------PQQQQQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAY 286
           GSPDE+GQ                 QQQQQL+ DGN PLF GLPPNLLNSIQMPPSE+ Y
Sbjct: 241 GSPDEVGQSQITAQTAHHQQQQQNQQQQQQLLNDGNAPLFHGLPPNLLNSIQMPPSESPY 299

BLAST of Cp4.1LG13g01990 vs. NCBI nr
Match: gi|778699867|ref|XP_004147565.2| (PREDICTED: AT-hook motif nuclear-localized protein 22 [Cucumis sativus])

HSP 1 Score: 480.7 bits (1236), Expect = 1.8e-132
Identity = 259/304 (85.20%), Postives = 266/304 (87.50%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQT-PFHHNNNNHNNNNSEEEHSATTSRLKRDRDDDHHPNNSAP 60
           MD+HSLPPPFHTRDFHLQ  PFH N    NNNNSEEEHS TT+RLKRDRDDD   NNS P
Sbjct: 1   MDSHSLPPPFHTRDFHLQQHPFHTNT---NNNNSEEEHSTTTTRLKRDRDDD--TNNSNP 60

Query: 61  NSGGD--PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120
           NS GD  PDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT
Sbjct: 61  NSAGDPTPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVTDGCDIVDSVAT 120

Query: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL 180
           FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL
Sbjct: 121 FARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFLPPPAPPAATTL 180

Query: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPPLPSMQGGGGI 240
           TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQP LPS+QGGGGI
Sbjct: 181 TIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQPQLPSLQGGGGI 240

Query: 241 GSPDEIGQ----------------PQQQQQLVGDGNGPLFQGLPPNLLNSIQMPPSEAAY 286
           GSPDE+GQ                 QQQQQL+ DGN PLF GLPPNLLNSIQMPPSE+ Y
Sbjct: 241 GSPDEVGQSQITAQTAHHQQQQQNQQQQQQLLNDGNAPLFHGLPPNLLNSIQMPPSESPY 299

BLAST of Cp4.1LG13g01990 vs. NCBI nr
Match: gi|1009169690|ref|XP_015865801.1| (PREDICTED: AT-hook motif nuclear-localized protein 24-like [Ziziphus jujuba])

HSP 1 Score: 387.9 bits (995), Expect = 1.6e-104
Identity = 224/311 (72.03%), Postives = 242/311 (77.81%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHL---QTPFHHNNNNHNNNNSEEEHSATTSRL----KRDRDDDHH 60
           MDAHSLPPPFHTRDFHL   Q PFHH        NSE+E + ++  +    KR+RDD   
Sbjct: 1   MDAHSLPPPFHTRDFHLHHQQQPFHHQQQQ----NSEDEQTGSSGLINKAQKRERDD--- 60

Query: 61  PNNSAPNSGGDP---------DGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIE 120
            NN + N  GD          +GE+TRRPRGRPAGSKNKPKPPIIITRDSANALRTHV+E
Sbjct: 61  -NNDSGNGNGDDTDLAVGAGGEGEMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVME 120

Query: 121 VTDGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGS 180
           + DGCDIV+SV+TFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIV LHGRFEILSL+GS
Sbjct: 121 IADGCDIVESVSTFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVTLHGRFEILSLSGS 180

Query: 181 FLPPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDD 240
           FLPPPAPPAAT LTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPL EDD
Sbjct: 181 FLPPPAPPAATGLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPL-EDD 240

Query: 241 QPPLPSMQGGGGIGSP--------DEIGQPQQQQQLVGDGNG----PLFQGLPPNLLNSI 284
           + PLP MQGGG IGSP         +    QQQ QL+G G+G    PLF GLPPNLLNSI
Sbjct: 241 EAPLP-MQGGGSIGSPPGGSGVGHQQQAAQQQQHQLLGGGDGNTSAPLFHGLPPNLLNSI 299

BLAST of Cp4.1LG13g01990 vs. NCBI nr
Match: gi|657964492|ref|XP_008373872.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Malus domestica])

HSP 1 Score: 375.2 bits (962), Expect = 1.1e-100
Identity = 217/308 (70.45%), Postives = 238/308 (77.27%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSR---LKRDRDDDHHPNNS 60
           MD HSLPPPFHTRDFHL     H   +H   NSE+E + ++      KR+RD D+  N+S
Sbjct: 1   MDTHSLPPPFHTRDFHLHHQQQHPQFHHQQQNSEDEQTGSSGLNKGQKRERDIDN--NDS 60

Query: 61  APNSG-----------GDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVT 120
             N G           G    E+TRRPRGRPAGSKNKPKPPIIITRDSANALRTHV+E+ 
Sbjct: 61  GGNGGELGKELNVTMSGADGSEMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIA 120

Query: 121 DGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFL 180
           DGCDIV+SVATFARRRQRGVCIMSGTGTVTNVTLRQPASPG+IV LHGRFEILSLAGSFL
Sbjct: 121 DGCDIVESVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGSIVTLHGRFEILSLAGSFL 180

Query: 181 PPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQP 240
           PPPAPPAAT LTIYLAGGQGQVVGGSVVGTL ASGPVVIMAASFSNAAYERLPLEED+  
Sbjct: 181 PPPAPPAATGLTIYLAGGQGQVVGGSVVGTLXASGPVVIMAASFSNAAYERLPLEEDEGQ 240

Query: 241 PLPSMQGGGG-IGSPDEIG-------QPQQQQQLVGDG---NGPLFQGLPPNLLNSIQMP 284
            LP   GGGG +GSP  +G       Q QQQQQL+ +    N PLF GLPPNLLNS+Q+ 
Sbjct: 241 -LPMQGGGGGSLGSPSGVGHQNQQQQQQQQQQQLMAEAANSNPPLFHGLPPNLLNSMQL- 300

BLAST of Cp4.1LG13g01990 vs. NCBI nr
Match: gi|658022364|ref|XP_008346594.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Malus domestica])

HSP 1 Score: 374.8 bits (961), Expect = 1.4e-100
Identity = 217/308 (70.45%), Postives = 237/308 (76.95%), Query Frame = 1

Query: 1   MDAHSLPPPFHTRDFHLQTPFHHNNNNHNNNNSEEEHSATTSR---LKRDRDDDHHPNNS 60
           MD HSLPPPFHTRDFHL     H    H   NSE+E + ++      KR+RD D+  N+S
Sbjct: 1   MDTHSLPPPFHTRDFHLHHQQQHPQFLHQQQNSEDEQTGSSGLNKGQKRERDXDN--NDS 60

Query: 61  APNSG-----------GDPDGEITRRPRGRPAGSKNKPKPPIIITRDSANALRTHVIEVT 120
             N G           G    E+TRRPRGRPAGSKNKPKPPIIITRDSANALRTHV+E+ 
Sbjct: 61  GGNGGELGKELNVTMSGXDGSEMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIA 120

Query: 121 DGCDIVDSVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGAIVNLHGRFEILSLAGSFL 180
           DGCDIV+SVATFARRRQRGVCIMSGTGTVTNVTLRQPASPG+IV LHGRFEILSLAGSFL
Sbjct: 121 DGCDIVESVATFARRRQRGVCIMSGTGTVTNVTLRQPASPGSIVTLHGRFEILSLAGSFL 180

Query: 181 PPPAPPAATTLTIYLAGGQGQVVGGSVVGTLIASGPVVIMAASFSNAAYERLPLEEDDQP 240
           PPPAPPAAT LTIYLAGGQGQVVGGSVVGTL ASGPVVIMAASFSNAAYERLPLEED+  
Sbjct: 181 PPPAPPAATGLTIYLAGGQGQVVGGSVVGTLTASGPVVIMAASFSNAAYERLPLEEDEGQ 240

Query: 241 PLPSMQGGGG-IGSPDEIG-------QPQQQQQLVGDG---NGPLFQGLPPNLLNSIQMP 284
            LP   GGGG +GSP  +G       Q QQQQQL+ +    N PLF GLPPNLLNS+Q+ 
Sbjct: 241 -LPMQGGGGGSLGSPSGVGHQNQQQQQQQQQQQLMAEAANSNPPLFHGLPPNLLNSMQL- 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL24_ARATH4.0e-8459.16AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
AHL22_ARATH2.6e-8359.87AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1... [more]
AHL26_ARATH1.4e-8157.32AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2... [more]
AHL18_ARATH1.4e-6555.81AT-hook motif nuclear-localized protein 18 OS=Arabidopsis thaliana GN=AHL18 PE=2... [more]
AHL23_ARATH7.9e-6452.36AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0KKQ4_CUCSA1.2e-13285.20Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157420 PE=4 SV=1[more]
B9GKT4_POPTR1.2e-10073.65DNA-binding family protein OS=Populus trichocarpa GN=POPTR_0001s08190g PE=4 SV=2[more]
W9SN10_9ROSA4.7e-10070.67Uncharacterized protein OS=Morus notabilis GN=L484_023188 PE=4 SV=1[more]
A1DR80_CATRO8.1e-10072.79AT-hook DNA-binding protein OS=Catharanthus roseus PE=2 SV=1[more]
A0A067JNW3_JATCU1.4e-9973.61Uncharacterized protein OS=Jatropha curcas GN=JCGZ_20733 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G22810.12.3e-8559.16 Predicted AT-hook DNA-binding family protein[more]
AT2G45430.11.5e-8459.87 AT-hook motif nuclear-localized protein 22[more]
AT4G12050.18.0e-8357.32 Predicted AT-hook DNA-binding family protein[more]
AT3G60870.18.1e-6755.81 AT-hook motif nuclear-localized protein 18[more]
AT4G17800.14.4e-6552.36 Predicted AT-hook DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|659073628|ref|XP_008437165.1|2.7e-13385.20PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|778699867|ref|XP_004147565.2|1.8e-13285.20PREDICTED: AT-hook motif nuclear-localized protein 22 [Cucumis sativus][more]
gi|1009169690|ref|XP_015865801.1|1.6e-10472.03PREDICTED: AT-hook motif nuclear-localized protein 24-like [Ziziphus jujuba][more]
gi|657964492|ref|XP_008373872.1|1.1e-10070.45PREDICTED: putative DNA-binding protein ESCAROLA [Malus domestica][more]
gi|658022364|ref|XP_008346594.1|1.4e-10070.45PREDICTED: putative DNA-binding protein ESCAROLA [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR014476AT-hook_nuclear
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g01990.1Cp4.1LG13g01990.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 99..211
score: 5.3
IPR005175PPC domainPROFILEPS51742PPCcoord: 95..231
score: 40
IPR014476AT-hook motif nuclear-localised proteinPIRPIRSF016021ESCAROLAcoord: 11..288
score: 1.2E
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 98..224
score: 5.1
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 4..283
score: 6.8E
NoneNo IPR availablePANTHERPTHR31100:SF15AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 24-RELATEDcoord: 4..283
score: 6.8E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 95..226
score: 3.14

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG13g01990Cp4.1LG05g02610Cucurbita pepo (Zucchini)cpecpeB223