Cp4.1LG02g05220 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g05220
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAT hook motif DNA-binding family protein
LocationCp4.1LG02 : 1843698 .. 1844558 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCCCAAAACCCCACCATCAACCACCGCCCCATTTCCAGCCCTTTCCCCGCTCCTTCGCCGTCGCGGCCACTCCGCCACCGCGCGATTCTTCCACCGCCGCCGATGACGATTCCACCACCGGCCACTCGCCTCTCACGCACGCTATGCTCCCCAAAGAACCGGCCTCCGGCGGCGACGGAGCTTCTATCGAGGTCGTTCGCCGTCCCCGTGGTCGACCTCCGGGATCCAAGAACAAGCCGAAGCCAGCCGCCGTTGTCGTCGCTGCACGCGACGCCGAGCCGGCGATGAGTCCTTATGTGCTCGAAGTTCCTGGCGGCAGTGATATCGTCGAGGCTATTTCCCGATTCTGCCGCCGCAGAAACACAGGGCTCTGTATTCTCAACGCTTATGGAACCGTCGCCGATGTCACACTCCGGCAGCCGTCCTCATCTCCGGTGGCGACCGTCACTTTCCATGGACGATTCGATATCCTTTCCGTCTGCGCTACTTTTGTTCCTCATACGACGTCGTTTCCAATCCCTAACGGATTCACCATCACGCTAGCAGGGCCGCAGGGCCAGATCTTCGGCGGTTTGGTTGCCGGAGCTTTAATCGGCGCCGGAACCGTCTTTGTAATAGCCGCCTCATTCAACAATCCCTCCTATCAACGGCTTCCATCAGAGGACGAGGTCAGAAAATCAGGCTTCAGCGATGTCGAAGAAGGTCACTCTCCACCGATCTCTGGTGGAAAAGATAGCGAAAACACCAACGCCGCAGCCGATACCTGCGGCCTGCTGCCGATGTACAGTACTCACTCTTCCTCCGACGCAATATGGTCTCGACAACCAGCGGCGCCGTCGCACCATCGCCAGTACTAA

mRNA sequence

ATGTTCCCAAAACCCCACCATCAACCACCGCCCCATTTCCAGCCCTTTCCCCGCTCCTTCGCCGTCGCGGCCACTCCGCCACCGCGCGATTCTTCCACCGCCGCCGATGACGATTCCACCACCGGCCACTCGCCTCTCACGCACGCTATGCTCCCCAAAGAACCGGCCTCCGGCGGCGACGGAGCTTCTATCGAGGTCGTTCGCCGTCCCCGTGGTCGACCTCCGGGATCCAAGAACAAGCCGAAGCCAGCCGCCGTTGTCGTCGCTGCACGCGACGCCGAGCCGGCGATGAGTCCTTATGTGCTCGAAGTTCCTGGCGGCAGTGATATCGTCGAGGCTATTTCCCGATTCTGCCGCCGCAGAAACACAGGGCTCTGTATTCTCAACGCTTATGGAACCGTCGCCGATGTCACACTCCGGCAGCCGTCCTCATCTCCGGTGGCGACCGTCACTTTCCATGGACGATTCGATATCCTTTCCGTCTGCGCTACTTTTGTTCCTCATACGACGTCGTTTCCAATCCCTAACGGATTCACCATCACGCTAGCAGGGCCGCAGGGCCAGATCTTCGGCGGTTTGGTTGCCGGAGCTTTAATCGGCGCCGGAACCGTCTTTGTAATAGCCGCCTCATTCAACAATCCCTCCTATCAACGGCTTCCATCAGAGGACGAGGTCAGAAAATCAGGCTTCAGCGATGTCGAAGAAGGTCACTCTCCACCGATCTCTGGTGGAAAAGATAGCGAAAACACCAACGCCGCAGCCGATACCTGCGGCCTGCTGCCGATGTACAGTACTCACTCTTCCTCCGACGCAATATGGTCTCGACAACCAGCGGCGCCGTCGCACCATCGCCAGTACTAA

Coding sequence (CDS)

ATGTTCCCAAAACCCCACCATCAACCACCGCCCCATTTCCAGCCCTTTCCCCGCTCCTTCGCCGTCGCGGCCACTCCGCCACCGCGCGATTCTTCCACCGCCGCCGATGACGATTCCACCACCGGCCACTCGCCTCTCACGCACGCTATGCTCCCCAAAGAACCGGCCTCCGGCGGCGACGGAGCTTCTATCGAGGTCGTTCGCCGTCCCCGTGGTCGACCTCCGGGATCCAAGAACAAGCCGAAGCCAGCCGCCGTTGTCGTCGCTGCACGCGACGCCGAGCCGGCGATGAGTCCTTATGTGCTCGAAGTTCCTGGCGGCAGTGATATCGTCGAGGCTATTTCCCGATTCTGCCGCCGCAGAAACACAGGGCTCTGTATTCTCAACGCTTATGGAACCGTCGCCGATGTCACACTCCGGCAGCCGTCCTCATCTCCGGTGGCGACCGTCACTTTCCATGGACGATTCGATATCCTTTCCGTCTGCGCTACTTTTGTTCCTCATACGACGTCGTTTCCAATCCCTAACGGATTCACCATCACGCTAGCAGGGCCGCAGGGCCAGATCTTCGGCGGTTTGGTTGCCGGAGCTTTAATCGGCGCCGGAACCGTCTTTGTAATAGCCGCCTCATTCAACAATCCCTCCTATCAACGGCTTCCATCAGAGGACGAGGTCAGAAAATCAGGCTTCAGCGATGTCGAAGAAGGTCACTCTCCACCGATCTCTGGTGGAAAAGATAGCGAAAACACCAACGCCGCAGCCGATACCTGCGGCCTGCTGCCGATGTACAGTACTCACTCTTCCTCCGACGCAATATGGTCTCGACAACCAGCGGCGCCGTCGCACCATCGCCAGTACTAA

Protein sequence

MFPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHSPPISGGKDSENTNAAADTCGLLPMYSTHSSSDAIWSRQPAAPSHHRQY
BLAST of Cp4.1LG02g05220 vs. Swiss-Prot
Match: AHL17_ARATH (AT-hook motif nuclear-localized protein 17 OS=Arabidopsis thaliana GN=AHL17 PE=2 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 6.4e-66
Identity = 137/246 (55.69%), Postives = 175/246 (71.14%), Query Frame = 1

Query: 43  HSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVL 102
           HS  +H  L        D +SIEVVRRPRGRPPGSKNKPKP   V   RD +P MSPY+L
Sbjct: 31  HSLTSHFHLSSTVTPTVDDSSIEVVRRPRGRPPGSKNKPKPPVFVT--RDTDPPMSPYIL 90

Query: 103 EVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPV-ATVTFHGRFDILSV 162
           EVP G+D+VEAI+RFCRR++ G+C+L+  G+VA+VTLRQPS + + +T+TFHG+FD+LSV
Sbjct: 91  EVPSGNDVVEAINRFCRRKSIGVCVLSGSGSVANVTLRQPSPAALGSTITFHGKFDLLSV 150

Query: 163 CATFVPH----TTSFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQ 222
            ATF+P     + S P+ N FT++LAGPQGQI GG VAG LI AGTV+VIAASFNNPSY 
Sbjct: 151 SATFLPPPPRTSLSPPVSNFFTVSLAGPQGQIIGGFVAGPLISAGTVYVIAASFNNPSYH 210

Query: 223 RLPSEDEVRKSGFSDVEEGHSPPISGG--KDSENTNAAADTCGLLPMYSTH-SSSDAIWS 281
           RLP+E+E + S  +   EG SPP+SGG  +  +   +  ++CG + MYS H   SD IW+
Sbjct: 211 RLPAEEEQKHSAGTGEREGQSPPVSGGGEESGQMAGSGGESCG-VSMYSCHMGGSDVIWA 270

BLAST of Cp4.1LG02g05220 vs. Swiss-Prot
Match: AHL28_ARATH (AT-hook motif nuclear-localized protein 28 OS=Arabidopsis thaliana GN=AHL28 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 8.1e-53
Identity = 114/223 (51.12%), Postives = 147/223 (65.92%), Query Frame = 1

Query: 64  IEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRRRNT 123
           +E V RPRGRP GSKNKPK    V      +P MSPY+LEVP G+D+VEA++RFCR +  
Sbjct: 1   METVGRPRGRPRGSKNKPKAPIFVTI----DPPMSPYILEVPSGNDVVEALNRFCRGKAI 60

Query: 124 GLCILNAYGTVADVTLRQPS-SSPVATVTFHGRFDILSVCATFVPH----TTSFPIPNGF 183
           G C+L+  G+VADVTLRQPS ++P +T+TFHG+FD+LSV ATF+P     + S P+ N F
Sbjct: 61  GFCVLSGSGSVADVTLRQPSPAAPGSTITFHGKFDLLSVSATFLPPLPPTSLSPPVSNFF 120

Query: 184 TITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHS 243
           T++LAGPQG++ GG VAG L+ AGTV+ +A SF NPSY RLP+ +E +++     EEG S
Sbjct: 121 TVSLAGPQGKVIGGFVAGPLVAAGTVYFVATSFKNPSYHRLPATEEEQRNSAEGEEEGQS 180

Query: 244 PPISGGKDSENTNAAADTCGLLPMYSTHSSSDAIWSRQPAAPS 282
           PP+SGG             G   MY     SD IW     APS
Sbjct: 181 PPVSGG-------------GGESMYV--GGSDVIWDPNAKAPS 204

BLAST of Cp4.1LG02g05220 vs. Swiss-Prot
Match: AHL26_ARATH (AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.6e-37
Identity = 91/215 (42.33%), Postives = 133/215 (61.86%), Query Frame = 1

Query: 54  EPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEA 113
           E  SGG G+  ++ RRPRGRP GSKNKPK  A ++  RD+  A+  +V+E+  G DIV+ 
Sbjct: 104 EGGSGGGGSGEQMTRRPRGRPAGSKNKPK--APIIITRDSANALRTHVMEIGDGCDIVDC 163

Query: 114 ISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFP 173
           ++ F RRR  G+C+++  G+V +VT+RQP S P + V+ HGRF+ILS+  +F+P     P
Sbjct: 164 MATFARRRQRGVCVMSGTGSVTNVTIRQPGSPPGSVVSLHGRFEILSLSGSFLPPPAP-P 223

Query: 174 IPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDV 233
              G ++ LAG QGQ+ GG V G L+ +G V V+AASF+N +Y+RLP E++  ++     
Sbjct: 224 AATGLSVYLAGGQGQVVGGSVVGPLLCSGPVVVMAASFSNAAYERLPLEEDEMQTPVQGG 283

Query: 234 EEG-------HSPPISGGKDSENTNAAADTCGLLP 262
             G        SPP+ G + +    AAA   GL P
Sbjct: 284 GGGGGGGGGMGSPPMMGQQQAMAAMAAAQ--GLPP 313

BLAST of Cp4.1LG02g05220 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.6e-37
Identity = 89/209 (42.58%), Postives = 129/209 (61.72%), Query Frame = 1

Query: 58  GGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRF 117
           GG G   ++ RRPRGRP GSKNKPKP  ++   RD+  A+  +V+E+  G D+VE+++ F
Sbjct: 95  GGSGGDHQMTRRPRGRPAGSKNKPKPPIIIT--RDSANALRTHVMEIGDGCDLVESVATF 154

Query: 118 CRRRNTGLCILNAYGTVADVTLRQPSS--SPVATVTFHGRFDILSVCATFVPHTTSFPIP 177
            RRR  G+C+++  G V +VT+RQP S  SP + V+ HGRF+ILS+  +F+P     P  
Sbjct: 155 ARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLPPPAP-PTA 214

Query: 178 NGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEE 237
            G ++ LAG QGQ+ GG V G L+ AG V V+AASF+N +Y+RLP E++  ++       
Sbjct: 215 TGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEEDEMQTPVHGGGG 274

Query: 238 G---HSPPISGGKDSENTNAAADTCGLLP 262
           G    SPP+ G +      A +   GL P
Sbjct: 275 GGSLESPPMMGQQLQHQQQAMSGHQGLPP 300

BLAST of Cp4.1LG02g05220 vs. Swiss-Prot
Match: AHL22_ARATH (AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.6e-37
Identity = 87/200 (43.50%), Postives = 123/200 (61.50%), Query Frame = 1

Query: 28  PRDSSTAADDDSTTGHSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVV 87
           P + S+A  D ST G             + GG G    + RRPRGRP GSKNKPKP  ++
Sbjct: 58  PNEHSSAGKDQSTPGSGG---------ESGGGGGGDNHITRRPRGRPAGSKNKPKPPIII 117

Query: 88  VAARDAEPAMSPYVLEVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSP- 147
              RD+  A+  +V+EV  G D++E+++ F RRR  G+C+L+  G V +VT+RQP+S P 
Sbjct: 118 T--RDSANALKSHVMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPG 177

Query: 148 --VATVTFHGRFDILSVCATFVPHTTSFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTV 207
              + V  HGRF+ILS+  +F+P     P  +G TI LAG QGQ+ GG V G L+ +G V
Sbjct: 178 GGSSVVNLHGRFEILSLSGSFLPPPAP-PAASGLTIYLAGGQGQVVGGSVVGPLMASGPV 237

Query: 208 FVIAASFNNPSYQRLPSEDE 225
            ++AASF N +Y+RLP E++
Sbjct: 238 VIMAASFGNAAYERLPLEED 245

BLAST of Cp4.1LG02g05220 vs. TrEMBL
Match: A0A0A0LJP1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G890120 PE=4 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 4.1e-128
Identity = 247/291 (84.88%), Postives = 254/291 (87.29%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAMLPKEPASGGD 60
           MFPKPHHQPP H QPFPRSF V ATPPPRDS    DDDS    SPLTHA++ KEP S  D
Sbjct: 1   MFPKPHHQPPSHSQPFPRSFPVTATPPPRDSP---DDDSPAAPSPLTHAIVLKEPPSSAD 60

Query: 61  GASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRR 120
           GASIEV RRPRGRPPGSKNKPKPAAVVVA RDAEP MSPYVLEVPGGSDIVEAISRFCRR
Sbjct: 61  GASIEVARRPRGRPPGSKNKPKPAAVVVANRDAEPPMSPYVLEVPGGSDIVEAISRFCRR 120

Query: 121 RNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNGFTI 180
           RNTGLCILNAYGTV DVTLRQP+SSPV TVTFHGRFDILSVCATFVP TTSFPIPNGFTI
Sbjct: 121 RNTGLCILNAYGTVGDVTLRQPASSPVGTVTFHGRFDILSVCATFVPQTTSFPIPNGFTI 180

Query: 181 TLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHSPP 240
           TLAGPQGQIFGGLVAG+LIG GTV+VIAASFNNPSYQRLPSEDEVRK  FSDVEEGHS P
Sbjct: 181 TLAGPQGQIFGGLVAGSLIGVGTVYVIAASFNNPSYQRLPSEDEVRKLTFSDVEEGHS-P 240

Query: 241 ISGGKDSENTNA--AADTCGLLPMYSTHSSSDAIW---SRQPAAPSHHRQY 287
           ISGGKDSENT A  A +TCGLLPMYSTHSSSD IW   +RQPA   HHRQY
Sbjct: 241 ISGGKDSENTAAGGAQETCGLLPMYSTHSSSDVIWTPAARQPA--HHHRQY 285

BLAST of Cp4.1LG02g05220 vs. TrEMBL
Match: B9GN33_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s10520g PE=4 SV=2)

HSP 1 Score: 289.3 bits (739), Expect = 5.2e-75
Identity = 162/293 (55.29%), Postives = 206/293 (70.31%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDS-STAADDDSTTGHSPLTHAMLPKEPASGG 60
           MF K H +   H  PF + +  +      D+ ST A   +T   +P T      EP S G
Sbjct: 19  MFSKLHPRHHQHL-PFSQQYQFSRESEEEDTRSTGAA--ATPNLTPTTQKQKLNEPNSSG 78

Query: 61  --DGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRF 120
             DGA+IEVVRRPRGRPPGSKNKPKP  ++   R++EP+MSPY+LEVPGG+D+VEA+SRF
Sbjct: 79  GTDGATIEVVRRPRGRPPGSKNKPKPPVIIT--RESEPSMSPYILEVPGGNDVVEALSRF 138

Query: 121 CRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNG 180
           CRR+N G+C+L   GTVA+VTLRQPS++P AT+TFHGRFDILS+ ATF+P T S+P+PN 
Sbjct: 139 CRRKNMGICVLTGSGTVANVTLRQPSATPGATITFHGRFDILSISATFLPQTASYPVPNS 198

Query: 181 FTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGH 240
           FTI+LAGPQGQI GG+VAG+L+ AGTVFV+AASFNNPSY RLP E+E R SG     EG 
Sbjct: 199 FTISLAGPQGQIVGGIVAGSLVAAGTVFVVAASFNNPSYHRLPLEEEGRTSGSDGGGEGQ 258

Query: 241 SPPISGGKDSENTNAAA-----DTCGLLPMYSTHSSSDAIW---SRQPAAPSH 283
           SP +SG    E+ +AA+     ++CG + MYS H  +D IW   +R P  P +
Sbjct: 259 SPAVSGAGGGESGHAASGGGGGESCG-IAMYSCHMPNDVIWAPAARPPPPPPY 305

BLAST of Cp4.1LG02g05220 vs. TrEMBL
Match: B9RJW1_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1039230 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 1.7e-73
Identity = 162/291 (55.67%), Postives = 197/291 (67.70%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAML------PKE 60
           MF K HH    H  P P S     +    D  T +   +TT  +  T          P E
Sbjct: 23  MFSKLHH----HHHPLPFSTHFQLSRDSEDDDTRSTGAATTAITTATATATTTTPTRPTE 82

Query: 61  P--ASGG-DGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIV 120
           P  +SGG DGA+IEVVRRPRGRPPGSKNKPKP  ++   RD EPAMSPY+LEV GGSD+V
Sbjct: 83  PPNSSGGTDGATIEVVRRPRGRPPGSKNKPKPPVIIT--RDPEPAMSPYILEVCGGSDVV 142

Query: 121 EAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTS 180
           EAISRFCRR+N G+C+L   GTVA+VTLRQPS++P +T+TFHGRFDILS+ ATF+P T S
Sbjct: 143 EAISRFCRRKNIGICVLTGSGTVANVTLRQPSTTPGSTITFHGRFDILSISATFMPQTVS 202

Query: 181 FPIPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGF- 240
           +P+PN FTI+LAGPQGQI GGLVAG+LI AGTV+++AA+FNNPSY RLP +DE R SG  
Sbjct: 203 YPVPNTFTISLAGPQGQIVGGLVAGSLIAAGTVYIMAATFNNPSYHRLPVDDEGRNSGSG 262

Query: 241 SDVEEGHSPPISG-GKDSENTNAAADTCGLLPMYSTHSSSDAIWSRQPAAP 281
               EG SP +SG G   ++     D+ G + MYS H  SD IW+     P
Sbjct: 263 GGGGEGQSPAVSGAGGGGDSGGGGGDSGGGMVMYSGHLPSDVIWAPTARPP 307

BLAST of Cp4.1LG02g05220 vs. TrEMBL
Match: M5XN52_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022234mg PE=4 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 2.2e-73
Identity = 167/288 (57.99%), Postives = 204/288 (70.83%), Query Frame = 1

Query: 2   FPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAML-PKEPASGGD 61
           FPKP  +P P      ++      P    S TA    + +  +P + A   P  P++  D
Sbjct: 14  FPKPIPKPSPTNHRECQTSEEERHPAATSSGTATVTTNPSAQNPKSSAAADPSNPSA--D 73

Query: 62  GASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRR 121
           GA+IEV+RRPRGRPPGSKNKPKP  ++   RD+EP MSPY+LEVPGGSDIVEA+SRFC R
Sbjct: 74  GATIEVIRRPRGRPPGSKNKPKPPVIIT--RDSEPPMSPYILEVPGGSDIVEAVSRFCCR 133

Query: 122 RNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTT-SFP--IPNG 181
           +N GLCIL   GTVA+VTLRQPS++P ATVTFHGRFDILS+ ATF+P TT S P  +P+G
Sbjct: 134 KNIGLCILTGSGTVANVTLRQPSTTPGATVTFHGRFDILSISATFLPQTTPSCPVSVPSG 193

Query: 182 FTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDE-VRKSGFSDVEEG 241
           FTI+LAGPQGQI GGLVAGAL+ AGTV+VIAASFNNPSY RLP EDE VR SG  D    
Sbjct: 194 FTISLAGPQGQIVGGLVAGALVAAGTVYVIAASFNNPSYHRLPGEDEAVRNSGSGD---A 253

Query: 242 HSPPISGGKDS-ENTNAAADTCGLLPMYSTHSSSDAIW---SRQPAAP 281
           HSPP+SGG +S  +   ++ +CG + MYS H  +D +W   +RQP  P
Sbjct: 254 HSPPLSGGVESGGHAPPSSQSCG-MSMYSCHLPTDVLWAPTARQPPPP 293

BLAST of Cp4.1LG02g05220 vs. TrEMBL
Match: A0A067LAD4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15877 PE=4 SV=1)

HSP 1 Score: 281.2 bits (718), Expect = 1.4e-72
Identity = 163/298 (54.70%), Postives = 202/298 (67.79%), Query Frame = 1

Query: 6   HHQPPPHFQPFPRSFAVAATPPPRDS-STAADDDSTTGHSPL---------THAMLPKEP 65
           HH    H  PF + F ++      D+ ST A    TT  +           T    P EP
Sbjct: 34  HHHQQHHHLPFSQHFPLSRDSEDDDTRSTGAAAAVTTPSTAAAAAAAAITPTQKQKPIEP 93

Query: 66  ASGG--DGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEA 125
            S G  DGA+IEVVRRPRGRPPGSKNKPKP  ++   RD EPAMSPY+LEVPGGSD+VEA
Sbjct: 94  NSSGGTDGATIEVVRRPRGRPPGSKNKPKPPVIIT--RDPEPAMSPYILEVPGGSDVVEA 153

Query: 126 ISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFP 185
           ISRFCRR+N G+C+L   GTVA+VTLRQPS++P +T+TFHGRFDILS+ ATF+P T S+P
Sbjct: 154 ISRFCRRKNIGICVLTGSGTVANVTLRQPSTTPGSTITFHGRFDILSISATFMPQTVSYP 213

Query: 186 IPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGF--S 245
           +PN FTI+LAGPQGQI GGLVAG+L+ AGTV+V+AA+ NNPS+ RLP+++E R SG    
Sbjct: 214 VPNTFTISLAGPQGQIVGGLVAGSLVAAGTVYVVAATLNNPSHHRLPADEEGRNSGSGGG 273

Query: 246 DVEEGHSPPIS----GGKDSENT--NAAADTCGLLPMYSTHSSSDAIW---SRQPAAP 281
              EG SP +S    GG DS +T      D+CG++ MYS H  S+ IW   +R P  P
Sbjct: 274 GGGEGSSPAVSGAGGGGGDSGHTQGGGGGDSCGMV-MYSCHLPSEVIWAPTARPPPPP 328

BLAST of Cp4.1LG02g05220 vs. TAIR10
Match: AT5G49700.1 (AT5G49700.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 252.3 bits (643), Expect = 3.6e-67
Identity = 137/246 (55.69%), Postives = 175/246 (71.14%), Query Frame = 1

Query: 43  HSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVL 102
           HS  +H  L        D +SIEVVRRPRGRPPGSKNKPKP   V   RD +P MSPY+L
Sbjct: 31  HSLTSHFHLSSTVTPTVDDSSIEVVRRPRGRPPGSKNKPKPPVFVT--RDTDPPMSPYIL 90

Query: 103 EVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPV-ATVTFHGRFDILSV 162
           EVP G+D+VEAI+RFCRR++ G+C+L+  G+VA+VTLRQPS + + +T+TFHG+FD+LSV
Sbjct: 91  EVPSGNDVVEAINRFCRRKSIGVCVLSGSGSVANVTLRQPSPAALGSTITFHGKFDLLSV 150

Query: 163 CATFVPH----TTSFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQ 222
            ATF+P     + S P+ N FT++LAGPQGQI GG VAG LI AGTV+VIAASFNNPSY 
Sbjct: 151 SATFLPPPPRTSLSPPVSNFFTVSLAGPQGQIIGGFVAGPLISAGTVYVIAASFNNPSYH 210

Query: 223 RLPSEDEVRKSGFSDVEEGHSPPISGG--KDSENTNAAADTCGLLPMYSTH-SSSDAIWS 281
           RLP+E+E + S  +   EG SPP+SGG  +  +   +  ++CG + MYS H   SD IW+
Sbjct: 211 RLPAEEEQKHSAGTGEREGQSPPVSGGGEESGQMAGSGGESCG-VSMYSCHMGGSDVIWA 270

BLAST of Cp4.1LG02g05220 vs. TAIR10
Match: AT1G14490.1 (AT1G14490.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 208.8 bits (530), Expect = 4.5e-54
Identity = 114/223 (51.12%), Postives = 147/223 (65.92%), Query Frame = 1

Query: 64  IEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRRRNT 123
           +E V RPRGRP GSKNKPK    V      +P MSPY+LEVP G+D+VEA++RFCR +  
Sbjct: 1   METVGRPRGRPRGSKNKPKAPIFVTI----DPPMSPYILEVPSGNDVVEALNRFCRGKAI 60

Query: 124 GLCILNAYGTVADVTLRQPS-SSPVATVTFHGRFDILSVCATFVPH----TTSFPIPNGF 183
           G C+L+  G+VADVTLRQPS ++P +T+TFHG+FD+LSV ATF+P     + S P+ N F
Sbjct: 61  GFCVLSGSGSVADVTLRQPSPAAPGSTITFHGKFDLLSVSATFLPPLPPTSLSPPVSNFF 120

Query: 184 TITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHS 243
           T++LAGPQG++ GG VAG L+ AGTV+ +A SF NPSY RLP+ +E +++     EEG S
Sbjct: 121 TVSLAGPQGKVIGGFVAGPLVAAGTVYFVATSFKNPSYHRLPATEEEQRNSAEGEEEGQS 180

Query: 244 PPISGGKDSENTNAAADTCGLLPMYSTHSSSDAIWSRQPAAPS 282
           PP+SGG             G   MY     SD IW     APS
Sbjct: 181 PPVSGG-------------GGESMYV--GGSDVIWDPNAKAPS 204

BLAST of Cp4.1LG02g05220 vs. TAIR10
Match: AT4G12050.1 (AT4G12050.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 157.9 bits (398), Expect = 9.2e-39
Identity = 91/215 (42.33%), Postives = 133/215 (61.86%), Query Frame = 1

Query: 54  EPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEA 113
           E  SGG G+  ++ RRPRGRP GSKNKPK  A ++  RD+  A+  +V+E+  G DIV+ 
Sbjct: 104 EGGSGGGGSGEQMTRRPRGRPAGSKNKPK--APIIITRDSANALRTHVMEIGDGCDIVDC 163

Query: 114 ISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFP 173
           ++ F RRR  G+C+++  G+V +VT+RQP S P + V+ HGRF+ILS+  +F+P     P
Sbjct: 164 MATFARRRQRGVCVMSGTGSVTNVTIRQPGSPPGSVVSLHGRFEILSLSGSFLPPPAP-P 223

Query: 174 IPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDV 233
              G ++ LAG QGQ+ GG V G L+ +G V V+AASF+N +Y+RLP E++  ++     
Sbjct: 224 AATGLSVYLAGGQGQVVGGSVVGPLLCSGPVVVMAASFSNAAYERLPLEEDEMQTPVQGG 283

Query: 234 EEG-------HSPPISGGKDSENTNAAADTCGLLP 262
             G        SPP+ G + +    AAA   GL P
Sbjct: 284 GGGGGGGGGMGSPPMMGQQQAMAAMAAAQ--GLPP 313

BLAST of Cp4.1LG02g05220 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 157.9 bits (398), Expect = 9.2e-39
Identity = 89/209 (42.58%), Postives = 129/209 (61.72%), Query Frame = 1

Query: 58  GGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRF 117
           GG G   ++ RRPRGRP GSKNKPKP  ++   RD+  A+  +V+E+  G D+VE+++ F
Sbjct: 95  GGSGGDHQMTRRPRGRPAGSKNKPKPPIIIT--RDSANALRTHVMEIGDGCDLVESVATF 154

Query: 118 CRRRNTGLCILNAYGTVADVTLRQPSS--SPVATVTFHGRFDILSVCATFVPHTTSFPIP 177
            RRR  G+C+++  G V +VT+RQP S  SP + V+ HGRF+ILS+  +F+P     P  
Sbjct: 155 ARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLPPPAP-PTA 214

Query: 178 NGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEE 237
            G ++ LAG QGQ+ GG V G L+ AG V V+AASF+N +Y+RLP E++  ++       
Sbjct: 215 TGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEEDEMQTPVHGGGG 274

Query: 238 G---HSPPISGGKDSENTNAAADTCGLLP 262
           G    SPP+ G +      A +   GL P
Sbjct: 275 GGSLESPPMMGQQLQHQQQAMSGHQGLPP 300

BLAST of Cp4.1LG02g05220 vs. TAIR10
Match: AT2G45430.1 (AT2G45430.1 AT-hook motif nuclear-localized protein 22)

HSP 1 Score: 157.9 bits (398), Expect = 9.2e-39
Identity = 87/200 (43.50%), Postives = 123/200 (61.50%), Query Frame = 1

Query: 28  PRDSSTAADDDSTTGHSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVV 87
           P + S+A  D ST G             + GG G    + RRPRGRP GSKNKPKP  ++
Sbjct: 58  PNEHSSAGKDQSTPGSGG---------ESGGGGGGDNHITRRPRGRPAGSKNKPKPPIII 117

Query: 88  VAARDAEPAMSPYVLEVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSP- 147
              RD+  A+  +V+EV  G D++E+++ F RRR  G+C+L+  G V +VT+RQP+S P 
Sbjct: 118 T--RDSANALKSHVMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPG 177

Query: 148 --VATVTFHGRFDILSVCATFVPHTTSFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTV 207
              + V  HGRF+ILS+  +F+P     P  +G TI LAG QGQ+ GG V G L+ +G V
Sbjct: 178 GGSSVVNLHGRFEILSLSGSFLPPPAP-PAASGLTIYLAGGQGQVVGGSVVGPLMASGPV 237

Query: 208 FVIAASFNNPSYQRLPSEDE 225
            ++AASF N +Y+RLP E++
Sbjct: 238 VIMAASFGNAAYERLPLEED 245

BLAST of Cp4.1LG02g05220 vs. NCBI nr
Match: gi|700205123|gb|KGN60256.1| (hypothetical protein Csa_3G890120 [Cucumis sativus])

HSP 1 Score: 465.7 bits (1197), Expect = 5.9e-128
Identity = 247/291 (84.88%), Postives = 254/291 (87.29%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAMLPKEPASGGD 60
           MFPKPHHQPP H QPFPRSF V ATPPPRDS    DDDS    SPLTHA++ KEP S  D
Sbjct: 1   MFPKPHHQPPSHSQPFPRSFPVTATPPPRDSP---DDDSPAAPSPLTHAIVLKEPPSSAD 60

Query: 61  GASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRR 120
           GASIEV RRPRGRPPGSKNKPKPAAVVVA RDAEP MSPYVLEVPGGSDIVEAISRFCRR
Sbjct: 61  GASIEVARRPRGRPPGSKNKPKPAAVVVANRDAEPPMSPYVLEVPGGSDIVEAISRFCRR 120

Query: 121 RNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNGFTI 180
           RNTGLCILNAYGTV DVTLRQP+SSPV TVTFHGRFDILSVCATFVP TTSFPIPNGFTI
Sbjct: 121 RNTGLCILNAYGTVGDVTLRQPASSPVGTVTFHGRFDILSVCATFVPQTTSFPIPNGFTI 180

Query: 181 TLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHSPP 240
           TLAGPQGQIFGGLVAG+LIG GTV+VIAASFNNPSYQRLPSEDEVRK  FSDVEEGHS P
Sbjct: 181 TLAGPQGQIFGGLVAGSLIGVGTVYVIAASFNNPSYQRLPSEDEVRKLTFSDVEEGHS-P 240

Query: 241 ISGGKDSENTNA--AADTCGLLPMYSTHSSSDAIW---SRQPAAPSHHRQY 287
           ISGGKDSENT A  A +TCGLLPMYSTHSSSD IW   +RQPA   HHRQY
Sbjct: 241 ISGGKDSENTAAGGAQETCGLLPMYSTHSSSDVIWTPAARQPA--HHHRQY 285

BLAST of Cp4.1LG02g05220 vs. NCBI nr
Match: gi|659132378|ref|XP_008466166.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 464.2 bits (1193), Expect = 1.7e-127
Identity = 243/291 (83.51%), Postives = 252/291 (86.60%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAMLPKEPASGGD 60
           MFPKPHHQPP H QPFPRSF V ATPPPRDS    DDDS    SPL+HA++ KEP +  D
Sbjct: 1   MFPKPHHQPPSHSQPFPRSFPVTATPPPRDSP---DDDSAAAPSPLSHAIVLKEPPASAD 60

Query: 61  GASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRFCRR 120
           GASIEVVRRPRGRPPGSKNKPKP AVVVA RDAEP MSPYVLEVPGGSDIVEAISRFCRR
Sbjct: 61  GASIEVVRRPRGRPPGSKNKPKPTAVVVAGRDAEPPMSPYVLEVPGGSDIVEAISRFCRR 120

Query: 121 RNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNGFTI 180
           RNTGLCILNAYGTV DVTLRQP+SS V TVTFHGRFDILSVCATFVP TTSFPIPNGFTI
Sbjct: 121 RNTGLCILNAYGTVGDVTLRQPASSSVGTVTFHGRFDILSVCATFVPQTTSFPIPNGFTI 180

Query: 181 TLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGHSPP 240
           TLAGPQGQIFGGLVAG+LIG GTV+VIAASFNNPSYQRLPSEDEVRK  FSDVEE HS P
Sbjct: 181 TLAGPQGQIFGGLVAGSLIGVGTVYVIAASFNNPSYQRLPSEDEVRKLTFSDVEEAHS-P 240

Query: 241 ISGGKDSENTNA--AADTCGLLPMYSTHSSSDAIW---SRQPAAPSHHRQY 287
           ISGGKDSENT A  A +TCGLLPMYSTHSSSD IW   +RQP  P HHRQY
Sbjct: 241 ISGGKDSENTAAGGAPETCGLLPMYSTHSSSDVIWTPAARQPPPPHHHRQY 287

BLAST of Cp4.1LG02g05220 vs. NCBI nr
Match: gi|1009137266|ref|XP_015885961.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba])

HSP 1 Score: 306.2 bits (783), Expect = 5.9e-80
Identity = 184/304 (60.53%), Postives = 210/304 (69.08%), Query Frame = 1

Query: 1   MFPKPHHQPP-----------PH-FQPFPRSFAVAATP--------PPRDSSTAADDDST 60
           MF K HH  P           PH   PF   F V              R SS AA   +T
Sbjct: 15  MFSKLHHPHPHQHQHQHQQHHPHQHHPFANPFQVITRECQTSEEDDTSRSSSGAATATTT 74

Query: 61  TGHSPLTHAMLPKEPASGGDGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPY 120
           T  +P   A   K+P   GDGA+IEVVRRPRGRPPGSKNKPKP  ++   RD EPAMSPY
Sbjct: 75  TTSNPT--AQRAKDPNPPGDGATIEVVRRPRGRPPGSKNKPKPPVIIT--RDTEPAMSPY 134

Query: 121 VLEVPGGSDIVEAISRFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILS 180
           +LEVPGG+DIVEAI+RFCRR+N GLC+L   GTVA+VTLRQPS++P ATVTFHGRFDILS
Sbjct: 135 ILEVPGGNDIVEAIARFCRRKNMGLCVLTGSGTVANVTLRQPSTTPGATVTFHGRFDILS 194

Query: 181 VCATFVPHTT-SFPIPNGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRL 240
           +CATF+P TT S P+PNGFTI+LAGPQGQI GGLVAG LI AGTV+VI ASFNNPSY RL
Sbjct: 195 ICATFLPQTTASCPVPNGFTISLAGPQGQIVGGLVAGTLIAAGTVYVIGASFNNPSYHRL 254

Query: 241 PSEDEVRKSGFSDVEEGHSPPISGGKDSENTNAAADTCGLLPMYSTHSSSDAIW---SRQ 281
           P+EDEVR SG +D   GHSP  SGG DS   +A A++CG + MYS H  SD IW   +RQ
Sbjct: 255 PAEDEVRNSGSAD---GHSPQ-SGGGDS-GGHAPAESCG-MSMYSCHLPSDVIWAPTARQ 308

BLAST of Cp4.1LG02g05220 vs. NCBI nr
Match: gi|566157292|ref|XP_002302339.2| (hypothetical protein POPTR_0002s10520g [Populus trichocarpa])

HSP 1 Score: 289.3 bits (739), Expect = 7.5e-75
Identity = 162/293 (55.29%), Postives = 206/293 (70.31%), Query Frame = 1

Query: 1   MFPKPHHQPPPHFQPFPRSFAVAATPPPRDS-STAADDDSTTGHSPLTHAMLPKEPASGG 60
           MF K H +   H  PF + +  +      D+ ST A   +T   +P T      EP S G
Sbjct: 19  MFSKLHPRHHQHL-PFSQQYQFSRESEEEDTRSTGAA--ATPNLTPTTQKQKLNEPNSSG 78

Query: 61  --DGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAISRF 120
             DGA+IEVVRRPRGRPPGSKNKPKP  ++   R++EP+MSPY+LEVPGG+D+VEA+SRF
Sbjct: 79  GTDGATIEVVRRPRGRPPGSKNKPKPPVIIT--RESEPSMSPYILEVPGGNDVVEALSRF 138

Query: 121 CRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIPNG 180
           CRR+N G+C+L   GTVA+VTLRQPS++P AT+TFHGRFDILS+ ATF+P T S+P+PN 
Sbjct: 139 CRRKNMGICVLTGSGTVANVTLRQPSATPGATITFHGRFDILSISATFLPQTASYPVPNS 198

Query: 181 FTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEEGH 240
           FTI+LAGPQGQI GG+VAG+L+ AGTVFV+AASFNNPSY RLP E+E R SG     EG 
Sbjct: 199 FTISLAGPQGQIVGGIVAGSLVAAGTVFVVAASFNNPSYHRLPLEEEGRTSGSDGGGEGQ 258

Query: 241 SPPISGGKDSENTNAAA-----DTCGLLPMYSTHSSSDAIW---SRQPAAPSH 283
           SP +SG    E+ +AA+     ++CG + MYS H  +D IW   +R P  P +
Sbjct: 259 SPAVSGAGGGESGHAASGGGGGESCG-IAMYSCHMPNDVIWAPAARPPPPPPY 305

BLAST of Cp4.1LG02g05220 vs. NCBI nr
Match: gi|743802489|ref|XP_011016916.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Populus euphratica])

HSP 1 Score: 286.2 bits (731), Expect = 6.4e-74
Identity = 160/294 (54.42%), Postives = 202/294 (68.71%), Query Frame = 1

Query: 1   MFPK---PHHQPPPHFQPFPRSFAVAATPPPRDSSTAADDDSTTGHSPLTHAMLPKEPAS 60
           MF K   PHHQ  P  Q +  S         R+S       +T   +P T      EP S
Sbjct: 19  MFSKLQPPHHQHLPFSQQYQFS---------RESEEEDTRSTTPNLTPTTQKQKLNEPNS 78

Query: 61  GG--DGASIEVVRRPRGRPPGSKNKPKPAAVVVAARDAEPAMSPYVLEVPGGSDIVEAIS 120
            G  DGA+IEVVRRPRGRPPGSKNKPKP  ++   R+ EP+MSPY+LEVPGG+D+VEA+S
Sbjct: 79  SGGTDGATIEVVRRPRGRPPGSKNKPKPPVIIT--REPEPSMSPYILEVPGGNDVVEALS 138

Query: 121 RFCRRRNTGLCILNAYGTVADVTLRQPSSSPVATVTFHGRFDILSVCATFVPHTTSFPIP 180
           RFCRR+N G+C+L   GTVA+VTLRQPS++P AT+TFHGRFDILS+ ATF+P T S+ +P
Sbjct: 139 RFCRRKNMGICVLTGSGTVANVTLRQPSATPGATITFHGRFDILSISATFLPQTASYLVP 198

Query: 181 NGFTITLAGPQGQIFGGLVAGALIGAGTVFVIAASFNNPSYQRLPSEDEVRKSGFSDVEE 240
           + FTI+LAGPQGQI GG+VAG+L+ AGT+FV+AASFNNPSY RLP E+E R SG     E
Sbjct: 199 SSFTISLAGPQGQIVGGIVAGSLVAAGTIFVVAASFNNPSYHRLPLEEERRTSGSGGGGE 258

Query: 241 GHSPPISGGKDSENTNAAA----DTCGLLPMYSTHSSSDAIW---SRQPAAPSH 283
           G SP +SG    E+ +AA+    ++CG + MYS H  +D IW   +R P  P +
Sbjct: 259 GQSPAVSGAGGEESGHAASGGGGESCG-IAMYSCHMPNDVIWTPAARPPPPPPY 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL17_ARATH6.4e-6655.69AT-hook motif nuclear-localized protein 17 OS=Arabidopsis thaliana GN=AHL17 PE=2... [more]
AHL28_ARATH8.1e-5351.12AT-hook motif nuclear-localized protein 28 OS=Arabidopsis thaliana GN=AHL28 PE=2... [more]
AHL26_ARATH1.6e-3742.33AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2... [more]
AHL24_ARATH1.6e-3742.58AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
AHL22_ARATH1.6e-3743.50AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0LJP1_CUCSA4.1e-12884.88Uncharacterized protein OS=Cucumis sativus GN=Csa_3G890120 PE=4 SV=1[more]
B9GN33_POPTR5.2e-7555.29Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s10520g PE=4 SV=2[more]
B9RJW1_RICCO1.7e-7355.67DNA binding protein, putative OS=Ricinus communis GN=RCOM_1039230 PE=4 SV=1[more]
M5XN52_PRUPE2.2e-7357.99Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022234mg PE=4 SV=1[more]
A0A067LAD4_JATCU1.4e-7254.70Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15877 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G49700.13.6e-6755.69 Predicted AT-hook DNA-binding family protein[more]
AT1G14490.14.5e-5451.12 Predicted AT-hook DNA-binding family protein[more]
AT4G12050.19.2e-3942.33 Predicted AT-hook DNA-binding family protein[more]
AT4G22810.19.2e-3942.58 Predicted AT-hook DNA-binding family protein[more]
AT2G45430.19.2e-3943.50 AT-hook motif nuclear-localized protein 22[more]
Match NameE-valueIdentityDescription
gi|700205123|gb|KGN60256.1|5.9e-12884.88hypothetical protein Csa_3G890120 [Cucumis sativus][more]
gi|659132378|ref|XP_008466166.1|1.7e-12783.51PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|1009137266|ref|XP_015885961.1|5.9e-8060.53PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba][more]
gi|566157292|ref|XP_002302339.2|7.5e-7555.29hypothetical protein POPTR_0002s10520g [Populus trichocarpa][more]
gi|743802489|ref|XP_011016916.1|6.4e-7454.42PREDICTED: putative DNA-binding protein ESCAROLA [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g05220.1Cp4.1LG02g05220.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 99..212
score: 1.8
IPR005175PPC domainPROFILEPS51742PPCcoord: 94..234
score: 31
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 95..223
score: 9.1
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 7..278
score: 2.1E
NoneNo IPR availablePANTHERPTHR31100:SF1AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 17-RELATEDcoord: 7..278
score: 2.1E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 96..223
score: 2.59