Cp4.1LG05g07180 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG05g07180
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAT hook motif DNA-binding family protein
LocationCp4.1LG05 : 4376541 .. 4382154 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTAATTTTCTCTTTTTTTTTTTTTCTGATTTAATTTTATTTAATTTTTTCCATTCCTTTTTTCTACTTTATTTATTTCTCCTCATTTCCTCTGCAAAACCCTAATTTTCTACATCTCGAAAAAATTTCTGTGATTGTTCTGTATTTTTTGGAATTTTTTTCTTGTATTTCTGAGATTTTTTTTACTGTTTTTATTTTATTTTTATTTTTTTTTACGATTATTGGGGTAAGTGATTTGTTGGATTTTTCCTAAACTGGAGCTTTGAGTAGTGGTCTGTAATGGCGGAGATTTAGGATACTGAGAAGAAACTGAGTTGAAGTTGTTGTACGGAAGCTCTGGTTCTTGATCTACAAGTAGTTCTACTGGAGGTTTTGGTTTTGGGGATTGGTGATAGGTTTTGGATTTCATTTCACTTCATTGGCCCTGTTTTTGTGTTCTTATTTCACCGTCTTCATCATTGAAACCTTCTTCGTCGGTTTGTTTGTAAAATATTGTATCTGAAGTTTGTAAACCGAAGAATGGAGGTTTAAGAAGCGGTTACGTAGTTTTGTTTTGTAACCAATTTCTTTTCTAGACTGTAAAAATGGAGGAAAAAGAGGGTGTTGATTTCGGATTTGCATTAAAAGTGAGCCAAGCTCCAGATAGCTTCGGGATGATGGATTCGAGACCTGAAAATACAAGCACGGACAGCGAGCCACCTCCTCCGCCTCCGCCGCCGGCGACGGCGACGGTGGAAGCAGCTCTGACGCTGGCAGGTACAGATGGAAAGAAGAAAAGAGGGAGGCCGAGAAAGTACGGACCCGATGGGACTGTAGTGCCAACATTGTCGCCAATGCCGATTTCATCATCGATTCCGCTGACGGGAGAGTTTCCGGGATGGAAACGGGGAAGGGGGAGGTCAGTAGAATCGCTCAAGAAGTCGCGGAAGTACGATTATGAGATTCCAGGTAAAATATCAATGGTCTATTATATATTTGGAATCATTTGATTTGAAATTCTGTTGGAATCGTTGGATCATTTGAAATTCTGTTATGCTGGTTTACTTTACTTGTGTTTACTGTTACATTTGAGTGCCGATCCGTTTGATAGCAACAACTAATATGGATTGAAATCAGCTTTAAAACTTCGATATATATTCTTCTCTCTCTCTCTCTCTCTTTATTAGTGTGAGAGAATCTCTGTACTTGCAGGAAAAGATAAGAGGGGTCGGTCTAGTTATCTGGCGTAGAAAGCGATGGTTTTGAACGTATATTTGTGGGTTTTGTACCGTAGAATTATGAGGGGTTGTTTATTGGGTTTTGATGTGATAGTGGACTGCATCGTTGGTTTCACTTCAGATCTGCCACAAATAGTACTTAATGGATCAAAAGTGACTTTGGAGCTTTCTGGAAAATGGAGGGAACTTTCCTCCTTAGGATAGGGATAGCCTTTGAGAAAATAGCCATTGTTTGGAGTTTGGACGTGAGTGACCTGCTGTTTATTATGAACAGAAAATTTGTGGTCTAACATGGCTTAAACTTTCCGAACTGGGCTGCTTAAAATGAACAGAAAATGGAAGGAAAATGCATGGTACGTTATTATGAGAGAGTTTTTTTTGGAAATTTCAAATGAAAACTGTAATTGGGAGATGAAATGATTTAGAGCATGTGGGTAACTGTTATCTGTTGGTGGTATTAGGTAATTGAAATTAGTGCTTTTTTTGTAACTGAAGAATATGGAAAATATTCTTATATTATCTTAAATATATAGCATGTTGATTAAGATATTCTGGAGTATTAGTTTAATTGGGAGTGGATGAAGCGAGGAAGGTGAGTATTAATTGATTGAAAATAAATAATCATGAAAACCCCATTTTATTCCCTTTTTCTGTTCCCTTAAATGGAAAACCAATGCCTTATATCATGAAAACAGAGTATGGTTCAACAGAGGATAAGCTATCCACAAAAGATGCTCAAAGTTACGAGCTAATGATTGTATTTTAAGTCGGAGTGAAATGAAAATGTTAGCAGATATGAAGTCCAGCTAAAAGTTTAAATTTTCAACCATTGTAATTTAAACTAGATGAAAGCATGCATTCAGGATTTAGGAAATCAGAGTTGGTTGCCCTTCTGTACTAAACTGTGGGGCGGGTTATTAGAGTTTCCAAGAAGGATTTTTTTTTTTGTCTCGGTCGTATGAATTCTGATATGAGATGAGGGTGATTGGATTAATTGCATTTTTCTCATTGACATTTTGAATTGATCAGGAAATATGTTTTGCGTTGTGTTCGAATTTAGGCTGATTTCCTTTGTGGTAAAGGAGATAATTATTTGTGTTTTTCATCATTCAAAAGCATAACCTATCTGAACATTGATTTATGCTTGATTGTAAATTATTCAATGGTGAGAGTTGTAATTAATGTCGTAAATAAATATTCTGAAGTTAGGTATTTACGACGTCTGGACCTAGTCATGCAGATGCAATTTCATTGTAAAAGGAGAAATTTTATCTCATTTCAGCATCATTCTTACATAATGCTTCGGTTACCCAGGTAACAAGGTTGCCTTCTTTGCTGGAGCAGATTTCACACCTCACGTGATCACTGTTAATATTGGTGAGGTAAGTCTTAGAATCCCTTGTCAAGGAGCACCTATGCTGCCATTAGTGAATTAGTATTGGATCAAATAATATGGTTTAGCAACATGATTGTTTGTATTTTATTCAATTTGGTGCGGGAGATAACAGCAGCTGTCACGATATAACTGATGTGTTATTTCACGAGTTACTGTACAATACGACATGGGCATGATCATGTGATCATTTTCACTTTAATGAATTTGTTCAATGTGAAATTTAAACCTGCTTGAATTTTGGGTAGTATTGAAAACCTTGAGGCTCAAATGACCACTTGTTTCTGGCTTGTCGAAACAGAAGGGTTTTCGCCGGTCCTGCTATTGATATACTAAACGAAACTTCTGGAAGTAAATTTTATTTGTTGAATTGTGCAAAAAATTACCAAGACCACCTGCAACTTTGCTTGTGCGAACCAGATTAAACATCGATAATCAGTTAAATAATTTGGAGGAATTAATTACAAAAGTTTTCAGTGATTTGCCATTGCAATGATCTCACAATTTCTGTTCATTCTTGATACATTGTTGCCACTTGTTCCTCATACAACTTTTTCTACTCCTTTAGGATGTTAACTTGAAAGTAATGTCATTTTCTCAACAAGGATCTCGGGCCATTTGTATACTCTCTGCTAATGGTATGGTTTCAAATGTCACACTTCGGCAGTCAACATCTTCTGGGGGTACCCTAACATATGAGGTAGAGCACCTTCCTTGATAATATAACATCTAAGATCCTTTGTATTGATGAGTCTATGGGTCCACTACGTGCATATTCTATAAAATGAGACCACGTTTGATCTTATAGCTTATTCCATGCTCTGTTTTCTTTCCGAGGCTCATTTTGTTTTTTGTTCTTCAAGTCATATATCTGACTGCACCGAGATCTGATGCGTATTTGGCTGTGTTTTCTTTCCGATGTTTCATTTTAATTGTTATGCGTGCATATGAACATTTTTAGGATATTGAACTTTATTTCTTTTACTAATAAATTCTTAAGAACAACCTATGAATTACGAAATAAGGACTGCTGTCTAGAACTTGATGCTCTCCGAAACAAAGCCCCACTCAACAAAACTCTTGCTCCTCTAACTCCTTTATTCTCCTCAGCAGGACCTATAGGTGCTTCCCTTGATTTTTCACTCTCTTTTTCCTTGAATACTCAAGCCTAGTGAATCCTGTAATCTAACTTTTGTAATCCCAGTGTGATAAGAATTTTCTTGCTTATTTGAACTTTTTTTTCTGACAAATGAGTGTCCAAAACTGCTTTCACTTGTACATTGTTTTAAAAAAAAGTTGCTGCTTTCAGTTTTTAGTTTAAGTTTTTTAAAAACAATATTTTTTAATTTTACATAATTTATTTGTTTAAAAAAATTAATTTGTTTTCTCAAAATTTCTTTATTTTGTATTTCACATTGTTAAAACATACTTTGCTTTCCTAGCTAGTTTTAAANGAACAAGGTAGGTTGCAGATTCTGACTCTGACACTTCAAATTGCTAAAAATTTGAATGCTCTGTTCGTAAGAGTGCGAATTGTTTCCTTCTTGTGTGTAACTAAGCTTGTTCTTTAGAAACAGACCCAAAAATAGAATAATTATAAAAGAGGCCCTAAATTTTATTGTTCTTTTTTCAAATTTTTTGAGGAAGACAACTATCTCATAATTTTAACATATTTGTAAAGAGGAAAAAAAAACGAAAAAACTCCTCATATTCTAGATATTTTCTTGAGAGCGTAGCAATTATCATTTATCTGTGTAGTGTTTTACATTAACTCTATCGAAATAAATGTTCTCTTGAAGCATAATTGATGGTTAGCCAAGTTGAAAATTTCAAATGATCTTGATGCAAAGTTTTCTTGCATCCATGATTGCTCAAAAGTCAAACTTGTGAACTCGTTTATCGCATGTTACTTGTTACAATTAGCCACTCTATGCAATTTTATCCCTGCATCTTATCTTCAACTGTCATGGTAGCATTAGTTGTAGAGGAACCTCATGAACTGTTATGATTATTAGGGTCGATTTGAGATACTTTCGTTATCTGGATCATATATGCCTTCCAAGATTGGTGGAACAAAGAGCCGATCCGGAGGGATGAGTGTCTCTTTGGCTTGCCCAGATGGCCGAGTAATGGGTGGAGGACTTTCCGGCATGCTGATAGCAGCTGGTCCAGTGCAGGTATTATTTTCTCCAAATGAGTCGAATTTATAAGGGCTTAAGTTCAGTACCCAGAGGCACAAATCCATGATTTGTTTTAATAGTAGTTTGTTCGTAAAAATATAAACATACATATATGTAGAGTTCGATTTTGAAGAAAATATAACTTACATAAATACAAAATAAATTGTGTGCCTTCCATGGTCGCCAGGTGGTGGTGGGCAGTTTCCAACCACCAGGCCACCAACAGGAGAATAAACCAAAGAAGAGCAGGATGGAACCTACATCAAATGCAATTTCACCTCCTCCCGCCATTAAACCCATAGGAGATAAAGCAGATTCTTTAGACCCAAATCCAGCTTTTACAACTTCACCAGTCAACGAAAAACCACCTTCTCCAGAAGAATCAAGAGTTGTCCTCAACCATTCAAACCATGAGGTATCTTGTTGATCGCCATCACCTGCATTTAGAATCTTTCATGTAGATTAGCTGGTGCATTGACATTTTCTCAACCTTGATCATCAGCGGTATGAGCTCATTGATTCTGTGAAGGTATGAGCTCATTGATTCTGTAGAGGGCTAACTAAGAGTGTTTAAACAACTGACGTATCTGTCGTCATCAGGTAACATCTCTAGATAGGAGTTTAAGCCTAATAGATAGATAGGAGTTTGATCATTAGGCTTATTTGTTGTTTGTTCTCAGCATGTAGGATCTCGTGTGTTGTATCCCACTTAACGTTATAGGAGTTTTAAGTGCAGAAATTTGTGTAAATTATTCATGTTCTAGAATTCTCTGTATCTGAAATAATAACATTTTAAGTTCAATTTCAAAGTCTGTTTCCCAGT

mRNA sequence

TCTTAATTTTCTCTTTTTTTTTTTTTCTGATTTAATTTTATTTAATTTTTTCCATTCCTTTTTTCTACTTTATTTATTTCTCCTCATTTCCTCTGCAAAACCCTAATTTTCTACATCTCGAAAAAATTTCTGTGATTGTTCTGTATTTTTTGGAATTTTTTTCTTGTATTTCTGAGATTTTTTTTACTGTTTTTATTTTATTTTTATTTTTTTTTACGATTATTGGGGTAAGTGATTTGTTGGATTTTTCCTAAACTGGAGCTTTGAGTAGTGGTCTGTAATGGCGGAGATTTAGGATACTGAGAAGAAACTGAGTTGAAGTTGTTGTACGGAAGCTCTGGTTCTTGATCTACAAGTAGTTCTACTGGAGGTTTTGGTTTTGGGGATTGGTGATAGGTTTTGGATTTCATTTCACTTCATTGGCCCTGTTTTTGTGTTCTTATTTCACCGTCTTCATCATTGAAACCTTCTTCGTCGGTTTGTTTGTAAAATATTGTATCTGAAGTTTGTAAACCGAAGAATGGAGGTTTAAGAAGCGGTTACGTAGTTTTGTTTTGTAACCAATTTCTTTTCTAGACTGTAAAAATGGAGGAAAAAGAGGGTGTTGATTTCGGATTTGCATTAAAAGTGAGCCAAGCTCCAGATAGCTTCGGGATGATGGATTCGAGACCTGAAAATACAAGCACGGACAGCGAGCCACCTCCTCCGCCTCCGCCGCCGGCGACGGCGACGGTGGAAGCAGCTCTGACGCTGGCAGGTACAGATGGAAAGAAGAAAAGAGGGAGGCCGAGAAAGTACGGACCCGATGGGACTGTAGTGCCAACATTGTCGCCAATGCCGATTTCATCATCGATTCCGCTGACGGGAGAGTTTCCGGGATGGAAACGGGGAAGGGGGAGGTCAGTAGAATCGCTCAAGAAGTCGCGGAAGTACGATTATGAGATTCCAGGTAACAAGGTTGCCTTCTTTGCTGGAGCAGATTTCACACCTCACGTGATCACTGTTAATATTGGTGAGGGTCGATTTGAGATACTTTCGTTATCTGGATCATATATGCCTTCCAAGATTGGTGGAACAAAGAGCCGATCCGGAGGGATGAGTGTCTCTTTGGCTTGCCCAGATGGCCGAGTAATGGGTGGAGGACTTTCCGGCATGCTGATAGCAGCTGGTCCAGTGCAGGTGGTGGTGGGCAGTTTCCAACCACCAGGCCACCAACAGGAGAATAAACCAAAGAAGAGCAGGATGGAACCTACATCAAATGCAATTTCACCTCCTCCCGCCATTAAACCCATAGGAGATAAAGCAGATTCTTTAGACCCAAATCCAGCTTTTACAACTTCACCAGTCAACGAAAAACCACCTTCTCCAGAAGAATCAAGAGTTGTCCTCAACCATTCAAACCATGAGCGGTATGAGCTCATTGATTCTGTGAAGGTATGAGCTCATTGATTCTGTAGAGGGCTAACTAAGAGTGTTTAAACAACTGACGTATCTGTCGTCATCAGGTAACATCTCTAGATAGGAGTTTAAGCCTAATAGATAGATAGGAGTTTGATCATTAGGCTTATTTGTTGTTTGTTCTCAGCATGTAGGATCTCGTGTGTTGTATCCCACTTAACGTTATAGGAGTTTTAAGTGCAGAAATTTGTGTAAATTATTCATGTTCTAGAATTCTCTGTATCTGAAATAATAACATTTTAAGTTCAATTTCAAAGTCTGTTTCCCAGT

Coding sequence (CDS)

ATGGAGGAAAAAGAGGGTGTTGATTTCGGATTTGCATTAAAAGTGAGCCAAGCTCCAGATAGCTTCGGGATGATGGATTCGAGACCTGAAAATACAAGCACGGACAGCGAGCCACCTCCTCCGCCTCCGCCGCCGGCGACGGCGACGGTGGAAGCAGCTCTGACGCTGGCAGGTACAGATGGAAAGAAGAAAAGAGGGAGGCCGAGAAAGTACGGACCCGATGGGACTGTAGTGCCAACATTGTCGCCAATGCCGATTTCATCATCGATTCCGCTGACGGGAGAGTTTCCGGGATGGAAACGGGGAAGGGGGAGGTCAGTAGAATCGCTCAAGAAGTCGCGGAAGTACGATTATGAGATTCCAGGTAACAAGGTTGCCTTCTTTGCTGGAGCAGATTTCACACCTCACGTGATCACTGTTAATATTGGTGAGGGTCGATTTGAGATACTTTCGTTATCTGGATCATATATGCCTTCCAAGATTGGTGGAACAAAGAGCCGATCCGGAGGGATGAGTGTCTCTTTGGCTTGCCCAGATGGCCGAGTAATGGGTGGAGGACTTTCCGGCATGCTGATAGCAGCTGGTCCAGTGCAGGTGGTGGTGGGCAGTTTCCAACCACCAGGCCACCAACAGGAGAATAAACCAAAGAAGAGCAGGATGGAACCTACATCAAATGCAATTTCACCTCCTCCCGCCATTAAACCCATAGGAGATAAAGCAGATTCTTTAGACCCAAATCCAGCTTTTACAACTTCACCAGTCAACGAAAAACCACCTTCTCCAGAAGAATCAAGAGTTGTCCTCAACCATTCAAACCATGAGCGGTATGAGCTCATTGATTCTGTGAAGGTATGA

Protein sequence

MEEKEGVDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEAALTLAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYDYEIPGNKVAFFAGADFTPHVITVNIGEGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIAAGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPPPAIKPIGDKADSLDPNPAFTTSPVNEKPPSPEESRVVLNHSNHERYELIDSVKV
BLAST of Cp4.1LG05g07180 vs. Swiss-Prot
Match: AHL3_ARATH (AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 3.5e-40
Identity = 123/296 (41.55%), Postives = 152/296 (51.35%), Query Frame = 1

Query: 1   MEEKEGVDFG------FALKVSQ---APDSFGMMDSRPENTSTDSEPPPPPPPPATATVE 60
           MEE+EG +        F LK      A D    MD  P   + +    PP   PA ATV 
Sbjct: 1   MEEREGTNINNNITSSFGLKQQHEAAASDGGYSMDPPPRPENPNPFLVPPTTVPAAATVA 60

Query: 61  AALT---------------LAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEF 120
           AA+T                +    KKKRGRPRKY PDGT+V TLSPMPISSS+PLT EF
Sbjct: 61  AAVTENAATPFSLTMPTENTSAEQLKKKRGRPRKYNPDGTLVVTLSPMPISSSVPLTSEF 120

Query: 121 PGWKRGRGRSVES--LKKSRKYDYE-------IPGNKVAFFAGADFTPHV--------IT 180
           P  KRGRGR   +  LKKS+ + ++       + G   A F GA+FTPHV        +T
Sbjct: 121 PPRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGEDVT 180

Query: 181 VNIG------------------------------------EGRFEILSLSGSYMPSKIGG 220
           + I                                     EGRFEILSL+GS+M +  GG
Sbjct: 181 MKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSLTGSFMQNDSGG 240

BLAST of Cp4.1LG05g07180 vs. Swiss-Prot
Match: AHL1_ARATH (AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 5.6e-38
Identity = 124/309 (40.13%), Postives = 151/309 (48.87%), Query Frame = 1

Query: 6   GVDFGFALKVSQAPDSFGMMD-SRPENTSTDSEPPPPP---------PPPATATVEAALT 65
           G D G  +  S AP  F +   S   N S  S  PPPP         PP   +TV    T
Sbjct: 17  GNDGGITVVRSDAPSDFHVAQRSESSNQSPTSVTPPPPQPSSHHTAPPPLQISTVTTTTT 76

Query: 66  LAGTDG------KKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFP-----------G 125
            A  +G      KKKRGRPRKYGPDGTVV  LSP PISS+ P     P            
Sbjct: 77  TAAMEGISGGLMKKKRGRPRKYGPDGTVV-ALSPKPISSA-PAPSHLPPPSSHVIDFSAS 136

Query: 126 WKRGRGRSVESLKKSRKYDYEIP--GNKVAFFAGADFTPHVITVN--------------- 185
            KR + +   S  ++ KY +++   G       G +FTPH+ITVN               
Sbjct: 137 EKRSKVKPTNSFNRT-KYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKIISFSQQ 196

Query: 186 -----------------------------IGEGRFEILSLSGSYMPSKIGGTKSRSGGMS 242
                                          EGRFEILSLSGS+MP+  GGT+SR+GGMS
Sbjct: 197 GPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMS 256

BLAST of Cp4.1LG05g07180 vs. Swiss-Prot
Match: AHL2_ARATH (AT-hook motif nuclear-localized protein 2 OS=Arabidopsis thaliana GN=AHL2 PE=2 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 2.6e-35
Identity = 122/321 (38.01%), Postives = 158/321 (49.22%), Query Frame = 1

Query: 6   GVDFGFALKVSQAPDSFGMMD-SRPENTSTDSEPPPPPPPPATATVEAALTLAGTDG--K 65
           G D G  +  S AP  F M   S   NT  +S  PPPPPPP  +   +A     + G  K
Sbjct: 13  GSDGGVTVVRSNAPSDFHMAPRSETSNTPPNSVAPPPPPPPQNSFTPSAAMDGFSSGPIK 72

Query: 66  KKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGW-----KRGRGRSVESLKKS---R 125
           K+RGRPRKYG DG  V TLSP PISS+ P T     +     KRG+ +       S    
Sbjct: 73  KRRGRPRKYGHDGAAV-TLSPNPISSAAPTTSHVIDFSTTSEKRGKMKPATPTPSSFIRP 132

Query: 126 KYDYEIPGNKVAFFAGADFTPHVI---------------------------------TVN 185
           KY  E  G      A A+FTPH+I                                 +V 
Sbjct: 133 KYQVENLGEWSPSSAAANFTPHIITVNAGEDVTKRIISFSQQGSLAICVLCANGVVSSVT 192

Query: 186 IG-----------EGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGM 245
           +            EGRFEILSLSG++MPS   GT+SR+GGMSVSLA PDGRV+GGG++G+
Sbjct: 193 LRQPDSSGGTLTYEGRFEILSLSGTFMPSDSDGTRSRTGGMSVSLASPDGRVVGGGVAGL 252

Query: 246 LIAAGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPPPAIKPIGDKADSLDPNPAFT 271
           L+AA P+QVVVG+F    +QQE  PK     P ++     P +    + AD     P  +
Sbjct: 253 LVAATPIQVVVGTFLGGTNQQEQTPK-----PHNHNFMSSPLMPTSSNVADHRTIRPMTS 312

BLAST of Cp4.1LG05g07180 vs. Swiss-Prot
Match: AHL7_ARATH (AT-hook motif nuclear-localized protein 7 OS=Arabidopsis thaliana GN=AHL7 PE=2 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 8.4e-26
Identity = 101/272 (37.13%), Postives = 131/272 (48.16%), Query Frame = 1

Query: 40  PPPPPPATATVEAALTLAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGW 99
           PP   P  ++ EA+       GKK+RGRPRKY  +G  +P       SSS+PL  +    
Sbjct: 41  PPMEAPMPSSGEAS-------GKKRRGRPRKYEANGAPLP-------SSSVPLVKKRV-- 100

Query: 100 KRGRGRSVESLKKSRKYDYEIPGNK------VAFFAGADFTPHVI--------TVNI--- 159
            RG+    +  K  +   +   G +      V    G++FTPHVI        T+ I   
Sbjct: 101 -RGKLNGFDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEDITMRIISF 160

Query: 160 ---------------------------------GEGRFEILSLSGSYMPSKIGGTKSRSG 219
                                             EGRFEILSLSGS+M ++  G+K RSG
Sbjct: 161 SQQGPRAICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMETENQGSKGRSG 220

Query: 220 GMSVSLACPDGRVMGGGLSGMLIAAGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAI-S 261
           GMSVSLA PDGRV+GGG++G+LIAA P+QVVVGSF     Q   KP+K R+E    A+ S
Sbjct: 221 GMSVSLAGPDGRVVGGGVAGLLIAATPIQVVVGSFITSDQQDHQKPRKQRVEHAPAAVMS 280

BLAST of Cp4.1LG05g07180 vs. Swiss-Prot
Match: AHL6_ARATH (AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 2.8e-21
Identity = 63/117 (53.85%), Postives = 79/117 (67.52%), Query Frame = 1

Query: 144 EGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIAAGPVQVVVGS 203
           EGRFEILSLSGS+MP++ GGTK R+GGMS+SLA P+G + GGGL+GMLIAAGPVQVV+GS
Sbjct: 215 EGRFEILSLSGSFMPTENGGTKGRAGGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGS 274

Query: 204 FQPPGHQQENKPKKSRMEPTSNAISPPPAIKPIGDKADSLDPNPAFTTSPVNEKPPS 261
           F      ++N+ KK R+     A +PP    P   +     P P FT + VN   PS
Sbjct: 275 FIVMHQAEQNQKKKPRV---MEAFAPPQPQAP--PQLQQQQP-PTFTITTVNSTSPS 325

BLAST of Cp4.1LG05g07180 vs. TrEMBL
Match: A0A0A0KN92_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G169100 PE=4 SV=1)

HSP 1 Score: 229.9 bits (585), Expect = 3.7e-57
Identity = 132/202 (65.35%), Postives = 147/202 (72.77%), Query Frame = 1

Query: 1   MEEKEG-VDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           MEEKEG VDFGFA+KVSQAP+SFGMMD+RPEN+STD E PP  PP +  T  AA      
Sbjct: 1   MEEKEGGVDFGFAVKVSQAPESFGMMDTRPENSSTDGETPPQQPPASVPTAGAA------ 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYDYE 120
           DGKKKRGRPRKYGPDGTV PTLSPMPISSSIPL GEF GWKRGRGRSVES+KKSRK++YE
Sbjct: 61  DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLAGEFAGWKRGRGRSVESIKKSRKFEYE 120

Query: 121 IPGNKVAFFAGADFTPHVITVNIGE------------GRFEILSLSGSYMPSKIGGTKSR 180
           IPGNKVAFFAGADFTPHVITVNIGE            G   I  LS + M S +   +S 
Sbjct: 121 IPGNKVAFFAGADFTPHVITVNIGEDVNLKVMSFSQQGSRAICILSANGMVSNVTLRQST 180

Query: 181 SGGMSVSLACPDGRVMGGGLSG 190
           S G +++    +GR     LSG
Sbjct: 181 SSGGTLTY---EGRFEILSLSG 193

BLAST of Cp4.1LG05g07180 vs. TrEMBL
Match: A0A0D2UBA6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G283600 PE=4 SV=1)

HSP 1 Score: 224.2 bits (570), Expect = 2.1e-55
Identity = 148/328 (45.12%), Postives = 184/328 (56.10%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDS-RPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           ME+KEG   G  +K  +  D F +     P   S  + PP    PP        +     
Sbjct: 1   MEDKEGPTLGVTVKGDECLDGFHVASYVGPTAPSIVAPPPSMVVPPGHGNGSEMM----- 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVES-LKKSRKYDY 120
             KKKRGRPRKY PDG++  TLSPMPIS+SIP+ G+FPGW +G+ + V++ +KKS  Y+ 
Sbjct: 61  --KKKRGRPRKYAPDGSLAITLSPMPISASIPMNGDFPGWNQGKAQPVDTFIKKSLNYEL 120

Query: 121 EI-PGNKVAFFAGADFTPHVITVNIGE--------------------------------- 180
           E  PG+K+A+F G +FTPHVITVN GE                                 
Sbjct: 121 ETNPGDKIAYFVGTNFTPHVITVNAGEDVSMKVMFFSQQGAHAICVLSANGTISNVTLRQ 180

Query: 181 -----------GRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIA 240
                      G FEILSLSGSYMP+  GGTKSRSGGMS+SLA PDGRV+GGGL+G+L+A
Sbjct: 181 PTSSGGTLTYEGHFEILSLSGSYMPTNNGGTKSRSGGMSISLAGPDGRVLGGGLAGLLVA 240

Query: 241 AGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPP-----PA--------IKPI---- 263
           AGPVQVVVGSF  PGH+QE K KK R+EPT   ISP      PA        IKPI    
Sbjct: 241 AGPVQVVVGSFL-PGHKQEQKHKKQRIEPTVTIISPTATHSMPADINVSYGRIKPIFTYS 300

BLAST of Cp4.1LG05g07180 vs. TrEMBL
Match: A0A0B0PG27_GOSAR (Putative DNA-binding ESCAROLA-like protein OS=Gossypium arboreum GN=F383_07309 PE=4 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 4.6e-55
Identity = 147/328 (44.82%), Postives = 184/328 (56.10%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDS-RPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           ME+KEG   G  +K  +  D F +     P   S  + PP    PP   T    +     
Sbjct: 1   MEDKEGPTLGVTVKGDEGLDGFHVASYVGPTVPSIVAPPPSMVVPPGHGTGSEMM----- 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVES-LKKSRKYDY 120
             KKKRGRPRKY PDG++  TLSPMPIS+SIP+ G+FPGW +GR + V++ +KKS  Y+ 
Sbjct: 61  --KKKRGRPRKYAPDGSLAITLSPMPISASIPMNGDFPGWNQGRAQPVDTFIKKSLNYEL 120

Query: 121 EI-PGNKVAFFAGADFTPHVITVNIGE--------------------------------- 180
           E  PG+++A+F G +FTPHVITVN GE                                 
Sbjct: 121 ETNPGDRIAYFVGTNFTPHVITVNAGEDVSMKVMFFSQQGAHAICVLSANGTISNVTLRQ 180

Query: 181 -----------GRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIA 240
                      G FEILSLSGSYMP+  GGTKSRSGGMS+SLA PDGRV+GGGL+G+L+A
Sbjct: 181 PTSSGGTLTYEGHFEILSLSGSYMPTNNGGTKSRSGGMSISLAGPDGRVLGGGLAGLLVA 240

Query: 241 AGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPP-----PA--------IKPI---- 263
           AGPVQVVVGSF  P H+QE K KK R+EPT   +SP      PA        IKPI    
Sbjct: 241 AGPVQVVVGSFL-PSHKQEQKHKKQRIEPTVTIVSPTATDSMPADINVSYGGIKPILTYS 300

BLAST of Cp4.1LG05g07180 vs. TrEMBL
Match: M5X1Y0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008335mg PE=4 SV=1)

HSP 1 Score: 219.9 bits (559), Expect = 3.9e-54
Identity = 155/331 (46.83%), Postives = 188/331 (56.80%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEA-ALTLAGT 60
           MEEK+ +  G A+   +APD++ +   R EN S    P   P   A AT    +L L GT
Sbjct: 1   MEEKDNLVSGVAVSGEEAPDTYRIAP-RNENPS----PSGGPTMAAAATASPMSLALTGT 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYD-Y 120
           + KKKRGRPRKYGPD TV   LSPMPISSSIPLTGEF  WKRGRGR V+S+KKS KYD +
Sbjct: 61  EVKKKRGRPRKYGPDKTVSSALSPMPISSSIPLTGEFSAWKRGRGRPVDSVKKSHKYDVF 120

Query: 121 EIPGNKVAFFAGADFTPHVITVNIGEG--------------RFEILSLSGSY------MP 180
           E  G K+A+  GA+FTPHV+TV+ GE                  ILS +G+        P
Sbjct: 121 ESSGEKIAYSVGANFTPHVLTVHAGEDVTMKIMSFSQQGSRAICILSANGTISNVTLRQP 180

Query: 181 SKIGGTKS------------------------RSGGMSVSLACPDGRVMGGGLSGMLIAA 240
           S  GGT +                        RSGGMSV+LA PDGRV+GGGL+GMLIAA
Sbjct: 181 SSSGGTLTYEGRFEILSLSGSYIAIENAGTKSRSGGMSVALAGPDGRVVGGGLAGMLIAA 240

Query: 241 GPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISP-------------PPAIKPI----- 266
           GPVQVVVGSF  PGHQQE KPKK R+EP S++I P                +KPI     
Sbjct: 241 GPVQVVVGSFL-PGHQQEQKPKKQRLEPVSSSIVPIVVNAVSGEEMKVCGGVKPILTSPS 300

BLAST of Cp4.1LG05g07180 vs. TrEMBL
Match: F6HAK6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g01810 PE=4 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 4.3e-53
Identity = 135/289 (46.71%), Postives = 166/289 (57.44%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDSRPENTS------------------TDSEPPPPP 60
           ME  EG++ G  +K  +APD++ +  +R EN S                  T +  P P 
Sbjct: 1   MEGTEGINSGVTVKGEEAPDTYRVA-ARSENPSEFGGSTMTAVSPVAAPAPTPATAPAPA 60

Query: 61  PPPATATVEAALTLAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRG 120
           P  A      ++ +  ++ KKKRGRPRKYGP G++   LSPMPISSSIPLTGEF  WKRG
Sbjct: 61  PGSAPTPAPVSVAMPSSEMKKKRGRPRKYGPGGSLTMALSPMPISSSIPLTGEFSAWKRG 120

Query: 121 RGRSVESLKKSRKYDYEIPGNKVAFFAGA-------------DFTPHVITVN-------- 180
           RGR V+S KK  K + E  G +VA+  GA             D T  +I+ +        
Sbjct: 121 RGRPVDSFKKQHKSESESAGERVAYSVGANFTPHVITVNAGEDVTMKIISFSQQGSRAIC 180

Query: 181 -----------------------IGEGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACP 228
                                    EGRFEILSLSGS+MPS+ GGTKSRSGGMSVSLA P
Sbjct: 181 ILSANGAISNVTLRQPNSSGGTLTYEGRFEILSLSGSFMPSESGGTKSRSGGMSVSLAGP 240

BLAST of Cp4.1LG05g07180 vs. TAIR10
Match: AT4G25320.1 (AT4G25320.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 166.8 bits (421), Expect = 2.0e-41
Identity = 123/296 (41.55%), Postives = 152/296 (51.35%), Query Frame = 1

Query: 1   MEEKEGVDFG------FALKVSQ---APDSFGMMDSRPENTSTDSEPPPPPPPPATATVE 60
           MEE+EG +        F LK      A D    MD  P   + +    PP   PA ATV 
Sbjct: 1   MEEREGTNINNNITSSFGLKQQHEAAASDGGYSMDPPPRPENPNPFLVPPTTVPAAATVA 60

Query: 61  AALT---------------LAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEF 120
           AA+T                +    KKKRGRPRKY PDGT+V TLSPMPISSS+PLT EF
Sbjct: 61  AAVTENAATPFSLTMPTENTSAEQLKKKRGRPRKYNPDGTLVVTLSPMPISSSVPLTSEF 120

Query: 121 PGWKRGRGRSVES--LKKSRKYDYE-------IPGNKVAFFAGADFTPHV--------IT 180
           P  KRGRGR   +  LKKS+ + ++       + G   A F GA+FTPHV        +T
Sbjct: 121 PPRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGEDVT 180

Query: 181 VNIG------------------------------------EGRFEILSLSGSYMPSKIGG 220
           + I                                     EGRFEILSL+GS+M +  GG
Sbjct: 181 MKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSLTGSFMQNDSGG 240

BLAST of Cp4.1LG05g07180 vs. TAIR10
Match: AT4G12080.1 (AT4G12080.1 AT-hook motif nuclear-localized protein 1)

HSP 1 Score: 159.5 bits (402), Expect = 3.1e-39
Identity = 124/309 (40.13%), Postives = 151/309 (48.87%), Query Frame = 1

Query: 6   GVDFGFALKVSQAPDSFGMMD-SRPENTSTDSEPPPPP---------PPPATATVEAALT 65
           G D G  +  S AP  F +   S   N S  S  PPPP         PP   +TV    T
Sbjct: 17  GNDGGITVVRSDAPSDFHVAQRSESSNQSPTSVTPPPPQPSSHHTAPPPLQISTVTTTTT 76

Query: 66  LAGTDG------KKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFP-----------G 125
            A  +G      KKKRGRPRKYGPDGTVV  LSP PISS+ P     P            
Sbjct: 77  TAAMEGISGGLMKKKRGRPRKYGPDGTVV-ALSPKPISSA-PAPSHLPPPSSHVIDFSAS 136

Query: 126 WKRGRGRSVESLKKSRKYDYEIP--GNKVAFFAGADFTPHVITVN--------------- 185
            KR + +   S  ++ KY +++   G       G +FTPH+ITVN               
Sbjct: 137 EKRSKVKPTNSFNRT-KYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKIISFSQQ 196

Query: 186 -----------------------------IGEGRFEILSLSGSYMPSKIGGTKSRSGGMS 242
                                          EGRFEILSLSGS+MP+  GGT+SR+GGMS
Sbjct: 197 GPRSICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMS 256

BLAST of Cp4.1LG05g07180 vs. TAIR10
Match: AT4G22770.1 (AT4G22770.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 150.6 bits (379), Expect = 1.5e-36
Identity = 122/321 (38.01%), Postives = 158/321 (49.22%), Query Frame = 1

Query: 6   GVDFGFALKVSQAPDSFGMMD-SRPENTSTDSEPPPPPPPPATATVEAALTLAGTDG--K 65
           G D G  +  S AP  F M   S   NT  +S  PPPPPPP  +   +A     + G  K
Sbjct: 13  GSDGGVTVVRSNAPSDFHMAPRSETSNTPPNSVAPPPPPPPQNSFTPSAAMDGFSSGPIK 72

Query: 66  KKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGW-----KRGRGRSVESLKKS---R 125
           K+RGRPRKYG DG  V TLSP PISS+ P T     +     KRG+ +       S    
Sbjct: 73  KRRGRPRKYGHDGAAV-TLSPNPISSAAPTTSHVIDFSTTSEKRGKMKPATPTPSSFIRP 132

Query: 126 KYDYEIPGNKVAFFAGADFTPHVI---------------------------------TVN 185
           KY  E  G      A A+FTPH+I                                 +V 
Sbjct: 133 KYQVENLGEWSPSSAAANFTPHIITVNAGEDVTKRIISFSQQGSLAICVLCANGVVSSVT 192

Query: 186 IG-----------EGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGM 245
           +            EGRFEILSLSG++MPS   GT+SR+GGMSVSLA PDGRV+GGG++G+
Sbjct: 193 LRQPDSSGGTLTYEGRFEILSLSGTFMPSDSDGTRSRTGGMSVSLASPDGRVVGGGVAGL 252

Query: 246 LIAAGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPPPAIKPIGDKADSLDPNPAFT 271
           L+AA P+QVVVG+F    +QQE  PK     P ++     P +    + AD     P  +
Sbjct: 253 LVAATPIQVVVGTFLGGTNQQEQTPK-----PHNHNFMSSPLMPTSSNVADHRTIRPMTS 312

BLAST of Cp4.1LG05g07180 vs. TAIR10
Match: AT4G00200.1 (AT4G00200.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 119.0 bits (297), Expect = 4.7e-27
Identity = 101/272 (37.13%), Postives = 131/272 (48.16%), Query Frame = 1

Query: 40  PPPPPPATATVEAALTLAGTDGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGW 99
           PP   P  ++ EA+       GKK+RGRPRKY  +G  +P       SSS+PL  +    
Sbjct: 41  PPMEAPMPSSGEAS-------GKKRRGRPRKYEANGAPLP-------SSSVPLVKKRV-- 100

Query: 100 KRGRGRSVESLKKSRKYDYEIPGNK------VAFFAGADFTPHVI--------TVNI--- 159
            RG+    +  K  +   +   G +      V    G++FTPHVI        T+ I   
Sbjct: 101 -RGKLNGFDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEDITMRIISF 160

Query: 160 ---------------------------------GEGRFEILSLSGSYMPSKIGGTKSRSG 219
                                             EGRFEILSLSGS+M ++  G+K RSG
Sbjct: 161 SQQGPRAICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMETENQGSKGRSG 220

Query: 220 GMSVSLACPDGRVMGGGLSGMLIAAGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAI-S 261
           GMSVSLA PDGRV+GGG++G+LIAA P+QVVVGSF     Q   KP+K R+E    A+ S
Sbjct: 221 GMSVSLAGPDGRVVGGGVAGLLIAATPIQVVVGSFITSDQQDHQKPRKQRVEHAPAAVMS 280

BLAST of Cp4.1LG05g07180 vs. TAIR10
Match: AT5G62260.1 (AT5G62260.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 104.0 bits (258), Expect = 1.6e-22
Identity = 63/117 (53.85%), Postives = 79/117 (67.52%), Query Frame = 1

Query: 144 EGRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIAAGPVQVVVGS 203
           EGRFEILSLSGS+MP++ GGTK R+GGMS+SLA P+G + GGGL+GMLIAAGPVQVV+GS
Sbjct: 215 EGRFEILSLSGSFMPTENGGTKGRAGGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGS 274

Query: 204 FQPPGHQQENKPKKSRMEPTSNAISPPPAIKPIGDKADSLDPNPAFTTSPVNEKPPS 261
           F      ++N+ KK R+     A +PP    P   +     P P FT + VN   PS
Sbjct: 275 FIVMHQAEQNQKKKPRV---MEAFAPPQPQAP--PQLQQQQP-PTFTITTVNSTSPS 325

BLAST of Cp4.1LG05g07180 vs. NCBI nr
Match: gi|449451944|ref|XP_004143720.1| (PREDICTED: AT-hook motif nuclear-localized protein 3 [Cucumis sativus])

HSP 1 Score: 229.9 bits (585), Expect = 5.4e-57
Identity = 132/202 (65.35%), Postives = 147/202 (72.77%), Query Frame = 1

Query: 1   MEEKEG-VDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           MEEKEG VDFGFA+KVSQAP+SFGMMD+RPEN+STD E PP  PP +  T  AA      
Sbjct: 1   MEEKEGGVDFGFAVKVSQAPESFGMMDTRPENSSTDGETPPQQPPASVPTAGAA------ 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYDYE 120
           DGKKKRGRPRKYGPDGTV PTLSPMPISSSIPL GEF GWKRGRGRSVES+KKSRK++YE
Sbjct: 61  DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLAGEFAGWKRGRGRSVESIKKSRKFEYE 120

Query: 121 IPGNKVAFFAGADFTPHVITVNIGE------------GRFEILSLSGSYMPSKIGGTKSR 180
           IPGNKVAFFAGADFTPHVITVNIGE            G   I  LS + M S +   +S 
Sbjct: 121 IPGNKVAFFAGADFTPHVITVNIGEDVNLKVMSFSQQGSRAICILSANGMVSNVTLRQST 180

Query: 181 SGGMSVSLACPDGRVMGGGLSG 190
           S G +++    +GR     LSG
Sbjct: 181 SSGGTLTY---EGRFEILSLSG 193

BLAST of Cp4.1LG05g07180 vs. NCBI nr
Match: gi|659073161|ref|XP_008467286.1| (PREDICTED: uncharacterized protein LOC103504671 [Cucumis melo])

HSP 1 Score: 229.2 bits (583), Expect = 9.2e-57
Identity = 132/202 (65.35%), Postives = 147/202 (72.77%), Query Frame = 1

Query: 1   MEEKEG-VDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           MEEKEG VDFGFA+KVSQAP+SFGMMDSRPEN+STD E PP  PP +  +  AA      
Sbjct: 1   MEEKEGGVDFGFAVKVSQAPESFGMMDSRPENSSTDGETPPQQPPASVPSAGAA------ 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYDYE 120
           DGKKKRGRPRKYGPDGTV PTLSPMPISSSIPL GEF GWKRGRGRSVES+KKSRK++YE
Sbjct: 61  DGKKKRGRPRKYGPDGTVAPTLSPMPISSSIPLPGEFAGWKRGRGRSVESIKKSRKFEYE 120

Query: 121 IPGNKVAFFAGADFTPHVITVNIGE------------GRFEILSLSGSYMPSKIGGTKSR 180
           IPGNKVAFFAGADFTPHVITVNIGE            G   I  LS + M S +   +S 
Sbjct: 121 IPGNKVAFFAGADFTPHVITVNIGEDVNLKVMSFSQQGSRAICILSANGMVSNVTLRQST 180

Query: 181 SGGMSVSLACPDGRVMGGGLSG 190
           S G +++    +GR     LSG
Sbjct: 181 SSGGTLTY---EGRFEILSLSG 193

BLAST of Cp4.1LG05g07180 vs. NCBI nr
Match: gi|823214975|ref|XP_012440250.1| (PREDICTED: AT-hook motif nuclear-localized protein 6-like [Gossypium raimondii])

HSP 1 Score: 224.2 bits (570), Expect = 2.9e-55
Identity = 148/328 (45.12%), Postives = 184/328 (56.10%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDS-RPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           ME+KEG   G  +K  +  D F +     P   S  + PP    PP        +     
Sbjct: 1   MEDKEGPTLGVTVKGDECLDGFHVASYVGPTAPSIVAPPPSMVVPPGHGNGSEMM----- 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVES-LKKSRKYDY 120
             KKKRGRPRKY PDG++  TLSPMPIS+SIP+ G+FPGW +G+ + V++ +KKS  Y+ 
Sbjct: 61  --KKKRGRPRKYAPDGSLAITLSPMPISASIPMNGDFPGWNQGKAQPVDTFIKKSLNYEL 120

Query: 121 EI-PGNKVAFFAGADFTPHVITVNIGE--------------------------------- 180
           E  PG+K+A+F G +FTPHVITVN GE                                 
Sbjct: 121 ETNPGDKIAYFVGTNFTPHVITVNAGEDVSMKVMFFSQQGAHAICVLSANGTISNVTLRQ 180

Query: 181 -----------GRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIA 240
                      G FEILSLSGSYMP+  GGTKSRSGGMS+SLA PDGRV+GGGL+G+L+A
Sbjct: 181 PTSSGGTLTYEGHFEILSLSGSYMPTNNGGTKSRSGGMSISLAGPDGRVLGGGLAGLLVA 240

Query: 241 AGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPP-----PA--------IKPI---- 263
           AGPVQVVVGSF  PGH+QE K KK R+EPT   ISP      PA        IKPI    
Sbjct: 241 AGPVQVVVGSFL-PGHKQEQKHKKQRIEPTVTIISPTATHSMPADINVSYGRIKPIFTYS 300

BLAST of Cp4.1LG05g07180 vs. NCBI nr
Match: gi|728844434|gb|KHG23877.1| (Putative DNA-binding ESCAROLA -like protein [Gossypium arboreum])

HSP 1 Score: 223.0 bits (567), Expect = 6.6e-55
Identity = 147/328 (44.82%), Postives = 184/328 (56.10%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDS-RPENTSTDSEPPPPPPPPATATVEAALTLAGT 60
           ME+KEG   G  +K  +  D F +     P   S  + PP    PP   T    +     
Sbjct: 1   MEDKEGPTLGVTVKGDEGLDGFHVASYVGPTVPSIVAPPPSMVVPPGHGTGSEMM----- 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVES-LKKSRKYDY 120
             KKKRGRPRKY PDG++  TLSPMPIS+SIP+ G+FPGW +GR + V++ +KKS  Y+ 
Sbjct: 61  --KKKRGRPRKYAPDGSLAITLSPMPISASIPMNGDFPGWNQGRAQPVDTFIKKSLNYEL 120

Query: 121 EI-PGNKVAFFAGADFTPHVITVNIGE--------------------------------- 180
           E  PG+++A+F G +FTPHVITVN GE                                 
Sbjct: 121 ETNPGDRIAYFVGTNFTPHVITVNAGEDVSMKVMFFSQQGAHAICVLSANGTISNVTLRQ 180

Query: 181 -----------GRFEILSLSGSYMPSKIGGTKSRSGGMSVSLACPDGRVMGGGLSGMLIA 240
                      G FEILSLSGSYMP+  GGTKSRSGGMS+SLA PDGRV+GGGL+G+L+A
Sbjct: 181 PTSSGGTLTYEGHFEILSLSGSYMPTNNGGTKSRSGGMSISLAGPDGRVLGGGLAGLLVA 240

Query: 241 AGPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISPP-----PA--------IKPI---- 263
           AGPVQVVVGSF  P H+QE K KK R+EPT   +SP      PA        IKPI    
Sbjct: 241 AGPVQVVVGSFL-PSHKQEQKHKKQRIEPTVTIVSPTATDSMPADINVSYGGIKPILTYS 300

BLAST of Cp4.1LG05g07180 vs. NCBI nr
Match: gi|596003907|ref|XP_007218277.1| (hypothetical protein PRUPE_ppa008335mg [Prunus persica])

HSP 1 Score: 219.9 bits (559), Expect = 5.6e-54
Identity = 155/331 (46.83%), Postives = 188/331 (56.80%), Query Frame = 1

Query: 1   MEEKEGVDFGFALKVSQAPDSFGMMDSRPENTSTDSEPPPPPPPPATATVEA-ALTLAGT 60
           MEEK+ +  G A+   +APD++ +   R EN S    P   P   A AT    +L L GT
Sbjct: 1   MEEKDNLVSGVAVSGEEAPDTYRIAP-RNENPS----PSGGPTMAAAATASPMSLALTGT 60

Query: 61  DGKKKRGRPRKYGPDGTVVPTLSPMPISSSIPLTGEFPGWKRGRGRSVESLKKSRKYD-Y 120
           + KKKRGRPRKYGPD TV   LSPMPISSSIPLTGEF  WKRGRGR V+S+KKS KYD +
Sbjct: 61  EVKKKRGRPRKYGPDKTVSSALSPMPISSSIPLTGEFSAWKRGRGRPVDSVKKSHKYDVF 120

Query: 121 EIPGNKVAFFAGADFTPHVITVNIGEG--------------RFEILSLSGSY------MP 180
           E  G K+A+  GA+FTPHV+TV+ GE                  ILS +G+        P
Sbjct: 121 ESSGEKIAYSVGANFTPHVLTVHAGEDVTMKIMSFSQQGSRAICILSANGTISNVTLRQP 180

Query: 181 SKIGGTKS------------------------RSGGMSVSLACPDGRVMGGGLSGMLIAA 240
           S  GGT +                        RSGGMSV+LA PDGRV+GGGL+GMLIAA
Sbjct: 181 SSSGGTLTYEGRFEILSLSGSYIAIENAGTKSRSGGMSVALAGPDGRVVGGGLAGMLIAA 240

Query: 241 GPVQVVVGSFQPPGHQQENKPKKSRMEPTSNAISP-------------PPAIKPI----- 266
           GPVQVVVGSF  PGHQQE KPKK R+EP S++I P                +KPI     
Sbjct: 241 GPVQVVVGSFL-PGHQQEQKPKKQRLEPVSSSIVPIVVNAVSGEEMKVCGGVKPILTSPS 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL3_ARATH3.5e-4041.55AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 S... [more]
AHL1_ARATH5.6e-3840.13AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 S... [more]
AHL2_ARATH2.6e-3538.01AT-hook motif nuclear-localized protein 2 OS=Arabidopsis thaliana GN=AHL2 PE=2 S... [more]
AHL7_ARATH8.4e-2637.13AT-hook motif nuclear-localized protein 7 OS=Arabidopsis thaliana GN=AHL7 PE=2 S... [more]
AHL6_ARATH2.8e-2153.85AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A0A0KN92_CUCSA3.7e-5765.35Uncharacterized protein OS=Cucumis sativus GN=Csa_5G169100 PE=4 SV=1[more]
A0A0D2UBA6_GOSRA2.1e-5545.12Uncharacterized protein OS=Gossypium raimondii GN=B456_008G283600 PE=4 SV=1[more]
A0A0B0PG27_GOSAR4.6e-5544.82Putative DNA-binding ESCAROLA-like protein OS=Gossypium arboreum GN=F383_07309 P... [more]
M5X1Y0_PRUPE3.9e-5446.83Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008335mg PE=4 SV=1[more]
F6HAK6_VITVI4.3e-5346.71Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g01810 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G25320.12.0e-4141.55 AT hook motif DNA-binding family protein[more]
AT4G12080.13.1e-3940.13 AT-hook motif nuclear-localized protein 1[more]
AT4G22770.11.5e-3638.01 AT hook motif DNA-binding family protein[more]
AT4G00200.14.7e-2737.13 AT hook motif DNA-binding family protein[more]
AT5G62260.11.6e-2253.85 AT hook motif DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|449451944|ref|XP_004143720.1|5.4e-5765.35PREDICTED: AT-hook motif nuclear-localized protein 3 [Cucumis sativus][more]
gi|659073161|ref|XP_008467286.1|9.2e-5765.35PREDICTED: uncharacterized protein LOC103504671 [Cucumis melo][more]
gi|823214975|ref|XP_012440250.1|2.9e-5545.12PREDICTED: AT-hook motif nuclear-localized protein 6-like [Gossypium raimondii][more]
gi|728844434|gb|KHG23877.1|6.6e-5544.82Putative DNA-binding ESCAROLA -like protein [Gossypium arboreum][more]
gi|596003907|ref|XP_007218277.1|5.6e-5446.83hypothetical protein PRUPE_ppa008335mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g07180.1Cp4.1LG05g07180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 135..205
score: 1.0
IPR005175PPC domainPROFILEPS51742PPCcoord: 86..225
score: 11
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 144..220
score: 3.0
NoneNo IPR availablePANTHERPTHR31500FAMILY NOT NAMEDcoord: 19..260
score: 2.2
NoneNo IPR availablePANTHERPTHR31500:SF18AT HOOK MOTIF DNA-BINDING FAMILY PROTEIN-RELATEDcoord: 19..260
score: 2.2
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 142..217
score: 2.17

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG05g07180Cucsa.128840Cucumber (Gy14) v1cgycpeB0340
Cp4.1LG05g07180Cucsa.251430Cucumber (Gy14) v1cgycpeB0684
Cp4.1LG05g07180CmaCh02G010070Cucurbita maxima (Rimu)cmacpeB646
Cp4.1LG05g07180CmaCh15G013810Cucurbita maxima (Rimu)cmacpeB331
Cp4.1LG05g07180CmaCh18G011940Cucurbita maxima (Rimu)cmacpeB453
Cp4.1LG05g07180CmoCh18G012150Cucurbita moschata (Rifu)cmocpeB414
Cp4.1LG05g07180CmoCh15G014440Cucurbita moschata (Rifu)cmocpeB295
Cp4.1LG05g07180CmoCh02G010220Cucurbita moschata (Rifu)cmocpeB595
Cp4.1LG05g07180Cla003576Watermelon (97103) v1cpewmB710
Cp4.1LG05g07180Cla006251Watermelon (97103) v1cpewmB705
Cp4.1LG05g07180Csa3G176860Cucumber (Chinese Long) v2cpecuB725
Cp4.1LG05g07180Csa5G169100Cucumber (Chinese Long) v2cpecuB737
Cp4.1LG05g07180MELO3C006833Melon (DHL92) v3.5.1cpemeB701
Cp4.1LG05g07180MELO3C005419Melon (DHL92) v3.5.1cpemeB648
Cp4.1LG05g07180ClCG01G005100Watermelon (Charleston Gray)cpewcgB640
Cp4.1LG05g07180ClCG05G007480Watermelon (Charleston Gray)cpewcgB678
Cp4.1LG05g07180CSPI05G07950Wild cucumber (PI 183967)cpecpiB739
Cp4.1LG05g07180CSPI03G16150Wild cucumber (PI 183967)cpecpiB725
Cp4.1LG05g07180Lsi09G005390Bottle gourd (USVL1VR-Ls)cpelsiB583
Cp4.1LG05g07180MELO3C005419.2Melon (DHL92) v3.6.1cpemedB775
Cp4.1LG05g07180MELO3C006833.2Melon (DHL92) v3.6.1cpemedB835
Cp4.1LG05g07180CsaV3_5G005390Cucumber (Chinese Long) v3cpecucB0916
Cp4.1LG05g07180CsaV3_3G016340Cucumber (Chinese Long) v3cpecucB0903
Cp4.1LG05g07180Bhi12G000507Wax gourdcpewgoB0918
Cp4.1LG05g07180CsGy5G005350Cucumber (Gy14) v2cgybcpeB692
Cp4.1LG05g07180Carg20835Silver-seed gourdcarcpeB1440
Cp4.1LG05g07180Carg08470Silver-seed gourdcarcpeB1002
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG05g07180Cp4.1LG09g01350Cucurbita pepo (Zucchini)cpecpeB057
Cp4.1LG05g07180Cp4.1LG13g00830Cucurbita pepo (Zucchini)cpecpeB222
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG05g07180Cucurbita pepo (Zucchini)cpecpeB245
Cp4.1LG05g07180Cucurbita moschata (Rifu)cmocpeB322
Cp4.1LG05g07180Wax gourdcpewgoB0919