Cucsa.023030.3 (mRNA) Cucumber (Gy14) v1

NameCucsa.023030.3
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionAT hook motif DNA-binding family protein
Locationscaffold00320 : 322184 .. 325291 (-)
Sequence length1165
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGGGAGGACAAGGGGATACCGGAAATGGGATTGAGGCGGTGGCTGGATCAAATAATCCAAGACCGGAGACCGCAGCTCCACCATCGGAAGGGGGTGGTCCGGCGGGTTCTGCAGCGGAGGCAGGTAAGAAGAAGAGGGGGAGACCGAGGAAGTATGGGCCGGACGGGAAGTTGAATGTGGCGGCACTGTCGCCGAAGCCCATATCGGCGTCGGCACCGGCACCGGCAGCTGTTATTGATTTCTCGGCGGAAAAACGTGGAAAAGTGCGGCCGGCGAGTTCCTTGACCAAAACCAAATATGAAGTGGAGAATTTAGGTAATAAACTTTTGACTTAAACAGTGTTGAGTTTGTGTGGACCTTTGATTTTGAGTCATATAGATGGAAGAAAAAAGGCTATAGCTATTTTGTAGGATCACTAATAACTGTTACATATAAATTCAGAGTATTTTTTCTAAGAAAAGTTTATCTCTATCATAAATAGTGAAAAATTACTAATCATATTGAAGTAACCTAATATATTCTTGAGACCTGAGGGACAACAGTTGTCCATGCTTTCTATGGTTTTTATTAAATTCCTTTTATGTTAATGAAAAAATCTCAAAGTCTTGATGTCAAGTAATCAATTAGCAGTGTGATTGGAAATCTTAATATGATCATAAAACTATTTTTGAATATGGTAGCCGACTTAGTTCATATATGAAAGATTAATTAGCTTAAACGACGATTGAAATTAATGAATGTAGATTTGCTTGCGAAATATGGTTTAGACCAGTTTAAGCACAAACATAATGCTGATTGGAAGAAAAAAAGGTCTTGATAGTAGAACTAAGTAAATTTTCAATAGACCAAAAAAAACTTCCTTTTATCATTACTAGATCATGATAGACCTAGATTATCTAAAAGACAGAATAATATAATACAATACATCCAAATATATATCTATAATTATGTTTATGCTTATACATATTGAAAATAGTTTTACCATTTAAAATAAAATAATTTATCTTTATTAAAATGAGGGTTGAAACATAAAATCTAAATATGTGATTTTATATTATTTTAGTGAGTGTTGAGTTGAGTTATTTTTTCATTTCATTGTAGGTGAATGGGTACCTTGTTCTGTTGGTGCCAATTTTACACCCCATATCATCACCGTAAGCAGTGGGGAGGTAATTTTGTTATGAATTTTCTAAATTTTGTAGCTCAATGTTTAATTAGATTAGTCAACACATTTCTACGAGGATGTGACAATATATATAAGATGTTATTTTGTGATAATTTTAGGACGTAACAATGAAGGTTCTCTCATTCTCTCAACAAGGACCTCGAGCAATCTGCATTCTGTCTGCTAACGGCGTGATTTCGAGCGTAACGCTTCGTCAACCCGACTCCTCCGGTGGAACCTTAACATATGAGGTTATAACTTTATTAATCTCCCATTTATGCATGTATAACTGAAATGAAATAAAACATATTTGAAAATGATTATGAGAGAGTTAAACGTTAATTTTGTCATTTTAAAAATTACTCTAAAATATGTAATTTTAATCATTTTTTGTCGTATTAGCAAATATATATAGCAGAGTGCAAAATAATTTGCATATATAACAAAGTAATTTAGATCCAATTGTTGGAGTCAATCACTAGTATGAGTTTTTCATCCATAGATTGTGGTATAATTATAAATCTTTTATTAGATGATGTTATACACTTAATTATTAATCCTAATAGTATTATTCATTGCACTTAGCTGATCTTAAGTTAATTTGAGTGACTAAATACATTTTTTTAGAGTGATATTAGACATTAACAAAGTGATTTTAACAATTTCAAAATAATTATGTAATTTTAGGGTCGTTTTGAAATATTGTCGTTGTCGGGATCATTCATGCCGAGTGACAGTATAGGAACAAAGAGCAGAATTGGTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATGAGTGTCTCCTTAGCAAGCCCAGACGGACGAGTTGTTGGTGGTGGAGTTGCTGGTTTGTTAGTAGCTGCAAGTCCAGTTCAAGTATGTGCCTTCTCCTTTCACTACCCGTAATTTTAAACTACGAAACAAAATTTTGTTGTTTGAGCTTATATTTCTTCTTTTATTACTACTTTCGAAAATGATAGGTGGTTGTAGGAAGCTTTATATCTGGTAACCAACATGAGCAAAAGCCGAAGAAGCCAAAACACGATGTTGTATTACCGGTTTCTACATTTCCAATCTCTAGTGTTGAACCAAAATCATACAAGACGACGACAACTATGACAACATCTTCGTTTCGTGCGGAAACATGGTCACCTAATGTAGTTCCAGATTTAAGAAGTCAACCAACTGATATCAATGTATCATTAACTAGTGGTTGATATATATTGTCGTTGTATTGTTCAACCATTACTGACCACATCGTTGTCGAGGTTCTATCCTGAGTAGGTGTCATTGTTTGTTGTTTACGTTGATGGACGATTGACAATTTTTTTTTTCTTTTTACAAGTAGTATTTTAGTTTTGACACTCCTTTTTAGGTAGTAATTGAAAGATATATCATTGTCTAGGGCTTCAACTATGTTACGTAATGTAGAATATATAGTTTGATTGCAAAATTCTAGATTCAACCTTTTAATTTAAACTTGAT

mRNA sequence

ATGACGGGAGGACAAGGGGATACCGGAAATGGGATTGAGGCGGTGGCTGGATCAAATAATCCAAGACCGGAGACCGCAGCTCCACCATCGGAAGGGGGTGGTCCGGCGGGTTCTGCAGCGGAGGCAGGTAAGAAGAAGAGGGGGAGACCGAGGAAGTATGGGCCGGACGGGAAGTTGAATGTGGCGGCACTGTCGCCGAAGCCCATATCGGCGTCGGCACCGGCACCGGCAGCTGTTATTGATTTCTCGGCGGAAAAACGTGGAAAAGTGCGGCCGGCGAGTTCCTTGACCAAAACCAAATATGAAGTGGAGAATTTAGGTGAATGGGTACCTTGTTCTGTTGGTGCCAATTTTACACCCCATATCATCACCGTAAGCAGTGGGGAGGACGTAACAATGAAGGTTCTCTCATTCTCTCAACAAGGACCTCGAGCAATCTGCATTCTGTCTGCTAACGGCGTGATTTCGAGCGTAACGCTTCGTCAACCCGACTCCTCCGGTGGAACCTTAACATATGAGGGTCGTTTTGAAATATTGTCGTTGTCGGGATCATTCATGCCGAGTGACAGTATAGGAACAAAGAGCAGAATTGCAAGCCCAGACGGACGAGTTGTTGGTGGTGGAGTTGCTGGTTTGTTAGTAGCTGCAAGTCCAGTTCAAGTGGTTGTAGGAAGCTTTATATCTGGTAACCAACATGAGCAAAAGCCGAAGAAGCCAAAACACGATGTTGTATTACCGGTTTCTACATTTCCAATCTCTAGTGTTGAACCAAAATCATACAAGACGACGACAACTATGACAACATCTTCGTTTCGTGCGGAAACATGGTCACCTAATGTAGTTCCAGATTTAAGAAGTCAACCAACTGATATCAATGTATCATTAACTAGTGGTTGATATATATTGTCGTTGTATTGTTCAACCATTACTGACCACATCGTTGTCGAGGTTCTATCCTGAGTAGGTGTCATTGTTTGTTGTTTACGTTGATGGACGATTGACAATTTTTTTTTTCTTTTTACAAGTAGTATTTTAGTTTTGACACTCCTTTTTAGGTAGTAATTGAAAGATATATCATTGTCTAGGGCTTCAACTATGTTACGTAATGTAGAATATATAGTTTGATTGCAAAATTCTAGATTCAACCTTTTAATTTAAACTTGAT

Coding sequence (CDS)

ATGACGGGAGGACAAGGGGATACCGGAAATGGGATTGAGGCGGTGGCTGGATCAAATAATCCAAGACCGGAGACCGCAGCTCCACCATCGGAAGGGGGTGGTCCGGCGGGTTCTGCAGCGGAGGCAGGTAAGAAGAAGAGGGGGAGACCGAGGAAGTATGGGCCGGACGGGAAGTTGAATGTGGCGGCACTGTCGCCGAAGCCCATATCGGCGTCGGCACCGGCACCGGCAGCTGTTATTGATTTCTCGGCGGAAAAACGTGGAAAAGTGCGGCCGGCGAGTTCCTTGACCAAAACCAAATATGAAGTGGAGAATTTAGGTGAATGGGTACCTTGTTCTGTTGGTGCCAATTTTACACCCCATATCATCACCGTAAGCAGTGGGGAGGACGTAACAATGAAGGTTCTCTCATTCTCTCAACAAGGACCTCGAGCAATCTGCATTCTGTCTGCTAACGGCGTGATTTCGAGCGTAACGCTTCGTCAACCCGACTCCTCCGGTGGAACCTTAACATATGAGGGTCGTTTTGAAATATTGTCGTTGTCGGGATCATTCATGCCGAGTGACAGTATAGGAACAAAGAGCAGAATTGCAAGCCCAGACGGACGAGTTGTTGGTGGTGGAGTTGCTGGTTTGTTAGTAGCTGCAAGTCCAGTTCAAGTGGTTGTAGGAAGCTTTATATCTGGTAACCAACATGAGCAAAAGCCGAAGAAGCCAAAACACGATGTTGTATTACCGGTTTCTACATTTCCAATCTCTAGTGTTGAACCAAAATCATACAAGACGACGACAACTATGACAACATCTTCGTTTCGTGCGGAAACATGGTCACCTAATGTAGTTCCAGATTTAAGAAGTCAACCAACTGATATCAATGTATCATTAACTAGTGGTTGA

Protein sequence

MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSRIASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQKPKKPKHDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINVSLTSG*
BLAST of Cucsa.023030.3 vs. Swiss-Prot
Match: AHL1_ARATH (AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 1.9e-89
Identity = 182/273 (66.67%), Postives = 214/273 (78.39%), Query Frame = 1

Query: 44  KKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAA-------VIDFSA-EKRGKVRPASS 103
           KKKRGRPRKYGPDG   V ALSPKPIS SAPAP+        VIDFSA EKR KV+P +S
Sbjct: 89  KKKRGRPRKYGPDG--TVVALSPKPIS-SAPAPSHLPPPSSHVIDFSASEKRSKVKPTNS 148

Query: 104 LTKTKY--EVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFSQQGPRAICILSANG 163
             +TKY  +VENLGEW PCSVG NFTPHIITV++GEDVTMK++SFSQQGPR+IC+LSANG
Sbjct: 149 FNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKIISFSQQGPRSICVLSANG 208

Query: 164 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSR-------IASPDGRVVG 223
           VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMP+DS GT+SR       +ASPDGRVVG
Sbjct: 209 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMSVSLASPDGRVVG 268

Query: 224 GGVAGLLVAASPVQVVVGSFISGNQH-EQKPKKPKHDVVL--PVSTFPISSVEPKSYKTT 283
           GG+AGLLVAASPVQVVVGSF++G  H +QKPKK KHD +L  P +  PISS     ++T 
Sbjct: 269 GGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDFMLSSPTAAIPISSA--ADHRTI 328

Query: 284 TTMTTSSFRAETWSPNVVPDLRSQPTDINVSLT 297
            ++++      TW  ++  D R++ TDINV++T
Sbjct: 329 HSVSSLPVNNNTWQTSLASDPRNKHTDINVNVT 356

BLAST of Cucsa.023030.3 vs. Swiss-Prot
Match: AHL2_ARATH (AT-hook motif nuclear-localized protein 2 OS=Arabidopsis thaliana GN=AHL2 PE=2 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 7.6e-78
Identity = 168/297 (56.57%), Postives = 206/297 (69.36%), Query Frame = 1

Query: 19  NNPRPETAAPPSEGGGPA----GSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAP 78
           N+  P    PP     P+    G ++   KK+RGRPRKYG DG      LSP PIS++AP
Sbjct: 43  NSVAPPPPPPPQNSFTPSAAMDGFSSGPIKKRRGRPRKYGHDGA--AVTLSPNPISSAAP 102

Query: 79  APAAVIDFS--AEKRGKVRPA----SSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSG 138
             + VIDFS  +EKRGK++PA    SS  + KY+VENLGEW P S  ANFTPHIITV++G
Sbjct: 103 TTSHVIDFSTTSEKRGKMKPATPTPSSFIRPKYQVENLGEWSPSSAAANFTPHIITVNAG 162

Query: 139 EDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPS 198
           EDVT +++SFSQQG  AIC+L ANGV+SSVTLRQPDSSGGTLTYEGRFEILSLSG+FMPS
Sbjct: 163 EDVTKRIISFSQQGSLAICVLCANGVVSSVTLRQPDSSGGTLTYEGRFEILSLSGTFMPS 222

Query: 199 DSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISG-NQHEQKPKKPK 258
           DS GT+SR       +ASPDGRVVGGGVAGLLVAA+P+QVVVG+F+ G NQ EQ PK   
Sbjct: 223 DSDGTRSRTGGMSVSLASPDGRVVGGGVAGLLVAATPIQVVVGTFLGGTNQQEQTPKPHN 282

Query: 259 HDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPT-DINVSLT 297
           H+ +   S    +S     ++T   M TSS    TW+P+   D R + + D N++LT
Sbjct: 283 HNFM--SSPLMPTSSNVADHRTIRPM-TSSLPISTWTPSFPSDSRHKHSHDFNITLT 334

BLAST of Cucsa.023030.3 vs. Swiss-Prot
Match: AHL7_ARATH (AT-hook motif nuclear-localized protein 7 OS=Arabidopsis thaliana GN=AHL7 PE=2 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 5.6e-57
Identity = 141/292 (48.29%), Postives = 182/292 (62.33%), Query Frame = 1

Query: 28  PPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAAVIDFSAEK- 87
           PP E   P+   A +GKK+RGRPRKY  +G               AP P++ +    ++ 
Sbjct: 41  PPMEAPMPSSGEA-SGKKRRGRPRKYEANG---------------APLPSSSVPLVKKRV 100

Query: 88  RGKVR--PASSLTKT-----KYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFS 147
           RGK+       + KT       E   +G  V   VG+NFTPH+ITV++GED+TM+++SFS
Sbjct: 101 RGKLNGFDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEDITMRIISFS 160

Query: 148 QQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSR--- 207
           QQGPRAICILSANGVIS+VTLRQPDS GGTLTYEGRFEILSLSGSFM +++ G+K R   
Sbjct: 161 QQGPRAICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMETENQGSKGRSGG 220

Query: 208 ----IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE-QKPKKPK--HDVVLPVST 267
               +A PDGRVVGGGVAGLL+AA+P+QVVVGSFI+ +Q + QKP+K +  H     +S 
Sbjct: 221 MSVSLAGPDGRVVGGGVAGLLIAATPIQVVVGSFITSDQQDHQKPRKQRVEHAPAAVMSV 280

Query: 268 FPISSVEPKSYKTTT------TMTTSSFRAETWSPNVVPDLRSQPTDINVSL 296
            P  S  P +    +          SSF   +W+ N     R+  TDIN+SL
Sbjct: 281 PPPPSPPPPAASVFSPTNPDREQPPSSFGISSWT-NGQDMPRNSATDINISL 315

BLAST of Cucsa.023030.3 vs. Swiss-Prot
Match: AHL3_ARATH (AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 3.1e-55
Identity = 131/269 (48.70%), Postives = 169/269 (62.83%), Query Frame = 1

Query: 12  IEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISA 71
           + A    N   P +   P+E      ++AE  KKKRGRPRKY PDG L V  LSP PIS+
Sbjct: 59  VAAAVTENAATPFSLTMPTEN-----TSAEQLKKKRGRPRKYNPDGTL-VVTLSPMPISS 118

Query: 72  SAPAPAAVIDFSAEKRGKVRPASS----------LTKTKYEVENLGEWVPCSVGANFTPH 131
           S P  +   +F   KRG+ R  S+            ++  +    G      VGANFTPH
Sbjct: 119 SVPLTS---EFPPRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPH 178

Query: 132 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 191
           ++ V++GEDVTMK+++FSQQG RAICILSANG IS+VTLRQ  +SGGTLTYEGRFEILSL
Sbjct: 179 VLIVNAGEDVTMKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSL 238

Query: 192 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 251
           +GSFM +DS GT+SR       +A PDGRV GGG+AGL +AA PVQV+VG+FI+G +  Q
Sbjct: 239 TGSFMQNDSGGTRSRAGGMSVCLAGPDGRVFGGGLAGLFLAAGPVQVMVGTFIAGQEQSQ 298

Query: 252 ----KPKKPKHDVVLPVSTFPISSVEPKS 260
               K ++ +        +F IS+ E K+
Sbjct: 299 LELAKERRLRFGAQPSSISFNISAEERKA 318

BLAST of Cucsa.023030.3 vs. Swiss-Prot
Match: AHL6_ARATH (AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 SV=1)

HSP 1 Score: 213.8 bits (543), Expect = 2.6e-54
Identity = 135/280 (48.21%), Postives = 170/280 (60.71%), Query Frame = 1

Query: 16  AGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVA----ALSPKPISA 75
           A S+ P P T  P   G   A + ++  KKKRGRPRKY PDG LN       LSP PIS+
Sbjct: 51  APSSAPVPTTVTP---GSATASTGSDPTKKKRGRPRKYAPDGSLNPRFLRPTLSPTPISS 110

Query: 76  SAPAPAAVIDFSAEKRGKVR----PASSLTKT-KYEVENLGEWVP-----CSVGANFTPH 135
           S P      D+   KRGK +    P   + K+ K+E  +     P     C VGANFT H
Sbjct: 111 SIPLSG---DYQW-KRGKAQQQHQPLEFVKKSHKFEYGSPAPTPPLPGLSCYVGANFTTH 170

Query: 136 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 195
             TV+ GEDVTMKV+ +SQQG RAICILSA G IS+VTL QP ++GGTLTYEGRFEILSL
Sbjct: 171 QFTVNGGEDVTMKVMPYSQQGSRAICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSL 230

Query: 196 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 255
           SGSFMP+++ GTK R       +A P+G + GGG+AG+L+AA PVQVV+GSFI  +Q EQ
Sbjct: 231 SGSFMPTENGGTKGRAGGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGSFIVMHQAEQ 290

Query: 256 KPKKPKHDVVL-----PVSTFPISSVEPKSYKTTTTMTTS 270
             KK    +       P +   +   +P ++  TT  +TS
Sbjct: 291 NQKKKPRVMEAFAPPQPQAPPQLQQQQPPTFTITTVNSTS 323

BLAST of Cucsa.023030.3 vs. TrEMBL
Match: A0A0A0KL67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G010650 PE=4 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 5.3e-163
Identity = 297/305 (97.38%), Postives = 297/305 (97.38%), Query Frame = 1

Query: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60
           MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN
Sbjct: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60

Query: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120
           VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP
Sbjct: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120

Query: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180
           HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS
Sbjct: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180

Query: 181 LSGSFMPSDSIGTKSRI-------ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240
           LSGSFMPSDSIGTKSRI       ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE
Sbjct: 181 LSGSFMPSDSIGTKSRIGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240

Query: 241 QKPKKPKHDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 299
           QKPKKPKHDVVLPV TFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV
Sbjct: 241 QKPKKPKHDVVLPVYTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 300

BLAST of Cucsa.023030.3 vs. TrEMBL
Match: M5WUW0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008806mg PE=4 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 2.9e-100
Identity = 205/293 (69.97%), Postives = 230/293 (78.50%), Query Frame = 1

Query: 16  AGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPA 75
           AGS  P P   APP     PA ++    KKKRGRPRKYGPDG + +A LSPKPIS+SAP 
Sbjct: 36  AGSTPPAP--VAPPPAAALPAAASLPM-KKKRGRPRKYGPDGSVTMA-LSPKPISSSAPP 95

Query: 76  PAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKV 135
           P  VIDFSAEKRGKV+P SS++KTKYEVENLGEWV CSVGANFTPHIITV+SGEDV MK+
Sbjct: 96  P--VIDFSAEKRGKVKPTSSVSKTKYEVENLGEWVACSVGANFTPHIITVNSGEDVMMKI 155

Query: 136 LSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKS 195
           +SFSQQGPRAIC+LSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMP+++ GT+S
Sbjct: 156 ISFSQQGPRAICVLSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNETGGTRS 215

Query: 196 R-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQKPKKPKHDVV---L 255
           R       +ASPDGRVVGGGVAGLLVAASPVQVVVGSF+SGNQHEQKPKK KHD +    
Sbjct: 216 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLSGNQHEQKPKKQKHDYISNAT 275

Query: 256 PVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINVSLTSG 299
           P    PISSV+PK   +++T    SFR + WS   +P      TDINVSL  G
Sbjct: 276 PTMAVPISSVDPKPNFSSST----SFRGDNWSS--LPSDPKTKTDINVSLPGG 316

BLAST of Cucsa.023030.3 vs. TrEMBL
Match: A0A0A0KP42_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157940 PE=4 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 4.9e-100
Identity = 202/306 (66.01%), Postives = 238/306 (77.78%), Query Frame = 1

Query: 2   TGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNV 61
           TGG   T  G ++ +  +     +  PP     P  +++  GKKKRGRPRKYGPDG +++
Sbjct: 37  TGGS-TTPPGTQSTSTPSASAQVSGQPPP----PTAASSVPGKKKRGRPRKYGPDGSVSM 96

Query: 62  AALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPH 121
           A LSPKPIS S P P  VIDFS EK+GKVRPAS+++K+K+EV+NLG+WVPCS+GANFTPH
Sbjct: 97  A-LSPKPISLSVPPP--VIDFSTEKKGKVRPASAVSKSKFEVDNLGDWVPCSLGANFTPH 156

Query: 122 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 181
           IITV++GEDVTMK++SFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
Sbjct: 157 IITVNAGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 216

Query: 182 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 241
           SGSFMPSD+  T+SR       +ASPDGRVVGGGVAGLLVAASPVQVVVGSF+SGNQHEQ
Sbjct: 217 SGSFMPSDNGATRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLSGNQHEQ 276

Query: 242 KPKKPKHDVVL---PVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDI 298
           KPKKPKHD +    P +  PIS V+PKS        +SSFR + WS  +  D R++ TDI
Sbjct: 277 KPKKPKHDTISPAPPTAAIPISCVDPKS----NLSPSSSFRGDNWS-MLPTDSRNKSTDI 329

BLAST of Cucsa.023030.3 vs. TrEMBL
Match: A0A061GBU0_THECC (AT-hook motif nuclear-localized protein 1 isoform 2 OS=Theobroma cacao GN=TCM_016097 PE=4 SV=1)

HSP 1 Score: 372.1 bits (954), Expect = 6.4e-100
Identity = 199/288 (69.10%), Postives = 233/288 (80.90%), Query Frame = 1

Query: 21  PRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAAVI 80
           P P+TAA P     P   +    KKKRGRPRKYGPDG + +A LSPKPIS +AP P  +I
Sbjct: 55  PPPQTAAQPVPP--PVSVSGLPVKKKRGRPRKYGPDGSVTMA-LSPKPISTAAPPP--LI 114

Query: 81  DFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFSQ 140
           DFSA KRGKV+  +S++K KYE+ENLGEWV CSVGANFTPHIITV++GEDVTMK++SFSQ
Sbjct: 115 DFSAGKRGKVKSPTSVSKAKYELENLGEWVACSVGANFTPHIITVNAGEDVTMKIISFSQ 174

Query: 141 QGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSR---- 200
           QGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDS GT+SR    
Sbjct: 175 QGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRSRSGGM 234

Query: 201 ---IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQKPKKPKHDVV---LPVSTF 260
              +ASPDGRVVGGGVAGLLVAASPVQVVVGSF++GNQHEQKPKK KH+ +    P++  
Sbjct: 235 SVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKPKKQKHEPISAATPMAAI 294

Query: 261 PISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINVSLTSG 299
           P+SS +PKS      ++TSSFR ++WS ++  D R++PTDINVSL +G
Sbjct: 295 PVSSADPKS-----NLSTSSFRGDSWS-SLPSDSRNKPTDINVSLPAG 331

BLAST of Cucsa.023030.3 vs. TrEMBL
Match: V4TPS8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10021222mg PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 4.2e-99
Identity = 201/293 (68.60%), Postives = 234/293 (79.86%), Query Frame = 1

Query: 18  SNNPRPETAAPP--SEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPA 77
           S NP   +A PP  ++   PA   A   KKKRGRPRKYGPDG + +A LSPKPIS++AP+
Sbjct: 34  SENPTLTSAPPPPATQPPAPAPPPALPLKKKRGRPRKYGPDGTVTMA-LSPKPISSAAPS 93

Query: 78  PAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKV 137
           P  VIDFSAEK  KV+PASS +K+KYEVEN+GEWV CSVGANFTPHIITV++GEDVTMK+
Sbjct: 94  PP-VIDFSAEKPRKVKPASSFSKSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKI 153

Query: 138 LSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKS 197
           +SFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDS GT+S
Sbjct: 154 ISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRS 213

Query: 198 R-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQKPKKPKHD---VVL 257
           R       +ASPDGRVVGGGVAGLLVAASPVQVVVGSF++GNQHEQK KK K++   +  
Sbjct: 214 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKHKKQKNEPISIAT 273

Query: 258 PVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINVSLTSG 299
           P +  PISS +PK      ++TTS+FR + WS ++  D R++PTDIN SL  G
Sbjct: 274 PTAAIPISSADPKG-----SLTTSTFRGDGWS-SLPSDSRNKPTDINASLPVG 318

BLAST of Cucsa.023030.3 vs. TAIR10
Match: AT4G12080.1 (AT4G12080.1 AT-hook motif nuclear-localized protein 1)

HSP 1 Score: 330.5 bits (846), Expect = 1.1e-90
Identity = 182/273 (66.67%), Postives = 214/273 (78.39%), Query Frame = 1

Query: 44  KKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAA-------VIDFSA-EKRGKVRPASS 103
           KKKRGRPRKYGPDG   V ALSPKPIS SAPAP+        VIDFSA EKR KV+P +S
Sbjct: 89  KKKRGRPRKYGPDG--TVVALSPKPIS-SAPAPSHLPPPSSHVIDFSASEKRSKVKPTNS 148

Query: 104 LTKTKY--EVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFSQQGPRAICILSANG 163
             +TKY  +VENLGEW PCSVG NFTPHIITV++GEDVTMK++SFSQQGPR+IC+LSANG
Sbjct: 149 FNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGEDVTMKIISFSQQGPRSICVLSANG 208

Query: 164 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSR-------IASPDGRVVG 223
           VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMP+DS GT+SR       +ASPDGRVVG
Sbjct: 209 VISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPNDSGGTRSRTGGMSVSLASPDGRVVG 268

Query: 224 GGVAGLLVAASPVQVVVGSFISGNQH-EQKPKKPKHDVVL--PVSTFPISSVEPKSYKTT 283
           GG+AGLLVAASPVQVVVGSF++G  H +QKPKK KHD +L  P +  PISS     ++T 
Sbjct: 269 GGLAGLLVAASPVQVVVGSFLAGTDHQDQKPKKNKHDFMLSSPTAAIPISSA--ADHRTI 328

Query: 284 TTMTTSSFRAETWSPNVVPDLRSQPTDINVSLT 297
            ++++      TW  ++  D R++ TDINV++T
Sbjct: 329 HSVSSLPVNNNTWQTSLASDPRNKHTDINVNVT 356

BLAST of Cucsa.023030.3 vs. TAIR10
Match: AT4G22770.1 (AT4G22770.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 292.0 bits (746), Expect = 4.3e-79
Identity = 168/297 (56.57%), Postives = 206/297 (69.36%), Query Frame = 1

Query: 19  NNPRPETAAPPSEGGGPA----GSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAP 78
           N+  P    PP     P+    G ++   KK+RGRPRKYG DG      LSP PIS++AP
Sbjct: 43  NSVAPPPPPPPQNSFTPSAAMDGFSSGPIKKRRGRPRKYGHDGA--AVTLSPNPISSAAP 102

Query: 79  APAAVIDFS--AEKRGKVRPA----SSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSG 138
             + VIDFS  +EKRGK++PA    SS  + KY+VENLGEW P S  ANFTPHIITV++G
Sbjct: 103 TTSHVIDFSTTSEKRGKMKPATPTPSSFIRPKYQVENLGEWSPSSAAANFTPHIITVNAG 162

Query: 139 EDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPS 198
           EDVT +++SFSQQG  AIC+L ANGV+SSVTLRQPDSSGGTLTYEGRFEILSLSG+FMPS
Sbjct: 163 EDVTKRIISFSQQGSLAICVLCANGVVSSVTLRQPDSSGGTLTYEGRFEILSLSGTFMPS 222

Query: 199 DSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISG-NQHEQKPKKPK 258
           DS GT+SR       +ASPDGRVVGGGVAGLLVAA+P+QVVVG+F+ G NQ EQ PK   
Sbjct: 223 DSDGTRSRTGGMSVSLASPDGRVVGGGVAGLLVAATPIQVVVGTFLGGTNQQEQTPKPHN 282

Query: 259 HDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPT-DINVSLT 297
           H+ +   S    +S     ++T   M TSS    TW+P+   D R + + D N++LT
Sbjct: 283 HNFM--SSPLMPTSSNVADHRTIRPM-TSSLPISTWTPSFPSDSRHKHSHDFNITLT 334

BLAST of Cucsa.023030.3 vs. TAIR10
Match: AT4G00200.1 (AT4G00200.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 222.6 bits (566), Expect = 3.2e-58
Identity = 141/292 (48.29%), Postives = 182/292 (62.33%), Query Frame = 1

Query: 28  PPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPAPAAVIDFSAEK- 87
           PP E   P+   A +GKK+RGRPRKY  +G               AP P++ +    ++ 
Sbjct: 41  PPMEAPMPSSGEA-SGKKRRGRPRKYEANG---------------APLPSSSVPLVKKRV 100

Query: 88  RGKVR--PASSLTKT-----KYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKVLSFS 147
           RGK+       + KT       E   +G  V   VG+NFTPH+ITV++GED+TM+++SFS
Sbjct: 101 RGKLNGFDMKKMHKTIGFHSSGERFGVGGGVGGGVGSNFTPHVITVNTGEDITMRIISFS 160

Query: 148 QQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKSR--- 207
           QQGPRAICILSANGVIS+VTLRQPDS GGTLTYEGRFEILSLSGSFM +++ G+K R   
Sbjct: 161 QQGPRAICILSANGVISNVTLRQPDSCGGTLTYEGRFEILSLSGSFMETENQGSKGRSGG 220

Query: 208 ----IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE-QKPKKPK--HDVVLPVST 267
               +A PDGRVVGGGVAGLL+AA+P+QVVVGSFI+ +Q + QKP+K +  H     +S 
Sbjct: 221 MSVSLAGPDGRVVGGGVAGLLIAATPIQVVVGSFITSDQQDHQKPRKQRVEHAPAAVMSV 280

Query: 268 FPISSVEPKSYKTTT------TMTTSSFRAETWSPNVVPDLRSQPTDINVSL 296
            P  S  P +    +          SSF   +W+ N     R+  TDIN+SL
Sbjct: 281 PPPPSPPPPAASVFSPTNPDREQPPSSFGISSWT-NGQDMPRNSATDINISL 315

BLAST of Cucsa.023030.3 vs. TAIR10
Match: AT4G25320.1 (AT4G25320.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 216.9 bits (551), Expect = 1.7e-56
Identity = 131/269 (48.70%), Postives = 169/269 (62.83%), Query Frame = 1

Query: 12  IEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISA 71
           + A    N   P +   P+E      ++AE  KKKRGRPRKY PDG L V  LSP PIS+
Sbjct: 59  VAAAVTENAATPFSLTMPTEN-----TSAEQLKKKRGRPRKYNPDGTL-VVTLSPMPISS 118

Query: 72  SAPAPAAVIDFSAEKRGKVRPASS----------LTKTKYEVENLGEWVPCSVGANFTPH 131
           S P  +   +F   KRG+ R  S+            ++  +    G      VGANFTPH
Sbjct: 119 SVPLTS---EFPPRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPH 178

Query: 132 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 191
           ++ V++GEDVTMK+++FSQQG RAICILSANG IS+VTLRQ  +SGGTLTYEGRFEILSL
Sbjct: 179 VLIVNAGEDVTMKIMTFSQQGSRAICILSANGPISNVTLRQSMTSGGTLTYEGRFEILSL 238

Query: 192 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 251
           +GSFM +DS GT+SR       +A PDGRV GGG+AGL +AA PVQV+VG+FI+G +  Q
Sbjct: 239 TGSFMQNDSGGTRSRAGGMSVCLAGPDGRVFGGGLAGLFLAAGPVQVMVGTFIAGQEQSQ 298

Query: 252 ----KPKKPKHDVVLPVSTFPISSVEPKS 260
               K ++ +        +F IS+ E K+
Sbjct: 299 LELAKERRLRFGAQPSSISFNISAEERKA 318

BLAST of Cucsa.023030.3 vs. TAIR10
Match: AT5G62260.1 (AT5G62260.1 AT hook motif DNA-binding family protein)

HSP 1 Score: 213.8 bits (543), Expect = 1.5e-55
Identity = 135/280 (48.21%), Postives = 170/280 (60.71%), Query Frame = 1

Query: 16  AGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVA----ALSPKPISA 75
           A S+ P P T  P   G   A + ++  KKKRGRPRKY PDG LN       LSP PIS+
Sbjct: 51  APSSAPVPTTVTP---GSATASTGSDPTKKKRGRPRKYAPDGSLNPRFLRPTLSPTPISS 110

Query: 76  SAPAPAAVIDFSAEKRGKVR----PASSLTKT-KYEVENLGEWVP-----CSVGANFTPH 135
           S P      D+   KRGK +    P   + K+ K+E  +     P     C VGANFT H
Sbjct: 111 SIPLSG---DYQW-KRGKAQQQHQPLEFVKKSHKFEYGSPAPTPPLPGLSCYVGANFTTH 170

Query: 136 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 195
             TV+ GEDVTMKV+ +SQQG RAICILSA G IS+VTL QP ++GGTLTYEGRFEILSL
Sbjct: 171 QFTVNGGEDVTMKVMPYSQQGSRAICILSATGSISNVTLGQPTNAGGTLTYEGRFEILSL 230

Query: 196 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 255
           SGSFMP+++ GTK R       +A P+G + GGG+AG+L+AA PVQVV+GSFI  +Q EQ
Sbjct: 231 SGSFMPTENGGTKGRAGGMSISLAGPNGNIFGGGLAGMLIAAGPVQVVMGSFIVMHQAEQ 290

Query: 256 KPKKPKHDVVL-----PVSTFPISSVEPKSYKTTTTMTTS 270
             KK    +       P +   +   +P ++  TT  +TS
Sbjct: 291 NQKKKPRVMEAFAPPQPQAPPQLQQQQPPTFTITTVNSTS 323

BLAST of Cucsa.023030.3 vs. NCBI nr
Match: gi|778697973|ref|XP_004149134.2| (PREDICTED: AT-hook motif nuclear-localized protein 1-like [Cucumis sativus])

HSP 1 Score: 581.6 bits (1498), Expect = 7.6e-163
Identity = 297/305 (97.38%), Postives = 297/305 (97.38%), Query Frame = 1

Query: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60
           MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN
Sbjct: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60

Query: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120
           VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP
Sbjct: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120

Query: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180
           HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS
Sbjct: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180

Query: 181 LSGSFMPSDSIGTKSRI-------ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240
           LSGSFMPSDSIGTKSRI       ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE
Sbjct: 181 LSGSFMPSDSIGTKSRIGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240

Query: 241 QKPKKPKHDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 299
           QKPKKPKHDVVLPV TFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV
Sbjct: 241 QKPKKPKHDVVLPVYTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 300

BLAST of Cucsa.023030.3 vs. NCBI nr
Match: gi|659121342|ref|XP_008460610.1| (PREDICTED: putative DNA-binding protein ESCAROLA isoform X2 [Cucumis melo])

HSP 1 Score: 552.7 bits (1423), Expect = 3.8e-154
Identity = 285/305 (93.44%), Postives = 290/305 (95.08%), Query Frame = 1

Query: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60
           MTGG+GDTGNGIEAVAGSNNPR ETAAPP E GGPAGSAAEAGKKKRGRPRKYGPDGKLN
Sbjct: 1   MTGGKGDTGNGIEAVAGSNNPRQETAAPPPEAGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60

Query: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120
           VAALSPKPISASAPAP AVIDFSAEKRGKVRPASSLTKTKYEVE+LGEWVPCSVGANFTP
Sbjct: 61  VAALSPKPISASAPAPTAVIDFSAEKRGKVRPASSLTKTKYEVESLGEWVPCSVGANFTP 120

Query: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180
           HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS
Sbjct: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180

Query: 181 LSGSFMPSDSIGTKSRI-------ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240
           LSGSFMPSDS+GTKSRI       ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQ E
Sbjct: 181 LSGSFMPSDSVGTKSRIGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQTE 240

Query: 241 QKPKKPKHDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 299
           QKPKK +HDVVLPVSTFPISSVEPKS+KTTT  TTSSFRAETWSPNVVPDLRSQPTDINV
Sbjct: 241 QKPKKSRHDVVLPVSTFPISSVEPKSFKTTT--TTSSFRAETWSPNVVPDLRSQPTDINV 300

BLAST of Cucsa.023030.3 vs. NCBI nr
Match: gi|659121340|ref|XP_008460609.1| (PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Cucumis melo])

HSP 1 Score: 552.7 bits (1423), Expect = 3.8e-154
Identity = 285/305 (93.44%), Postives = 290/305 (95.08%), Query Frame = 1

Query: 1   MTGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLN 60
           MTGG+GDTGNGIEAVAGSNNPR ETAAPP E GGPAGSAAEAGKKKRGRPRKYGPDGKLN
Sbjct: 8   MTGGKGDTGNGIEAVAGSNNPRQETAAPPPEAGGPAGSAAEAGKKKRGRPRKYGPDGKLN 67

Query: 61  VAALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTP 120
           VAALSPKPISASAPAP AVIDFSAEKRGKVRPASSLTKTKYEVE+LGEWVPCSVGANFTP
Sbjct: 68  VAALSPKPISASAPAPTAVIDFSAEKRGKVRPASSLTKTKYEVESLGEWVPCSVGANFTP 127

Query: 121 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 180
           HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS
Sbjct: 128 HIITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILS 187

Query: 181 LSGSFMPSDSIGTKSRI-------ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHE 240
           LSGSFMPSDS+GTKSRI       ASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQ E
Sbjct: 188 LSGSFMPSDSVGTKSRIGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQTE 247

Query: 241 QKPKKPKHDVVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINV 299
           QKPKK +HDVVLPVSTFPISSVEPKS+KTTT  TTSSFRAETWSPNVVPDLRSQPTDINV
Sbjct: 248 QKPKKSRHDVVLPVSTFPISSVEPKSFKTTT--TTSSFRAETWSPNVVPDLRSQPTDINV 307

BLAST of Cucsa.023030.3 vs. NCBI nr
Match: gi|1009175138|ref|XP_015868718.1| (PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform X1 [Ziziphus jujuba])

HSP 1 Score: 374.8 bits (961), Expect = 1.4e-100
Identity = 208/307 (67.75%), Postives = 235/307 (76.55%), Query Frame = 1

Query: 2   TGGQGDTGNGIEAVAGSNNPRPETAAPPSEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNV 61
           TGG   T   + A    + P   TAA P+ GG     A    KKKRGRPRKYGPDG + +
Sbjct: 34  TGGSTTTTPAV-APPPPSAPAVATAATPAAGG-----ATMPEKKKRGRPRKYGPDGTVTM 93

Query: 62  AALSPKPISASAPAPAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPH 121
           A LSPKPIS+SAP P  VIDFSAEK  KVRPASS++K KYE+ENLGEWV CSVGANFTPH
Sbjct: 94  A-LSPKPISSSAPPP--VIDFSAEKSRKVRPASSVSKAKYELENLGEWVACSVGANFTPH 153

Query: 122 IITVSSGEDVTMKVLSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 181
           IITV++GEDVTMK++SFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL
Sbjct: 154 IITVNAGEDVTMKIISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSL 213

Query: 182 SGSFMPSDSIGTKSR-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQ 241
           SGSFMPS++ GT+SR       +ASPDGRVVGGGVAGLLVAASPVQVVVGSF++GNQHEQ
Sbjct: 214 SGSFMPSETGGTRSRSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQ 273

Query: 242 KPKKPKHD---VVLPVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDI 299
           KPKK KHD   V  P +  PISS +PK+  T    +T SFR ++WS       R++ TDI
Sbjct: 274 KPKKQKHDYISVATPTAAIPISSADPKANLT----STGSFRGDSWSSLPQDSTRNKSTDI 327

BLAST of Cucsa.023030.3 vs. NCBI nr
Match: gi|568846955|ref|XP_006477307.1| (PREDICTED: AT-hook motif nuclear-localized protein 1 [Citrus sinensis])

HSP 1 Score: 374.4 bits (960), Expect = 1.9e-100
Identity = 203/293 (69.28%), Postives = 235/293 (80.20%), Query Frame = 1

Query: 18  SNNPRPETAAPP--SEGGGPAGSAAEAGKKKRGRPRKYGPDGKLNVAALSPKPISASAPA 77
           S NP P +A PP  ++   PA   A   KKKRGRPRKYGPDG + +A LSPKPIS++AP+
Sbjct: 34  SENPTPTSAPPPPATQPPAPAPPPALPLKKKRGRPRKYGPDGTVTMA-LSPKPISSAAPS 93

Query: 78  PAAVIDFSAEKRGKVRPASSLTKTKYEVENLGEWVPCSVGANFTPHIITVSSGEDVTMKV 137
           P  VIDFSAEK  KV+PASS +K+KYEVEN+GEWV CSVGANFTPHIITV++GEDVTMK+
Sbjct: 94  PP-VIDFSAEKPRKVKPASSFSKSKYEVENIGEWVACSVGANFTPHIITVNTGEDVTMKI 153

Query: 138 LSFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSIGTKS 197
           +SFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDS GT+S
Sbjct: 154 ISFSQQGPRAICILSANGVISSVTLRQPDSSGGTLTYEGRFEILSLSGSFMPSDSGGTRS 213

Query: 198 R-------IASPDGRVVGGGVAGLLVAASPVQVVVGSFISGNQHEQKPKKPKHD---VVL 257
           R       +ASPDGRVVGGGVAGLLVAASPVQVVVGSF++GNQHEQK KK K++   +  
Sbjct: 214 RSGGMSVSLASPDGRVVGGGVAGLLVAASPVQVVVGSFLAGNQHEQKHKKQKNEPISIAT 273

Query: 258 PVSTFPISSVEPKSYKTTTTMTTSSFRAETWSPNVVPDLRSQPTDINVSLTSG 299
           P +  PISS +PK      ++TTSSFR + WS ++  D R++PTDIN SL  G
Sbjct: 274 PTAAIPISSADPKG-----SLTTSSFRGDGWS-SLPSDSRNKPTDINASLPVG 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL1_ARATH1.9e-8966.67AT-hook motif nuclear-localized protein 1 OS=Arabidopsis thaliana GN=AHL1 PE=1 S... [more]
AHL2_ARATH7.6e-7856.57AT-hook motif nuclear-localized protein 2 OS=Arabidopsis thaliana GN=AHL2 PE=2 S... [more]
AHL7_ARATH5.6e-5748.29AT-hook motif nuclear-localized protein 7 OS=Arabidopsis thaliana GN=AHL7 PE=2 S... [more]
AHL3_ARATH3.1e-5548.70AT-hook motif nuclear-localized protein 3 OS=Arabidopsis thaliana GN=AHL3 PE=1 S... [more]
AHL6_ARATH2.6e-5448.21AT-hook motif nuclear-localized protein 6 OS=Arabidopsis thaliana GN=AHL6 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A0A0KL67_CUCSA5.3e-16397.38Uncharacterized protein OS=Cucumis sativus GN=Csa_5G010650 PE=4 SV=1[more]
M5WUW0_PRUPE2.9e-10069.97Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008806mg PE=4 SV=1[more]
A0A0A0KP42_CUCSA4.9e-10066.01Uncharacterized protein OS=Cucumis sativus GN=Csa_5G157940 PE=4 SV=1[more]
A0A061GBU0_THECC6.4e-10069.10AT-hook motif nuclear-localized protein 1 isoform 2 OS=Theobroma cacao GN=TCM_01... [more]
V4TPS8_9ROSI4.2e-9968.60Uncharacterized protein OS=Citrus clementina GN=CICLE_v10021222mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G12080.11.1e-9066.67 AT-hook motif nuclear-localized protein 1[more]
AT4G22770.14.3e-7956.57 AT hook motif DNA-binding family protein[more]
AT4G00200.13.2e-5848.29 AT hook motif DNA-binding family protein[more]
AT4G25320.11.7e-5648.70 AT hook motif DNA-binding family protein[more]
AT5G62260.11.5e-5548.21 AT hook motif DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|778697973|ref|XP_004149134.2|7.6e-16397.38PREDICTED: AT-hook motif nuclear-localized protein 1-like [Cucumis sativus][more]
gi|659121342|ref|XP_008460610.1|3.8e-15493.44PREDICTED: putative DNA-binding protein ESCAROLA isoform X2 [Cucumis melo][more]
gi|659121340|ref|XP_008460609.1|3.8e-15493.44PREDICTED: putative DNA-binding protein ESCAROLA isoform X1 [Cucumis melo][more]
gi|1009175138|ref|XP_015868718.1|1.4e-10067.75PREDICTED: AT-hook motif nuclear-localized protein 1-like isoform X1 [Ziziphus j... [more]
gi|568846955|ref|XP_006477307.1|1.9e-10069.28PREDICTED: AT-hook motif nuclear-localized protein 1 [Citrus sinensis][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsa.023030Cucsa.023030gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsa.023030.3Cucsa.023030.3-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.023030.3.CDS.6Cucsa.023030.3.CDS.6CDS
Cucsa.023030.3.CDS.5Cucsa.023030.3.CDS.5CDS
Cucsa.023030.3.CDS.4Cucsa.023030.3.CDS.4CDS
Cucsa.023030.3.CDS.3Cucsa.023030.3.CDS.3CDS
Cucsa.023030.3.CDS.2Cucsa.023030.3.CDS.2CDS
Cucsa.023030.3.CDS.1Cucsa.023030.3.CDS.1CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.023030.3.three_prime_UTR.1Cucsa.023030.3.three_prime_UTR.1three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 119..226
score: 4.9
IPR005175PPC domainPROFILEPS51742PPCcoord: 115..253
score: 38
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 118..233
score: 3.5
NoneNo IPR availablePANTHERPTHR31500FAMILY NOT NAMEDcoord: 12..298
score: 4.7E
NoneNo IPR availablePANTHERPTHR31500:SF7AT HOOK MOTIF DNA-BINDING FAMILY PROTEIN-RELATEDcoord: 12..298
score: 4.7E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 117..232
score: 4.19