CmoCh04G002020 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G002020
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionAT hook motif DNA-binding family protein
LocationCmo_Chr04 : 1004403 .. 1005206 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGTCGATGGTGGGCTGAAAATATACCCACCACCACAGATAGCAGCACCACCAACCCTTATTCGACTCCATTAAAGCAAAGCTTAGAAGCTGCGGATGAAGAAAACAACTCCGGCAGCTATGAGAGAGTTGAACCAGGCACCGGCTCCACCACGCGCCGCCCACGTGGTCGGCCGCCGGGATCCAAAAACAAGATGAAGCCTCCGGTGGTAGTCACCAAGGAGAGCCCCGATGCTCTCCGGAGCCACGTTTTAGAAATCGGCAGCGGCAGCGATATCGTCGAAAGCATCTCGAACTTCGCTCAACGCCGCCAGCGAGGGGTTTCTGTCCTCAGTGGAAATGGGGTTGTCGCCAATGTCACTCTAAGGCATCCGGCCGCCCCTGGCAACGTTATTACTCTACAGGGACGGTTTGATATATTGTCGCTCTCCGGTGCCTTTTTGCCTTCCCCTGCTCCACCAGGGGCTACCGGACTAACAGTCTACTTGGCTGGTGGGCAGGGGCAGGTCGTTGGTGGCATTGTCGTCGGTGCTCTCATCGCAACCGGTCCAGTTATTGTCATCGCTGCTACCTTCACCAATGCCACATACGAGCAGCTACCGCTTGAGGATGAGGAGGCAGCTGCCGGAGACAAGTCCGGAAATAGCCAAAACAATTCAACTTCTCAGAGCATGGGGGATCAGCAACAGCAGCAGCCTTCAATGGGAAATTACAATATGACTTCAAATTTGGTGCCAAATGGTCAGGTTTCTTCGCATGATGTGTTTTGGAGTCCTCCTCGAGCCCCACCTCCATTCTAA

mRNA sequence

ATGGCAAGTCGATGGTGGGCTGAAAATATACCCACCACCACAGATAGCAGCACCACCAACCCTTATTCGACTCCATTAAAGCAAAGCTTAGAAGCTGCGGATGAAGAAAACAACTCCGGCAGCTATGAGAGAGTTGAACCAGGCACCGGCTCCACCACGCGCCGCCCACGTGGTCGGCCGCCGGGATCCAAAAACAAGATGAAGCCTCCGGTGGTAGTCACCAAGGAGAGCCCCGATGCTCTCCGGAGCCACGTTTTAGAAATCGGCAGCGGCAGCGATATCGTCGAAAGCATCTCGAACTTCGCTCAACGCCGCCAGCGAGGGGTTTCTGTCCTCAGTGGAAATGGGGTTGTCGCCAATGTCACTCTAAGGCATCCGGCCGCCCCTGGCAACGTTATTACTCTACAGGGACGGTTTGATATATTGTCGCTCTCCGGTGCCTTTTTGCCTTCCCCTGCTCCACCAGGGGCTACCGGACTAACAGTCTACTTGGCTGGTGGGCAGGGGCAGGTCGTTGGTGGCATTGTCGTCGGTGCTCTCATCGCAACCGGTCCAGTTATTGTCATCGCTGCTACCTTCACCAATGCCACATACGAGCAGCTACCGCTTGAGGATGAGGAGGCAGCTGCCGGAGACAAGTCCGGAAATAGCCAAAACAATTCAACTTCTCAGAGCATGGGGGATCAGCAACAGCAGCAGCCTTCAATGGGAAATTACAATATGACTTCAAATTTGGTGCCAAATGGTCAGGTTTCTTCGCATGATGTGTTTTGGAGTCCTCCTCGAGCCCCACCTCCATTCTAA

Coding sequence (CDS)

ATGGCAAGTCGATGGTGGGCTGAAAATATACCCACCACCACAGATAGCAGCACCACCAACCCTTATTCGACTCCATTAAAGCAAAGCTTAGAAGCTGCGGATGAAGAAAACAACTCCGGCAGCTATGAGAGAGTTGAACCAGGCACCGGCTCCACCACGCGCCGCCCACGTGGTCGGCCGCCGGGATCCAAAAACAAGATGAAGCCTCCGGTGGTAGTCACCAAGGAGAGCCCCGATGCTCTCCGGAGCCACGTTTTAGAAATCGGCAGCGGCAGCGATATCGTCGAAAGCATCTCGAACTTCGCTCAACGCCGCCAGCGAGGGGTTTCTGTCCTCAGTGGAAATGGGGTTGTCGCCAATGTCACTCTAAGGCATCCGGCCGCCCCTGGCAACGTTATTACTCTACAGGGACGGTTTGATATATTGTCGCTCTCCGGTGCCTTTTTGCCTTCCCCTGCTCCACCAGGGGCTACCGGACTAACAGTCTACTTGGCTGGTGGGCAGGGGCAGGTCGTTGGTGGCATTGTCGTCGGTGCTCTCATCGCAACCGGTCCAGTTATTGTCATCGCTGCTACCTTCACCAATGCCACATACGAGCAGCTACCGCTTGAGGATGAGGAGGCAGCTGCCGGAGACAAGTCCGGAAATAGCCAAAACAATTCAACTTCTCAGAGCATGGGGGATCAGCAACAGCAGCAGCCTTCAATGGGAAATTACAATATGACTTCAAATTTGGTGCCAAATGGTCAGGTTTCTTCGCATGATGTGTTTTGGAGTCCTCCTCGAGCCCCACCTCCATTCTAA
BLAST of CmoCh04G002020 vs. Swiss-Prot
Match: AHL15_ARATH (AT-hook motif nuclear-localized protein 15 OS=Arabidopsis thaliana GN=AHL15 PE=2 SV=1)

HSP 1 Score: 253.4 bits (646), Expect = 2.7e-66
Identity = 154/279 (55.20%), Postives = 197/279 (70.61%), Query Frame = 1

Query: 9   NIPTTTDSS-------TTNPYSTPLKQSLEAADEENNSGSYERVEPGTGS--TTRRPRGR 68
           N PT T S        TTN   +P  Q+ ++ +E+N+      VEPG+GS  T RRPRGR
Sbjct: 35  NPPTMTRSDPRLDHDFTTNNSGSPNTQT-QSQEEQNSRDEQPAVEPGSGSGSTGRRPRGR 94

Query: 69  PPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVA 128
           PPGSKNK K PVVVTKESP++L+SHVLEI +G+D+ ES++ FA+RR RGVSVLSG+G+V 
Sbjct: 95  PPGSKNKPKSPVVVTKESPNSLQSHVLEIATGADVAESLNAFARRRGRGVSVLSGSGLVT 154

Query: 129 NVTLRHPAAPGNVITLQGRFDILSLSGAFLP-SPAPPGATGLTVYLAGGQGQVVGGIVVG 188
           NVTLR PAA G V++L+G+F+ILS+ GAFLP S +P  A GLT+YLAG QGQVVGG V G
Sbjct: 155 NVTLRQPAASGGVVSLRGQFEILSMCGAFLPTSGSPAAAAGLTIYLAGAQGQVVGGGVAG 214

Query: 189 ALIATGPVIVIAATFTNATYEQLPLEDEEAAA-------GDKSGNSQNNSTSQSMGDQQQ 248
            LIA+GPVIVIAATF NATYE+LP+E+E+          G K     +++ S + G++  
Sbjct: 215 PLIASGPVIVIAATFCNATYERLPIEEEQQQEQPLQLEDGKKQKEENDDNESGNNGNEGS 274

Query: 249 QQPSMGNYNMTSNLVPNG-QVSSHDVFWS--PPRAPPPF 268
            QP M  YNM  N +PNG Q++ HDV+W   PPRAPP +
Sbjct: 275 MQPPM--YNMPPNFIPNGHQMAQHDVYWGGPPPRAPPSY 310

BLAST of CmoCh04G002020 vs. Swiss-Prot
Match: AHL19_ARATH (AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 2.9e-60
Identity = 153/317 (48.26%), Postives = 195/317 (61.51%), Query Frame = 1

Query: 1   MASRWWAENIPTTTDSSTTNPYSTPLKQ-----SLEAA---------------------D 60
           MA+ WW   +   +   TT P S+ LK+     S+  A                     D
Sbjct: 1   MANPWWTGQV-NLSGLETTPPGSSQLKKPDLHISMNMAMDSGHNNHHHHQEVDNNNNDDD 60

Query: 61  EENNSGS-YERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSD 120
            +N SG  +E  E    + TRRPRGRP GSKNK KPP+ VT++SP+AL+SHV+EI SG+D
Sbjct: 61  RDNLSGDDHEPREGAVEAPTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTD 120

Query: 121 IVESISNFAQRRQRGVSVLSGNGVVANVTLRHP------AAPGN--VITLQGRFDILSLS 180
           ++E+++ FA+RRQRG+ +LSGNG VANVTLR P      AAPG   V+ LQGRF+ILSL+
Sbjct: 121 VIETLATFARRRQRGICILSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLT 180

Query: 181 GAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLED 240
           G+FLP PAPPG+TGLT+YLAGGQGQVVGG VVG L+A GPV++IAATF+NATYE+LPLE+
Sbjct: 181 GSFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEE 240

Query: 241 EEAA------------AGDKSGNSQNNSTSQSMGDQQQQQPSMGNYNMTSNLVPN----- 264
           EEAA             G   G     S+    GD  Q  P    YNM  NLV N     
Sbjct: 241 EEAAERGGGGGSGGVVPGQLGGGGSPLSSGAGGGDGNQGLPV---YNMPGNLVSNGGSGG 300

BLAST of CmoCh04G002020 vs. Swiss-Prot
Match: AHL20_ARATH (AT-hook motif nuclear-localized protein 20 OS=Arabidopsis thaliana GN=AHL20 PE=2 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 7.0e-59
Identity = 131/239 (54.81%), Postives = 168/239 (70.29%), Query Frame = 1

Query: 32  AADEENNSGSYERVEPGTGST---TRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEI 91
           A ++  ++   E  +P  G+     RRPRGRPPGSKNK K P+ VT++SP+ALRSHVLEI
Sbjct: 42  AMNQSQDNDQDEEDDPREGAVEVVNRRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEI 101

Query: 92  GSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVITLQGRFDILSLSGAF 151
             GSD+ ++I++F++RRQRGV VLSG G VANVTLR  AAPG V++LQGRF+ILSL+GAF
Sbjct: 102 SDGSDVADTIAHFSRRRQRGVCVLSGTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAF 161

Query: 152 LPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEA 211
           LP P+PPG+TGLTVYLAG QGQVVGG VVG L+A G V+VIAATF+NATYE+LP+E+EE 
Sbjct: 162 LPGPSPPGSTGLTVYLAGVQGQVVGGSVVGPLLAIGSVMVIAATFSNATYERLPMEEEED 221

Query: 212 AAGDKSGNSQNNSTSQSMGDQQQQQPSMG-NYNMTSNLVPN--GQVSSHDVFWSPPRAP 265
             G +  +   +S  +   +        G  YNM  +L+PN  GQ+      W   R P
Sbjct: 222 GGGSRQIHGGGDSPPRIGSNLPDLSGMAGPGYNMPPHLIPNGAGQLGHEPYTWVHARPP 280

BLAST of CmoCh04G002020 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 2.5e-56
Identity = 128/239 (53.56%), Postives = 164/239 (68.62%), Query Frame = 1

Query: 34  DEENNSGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSD 93
           D    SG       G    TRRPRGRP GSKNK KPP+++T++S +ALR+HV+EIG G D
Sbjct: 85  DIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCD 144

Query: 94  IVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAA---PGNVITLQGRFDILSLSGAFLP 153
           +VES++ FA+RRQRGV V+SG G V NVT+R P +   PG+V++L GRF+ILSLSG+FLP
Sbjct: 145 LVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLP 204

Query: 154 SPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEAAA 213
            PAPP ATGL+VYLAGGQGQVVGG VVG L+  GPV+V+AA+F+NA YE+LPLE++E   
Sbjct: 205 PPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEEDEMQT 264

Query: 214 GDKSGNSQNNSTSQSMGDQQ---QQQPSMGNYNMTSNLVPNGQV-SSHD-VFWSPPRAP 265
               G    +  S  M  QQ   QQQ   G+  +  NL+ + Q+   HD  +WS  R P
Sbjct: 265 PVHGGGGGGSLESPPMMGQQLQHQQQAMSGHQGLPPNLLGSVQLQQQHDQSYWSTGRPP 323

BLAST of CmoCh04G002020 vs. Swiss-Prot
Match: AHL22_ARATH (AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 1.2e-55
Identity = 128/256 (50.00%), Postives = 177/256 (69.14%), Query Frame = 1

Query: 36  ENNSGSYERVEPGTGST-----------TRRPRGRPPGSKNKMKPPVVVTKESPDALRSH 95
           E++S   ++  PG+G             TRRPRGRP GSKNK KPP+++T++S +AL+SH
Sbjct: 60  EHSSAGKDQSTPGSGGESGGGGGGDNHITRRPRGRPAGSKNKPKPPIIITRDSANALKSH 119

Query: 96  VLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAA-PG---NVITLQGRFD 155
           V+E+ +G D++ES++ FA+RRQRG+ VLSGNG V NVT+R PA+ PG   +V+ L GRF+
Sbjct: 120 VMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGRFE 179

Query: 156 ILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQ 215
           ILSLSG+FLP PAPP A+GLT+YLAGGQGQVVGG VVG L+A+GPV+++AA+F NA YE+
Sbjct: 180 ILSLSGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAAYER 239

Query: 216 LPLED---EEAAAGDKSGNSQNNST------SQSMGDQQQQQ-----PSMGNYNMTSNLV 263
           LPLE+   EE  AG  + N   N+T      +Q+   QQQQQ     P+     +  NL+
Sbjct: 240 LPLEEDDQEEQTAGAVANNIDGNATMGGGTQTQTQTQQQQQQQLMQDPTSFIQGLPPNLM 299

BLAST of CmoCh04G002020 vs. TrEMBL
Match: A0A0A0KLQ0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G182750 PE=4 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 6.1e-126
Identity = 241/269 (89.59%), Postives = 252/269 (93.68%), Query Frame = 1

Query: 1   MASRWWAENIPTTTDSSTTNPYSTPLKQSLEAADEENNSGSYERVEPGTGSTTRRPRGRP 60
           MA+RWWAENIPTTTDSST+NPYSTPLKQSLE ADEENNSGS+ER EPGT S+TRRPRGRP
Sbjct: 1   MANRWWAENIPTTTDSSTSNPYSTPLKQSLEVADEENNSGSHERAEPGTSSSTRRPRGRP 60

Query: 61  PGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVAN 120
           PGSKNK KPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVL GNGVVAN
Sbjct: 61  PGSKNKPKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLGGNGVVAN 120

Query: 121 VTLRHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180
           VTLRHP A G VITLQGRFDILSLSGAFLP+PAPPGATGLTVYLAGGQGQVVGGIVVGAL
Sbjct: 121 VTLRHPGASGGVITLQGRFDILSLSGAFLPAPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180

Query: 181 IATGPVIVIAATFTNATYEQLPLEDEEAAAGDKSGNSQNNSTSQSMGD--QQQQQPSMGN 240
           +ATGPVIVIAATFTNAT+E+LPLEDEE AAGDKSG SQNNSTSQSMG+  QQQQQPSMG 
Sbjct: 181 VATGPVIVIAATFTNATFERLPLEDEEVAAGDKSGTSQNNSTSQSMGEQQQQQQQPSMGV 240

Query: 241 YNMTSNLVPNGQVSSHDVFWSPPRAPPPF 268
           YNMT NLV NGQVSSH++ WS PRAPPPF
Sbjct: 241 YNMTPNLVTNGQVSSHEMIWSLPRAPPPF 269

BLAST of CmoCh04G002020 vs. TrEMBL
Match: M5XDF2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008610mg PE=4 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 8.6e-80
Identity = 185/326 (56.75%), Postives = 219/326 (67.18%), Query Frame = 1

Query: 1   MASRWWAENIP--------------------------------TTTDSSTTNPYSTPLKQ 60
           MA+RWWA N+                                 T T+SST+NP +TP KQ
Sbjct: 1   MANRWWAGNVAMGGGHVDSISSTPPSLHLRNTEEQLDDHNTTNTPTNSSTSNPATTPNKQ 60

Query: 61  S------------LEAADEENN--SGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVT 120
           +            LEA +++ N  SGS++ +EPG  S+ RRPRGRPPGSKNK KPP+++T
Sbjct: 61  NDEHHEDGRDNNDLEADNQDPNTGSGSHDSLEPG--SSNRRPRGRPPGSKNKPKPPIIIT 120

Query: 121 KESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVIT 180
           KESP+ALRSHVLEI SGSDIV+SI+ FAQRR RGVSVLSG+G+VANVTLRHPAAP  VIT
Sbjct: 121 KESPNALRSHVLEISSGSDIVDSIATFAQRRHRGVSVLSGSGIVANVTLRHPAAPSGVIT 180

Query: 181 LQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFT 240
           L GRF+ILSLSGAFLPSP+PPGATGLTVYLAGGQGQVVGG V+GAL+A+GPV+V+AATFT
Sbjct: 181 LHGRFEILSLSGAFLPSPSPPGATGLTVYLAGGQGQVVGGTVMGALVASGPVMVVAATFT 240

Query: 241 NATYEQLPLEDEEAAAGDKSGNSQNNSTSQS----------MGDQ--QQQQPSMGNYNMT 267
           NATYE+LPLEDE+A  G      Q     QS           G Q   +   SM  YN+ 
Sbjct: 241 NATYERLPLEDEQAGEGGMQVQQQQQQQQQSGVNSAGTGGNSGSQGLVEHTSSMAIYNLP 300

BLAST of CmoCh04G002020 vs. TrEMBL
Match: F6HNV0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0019g02080 PE=4 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 8.0e-78
Identity = 173/269 (64.31%), Postives = 199/269 (73.98%), Query Frame = 1

Query: 12  TTTDSSTTNPYSTPLKQS--------LEAADEENNSGSYERVEPGTGSTTRRPRGRPPGS 71
           T + SS TNP  +   Q+        ++  D E N+G +E  EP   S  RRPRGRPPGS
Sbjct: 62  TNSSSSNTNPNPSAANQNPEEEDSREIDLEDSEQNAGGHEIAEPS--SAGRRPRGRPPGS 121

Query: 72  KNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTL 131
           KNK KPP+V+TKESP+ALRSHVLEI SGSDI ESI+NFAQRR RGVSVLS +G+V NVTL
Sbjct: 122 KNKPKPPIVITKESPNALRSHVLEISSGSDIAESIANFAQRRHRGVSVLSASGIVNNVTL 181

Query: 132 RHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIAT 191
           R PAAPG VITLQGRF+ILSLSGAFLP+P+PPGATGLTVYLAGGQGQVVGG VVGAL+A+
Sbjct: 182 RQPAAPGGVITLQGRFEILSLSGAFLPAPSPPGATGLTVYLAGGQGQVVGGSVVGALMAS 241

Query: 192 GPVIVIAATFTNATYEQLPLEDEEAAAG----DKSG-NSQNNSTSQSMGDQQQQQPSMGN 251
           GPVIVIAATF+NAT+E+LPLEDE A  G      SG NS    TS           SM  
Sbjct: 242 GPVIVIAATFSNATFERLPLEDEPANEGIQMPQTSGVNSGTGGTSAPQSHGLVDPSSMPI 301

Query: 252 YNMTSNLVPNGQVSSHDVFWSPPRAPPPF 268
           YN+  NL+PNGQ+  HDVFW+PP  PPP+
Sbjct: 302 YNLPPNLLPNGQM-PHDVFWAPPPRPPPY 327

BLAST of CmoCh04G002020 vs. TrEMBL
Match: A5AJT5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_028561 PE=4 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 8.9e-77
Identity = 176/296 (59.46%), Postives = 201/296 (67.91%), Query Frame = 1

Query: 1   MASRWWAENIP---------------TTTDSSTTNPYSTPLK---------QSLEAADEE 60
           M  RWWA N+                T  D    N   T  +         + ++  D E
Sbjct: 1   MTHRWWAGNVAMRDPMSSAPSLHLRNTEDDQGGLNRLGTQKRARNPEEEDSREIDLEDSE 60

Query: 61  NNSGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVE 120
            N+G +E  EP   S  RRPRGRPPGSKNK KPP+V+TKESP+ALRSHVLEI SGSDI E
Sbjct: 61  QNAGGHEIAEPS--SAGRRPRGRPPGSKNKPKPPIVITKESPNALRSHVLEISSGSDIAE 120

Query: 121 SISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVITLQGRFDILSLSGAFLPSPAPPG 180
           SI+NFAQRR RGVSVLS +G+V NVTLR PAAPG VITLQGRF+ILSLSGAFLP+P+PPG
Sbjct: 121 SIANFAQRRHRGVSVLSASGIVNNVTLRQPAAPGGVITLQGRFEILSLSGAFLPAPSPPG 180

Query: 181 ATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEAAAG----D 240
           ATGLTVYLAGGQGQVVGG VVGAL+A+GPVIVIAATF+NAT+E+LPLEDE A  G     
Sbjct: 181 ATGLTVYLAGGQGQVVGGSVVGALMASGPVIVIAATFSNATFERLPLEDEPANEGIQMPQ 240

Query: 241 KSG-NSQNNSTSQSMGDQQQQQPSMGNYNMTSNLVPNGQVSSHDVFWSPPRAPPPF 268
            SG NS    TS           SM  YN   NL+PNGQ+  HDVFW+PP  PPP+
Sbjct: 241 TSGVNSGTGGTSAPQSHGLVDPSSMPIYNXPPNLLPNGQM-PHDVFWAPPPRPPPY 293

BLAST of CmoCh04G002020 vs. TrEMBL
Match: A0A0S3RZJ9_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G362300 PE=4 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 1.2e-76
Identity = 170/293 (58.02%), Postives = 204/293 (69.62%), Query Frame = 1

Query: 1   MASRWWAENI------------------PTTTDSSTTNPYSTPLKQSLE---AADEENNS 60
           MA+RWWA N+                  PT + +S TN  +   ++ +      D+  N 
Sbjct: 1   MANRWWAGNVGMIREQELMENNNNANTTPTNSSNSNTNANTNTTEEEVSRDNGDDQNQNL 60

Query: 61  GSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESIS 120
            S+E  EPG+G   RRPRGRPPGSKNK KPP+++TKESP+ALRSHVLEI SGSD+ ESI+
Sbjct: 61  VSHEGSEPGSGG--RRPRGRPPGSKNKPKPPIIITKESPNALRSHVLEIASGSDVAESIA 120

Query: 121 NFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATG 180
            FA RR RGVSVLSG+G+V NVTLR PAAP  VITL GRF+ILSLSGAFLPSP+PPGATG
Sbjct: 121 AFANRRHRGVSVLSGSGIVTNVTLRQPAAPAGVITLHGRFEILSLSGAFLPSPSPPGATG 180

Query: 181 LTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEAAAGDKSGNSQN 240
           L+VYLAGGQGQVVGG V G+L+A+GPV+VIAATF NATYE+LPLEDE+   G++    Q 
Sbjct: 181 LSVYLAGGQGQVVGGTVAGSLVASGPVMVIAATFANATYERLPLEDEQ---GEEEMQVQQ 240

Query: 241 NSTSQSMGDQQQ---QQPSMGNYNMTSNLVPNGQVSSHDVFW--SPPRAPPPF 268
            S  Q    Q Q   +Q SM  YN+  NL+ NGQ   HDVFW  +PPR PP F
Sbjct: 241 QSQPQQQQQQSQGLGEQVSMAMYNLPPNLLHNGQNMPHDVFWGAAPPRPPPSF 288

BLAST of CmoCh04G002020 vs. TAIR10
Match: AT3G55560.1 (AT3G55560.1 AT-hook protein of GA feedback 2)

HSP 1 Score: 253.4 bits (646), Expect = 1.5e-67
Identity = 154/279 (55.20%), Postives = 197/279 (70.61%), Query Frame = 1

Query: 9   NIPTTTDSS-------TTNPYSTPLKQSLEAADEENNSGSYERVEPGTGS--TTRRPRGR 68
           N PT T S        TTN   +P  Q+ ++ +E+N+      VEPG+GS  T RRPRGR
Sbjct: 35  NPPTMTRSDPRLDHDFTTNNSGSPNTQT-QSQEEQNSRDEQPAVEPGSGSGSTGRRPRGR 94

Query: 69  PPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVA 128
           PPGSKNK K PVVVTKESP++L+SHVLEI +G+D+ ES++ FA+RR RGVSVLSG+G+V 
Sbjct: 95  PPGSKNKPKSPVVVTKESPNSLQSHVLEIATGADVAESLNAFARRRGRGVSVLSGSGLVT 154

Query: 129 NVTLRHPAAPGNVITLQGRFDILSLSGAFLP-SPAPPGATGLTVYLAGGQGQVVGGIVVG 188
           NVTLR PAA G V++L+G+F+ILS+ GAFLP S +P  A GLT+YLAG QGQVVGG V G
Sbjct: 155 NVTLRQPAASGGVVSLRGQFEILSMCGAFLPTSGSPAAAAGLTIYLAGAQGQVVGGGVAG 214

Query: 189 ALIATGPVIVIAATFTNATYEQLPLEDEEAAA-------GDKSGNSQNNSTSQSMGDQQQ 248
            LIA+GPVIVIAATF NATYE+LP+E+E+          G K     +++ S + G++  
Sbjct: 215 PLIASGPVIVIAATFCNATYERLPIEEEQQQEQPLQLEDGKKQKEENDDNESGNNGNEGS 274

Query: 249 QQPSMGNYNMTSNLVPNG-QVSSHDVFWS--PPRAPPPF 268
            QP M  YNM  N +PNG Q++ HDV+W   PPRAPP +
Sbjct: 275 MQPPM--YNMPPNFIPNGHQMAQHDVYWGGPPPRAPPSY 310

BLAST of CmoCh04G002020 vs. TAIR10
Match: AT3G04570.1 (AT3G04570.1 AT-hook motif nuclear-localized protein 19)

HSP 1 Score: 233.4 bits (594), Expect = 1.6e-61
Identity = 153/317 (48.26%), Postives = 195/317 (61.51%), Query Frame = 1

Query: 1   MASRWWAENIPTTTDSSTTNPYSTPLKQ-----SLEAA---------------------D 60
           MA+ WW   +   +   TT P S+ LK+     S+  A                     D
Sbjct: 1   MANPWWTGQV-NLSGLETTPPGSSQLKKPDLHISMNMAMDSGHNNHHHHQEVDNNNNDDD 60

Query: 61  EENNSGS-YERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSD 120
            +N SG  +E  E    + TRRPRGRP GSKNK KPP+ VT++SP+AL+SHV+EI SG+D
Sbjct: 61  RDNLSGDDHEPREGAVEAPTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTD 120

Query: 121 IVESISNFAQRRQRGVSVLSGNGVVANVTLRHP------AAPGN--VITLQGRFDILSLS 180
           ++E+++ FA+RRQRG+ +LSGNG VANVTLR P      AAPG   V+ LQGRF+ILSL+
Sbjct: 121 VIETLATFARRRQRGICILSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLT 180

Query: 181 GAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLED 240
           G+FLP PAPPG+TGLT+YLAGGQGQVVGG VVG L+A GPV++IAATF+NATYE+LPLE+
Sbjct: 181 GSFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEE 240

Query: 241 EEAA------------AGDKSGNSQNNSTSQSMGDQQQQQPSMGNYNMTSNLVPN----- 264
           EEAA             G   G     S+    GD  Q  P    YNM  NLV N     
Sbjct: 241 EEAAERGGGGGSGGVVPGQLGGGGSPLSSGAGGGDGNQGLPV---YNMPGNLVSNGGSGG 300

BLAST of CmoCh04G002020 vs. TAIR10
Match: AT4G14465.1 (AT4G14465.1 AT-hook motif nuclear-localized protein 20)

HSP 1 Score: 228.8 bits (582), Expect = 4.0e-60
Identity = 131/239 (54.81%), Postives = 168/239 (70.29%), Query Frame = 1

Query: 32  AADEENNSGSYERVEPGTGST---TRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEI 91
           A ++  ++   E  +P  G+     RRPRGRPPGSKNK K P+ VT++SP+ALRSHVLEI
Sbjct: 42  AMNQSQDNDQDEEDDPREGAVEVVNRRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEI 101

Query: 92  GSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVITLQGRFDILSLSGAF 151
             GSD+ ++I++F++RRQRGV VLSG G VANVTLR  AAPG V++LQGRF+ILSL+GAF
Sbjct: 102 SDGSDVADTIAHFSRRRQRGVCVLSGTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAF 161

Query: 152 LPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEA 211
           LP P+PPG+TGLTVYLAG QGQVVGG VVG L+A G V+VIAATF+NATYE+LP+E+EE 
Sbjct: 162 LPGPSPPGSTGLTVYLAGVQGQVVGGSVVGPLLAIGSVMVIAATFSNATYERLPMEEEED 221

Query: 212 AAGDKSGNSQNNSTSQSMGDQQQQQPSMG-NYNMTSNLVPN--GQVSSHDVFWSPPRAP 265
             G +  +   +S  +   +        G  YNM  +L+PN  GQ+      W   R P
Sbjct: 222 GGGSRQIHGGGDSPPRIGSNLPDLSGMAGPGYNMPPHLIPNGAGQLGHEPYTWVHARPP 280

BLAST of CmoCh04G002020 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 220.3 bits (560), Expect = 1.4e-57
Identity = 128/239 (53.56%), Postives = 164/239 (68.62%), Query Frame = 1

Query: 34  DEENNSGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVTKESPDALRSHVLEIGSGSD 93
           D    SG       G    TRRPRGRP GSKNK KPP+++T++S +ALR+HV+EIG G D
Sbjct: 85  DIHGGSGEGGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCD 144

Query: 94  IVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAA---PGNVITLQGRFDILSLSGAFLP 153
           +VES++ FA+RRQRGV V+SG G V NVT+R P +   PG+V++L GRF+ILSLSG+FLP
Sbjct: 145 LVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFLP 204

Query: 154 SPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQLPLEDEEAAA 213
            PAPP ATGL+VYLAGGQGQVVGG VVG L+  GPV+V+AA+F+NA YE+LPLE++E   
Sbjct: 205 PPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEEDEMQT 264

Query: 214 GDKSGNSQNNSTSQSMGDQQ---QQQPSMGNYNMTSNLVPNGQV-SSHD-VFWSPPRAP 265
               G    +  S  M  QQ   QQQ   G+  +  NL+ + Q+   HD  +WS  R P
Sbjct: 265 PVHGGGGGGSLESPPMMGQQLQHQQQAMSGHQGLPPNLLGSVQLQQQHDQSYWSTGRPP 323

BLAST of CmoCh04G002020 vs. TAIR10
Match: AT2G45430.1 (AT2G45430.1 AT-hook motif nuclear-localized protein 22)

HSP 1 Score: 218.0 bits (554), Expect = 7.0e-57
Identity = 128/256 (50.00%), Postives = 177/256 (69.14%), Query Frame = 1

Query: 36  ENNSGSYERVEPGTGST-----------TRRPRGRPPGSKNKMKPPVVVTKESPDALRSH 95
           E++S   ++  PG+G             TRRPRGRP GSKNK KPP+++T++S +AL+SH
Sbjct: 60  EHSSAGKDQSTPGSGGESGGGGGGDNHITRRPRGRPAGSKNKPKPPIIITRDSANALKSH 119

Query: 96  VLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAA-PG---NVITLQGRFD 155
           V+E+ +G D++ES++ FA+RRQRG+ VLSGNG V NVT+R PA+ PG   +V+ L GRF+
Sbjct: 120 VMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGRFE 179

Query: 156 ILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFTNATYEQ 215
           ILSLSG+FLP PAPP A+GLT+YLAGGQGQVVGG VVG L+A+GPV+++AA+F NA YE+
Sbjct: 180 ILSLSGSFLPPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASGPVVIMAASFGNAAYER 239

Query: 216 LPLED---EEAAAGDKSGNSQNNST------SQSMGDQQQQQ-----PSMGNYNMTSNLV 263
           LPLE+   EE  AG  + N   N+T      +Q+   QQQQQ     P+     +  NL+
Sbjct: 240 LPLEEDDQEEQTAGAVANNIDGNATMGGGTQTQTQTQQQQQQQLMQDPTSFIQGLPPNLM 299

BLAST of CmoCh04G002020 vs. NCBI nr
Match: gi|659123496|ref|XP_008461695.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 463.8 bits (1192), Expect = 2.1e-127
Identity = 242/267 (90.64%), Postives = 252/267 (94.38%), Query Frame = 1

Query: 1   MASRWWAENIPTTTDSSTTNPYSTPLKQSLEAADEENNSGSYERVEPGTGSTTRRPRGRP 60
           MA+RWWAENIPTTTDSST+NPYSTPLKQSLEAADEENNSGS+ER EPGT S+TRRPRGRP
Sbjct: 1   MANRWWAENIPTTTDSSTSNPYSTPLKQSLEAADEENNSGSHERAEPGTSSSTRRPRGRP 60

Query: 61  PGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVAN 120
           PGSKNK KPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVAN
Sbjct: 61  PGSKNKPKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVAN 120

Query: 121 VTLRHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180
           VTLRHP A G VITLQGRFDILSLSGAFLP+PAPPGATGLTVYLAGGQGQVVGGIVVGAL
Sbjct: 121 VTLRHPGASGGVITLQGRFDILSLSGAFLPAPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180

Query: 181 IATGPVIVIAATFTNATYEQLPLEDEEAAAGDKSGNSQNNSTSQSMGDQQQQQPSMGNYN 240
           +ATGPVIVIAATFTNAT+E+LPLEDEE AAGDKSG SQNNSTSQSMG+QQQQ PSMG YN
Sbjct: 181 VATGPVIVIAATFTNATFERLPLEDEEVAAGDKSGTSQNNSTSQSMGEQQQQPPSMGVYN 240

Query: 241 MTSNLVPNGQVSSHDVFWSPPRAPPPF 268
           M  NLV NGQVSSHD+ WS PRAPPPF
Sbjct: 241 MAPNLVANGQVSSHDMIWSLPRAPPPF 267

BLAST of CmoCh04G002020 vs. NCBI nr
Match: gi|778701070|ref|XP_011654959.1| (PREDICTED: AT-hook motif nuclear-localized protein 15-like [Cucumis sativus])

HSP 1 Score: 458.4 bits (1178), Expect = 8.7e-126
Identity = 241/269 (89.59%), Postives = 252/269 (93.68%), Query Frame = 1

Query: 1   MASRWWAENIPTTTDSSTTNPYSTPLKQSLEAADEENNSGSYERVEPGTGSTTRRPRGRP 60
           MA+RWWAENIPTTTDSST+NPYSTPLKQSLE ADEENNSGS+ER EPGT S+TRRPRGRP
Sbjct: 1   MANRWWAENIPTTTDSSTSNPYSTPLKQSLEVADEENNSGSHERAEPGTSSSTRRPRGRP 60

Query: 61  PGSKNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVAN 120
           PGSKNK KPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVL GNGVVAN
Sbjct: 61  PGSKNKPKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLGGNGVVAN 120

Query: 121 VTLRHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180
           VTLRHP A G VITLQGRFDILSLSGAFLP+PAPPGATGLTVYLAGGQGQVVGGIVVGAL
Sbjct: 121 VTLRHPGASGGVITLQGRFDILSLSGAFLPAPAPPGATGLTVYLAGGQGQVVGGIVVGAL 180

Query: 181 IATGPVIVIAATFTNATYEQLPLEDEEAAAGDKSGNSQNNSTSQSMGD--QQQQQPSMGN 240
           +ATGPVIVIAATFTNAT+E+LPLEDEE AAGDKSG SQNNSTSQSMG+  QQQQQPSMG 
Sbjct: 181 VATGPVIVIAATFTNATFERLPLEDEEVAAGDKSGTSQNNSTSQSMGEQQQQQQQPSMGV 240

Query: 241 YNMTSNLVPNGQVSSHDVFWSPPRAPPPF 268
           YNMT NLV NGQVSSH++ WS PRAPPPF
Sbjct: 241 YNMTPNLVTNGQVSSHEMIWSLPRAPPPF 269

BLAST of CmoCh04G002020 vs. NCBI nr
Match: gi|645255784|ref|XP_008233653.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Prunus mume])

HSP 1 Score: 308.5 bits (789), Expect = 1.1e-80
Identity = 187/330 (56.67%), Postives = 223/330 (67.58%), Query Frame = 1

Query: 1   MASRWWAENIP--------------------------------TTTDSSTTNPYSTPLKQ 60
           MA+RWWA N+                                 T T+SST+NP +TP KQ
Sbjct: 1   MANRWWAGNVAMGGGHVDSISSTPPSLHLRNTEEQLDDHNTTNTPTNSSTSNPTTTPNKQ 60

Query: 61  S------------LEAADEENN--SGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVT 120
           +            LEA +++ N  SGS++ +EPG  S+ RRPRGRPPGSKNK KPP+++T
Sbjct: 61  NHEQHEDGRDNDDLEADNQDPNTGSGSHDSLEPG--SSNRRPRGRPPGSKNKPKPPIIIT 120

Query: 121 KESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVIT 180
           KESP+ALRSHVLEI SGSDIV+SI+ FAQRR RGVSVLSG+G+VANVTLRHPAAP  VIT
Sbjct: 121 KESPNALRSHVLEISSGSDIVDSIATFAQRRHRGVSVLSGSGIVANVTLRHPAAPSGVIT 180

Query: 181 LQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFT 240
           L GRF+ILSLSGAFLPSP+PPGATGLTVYLAGGQGQVVGG V+GAL+A+GPV+VIAATFT
Sbjct: 181 LHGRFEILSLSGAFLPSPSPPGATGLTVYLAGGQGQVVGGTVMGALVASGPVMVIAATFT 240

Query: 241 NATYEQLPLEDEEAA----------------AGDKSGNSQNNSTSQSMGDQQQQQPSMGN 267
           NATYE+LPLEDE+A                 +G  S  +  NS SQ +G+      SM  
Sbjct: 241 NATYERLPLEDEQAGEGGMQVQPQQQQHQQQSGVNSAGTGGNSGSQGLGEHTS---SMAI 300

BLAST of CmoCh04G002020 vs. NCBI nr
Match: gi|596005031|ref|XP_007218339.1| (hypothetical protein PRUPE_ppa008610mg [Prunus persica])

HSP 1 Score: 305.1 bits (780), Expect = 1.2e-79
Identity = 185/326 (56.75%), Postives = 219/326 (67.18%), Query Frame = 1

Query: 1   MASRWWAENIP--------------------------------TTTDSSTTNPYSTPLKQ 60
           MA+RWWA N+                                 T T+SST+NP +TP KQ
Sbjct: 1   MANRWWAGNVAMGGGHVDSISSTPPSLHLRNTEEQLDDHNTTNTPTNSSTSNPATTPNKQ 60

Query: 61  S------------LEAADEENN--SGSYERVEPGTGSTTRRPRGRPPGSKNKMKPPVVVT 120
           +            LEA +++ N  SGS++ +EPG  S+ RRPRGRPPGSKNK KPP+++T
Sbjct: 61  NDEHHEDGRDNNDLEADNQDPNTGSGSHDSLEPG--SSNRRPRGRPPGSKNKPKPPIIIT 120

Query: 121 KESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTLRHPAAPGNVIT 180
           KESP+ALRSHVLEI SGSDIV+SI+ FAQRR RGVSVLSG+G+VANVTLRHPAAP  VIT
Sbjct: 121 KESPNALRSHVLEISSGSDIVDSIATFAQRRHRGVSVLSGSGIVANVTLRHPAAPSGVIT 180

Query: 181 LQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIATGPVIVIAATFT 240
           L GRF+ILSLSGAFLPSP+PPGATGLTVYLAGGQGQVVGG V+GAL+A+GPV+V+AATFT
Sbjct: 181 LHGRFEILSLSGAFLPSPSPPGATGLTVYLAGGQGQVVGGTVMGALVASGPVMVVAATFT 240

Query: 241 NATYEQLPLEDEEAAAGDKSGNSQNNSTSQS----------MGDQ--QQQQPSMGNYNMT 267
           NATYE+LPLEDE+A  G      Q     QS           G Q   +   SM  YN+ 
Sbjct: 241 NATYERLPLEDEQAGEGGMQVQQQQQQQQQSGVNSAGTGGNSGSQGLVEHTSSMAIYNLP 300

BLAST of CmoCh04G002020 vs. NCBI nr
Match: gi|731412935|ref|XP_010658536.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera])

HSP 1 Score: 298.5 bits (763), Expect = 1.2e-77
Identity = 173/269 (64.31%), Postives = 199/269 (73.98%), Query Frame = 1

Query: 12  TTTDSSTTNPYSTPLKQS--------LEAADEENNSGSYERVEPGTGSTTRRPRGRPPGS 71
           T + SS TNP  +   Q+        ++  D E N+G +E  EP   S  RRPRGRPPGS
Sbjct: 62  TNSSSSNTNPNPSAANQNPEEEDSREIDLEDSEQNAGGHEIAEPS--SAGRRPRGRPPGS 121

Query: 72  KNKMKPPVVVTKESPDALRSHVLEIGSGSDIVESISNFAQRRQRGVSVLSGNGVVANVTL 131
           KNK KPP+V+TKESP+ALRSHVLEI SGSDI ESI+NFAQRR RGVSVLS +G+V NVTL
Sbjct: 122 KNKPKPPIVITKESPNALRSHVLEISSGSDIAESIANFAQRRHRGVSVLSASGIVNNVTL 181

Query: 132 RHPAAPGNVITLQGRFDILSLSGAFLPSPAPPGATGLTVYLAGGQGQVVGGIVVGALIAT 191
           R PAAPG VITLQGRF+ILSLSGAFLP+P+PPGATGLTVYLAGGQGQVVGG VVGAL+A+
Sbjct: 182 RQPAAPGGVITLQGRFEILSLSGAFLPAPSPPGATGLTVYLAGGQGQVVGGSVVGALMAS 241

Query: 192 GPVIVIAATFTNATYEQLPLEDEEAAAG----DKSG-NSQNNSTSQSMGDQQQQQPSMGN 251
           GPVIVIAATF+NAT+E+LPLEDE A  G      SG NS    TS           SM  
Sbjct: 242 GPVIVIAATFSNATFERLPLEDEPANEGIQMPQTSGVNSGTGGTSAPQSHGLVDPSSMPI 301

Query: 252 YNMTSNLVPNGQVSSHDVFWSPPRAPPPF 268
           YN+  NL+PNGQ+  HDVFW+PP  PPP+
Sbjct: 302 YNLPPNLLPNGQM-PHDVFWAPPPRPPPY 327

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL15_ARATH2.7e-6655.20AT-hook motif nuclear-localized protein 15 OS=Arabidopsis thaliana GN=AHL15 PE=2... [more]
AHL19_ARATH2.9e-6048.26AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2... [more]
AHL20_ARATH7.0e-5954.81AT-hook motif nuclear-localized protein 20 OS=Arabidopsis thaliana GN=AHL20 PE=2... [more]
AHL24_ARATH2.5e-5653.56AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
AHL22_ARATH1.2e-5550.00AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1... [more]
Match NameE-valueIdentityDescription
A0A0A0KLQ0_CUCSA6.1e-12689.59Uncharacterized protein OS=Cucumis sativus GN=Csa_5G182750 PE=4 SV=1[more]
M5XDF2_PRUPE8.6e-8056.75Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008610mg PE=4 SV=1[more]
F6HNV0_VITVI8.0e-7864.31Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0019g02080 PE=4 SV=... [more]
A5AJT5_VITVI8.9e-7759.46Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_028561 PE=4 SV=1[more]
A0A0S3RZJ9_PHAAN1.2e-7658.02Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G362300 PE=... [more]
Match NameE-valueIdentityDescription
AT3G55560.11.5e-6755.20 AT-hook protein of GA feedback 2[more]
AT3G04570.11.6e-6148.26 AT-hook motif nuclear-localized protein 19[more]
AT4G14465.14.0e-6054.81 AT-hook motif nuclear-localized protein 20[more]
AT4G22810.11.4e-5753.56 Predicted AT-hook DNA-binding family protein[more]
AT2G45430.17.0e-5750.00 AT-hook motif nuclear-localized protein 22[more]
Match NameE-valueIdentityDescription
gi|659123496|ref|XP_008461695.1|2.1e-12790.64PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|778701070|ref|XP_011654959.1|8.7e-12689.59PREDICTED: AT-hook motif nuclear-localized protein 15-like [Cucumis sativus][more]
gi|645255784|ref|XP_008233653.1|1.1e-8056.67PREDICTED: putative DNA-binding protein ESCAROLA [Prunus mume][more]
gi|596005031|ref|XP_007218339.1|1.2e-7956.75hypothetical protein PRUPE_ppa008610mg [Prunus persica][more]
gi|731412935|ref|XP_010658536.1|1.2e-7764.31PREDICTED: putative DNA-binding protein ESCAROLA [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
IPR014476AT-hook_nuclear
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G002020.1CmoCh04G002020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 82..194
score: 3.4
IPR005175PPC domainPROFILEPS51742PPCcoord: 78..216
score: 37
IPR014476AT-hook motif nuclear-localised proteinPIRPIRSF016021ESCAROLAcoord: 8..265
score: 1.6E
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 81..209
score: 4.3
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 20..267
score: 1.8E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 79..208
score: 4.19

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh04G002020Cucsa.136170Cucumber (Gy14) v1cgycmoB0373
CmoCh04G002020Cucsa.165130Cucumber (Gy14) v1cgycmoB0474
CmoCh04G002020CmaCh04G002000Cucurbita maxima (Rimu)cmacmoB728
CmoCh04G002020CmaCh04G011690Cucurbita maxima (Rimu)cmacmoB731
CmoCh04G002020Cla003993Watermelon (97103) v1cmowmB673
CmoCh04G002020Cla006543Watermelon (97103) v1cmowmB737
CmoCh04G002020Csa6G008690Cucumber (Chinese Long) v2cmocuB767
CmoCh04G002020Csa5G182750Cucumber (Chinese Long) v2cmocuB737
CmoCh04G002020MELO3C023881Melon (DHL92) v3.5.1cmomeB646
CmoCh04G002020MELO3C008232Melon (DHL92) v3.5.1cmomeB676
CmoCh04G002020ClCG11G006720Watermelon (Charleston Gray)cmowcgB630
CmoCh04G002020ClCG07G003540Watermelon (Charleston Gray)cmowcgB676
CmoCh04G002020CSPI06G01190Wild cucumber (PI 183967)cmocpiB776
CmoCh04G002020CSPI05G09840Wild cucumber (PI 183967)cmocpiB747
CmoCh04G002020Lsi04G009030Bottle gourd (USVL1VR-Ls)cmolsiB670
CmoCh04G002020Cp4.1LG01g05040Cucurbita pepo (Zucchini)cmocpeB673
CmoCh04G002020Cp4.1LG01g10920Cucurbita pepo (Zucchini)cmocpeB674
CmoCh04G002020MELO3C023881.2Melon (DHL92) v3.6.1cmomedB733
CmoCh04G002020CsaV3_5G008290Cucumber (Chinese Long) v3cmocucB0873
CmoCh04G002020Cla97C07G131830Watermelon (97103) v2cmowmbB755
CmoCh04G002020Bhi06G000866Wax gourdcmowgoB0837
CmoCh04G002020CsGy5G007200Cucumber (Gy14) v2cgybcmoB652
CmoCh04G002020Carg00252Silver-seed gourdcarcmoB0924
CmoCh04G002020Carg02091Silver-seed gourdcarcmoB0372
CmoCh04G002020Carg18334Silver-seed gourdcarcmoB0123
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh04G002020CmoCh15G001110Cucurbita moschata (Rifu)cmocmoB267
CmoCh04G002020CmoCh04G012390Cucurbita moschata (Rifu)cmocmoB464