CSPI02G02660 (gene) Wild cucumber (PI 183967)

NameCSPI02G02660
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionAT hook motif DNA-binding family protein
LocationChr2 : 1835748 .. 1837462 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAACACTCACATTACCCTCAACATTTCTAAAAAAACCCCATTTCTCTCTTTCTCTCTCCTCACATACTTTCTTGTCTATCTCAACTCTCTAGGCATCTCTTTGCCTTTTCTTTCCCTCTCCTTCCCCTTCATCATCACTTCAACCTCCCCTTTCCTTAATCCTTCCATTTAATTTACACCTCCCAATACTCTCTACTTATTAATTTAATCCTTTCTTCTCTCTATTTTTTTCCTTTTTTTTAATTATTATTTCTTCTTCTTCTTCCCCCTTTCAACCCCACAATACCATCTTAATTCTTCTTCAGCAGATAAATTATTTTTTTTTATAAAAGAATTTTAACTTTTAAAACTCTGAAAATTTTCAAGATCTGAAAAAAAAAAAAAAGATCTGAATTTTTTGATGATGGCAAACCGGTGGTGGACCTCCGGCCAGATGGGTCTTCCCGGAGTTGATCATACTTCAACAAGCTCCTCTGCTATGAGAAAACCAGATCTGGGAATCTCCATGAACGACAACGGTGGTCCTGTTCACAGTGGTGGTGATGACGATGACGATAGGGATAACGGTGGGGATGAACCTAAAGAAGGAGCAGTTGAGGTTCCAACGCGTCGTCCCCGTGGCCGACCGCCGGGATCCAAGAATAAGCCTAAGCCACCTATCTTTGTTACTAGAGATAGCCCTAATGCCCTAAAAAGCCATGTCATGGAGATTTCTAATGGAGCTGATATTGCCGAGAGCGTTGCTCAATTCGCTCGACGGCGACAGAGAGGTGAAAAAGCTTAAGCTAATTAATAGGTTGCGGTGAATTTAATTATTGATTTTAATTATCTATATATCCATGAGTCACAGAGTTTGTACTTTGAAATTAGGTTAATTTAGTTCACATGAATTCGTGATCCATAACTGGCATGTATGATTGACTTCAGAGAAAAATATTTTCATTAACAGGTGTTTCTGTGCTTAGTGGTAGCGGTACGGTTACAAATGTCACACTCCGACAACCGTCTGCGCCTGGTGCAGTCTTAGCCCTCCAGGGACGATTCGAGATACTTTCTTTAACTGGAACTTTCCTCCCTGGACCAGCCCCACCTGGCTCAACCGGACTAACGATCTACTTGGCTGGTGGACAAGGGCAAGTGGTGGGTGGCAGCGTCGTCGGGCCACTCACCGCTGCTGGCCCAGTGATGGTGATTGCTGCAACATTTTCCAACGCAACATACGAAAGATTACCCTTAGAAGAGGAAGAAGAAGGCGGCGGAGTAGGAGCACAAGGGCACACATCGGCAGGCGGTGGCGGCGCAGGCGACGGTTCACCACAAGGCATCGGAGGCGGAGTCGGGGACCCATCAGCTATGACTCCACTGTACAATTTACCACCAAATTTACTACCGAATGGTGGCGGAGGGCAGTTGAACCAAGAGGCCTATTCTTGGGCTCACGGCGGCCGGCCGTCATTTTAAAGCTTCTGATGGGAAAAAAAAACAAAGGTTAGAAAATGTTTTAAGATTGGAGGCTGTTTTGTTATTGCAGCCATGGTTTAGGAAGTTGAAGGTATAAAAGATGAAAGAGATGAGTCTCTATATATTTCATGTCTTAGTGTTGATTATAAATATTCAGTGTGAAAAAAAAAACCCTTAAATTAATTGTTCTATGCAAATACAAGAAGGAGAGTTAATTAACTTCCTCTCTTTTTCTTCTAATTTAATTTTTATT

mRNA sequence

ATGATGGCAAACCGGTGGTGGACCTCCGGCCAGATGGGTCTTCCCGGAGTTGATCATACTTCAACAAGCTCCTCTGCTATGAGAAAACCAGATCTGGGAATCTCCATGAACGACAACGGTGGTCCTGTTCACAGTGGTGGTGATGACGATGACGATAGGGATAACGGTGGGGATGAACCTAAAGAAGGAGCAGTTGAGGTTCCAACGCGTCGTCCCCGTGGCCGACCGCCGGGATCCAAGAATAAGCCTAAGCCACCTATCTTTGTTACTAGAGATAGCCCTAATGCCCTAAAAAGCCATGTCATGGAGATTTCTAATGGAGCTGATATTGCCGAGAGCGTTGCTCAATTCGCTCGACGGCGACAGAGAGGTGTTTCTGTGCTTAGTGGTAGCGGTACGGTTACAAATGTCACACTCCGACAACCGTCTGCGCCTGGTGCAGTCTTAGCCCTCCAGGGACGATTCGAGATACTTTCTTTAACTGGAACTTTCCTCCCTGGACCAGCCCCACCTGGCTCAACCGGACTAACGATCTACTTGGCTGGTGGACAAGGGCAAGTGGTGGGTGGCAGCGTCGTCGGGCCACTCACCGCTGCTGGCCCAGTGATGGTGATTGCTGCAACATTTTCCAACGCAACATACGAAAGATTACCCTTAGAAGAGGAAGAAGAAGGCGGCGGAGTAGGAGCACAAGGGCACACATCGGCAGGCGGTGGCGGCGCAGGCGACGGTTCACCACAAGGCATCGGAGGCGGAGTCGGGGACCCATCAGCTATGACTCCACTGTACAATTTACCACCAAATTTACTACCGAATGGTGGCGGAGGGCAGTTGAACCAAGAGGCCTATTCTTGGGCTCACGGCGGCCGGCCGTCATTTTAA

Coding sequence (CDS)

ATGATGGCAAACCGGTGGTGGACCTCCGGCCAGATGGGTCTTCCCGGAGTTGATCATACTTCAACAAGCTCCTCTGCTATGAGAAAACCAGATCTGGGAATCTCCATGAACGACAACGGTGGTCCTGTTCACAGTGGTGGTGATGACGATGACGATAGGGATAACGGTGGGGATGAACCTAAAGAAGGAGCAGTTGAGGTTCCAACGCGTCGTCCCCGTGGCCGACCGCCGGGATCCAAGAATAAGCCTAAGCCACCTATCTTTGTTACTAGAGATAGCCCTAATGCCCTAAAAAGCCATGTCATGGAGATTTCTAATGGAGCTGATATTGCCGAGAGCGTTGCTCAATTCGCTCGACGGCGACAGAGAGGTGTTTCTGTGCTTAGTGGTAGCGGTACGGTTACAAATGTCACACTCCGACAACCGTCTGCGCCTGGTGCAGTCTTAGCCCTCCAGGGACGATTCGAGATACTTTCTTTAACTGGAACTTTCCTCCCTGGACCAGCCCCACCTGGCTCAACCGGACTAACGATCTACTTGGCTGGTGGACAAGGGCAAGTGGTGGGTGGCAGCGTCGTCGGGCCACTCACCGCTGCTGGCCCAGTGATGGTGATTGCTGCAACATTTTCCAACGCAACATACGAAAGATTACCCTTAGAAGAGGAAGAAGAAGGCGGCGGAGTAGGAGCACAAGGGCACACATCGGCAGGCGGTGGCGGCGCAGGCGACGGTTCACCACAAGGCATCGGAGGCGGAGTCGGGGACCCATCAGCTATGACTCCACTGTACAATTTACCACCAAATTTACTACCGAATGGTGGCGGAGGGCAGTTGAACCAAGAGGCCTATTCTTGGGCTCACGGCGGCCGGCCGTCATTTTAA
BLAST of CSPI02G02660 vs. Swiss-Prot
Match: AHL19_ARATH (AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2 SV=1)

HSP 1 Score: 347.1 bits (889), Expect = 2.0e-94
Identity = 211/311 (67.85%), Postives = 236/311 (75.88%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDD 61
           MAN WWT GQ+ L G++ T   SS ++KPDL ISMN   D+G   H         ++DDD
Sbjct: 1   MANPWWT-GQVNLSGLETTPPGSSQLKKPDLHISMNMAMDSGHNNHHHHQEVDNNNNDDD 60

Query: 62  RDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGAD 121
           RDN  GD  EP+EGAVE PTRRPRGRP GSKNKPKPPIFVTRDSPNALKSHVMEI++G D
Sbjct: 61  RDNLSGDDHEPREGAVEAPTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTD 120

Query: 122 IAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLT 181
           + E++A FARRRQRG+ +LSG+GTV NVTLRQPS      APG  AVLALQGRFEILSLT
Sbjct: 121 VIETLATFARRRQRGICILSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLT 180

Query: 182 GTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEE 241
           G+FLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPL AAGPVM+IAATFSNATYERLPLEE
Sbjct: 181 GSFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEE 240

Query: 242 EE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG 287
           EE  E GG G  G    G  G G GSP   G G GD +   P+YN+P NL+ N   GGGG
Sbjct: 241 EEAAERGGGGGSGGVVPGQLGGG-GSPLSSGAGGGDGNQGLPVYNMPGNLVSNGGSGGGG 300

BLAST of CSPI02G02660 vs. Swiss-Prot
Match: AHL20_ARATH (AT-hook motif nuclear-localized protein 20 OS=Arabidopsis thaliana GN=AHL20 PE=2 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 1.0e-87
Identity = 191/300 (63.67%), Postives = 224/300 (74.67%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPG-VDHTSTSS--------SAMRKPDLGISMNDNGGPVHSGGDDDDD 61
           MAN WWT+ Q GL G VDH+ +S         S + K DLGI+MN +        D+D D
Sbjct: 1   MANPWWTN-QSGLAGMVDHSVSSGHHQNHHHQSLLTKGDLGIAMNQSQ-------DNDQD 60

Query: 62  RDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAE 121
            +   D+P+EGAVEV  RRPRGRPPGSKNKPK PIFVTRDSPNAL+SHV+EIS+G+D+A+
Sbjct: 61  EE---DDPREGAVEVVNRRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEISDGSDVAD 120

Query: 122 SVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPG 181
           ++A F+RRRQRGV VLSG+G+V NVTLRQ +APG V++LQGRFEILSLTG FLPGP+PPG
Sbjct: 121 TIAHFSRRRQRGVCVLSGTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAFLPGPSPPG 180

Query: 182 STGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQG 241
           STGLT+YLAG QGQVVGGSVVGPL A G VMVIAATFSNATYERLP+EEEE+GGG   Q 
Sbjct: 181 STGLTVYLAGVQGQVVGGSVVGPLLAIGSVMVIAATFSNATYERLPMEEEEDGGG-SRQI 240

Query: 242 HTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
           H      G GD SP  IG  + D S M  P YN+PP+L+PN G GQL  E Y+W H   P
Sbjct: 241 H------GGGD-SPPRIGSNLPDLSGMAGPGYNMPPHLIPN-GAGQLGHEPYTWVHARPP 280

BLAST of CSPI02G02660 vs. Swiss-Prot
Match: AHL23_ARATH (AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 7.2e-65
Identity = 150/289 (51.90%), Postives = 188/289 (65.05%), Query Frame = 1

Query: 27  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGA 86
           + +PDL +  N +   V  G        DD+D+ +N               GG     G 
Sbjct: 17  LHRPDLHLHHNSSSDDVTPGAGMGHFTVDDEDNNNNHQGLDLASGGGSGSSGGGGGHGGG 76

Query: 87  VEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRG 146
            +V  RRPRGRPPGSKNKPKPP+ +TR+S N L++H++E++NG D+ + VA +ARRRQRG
Sbjct: 77  GDVVGRRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYARRRQRG 136

Query: 147 VSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQ 206
           + VLSGSGTVTNV++RQPSA GAV+ LQG FEILSL+G+FLP PAPPG+T LTI+LAGGQ
Sbjct: 137 ICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFLPPPAPPGATSLTIFLAGGQ 196

Query: 207 GQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDG 266
           GQVVGGSVVG LTAAGPV+VIAA+F+N  YERLPLEE+E+      Q     G  G G+ 
Sbjct: 197 GQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDEQ------QQQLGGGSNGGGNL 256

Query: 267 SPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 294
            P+   GG G      P +NLP N+ PN    QL  E +    GGR  F
Sbjct: 257 FPEVAAGGGGG----LPFFNLPMNMQPN---VQLPVEGWPGNSGGRGPF 292

BLAST of CSPI02G02660 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 1.3e-61
Identity = 147/253 (58.10%), Postives = 176/253 (69.57%), Query Frame = 1

Query: 46  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEIS 105
           GG  +    +GGD          TRRPRGRP GSKNKPKPPI +TRDS NAL++HVMEI 
Sbjct: 88  GGSGEGGGGSGGDHQM-------TRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIG 147

Query: 106 NGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSA---PGAVLALQGRFEILSLTG 165
           +G D+ ESVA FARRRQRGV V+SG+G VTNVT+RQP +   PG+V++L GRFEILSL+G
Sbjct: 148 DGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSG 207

Query: 166 TFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEE 225
           +FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  AGPV+V+AA+FSNA YERLPLEE+
Sbjct: 208 SFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEED 267

Query: 226 EEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE 285
           E       Q     GGGG    SP  +G  +     AM+    LPPNLL   G  QL Q+
Sbjct: 268 E------MQTPVHGGGGGGSLESPPMMGQQLQHQQQAMSGHQGLPPNLL---GSVQLQQQ 324

Query: 286 -AYSWAHGGRPSF 294
              S+   GRP +
Sbjct: 328 HDQSYWSTGRPPY 324

BLAST of CSPI02G02660 vs. Swiss-Prot
Match: AHL26_ARATH (AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2 SV=1)

HSP 1 Score: 237.7 bits (605), Expect = 1.7e-61
Identity = 151/262 (57.63%), Postives = 184/262 (70.23%), Query Frame = 1

Query: 38  DNGGPVHSGGDDDDDRDNGGDEPKEG--AVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPN 97
           DN    +SG +  +   +GG+    G  + E  TRRPRGRP GSKNKPK PI +TRDS N
Sbjct: 84  DNIANTNSGSEGKEMSLHGGEGGSGGGGSGEQMTRRPRGRPAGSKNKPKAPIIITRDSAN 143

Query: 98  ALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQP-SAPGAVLALQGR 157
           AL++HVMEI +G DI + +A FARRRQRGV V+SG+G+VTNVT+RQP S PG+V++L GR
Sbjct: 144 ALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMSGTGSVTNVTIRQPGSPPGSVVSLHGR 203

Query: 158 FEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATY 217
           FEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  +GPV+V+AA+FSNA Y
Sbjct: 204 FEILSLSGSFLPPPAPPAATGLSVYLAGGQGQVVGGSVVGPLLCSGPVVVMAASFSNAAY 263

Query: 218 ERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGG 277
           ERLPLEE+E    V  QG    GGGG G GSP  +G      +AM     LPPNLL   G
Sbjct: 264 ERLPLEEDEMQTPV--QGGGGGGGGGGGMGSPPMMGQQQA-MAAMAAAQGLPPNLL---G 323

Query: 278 GGQL-----NQEAYSWAHGGRP 292
             QL     N + Y W+ G  P
Sbjct: 324 SVQLPPPQQNDQQY-WSTGRPP 338

BLAST of CSPI02G02660 vs. TrEMBL
Match: A0A0A0LL85_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G010120 PE=4 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 2.4e-160
Identity = 293/293 (100.00%), Postives = 293/293 (100.00%), Query Frame = 1

Query: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60
           MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP
Sbjct: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60

Query: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120
           KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR
Sbjct: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120

Query: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180
           RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL
Sbjct: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180

Query: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG 240
           AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG
Sbjct: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG 240

Query: 241 AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 294
           AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
Sbjct: 241 AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 293

BLAST of CSPI02G02660 vs. TrEMBL
Match: F2Y9E5_COFAR (DNA-binding protein OS=Coffea arabica GN=MA17P03.7 PE=4 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 5.1e-110
Identity = 225/296 (76.01%), Postives = 252/296 (85.14%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISMNDNGGPVHSGG--DDDDDRDNGGD 61
           MANRWWT GQ+GLPGV+  +ST S  ++KPDLGISMNDN G     G  DD+D+R+N  D
Sbjct: 1   MANRWWT-GQVGLPGVETSSSTGSPVLKKPDLGISMNDNSGSGGGSGGRDDEDERENSTD 60

Query: 62  EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFA 121
           EPKEGAVEV TRRPRGRPPGSKNKPKPPIFVTRDSPNAL+SHVME++NG+DIAES+AQFA
Sbjct: 61  EPKEGAVEVATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFA 120

Query: 122 RRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTI 181
           RRRQRGV VLS SGTVTNVTLRQPSAPGAV+AL GRFEILSLTG FLPGPAPPG+TGLTI
Sbjct: 121 RRRQRGVCVLSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGATGLTI 180

Query: 182 YLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGG 241
           YLAGGQGQVVGGSVVG L A+GPVMVIA+TFSNATYERLP+EE+EEGGG  AQG     G
Sbjct: 181 YLAGGQGQVVGGSVVGSLVASGPVMVIASTFSNATYERLPIEEDEEGGGA-AQGQLGGNG 240

Query: 242 G---GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
               G+G G+PQ   GG+GDPS+M P+YNLPPNL+PN  GGQLN EA++WAHG  P
Sbjct: 241 SPPLGSG-GAPQQ--GGLGDPSSM-PVYNLPPNLMPN--GGQLNHEAFAWAHGRPP 288

BLAST of CSPI02G02660 vs. TrEMBL
Match: C0ILP0_COFCA (Uncharacterized protein OS=Coffea canephora GN=46C02.8 PE=4 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 5.1e-110
Identity = 225/296 (76.01%), Postives = 252/296 (85.14%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISMNDNGGPVHSGG--DDDDDRDNGGD 61
           MANRWWT GQ+GLPGV+  +ST S  ++KPDLGISMNDN G     G  DD+D+R+N  D
Sbjct: 1   MANRWWT-GQVGLPGVETSSSTGSPVLKKPDLGISMNDNSGSGGGSGGRDDEDERENSTD 60

Query: 62  EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFA 121
           EPKEGAVEV TRRPRGRPPGSKNKPKPPIFVTRDSPNAL+SHVME++NG+DIAES+AQFA
Sbjct: 61  EPKEGAVEVATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFA 120

Query: 122 RRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTI 181
           RRRQRGV VLS SGTVTNVTLRQPSAPGAV+AL GRFEILSLTG FLPGPAPPG+TGLTI
Sbjct: 121 RRRQRGVCVLSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGATGLTI 180

Query: 182 YLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGG 241
           YLAGGQGQVVGGSVVG L A+GPVMVIA+TFSNATYERLP+EE+EEGGG  AQG     G
Sbjct: 181 YLAGGQGQVVGGSVVGSLVASGPVMVIASTFSNATYERLPIEEDEEGGGA-AQGQLGGNG 240

Query: 242 G---GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
               G+G G+PQ   GG+GDPS+M P+YNLPPNL+PN  GGQLN EA++WAHG  P
Sbjct: 241 SPPLGSG-GAPQQ--GGLGDPSSM-PVYNLPPNLMPN--GGQLNHEAFAWAHGRPP 288

BLAST of CSPI02G02660 vs. TrEMBL
Match: F1DGA1_COFAR (DNA-binding protein OS=Coffea arabica GN=MA29G21.8 PE=4 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 2.0e-109
Identity = 224/296 (75.68%), Postives = 252/296 (85.14%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISMNDNGGPVHSGG--DDDDDRDNGGD 61
           MANRWWT GQ+GLPGV+  +ST S  ++KPDLGISMNDN G     G  DD+D+R+N  D
Sbjct: 1   MANRWWT-GQVGLPGVETSSSTGSPVLKKPDLGISMNDNSGSGGGSGGRDDEDERENSTD 60

Query: 62  EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFA 121
           EPKEGAVEV TRRPRGRPPGSKNKPKPPIFVTRDSPNAL+SHVME++NG+DIAES+AQFA
Sbjct: 61  EPKEGAVEVATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFA 120

Query: 122 RRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTI 181
           RRRQRGV VLS SGTVTNVTLRQPSAPGAV+AL GRFEILSLTG FLPGPAPPG+TGLTI
Sbjct: 121 RRRQRGVCVLSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGATGLTI 180

Query: 182 YLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGG 241
           YLAGGQGQVVGGSVVG L A+GPVMVIA+TFSNATYERLP+EE+EEGGG  AQG     G
Sbjct: 181 YLAGGQGQVVGGSVVGSLVASGPVMVIASTFSNATYERLPIEEDEEGGGA-AQGQLGGNG 240

Query: 242 G---GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
               G+G G+PQ   GG+GDPS+M P+Y+LPPNL+PN  GGQLN EA++WAHG  P
Sbjct: 241 SPPLGSG-GAPQQ--GGLGDPSSM-PVYSLPPNLMPN--GGQLNHEAFAWAHGRPP 288

BLAST of CSPI02G02660 vs. TrEMBL
Match: A0A067DA75_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022331mg PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 1.7e-108
Identity = 226/305 (74.10%), Postives = 251/305 (82.30%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISM--NDNG----GPVHSGGDDDDDRD 61
           MANRWWT GQ+GLPG+D  T+TSSS M+KPDLGIS+  N+NG    G    GGD++DDR+
Sbjct: 1   MANRWWT-GQVGLPGMDGSTATSSSPMKKPDLGISIMANNNGESGSGGGGGGGDEEDDRE 60

Query: 62  NGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESV 121
           +  DEP+EGA+E+ TRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI+NGAD+AE++
Sbjct: 61  HS-DEPREGAIEISTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEIANGADVAETL 120

Query: 122 AQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGST 181
           A FARRRQRGV VLSGSGTVTNVTLRQPS P AV+A+ GRFEILSLTG FLPGPAPPGST
Sbjct: 121 ANFARRRQRGVCVLSGSGTVTNVTLRQPSDPSAVMAIHGRFEILSLTGAFLPGPAPPGST 180

Query: 182 GLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT 241
           GLTIYLAGGQGQVVGGSVVG L A+GPVMVIAATFSNATYERLPL+EEEE GG GAQG  
Sbjct: 181 GLTIYLAGGQGQVVGGSVVGSLVASGPVMVIAATFSNATYERLPLDEEEE-GGAGAQGPL 240

Query: 242 SAGGG------GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHG 294
             GGG      G G G   G GGG+GDPS M    NLPPNL+ N  GGQL+ EAY WAH 
Sbjct: 241 GGGGGGGSGSSGGGGGGAGGGGGGIGDPSGMGVYNNLPPNLVAN--GGQLSHEAYGWAH- 299

BLAST of CSPI02G02660 vs. TAIR10
Match: AT3G04570.1 (AT3G04570.1 AT-hook motif nuclear-localized protein 19)

HSP 1 Score: 347.1 bits (889), Expect = 1.1e-95
Identity = 211/311 (67.85%), Postives = 236/311 (75.88%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMN---DNGGPVH------SGGDDDDD 61
           MAN WWT GQ+ L G++ T   SS ++KPDL ISMN   D+G   H         ++DDD
Sbjct: 1   MANPWWT-GQVNLSGLETTPPGSSQLKKPDLHISMNMAMDSGHNNHHHHQEVDNNNNDDD 60

Query: 62  RDN-GGD--EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGAD 121
           RDN  GD  EP+EGAVE PTRRPRGRP GSKNKPKPPIFVTRDSPNALKSHVMEI++G D
Sbjct: 61  RDNLSGDDHEPREGAVEAPTRRPRGRPAGSKNKPKPPIFVTRDSPNALKSHVMEIASGTD 120

Query: 122 IAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPS------APG--AVLALQGRFEILSLT 181
           + E++A FARRRQRG+ +LSG+GTV NVTLRQPS      APG  AVLALQGRFEILSLT
Sbjct: 121 VIETLATFARRRQRGICILSGNGTVANVTLRQPSTAAVAAAPGGAAVLALQGRFEILSLT 180

Query: 182 GTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEE 241
           G+FLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPL AAGPVM+IAATFSNATYERLPLEE
Sbjct: 181 GSFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLMAAGPVMLIAATFSNATYERLPLEE 240

Query: 242 EE--EGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPN---GGGG 287
           EE  E GG G  G    G  G G GSP   G G GD +   P+YN+P NL+ N   GGGG
Sbjct: 241 EEAAERGGGGGSGGVVPGQLGGG-GSPLSSGAGGGDGNQGLPVYNMPGNLVSNGGSGGGG 300

BLAST of CSPI02G02660 vs. TAIR10
Match: AT4G14465.1 (AT4G14465.1 AT-hook motif nuclear-localized protein 20)

HSP 1 Score: 324.7 bits (831), Expect = 5.8e-89
Identity = 191/300 (63.67%), Postives = 224/300 (74.67%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPG-VDHTSTSS--------SAMRKPDLGISMNDNGGPVHSGGDDDDD 61
           MAN WWT+ Q GL G VDH+ +S         S + K DLGI+MN +        D+D D
Sbjct: 1   MANPWWTN-QSGLAGMVDHSVSSGHHQNHHHQSLLTKGDLGIAMNQSQ-------DNDQD 60

Query: 62  RDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAE 121
            +   D+P+EGAVEV  RRPRGRPPGSKNKPK PIFVTRDSPNAL+SHV+EIS+G+D+A+
Sbjct: 61  EE---DDPREGAVEVVNRRPRGRPPGSKNKPKAPIFVTRDSPNALRSHVLEISDGSDVAD 120

Query: 122 SVAQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPG 181
           ++A F+RRRQRGV VLSG+G+V NVTLRQ +APG V++LQGRFEILSLTG FLPGP+PPG
Sbjct: 121 TIAHFSRRRQRGVCVLSGTGSVANVTLRQAAAPGGVVSLQGRFEILSLTGAFLPGPSPPG 180

Query: 182 STGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQG 241
           STGLT+YLAG QGQVVGGSVVGPL A G VMVIAATFSNATYERLP+EEEE+GGG   Q 
Sbjct: 181 STGLTVYLAGVQGQVVGGSVVGPLLAIGSVMVIAATFSNATYERLPMEEEEDGGG-SRQI 240

Query: 242 HTSAGGGGAGDGSPQGIGGGVGDPSAMT-PLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
           H      G GD SP  IG  + D S M  P YN+PP+L+PN G GQL  E Y+W H   P
Sbjct: 241 H------GGGD-SPPRIGSNLPDLSGMAGPGYNMPPHLIPN-GAGQLGHEPYTWVHARPP 280

BLAST of CSPI02G02660 vs. TAIR10
Match: AT4G17800.1 (AT4G17800.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 248.8 bits (634), Expect = 4.1e-66
Identity = 150/289 (51.90%), Postives = 188/289 (65.05%), Query Frame = 1

Query: 27  MRKPDLGISMNDNGGPVHSGG-------DDDDDRDN---------------GGDEPKEGA 86
           + +PDL +  N +   V  G        DD+D+ +N               GG     G 
Sbjct: 17  LHRPDLHLHHNSSSDDVTPGAGMGHFTVDDEDNNNNHQGLDLASGGGSGSSGGGGGHGGG 76

Query: 87  VEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARRRQRG 146
            +V  RRPRGRPPGSKNKPKPP+ +TR+S N L++H++E++NG D+ + VA +ARRRQRG
Sbjct: 77  GDVVGRRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYARRRQRG 136

Query: 147 VSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYLAGGQ 206
           + VLSGSGTVTNV++RQPSA GAV+ LQG FEILSL+G+FLP PAPPG+T LTI+LAGGQ
Sbjct: 137 ICVLSGSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFLPPPAPPGATSLTIFLAGGQ 196

Query: 207 GQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGGAGDG 266
           GQVVGGSVVG LTAAGPV+VIAA+F+N  YERLPLEE+E+      Q     G  G G+ 
Sbjct: 197 GQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDEQ------QQQLGGGSNGGGNL 256

Query: 267 SPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 294
            P+   GG G      P +NLP N+ PN    QL  E +    GGR  F
Sbjct: 257 FPEVAAGGGGG----LPFFNLPMNMQPN---VQLPVEGWPGNSGGRGPF 292

BLAST of CSPI02G02660 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 238.0 bits (606), Expect = 7.2e-63
Identity = 147/253 (58.10%), Postives = 176/253 (69.57%), Query Frame = 1

Query: 46  GGDDDDDRDNGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEIS 105
           GG  +    +GGD          TRRPRGRP GSKNKPKPPI +TRDS NAL++HVMEI 
Sbjct: 88  GGSGEGGGGSGGDHQM-------TRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIG 147

Query: 106 NGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQPSA---PGAVLALQGRFEILSLTG 165
           +G D+ ESVA FARRRQRGV V+SG+G VTNVT+RQP +   PG+V++L GRFEILSL+G
Sbjct: 148 DGCDLVESVATFARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSG 207

Query: 166 TFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEE 225
           +FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  AGPV+V+AA+FSNA YERLPLEE+
Sbjct: 208 SFLPPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLEED 267

Query: 226 EEGGGVGAQGHTSAGGGGAGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQE 285
           E       Q     GGGG    SP  +G  +     AM+    LPPNLL   G  QL Q+
Sbjct: 268 E------MQTPVHGGGGGGSLESPPMMGQQLQHQQQAMSGHQGLPPNLL---GSVQLQQQ 324

Query: 286 -AYSWAHGGRPSF 294
              S+   GRP +
Sbjct: 328 HDQSYWSTGRPPY 324

BLAST of CSPI02G02660 vs. TAIR10
Match: AT4G12050.1 (AT4G12050.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 237.7 bits (605), Expect = 9.4e-63
Identity = 151/262 (57.63%), Postives = 184/262 (70.23%), Query Frame = 1

Query: 38  DNGGPVHSGGDDDDDRDNGGDEPKEG--AVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPN 97
           DN    +SG +  +   +GG+    G  + E  TRRPRGRP GSKNKPK PI +TRDS N
Sbjct: 84  DNIANTNSGSEGKEMSLHGGEGGSGGGGSGEQMTRRPRGRPAGSKNKPKAPIIITRDSAN 143

Query: 98  ALKSHVMEISNGADIAESVAQFARRRQRGVSVLSGSGTVTNVTLRQP-SAPGAVLALQGR 157
           AL++HVMEI +G DI + +A FARRRQRGV V+SG+G+VTNVT+RQP S PG+V++L GR
Sbjct: 144 ALRTHVMEIGDGCDIVDCMATFARRRQRGVCVMSGTGSVTNVTIRQPGSPPGSVVSLHGR 203

Query: 158 FEILSLTGTFLPGPAPPGSTGLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATY 217
           FEILSL+G+FLP PAPP +TGL++YLAGGQGQVVGGSVVGPL  +GPV+V+AA+FSNA Y
Sbjct: 204 FEILSLSGSFLPPPAPPAATGLSVYLAGGQGQVVGGSVVGPLLCSGPVVVMAASFSNAAY 263

Query: 218 ERLPLEEEEEGGGVGAQGHTSAGGGGAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGG 277
           ERLPLEE+E    V  QG    GGGG G GSP  +G      +AM     LPPNLL   G
Sbjct: 264 ERLPLEEDEMQTPV--QGGGGGGGGGGGMGSPPMMGQQQA-MAAMAAAQGLPPNLL---G 323

Query: 278 GGQL-----NQEAYSWAHGGRP 292
             QL     N + Y W+ G  P
Sbjct: 324 SVQLPPPQQNDQQY-WSTGRPP 338

BLAST of CSPI02G02660 vs. NCBI nr
Match: gi|449443241|ref|XP_004139388.1| (PREDICTED: AT-hook motif nuclear-localized protein 19 [Cucumis sativus])

HSP 1 Score: 572.8 bits (1475), Expect = 3.5e-160
Identity = 293/293 (100.00%), Postives = 293/293 (100.00%), Query Frame = 1

Query: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60
           MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP
Sbjct: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60

Query: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120
           KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR
Sbjct: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120

Query: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180
           RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL
Sbjct: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180

Query: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG 240
           AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG
Sbjct: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG 240

Query: 241 AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 294
           AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF
Sbjct: 241 AGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 293

BLAST of CSPI02G02660 vs. NCBI nr
Match: gi|659070721|ref|XP_008456342.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 558.9 bits (1439), Expect = 5.2e-156
Identity = 290/294 (98.64%), Postives = 291/294 (98.98%), Query Frame = 1

Query: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60
           MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP
Sbjct: 1   MMANRWWTSGQMGLPGVDHTSTSSSAMRKPDLGISMNDNGGPVHSGGDDDDDRDNGGDEP 60

Query: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120
           KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR
Sbjct: 61  KEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFARR 120

Query: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180
           RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL
Sbjct: 121 RQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTIYL 180

Query: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGGGG 240
           AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA GGG
Sbjct: 181 AGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSA-GGG 240

Query: 241 AGDGSPQGIGGGV-GDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRPSF 294
           AGD SPQGIGGGV GDPSAMTPLYNLPPNLLPNGGGGQ+NQEAYSWAHGGRPSF
Sbjct: 241 AGDSSPQGIGGGVGGDPSAMTPLYNLPPNLLPNGGGGQMNQEAYSWAHGGRPSF 293

BLAST of CSPI02G02660 vs. NCBI nr
Match: gi|167600640|gb|ABZ89182.1| (putative protein [Coffea canephora])

HSP 1 Score: 405.6 bits (1041), Expect = 7.4e-110
Identity = 225/296 (76.01%), Postives = 252/296 (85.14%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISMNDNGGPVHSGG--DDDDDRDNGGD 61
           MANRWWT GQ+GLPGV+  +ST S  ++KPDLGISMNDN G     G  DD+D+R+N  D
Sbjct: 1   MANRWWT-GQVGLPGVETSSSTGSPVLKKPDLGISMNDNSGSGGGSGGRDDEDERENSTD 60

Query: 62  EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFA 121
           EPKEGAVEV TRRPRGRPPGSKNKPKPPIFVTRDSPNAL+SHVME++NG+DIAES+AQFA
Sbjct: 61  EPKEGAVEVATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFA 120

Query: 122 RRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTI 181
           RRRQRGV VLS SGTVTNVTLRQPSAPGAV+AL GRFEILSLTG FLPGPAPPG+TGLTI
Sbjct: 121 RRRQRGVCVLSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGATGLTI 180

Query: 182 YLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGG 241
           YLAGGQGQVVGGSVVG L A+GPVMVIA+TFSNATYERLP+EE+EEGGG  AQG     G
Sbjct: 181 YLAGGQGQVVGGSVVGSLVASGPVMVIASTFSNATYERLPIEEDEEGGGA-AQGQLGGNG 240

Query: 242 G---GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
               G+G G+PQ   GG+GDPS+M P+YNLPPNL+PN  GGQLN EA++WAHG  P
Sbjct: 241 SPPLGSG-GAPQQ--GGLGDPSSM-PVYNLPPNLMPN--GGQLNHEAFAWAHGRPP 288

BLAST of CSPI02G02660 vs. NCBI nr
Match: gi|324388027|gb|ADY38789.1| (DNA-binding protein [Coffea arabica])

HSP 1 Score: 403.7 bits (1036), Expect = 2.8e-109
Identity = 224/296 (75.68%), Postives = 252/296 (85.14%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISMNDNGGPVHSGG--DDDDDRDNGGD 61
           MANRWWT GQ+GLPGV+  +ST S  ++KPDLGISMNDN G     G  DD+D+R+N  D
Sbjct: 1   MANRWWT-GQVGLPGVETSSSTGSPVLKKPDLGISMNDNSGSGGGSGGRDDEDERENSTD 60

Query: 62  EPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESVAQFA 121
           EPKEGAVEV TRRPRGRPPGSKNKPKPPIFVTRDSPNAL+SHVME++NG+DIAES+AQFA
Sbjct: 61  EPKEGAVEVATRRPRGRPPGSKNKPKPPIFVTRDSPNALRSHVMEVANGSDIAESIAQFA 120

Query: 122 RRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGSTGLTI 181
           RRRQRGV VLS SGTVTNVTLRQPSAPGAV+AL GRFEILSLTG FLPGPAPPG+TGLTI
Sbjct: 121 RRRQRGVCVLSASGTVTNVTLRQPSAPGAVMALHGRFEILSLTGAFLPGPAPPGATGLTI 180

Query: 182 YLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHTSAGG 241
           YLAGGQGQVVGGSVVG L A+GPVMVIA+TFSNATYERLP+EE+EEGGG  AQG     G
Sbjct: 181 YLAGGQGQVVGGSVVGSLVASGPVMVIASTFSNATYERLPIEEDEEGGGA-AQGQLGGNG 240

Query: 242 G---GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHGGRP 292
               G+G G+PQ   GG+GDPS+M P+Y+LPPNL+PN  GGQLN EA++WAHG  P
Sbjct: 241 SPPLGSG-GAPQQ--GGLGDPSSM-PVYSLPPNLMPN--GGQLNHEAFAWAHGRPP 288

BLAST of CSPI02G02660 vs. NCBI nr
Match: gi|568869307|ref|XP_006487869.1| (PREDICTED: AT-hook motif nuclear-localized protein 19 [Citrus sinensis])

HSP 1 Score: 400.6 bits (1028), Expect = 2.4e-108
Identity = 226/305 (74.10%), Postives = 251/305 (82.30%), Query Frame = 1

Query: 2   MANRWWTSGQMGLPGVD-HTSTSSSAMRKPDLGISM--NDNG----GPVHSGGDDDDDRD 61
           MANRWWT GQ+GLPG+D  T+TSSS M+KPDLGIS+  N+NG    G    GGD++DDR+
Sbjct: 1   MANRWWT-GQVGLPGMDGSTATSSSPMKKPDLGISIMANNNGESGSGGGGGGGDEEDDRE 60

Query: 62  NGGDEPKEGAVEVPTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEISNGADIAESV 121
           +  DEP+EGA+E+ TRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEI+NGAD+AE++
Sbjct: 61  HS-DEPREGAIEISTRRPRGRPPGSKNKPKPPIFVTRDSPNALKSHVMEIANGADVAETL 120

Query: 122 AQFARRRQRGVSVLSGSGTVTNVTLRQPSAPGAVLALQGRFEILSLTGTFLPGPAPPGST 181
           A FARRRQRGV VLSGSGTVTNVTLRQPS P AV+A+ GRFEILSLTG FLPGPAPPGST
Sbjct: 121 ANFARRRQRGVCVLSGSGTVTNVTLRQPSDPSAVMAIHGRFEILSLTGAFLPGPAPPGST 180

Query: 182 GLTIYLAGGQGQVVGGSVVGPLTAAGPVMVIAATFSNATYERLPLEEEEEGGGVGAQGHT 241
           GLTIYLAGGQGQVVGGSVVG L A+GPVMVIAATFSNATYERLPL+EEEE GG GAQG  
Sbjct: 181 GLTIYLAGGQGQVVGGSVVGSLVASGPVMVIAATFSNATYERLPLDEEEE-GGAGAQGPL 240

Query: 242 SAGGG------GAGDGSPQGIGGGVGDPSAMTPLYNLPPNLLPNGGGGQLNQEAYSWAHG 294
             GGG      G G G   G GGG+GDPS M    NLPPNL+ N  GGQL+ EAY WAH 
Sbjct: 241 GGGGGGGSGSSGGGGGGAGGGGGGIGDPSGMGVYNNLPPNLVAN--GGQLSHEAYGWAH- 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL19_ARATH2.0e-9467.85AT-hook motif nuclear-localized protein 19 OS=Arabidopsis thaliana GN=AHL19 PE=2... [more]
AHL20_ARATH1.0e-8763.67AT-hook motif nuclear-localized protein 20 OS=Arabidopsis thaliana GN=AHL20 PE=2... [more]
AHL23_ARATH7.2e-6551.90AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1... [more]
AHL24_ARATH1.3e-6158.10AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
AHL26_ARATH1.7e-6157.63AT-hook motif nuclear-localized protein 26 OS=Arabidopsis thaliana GN=AHL26 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0LL85_CUCSA2.4e-160100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G010120 PE=4 SV=1[more]
F2Y9E5_COFAR5.1e-11076.01DNA-binding protein OS=Coffea arabica GN=MA17P03.7 PE=4 SV=1[more]
C0ILP0_COFCA5.1e-11076.01Uncharacterized protein OS=Coffea canephora GN=46C02.8 PE=4 SV=1[more]
F1DGA1_COFAR2.0e-10975.68DNA-binding protein OS=Coffea arabica GN=MA29G21.8 PE=4 SV=1[more]
A0A067DA75_CITSI1.7e-10874.10Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022331mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04570.11.1e-9567.85 AT-hook motif nuclear-localized protein 19[more]
AT4G14465.15.8e-8963.67 AT-hook motif nuclear-localized protein 20[more]
AT4G17800.14.1e-6651.90 Predicted AT-hook DNA-binding family protein[more]
AT4G22810.17.2e-6358.10 Predicted AT-hook DNA-binding family protein[more]
AT4G12050.19.4e-6357.63 Predicted AT-hook DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|449443241|ref|XP_004139388.1|3.5e-160100.00PREDICTED: AT-hook motif nuclear-localized protein 19 [Cucumis sativus][more]
gi|659070721|ref|XP_008456342.1|5.2e-15698.64PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|167600640|gb|ABZ89182.1|7.4e-11076.01putative protein [Coffea canephora][more]
gi|324388027|gb|ADY38789.1|2.8e-10975.68DNA-binding protein [Coffea arabica][more]
gi|568869307|ref|XP_006487869.1|2.4e-10874.10PREDICTED: AT-hook motif nuclear-localized protein 19 [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
IPR014476AT-hook_nuclear
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0050832 defense response to fungus
biological_process GO:0010359 regulation of anion channel activity
biological_process GO:0000041 transition metal ion transport
biological_process GO:0006811 ion transport
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:1900425 negative regulation of defense response to bacterium
biological_process GO:0045824 negative regulation of innate immune response
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G02660.1CSPI02G02660.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 99..210
score: 1.5
IPR005175PPC domainPROFILEPS51742PPCcoord: 94..232
score: 3
IPR014476AT-hook motif nuclear-localised proteinPIRPIRSF016021ESCAROLAcoord: 11..293
score: 6.3E
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 98..222
score: 2.1
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 1..293
score: 1.6E
NoneNo IPR availablePANTHERPTHR31100:SF8SUBFAMILY NOT NAMEDcoord: 1..293
score: 1.6E
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 95..222
score: 1.83

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI02G02660Cla013628Watermelon (97103) v1cpiwmB149
CSPI02G02660Cla021549Watermelon (97103) v1cpiwmB127
CSPI02G02660Csa2G010120Cucumber (Chinese Long) v2cpicuB065
CSPI02G02660Csa3G141840Cucumber (Chinese Long) v2cpicuB074
CSPI02G02660MELO3C006450Melon (DHL92) v3.5.1cpimeB146
CSPI02G02660MELO3C004413Melon (DHL92) v3.5.1cpimeB139
CSPI02G02660ClCG02G016940Watermelon (Charleston Gray)cpiwcgB128
CSPI02G02660Lsi11G004150Bottle gourd (USVL1VR-Ls)cpilsiB096
CSPI02G02660MELO3C004413.2Melon (DHL92) v3.6.1cpimedB132
CSPI02G02660MELO3C006450.2Melon (DHL92) v3.6.1cpimedB138
CSPI02G02660CsaV3_2G003830Cucumber (Chinese Long) v3cpicucB080
CSPI02G02660CsaV3_3G012380Cucumber (Chinese Long) v3cpicucB092
CSPI02G02660Cla97C02G042660Watermelon (97103) v2cpiwmbB120
CSPI02G02660Cla97C05G085410Watermelon (97103) v2cpiwmbB136
CSPI02G02660Bhi06G001561Wax gourdcpiwgoB135
CSPI02G02660Cucsa.132980Cucumber (Gy14) v1cgycpiB187
CSPI02G02660CmaCh12G011430Cucurbita maxima (Rimu)cmacpiB174
CSPI02G02660CmaCh05G012270Cucurbita maxima (Rimu)cmacpiB800
CSPI02G02660CmoCh05G012540Cucurbita moschata (Rifu)cmocpiB787
CSPI02G02660CmoCh12G011640Cucurbita moschata (Rifu)cmocpiB159
CSPI02G02660Cp4.1LG07g09820Cucurbita pepo (Zucchini)cpecpiB821
CSPI02G02660CsGy2G002770Cucumber (Gy14) v2cgybcpiB056
CSPI02G02660CsGy3G012360Cucumber (Gy14) v2cgybcpiB103
CSPI02G02660Carg23002Silver-seed gourdcarcpiB1109
CSPI02G02660Carg17436Silver-seed gourdcarcpiB0286
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CSPI02G02660CSPI03G12260Wild cucumber (PI 183967)cpicpiB062
The following block(s) are covering this gene:
GeneOrganismBlock
CSPI02G02660Wax gourdcpiwgoB137
CSPI02G02660Cucurbita pepo (Zucchini)cpecpiB100