CSPI02G22120 (gene) Wild cucumber (PI 183967)

NameCSPI02G22120
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionGATA transcription factor-like protein
LocationChr2 : 19314231 .. 19316039 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTTTTCCCCCCAAAAAAACACCAAAAACACACAAACACACACACACACATACACACAAAAACCTCCTTAATCATGGAAGCTCCTGAATATTTCCAGATCAATGCCTACTCCTCCCAATTCTCCTCCCCCGACGACGCCGATGCCACCACCACCGCCGCCGCTGCCGCTGCACCCGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACAACGAAGATGACGCCGTCCTTACTGACTCTGGAGGAGGAGGAGGAGGAGGAGGTGGAGGAGGAGGAGGATTGTTTTACAACAATAATAATACTTCTACTAATGACCATAACAATAACAATAATTCAACGGAATCTTCCGCCGTTACCGTGATGGAGAGTTGCAATTCCTCATCCTCCTTCTTTGAAGATATTAGTGGCTCTAATTTAGGCGATGCCCATTTCTCCAGCGAACTCTGCGTTCCGGTAATTCCCCCTCTCCAAAACCATTCGTATTTAGGCGATGCCCATTTCTTCCCCTTTTTTTCTTTTCCAGATTTCTTACCTTTTTTCTCTTTACGTATACATTCAATATTTTCTTTTTTCTTTTTTCTTTTTTTTTTTACCTTTTTTGTATTATATATGTATAGATATAACATTATGACACACATATATATAATATATACTTTGGTATTTGTATTGTTTTTGTCTATATATGTTACTTCTAAAATTAATCTGTTAGATATATTTCAATGGTACTTGAAGAACTTATTACTAGAAAGATATATCATTTGAATTTTTGTTGAGTTTTATTTGTCATGGTAGAATTAAGTGACAAATTTACATAATTAAATATCATGAAAATTTATGTAAAATAATCAAAACTGTTTCTTAATAAATATTTAGATATAATTTTTGACAACTAAGAATGTACAATTATTTGTAATGTCAAATTGTAAAACTTAACTTAGTTTACTTTTATTTGTGTAAATTTCTTGAAAGAAAGAAAGGAAAATTTGTAGTGGCCACGTATTTGAAGCCAATAAAAGTGTGGTACGTGGCAATTGTTTTTTTAATATAGGTATTTTTTGTGGTACAGTATGACGATTTAGCTGAGCTGGAATGGCTTTCAAACTTTGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTAGAACTCATCTCGGGAGTCAAAGTCAAATCCGACGAACCCCCCACTCAATCCCCACAACCCACGGCCACCCGAAGCGCAGCAGCAATTTTCAAACCGGAAATTGTTTCCGTTCCGGCTAAAGCCCGCAGCAAACGCTCACGTGCCCTCCCATCTAATTGGAACAACTCCGCCCTCCTTCCTCTTTCTTCTCCCACTGCCGAACCCGAAACTACACCACCCATCGAACAACCACATCCCATTAAAAAAACCCTTCCCAAGGCGGCAGCTACGGCTAAGAAGAAGGACAGTCCAGACTTAGGATTTTCATCCGGAGAGGGGCGTAAGTGCATGCACTGCGCCACCGACAAGACACCCCAGTGGCGTACTGGCCCAATGGGCCCAAAAACATTGTGTAATGCTTGTGGGGTTCGGTACAAATCCGGCCGCTTGGTTCCGGAGTATCGCCCTGCCGCAAGCCCCACCTTTGTTTTAACGAAACACTCTAATTCTCATCGGAAAGTTTTGGAGCTTCGACGACAAAAGGAGATTCTTAGAGCCCAACAACAGCAACCACAGCATTTGCTTTTGGATCATCGTCAGGATATGATCTTTGATGCATCAAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCAGATTTCCGGCAGCTGATCTGA

mRNA sequence

ATGGAAGCTCCTGAATATTTCCAGATCAATGCCTACTCCTCCCAATTCTCCTCCCCCGACGACGCCGATGCCACCACCACCGCCGCCGCTGCCGCTGCACCCGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACAACGAAGATGACGCCGTCCTTACTGACTCTGGAGGAGGAGGAGGAGGAGGAGGTGGAGGAGGAGGAGGATTGTTTTACAACAATAATAATACTTCTACTAATGACCATAACAATAACAATAATTCAACGGAATCTTCCGCCGTTACCGTGATGGAGAGTTGCAATTCCTCATCCTCCTTCTTTGAAGATATTAGTGGCTCTAATTTAGGCGATGCCCATTTCTCCAGCGAACTCTGCGTTCCGTATGACGATTTAGCTGAGCTGGAATGGCTTTCAAACTTTGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTAGAACTCATCTCGGGAGTCAAAGTCAAATCCGACGAACCCCCCACTCAATCCCCACAACCCACGGCCACCCGAAGCGCAGCAGCAATTTTCAAACCGGAAATTGTTTCCGTTCCGGCTAAAGCCCGCAGCAAACGCTCACGTGCCCTCCCATCTAATTGGAACAACTCCGCCCTCCTTCCTCTTTCTTCTCCCACTGCCGAACCCGAAACTACACCACCCATCGAACAACCACATCCCATTAAAAAAACCCTTCCCAAGGCGGCAGCTACGGCTAAGAAGAAGGACAGTCCAGACTTAGGATTTTCATCCGGAGAGGGGCGTAAGTGCATGCACTGCGCCACCGACAAGACACCCCAGTGGCGTACTGGCCCAATGGGCCCAAAAACATTGTGTAATGCTTGTGGGGTTCGGTACAAATCCGGCCGCTTGGTTCCGGAGTATCGCCCTGCCGCAAGCCCCACCTTTGTTTTAACGAAACACTCTAATTCTCATCGGAAAGTTTTGGAGCTTCGACGACAAAAGGAGATTCTTAGAGCCCAACAACAGCAACCACAGCATTTGCTTTTGGATCATCGTCAGGATATGATCTTTGATGCATCAAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCAGATTTCCGGCAGCTGATCTGA

Coding sequence (CDS)

ATGGAAGCTCCTGAATATTTCCAGATCAATGCCTACTCCTCCCAATTCTCCTCCCCCGACGACGCCGATGCCACCACCACCGCCGCCGCTGCCGCTGCACCCGACCATTTCATCGTCGAGGAGCTTCTCGATTTCTCCAACAACGAAGATGACGCCGTCCTTACTGACTCTGGAGGAGGAGGAGGAGGAGGAGGTGGAGGAGGAGGAGGATTGTTTTACAACAATAATAATACTTCTACTAATGACCATAACAATAACAATAATTCAACGGAATCTTCCGCCGTTACCGTGATGGAGAGTTGCAATTCCTCATCCTCCTTCTTTGAAGATATTAGTGGCTCTAATTTAGGCGATGCCCATTTCTCCAGCGAACTCTGCGTTCCGTATGACGATTTAGCTGAGCTGGAATGGCTTTCAAACTTTGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTAGAACTCATCTCGGGAGTCAAAGTCAAATCCGACGAACCCCCCACTCAATCCCCACAACCCACGGCCACCCGAAGCGCAGCAGCAATTTTCAAACCGGAAATTGTTTCCGTTCCGGCTAAAGCCCGCAGCAAACGCTCACGTGCCCTCCCATCTAATTGGAACAACTCCGCCCTCCTTCCTCTTTCTTCTCCCACTGCCGAACCCGAAACTACACCACCCATCGAACAACCACATCCCATTAAAAAAACCCTTCCCAAGGCGGCAGCTACGGCTAAGAAGAAGGACAGTCCAGACTTAGGATTTTCATCCGGAGAGGGGCGTAAGTGCATGCACTGCGCCACCGACAAGACACCCCAGTGGCGTACTGGCCCAATGGGCCCAAAAACATTGTGTAATGCTTGTGGGGTTCGGTACAAATCCGGCCGCTTGGTTCCGGAGTATCGCCCTGCCGCAAGCCCCACCTTTGTTTTAACGAAACACTCTAATTCTCATCGGAAAGTTTTGGAGCTTCGACGACAAAAGGAGATTCTTAGAGCCCAACAACAGCAACCACAGCATTTGCTTTTGGATCATCGTCAGGATATGATCTTTGATGCATCAAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCAGATTTCCGGCAGCTGATCTGA
BLAST of CSPI02G22120 vs. Swiss-Prot
Match: GAT12_ARATH (GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 4.7e-69
Identity = 167/316 (52.85%), Postives = 203/316 (64.24%), Query Frame = 1

Query: 78  TSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLA-ELE 137
           ++ +D  N+  +  ++  T+ +S N S++      G       FS +LC+P DDLA ELE
Sbjct: 24  SNDDDEENDVVADSTTTTTITDSSNFSAADLPSFHGDVQDGTSFSGDLCIPSDDLADELE 83

Query: 138 WLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVSVPAKAR 197
           WLSN V+ES S ED+ KLELISG K + D P + +  P    S++ IF  + VSVPAKAR
Sbjct: 84  WLSNIVDESLSPEDVHKLELISGFKSRPD-PKSDTGSPENPNSSSPIFTTD-VSVPAKAR 143

Query: 198 SKRSRALPSNWNNSALL-------PLSSPTA-------EPETTPPIEQPHPIKKTLPKAA 257
           SKRSRA   NW +  LL       P +  T         P T+PP+    P+ K      
Sbjct: 144 SKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQHLSPPTSPPLLMA-PLGKKQAVDG 203

Query: 258 ATAKKKD--SPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 317
              +KKD  SP+ G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 204 GHRRKKDVSSPESG--GAEERRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 263

Query: 318 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQD--MIFD-ASNGD 374
           YRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA  +   H    H  D  MIFD +S+GD
Sbjct: 264 YRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAHHEFIHH---HHGTDTAMIFDVSSDGD 323

BLAST of CSPI02G22120 vs. Swiss-Prot
Match: GATA9_ARATH (GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 5.2e-60
Identity = 167/352 (47.44%), Postives = 208/352 (59.09%), Query Frame = 1

Query: 31  AAAPDHFIVEELLDFSNNEDDAVLTDSGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNST 90
           A  PD F+V++LLDFSN  DD  + D             GL            N   +S+
Sbjct: 12  AGNPDSFVVDDLLDFSN--DDGEVDD-------------GL------------NTLPDSS 71

Query: 91  ESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSED 150
             S  T+ +S NSSS F          D    S+L +P DD+AELEWLSNFVEESF+ ED
Sbjct: 72  TLSTGTLTDSSNSSSLFT---------DGTGFSDLYIPNDDIAELEWLSNFVEESFAGED 131

Query: 151 MQKLELISGVKVKSDEPPTQS----PQPTATRSAAAIFKPEIVSVPAKARSKRSRALPSN 210
             KL L SG+K       T +    P+P        I +   V+VPAKARSKRSR+  S 
Sbjct: 132 QDKLHLFSGLKNPQTTGSTLTHLIKPEPELDHQFIDIDESN-VAVPAKARSKRSRSAAST 191

Query: 211 WNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHC 270
           W  S LL L    A+ + T P ++   +K+        A   D  D G  SG GR+C+HC
Sbjct: 192 WA-SRLLSL----ADSDETNPKKKQRRVKEQ-----DFAGDMDV-DCG-ESGGGRRCLHC 251

Query: 271 ATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRR 330
           AT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRR
Sbjct: 252 ATEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRR 308

Query: 331 QKEILRAQQQQPQHLLLDHR-QDMIFD-ASNGDDYLIH---QHVGPDFRQLI 374
           QKE+      + +HLL   R ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 312 QKEM------RDEHLLSQLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of CSPI02G22120 vs. Swiss-Prot
Match: GATA2_ARATH (GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 3.3e-38
Identity = 114/282 (40.43%), Postives = 148/282 (52.48%), Query Frame = 1

Query: 72  FYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDD 131
           F N +  S +    +  +T SS+    ++     SF      S+     F  ++CVP DD
Sbjct: 20  FSNEDIFSASSSGGSTAATSSSSFPPPQN----PSFHHHHLPSSADHHSFLHDICVPSDD 79

Query: 132 LAELEWLSNFVEESFSSEDMQKLE-LISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVS 191
            A LEWLS FV++SF+      L   ++ VK ++                         S
Sbjct: 80  AAHLEWLSQFVDDSFADFPANPLGGTMTSVKTET-------------------------S 139

Query: 192 VPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATA----- 251
            P K RSKRSRA        + +PL S           +Q H   K  PK   +      
Sbjct: 140 FPGKPRSKRSRAPAPFAGTWSPMPLESEH---------QQLHSAAKFKPKKEQSGGGGGG 199

Query: 252 --KKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP 311
             + + S       G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRP
Sbjct: 200 GGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVPEYRP 259

Query: 312 AASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDH 346
           A+SPTFVLT+HSNSHRKV+ELRRQKE++R    QPQ + L H
Sbjct: 260 ASSPTFVLTQHSNSHRKVMELRRQKEVMR----QPQQVQLHH 259

BLAST of CSPI02G22120 vs. Swiss-Prot
Match: GATA4_ARATH (GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 2.8e-37
Identity = 116/269 (43.12%), Postives = 143/269 (53.16%), Query Frame = 1

Query: 80  TNDHNNNNNST-ESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWL 139
           +ND   +++ST  SSA +   S  +  SF      S      F+ +LCVP DD A LEWL
Sbjct: 21  SNDEIFSSSSTVTSSAASSAASSENPFSFPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWL 80

Query: 140 SNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVSVPAKARSK 199
           S FV++SFS      L +                            +PEI S   K RS+
Sbjct: 81  SRFVDDSFSDFPANPLTMT--------------------------VRPEI-SFTGKPRSR 140

Query: 200 RSRA----LPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLG 259
           RSRA    +   W         +P +E E    + +P P KK     + TA         
Sbjct: 141 RSRAPAPSVAGTW---------APMSESELCHSVAKPKP-KKVYNAESVTADG------- 200

Query: 260 FSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKH 319
                 R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+H
Sbjct: 201 -----ARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQH 240

Query: 320 SNSHRKVLELRRQKE----ILRAQQQQPQ 340
           SNSHRKV+ELRRQKE     +R    QPQ
Sbjct: 261 SNSHRKVMELRRQKEQQESCVRIPPFQPQ 240

BLAST of CSPI02G22120 vs. Swiss-Prot
Match: GATA5_ARATH (GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 8.7e-31
Identity = 125/330 (37.88%), Postives = 160/330 (48.48%), Query Frame = 1

Query: 24  ATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGGGGGGGGGGGGLFYNNNNTSTNDH 83
           A TTA    + D F V++LLD SN   D V  D                +     S+ + 
Sbjct: 28  AVTTAQNGFSVDDFSVDDLLDLSN---DDVFADEETDLKAQ--------HEMVRVSSEEP 87

Query: 84  NNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWLSNFVE 143
           N++ ++   S+               D SG +   +  +SEL +P DDLA LEWLS+FVE
Sbjct: 88  NDDGDALRRSS---------------DFSGCDDFGSLPTSELSLPADDLANLEWLSHFVE 147

Query: 144 ESFSSEDMQKLELISGVKVKSDEPPTQSP--------QPTATRSAAAIFKPEIVSVPAKA 203
           +SF+          SG  +     PT+ P         P    +    FK     VPAKA
Sbjct: 148 DSFTE--------YSGPNLTGT--PTEKPAWLTGDRKHPVTAVTEETCFKSP---VPAKA 207

Query: 204 RSKRSRALPSNWN---NSALLPLSSPTAEPETTPP-------IEQPHPIKKTLPKAAATA 263
           RSKR+R     W+   +S+  P SS +    ++ P        E   P+  +        
Sbjct: 208 RSKRNRNGLKVWSLGSSSSSGPSSSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKK 267

Query: 264 KKKDSPDLGFSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 323
            KK S +  FS GE       RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+P
Sbjct: 268 HKKRSAESVFS-GELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLP 317

Query: 324 EYRPAASPTFVLTKHSNSHRKVLELRRQKE 330
           EYRPA SPTF    HSN HRKV+E+RR+KE
Sbjct: 328 EYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of CSPI02G22120 vs. TrEMBL
Match: A0A0A0LPR5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G373450 PE=4 SV=1)

HSP 1 Score: 715.7 bits (1846), Expect = 3.0e-203
Identity = 372/374 (99.47%), Postives = 372/374 (99.47%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG
Sbjct: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60

Query: 61  GGGGGGGG-GGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA 120
           GGGGGGGG GGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA
Sbjct: 61  GGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA 120

Query: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS 180
           HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS 180

Query: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLP 240
           AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAE ETTPPIEQPHPIKKTLP
Sbjct: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESETTPPIEQPHPIKKTLP 240

Query: 241 KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 300
           KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP
Sbjct: 241 KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 300

Query: 301 EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY 360
           EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY
Sbjct: 301 EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY 360

Query: 361 LIHQHVGPDFRQLI 374
           LIHQHVGPDFRQLI
Sbjct: 361 LIHQHVGPDFRQLI 374

BLAST of CSPI02G22120 vs. TrEMBL
Match: M5Y804_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015753mg PE=4 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 4.0e-99
Identity = 232/410 (56.59%), Postives = 272/410 (66.34%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFS-----SPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLT 60
           MEAPEYFQ N++  QF+     S D+ +   T       DHF+VE+LLDFSN  DDAV+T
Sbjct: 1   MEAPEYFQ-NSFCPQFTPEKRHSFDNNNNKATNGGGGGGDHFMVEDLLDFSN--DDAVIT 60

Query: 61  DSGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSS------SFFE 120
           D           GG  F           N   NST+SS +TV++SCNSSS      +   
Sbjct: 61  D-----------GGATF----------DNATGNSTDSSTITVIDSCNSSSLSGSEPNVIP 120

Query: 121 DISGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPT 180
           DI   N+ +  FSS+LCVPYDDLAELEWLSNFVEESFSSEDMQKL+LISG+K + DE  +
Sbjct: 121 DIGSRNITEGPFSSDLCVPYDDLAELEWLSNFVEESFSSEDMQKLQLISGMKARPDEAAS 180

Query: 181 QS----PQPTATRSAA------AIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPT 240
           ++    P+P    +A        IF P+ VSVPAKARSKRSR  P NW  S LL LS PT
Sbjct: 181 ETRQFQPEPNRNDNAHNTTTNNPIFNPD-VSVPAKARSKRSRGAPCNWT-SRLLLLSQPT 240

Query: 241 AEPETTPPI----EQPHPIKKTLPKAAATA--KKKDSPD-LGFSSGEGRKCMHCATDKTP 300
           +  E +  +    E P P   T  K    +  KKK+SP+ LG   G+GRKC+HCATDKTP
Sbjct: 241 SSSEQSDVVSGAPESPLPPPSTTGKKTVKSVPKKKESPEGLGGGPGDGRKCLHCATDKTP 300

Query: 301 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILR 360
           QWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLTKHSNSHRKVLELRRQKE++R
Sbjct: 301 QWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTKHSNSHRKVLELRRQKEMVR 360

Query: 361 AQQQ-----QPQ----HLLLDHRQDMIFDASNGDDYLIHQHVGPDFRQLI 374
           AQQQ      PQ    H    H Q+M+FD SNG DYLIHQH+GPDFRQLI
Sbjct: 361 AQQQFIHQVPPQQHHHHHHHHHHQNMVFDVSNGGDYLIHQHMGPDFRQLI 384

BLAST of CSPI02G22120 vs. TrEMBL
Match: V4WC04_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008690mg PE=4 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 2.4e-96
Identity = 227/402 (56.47%), Postives = 263/402 (65.42%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           ME PE+FQ  +Y +QFS+       +  ++    DHFIVEELLDFSN  +DA+LTD+   
Sbjct: 1   MEVPEFFQ-GSYCAQFSAEKHHSLDSNKSSNGG-DHFIVEELLDFSN--EDAILTDAAA- 60

Query: 61  GGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSS------SFFEDISG- 120
                                  +   NST+SS VTV++SCNSSS      +F  + +G 
Sbjct: 61  ---------------------FDDVTANSTDSSTVTVVDSCNSSSFSGCGPNFPGENNGC 120

Query: 121 SNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEP------ 180
            N  DAHFS +LCVPYDDLAELEWLSN VEESFS ED+QKL+LISG+K +SD        
Sbjct: 121 RNFSDAHFSGDLCVPYDDLAELEWLSNIVEESFSCEDLQKLQLISGMKARSDHSSETCQF 180

Query: 181 ----------PTQSPQPTATRSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSS 240
                      T +   T       +F PE ++VPAKARSKRSRA P +W  S LL LS 
Sbjct: 181 QPGTNRINHGSTNTSNNTNANPNNPVFNPE-MAVPAKARSKRSRAAPCSW-ASRLLVLSP 240

Query: 241 P--TAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWR 300
           P  T+EPE  P    P P++      A  +KKKDS D G  +GEGRKC+HCATDKTPQWR
Sbjct: 241 PESTSEPEIIPTGLPPPPLQGKKSVKACGSKKKDSGDEG--NGEGRKCLHCATDKTPQWR 300

Query: 301 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQ 360
           TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+ RAQQ
Sbjct: 301 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRAQQ 360

Query: 361 QQPQHLLL----DHRQDMIFDASNGDDYLIHQHVGPDFRQLI 374
           QQ Q         H Q+M+FD SNGDDYLIHQHVGPDFRQLI
Sbjct: 361 QQHQQQQFMHHHHHHQNMMFDLSNGDDYLIHQHVGPDFRQLI 372

BLAST of CSPI02G22120 vs. TrEMBL
Match: A0A067GJP0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017390mg PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 7.0e-96
Identity = 228/402 (56.72%), Postives = 265/402 (65.92%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           ME PE+FQ  +Y +QFS+       +  ++    DHFIVEELLDFSN  +DA+LTD+   
Sbjct: 1   MEVPEFFQ-GSYCAQFSAEKHHSLDSNKSSNGG-DHFIVEELLDFSN--EDAILTDAAAF 60

Query: 61  GGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSS------SFFEDISGS 120
                                  +   NST+SS VTV++SCNSSS      +F  + +G 
Sbjct: 61  D----------------------DVTANSTDSSTVTVVDSCNSSSFSGCGPNFPGENNGC 120

Query: 121 -NLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPP-TQSP 180
            N  DAHFS +LCVPYDDLAELEWLSN VEESFS ED+QKL+LISG+K +SD    T+  
Sbjct: 121 RNFSDAHFSGDLCVPYDDLAELEWLSNIVEESFSCEDLQKLQLISGMKARSDHSSETRQF 180

Query: 181 QPTATRSAAA---------------IFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSS 240
           QP   R                   +F PE+ +VPAKARSKRSRA P +W  S LL LS 
Sbjct: 181 QPGTNRIYHGSTNTSNNTNANPNNPVFNPEM-AVPAKARSKRSRAAPCSWA-SRLLVLSP 240

Query: 241 P--TAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWR 300
           P  T+EPE  P    P P++      A  +KKKDS D G  +GEGRKC+HCATDKTPQWR
Sbjct: 241 PESTSEPEIIPTGPPPPPLQGKKSVKACGSKKKDSGDEG--NGEGRKCLHCATDKTPQWR 300

Query: 301 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQ 360
           TGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLTKHSNSHRKVLELRRQKE+ RAQQ
Sbjct: 301 TGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTKHSNSHRKVLELRRQKELQRAQQ 360

Query: 361 QQPQHLLL----DHRQDMIFDASNGDDYLIHQHVGPDFRQLI 374
           QQ Q         H Q+M+FD SNGDDYLIHQHVGPDFRQLI
Sbjct: 361 QQHQQQQFMHHHHHHQNMMFDLSNGDDYLIHQHVGPDFRQLI 372

BLAST of CSPI02G22120 vs. TrEMBL
Match: B9H8Y3_POPTR (Zinc finger family protein OS=Populus trichocarpa GN=POPTR_0006s25410g PE=4 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 1.0e-94
Identity = 224/387 (57.88%), Postives = 264/387 (68.22%), Query Frame = 1

Query: 1   MEAPEYFQINAY-SSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGG 60
           MEAPE +    + SSQF+S +   +  +  +    DHFIVE+LLDFSN ++DA++TD   
Sbjct: 1   MEAPELYGTTGFCSSQFTSNEKHHSLDSNKSIGGGDHFIVEDLLDFSNEDEDAMVTDP-- 60

Query: 61  GGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA 120
                         +NNN  T       NST+SS VT ++SCNSSS    + SG N GD 
Sbjct: 61  --------------SNNNIVTP----TTNSTDSSTVTFVDSCNSSSFSGCEPSGFN-GDI 120

Query: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPP-TQSPQPTATR 180
               ELCVPYDDLAELEWLSNFVEESFSSED+Q+L+LISG+K + DE   T+  Q     
Sbjct: 121 ---GELCVPYDDLAELEWLSNFVEESFSSEDLQRLQLISGMKARPDESSETRHFQSDDNN 180

Query: 181 SAAA--------IFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTA--EPETTPPI 240
           +           +F PE+ +VPAKARSKRSRA P NW  S LL LS  T+  EPE  P  
Sbjct: 181 NGNVSNICNNNTMFNPEM-AVPAKARSKRSRAAPGNWA-SRLLVLSRTTSSSEPEIIPGS 240

Query: 241 EQ-PHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNA 300
            Q P+  KKT+ K A   KK+D    G   G+GRKC+HCATDKTPQWRTGPMGPKTLCNA
Sbjct: 241 TQHPNSGKKTI-KGAVGLKKRDGDVEG---GDGRKCLHCATDKTPQWRTGPMGPKTLCNA 300

Query: 301 CGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQ 360
           CGVRYKSGRLVPEYRPAASPTF+LTKHSNSHRKVLELRRQKE++RAQQQ   H  L H Q
Sbjct: 301 CGVRYKSGRLVPEYRPAASPTFMLTKHSNSHRKVLELRRQKEMVRAQQQHQHHQYLHHHQ 357

Query: 361 DMIFDASN-GDDYLIHQHVGPDFRQLI 374
           +M+FD SN GDDYLIHQHVGPDFR++I
Sbjct: 361 NMVFDVSNGGDDYLIHQHVGPDFRRMI 357

BLAST of CSPI02G22120 vs. TAIR10
Match: AT5G25830.1 (AT5G25830.1 GATA transcription factor 12)

HSP 1 Score: 263.1 bits (671), Expect = 2.7e-70
Identity = 167/316 (52.85%), Postives = 203/316 (64.24%), Query Frame = 1

Query: 78  TSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLA-ELE 137
           ++ +D  N+  +  ++  T+ +S N S++      G       FS +LC+P DDLA ELE
Sbjct: 24  SNDDDEENDVVADSTTTTTITDSSNFSAADLPSFHGDVQDGTSFSGDLCIPSDDLADELE 83

Query: 138 WLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVSVPAKAR 197
           WLSN V+ES S ED+ KLELISG K + D P + +  P    S++ IF  + VSVPAKAR
Sbjct: 84  WLSNIVDESLSPEDVHKLELISGFKSRPD-PKSDTGSPENPNSSSPIFTTD-VSVPAKAR 143

Query: 198 SKRSRALPSNWNNSALL-------PLSSPTA-------EPETTPPIEQPHPIKKTLPKAA 257
           SKRSRA   NW +  LL       P +  T         P T+PP+    P+ K      
Sbjct: 144 SKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQHLSPPTSPPLLMA-PLGKKQAVDG 203

Query: 258 ATAKKKD--SPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 317
              +KKD  SP+ G    E R+C+HCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 204 GHRRKKDVSSPESG--GAEERRCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 263

Query: 318 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQD--MIFD-ASNGD 374
           YRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA  +   H    H  D  MIFD +S+GD
Sbjct: 264 YRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAHHEFIHH---HHGTDTAMIFDVSSDGD 323

BLAST of CSPI02G22120 vs. TAIR10
Match: AT4G32890.1 (AT4G32890.1 GATA transcription factor 9)

HSP 1 Score: 233.0 bits (593), Expect = 2.9e-61
Identity = 167/352 (47.44%), Postives = 208/352 (59.09%), Query Frame = 1

Query: 31  AAAPDHFIVEELLDFSNNEDDAVLTDSGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNST 90
           A  PD F+V++LLDFSN  DD  + D             GL            N   +S+
Sbjct: 12  AGNPDSFVVDDLLDFSN--DDGEVDD-------------GL------------NTLPDSS 71

Query: 91  ESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSED 150
             S  T+ +S NSSS F          D    S+L +P DD+AELEWLSNFVEESF+ ED
Sbjct: 72  TLSTGTLTDSSNSSSLFT---------DGTGFSDLYIPNDDIAELEWLSNFVEESFAGED 131

Query: 151 MQKLELISGVKVKSDEPPTQS----PQPTATRSAAAIFKPEIVSVPAKARSKRSRALPSN 210
             KL L SG+K       T +    P+P        I +   V+VPAKARSKRSR+  S 
Sbjct: 132 QDKLHLFSGLKNPQTTGSTLTHLIKPEPELDHQFIDIDESN-VAVPAKARSKRSRSAAST 191

Query: 211 WNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHC 270
           W  S LL L    A+ + T P ++   +K+        A   D  D G  SG GR+C+HC
Sbjct: 192 WA-SRLLSL----ADSDETNPKKKQRRVKEQ-----DFAGDMDV-DCG-ESGGGRRCLHC 251

Query: 271 ATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRR 330
           AT+KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRR
Sbjct: 252 ATEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRR 308

Query: 331 QKEILRAQQQQPQHLLLDHR-QDMIFD-ASNGDDYLIH---QHVGPDFRQLI 374
           QKE+      + +HLL   R ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 312 QKEM------RDEHLLSQLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of CSPI02G22120 vs. TAIR10
Match: AT2G45050.1 (AT2G45050.1 GATA transcription factor 2)

HSP 1 Score: 160.6 bits (405), Expect = 1.9e-39
Identity = 114/282 (40.43%), Postives = 148/282 (52.48%), Query Frame = 1

Query: 72  FYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDD 131
           F N +  S +    +  +T SS+    ++     SF      S+     F  ++CVP DD
Sbjct: 20  FSNEDIFSASSSGGSTAATSSSSFPPPQN----PSFHHHHLPSSADHHSFLHDICVPSDD 79

Query: 132 LAELEWLSNFVEESFSSEDMQKLE-LISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVS 191
            A LEWLS FV++SF+      L   ++ VK ++                         S
Sbjct: 80  AAHLEWLSQFVDDSFADFPANPLGGTMTSVKTET-------------------------S 139

Query: 192 VPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATA----- 251
            P K RSKRSRA        + +PL S           +Q H   K  PK   +      
Sbjct: 140 FPGKPRSKRSRAPAPFAGTWSPMPLESEH---------QQLHSAAKFKPKKEQSGGGGGG 199

Query: 252 --KKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP 311
             + + S       G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+KSGRLVPEYRP
Sbjct: 200 GGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFKSGRLVPEYRP 259

Query: 312 AASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDH 346
           A+SPTFVLT+HSNSHRKV+ELRRQKE++R    QPQ + L H
Sbjct: 260 ASSPTFVLTQHSNSHRKVMELRRQKEVMR----QPQQVQLHH 259

BLAST of CSPI02G22120 vs. TAIR10
Match: AT3G60530.1 (AT3G60530.1 GATA transcription factor 4)

HSP 1 Score: 157.5 bits (397), Expect = 1.6e-38
Identity = 116/269 (43.12%), Postives = 143/269 (53.16%), Query Frame = 1

Query: 80  TNDHNNNNNST-ESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWL 139
           +ND   +++ST  SSA +   S  +  SF      S      F+ +LCVP DD A LEWL
Sbjct: 21  SNDEIFSSSSTVTSSAASSAASSENPFSFPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWL 80

Query: 140 SNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSAAAIFKPEIVSVPAKARSK 199
           S FV++SFS      L +                            +PEI S   K RS+
Sbjct: 81  SRFVDDSFSDFPANPLTMT--------------------------VRPEI-SFTGKPRSR 140

Query: 200 RSRA----LPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPKAAATAKKKDSPDLG 259
           RSRA    +   W         +P +E E    + +P P KK     + TA         
Sbjct: 141 RSRAPAPSVAGTW---------APMSESELCHSVAKPKP-KKVYNAESVTADG------- 200

Query: 260 FSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKH 319
                 R+C HCA++KTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLT+H
Sbjct: 201 -----ARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQH 240

Query: 320 SNSHRKVLELRRQKE----ILRAQQQQPQ 340
           SNSHRKV+ELRRQKE     +R    QPQ
Sbjct: 261 SNSHRKVMELRRQKEQQESCVRIPPFQPQ 240

BLAST of CSPI02G22120 vs. TAIR10
Match: AT5G66320.1 (AT5G66320.1 GATA transcription factor 5)

HSP 1 Score: 136.0 bits (341), Expect = 4.9e-32
Identity = 125/330 (37.88%), Postives = 160/330 (48.48%), Query Frame = 1

Query: 24  ATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGGGGGGGGGGGGLFYNNNNTSTNDH 83
           A TTA    + D F V++LLD SN   D V  D                +     S+ + 
Sbjct: 28  AVTTAQNGFSVDDFSVDDLLDLSN---DDVFADEETDLKAQ--------HEMVRVSSEEP 87

Query: 84  NNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAHFSSELCVPYDDLAELEWLSNFVE 143
           N++ ++   S+               D SG +   +  +SEL +P DDLA LEWLS+FVE
Sbjct: 88  NDDGDALRRSS---------------DFSGCDDFGSLPTSELSLPADDLANLEWLSHFVE 147

Query: 144 ESFSSEDMQKLELISGVKVKSDEPPTQSP--------QPTATRSAAAIFKPEIVSVPAKA 203
           +SF+          SG  +     PT+ P         P    +    FK     VPAKA
Sbjct: 148 DSFTE--------YSGPNLTGT--PTEKPAWLTGDRKHPVTAVTEETCFKSP---VPAKA 207

Query: 204 RSKRSRALPSNWN---NSALLPLSSPTAEPETTPP-------IEQPHPIKKTLPKAAATA 263
           RSKR+R     W+   +S+  P SS +    ++ P        E   P+  +        
Sbjct: 208 RSKRNRNGLKVWSLGSSSSSGPSSSGSTSSSSSGPSSPWFSGAELLEPVVTSERPPFPKK 267

Query: 264 KKKDSPDLGFSSGE------GRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 323
            KK S +  FS GE       RKC HC   KTPQWR GPMG KTLCNACGVRYKSGRL+P
Sbjct: 268 HKKRSAESVFS-GELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLP 317

Query: 324 EYRPAASPTFVLTKHSNSHRKVLELRRQKE 330
           EYRPA SPTF    HSN HRKV+E+RR+KE
Sbjct: 328 EYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of CSPI02G22120 vs. NCBI nr
Match: gi|700207683|gb|KGN62802.1| (hypothetical protein Csa_2G373450 [Cucumis sativus])

HSP 1 Score: 715.7 bits (1846), Expect = 4.2e-203
Identity = 372/374 (99.47%), Postives = 372/374 (99.47%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG
Sbjct: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60

Query: 61  GGGGGGGG-GGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA 120
           GGGGGGGG GGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA
Sbjct: 61  GGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDA 120

Query: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS 180
           HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS
Sbjct: 121 HFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRS 180

Query: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLP 240
           AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAE ETTPPIEQPHPIKKTLP
Sbjct: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESETTPPIEQPHPIKKTLP 240

Query: 241 KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 300
           KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP
Sbjct: 241 KAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 300

Query: 301 EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY 360
           EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY
Sbjct: 301 EYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDY 360

Query: 361 LIHQHVGPDFRQLI 374
           LIHQHVGPDFRQLI
Sbjct: 361 LIHQHVGPDFRQLI 374

BLAST of CSPI02G22120 vs. NCBI nr
Match: gi|778674365|ref|XP_011650196.1| (PREDICTED: GATA transcription factor 12-like [Cucumis sativus])

HSP 1 Score: 682.9 bits (1761), Expect = 3.0e-193
Identity = 356/373 (95.44%), Postives = 356/373 (95.44%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG
Sbjct: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60

Query: 61  GGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAH 120
           GGGG                NDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAH
Sbjct: 61  GGGG----------------NDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAH 120

Query: 121 FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSA 180
           FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSA
Sbjct: 121 FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSA 180

Query: 181 AAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPK 240
           AAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAE ETTPPIEQPHPIKKTLPK
Sbjct: 181 AAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESETTPPIEQPHPIKKTLPK 240

Query: 241 AAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300
           AAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 241 AAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300

Query: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDYL 360
           YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDYL
Sbjct: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDYL 357

Query: 361 IHQHVGPDFRQLI 374
           IHQHVGPDFRQLI
Sbjct: 361 IHQHVGPDFRQLI 357

BLAST of CSPI02G22120 vs. NCBI nr
Match: gi|659088475|ref|XP_008445001.1| (PREDICTED: GATA transcription factor 12-like [Cucumis melo])

HSP 1 Score: 651.0 bits (1678), Expect = 1.3e-183
Identity = 340/373 (91.15%), Postives = 353/373 (94.64%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           MEAPEYFQINAYSSQFSSPD ADA+TTAAAA  P+HFIVEELLDFSNNEDDAV TD+GGG
Sbjct: 1   MEAPEYFQINAYSSQFSSPDHADASTTAAAA--PEHFIVEELLDFSNNEDDAVFTDAGGG 60

Query: 61  GGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSNLGDAH 120
           GGGGG     LFYNNNNT++NDHNNNNNS ESSA+TVMESCNSSSSFFEDISGSNLGDAH
Sbjct: 61  GGGGG-----LFYNNNNTTSNDHNNNNNSAESSAITVMESCNSSSSFFEDISGSNLGDAH 120

Query: 121 FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPTATRSA 180
           FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLEL+SGVKVKSDE P QSPQPTATR+A
Sbjct: 121 FSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTATRTA 180

Query: 181 AAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAEPETTPPIEQPHPIKKTLPK 240
           AAIFKPEIVSVPAKARSKRSRALPSNWNNS+LLPLS PTAEPE T PI QP+ IKK LPK
Sbjct: 181 AAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLS-PTAEPEITAPIGQPYSIKKPLPK 240

Query: 241 AAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300
            AATAKKKD+PD+GFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 241 VAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300

Query: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDASNGDDYL 360
           YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQ QHLLLDHRQDMIFDASNGDDYL
Sbjct: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDHRQDMIFDASNGDDYL 360

Query: 361 IHQHVGPDFRQLI 374
           IHQHVGPDFRQ+I
Sbjct: 361 IHQHVGPDFRQMI 365

BLAST of CSPI02G22120 vs. NCBI nr
Match: gi|1009135304|ref|XP_015884916.1| (PREDICTED: GATA transcription factor 12 [Ziziphus jujuba])

HSP 1 Score: 377.9 bits (969), Expect = 2.1e-101
Identity = 232/397 (58.44%), Postives = 270/397 (68.01%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFSSPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLTDSGGG 60
           MEAPE++Q N++  QF  P+   +T    A  A DHFIVE+LLDFSNN  DAV+TD    
Sbjct: 1   MEAPEFYQ-NSFCPQFV-PEKRHSTDNKTAGGA-DHFIVEDLLDFSNN--DAVITD---- 60

Query: 61  GGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSS------SFFEDISGS 120
                    G F           +   NST+SS VTV++SCNSSS      +F  DI   
Sbjct: 61  ---------GAF----------DSVTGNSTDSSTVTVVDSCNSSSFSGCEPNFVGDIGCR 120

Query: 121 NLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVK----SDEPPTQ 180
           +  D +FS +LCVPYDDLAELEWLSNFVEESFSS+D+Q+L+LISG+K       +   T+
Sbjct: 121 SFTDGNFSGDLCVPYDDLAELEWLSNFVEESFSSDDLQRLQLISGMKASRTTDEEASDTR 180

Query: 181 SPQPTATRSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALL--PLSSPTAEPET---- 240
             QP   R+ A IF  E +SVPAKARSKRSRA P NW +  LL  P ++ T    T    
Sbjct: 181 HFQPEPNRN-APIFNSE-MSVPAKARSKRSRAAPCNWTSRLLLLSPTTTTTTASTTSSSE 240

Query: 241 ------TPPIEQPHPIKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGP 300
                 TPP   P+P KKT+    A  KKK+SPD    SG+GRKC+HCATDKTPQWRTGP
Sbjct: 241 ADVVVSTPPPPPPNPGKKTV---KAPQKKKESPDSAAGSGDGRKCLHCATDKTPQWRTGP 300

Query: 301 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQP 360
           MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE++RAQQQQ 
Sbjct: 301 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMMRAQQQQQ 360

Query: 361 QHLL--LDHRQDMIFDASNGDDYLIHQHVGPDFRQLI 374
           Q  L    H Q+M+FD SNGDDYLIHQHVGPDFRQ+I
Sbjct: 361 QQFLHHHHHHQNMVFDVSNGDDYLIHQHVGPDFRQMI 364

BLAST of CSPI02G22120 vs. NCBI nr
Match: gi|596295002|ref|XP_007226931.1| (hypothetical protein PRUPE_ppa015753mg [Prunus persica])

HSP 1 Score: 369.8 bits (948), Expect = 5.7e-99
Identity = 232/410 (56.59%), Postives = 272/410 (66.34%), Query Frame = 1

Query: 1   MEAPEYFQINAYSSQFS-----SPDDADATTTAAAAAAPDHFIVEELLDFSNNEDDAVLT 60
           MEAPEYFQ N++  QF+     S D+ +   T       DHF+VE+LLDFSN  DDAV+T
Sbjct: 1   MEAPEYFQ-NSFCPQFTPEKRHSFDNNNNKATNGGGGGGDHFMVEDLLDFSN--DDAVIT 60

Query: 61  DSGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSS------SFFE 120
           D           GG  F           N   NST+SS +TV++SCNSSS      +   
Sbjct: 61  D-----------GGATF----------DNATGNSTDSSTITVIDSCNSSSLSGSEPNVIP 120

Query: 121 DISGSNLGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPT 180
           DI   N+ +  FSS+LCVPYDDLAELEWLSNFVEESFSSEDMQKL+LISG+K + DE  +
Sbjct: 121 DIGSRNITEGPFSSDLCVPYDDLAELEWLSNFVEESFSSEDMQKLQLISGMKARPDEAAS 180

Query: 181 QS----PQPTATRSAA------AIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPT 240
           ++    P+P    +A        IF P+ VSVPAKARSKRSR  P NW  S LL LS PT
Sbjct: 181 ETRQFQPEPNRNDNAHNTTTNNPIFNPD-VSVPAKARSKRSRGAPCNWT-SRLLLLSQPT 240

Query: 241 AEPETTPPI----EQPHPIKKTLPKAAATA--KKKDSPD-LGFSSGEGRKCMHCATDKTP 300
           +  E +  +    E P P   T  K    +  KKK+SP+ LG   G+GRKC+HCATDKTP
Sbjct: 241 SSSEQSDVVSGAPESPLPPPSTTGKKTVKSVPKKKESPEGLGGGPGDGRKCLHCATDKTP 300

Query: 301 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILR 360
           QWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA+SPTFVLTKHSNSHRKVLELRRQKE++R
Sbjct: 301 QWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTKHSNSHRKVLELRRQKEMVR 360

Query: 361 AQQQ-----QPQ----HLLLDHRQDMIFDASNGDDYLIHQHVGPDFRQLI 374
           AQQQ      PQ    H    H Q+M+FD SNG DYLIHQH+GPDFRQLI
Sbjct: 361 AQQQFIHQVPPQQHHHHHHHHHHQNMVFDVSNGGDYLIHQHMGPDFRQLI 384

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAT12_ARATH4.7e-6952.85GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1[more]
GATA9_ARATH5.2e-6047.44GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1[more]
GATA2_ARATH3.3e-3840.43GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1[more]
GATA4_ARATH2.8e-3743.12GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1[more]
GATA5_ARATH8.7e-3137.88GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LPR5_CUCSA3.0e-20399.47Uncharacterized protein OS=Cucumis sativus GN=Csa_2G373450 PE=4 SV=1[more]
M5Y804_PRUPE4.0e-9956.59Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa015753mg PE=4 SV=1[more]
V4WC04_9ROSI2.4e-9656.47Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008690mg PE=4 SV=1[more]
A0A067GJP0_CITSI7.0e-9656.72Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017390mg PE=4 SV=1[more]
B9H8Y3_POPTR1.0e-9457.88Zinc finger family protein OS=Populus trichocarpa GN=POPTR_0006s25410g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.12.7e-7052.85 GATA transcription factor 12[more]
AT4G32890.12.9e-6147.44 GATA transcription factor 9[more]
AT2G45050.11.9e-3940.43 GATA transcription factor 2[more]
AT3G60530.11.6e-3843.12 GATA transcription factor 4[more]
AT5G66320.14.9e-3237.88 GATA transcription factor 5[more]
Match NameE-valueIdentityDescription
gi|700207683|gb|KGN62802.1|4.2e-20399.47hypothetical protein Csa_2G373450 [Cucumis sativus][more]
gi|778674365|ref|XP_011650196.1|3.0e-19395.44PREDICTED: GATA transcription factor 12-like [Cucumis sativus][more]
gi|659088475|ref|XP_008445001.1|1.3e-18391.15PREDICTED: GATA transcription factor 12-like [Cucumis melo][more]
gi|1009135304|ref|XP_015884916.1|2.1e-10158.44PREDICTED: GATA transcription factor 12 [Ziziphus jujuba][more]
gi|596295002|ref|XP_007226931.1|5.7e-9956.59hypothetical protein PRUPE_ppa015753mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000679Znf_GATA
IPR013088Znf_NHR/GATA
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0030154 cell differentiation
biological_process GO:0045944 positive regulation of transcription from RNA polymerase II promoter
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003682 chromatin binding
molecular_function GO:0000977 RNA polymerase II regulatory region sequence-specific DNA binding
molecular_function GO:0001085 RNA polymerase II transcription factor binding
molecular_function GO:0001228 transcriptional activator activity, RNA polymerase II transcription regulatory region sequence-specific binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G22120.1CSPI02G22120.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 263..297
score: 7.8
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 257..307
score: 1.2
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 263..288
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 257..293
score: 12
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 257..294
score: 4.8
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 74..370
score: 8.7E-145coord: 1..57
score: 8.7E
NoneNo IPR availablePANTHERPTHR10071:SF196SUBFAMILY NOT NAMEDcoord: 74..370
score: 8.7E-145coord: 1..57
score: 8.7E
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 259..321
score: 1.57

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI02G22120Cla022200Watermelon (97103) v1cpiwmB112
CSPI02G22120Cla023109Watermelon (97103) v1cpiwmB164
CSPI02G22120Csa2G373450Cucumber (Chinese Long) v2cpicuB065
CSPI02G22120MELO3C003466Melon (DHL92) v3.5.1cpimeB132
CSPI02G22120MELO3C011130Melon (DHL92) v3.5.1cpimeB118
CSPI02G22120ClCG08G012130Watermelon (Charleston Gray)cpiwcgB168
CSPI02G22120ClCG11G015130Watermelon (Charleston Gray)cpiwcgB109
CSPI02G22120Lsi08G010740Bottle gourd (USVL1VR-Ls)cpilsiB139
CSPI02G22120Lsi04G023280Bottle gourd (USVL1VR-Ls)cpilsiB118
CSPI02G22120MELO3C011130.2Melon (DHL92) v3.6.1cpimedB114
CSPI02G22120CsaV3_2G030750Cucumber (Chinese Long) v3cpicucB081
CSPI02G22120CsaV3_3G048530Cucumber (Chinese Long) v3cpicucB094
CSPI02G22120Cla97C11G221130Watermelon (97103) v2cpiwmbB109
CSPI02G22120Cla97C08G155280Watermelon (97103) v2cpiwmbB162
CSPI02G22120Bhi04G000711Wax gourdcpiwgoB162
CSPI02G22120Cucsa.161160Cucumber (Gy14) v1cgycpiB239
CSPI02G22120Cucsa.312530Cucumber (Gy14) v1cgycpiB468
CSPI02G22120CmaCh12G002430Cucurbita maxima (Rimu)cmacpiB165
CSPI02G22120CmaCh05G005840Cucurbita maxima (Rimu)cmacpiB794
CSPI02G22120CmoCh12G001980Cucurbita moschata (Rifu)cmocpiB150
CSPI02G22120CmoCh05G006090Cucurbita moschata (Rifu)cmocpiB783
CSPI02G22120Cp4.1LG07g02390Cucurbita pepo (Zucchini)cpecpiB811
CSPI02G22120Cp4.1LG11g05190Cucurbita pepo (Zucchini)cpecpiB093
CSPI02G22120CsGy2G021850Cucumber (Gy14) v2cgybcpiB057
CSPI02G22120Carg03833Silver-seed gourdcarcpiB1010
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CSPI02G22120CSPI03G46080Wild cucumber (PI 183967)cpicpiB061
The following block(s) are covering this gene:
GeneOrganismBlock
CSPI02G22120Silver-seed gourdcarcpiB0063