Cucsa.109180 (gene) Cucumber (Gy14) v1

NameCucsa.109180
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionGATA transcription factor, putative
Locationscaffold00934 : 99748 .. 101506 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAAGGAAGAAGAAGAAGCCGGCTAAGCCTGCTGTCCATTAAGACTAAGAAAAGTCAGCCCCACCAAATGATAAAACCAAATGTCCAAAAATTAATCAAAATCAATATTTTTTTAATTTTCTTTTTCAATTCTGATATAAATTTCAAAAGGGGTTAATTTATCCTTTCAAACTTTGGAATCTGCAACTGCCCATCTGACCCATCTTTCCATTTCTCTCCCTTTGCAATTATTTTTTGTTTTTCTTCCCCCTTTCTCAAACAGGTAATAATCATTAATTCAAACATACAAAGTTTAGAAATGAATTTTGTGTGTGTTTTTTTTAAGGAAAAATAATTATTTGTTTTTGTGTTTTGGTTGTAGAAGAAATGGAGTTTTTGGAAGCTAAAGCTTTGAAATCAAGTTTCCATTGGGAATTAGCCATGAAATCTGCTCAACAAGATGCTTTGGTTGAGGAAGTTTGGTGTTTGAACGGTCCTAATCTTGTTTCCGGTGAGGATTTTGAGATCGAAGAGTTTTTAAACTTCCCTAATGGCGATTTAGAACATGGGTCTTCTTTGAGACTTCAAGAAGATGACGATTGTGAAGAGTTTGAGAAGAATCGGTTCTCTGTTTCGTCTAATTCGAACCAGTCCGATGGGTCTCCGGTCGTCGGAGAGGAGGATTCTAAGTCGCTTCTTGCTGTTGAACTTGCTTTTCCGGTAAATTTAATTTCTGGGCTTTTTTTTTTTTAATGTTTTTGGTTCTAATTGCTTGAATCGAAAATTCAAAATCAAAATCCCACTTTTCAGGGCGATTCTCTGACGGACCTTGAATGGGTTTCTCAATTTGTCGATGATTCTTCCTCGGAATTTTCCTGTGCCGCCGTGGCTTTCAACCGCTCCGAACCGGAGAAGAAACTCACCGGAACGGTTATTTCGTGTTTGCCGACGTTTTTTCCGGTCAGACCGAGGACAAAAAGGTCTAGACAGTCTCGTCAAGCGAAATCCGCAGGTTCTTCTCTCAATCAATCACCGTCGTCGTCCTCCTCCTCGACCTCCTCCGGTGTTTCCTCCGCCGCACCTCGGTTTATATTCTCCGACGCCGGCGAGAACGTGGACTTTTTGAACGTAACCGGTGAGCCACCGAAGAAGCAGAGGAAAAAGCCATCGTCGCCGTCGCCATCGTCAACGGGTCTTCTACCCACCGGTTCAACTGGTCAAATTCCGCGGCGGTGCAGCCATTGTCTGGTTCAAAAGACCCCACAGTGGCGGACCGGTCCAAACGGAGCCAAAACGCTTTGTAACGCTTGTGGTGTCCGGTATAAATCCGGTCGGCTCTTCCCGGAATATAGACCGGCGTTGAGTCCCACTTTTTGCAGCGGCGTTCACTCAAACAGTCATCGAAAAGTACTTGAAATGAGGAAGACGAAGGAAGTTCCACAACCGGCGACCGAGTTGGCCCCAATGGTCCCGAGTTACTAAACACCAACTGAACCGGGTTCAGTTGAACCGGGCAGTAGATGATGATGATTAAGAAGGAAATGACCAAAAGGGCAGATTTTTGTTGTGACGCTAAAAAGGAATCCATTTGTATTTGTAGGGTTAGGCGAGGAATTAGATTTAGGGCATTTAATTTTAGTGTAGTATTTTTGTGAAATTAATTTCATTAATTTAGGTTTGATTAGATATGCTTAGGTAGGCAAAAGAAGAAGATTTTTGTATTTTTGTATTACTTTAGCTAGATCACGAATATAAAATACTCTATTTCTTCTCACGAGAA

mRNA sequence

GGAAGGAAGAAGAAGAAGCCGGCTAAGCCTGCTGTCCATTAAGACTAAGAAAAGTCAGCCCCACCAAATGATAAAACCAAATGTCCAAAAATTAATCAAAATCAATATTTTTTTAATTTTCTTTTTCAATTCTGATATAAATTTCAAAAGGGGTTAATTTATCCTTTCAAACTTTGGAATCTGCAACTGCCCATCTGACCCATCTTTCCATTTCTCTCCCTTTGCAATTATTTTTTGTTTTTCTTCCCCCTTTCTCAAACAGAAATGGAGTTTTTGGAAGCTAAAGCTTTGAAATCAAGTTTCCATTGGGAATTAGCCATGAAATCTGCTCAACAAGATGCTTTGGTTGAGGAAGTTTGGTGTTTGAACGGTCCTAATCTTGTTTCCGGTGAGGATTTTGAGATCGAAGAGTTTTTAAACTTCCCTAATGGCGATTTAGAACATGGGTCTTCTTTGAGACTTCAAGAAGATGACGATTGTGAAGAGTTTGAGAAGAATCGGTTCTCTGTTTCGTCTAATTCGAACCAGTCCGATGGGTCTCCGGTCGTCGGAGAGGAGGATTCTAAGTCGCTTCTTGCTGTTGAACTTGCTTTTCCGGGCGATTCTCTGACGGACCTTGAATGGGTTTCTCAATTTGTCGATGATTCTTCCTCGGAATTTTCCTGTGCCGCCGTGGCTTTCAACCGCTCCGAACCGGAGAAGAAACTCACCGGAACGGTTATTTCGTGTTTGCCGACGTTTTTTCCGGTCAGACCGAGGACAAAAAGGTCTAGACAGTCTCGTCAAGCGAAATCCGCAGGTTCTTCTCTCAATCAATCACCGTCGTCGTCCTCCTCCTCGACCTCCTCCGGTGTTTCCTCCGCCGCACCTCGGTTTATATTCTCCGACGCCGGCGAGAACGTGGACTTTTTGAACGTAACCGGTGAGCCACCGAAGAAGCAGAGGAAAAAGCCATCGTCGCCGTCGCCATCGTCAACGGGTCTTCTACCCACCGGTTCAACTGGTCAAATTCCGCGGCGGTGCAGCCATTGTCTGGTTCAAAAGACCCCACAGTGGCGGACCGGTCCAAACGGAGCCAAAACGCTTTGTAACGCTTGTGGTGTCCGGTATAAATCCGGTCGGCTCTTCCCGGAATATAGACCGGCGTTGAGTCCCACTTTTTGCAGCGGCGTTCACTCAAACAGTCATCGAAAAGTACTTGAAATGAGGAAGACGAAGGAAGTTCCACAACCGGCGACCGAGTTGGCCCCAATGGTCCCGAGTTACTAAACACCAACTGAACCGGGTTCAGTTGAACCGGGCAGTAGATGATGATGATTAAGAAGGAAATGACCAAAAGGGCAGATTTTTGTTGTGACGCTAAAAAGGAATCCATTTGTATTTGTAGGGTTAGGCGAGGAATTAGATTTAGGGCATTTAATTTTAGTGTAGTATTTTTGTGAAATTAATTTCATTAATTTAGGTTTGATTAGATATGCTTAGGTAGGCAAAAGAAGAAGATTTTTGTATTTTTGTATTACTTTAGCTAGATCACGAATATAAAATACTCTATTTCTTCTCACGAGAA

Coding sequence (CDS)

ATGGAGTTTTTGGAAGCTAAAGCTTTGAAATCAAGTTTCCATTGGGAATTAGCCATGAAATCTGCTCAACAAGATGCTTTGGTTGAGGAAGTTTGGTGTTTGAACGGTCCTAATCTTGTTTCCGGTGAGGATTTTGAGATCGAAGAGTTTTTAAACTTCCCTAATGGCGATTTAGAACATGGGTCTTCTTTGAGACTTCAAGAAGATGACGATTGTGAAGAGTTTGAGAAGAATCGGTTCTCTGTTTCGTCTAATTCGAACCAGTCCGATGGGTCTCCGGTCGTCGGAGAGGAGGATTCTAAGTCGCTTCTTGCTGTTGAACTTGCTTTTCCGGGCGATTCTCTGACGGACCTTGAATGGGTTTCTCAATTTGTCGATGATTCTTCCTCGGAATTTTCCTGTGCCGCCGTGGCTTTCAACCGCTCCGAACCGGAGAAGAAACTCACCGGAACGGTTATTTCGTGTTTGCCGACGTTTTTTCCGGTCAGACCGAGGACAAAAAGGTCTAGACAGTCTCGTCAAGCGAAATCCGCAGGTTCTTCTCTCAATCAATCACCGTCGTCGTCCTCCTCCTCGACCTCCTCCGGTGTTTCCTCCGCCGCACCTCGGTTTATATTCTCCGACGCCGGCGAGAACGTGGACTTTTTGAACGTAACCGGTGAGCCACCGAAGAAGCAGAGGAAAAAGCCATCGTCGCCGTCGCCATCGTCAACGGGTCTTCTACCCACCGGTTCAACTGGTCAAATTCCGCGGCGGTGCAGCCATTGTCTGGTTCAAAAGACCCCACAGTGGCGGACCGGTCCAAACGGAGCCAAAACGCTTTGTAACGCTTGTGGTGTCCGGTATAAATCCGGTCGGCTCTTCCCGGAATATAGACCGGCGTTGAGTCCCACTTTTTGCAGCGGCGTTCACTCAAACAGTCATCGAAAAGTACTTGAAATGAGGAAGACGAAGGAAGTTCCACAACCGGCGACCGAGTTGGCCCCAATGGTCCCGAGTTACTAA

Protein sequence

MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY*
BLAST of Cucsa.109180 vs. Swiss-Prot
Match: GATA5_ARATH (GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.4e-51
Identity = 138/335 (41.19%), Postives = 181/335 (54.03%), Query Frame = 1

Query: 4   LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDL--EHG 63
           +E  ALKSS   E+A+K+       E +      N  S +DF +++ L+  N D+  +  
Sbjct: 1   MEQAALKSSVRKEMALKTTSP-VYEEFLAVTTAQNGFSVDDFSVDDLLDLSNDDVFADEE 60

Query: 64  SSLRLQEDDDCEEFEKNRFSVSSNSNQSDG------SPVVGEEDSKSLLAVELAFPGDSL 123
           + L+ Q +            VSS     DG      S   G +D  SL   EL+ P D L
Sbjct: 61  TDLKAQHE---------MVRVSSEEPNDDGDALRRSSDFSGCDDFGSLPTSELSLPADDL 120

Query: 124 TDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLTG---------TVISCLPTFFPVRPRT 183
            +LEW+S FV+DS +E+S   +    +E    LTG         T  +C  +  P + R+
Sbjct: 121 ANLEWLSHFVEDSFTEYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEETCFKSPVPAKARS 180

Query: 184 KRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPK-- 243
           KR+R   +  S GSS +  PSSS S++SS    ++P F  ++  E V    VT E P   
Sbjct: 181 KRNRNGLKVWSLGSSSSSGPSSSGSTSSSSSGPSSPWFSGAELLEPV----VTSERPPFP 240

Query: 244 KQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKS 303
           K+ KK S+ S  S  L       Q  R+CSHC VQKTPQWR GP GAKTLCNACGVRYKS
Sbjct: 241 KKHKKRSAESVFSGELQQL----QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKS 300

Query: 304 GRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKE 320
           GRL PEYRPA SPTF S +HSN HRKV+EMR+ KE
Sbjct: 301 GRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cucsa.109180 vs. Swiss-Prot
Match: GATA6_ARATH (GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 4.1e-40
Identity = 119/299 (39.80%), Postives = 161/299 (53.85%), Query Frame = 1

Query: 41  SGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEED- 100
           +G+DF +++ L+F           + +EDDD    ++    V      SD + +    D 
Sbjct: 24  NGDDFSVDDLLDFS----------KEEEDDDVLVEDEAELKVQRKRGVSDENTLHRSNDF 83

Query: 101 -SKSLLAVELAFPGDSLTDLEWVSQFVDDSS-SEFSCAA-----VAFNRS---EPEKKLT 160
            +       L+ P D + +LEW+S FVDDSS + +S        +  NR    +P K+ T
Sbjct: 84  STADFHTSGLSVPMDDIAELEWLSNFVDDSSFTPYSAPTNKPVWLTGNRRHLVQPVKEET 143

Query: 161 GTVISCLPTFFP-VRPRTKRSRQSRQAKSAGS-SLNQSPSSSSSSTSSGVSSAAPRFIFS 220
                C  +  P V+ R KR+R   +  S GS SL  S SSS++S+SS    ++P ++ S
Sbjct: 144 -----CFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSSSTTSSSSSPRPSSPLWLAS 203

Query: 221 DAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTG 280
             G+ +D      EP  K +KK       + G   T +  Q  R+C HC VQKTPQWR G
Sbjct: 204 --GQFLD------EPMTKTQKKKKVWK--NAGQTQTQTQTQT-RQCGHCGVQKTPQWRAG 263

Query: 281 PNGAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATE 327
           P GAKTLCNACGVRYKSGRL PEYRPA SPTF S +HSN H KV+EMR+ KE    A E
Sbjct: 264 PLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEE 296

BLAST of Cucsa.109180 vs. Swiss-Prot
Match: GATA7_ARATH (GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 5.8e-34
Identity = 109/289 (37.72%), Postives = 150/289 (51.90%), Query Frame = 1

Query: 44  DFEIEEFLNFPNGD--LEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSK 103
           DF +++ L+  N D  LE  SS R +++ + E+F+       S S+QS  + +   ED  
Sbjct: 10  DFSVDDLLDLSNADTSLESSSSQRKEDEQEREKFK-------SFSDQS--TRLSPPEDL- 69

Query: 104 SLLAVELAFPGDS----LTDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVI--SC 163
                 L+FPGD+    L DLEW+S FV+DS SE   ++       P   +    +   C
Sbjct: 70  ------LSFPGDAPVGDLEDLEWLSNFVEDSFSESYISS-----DFPVNPVASVEVRRQC 129

Query: 164 LPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDF 223
           +    PV+PR+KR R + +  S      +SPS   S+  +                    
Sbjct: 130 V----PVKPRSKRRRTNGRIWSM-----ESPSPLLSTAVA-------------------- 189

Query: 224 LNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLC 283
                   +++++       S  G++      Q+ R CSHC VQKTPQWR GP GAKTLC
Sbjct: 190 --------RRKKRGRQKVDASYGGVV---QQQQLRRCCSHCGVQKTPQWRMGPLGAKTLC 236

Query: 284 NACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPA 325
           NACGVR+KSGRL PEYRPA SPTF + +HSNSHRKVLE+R  K V  PA
Sbjct: 250 NACGVRFKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK-VADPA 236

BLAST of Cucsa.109180 vs. Swiss-Prot
Match: GATA2_ARATH (GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 5.8e-34
Identity = 107/298 (35.91%), Postives = 142/298 (47.65%), Query Frame = 1

Query: 30  EVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQS 89
           +V+ L+ P+L+      I++ L+F N D+   SS              +  + SS+S   
Sbjct: 2   DVYGLSSPDLL-----RIDDLLDFSNEDIFSASSSG-----------GSTAATSSSSFPP 61

Query: 90  DGSP------VVGEEDSKSLLAVELAFPGDSLTDLEWVSQFVDDSSSEFSCAAVAFNRSE 149
             +P      +    D  S L  ++  P D    LEW+SQFVDDS ++F           
Sbjct: 62  PQNPSFHHHHLPSSADHHSFLH-DICVPSDDAAHLEWLSQFVDDSFADF----------- 121

Query: 150 PEKKLTGTVISC-LPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAP 209
           P   L GT+ S    T FP +PR+KRSR         S +   P  S        +   P
Sbjct: 122 PANPLGGTMTSVKTETSFPGKPRSKRSRAPAPFAGTWSPM---PLESEHQQLHSAAKFKP 181

Query: 210 RFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTP 269
           +   S  G               + +  SS +    G+          RRC+HC  +KTP
Sbjct: 182 KKEQSGGGGGGG----------GRHQSSSSETTEGGGM----------RRCTHCASEKTP 241

Query: 270 QWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEV 321
           QWRTGP G KTLCNACGVR+KSGRL PEYRPA SPTF    HSNSHRKV+E+R+ KEV
Sbjct: 242 QWRTGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV 248

BLAST of Cucsa.109180 vs. Swiss-Prot
Match: GATA4_ARATH (GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 2.2e-33
Identity = 100/303 (33.00%), Postives = 148/303 (48.84%), Query Frame = 1

Query: 30  EVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQS 89
           +V+ ++ P+L+      I++ L+F N ++   SS             +N FS  S++  S
Sbjct: 2   DVYGMSSPDLL-----RIDDLLDFSNDEIFSSSSTVTSSAASSAASSENPFSFPSSTYTS 61

Query: 90  DGSPVVGEEDSKSLLAVELAFPGDSLTDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLT 149
              P +  + +      +L  P D    LEW+S+FVDDS S+F    +      PE   T
Sbjct: 62  ---PTLLTDFTH-----DLCVPSDDAAHLEWLSRFVDDSFSDFPANPLTMT-VRPEISFT 121

Query: 150 GTVISCLPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDA 209
           G            +PR++R              +++P+ S + T + +S +         
Sbjct: 122 G------------KPRSRR--------------SRAPAPSVAGTWAPMSES--------- 181

Query: 210 GENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPN 269
               +  +   +P  K +K  ++ S ++ G           RRC+HC  +KTPQWRTGP 
Sbjct: 182 ----ELCHSVAKP--KPKKVYNAESVTADGA----------RRCTHCASEKTPQWRTGPL 239

Query: 270 GAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAP 329
           G KTLCNACGVRYKSGRL PEYRPA SPTF    HSNSHRKV+E+R+ KE  +    + P
Sbjct: 242 GPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQESCVRIPP 239

Query: 330 MVP 333
             P
Sbjct: 302 FQP 239

BLAST of Cucsa.109180 vs. TrEMBL
Match: A0A0A0LLB3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G251490 PE=4 SV=1)

HSP 1 Score: 676.8 bits (1745), Expect = 1.4e-191
Identity = 334/334 (100.00%), Postives = 334/334 (100.00%), Query Frame = 1

Query: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH 60
           MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH
Sbjct: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH 60

Query: 61  GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW 120
           GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW
Sbjct: 61  GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW 120

Query: 121 VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180
           VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS
Sbjct: 121 VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180

Query: 181 SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL 240
           SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL
Sbjct: 181 SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL 240

Query: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300
           LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC
Sbjct: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300

Query: 301 SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY
Sbjct: 301 SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 334

BLAST of Cucsa.109180 vs. TrEMBL
Match: A0A068TLY9_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00014204001 PE=4 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 3.8e-69
Identity = 166/352 (47.16%), Postives = 220/352 (62.50%), Query Frame = 1

Query: 1   MEFLEAKALKSSFHWELA-MKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLE 60
           M+ +EAKALKSS   ++  MKS+QQ   V+++WC+ G N VS +DF +++ L+F + D +
Sbjct: 1   MDCMEAKALKSSLLSDIGGMKSSQQQGFVDDIWCVTGLNNVSCDDFSVDDLLDFSDKDFK 60

Query: 61  HGSSLRLQEDDDCEEFEKNRFSVSSNSNQ-----SDGSPVVGEEDSKSLLAVELAFPGDS 120
            G    L+ED+D     K+  S+SS+ +      S+ S     +D  SLLA ELA P + 
Sbjct: 61  DGP---LKEDEDF----KDTLSLSSSQHHHHHRNSNFSSFSETDDFGSLLAAELAVPAEE 120

Query: 121 LTDLEWVSQFVDDSSSEFS--CAAVAF--NRSEPEKKLTGTVIS-----CLPTFFPVRPR 180
           + +LEW+SQFVDDS SE S  C A +F  N+    +K +   +      C P   PV+PR
Sbjct: 121 MENLEWLSQFVDDSRSEVSLLCPAGSFKDNKGRLTEKWSEPAVHMIRVPCFPLHVPVKPR 180

Query: 181 TKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPP-K 240
           +KRSR + +  S   SL  + SSS+SS+S G S+ +P FI S+  ++ + L+   +PP K
Sbjct: 181 SKRSRPNGRVWSGSPSLTTTESSSTSSSSYGSSALSP-FILSNPVQDSEMLSSVEKPPAK 240

Query: 241 KQRKKPSSPSPSSTGLLPTGSTG-QIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYK 300
           K +KKP++ + S       GS G Q  RRCSHC V KTPQWRTGP G KTLCNACGVRYK
Sbjct: 241 KHKKKPATDTGS-------GSIGSQTSRRCSHCQVNKTPQWRTGPLGPKTLCNACGVRYK 300

Query: 301 SGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVP-QPATELAPMVPSY 335
           SGRLFPEYRPA SPTF   VHSNSHRKVLEMR+ KE        L PMV S+
Sbjct: 301 SGRLFPEYRPACSPTFSQEVHSNSHRKVLEMRRKKEATGHVEAGLTPMVSSF 337

BLAST of Cucsa.109180 vs. TrEMBL
Match: B9HRG7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s12620g PE=4 SV=2)

HSP 1 Score: 269.2 bits (687), Expect = 6.6e-69
Identity = 169/346 (48.84%), Postives = 209/346 (60.40%), Query Frame = 1

Query: 1   MEF-LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVS-GEDFEIEEFLNFPNGDL 60
           MEF +E +ALKSS   EL  K+  + A  E+   LN P +VS  +DF ++ FL+F NG+ 
Sbjct: 1   MEFRVEERALKSSLLRELDTKTTSEQAFCEDFLALNTPGVVSFDQDFSVDCFLDFSNGEF 60

Query: 61  EHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDL 120
             G    +QE ++    EK+  SVSS     D         S S LA ELA P D + +L
Sbjct: 61  NDGY---VQEQEE----EKDSISVSSQDRVDDDFNSNSSSFSDSFLASELAVPTDDIAEL 120

Query: 121 EWVSQFVDDSSSEFSCAAVAF---------NRSEPEKKLTGTVISCL-PTFFPVRPRTKR 180
           EWVS FVDDS S+ S    A          NR EPE K T    SCL P+  P + RTKR
Sbjct: 121 EWVSHFVDDSVSDVSLLVPACKGSSKRHAKNRFEPETKPTFAKTSCLFPSRVPSKARTKR 180

Query: 181 SRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRK 240
           SR + +  SAGS+ +++PSSS+SSTSS      P  + ++  +  D L+   E P K  K
Sbjct: 181 SRPTGRTWSAGSNQSETPSSSTSSTSS-----MPCLVATNTVQTADSLSWLSEQPMKISK 240

Query: 241 KPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLF 300
           K   P+  ++GL+   ++ Q  RRCSHC VQKTPQWRTGP GAKTLCNACGVRYKSGRLF
Sbjct: 241 K--RPAVHTSGLM---ASTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLF 300

Query: 301 PEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           PEYRPA SPTF S VHSNSHRKVLEMR+ KEV      L  MVPS+
Sbjct: 301 PEYRPACSPTFSSEVHSNSHRKVLEMRRKKEVAGAEPRLNQMVPSF 329

BLAST of Cucsa.109180 vs. TrEMBL
Match: F6GWQ6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0023g02880 PE=4 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 8.6e-69
Identity = 166/350 (47.43%), Postives = 214/350 (61.14%), Query Frame = 1

Query: 1   MEFLEAKALKSSF-HWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLE 60
           ME +E KALKSS    ELA K  QQ A ++++   NG + VSG+DF I++ L+F NG + 
Sbjct: 1   MECVE-KALKSSVVRPELAFKLTQQPACMDDMCMGNGQSGVSGDDFSIDDLLDFTNGGI- 60

Query: 61  HGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVG-----EEDSKSLLAVELAFPGDS 120
            G  L  +ED++ E+      S      ++D S +       +++  S+ A EL  P D 
Sbjct: 61  -GEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVKDEFPSVPATELTVPADD 120

Query: 121 LTDLEWVSQFVDDSSSEFSC-------AAVAFNRSE--PEKKLTGTVISCLPTFFPVRPR 180
           L DLEW+S FV+DS SE+S           A N++E  PE +    + SCL T FP + R
Sbjct: 121 LADLEWLSHFVEDSFSEYSAPFPHGTLTEKAQNQTENPPEPETPLQIKSCLKTPFPAKAR 180

Query: 181 TKRSRQSRQAKSAGS-SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPK 240
           +KR+R   +  S GS SL +S SSSSSS+SS +SS  P  I+ +  +NV+  +   +PP 
Sbjct: 181 SKRARTGGRVWSMGSPSLTESSSSSSSSSSSSLSS--PWLIYPNTCQNVESFHSAVKPPA 240

Query: 241 KQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKS 300
           K+ KK   P  S       GS    P RCSHC VQKTPQWRTGP GAKTLCNACGVRYKS
Sbjct: 241 KKHKKRLDPEAS-------GSAQPTPHRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYKS 300

Query: 301 GRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           GRL PEYRPA SPTF S +HSN HRKVLEMR+ KEV +P + LAP VPS+
Sbjct: 301 GRLLPEYRPACSPTFSSEIHSNHHRKVLEMRRKKEVTRPESGLAPAVPSF 338

BLAST of Cucsa.109180 vs. TrEMBL
Match: D9ZIZ0_MALDO (GATA domain class transcription factor OS=Malus domestica GN=GATA3 PE=2 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 7.3e-68
Identity = 168/345 (48.70%), Postives = 208/345 (60.29%), Query Frame = 1

Query: 1   MEF-LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLE 60
           ME+ +EA+ALKSS   ELA+KS Q   L+EE+WC  G + V  EDF +++ L+  N +  
Sbjct: 1   MEYCMEARALKSSLRRELAVKSTQH-VLLEELWCATGISGVPSEDFSVDDLLDLSNDEFG 60

Query: 61  HGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLE 120
           +GS   ++E+ +    E++  SV   ++ S  S +    DS S LA +L  P D L +LE
Sbjct: 61  NGS---VEEEGE----ERDSVSVDDETSNSSNSVLA---DSDSGLATQLVVPDDDLAELE 120

Query: 121 WVSQFVDDSSSEFSCA---------AVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSR 180
           WVS FVDDS  + S           A+  NRSE E K      S  P   PV+PRTKR R
Sbjct: 121 WVSHFVDDSLPDLSLLHTIGVQKPEALLANRSESEPKPAQLRASLFPFEVPVKPRTKRCR 180

Query: 181 QSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPP-KKQRKK 240
            + +  S  SS   SPSS SSS+ SG+S + P  IF+       F+   GEP  KKQ+KK
Sbjct: 181 LASRDWSLSSS--SSPSSPSSSSGSGLSFSTPCLIFNPVQSMHVFV---GEPAAKKQKKK 240

Query: 241 PSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFP 300
           P+      TG    G  GQ  RRCSHC VQKTPQWRTGP G KTLCNACGVR+KSGRLFP
Sbjct: 241 PAV----QTGEGSIG--GQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFP 300

Query: 301 EYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           EYRPA SPTF   VHSNSHRKVLEMRK KEV +P   L  M+ S+
Sbjct: 301 EYRPACSPTFSGDVHSNSHRKVLEMRKRKEVGEPEPRLNRMIRSF 323

BLAST of Cucsa.109180 vs. TAIR10
Match: AT5G66320.1 (AT5G66320.1 GATA transcription factor 5)

HSP 1 Score: 204.9 bits (520), Expect = 7.7e-53
Identity = 138/335 (41.19%), Postives = 181/335 (54.03%), Query Frame = 1

Query: 4   LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDL--EHG 63
           +E  ALKSS   E+A+K+       E +      N  S +DF +++ L+  N D+  +  
Sbjct: 1   MEQAALKSSVRKEMALKTTSP-VYEEFLAVTTAQNGFSVDDFSVDDLLDLSNDDVFADEE 60

Query: 64  SSLRLQEDDDCEEFEKNRFSVSSNSNQSDG------SPVVGEEDSKSLLAVELAFPGDSL 123
           + L+ Q +            VSS     DG      S   G +D  SL   EL+ P D L
Sbjct: 61  TDLKAQHE---------MVRVSSEEPNDDGDALRRSSDFSGCDDFGSLPTSELSLPADDL 120

Query: 124 TDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLTG---------TVISCLPTFFPVRPRT 183
            +LEW+S FV+DS +E+S   +    +E    LTG         T  +C  +  P + R+
Sbjct: 121 ANLEWLSHFVEDSFTEYSGPNLTGTPTEKPAWLTGDRKHPVTAVTEETCFKSPVPAKARS 180

Query: 184 KRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPK-- 243
           KR+R   +  S GSS +  PSSS S++SS    ++P F  ++  E V    VT E P   
Sbjct: 181 KRNRNGLKVWSLGSSSSSGPSSSGSTSSSSSGPSSPWFSGAELLEPV----VTSERPPFP 240

Query: 244 KQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKS 303
           K+ KK S+ S  S  L       Q  R+CSHC VQKTPQWR GP GAKTLCNACGVRYKS
Sbjct: 241 KKHKKRSAESVFSGELQQL----QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKS 300

Query: 304 GRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKE 320
           GRL PEYRPA SPTF S +HSN HRKV+EMR+ KE
Sbjct: 301 GRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cucsa.109180 vs. TAIR10
Match: AT3G51080.1 (AT3G51080.1 GATA transcription factor 6)

HSP 1 Score: 166.8 bits (421), Expect = 2.3e-41
Identity = 119/299 (39.80%), Postives = 161/299 (53.85%), Query Frame = 1

Query: 41  SGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEED- 100
           +G+DF +++ L+F           + +EDDD    ++    V      SD + +    D 
Sbjct: 24  NGDDFSVDDLLDFS----------KEEEDDDVLVEDEAELKVQRKRGVSDENTLHRSNDF 83

Query: 101 -SKSLLAVELAFPGDSLTDLEWVSQFVDDSS-SEFSCAA-----VAFNRS---EPEKKLT 160
            +       L+ P D + +LEW+S FVDDSS + +S        +  NR    +P K+ T
Sbjct: 84  STADFHTSGLSVPMDDIAELEWLSNFVDDSSFTPYSAPTNKPVWLTGNRRHLVQPVKEET 143

Query: 161 GTVISCLPTFFP-VRPRTKRSRQSRQAKSAGS-SLNQSPSSSSSSTSSGVSSAAPRFIFS 220
                C  +  P V+ R KR+R   +  S GS SL  S SSS++S+SS    ++P ++ S
Sbjct: 144 -----CFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSSSTTSSSSSPRPSSPLWLAS 203

Query: 221 DAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTG 280
             G+ +D      EP  K +KK       + G   T +  Q  R+C HC VQKTPQWR G
Sbjct: 204 --GQFLD------EPMTKTQKKKKVWK--NAGQTQTQTQTQT-RQCGHCGVQKTPQWRAG 263

Query: 281 PNGAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATE 327
           P GAKTLCNACGVRYKSGRL PEYRPA SPTF S +HSN H KV+EMR+ KE    A E
Sbjct: 264 PLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEE 296

BLAST of Cucsa.109180 vs. TAIR10
Match: AT4G36240.1 (AT4G36240.1 GATA transcription factor 7)

HSP 1 Score: 146.4 bits (368), Expect = 3.2e-35
Identity = 109/289 (37.72%), Postives = 150/289 (51.90%), Query Frame = 1

Query: 44  DFEIEEFLNFPNGD--LEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSK 103
           DF +++ L+  N D  LE  SS R +++ + E+F+       S S+QS  + +   ED  
Sbjct: 10  DFSVDDLLDLSNADTSLESSSSQRKEDEQEREKFK-------SFSDQS--TRLSPPEDL- 69

Query: 104 SLLAVELAFPGDS----LTDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVI--SC 163
                 L+FPGD+    L DLEW+S FV+DS SE   ++       P   +    +   C
Sbjct: 70  ------LSFPGDAPVGDLEDLEWLSNFVEDSFSESYISS-----DFPVNPVASVEVRRQC 129

Query: 164 LPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDF 223
           +    PV+PR+KR R + +  S      +SPS   S+  +                    
Sbjct: 130 V----PVKPRSKRRRTNGRIWSM-----ESPSPLLSTAVA-------------------- 189

Query: 224 LNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLC 283
                   +++++       S  G++      Q+ R CSHC VQKTPQWR GP GAKTLC
Sbjct: 190 --------RRKKRGRQKVDASYGGVV---QQQQLRRCCSHCGVQKTPQWRMGPLGAKTLC 236

Query: 284 NACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPA 325
           NACGVR+KSGRL PEYRPA SPTF + +HSNSHRKVLE+R  K V  PA
Sbjct: 250 NACGVRFKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK-VADPA 236

BLAST of Cucsa.109180 vs. TAIR10
Match: AT2G45050.1 (AT2G45050.1 GATA transcription factor 2)

HSP 1 Score: 146.4 bits (368), Expect = 3.2e-35
Identity = 107/298 (35.91%), Postives = 142/298 (47.65%), Query Frame = 1

Query: 30  EVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQS 89
           +V+ L+ P+L+      I++ L+F N D+   SS              +  + SS+S   
Sbjct: 2   DVYGLSSPDLL-----RIDDLLDFSNEDIFSASSSG-----------GSTAATSSSSFPP 61

Query: 90  DGSP------VVGEEDSKSLLAVELAFPGDSLTDLEWVSQFVDDSSSEFSCAAVAFNRSE 149
             +P      +    D  S L  ++  P D    LEW+SQFVDDS ++F           
Sbjct: 62  PQNPSFHHHHLPSSADHHSFLH-DICVPSDDAAHLEWLSQFVDDSFADF----------- 121

Query: 150 PEKKLTGTVISC-LPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAP 209
           P   L GT+ S    T FP +PR+KRSR         S +   P  S        +   P
Sbjct: 122 PANPLGGTMTSVKTETSFPGKPRSKRSRAPAPFAGTWSPM---PLESEHQQLHSAAKFKP 181

Query: 210 RFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTP 269
           +   S  G               + +  SS +    G+          RRC+HC  +KTP
Sbjct: 182 KKEQSGGGGGGG----------GRHQSSSSETTEGGGM----------RRCTHCASEKTP 241

Query: 270 QWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEV 321
           QWRTGP G KTLCNACGVR+KSGRL PEYRPA SPTF    HSNSHRKV+E+R+ KEV
Sbjct: 242 QWRTGPLGPKTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV 248

BLAST of Cucsa.109180 vs. TAIR10
Match: AT3G60530.1 (AT3G60530.1 GATA transcription factor 4)

HSP 1 Score: 144.4 bits (363), Expect = 1.2e-34
Identity = 100/303 (33.00%), Postives = 148/303 (48.84%), Query Frame = 1

Query: 30  EVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSSLRLQEDDDCEEFEKNRFSVSSNSNQS 89
           +V+ ++ P+L+      I++ L+F N ++   SS             +N FS  S++  S
Sbjct: 2   DVYGMSSPDLL-----RIDDLLDFSNDEIFSSSSTVTSSAASSAASSENPFSFPSSTYTS 61

Query: 90  DGSPVVGEEDSKSLLAVELAFPGDSLTDLEWVSQFVDDSSSEFSCAAVAFNRSEPEKKLT 149
              P +  + +      +L  P D    LEW+S+FVDDS S+F    +      PE   T
Sbjct: 62  ---PTLLTDFTH-----DLCVPSDDAAHLEWLSRFVDDSFSDFPANPLTMT-VRPEISFT 121

Query: 150 GTVISCLPTFFPVRPRTKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDA 209
           G            +PR++R              +++P+ S + T + +S +         
Sbjct: 122 G------------KPRSRR--------------SRAPAPSVAGTWAPMSES--------- 181

Query: 210 GENVDFLNVTGEPPKKQRKKPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPN 269
               +  +   +P  K +K  ++ S ++ G           RRC+HC  +KTPQWRTGP 
Sbjct: 182 ----ELCHSVAKP--KPKKVYNAESVTADGA----------RRCTHCASEKTPQWRTGPL 239

Query: 270 GAKTLCNACGVRYKSGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAP 329
           G KTLCNACGVRYKSGRL PEYRPA SPTF    HSNSHRKV+E+R+ KE  +    + P
Sbjct: 242 GPKTLCNACGVRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQESCVRIPP 239

Query: 330 MVP 333
             P
Sbjct: 302 FQP 239

BLAST of Cucsa.109180 vs. NCBI nr
Match: gi|449464846|ref|XP_004150140.1| (PREDICTED: GATA transcription factor 5-like [Cucumis sativus])

HSP 1 Score: 676.8 bits (1745), Expect = 2.0e-191
Identity = 334/334 (100.00%), Postives = 334/334 (100.00%), Query Frame = 1

Query: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH 60
           MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH
Sbjct: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH 60

Query: 61  GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW 120
           GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW
Sbjct: 61  GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW 120

Query: 121 VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180
           VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS
Sbjct: 121 VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180

Query: 181 SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL 240
           SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL
Sbjct: 181 SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL 240

Query: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300
           LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC
Sbjct: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300

Query: 301 SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY
Sbjct: 301 SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 334

BLAST of Cucsa.109180 vs. NCBI nr
Match: gi|659122191|ref|XP_008461014.1| (PREDICTED: GATA transcription factor 5-like [Cucumis melo])

HSP 1 Score: 650.6 bits (1677), Expect = 1.5e-183
Identity = 323/334 (96.71%), Postives = 326/334 (97.60%), Query Frame = 1

Query: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEH 60
           MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNG NLVSGEDFEIEEFLNF NGDLEH
Sbjct: 1   MEFLEAKALKSSFHWELAMKSAQQDALVEEVWCLNGANLVSGEDFEIEEFLNFSNGDLEH 60

Query: 61  GSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEW 120
           GSSLR QEDDDCEEFEKNRFS+SSNSNQ+ GSPVVG+EDSKSLLAVELAFPGDSLTDLEW
Sbjct: 61  GSSLRPQEDDDCEEFEKNRFSLSSNSNQTGGSPVVGDEDSKSLLAVELAFPGDSLTDLEW 120

Query: 121 VSQFVDDSSSEFSCAAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180
           VSQFVDDSSSEFSC AVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS
Sbjct: 121 VSQFVDDSSSEFSCPAVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQAKSAGS 180

Query: 181 SLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRKKPSSPSPSSTGL 240
           SLNQSPSSSSSSTSSGVSSAAP FIFSDAGENVD LNVTGEPPKKQRKKPSSPSPSSTGL
Sbjct: 181 SLNQSPSSSSSSTSSGVSSAAPWFIFSDAGENVDSLNVTGEPPKKQRKKPSSPSPSSTGL 240

Query: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300
           LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC
Sbjct: 241 LPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYRPALSPTFC 300

Query: 301 SGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           SGVHSNSHRKVLEMRKTKEV QPATELAPMVPSY
Sbjct: 301 SGVHSNSHRKVLEMRKTKEVSQPATELAPMVPSY 334

BLAST of Cucsa.109180 vs. NCBI nr
Match: gi|645262647|ref|XP_008236855.1| (PREDICTED: GATA transcription factor 5-like [Prunus mume])

HSP 1 Score: 279.6 bits (714), Expect = 7.0e-72
Identity = 173/342 (50.58%), Postives = 210/342 (61.40%), Query Frame = 1

Query: 4   LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLEHGSS 63
           +EAKALKSS   ELA+KS QQ AL++E WC  G + V  EDF +++ L+  NG+ E GS 
Sbjct: 5   MEAKALKSSLRSELALKSNQQ-ALIDEFWCATGISGVPSEDFSVDDLLDLSNGEFEDGSV 64

Query: 64  LRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDLEWVSQ 123
                    EE E+ + SVS +   S+ S  V   DS+S LA +L  P D L  LEWVS 
Sbjct: 65  E--------EEEEEEKDSVSVDDESSNSSNFVSA-DSESSLASQLLVPDDDLAGLEWVSH 124

Query: 124 FVDDSSSEFSCA---------AVAFNRSEPEKKLTGTVISCLPTFFPVRPRTKRSRQSRQ 183
           FVDDS  + S           A+A  RSE E KL  +  +  P+  PV+PRTKR R + +
Sbjct: 125 FVDDSMLDLSLLHPVGTQKPEALALTRSEAEAKLVQSTPTWFPSQVPVKPRTKRCRAASR 184

Query: 184 AKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPP-KKQRKKPSSP 243
             S  SS   S  SSSSS SSG S + P  IF++  ++ D L   GEP  KKQ+KKP+  
Sbjct: 185 VWSYPSS---SSPSSSSSCSSGFSFSTPCLIFNNPVQSTDVL--VGEPATKKQKKKPAVQ 244

Query: 244 SPSSTGLLPTGSTG-QIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLFPEYR 303
           + +       GS G Q  RRCSHC VQKTPQWRTGP GAKTLCNACGVR+KSGRLFPEYR
Sbjct: 245 TGAD------GSVGVQFQRRCSHCHVQKTPQWRTGPLGAKTLCNACGVRFKSGRLFPEYR 304

Query: 304 PALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           PA SPTF   VHSNSHRKVLEMRK KE   P   L  ++PS+
Sbjct: 305 PACSPTFSGDVHSNSHRKVLEMRKRKEAGGPEPGLNRVIPSF 325

BLAST of Cucsa.109180 vs. NCBI nr
Match: gi|661899003|emb|CDO96997.1| (unnamed protein product [Coffea canephora])

HSP 1 Score: 270.0 bits (689), Expect = 5.5e-69
Identity = 166/352 (47.16%), Postives = 220/352 (62.50%), Query Frame = 1

Query: 1   MEFLEAKALKSSFHWELA-MKSAQQDALVEEVWCLNGPNLVSGEDFEIEEFLNFPNGDLE 60
           M+ +EAKALKSS   ++  MKS+QQ   V+++WC+ G N VS +DF +++ L+F + D +
Sbjct: 1   MDCMEAKALKSSLLSDIGGMKSSQQQGFVDDIWCVTGLNNVSCDDFSVDDLLDFSDKDFK 60

Query: 61  HGSSLRLQEDDDCEEFEKNRFSVSSNSNQ-----SDGSPVVGEEDSKSLLAVELAFPGDS 120
            G    L+ED+D     K+  S+SS+ +      S+ S     +D  SLLA ELA P + 
Sbjct: 61  DGP---LKEDEDF----KDTLSLSSSQHHHHHRNSNFSSFSETDDFGSLLAAELAVPAEE 120

Query: 121 LTDLEWVSQFVDDSSSEFS--CAAVAF--NRSEPEKKLTGTVIS-----CLPTFFPVRPR 180
           + +LEW+SQFVDDS SE S  C A +F  N+    +K +   +      C P   PV+PR
Sbjct: 121 MENLEWLSQFVDDSRSEVSLLCPAGSFKDNKGRLTEKWSEPAVHMIRVPCFPLHVPVKPR 180

Query: 181 TKRSRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPP-K 240
           +KRSR + +  S   SL  + SSS+SS+S G S+ +P FI S+  ++ + L+   +PP K
Sbjct: 181 SKRSRPNGRVWSGSPSLTTTESSSTSSSSYGSSALSP-FILSNPVQDSEMLSSVEKPPAK 240

Query: 241 KQRKKPSSPSPSSTGLLPTGSTG-QIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYK 300
           K +KKP++ + S       GS G Q  RRCSHC V KTPQWRTGP G KTLCNACGVRYK
Sbjct: 241 KHKKKPATDTGS-------GSIGSQTSRRCSHCQVNKTPQWRTGPLGPKTLCNACGVRYK 300

Query: 301 SGRLFPEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVP-QPATELAPMVPSY 335
           SGRLFPEYRPA SPTF   VHSNSHRKVLEMR+ KE        L PMV S+
Sbjct: 301 SGRLFPEYRPACSPTFSQEVHSNSHRKVLEMRRKKEATGHVEAGLTPMVSSF 337

BLAST of Cucsa.109180 vs. NCBI nr
Match: gi|566187676|ref|XP_002313763.2| (hypothetical protein POPTR_0009s12620g [Populus trichocarpa])

HSP 1 Score: 269.2 bits (687), Expect = 9.4e-69
Identity = 169/346 (48.84%), Postives = 209/346 (60.40%), Query Frame = 1

Query: 1   MEF-LEAKALKSSFHWELAMKSAQQDALVEEVWCLNGPNLVS-GEDFEIEEFLNFPNGDL 60
           MEF +E +ALKSS   EL  K+  + A  E+   LN P +VS  +DF ++ FL+F NG+ 
Sbjct: 1   MEFRVEERALKSSLLRELDTKTTSEQAFCEDFLALNTPGVVSFDQDFSVDCFLDFSNGEF 60

Query: 61  EHGSSLRLQEDDDCEEFEKNRFSVSSNSNQSDGSPVVGEEDSKSLLAVELAFPGDSLTDL 120
             G    +QE ++    EK+  SVSS     D         S S LA ELA P D + +L
Sbjct: 61  NDGY---VQEQEE----EKDSISVSSQDRVDDDFNSNSSSFSDSFLASELAVPTDDIAEL 120

Query: 121 EWVSQFVDDSSSEFSCAAVAF---------NRSEPEKKLTGTVISCL-PTFFPVRPRTKR 180
           EWVS FVDDS S+ S    A          NR EPE K T    SCL P+  P + RTKR
Sbjct: 121 EWVSHFVDDSVSDVSLLVPACKGSSKRHAKNRFEPETKPTFAKTSCLFPSRVPSKARTKR 180

Query: 181 SRQSRQAKSAGSSLNQSPSSSSSSTSSGVSSAAPRFIFSDAGENVDFLNVTGEPPKKQRK 240
           SR + +  SAGS+ +++PSSS+SSTSS      P  + ++  +  D L+   E P K  K
Sbjct: 181 SRPTGRTWSAGSNQSETPSSSTSSTSS-----MPCLVATNTVQTADSLSWLSEQPMKISK 240

Query: 241 KPSSPSPSSTGLLPTGSTGQIPRRCSHCLVQKTPQWRTGPNGAKTLCNACGVRYKSGRLF 300
           K   P+  ++GL+   ++ Q  RRCSHC VQKTPQWRTGP GAKTLCNACGVRYKSGRLF
Sbjct: 241 K--RPAVHTSGLM---ASTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLF 300

Query: 301 PEYRPALSPTFCSGVHSNSHRKVLEMRKTKEVPQPATELAPMVPSY 335
           PEYRPA SPTF S VHSNSHRKVLEMR+ KEV      L  MVPS+
Sbjct: 301 PEYRPACSPTFSSEVHSNSHRKVLEMRRKKEVAGAEPRLNQMVPSF 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GATA5_ARATH1.4e-5141.19GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1[more]
GATA6_ARATH4.1e-4039.80GATA transcription factor 6 OS=Arabidopsis thaliana GN=GATA6 PE=2 SV=1[more]
GATA7_ARATH5.8e-3437.72GATA transcription factor 7 OS=Arabidopsis thaliana GN=GATA7 PE=2 SV=1[more]
GATA2_ARATH5.8e-3435.91GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1[more]
GATA4_ARATH2.2e-3333.00GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LLB3_CUCSA1.4e-191100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G251490 PE=4 SV=1[more]
A0A068TLY9_COFCA3.8e-6947.16Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00014204001 PE=4 SV=1[more]
B9HRG7_POPTR6.6e-6948.84Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0009s12620g PE=4 SV=2[more]
F6GWQ6_VITVI8.6e-6947.43Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0023g02880 PE=4 SV=... [more]
D9ZIZ0_MALDO7.3e-6848.70GATA domain class transcription factor OS=Malus domestica GN=GATA3 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66320.17.7e-5341.19 GATA transcription factor 5[more]
AT3G51080.12.3e-4139.80 GATA transcription factor 6[more]
AT4G36240.13.2e-3537.72 GATA transcription factor 7[more]
AT2G45050.13.2e-3535.91 GATA transcription factor 2[more]
AT3G60530.11.2e-3433.00 GATA transcription factor 4[more]
Match NameE-valueIdentityDescription
gi|449464846|ref|XP_004150140.1|2.0e-191100.00PREDICTED: GATA transcription factor 5-like [Cucumis sativus][more]
gi|659122191|ref|XP_008461014.1|1.5e-18396.71PREDICTED: GATA transcription factor 5-like [Cucumis melo][more]
gi|645262647|ref|XP_008236855.1|7.0e-7250.58PREDICTED: GATA transcription factor 5-like [Prunus mume][more]
gi|661899003|emb|CDO96997.1|5.5e-6947.16unnamed protein product [Coffea canephora][more]
gi|566187676|ref|XP_002313763.2|9.4e-6948.84hypothetical protein POPTR_0009s12620g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000679Znf_GATA
IPR013088Znf_NHR/GATA
IPR016679TF_GATA_pln
IPR000679Znf_GATA
IPR013088Znf_NHR/GATA
IPR016679TF_GATA_pln
IPR000679Znf_GATA
IPR013088Znf_NHR/GATA
IPR016679TF_GATA_pln
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO:0045893positive regulation of transcription, DNA-templated
GO:0006355regulation of transcription, DNA-templated
GO:0045893positive regulation of transcription, DNA-templated
GO:0006355regulation of transcription, DNA-templated
GO:0045893positive regulation of transcription, DNA-templated
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO:0005634nucleus
GO:0005634nucleus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.109180.3Cucsa.109180.3mRNA
Cucsa.109180.2Cucsa.109180.2mRNA
Cucsa.109180.1Cucsa.109180.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 253..286
score: 6.3
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 247..297
score: 5.8
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 253..278
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 251..283
score: 11
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 251..284
score: 2.3
IPR016679Transcription factor, GATA, plantPIRPIRSF016992Txn_fac_GATA_plantcoord: 8..330
score: 1.4
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 24..322
score: 6.7
NoneNo IPR availablePANTHERPTHR10071:SF163GATA TRANSCRIPTION FACTOR 14-RELATEDcoord: 24..322
score: 6.7
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 248..310
score: 5.7

The following gene(s) are paralogous to this gene:

None