Cla97C08G155280 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G155280
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionGATA transcription factor
LocationCla97Chr08: 23306442 .. 23308257 (-)
RNA-Seq ExpressionCla97C08G155280
SyntenyCla97C08G155280
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCCTGAATATTTCCAGATCAATGGCTACTGTTCTCAATTCGCCACCCACTCCTCCTCCGACAACGACTCCGCCACCGCCACCGCCACAGCCACTGCCGCCGGACCGGAGCATTTCATCGTGGAGGAGCTTCTCGATTTCTCCAACGATGACGACGCCGTTATTGGTGACGGTGGAGGATTGTTTTACAATAATAATAACAATGGGAATAATAATTCAACGGAATGTTCCGCCGTTACGGTGATTGAGAGTTGCAATTCGTCGTCGTTTTTGGAAGATATTAGTGGCTCTAATTTAACCGACGCCCATTTCTCCAGCGAACTCTGCGTTCCGGTAATTTCAAAACCTCGCCTTTTCTTACCTTTTTTTCTTTTTTTTCCACGTACATTCAATATTTATTTAGTTTTGAAGAAAGGATTATCAATTCAAACTTTTTTTTTTTTTTTTTTAACCTTTTTAATTCTAAGCTTCAACTTTTTACCTTATTTGCATTACATTTATATACATATAACATTAAGACATATATTGTTGAGCTCCTTTGTCTTGTACCAATTCTTTTCTTTTAGTTTGATTCTTTGAAGAACTATAAAAATTTACTTTTGTCAATCAAATTTGTTATATTAATAGGTTTGAAATGTCACTTTGGACAACTCACTAGAAATAAGTGAAAATTTGAGTGATAAATAAGAAGACATAACTTTGATAAACGACTTAGAGGTAATCTGTTTAATTCATAGTAGTCACCAACTTCAATTTTTTTTTATACCCAAATATTGTCATGTGTGATATTAGGTGAGGTGTTTTATTAAAAAAATTTAACAATTATTTCACTCGAAAACTTTTTATATTTTTTAAGTAAAAAGAAATAGTAGTATTGGTATGCACCTTATGATTAGTAGGTAAGAAGTTGAATAAAATAAAAGTGTGAATAGATAATTTATGTAATTTTGTTTGAGGAAAATTTATGCGATTCAGGTAGTTTACTTATTTATATGTGCAATGAAGTCACAAAGAAACTGTGTAATGGATATTTACTGTCATCATCATTATTATTAAGAAATTGTTTTAATTTTGGGTACAGTACGACGATTTAGCTGAGTTGGAATGGCTTTCACATTTCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTGGAATTAATCTCCGGAGTCAAAGTCAAAGAACCCGCCCAATCCCCACAACCCACCGTCTCTCACGGCCGAAAAGCCGCCGCAATTTTCAAACCGGACATCGTTTCCGTTCCGGCCAAAGCCCGTAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTTCCTCTTTCTCCCACCACCGAACCCGAAATTACCACCACCGCGGGACCACCGCACCCCATCAAAAAACCCCCTCCCAAGGCGGCGACAGCCAAGAAGAAGGACAGCCCGGAGGTCGGAGTGTCCTCCGGCGAGGGGCGAAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCGAAAACGCTGTGTAATGCTTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAGTACCGCCCCGCCGCTAGCCCCACCTTCGTTTTAACCAAACACTCCAATTCTCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAAGAGCTTCTTAGAGCCCAACAACAGCAACAACAACATTTGCTTTTGGATCATCATCAGGATATGATCTTTGATGCATCCAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCCGATTTCCGGCAGCTGATCTGA

mRNA sequence

ATGGAAGCTCCTGAATATTTCCAGATCAATGGCTACTGTTCTCAATTCGCCACCCACTCCTCCTCCGACAACGACTCCGCCACCGCCACCGCCACAGCCACTGCCGCCGGACCGGAGCATTTCATCGTGGAGGAGCTTCTCGATTTCTCCAACGATGACGACGCCGTTATTGGTGACGGTGGAGGATTGTTTTACAATAATAATAACAATGGGAATAATAATTCAACGGAATGTTCCGCCGTTACGGTGATTGAGAGTTGCAATTCGTCGTCGTTTTTGGAAGATATTAGTGGCTCTAATTTAACCGACGCCCATTTCTCCAGCGAACTCTGCGTTCCGTACGACGATTTAGCTGAGTTGGAATGGCTTTCACATTTCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTGGAATTAATCTCCGGAGTCAAAGTCAAAGAACCCGCCCAATCCCCACAACCCACCGTCTCTCACGGCCGAAAAGCCGCCGCAATTTTCAAACCGGACATCGTTTCCGTTCCGGCCAAAGCCCGTAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTTCCTCTTTCTCCCACCACCGAACCCGAAATTACCACCACCGCGGGACCACCGCACCCCATCAAAAAACCCCCTCCCAAGGCGGCGACAGCCAAGAAGAAGGACAGCCCGGAGGTCGGAGTGTCCTCCGGCGAGGGGCGAAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCGAAAACGCTGTGTAATGCTTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAGTACCGCCCCGCCGCTAGCCCCACCTTCGTTTTAACCAAACACTCCAATTCTCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAAGAGCTTCTTAGAGCCCAACAACAGCAACAACAACATTTGCTTTTGGATCATCATCAGGATATGATCTTTGATGCATCCAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCCGATTTCCGGCAGCTGATCTGA

Coding sequence (CDS)

ATGGAAGCTCCTGAATATTTCCAGATCAATGGCTACTGTTCTCAATTCGCCACCCACTCCTCCTCCGACAACGACTCCGCCACCGCCACCGCCACAGCCACTGCCGCCGGACCGGAGCATTTCATCGTGGAGGAGCTTCTCGATTTCTCCAACGATGACGACGCCGTTATTGGTGACGGTGGAGGATTGTTTTACAATAATAATAACAATGGGAATAATAATTCAACGGAATGTTCCGCCGTTACGGTGATTGAGAGTTGCAATTCGTCGTCGTTTTTGGAAGATATTAGTGGCTCTAATTTAACCGACGCCCATTTCTCCAGCGAACTCTGCGTTCCGTACGACGATTTAGCTGAGTTGGAATGGCTTTCACATTTCGTAGAGGAATCATTTTCCAGCGAGGACATGCAAAAGTTGGAATTAATCTCCGGAGTCAAAGTCAAAGAACCCGCCCAATCCCCACAACCCACCGTCTCTCACGGCCGAAAAGCCGCCGCAATTTTCAAACCGGACATCGTTTCCGTTCCGGCCAAAGCCCGTAGCAAACGCTCACGCGCCGTCCCATCCAATTGGAACAACTCCCGCCTCCTTCCTCTTTCTCCCACCACCGAACCCGAAATTACCACCACCGCGGGACCACCGCACCCCATCAAAAAACCCCCTCCCAAGGCGGCGACAGCCAAGAAGAAGGACAGCCCGGAGGTCGGAGTGTCCTCCGGCGAGGGGCGAAAGTGCATGCACTGCGCCACCGACAAGACGCCGCAGTGGCGGACCGGCCCAATGGGCCCGAAAACGCTGTGTAATGCTTGCGGCGTTCGGTACAAATCCGGCCGCCTGGTGCCGGAGTACCGCCCCGCCGCTAGCCCCACCTTCGTTTTAACCAAACACTCCAATTCTCACCGGAAAGTTTTGGAGCTCCGGCGGCAGAAAGAGCTTCTTAGAGCCCAACAACAGCAACAACAACATTTGCTTTTGGATCATCATCAGGATATGATCTTTGATGCATCCAACGGTGATGATTATCTCATTCATCAACATGTGGGCCCCGATTTCCGGCAGCTGATCTGA

Protein sequence

MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHVGPDFRQLI
Homology
BLAST of Cla97C08G155280 vs. NCBI nr
Match: XP_038886306.1 (GATA transcription factor 12-like [Benincasa hispida])

HSP 1 Score: 627.1 bits (1616), Expect = 9.3e-176
Identity = 330/368 (89.67%), Postives = 336/368 (91.30%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDG 60
           MEAPEYFQINGYCSQF+THSSSD D+ TATAT   AGPEHFIVEELLDFSNDDD V+GDG
Sbjct: 1   MEAPEYFQINGYCSQFSTHSSSDTDTTTATAT---AGPEHFIVEELLDFSNDDDGVVGDG 60

Query: 61  GGLFY--NNNNNGNNNSTECSAVTVIESCNS---------SSFLEDISGSNLTDAHFSSE 120
           GGLFY  NN NN NNNSTE SAVTVIESCNS         SSFLEDISGSNL DAHFSSE
Sbjct: 61  GGLFYNTNNGNNNNNNSTESSAVTVIESCNSSSFSGCEPNSSFLEDISGSNLADAHFSSE 120

Query: 121 LCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRKAAAI 180
           LCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKV+  EP  S QPT +  R AAAI
Sbjct: 121 LCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVRSDEPTNSRQPTAT--RNAAAI 180

Query: 181 FKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPKAATA 240
           FKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEIT TAGPPHPIKK PPKAATA
Sbjct: 181 FKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITATAGPPHPIKKNPPKAATA 240

Query: 241 KKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA 300
           KKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA
Sbjct: 241 KKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAA 300

Query: 301 SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHV 356
           SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQH+
Sbjct: 301 SPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYLIHQHM 360

BLAST of Cla97C08G155280 vs. NCBI nr
Match: XP_008445001.1 (PREDICTED: GATA transcription factor 12-like [Cucumis melo] >KAA0065050.1 GATA transcription factor 12-like [Cucumis melo var. makuwa])

HSP 1 Score: 553.9 bits (1426), Expect = 1.0e-153
Identity = 305/373 (81.77%), Postives = 320/373 (85.79%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI-- 60
           MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV   
Sbjct: 1   MEAPEYFQINAYSSQF---SSPDHADASTTA---AAAPEHFIVEELLDFSNNEDDAVFTD 60

Query: 61  ----GDGGGLFYNNNN------NGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHF 120
               G GGGLFYNNNN      N NNNS E SA+TV+ESCN SSSF EDISGSNL DAHF
Sbjct: 61  AGGGGGGGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSSSSFFEDISGSNLGDAHF 120

Query: 121 SSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRK 180
           SSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R 
Sbjct: 121 SSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTAT--RT 180

Query: 181 AAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK 240
           AAAIFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK
Sbjct: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPTAEPEITAPIGQPYSIKKPLPK 240

Query: 241 -AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300
            AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 241 VAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300

Query: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYL 356
           YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYL
Sbjct: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDHRQDMIFDASNGDDYL 360

BLAST of Cla97C08G155280 vs. NCBI nr
Match: XP_031736569.1 (GATA transcription factor 12 [Cucumis sativus])

HSP 1 Score: 544.7 bits (1402), Expect = 6.1e-151
Identity = 306/380 (80.53%), Postives = 318/380 (83.68%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI-- 60
           MEAPEYFQIN Y SQF   SS D+  AT TA A AA P+HFIVEELLDFS N+DDAV+  
Sbjct: 1   MEAPEYFQINAYSSQF---SSPDDADATTTA-AAAAAPDHFIVEELLDFSNNEDDAVLTD 60

Query: 61  ----------GDGGGLFYNNNN------NGNNNSTECSAVTVIESCN-SSSFLEDISGSN 120
                     G GGGLFYNNNN      N NNNSTE SAVTV+ESCN SSSF EDISGSN
Sbjct: 61  SGGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSN 120

Query: 121 LTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPT 180
           L DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK    P QSPQPT
Sbjct: 121 LGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPT 180

Query: 181 VSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHP 240
            +  R AAAIFKP+IVSVPAKARSKRSRA+PSNWNNS LLPL SPT E E T     PHP
Sbjct: 181 AT--RSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESETTPPIEQPHP 240

Query: 241 IKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300
           IKK  PK AATAKKKDSP++G SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Sbjct: 241 IKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300

Query: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDA 356
           SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ QHLLLDH QDMIFDA
Sbjct: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDA 360

BLAST of Cla97C08G155280 vs. NCBI nr
Match: XP_022132107.1 (GATA transcription factor 12 [Momordica charantia])

HSP 1 Score: 468.0 bits (1203), Expect = 7.2e-128
Identity = 274/379 (72.30%), Postives = 290/379 (76.52%), Query Frame = 0

Query: 1   MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDD 60
           ME P+YFQIN   YC SQF     HSSSDND      T    G EHFIVEELLDFSN DD
Sbjct: 1   MELPDYFQINNAAYCSSQFVAETRHSSSDND------TDGGCGGEHFIVEELLDFSN-DD 60

Query: 61  AVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS---------SFLEDISGSNLTDAH 120
            V  D          NGN+N+    +V+VIESCNSS         SFL+DI+ SNL DA 
Sbjct: 61  GVAADVSSF------NGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAK 120

Query: 121 FSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRK 180
           FS+ELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK  E  Q  QP+ +    
Sbjct: 121 FSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVA 180

Query: 181 AAAIFKPDIVSVPAKARSKRSR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIK 240
           AA IFKPDIVSVPAKARSKRSR AVP+NWNNSRLLPLSPTT       +   A PPHP K
Sbjct: 181 AAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGK 240

Query: 241 KPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300
           K   KA  TAKKKD P+ G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Sbjct: 241 KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300

Query: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDAS 356
           RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ QH L+LDHHQ+MIFDAS
Sbjct: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDAS 360

BLAST of Cla97C08G155280 vs. NCBI nr
Match: XP_023002390.1 (GATA transcription factor 12-like [Cucurbita maxima])

HSP 1 Score: 453.0 bits (1164), Expect = 2.4e-123
Identity = 255/381 (66.93%), Postives = 288/381 (75.59%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDG 60
           MEAPEYF  N YCSQF    +SD D+A +TATATA   +HFIVEELLDFSNDDD+ I D 
Sbjct: 1   MEAPEYFHNNAYCSQF----TSDKDAAASTATATATA-DHFIVEELLDFSNDDDSAIADS 60

Query: 61  GGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SFLEDISGSNLTDAHFSSELCV 120
           GG F N     N NS+E SA T +ES NSS        SF +D+SGS+L D  FS ++ +
Sbjct: 61  GGFFNNVTCFLNGNSSESSAATAVESSNSSSFSGCERTSFFDDVSGSSLADVRFSDDIFI 120

Query: 121 PYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAA 180
           PY++L ELEWL+ F EE FSSEDMQKLELI+GVKVK  EP QS  PT      SHGR AA
Sbjct: 121 PYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTTAVSAASHGRNAA 180

Query: 181 A-IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPP 240
           A IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT   +E +I  T  PPHP+KK P
Sbjct: 181 AEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHPVKKVP 240

Query: 241 PKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300
           PK A A KK     S E G+S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Sbjct: 241 PKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300

Query: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFD 356
           RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +AQ+QQ     H    HHQ+M+FD
Sbjct: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKAQEQQALMMDHHHHHHHQEMMFD 360

BLAST of Cla97C08G155280 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 1.3e-68
Identity = 177/340 (52.06%), Postives = 208/340 (61.18%), Query Frame = 0

Query: 41  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISG 100
           F V++LL DFSNDDD              N+   +ST  +  T+ +S N S++ L    G
Sbjct: 14  FAVDDLLVDFSNDDD------------EENDVVADST--TTTTITDSSNFSAADLPSFHG 73

Query: 101 SNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQPT 160
                  FS +LC+P DDLA ELEWLS+ V+ES S ED+ KLELISG K +   +S   +
Sbjct: 74  DVQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKSRPDPKSDTGS 133

Query: 161 VSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGP 220
             +   ++ IF  D VSVPAKARSKRSRA   NW +  LL      SP T   I ++   
Sbjct: 134 PENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQH 193

Query: 221 PHPIKKPPPKAATAKKK-------------DSPEVGVSSGEGRKCMHCATDKTPQWRTGP 280
             P   PP   A   KK              SPE G    E R+C+HCATDKTPQWRTGP
Sbjct: 194 LSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESG--GAEERRCLHCATDKTPQWRTGP 253

Query: 281 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ 340
           MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA     
Sbjct: 254 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRA----- 313

Query: 341 QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI 356
            H  + HH      MIFD +S+GDDYLIH +VGPDFRQLI
Sbjct: 314 HHEFIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Cla97C08G155280 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 3.4e-64
Identity = 161/337 (47.77%), Postives = 207/337 (61.42%), Query Frame = 0

Query: 35  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLE 94
           A  P+ F+V++LLDFSNDD  V         ++  N   +S+  S  T+ +S NSSS   
Sbjct: 12  AGNPDSFVVDDLLDFSNDDGEV---------DDGLNTLPDSSTLSTGTLTDSSNSSSLFT 71

Query: 95  DISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQS- 154
           D +G         S+L +P DD+AELEWLS+FVEESF+ ED  KL L SG+K  +   S 
Sbjct: 72  DGTG--------FSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGST 131

Query: 155 ------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI 214
                 P+P + H            V+VPAKARSKRSR+  S W  SRLL L+ + E   
Sbjct: 132 LTHLIKPEPELDH---QFIDIDESNVAVPAKARSKRSRSAASTW-ASRLLSLADSDE--- 191

Query: 215 TTTAGPPHPIKKPPPKAATAKKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGP 274
                       P  K    K++D + ++ V  GE   GR+C+HCAT+KTPQWRTGPMGP
Sbjct: 192 ----------TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGP 251

Query: 275 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL 334
           KTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +HL
Sbjct: 252 KTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM------RDEHL 308

Query: 335 LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI 356
           L     ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 312 LSQLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Cla97C08G155280 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 9.7e-43
Identity = 121/280 (43.21%), Postives = 153/280 (54.64%), Query Frame = 0

Query: 36  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLED 95
           + P+   +++LLDFSND+         +F     + ++  T  +A +   S N  SF   
Sbjct: 7   SSPDLLRIDDLLDFSNDE---------IF-----SSSSTVTSSAASSAASSENPFSFPSS 66

Query: 96  ISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQ 155
              S      F+ +LCVP DD A LEWLS FV++SFS                 PA    
Sbjct: 67  TYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFSD---------------FPANPLT 126

Query: 156 PTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTA 215
            TV          +P+I S   K RS+RSRA    V   W        +P +E E+    
Sbjct: 127 MTV----------RPEI-SFTGKPRSRRSRAPAPSVAGTW--------APMSESELC--- 186

Query: 216 GPPHPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 275
              H + KP P      KK      V++   R+C HCA++KTPQWRTGP+GPKTLCNACG
Sbjct: 187 ---HSVAKPKP------KKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNACG 226

Query: 276 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 312
           VRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Sbjct: 247 VRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 226

BLAST of Cla97C08G155280 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 8.2e-42
Identity = 116/294 (39.46%), Postives = 151/294 (51.36%), Query Frame = 0

Query: 36  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLED 95
           + P+   +++LLDFSN+D       GG            ST  ++ +      + SF   
Sbjct: 7   SSPDLLRIDDLLDFSNEDIFSASSSGG------------STAATSSSSFPPPQNPSFHHH 66

Query: 96  ISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKL-ELISGVKVKEPAQSP 155
              S+     F  ++CVP DD A LEWLS FV++SF+      L   ++ VK +      
Sbjct: 67  HLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSVKTE------ 126

Query: 156 QPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPP 215
                              S P K RSKRSRA          +PL    +   +     P
Sbjct: 127 ------------------TSFPGKPRSKRSRAPAPFAGTWSPMPLESEHQQLHSAAKFKP 186

Query: 216 HPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 275
              +         + + S       G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+
Sbjct: 187 KKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRF 246

Query: 276 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH 329
           KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QQ Q H    HH
Sbjct: 247 KSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQVQLH----HH 260

BLAST of Cla97C08G155280 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 1.5e-35
Identity = 119/297 (40.07%), Postives = 152/297 (51.18%), Query Frame = 0

Query: 39  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISG 98
           + F V++LLD SNDD  V  D        +    +     S+    +  ++     D SG
Sbjct: 39  DDFSVDDLLDLSNDD--VFAD-----EETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSG 98

Query: 99  SNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPA------Q 158
            +   +  +SEL +P DDLA LEWLSHFVE+SF+      L   +G   ++PA      +
Sbjct: 99  CDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYSGPNL---TGTPTEKPAWLTGDRK 158

Query: 159 SPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTA 218
            P   V+        FK     VPAKARSKR+R     W+        P++     ++++
Sbjct: 159 HPVTAVTE----ETCFKS---PVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTSSSSS 218

Query: 219 GPPHP-----------IKKPPPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTP 278
           GP  P           +    P      KK S E  V SGE       RKC HC   KTP
Sbjct: 219 GPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAE-SVFSGELQQLQPQRKCSHCGVQKTP 278

Query: 279 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 312
           QWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Sbjct: 279 QWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cla97C08G155280 vs. ExPASy TrEMBL
Match: A0A5A7VCX1 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003790 PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 4.8e-154
Identity = 305/373 (81.77%), Postives = 320/373 (85.79%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI-- 60
           MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV   
Sbjct: 1   MEAPEYFQINAYSSQF---SSPDHADASTTA---AAAPEHFIVEELLDFSNNEDDAVFTD 60

Query: 61  ----GDGGGLFYNNNN------NGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHF 120
               G GGGLFYNNNN      N NNNS E SA+TV+ESCN SSSF EDISGSNL DAHF
Sbjct: 61  AGGGGGGGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSSSSFFEDISGSNLGDAHF 120

Query: 121 SSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRK 180
           SSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R 
Sbjct: 121 SSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTAT--RT 180

Query: 181 AAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK 240
           AAAIFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK
Sbjct: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPTAEPEITAPIGQPYSIKKPLPK 240

Query: 241 -AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300
            AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 241 VAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300

Query: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYL 356
           YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYL
Sbjct: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDHRQDMIFDASNGDDYL 360

BLAST of Cla97C08G155280 vs. ExPASy TrEMBL
Match: A0A1S3BBN7 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 4.8e-154
Identity = 305/373 (81.77%), Postives = 320/373 (85.79%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI-- 60
           MEAPEYFQIN Y SQF   SS D+  A+ TA   AA PEHFIVEELLDFS N+DDAV   
Sbjct: 1   MEAPEYFQINAYSSQF---SSPDHADASTTA---AAAPEHFIVEELLDFSNNEDDAVFTD 60

Query: 61  ----GDGGGLFYNNNN------NGNNNSTECSAVTVIESCN-SSSFLEDISGSNLTDAHF 120
               G GGGLFYNNNN      N NNNS E SA+TV+ESCN SSSF EDISGSNL DAHF
Sbjct: 61  AGGGGGGGGLFYNNNNTTSNDHNNNNNSAESSAITVMESCNSSSSFFEDISGSNLGDAHF 120

Query: 121 SSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPTVSHGRK 180
           SSELCVPYDDLAELEWLS+FVEESFSSEDMQKLEL+SGVKVK    PAQSPQPT +  R 
Sbjct: 121 SSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELLSGVKVKSDEIPAQSPQPTAT--RT 180

Query: 181 AAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPPHPIKKPPPK 240
           AAAIFKP+IVSVPAKARSKRSRA+PSNWNNS LLPLSPT EPEIT   G P+ IKKP PK
Sbjct: 181 AAAIFKPEIVSVPAKARSKRSRALPSNWNNSSLLPLSPTAEPEITAPIGQPYSIKKPLPK 240

Query: 241 -AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300
            AATAKKKD+P+VG SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE
Sbjct: 241 VAATAKKKDNPDVGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPE 300

Query: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDASNGDDYL 356
           YRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQQQHLLLDH QDMIFDASNGDDYL
Sbjct: 301 YRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQQQHLLLDHRQDMIFDASNGDDYL 360

BLAST of Cla97C08G155280 vs. ExPASy TrEMBL
Match: A0A0A0LPR5 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1)

HSP 1 Score: 544.7 bits (1402), Expect = 2.9e-151
Identity = 306/380 (80.53%), Postives = 318/380 (83.68%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFS-NDDDAVI-- 60
           MEAPEYFQIN Y SQF   SS D+  AT TA A AA P+HFIVEELLDFS N+DDAV+  
Sbjct: 1   MEAPEYFQINAYSSQF---SSPDDADATTTA-AAAAAPDHFIVEELLDFSNNEDDAVLTD 60

Query: 61  ----------GDGGGLFYNNNN------NGNNNSTECSAVTVIESCN-SSSFLEDISGSN 120
                     G GGGLFYNNNN      N NNNSTE SAVTV+ESCN SSSF EDISGSN
Sbjct: 61  SGGGGGGGGGGGGGGLFYNNNNTSTNDHNNNNNSTESSAVTVMESCNSSSSFFEDISGSN 120

Query: 121 LTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKE---PAQSPQPT 180
           L DAHFSSELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK    P QSPQPT
Sbjct: 121 LGDAHFSSELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDEPPTQSPQPT 180

Query: 181 VSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL-SPTTEPEITTTAGPPHP 240
            +  R AAAIFKP+IVSVPAKARSKRSRA+PSNWNNS LLPL SPT E E T     PHP
Sbjct: 181 AT--RSAAAIFKPEIVSVPAKARSKRSRALPSNWNNSALLPLSSPTAESETTPPIEQPHP 240

Query: 241 IKKPPPK-AATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300
           IKK  PK AATAKKKDSP++G SSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK
Sbjct: 241 IKKTLPKAAATAKKKDSPDLGFSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYK 300

Query: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHHQDMIFDA 356
           SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE+LRAQQQQ QHLLLDH QDMIFDA
Sbjct: 301 SGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEILRAQQQQPQHLLLDHRQDMIFDA 360

BLAST of Cla97C08G155280 vs. ExPASy TrEMBL
Match: A0A6J1BSX6 (GATA transcription factor OS=Momordica charantia OX=3673 GN=LOC111005058 PE=3 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 3.5e-128
Identity = 274/379 (72.30%), Postives = 290/379 (76.52%), Query Frame = 0

Query: 1   MEAPEYFQIN--GYC-SQFAT---HSSSDNDSATATATATAAGPEHFIVEELLDFSNDDD 60
           ME P+YFQIN   YC SQF     HSSSDND      T    G EHFIVEELLDFSN DD
Sbjct: 1   MELPDYFQINNAAYCSSQFVAETRHSSSDND------TDGGCGGEHFIVEELLDFSN-DD 60

Query: 61  AVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSS---------SFLEDISGSNLTDAH 120
            V  D          NGN+N+    +V+VIESCNSS         SFL+DI+ SNL DA 
Sbjct: 61  GVAADVSSF------NGNDNNNPSVSVSVIESCNSSNSFSCCEPNSFLDDITHSNLGDAK 120

Query: 121 FSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPTVSHGRK 180
           FS+ELCVPYDDLAELEWLS+FVEESFSSEDMQKLELISGVKVK  E  Q  QP+ +    
Sbjct: 121 FSTELCVPYDDLAELEWLSNFVEESFSSEDMQKLELISGVKVKSDETFQIRQPSPAVAVA 180

Query: 181 AAAIFKPDIVSVPAKARSKRSR-AVPSNWNNSRLLPLSPTTEPE----ITTTAGPPHPIK 240
           AA IFKPDIVSVPAKARSKRSR AVP+NWNNSRLLPLSPTT       +   A PPHP K
Sbjct: 181 AAEIFKPDIVSVPAKARSKRSRAAVPTNWNNSRLLPLSPTTSSSELDVLAVAATPPHPGK 240

Query: 241 KPPPKA-ATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300
           K   KA  TAKKKD P+ G S GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Sbjct: 241 KATIKATVTAKKKDCPDAGASPGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300

Query: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQH-LLLDHHQDMIFDAS 356
           RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL R QQQQ QH L+LDHHQ+MIFDAS
Sbjct: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQRTQQQQHQHQLILDHHQNMIFDAS 360

BLAST of Cla97C08G155280 vs. ExPASy TrEMBL
Match: A0A6J1KNT0 (GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1)

HSP 1 Score: 453.0 bits (1164), Expect = 1.2e-123
Identity = 255/381 (66.93%), Postives = 288/381 (75.59%), Query Frame = 0

Query: 1   MEAPEYFQINGYCSQFATHSSSDNDSATATATATAAGPEHFIVEELLDFSNDDDAVIGDG 60
           MEAPEYF  N YCSQF    +SD D+A +TATATA   +HFIVEELLDFSNDDD+ I D 
Sbjct: 1   MEAPEYFHNNAYCSQF----TSDKDAAASTATATATA-DHFIVEELLDFSNDDDSAIADS 60

Query: 61  GGLFYNNNNNGNNNSTECSAVTVIESCNSS--------SFLEDISGSNLTDAHFSSELCV 120
           GG F N     N NS+E SA T +ES NSS        SF +D+SGS+L D  FS ++ +
Sbjct: 61  GGFFNNVTCFLNGNSSESSAATAVESSNSSSFSGCERTSFFDDVSGSSLADVRFSDDIFI 120

Query: 121 PYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVK--EPAQSPQPT-----VSHGRKAA 180
           PY++L ELEWL+ F EE FSSEDMQKLELI+GVKVK  EP QS  PT      SHGR AA
Sbjct: 121 PYNELVELEWLASFEEEPFSSEDMQKLELITGVKVKPDEPLQSQHPTTAVSAASHGRNAA 180

Query: 181 A-IFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPT---TEPEITTTAGPPHPIKKPP 240
           A IFKPDIV+VPAKARSKRSR +PSNWNNSRLLPLSPT   +E +I  T  PPHP+KK P
Sbjct: 181 AEIFKPDIVAVPAKARSKRSRIIPSNWNNSRLLPLSPTSSSSEQDIPATEPPPHPVKKVP 240

Query: 241 PKAATAKKK----DSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300
           PK A A KK     S E G+S+GEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG
Sbjct: 241 PKVAAAVKKKESSSSSETGMSAGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRYKSG 300

Query: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQ---QQHLLLDHHQDMIFD 356
           RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEL +AQ+QQ     H    HHQ+M+FD
Sbjct: 301 RLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELQKAQEQQALMMDHHHHHHHQEMMFD 360

BLAST of Cla97C08G155280 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 261.5 bits (667), Expect = 9.6e-70
Identity = 177/340 (52.06%), Postives = 208/340 (61.18%), Query Frame = 0

Query: 41  FIVEELL-DFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCN-SSSFLEDISG 100
           F V++LL DFSNDDD              N+   +ST  +  T+ +S N S++ L    G
Sbjct: 14  FAVDDLLVDFSNDDD------------EENDVVADST--TTTTITDSSNFSAADLPSFHG 73

Query: 101 SNLTDAHFSSELCVPYDDLA-ELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQPT 160
                  FS +LC+P DDLA ELEWLS+ V+ES S ED+ KLELISG K +   +S   +
Sbjct: 74  DVQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKSRPDPKSDTGS 133

Query: 161 VSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPL----SPTTEPEITTTAGP 220
             +   ++ IF  D VSVPAKARSKRSRA   NW +  LL      SP T   I ++   
Sbjct: 134 PENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQH 193

Query: 221 PHPIKKPPPKAATAKKK-------------DSPEVGVSSGEGRKCMHCATDKTPQWRTGP 280
             P   PP   A   KK              SPE G    E R+C+HCATDKTPQWRTGP
Sbjct: 194 LSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESG--GAEERRCLHCATDKTPQWRTGP 253

Query: 281 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQ 340
           MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKV+ELRRQKE+ RA     
Sbjct: 254 MGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRA----- 313

Query: 341 QHLLLDHHQD----MIFD-ASNGDDYLIHQHVGPDFRQLI 356
            H  + HH      MIFD +S+GDDYLIH +VGPDFRQLI
Sbjct: 314 HHEFIHHHHGTDTAMIFDVSSDGDDYLIHHNVGPDFRQLI 331

BLAST of Cla97C08G155280 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 246.9 bits (629), Expect = 2.4e-65
Identity = 161/337 (47.77%), Postives = 207/337 (61.42%), Query Frame = 0

Query: 35  AAGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLE 94
           A  P+ F+V++LLDFSNDD  V         ++  N   +S+  S  T+ +S NSSS   
Sbjct: 12  AGNPDSFVVDDLLDFSNDDGEV---------DDGLNTLPDSSTLSTGTLTDSSNSSSLFT 71

Query: 95  DISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQS- 154
           D +G         S+L +P DD+AELEWLS+FVEESF+ ED  KL L SG+K  +   S 
Sbjct: 72  DGTG--------FSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGST 131

Query: 155 ------PQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI 214
                 P+P + H            V+VPAKARSKRSR+  S W  SRLL L+ + E   
Sbjct: 132 LTHLIKPEPELDH---QFIDIDESNVAVPAKARSKRSRSAASTW-ASRLLSLADSDE--- 191

Query: 215 TTTAGPPHPIKKPPPKAATAKKKD-SPEVGVSSGE---GRKCMHCATDKTPQWRTGPMGP 274
                       P  K    K++D + ++ V  GE   GR+C+HCAT+KTPQWRTGPMGP
Sbjct: 192 ----------TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGP 251

Query: 275 KTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHL 334
           KTLCNACGVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKV+ELRRQKE+      + +HL
Sbjct: 252 KTLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM------RDEHL 308

Query: 335 LLD-HHQDMIFD-ASNGDDYLIH---QHVGPDFRQLI 356
           L     ++++ D  SNG+D+L+H    HV PDFR LI
Sbjct: 312 LSQLRCENLLMDIRSNGEDFLMHNNTNHVAPDFRHLI 308

BLAST of Cla97C08G155280 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 175.6 bits (444), Expect = 6.9e-44
Identity = 121/280 (43.21%), Postives = 153/280 (54.64%), Query Frame = 0

Query: 36  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLED 95
           + P+   +++LLDFSND+         +F     + ++  T  +A +   S N  SF   
Sbjct: 7   SSPDLLRIDDLLDFSNDE---------IF-----SSSSTVTSSAASSAASSENPFSFPSS 66

Query: 96  ISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPAQSPQ 155
              S      F+ +LCVP DD A LEWLS FV++SFS                 PA    
Sbjct: 67  TYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFSD---------------FPANPLT 126

Query: 156 PTVSHGRKAAAIFKPDIVSVPAKARSKRSRA----VPSNWNNSRLLPLSPTTEPEITTTA 215
            TV          +P+I S   K RS+RSRA    V   W        +P +E E+    
Sbjct: 127 MTV----------RPEI-SFTGKPRSRRSRAPAPSVAGTW--------APMSESELC--- 186

Query: 216 GPPHPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACG 275
              H + KP P      KK      V++   R+C HCA++KTPQWRTGP+GPKTLCNACG
Sbjct: 187 ---HSVAKPKP------KKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNACG 226

Query: 276 VRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 312
           VRYKSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE
Sbjct: 247 VRYKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKE 226

BLAST of Cla97C08G155280 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 172.6 bits (436), Expect = 5.8e-43
Identity = 116/294 (39.46%), Postives = 151/294 (51.36%), Query Frame = 0

Query: 36  AGPEHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLED 95
           + P+   +++LLDFSN+D       GG            ST  ++ +      + SF   
Sbjct: 7   SSPDLLRIDDLLDFSNEDIFSASSSGG------------STAATSSSSFPPPQNPSFHHH 66

Query: 96  ISGSNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKL-ELISGVKVKEPAQSP 155
              S+     F  ++CVP DD A LEWLS FV++SF+      L   ++ VK +      
Sbjct: 67  HLPSSADHHSFLHDICVPSDDAAHLEWLSQFVDDSFADFPANPLGGTMTSVKTE------ 126

Query: 156 QPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEITTTAGPP 215
                              S P K RSKRSRA          +PL    +   +     P
Sbjct: 127 ------------------TSFPGKPRSKRSRAPAPFAGTWSPMPLESEHQQLHSAAKFKP 186

Query: 216 HPIKKPPPKAATAKKKDSPEVGVSSGEGRKCMHCATDKTPQWRTGPMGPKTLCNACGVRY 275
              +         + + S       G  R+C HCA++KTPQWRTGP+GPKTLCNACGVR+
Sbjct: 187 KKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRF 246

Query: 276 KSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKELLRAQQQQQQHLLLDHH 329
           KSGRLVPEYRPA+SPTFVLT+HSNSHRKV+ELRRQKE++R  QQ Q H    HH
Sbjct: 247 KSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQVQLH----HH 260

BLAST of Cla97C08G155280 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 151.8 bits (382), Expect = 1.1e-36
Identity = 119/297 (40.07%), Postives = 152/297 (51.18%), Query Frame = 0

Query: 39  EHFIVEELLDFSNDDDAVIGDGGGLFYNNNNNGNNNSTECSAVTVIESCNSSSFLEDISG 98
           + F V++LLD SNDD  V  D        +    +     S+    +  ++     D SG
Sbjct: 39  DDFSVDDLLDLSNDD--VFAD-----EETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSG 98

Query: 99  SNLTDAHFSSELCVPYDDLAELEWLSHFVEESFSSEDMQKLELISGVKVKEPA------Q 158
            +   +  +SEL +P DDLA LEWLSHFVE+SF+      L   +G   ++PA      +
Sbjct: 99  CDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYSGPNL---TGTPTEKPAWLTGDRK 158

Query: 159 SPQPTVSHGRKAAAIFKPDIVSVPAKARSKRSRAVPSNWNNSRLLPLSPTTEPEI-TTTA 218
            P   V+        FK     VPAKARSKR+R     W+        P++     ++++
Sbjct: 159 HPVTAVTE----ETCFKS---PVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTSSSSS 218

Query: 219 GPPHP-----------IKKPPPKAATAKKKDSPEVGVSSGE------GRKCMHCATDKTP 278
           GP  P           +    P      KK S E  V SGE       RKC HC   KTP
Sbjct: 219 GPSSPWFSGAELLEPVVTSERPPFPKKHKKRSAE-SVFSGELQQLQPQRKCSHCGVQKTP 278

Query: 279 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 312
           QWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE
Sbjct: 279 QWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886306.19.3e-17689.67GATA transcription factor 12-like [Benincasa hispida][more]
XP_008445001.11.0e-15381.77PREDICTED: GATA transcription factor 12-like [Cucumis melo] >KAA0065050.1 GATA t... [more]
XP_031736569.16.1e-15180.53GATA transcription factor 12 [Cucumis sativus][more]
XP_022132107.17.2e-12872.30GATA transcription factor 12 [Momordica charantia][more]
XP_023002390.12.4e-12366.93GATA transcription factor 12-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
P697811.3e-6852.06GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826323.4e-6447.77GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497439.7e-4343.21GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
O497418.2e-4239.46GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
Q9FH571.5e-3540.07GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7VCX14.8e-15481.77GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffo... [more]
A0A1S3BBN74.8e-15481.77GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103488171 PE=3 SV=1[more]
A0A0A0LPR52.9e-15180.53GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_2G373450 PE=3 SV=1[more]
A0A6J1BSX63.5e-12872.30GATA transcription factor OS=Momordica charantia OX=3673 GN=LOC111005058 PE=3 SV... [more]
A0A6J1KNT01.2e-12366.93GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111496245 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.19.6e-7052.06GATA transcription factor 12 [more]
AT4G32890.12.4e-6547.77GATA transcription factor 9 [more]
AT3G60530.16.9e-4443.21GATA transcription factor 4 [more]
AT2G45050.15.8e-4339.46GATA transcription factor 2 [more]
AT5G66320.11.1e-3640.07GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 239..289
e-value: 1.2E-17
score: 74.7
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 245..278
e-value: 1.8E-15
score: 56.2
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 245..270
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 239..275
score: 12.637311
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 244..291
e-value: 3.44761E-15
score: 67.0126
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 238..289
e-value: 6.0E-16
score: 59.8
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 22..334
e-value: 8.8E-83
score: 276.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 178..243
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..209
NoneNo IPR availablePANTHERPTHR45658:SF43GATA TRANSCRIPTION FACTORcoord: 1..355
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..355
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 241..303

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G155280.1Cla97C08G155280.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding