CmoCh14G013290 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh14G013290
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionGATA transcription factor
LocationCmo_Chr14: 11109848 .. 11111046 (-)
RNA-Seq ExpressionCmoCh14G013290
SyntenyCmoCh14G013290
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACCCCCCTTTCCACTCTTCTCTGGCAACTCAAATTACTCTCCAGTAACTTGCTCTGCGTTAATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAAGCAGAAATCCGCCGCCGAGCAGTTCGCTGTTGATGAATATTTATTGGATTTCTCCAATGAAGATATGGCAATGCAGAGCGGTTGCTTCGATAACGTTGCCGGAAATTGCTGTGATTCTTCGACGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTGGGAAATTTTGGGTCTGGAGGCTTCTGTGAAGCTCAATTCTCTAACGAACTCTGCATTCCGGTATTAGAAATTGGAGCGTTTTTGTTTGATTGTGAAATTCTGTGAAATTTTTTGGATTTGATTCTGTTTTTTTTTACAGTGCGACGATTTGGCGGAGCTCGAATGGCTGTCGAATTTCGTTGAAGGATCATTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTCTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGTGAAGCTCTTACTCTCCCCGGCAAAGCCCGAAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCGGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAGTGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTATGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGGTGAGGAGTACTTGATCCACCGCCACAACGGCGGTGATTTCAGTCGGATGATGTAGAATTTTGCAGTAGAAGGGGCGGGTAGTTTTTAACCCTTCGAATTCGGTAATGATTATGATCATGGTGGGACTGAGTAGATGATGTACGGACTGTTTCCAGGCATCTTTTTTATGATAAATATAGTTCATGGGCCC

mRNA sequence

ACCCCCCTTTCCACTCTTCTCTGGCAACTCAAATTACTCTCCAGTAACTTGCTCTGCGTTAATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAAGCAGAAATCCGCCGCCGAGCAGTTCGCTGTTGATGAATATTTATTGGATTTCTCCAATGAAGATATGGCAATGCAGAGCGGTTGCTTCGATAACGTTGCCGGAAATTGCTGTGATTCTTCGACGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTGGGAAATTTTGGGTCTGGAGGCTTCTGTGAAGCTCAATTCTCTAACGAACTCTGCATTCCGTGCGACGATTTGGCGGAGCTCGAATGGCTGTCGAATTTCGTTGAAGGATCATTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTCTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGTGAAGCTCTTACTCTCCCCGGCAAAGCCCGAAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCGGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAGTGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTATGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGGTGAGGAGTACTTGATCCACCGCCACAACGGCGGTGATTTCAGTCGGATGATGTAGAATTTTGCAGTAGAAGGGGCGGGTAGTTTTTAACCCTTCGAATTCGGTAATGATTATGATCATGGTGGGACTGAGTAGATGATGTACGGACTGTTTCCAGGCATCTTTTTTATGATAAATATAGTTCATGGGCCC

Coding sequence (CDS)

ATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAAGCAGAAATCCGCCGCCGAGCAGTTCGCTGTTGATGAATATTTATTGGATTTCTCCAATGAAGATATGGCAATGCAGAGCGGTTGCTTCGATAACGTTGCCGGAAATTGCTGTGATTCTTCGACGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTGGGAAATTTTGGGTCTGGAGGCTTCTGTGAAGCTCAATTCTCTAACGAACTCTGCATTCCGTGCGACGATTTGGCGGAGCTCGAATGGCTGTCGAATTTCGTTGAAGGATCATTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTCTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGTGAAGCTCTTACTCTCCCCGGCAAAGCCCGAAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCGGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAGTGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTATGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGGTGAGGAGTACTTGATCCACCGCCACAACGGCGGTGATTTCAGTCGGATGATGTAG

Protein sequence

MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNGGDFSRMM
Homology
BLAST of CmoCh14G013290 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 6.6e-48
Identity = 135/345 (39.13%), Postives = 174/345 (50.43%), Query Frame = 0

Query: 31  FAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAI-ESCNSSASSGDNLVLGNFGSG 90
           FAVD+ L+DFSN+D        D       DS+T T I +S N SA+      L +F   
Sbjct: 14  FAVDDLLVDFSNDD--------DEENDVVADSTTTTTITDSSNFSAAD-----LPSFHGD 73

Query: 91  GFCEAQFSNELCIPCDDLA-ELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 150
                 FS +LCIP DDLA ELEWLSN V+ S S E++ K    +  +SG          
Sbjct: 74  VQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHK----LELISG------FKSR 133

Query: 151 PSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRGSPCDWSTRVL------------ 210
           P    +T        ++  F  + +++P KARSKRSR + C+W++R L            
Sbjct: 134 PDPKSDTGSPENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTG 193

Query: 211 -------------------------------------RATAPEAGKSEMTSGRKCQHCAA 270
                                                  ++PE+G +E    R+C HCA 
Sbjct: 194 ETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAE---ERRCLHCAT 253

Query: 271 EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQN 308
           +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+  KHSNSHRKVMELRRQ 
Sbjct: 254 DKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQK 313

BLAST of CmoCh14G013290 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 5.3e-45
Identity = 133/304 (43.75%), Postives = 168/304 (55.26%), Query Frame = 0

Query: 29  EQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGNFGS 88
           + F VD+ LLDFSN+D  +  G   N   +    ST T  +S NSS+   D       G+
Sbjct: 16  DSFVVDD-LLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSSSLFTD-------GT 75

Query: 89  GGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 148
           G        ++L IP DD+AELEWLSNFVE SF+ E+ +K    +   SG       L+ 
Sbjct: 76  G-------FSDLYIPNDDIAELEWLSNFVEESFAGEDQDK----LHLFSG-------LKN 135

Query: 149 PSSSGETAFGYGSGKTT---SFFHGEA--LTLPGKARSKRSRGSPCDWSTRVLR-----A 208
           P ++G T       +      F   +   + +P KARSKRSR +   W++R+L       
Sbjct: 136 PQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDE 195

Query: 209 TAPE-----------AGK-----SEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVR 268
           T P+           AG       E   GR+C HCA EKTPQWRTGPMGPKTLCNACGVR
Sbjct: 196 TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVR 255

Query: 269 YKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ---LGRWKGGEEYLIHRHN 304
           YKSGRLVPEYRPA+SPTF+  +HSNSHRKVMELRRQ E+     L + +     +  R N
Sbjct: 256 YKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSN 291

BLAST of CmoCh14G013290 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 6.7e-40
Identity = 122/307 (39.74%), Postives = 154/307 (50.16%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSS--ASSGDNLVL 85
           S+ +   +D+ LLDFSNED+   S    + A     S       S +     SS D+   
Sbjct: 7   SSPDLLRIDD-LLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPSFHHHHLPSSADH--- 66

Query: 86  GNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTA 145
                       F +++C+P DD A LEWLS FV+ SF+      DF A P   GG  T+
Sbjct: 67  ----------HSFLHDICVPSDDAAHLEWLSQFVDDSFA------DFPANPL--GGTMTS 126

Query: 146 VTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRG----------SPCDWST 205
           V  E                 TSF        PGK RSKRSR            P +   
Sbjct: 127 VKTE-----------------TSF--------PGKPRSKRSRAPAPFAGTWSPMPLESEH 186

Query: 206 RVLRATAP------------------EAGKSEMTSG---RKCQHCAAEKTPQWRTGPMGP 265
           + L + A                   ++  SE T G   R+C HCA+EKTPQWRTGP+GP
Sbjct: 187 QQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGP 246

Query: 266 KTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEE 300
           KTLCNACGVR+KSGRLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ EV      +  ++
Sbjct: 247 KTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV-----MRQPQQ 261

BLAST of CmoCh14G013290 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 8.7e-40
Identity = 113/277 (40.79%), Postives = 147/277 (53.07%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGN 85
           S+ +   +D+ LLDFSN+++                SS+ T   S  SSA+S +N    +
Sbjct: 7   SSPDLLRIDD-LLDFSNDEIF---------------SSSSTVTSSAASSAASSENPF--S 66

Query: 86  FGSGGFCE----AQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIG 145
           F S  +        F+++LC+P DD A LEWLS FV+ SFS      DF A P       
Sbjct: 67  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------DFPANPL------ 126

Query: 146 TAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRG---------SPCDWS 205
             +T+                          ++  GK RS+RSR          +P   S
Sbjct: 127 -TMTVR-----------------------PEISFTGKPRSRRSRAPAPSVAGTWAPMSES 186

Query: 206 TRVLRATAPEAGK----SEMTS--GRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSG 265
                   P+  K      +T+   R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSG
Sbjct: 187 ELCHSVAKPKPKKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSG 229

Query: 266 RLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ 284
           RLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ E ++
Sbjct: 247 RLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQE 229

BLAST of CmoCh14G013290 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 3.0e-32
Identity = 111/294 (37.76%), Postives = 144/294 (48.98%), Query Frame = 0

Query: 27  AAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGNF 86
           + + F+VD+ LLD SN+D+                 S+    +  ++   S D     +F
Sbjct: 37  SVDDFSVDD-LLDLSNDDVFADEETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSGCDDF 96

Query: 87  GSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIP-----FLSGGIG 146
           GS        ++EL +P DDLA LEWLS+FVE SF TE    +    P     +L+G   
Sbjct: 97  GS------LPTSELSLPADDLANLEWLSHFVEDSF-TEYSGPNLTGTPTEKPAWLTGDRK 156

Query: 147 ---TAVTLE-------------------------------APSSSGETAFGYGSGKTTSF 206
              TAVT E                                PSSSG T+    SG ++ +
Sbjct: 157 HPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTS-SSSSGPSSPW 216

Query: 207 FHGEALTLPGKARSKRSRGSPCDWSTRVLRATAPEAGK-SEMTSGRKCQHCAAEKTPQWR 266
           F G  L  P       S   P     +   A +  +G+  ++   RKC HC  +KTPQWR
Sbjct: 217 FSGAELLEP----VVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWR 276

Query: 267 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNE 281
            GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+RR+ E
Sbjct: 277 AGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of CmoCh14G013290 vs. ExPASy TrEMBL
Match: A0A6J1GV45 (GATA transcription factor 4-like OS=Cucurbita moschata OX=3662 GN=LOC111457821 PE=4 SV=1)

HSP 1 Score: 629.0 bits (1621), Expect = 1.0e-176
Identity = 307/307 (100.00%), Postives = 307/307 (100.00%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 98

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 308
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of CmoCh14G013290 vs. ExPASy TrEMBL
Match: A0A6J1IP77 (GATA transcription factor 9-like OS=Cucurbita maxima OX=3661 GN=LOC111479297 PE=4 SV=1)

HSP 1 Score: 596.7 bits (1537), Expect = 5.6e-167
Identity = 293/307 (95.44%), Postives = 295/307 (96.09%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYY T+G QFSPDKQ SAAE FAVDEYLL+FSNEDMAM SGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYDTDGAQFSPDKQNSAAENFAVDEYLLNFSNEDMAMHSGCFDNVAGNCC 98

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGI TAVTLEA SSSG TA GY S KTTSFFHGEALTLPGKAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGISTAVTLEAQSSSGATALGYRSEKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 308
           GD SRMM
Sbjct: 339 GDLSRMM 345

BLAST of CmoCh14G013290 vs. ExPASy TrEMBL
Match: A0A5A7U6E0 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G00650 PE=3 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 1.1e-133
Identity = 245/323 (75.85%), Postives = 266/323 (82.35%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           ME+PEYLVGGYYGT   QFSP  +KS +E F VDEYLLDFSNED+AM  G FDNVAGNC 
Sbjct: 1   MELPEYLVGGYYGTGASQFSPHNKKSTSEHFPVDEYLLDFSNEDVAMHGGFFDNVAGNCS 60

Query: 61  D-SSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEG 120
           D SST+TAI+SCNSS S GDN +LG F SG FCEAQFS+ELCIPCDDLAELEWLSNFVE 
Sbjct: 61  DNSSTLTAIDSCNSSVSGGDNQLLGKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEE 120

Query: 121 SFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKA 180
           SFSTEEI+KDF AIPF+SGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLPGKA
Sbjct: 121 SFSTEEIDKDFPAIPFISGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKA 180

Query: 181 RSKRSRGSPCDWSTRVLRATAPE-----AGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKT 240
           RSKRSR +PCDWSTR+L+ATAPE      GK E TSGRKC HCAAEKTPQWRTGPMGPKT
Sbjct: 181 RSKRSRATPCDWSTRLLQATAPEKTEGAMGKPETTSGRKCLHCAAEKTPQWRTGPMGPKT 240

Query: 241 LCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ----------L 300
           LCNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++            
Sbjct: 241 LCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF 300

Query: 301 GRWKGGEEYLIHRHNGGDFSRMM 308
            R  G +EYLIHRHNGGDFS MM
Sbjct: 301 SRSNGCDEYLIHRHNGGDFSHMM 323

BLAST of CmoCh14G013290 vs. ExPASy TrEMBL
Match: A0A1S3BI99 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103489961 PE=3 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 1.1e-133
Identity = 245/323 (75.85%), Postives = 266/323 (82.35%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           ME+PEYLVGGYYGT   QFSP  +KS +E F VDEYLLDFSNED+AM  G FDNVAGNC 
Sbjct: 1   MELPEYLVGGYYGTGASQFSPHNKKSTSEHFPVDEYLLDFSNEDVAMHGGFFDNVAGNCS 60

Query: 61  D-SSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEG 120
           D SST+TAI+SCNSS S GDN +LG F SG FCEAQFS+ELCIPCDDLAELEWLSNFVE 
Sbjct: 61  DNSSTLTAIDSCNSSVSGGDNQLLGKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEE 120

Query: 121 SFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKA 180
           SFSTEEI+KDF AIPF+SGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLPGKA
Sbjct: 121 SFSTEEIDKDFPAIPFISGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKA 180

Query: 181 RSKRSRGSPCDWSTRVLRATAPE-----AGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKT 240
           RSKRSR +PCDWSTR+L+ATAPE      GK E TSGRKC HCAAEKTPQWRTGPMGPKT
Sbjct: 181 RSKRSRATPCDWSTRLLQATAPEKTEGAMGKPETTSGRKCLHCAAEKTPQWRTGPMGPKT 240

Query: 241 LCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ----------L 300
           LCNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++            
Sbjct: 241 LCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF 300

Query: 301 GRWKGGEEYLIHRHNGGDFSRMM 308
            R  G +EYLIHRHNGGDFS MM
Sbjct: 301 SRSNGCDEYLIHRHNGGDFSHMM 323

BLAST of CmoCh14G013290 vs. ExPASy TrEMBL
Match: A0A0A0L802 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_3G457670 PE=3 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 1.1e-127
Identity = 235/311 (75.56%), Postives = 255/311 (81.99%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           ME+P YLVGGYYGT   QFSPD +KS AE F +DEYLLDFSNED+AM SG FDNVAGNC 
Sbjct: 1   MELPGYLVGGYYGTGAPQFSPDNKKSTAEHFPLDEYLLDFSNEDVAMHSGFFDNVAGNCS 60

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSST+TAI+SCNSS S GDN +L  F SG FCEAQFS+ELCIPCDDLAELEWLSNFVE S
Sbjct: 61  DSSTLTAIDSCNSSVSGGDNQLLAKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEES 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEI+KDF AIPFLSGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLPGKAR
Sbjct: 121 FSTEEIDKDFPAIPFLSGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPE-----AGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTL 240
           SKRSR +PCDWSTR+L+ATAPE       K E TSGRKC HCAAEKTPQWRTGPMGPKTL
Sbjct: 181 SKRSRATPCDWSTRLLQATAPEKTEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTL 240

Query: 241 CNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ----------LG 297
           CNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++             
Sbjct: 241 CNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS 300

BLAST of CmoCh14G013290 vs. NCBI nr
Match: XP_022955987.1 (GATA transcription factor 4-like [Cucurbita moschata])

HSP 1 Score: 629.0 bits (1621), Expect = 2.1e-176
Identity = 307/307 (100.00%), Postives = 307/307 (100.00%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 98

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 308
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of CmoCh14G013290 vs. NCBI nr
Match: KAG7018317.1 (GATA transcription factor 9 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 626.3 bits (1614), Expect = 1.4e-175
Identity = 305/307 (99.35%), Postives = 307/307 (100.00%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC
Sbjct: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPC+DLAELEWLSNFVEG+
Sbjct: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCEDLAELEWLSNFVEGT 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR
Sbjct: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300

Query: 301 GDFSRMM 308
           GDFSRMM
Sbjct: 301 GDFSRMM 307

BLAST of CmoCh14G013290 vs. NCBI nr
Match: KAG6581881.1 (GATA transcription factor 9, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 623.2 bits (1606), Expect = 1.2e-174
Identity = 304/307 (99.02%), Postives = 306/307 (99.67%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC
Sbjct: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLG+FGSGGFCEAQFSNELCIPC+DLAELEWLSNFVEGS
Sbjct: 61  DSSTVTAIESCNSSASSGDNLVLGDFGSGGFCEAQFSNELCIPCEDLAELEWLSNFVEGS 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FS EEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR
Sbjct: 121 FSMEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300

Query: 301 GDFSRMM 308
           GDFSRMM
Sbjct: 301 GDFSRMM 307

BLAST of CmoCh14G013290 vs. NCBI nr
Match: XP_023528501.1 (GATA transcription factor 4-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 605.9 bits (1561), Expect = 1.9e-169
Identity = 297/307 (96.74%), Postives = 301/307 (98.05%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPD QKSAAEQFAVDEYLLDFSNED+AM SGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 98

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLP KAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 338

Query: 301 GDFSRMM 308
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of CmoCh14G013290 vs. NCBI nr
Match: XP_022979622.1 (GATA transcription factor 9-like [Cucurbita maxima])

HSP 1 Score: 596.7 bits (1537), Expect = 1.2e-166
Identity = 293/307 (95.44%), Postives = 295/307 (96.09%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60
           MEVPEYLVGGYY T+G QFSPDKQ SAAE FAVDEYLL+FSNEDMAM SGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYDTDGAQFSPDKQNSAAENFAVDEYLLNFSNEDMAMHSGCFDNVAGNCC 98

Query: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 120
           DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180
           FSTEEIEKDFQAIPFLSGGI TAVTLEA SSSG TA GY S KTTSFFHGEALTLPGKAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGISTAVTLEAQSSSGATALGYRSEKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300
           VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 308
           GD SRMM
Sbjct: 339 GDLSRMM 345

BLAST of CmoCh14G013290 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 192.6 bits (488), Expect = 4.7e-49
Identity = 135/345 (39.13%), Postives = 174/345 (50.43%), Query Frame = 0

Query: 31  FAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAI-ESCNSSASSGDNLVLGNFGSG 90
           FAVD+ L+DFSN+D        D       DS+T T I +S N SA+      L +F   
Sbjct: 14  FAVDDLLVDFSNDD--------DEENDVVADSTTTTTITDSSNFSAAD-----LPSFHGD 73

Query: 91  GFCEAQFSNELCIPCDDLA-ELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 150
                 FS +LCIP DDLA ELEWLSN V+ S S E++ K    +  +SG          
Sbjct: 74  VQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHK----LELISG------FKSR 133

Query: 151 PSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRGSPCDWSTRVL------------ 210
           P    +T        ++  F  + +++P KARSKRSR + C+W++R L            
Sbjct: 134 PDPKSDTGSPENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTG 193

Query: 211 -------------------------------------RATAPEAGKSEMTSGRKCQHCAA 270
                                                  ++PE+G +E    R+C HCA 
Sbjct: 194 ETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAE---ERRCLHCAT 253

Query: 271 EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQN 308
           +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+  KHSNSHRKVMELRRQ 
Sbjct: 254 DKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQK 313

BLAST of CmoCh14G013290 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 183.0 bits (463), Expect = 3.7e-46
Identity = 133/304 (43.75%), Postives = 168/304 (55.26%), Query Frame = 0

Query: 29  EQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGNFGS 88
           + F VD+ LLDFSN+D  +  G   N   +    ST T  +S NSS+   D       G+
Sbjct: 16  DSFVVDD-LLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSSSLFTD-------GT 75

Query: 89  GGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 148
           G        ++L IP DD+AELEWLSNFVE SF+ E+ +K    +   SG       L+ 
Sbjct: 76  G-------FSDLYIPNDDIAELEWLSNFVEESFAGEDQDK----LHLFSG-------LKN 135

Query: 149 PSSSGETAFGYGSGKTT---SFFHGEA--LTLPGKARSKRSRGSPCDWSTRVLR-----A 208
           P ++G T       +      F   +   + +P KARSKRSR +   W++R+L       
Sbjct: 136 PQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDE 195

Query: 209 TAPE-----------AGK-----SEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVR 268
           T P+           AG       E   GR+C HCA EKTPQWRTGPMGPKTLCNACGVR
Sbjct: 196 TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVR 255

Query: 269 YKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ---LGRWKGGEEYLIHRHN 304
           YKSGRLVPEYRPA+SPTF+  +HSNSHRKVMELRRQ E+     L + +     +  R N
Sbjct: 256 YKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSN 291

BLAST of CmoCh14G013290 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 166.0 bits (419), Expect = 4.7e-41
Identity = 122/307 (39.74%), Postives = 154/307 (50.16%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSS--ASSGDNLVL 85
           S+ +   +D+ LLDFSNED+   S    + A     S       S +     SS D+   
Sbjct: 7   SSPDLLRIDD-LLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPSFHHHHLPSSADH--- 66

Query: 86  GNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTA 145
                       F +++C+P DD A LEWLS FV+ SF+      DF A P   GG  T+
Sbjct: 67  ----------HSFLHDICVPSDDAAHLEWLSQFVDDSFA------DFPANPL--GGTMTS 126

Query: 146 VTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRG----------SPCDWST 205
           V  E                 TSF        PGK RSKRSR            P +   
Sbjct: 127 VKTE-----------------TSF--------PGKPRSKRSRAPAPFAGTWSPMPLESEH 186

Query: 206 RVLRATAP------------------EAGKSEMTSG---RKCQHCAAEKTPQWRTGPMGP 265
           + L + A                   ++  SE T G   R+C HCA+EKTPQWRTGP+GP
Sbjct: 187 QQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGP 246

Query: 266 KTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEE 300
           KTLCNACGVR+KSGRLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ EV      +  ++
Sbjct: 247 KTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV-----MRQPQQ 261

BLAST of CmoCh14G013290 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 165.6 bits (418), Expect = 6.2e-41
Identity = 113/277 (40.79%), Postives = 147/277 (53.07%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGN 85
           S+ +   +D+ LLDFSN+++                SS+ T   S  SSA+S +N    +
Sbjct: 7   SSPDLLRIDD-LLDFSNDEIF---------------SSSSTVTSSAASSAASSENPF--S 66

Query: 86  FGSGGFCE----AQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIG 145
           F S  +        F+++LC+P DD A LEWLS FV+ SFS      DF A P       
Sbjct: 67  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------DFPANPL------ 126

Query: 146 TAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKARSKRSRG---------SPCDWS 205
             +T+                          ++  GK RS+RSR          +P   S
Sbjct: 127 -TMTVR-----------------------PEISFTGKPRSRRSRAPAPSVAGTWAPMSES 186

Query: 206 TRVLRATAPEAGK----SEMTS--GRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSG 265
                   P+  K      +T+   R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSG
Sbjct: 187 ELCHSVAKPKPKKVYNAESVTADGARRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSG 229

Query: 266 RLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQ 284
           RLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ E ++
Sbjct: 247 RLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEQQE 229

BLAST of CmoCh14G013290 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 140.6 bits (353), Expect = 2.1e-33
Identity = 111/294 (37.76%), Postives = 144/294 (48.98%), Query Frame = 0

Query: 27  AAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCCDSSTVTAIESCNSSASSGDNLVLGNF 86
           + + F+VD+ LLD SN+D+                 S+    +  ++   S D     +F
Sbjct: 37  SVDDFSVDD-LLDLSNDDVFADEETDLKAQHEMVRVSSEEPNDDGDALRRSSDFSGCDDF 96

Query: 87  GSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIP-----FLSGGIG 146
           GS        ++EL +P DDLA LEWLS+FVE SF TE    +    P     +L+G   
Sbjct: 97  GS------LPTSELSLPADDLANLEWLSHFVEDSF-TEYSGPNLTGTPTEKPAWLTGDRK 156

Query: 147 ---TAVTLE-------------------------------APSSSGETAFGYGSGKTTSF 206
              TAVT E                                PSSSG T+    SG ++ +
Sbjct: 157 HPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTS-SSSSGPSSPW 216

Query: 207 FHGEALTLPGKARSKRSRGSPCDWSTRVLRATAPEAGK-SEMTSGRKCQHCAAEKTPQWR 266
           F G  L  P       S   P     +   A +  +G+  ++   RKC HC  +KTPQWR
Sbjct: 217 FSGAELLEP----VVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWR 276

Query: 267 TGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNE 281
            GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+RR+ E
Sbjct: 277 AGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P697816.6e-4839.13GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826325.3e-4543.75GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497416.7e-4039.74GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
O497438.7e-4040.79GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
Q9FH573.0e-3237.76GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1GV451.0e-176100.00GATA transcription factor 4-like OS=Cucurbita moschata OX=3662 GN=LOC111457821 P... [more]
A0A6J1IP775.6e-16795.44GATA transcription factor 9-like OS=Cucurbita maxima OX=3661 GN=LOC111479297 PE=... [more]
A0A5A7U6E01.1e-13375.85GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A1S3BI991.1e-13375.85GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103489961 PE=3 SV=1[more]
A0A0A0L8021.1e-12775.56GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_3G457670 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_022955987.12.1e-176100.00GATA transcription factor 4-like [Cucurbita moschata][more]
KAG7018317.11.4e-17599.35GATA transcription factor 9 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6581881.11.2e-17499.02GATA transcription factor 9, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023528501.11.9e-16996.74GATA transcription factor 4-like [Cucurbita pepo subsp. pepo][more]
XP_022979622.11.2e-16695.44GATA transcription factor 9-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G25830.14.7e-4939.13GATA transcription factor 12 [more]
AT4G32890.13.7e-4643.75GATA transcription factor 9 [more]
AT2G45050.14.7e-4139.74GATA transcription factor 2 [more]
AT3G60530.16.2e-4140.79GATA transcription factor 4 [more]
AT5G66320.12.1e-3337.76GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 208..258
e-value: 5.9E-17
score: 72.3
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 214..247
e-value: 2.2E-15
score: 56.0
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 214..239
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 208..244
score: 12.37361
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 213..260
e-value: 2.50152E-13
score: 61.6198
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 14..295
e-value: 7.4E-68
score: 227.3
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 209..281
e-value: 2.3E-15
score: 58.4
NoneNo IPR availablePANTHERPTHR45658:SF46GATA TRANSCRIPTION FACTOR 9coord: 1..283
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..283
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 211..272

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G013290.1CmoCh14G013290.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding