Cp4.1LG03g14100 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g14100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGATA transcription factor
LocationCp4.1LG03: 9070556 .. 9071817 (-)
RNA-Seq ExpressionCp4.1LG03g14100
SyntenyCp4.1LG03g14100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATATTTGCGTTTACACCCCGCCTCTGCCTTCTCAAAAACCCCCCTTTCCACTCCTCTCCGGCAACTCAAATTACTCTCTAGTAACTTGCTCTGTATTTATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAATCAGAAATCCGCTGCCGAGCAGTTCGCTGTCGATGAATATTTATTGGATTTCTCCAATGAAGATGTGGCAATGCATAGCGGTTGCTTCGATAATGTTGCCGGAAATTGCTGTGATTCTTCGATGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTAGGAAATTTTGGGTCCGGAGGCTACTGTGAAGCTCAATTCTCTGACGAACTCTGCATTCCGGTATTTGAAATTGGAGCGTTTTCGTTTGATTGTGAAATTCTGTGAAATTTTTTGGATTTGATTCTGTTTTTTTTTTTTTTACAGTGCGACGATTTGGCGGAACTCGAATGGCTGTCGAATTTCGTTGAAGGATCGTTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTTTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGCGAAGCTCTTACTCTCCCCTGCAAAGCCCGTAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCAGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAATGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTTTGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGTCGAGGAGTACTTGATCCACCGCATCAACGGCGGTGATTTCAGTCGGATGATGTAGAATTTTGCAGTAAAAGGGGCGCGTAGTTTTTAACCCTTCCAATTCGGTAATGATTATGATCATGGTGGGACTGAGTAGACGATGTACGGACTGTTTCCAGGCATCTTTTTTTTATGATAAATATACTTCATGGGCCCACATATTTTTTTTTAAATT

mRNA sequence

ATATTTGCGTTTACACCCCGCCTCTGCCTTCTCAAAAACCCCCCTTTCCACTCCTCTCCGGCAACTCAAATTACTCTCTAGTAACTTGCTCTGTATTTATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAATCAGAAATCCGCTGCCGAGCAGTTCGCTGTCGATGAATATTTATTGGATTTCTCCAATGAAGATGTGGCAATGCATAGCGGTTGCTTCGATAATGTTGCCGGAAATTGCTGTGATTCTTCGATGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTAGGAAATTTTGGGTCCGGAGGCTACTGTGAAGCTCAATTCTCTGACGAACTCTGCATTCCGTGCGACGATTTGGCGGAACTCGAATGGCTGTCGAATTTCGTTGAAGGATCGTTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTTTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGCGAAGCTCTTACTCTCCCCTGCAAAGCCCGTAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCAGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAATGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTTTGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGTCGAGGAGTACTTGATCCACCGCATCAACGGCGGTGATTTCAGTCGGATGATGTAGAATTTTGCAGTAAAAGGGGCGCGTAGTTTTTAACCCTTCCAATTCGGTAATGATTATGATCATGGTGGGACTGAGTAGACGATGTACGGACTGTTTCCAGGCATCTTTTTTTTATGATAAATATACTTCATGGGCCCACATATTTTTTTTTAAATT

Coding sequence (CDS)

ATGGAGGTGCCCGAGTATCTTGTCGGTGGCTACTACGGCACCGAGGGCGGTCAATTTTCGCCGGACAATCAGAAATCCGCTGCCGAGCAGTTCGCTGTCGATGAATATTTATTGGATTTCTCCAATGAAGATGTGGCAATGCATAGCGGTTGCTTCGATAATGTTGCCGGAAATTGCTGTGATTCTTCGATGGTGACTGCGATTGAGAGCTGCAATTCGTCGGCCTCCAGCGGCGATAACTTGGTGTTAGGAAATTTTGGGTCCGGAGGCTACTGTGAAGCTCAATTCTCTGACGAACTCTGCATTCCGTGCGACGATTTGGCGGAACTCGAATGGCTGTCGAATTTCGTTGAAGGATCGTTTTCGACGGAGGAGATTGAGAAGGATTTTCAGGCGATTCCATTTCTCTCCGGGGGGATCGGTACGGCGGTGACTTTAGAAGCGCCGTCGTCTTCAGGAGAGACGGCGTTTGGTTACGGAAGTGGAAAAACGACATCGTTTTTTCACGGCGAAGCTCTTACTCTCCCCTGCAAAGCCCGTAGCAAACGATCACGCGGTTCTCCTTGCGACTGGTCCACGCGCGTCCTCAGGGCGACGGCTCCAGAGGCAGGAAAGTCCGAAATGACGTCAGGCCGGAAATGCCAGCATTGCGCGGCGGAGAAGACGCCGCAATGGCGGACTGGGCCTATGGGCCCAAAAACGCTTTGTAATGCTTGTGGGGTCCGGTATAAGTCGGGTCGACTGGTACCGGAATACCGACCTGCTGCGAGCCCGACATTTTTGTCGACGAAGCACTCGAATTCTCACCGGAAGGTGATGGAACTTCGACGGCAAAATGAGGTAGAACAGTTGGGGAGATGGAAGGGGGTCGAGGAGTACTTGATCCACCGCATCAACGGCGGTGATTTCAGTCGGATGATGTAG

Protein sequence

MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRINGGDFSRMM
Homology
BLAST of Cp4.1LG03g14100 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 5.6e-47
Identity = 135/345 (39.13%), Postives = 173/345 (50.14%), Query Frame = 0

Query: 31  FAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAI-ESCNSSASSGDNLVLGNFGSG 90
           FAVD+ L+DFSN+D        D       DS+  T I +S N SA+      L +F   
Sbjct: 14  FAVDDLLVDFSNDD--------DEENDVVADSTTTTTITDSSNFSAAD-----LPSFHGD 73

Query: 91  GYCEAQFSDELCIPCDDLA-ELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 150
                 FS +LCIP DDLA ELEWLSN V+ S S E++ K    +  +SG          
Sbjct: 74  VQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHK----LELISG------FKSR 133

Query: 151 PSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRGSPCDWSTRVL------------ 210
           P    +T        ++  F  + +++P KARSKRSR + C+W++R L            
Sbjct: 134 PDPKSDTGSPENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTG 193

Query: 211 -------------------------------------RATAPEAGKSEMTSGRKCQHCAA 270
                                                  ++PE+G +E    R+C HCA 
Sbjct: 194 ETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAE---ERRCLHCAT 253

Query: 271 EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQN 308
           +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+  KHSNSHRKVMELRRQ 
Sbjct: 254 DKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQK 313

BLAST of Cp4.1LG03g14100 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 1.4e-45
Identity = 138/304 (45.39%), Postives = 169/304 (55.59%), Query Frame = 0

Query: 29  EQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGNFGS 88
           + F VD+ LLDFSN+D  +  G   N   +    S  T  +S NSS+   D       G+
Sbjct: 16  DSFVVDD-LLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSSSLFTD-------GT 75

Query: 89  GGYCEAQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 148
           G      FSD L IP DD+AELEWLSNFVE SF+ E+ +K    +   SG       L+ 
Sbjct: 76  G------FSD-LYIPNDDIAELEWLSNFVEESFAGEDQDK----LHLFSG-------LKN 135

Query: 149 PSSSGETAFGYGSGKTT---SFFHGEA--LTLPCKARSKRSRGSPCDWSTRVLR-----A 208
           P ++G T       +      F   +   + +P KARSKRSR +   W++R+L       
Sbjct: 136 PQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDE 195

Query: 209 TAPE-----------AGK-----SEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVR 268
           T P+           AG       E   GR+C HCA EKTPQWRTGPMGPKTLCNACGVR
Sbjct: 196 TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVR 255

Query: 269 YKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEV--EQLGRWKGVEEYLIH-RIN 304
           YKSGRLVPEYRPA+SPTF+  +HSNSHRKVMELRRQ E+  E L      E  L+  R N
Sbjct: 256 YKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSN 291

BLAST of Cp4.1LG03g14100 vs. ExPASy Swiss-Prot
Match: O49743 (GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 4.3e-39
Identity = 111/268 (41.42%), Postives = 140/268 (52.24%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGN 85
           S+ +   +D+ LLDFSN+++                SS  T   S  SSA+S +N    +
Sbjct: 7   SSPDLLRIDD-LLDFSNDEIF---------------SSSSTVTSSAASSAASSENPF--S 66

Query: 86  FGSGGYCE----AQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLS---- 145
           F S  Y        F+ +LC+P DD A LEWLS FV+ SFS      DF A P       
Sbjct: 67  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------DFPANPLTMTVRP 126

Query: 146 --GGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRGSPCDWSTRV 205
                G   +  + + +   A  +     +   H  A   P K  +  S           
Sbjct: 127 EISFTGKPRSRRSRAPAPSVAGTWAPMSESELCHSVAKPKPKKVYNAES----------- 186

Query: 206 LRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA 265
              TA  A        R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA
Sbjct: 187 --VTADGA--------RRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPA 229

Query: 266 ASPTFLSTKHSNSHRKVMELRRQNEVEQ 284
           +SPTF+ T+HSNSHRKVMELRRQ E ++
Sbjct: 247 SSPTFVLTQHSNSHRKVMELRRQKEQQE 229

BLAST of Cp4.1LG03g14100 vs. ExPASy Swiss-Prot
Match: O49741 (GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 5.6e-39
Identity = 119/289 (41.18%), Postives = 145/289 (50.17%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSS--ASSGDNLVL 85
           S+ +   +D+ LLDFSNED+   S    + A     S       S +     SS D+   
Sbjct: 7   SSPDLLRIDD-LLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPSFHHHHLPSSADH--- 66

Query: 86  GNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTA 145
                       F  ++C+P DD A LEWLS FV+ SF+      DF A P   GG  T+
Sbjct: 67  ----------HSFLHDICVPSDDAAHLEWLSQFVDDSFA------DFPANPL--GGTMTS 126

Query: 146 VTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRG----------SPCDWST 205
           V  E                 TSF        P K RSKRSR            P +   
Sbjct: 127 VKTE-----------------TSF--------PGKPRSKRSRAPAPFAGTWSPMPLESEH 186

Query: 206 RVLRATAP------------------EAGKSEMTSG---RKCQHCAAEKTPQWRTGPMGP 265
           + L + A                   ++  SE T G   R+C HCA+EKTPQWRTGP+GP
Sbjct: 187 QQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGP 246

Query: 266 KTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEV 282
           KTLCNACGVR+KSGRLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ EV
Sbjct: 247 KTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV 248

BLAST of Cp4.1LG03g14100 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 1.0e-32
Identity = 116/297 (39.06%), Postives = 145/297 (48.82%), Query Frame = 0

Query: 27  AAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGNF 86
           + + F+VD+ LLD SN+DV       D          MV    S       GD L   + 
Sbjct: 37  SVDDFSVDD-LLDLSNDDVFA-----DEETDLKAQHEMVRV--SSEEPNDDGDALRRSSD 96

Query: 87  GSGGYCE---AQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIP-----FLSG 146
            SG  C+   +  + EL +P DDLA LEWLS+FVE SF TE    +    P     +L+G
Sbjct: 97  FSG--CDDFGSLPTSELSLPADDLANLEWLSHFVEDSF-TEYSGPNLTGTPTEKPAWLTG 156

Query: 147 GIG---TAVTLE-------------------------------APSSSGETAFGYGSGKT 206
                 TAVT E                                PSSSG T+    SG +
Sbjct: 157 DRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTS-SSSSGPS 216

Query: 207 TSFFHGEALTLPCKARSKRSRGSPCDWSTRVLRATAPEAGK-SEMTSGRKCQHCAAEKTP 266
           + +F G  L  P       S   P     +   A +  +G+  ++   RKC HC  +KTP
Sbjct: 217 SPWFSGAELLEPVVT----SERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTP 276

Query: 267 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNE 281
           QWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+RR+ E
Sbjct: 277 QWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cp4.1LG03g14100 vs. NCBI nr
Match: XP_023528501.1 (GATA transcription factor 4-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 632 bits (1630), Expect = 3.42e-228
Identity = 307/307 (100.00%), Postives = 307/307 (100.00%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 98

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING
Sbjct: 279 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 338

Query: 301 GDFSRMM 307
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of Cp4.1LG03g14100 vs. NCBI nr
Match: XP_022955987.1 (GATA transcription factor 4-like [Cucurbita moschata])

HSP 1 Score: 609 bits (1570), Expect = 4.78e-219
Identity = 297/307 (96.74%), Postives = 301/307 (98.05%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPD QKSAAEQFAVDEYLLDFSNED+AM SGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 98

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLP KAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 307
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of Cp4.1LG03g14100 vs. NCBI nr
Match: KAG7018317.1 (GATA transcription factor 9 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 606 bits (1563), Expect = 1.36e-218
Identity = 295/307 (96.09%), Postives = 301/307 (98.05%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPD QKSAAEQFAVDEYLLDFSNED+AM SGCFDNVAGNCC
Sbjct: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPC+DLAELEWLSNFVEG+
Sbjct: 61  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCEDLAELEWLSNFVEGT 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLP KAR
Sbjct: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300

Query: 301 GDFSRMM 307
           GDFSRMM
Sbjct: 301 GDFSRMM 307

BLAST of Cp4.1LG03g14100 vs. NCBI nr
Match: KAG6581881.1 (GATA transcription factor 9, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 603 bits (1555), Expect = 2.25e-217
Identity = 294/307 (95.77%), Postives = 300/307 (97.72%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPD QKSAAEQFAVDEYLLDFSNED+AM SGCFDNVAGNCC
Sbjct: 1   MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 60

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLG+FGSGG+CEAQFS+ELCIPC+DLAELEWLSNFVEGS
Sbjct: 61  DSSTVTAIESCNSSASSGDNLVLGDFGSGGFCEAQFSNELCIPCEDLAELEWLSNFVEGS 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FS EEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLP KAR
Sbjct: 121 FSMEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 241 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 300

Query: 301 GDFSRMM 307
           GDFSRMM
Sbjct: 301 GDFSRMM 307

BLAST of Cp4.1LG03g14100 vs. NCBI nr
Match: XP_022979622.1 (GATA transcription factor 9-like [Cucurbita maxima])

HSP 1 Score: 582 bits (1499), Expect = 3.15e-208
Identity = 285/307 (92.83%), Postives = 291/307 (94.79%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYY T+G QFSPD Q SAAE FAVDEYLL+FSNED+AMHSGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYDTDGAQFSPDKQNSAAENFAVDEYLLNFSNEDMAMHSGCFDNVAGNCC 98

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGI TAVTLEA SSSG TA GY S KTTSFFHGEALTLP KAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGISTAVTLEAQSSSGATALGYRSEKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 307
           GD SRMM
Sbjct: 339 GDLSRMM 345

BLAST of Cp4.1LG03g14100 vs. ExPASy TrEMBL
Match: A0A6J1GV45 (GATA transcription factor 4-like OS=Cucurbita moschata OX=3662 GN=LOC111457821 PE=4 SV=1)

HSP 1 Score: 609 bits (1570), Expect = 2.31e-219
Identity = 297/307 (96.74%), Postives = 301/307 (98.05%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYYGTEGGQFSPD QKSAAEQFAVDEYLLDFSNED+AM SGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYGTEGGQFSPDKQKSAAEQFAVDEYLLDFSNEDMAMQSGCFDNVAGNCC 98

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLP KAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 307
           GDFSRMM
Sbjct: 339 GDFSRMM 345

BLAST of Cp4.1LG03g14100 vs. ExPASy TrEMBL
Match: A0A6J1IP77 (GATA transcription factor 9-like OS=Cucurbita maxima OX=3661 GN=LOC111479297 PE=4 SV=1)

HSP 1 Score: 582 bits (1499), Expect = 1.53e-208
Identity = 285/307 (92.83%), Postives = 291/307 (94.79%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           MEVPEYLVGGYY T+G QFSPD Q SAAE FAVDEYLL+FSNED+AMHSGCFDNVAGNCC
Sbjct: 39  MEVPEYLVGGYYDTDGAQFSPDKQNSAAENFAVDEYLLNFSNEDMAMHSGCFDNVAGNCC 98

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS VTAIESCNSSASSGDNLVLGNFGSGG+CEAQFS+ELCIPCDDLAELEWLSNFVEGS
Sbjct: 99  DSSTVTAIESCNSSASSGDNLVLGNFGSGGFCEAQFSNELCIPCDDLAELEWLSNFVEGS 158

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEIEKDFQAIPFLSGGI TAVTLEA SSSG TA GY S KTTSFFHGEALTLP KAR
Sbjct: 159 FSTEEIEKDFQAIPFLSGGISTAVTLEAQSSSGATALGYRSEKTTSFFHGEALTLPGKAR 218

Query: 181 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 240
           SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG
Sbjct: 219 SKRSRGSPCDWSTRVLRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACG 278

Query: 241 VRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQLGRWKGVEEYLIHRING 300
           VRYKSGRLVPEYRPAASPTF+STKHSNSHRKVMELRRQNEVEQLGRWKG EEYLIHR NG
Sbjct: 279 VRYKSGRLVPEYRPAASPTFMSTKHSNSHRKVMELRRQNEVEQLGRWKGGEEYLIHRHNG 338

Query: 301 GDFSRMM 307
           GD SRMM
Sbjct: 339 GDLSRMM 345

BLAST of Cp4.1LG03g14100 vs. ExPASy TrEMBL
Match: A0A5A7U6E0 (GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G00650 PE=3 SV=1)

HSP 1 Score: 485 bits (1248), Expect = 1.16e-170
Identity = 244/323 (75.54%), Postives = 264/323 (81.73%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           ME+PEYLVGGYYGT   QFSP N+KS +E F VDEYLLDFSNEDVAMH G FDNVAGNC 
Sbjct: 1   MELPEYLVGGYYGTGASQFSPHNKKSTSEHFPVDEYLLDFSNEDVAMHGGFFDNVAGNCS 60

Query: 61  D-SSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEG 120
           D SS +TAI+SCNSS S GDN +LG F SG +CEAQFS ELCIPCDDLAELEWLSNFVE 
Sbjct: 61  DNSSTLTAIDSCNSSVSGGDNQLLGKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEE 120

Query: 121 SFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKA 180
           SFSTEEI+KDF AIPF+SGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLP KA
Sbjct: 121 SFSTEEIDKDFPAIPFISGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKA 180

Query: 181 RSKRSRGSPCDWSTRVLRATAPEA-----GKSEMTSGRKCQHCAAEKTPQWRTGPMGPKT 240
           RSKRSR +PCDWSTR+L+ATAPE      GK E TSGRKC HCAAEKTPQWRTGPMGPKT
Sbjct: 181 RSKRSRATPCDWSTRLLQATAPEKTEGAMGKPETTSGRKCLHCAAEKTPQWRTGPMGPKT 240

Query: 241 LCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQ----------L 300
           LCNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++            
Sbjct: 241 LCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF 300

Query: 301 GRWKGVEEYLIHRINGGDFSRMM 307
            R  G +EYLIHR NGGDFS MM
Sbjct: 301 SRSNGCDEYLIHRHNGGDFSHMM 323

BLAST of Cp4.1LG03g14100 vs. ExPASy TrEMBL
Match: A0A1S3BI99 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103489961 PE=3 SV=1)

HSP 1 Score: 485 bits (1248), Expect = 1.16e-170
Identity = 244/323 (75.54%), Postives = 264/323 (81.73%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           ME+PEYLVGGYYGT   QFSP N+KS +E F VDEYLLDFSNEDVAMH G FDNVAGNC 
Sbjct: 1   MELPEYLVGGYYGTGASQFSPHNKKSTSEHFPVDEYLLDFSNEDVAMHGGFFDNVAGNCS 60

Query: 61  D-SSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEG 120
           D SS +TAI+SCNSS S GDN +LG F SG +CEAQFS ELCIPCDDLAELEWLSNFVE 
Sbjct: 61  DNSSTLTAIDSCNSSVSGGDNQLLGKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEE 120

Query: 121 SFSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKA 180
           SFSTEEI+KDF AIPF+SGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLP KA
Sbjct: 121 SFSTEEIDKDFPAIPFISGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKA 180

Query: 181 RSKRSRGSPCDWSTRVLRATAPEA-----GKSEMTSGRKCQHCAAEKTPQWRTGPMGPKT 240
           RSKRSR +PCDWSTR+L+ATAPE      GK E TSGRKC HCAAEKTPQWRTGPMGPKT
Sbjct: 181 RSKRSRATPCDWSTRLLQATAPEKTEGAMGKPETTSGRKCLHCAAEKTPQWRTGPMGPKT 240

Query: 241 LCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQ----------L 300
           LCNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++            
Sbjct: 241 LCNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIF 300

Query: 301 GRWKGVEEYLIHRINGGDFSRMM 307
            R  G +EYLIHR NGGDFS MM
Sbjct: 301 SRSNGCDEYLIHRHNGGDFSHMM 323

BLAST of Cp4.1LG03g14100 vs. ExPASy TrEMBL
Match: A0A0A0L802 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_3G457670 PE=3 SV=1)

HSP 1 Score: 468 bits (1205), Expect = 3.53e-164
Identity = 235/311 (75.56%), Postives = 254/311 (81.67%), Query Frame = 0

Query: 1   MEVPEYLVGGYYGTEGGQFSPDNQKSAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCC 60
           ME+P YLVGGYYGT   QFSPDN+KS AE F +DEYLLDFSNEDVAMHSG FDNVAGNC 
Sbjct: 1   MELPGYLVGGYYGTGAPQFSPDNKKSTAEHFPLDEYLLDFSNEDVAMHSGFFDNVAGNCS 60

Query: 61  DSSMVTAIESCNSSASSGDNLVLGNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGS 120
           DSS +TAI+SCNSS S GDN +L  F SG +CEAQFS ELCIPCDDLAELEWLSNFVE S
Sbjct: 61  DSSTLTAIDSCNSSVSGGDNQLLAKFESGSFCEAQFSSELCIPCDDLAELEWLSNFVEES 120

Query: 121 FSTEEIEKDFQAIPFLSGGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKAR 180
           FSTEEI+KDF AIPFLSGGI +A T E  SSSG TAFGYG+ KTT+FFH EALTLP KAR
Sbjct: 121 FSTEEIDKDFPAIPFLSGGISSAATPETSSSSGATAFGYGNAKTTTFFHSEALTLPGKAR 180

Query: 181 SKRSRGSPCDWSTRVLRATAPEA-----GKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTL 240
           SKRSR +PCDWSTR+L+ATAPE       K E TSGRKC HCAAEKTPQWRTGPMGPKTL
Sbjct: 181 SKRSRATPCDWSTRLLQATAPEKTEGTMAKPETTSGRKCLHCAAEKTPQWRTGPMGPKTL 240

Query: 241 CNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEVEQ----------LG 296
           CNACGVRYKSGRLVPEYRPA+SPTF+STKHSNSHRKVMELRRQ E++             
Sbjct: 241 CNACGVRYKSGRLVPEYRPASSPTFVSTKHSNSHRKVMELRRQKEMQHQEQFVSQSSIFS 300

BLAST of Cp4.1LG03g14100 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 189.5 bits (480), Expect = 4.0e-48
Identity = 135/345 (39.13%), Postives = 173/345 (50.14%), Query Frame = 0

Query: 31  FAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAI-ESCNSSASSGDNLVLGNFGSG 90
           FAVD+ L+DFSN+D        D       DS+  T I +S N SA+      L +F   
Sbjct: 14  FAVDDLLVDFSNDD--------DEENDVVADSTTTTTITDSSNFSAAD-----LPSFHGD 73

Query: 91  GYCEAQFSDELCIPCDDLA-ELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 150
                 FS +LCIP DDLA ELEWLSN V+ S S E++ K    +  +SG          
Sbjct: 74  VQDGTSFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHK----LELISG------FKSR 133

Query: 151 PSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRGSPCDWSTRVL------------ 210
           P    +T        ++  F  + +++P KARSKRSR + C+W++R L            
Sbjct: 134 PDPKSDTGSPENPNSSSPIFTTD-VSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTG 193

Query: 211 -------------------------------------RATAPEAGKSEMTSGRKCQHCAA 270
                                                  ++PE+G +E    R+C HCA 
Sbjct: 194 ETILSSQQHLSPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAE---ERRCLHCAT 253

Query: 271 EKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQN 308
           +KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTF+  KHSNSHRKVMELRRQ 
Sbjct: 254 DKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQK 313

BLAST of Cp4.1LG03g14100 vs. TAIR 10
Match: AT4G32890.1 (GATA transcription factor 9 )

HSP 1 Score: 184.9 bits (468), Expect = 9.8e-47
Identity = 138/304 (45.39%), Postives = 169/304 (55.59%), Query Frame = 0

Query: 29  EQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGNFGS 88
           + F VD+ LLDFSN+D  +  G   N   +    S  T  +S NSS+   D       G+
Sbjct: 16  DSFVVDD-LLDFSNDDGEVDDGL--NTLPDSSTLSTGTLTDSSNSSSLFTD-------GT 75

Query: 89  GGYCEAQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTAVTLEA 148
           G      FSD L IP DD+AELEWLSNFVE SF+ E+ +K    +   SG       L+ 
Sbjct: 76  G------FSD-LYIPNDDIAELEWLSNFVEESFAGEDQDK----LHLFSG-------LKN 135

Query: 149 PSSSGETAFGYGSGKTT---SFFHGEA--LTLPCKARSKRSRGSPCDWSTRVLR-----A 208
           P ++G T       +      F   +   + +P KARSKRSR +   W++R+L       
Sbjct: 136 PQTTGSTLTHLIKPEPELDHQFIDIDESNVAVPAKARSKRSRSAASTWASRLLSLADSDE 195

Query: 209 TAPE-----------AGK-----SEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVR 268
           T P+           AG       E   GR+C HCA EKTPQWRTGPMGPKTLCNACGVR
Sbjct: 196 TNPKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNACGVR 255

Query: 269 YKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEV--EQLGRWKGVEEYLIH-RIN 304
           YKSGRLVPEYRPA+SPTF+  +HSNSHRKVMELRRQ E+  E L      E  L+  R N
Sbjct: 256 YKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEMRDEHLLSQLRCENLLMDIRSN 291

BLAST of Cp4.1LG03g14100 vs. TAIR 10
Match: AT3G60530.1 (GATA transcription factor 4 )

HSP 1 Score: 163.3 bits (412), Expect = 3.1e-40
Identity = 111/268 (41.42%), Postives = 140/268 (52.24%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGN 85
           S+ +   +D+ LLDFSN+++                SS  T   S  SSA+S +N    +
Sbjct: 7   SSPDLLRIDD-LLDFSNDEIF---------------SSSSTVTSSAASSAASSENPF--S 66

Query: 86  FGSGGYCE----AQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLS---- 145
           F S  Y        F+ +LC+P DD A LEWLS FV+ SFS      DF A P       
Sbjct: 67  FPSSTYTSPTLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS------DFPANPLTMTVRP 126

Query: 146 --GGIGTAVTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRGSPCDWSTRV 205
                G   +  + + +   A  +     +   H  A   P K  +  S           
Sbjct: 127 EISFTGKPRSRRSRAPAPSVAGTWAPMSESELCHSVAKPKPKKVYNAES----------- 186

Query: 206 LRATAPEAGKSEMTSGRKCQHCAAEKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPA 265
              TA  A        R+C HCA+EKTPQWRTGP+GPKTLCNACGVRYKSGRLVPEYRPA
Sbjct: 187 --VTADGA--------RRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPA 229

Query: 266 ASPTFLSTKHSNSHRKVMELRRQNEVEQ 284
           +SPTF+ T+HSNSHRKVMELRRQ E ++
Sbjct: 247 SSPTFVLTQHSNSHRKVMELRRQKEQQE 229

BLAST of Cp4.1LG03g14100 vs. TAIR 10
Match: AT2G45050.1 (GATA transcription factor 2 )

HSP 1 Score: 162.9 bits (411), Expect = 4.0e-40
Identity = 119/289 (41.18%), Postives = 145/289 (50.17%), Query Frame = 0

Query: 26  SAAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSS--ASSGDNLVL 85
           S+ +   +D+ LLDFSNED+   S    + A     S       S +     SS D+   
Sbjct: 7   SSPDLLRIDD-LLDFSNEDIFSASSSGGSTAATSSSSFPPPQNPSFHHHHLPSSADH--- 66

Query: 86  GNFGSGGYCEAQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIPFLSGGIGTA 145
                       F  ++C+P DD A LEWLS FV+ SF+      DF A P   GG  T+
Sbjct: 67  ----------HSFLHDICVPSDDAAHLEWLSQFVDDSFA------DFPANPL--GGTMTS 126

Query: 146 VTLEAPSSSGETAFGYGSGKTTSFFHGEALTLPCKARSKRSRG----------SPCDWST 205
           V  E                 TSF        P K RSKRSR            P +   
Sbjct: 127 VKTE-----------------TSF--------PGKPRSKRSRAPAPFAGTWSPMPLESEH 186

Query: 206 RVLRATAP------------------EAGKSEMTSG---RKCQHCAAEKTPQWRTGPMGP 265
           + L + A                   ++  SE T G   R+C HCA+EKTPQWRTGP+GP
Sbjct: 187 QQLHSAAKFKPKKEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGP 246

Query: 266 KTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNEV 282
           KTLCNACGVR+KSGRLVPEYRPA+SPTF+ T+HSNSHRKVMELRRQ EV
Sbjct: 247 KTLCNACGVRFKSGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEV 248

BLAST of Cp4.1LG03g14100 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 142.1 bits (357), Expect = 7.3e-34
Identity = 116/297 (39.06%), Postives = 145/297 (48.82%), Query Frame = 0

Query: 27  AAEQFAVDEYLLDFSNEDVAMHSGCFDNVAGNCCDSSMVTAIESCNSSASSGDNLVLGNF 86
           + + F+VD+ LLD SN+DV       D          MV    S       GD L   + 
Sbjct: 37  SVDDFSVDD-LLDLSNDDVFA-----DEETDLKAQHEMVRV--SSEEPNDDGDALRRSSD 96

Query: 87  GSGGYCE---AQFSDELCIPCDDLAELEWLSNFVEGSFSTEEIEKDFQAIP-----FLSG 146
            SG  C+   +  + EL +P DDLA LEWLS+FVE SF TE    +    P     +L+G
Sbjct: 97  FSG--CDDFGSLPTSELSLPADDLANLEWLSHFVEDSF-TEYSGPNLTGTPTEKPAWLTG 156

Query: 147 GIG---TAVTLE-------------------------------APSSSGETAFGYGSGKT 206
                 TAVT E                                PSSSG T+    SG +
Sbjct: 157 DRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGSTS-SSSSGPS 216

Query: 207 TSFFHGEALTLPCKARSKRSRGSPCDWSTRVLRATAPEAGK-SEMTSGRKCQHCAAEKTP 266
           + +F G  L  P       S   P     +   A +  +G+  ++   RKC HC  +KTP
Sbjct: 217 SPWFSGAELLEPVVT----SERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTP 276

Query: 267 QWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFLSTKHSNSHRKVMELRRQNE 281
           QWR GPMG KTLCNACGVRYKSGRL+PEYRPA SPTF S  HSN HRKV+E+RR+ E
Sbjct: 277 QWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P697815.6e-4739.13GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826321.4e-4545.39GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
O497434.3e-3941.42GATA transcription factor 4 OS=Arabidopsis thaliana OX=3702 GN=GATA4 PE=2 SV=1[more]
O497415.6e-3941.18GATA transcription factor 2 OS=Arabidopsis thaliana OX=3702 GN=GATA2 PE=2 SV=1[more]
Q9FH571.0e-3239.06GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023528501.13.42e-228100.00GATA transcription factor 4-like [Cucurbita pepo subsp. pepo][more]
XP_022955987.14.78e-21996.74GATA transcription factor 4-like [Cucurbita moschata][more]
KAG7018317.11.36e-21896.09GATA transcription factor 9 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6581881.12.25e-21795.77GATA transcription factor 9, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022979622.13.15e-20892.83GATA transcription factor 9-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GV452.31e-21996.74GATA transcription factor 4-like OS=Cucurbita moschata OX=3662 GN=LOC111457821 P... [more]
A0A6J1IP771.53e-20892.83GATA transcription factor 9-like OS=Cucurbita maxima OX=3661 GN=LOC111479297 PE=... [more]
A0A5A7U6E01.16e-17075.54GATA transcription factor OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffo... [more]
A0A1S3BI991.16e-17075.54GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103489961 PE=3 SV=1[more]
A0A0A0L8023.53e-16475.56GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_3G457670 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.14.0e-4839.13GATA transcription factor 12 [more]
AT4G32890.19.8e-4745.39GATA transcription factor 9 [more]
AT3G60530.13.1e-4041.42GATA transcription factor 4 [more]
AT2G45050.14.0e-4041.18GATA transcription factor 2 [more]
AT5G66320.17.3e-3439.06GATA transcription factor 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 208..258
e-value: 5.9E-17
score: 72.3
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 214..247
e-value: 2.2E-15
score: 56.0
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 214..239
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 208..244
score: 12.37361
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 213..264
e-value: 2.63036E-13
score: 61.6198
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 14..294
e-value: 1.1E-66
score: 223.5
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 204..280
e-value: 1.9E-15
score: 58.3
NoneNo IPR availablePANTHERPTHR45658:SF46GATA TRANSCRIPTION FACTOR 9coord: 1..283
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..283
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 211..272

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g14100.1Cp4.1LG03g14100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding