Cp4.1LG14g01630 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g01630
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGATA transcription factor
LocationCp4.1LG14: 3491573 .. 3493071 (+)
RNA-Seq ExpressionCp4.1LG14g01630
SyntenyCp4.1LG14g01630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGGGGCAGTTTTAGCATAGGGGGGCACGGACGCTTTCAGGCCTCATTCAATGCTCCTTACAGAACTCACCATCCATTTTTGTTCCCAAACAAACTCCATTTTCATCCTTCTCTCTCTCTCCTCTTCTCTGCAACCTGAGCTCTCTCTCTCTCTCTTCACCGGCTTATCGGACCATGGAATGTGTTGATGGAGCTCTCGAGATTCTTCAATTGAGCCCACAGCTCTGTTTTCGCGATAATGGCTGTTTTAATCACCAGAATCTTCCCACCTCCGATGATTTCTTTGTCGACCAACTCCTCGATTTCTCCAATGATGATCAATTTGTTCAAGACCAGACCCCCGACGAGGACGAACACGACCACGACGACTCTGTTTCTCTTTCCGGTCGAGAAATTCATCAGAACTCCATTGTTTCCGATCATCCTTCTTTACCCTCCGGCGAACTTACCGTTCCGGTGTTGTTTTTGTTTCCTTTCCCCTGTTTTTTGAACAGAGTTGCGGGGGTTGTTTTTTTAATTTCTGTAATATTGATGAAGGTTTTGCTGTTTTTAACAGGTGGATGATTTAGCAGACCTCGAATGGTTATCTCATTTCGTTGAGGATTCTTTCTCTGGATTCTCGGCTTCCTTCCCCTCCGCCGGAATTTCTTCCTTGGTGAAATCGTCAAAGGACTCCGCCGCCGTAGACAACCAACCGAGCAACGGTGGTTCCATTTCGCCGCCGGAGAACTGTTTCAAAACCCCCATTCCGGTTAAGGCTAGAACCAAACGGACGAGAACTGGCGGTCGAGTTTGGTGCCTCGCCTCACCGTCGTTGACCGAGTCATCCTCCAGTTCCACAACGTCGTCGTCCTCCTCCTCGTCGCCGGCTAGTCCTTGGCTTATACTTCCCGACCGTTTCGAACCGGAAATTCCAAAGAAGAAACCAAGGAGAAAATCGTTATCAGAAAAGCCCAAAACCAACGTCGGAGCTCAGCCTCCACGGCGGTGCAGCCATTGCGGAGTCCAGAAAACCCCCCAATGGAGAACCGGCCCCCTCGGAGCCAAAACTGTCTGCAACGCTTGCGGCGTCCGATTCAAATCGGGTCGACTATTACCCGAATACCGACCCGCCTGTAGTCCAACTTTCTCCAGCGAATTGCACTCCAACCACCACCGGAAAGTCCTCGAGATGCGCCGTAAAAAGGAAATCGCCGCCCCGGCCGAGTTATTAACCTTAGAACAGAAATAGCATACCATAGTTGTAAGAGGAGGAAGAGAAGAGTAAAGTTTAGGTAAGGAACCGGATTTATCGAACCCGGTTCAATTAATCCGAGTTAGTATACATTTTCCCCCTCCTCGGTTCGCCGTAGGTTTGTAGCAATGTACGTACGTAACCGTCTAGAGAAAAAAACAAAAATTATTATGTATTTCTAAATTTGAATTTCTTTTATTATTAAAATTTTAGATTAAATCACATTTAACTGCAAAATGTATTTTTCACAATAAGTTCCGTATT

mRNA sequence

AAAGGGGCAGTTTTAGCATAGGGGGGCACGGACGCTTTCAGGCCTCATTCAATGCTCCTTACAGAACTCACCATCCATTTTTGTTCCCAAACAAACTCCATTTTCATCCTTCTCTCTCTCTCCTCTTCTCTGCAACCTGAGCTCTCTCTCTCTCTCTTCACCGGCTTATCGGACCATGGAATGTGTTGATGGAGCTCTCGAGATTCTTCAATTGAGCCCACAGCTCTGTTTTCGCGATAATGGCTGTTTTAATCACCAGAATCTTCCCACCTCCGATGATTTCTTTGTCGACCAACTCCTCGATTTCTCCAATGATGATCAATTTGTTCAAGACCAGACCCCCGACGAGGACGAACACGACCACGACGACTCTGTTTCTCTTTCCGGTCGAGAAATTCATCAGAACTCCATTGTTTCCGATCATCCTTCTTTACCCTCCGGCGAACTTACCGTTCCGGTGGATGATTTAGCAGACCTCGAATGGTTATCTCATTTCGTTGAGGATTCTTTCTCTGGATTCTCGGCTTCCTTCCCCTCCGCCGGAATTTCTTCCTTGGTGAAATCGTCAAAGGACTCCGCCGCCGTAGACAACCAACCGAGCAACGGTGGTTCCATTTCGCCGCCGGAGAACTGTTTCAAAACCCCCATTCCGGTTAAGGCTAGAACCAAACGGACGAGAACTGGCGGTCGAGTTTGGTGCCTCGCCTCACCGTCGTTGACCGAGTCATCCTCCAGTTCCACAACGTCGTCGTCCTCCTCCTCGTCGCCGGCTAGTCCTTGGCTTATACTTCCCGACCGTTTCGAACCGGAAATTCCAAAGAAGAAACCAAGGAGAAAATCGTTATCAGAAAAGCCCAAAACCAACGTCGGAGCTCAGCCTCCACGGCGGTGCAGCCATTGCGGAGTCCAGAAAACCCCCCAATGGAGAACCGGCCCCCTCGGAGCCAAAACTGTCTGCAACGCTTGCGGCGTCCGATTCAAATCGGGTCGACTATTACCCGAATACCGACCCGCCTGTAGTCCAACTTTCTCCAGCGAATTGCACTCCAACCACCACCGGAAAGTCCTCGAGATGCGCCGTAAAAAGGAAATCGCCGCCCCGGCCGAGTTATTAACCTTAGAACAGAAATAGCATACCATAGTTGTAAGAGGAGGAAGAGAAGAGTAAAGTTTAGGTAAGGAACCGGATTTATCGAACCCGGTTCAATTAATCCGAGTTAGTATACATTTTCCCCCTCCTCGGTTCGCCGTAGGTTTGTAGCAATGTACGTACGTAACCGTCTAGAGAAAAAAACAAAAATTATTATGTATTTCTAAATTTGAATTTCTTTTATTATTAAAATTTTAGATTAAATCACATTTAACTGCAAAATGTATTTTTCACAATAAGTTCCGTATT

Coding sequence (CDS)

ATGGAATGTGTTGATGGAGCTCTCGAGATTCTTCAATTGAGCCCACAGCTCTGTTTTCGCGATAATGGCTGTTTTAATCACCAGAATCTTCCCACCTCCGATGATTTCTTTGTCGACCAACTCCTCGATTTCTCCAATGATGATCAATTTGTTCAAGACCAGACCCCCGACGAGGACGAACACGACCACGACGACTCTGTTTCTCTTTCCGGTCGAGAAATTCATCAGAACTCCATTGTTTCCGATCATCCTTCTTTACCCTCCGGCGAACTTACCGTTCCGGTGGATGATTTAGCAGACCTCGAATGGTTATCTCATTTCGTTGAGGATTCTTTCTCTGGATTCTCGGCTTCCTTCCCCTCCGCCGGAATTTCTTCCTTGGTGAAATCGTCAAAGGACTCCGCCGCCGTAGACAACCAACCGAGCAACGGTGGTTCCATTTCGCCGCCGGAGAACTGTTTCAAAACCCCCATTCCGGTTAAGGCTAGAACCAAACGGACGAGAACTGGCGGTCGAGTTTGGTGCCTCGCCTCACCGTCGTTGACCGAGTCATCCTCCAGTTCCACAACGTCGTCGTCCTCCTCCTCGTCGCCGGCTAGTCCTTGGCTTATACTTCCCGACCGTTTCGAACCGGAAATTCCAAAGAAGAAACCAAGGAGAAAATCGTTATCAGAAAAGCCCAAAACCAACGTCGGAGCTCAGCCTCCACGGCGGTGCAGCCATTGCGGAGTCCAGAAAACCCCCCAATGGAGAACCGGCCCCCTCGGAGCCAAAACTGTCTGCAACGCTTGCGGCGTCCGATTCAAATCGGGTCGACTATTACCCGAATACCGACCCGCCTGTAGTCCAACTTTCTCCAGCGAATTGCACTCCAACCACCACCGGAAAGTCCTCGAGATGCGCCGTAAAAAGGAAATCGCCGCCCCGGCCGAGTTATTAACCTTAGAACAGAAATAG

Protein sequence

MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIAAPAELLTLEQK
Homology
BLAST of Cp4.1LG14g01630 vs. ExPASy Swiss-Prot
Match: Q9FH57 (GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 1.1e-64
Identity = 151/294 (51.36%), Postives = 187/294 (63.61%), Query Frame = 0

Query: 28  QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHD---------HDDSVSLSGREIHQNS 87
           QN  + DDF VD LLD SNDD F  ++T  + +H+         +DD  +L  R     S
Sbjct: 33  QNGFSVDDFSVDDLLDLSNDDVFADEETDLKAQHEMVRVSSEEPNDDGDAL--RRSSDFS 92

Query: 88  IVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVD 147
              D  SLP+ EL++P DDLA+LEWLSHFVEDSF+ +S      G +     ++  A + 
Sbjct: 93  GCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYS------GPNLTGTPTEKPAWLT 152

Query: 148 NQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSP 207
               +  +    E CFK+P+P KAR+KR R G +VW L S S +  SSS +T SSSSS P
Sbjct: 153 GDRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGST-SSSSSGP 212

Query: 208 ASPWLILPDRFEPEI-------PKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWR 267
           +SPW    +  EP +       PKK  +R + S         QP R+CSHCGVQKTPQWR
Sbjct: 213 SSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWR 272

Query: 268 TGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKE 306
            GP+GAKT+CNACGVR+KSGRLLPEYRPACSPTFSSELHSNHHRKV+EMRRKKE
Sbjct: 273 AGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cp4.1LG14g01630 vs. ExPASy Swiss-Prot
Match: Q9SD38 (GATA transcription factor 6 OS=Arabidopsis thaliana OX=3702 GN=GATA6 PE=2 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 2.6e-55
Identity = 146/294 (49.66%), Postives = 180/294 (61.22%), Query Frame = 0

Query: 34  DDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSVSLSGRE-IHQNSIVSDHPSLPSGELT 93
           DDF VD LLDFS +++       DE E        +S    +H+++  S      SG L+
Sbjct: 26  DDFSVDDLLDFSKEEEDDDVLVEDEAELKVQRKRGVSDENTLHRSNDFSTADFHTSG-LS 85

Query: 94  VPVDDLADLEWLSHFVED-SFSGFSAS-----FPSAGISSLVKSSKDSAAVDNQPSNGGS 153
           VP+DD+A+LEWLS+FV+D SF+ +SA      + +     LV+  K+             
Sbjct: 86  VPMDDIAELEWLSNFVDDSSFTPYSAPTNKPVWLTGNRRHLVQPVKE------------- 145

Query: 154 ISPPENCFKTPIP-VKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASP-WLI 213
               E CFK+  P VK R KR RTG RVW   S SLT+SSSSSTTSSSSS  P+SP WL 
Sbjct: 146 ----ETCFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSSSTTSSSSSPRPSSPLWLA 205

Query: 214 LPDRFEPEIPKKKPRRKSLSEKPKTNVGAQ-PPRRCSHCGVQKTPQWRTGPLGAKTVCNA 273
                +  + K + ++K      +T    Q   R+C HCGVQKTPQWR GPLGAKT+CNA
Sbjct: 206 SGQFLDEPMTKTQKKKKVWKNAGQTQTQTQTQTRQCGHCGVQKTPQWRAGPLGAKTLCNA 265

Query: 274 CGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIAAPAELLTLEQ 318
           CGVR+KSGRLLPEYRPACSPTFSSELHSNHH KV+EMRRKKE +  AE   L Q
Sbjct: 266 CGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEETGLNQ 301

BLAST of Cp4.1LG14g01630 vs. ExPASy Swiss-Prot
Match: O65515 (GATA transcription factor 7 OS=Arabidopsis thaliana OX=3702 GN=GATA7 PE=2 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 7.4e-42
Identity = 122/282 (43.26%), Postives = 152/282 (53.90%), Query Frame = 0

Query: 35  DFFVDQLLDFSNDDQFVQDQTPD--EDEHDHDDSVSLSGREIHQNSIVSDHPSLPSGELT 94
           DF VD LLD SN D  ++  +    EDE + +   S S     Q++ +S    L S    
Sbjct: 10  DFSVDDLLDLSNADTSLESSSSQRKEDEQEREKFKSFS----DQSTRLSPPEDLLSFPGD 69

Query: 95  VPVDDLADLEWLSHFVEDSFSG--FSASFPSAGISSLVKSSKDSAAVDNQPSNGGSISPP 154
            PV DL DLEWLS+FVEDSFS    S+ FP   ++                    S+   
Sbjct: 70  APVGDLEDLEWLSNFVEDSFSESYISSDFPVNPVA--------------------SVEVR 129

Query: 155 ENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASPWLILPDRFE 214
             C    +PVK R+KR RT GR+W + SPS   S++ +                      
Sbjct: 130 RQC----VPVKPRSKRRRTNGRIWSMESPSPLLSTAVARR-------------------- 189

Query: 215 PEIPKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWRTGPLGAKTVCNACGVRFKS 274
               KK+ R+K  +         Q  R CSHCGVQKTPQWR GPLGAKT+CNACGVRFKS
Sbjct: 190 ----KKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACGVRFKS 238

Query: 275 GRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIAAPAEL 313
           GRLLPEYRPACSPTF++E+HSN HRKVLE+R  K +A PA +
Sbjct: 250 GRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK-VADPARV 238

BLAST of Cp4.1LG14g01630 vs. ExPASy Swiss-Prot
Match: P69781 (GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.9e-37
Identity = 122/292 (41.78%), Postives = 156/292 (53.42%), Query Frame = 0

Query: 27  HQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHDH-DDSVSLSGREIHQ-NSIVSDHP 86
           H+   TSD    D L+DFSNDD    D   D        DS + S  ++   +  V D  
Sbjct: 6   HEFFHTSDFAVDDLLVDFSNDDDEENDVVADSTTTTTITDSSNFSAADLPSFHGDVQDGT 65

Query: 87  SLPSGELTVPVDDLAD-LEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVDNQPSN 146
           S  SG+L +P DDLAD LEWLS+ V++S S          + S  KS  D  +    P N
Sbjct: 66  SF-SGDLCIPSDDLADELEWLSNIVDESLS--PEDVHKLELISGFKSRPDPKSDTGSPEN 125

Query: 147 GGSISPPENCFKT--PIPVKARTKRTRTGGRVWC---LASPSLTESSSSSTTSSSSS--- 206
             S SP    F T   +P KAR+KR+R     W    L   +  +S  +  T  SS    
Sbjct: 126 PNSSSP---IFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQHL 185

Query: 207 SSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWRTGPL 266
           S P SP L++    + +      RRK     P++  G    RRC HC   KTPQWRTGP+
Sbjct: 186 SPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPES--GGAEERRCLHCATDKTPQWRTGPM 245

Query: 267 GAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIA 308
           G KT+CNACGVR+KSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE++
Sbjct: 246 GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMS 289

BLAST of Cp4.1LG14g01630 vs. ExPASy Swiss-Prot
Match: O82632 (GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 4.6e-36
Identity = 116/288 (40.28%), Postives = 154/288 (53.47%), Query Frame = 0

Query: 34  DDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSVSLSGREIHQNSIVSDHPSLPSG--EL 93
           D F VD LLDFSNDD  V     D+  +   DS +LS   +  +S  S   +  +G  +L
Sbjct: 16  DSFVVDDLLDFSNDDGEV-----DDGLNTLPDSSTLSTGTLTDSSNSSSLFTDGTGFSDL 75

Query: 94  TVPVDDLADLEWLSHFVEDSFSG--------FSA-SFPSAGISSLVKSSKDSAAVDNQPS 153
            +P DD+A+LEWLS+FVE+SF+G        FS    P    S+L    K    +D+Q  
Sbjct: 76  YIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGSTLTHLIKPEPELDHQ-- 135

Query: 154 NGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASPW 213
               I   E+     +P KAR+KR+R+    W     SL +S  ++              
Sbjct: 136 ---FIDIDES--NVAVPAKARSKRSRSAASTWASRLLSLADSDETN-------------- 195

Query: 214 LILPDRFEPEIPKKKPRR---KSLSEKPKTNVG-AQPPRRCSHCGVQKTPQWRTGPLGAK 273
                      PKKK RR   +  +     + G +   RRC HC  +KTPQWRTGP+G K
Sbjct: 196 -----------PKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPK 255

Query: 274 TVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEI 307
           T+CNACGVR+KSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE+
Sbjct: 256 TLCNACGVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM 266

BLAST of Cp4.1LG14g01630 vs. NCBI nr
Match: XP_023551796.1 (GATA transcription factor 5-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 632 bits (1631), Expect = 1.37e-228
Identity = 318/318 (100.00%), Postives = 318/318 (100.00%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. NCBI nr
Match: XP_022929334.1 (GATA transcription factor 5-like [Cucurbita moschata])

HSP 1 Score: 620 bits (1599), Expect = 1.04e-223
Identity = 312/318 (98.11%), Postives = 316/318 (99.37%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD +PS+GGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDKKPSHGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDR+EPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRYEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. NCBI nr
Match: KAG7015332.1 (GATA transcription factor 5, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 621 bits (1602), Expect = 1.15e-223
Identity = 313/318 (98.43%), Postives = 316/318 (99.37%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 32  MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 91

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 92  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 151

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD +PS+GGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 152 SAGISSLVKSSKDSAAVDKKPSHGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 211

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS
Sbjct: 212 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 271

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 272 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 331

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 332 RRKKEIAAPAELLTLEQK 349

BLAST of Cp4.1LG14g01630 vs. NCBI nr
Match: KAG6577242.1 (GATA transcription factor 5, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 619 bits (1596), Expect = 2.98e-223
Identity = 311/318 (97.80%), Postives = 316/318 (99.37%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD +PS+GGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDKKPSHGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDR+EPEIPKKKPRRKSLSE+PKTNVGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRYEPEIPKKKPRRKSLSERPKTNVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. NCBI nr
Match: XP_022985049.1 (GATA transcription factor 5-like [Cucurbita maxima])

HSP 1 Score: 617 bits (1592), Expect = 1.21e-222
Identity = 311/318 (97.80%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSG+EIH NSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGQEIHPNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD QP NGGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDKQPGNGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKT+VGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTSVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. ExPASy TrEMBL
Match: A0A6J1ENF7 (GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111435940 PE=3 SV=1)

HSP 1 Score: 620 bits (1599), Expect = 5.03e-224
Identity = 312/318 (98.11%), Postives = 316/318 (99.37%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD +PS+GGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDKKPSHGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDR+EPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRYEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. ExPASy TrEMBL
Match: A0A6J1JC71 (GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111483139 PE=3 SV=1)

HSP 1 Score: 617 bits (1592), Expect = 5.86e-223
Identity = 311/318 (97.80%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MECVDGALEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60
           MECVDGALEILQLSPQLCFRDNGCFN QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE
Sbjct: 1   MECVDGALEILQLSPQLCFRDNGCFNPQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDE 60

Query: 61  HDHDDSVSLSGREIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120
           HDHDDSVSLSG+EIH NSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP
Sbjct: 61  HDHDDSVSLSGQEIHPNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFP 120

Query: 121 SAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPS 180
           SAGISSLVKSSKDSAAVD QP NGGSISPPENCFKTPIPVKAR+KRTRTGGRVWCLASPS
Sbjct: 121 SAGISSLVKSSKDSAAVDKQPGNGGSISPPENCFKTPIPVKARSKRTRTGGRVWCLASPS 180

Query: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCS 240
           LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKT+VGAQPPRRCS
Sbjct: 181 LTESSSSSTTSSSSSSSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTSVGAQPPRRCS 240

Query: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300
           HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM
Sbjct: 241 HCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEM 300

Query: 301 RRKKEIAAPAELLTLEQK 318
           RRKKEIAAPAELLTLEQK
Sbjct: 301 RRKKEIAAPAELLTLEQK 318

BLAST of Cp4.1LG14g01630 vs. ExPASy TrEMBL
Match: A0A1S3CPB9 (GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103503274 PE=3 SV=1)

HSP 1 Score: 438 bits (1127), Expect = 2.35e-152
Identity = 234/315 (74.29%), Postives = 263/315 (83.49%), Query Frame = 0

Query: 8   LEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSV 67
           +E ++LSPQLCF      N QN+ +SDDFFVDQLLD S+ D+F+QDQTPD+D+ D   SV
Sbjct: 1   MECVRLSPQLCF------NPQNVVSSDDFFVDQLLDLSDHDEFLQDQTPDDDDDDDKPSV 60

Query: 68  SLSG----REIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAG 127
           SLS     +EIHQ+SIVSD PSLPS ELTVP DDL DLEWLSHFVEDSFSGFSA FPS  
Sbjct: 61  SLSNFVSAQEIHQDSIVSDLPSLPSSELTVPADDLEDLEWLSHFVEDSFSGFSAPFPS-- 120

Query: 128 ISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTE 187
              L+KSSK+ + ++ Q  + GS+SPPE CFKTPIPVKAR+KR RT GRVWCL SPSLT+
Sbjct: 121 ---LMKSSKEISTLEEQLEDDGSVSPPEPCFKTPIPVKARSKRRRTSGRVWCLRSPSLTD 180

Query: 188 SSSSSTTSSSSSSSPASPWLILPDRFEPEIP-KKKPRRKSLSEKPKTNVGAQPPRRCSHC 247
           SSS STTSSSSSS PASPWLI+ +RFEPEIP  KK RRKS SEK +  +GAQPPRRCSHC
Sbjct: 181 SSSCSTTSSSSSS-PASPWLIISNRFEPEIPVTKKRRRKSPSEKSRITIGAQPPRRCSHC 240

Query: 248 GVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRR 307
           GVQKTPQWRTGPLGAKT+CNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRR
Sbjct: 241 GVQKTPQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRR 300

Query: 308 KKEIAAPAELLTLEQ 317
           KKE+ APAE LT+E+
Sbjct: 301 KKEVTAPAEFLTVEK 303

BLAST of Cp4.1LG14g01630 vs. ExPASy TrEMBL
Match: A0A0A0KUP5 (GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_4G043890 PE=3 SV=1)

HSP 1 Score: 426 bits (1095), Expect = 1.93e-147
Identity = 230/316 (72.78%), Postives = 258/316 (81.65%), Query Frame = 0

Query: 8   LEILQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSV 67
           +E ++LSPQLCF      N QN+ +SDDFFVDQLLD S+ D+F+QDQTPD+D+ D   SV
Sbjct: 3   MECVRLSPQLCF------NPQNVVSSDDFFVDQLLDLSDHDEFLQDQTPDDDDDDDKPSV 62

Query: 68  SLSG----REIHQNSIVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAG 127
           SLS     +EIHQ+SIVSD PSLP+ ELTVP DDL DLEWLSHFVEDSFSGFSA FPS  
Sbjct: 63  SLSNLVSAQEIHQDSIVSDFPSLPTSELTVPADDLEDLEWLSHFVEDSFSGFSAPFPSP- 122

Query: 128 ISSLVKSSKDSAAVDNQ-PSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLT 187
               +KSSK+ A  + Q   + GS+SPPE CFKTPIP KAR+KR RT GRVWCL SPSLT
Sbjct: 123 ----MKSSKEIATSEEQLVEDDGSVSPPEPCFKTPIPAKARSKRRRTSGRVWCLRSPSLT 182

Query: 188 ESSSSSTTSSSSSSSPASPWLILPDRFEPEIPK-KKPRRKSLSEKPKTNVGAQPPRRCSH 247
           +SSS STTSSSSSS PASPWLI+ DRFEPEIP  KK RRKS SEK +  +GAQPPRRCSH
Sbjct: 183 DSSSCSTTSSSSSS-PASPWLIISDRFEPEIPATKKRRRKSPSEKSRITIGAQPPRRCSH 242

Query: 248 CGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMR 307
           CGVQKTPQWRTGPLGAKT+CNACGVRFKSGRLLPEYRPACSP FSSELHSNHHRKVLEMR
Sbjct: 243 CGVQKTPQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPACSPNFSSELHSNHHRKVLEMR 302

Query: 308 RKKEIAAPAELLTLEQ 317
           RKKE+ AP E L++E+
Sbjct: 303 RKKEVTAPDEFLSVEK 306

BLAST of Cp4.1LG14g01630 vs. ExPASy TrEMBL
Match: A0A2N9GQS4 (GATA-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29680 PE=4 SV=1)

HSP 1 Score: 321 bits (822), Expect = 8.46e-105
Identity = 190/328 (57.93%), Postives = 226/328 (68.90%), Query Frame = 0

Query: 1   MECVDGALEI-------LQLSPQLCFRDNGCFNHQNLPTSDDFFVDQLLDFSNDDQFVQD 60
           MECV+ AL+        L+ SPQ  F D    N QN    DD FVD+LLDFSN+D FV+ 
Sbjct: 56  MECVEAALKTSLRKEMALKSSPQAVFEDMWAVNGQNGVACDDLFVDELLDFSNEDGFVK- 115

Query: 61  QTPDEDEHDHDDSVSLSGREIHQNS------IVSDHPSLPSGELTVPVDDLADLEWLSHF 120
              +E+E +    VS+S ++ H+NS      +  +  S+P+ EL VP DDLA+LEWLSHF
Sbjct: 116 -AEEEEEEEDKGFVSVSPQQDHENSNTNTFTVKDEFGSVPTSELAVPADDLANLEWLSHF 175

Query: 121 VEDSFSGFSASFPSAGISSLVKSSKDSAAVDNQPSNGGSISPPENCFKTPIPVKARTKRT 180
           VEDSFS FSA +P  GI  L++  K+ A+     +      P   CFKTP+P KAR+KRT
Sbjct: 176 VEDSFSEFSAPYP-PGI--LIEKPKNEASEPEPETPSDETKP---CFKTPVPAKARSKRT 235

Query: 181 RTGGRVWCLASPSLTESSSSSTTSSSSSSSPASPWLIL--------PDRFEPEIPKKKPR 240
           RTGGRVW L SPSLTESSSSS TSSSSSSSP+S  LI         P   E + P KK +
Sbjct: 236 RTGGRVWSLGSPSLTESSSSS-TSSSSSSSPSSSGLIYANPAQNSEPVNMEGKPPLKKQK 295

Query: 241 RKSLSEKPKTNVG-AQPPRRCSHCGVQKTPQWRTGPLGAKTVCNACGVRFKSGRLLPEYR 300
           +K  +E      G AQPPRRCSHCGVQKTPQWRTGP+GAKT+CNACGVR+KSGRLLPEYR
Sbjct: 296 KKLAAEGGVVGSGGAQPPRRCSHCGVQKTPQWRTGPMGAKTLCNACGVRYKSGRLLPEYR 355

Query: 301 PACSPTFSSELHSNHHRKVLEMRRKKEI 306
           PACSPTFS+ELHSNHHRKVLEMRRKKE+
Sbjct: 356 PACSPTFSTELHSNHHRKVLEMRRKKEV 374

BLAST of Cp4.1LG14g01630 vs. TAIR 10
Match: AT5G66320.1 (GATA transcription factor 5 )

HSP 1 Score: 248.4 bits (633), Expect = 7.5e-66
Identity = 151/294 (51.36%), Postives = 187/294 (63.61%), Query Frame = 0

Query: 28  QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHD---------HDDSVSLSGREIHQNS 87
           QN  + DDF VD LLD SNDD F  ++T  + +H+         +DD  +L  R     S
Sbjct: 33  QNGFSVDDFSVDDLLDLSNDDVFADEETDLKAQHEMVRVSSEEPNDDGDAL--RRSSDFS 92

Query: 88  IVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVD 147
              D  SLP+ EL++P DDLA+LEWLSHFVEDSF+ +S      G +     ++  A + 
Sbjct: 93  GCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYS------GPNLTGTPTEKPAWLT 152

Query: 148 NQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSP 207
               +  +    E CFK+P+P KAR+KR R G +VW L S S +  SSS +T SSSSS P
Sbjct: 153 GDRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGST-SSSSSGP 212

Query: 208 ASPWLILPDRFEPEI-------PKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWR 267
           +SPW    +  EP +       PKK  +R + S         QP R+CSHCGVQKTPQWR
Sbjct: 213 SSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWR 272

Query: 268 TGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKE 306
            GP+GAKT+CNACGVR+KSGRLLPEYRPACSPTFSSELHSNHHRKV+EMRRKKE
Sbjct: 273 AGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cp4.1LG14g01630 vs. TAIR 10
Match: AT5G66320.2 (GATA transcription factor 5 )

HSP 1 Score: 248.4 bits (633), Expect = 7.5e-66
Identity = 151/294 (51.36%), Postives = 187/294 (63.61%), Query Frame = 0

Query: 28  QNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHD---------HDDSVSLSGREIHQNS 87
           QN  + DDF VD LLD SNDD F  ++T  + +H+         +DD  +L  R     S
Sbjct: 33  QNGFSVDDFSVDDLLDLSNDDVFADEETDLKAQHEMVRVSSEEPNDDGDAL--RRSSDFS 92

Query: 88  IVSDHPSLPSGELTVPVDDLADLEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVD 147
              D  SLP+ EL++P DDLA+LEWLSHFVEDSF+ +S      G +     ++  A + 
Sbjct: 93  GCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYS------GPNLTGTPTEKPAWLT 152

Query: 148 NQPSNGGSISPPENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSP 207
               +  +    E CFK+P+P KAR+KR R G +VW L S S +  SSS +T SSSSS P
Sbjct: 153 GDRKHPVTAVTEETCFKSPVPAKARSKRNRNGLKVWSLGSSSSSGPSSSGST-SSSSSGP 212

Query: 208 ASPWLILPDRFEPEI-------PKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWR 267
           +SPW    +  EP +       PKK  +R + S         QP R+CSHCGVQKTPQWR
Sbjct: 213 SSPWFSGAELLEPVVTSERPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWR 272

Query: 268 TGPLGAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKE 306
            GP+GAKT+CNACGVR+KSGRLLPEYRPACSPTFSSELHSNHHRKV+EMRRKKE
Sbjct: 273 AGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

BLAST of Cp4.1LG14g01630 vs. TAIR 10
Match: AT3G51080.1 (GATA transcription factor 6 )

HSP 1 Score: 217.2 bits (552), Expect = 1.9e-56
Identity = 146/294 (49.66%), Postives = 180/294 (61.22%), Query Frame = 0

Query: 34  DDFFVDQLLDFSNDDQFVQDQTPDEDEHDHDDSVSLSGRE-IHQNSIVSDHPSLPSGELT 93
           DDF VD LLDFS +++       DE E        +S    +H+++  S      SG L+
Sbjct: 26  DDFSVDDLLDFSKEEEDDDVLVEDEAELKVQRKRGVSDENTLHRSNDFSTADFHTSG-LS 85

Query: 94  VPVDDLADLEWLSHFVED-SFSGFSAS-----FPSAGISSLVKSSKDSAAVDNQPSNGGS 153
           VP+DD+A+LEWLS+FV+D SF+ +SA      + +     LV+  K+             
Sbjct: 86  VPMDDIAELEWLSNFVDDSSFTPYSAPTNKPVWLTGNRRHLVQPVKE------------- 145

Query: 154 ISPPENCFKTPIP-VKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASP-WLI 213
               E CFK+  P VK R KR RTG RVW   S SLT+SSSSSTTSSSSS  P+SP WL 
Sbjct: 146 ----ETCFKSQHPAVKTRPKRARTGVRVWSHGSQSLTDSSSSSTTSSSSSPRPSSPLWLA 205

Query: 214 LPDRFEPEIPKKKPRRKSLSEKPKTNVGAQ-PPRRCSHCGVQKTPQWRTGPLGAKTVCNA 273
                +  + K + ++K      +T    Q   R+C HCGVQKTPQWR GPLGAKT+CNA
Sbjct: 206 SGQFLDEPMTKTQKKKKVWKNAGQTQTQTQTQTRQCGHCGVQKTPQWRAGPLGAKTLCNA 265

Query: 274 CGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIAAPAELLTLEQ 318
           CGVR+KSGRLLPEYRPACSPTFSSELHSNHH KV+EMRRKKE +  AE   L Q
Sbjct: 266 CGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEETGLNQ 301

BLAST of Cp4.1LG14g01630 vs. TAIR 10
Match: AT4G36240.1 (GATA transcription factor 7 )

HSP 1 Score: 172.6 bits (436), Expect = 5.2e-43
Identity = 122/282 (43.26%), Postives = 152/282 (53.90%), Query Frame = 0

Query: 35  DFFVDQLLDFSNDDQFVQDQTPD--EDEHDHDDSVSLSGREIHQNSIVSDHPSLPSGELT 94
           DF VD LLD SN D  ++  +    EDE + +   S S     Q++ +S    L S    
Sbjct: 10  DFSVDDLLDLSNADTSLESSSSQRKEDEQEREKFKSFS----DQSTRLSPPEDLLSFPGD 69

Query: 95  VPVDDLADLEWLSHFVEDSFSG--FSASFPSAGISSLVKSSKDSAAVDNQPSNGGSISPP 154
            PV DL DLEWLS+FVEDSFS    S+ FP   ++                    S+   
Sbjct: 70  APVGDLEDLEWLSNFVEDSFSESYISSDFPVNPVA--------------------SVEVR 129

Query: 155 ENCFKTPIPVKARTKRTRTGGRVWCLASPSLTESSSSSTTSSSSSSSPASPWLILPDRFE 214
             C    +PVK R+KR RT GR+W + SPS   S++ +                      
Sbjct: 130 RQC----VPVKPRSKRRRTNGRIWSMESPSPLLSTAVARR-------------------- 189

Query: 215 PEIPKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWRTGPLGAKTVCNACGVRFKS 274
               KK+ R+K  +         Q  R CSHCGVQKTPQWR GPLGAKT+CNACGVRFKS
Sbjct: 190 ----KKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACGVRFKS 238

Query: 275 GRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIAAPAEL 313
           GRLLPEYRPACSPTF++E+HSN HRKVLE+R  K +A PA +
Sbjct: 250 GRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK-VADPARV 238

BLAST of Cp4.1LG14g01630 vs. TAIR 10
Match: AT5G25830.1 (GATA transcription factor 12 )

HSP 1 Score: 157.9 bits (398), Expect = 1.3e-38
Identity = 122/292 (41.78%), Postives = 156/292 (53.42%), Query Frame = 0

Query: 27  HQNLPTSDDFFVDQLLDFSNDDQFVQDQTPDEDEHDH-DDSVSLSGREIHQ-NSIVSDHP 86
           H+   TSD    D L+DFSNDD    D   D        DS + S  ++   +  V D  
Sbjct: 6   HEFFHTSDFAVDDLLVDFSNDDDEENDVVADSTTTTTITDSSNFSAADLPSFHGDVQDGT 65

Query: 87  SLPSGELTVPVDDLAD-LEWLSHFVEDSFSGFSASFPSAGISSLVKSSKDSAAVDNQPSN 146
           S  SG+L +P DDLAD LEWLS+ V++S S          + S  KS  D  +    P N
Sbjct: 66  SF-SGDLCIPSDDLADELEWLSNIVDESLS--PEDVHKLELISGFKSRPDPKSDTGSPEN 125

Query: 147 GGSISPPENCFKT--PIPVKARTKRTRTGGRVWC---LASPSLTESSSSSTTSSSSS--- 206
             S SP    F T   +P KAR+KR+R     W    L   +  +S  +  T  SS    
Sbjct: 126 PNSSSP---IFTTDVSVPAKARSKRSRAAACNWASRGLLKETFYDSPFTGETILSSQQHL 185

Query: 207 SSPASPWLILPDRFEPEIPKKKPRRKSLSEKPKTNVGAQPPRRCSHCGVQKTPQWRTGPL 266
           S P SP L++    + +      RRK     P++  G    RRC HC   KTPQWRTGP+
Sbjct: 186 SPPTSPPLLMAPLGKKQAVDGGHRRKKDVSSPES--GGAEERRCLHCATDKTPQWRTGPM 245

Query: 267 GAKTVCNACGVRFKSGRLLPEYRPACSPTFSSELHSNHHRKVLEMRRKKEIA 308
           G KT+CNACGVR+KSGRL+PEYRPA SPTF    HSN HRKV+E+RR+KE++
Sbjct: 246 GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMS 289

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FH571.1e-6451.36GATA transcription factor 5 OS=Arabidopsis thaliana OX=3702 GN=GATA5 PE=2 SV=1[more]
Q9SD382.6e-5549.66GATA transcription factor 6 OS=Arabidopsis thaliana OX=3702 GN=GATA6 PE=2 SV=1[more]
O655157.4e-4243.26GATA transcription factor 7 OS=Arabidopsis thaliana OX=3702 GN=GATA7 PE=2 SV=1[more]
P697811.9e-3741.78GATA transcription factor 12 OS=Arabidopsis thaliana OX=3702 GN=GATA12 PE=2 SV=1[more]
O826324.6e-3640.28GATA transcription factor 9 OS=Arabidopsis thaliana OX=3702 GN=GATA9 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_023551796.11.37e-228100.00GATA transcription factor 5-like [Cucurbita pepo subsp. pepo][more]
XP_022929334.11.04e-22398.11GATA transcription factor 5-like [Cucurbita moschata][more]
KAG7015332.11.15e-22398.43GATA transcription factor 5, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
KAG6577242.12.98e-22397.80GATA transcription factor 5, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022985049.11.21e-22297.80GATA transcription factor 5-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1ENF75.03e-22498.11GATA transcription factor OS=Cucurbita moschata OX=3662 GN=LOC111435940 PE=3 SV=... [more]
A0A6J1JC715.86e-22397.80GATA transcription factor OS=Cucurbita maxima OX=3661 GN=LOC111483139 PE=3 SV=1[more]
A0A1S3CPB92.35e-15274.29GATA transcription factor OS=Cucumis melo OX=3656 GN=LOC103503274 PE=3 SV=1[more]
A0A0A0KUP51.93e-14772.78GATA transcription factor OS=Cucumis sativus OX=3659 GN=Csa_4G043890 PE=3 SV=1[more]
A0A2N9GQS48.46e-10557.93GATA-type domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2968... [more]
Match NameE-valueIdentityDescription
AT5G66320.17.5e-6651.36GATA transcription factor 5 [more]
AT5G66320.27.5e-6651.36GATA transcription factor 5 [more]
AT3G51080.11.9e-5649.66GATA transcription factor 6 [more]
AT4G36240.15.2e-4343.26GATA transcription factor 7 [more]
AT5G25830.11.3e-3841.78GATA transcription factor 12 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 233..287
e-value: 6.8E-16
score: 68.8
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 239..273
e-value: 1.4E-15
score: 56.6
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 239..264
IPR000679Zinc finger, GATA-typePROSITEPS50114GATA_ZN_FINGER_2coord: 233..269
score: 11.648429
IPR000679Zinc finger, GATA-typeCDDcd00202ZnF_GATAcoord: 238..286
e-value: 1.9621E-13
score: 62.005
IPR016679Transcription factor, GATA, plantPIRSFPIRSF016992Txn_fac_GATA_plantcoord: 11..318
e-value: 3.8E-85
score: 284.0
IPR013088Zinc finger, NHR/GATA-typeGENE3D3.30.50.10coord: 230..306
e-value: 2.6E-15
score: 57.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 181..201
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..150
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 181..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..166
NoneNo IPR availablePANTHERPTHR45658GATA TRANSCRIPTION FACTORcoord: 1..307
NoneNo IPR availablePANTHERPTHR45658:SF88GATA TRANSCRIPTION FACTORcoord: 1..307
NoneNo IPR availableSUPERFAMILY57716Glucocorticoid receptor-like (DNA-binding domain)coord: 233..295

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g01630.1Cp4.1LG14g01630.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030154 cell differentiation
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding