Cp4.1LG15g03820 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g03820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGATA transcription factor, putative
LocationCp4.1LG15 : 5041268 .. 5047572 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGATCTGCAACTAAGATCAAGATGAAGCAACCCTTTGTTGCAGCACCTTCTTCCCCATTTCTGCTTTTCGTCGTCTCGCCGGCGCTGTTTCGCCGCCGGTTTTTCTATCGTTCCGGCGACCTTGAACTGAAGTGGAGTGGAGTGGAGTGGCATCAACCGTAAAAAACCCCCAAATTCAAGGTCATTTTCCAGCCCATGTTATCTTCCTCCTTTTTCCCTATGAAAATTTAGATTCCCAATTTTCCATATCAATCTTCGATTATCCTCTTCCTGCTGCTGATTTTTTTTGTCCAACGCTTTTCACTTTCACTCTGCTTTCTTCTTCACCCTTCTGATATTTACTTGGATTTTCTAGGGTTTACTTTACTTGAATTTTTTGTTGCATTTTTCGCGATTGAATTTGGATTTGGAATCGCTTGCTTTTACTTCAATTCCAGTTTCTTTTGTGTTATTGGGCGGGGTTGATTAATTCCCCTCTCTTTTTGTTGAGCTATCCTGTTCGAAATTTGAAATCCAGGGCTTTGAATTGTCTGTTTCTCATACATAATTTTTCTGCTGACCTTTTCGATCGTCCGTCGTGTTTGGTCAATAAGTTTCTGGAATTTTTTGGTTCCTTGTTAAATTAGCGTGTTCTTGGCCTTCCTATGTAATTCTTGAAGAAATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATCCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTCGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCTGGGGGGATGAACCGAGTCCAGGACATTGTTCCTTCGTCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACACTGTCGTTCCGCGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTTTGGTTTTCAATTGGGTGTCCATACTCTTCGTAAATTATGCACTAGTTGTGTAATTCTTCTTCTACTTACTGAAGATATATTGTATTTGAAGCTGGGTTTGAGTAATTTGCGAACAATATGATGAACTTCGAACAATTTGACTCAATGCTTCGTTATGACATTCAACTTCAAGATCATTCAAATTGGGGAGGGATTTTGTAAAATGTAATATTGTTGGTTTAAAGGATTTTCTGAACTGTTATAGAATGGGTATAATTTTAAAACCCTGTTTTAGTTTCCTACAGGGACCTAACTGTGTTGTGTTGCTGTATTGTTGGAAATTTGGACGATGAATGGAAAGCTTTTTCTTTTGAAATATGATTGAAAAAGAAGAAAGGAATATGATGATAAATGAACAGATTGTGCCAGTTGAACTGAATACTACTGTTGTCAAATTAGATGTATGAAGGTTTTGGTGCGAAGTATACATATATGCCATGGCATGCCATACTATATGAAGGAGTTGTTTTGAGGTTCGAGTACATGAATACGATCACGATAGATTTAGGAAGGATTTGGTGCGAAGTATACGTATATGCCATGCCATGCCATGCCATACTATTATGAAGGAGTTGTTTTGAGGTTTGAGTACATGAATTCGAACACGAGGGGTTTTAAGTTTTTCGTGGTTAAATTTGACAGGTTCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGCGTTCCCGTCAACCAACAGGTATACGTGCACGTTATTATTTTAATGGTGTATTGATTGAAACCATCTGGTGTTCTTGATTTCCTACCTCAGTTAACTTTTGTTCTAATTCTTACAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAATAGGTTTAGGGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGGTATAACTTACAATAGATGCCTGTTTATTTATTTATTAGATGGTTGAAGTTAAGTCAAGGGAGCGCTCCCTTCCTCTCCCGCTCTCACTCAGAGATCTAAGTAAAAAATTACCAAAAAGATATCCGATGAGATCCCACATCAATTGGGGAGGAGAACGAAACATTCTTTGTAAGGGTGTGGAAACCTCTCCCTAGCAGACGTGTTTTAAAAATCTTGAGGGGAAGCCCGAAAGGGAAAGCCCAAAGAGGACAATATCTACTAGCGGTGGGCTTGGGCTGTTACATATCCTTTGCCAGACTCTCTCCTCTCTATATTCCCTCTTTGTCAGCCTTACTGACTAATACGCTTGAGTCGTGCACAAACTATGACATCATCCTACATGTCTCACCCTTTCCTCGTTTACCAGCATACACTTCAGTGATTGGGGCCTATCATTATATTAAGTAGAGTCAACAATTAGAAAACAGCAGGGATAGGATAGTGTGAAACAGCTATATTATGATTTTGTGGATCGTCTCTGATCTTGGTTGTGGTCAAATGGGATTTTCTAAAACATTAGTTTTTGAGGCAGGTAGCATTTCAATTTTAGATTTTCAGATGAGAAAAGGTGAAGGAAGTTTCTTGTAACAGCCCAATCCCACCGCTAACAGATATTGTCATTTTTGAGCTTTCCCTTCTGGCTTCCCCCTCAAAGTTTTAAAACGCATCTGTTAGGGAAAAATTTCTACACTCTTGTAAAGAGAGCTTTGTTCCCCTCTCCAACCATTGTAGGATCTCACAATCCACTCTCTTTGGGGGCCCAACGTTCATCGTTGGCACACCGCCCGATAACCAACTTCGTGAGACTGACGGTAATACGTAATGGGCCAAAGCGGACAATATCTGTTAGCGATGTGCTTGGGTTGTTACATTTCTAACGTGGGAAAAATGCTTTATTGATGTAACTTAGAGGTGTTATAGATGTTCGATGGGCCGCTCATAATATTTAAGTTTTTTTTTACTTGAAATAAAGAAAATAAGAGCTGCTAATAGAAAGGAAATTACATATCTGAAGAAGATGGAAGTTGATAGGAATAACTTTCTTTAATCTTGTGAAATATATATATTTCTAATAACTTTCAGAAAAAGAGAGATGAATTTATAACTGTGATGATATAGTGCAAAAAGCATATCTTTAGTGTACTTCGAAAGATTTCTTTTAATCAATTCAATGTTGTTTCTGTTGTATGCAATTCAACGTGATATATTCGTCTCCTTCATCATCCAGAGTATATAAACTAAGTTACTTTATTGCATTATTCCATTCTTATGGTTTGTAGTGCTCAGCATGTGACAATTCAAGAATTTAAACTAGCTCTAGTCAAATGCTCAAATCATCAAAGGACAGAAAAAATCATTTTGTTCTCGAGACATGTTCTAACAACTCTTGTCTGGATTCATCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACGTTGGATCCTGGACACGATGATGGCTTGTTGGAAACCTCGTAAGTATTGATCGATCCTCACAAGTCCCATTTTCTAACTTTGAATGACCTTGCTGGTAGTCTGAGTCTCATCAATTTGTCACCTTACGCTTGAGTTGCGAGTTGGTTTTGCATCCAACTGGAATACCAAAATTCTAGAAGGAATAGAAAAATTCTCAATTCTCATTCATACTGGAACAAATTCAGTGATCTGTTTCCCTAATAATGTCTTTCATTATTTGTTTATGCAGATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCGAATAAGGTTTGTTAATCGAGTCCTTTAGGTGATATAACATCGTCCATGAAATCGAGTCCTTTAGGTGATATAACATCGTCCATGAAATCGAGTCCTTTACAGTTGTCTTGGGCTGAAACTTGCATAATTATTTGGAGATCAGTGATGGGAGCGAAGTATGTTGTGTTTAACATTTCATTATCAAAGACCTGCGGTTTTGATTCAGGAGATTCTTACAGTTGGTTGATGATGGTTTTAAGTTAGTTTTGCAGATCAGTCTTGGTTTTTAGATTTTCATTGGTTAGGAAGGAATACCAATAGGAACCGAACCTCGAGACGGTTTCTTCCTTTGCGAGTCTTTTCCCGTATGCCAGTAGAAAGACTTGGATGCAATCGGTTGCCTTAAATATTTAACTTTGTTCTTGACTATTGAGTTCCACCACACCACCTCTTGCCTTAAAATCTGCAGTAGAGAGTTTAGTTCATAGAGTATGTTTTTTCCCACAACTTTTCGAATGAGCTTTTCGATTGCGATTCACAGAGATGCATTAGCTGTTGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGGAAATAGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTCTCAAATGAGCTTTTTGATTGCCATTCACCGAGATATATTACCTGTTGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGTAAATGGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTTTCAAATGAGCTTTTCGATTGCCATTCACCGAGATATATTACCTGTCGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGTAAATAGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTCTCAAATGAGCTTTTTGATTGCCATTCACCGAGATATATTACCTGTTGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGTAAATGGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTTTCAAATGAGCTTTTCGATTGCCATTCACCGAGATATATTACCTGTCGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGTAAATGGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTTTCAAATGAGCTTTTCGATTGCCATTCACCGAGATATATTACCTGTCGTAGCAAACTAAATGTTCGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGAACTTTGAATGCAATAGGGAAACCTGCAGTAAATGGTTAAGTTCATAGAGTATGTCTGTTTTTTCCCACAACTTTTCGATTGCGATTCACCGTGATATATTTACCTGTTGTAGCAAACTAAATGTTCGACGACCTGTCAATGGCCTCGAGTTCTAATTAGTGAACTTTGAATGCTATAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACACTGGCGTCCAGGAACTCTCCGTGAAGGATACCGAACAGGTAGACATTTAAACGGAGTTAAGAAAGACTCAATAGTCAAATAATATCATACTTCCATACATTTTAATACTGTTTTGGTGTCTGTTATTCCTATCCATTTTTTCGTATATCAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAATGTGGATATTCTCGCTTCTAATGGCGATAATTAGGCCATAAACCACAAAAATTGATCAACTAACTCTCTGATAAGTTCGTGCTGTGGTTGAAGTTGGATCATTTGAGTATTCCAACAAATTATATGGAAATTCATGCTATGCTCTGTGCAGTTTAGAGAGTATGAGAAGAACCATAAGCAATGGTAGAATGGGTTCCCCATTTGTCACCAATCATGTAGCTGAAAATAGATGGGATTCAGAAGGTGGGTTTCAATGGCGGCCCTTTGGTCGAAGTATGAAAGAACAAAAGCTCTGATCAAAAGGGCTGGTGAGGTGAGTTGCTGCCTCCACTTTATGATATGATCTTGTTCTTGTTCTACAATGTTTGGTTAAGATTTGTGCAATCTTCCCCATGTAAAAGGGCTGCTGTTAAGTGCTTCAGATCGCATTTTTTTTAGCTTATTTATGGCTTTGTATTTAAGGGAATTTTGTGTTGTGTTTTGATTTTGTGGAATTTTGAAAATTATAGTATTATGATCGAGTGATCTAAGTTAAGGTAGAGAGTAAAGTTGTGTTAAGTATAGAGCTTATTG

mRNA sequence

TGGATCTGCAACTAAGATCAAGATGAAGCAACCCTTTGTTGCAGCACCTTCTTCCCCATTTCTGCTTTTCGTCGTCTCGCCGGCGCTGTTTCGCCGCCGGTTTTTCTATCGTTCCGGCGACCTTGAACTGAAGTGGAGTGGAGTGGAGTGGCATCAACCGTAAAAAACCCCCAAATTCAAGAAATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATCCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTCGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCTGGGGGGATGAACCGAGTCCAGGACATTGTTCCTTCGTCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACACTGTCGTTCCGCGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGATGTATGAAGGTTTTGGTGCGAAGTATACATATATGCCATGGCATGCCATACTATATGAAGGAGTTGTTTTGAGGTTCGAGTACATGAATACGATCACGATAGATTTAGGAAGGATTTGGTGCGAAGTTCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGCGTTCCCGTCAACCAACAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAATAGGTTTAGGGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACGTTGGATCCTGGACACGATGATGGCTTGTTGGAAACCTCATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCGAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACACTGGCGTCCAGGAACTCTCCGTGAAGGATACCGAACAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAATGTGGATATTCTCGCTTCTAATGGCGATAATTAGGCCATAAACCACAAAAATTGATCAACTAACTCTCTGATAAGTTCGTGCTGTGGTTGAAGTTGGATCATTTGAGTATTCCAACAAATTATATGGAAATTCATGCTATGCTCTGTGCAGTTTAGAGAGTATGAGAAGAACCATAAGCAATGGTAGAATGGGTTCCCCATTTGTCACCAATCATGTAGCTGAAAATAGATGGGATTCAGAAGGTGGGTTTCAATGGCGGCCCTTTGGTCGAAGTATGAAAGAACAAAAGCTCTGATCAAAAGGGCTGGTGAGGTGAGTTGCTGCCTCCACTTTATGATATGATCTTGTTCTTGTTCTACAATGTTTGGTTAAGATTTGTGCAATCTTCCCCATGTAAAAGGGCTGCTGTTAAGTGCTTCAGATCGCATTTTTTTTAGCTTATTTATGGCTTTGTATTTAAGGGAATTTTGTGTTGTGTTTTGATTTTGTGGAATTTTGAAAATTATAGTATTATGATCGAGTGATCTAAGTTAAGGTAGAGAGTAAAGTTGTGTTAAGTATAGAGCTTATTG

Coding sequence (CDS)

ATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATCCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTCGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCTGGGGGGATGAACCGAGTCCAGGACATTGTTCCTTCGTCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACACTGTCGTTCCGCGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGATGTATGAAGGTTTTGGTGCGAAGTATACATATATGCCATGGCATGCCATACTATATGAAGGAGTTGTTTTGAGGTTCGAGTACATGAATACGATCACGATAGATTTAGGAAGGATTTGGTGCGAAGTTCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGCGTTCCCGTCAACCAACAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAATAGGTTTAGGGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACGTTGGATCCTGGACACGATGATGGCTTGTTGGAAACCTCATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCGAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACACTGGCGTCCAGGAACTCTCCGTGAAGGATACCGAACAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAA

Protein sequence

MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKYTYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGVQELSVKDTEQSDGEANESDAAVS
BLAST of Cp4.1LG15g03820 vs. Swiss-Prot
Match: GAT24_ARATH (GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2)

HSP 1 Score: 175.3 bits (443), Expect = 1.1e-42
Identity = 110/189 (58.20%), Postives = 130/189 (68.78%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIPA-VGSVPVNQQ--GTNGFPVRSVQPQRAASLNRFREKRKER 205
           +VQAVLLLLGG E+P  +P  +GS   N +  G +G P R   PQR ASL RFREKRK R
Sbjct: 98  KVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRLSVPQRLASLLRFREKRKGR 157

Query: 206 CFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSS-----SLLSQTLDPGHDDGLLE 265
            F+K IRYTVRKEVALRMQRKKGQF S+K++ D+ GS+     S  S  ++ G +    E
Sbjct: 158 NFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDWGSNQSWAVE-GTETQKPE 217

Query: 266 TSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGV-QELSVKDT 325
             C HCGTS KSTPMMRRGP GPRTLCNACGL WANKG LRDLSKV      Q LS+   
Sbjct: 218 VLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDLSKVPPPQTPQHLSLNKN 277

BLAST of Cp4.1LG15g03820 vs. Swiss-Prot
Match: GAT28_ARATH (GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 4.3e-42
Identity = 108/186 (58.06%), Postives = 123/186 (66.13%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIP-AVGSVPVNQQGTN--GFPVRSVQPQRAASLNRFREKRKER 205
           +VQAVLLLLGG E+P   P  +GS   N + ++  G P R   PQR ASL RFREKRK R
Sbjct: 102 KVQAVLLLLGGRELPQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGR 161

Query: 206 CFEKKIRYTVRKEVALRMQRKKGQFISSKANADEV---GSSSLLSQTLDPGHDDGL-LET 265
            F+KKIRYTVRKEVALRMQR KGQF S+K+N DE    GSS   +QT      +    E 
Sbjct: 162 NFDKKIRYTVRKEVALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQEI 221

Query: 266 SCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGVQELSVKDTEQ 325
           SC HCG   KSTPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q L +   E 
Sbjct: 222 SCRHCGIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNED 281

BLAST of Cp4.1LG15g03820 vs. Swiss-Prot
Match: GAT20_ORYSJ (GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 1.9e-37
Identity = 98/177 (55.37%), Postives = 113/177 (63.84%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLNRFREKRKERCFE 205
           +VQAVLLLLGG E+  G+ +  S          FP       R ASL RFREKRKER F+
Sbjct: 146 KVQAVLLLLGGRELNPGLGSGASSSAPYSKRLNFP------HRVASLMRFREKRKERNFD 205

Query: 206 KKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGHDDGLLE------TS 265
           KKIRY+VRKEVALRMQR +GQF SSK   DE  S    S   D   + G +E        
Sbjct: 206 KKIRYSVRKEVALRMQRNRGQFTSSKPKGDEATSELTAS---DGSPNWGSVEGRPPSAAE 265

Query: 266 CTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGVQEL-SVKD 316
           C HCG ++K+TPMMRRGP GPRTLCNACGL WANKG+LRDLSK   T +Q + SV D
Sbjct: 266 CHHCGINAKATPMMRRGPDGPRTLCNACGLMWANKGMLRDLSKAPPTPIQVVASVND 313

BLAST of Cp4.1LG15g03820 vs. Swiss-Prot
Match: GAT17_ORYSJ (GATA transcription factor 17 OS=Oryza sativa subsp. japonica GN=GATA17 PE=2 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 4.7e-36
Identity = 93/162 (57.41%), Postives = 109/162 (67.28%), Query Frame = 1

Query: 147 VQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLNRFREKRKERCFEK 206
           VQAVLLLLGG E+    P  GSVP     +  +  +   P R ASL RFREKRKER F+K
Sbjct: 126 VQAVLLLLGGRELA---PGSGSVP---SSSAAYSKKMNFPHRMASLMRFREKRKERNFDK 185

Query: 207 KIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGHDDGLLE------TSC 266
           KIRYTVRKEVALRMQR +GQF SSK+ A+E  +S + S    P    G +E        C
Sbjct: 186 KIRYTVRKEVALRMQRNRGQFTSSKSKAEE-ATSVITSSEGSPNW--GAVEGRPPSAAEC 245

Query: 267 THCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSK 303
            HCG S+ STPMMRRGP GPRTLCNACGL WANKG +R+++K
Sbjct: 246 HHCGISAASTPMMRRGPDGPRTLCNACGLMWANKGTMREVTK 278

BLAST of Cp4.1LG15g03820 vs. Swiss-Prot
Match: GAT25_ARATH (GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2)

HSP 1 Score: 151.8 bits (382), Expect = 1.4e-35
Identity = 97/183 (53.01%), Postives = 109/183 (59.56%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQ-----PQRAASLNRFREKRK 205
           +V AVL LLGG    +  P V  +   Q   N  PV   Q     PQRA SL+RFR+KR 
Sbjct: 102 KVDAVLSLLGGSTELAPGPQVMELAQQQ---NHMPVVEYQSRCSLPQRAQSLDRFRKKRN 161

Query: 206 ERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGHDDGLLETSC 265
            RCFEKK+RY VR+EVALRM R KGQF SSK       S +      D   DD   E SC
Sbjct: 162 ARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYNSGT----DQDSAQDDAHPEISC 221

Query: 266 THCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGVQELSVKDTEQSD 324
           THCG SSK TPMMRRGP+GPRTLCNACGL WAN+G LRDLSK +      L   D   S 
Sbjct: 222 THCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLRDLSKKTEENQLALMKPDDGGSV 277

BLAST of Cp4.1LG15g03820 vs. TrEMBL
Match: A0A0A0K2L9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1)

HSP 1 Score: 412.1 bits (1058), Expect = 6.2e-112
Identity = 229/339 (67.55%), Postives = 258/339 (76.11%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
               NRV+D+VPS+Y+SGSDYNPLT                    +   D++        
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GS PVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSAPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GF VRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of Cp4.1LG15g03820 vs. TrEMBL
Match: A0A061E2X9_THECC (Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 4.3e-73
Identity = 177/341 (51.91%), Postives = 221/341 (64.81%), Query Frame = 1

Query: 1   MPDSNFQD-AMYGSAVVNNGGRVLDSIQNRVGDEGDDITVG-----EESMDNPQMRFEDS 60
           M +SN Q  +MYGS  +N        +Q  + +E DD+  G     EES+DNPQ+ ++++
Sbjct: 1   MANSNHQPTSMYGSGAMN--------MQQNLEEEDDDVPGGTGGGGEESVDNPQIGYQET 60

Query: 61  GGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKYT 120
           GG   V  ++               N G ++ +      +Y            G G+  T
Sbjct: 61  GG---VVTVM---------------NNGMEEAS---HANIY------------GQGSDLT 120

Query: 121 YMPWHA------ILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGS 180
            +P +       + ++G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+G+
Sbjct: 121 VVPGNGGSDQLTLSFQGEVYVFD---SVSPD------KVQAVLLLLGGYEIPSGIPALGT 180

Query: 181 VPVNQQGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFI 240
           VPV Q+G   FP R++QPQRAASLNRFREKRKERCF+KKIRYTVRKEVALRMQRKKGQF 
Sbjct: 181 VPVTQRGLGDFPGRAIQPQRAASLNRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFT 240

Query: 241 SSKANADEVGS-SSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNAC 300
           SSKA +DEV S SS  S T   G D+ + ETSCTHCG SSKSTPMMRRGP GPRTLCNAC
Sbjct: 241 SSKAISDEVASASSGWSVTPGSGQDESMEETSCTHCGISSKSTPMMRRGPTGPRTLCNAC 291

Query: 301 GLKWANKGILRDLSKVSNTGVQELSVKDTEQSDGEANESDA 329
           GLKWANKG+LRDLSKVS   +Q+ S K TEQSD EAN+S+A
Sbjct: 301 GLKWANKGVLRDLSKVSTIPIQDASAKPTEQSDAEANDSEA 291

BLAST of Cp4.1LG15g03820 vs. TrEMBL
Match: A0A0B0MJ71_GOSAR (GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE=4 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 4.8e-72
Identity = 177/335 (52.84%), Postives = 213/335 (63.58%), Query Frame = 1

Query: 1   MPDSNFQD-AMYGSAVVNNGGRVLDSIQNRVGDEGDDITVG-----EESMDNPQMRFEDS 60
           M +SN Q  +MYGS   N    + +       +E DD+ VG     EES+DNPQ+ F+++
Sbjct: 1   MANSNHQPTSMYGSGAANMQRNIDE-------EEDDDVPVGAGGGGEESVDNPQIGFQEN 60

Query: 61  GGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKYT 120
           G +  V                   N G D+ +      VY   S S      G   + T
Sbjct: 61  GAVVAVM------------------NNGMDEAS---HAHVYGQGSDSTSAPGNGGADQLT 120

Query: 121 YMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQQ 180
                 + ++G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+G+V V Q+
Sbjct: 121 ------LSFQGEVYVFD---SVSPD------KVQAVLLLLGGYEIPSGIPAMGTVSVTQR 180

Query: 181 GTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANA 240
           G N FP RS+QPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQF SSKA +
Sbjct: 181 GLNDFPGRSIQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFTSSKAIS 240

Query: 241 DEVGS-SSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
           +EV S SS  S T   G D+ + E  CTHCG SSK TPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 EEVASASSGWSGTPGSGQDENMQEVLCTHCGISSKRTPMMRRGPAGPRTLCNACGLKWAN 292

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGEANESDA 329
           KG+LRDLSKVS   + + +VK  EQSD EANES+A
Sbjct: 301 KGVLRDLSKVSTVVIPDPTVKTAEQSDAEANESEA 292

BLAST of Cp4.1LG15g03820 vs. TrEMBL
Match: A0A0D2PM93_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 1.4e-71
Identity = 176/336 (52.38%), Postives = 211/336 (62.80%), Query Frame = 1

Query: 1   MPDSNFQDA-MYGSAVVNNGGRVLDSIQNRVGDEGDDITVG------EESMDNPQMRFED 60
           M +SN Q   MYGS   N        +Q  + +E DD   G      EES+DNPQ+ F++
Sbjct: 1   MANSNHQRTPMYGSGAAN--------MQRNIDEEEDDDVPGGAGGGGEESVDNPQIGFQE 60

Query: 61  SGGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
           +G +  V                   N G D+ +      VY   S S      G   + 
Sbjct: 61  NGAVVAVM------------------NNGMDEAS---HAHVYGQGSDSTSAPGNGGADQL 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
           T      + ++G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+G+V V Q
Sbjct: 121 T------LSFQGEVYVFD---SVSPD------KVQAVLLLLGGYEIPSGIPAMGTVSVTQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           +G + FP RS+QPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQF SSKA 
Sbjct: 181 RGLSDFPGRSIQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFTSSKAI 240

Query: 241 ADEVGS-SSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWA 300
           ++EV S SS  S T   G D+ + E  CTHCG SSK TPMMRRGPAGPRTLCNACGLKWA
Sbjct: 241 SEEVASASSGWSGTPGSGQDENIQEVLCTHCGISSKKTPMMRRGPAGPRTLCNACGLKWA 292

Query: 301 NKGILRDLSKVSNTGVQELSVKDTEQSDGEANESDA 329
           NKG+LRDLSKVS   + + +VK  EQSD EANES+A
Sbjct: 301 NKGVLRDLSKVSTVAIPDPTVKTAEQSDAEANESEA 292

BLAST of Cp4.1LG15g03820 vs. TrEMBL
Match: A0A0D2V9Q1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G055000 PE=4 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 2.9e-69
Identity = 173/330 (52.42%), Postives = 216/330 (65.45%), Query Frame = 1

Query: 1   MPDSNFQD-AMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMNR 60
           M +SN    ++YGS  +N      D  ++  G  G     GEES+DNPQ+ +++SGG   
Sbjct: 1   MENSNHHSTSLYGSGAMNMQRNPEDEEEDVPGGGGGG---GEESVDNPQIGYQESGG--G 60

Query: 61  VQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKYTYMPWH 120
           V  ++ +     S  N L G G    LT + +G      + S D++              
Sbjct: 61  VVTVMNNGMEEASHAN-LYGQGS--DLT-AVQG------NSSADQL-------------- 120

Query: 121 AILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGF 180
            + ++G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+ + P+ Q+G   F
Sbjct: 121 TLSFQGEVYVFD---SVSPD------KVQAVLLLLGGYEIPSGIPALAATPIAQRGMGDF 180

Query: 181 PVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGS 240
           P RS+QP RAASLNRFREKRKERCF+KKIRYTVRKEVALRMQRKKGQF S+KA +DEV S
Sbjct: 181 PGRSIQPHRAASLNRFREKRKERCFDKKIRYTVRKEVALRMQRKKGQFTSAKAISDEVAS 240

Query: 241 -SSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILR 300
            SS  S T   G D+ + ET C+HCG SSK TPMMRRGPAGPRTLCNACGLKWANKG+LR
Sbjct: 241 ASSGWSGTPGSGQDESMQETLCSHCGISSKKTPMMRRGPAGPRTLCNACGLKWANKGVLR 292

Query: 301 DLSKVSNTGVQELSVKDTEQSDGEANESDA 329
           DLSKVS   +Q+ +VK TEQSD EANES+A
Sbjct: 301 DLSKVSMVAIQDPTVKTTEQSDAEANESEA 292

BLAST of Cp4.1LG15g03820 vs. TAIR10
Match: AT3G21175.1 (AT3G21175.1 ZIM-like 1)

HSP 1 Score: 175.3 bits (443), Expect = 6.4e-44
Identity = 110/189 (58.20%), Postives = 130/189 (68.78%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIPA-VGSVPVNQQ--GTNGFPVRSVQPQRAASLNRFREKRKER 205
           +VQAVLLLLGG E+P  +P  +GS   N +  G +G P R   PQR ASL RFREKRK R
Sbjct: 98  KVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRLSVPQRLASLLRFREKRKGR 157

Query: 206 CFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSS-----SLLSQTLDPGHDDGLLE 265
            F+K IRYTVRKEVALRMQRKKGQF S+K++ D+ GS+     S  S  ++ G +    E
Sbjct: 158 NFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDWGSNQSWAVE-GTETQKPE 217

Query: 266 TSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGV-QELSVKDT 325
             C HCGTS KSTPMMRRGP GPRTLCNACGL WANKG LRDLSKV      Q LS+   
Sbjct: 218 VLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDLSKVPPPQTPQHLSLNKN 277

BLAST of Cp4.1LG15g03820 vs. TAIR10
Match: AT1G51600.1 (AT1G51600.1 ZIM-LIKE 2)

HSP 1 Score: 173.3 bits (438), Expect = 2.5e-43
Identity = 108/186 (58.06%), Postives = 123/186 (66.13%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIP-AVGSVPVNQQGTN--GFPVRSVQPQRAASLNRFREKRKER 205
           +VQAVLLLLGG E+P   P  +GS   N + ++  G P R   PQR ASL RFREKRK R
Sbjct: 102 KVQAVLLLLGGRELPQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGR 161

Query: 206 CFEKKIRYTVRKEVALRMQRKKGQFISSKANADEV---GSSSLLSQTLDPGHDDGL-LET 265
            F+KKIRYTVRKEVALRMQR KGQF S+K+N DE    GSS   +QT      +    E 
Sbjct: 162 NFDKKIRYTVRKEVALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQEI 221

Query: 266 SCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNTGVQELSVKDTEQ 325
           SC HCG   KSTPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q L +   E 
Sbjct: 222 SCRHCGIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNED 281

BLAST of Cp4.1LG15g03820 vs. TAIR10
Match: AT4G24470.3 (AT4G24470.3 GATA-type zinc finger protein with TIFY domain)

HSP 1 Score: 154.1 bits (388), Expect = 1.5e-37
Identity = 93/162 (57.41%), Postives = 103/162 (63.58%), Query Frame = 1

Query: 146 EVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQ-----PQRAASLNRFREKRK 205
           +V AVL LLGG    +  P V  +   Q   N  PV   Q     PQRA SL+RFR+KR 
Sbjct: 102 KVDAVLSLLGGSTELAPGPQVMELAQQQ---NHMPVVEYQSRCSLPQRAQSLDRFRKKRN 161

Query: 206 ERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGHDDGLLETSC 265
            RCFEKK+RY VR+EVALRM R KGQF SSK       S +      D   DD   E SC
Sbjct: 162 ARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYNSGT----DQDSAQDDAHPEISC 221

Query: 266 THCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSK 303
           THCG SSK TPMMRRGP+GPRTLCNACGL WAN+G LRDLSK
Sbjct: 222 THCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLRDLSK 256

BLAST of Cp4.1LG15g03820 vs. TAIR10
Match: AT1G08010.1 (AT1G08010.1 GATA transcription factor 11)

HSP 1 Score: 53.9 bits (128), Expect = 2.2e-07
Identity = 31/90 (34.44%), Postives = 50/90 (55.56%), Query Frame = 1

Query: 243 LSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRD--- 302
           +S TL+  + DG++   CTHC T+   TP  R GP+GP+TLCNACG+++ +  ++ +   
Sbjct: 206 VSSTLEASNSDGIVR-KCTHCETTK--TPQWREGPSGPKTLCNACGVRFRSGRLVPEYRP 265

Query: 303 ---------LSKVSNTGVQELSVKDTEQSD 321
                    +   S+  + E+  KD EQ D
Sbjct: 266 ASSPTFIPAVHSNSHRKIIEMRRKDDEQFD 292

BLAST of Cp4.1LG15g03820 vs. TAIR10
Match: AT1G08000.1 (AT1G08000.1 GATA transcription factor 10)

HSP 1 Score: 53.1 bits (126), Expect = 3.7e-07
Identity = 25/62 (40.32%), Postives = 40/62 (64.52%), Query Frame = 1

Query: 244 SQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKV 303
           S TL+    DG++   CTHC T +  TP  R+GP+GP+TLCNACG+++ +  ++ +    
Sbjct: 205 SSTLESSKSDGIVRI-CTHCETIT--TPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPA 263

Query: 304 SN 306
           S+
Sbjct: 265 SS 263

BLAST of Cp4.1LG15g03820 vs. NCBI nr
Match: gi|659110318|ref|XP_008455164.1| (PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo])

HSP 1 Score: 418.3 bits (1074), Expect = 1.2e-113
Identity = 233/339 (68.73%), Postives = 261/339 (76.99%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMN-- 60
           MPDSNFQDAMYGS V+N+GGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM+  
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  -----RVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
                RV+D+VPS+YVSGSDYNPLT                    +   D++        
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GSVPVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSVPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GFPVRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANES+AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAIN 297

BLAST of Cp4.1LG15g03820 vs. NCBI nr
Match: gi|659110314|ref|XP_008455162.1| (PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo])

HSP 1 Score: 418.3 bits (1074), Expect = 1.2e-113
Identity = 233/339 (68.73%), Postives = 261/339 (76.99%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMN-- 60
           MPDSNFQDAMYGS V+N+GGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM+  
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  -----RVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
                RV+D+VPS+YVSGSDYNPLT                    +   D++        
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GSVPVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSVPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GFPVRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANES+AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAIN 297

BLAST of Cp4.1LG15g03820 vs. NCBI nr
Match: gi|449438218|ref|XP_004136886.1| (PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus])

HSP 1 Score: 412.1 bits (1058), Expect = 8.9e-112
Identity = 229/339 (67.55%), Postives = 258/339 (76.11%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
               NRV+D+VPS+Y+SGSDYNPLT                    +   D++        
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GS PVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSAPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GF VRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of Cp4.1LG15g03820 vs. NCBI nr
Match: gi|778724486|ref|XP_011658814.1| (PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus])

HSP 1 Score: 412.1 bits (1058), Expect = 8.9e-112
Identity = 229/339 (67.55%), Postives = 258/339 (76.11%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
               NRV+D+VPS+Y+SGSDYNPLT                    +   D++        
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GS PVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSAPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GF VRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of Cp4.1LG15g03820 vs. NCBI nr
Match: gi|700188515|gb|KGN43748.1| (hypothetical protein Csa_7G064580 [Cucumis sativus])

HSP 1 Score: 412.1 bits (1058), Expect = 8.9e-112
Identity = 229/339 (67.55%), Postives = 258/339 (76.11%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKMYEGFGAKY 120
               NRV+D+VPS+Y+SGSDYNPLT                    +   D++        
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLT-------------------GNGGADQL-------- 120

Query: 121 TYMPWHAILYEGVVLRFEYMNTITIDLGRIWCEVQAVLLLLGGYEIPSGIPAVGSVPVNQ 180
                  + + G V  F+   +++ D      +VQAVLLLLGGYEIPSGIPA+GS PVNQ
Sbjct: 121 ------TLSFRGEVYAFD---SVSPD------KVQAVLLLLGGYEIPSGIPAIGSAPVNQ 180

Query: 181 QGTNGFPVRSVQPQRAASLNRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKAN 240
           QG +GF VRSVQPQRAASL+RFREKRKERCFEKKIRY+VRKEVALRMQRKKGQFISSKA 
Sbjct: 181 QGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKEVALRMQRKKGQFISSKAI 240

Query: 241 ADEVGSSSLLSQTLDPGHDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 300
            DEVGSSS+LSQTLD G DDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN
Sbjct: 241 GDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWAN 297

Query: 301 KGILRDLSKVSNTGVQELSVKDTEQSDGE-ANESDAAVS 332
           KGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 301 KGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAT24_ARATH1.1e-4258.20GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2[more]
GAT28_ARATH4.3e-4258.06GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1[more]
GAT20_ORYSJ1.9e-3755.37GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1[more]
GAT17_ORYSJ4.7e-3657.41GATA transcription factor 17 OS=Oryza sativa subsp. japonica GN=GATA17 PE=2 SV=1[more]
GAT25_ARATH1.4e-3553.01GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0K2L9_CUCSA6.2e-11267.55Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1[more]
A0A061E2X9_THECC4.3e-7351.91Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1[more]
A0A0B0MJ71_GOSAR4.8e-7252.84GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE... [more]
A0A0D2PM93_GOSRA1.4e-7152.38Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1[more]
A0A0D2V9Q1_GOSRA2.9e-6952.42Uncharacterized protein OS=Gossypium raimondii GN=B456_013G055000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21175.16.4e-4458.20 ZIM-like 1[more]
AT1G51600.12.5e-4358.06 ZIM-LIKE 2[more]
AT4G24470.31.5e-3757.41 GATA-type zinc finger protein with TIFY domain[more]
AT1G08010.12.2e-0734.44 GATA transcription factor 11[more]
AT1G08000.13.7e-0740.32 GATA transcription factor 10[more]
Match NameE-valueIdentityDescription
gi|659110318|ref|XP_008455164.1|1.2e-11368.73PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo][more]
gi|659110314|ref|XP_008455162.1|1.2e-11368.73PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo][more]
gi|449438218|ref|XP_004136886.1|8.9e-11267.55PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus][more]
gi|778724486|ref|XP_011658814.1|8.9e-11267.55PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus][more]
gi|700188515|gb|KGN43748.1|8.9e-11267.55hypothetical protein Csa_7G064580 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0043565sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR013088Znf_NHR/GATA
IPR010402CCT_domain
IPR010399Tify_dom
IPR000679Znf_GATA
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016765 transferase activity, transferring alkyl or aryl (other than methyl) groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g03820.1Cp4.1LG15g03820.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 260..295
score: 1.1
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 254..307
score: 3.3
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 260..287
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 259..311
score: 9
IPR010399Tify domainPFAMPF06200tifycoord: 83..104
score: 8.
IPR010399Tify domainPROFILEPS51320TIFYcoord: 79..114
score: 9
IPR010402CCT domainPFAMPF06203CCTcoord: 188..230
score: 3.3
IPR010402CCT domainPROFILEPS51017CCTcoord: 188..230
score: 13
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 258..301
score: 9.1
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 14..105
score: 3.2E-100coord: 148..305
score: 3.2E
NoneNo IPR availablePANTHERPTHR10071:SF186SUBFAMILY NOT NAMEDcoord: 14..105
score: 3.2E-100coord: 148..305
score: 3.2E
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 257..306
score: 5.7

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG15g03820Cp4.1LG04g07180Cucurbita pepo (Zucchini)cpecpeB269