CmaCh19G004830 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G004830
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionGATA transcription factor 28
LocationCma_Chr19 : 5772608 .. 5777536 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATTCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTGGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCCGGGGGAATGAACCGAGTCCAGGACATTGTTCCTTCATCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACGCTGTCGTTCCGCGGAGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTTTGGTTTTCAATTGGGTGTCGATAGTCTTCGTAAATTATGCACTAGTTGTGTAATTCTTCTTCTACTTACTGAAGATATATAGTCTTTGAAGCTGGGTTTGGGTAATTTGCGAACAATATGATGAACTTTGAACAATTTGACTCAATGCTTCGTTATGACATTCAACTTCAAGCTCATTCAAATTGGGGAGGGATTTTGTAAAGTAATATTGTTGGTTTAAAGGATTTTCTGAATTGTTAGAGTGGGTATAATTTTAAAACCCTGTTTTAGTTTCCTACAGGGACCTAATTGTGTTGTGTTGCTGTATTGTTGGAACATTTGGATGATGAATGGGAAGTTTTTTCTTTTGAAATATGATTGAAAAAGAAGAAAGGAATATGATGATAAATGAACCGATTGTGCCAGTTGAACTGAATACTGCTGTTGTCAAATTAGATGTAGGAAGGATTTTGTGCGAAGTATACATATATGCCATGCCATGCCATACTATATGAAGGAGTTTTTTTGAGGTTCGAGTACATGAATACGATCACGATAGATTTAGGAAGGATTTGGTACGAAGTATACATATATGCCATGCCATACTATATGAAGGAGTTGTTTTGAGGTTCGAGTACATGAACACGAACACGAGGGGTTTTAAGTTTTTTGTGGTTAAATTTGACAGGTGCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGTGTTCCCGTCAACCAACAGGTATACGTGCACATTATTATTTTAATGGTGTATTGATTGAAACCATCTGGTGTTCTTGATTTCCTACCTCGGTTAACTTTTGTTCTAATTCTTTCAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAGTAGGTTTAGAGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGGTATAACTTATAAATAGATACCTGTTTATTTATTTATTAGACGGTTGAAGTTAAGTCAAGGGAGCATAGAGAGATCCCTTCCTCTCCCGCTCTCACTCAGAGATCAAGTAAAAAATTACCAAAAAGATATCATGTGAGATCCCACATTGATTGGGGAGGAGAACAAAACATTCTTTATAAGGGTGTCGAAACTTTTCCCTAGCAGACGTGTTTTAAAAATTTTGAGGGGAAGCCCGAAAGGGAAAGCCCAAAGAGAACAATATCTGCTAGCGGTGGGCTTGGGCGGTTACATATCCTTTGCCAGACCCTCTCTCCTCTCTATATTCCCTCTCTGTCAGCCTTACTAACTAATATGCTTGAGTCGTGCACAAACTATGTCATCATCCTACATGTCTCACCCTTTCCTCGTTTACCAGCATACACTTCAGTGATTGGGGCCTATCAGTATATTAAGTAGAATCAACAACTAGAAAACAGCAGGGATAGGATAGTGTGAAACAGCTATATTATGATTTTGTGGATCGTCTCTGATCTTGGTTGTGGTCAAATGGGATTTTCTAAAACATTAGTTTTTGAGGCAGGTTGCATTTCAATTTTAGATCTTCAGATGAGAAAAGGTGAAGGAAGTTTCTTGTAACAACCCAAGCCCACCGCTAAATAATATTGTCCTTTTTGGGTTTTCCGTTCTGGCTTCTCCTCAAAGTTTTAAAACGCGTCTCTTAGGGAAAAATTTCTACACTCTTGTAAATAGCGCTTTGTTCCCCTCTCCAACCAATGTAGGATCTCACAATCCACTCTCCTTAGGGGCCCAACGTTCATCGCTGGCACACTGCCCGATAACCGACTTCATGAGGCTGACGGTAATACGTAACGGGCCAAAGCGGACAATAACTGTTAGCGGTGGGCTTGGGTTGTTATTTCTAATGTGGGAAAAATGTTTTATTGATGTATCTTAGAGGTGTTATAGATGTTCGATGGGCTGCTCATAATATTTAAGTTTTTTTTACTTGAAATGAAGAAAATAAGAGCTGCTAATAGAAAGGAAATTACATATCTGAAGAAGATGGAAGTTGATTGGAATAACTTTCTTTAATCTTGTGCATATATATATATATATATATTTCTAATAACTTTCAGAAAAAGAAAGATGAATTTATAACTGTGATGATATAGTCCAAAAAGCATATCTTAGTGTACTTCGAAAGATATCTTCTAATCTATTCAATGTTGTTTCTGTTGTATGCAATTCAATGTGATATATTCGTCTCCTTCATCATCCAGAGTATATAAACTAAGTTACTTTATTGCATTATTCCATTCTTATTGTTTGTAGTGCTCAGCATTTGACAATTCAAGAATTTAAACTAGCTCTAGTCAAATGCTCAAATCATCAAAGGACAGAAAAAATCATTTTGTTCTCGAGACATGTTCTAACAACTCTTGTCTGGATTCATCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACATTGGATCCTGGACAAGATGATGGCTTGTTGGAAACCTCGTAAGTATTGATCGATCCTCACAAGTCCCATTTTCTAACTTTGAATGACCCTGCTGGTAGTCTGAGGCTCATCAATTTGTAACCTTACGCTTGAGTTGCGAGTTGGTTTTGCATCTAACTGGAATACCAAAATTCTAGAAGGAATAGAAAAATTATCAATTCTCATTCATACTGGAACAAATTCAATGATCTGTTTTACTAATAATGTCTTTCATTATTTGTTTATGCAGATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCCAATAAGGTTTGTTAATCGAGTCCTTTAGGTGATATAACATCGTCCATGAAATCGAGTCCTTTACAGTTGTCTTGGGCTGAAGCCTGCATAATTATTTGGAGATAAGTGATGTGAGCGAAGTATGTTGTTTAACATTTCATTATCAAAGACACCTGTGCTAGGCTTGTGGGTTTGATTTAGGAGATGCTTTCAGCTGGTTGATGATGGTTTTAAGTTAGTTTTGTAGATCATTCTTGGTTTATAGATTTTCATTGGTTAGGAAGGAATACCAATAGGAACCGAACCTCGAGGCGGTTTCTTCCTTTGTGAATCTTTTCCTGTATGCCAGTAGAAAGACTTGGATGCAATCGGTTGCCCGAGATATTTAACTTCGTTCTTGACTATTGAGTTCCACCACACCACCTCTTGCCTTAAAATCTGCAGTAGATAGCCAAGTTCATAAAGTATATCTGTTTTATCCCACAACTTTTCAAATGAGCTTTTCTATTGCGATTCACAGAGATACATTAGCTGTTGTAGCAAACTAAATGTTTGACGACTTGTCAATGGCCTCGAGTTCTCATTAGTGATCTTCGAATGCAATAGGGAAATCTGCAGTAGATAGATAAGTTCATAGAGTATGTCTGTTTTTTACCACAACTTTCGAGTGAGCTTTTCGATTGCGGTTCACAGAGATATATTAGCTGTTGTAACAAACTAAATGTTCGACGACTTATCAATGGCCTCGAGTTCTCATTAGTGAACTCTGAATGCAATAGGGAAACCTGCAGTAAATAGTTAATTCCATAGGTGTATGTCTGTTTTTTCCCACAACTTTTCGTTTGCGATTCACCGAGATATATTACCTGTTGTAGCAAACTAAATGTTCGATGACTTGTCAATGGCCTCGAGTTCTAATTAGTGAACTTTGAATGGAATAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACTCGGGCGTTCAAGAACTCTCCGTGAAGGATACCGAACAGGTAGACATTTAAACAGAGTTAAGAAAGACTCAATAGTTAAATAATATCATACTTCCATACATTTTAATACTGTTTTGGTGTCTGTTATTGCTATCCATCTTTTCCTATATCAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAATGTGGATATTCTCGCTTCTAATGGCGATAATTAGGCCATAAACCACAAAAATTGATCAACTAACTCTCTGATAAGTTCGTGCTGTGGTTGAAGTTGGATCATTTGAGTATTCCAACAAATTATATGGAAATTCATGCTATGCTCTGTGCAGTTTAGAGAGTATAAGAAATAAGCAATGGTAGAATGGGTTCCCCATTTGTCACCAATCATGTAGCTGAAAATAGATGGGATTCAGAAGGTGGGTTTCAATGGCGGCCTTTTGGTCGAAGTATCAAAGAACAAAAGCTCTGATCAAAGGGCTGGTGAGGTGAGTTGCTGCCTCCACTTTAGGATATGATCTTATTCTTGTTCTACAATGTTTGGTTAAGATTTGTGCAATCTTTTCCCCATGTAAATGGGCAGTTAAGTGCTTCAGATCACATTTTTTTTAGCTTATTTATGGCTTAGAAGTGAAGGTAAATAATTTGTAGTGTAATTTCTGGCCTCTTTGTTACTTTGTATTTAAGGGAATTTTGTGTTGTGTTTTGATTTTGTGAAATTTTGAAAATTATAGTATTATGATCGAGTGATCAAGTTAT

mRNA sequence

ATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATTCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTGGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCCGGGGGAATGAACCGAGTCCAGGACATTGTTCCTTCATCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACGCTGTCGTTCCGCGGAGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTGCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGTGTTCCCGTCAACCAACAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAGTAGGTTTAGAGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACATTGGATCCTGGACAAGATGATGGCTTGTTGGAAACCTCATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCCAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACTCGGGCGTTCAAGAACTCTCCGTGAAGGATACCGAACAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAATGTGGATATTCTCGCTTCTAATGGCGATAATTAGGCCATAAACCACAAAAATTGATCAACTAACTCTCTGATAAGTTCGTGCTGTGGTTGAAGTTGGATCATTTGAGTATTCCAACAAATTATATGGAAATTCATGCTATGCTCTGTGCAGTTTAGAGAGTATAAGAAATAAGCAATGGTAGAATGGGTTCCCCATTTGTCACCAATCATGTAGCTGAAAATAGATGGGATTCAGAAGGTGGGTTTCAATGGCGGCCTTTTGGTCGAAGTATCAAAGAACAAAAGCTCTGATCAAAGGGCTGGTGAGGTGAGTTGCTGCCTCCACTTTAGGATATGATCTTATTCTTGTTCTACAATGTTTGGTTAAGATTTGTGCAATCTTTTCCCCATGTAAATGGGCAGTTAAGTGCTTCAGATCACATTTTTTTTAGCTTATTTATGGCTTAGAAGTGAAGGTAAATAATTTGTAGTGTAATTTCTGGCCTCTTTGTTACTTTGTATTTAAGGGAATTTTGTGTTGTGTTTTGATTTTGTGAAATTTTGAAAATTATAGTATTATGATCGAGTGATCAAGTTAT

Coding sequence (CDS)

ATGCCGGACTCCAATTTCCAAGACGCGATGTACGGTTCCGCCGTCGTGAACAACGGTGGCCGGGTTTTGGATAGCATTCAGAACCGAGTTGGCGATGAAGGTGACGACATTACTGTGGGGGAAGAGTCCATGGACAATCCTCAGATGCGCTTCGAAGATTCCGGGGGAATGAACCGAGTCCAGGACATTGTTCCTTCATCGTATGTTTCCGGCTCCGATTACAACCCTTTGACTGGAAACGGTGGTGCTGACCAACTTACGCTGTCGTTCCGCGGAGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTGCAAGCCGTGCTTTTGCTTTTAGGTGGATATGAAATTCCTTCTGGGATTCCTGCTGTTGGAAGTGTTCCCGTCAACCAACAGGGTACCAATGGCTTTCCTGTCAGGTCGGTTCAACCACAAAGAGCTGCTTCATTGAGTAGGTTTAGAGAGAAGAGAAAAGAAAGGTGTTTTGAGAAGAAAATTCGTTACACCGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCCTCTAAAGCTAATGCAGATGAAGTGGGCTCATCTTCACTTTTGTCTCAAACATTGGATCCTGGACAAGATGATGGCTTGTTGGAAACCTCATGTACACATTGTGGAACCAGTTCAAAATCTACTCCGATGATGCGTCGTGGGCCTGCTGGCCCTAGGACTCTGTGCAATGCATGTGGGCTCAAGTGGGCCAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCGAACTCGGGCGTTCAAGAACTCTCCGTGAAGGATACCGAACAGAGCGATGGCGAAGCTAATGAATCCGACGCTGCAGTTAGCTAA

Protein sequence

MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEANESDAAVS
BLAST of CmaCh19G004830 vs. Swiss-Prot
Match: GAT24_ARATH (GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2)

HSP 1 Score: 211.8 bits (538), Expect = 9.6e-54
Identity = 141/279 (50.54%), Postives = 175/279 (62.72%), Query Frame = 1

Query: 18  NGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMNR-VQDIVPSSYVSGSDYNP 77
           NG   +   QN +  + +D  +     +N  M     GGM+  V+  +PS   + +D   
Sbjct: 8   NGRMHIGVAQNPMHVQYEDHGLHHIDNENSMMDDHADGGMDEGVETDIPSHPGNSADNRG 67

Query: 78  LTGNGG---ADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPA-VGSVPVNQQ 137
              + G    DQLTLSF+G+VY FD VSP+KVQAVLLLLGG E+P  +P  +GS   N +
Sbjct: 68  EVVDRGIENGDQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNR 127

Query: 138 --GTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKA 197
             G +G P R   PQR ASL RFREKRK R F+K IRYTVRKEVALRMQRKKGQF S+K+
Sbjct: 128 VLGLSGTPQRLSVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKS 187

Query: 198 NADEVGSS-----SLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNAC 257
           + D+ GS+     S  S  ++ G +    E  C HCGTS KSTPMMRRGP GPRTLCNAC
Sbjct: 188 SNDDSGSTGSDWGSNQSWAVE-GTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNAC 247

Query: 258 GLKWANKGILRDLSKVSNSGV-QELSVKDTEQSDGEANE 284
           GL WANKG LRDLSKV      Q LS+   E ++ EA++
Sbjct: 248 GLMWANKGTLRDLSKVPPPQTPQHLSLNKNEDANLEADQ 285

BLAST of CmaCh19G004830 vs. Swiss-Prot
Match: GAT28_ARATH (GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1)

HSP 1 Score: 208.0 bits (528), Expect = 1.4e-52
Identity = 132/241 (54.77%), Postives = 158/241 (65.56%), Query Frame = 1

Query: 53  DSGGMNR-VQDIVPSSYVSGSDYNPLTGNGGA---DQLTLSFRGEVYAFDSVSPDKVQAV 112
           ++GGM+  V+  +PS   + +D      + G+   DQLTLSF+G+VY FDSV P+KVQAV
Sbjct: 47  NAGGMSEGVETDIPSHPGNVTDNRGEVVDRGSEQGDQLTLSFQGQVYVFDSVLPEKVQAV 106

Query: 113 LLLLGGYEIPSGIP-AVGSVPVNQQGTN--GFPVRSVQPQRAASLSRFREKRKERCFEKK 172
           LLLLGG E+P   P  +GS   N + ++  G P R   PQR ASL RFREKRK R F+KK
Sbjct: 107 LLLLGGRELPQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGRNFDKK 166

Query: 173 IRYTVRKEVALRMQRKKGQFISSKANADEV---GSSSLLSQTLDPGQDDGL-LETSCTHC 232
           IRYTVRKEVALRMQR KGQF S+K+N DE    GSS   +QT      +    E SC HC
Sbjct: 167 IRYTVRKEVALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQEISCRHC 226

Query: 233 GTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEA 283
           G   KSTPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q L +   E ++ E 
Sbjct: 227 GIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNEDANLET 286

BLAST of CmaCh19G004830 vs. Swiss-Prot
Match: GAT20_ORYSJ (GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 2.7e-48
Identity = 129/260 (49.62%), Postives = 155/260 (59.62%), Query Frame = 1

Query: 32  DEGDDITVGEESM--DNPQMRFEDSGGMNRVQDIVPSSYVSGSDYNP----LTGNGG--- 91
           +E D++   EE M  D      E  GG    +  VP    + +  +P    L  +G    
Sbjct: 67  EEEDELEEEEEEMEEDEDAQHHEGVGG----EVAVPMDAEAAAQLDPHGGMLAASGAVQP 126

Query: 92  --ADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVR 151
             ++QLTLSF+GEVY FDSVSPDKVQAVLLLLGG E+  G+ +  S          FP  
Sbjct: 127 MASNQLTLSFQGEVYVFDSVSPDKVQAVLLLLGGRELNPGLGSGASSSAPYSKRLNFP-- 186

Query: 152 SVQPQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSL 211
                R ASL RFREKRKER F+KKIRY+VRKEVALRMQR +GQF SSK   DE  S   
Sbjct: 187 ----HRVASLMRFREKRKERNFDKKIRYSVRKEVALRMQRNRGQFTSSKPKGDEATSELT 246

Query: 212 LSQTLDPGQDDGLLE------TSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGI 271
            S   D   + G +E        C HCG ++K+TPMMRRGP GPRTLCNACGL WANKG+
Sbjct: 247 AS---DGSPNWGSVEGRPPSAAECHHCGINAKATPMMRRGPDGPRTLCNACGLMWANKGM 306

Query: 272 LRDLSKVSNSGVQEL-SVKD 274
           LRDLSK   + +Q + SV D
Sbjct: 307 LRDLSKAPPTPIQVVASVND 313

BLAST of CmaCh19G004830 vs. Swiss-Prot
Match: GAT25_ARATH (GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2)

HSP 1 Score: 187.6 bits (475), Expect = 1.9e-46
Identity = 114/205 (55.61%), Postives = 128/205 (62.44%), Query Frame = 1

Query: 82  GADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRS 141
           GA+QLT+SFRG+VY FD+V  DKV AVL LLGG    +  P V  +   Q   N  PV  
Sbjct: 80  GANQLTISFRGQVYVFDAVGADKVDAVLSLLGGSTELAPGPQVMELAQQQ---NHMPVVE 139

Query: 142 VQ-----PQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVG 201
            Q     PQRA SL RFR+KR  RCFEKK+RY VR+EVALRM R KGQF SSK       
Sbjct: 140 YQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYN 199

Query: 202 SSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILR 261
           S +      D  QDD   E SCTHCG SSK TPMMRRGP+GPRTLCNACGL WAN+G LR
Sbjct: 200 SGT----DQDSAQDDAHPEISCTHCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLR 259

Query: 262 DLSKVSNSGVQELSVKDTEQSDGEA 282
           DLSK +      L   D   S  +A
Sbjct: 260 DLSKKTEENQLALMKPDDGGSVADA 277

BLAST of CmaCh19G004830 vs. Swiss-Prot
Match: GAT18_ORYSI (GATA transcription factor 18 OS=Oryza sativa subsp. indica GN=GATA18 PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 2.8e-45
Identity = 121/257 (47.08%), Postives = 150/257 (58.37%), Query Frame = 1

Query: 32  DEGDDITVGEESMDNPQMRFEDSGGMNRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFR 91
           D+GDD T  +E  D+ +   E+        ++ P+   +  +       G  +QLTL F+
Sbjct: 35  DDGDDGTEEDEEEDDDEEGDEE--------ELPPAEDPAAPEPVSALLPGSPNQLTLLFQ 94

Query: 92  GEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPV----RSVQPQRA 151
           GEVY F+SV+P+KVQAVLLLLG  E+P G+     V  NQ+   G+        +  +R 
Sbjct: 95  GEVYVFESVTPEKVQAVLLLLGSCEMPPGL--ANMVLPNQRENRGYDDLLQRTDIPAKRV 154

Query: 152 ASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVGSSSLLSQTLDP 211
           ASL RFREKRKER F+KKIRY VRKEVALRMQR+KGQF        E  S      +   
Sbjct: 155 ASLIRFREKRKERNFDKKIRYAVRKEVALRMQRRKGQFAGRANMEGESLSPGCELASQGS 214

Query: 212 GQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQ 271
           GQD    E+ C +CGTS K TP MRRGPAGPRTLCNACGL WANKG LR+  K       
Sbjct: 215 GQDFLSRESKCQNCGTSEKMTPAMRRGPAGPRTLCNACGLMWANKGTLRNCPKAK----V 274

Query: 272 ELSVKDTEQSDGEANES 285
           E SV  TEQS+   + S
Sbjct: 275 ESSVVATEQSNAAVSPS 277

BLAST of CmaCh19G004830 vs. TrEMBL
Match: A0A0A0K2L9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 1.7e-134
Identity = 252/297 (84.85%), Postives = 270/297 (90.91%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
               NRV+D+VPS+Y+SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GS PVNQQG +GF VRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of CmaCh19G004830 vs. TrEMBL
Match: W9SKI1_9ROSA (GATA transcription factor 28 OS=Morus notabilis GN=L484_014769 PE=4 SV=1)

HSP 1 Score: 365.2 bits (936), Expect = 7.6e-98
Identity = 194/294 (65.99%), Postives = 236/294 (80.27%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGG-MNR 60
           MP+ N Q +MYG A +       +    +V D+ +D+T GEES+DNPQ+RF+D+   MN 
Sbjct: 1   MPEPNQQASMYGRAAM---ATTTNMQSGQVDDDDNDVTAGEESIDNPQIRFDDAAAAMNG 60

Query: 61  VQDIVPSS--YVSG-SDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYE 120
           +QD VPS+  YV G +DY P+  NGG+DQLTLSF+GEVY FD+VSPDKVQAVLLLLGGYE
Sbjct: 61  IQD-VPSNALYVPGVADYAPVAENGGSDQLTLSFQGEVYVFDAVSPDKVQAVLLLLGGYE 120

Query: 121 IPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKEVAL 180
           IPSGIPA+G+ P+ Q+G N F  + +QPQRAASL+RFREKRKERCF+KKIRY VRKEVA+
Sbjct: 121 IPSGIPAMGATPIGQRGMNQFVAKPIQPQRAASLNRFREKRKERCFDKKIRYNVRKEVAM 180

Query: 181 RMQRKKGQFISSKANADEVGS-SSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGP 240
           RMQRKKGQF S+K +++E+GS SS+ + T   GQD+ + ETSCTHCG SSKSTPMMRRGP
Sbjct: 181 RMQRKKGQFTSAKTSSEELGSASSVWNATPGSGQDENMQETSCTHCGISSKSTPMMRRGP 240

Query: 241 AGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEANESDAAVS 290
           AGPRTLCNACGLKWANKGILRDLSKV N  VQ+ SVK+TEQSDG+AN+S A  +
Sbjct: 241 AGPRTLCNACGLKWANKGILRDLSKVLNGNVQDASVKETEQSDGDANDSAAVTT 290

BLAST of CmaCh19G004830 vs. TrEMBL
Match: A0A061E2X9_THECC (Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1)

HSP 1 Score: 347.4 bits (890), Expect = 1.6e-92
Identity = 193/299 (64.55%), Postives = 230/299 (76.92%), Query Frame = 1

Query: 1   MPDSNFQD-AMYGSAVVNNGGRVLDSIQNRVGDEGDDITVG-----EESMDNPQMRFEDS 60
           M +SN Q  +MYGS  +N        +Q  + +E DD+  G     EES+DNPQ+ ++++
Sbjct: 1   MANSNHQPTSMYGSGAMN--------MQQNLEEEDDDVPGGTGGGGEESVDNPQIGYQET 60

Query: 61  GGM-----NRVQDIVPSS-YVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAV 120
           GG+     N +++   ++ Y  GSD   + GNGG+DQLTLSF+GEVY FDSVSPDKVQAV
Sbjct: 61  GGVVTVMNNGMEEASHANIYGQGSDLTVVPGNGGSDQLTLSFQGEVYVFDSVSPDKVQAV 120

Query: 121 LLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRY 180
           LLLLGGYEIPSGIPA+G+VPV Q+G   FP R++QPQRAASL+RFREKRKERCF+KKIRY
Sbjct: 121 LLLLGGYEIPSGIPALGTVPVTQRGLGDFPGRAIQPQRAASLNRFREKRKERCFDKKIRY 180

Query: 181 TVRKEVALRMQRKKGQFISSKANADEVGS-SSLLSQTLDPGQDDGLLETSCTHCGTSSKS 240
           TVRKEVALRMQRKKGQF SSKA +DEV S SS  S T   GQD+ + ETSCTHCG SSKS
Sbjct: 181 TVRKEVALRMQRKKGQFTSSKAISDEVASASSGWSVTPGSGQDESMEETSCTHCGISSKS 240

Query: 241 TPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEANESDA 287
           TPMMRRGP GPRTLCNACGLKWANKG+LRDLSKVS   +Q+ S K TEQSD EAN+S+A
Sbjct: 241 TPMMRRGPTGPRTLCNACGLKWANKGVLRDLSKVSTIPIQDASAKPTEQSDAEANDSEA 291

BLAST of CmaCh19G004830 vs. TrEMBL
Match: A0A0B0MJ71_GOSAR (GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE=4 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 4.5e-90
Identity = 194/299 (64.88%), Postives = 222/299 (74.25%), Query Frame = 1

Query: 1   MPDSNFQD-AMYGSAVVNNGGRVLDSIQNRVGDEGDDITVG-----EESMDNPQMRFEDS 60
           M +SN Q  +MYGS   N    + +       +E DD+ VG     EES+DNPQ+ F+++
Sbjct: 1   MANSNHQPTSMYGSGAANMQRNIDE-------EEDDDVPVGAGGGGEESVDNPQIGFQEN 60

Query: 61  GG----MNRVQDIVPSSYV--SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAV 120
           G     MN   D    ++V   GSD     GNGGADQLTLSF+GEVY FDSVSPDKVQAV
Sbjct: 61  GAVVAVMNNGMDEASHAHVYGQGSDSTSAPGNGGADQLTLSFQGEVYVFDSVSPDKVQAV 120

Query: 121 LLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRY 180
           LLLLGGYEIPSGIPA+G+V V Q+G N FP RS+QPQRAASL+RFREKRKERCFEKKIRY
Sbjct: 121 LLLLGGYEIPSGIPAMGTVSVTQRGLNDFPGRSIQPQRAASLNRFREKRKERCFEKKIRY 180

Query: 181 TVRKEVALRMQRKKGQFISSKANADEVGS-SSLLSQTLDPGQDDGLLETSCTHCGTSSKS 240
           TVRKEVALRMQRKKGQF SSKA ++EV S SS  S T   GQD+ + E  CTHCG SSK 
Sbjct: 181 TVRKEVALRMQRKKGQFTSSKAISEEVASASSGWSGTPGSGQDENMQEVLCTHCGISSKR 240

Query: 241 TPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEANESDA 287
           TPMMRRGPAGPRTLCNACGLKWANKG+LRDLSKVS   + + +VK  EQSD EANES+A
Sbjct: 241 TPMMRRGPAGPRTLCNACGLKWANKGVLRDLSKVSTVVIPDPTVKTAEQSDAEANESEA 292

BLAST of CmaCh19G004830 vs. TrEMBL
Match: A0A0D2PM93_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 1.3e-89
Identity = 193/300 (64.33%), Postives = 220/300 (73.33%), Query Frame = 1

Query: 1   MPDSNFQDA-MYGSAVVNNGGRVLDSIQNRVGDEGDDITVG------EESMDNPQMRFED 60
           M +SN Q   MYGS   N        +Q  + +E DD   G      EES+DNPQ+ F++
Sbjct: 1   MANSNHQRTPMYGSGAAN--------MQRNIDEEEDDDVPGGAGGGGEESVDNPQIGFQE 60

Query: 61  SGG----MNRVQDIVPSSYV--SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQA 120
           +G     MN   D    ++V   GSD     GNGGADQLTLSF+GEVY FDSVSPDKVQA
Sbjct: 61  NGAVVAVMNNGMDEASHAHVYGQGSDSTSAPGNGGADQLTLSFQGEVYVFDSVSPDKVQA 120

Query: 121 VLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIR 180
           VLLLLGGYEIPSGIPA+G+V V Q+G + FP RS+QPQRAASL+RFREKRKERCFEKKIR
Sbjct: 121 VLLLLGGYEIPSGIPAMGTVSVTQRGLSDFPGRSIQPQRAASLNRFREKRKERCFEKKIR 180

Query: 181 YTVRKEVALRMQRKKGQFISSKANADEVGS-SSLLSQTLDPGQDDGLLETSCTHCGTSSK 240
           YTVRKEVALRMQRKKGQF SSKA ++EV S SS  S T   GQD+ + E  CTHCG SSK
Sbjct: 181 YTVRKEVALRMQRKKGQFTSSKAISEEVASASSGWSGTPGSGQDENIQEVLCTHCGISSK 240

Query: 241 STPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEANESDA 287
            TPMMRRGPAGPRTLCNACGLKWANKG+LRDLSKVS   + + +VK  EQSD EANES+A
Sbjct: 241 KTPMMRRGPAGPRTLCNACGLKWANKGVLRDLSKVSTVAIPDPTVKTAEQSDAEANESEA 292

BLAST of CmaCh19G004830 vs. TAIR10
Match: AT3G21175.1 (AT3G21175.1 ZIM-like 1)

HSP 1 Score: 211.8 bits (538), Expect = 5.4e-55
Identity = 141/279 (50.54%), Postives = 175/279 (62.72%), Query Frame = 1

Query: 18  NGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMNR-VQDIVPSSYVSGSDYNP 77
           NG   +   QN +  + +D  +     +N  M     GGM+  V+  +PS   + +D   
Sbjct: 8   NGRMHIGVAQNPMHVQYEDHGLHHIDNENSMMDDHADGGMDEGVETDIPSHPGNSADNRG 67

Query: 78  LTGNGG---ADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPA-VGSVPVNQQ 137
              + G    DQLTLSF+G+VY FD VSP+KVQAVLLLLGG E+P  +P  +GS   N +
Sbjct: 68  EVVDRGIENGDQLTLSFQGQVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNR 127

Query: 138 --GTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKA 197
             G +G P R   PQR ASL RFREKRK R F+K IRYTVRKEVALRMQRKKGQF S+K+
Sbjct: 128 VLGLSGTPQRLSVPQRLASLLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKS 187

Query: 198 NADEVGSS-----SLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNAC 257
           + D+ GS+     S  S  ++ G +    E  C HCGTS KSTPMMRRGP GPRTLCNAC
Sbjct: 188 SNDDSGSTGSDWGSNQSWAVE-GTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNAC 247

Query: 258 GLKWANKGILRDLSKVSNSGV-QELSVKDTEQSDGEANE 284
           GL WANKG LRDLSKV      Q LS+   E ++ EA++
Sbjct: 248 GLMWANKGTLRDLSKVPPPQTPQHLSLNKNEDANLEADQ 285

BLAST of CmaCh19G004830 vs. TAIR10
Match: AT1G51600.1 (AT1G51600.1 ZIM-LIKE 2)

HSP 1 Score: 208.0 bits (528), Expect = 7.8e-54
Identity = 132/241 (54.77%), Postives = 158/241 (65.56%), Query Frame = 1

Query: 53  DSGGMNR-VQDIVPSSYVSGSDYNPLTGNGGA---DQLTLSFRGEVYAFDSVSPDKVQAV 112
           ++GGM+  V+  +PS   + +D      + G+   DQLTLSF+G+VY FDSV P+KVQAV
Sbjct: 47  NAGGMSEGVETDIPSHPGNVTDNRGEVVDRGSEQGDQLTLSFQGQVYVFDSVLPEKVQAV 106

Query: 113 LLLLGGYEIPSGIP-AVGSVPVNQQGTN--GFPVRSVQPQRAASLSRFREKRKERCFEKK 172
           LLLLGG E+P   P  +GS   N + ++  G P R   PQR ASL RFREKRK R F+KK
Sbjct: 107 LLLLGGRELPQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGRNFDKK 166

Query: 173 IRYTVRKEVALRMQRKKGQFISSKANADEV---GSSSLLSQTLDPGQDDGL-LETSCTHC 232
           IRYTVRKEVALRMQR KGQF S+K+N DE    GSS   +QT      +    E SC HC
Sbjct: 167 IRYTVRKEVALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQEISCRHC 226

Query: 233 GTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGEA 283
           G   KSTPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q L +   E ++ E 
Sbjct: 227 GIGEKSTPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNEDANLET 286

BLAST of CmaCh19G004830 vs. TAIR10
Match: AT4G24470.3 (AT4G24470.3 GATA-type zinc finger protein with TIFY domain)

HSP 1 Score: 189.5 bits (480), Expect = 2.9e-48
Identity = 110/184 (59.78%), Postives = 122/184 (66.30%), Query Frame = 1

Query: 82  GADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEIPSGIPAVGSVPVNQQGTNGFPVRS 141
           GA+QLT+SFRG+VY FD+V  DKV AVL LLGG    +  P V  +   Q   N  PV  
Sbjct: 80  GANQLTISFRGQVYVFDAVGADKVDAVLSLLGGSTELAPGPQVMELAQQQ---NHMPVVE 139

Query: 142 VQ-----PQRAASLSRFREKRKERCFEKKIRYTVRKEVALRMQRKKGQFISSKANADEVG 201
            Q     PQRA SL RFR+KR  RCFEKK+RY VR+EVALRM R KGQF SSK       
Sbjct: 140 YQSRCSLPQRAQSLDRFRKKRNARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYN 199

Query: 202 SSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILR 261
           S +      D  QDD   E SCTHCG SSK TPMMRRGP+GPRTLCNACGL WAN+G LR
Sbjct: 200 SGT----DQDSAQDDAHPEISCTHCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLR 256

BLAST of CmaCh19G004830 vs. TAIR10
Match: AT1G08000.1 (AT1G08000.1 GATA transcription factor 10)

HSP 1 Score: 53.9 bits (128), Expect = 1.9e-07
Identity = 25/62 (40.32%), Postives = 41/62 (66.13%), Query Frame = 1

Query: 202 SQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKV 261
           S TL+  + DG++   CTHC T +  TP  R+GP+GP+TLCNACG+++ +  ++ +    
Sbjct: 205 SSTLESSKSDGIVRI-CTHCETIT--TPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPA 263

Query: 262 SN 264
           S+
Sbjct: 265 SS 263

BLAST of CmaCh19G004830 vs. TAIR10
Match: AT1G08010.1 (AT1G08010.1 GATA transcription factor 11)

HSP 1 Score: 53.1 bits (126), Expect = 3.2e-07
Identity = 31/90 (34.44%), Postives = 49/90 (54.44%), Query Frame = 1

Query: 201 LSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRD--- 260
           +S TL+    DG++   CTHC T+   TP  R GP+GP+TLCNACG+++ +  ++ +   
Sbjct: 206 VSSTLEASNSDGIVR-KCTHCETTK--TPQWREGPSGPKTLCNACGVRFRSGRLVPEYRP 265

Query: 261 ---------LSKVSNSGVQELSVKDTEQSD 279
                    +   S+  + E+  KD EQ D
Sbjct: 266 ASSPTFIPAVHSNSHRKIIEMRRKDDEQFD 292

BLAST of CmaCh19G004830 vs. NCBI nr
Match: gi|659110314|ref|XP_008455162.1| (PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo])

HSP 1 Score: 493.0 bits (1268), Expect = 3.5e-136
Identity = 256/297 (86.20%), Postives = 273/297 (91.92%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMN-- 60
           MPDSNFQDAMYGS V+N+GGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM+  
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  -----RVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
                RV+D+VPS+YVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GSVPVNQQG +GFPVRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSVPVNQQGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANES+AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAIN 297

BLAST of CmaCh19G004830 vs. NCBI nr
Match: gi|659110318|ref|XP_008455164.1| (PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo])

HSP 1 Score: 493.0 bits (1268), Expect = 3.5e-136
Identity = 256/297 (86.20%), Postives = 273/297 (91.92%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGMN-- 60
           MPDSNFQDAMYGS V+N+GGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM+  
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  -----RVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
                RV+D+VPS+YVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GSVPVNQQG +GFPVRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSVPVNQQGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANES+AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAIN 297

BLAST of CmaCh19G004830 vs. NCBI nr
Match: gi|449438218|ref|XP_004136886.1| (PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus])

HSP 1 Score: 486.9 bits (1252), Expect = 2.5e-134
Identity = 252/297 (84.85%), Postives = 270/297 (90.91%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
               NRV+D+VPS+Y+SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GS PVNQQG +GF VRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of CmaCh19G004830 vs. NCBI nr
Match: gi|778724486|ref|XP_011658814.1| (PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus])

HSP 1 Score: 486.9 bits (1252), Expect = 2.5e-134
Identity = 252/297 (84.85%), Postives = 270/297 (90.91%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
               NRV+D+VPS+Y+SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GS PVNQQG +GF VRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

BLAST of CmaCh19G004830 vs. NCBI nr
Match: gi|700188515|gb|KGN43748.1| (hypothetical protein Csa_7G064580 [Cucumis sativus])

HSP 1 Score: 486.9 bits (1252), Expect = 2.5e-134
Identity = 252/297 (84.85%), Postives = 270/297 (90.91%), Query Frame = 1

Query: 1   MPDSNFQDAMYGSAVVNNGGRVLDSIQNRVGDEGDDITVGEESMDNPQMRFEDSGGM--- 60
           MPDSNF+DAMYGS V++NGGR L +IQNRV DE DDI  GEES+DNPQMRFEDSGGM   
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  ----NRVQDIVPSSYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
               NRV+D+VPS+Y+SGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEIPSGIPAVGSVPVNQQGTNGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYTVRKE 180
           GYEIPSGIPA+GS PVNQQG +GF VRSVQPQRAASLSRFREKRKERCFEKKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKANADEVGSSSLLSQTLDPGQDDGLLETSCTHCGTSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSS+LSQTLD GQDDGLLETSCTHCGTSSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNSGVQELSVKDTEQSDGE-ANESDAAVS 290
           GPAGPRTLCNACGLKWANKGILRDLSKVSN  +QE S K+ EQSDGE ANE +AA++
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAIN 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAT24_ARATH9.6e-5450.54GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2[more]
GAT28_ARATH1.4e-5254.77GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1[more]
GAT20_ORYSJ2.7e-4849.62GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1[more]
GAT25_ARATH1.9e-4655.61GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2[more]
GAT18_ORYSI2.8e-4547.08GATA transcription factor 18 OS=Oryza sativa subsp. indica GN=GATA18 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K2L9_CUCSA1.7e-13484.85Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1[more]
W9SKI1_9ROSA7.6e-9865.99GATA transcription factor 28 OS=Morus notabilis GN=L484_014769 PE=4 SV=1[more]
A0A061E2X9_THECC1.6e-9264.55Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1[more]
A0A0B0MJ71_GOSAR4.5e-9064.88GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE... [more]
A0A0D2PM93_GOSRA1.3e-8964.33Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21175.15.4e-5550.54 ZIM-like 1[more]
AT1G51600.17.8e-5454.77 ZIM-LIKE 2[more]
AT4G24470.32.9e-4859.78 GATA-type zinc finger protein with TIFY domain[more]
AT1G08000.11.9e-0740.32 GATA transcription factor 10[more]
AT1G08010.13.2e-0734.44 GATA transcription factor 11[more]
Match NameE-valueIdentityDescription
gi|659110314|ref|XP_008455162.1|3.5e-13686.20PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo][more]
gi|659110318|ref|XP_008455164.1|3.5e-13686.20PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo][more]
gi|449438218|ref|XP_004136886.1|2.5e-13484.85PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus][more]
gi|778724486|ref|XP_011658814.1|2.5e-13484.85PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus][more]
gi|700188515|gb|KGN43748.1|2.5e-13484.85hypothetical protein Csa_7G064580 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000679Znf_GATA
IPR010399Tify_dom
IPR010402CCT_domain
IPR013088Znf_NHR/GATA
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
GO:0005515protein binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G004830.1CmaCh19G004830.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 218..253
score: 9.0
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 212..265
score: 2.7
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 218..245
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 217..269
score: 9
IPR010399Tify domainPFAMPF06200tifycoord: 83..113
score: 1.1
IPR010399Tify domainSMARTSM00979tify_2coord: 79..114
score: 5.
IPR010399Tify domainPROFILEPS51320TIFYcoord: 79..114
score: 13
IPR010402CCT domainPFAMPF06203CCTcoord: 146..188
score: 3.2
IPR010402CCT domainPROFILEPS51017CCTcoord: 146..188
score: 13
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 216..259
score: 7.5
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 14..264
score: 3.3E
NoneNo IPR availablePANTHERPTHR10071:SF186SUBFAMILY NOT NAMEDcoord: 14..264
score: 3.3E
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 215..264
score: 3.61

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh19G004830CmaCh11G016740Cucurbita maxima (Rimu)cmacmaB149