CmoCh11G014550 (gene) Cucurbita moschata (Rifu)

NameCmoCh11G014550
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionGATA transcription factor 28
LocationCmo_Chr11 : 10243116 .. 10247757 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTTTTTTTTTTTTTTTTTTGCTTCGGATTTTTTCGTTTCTTGTTAAATTAGCGTGTTCTTGGCCTATCTATGTGATTCTACAAGAAATGCCGGACTCCAATTTCCAAGACGTGATGTACGGTTCCGGAGTCATGAACAACGTTGCCCAGGATTTGGATAACGTTCAGACTCAAGTTGGCGATGAAGACGACAACGTTACCGGTCGGGAAGAGTCCATTGACAACCCTCAGATGCGCTTTGAAGACTCCGGGGGGATGAGTGGCTCGGTCTCAGCGATGAACCGAGTACAGGATATTGTTCCTTCGGCGTATATTTCCAGCTCTGATTACAACCCTTTGATTGGAAACGGTGGTGCCGATCAGCTCACGCTGTCATTCAGAGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTTTGTTTTGAATTGGGTGTTGATGATGACCTTTGTAAATTGTCCATTACTTATGCAATTATTCTGCTACTTGCTGAAGATTTATACCATTTGAAGTTGAACTACTTGATCTAGCCAAGCCTAGTGCTTCTTTATGGTATTCAATTCCAAGCTCGTTTTGAGATTGAGATTCTTTTTGTAACTTACTCAAATTAGGGAGCGTTTATGTAAGATAAATCATTGTTGGGTTACAAGTTTTAGACTATTTTAGTTTCCTAGTGGGACTTTTTGATGATGAATGGATTGTTGTTTCTTGTAAAAGATTATCGAAAAAGAAGAAATATGATGACGAATGAACAAAATTGTGCAAGTTGAATCGAAACCAAGACTGGTTTTTCGAATTAGATTTAGGAAGCATTTGGTTCGATGTTCAAATACATGGATACGAACTCTAGGGGTTTAAGTTTTTTGTGGTTAAATTTGACAGGTGCAAGCCGTACTTTTGCTTTTAGGTGGATATGAAGTTCCTTCTGGTATTCCTGCTGTTGGAAGTATTCCTGTCAATCAACAGGTATCCATACAAACTATCATTTGAATGGTTTACCTTACAGATATAGGTTTAAATCATCTGGTGCTGTTGATTTTCTAACTCACTTCTGCTGTAATCCTGCAGGGTGCTGATGGCTTTCCTGTCAGGTCGGCGCAACCGCAAAGAGCTGCTTCATTGAGTAGGTTTCGAGAGAAGAGGAAAGAGAGGTGTTTCGGGAAGAAAATCCGTTACAATGTGCGAAAAGAAGTTGCACTCAGGTATATGATAGATACCAGTTTATATATATGTTAGAAGGTACAGTTCAAGTCAAGAGACCGTAGAGAGGTCCCTCTCTATCTCCTCTCTATATTCCCTCTCTGTCAACTGTGTAACTAACTAATATGCTTGGGTCCGACACAAATACATCATCCCACACCGCACTCTCCCTTCCTGTTTACGCGCACGCACCTTAGTCATTGGGGGTCTGTCGGTATATCAAGTAGAATTGAGTATTGAAAAATAGTAGGGATTAGTATAACACAGCTATTTCATGGTTACTTGGAAAGGAAATTACAATATTTGAGTAAGATAGAAGTTCATCGGAATAACTTATGATTTCTTTGATCTTGTGAAAACAAATTTATATTTCTAATAACTTTCCGAAAAGGAAAGAAGAGAAATTTATAATGGTAACAGTCAAAGCCCACTGTTAGCAGATATTGTCCTCTTTGGAATTTCTTTTTCGGACTTTTCCTCAATGTTTTTAAAACGCGTCTTCTAGGGAGAGGTTTCCACACCCTTGTAAAGAATGCTTCGTTCTCCTCCTCAACCGATGTAGTATCTCATAATTCATCCCCCTTCAAGGCTCAGTGTCCTCGCTAGCACTCGCTACCTTCTCCAATCGATGTAAGACTCCCTCAATTCACCCTCCTTTGAGGCCCACGTTCTTGCTGGCACACTGTCTCGTGTCCACCCCTTCTCGGGGCTCGGCTTCCTTGTTGGTATATCGCTTGATATCTGGCTCTAATACCATTTGTAACAGCCAAAACCTACCGGTAACAGAAATTGTCCTCTTTGGGCTTTCCCTTTCAGGTTTTCCCTCAAGGTTTTTAAAACGAGTTTGCTAGGGAGAGGTTTTCACACCCTTATAAAGAATGCTTCGTTCTCCTCTCCAACCGATGTGAGATCTCACAATAGCTGTTATGATGTATTGCAGAAAGCATATCTTAGTGTACTTTTAAAGATACATTTTCATTATTAAATCAATGCTGTTTCTGTTGTATGCAATTCAATGTGATATTCCTTCATCTCCTTCTTTATCCAGAGCACATATACTAAATTACTTTGTTGCATTATTCCCATCTTATTGTCTGTTGGGCTCAGCGTATGACAGTTAAGAGTTTAAGATCTAATCATGGAAGGAGAGAAAAGATGGTTGTGTTCTTGAGACTTGTTCTAACAACTTGTCTGGACTCAACAGAATGCAGCGGAAGAAGGGCCAGTTTATATCATCTAAAGCTAGTGCAGATGAAGTGGGCTCATCTTCAGTCTTGGCTCTAACTTTGGATTCTGGTCAAGATGATGGCTTGTTGGAAACTTCGTAAGTATTGTCGATTATCACGAGTCCCATTTTCTAACTTTCGATAACCCTGCTGGTAGTCTGAGGATCATCTTTGTGTCACCTTGCCGGTTTGTTTCGCATCCAACTGGAATGTCAAGAAGCTCAATTCTCATTCAGAAAGGAAATTGTAAATCCGCTAATTTTTTTACGATATTGTCTTCCATGATTCTTATGCAGATGCACACATTGTGGAGTCAGTTCAAAATCTACCCCAATGATGCGTCGAGGGCCAGCAGGCCCGAGGACTCTGTGCAATGCATGTGGGCTGAAGTGGGCCAATAAGGTTTGTTTATTAATCGAAATTTAAGAGTTCTTTAGGTGATATAACATCATCCACGAAGTGATTTGCAGTATCTTGGGGTGAAATCTGCATGATATGCATCTGTGTTCAACATTTCATTGTCAAATGTGCTAGATTTGTGGTTTTCATCTAGAAGATGCTTTCATTTAGTTGATAAAGGTTTTGACTTCCAAGTTAGTTTAGCAGATCAGTCTTTGTTAATAGCTTTTCATTGGTTAGGACGGAAAACTTGTTTGAACCAGAGCATTCGGGTACGGTGGATTGCGTTTTGTTGTAGCATTAAAATTTTCTGGGCGGTTTGCATATCCGTGCTGTGTTTTAGTTGTAAGATTTTCATACACTCAGTTTGTTCTTGTCTATAGAGTTCCACCAAACCACCTTTCGCCACGAAATGTCTGTTTTTCCCCCGATTTTTTCGCGTGAGCTTTTCAATGGTGTTTCTTAGAGTCATATTGGTTCTGTTCTTTGTAGTAAACTAAATGTTCCATGGCATGTCAATGGATGAGCTTTACTGCTCGAATTCTCATTAGTGAACTCTGAATGCAATAGGGAATTTTGAGGGATCTTTCCAAGGTTTCAACCGCTGGCATTCAGGAGCCCTCTGTGAAGGAGACTGAACAGGTACACGTTCAAATCAAGTTGAGAAAGAATCAATAATTAGATACGAACAGCTACTCTAAAGAATCATCAACGATCAAAATACTGTTTTTATTTCCTTCTCTTTTTCTTATCCTGTGAACTGCTTTAGAAGAAATCATCTCGATCCCCATTGAACAACAAACTTCTATACGAACAGTCTAGGGGTTAGGAACGTTGTTTTTGTTTGACTTGAAAACGTTAGCCTCTAATCGTAAGTTTCCGTTTTCTCTTGTATCAAAGTAGAACATCGTGTTTGTTTGACGTGAAAATGTTAGTCTCTGATCGTAAACCTTCCATTTCTTCTTTTATCAGAGAGGAATGTCGTGTTTGTTTGACGTGAAAATGTTAGCATCTGATCGTAAACTTTCCATTTTTCTTGTATCAGAACGGATTGTCATATTTGTTTGACGTGAAAATGTTAGCCTCTGATCGTAAACTTTCTATTTTTTCTTGTATTAGAGCGAAGTGTCATCTTTGTTTGACATAAAAATATTAGCCTCTGATCGTGAACTTTCCATTTTTTCTTGTACCAGAGCAGAACATCATGTTTGTTTGACGTGAAAATATTAGCCTCTAATCGTAAACTTTCCATTTTTTCTTGTATCACAGCGGTGTTCGTTTGACGTGAAAACGTTAGCCTCTGATCTTAAACTTTCCATTTTTTCTTGTAACAGAGTGACGGTGAAGCTAACGAGTCCGATGCTGCTATAAACGTGGATGCCCTCGCTTCTAATGGTGGTAAATAGGGCGATAAACCAGAGAAGGTAGATGTACAAAAGTTGAAGAAGGAGCCACCAATTTGCAACCAATCTTATATCCATAGGTGTAAATATAGATGAGATTTAGAAGGTTGGTGCTCAAAATTCTTGCAGTTTTCAATGGTGGGATCAAAGGCTCTCATAAGTTGATTACTAAGCTATCTTATTTATTCTCTACAATGTTTGGTTTAGGATGTAATCTTCCATCCCTCTCTGTAAATGGGCGATTAAATGCTTCACATTCATCACACTCCTTTCATTTTTTGTTTTTCTTCTTATTAGCTTACGAATAAATTAGATAATAGTGTAATTTCTGGCCTCCAACCAGGGGTATTTTAGTTATTTAAGGGTAAATCCGTAATTTCCTCTTCTTGCTTCTCATCCAATTATAA

mRNA sequence

TTTTTTTTTTTTTTTTTTTTTTTGCTTCGGATTTTTTCGTTTCTTGTTAAATTAGCGTGTTCTTGGCCTATCTATGTGATTCTACAAGAAATGCCGGACTCCAATTTCCAAGACGTGATGTACGGTTCCGGAGTCATGAACAACGTTGCCCAGGATTTGGATAACGTTCAGACTCAAGTTGGCGATGAAGACGACAACGTTACCGGTCGGGAAGAGTCCATTGACAACCCTCAGATGCGCTTTGAAGACTCCGGGGGGATGAGTGGCTCGGTCTCAGCGATGAACCGAGTACAGGATATTGTTCCTTCGGCGTATATTTCCAGCTCTGATTACAACCCTTTGATTGGAAACGGTGGTGCCGATCAGCTCACGCTGTCATTCAGAGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTGCAAGCCGTACTTTTGCTTTTAGGTGGATATGAAGTTCCTTCTGGTATTCCTGCTGTTGGAAGTATTCCTGTCAATCAACAGGGTGCTGATGGCTTTCCTGTCAGGTCGGCGCAACCGCAAAGAGCTGCTTCATTGAGTAGGTTTCGAGAGAAGAGGAAAGAGAGGTGTTTCGGGAAGAAAATCCGTTACAATGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCATCTAAAGCTAGTGCAGATGAAGTGGGCTCATCTTCAGTCTTGGCTCTAACTTTGGATTCTGGTCAAGATGATGGCTTGTTGGAAACTTCATGCACACATTGTGGAGTCAGTTCAAAATCTACCCCAATGATGCGTCGAGGGCCAGCAGGCCCGAGGACTCTGTGCAATGCATGTGGGCTGAAGTGGGCCAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCAACCGCTGGCATTCAGGAGCCCTCTGTGAAGGAGACTGAACAGAGTGACGGTGAAGCTAACGAGTCCGATGCTGCTATAAACGTGGATGCCCTCGCTTCTAATGGTGGTAAATAGGGCGATAAACCAGAGAAGGTAGATGTACAAAAGTTGAAGAAGGAGCCACCAATTTGCAACCAATCTTATATCCATAGGTGTAAATATAGATGAGATTTAGAAGGTTGGTGCTCAAAATTCTTGCAGTTTTCAATGGTGGGATCAAAGGCTCTCATAAGTTGATTACTAAGCTATCTTATTTATTCTCTACAATGTTTGGTTTAGGATGTAATCTTCCATCCCTCTCTGTAAATGGGCGATTAAATGCTTCACATTCATCACACTCCTTTCATTTTTTGTTTTTCTTCTTATTAGCTTACGAATAAATTAGATAATAGTGTAATTTCTGGCCTCCAACCAGGGGTATTTTAGTTATTTAAGGGTAAATCCGTAATTTCCTCTTCTTGCTTCTCATCCAATTATAA

Coding sequence (CDS)

ATGCCGGACTCCAATTTCCAAGACGTGATGTACGGTTCCGGAGTCATGAACAACGTTGCCCAGGATTTGGATAACGTTCAGACTCAAGTTGGCGATGAAGACGACAACGTTACCGGTCGGGAAGAGTCCATTGACAACCCTCAGATGCGCTTTGAAGACTCCGGGGGGATGAGTGGCTCGGTCTCAGCGATGAACCGAGTACAGGATATTGTTCCTTCGGCGTATATTTCCAGCTCTGATTACAACCCTTTGATTGGAAACGGTGGTGCCGATCAGCTCACGCTGTCATTCAGAGGGGAGGTTTACGCTTTTGACTCTGTATCGCCGGACAAGGTGCAAGCCGTACTTTTGCTTTTAGGTGGATATGAAGTTCCTTCTGGTATTCCTGCTGTTGGAAGTATTCCTGTCAATCAACAGGGTGCTGATGGCTTTCCTGTCAGGTCGGCGCAACCGCAAAGAGCTGCTTCATTGAGTAGGTTTCGAGAGAAGAGGAAAGAGAGGTGTTTCGGGAAGAAAATCCGTTACAATGTGCGAAAAGAAGTTGCACTCAGAATGCAGCGGAAGAAGGGCCAGTTTATATCATCTAAAGCTAGTGCAGATGAAGTGGGCTCATCTTCAGTCTTGGCTCTAACTTTGGATTCTGGTCAAGATGATGGCTTGTTGGAAACTTCATGCACACATTGTGGAGTCAGTTCAAAATCTACCCCAATGATGCGTCGAGGGCCAGCAGGCCCGAGGACTCTGTGCAATGCATGTGGGCTGAAGTGGGCCAATAAGGGAATTTTGAGGGATCTTTCCAAGGTTTCAACCGCTGGCATTCAGGAGCCCTCTGTGAAGGAGACTGAACAGAGTGACGGTGAAGCTAACGAGTCCGATGCTGCTATAAACGTGGATGCCCTCGCTTCTAATGGTGGTAAATAG
BLAST of CmoCh11G014550 vs. Swiss-Prot
Match: GAT24_ARATH (GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2)

HSP 1 Score: 206.1 bits (523), Expect = 5.6e-52
Identity = 136/259 (52.51%), Postives = 160/259 (61.78%), Query Frame = 1

Query: 44  IDNPQMRFED--SGGMSGSVSAMNRVQDIVPSAYISSSDYNPLIGNG--GADQLTLSFRG 103
           IDN     +D   GGM   V       DI      S+ +   ++  G    DQLTLSF+G
Sbjct: 32  IDNENSMMDDHADGGMDEGVET-----DIPSHPGNSADNRGEVVDRGIENGDQLTLSFQG 91

Query: 104 EVYAFDSVSPDKVQAVLLLLGGYEVPSGIPA-VGSIPVNQQ--GADGFPVRSAQPQRAAS 163
           +VY FD VSP+KVQAVLLLLGG EVP  +P  +GS   N +  G  G P R + PQR AS
Sbjct: 92  QVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRLSVPQRLAS 151

Query: 164 LSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSSVLALTLDS-- 223
           L RFREKRK R F K IRY VRKEVALRMQRKKGQF S+K+S D+ GS+     +  S  
Sbjct: 152 LLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDWGSNQSWA 211

Query: 224 --GQDDGLLETSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAG 283
             G +    E  C HCG S KSTPMMRRGP GPRTLCNACGL WANKG LRDLSKV    
Sbjct: 212 VEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDLSKVPPPQ 271

Query: 284 I-QEPSVKETEQSDGEANE 291
             Q  S+ + E ++ EA++
Sbjct: 272 TPQHLSLNKNEDANLEADQ 285

BLAST of CmoCh11G014550 vs. Swiss-Prot
Match: GAT28_ARATH (GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 2.4e-50
Identity = 135/295 (45.76%), Postives = 172/295 (58.31%), Query Frame = 1

Query: 8   DVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGSVSAMNRV 67
           D ++GS    ++ +  D +  Q      +       + + Q    ++GGMS  V      
Sbjct: 2   DDLHGSNARMHIREAQDPMHVQFEHHALHHIHNGSGMVDDQADDGNAGGMSEGV------ 61

Query: 68  QDIVPSAYISSSDYNPLIGNGGA---DQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEV 127
           +  +PS   + +D    + + G+   DQLTLSF+G+VY FDSV P+KVQAVLLLLGG E+
Sbjct: 62  ETDIPSHPGNVTDNRGEVVDRGSEQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGGREL 121

Query: 128 PSGIP-AVGSIPVNQQGAD--GFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKEV 187
           P   P  +GS   N + +   G P R + PQR ASL RFREKRK R F KKIRY VRKEV
Sbjct: 122 PQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGRNFDKKIRYTVRKEV 181

Query: 188 ALRMQRKKGQFISSKASADEV-------GSSSVLALTLDSGQDDGLLETSCTHCGVSSKS 247
           ALRMQR KGQF S+K++ DE        GS+   A+     Q     E SC HCG+  KS
Sbjct: 182 ALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQ---EISCRHCGIGEKS 241

Query: 248 TPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEAN 290
           TPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q   + + E ++ E +
Sbjct: 242 TPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNEDANLETD 287

BLAST of CmoCh11G014550 vs. Swiss-Prot
Match: GAT20_ORYSJ (GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1)

HSP 1 Score: 191.4 bits (485), Expect = 1.4e-47
Identity = 120/223 (53.81%), Postives = 140/223 (62.78%), Query Frame = 1

Query: 90  ADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEVPSGIPAVGSIPVNQQGADGFPVRSA 149
           ++QLTLSF+GEVY FDSVSPDKVQAVLLLLGG E+  G+ +  S       +  +  R  
Sbjct: 125 SNQLTLSFQGEVYVFDSVSPDKVQAVLLLLGGRELNPGLGSGAS------SSAPYSKRLN 184

Query: 150 QPQRAASLSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSSVLA 209
            P R ASL RFREKRKER F KKIRY+VRKEVALRMQR +GQF SSK   DE  S     
Sbjct: 185 FPHRVASLMRFREKRKERNFDKKIRYSVRKEVALRMQRNRGQFTSSKPKGDEATSE---- 244

Query: 210 LTLDSGQDD-GLLE------TSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGIL 269
           LT   G  + G +E        C HCG+++K+TPMMRRGP GPRTLCNACGL WANKG+L
Sbjct: 245 LTASDGSPNWGSVEGRPPSAAECHHCGINAKATPMMRRGPDGPRTLCNACGLMWANKGML 304

Query: 270 RDLSKVSTAGIQEPSVKETEQSDGEANESDAAINVDALASNGG 306
           RDLSK     IQ   V      +G A        + A A+  G
Sbjct: 305 RDLSKAPPTPIQ--VVASVNDGNGSAAAPTTEQEIPAPATVNG 335

BLAST of CmoCh11G014550 vs. Swiss-Prot
Match: GAT25_ARATH (GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2)

HSP 1 Score: 190.3 bits (482), Expect = 3.2e-47
Identity = 115/217 (53.00%), Postives = 141/217 (64.98%), Query Frame = 1

Query: 89  GADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEVPSGIPAVGSIPVNQQGAD--GFPV 148
           GA+QLT+SFRG+VY FD+V  DKV AVL LLGG    +  P V  +   Q       +  
Sbjct: 80  GANQLTISFRGQVYVFDAVGADKVDAVLSLLGGSTELAPGPQVMELAQQQNHMPVVEYQS 139

Query: 149 RSAQPQRAASLSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSS 208
           R + PQRA SL RFR+KR  RCF KK+RY VR+EVALRM R KGQF SSK +     S +
Sbjct: 140 RCSLPQRAQSLDRFRKKRNARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYNSGT 199

Query: 209 VLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLS 268
                 DS QDD   E SCTHCG+SSK TPMMRRGP+GPRTLCNACGL WAN+G LRDLS
Sbjct: 200 ----DQDSAQDDAHPEISCTHCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLRDLS 259

Query: 269 KVSTAGIQEPSVKETEQSDGEANESDAAINVDALASN 304
           K +    +E  +   +  DG  + +DAA N++  A++
Sbjct: 260 KKT----EENQLALMKPDDG-GSVADAANNLNTEAAS 287

BLAST of CmoCh11G014550 vs. Swiss-Prot
Match: GAT17_ORYSJ (GATA transcription factor 17 OS=Oryza sativa subsp. japonica GN=GATA17 PE=2 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 1.9e-44
Identity = 124/287 (43.21%), Postives = 167/287 (58.19%), Query Frame = 1

Query: 29  QVGDEDDNVTGREESIDNPQMRFEDSGGMSGSVSAMNRVQDIVPSAYISSSDYNPLIGNG 88
           Q  + ++   G EE  +  +    D+   + +V+ M+   ++VP A   +    P + + 
Sbjct: 46  QEQEYEEGEEGEEEEYEGGEGVPMDADASAAAVAGMDPHGEMVPVAGGEAGGGYPHVAS- 105

Query: 89  GADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEVPSGIPAVGSIPVNQQGADGFPVRS 148
             + LTLSF+GEVY F+SVS ++VQAVLLLLGG E+    P  GS+P     +  +  + 
Sbjct: 106 --NTLTLSFQGEVYVFESVSAERVQAVLLLLGGREL---APGSGSVP---SSSAAYSKKM 165

Query: 149 AQPQRAASLSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSSVL 208
             P R ASL RFREKRKER F KKIRY VRKEVALRMQR +GQF SSK+ A+E  S    
Sbjct: 166 NFPHRMASLMRFREKRKERNFDKKIRYTVRKEVALRMQRNRGQFTSSKSKAEEATS---- 225

Query: 209 ALTLDSGQDD-GLLE------TSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGI 268
            +T   G  + G +E        C HCG+S+ STPMMRRGP GPRTLCNACGL WANKG 
Sbjct: 226 VITSSEGSPNWGAVEGRPPSAAECHHCGISAASTPMMRRGPDGPRTLCNACGLMWANKGT 285

Query: 269 LRDLSK----------VSTAGIQEPSVKET--EQSDGEANESDAAIN 297
           +R+++K           +T  +Q   V+ T  EQ +    E+ +A N
Sbjct: 286 MREVTKGPPVPLQIVPAATNDVQNGIVEATGVEQHNSAVEEAVSAAN 319

BLAST of CmoCh11G014550 vs. TrEMBL
Match: A0A0A0K2L9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 1.4e-139
Identity = 261/307 (85.02%), Postives = 276/307 (89.90%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNF+D MYGSGVM+N  + L N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS +NRV+D+VPS YIS SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS PVNQQGADGF VRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANE +AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

BLAST of CmoCh11G014550 vs. TrEMBL
Match: W9SKI1_9ROSA (GATA transcription factor 28 OS=Morus notabilis GN=L484_014769 PE=4 SV=1)

HSP 1 Score: 365.2 bits (936), Expect = 8.0e-98
Identity = 199/309 (64.40%), Postives = 240/309 (77.67%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQT-QVGDEDDNVTGREESIDNPQMRFEDSGGMSG 60
           MP+ N Q  MYG   M        N+Q+ QV D+D++VT  EESIDNPQ+RF+D      
Sbjct: 1   MPEPNQQASMYGRAAMATTT----NMQSGQVDDDDNDVTAGEESIDNPQIRFDD------ 60

Query: 61  SVSAMNRVQDIVPSA-YISS-SDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLL 120
           + +AMN +QD+  +A Y+   +DY P+  NGG+DQLTLSF+GEVY FD+VSPDKVQAVLL
Sbjct: 61  AAAAMNGIQDVPSNALYVPGVADYAPVAENGGSDQLTLSFQGEVYVFDAVSPDKVQAVLL 120

Query: 121 LLGGYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNV 180
           LLGGYE+PSGIPA+G+ P+ Q+G + F  +  QPQRAASL+RFREKRKERCF KKIRYNV
Sbjct: 121 LLGGYEIPSGIPAMGATPIGQRGMNQFVAKPIQPQRAASLNRFREKRKERCFDKKIRYNV 180

Query: 181 RKEVALRMQRKKGQFISSKASADEVGS-SSVLALTLDSGQDDGLLETSCTHCGVSSKSTP 240
           RKEVA+RMQRKKGQF S+K S++E+GS SSV   T  SGQD+ + ETSCTHCG+SSKSTP
Sbjct: 181 RKEVAMRMQRKKGQFTSAKTSSEELGSASSVWNATPGSGQDENMQETSCTHCGISSKSTP 240

Query: 241 MMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEANESDAAIN 300
           MMRRGPAGPRTLCNACGLKWANKGILRDLSKV    +Q+ SVKETEQSDG+AN+S A   
Sbjct: 241 MMRRGPAGPRTLCNACGLKWANKGILRDLSKVLNGNVQDASVKETEQSDGDANDSAAVTT 299

Query: 301 VDALASNGG 306
              +AS+ G
Sbjct: 301 TANIASSNG 299

BLAST of CmoCh11G014550 vs. TrEMBL
Match: A0A061E2X9_THECC (Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1)

HSP 1 Score: 355.1 bits (910), Expect = 8.3e-95
Identity = 200/315 (63.49%), Postives = 238/315 (75.56%), Query Frame = 1

Query: 1   MPDSNFQDV-MYGSGVMNNVAQDLDNVQTQVGDEDDNVTGR-----EESIDNPQMRFEDS 60
           M +SN Q   MYGSG MN        +Q  + +EDD+V G      EES+DNPQ+ ++++
Sbjct: 1   MANSNHQPTSMYGSGAMN--------MQQNLEEEDDDVPGGTGGGGEESVDNPQIGYQET 60

Query: 61  GGMSGSVSAMNRVQDIVPSA--YISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKV 120
           GG+   V+ MN   +    A  Y   SD   + GNGG+DQLTLSF+GEVY FDSVSPDKV
Sbjct: 61  GGV---VTVMNNGMEEASHANIYGQGSDLTVVPGNGGSDQLTLSFQGEVYVFDSVSPDKV 120

Query: 121 QAVLLLLGGYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKK 180
           QAVLLLLGGYE+PSGIPA+G++PV Q+G   FP R+ QPQRAASL+RFREKRKERCF KK
Sbjct: 121 QAVLLLLGGYEIPSGIPALGTVPVTQRGLGDFPGRAIQPQRAASLNRFREKRKERCFDKK 180

Query: 181 IRYNVRKEVALRMQRKKGQFISSKASADEVGS-SSVLALTLDSGQDDGLLETSCTHCGVS 240
           IRY VRKEVALRMQRKKGQF SSKA +DEV S SS  ++T  SGQD+ + ETSCTHCG+S
Sbjct: 181 IRYTVRKEVALRMQRKKGQFTSSKAISDEVASASSGWSVTPGSGQDESMEETSCTHCGIS 240

Query: 241 SKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEANES 300
           SKSTPMMRRGP GPRTLCNACGLKWANKG+LRDLSKVST  IQ+ S K TEQSD EAN+S
Sbjct: 241 SKSTPMMRRGPTGPRTLCNACGLKWANKGVLRDLSKVSTIPIQDASAKPTEQSDAEANDS 300

Query: 301 DA-AINVDALASNGG 306
           +A  +  D ++S+ G
Sbjct: 301 EAVTVTTDVVSSSNG 304

BLAST of CmoCh11G014550 vs. TrEMBL
Match: A0A0B0MJ71_GOSAR (GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 2.5e-91
Identity = 197/315 (62.54%), Postives = 235/315 (74.60%), Query Frame = 1

Query: 1   MPDSNFQDV-MYGSGVMNNVAQDLDNVQTQVGDEDDNVT-----GREESIDNPQMRFEDS 60
           M +SN Q   MYGSG  N + +++D       +EDD+V      G EES+DNPQ+ F+++
Sbjct: 1   MANSNHQPTSMYGSGAAN-MQRNIDE------EEDDDVPVGAGGGGEESVDNPQIGFQEN 60

Query: 61  GGMSGSVSAMNRVQDIVPSAYI--SSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKV 120
           G +   V+ MN   D    A++    SD     GNGGADQLTLSF+GEVY FDSVSPDKV
Sbjct: 61  GAV---VAVMNNGMDEASHAHVYGQGSDSTSAPGNGGADQLTLSFQGEVYVFDSVSPDKV 120

Query: 121 QAVLLLLGGYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKK 180
           QAVLLLLGGYE+PSGIPA+G++ V Q+G + FP RS QPQRAASL+RFREKRKERCF KK
Sbjct: 121 QAVLLLLGGYEIPSGIPAMGTVSVTQRGLNDFPGRSIQPQRAASLNRFREKRKERCFEKK 180

Query: 181 IRYNVRKEVALRMQRKKGQFISSKASADEVGS-SSVLALTLDSGQDDGLLETSCTHCGVS 240
           IRY VRKEVALRMQRKKGQF SSKA ++EV S SS  + T  SGQD+ + E  CTHCG+S
Sbjct: 181 IRYTVRKEVALRMQRKKGQFTSSKAISEEVASASSGWSGTPGSGQDENMQEVLCTHCGIS 240

Query: 241 SKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEANES 300
           SK TPMMRRGPAGPRTLCNACGLKWANKG+LRDLSKVST  I +P+VK  EQSD EANES
Sbjct: 241 SKRTPMMRRGPAGPRTLCNACGLKWANKGVLRDLSKVSTVVIPDPTVKTAEQSDAEANES 300

Query: 301 DA-AINVDALASNGG 306
           +A  +  D ++S+ G
Sbjct: 301 EAVTVTTDVVSSSNG 305

BLAST of CmoCh11G014550 vs. TrEMBL
Match: A0A0D2PM93_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 2.5e-91
Identity = 197/315 (62.54%), Postives = 234/315 (74.29%), Query Frame = 1

Query: 1   MPDSNFQDV-MYGSGVMNNVAQDLDNVQTQVGDEDDNVTGR-----EESIDNPQMRFEDS 60
           M +SN Q   MYGSG  N + +++D       +EDD+V G      EES+DNPQ+ F+++
Sbjct: 1   MANSNHQRTPMYGSGAAN-MQRNIDE------EEDDDVPGGAGGGGEESVDNPQIGFQEN 60

Query: 61  GGMSGSVSAMNRVQDIVPSAYI--SSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKV 120
           G +   V+ MN   D    A++    SD     GNGGADQLTLSF+GEVY FDSVSPDKV
Sbjct: 61  GAV---VAVMNNGMDEASHAHVYGQGSDSTSAPGNGGADQLTLSFQGEVYVFDSVSPDKV 120

Query: 121 QAVLLLLGGYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKK 180
           QAVLLLLGGYE+PSGIPA+G++ V Q+G   FP RS QPQRAASL+RFREKRKERCF KK
Sbjct: 121 QAVLLLLGGYEIPSGIPAMGTVSVTQRGLSDFPGRSIQPQRAASLNRFREKRKERCFEKK 180

Query: 181 IRYNVRKEVALRMQRKKGQFISSKASADEVGS-SSVLALTLDSGQDDGLLETSCTHCGVS 240
           IRY VRKEVALRMQRKKGQF SSKA ++EV S SS  + T  SGQD+ + E  CTHCG+S
Sbjct: 181 IRYTVRKEVALRMQRKKGQFTSSKAISEEVASASSGWSGTPGSGQDENIQEVLCTHCGIS 240

Query: 241 SKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEANES 300
           SK TPMMRRGPAGPRTLCNACGLKWANKG+LRDLSKVST  I +P+VK  EQSD EANES
Sbjct: 241 SKKTPMMRRGPAGPRTLCNACGLKWANKGVLRDLSKVSTVAIPDPTVKTAEQSDAEANES 300

Query: 301 DA-AINVDALASNGG 306
           +A  +  D ++S+ G
Sbjct: 301 EAVTVTPDVVSSSNG 305

BLAST of CmoCh11G014550 vs. TAIR10
Match: AT3G21175.1 (AT3G21175.1 ZIM-like 1)

HSP 1 Score: 206.1 bits (523), Expect = 3.2e-53
Identity = 136/259 (52.51%), Postives = 160/259 (61.78%), Query Frame = 1

Query: 44  IDNPQMRFED--SGGMSGSVSAMNRVQDIVPSAYISSSDYNPLIGNG--GADQLTLSFRG 103
           IDN     +D   GGM   V       DI      S+ +   ++  G    DQLTLSF+G
Sbjct: 32  IDNENSMMDDHADGGMDEGVET-----DIPSHPGNSADNRGEVVDRGIENGDQLTLSFQG 91

Query: 104 EVYAFDSVSPDKVQAVLLLLGGYEVPSGIPA-VGSIPVNQQ--GADGFPVRSAQPQRAAS 163
           +VY FD VSP+KVQAVLLLLGG EVP  +P  +GS   N +  G  G P R + PQR AS
Sbjct: 92  QVYVFDRVSPEKVQAVLLLLGGREVPHTLPTTLGSPHQNNRVLGLSGTPQRLSVPQRLAS 151

Query: 164 LSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSSVLALTLDS-- 223
           L RFREKRK R F K IRY VRKEVALRMQRKKGQF S+K+S D+ GS+     +  S  
Sbjct: 152 LLRFREKRKGRNFDKTIRYTVRKEVALRMQRKKGQFTSAKSSNDDSGSTGSDWGSNQSWA 211

Query: 224 --GQDDGLLETSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAG 283
             G +    E  C HCG S KSTPMMRRGP GPRTLCNACGL WANKG LRDLSKV    
Sbjct: 212 VEGTETQKPEVLCRHCGTSEKSTPMMRRGPDGPRTLCNACGLMWANKGTLRDLSKVPPPQ 271

Query: 284 I-QEPSVKETEQSDGEANE 291
             Q  S+ + E ++ EA++
Sbjct: 272 TPQHLSLNKNEDANLEADQ 285

BLAST of CmoCh11G014550 vs. TAIR10
Match: AT1G51600.1 (AT1G51600.1 ZIM-LIKE 2)

HSP 1 Score: 200.7 bits (509), Expect = 1.3e-51
Identity = 135/295 (45.76%), Postives = 172/295 (58.31%), Query Frame = 1

Query: 8   DVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGSVSAMNRV 67
           D ++GS    ++ +  D +  Q      +       + + Q    ++GGMS  V      
Sbjct: 2   DDLHGSNARMHIREAQDPMHVQFEHHALHHIHNGSGMVDDQADDGNAGGMSEGV------ 61

Query: 68  QDIVPSAYISSSDYNPLIGNGGA---DQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEV 127
           +  +PS   + +D    + + G+   DQLTLSF+G+VY FDSV P+KVQAVLLLLGG E+
Sbjct: 62  ETDIPSHPGNVTDNRGEVVDRGSEQGDQLTLSFQGQVYVFDSVLPEKVQAVLLLLGGREL 121

Query: 128 PSGIP-AVGSIPVNQQGAD--GFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKEV 187
           P   P  +GS   N + +   G P R + PQR ASL RFREKRK R F KKIRY VRKEV
Sbjct: 122 PQAAPPGLGSPHQNNRVSSLPGTPQRFSIPQRLASLVRFREKRKGRNFDKKIRYTVRKEV 181

Query: 188 ALRMQRKKGQFISSKASADEV-------GSSSVLALTLDSGQDDGLLETSCTHCGVSSKS 247
           ALRMQR KGQF S+K++ DE        GS+   A+     Q     E SC HCG+  KS
Sbjct: 182 ALRMQRNKGQFTSAKSNNDEAASAGSSWGSNQTWAIESSEAQHQ---EISCRHCGIGEKS 241

Query: 248 TPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGEAN 290
           TPMMRRGPAGPRTLCNACGL WANKG  RDLSK S    Q   + + E ++ E +
Sbjct: 242 TPMMRRGPAGPRTLCNACGLMWANKGAFRDLSKASPQTAQNLPLNKNEDANLETD 287

BLAST of CmoCh11G014550 vs. TAIR10
Match: AT4G24470.3 (AT4G24470.3 GATA-type zinc finger protein with TIFY domain)

HSP 1 Score: 190.7 bits (483), Expect = 1.4e-48
Identity = 114/220 (51.82%), Postives = 139/220 (63.18%), Query Frame = 1

Query: 89  GADQLTLSFRGEVYAFDSVSPDKVQAVLLLLGGYEVPSGIPAVGSIPVNQQGAD--GFPV 148
           GA+QLT+SFRG+VY FD+V  DKV AVL LLGG    +  P V  +   Q       +  
Sbjct: 80  GANQLTISFRGQVYVFDAVGADKVDAVLSLLGGSTELAPGPQVMELAQQQNHMPVVEYQS 139

Query: 149 RSAQPQRAASLSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADEVGSSS 208
           R + PQRA SL RFR+KR  RCF KK+RY VR+EVALRM R KGQF SSK +     S +
Sbjct: 140 RCSLPQRAQSLDRFRKKRNARCFEKKVRYGVRQEVALRMARNKGQFTSSKMTDGAYNSGT 199

Query: 209 VLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLS 268
                 DS QDD   E SCTHCG+SSK TPMMRRGP+GPRTLCNACGL WAN+G LRDLS
Sbjct: 200 ----DQDSAQDDAHPEISCTHCGISSKCTPMMRRGPSGPRTLCNACGLFWANRGTLRDLS 259

Query: 269 K---VSTAGIQEPSVKETEQSDGEANESDAAINVDALASN 304
           K    +   + +P        D   + +DAA N++  A++
Sbjct: 260 KKTEENQLALMKPVSSYKYHPDDGGSVADAANNLNTEAAS 295

BLAST of CmoCh11G014550 vs. TAIR10
Match: AT1G08000.1 (AT1G08000.1 GATA transcription factor 10)

HSP 1 Score: 53.9 bits (128), Expect = 2.0e-07
Identity = 24/60 (40.00%), Postives = 41/60 (68.33%), Query Frame = 1

Query: 211 TLDSGQDDGLLETSCTHCGVSSKSTPMMRRGPAGPRTLCNACGLKWANKGILRDLSKVST 270
           TL+S + DG++   CTHC   + +TP  R+GP+GP+TLCNACG+++ +  ++ +    S+
Sbjct: 207 TLESSKSDGIVRI-CTHC--ETITTPQWRQGPSGPKTLCNACGVRFKSGRLVPEYRPASS 263

BLAST of CmoCh11G014550 vs. TAIR10
Match: AT5G02810.1 (AT5G02810.1 pseudo-response regulator 7)

HSP 1 Score: 53.1 bits (126), Expect = 3.4e-07
Identity = 26/50 (52.00%), Postives = 38/50 (76.00%), Query Frame = 1

Query: 152 QRAASLSRFREKRKERCFGKKIRYNVRKEVALRMQRKKGQFISSKASADE 202
           QR A+L++FR+KRKERCF KK+RY  RK++A +  R +GQF+   A+A +
Sbjct: 668 QREAALTKFRQKRKERCFRKKVRYQSRKKLAEQRPRVRGQFVRKTAAATD 717

BLAST of CmoCh11G014550 vs. NCBI nr
Match: gi|659110314|ref|XP_008455162.1| (PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo])

HSP 1 Score: 511.9 bits (1317), Expect = 7.6e-142
Identity = 263/307 (85.67%), Postives = 280/307 (91.21%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNFQD MYGSGVMN+  +DL N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS ++RV+D+VPS Y+S SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS+PVNQQGADGFPVRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSVPVNQQGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANES+AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

BLAST of CmoCh11G014550 vs. NCBI nr
Match: gi|659110318|ref|XP_008455164.1| (PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo])

HSP 1 Score: 511.9 bits (1317), Expect = 7.6e-142
Identity = 263/307 (85.67%), Postives = 280/307 (91.21%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNFQD MYGSGVMN+  +DL N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFQDAMYGSGVMNDGGRDLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS ++RV+D+VPS Y+S SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVISRVEDVVPSTYVSGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS+PVNQQGADGFPVRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSVPVNQQGADGFPVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANES+AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANESNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

BLAST of CmoCh11G014550 vs. NCBI nr
Match: gi|449438218|ref|XP_004136886.1| (PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus])

HSP 1 Score: 503.8 bits (1296), Expect = 2.1e-139
Identity = 261/307 (85.02%), Postives = 276/307 (89.90%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNF+D MYGSGVM+N  + L N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS +NRV+D+VPS YIS SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS PVNQQGADGF VRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANE +AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

BLAST of CmoCh11G014550 vs. NCBI nr
Match: gi|778724486|ref|XP_011658814.1| (PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus])

HSP 1 Score: 503.8 bits (1296), Expect = 2.1e-139
Identity = 261/307 (85.02%), Postives = 276/307 (89.90%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNF+D MYGSGVM+N  + L N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS +NRV+D+VPS YIS SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS PVNQQGADGF VRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANE +AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

BLAST of CmoCh11G014550 vs. NCBI nr
Match: gi|700188515|gb|KGN43748.1| (hypothetical protein Csa_7G064580 [Cucumis sativus])

HSP 1 Score: 503.8 bits (1296), Expect = 2.1e-139
Identity = 261/307 (85.02%), Postives = 276/307 (89.90%), Query Frame = 1

Query: 1   MPDSNFQDVMYGSGVMNNVAQDLDNVQTQVGDEDDNVTGREESIDNPQMRFEDSGGMSGS 60
           MPDSNF+D MYGSGVM+N  + L N+Q +V DEDD++ G EESIDNPQMRFEDSGGMSGS
Sbjct: 1   MPDSNFEDAMYGSGVMDNGGRGLGNIQNRVDDEDDDINGGEESIDNPQMRFEDSGGMSGS 60

Query: 61  VSAMNRVQDIVPSAYISSSDYNPLIGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120
           VS +NRV+D+VPS YIS SDYNPL GNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG
Sbjct: 61  VSVINRVEDVVPSTYISGSDYNPLTGNGGADQLTLSFRGEVYAFDSVSPDKVQAVLLLLG 120

Query: 121 GYEVPSGIPAVGSIPVNQQGADGFPVRSAQPQRAASLSRFREKRKERCFGKKIRYNVRKE 180
           GYE+PSGIPA+GS PVNQQGADGF VRS QPQRAASLSRFREKRKERCF KKIRY+VRKE
Sbjct: 121 GYEIPSGIPAIGSAPVNQQGADGFTVRSVQPQRAASLSRFREKRKERCFEKKIRYSVRKE 180

Query: 181 VALRMQRKKGQFISSKASADEVGSSSVLALTLDSGQDDGLLETSCTHCGVSSKSTPMMRR 240
           VALRMQRKKGQFISSKA  DEVGSSSVL+ TLDSGQDDGLLETSCTHCG SSKSTPMMRR
Sbjct: 181 VALRMQRKKGQFISSKAIGDEVGSSSVLSQTLDSGQDDGLLETSCTHCGTSSKSTPMMRR 240

Query: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSTAGIQEPSVKETEQSDGE-ANESDAAINVDA 300
           GPAGPRTLCNACGLKWANKGILRDLSKVS   IQEPS KE EQSDGE ANE +AAINVD 
Sbjct: 241 GPAGPRTLCNACGLKWANKGILRDLSKVSNPSIQEPSAKEIEQSDGEAANEHNAAINVDI 300

Query: 301 LASNGGK 307
           L SNG K
Sbjct: 301 LTSNGDK 307

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAT24_ARATH5.6e-5252.51GATA transcription factor 24 OS=Arabidopsis thaliana GN=GATA24 PE=2 SV=2[more]
GAT28_ARATH2.4e-5045.76GATA transcription factor 28 OS=Arabidopsis thaliana GN=GATA28 PE=2 SV=1[more]
GAT20_ORYSJ1.4e-4753.81GATA transcription factor 20 OS=Oryza sativa subsp. japonica GN=GATA20 PE=2 SV=1[more]
GAT25_ARATH3.2e-4753.00GATA transcription factor 25 OS=Arabidopsis thaliana GN=GATA25 PE=2 SV=2[more]
GAT17_ORYSJ1.9e-4443.21GATA transcription factor 17 OS=Oryza sativa subsp. japonica GN=GATA17 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K2L9_CUCSA1.4e-13985.02Uncharacterized protein OS=Cucumis sativus GN=Csa_7G064580 PE=4 SV=1[more]
W9SKI1_9ROSA8.0e-9864.40GATA transcription factor 28 OS=Morus notabilis GN=L484_014769 PE=4 SV=1[more]
A0A061E2X9_THECC8.3e-9563.49Zim-like 2 OS=Theobroma cacao GN=TCM_007360 PE=4 SV=1[more]
A0A0B0MJ71_GOSAR2.5e-9162.54GATA transcription factor 24-like protein OS=Gossypium arboreum GN=F383_19720 PE... [more]
A0A0D2PM93_GOSRA2.5e-9162.54Uncharacterized protein OS=Gossypium raimondii GN=B456_008G017800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G21175.13.2e-5352.51 ZIM-like 1[more]
AT1G51600.11.3e-5145.76 ZIM-LIKE 2[more]
AT4G24470.31.4e-4851.82 GATA-type zinc finger protein with TIFY domain[more]
AT1G08000.12.0e-0740.00 GATA transcription factor 10[more]
AT5G02810.13.4e-0752.00 pseudo-response regulator 7[more]
Match NameE-valueIdentityDescription
gi|659110314|ref|XP_008455162.1|7.6e-14285.67PREDICTED: GATA transcription factor 24-like isoform X1 [Cucumis melo][more]
gi|659110318|ref|XP_008455164.1|7.6e-14285.67PREDICTED: GATA transcription factor 24-like isoform X2 [Cucumis melo][more]
gi|449438218|ref|XP_004136886.1|2.1e-13985.02PREDICTED: GATA transcription factor 24 isoform X1 [Cucumis sativus][more]
gi|778724486|ref|XP_011658814.1|2.1e-13985.02PREDICTED: GATA transcription factor 24 isoform X2 [Cucumis sativus][more]
gi|700188515|gb|KGN43748.1|2.1e-13985.02hypothetical protein Csa_7G064580 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000679Znf_GATA
IPR010399Tify_dom
IPR010402CCT_domain
IPR013088Znf_NHR/GATA
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0043565sequence-specific DNA binding
GO:0005515protein binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh11G014550.1CmoCh11G014550.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 225..260
score: 1.6
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 219..273
score: 4.6
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 225..252
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 224..276
score: 9
IPR010399Tify domainPFAMPF06200tifycoord: 90..120
score: 1.2
IPR010399Tify domainSMARTSM00979tify_2coord: 86..121
score: 5.
IPR010399Tify domainPROFILEPS51320TIFYcoord: 86..121
score: 13
IPR010402CCT domainPFAMPF06203CCTcoord: 153..195
score: 2.7
IPR010402CCT domainPROFILEPS51017CCTcoord: 153..195
score: 13
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 223..266
score: 2.5
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 22..298
score: 1.5E
NoneNo IPR availablePANTHERPTHR10071:SF186SUBFAMILY NOT NAMEDcoord: 22..298
score: 1.5E
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 222..270
score: 1.95

The following gene(s) are paralogous to this gene:

None