Cp4.1LG06g03840 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG06g03840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGATA transcription factor-like protein
LocationCp4.1LG06 : 2895941 .. 2898104 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGATGGCGGTTCCAATAGGCCAATATGCAGAGAATGTTGAGAAAAATCTGAGGTTTCCTCTGCCAATTCATTGTACTTTTTACCTTTTTACAGAAACAGAGTCAATCATGTGATAATTGGTAATAATAATATAATAATAATTGATTACTGATTTCAGATTCACTTAAGGATTAATCAATCTTTGGCTCATCACATCACATGCCTATCTTTCTGCACTAACCAATTACTTTCTTCTCCTTCTTTATTATTATAATCAAATTGATTAACACTGAATAAAGCAGAAGGGAAGAAAAAAAAAAAGAATAAAGCTGCTGCAGCATCCTGATCTACGGACATTAAAAACAGAGCAAAATCAAGCAATCAAGCGGGAATCAGAAATTTTTTGAAATTAGATTAAGTTCACCGAAGTCTATGGCTATCGCTCGAGCCTACCGTTAGCAGATATTGTCCTTTTTGGCTCCCTTTCGGACTTCCTCTCAAGGTTCTTAAAACGTACTGTGAGCAGATATTGTCCTCTTTGAGCTTTCCCTCCAGGTTTTTAAAACGCGTTTGCTAAAGAGAAGTTTCCACACCCTTGCAAAGAATACTTCGTTCTTCTCTCCAACTGATGCGCGGTTAGTTCTAAGTTGCCAGTTTTTGAACCAACCAAGCAAGATTTGAAATTACATTCACCATAGCCTATGACTATCCCAACTGCCAAAGTCTACGGTTAGCATACATTGTCCTTTTTGGGTTTTCCAAAGACTTCCCTTCAAGGTTTTTGAAGTACGTCTATTAGGGAGAGGTTTCCACACTCTTATAAAGAATACTTAGTTCTACTCTCTAATCGATGTGGGATCTCAAGGTTCGTTCTAAGTTCCTGGTTTCTAAACCAACCAAACAAGATTTGAAATTAGATTACATGAACCATAGCCGGTGGCTACCCAGAAAGTTCCTGTAACACTCAAACACAGCACTAGTAGATACTGTCCTCTATGGACTTTTCCTCTCGGGATTTCCCTCAAGATCTTTAAAACACGTCTGCTAGAGAGAGCTAGGAGGTAAAAGAGGACTAAAAAAGAAATGGGTTTTAGAAACCCACAATTGGAAGCAAGCTGAAGCTCCCCACTATTTCCACCCTGCTTTTTCCACCCAAAACCCACACTCCTCCGATACACTTGAAACCACCGCCACCGCCGCCGCCGCCGACCATTTCCTTGTCGAAGACCTCCTGGATTTCTCTAATGATGACCATGTCGTCTTCACCGACGCCACTTCTGACAACCAAACCCCCACCTCAACTGATTCCTCCACTCTCACTTTACTCGAACAACCCAACAATGGACACCACAATTACAGCTTNCACCACAATTATAGCTACCTCGACGCCAATTTCTCCACCGACCTTCCTGTTCCGGTAATCATCAACATTCTTCATTTCTCTTACCAATTTTCTTGCGAGAAATCTCTGTAAATTGATGGGTTTTTTTTGTTCTTTTTCAATTGCTCTGTTTTTTTTTTTTTTTTTTTTTTTTTGTGGGTTTTTTTCCACAGTACGACGACTTAGCAGAGCTCGAATGGCTTTCCAATTTCGTTGAGGATTCTTTCTCCACCGACGATTTGGAGAAGCTGAGTCTGATATCAGGGGTGAATTCCCGGACCGACGACGACGCCGCCGCAAAACCCAGAGAATTTCAAACCGGACACTCCGCCGGTTTCCACCCCGACATGTCGGTACCGGCGAAAGCCGCTCGTAGCAAGAGGTCTCGGGCATGTATATGGAATTCCAGAGTATCCGTACTCTCCCCGACAAATTCCTCTTCCGAAACCGACGTTGTCGTCACGCTCACGGCTAAGAAACCATCCAAGAAGAAAGAAACCCCCGACGACACGTTGTCCCCCAGCAATGGGGAAGGTCGGAAATGTCTCCATTGCGCCACCGACAAAACACCTCAGTGGCGAACCGGTCCATTTGGTCCTAAAACCTTGTGTAACGCTTGTGGGGTTCGGTACAAATCCGGCCGCCTCGTACCGGAGTATCGACCAGCTGCAAGCCCGACATTCGTGCTGACAAAACACTCCAATTCTCACCGGAAAGTCATGGAACTCCGGCGACAAAAGGAAATGATGAGAGCACAACAACAACAACAACAACATCTNTCCGGCGACAAAAGGAAATGA

mRNA sequence

ATGGCGATGGCGGTTCCAATAGGCCAATATGCAGAGAATGTTGAGAAAAATCTGAGGTTTCCTCTGCCAATTCATTCTGAAGCTCCCCACTATTTCCACCCTGCTTTTTCCACCCAAAACCCACACTCCTCCGATACACTTGAAACCACCGCCACCGCCGCCGCCGCCGACCATTTCCTTGTCGAAGACCTCCTGGATTTCTCTAATGATGACCATGTCGTCTTCACCGACGCCACTTCTGACAACCAAACCCCCACCTCAACTGATTCCTCCACTCTCACTTTACTCGAACAACCCAACAATGGACACCACAATTACAGCTTNCACCACAATTATAGCTACCTCGACGCCAATTTCTCCACCGACCTTCCTGTTCCGTACGACGACTTAGCAGAGCTCGAATGGCTTTCCAATTTCGTTGAGGATTCTTTCTCCACCGACGATTTGGAGAAGCTGAGTCTGATATCAGGGGTGAATTCCCGGACCGACGACGACGCCGCCGCAAAACCCAGAGAATTTCAAACCGGACACTCCGCCGGTTTCCACCCCGACATGTCGGTACCGGCGAAAGCCGCTCGTAGCAAGAGGTCTCGGGCATGTATATGGAATTCCAGAGTATCCGTACTCTCCCCGACAAATTCCTCTTCCGAAACCGACGTTGTCGTCACGCTCACGGCTAAGAAACCATCCAAGAAGAAAGAAACCCCCGACGACACGTTGTCCCCCAGCAATGGGGAAGGTCGGAAATGTCTCCATTGCGCCACCGACAAAACACCTCAGTGGCGAACCGGTCCATTTGGTCCTAAAACCTTGTGTAACGCTTGTGGGGTTCGGTACAAATCCGGCCGCCTCGTACCGGAGTATCGACCAGCTGCAAGCCCGACATTCGTGCTGACAAAACACTCCAATTCTCACCGGAAAGTCATGGAACTCCGGCGACAAAAGGAAATGATGAGAGCACAACAACAACAACAACAACATCTNTCCGGCGACAAAAGGAAATGA

Coding sequence (CDS)

ATGGCGATGGCGGTTCCAATAGGCCAATATGCAGAGAATGTTGAGAAAAATCTGAGGTTTCCTCTGCCAATTCATTCTGAAGCTCCCCACTATTTCCACCCTGCTTTTTCCACCCAAAACCCACACTCCTCCGATACACTTGAAACCACCGCCACCGCCGCCGCCGCCGACCATTTCCTTGTCGAAGACCTCCTGGATTTCTCTAATGATGACCATGTCGTCTTCACCGACGCCACTTCTGACAACCAAACCCCCACCTCAACTGATTCCTCCACTCTCACTTTACTCGAACAACCCAACAATGGACACCACAATTACAGCTTNCACCACAATTATAGCTACCTCGACGCCAATTTCTCCACCGACCTTCCTGTTCCGTACGACGACTTAGCAGAGCTCGAATGGCTTTCCAATTTCGTTGAGGATTCTTTCTCCACCGACGATTTGGAGAAGCTGAGTCTGATATCAGGGGTGAATTCCCGGACCGACGACGACGCCGCCGCAAAACCCAGAGAATTTCAAACCGGACACTCCGCCGGTTTCCACCCCGACATGTCGGTACCGGCGAAAGCCGCTCGTAGCAAGAGGTCTCGGGCATGTATATGGAATTCCAGAGTATCCGTACTCTCCCCGACAAATTCCTCTTCCGAAACCGACGTTGTCGTCACGCTCACGGCTAAGAAACCATCCAAGAAGAAAGAAACCCCCGACGACACGTTGTCCCCCAGCAATGGGGAAGGTCGGAAATGTCTCCATTGCGCCACCGACAAAACACCTCAGTGGCGAACCGGTCCATTTGGTCCTAAAACCTTGTGTAACGCTTGTGGGGTTCGGTACAAATCCGGCCGCCTCGTACCGGAGTATCGACCAGCTGCAAGCCCGACATTCGTGCTGACAAAACACTCCAATTCTCACCGGAAAGTCATGGAACTCCGGCGACAAAAGGAAATGATGAGAGCACAACAACAACAACAACAACATCTNTCCGGCGACAAAAGGAAATGA

Protein sequence

MAMAVPIGQYAENVEKNLRFPLPIHSEAPHYFHPAFSTQNPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTGHSAGFHPDMSVPAKAARSKRSRACIWNSRVSVLSPTNSSSETDVVVTLTAKKPSKKKETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQHLSGDKRK
BLAST of Cp4.1LG06g03840 vs. Swiss-Prot
Match: GAT12_ARATH (GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.8e-56
Identity = 153/298 (51.34%), Postives = 174/298 (58.39%), Query Frame = 1

Query: 59  FLVEDLL-DFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDA 118
           F V+DLL DFSNDD            T T TDSS  +  + P+         H       
Sbjct: 14  FAVDDLLVDFSNDDDEENDVVADSTTTTTITDSSNFSAADLPS--------FHGDVQDGT 73

Query: 119 NFSTDLPVPYDDLA-ELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTG 178
           +FS DL +P DDLA ELEWLSN V++S S +D+ KL LISG  SR D  +     E    
Sbjct: 74  SFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKSRPDPKSDTGSPENPNS 133

Query: 179 HSAGFHPDMSVPAKAARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTLTAKK------ 238
            S  F  D+SVPAKA RSKRSRA  C W SR  +L  T   S       L++++      
Sbjct: 134 SSPIFTTDVSVPAKA-RSKRSRAAACNWASR-GLLKETFYDSPFTGETILSSQQHLSPPT 193

Query: 239 -------PSKKKETPD-------DTLSPSNG--EGRKCLHCATDKTPQWRTGPFGPKTLC 298
                  P  KK+  D       D  SP +G  E R+CLHCATDKTPQWRTGP GPKTLC
Sbjct: 194 SPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERRCLHCATDKTPQWRTGPMGPKTLC 253

Query: 299 NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQHLSG 331
           NACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKVMELRRQKEM RA  +   H  G
Sbjct: 254 NACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAHHEFIHHHHG 301

BLAST of Cp4.1LG06g03840 vs. Swiss-Prot
Match: GATA9_ARATH (GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 9.4e-53
Identity = 142/282 (50.35%), Postives = 173/282 (61.35%), Query Frame = 1

Query: 48  ETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYS 107
           E    A   D F+V+DLLDFSNDD  V      D+   T  DSSTL+       G    S
Sbjct: 7   ELFLVAGNPDSFVVDDLLDFSNDDGEV------DDGLNTLPDSSTLS------TGTLTDS 66

Query: 108 XHHNYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGV-NSRTDDDA 167
            + +  + D    +DL +P DD+AELEWLSNFVE+SF+ +D +KL L SG+ N +T    
Sbjct: 67  SNSSSLFTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGST 126

Query: 168 AA---KPR-EFQTGHSAGFHPDMSVPAKAARSKRSR--ACIWNSRVSVLSPTNSSSETDV 227
                KP  E           +++VPAKA RSKRSR  A  W SR+  L+    S ET+ 
Sbjct: 127 LTHLIKPEPELDHQFIDIDESNVAVPAKA-RSKRSRSAASTWASRLLSLA---DSDETN- 186

Query: 228 VVTLTAKKPSKKKETPD-----DTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNAC 287
                 KK  ++ +  D     D     +G GR+CLHCAT+KTPQWRTGP GPKTLCNAC
Sbjct: 187 -----PKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNAC 246

Query: 288 GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEM 318
           GVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKVMELRRQKEM
Sbjct: 247 GVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM 266

BLAST of Cp4.1LG06g03840 vs. Swiss-Prot
Match: GATA2_ARATH (GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 1.8e-43
Identity = 125/287 (43.55%), Postives = 160/287 (55.75%), Query Frame = 1

Query: 54  AAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNY---SXHH 113
           ++ D   ++DLLDFSN+D  +F+ ++S   T  +T SS+    + P+  HH+    + HH
Sbjct: 7   SSPDLLRIDDLLDFSNED--IFSASSSGGST-AATSSSSFPPPQNPSFHHHHLPSSADHH 66

Query: 114 NYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKP 173
           ++ +       D+ VP DD A LEWLS FV+DSF+                   D  A P
Sbjct: 67  SFLH-------DICVPSDDAAHLEWLSQFVDDSFA-------------------DFPANP 126

Query: 174 REFQTGHSAGFHPDMSVPAKAARSKRSRACIWNSRVSVLSPTNSSSETDVVVTLTAKKPS 233
                G       + S P K  RSKRSRA          SP    SE   + +    KP 
Sbjct: 127 LG---GTMTSVKTETSFPGKP-RSKRSRAPA--PFAGTWSPMPLESEHQQLHSAAKFKPK 186

Query: 234 KK----------KETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYK 293
           K+          +     + +   G  R+C HCA++KTPQWRTGP GPKTLCNACGVR+K
Sbjct: 187 KEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFK 246

Query: 294 SGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQH 328
           SGRLVPEYRPA+SPTFVLT+HSNSHRKVMELRRQKE+MR  QQ Q H
Sbjct: 247 SGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQVQLH 258

BLAST of Cp4.1LG06g03840 vs. Swiss-Prot
Match: GATA4_ARATH (GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 7.5e-42
Identity = 120/267 (44.94%), Postives = 150/267 (56.18%), Query Frame = 1

Query: 54  AAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYS 113
           ++ D   ++DLLDFSND+  +F    S + T TS+ +S+    E P +     S  +   
Sbjct: 7   SSPDLLRIDDLLDFSNDE--IF----SSSSTVTSSAASSAASSENPFSFP---SSTYTSP 66

Query: 114 YLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREF 173
            L  +F+ DL VP DD A LEWLS FV+DSFS                   D  A P   
Sbjct: 67  TLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS-------------------DFPANPLTM 126

Query: 174 QTGHSAGFHPDMSVPAKAARSKRSRACIWNSRVSVLSPTNSSSETDVVVTLTAKKPSKKK 233
                    P++S   K  RS+RSRA       SV       SE+++  ++   KP K  
Sbjct: 127 TV------RPEISFTGKP-RSRRSRA----PAPSVAGTWAPMSESELCHSVAKPKPKKVY 186

Query: 234 ETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAAS 293
                T   +    R+C HCA++KTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+S
Sbjct: 187 NAESVTADGA----RRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASS 230

Query: 294 PTFVLTKHSNSHRKVMELRRQKEMMRA 321
           PTFVLT+HSNSHRKVMELRRQKE   +
Sbjct: 247 PTFVLTQHSNSHRKVMELRRQKEQQES 230

BLAST of Cp4.1LG06g03840 vs. Swiss-Prot
Match: GATA5_ARATH (GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.5e-34
Identity = 122/312 (39.10%), Postives = 155/312 (49.68%), Query Frame = 1

Query: 47  LETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNY 106
           + T     + D F V+DLLD SNDD  VF D  +D +   +         E+PN+     
Sbjct: 29  VTTAQNGFSVDDFSVDDLLDLSNDD--VFADEETDLK---AQHEMVRVSSEEPNDDGDAL 88

Query: 107 SXHHNYSYLDANFS---TDLPVPYDDLAELEWLSNFVEDSFST--------DDLEKLSLI 166
               ++S  D   S   ++L +P DDLA LEWLS+FVEDSF+            EK + +
Sbjct: 89  RRSSDFSGCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYSGPNLTGTPTEKPAWL 148

Query: 167 SG-----VNSRTDDDAAAKPREFQTGHSAGFHPDMSVPAKAARSKRSR--ACIWNSRVSV 226
           +G     V + T++     P                VPAK ARSKR+R    +W+   S 
Sbjct: 149 TGDRKHPVTAVTEETCFKSP----------------VPAK-ARSKRNRNGLKVWSLGSSS 208

Query: 227 LSPTNSSSET------------------DVVVTLTAKKPSKKKETPDDTLSPSNGE---- 286
            S  +SS  T                  + VVT + + P  KK       S  +GE    
Sbjct: 209 SSGPSSSGSTSSSSSGPSSPWFSGAELLEPVVT-SERPPFPKKHKKRSAESVFSGELQQL 268

Query: 287 --GRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNS 317
              RKC HC   KTPQWR GP G KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN 
Sbjct: 269 QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNH 317

BLAST of Cp4.1LG06g03840 vs. TrEMBL
Match: A0A0A0LH13_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G895650 PE=4 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 2.0e-118
Identity = 248/322 (77.02%), Postives = 262/322 (81.37%), Query Frame = 1

Query: 27  EAPHYFHPAFS----TQNPHSSDTLETTATAA---AADHFLVEDLLDFSNDDHVVFTDAT 86
           E P YF P FS    T++  SSD  +T   AA    ADHF+VEDLLDFSNDD VVFTD T
Sbjct: 2   EGPEYFQPGFSSQFSTEDRQSSDANKTNTAAAPPTTADHFIVEDLLDFSNDDDVVFTDGT 61

Query: 87  SDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDANFSTDLPVPYDDLAELEWLSNF 146
            DNQTPTSTDSSTLTLL+  N+ + N    HNY + DANFSTDL VPYDDLAELEWLSNF
Sbjct: 62  FDNQTPTSTDSSTLTLLDSCNS-YPNTGNAHNYHFADANFSTDLGVPYDDLAELEWLSNF 121

Query: 147 VEDSFSTDDLEKLSLISGVNSRTD--DDAAAKPREFQTG----HSAGFHPDMSV-PAKAA 206
           VEDSFSTDDLEKLSLISG+NSR D  DD A+K REFQTG    HS GF  +MSV PAKAA
Sbjct: 122 VEDSFSTDDLEKLSLISGMNSRADVHDDDASKAREFQTGFNRNHSPGFRHEMSVVPAKAA 181

Query: 207 RSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTLT-----AKKPSKKKETPDDTLSPS-- 266
           RSKRSRA  CIWNSR+SVLSPTNSSSETDVVVTLT     AKK +KKKE PDDT S +  
Sbjct: 182 RSKRSRAAPCIWNSRLSVLSPTNSSSETDVVVTLTPHPNTAKKTTKKKEIPDDTSSAAGN 241

Query: 267 NGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 326
           NGEGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN
Sbjct: 242 NGEGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 301

BLAST of Cp4.1LG06g03840 vs. TrEMBL
Match: A0A151QPZ2_CAJCA (GATA transcription factor 13 OS=Cajanus cajan GN=KK1_046971 PE=4 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 9.1e-79
Identity = 173/292 (59.25%), Postives = 210/292 (71.92%), Query Frame = 1

Query: 48  ETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYS 107
           +T  T  ++D F+VEDL DFSN D  V TDAT D+    STDSST+T +E  N+   +  
Sbjct: 19  DTNKTNNSSDPFIVEDLFDFSNHDDAVITDATFDSVPVNSTDSSTVTAVESCNSSSFSDP 78

Query: 108 XHHNYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAA 167
                +  +ANFS DL VPYDD+AELEWLSNFVE+SFS++DL++L LISG+    DD + 
Sbjct: 79  NPATRNLSNANFSGDLCVPYDDIAELEWLSNFVEESFSSEDLQQLQLISGMKGPNDDASE 138

Query: 168 AKPREFQ-TGHSAGFHPDMSVPAKAARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVT- 227
           A+    + T +S  F+ ++SVPA+A RSKRSR   C W SR+ VLSP  SS+E +VV   
Sbjct: 139 ARGFHSEPTRNSPIFNSEVSVPARA-RSKRSRGPPCNWASRLLVLSPATSSTEPEVVAPS 198

Query: 228 --------LTAKKPSKKKETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNAC 287
                   ++AKKP+K      D+   S G+GR+CLHCATDKTPQWRTGP GPKTLCNAC
Sbjct: 199 PTSSLPGPVSAKKPAKASPRKKDSGDGSGGDGRRCLHCATDKTPQWRTGPMGPKTLCNAC 258

Query: 288 GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQH 328
           GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKV+ELRRQKEM+RAQQ Q QH
Sbjct: 259 GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKEMVRAQQHQHQH 309

BLAST of Cp4.1LG06g03840 vs. TrEMBL
Match: A0A067GJP0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017390mg PE=4 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 3.5e-78
Identity = 181/340 (53.24%), Postives = 231/340 (67.94%), Query Frame = 1

Query: 27  EAPHYFHPAFSTQ-NPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTP 86
           E P +F  ++  Q +     +L++  ++   DHF+VE+LLDFSN+D ++   A  D+ T 
Sbjct: 2   EVPEFFQGSYCAQFSAEKHHSLDSNKSSNGGDHFIVEELLDFSNEDAILTDAAAFDDVTA 61

Query: 87  TSTDSSTLTLLEQPNNGHH-----NYSXHHN--YSYLDANFSTDLPVPYDDLAELEWLSN 146
            STDSST+T+++  N+        N+   +N   ++ DA+FS DL VPYDDLAELEWLSN
Sbjct: 62  NSTDSSTVTVVDSCNSSSFSGCGPNFPGENNGCRNFSDAHFSGDLCVPYDDLAELEWLSN 121

Query: 147 FVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTGHSAGFH---------------- 206
            VE+SFS +DL+KL LISG+ +R+D   +++ R+FQ G +  +H                
Sbjct: 122 IVEESFSCEDLQKLQLISGMKARSDH--SSETRQFQPGTNRIYHGSTNTSNNTNANPNNP 181

Query: 207 ---PDMSVPAKAARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVT------LTAKKP-- 266
              P+M+VPAKA RSKRSRA  C W SR+ VLSP  S+SE +++ T      L  KK   
Sbjct: 182 VFNPEMAVPAKA-RSKRSRAAPCSWASRLLVLSPPESTSEPEIIPTGPPPPPLQGKKSVK 241

Query: 267 ---SKKKETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVP 326
              SKKK++ D+     NGEGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVP
Sbjct: 242 ACGSKKKDSGDE----GNGEGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 301

BLAST of Cp4.1LG06g03840 vs. TrEMBL
Match: V4WC04_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008690mg PE=4 SV=1)

HSP 1 Score: 298.1 bits (762), Expect = 1.3e-77
Identity = 181/340 (53.24%), Postives = 231/340 (67.94%), Query Frame = 1

Query: 27  EAPHYFHPAFSTQ-NPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTP 86
           E P +F  ++  Q +     +L++  ++   DHF+VE+LLDFSN+D ++   A  D+ T 
Sbjct: 2   EVPEFFQGSYCAQFSAEKHHSLDSNKSSNGGDHFIVEELLDFSNEDAILTDAAAFDDVTA 61

Query: 87  TSTDSSTLTLLEQPNNGHH-----NYSXHHN--YSYLDANFSTDLPVPYDDLAELEWLSN 146
            STDSST+T+++  N+        N+   +N   ++ DA+FS DL VPYDDLAELEWLSN
Sbjct: 62  NSTDSSTVTVVDSCNSSSFSGCGPNFPGENNGCRNFSDAHFSGDLCVPYDDLAELEWLSN 121

Query: 147 FVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTG-------------------HSA 206
            VE+SFS +DL+KL LISG+ +R+D   +++  +FQ G                   ++ 
Sbjct: 122 IVEESFSCEDLQKLQLISGMKARSDH--SSETCQFQPGTNRINHGSTNTSNNTNANPNNP 181

Query: 207 GFHPDMSVPAKAARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVT------LTAKKP-- 266
            F+P+M+VPAKA RSKRSRA  C W SR+ VLSP  S+SE +++ T      L  KK   
Sbjct: 182 VFNPEMAVPAKA-RSKRSRAAPCSWASRLLVLSPPESTSEPEIIPTGLPPPPLQGKKSVK 241

Query: 267 ---SKKKETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVP 326
              SKKK++ D+     NGEGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVP
Sbjct: 242 ACGSKKKDSGDE----GNGEGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVP 301

BLAST of Cp4.1LG06g03840 vs. TrEMBL
Match: A0A061GK48_THECC (GATA transcription factor 9, putative OS=Theobroma cacao GN=TCM_037205 PE=4 SV=1)

HSP 1 Score: 291.2 bits (744), Expect = 1.6e-75
Identity = 178/310 (57.42%), Postives = 210/310 (67.74%), Query Frame = 1

Query: 54  AAADHFLVEDLLDFSNDDHVVFTDATSDNQTPT--STDSSTLTLLEQPNNGHHNYSXHHN 113
           AA DHF+VEDLLDFSN+D V+ TD T D+      STDSST+T ++  N+   +     N
Sbjct: 22  AAGDHFIVEDLLDFSNEDAVI-TDGTFDSSVAGGHSTDSSTVTAVDSCNSSSLS-GCEPN 81

Query: 114 YS-------YLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDD 173
           +        + D  F+ DL VPYDDLAELEWLSNFVE+SFS++DL+KL LISG+ +R D+
Sbjct: 82  FEGDMGCRGFTDGQFAGDLCVPYDDLAELEWLSNFVEESFSSEDLQKLQLISGMKTRPDE 141

Query: 174 DAAA------------------KPREFQTGHSAGFHPDMSVPAKAARSKRSRACI--WNS 233
            + +                          ++  FHPDMSVPAKA RSKRSRA    W S
Sbjct: 142 SSQSGGFQPVITNQMHHVIENGDTEHGNNNNNPSFHPDMSVPAKA-RSKRSRAAPLNWAS 201

Query: 234 RVSVLSPTNSSSETDVVVTLT-------AKKPSK-KKETPDDTLSPSNGEGRKCLHCATD 293
           R+ VLSPT SSSE D+VV +         KKP K KK+   +    +N +GRKCLHCATD
Sbjct: 202 RLLVLSPTTSSSEPDIVVPVQPPPPNHPGKKPVKTKKKDGGEGGGLANSDGRKCLHCATD 261

Query: 294 KTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKE 327
           KTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKV+ELRRQKE
Sbjct: 262 KTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRRQKE 321

BLAST of Cp4.1LG06g03840 vs. TAIR10
Match: AT5G25830.1 (AT5G25830.1 GATA transcription factor 12)

HSP 1 Score: 221.1 bits (562), Expect = 1.0e-57
Identity = 153/298 (51.34%), Postives = 174/298 (58.39%), Query Frame = 1

Query: 59  FLVEDLL-DFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDA 118
           F V+DLL DFSNDD            T T TDSS  +  + P+         H       
Sbjct: 14  FAVDDLLVDFSNDDDEENDVVADSTTTTTITDSSNFSAADLPS--------FHGDVQDGT 73

Query: 119 NFSTDLPVPYDDLA-ELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTG 178
           +FS DL +P DDLA ELEWLSN V++S S +D+ KL LISG  SR D  +     E    
Sbjct: 74  SFSGDLCIPSDDLADELEWLSNIVDESLSPEDVHKLELISGFKSRPDPKSDTGSPENPNS 133

Query: 179 HSAGFHPDMSVPAKAARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTLTAKK------ 238
            S  F  D+SVPAKA RSKRSRA  C W SR  +L  T   S       L++++      
Sbjct: 134 SSPIFTTDVSVPAKA-RSKRSRAAACNWASR-GLLKETFYDSPFTGETILSSQQHLSPPT 193

Query: 239 -------PSKKKETPD-------DTLSPSNG--EGRKCLHCATDKTPQWRTGPFGPKTLC 298
                  P  KK+  D       D  SP +G  E R+CLHCATDKTPQWRTGP GPKTLC
Sbjct: 194 SPPLLMAPLGKKQAVDGGHRRKKDVSSPESGGAEERRCLHCATDKTPQWRTGPMGPKTLC 253

Query: 299 NACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQHLSG 331
           NACGVRYKSGRLVPEYRPAASPTFVL KHSNSHRKVMELRRQKEM RA  +   H  G
Sbjct: 254 NACGVRYKSGRLVPEYRPAASPTFVLAKHSNSHRKVMELRRQKEMSRAHHEFIHHHHG 301

BLAST of Cp4.1LG06g03840 vs. TAIR10
Match: AT4G32890.1 (AT4G32890.1 GATA transcription factor 9)

HSP 1 Score: 208.8 bits (530), Expect = 5.3e-54
Identity = 142/282 (50.35%), Postives = 173/282 (61.35%), Query Frame = 1

Query: 48  ETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYS 107
           E    A   D F+V+DLLDFSNDD  V      D+   T  DSSTL+       G    S
Sbjct: 7   ELFLVAGNPDSFVVDDLLDFSNDDGEV------DDGLNTLPDSSTLS------TGTLTDS 66

Query: 108 XHHNYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGV-NSRTDDDA 167
            + +  + D    +DL +P DD+AELEWLSNFVE+SF+ +D +KL L SG+ N +T    
Sbjct: 67  SNSSSLFTDGTGFSDLYIPNDDIAELEWLSNFVEESFAGEDQDKLHLFSGLKNPQTTGST 126

Query: 168 AA---KPR-EFQTGHSAGFHPDMSVPAKAARSKRSR--ACIWNSRVSVLSPTNSSSETDV 227
                KP  E           +++VPAKA RSKRSR  A  W SR+  L+    S ET+ 
Sbjct: 127 LTHLIKPEPELDHQFIDIDESNVAVPAKA-RSKRSRSAASTWASRLLSLA---DSDETN- 186

Query: 228 VVTLTAKKPSKKKETPD-----DTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNAC 287
                 KK  ++ +  D     D     +G GR+CLHCAT+KTPQWRTGP GPKTLCNAC
Sbjct: 187 -----PKKKQRRVKEQDFAGDMDVDCGESGGGRRCLHCATEKTPQWRTGPMGPKTLCNAC 246

Query: 288 GVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEM 318
           GVRYKSGRLVPEYRPA+SPTFV+ +HSNSHRKVMELRRQKEM
Sbjct: 247 GVRYKSGRLVPEYRPASSPTFVMARHSNSHRKVMELRRQKEM 266

BLAST of Cp4.1LG06g03840 vs. TAIR10
Match: AT2G45050.1 (AT2G45050.1 GATA transcription factor 2)

HSP 1 Score: 177.9 bits (450), Expect = 1.0e-44
Identity = 125/287 (43.55%), Postives = 160/287 (55.75%), Query Frame = 1

Query: 54  AAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNY---SXHH 113
           ++ D   ++DLLDFSN+D  +F+ ++S   T  +T SS+    + P+  HH+    + HH
Sbjct: 7   SSPDLLRIDDLLDFSNED--IFSASSSGGST-AATSSSSFPPPQNPSFHHHHLPSSADHH 66

Query: 114 NYSYLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKP 173
           ++ +       D+ VP DD A LEWLS FV+DSF+                   D  A P
Sbjct: 67  SFLH-------DICVPSDDAAHLEWLSQFVDDSFA-------------------DFPANP 126

Query: 174 REFQTGHSAGFHPDMSVPAKAARSKRSRACIWNSRVSVLSPTNSSSETDVVVTLTAKKPS 233
                G       + S P K  RSKRSRA          SP    SE   + +    KP 
Sbjct: 127 LG---GTMTSVKTETSFPGKP-RSKRSRAPA--PFAGTWSPMPLESEHQQLHSAAKFKPK 186

Query: 234 KK----------KETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYK 293
           K+          +     + +   G  R+C HCA++KTPQWRTGP GPKTLCNACGVR+K
Sbjct: 187 KEQSGGGGGGGGRHQSSSSETTEGGGMRRCTHCASEKTPQWRTGPLGPKTLCNACGVRFK 246

Query: 294 SGRLVPEYRPAASPTFVLTKHSNSHRKVMELRRQKEMMRAQQQQQQH 328
           SGRLVPEYRPA+SPTFVLT+HSNSHRKVMELRRQKE+MR  QQ Q H
Sbjct: 247 SGRLVPEYRPASSPTFVLTQHSNSHRKVMELRRQKEVMRQPQQVQLH 258

BLAST of Cp4.1LG06g03840 vs. TAIR10
Match: AT3G60530.1 (AT3G60530.1 GATA transcription factor 4)

HSP 1 Score: 172.6 bits (436), Expect = 4.2e-43
Identity = 120/267 (44.94%), Postives = 150/267 (56.18%), Query Frame = 1

Query: 54  AAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYS 113
           ++ D   ++DLLDFSND+  +F    S + T TS+ +S+    E P +     S  +   
Sbjct: 7   SSPDLLRIDDLLDFSNDE--IF----SSSSTVTSSAASSAASSENPFSFP---SSTYTSP 66

Query: 114 YLDANFSTDLPVPYDDLAELEWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREF 173
            L  +F+ DL VP DD A LEWLS FV+DSFS                   D  A P   
Sbjct: 67  TLLTDFTHDLCVPSDDAAHLEWLSRFVDDSFS-------------------DFPANPLTM 126

Query: 174 QTGHSAGFHPDMSVPAKAARSKRSRACIWNSRVSVLSPTNSSSETDVVVTLTAKKPSKKK 233
                    P++S   K  RS+RSRA       SV       SE+++  ++   KP K  
Sbjct: 127 TV------RPEISFTGKP-RSRRSRA----PAPSVAGTWAPMSESELCHSVAKPKPKKVY 186

Query: 234 ETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAAS 293
                T   +    R+C HCA++KTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPA+S
Sbjct: 187 NAESVTADGA----RRCTHCASEKTPQWRTGPLGPKTLCNACGVRYKSGRLVPEYRPASS 230

Query: 294 PTFVLTKHSNSHRKVMELRRQKEMMRA 321
           PTFVLT+HSNSHRKVMELRRQKE   +
Sbjct: 247 PTFVLTQHSNSHRKVMELRRQKEQQES 230

BLAST of Cp4.1LG06g03840 vs. TAIR10
Match: AT5G66320.1 (AT5G66320.1 GATA transcription factor 5)

HSP 1 Score: 148.3 bits (373), Expect = 8.5e-36
Identity = 122/312 (39.10%), Postives = 155/312 (49.68%), Query Frame = 1

Query: 47  LETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPTSTDSSTLTLLEQPNNGHHNY 106
           + T     + D F V+DLLD SNDD  VF D  +D +   +         E+PN+     
Sbjct: 29  VTTAQNGFSVDDFSVDDLLDLSNDD--VFADEETDLK---AQHEMVRVSSEEPNDDGDAL 88

Query: 107 SXHHNYSYLDANFS---TDLPVPYDDLAELEWLSNFVEDSFST--------DDLEKLSLI 166
               ++S  D   S   ++L +P DDLA LEWLS+FVEDSF+            EK + +
Sbjct: 89  RRSSDFSGCDDFGSLPTSELSLPADDLANLEWLSHFVEDSFTEYSGPNLTGTPTEKPAWL 148

Query: 167 SG-----VNSRTDDDAAAKPREFQTGHSAGFHPDMSVPAKAARSKRSR--ACIWNSRVSV 226
           +G     V + T++     P                VPAK ARSKR+R    +W+   S 
Sbjct: 149 TGDRKHPVTAVTEETCFKSP----------------VPAK-ARSKRNRNGLKVWSLGSSS 208

Query: 227 LSPTNSSSET------------------DVVVTLTAKKPSKKKETPDDTLSPSNGE---- 286
            S  +SS  T                  + VVT + + P  KK       S  +GE    
Sbjct: 209 SSGPSSSGSTSSSSSGPSSPWFSGAELLEPVVT-SERPPFPKKHKKRSAESVFSGELQQL 268

Query: 287 --GRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNS 317
              RKC HC   KTPQWR GP G KTLCNACGVRYKSGRL+PEYRPA SPTF    HSN 
Sbjct: 269 QPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNH 317

BLAST of Cp4.1LG06g03840 vs. NCBI nr
Match: gi|778687724|ref|XP_011652614.1| (PREDICTED: GATA transcription factor 9-like [Cucumis sativus])

HSP 1 Score: 433.7 bits (1114), Expect = 2.9e-118
Identity = 248/322 (77.02%), Postives = 262/322 (81.37%), Query Frame = 1

Query: 27  EAPHYFHPAFS----TQNPHSSDTLETTATAA---AADHFLVEDLLDFSNDDHVVFTDAT 86
           E P YF P FS    T++  SSD  +T   AA    ADHF+VEDLLDFSNDD VVFTD T
Sbjct: 2   EGPEYFQPGFSSQFSTEDRQSSDANKTNTAAAPPTTADHFIVEDLLDFSNDDDVVFTDGT 61

Query: 87  SDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDANFSTDLPVPYDDLAELEWLSNF 146
            DNQTPTSTDSSTLTLL+  N+ + N    HNY + DANFSTDL VPYDDLAELEWLSNF
Sbjct: 62  FDNQTPTSTDSSTLTLLDSCNS-YPNTGNAHNYHFADANFSTDLGVPYDDLAELEWLSNF 121

Query: 147 VEDSFSTDDLEKLSLISGVNSRTD--DDAAAKPREFQTG----HSAGFHPDMSV-PAKAA 206
           VEDSFSTDDLEKLSLISG+NSR D  DD A+K REFQTG    HS GF  +MSV PAKAA
Sbjct: 122 VEDSFSTDDLEKLSLISGMNSRADVHDDDASKAREFQTGFNRNHSPGFRHEMSVVPAKAA 181

Query: 207 RSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTLT-----AKKPSKKKETPDDTLSPS-- 266
           RSKRSRA  CIWNSR+SVLSPTNSSSETDVVVTLT     AKK +KKKE PDDT S +  
Sbjct: 182 RSKRSRAAPCIWNSRLSVLSPTNSSSETDVVVTLTPHPNTAKKTTKKKEIPDDTSSAAGN 241

Query: 267 NGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 326
           NGEGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN
Sbjct: 242 NGEGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSN 301

BLAST of Cp4.1LG06g03840 vs. NCBI nr
Match: gi|659132192|ref|XP_008466068.1| (PREDICTED: GATA transcription factor 9-like [Cucumis melo])

HSP 1 Score: 428.3 bits (1100), Expect = 1.2e-116
Identity = 245/323 (75.85%), Postives = 262/323 (81.11%), Query Frame = 1

Query: 27  EAPHYFHPAFS----TQNPHSSDTLETTATAA---AADHFLVEDLLDFSNDDHVVFTDAT 86
           E P YF P FS    T++  SSD  +TTA A     ADHF+VEDLLDFSNDD VVFTD  
Sbjct: 2   EGPEYFQPGFSSQFSTEDRQSSDANKTTAAAVPPTTADHFIVEDLLDFSNDDDVVFTDGA 61

Query: 87  SDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHNYSYLDANFSTDLPVPYDDLAELEWLSNF 146
            DNQTPTSTDSS+LTLL+  N+ + N    HNY + DANFSTDL VPYDDLAELEWLSNF
Sbjct: 62  FDNQTPTSTDSSSLTLLDSCNS-YPNTGNAHNYHFADANFSTDLGVPYDDLAELEWLSNF 121

Query: 147 VEDSFSTDDLEKLSLISGVNSRTD--DDAAAKPREFQTG-----HSAGFHPDMSV-PAKA 206
           VEDSFSTDDLEKLSLISG+NS+TD  DD A+K REFQ+G     HS  F  +MSV PAKA
Sbjct: 122 VEDSFSTDDLEKLSLISGMNSQTDVHDDDASKTREFQSGFNPRNHSPAFRHEMSVVPAKA 181

Query: 207 ARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTLT-----AKKPSKKKETPDDTLSPS- 266
           ARSKRSRA  CIWNSR+SVLSPTNSSSETDVVVTLT     AKK +KKKE PDDT S + 
Sbjct: 182 ARSKRSRAAPCIWNSRLSVLSPTNSSSETDVVVTLTPHPNTAKKTTKKKEIPDDTSSAAG 241

Query: 267 -NGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS 326
            NGEGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS
Sbjct: 242 NNGEGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHS 301

BLAST of Cp4.1LG06g03840 vs. NCBI nr
Match: gi|1009135304|ref|XP_015884916.1| (PREDICTED: GATA transcription factor 12 [Ziziphus jujuba])

HSP 1 Score: 323.9 bits (829), Expect = 3.2e-85
Identity = 200/336 (59.52%), Postives = 234/336 (69.64%), Query Frame = 1

Query: 27  EAPHYFH----PAFSTQNPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDN 86
           EAP ++     P F  +  HS+D      TA  ADHF+VEDLLDFSN+D V+ TD   D+
Sbjct: 2   EAPEFYQNSFCPQFVPEKRHSTDN----KTAGGADHFIVEDLLDFSNNDAVI-TDGAFDS 61

Query: 87  QTPTSTDSSTLTLLEQPNNGHHNYSXHHNY-------SYLDANFSTDLPVPYDDLAELEW 146
            T  STDSST+T+++  N+   +     N+       S+ D NFS DL VPYDDLAELEW
Sbjct: 62  VTGNSTDSSTVTVVDSCNSSSFS-GCEPNFVGDIGCRSFTDGNFSGDLCVPYDDLAELEW 121

Query: 147 LSNFVEDSFSTDDLEKLSLISGVN-SRTDDDAAAKPREFQTGHSAG---FHPDMSVPAKA 206
           LSNFVE+SFS+DDL++L LISG+  SRT D+ A+  R FQ   +     F+ +MSVPAKA
Sbjct: 122 LSNFVEESFSSDDLQRLQLISGMKASRTTDEEASDTRHFQPEPNRNAPIFNSEMSVPAKA 181

Query: 207 ARSKRSRA--CIWNSRVSVLSPTN--------SSSETDVVVTL-----------TAKKPS 266
            RSKRSRA  C W SR+ +LSPT         SSSE DVVV+            T K P 
Sbjct: 182 -RSKRSRAAPCNWTSRLLLLSPTTTTTTASTTSSSEADVVVSTPPPPPPNPGKKTVKAPQ 241

Query: 267 KKKETPDDTLSPSNGEGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRP 326
           KKKE+PD   +  +G+GRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRP
Sbjct: 242 KKKESPDS--AAGSGDGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRP 301

BLAST of Cp4.1LG06g03840 vs. NCBI nr
Match: gi|719979656|ref|XP_010249552.1| (PREDICTED: GATA transcription factor 12 [Nelumbo nucifera])

HSP 1 Score: 316.2 bits (809), Expect = 6.7e-83
Identity = 192/322 (59.63%), Postives = 227/322 (70.50%), Query Frame = 1

Query: 27  EAPHYFH--------PAFSTQNPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDA 86
           EAP +FH        P F+ +  HS             DHF+++DLLDFSN+D V+ TD 
Sbjct: 2   EAPEFFHGGYYRPGNPQFTPEKRHSDPK--------PGDHFIIDDLLDFSNEDAVI-TDG 61

Query: 87  TSDNQTPTSTDSSTLTLLEQPNNGHHNYSXHHN-----YSYLDANFSTDLPVPYDDLAEL 146
           T D  T  STDSST+T+L+  N+       H +      S+ DA FS DL VPYDDLAEL
Sbjct: 62  TFDI-TGNSTDSSTVTVLDSCNSSFSGSDPHFSGDFGCRSFPDAQFSGDLCVPYDDLAEL 121

Query: 147 EWLSNFVEDSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTGHSAG---FHPDMSVPAK 206
           EWLSNFVE+SFS++DL+KL LISG+ +RTD+   ++ REFQ  ++     F P++SVP K
Sbjct: 122 EWLSNFVEESFSSEDLQKLQLISGMKARTDE--VSETREFQPENNRNNPMFRPEISVPGK 181

Query: 207 AARSKRSRA--CIWNSRVSVLSPTNSSSETDVVVTL---TAKKPSKKKETPDDTLSPSNG 266
           A RSKRSRA  C W+SR+ VLSPT SSSE+DV       T K   KK+E+ D+  +  NG
Sbjct: 182 A-RSKRSRAAPCDWSSRLLVLSPTTSSSESDVASNSGKKTTKGAPKKRESSDN--ASGNG 241

Query: 267 EGRKCLHCATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSH 326
           EGRKCLHCATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSH
Sbjct: 242 EGRKCLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSH 301

Query: 327 RKVMELRRQKEMMRAQQQQQQH 328
           RKVMELRRQKE+ RAQQQQ  H
Sbjct: 302 RKVMELRRQKELQRAQQQQFLH 308

BLAST of Cp4.1LG06g03840 vs. NCBI nr
Match: gi|720035949|ref|XP_010267189.1| (PREDICTED: GATA transcription factor 12-like [Nelumbo nucifera])

HSP 1 Score: 314.7 bits (805), Expect = 1.9e-82
Identity = 186/314 (59.24%), Postives = 224/314 (71.34%), Query Frame = 1

Query: 27  EAPHYFHPAFSTQNPHSSDTLETTATAAAADHFLVEDLLDFSNDDHVVFTDATSDNQTPT 86
           EAP +FH  +           +  A + + DHF+++DLLDFSN+D V+ T+ T D  T  
Sbjct: 2   EAPEFFHGGYCRAGNPQFTPEKRLADSKSGDHFIIDDLLDFSNEDAVI-TEGTFDTITGN 61

Query: 87  STDSSTLTLLEQPNNGHHNYSXHHN-----YSYLDANFSTDLPVPYDDLAELEWLSNFVE 146
           STDSST+T+L+  N+         +      ++ DA FS DL VPYDDLAELEWLSNFVE
Sbjct: 62  STDSSTVTVLDSCNSSFSGSDTQISGDLGCRNFPDAQFSGDLCVPYDDLAELEWLSNFVE 121

Query: 147 DSFSTDDLEKLSLISGVNSRTDDDAAAKPREFQTGHSAG---FHPDMSVPAKAARSKRSR 206
           +SFS++DL+KL LISG+ +RTDD   +  REFQ  ++     F P+MSVPAKA RSKRSR
Sbjct: 122 ESFSSEDLQKLQLISGMKARTDD--VSDTREFQPENNKNNSIFRPEMSVPAKA-RSKRSR 181

Query: 207 ACI--WNSRVSVLSPTNSSSETDVVVTL---TAKKPSKKKETPDDTLSPSNGEGRKCLHC 266
           A    W+SR+ VLSPT SSSE+DV  +    + K   KK+E+ D   +  NGEGRKCLHC
Sbjct: 182 AAPGNWSSRLLVLSPTTSSSESDVATSSGKKSTKGTPKKRESSDG--ASGNGEGRKCLHC 241

Query: 267 ATDKTPQWRTGPFGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVMELRR 326
           ATDKTPQWRTGP GPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKV+ELRR
Sbjct: 242 ATDKTPQWRTGPMGPKTLCNACGVRYKSGRLVPEYRPAASPTFVLTKHSNSHRKVLELRR 301

Query: 327 QKEMMRAQQQQQQH 328
           QKE+ RAQQQQ  H
Sbjct: 302 QKELQRAQQQQFLH 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAT12_ARATH1.8e-5651.34GATA transcription factor 12 OS=Arabidopsis thaliana GN=GATA12 PE=2 SV=1[more]
GATA9_ARATH9.4e-5350.35GATA transcription factor 9 OS=Arabidopsis thaliana GN=GATA9 PE=2 SV=1[more]
GATA2_ARATH1.8e-4343.55GATA transcription factor 2 OS=Arabidopsis thaliana GN=GATA2 PE=2 SV=1[more]
GATA4_ARATH7.5e-4244.94GATA transcription factor 4 OS=Arabidopsis thaliana GN=GATA4 PE=2 SV=1[more]
GATA5_ARATH1.5e-3439.10GATA transcription factor 5 OS=Arabidopsis thaliana GN=GATA5 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LH13_CUCSA2.0e-11877.02Uncharacterized protein OS=Cucumis sativus GN=Csa_3G895650 PE=4 SV=1[more]
A0A151QPZ2_CAJCA9.1e-7959.25GATA transcription factor 13 OS=Cajanus cajan GN=KK1_046971 PE=4 SV=1[more]
A0A067GJP0_CITSI3.5e-7853.24Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017390mg PE=4 SV=1[more]
V4WC04_9ROSI1.3e-7753.24Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008690mg PE=4 SV=1[more]
A0A061GK48_THECC1.6e-7557.42GATA transcription factor 9, putative OS=Theobroma cacao GN=TCM_037205 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G25830.11.0e-5751.34 GATA transcription factor 12[more]
AT4G32890.15.3e-5450.35 GATA transcription factor 9[more]
AT2G45050.11.0e-4443.55 GATA transcription factor 2[more]
AT3G60530.14.2e-4344.94 GATA transcription factor 4[more]
AT5G66320.18.5e-3639.10 GATA transcription factor 5[more]
Match NameE-valueIdentityDescription
gi|778687724|ref|XP_011652614.1|2.9e-11877.02PREDICTED: GATA transcription factor 9-like [Cucumis sativus][more]
gi|659132192|ref|XP_008466068.1|1.2e-11675.85PREDICTED: GATA transcription factor 9-like [Cucumis melo][more]
gi|1009135304|ref|XP_015884916.1|3.2e-8559.52PREDICTED: GATA transcription factor 12 [Ziziphus jujuba][more]
gi|719979656|ref|XP_010249552.1|6.7e-8359.63PREDICTED: GATA transcription factor 12 [Nelumbo nucifera][more]
gi|720035949|ref|XP_010267189.1|1.9e-8259.24PREDICTED: GATA transcription factor 12-like [Nelumbo nucifera][more]
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0043565sequence-specific DNA binding
GO:0008270zinc ion binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0045893positive regulation of transcription, DNA-templated
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR016679TF_GATA_pln
IPR013088Znf_NHR/GATA
IPR000679Znf_GATA
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0030154 cell differentiation
biological_process GO:0045944 positive regulation of transcription from RNA polymerase II promoter
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0003682 chromatin binding
molecular_function GO:0000977 RNA polymerase II regulatory region sequence-specific DNA binding
molecular_function GO:0001085 RNA polymerase II transcription factor binding
molecular_function GO:0001228 transcriptional activator activity, RNA polymerase II transcription regulatory region sequence-specific binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g03840.1Cp4.1LG06g03840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000679Zinc finger, GATA-typePFAMPF00320GATAcoord: 250..284
score: 7.8
IPR000679Zinc finger, GATA-typeSMARTSM00401GATA_3coord: 244..294
score: 6.0
IPR000679Zinc finger, GATA-typePROSITEPS00344GATA_ZN_FINGER_1coord: 250..275
scor
IPR000679Zinc finger, GATA-typePROFILEPS50114GATA_ZN_FINGER_2coord: 244..280
score: 12
IPR013088Zinc finger, NHR/GATA-typeGENE3DG3DSA:3.30.50.10coord: 248..282
score: 6.6
IPR016679Transcription factor, GATA, plantPIRPIRSF016992Txn_fac_GATA_plantcoord: 40..334
score: 3.8
NoneNo IPR availablePANTHERPTHR10071TRANSCRIPTION FACTOR GATA GATA BINDING FACTORcoord: 31..327
score: 2.8E
NoneNo IPR availablePANTHERPTHR10071:SF202GATA TRANSCRIPTION FACTOR 12coord: 31..327
score: 2.8E
NoneNo IPR availableunknownSSF57716Glucocorticoid receptor-like (DNA-binding domain)coord: 245..308
score: 2.14

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG06g03840Cp4.1LG11g05190Cucurbita pepo (Zucchini)cpecpeB135
Cp4.1LG06g03840Cp4.1LG07g02390Cucurbita pepo (Zucchini)cpecpeB511
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG06g03840Cucurbita pepo (Zucchini)cpecpeB461
Cp4.1LG06g03840Cucurbita maxima (Rimu)cmacpeB509
Cp4.1LG06g03840Cucurbita moschata (Rifu)cmocpeB466
Cp4.1LG06g03840Silver-seed gourdcarcpeB0085
Cp4.1LG06g03840Silver-seed gourdcarcpeB0949