Csa4G023020 (gene) Cucumber (Chinese Long) v2

NameCsa4G023020
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionEthylene-responsive transcription factor; contains IPR016177 (DNA-binding, integrase-type)
LocationChr4 : 2595006 .. 2595755 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAACAAAACCCATCTGATCTTAAGACTTAAAAGTTAAATCCTTTAATCTCTCTCTGAAAATGGATTCCTCAGATGAATTCTTAACCATAGAATTCATCACCCAATTTCTATTGGGAGATTTCTCAGATCATCAAACAGACTCCCCCTTCCTCCACCCCATCAAACTGGAAGATTTCTTCTTCGATTCCCCAATTCCGCCGCTACCGCCGCCGCCAGAGATCTCCGGCAACGACACGAAGCCGGGCAAAGTTGTAGACCCATCGACGCTGCCTGATCATCGTCCCGACATGTCGACACAGGCCTGCGGGGCCGAGACGAAAGTCGCCGTGGTAGAGGCGAGTGGCGGAAAGGGGCGGAGGCATTTCCGGGGAGTACGGCGACGGCCATGGGGTAAATTCGCGGCGGAGATTCGTGACCCGACCCGGAAAGGGAGCCGGGTCTGGTTGGGGACTTACGACAGTGACATAGACGCGGCTAAGGCTTATGACTGTGCGGCGTTCAGGCTGAGAGGGAGGAAAGCCATTCTGAATTTTCCATTGGAGGCCGGAGAACCTGACCCGCCGGCGGCGGCTGACCGGAAAAGGGGGAGGGGGCAGAAATGGAGAAATATTCCGAAGGCATTGATGGCAACAAACGAAAAGTGAATGCTAATAATTGATGGGAAATTAAATTGAAAACAATAATCAGATATGAACTATCTTTTGTGATTGGGTTAAGTTAAATGTTTCTTTTCTTTTTTTT

mRNA sequence

ATGGATTCCTCAGATGAATTCTTAACCATAGAATTCATCACCCAATTTCTATTGGGAGATTTCTCAGATCATCAAACAGACTCCCCCTTCCTCCACCCCATCAAACTGGAAGATTTCTTCTTCGATTCCCCAATTCCGCCGCTACCGCCGCCGCCAGAGATCTCCGGCAACGACACGAAGCCGGGCAAAGTTGTAGACCCATCGACGCTGCCTGATCATCGTCCCGACATGTCGACACAGGCCTGCGGGGCCGAGACGAAAGTCGCCGTGGTAGAGGCGAGTGGCGGAAAGGGGCGGAGGCATTTCCGGGGAGTACGGCGACGGCCATGGGGTAAATTCGCGGCGGAGATTCGTGACCCGACCCGGAAAGGGAGCCGGGTCTGGTTGGGGACTTACGACAGTGACATAGACGCGGCTAAGGCTTATGACTGTGCGGCGTTCAGGCTGAGAGGGAGGAAAGCCATTCTGAATTTTCCATTGGAGGCCGGAGAACCTGACCCGCCGGCGGCGGCTGACCGGAAAAGGGGGAGGGGGCAGAAATGGAGAAATATTCCGAAGGCATTGATGGCAACAAACGAAAAGTGA

Coding sequence (CDS)

ATGGATTCCTCAGATGAATTCTTAACCATAGAATTCATCACCCAATTTCTATTGGGAGATTTCTCAGATCATCAAACAGACTCCCCCTTCCTCCACCCCATCAAACTGGAAGATTTCTTCTTCGATTCCCCAATTCCGCCGCTACCGCCGCCGCCAGAGATCTCCGGCAACGACACGAAGCCGGGCAAAGTTGTAGACCCATCGACGCTGCCTGATCATCGTCCCGACATGTCGACACAGGCCTGCGGGGCCGAGACGAAAGTCGCCGTGGTAGAGGCGAGTGGCGGAAAGGGGCGGAGGCATTTCCGGGGAGTACGGCGACGGCCATGGGGTAAATTCGCGGCGGAGATTCGTGACCCGACCCGGAAAGGGAGCCGGGTCTGGTTGGGGACTTACGACAGTGACATAGACGCGGCTAAGGCTTATGACTGTGCGGCGTTCAGGCTGAGAGGGAGGAAAGCCATTCTGAATTTTCCATTGGAGGCCGGAGAACCTGACCCGCCGGCGGCGGCTGACCGGAAAAGGGGGAGGGGGCAGAAATGGAGAAATATTCCGAAGGCATTGATGGCAACAAACGAAAAGTGA

Protein sequence

MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQKWRNIPKALMATNEK*
BLAST of Csa4G023020 vs. Swiss-Prot
Match: EF106_ARATH (Ethylene-responsive transcription factor ERF106 OS=Arabidopsis thaliana GN=ERF106 PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 9.1e-32
Identity = 79/189 (41.80%), Postives = 111/189 (58.73%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDF-------SDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPE 60
           M S +E   +E I   LL D         D   D+ F+  +    +  +  +P   P   
Sbjct: 1   MASFEESSDLEAIQSHLLEDLLVCDGFMGDFDFDASFVSGL----WCIEPHVPKQEPDSP 60

Query: 61  ISGNDTKPGKV--VDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGR---RHFRGVRRR 120
           +   D+   +   V+  +     P++++ +   ET  +V +A   +     RH+RGVRRR
Sbjct: 61  VLDPDSFVNEFLQVEGESSSSSSPELNSSSSTYETDQSVKKAERFEEEVDARHYRGVRRR 120

Query: 121 PWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPP 178
           PWGKFAAEIRDP +KGSR+WLGT++SD+DAA+AYDCAAF+LRGRKA+LNFPL+AG+ + P
Sbjct: 121 PWGKFAAEIRDPAKKGSRIWLGTFESDVDAARAYDCAAFKLRGRKAVLNFPLDAGKYEAP 180

BLAST of Csa4G023020 vs. Swiss-Prot
Match: EF107_ARATH (Ethylene-responsive transcription factor ERF107 OS=Arabidopsis thaliana GN=ERF107 PE=2 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 4.5e-31
Identity = 73/159 (45.91%), Postives = 93/159 (58.49%), Query Frame = 1

Query: 36  LEDFFFDSP--------IPPLPPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETK 95
           +EDF FD          + P  P P++  +      V+DP +       M  ++  + + 
Sbjct: 28  IEDFVFDDTAFVSGLWSLEPFNPVPKLEPSSP----VLDPDSYVQEILQMEAESSSSSST 87

Query: 96  VAVVEASGGKGR---------RHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDA 155
               E      R         RH+RGVRRRPWGKFAAEIRDP +KGSR+WLGT++SDIDA
Sbjct: 88  TTSPEVETVSNRKKTKRFEETRHYRGVRRRPWGKFAAEIRDPAKKGSRIWLGTFESDIDA 147

Query: 156 AKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGR 178
           A+AYD AAF+LRGRKA+LNFPL+AG+ D P  + RKR R
Sbjct: 148 ARAYDYAAFKLRGRKAVLNFPLDAGKYDAPVNSCRKRRR 182

BLAST of Csa4G023020 vs. Swiss-Prot
Match: EF104_ARATH (Ethylene-responsive transcription factor ERF104 OS=Arabidopsis thaliana GN=ERF104 PE=1 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 1.0e-30
Identity = 75/183 (40.98%), Postives = 104/183 (56.83%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60
           M +  E L I+FI+Q LL DF   +TD P L   +L +F  ++       P  I+    K
Sbjct: 1   MATKQEALAIDFISQHLLTDFVSMETDHPSLFTNQLHNFHSETG------PRTITNQSPK 60

Query: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120
           P      STL   +P +   +    ++    +    +  RH+RGVRRRPWGK+AAEIRDP
Sbjct: 61  PN-----STLNQRKPPLPNLSV---SRTVSTKTEKEEEERHYRGVRRRPWGKYAAEIRDP 120

Query: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180
            +KG R+WLGTYD+ ++A +AYD AAF+LRGRKAILNFPL+        + +   G G++
Sbjct: 121 NKKGCRIWLGTYDTAVEAGRAYDQAAFQLRGRKAILNFPLDVRVTSETCSGEGVIGLGKR 169

Query: 181 WRN 184
            R+
Sbjct: 181 KRD 169

BLAST of Csa4G023020 vs. Swiss-Prot
Match: EF102_ARATH (Ethylene-responsive transcription factor 5 OS=Arabidopsis thaliana GN=ERF5 PE=2 SV=1)

HSP 1 Score: 119.8 bits (299), Expect = 3.4e-26
Identity = 62/112 (55.36%), Postives = 75/112 (66.96%), Query Frame = 1

Query: 80  QACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAA 139
           Q     TK  V +    + ++H+RGVR+RPWGKFAAEIRDP ++GSRVWLGT+D+ I+AA
Sbjct: 134 QFAAENTKPEVTKPVSEEEKKHYRGVRQRPWGKFAAEIRDPNKRGSRVWLGTFDTAIEAA 193

Query: 140 KAYDCAAFRLRGRKAILNFPLEAGEPDPPA---AADRKRGRGQKWRNIPKAL 189
           +AYD AAFRLRG KAILNFPLE G+  P A      RKR   +K   + K L
Sbjct: 194 RAYDEAAFRLRGSKAILNFPLEVGKWKPRADEGEKKRKRDDDEKVTVVEKVL 245

BLAST of Csa4G023020 vs. Swiss-Prot
Match: EF103_ARATH (Ethylene-responsive transcription factor 6 OS=Arabidopsis thaliana GN=ERF6 PE=2 SV=2)

HSP 1 Score: 116.7 bits (291), Expect = 2.8e-25
Identity = 62/125 (49.60%), Postives = 75/125 (60.00%), Query Frame = 1

Query: 49  PPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRR 108
           PP   +  N   P K+  P+     R      A G       V     + +RH+RGVR R
Sbjct: 89  PPRVTVQSNRKPPLKIAPPN-----RTKWIQFATGNPKPELPVPVVAAEEKRHYRGVRMR 148

Query: 109 PWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPP 168
           PWGKFAAEIRDPTR+G+RVWLGT+++ I+AA+AYD  AFRLRG KAILNFPLE  + +P 
Sbjct: 149 PWGKFAAEIRDPTRRGTRVWLGTFETAIEAARAYDKEAFRLRGSKAILNFPLEVDKWNPR 208

Query: 169 AAADR 174
           A   R
Sbjct: 209 AEDGR 208

BLAST of Csa4G023020 vs. TrEMBL
Match: A0A0A0KXM4_CUCSA (DNA binding protein OS=Cucumis sativus GN=Csa_4G023020 PE=4 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 1.5e-110
Identity = 194/194 (100.00%), Postives = 194/194 (100.00%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60
           MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK
Sbjct: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60

Query: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120
           PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP
Sbjct: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120

Query: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180
           TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK
Sbjct: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180

Query: 181 WRNIPKALMATNEK 195
           WRNIPKALMATNEK
Sbjct: 181 WRNIPKALMATNEK 194

BLAST of Csa4G023020 vs. TrEMBL
Match: A0A0D2P300_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G046100 PE=4 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 1.3e-37
Identity = 97/186 (52.15%), Postives = 116/186 (62.37%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDF--SDHQTDSPFLHPIKLEDFFFDS---PIPPLPPP---- 60
           M S +E  T++FI Q LLGDF  +D    S      +L+++       P  P+  P    
Sbjct: 1   MTSVEESTTLDFIHQHLLGDFPSADAFISSLDFGLSQLQNYQVPELYQPHSPVSDPNCGI 60

Query: 61  PEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWG 120
           PEI   D KPG V   S      P +       E KV V E      RRH+RGVRRRPWG
Sbjct: 61  PEIFSYDVKPGVVELES------PSVFISGSKVEQKVLVCEE-----RRHYRGVRRRPWG 120

Query: 121 KFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAA 178
           KFAAEIRDPTRKG RVWLGT+D+D+DAAKAYDCAAF++RG+KAILNFPLEAG+  PPA  
Sbjct: 121 KFAAEIRDPTRKGRRVWLGTFDTDVDAAKAYDCAAFKMRGQKAILNFPLEAGQATPPATT 175

BLAST of Csa4G023020 vs. TrEMBL
Match: A0A061G087_THECC (Ethylene-responsive element binding factor, putative OS=Theobroma cacao GN=TCM_014850 PE=4 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 6.6e-37
Identity = 99/190 (52.11%), Postives = 119/190 (62.63%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQ---TDSPF----LHPI-----KLEDFFFDSPIP-P 60
           M + +E  T+EFI Q LLGDF+      T   F    L PI      + +   DSPI  P
Sbjct: 1   MATIEESTTLEFIRQHLLGDFASADAFITSLDFGLSQLQPIIKPENPIPELEHDSPISDP 60

Query: 61  LPPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRR 120
               P+I   D KP +VVD  +     P         E K+++ E      RRH+RGVRR
Sbjct: 61  NNQIPDIFSCDVKP-EVVDLES-----PRSIISVYNPEPKLSLCEE-----RRHYRGVRR 120

Query: 121 RPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDP 178
           RPWGKFAAEIRDP+RKGSRVWLGT++SD+DAAKAYDCAAF++RG KAILNFPLEAGE  P
Sbjct: 121 RPWGKFAAEIRDPSRKGSRVWLGTFESDVDAAKAYDCAAFKMRGHKAILNFPLEAGEAGP 179

BLAST of Csa4G023020 vs. TrEMBL
Match: B9R9Y6_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1501800 PE=4 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 2.5e-36
Identity = 96/190 (50.53%), Postives = 118/190 (62.11%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFS---------DHQTDSPFLHPIKLE--DFFFDSPIPPLP 60
           M +S E   +E I+Q+LLGDF          D     P L P+KLE  +    SP P  P
Sbjct: 1   MATSQESSVLELISQYLLGDFPSADIFFCNLDSTLAHPNLRPVKLESDNCPASSPEPKSP 60

Query: 61  PPPEIS-GNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKG-RRHFRGVRR 120
               I   +D KP +VV+P+  P     ++++    ++      A   +  +RH+RGVRR
Sbjct: 61  VSDLIQYAHDAKP-EVVEPT--PPQPLGLASRPINRQSPPPDSNARDDEEEKRHYRGVRR 120

Query: 121 RPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDP 178
           RPWGKFAAEIRDP RKGSRVWLGT+D D+DAAKAYDCAAFR+RGRKAILNFPLEAG  DP
Sbjct: 121 RPWGKFAAEIRDPNRKGSRVWLGTFDRDVDAAKAYDCAAFRMRGRKAILNFPLEAGLADP 180

BLAST of Csa4G023020 vs. TrEMBL
Match: G0Z813_9ROSA (ERFAP2-like protein OS=Pyrus x bretschneideri PE=2 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 3.3e-36
Identity = 96/195 (49.23%), Postives = 118/195 (60.51%), Query Frame = 1

Query: 9   TIEFITQFLLGDFSDHQTDSPFLH------PIKLE--DFFFDSPIPPLPPP----PEISG 68
           T+E I Q+LLGDF+   TDS   H      P+K E        P  P+  P    P  S 
Sbjct: 23  TLELIRQYLLGDFTF--TDSFISHLNFQFQPVKPEYSSLSESGPSSPISNPNHHSPHFST 82

Query: 69  NDTKPGKVVDPSTLPD-------HRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRP 128
           ++TKP  V  PS   +        R + ++Q  G + ++  V   G    RH+RGVRRRP
Sbjct: 83  SETKPEIVNLPSMEAEPLNLSSPKRKNSTSQNPGPKDELQQVSGPGD-ALRHYRGVRRRP 142

Query: 129 WGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPA 185
           WGKFAAEIRDP RKG+RVWLGT+D+D+DAAKAYDCAAF+LRGRKAILNFPLEAG  +PP 
Sbjct: 143 WGKFAAEIRDPARKGTRVWLGTFDTDVDAAKAYDCAAFKLRGRKAILNFPLEAGVSEPPV 202

BLAST of Csa4G023020 vs. TAIR10
Match: AT5G07580.1 (AT5G07580.1 Integrase-type DNA-binding superfamily protein)

HSP 1 Score: 138.3 bits (347), Expect = 5.1e-33
Identity = 79/189 (41.80%), Postives = 111/189 (58.73%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDF-------SDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPE 60
           M S +E   +E I   LL D         D   D+ F+  +    +  +  +P   P   
Sbjct: 68  MASFEESSDLEAIQSHLLEDLLVCDGFMGDFDFDASFVSGL----WCIEPHVPKQEPDSP 127

Query: 61  ISGNDTKPGKV--VDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGR---RHFRGVRRR 120
           +   D+   +   V+  +     P++++ +   ET  +V +A   +     RH+RGVRRR
Sbjct: 128 VLDPDSFVNEFLQVEGESSSSSSPELNSSSSTYETDQSVKKAERFEEEVDARHYRGVRRR 187

Query: 121 PWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPP 178
           PWGKFAAEIRDP +KGSR+WLGT++SD+DAA+AYDCAAF+LRGRKA+LNFPL+AG+ + P
Sbjct: 188 PWGKFAAEIRDPAKKGSRIWLGTFESDVDAARAYDCAAFKLRGRKAVLNFPLDAGKYEAP 247

BLAST of Csa4G023020 vs. TAIR10
Match: AT5G61590.1 (AT5G61590.1 Integrase-type DNA-binding superfamily protein)

HSP 1 Score: 136.0 bits (341), Expect = 2.6e-32
Identity = 73/159 (45.91%), Postives = 93/159 (58.49%), Query Frame = 1

Query: 36  LEDFFFDSP--------IPPLPPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETK 95
           +EDF FD          + P  P P++  +      V+DP +       M  ++  + + 
Sbjct: 28  IEDFVFDDTAFVSGLWSLEPFNPVPKLEPSSP----VLDPDSYVQEILQMEAESSSSSST 87

Query: 96  VAVVEASGGKGR---------RHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDA 155
               E      R         RH+RGVRRRPWGKFAAEIRDP +KGSR+WLGT++SDIDA
Sbjct: 88  TTSPEVETVSNRKKTKRFEETRHYRGVRRRPWGKFAAEIRDPAKKGSRIWLGTFESDIDA 147

Query: 156 AKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGR 178
           A+AYD AAF+LRGRKA+LNFPL+AG+ D P  + RKR R
Sbjct: 148 ARAYDYAAFKLRGRKAVLNFPLDAGKYDAPVNSCRKRRR 182

BLAST of Csa4G023020 vs. TAIR10
Match: AT5G61600.1 (AT5G61600.1 ethylene response factor 104)

HSP 1 Score: 134.8 bits (338), Expect = 5.7e-32
Identity = 75/183 (40.98%), Postives = 104/183 (56.83%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60
           M +  E L I+FI+Q LL DF   +TD P L   +L +F  ++       P  I+    K
Sbjct: 1   MATKQEALAIDFISQHLLTDFVSMETDHPSLFTNQLHNFHSETG------PRTITNQSPK 60

Query: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120
           P      STL   +P +   +    ++    +    +  RH+RGVRRRPWGK+AAEIRDP
Sbjct: 61  PN-----STLNQRKPPLPNLSV---SRTVSTKTEKEEEERHYRGVRRRPWGKYAAEIRDP 120

Query: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180
            +KG R+WLGTYD+ ++A +AYD AAF+LRGRKAILNFPL+        + +   G G++
Sbjct: 121 NKKGCRIWLGTYDTAVEAGRAYDQAAFQLRGRKAILNFPLDVRVTSETCSGEGVIGLGKR 169

Query: 181 WRN 184
            R+
Sbjct: 181 KRD 169

BLAST of Csa4G023020 vs. TAIR10
Match: AT5G47230.1 (AT5G47230.1 ethylene responsive element binding factor 5)

HSP 1 Score: 119.8 bits (299), Expect = 1.9e-27
Identity = 62/112 (55.36%), Postives = 75/112 (66.96%), Query Frame = 1

Query: 80  QACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAA 139
           Q     TK  V +    + ++H+RGVR+RPWGKFAAEIRDP ++GSRVWLGT+D+ I+AA
Sbjct: 134 QFAAENTKPEVTKPVSEEEKKHYRGVRQRPWGKFAAEIRDPNKRGSRVWLGTFDTAIEAA 193

Query: 140 KAYDCAAFRLRGRKAILNFPLEAGEPDPPA---AADRKRGRGQKWRNIPKAL 189
           +AYD AAFRLRG KAILNFPLE G+  P A      RKR   +K   + K L
Sbjct: 194 RAYDEAAFRLRGSKAILNFPLEVGKWKPRADEGEKKRKRDDDEKVTVVEKVL 245

BLAST of Csa4G023020 vs. TAIR10
Match: AT4G17490.1 (AT4G17490.1 ethylene responsive element binding factor 6)

HSP 1 Score: 116.7 bits (291), Expect = 1.6e-26
Identity = 62/125 (49.60%), Postives = 75/125 (60.00%), Query Frame = 1

Query: 49  PPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRR 108
           PP   +  N   P K+  P+     R      A G       V     + +RH+RGVR R
Sbjct: 89  PPRVTVQSNRKPPLKIAPPN-----RTKWIQFATGNPKPELPVPVVAAEEKRHYRGVRMR 148

Query: 109 PWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPP 168
           PWGKFAAEIRDPTR+G+RVWLGT+++ I+AA+AYD  AFRLRG KAILNFPLE  + +P 
Sbjct: 149 PWGKFAAEIRDPTRRGTRVWLGTFETAIEAARAYDKEAFRLRGSKAILNFPLEVDKWNPR 208

Query: 169 AAADR 174
           A   R
Sbjct: 209 AEDGR 208

BLAST of Csa4G023020 vs. NCBI nr
Match: gi|449458407|ref|XP_004146939.1| (PREDICTED: ethylene-responsive transcription factor ERF106 [Cucumis sativus])

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-110
Identity = 194/194 (100.00%), Postives = 194/194 (100.00%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60
           MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK
Sbjct: 1   MDSSDEFLTIEFITQFLLGDFSDHQTDSPFLHPIKLEDFFFDSPIPPLPPPPEISGNDTK 60

Query: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120
           PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP
Sbjct: 61  PGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWGKFAAEIRDP 120

Query: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180
           TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK
Sbjct: 121 TRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAADRKRGRGQK 180

Query: 181 WRNIPKALMATNEK 195
           WRNIPKALMATNEK
Sbjct: 181 WRNIPKALMATNEK 194

BLAST of Csa4G023020 vs. NCBI nr
Match: gi|659107812|ref|XP_008453871.1| (PREDICTED: ethylene-responsive transcription factor ERF106 [Cucumis melo])

HSP 1 Score: 361.3 bits (926), Expect = 1.1e-96
Identity = 185/202 (91.58%), Postives = 186/202 (92.08%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQT----DSPFLHPIKLEDFFFDSPIPPLPPPPEISG 60
           MDSSDEFLTIEFITQFLLGDFSDHQT    DSPFLHPIKLEDFFFDSPIPPLPPPPEIS 
Sbjct: 1   MDSSDEFLTIEFITQFLLGDFSDHQTHFPTDSPFLHPIKLEDFFFDSPIPPLPPPPEISD 60

Query: 61  NDTK-PGKVVDPSTLPDHRPDMSTQACGAE--TKVAVVEASGGK-GRRHFRGVRRRPWGK 120
           NDTK PGKVVD ST PD  PDMSTQACGAE   KV+VVEASGGK GRRHFRGVRRRPWGK
Sbjct: 61  NDTKKPGKVVDQSTTPD--PDMSTQACGAELAAKVSVVEASGGKAGRRHFRGVRRRPWGK 120

Query: 121 FAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAAD 180
           FAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPP AAD
Sbjct: 121 FAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPTAAD 180

Query: 181 RKRGRGQKWRNIPKALMATNEK 195
           RKRGRGQKWRNI KALMATNEK
Sbjct: 181 RKRGRGQKWRNISKALMATNEK 200

BLAST of Csa4G023020 vs. NCBI nr
Match: gi|823184047|ref|XP_012489066.1| (PREDICTED: ethylene-responsive transcription factor ERF107-like [Gossypium raimondii])

HSP 1 Score: 164.5 bits (415), Expect = 1.9e-37
Identity = 97/186 (52.15%), Postives = 116/186 (62.37%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDF--SDHQTDSPFLHPIKLEDFFFDS---PIPPLPPP---- 60
           M S +E  T++FI Q LLGDF  +D    S      +L+++       P  P+  P    
Sbjct: 1   MTSVEESTTLDFIHQHLLGDFPSADAFISSLDFGLSQLQNYQVPELYQPHSPVSDPNCGI 60

Query: 61  PEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRRRPWG 120
           PEI   D KPG V   S      P +       E KV V E      RRH+RGVRRRPWG
Sbjct: 61  PEIFSYDVKPGVVELES------PSVFISGSKVEQKVLVCEE-----RRHYRGVRRRPWG 120

Query: 121 KFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDPPAAA 178
           KFAAEIRDPTRKG RVWLGT+D+D+DAAKAYDCAAF++RG+KAILNFPLEAG+  PPA  
Sbjct: 121 KFAAEIRDPTRKGRRVWLGTFDTDVDAAKAYDCAAFKMRGQKAILNFPLEAGQATPPATT 175

BLAST of Csa4G023020 vs. NCBI nr
Match: gi|590671234|ref|XP_007038277.1| (Ethylene-responsive element binding factor, putative [Theobroma cacao])

HSP 1 Score: 162.2 bits (409), Expect = 9.4e-37
Identity = 99/190 (52.11%), Postives = 119/190 (62.63%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDFSDHQ---TDSPF----LHPI-----KLEDFFFDSPIP-P 60
           M + +E  T+EFI Q LLGDF+      T   F    L PI      + +   DSPI  P
Sbjct: 1   MATIEESTTLEFIRQHLLGDFASADAFITSLDFGLSQLQPIIKPENPIPELEHDSPISDP 60

Query: 61  LPPPPEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGKGRRHFRGVRR 120
               P+I   D KP +VVD  +     P         E K+++ E      RRH+RGVRR
Sbjct: 61  NNQIPDIFSCDVKP-EVVDLES-----PRSIISVYNPEPKLSLCEE-----RRHYRGVRR 120

Query: 121 RPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRGRKAILNFPLEAGEPDP 178
           RPWGKFAAEIRDP+RKGSRVWLGT++SD+DAAKAYDCAAF++RG KAILNFPLEAGE  P
Sbjct: 121 RPWGKFAAEIRDPSRKGSRVWLGTFESDVDAAKAYDCAAFKMRGHKAILNFPLEAGEAGP 179

BLAST of Csa4G023020 vs. NCBI nr
Match: gi|747105657|ref|XP_011101099.1| (PREDICTED: ethylene-responsive transcription factor ERF107-like [Sesamum indicum])

HSP 1 Score: 161.8 bits (408), Expect = 1.2e-36
Identity = 95/206 (46.12%), Postives = 116/206 (56.31%), Query Frame = 1

Query: 1   MDSSDEFLTIEFITQFLLGDF---------------SDHQTDSPFLHPIKLEDFFFDSPI 60
           M++SDE LT+EFI   LL DF               SD   D P   P +LE      P 
Sbjct: 1   METSDESLTLEFIRHHLLEDFTSADSFFDNLTTLCFSDVYGDKPIWSPGELESPSSSGPT 60

Query: 61  PPLPPP--------PEISGNDTKPGKVVDPSTLPDHRPDMSTQACGAETKVAVVEASGGK 120
           P    P        P++   +TKP   +  +   D      +     + +    E   G 
Sbjct: 61  PLPDSPISQYFAFSPDLFEFETKPQISMSTTDTQDDPSFPESPEAPVKAEPEFTELVTGS 120

Query: 121 G------RRHFRGVRRRPWGKFAAEIRDPTRKGSRVWLGTYDSDIDAAKAYDCAAFRLRG 178
           G       RH+RGVRRRPWGK+AAEIRDPTRKGSRVWLGTYD+D+DAA+AYDCAAF++RG
Sbjct: 121 GVTLLAAGRHYRGVRRRPWGKYAAEIRDPTRKGSRVWLGTYDTDVDAARAYDCAAFKMRG 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EF106_ARATH9.1e-3241.80Ethylene-responsive transcription factor ERF106 OS=Arabidopsis thaliana GN=ERF10... [more]
EF107_ARATH4.5e-3145.91Ethylene-responsive transcription factor ERF107 OS=Arabidopsis thaliana GN=ERF10... [more]
EF104_ARATH1.0e-3040.98Ethylene-responsive transcription factor ERF104 OS=Arabidopsis thaliana GN=ERF10... [more]
EF102_ARATH3.4e-2655.36Ethylene-responsive transcription factor 5 OS=Arabidopsis thaliana GN=ERF5 PE=2 ... [more]
EF103_ARATH2.8e-2549.60Ethylene-responsive transcription factor 6 OS=Arabidopsis thaliana GN=ERF6 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KXM4_CUCSA1.5e-110100.00DNA binding protein OS=Cucumis sativus GN=Csa_4G023020 PE=4 SV=1[more]
A0A0D2P300_GOSRA1.3e-3752.15Uncharacterized protein OS=Gossypium raimondii GN=B456_007G046100 PE=4 SV=1[more]
A0A061G087_THECC6.6e-3752.11Ethylene-responsive element binding factor, putative OS=Theobroma cacao GN=TCM_0... [more]
B9R9Y6_RICCO2.5e-3650.53DNA binding protein, putative OS=Ricinus communis GN=RCOM_1501800 PE=4 SV=1[more]
G0Z813_9ROSA3.3e-3649.23ERFAP2-like protein OS=Pyrus x bretschneideri PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07580.15.1e-3341.80 Integrase-type DNA-binding superfamily protein[more]
AT5G61590.12.6e-3245.91 Integrase-type DNA-binding superfamily protein[more]
AT5G61600.15.7e-3240.98 ethylene response factor 104[more]
AT5G47230.11.9e-2755.36 ethylene responsive element binding factor 5[more]
AT4G17490.11.6e-2649.60 ethylene responsive element binding factor 6[more]
Match NameE-valueIdentityDescription
gi|449458407|ref|XP_004146939.1|2.2e-110100.00PREDICTED: ethylene-responsive transcription factor ERF106 [Cucumis sativus][more]
gi|659107812|ref|XP_008453871.1|1.1e-9691.58PREDICTED: ethylene-responsive transcription factor ERF106 [Cucumis melo][more]
gi|823184047|ref|XP_012489066.1|1.9e-3752.15PREDICTED: ethylene-responsive transcription factor ERF107-like [Gossypium raimo... [more]
gi|590671234|ref|XP_007038277.1|9.4e-3752.11Ethylene-responsive element binding factor, putative [Theobroma cacao][more]
gi|747105657|ref|XP_011101099.1|1.2e-3646.12PREDICTED: ethylene-responsive transcription factor ERF107-like [Sesamum indicum... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001471AP2/ERF_dom
IPR016177DNA-bd_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU129029cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G023020.1Csa4G023020.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU129029CU129029transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001471AP2/ERF domainPRINTSPR00367ETHRSPELEMNTcoord: 141..161
score: 1.2E-12coord: 102..113
score: 1.2
IPR001471AP2/ERF domainGENE3DG3DSA:3.30.730.10coord: 100..160
score: 7.5
IPR001471AP2/ERF domainPFAMPF00847AP2coord: 101..151
score: 2.0
IPR001471AP2/ERF domainSMARTSM00380rav1_2coord: 101..165
score: 3.0
IPR001471AP2/ERF domainPROFILEPS51032AP2_ERFcoord: 101..159
score: 23
IPR016177DNA-binding domainunknownSSF54171DNA-binding domaincoord: 101..161
score: 2.09
NoneNo IPR availablePANTHERPTHR31677FAMILY NOT NAMEDcoord: 99..169
score: 5.4
NoneNo IPR availablePANTHERPTHR31677:SF24ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF106coord: 99..169
score: 5.4