CmaCh20G000110 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G000110
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEthylene-responsive transcription factor
LocationCma_Chr20 : 31447 .. 32584 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCATGCCGCCATTATAAAACCAATGCCCATCACTTCTTCCTTCATCTCAATTCCATTCGTAAACTTTGCAATGCAGATGGAGGGTTCAGTTGAGAAACAACAGCATAAAGAGGTTTCAGTTTCAGGCAGCAAAATTGGGAAATGCAAGGACAGATCTCGTAGACGCAGCTTCGTTGGTGTCAGACAAAGGCCCTCTGGGAAATGGGTGGCAGAGATCAAAGACACAACCCATGATATCAGAATGTGGTTGGGAACCTTCAAGACAGCGGAAGAGGCTGCCACAGCATACGATGAAGCCGCCCGCCTCCTCCGTGGTGCCAATACTCGTACCAACTTCGCAACCCATCTCAAATCCACCTCCGCCCTCTCTTTCAAAATTAGAAATCTCCTCAACCAAAAGATCAGTTTGAAACGAAGCTCCTCCAAAACCAGCCCCACCAAACTCGGTCCAATCACTGAAATTGAGGGTGCTCATTCTGGGCTCTCCTTCTCCACTGCCACAAACCAAGAGATACATAAGACGTTTGATAATGCAGACTGGAGCAGCGCCTGCAGTGGAGAAGTTGAGGCGCTTGGTCTTCGTCAGTTCAGCCATTCGTGGTGGCATCTTCCTCTTGGGTTGAACTTGGAAATGGATGAGCCTCCTGATCCATTACTGAGCCACCAAGATAGACAAGTGGCAGAGATGTCATGTGATTCAGAGCAATTAGATCTAACATCCATACCTATGTGTGCAGTGAATGGAATGAGTGAATATTTAGGAACTACATATGATGCCTTTGATTCTGTGGGTCAGATATTCTGCCCAAGTTGATTATACCCCGCTTTAATTAATTCCCTTTCTACTGCTATTTGAATGCAGATGATAAGGCAAAAGAAAATCAAACAAGCAGGCAGCACCCAATGCGCTCATACATGTACAACTTTTTCATATTATAAAAACACATGTACAACTTGTGATATTGTCTATCACAATATCAATCCATCCACAACTCAAGAAAAGATAGAATTACAGTCGAAACACTGGCGTACATATTCCTGATTTCATACAAAGAGCCACAAAGATAAGTAATGGAAATTTACTCCAAGACAAAACAGAATCGCTTATGGAAATGAAAACGGCAGGCTTTTGA

mRNA sequence

ATGCCTCATGCCGCCATTATAAAACCAATGCCCATCACTTCTTCCTTCATCTCAATTCCATTCGTAAACTTTGCAATGCAGATGGAGGGTTCAGTTGAGAAACAACAGCATAAAGAGGTTTCAGTTTCAGGCAGCAAAATTGGGAAATGCAAGGACAGATCTCGTAGACGCAGCTTCGTTGGTGTCAGACAAAGGCCCTCTGGGAAATGGGTGGCAGAGATCAAAGACACAACCCATGATATCAGAATGTGGTTGGGAACCTTCAAGACAGCGGAAGAGGCTGCCACAGCATACGATGAAGCCGCCCGCCTCCTCCGTGGTGCCAATACTCGTACCAACTTCGCAACCCATCTCAAATCCACCTCCGCCCTCTCTTTCAAAATTAGAAATCTCCTCAACCAAAAGATCAGTTTGAAACGAAGCTCCTCCAAAACCAGCCCCACCAAACTCGGTCCAATCACTGAAATTGAGGGTGCTCATTCTGGGCTCTCCTTCTCCACTGCCACAAACCAAGAGATACATAAGACGTTTGATAATGCAGACTGGAGCAGCGCCTGCAGTGGAGAAGTTGAGGCGCTTGGTCTTCGTCAGTTCAGCCATTCGTGGTGGCATCTTCCTCTTGGGTTGAACTTGGAAATGGATGAGCCTCCTGATCCATTACTGAGCCACCAAGATAGACAAGTGGCAGAGATGTCATGTGATTCAGAGCAATTAGATCTAACATCCATACCTATGTGTGCAGTGAATGGAATGAGTGAATATTTAGGAACTACATATGATGCCTTTGATTCTGTGGATGATAAGGCAAAAGAAAATCAAACAAGCAGGCAGCACCCAATGCGCTCATACATTCGAAACACTGGCGTACATATTCCTGATTTCATACAAAGAGCCACAAAGATAAGTAATGGAAATTTACTCCAAGACAAAACAGAATCGCTTATGGAAATGAAAACGGCAGGCTTTTGA

Coding sequence (CDS)

ATGCCTCATGCCGCCATTATAAAACCAATGCCCATCACTTCTTCCTTCATCTCAATTCCATTCGTAAACTTTGCAATGCAGATGGAGGGTTCAGTTGAGAAACAACAGCATAAAGAGGTTTCAGTTTCAGGCAGCAAAATTGGGAAATGCAAGGACAGATCTCGTAGACGCAGCTTCGTTGGTGTCAGACAAAGGCCCTCTGGGAAATGGGTGGCAGAGATCAAAGACACAACCCATGATATCAGAATGTGGTTGGGAACCTTCAAGACAGCGGAAGAGGCTGCCACAGCATACGATGAAGCCGCCCGCCTCCTCCGTGGTGCCAATACTCGTACCAACTTCGCAACCCATCTCAAATCCACCTCCGCCCTCTCTTTCAAAATTAGAAATCTCCTCAACCAAAAGATCAGTTTGAAACGAAGCTCCTCCAAAACCAGCCCCACCAAACTCGGTCCAATCACTGAAATTGAGGGTGCTCATTCTGGGCTCTCCTTCTCCACTGCCACAAACCAAGAGATACATAAGACGTTTGATAATGCAGACTGGAGCAGCGCCTGCAGTGGAGAAGTTGAGGCGCTTGGTCTTCGTCAGTTCAGCCATTCGTGGTGGCATCTTCCTCTTGGGTTGAACTTGGAAATGGATGAGCCTCCTGATCCATTACTGAGCCACCAAGATAGACAAGTGGCAGAGATGTCATGTGATTCAGAGCAATTAGATCTAACATCCATACCTATGTGTGCAGTGAATGGAATGAGTGAATATTTAGGAACTACATATGATGCCTTTGATTCTGTGGATGATAAGGCAAAAGAAAATCAAACAAGCAGGCAGCACCCAATGCGCTCATACATTCGAAACACTGGCGTACATATTCCTGATTTCATACAAAGAGCCACAAAGATAAGTAATGGAAATTTACTCCAAGACAAAACAGAATCGCTTATGGAAATGAAAACGGCAGGCTTTTGA

Protein sequence

MPHAAIIKPMPITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSKTSPTKLGPITEIEGAHSGLSFSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLPLGLNLEMDEPPDPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTYDAFDSVDDKAKENQTSRQHPMRSYIRNTGVHIPDFIQRATKISNGNLLQDKTESLMEMKTAGF
BLAST of CmaCh20G000110 vs. Swiss-Prot
Match: RA211_ARATH (Ethylene-responsive transcription factor RAP2-11 OS=Arabidopsis thaliana GN=RAP2-11 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 7.0e-29
Identity = 96/231 (41.56%), Postives = 118/231 (51.08%), Query Frame = 1

Query: 49  KCKDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGA 108
           K K +  +  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF+TAEEAA AYDEAA LLRG+
Sbjct: 12  KEKSKGNKTKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFETAEEAARAYDEAACLLRGS 71

Query: 109 NTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSK----TSPTKLGPITEIEGAHSGLS 168
           NTRTNFA H  + S LS KIRNLL+QK S+K+   +     S      I  I  A S  +
Sbjct: 72  NTRTNFANHFPNNSQLSLKIRNLLHQKQSMKQQQQQQHKPVSSLTDCNINYISTATSLTT 131

Query: 169 FSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLPLGLNLEMDEPPDPLLSHQ 228
            +T T        +     S+  G+ E  GL Q  +SW   PL        P        
Sbjct: 132 TTTTTTTTAIPLNNVYRPDSSVIGQPETEGL-QLPYSW---PLVSGFNHQIPLAQAGGET 191

Query: 229 DRQVAEMSCDSEQLDLTSI------PMCAVNGMSEY---LGTTYDAFDSVD 267
              + +     + L L  I       + A+NG + Y   +   Y  FD  D
Sbjct: 192 HGHLNDHYSTDQHLGLAEIERQISASLYAMNGANSYYDNMNAEYAIFDPTD 238

BLAST of CmaCh20G000110 vs. Swiss-Prot
Match: ERF71_ARATH (Ethylene-responsive transcription factor ERF071 OS=Arabidopsis thaliana GN=ERF071 PE=2 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 7.5e-15
Identity = 47/103 (45.63%), Postives = 63/103 (61.17%), Query Frame = 1

Query: 12  ITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQRPSGKWV 71
           I S FI     +   Q+     +++ K VSVS  + GK   R R+  + G+RQRP GKW 
Sbjct: 6   IISDFIWSKSESEPSQLGSVSSRKKRKPVSVSEERDGK---RERKNLYRGIRQRPWGKWA 65

Query: 72  AEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNF 115
           AEI+D +  +R+WLGTFKTA+EAA AYD AA  +RG   + NF
Sbjct: 66  AEIRDPSKGVRVWLGTFKTADEAARAYDVAAIKIRGRKAKLNF 105

BLAST of CmaCh20G000110 vs. Swiss-Prot
Match: RA212_ARATH (Ethylene-responsive transcription factor RAP2-12 OS=Arabidopsis thaliana GN=RAP2-12 PE=1 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 2.9e-14
Identity = 62/189 (32.80%), Postives = 95/189 (50.26%), Query Frame = 1

Query: 5   AIIKPMPITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQ 64
           A +KP   TS+    P    +   EGSV     K+V+       K  +R R+  + G+RQ
Sbjct: 78  ADVKPFVFTST----PKPAVSAAAEGSVFG---KKVTGLDGDAEKSANRKRKNQYRGIRQ 137

Query: 65  RPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSAL 124
           RP GKW AEI+D     R+WLGTFKTAEEAA AYD AAR +RG+  + NF       ++ 
Sbjct: 138 RPWGKWAAEIRDPREGARIWLGTFKTAEEAARAYDAAARRIRGSKAKVNFPEENMKANSQ 197

Query: 125 SFKIRNLLNQKISLKRSSSKTSPTKLGPITEIEGAHSGLSF-STATNQEIHKTFDNADWS 184
              +      K +L++  +K +P     + +    +S +SF +    +E H+  +N +  
Sbjct: 198 KRSV------KANLQKPVAKPNPNPSPALVQ----NSNISFENMCFMEEKHQVSNNNNNQ 249

Query: 185 SACSGEVEA 193
              +  V+A
Sbjct: 258 FGMTNSVDA 249

BLAST of CmaCh20G000110 vs. Swiss-Prot
Match: EF112_ARATH (Ethylene-responsive transcription factor ERF112 OS=Arabidopsis thaliana GN=ERF112 PE=2 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 4.9e-14
Identity = 39/69 (56.52%), Postives = 48/69 (69.57%), Query Frame = 1

Query: 51  KDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANT 110
           K  SR+R++ GVRQRP GKW AEI+D     R+WLGTF TAEEAA AYD+AA   RG   
Sbjct: 62  KSNSRQRNYRGVRQRPWGKWAAEIRDPNKAARVWLGTFDTAEEAALAYDKAAFEFRGHKA 121

Query: 111 RTNFATHLK 120
           + NF  H++
Sbjct: 122 KLNFPEHIR 130

BLAST of CmaCh20G000110 vs. Swiss-Prot
Match: RAP22_ARATH (Ethylene-responsive transcription factor RAP2-2 OS=Arabidopsis thaliana GN=RAP2-2 PE=1 SV=2)

HSP 1 Score: 77.4 bits (189), Expect = 3.2e-13
Identity = 55/164 (33.54%), Postives = 81/164 (49.39%), Query Frame = 1

Query: 7   IKPMPITSSF--ISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQ 66
           +KP   T++   ++  FV+  + + GS   ++  E +    K  K   R R+  + G+RQ
Sbjct: 77  VKPFVFTATTKPVASAFVSTGIYLVGSAYAKKTVESAEQAEKSSK---RKRKNQYRGIRQ 136

Query: 67  RPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSAL 126
           RP GKW AEI+D     R WLGTF TAEEAA AYD AAR +RG   + NF    K+ S +
Sbjct: 137 RPWGKWAAEIRDPRKGSREWLGTFDTAEEAARAYDAAARRIRGTKAKVNFPEE-KNPSVV 196

Query: 127 SFKIRNLLNQKISLKRSSSKTSPTKLGPITEIEGAHSGLSFSTA 169
           S K  +     +    +    S T +   T +   +   SF  +
Sbjct: 197 SQKRPSAKTNNLQKSVAKPNKSVTLVQQPTHLSQQYCNNSFDNS 236

BLAST of CmaCh20G000110 vs. TrEMBL
Match: A0A0A0L1N6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G268100 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 4.0e-47
Identity = 126/239 (52.72%), Postives = 152/239 (63.60%), Query Frame = 1

Query: 28  MEGSVEKQQHKEVSVSGSKIGKC-KDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLG 87
           ME SVEKQ+H++V+     +GKC K RS +RSFVGVRQRPSGKWVAEIKD THDIRMWLG
Sbjct: 1   MESSVEKQEHRDVAFK--LVGKCIKKRSGKRSFVGVRQRPSGKWVAEIKDATHDIRMWLG 60

Query: 88  TFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSKTS 147
           TF T EEAA AYDEAA LLRG+N RTNF     S+SALSFKIRNLL  KI+LKRSS    
Sbjct: 61  TFNTPEEAARAYDEAACLLRGSNARTNFTLASNSSSALSFKIRNLLIHKITLKRSS---- 120

Query: 148 PTKLGPITEIEGAHSGLSFSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLP 207
                 ITE    H+ ++ S  TNQE+H   +NA WSS C+GE+E +G   F+HS   LP
Sbjct: 121 ------ITE---THAHVAHS-ETNQEMHMFDNNAVWSS-CNGEIE-VGFCHFTHSCCDLP 180

Query: 208 LGLNLEMDEPPDPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTY-DAFDS 265
           LG N +MDE                             + A+NG++++LG  Y DAFD+
Sbjct: 181 LGFNFDMDE----------------------------SLSALNGITQHLGNAYDDAFDT 193

BLAST of CmaCh20G000110 vs. TrEMBL
Match: A0A0D2VJP1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G244500 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 4.3e-33
Identity = 109/242 (45.04%), Postives = 141/242 (58.26%), Query Frame = 1

Query: 49  KCKDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGA 108
           K K + RR  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF+TAEEAA AYDEAA LLRG+
Sbjct: 10  KGKFKERRNKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFETAEEAARAYDEAACLLRGS 69

Query: 109 NTRTNFATHLKSTSALSFKIRNLLNQKISLK--RSSSKTSPTKLGPIT----EIEGAHSG 168
           NTRTNFATH+ + S LS KIRNLLN K SL+  ++++  S T    IT     I G++  
Sbjct: 70  NTRTNFATHVPTDSHLSLKIRNLLNHKKSLRQGKNNNSNSTTNSNKITIKASTIVGSNDS 129

Query: 169 LSFSTATNQEIHKT-----FDNADWSSACSGEVEALGL--RQFSHSWW------HLPLGL 228
           ++ S +     + T     FD A +    SG V  LGL   Q   SW        +PL  
Sbjct: 130 INSSISNAGTDNFTCNSLVFDGA-YRPELSGFVGELGLDPSQLGQSWMIPTGFDQIPLSQ 189

Query: 229 NLEMDEPPDPLLSHQDRQVAEMSCDSEQLDL---TSIPMCAVNGMSEYL-GTTYDAFDSV 268
            LE+ +    L    D+++ E     E+L +    S  + A+NG++EYL    +D  D++
Sbjct: 190 GLELPQEVGVLPQGIDQELMEF----ERLKVERQVSATLYAMNGVNEYLQSAAFDPNDAI 246

BLAST of CmaCh20G000110 vs. TrEMBL
Match: A0A059D8K7_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03605 PE=4 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 5.6e-33
Identity = 105/248 (42.34%), Postives = 139/248 (56.05%), Query Frame = 1

Query: 52  DRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTR 111
           + + +  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF TAEEAA AYDEAA LLRG NTR
Sbjct: 35  NNTAKSKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFDTAEEAARAYDEAACLLRGFNTR 94

Query: 112 TNFATHLKSTSALSFKIRNLLNQKISLKR-------SSSKTSPTKLGPITEIEGAHSGLS 171
           TNF+ HL ++SALS KIR LLNQKIS +R       S+SK++              S +S
Sbjct: 95  TNFSPHLPTSSALSLKIRTLLNQKISSRRDRPMVSHSNSKSTKKPASQTNSATSISSMIS 154

Query: 172 FSTATN---------------QEIHKTFDNA--DWSSACSGEVEALGLRQFSHSWW---- 231
            S+ TN                +  + FD+A     S C   + +  + Q +HSW     
Sbjct: 155 LSSVTNVYTSSDSDCSFSSGMMQDGRIFDDAYRPDLSRCLASLGSTPMSQHNHSWAFTSG 214

Query: 232 --HLPLGLNLEMDEPP--DPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTY 268
             HLPL   ++ D  P   P  S  D +++E      +  + S  + AVNG++EYL  TY
Sbjct: 215 LDHLPL---VQEDGLPKHGPCASTTDLELSEFERMKVERQI-SASLYAVNGLNEYLENTY 274

BLAST of CmaCh20G000110 vs. TrEMBL
Match: V7CXD8_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G111800g PE=4 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 2.8e-32
Identity = 107/272 (39.34%), Postives = 151/272 (55.51%), Query Frame = 1

Query: 26  MQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRS---FVGVRQRPSGKWVAEIKDTTHDIR 85
           M+++      QH++  ++ +K GK K RSR  +   FVGVRQRPSG+WVAEIKDTT  IR
Sbjct: 1   MEIQFQQPNLQHQKSGIAITKGGKLKGRSRSNNTNKFVGVRQRPSGRWVAEIKDTTQKIR 60

Query: 86  MWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKR-- 145
           MWLGTF+TAEEAA AYDEAA LLRG+NTRTNF TH+   S L+ +IRNLLN K  +K+  
Sbjct: 61  MWLGTFETAEEAARAYDEAACLLRGSNTRTNFITHVSLDSPLASRIRNLLNNKKGMKKQE 120

Query: 146 ------------SSSKTSPTKLGPITE-IEGAHSG-----LSFSTATNQEIHKTFDNADW 205
                       SS+ T+ T +   T  I  +++G      S ST T Q     FD+A +
Sbjct: 121 DANANNAPAPRVSSTSTASTSISTTTSAISNSNNGNGNNDKSLSTVTPQS-SNLFDDA-Y 180

Query: 206 SSACSGEVEALGLRQFSHSWW-------HLPLGLNLEMDEPPDPLLSHQDRQVAEMSCDS 265
               S   E  G  Q S++ W         P+   L++ +  D L    D +++E     
Sbjct: 181 KPDLSNCREDFGSGQQSNASWGFGAVFDRFPIAQILDVPKIDDCLTDTADLELSEFERMK 240

Query: 266 EQLDLTSIPMCAVNGMSEYLGTTYDAFDSVDD 268
            +  + S  + A+NG+ EY+ T  D+ +++ D
Sbjct: 241 VERQI-SASLYAINGVQEYMETVQDSSETLWD 269

BLAST of CmaCh20G000110 vs. TrEMBL
Match: D7SLK5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g00490 PE=4 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 3.1e-31
Identity = 117/267 (43.82%), Postives = 149/267 (55.81%), Query Frame = 1

Query: 35  QQHKEVSVSGSKIGKCKDR------SRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTF 94
           Q H     S  K  K ++R      S+   FVGVRQRPSGKWVAEIK+TT  IRMWLGTF
Sbjct: 6   QNHPNGVSSPCKQTKVRERTATNKCSKSGKFVGVRQRPSGKWVAEIKNTTQKIRMWLGTF 65

Query: 95  KTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKR------SS 154
            TAEEAA AYDEAA LLRG+NTRTNFATH  + S LS KIRNLL++K SLK+      ++
Sbjct: 66  NTAEEAARAYDEAACLLRGSNTRTNFATHAPTNSPLSLKIRNLLHRKKSLKQNHPPPPAA 125

Query: 155 SKTSPTKLGP-ITEIEGAHSGLSFSTATN---QEIHKTFDNA---DWSSACSGEVEALGL 214
             T PT++    +   G    ++ S A+N   Q+I + FD+A   D S+ C GE+  LG 
Sbjct: 126 LTTIPTEMSTGASSTSGNSHSINASHASNVVKQDI-QMFDDAYRPDLSN-CIGELR-LGS 185

Query: 215 RQFSHSWWHLPLGLNLEMDEPPDPLLSHQD----RQVAEMSCDSEQLDLT---------- 268
            QF  S W  P G         D L S QD     + AE+   +  LD+T          
Sbjct: 186 SQFDPS-WVFPAG--------SDWLQSTQDGFEFPKHAELLSRATDLDMTEFGLMKVERH 245

BLAST of CmaCh20G000110 vs. TAIR10
Match: AT5G19790.1 (AT5G19790.1 related to AP2 11)

HSP 1 Score: 129.4 bits (324), Expect = 3.9e-30
Identity = 96/231 (41.56%), Postives = 118/231 (51.08%), Query Frame = 1

Query: 49  KCKDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGA 108
           K K +  +  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF+TAEEAA AYDEAA LLRG+
Sbjct: 12  KEKSKGNKTKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFETAEEAARAYDEAACLLRGS 71

Query: 109 NTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSK----TSPTKLGPITEIEGAHSGLS 168
           NTRTNFA H  + S LS KIRNLL+QK S+K+   +     S      I  I  A S  +
Sbjct: 72  NTRTNFANHFPNNSQLSLKIRNLLHQKQSMKQQQQQQHKPVSSLTDCNINYISTATSLTT 131

Query: 169 FSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLPLGLNLEMDEPPDPLLSHQ 228
            +T T        +     S+  G+ E  GL Q  +SW   PL        P        
Sbjct: 132 TTTTTTTTAIPLNNVYRPDSSVIGQPETEGL-QLPYSW---PLVSGFNHQIPLAQAGGET 191

Query: 229 DRQVAEMSCDSEQLDLTSI------PMCAVNGMSEY---LGTTYDAFDSVD 267
              + +     + L L  I       + A+NG + Y   +   Y  FD  D
Sbjct: 192 HGHLNDHYSTDQHLGLAEIERQISASLYAMNGANSYYDNMNAEYAIFDPTD 238

BLAST of CmaCh20G000110 vs. TAIR10
Match: AT2G47520.1 (AT2G47520.1 Integrase-type DNA-binding superfamily protein)

HSP 1 Score: 82.8 bits (203), Expect = 4.2e-16
Identity = 47/103 (45.63%), Postives = 63/103 (61.17%), Query Frame = 1

Query: 12  ITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQRPSGKWV 71
           I S FI     +   Q+     +++ K VSVS  + GK   R R+  + G+RQRP GKW 
Sbjct: 6   IISDFIWSKSESEPSQLGSVSSRKKRKPVSVSEERDGK---RERKNLYRGIRQRPWGKWA 65

Query: 72  AEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNF 115
           AEI+D +  +R+WLGTFKTA+EAA AYD AA  +RG   + NF
Sbjct: 66  AEIRDPSKGVRVWLGTFKTADEAARAYDVAAIKIRGRKAKLNF 105

BLAST of CmaCh20G000110 vs. TAIR10
Match: AT1G53910.1 (AT1G53910.1 related to AP2 12)

HSP 1 Score: 80.9 bits (198), Expect = 1.6e-15
Identity = 62/189 (32.80%), Postives = 95/189 (50.26%), Query Frame = 1

Query: 5   AIIKPMPITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQ 64
           A +KP   TS+    P    +   EGSV     K+V+       K  +R R+  + G+RQ
Sbjct: 78  ADVKPFVFTST----PKPAVSAAAEGSVFG---KKVTGLDGDAEKSANRKRKNQYRGIRQ 137

Query: 65  RPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSAL 124
           RP GKW AEI+D     R+WLGTFKTAEEAA AYD AAR +RG+  + NF       ++ 
Sbjct: 138 RPWGKWAAEIRDPREGARIWLGTFKTAEEAARAYDAAARRIRGSKAKVNFPEENMKANSQ 197

Query: 125 SFKIRNLLNQKISLKRSSSKTSPTKLGPITEIEGAHSGLSF-STATNQEIHKTFDNADWS 184
              +      K +L++  +K +P     + +    +S +SF +    +E H+  +N +  
Sbjct: 198 KRSV------KANLQKPVAKPNPNPSPALVQ----NSNISFENMCFMEEKHQVSNNNNNQ 249

Query: 185 SACSGEVEA 193
              +  V+A
Sbjct: 258 FGMTNSVDA 249

BLAST of CmaCh20G000110 vs. TAIR10
Match: AT2G33710.2 (AT2G33710.2 Integrase-type DNA-binding superfamily protein)

HSP 1 Score: 80.1 bits (196), Expect = 2.7e-15
Identity = 39/69 (56.52%), Postives = 48/69 (69.57%), Query Frame = 1

Query: 51  KDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANT 110
           K  SR+R++ GVRQRP GKW AEI+D     R+WLGTF TAEEAA AYD+AA   RG   
Sbjct: 62  KSNSRQRNYRGVRQRPWGKWAAEIRDPNKAARVWLGTFDTAEEAALAYDKAAFEFRGHKA 121

Query: 111 RTNFATHLK 120
           + NF  H++
Sbjct: 122 KLNFPEHIR 130

BLAST of CmaCh20G000110 vs. TAIR10
Match: AT3G14230.1 (AT3G14230.1 related to AP2 2)

HSP 1 Score: 77.4 bits (189), Expect = 1.8e-14
Identity = 55/164 (33.54%), Postives = 81/164 (49.39%), Query Frame = 1

Query: 7   IKPMPITSSF--ISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQ 66
           +KP   T++   ++  FV+  + + GS   ++  E +    K  K   R R+  + G+RQ
Sbjct: 77  VKPFVFTATTKPVASAFVSTGIYLVGSAYAKKTVESAEQAEKSSK---RKRKNQYRGIRQ 136

Query: 67  RPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSAL 126
           RP GKW AEI+D     R WLGTF TAEEAA AYD AAR +RG   + NF    K+ S +
Sbjct: 137 RPWGKWAAEIRDPRKGSREWLGTFDTAEEAARAYDAAARRIRGTKAKVNFPEE-KNPSVV 196

Query: 127 SFKIRNLLNQKISLKRSSSKTSPTKLGPITEIEGAHSGLSFSTA 169
           S K  +     +    +    S T +   T +   +   SF  +
Sbjct: 197 SQKRPSAKTNNLQKSVAKPNKSVTLVQQPTHLSQQYCNNSFDNS 236

BLAST of CmaCh20G000110 vs. NCBI nr
Match: gi|449463881|ref|XP_004149659.1| (PREDICTED: ethylene-responsive transcription factor ERF087 [Cucumis sativus])

HSP 1 Score: 196.8 bits (499), Expect = 5.7e-47
Identity = 126/239 (52.72%), Postives = 152/239 (63.60%), Query Frame = 1

Query: 28  MEGSVEKQQHKEVSVSGSKIGKC-KDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLG 87
           ME SVEKQ+H++V+     +GKC K RS +RSFVGVRQRPSGKWVAEIKD THDIRMWLG
Sbjct: 1   MESSVEKQEHRDVAFK--LVGKCIKKRSGKRSFVGVRQRPSGKWVAEIKDATHDIRMWLG 60

Query: 88  TFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSKTS 147
           TF T EEAA AYDEAA LLRG+N RTNF     S+SALSFKIRNLL  KI+LKRSS    
Sbjct: 61  TFNTPEEAARAYDEAACLLRGSNARTNFTLASNSSSALSFKIRNLLIHKITLKRSS---- 120

Query: 148 PTKLGPITEIEGAHSGLSFSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLP 207
                 ITE    H+ ++ S  TNQE+H   +NA WSS C+GE+E +G   F+HS   LP
Sbjct: 121 ------ITE---THAHVAHS-ETNQEMHMFDNNAVWSS-CNGEIE-VGFCHFTHSCCDLP 180

Query: 208 LGLNLEMDEPPDPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTY-DAFDS 265
           LG N +MDE                             + A+NG++++LG  Y DAFD+
Sbjct: 181 LGFNFDMDE----------------------------SLSALNGITQHLGNAYDDAFDT 193

BLAST of CmaCh20G000110 vs. NCBI nr
Match: gi|659098050|ref|XP_008449953.1| (PREDICTED: ethylene-responsive transcription factor RAP2-11 [Cucumis melo])

HSP 1 Score: 196.8 bits (499), Expect = 5.7e-47
Identity = 127/239 (53.14%), Postives = 151/239 (63.18%), Query Frame = 1

Query: 28  MEGSVEKQQHKEVSVSGSKIGKC-KDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLG 87
           MEGSVEKQ+ ++V+     +GKC K RS +RSFVGVRQRPSGKWVAEIKD THDIRMWLG
Sbjct: 1   MEGSVEKQERRDVAFK--LVGKCIKKRSGKRSFVGVRQRPSGKWVAEIKDATHDIRMWLG 60

Query: 88  TFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRNLLNQKISLKRSSSKTS 147
           TF TAEEAA AYDEAA LLRG+N RTNF     S+S LSFKIRNLL  KI+LKRSS    
Sbjct: 61  TFNTAEEAARAYDEAACLLRGSNARTNFTLASNSSSTLSFKIRNLLIHKITLKRSS---- 120

Query: 148 PTKLGPITEIEGAHSGLSFSTATNQEIHKTFDNADWSSACSGEVEALGLRQFSHSWWHLP 207
                 ITE    H+ ++ S  TNQE H  FDN    ++C+GE+E +G   F+HS  HLP
Sbjct: 121 ------ITE---THAHVAHS-ETNQETH-MFDNDTVWNSCNGEIE-VGFCHFTHSCCHLP 180

Query: 208 LGLNLEMDEPPDPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTY-DAFDS 265
           LG N +MDE                             + A NG++E+LG TY DAFD+
Sbjct: 181 LGFNFDMDE----------------------------SLSAPNGITEHLGNTYDDAFDT 193

BLAST of CmaCh20G000110 vs. NCBI nr
Match: gi|823258168|ref|XP_012461789.1| (PREDICTED: ethylene-responsive transcription factor RAP2-11-like [Gossypium raimondii])

HSP 1 Score: 151.0 bits (380), Expect = 3.6e-33
Identity = 119/280 (42.50%), Postives = 152/280 (54.29%), Query Frame = 1

Query: 11  PITSSFISIPFVNFAMQMEGSVEKQQHKEVSVSGSKIGKCKDRSRRRSFVGVRQRPSGKW 70
           P   SF   P   F   ME  V+ QQ            K K + RR  FVGVRQRPSGKW
Sbjct: 56  PHLFSFHPPPNNLFFFPMEHQVQTQQ------------KGKFKERRNKFVGVRQRPSGKW 115

Query: 71  VAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTRTNFATHLKSTSALSFKIRN 130
           VAEIKDTT  IRMWLGTF+TAEEAA AYDEAA LLRG+NTRTNFATH+ + S LS KIRN
Sbjct: 116 VAEIKDTTQKIRMWLGTFETAEEAARAYDEAACLLRGSNTRTNFATHVPTDSHLSLKIRN 175

Query: 131 LLNQKISLK--RSSSKTSPTKLGPIT----EIEGAHSGLSFSTATNQEIHKT-----FDN 190
           LLN K SL+  ++++  S T    IT     I G++  ++ S +     + T     FD 
Sbjct: 176 LLNHKKSLRQGKNNNSNSTTNSNKITIKASTIVGSNDSINSSISNAGTDNFTCNSLVFDG 235

Query: 191 ADWSSACSGEVEALGL--RQFSHSWW------HLPLGLNLEMDEPPDPLLSHQDRQVAEM 250
           A +    SG V  LGL   Q   SW        +PL   LE+ +    L    D+++ E 
Sbjct: 236 A-YRPELSGFVGELGLDPSQLGQSWMIPTGFDQIPLSQGLELPQEVGVLPQGIDQELMEF 295

Query: 251 SCDSEQLDL---TSIPMCAVNGMSEYL-GTTYDAFDSVDD 268
               E+L +    S  + A+NG++EYL    +D  D++ D
Sbjct: 296 ----ERLKVERQVSATLYAMNGVNEYLQSAAFDPNDAIWD 318

BLAST of CmaCh20G000110 vs. NCBI nr
Match: gi|763816522|gb|KJB83374.1| (hypothetical protein B456_013G244500 [Gossypium raimondii])

HSP 1 Score: 150.2 bits (378), Expect = 6.1e-33
Identity = 109/242 (45.04%), Postives = 141/242 (58.26%), Query Frame = 1

Query: 49  KCKDRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGA 108
           K K + RR  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF+TAEEAA AYDEAA LLRG+
Sbjct: 10  KGKFKERRNKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFETAEEAARAYDEAACLLRGS 69

Query: 109 NTRTNFATHLKSTSALSFKIRNLLNQKISLK--RSSSKTSPTKLGPIT----EIEGAHSG 168
           NTRTNFATH+ + S LS KIRNLLN K SL+  ++++  S T    IT     I G++  
Sbjct: 70  NTRTNFATHVPTDSHLSLKIRNLLNHKKSLRQGKNNNSNSTTNSNKITIKASTIVGSNDS 129

Query: 169 LSFSTATNQEIHKT-----FDNADWSSACSGEVEALGL--RQFSHSWW------HLPLGL 228
           ++ S +     + T     FD A +    SG V  LGL   Q   SW        +PL  
Sbjct: 130 INSSISNAGTDNFTCNSLVFDGA-YRPELSGFVGELGLDPSQLGQSWMIPTGFDQIPLSQ 189

Query: 229 NLEMDEPPDPLLSHQDRQVAEMSCDSEQLDL---TSIPMCAVNGMSEYL-GTTYDAFDSV 268
            LE+ +    L    D+++ E     E+L +    S  + A+NG++EYL    +D  D++
Sbjct: 190 GLELPQEVGVLPQGIDQELMEF----ERLKVERQVSATLYAMNGVNEYLQSAAFDPNDAI 246

BLAST of CmaCh20G000110 vs. NCBI nr
Match: gi|702278702|ref|XP_010044941.1| (PREDICTED: ethylene-responsive transcription factor RAP2-11 [Eucalyptus grandis])

HSP 1 Score: 149.8 bits (377), Expect = 8.0e-33
Identity = 105/248 (42.34%), Postives = 139/248 (56.05%), Query Frame = 1

Query: 52  DRSRRRSFVGVRQRPSGKWVAEIKDTTHDIRMWLGTFKTAEEAATAYDEAARLLRGANTR 111
           + + +  FVGVRQRPSGKWVAEIKDTT  IRMWLGTF TAEEAA AYDEAA LLRG NTR
Sbjct: 35  NNTAKSKFVGVRQRPSGKWVAEIKDTTQKIRMWLGTFDTAEEAARAYDEAACLLRGFNTR 94

Query: 112 TNFATHLKSTSALSFKIRNLLNQKISLKR-------SSSKTSPTKLGPITEIEGAHSGLS 171
           TNF+ HL ++SALS KIR LLNQKIS +R       S+SK++              S +S
Sbjct: 95  TNFSPHLPTSSALSLKIRTLLNQKISSRRDRPMVSHSNSKSTKKPASQTNSATSISSMIS 154

Query: 172 FSTATN---------------QEIHKTFDNA--DWSSACSGEVEALGLRQFSHSWW---- 231
            S+ TN                +  + FD+A     S C   + +  + Q +HSW     
Sbjct: 155 LSSVTNVYTSSDSDCSFSSGMMQDGRIFDDAYRPDLSRCLASLGSTPMSQHNHSWAFTSG 214

Query: 232 --HLPLGLNLEMDEPP--DPLLSHQDRQVAEMSCDSEQLDLTSIPMCAVNGMSEYLGTTY 268
             HLPL   ++ D  P   P  S  D +++E      +  + S  + AVNG++EYL  TY
Sbjct: 215 LDHLPL---VQEDGLPKHGPCASTTDLELSEFERMKVERQI-SASLYAVNGLNEYLENTY 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RA211_ARATH7.0e-2941.56Ethylene-responsive transcription factor RAP2-11 OS=Arabidopsis thaliana GN=RAP2... [more]
ERF71_ARATH7.5e-1545.63Ethylene-responsive transcription factor ERF071 OS=Arabidopsis thaliana GN=ERF07... [more]
RA212_ARATH2.9e-1432.80Ethylene-responsive transcription factor RAP2-12 OS=Arabidopsis thaliana GN=RAP2... [more]
EF112_ARATH4.9e-1456.52Ethylene-responsive transcription factor ERF112 OS=Arabidopsis thaliana GN=ERF11... [more]
RAP22_ARATH3.2e-1333.54Ethylene-responsive transcription factor RAP2-2 OS=Arabidopsis thaliana GN=RAP2-... [more]
Match NameE-valueIdentityDescription
A0A0A0L1N6_CUCSA4.0e-4752.72Uncharacterized protein OS=Cucumis sativus GN=Csa_4G268100 PE=4 SV=1[more]
A0A0D2VJP1_GOSRA4.3e-3345.04Uncharacterized protein OS=Gossypium raimondii GN=B456_013G244500 PE=4 SV=1[more]
A0A059D8K7_EUCGR5.6e-3342.34Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B03605 PE=4 SV=1[more]
V7CXD8_PHAVU2.8e-3239.34Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G111800g PE=4 SV=1[more]
D7SLK5_VITVI3.1e-3143.82Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g00490 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G19790.13.9e-3041.56 related to AP2 11[more]
AT2G47520.14.2e-1645.63 Integrase-type DNA-binding superfamily protein[more]
AT1G53910.11.6e-1532.80 related to AP2 12[more]
AT2G33710.22.7e-1556.52 Integrase-type DNA-binding superfamily protein[more]
AT3G14230.11.8e-1433.54 related to AP2 2[more]
Match NameE-valueIdentityDescription
gi|449463881|ref|XP_004149659.1|5.7e-4752.72PREDICTED: ethylene-responsive transcription factor ERF087 [Cucumis sativus][more]
gi|659098050|ref|XP_008449953.1|5.7e-4753.14PREDICTED: ethylene-responsive transcription factor RAP2-11 [Cucumis melo][more]
gi|823258168|ref|XP_012461789.1|3.6e-3342.50PREDICTED: ethylene-responsive transcription factor RAP2-11-like [Gossypium raim... [more]
gi|763816522|gb|KJB83374.1|6.1e-3345.04hypothetical protein B456_013G244500 [Gossypium raimondii][more]
gi|702278702|ref|XP_010044941.1|8.0e-3342.34PREDICTED: ethylene-responsive transcription factor RAP2-11 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001471AP2/ERF_dom
IPR016177DNA-bd_dom_sf
IPR017392AP2/ERF-transcript_factor
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0048528 post-embryonic root development
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0043565 sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G000110.1CmaCh20G000110.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001471AP2/ERF domainPRINTSPR00367ETHRSPELEMNTcoord: 59..70
score: 3.5E-7coord: 81..97
score: 3.
IPR001471AP2/ERF domainGENE3DG3DSA:3.30.730.10coord: 58..115
score: 8.2
IPR001471AP2/ERF domainPFAMPF00847AP2coord: 59..107
score: 3.1
IPR001471AP2/ERF domainSMARTSM00380rav1_2coord: 58..121
score: 1.6
IPR001471AP2/ERF domainPROFILEPS51032AP2_ERFcoord: 58..115
score: 19
IPR016177DNA-binding domainunknownSSF54171DNA-binding domaincoord: 59..116
score: 1.44
IPR017392AP2/ERF transcription factor ERF/PTI6PANTHERPTHR31194SHN SHINE , DNA BINDING / TRANSCRIPTION FACTORcoord: 56..150
score: 7.0
NoneNo IPR availablePANTHERPTHR31194:SF5SUBFAMILY NOT NAMEDcoord: 56..150
score: 7.0