CmaCh08G002180 (gene) Cucurbita maxima (Rimu)

NameCmaCh08G002180
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
Descriptionsequence-specific DNA binding transcription factors
LocationCma_Chr08 : 1230475 .. 1231377 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACTCCTCCTCCGGCACCACTCTCCTCTACCAAATCCATCTCCACCGTCGCAACCACTGAGAAAGTCCAGCCGATTCCATGGACGCACCAGGAGACCGTCAATCTTATCCAGGCTTACCAAGAGAAATGGTACGCTCTCGAACGAGGTCAGTTGAAGTCCAGCCAGTGGGAGGAAGTCGCTGTCACTGTCGCCGCCCGCTGCGGCTACAGCCACTTCGATCCCTCGAAAACGTCCGTTCAGTGCCGCCACAAGATGGAGAAGCTCCGCCAGCGTTTTCGCTCCGAGAAACACCGTCTCGCCATCGGCACCCAATCTTCCTCCCGTTGGCTCTACTTCGAACTTATGAACAACCTATTACGCGGTCCGCTACCAATCTCCGCTCGTCCGATGTCTTCGATTCCGTTCGATAACGACGAAGAAGACCAAACCGCTGAGAAATCTGATAACTACAACTCCGATTACGAGGAAGAGGAGAAGAATCACAGTAAATCGAAGAGCATTAGCAACATACTCCGTCGGCCGATTGCTTTTGATGGATCTTCCTCACGAAGGAGAAATAGAAATTCTAGCGAAGACGACGAAGACGACGAAGAAGAAGCCGACATTAGGGTTTCCAGGTTTCCGGAGGAGGATCTGGCAGGAGAGGCGGCGGTAGCTGAGGAAGGGAAAGAGATGTGCTCCAAATTAGCGGCAGAGATTCGATTGTTCGCCGATAGATTGGTCGGAATGGAGAGTTTGAAGATGGAGATGATGAAGGAAGCGGAGATGAACAGAATTGCGATGGAGAACAGGCGGATTGAGATGATTTTAGAATCGGAAAAGAAGATTGTGAATTCAATTGCAAAAGCTTTCGGATCTCCTCCCCCCAAGAGGCTGAAGATTGGCCATGATCCTTGA

mRNA sequence

ATGGCCACTCCTCCTCCGGCACCACTCTCCTCTACCAAATCCATCTCCACCGTCGCAACCACTGAGAAAGTCCAGCCGATTCCATGGACGCACCAGGAGACCGTCAATCTTATCCAGGCTTACCAAGAGAAATGGTACGCTCTCGAACGAGGTCAGTTGAAGTCCAGCCAGTGGGAGGAAGTCGCTGTCACTGTCGCCGCCCGCTGCGGCTACAGCCACTTCGATCCCTCGAAAACGTCCGTTCAGTGCCGCCACAAGATGGAGAAGCTCCGCCAGCGTTTTCGCTCCGAGAAACACCGTCTCGCCATCGGCACCCAATCTTCCTCCCGTTGGCTCTACTTCGAACTTATGAACAACCTATTACGCGGTCCGCTACCAATCTCCGCTCGTCCGATGTCTTCGATTCCGTTCGATAACGACGAAGAAGACCAAACCGCTGAGAAATCTGATAACTACAACTCCGATTACGAGGAAGAGGAGAAGAATCACAGTAAATCGAAGAGCATTAGCAACATACTCCGTCGGCCGATTGCTTTTGATGGATCTTCCTCACGAAGGAGAAATAGAAATTCTAGCGAAGACGACGAAGACGACGAAGAAGAAGCCGACATTAGGGTTTCCAGGTTTCCGGAGGAGGATCTGGCAGGAGAGGCGGCGGTAGCTGAGGAAGGGAAAGAGATGTGCTCCAAATTAGCGGCAGAGATTCGATTGTTCGCCGATAGATTGGTCGGAATGGAGAGTTTGAAGATGGAGATGATGAAGGAAGCGGAGATGAACAGAATTGCGATGGAGAACAGGCGGATTGAGATGATTTTAGAATCGGAAAAGAAGATTGTGAATTCAATTGCAAAAGCTTTCGGATCTCCTCCCCCCAAGAGGCTGAAGATTGGCCATGATCCTTGA

Coding sequence (CDS)

ATGGCCACTCCTCCTCCGGCACCACTCTCCTCTACCAAATCCATCTCCACCGTCGCAACCACTGAGAAAGTCCAGCCGATTCCATGGACGCACCAGGAGACCGTCAATCTTATCCAGGCTTACCAAGAGAAATGGTACGCTCTCGAACGAGGTCAGTTGAAGTCCAGCCAGTGGGAGGAAGTCGCTGTCACTGTCGCCGCCCGCTGCGGCTACAGCCACTTCGATCCCTCGAAAACGTCCGTTCAGTGCCGCCACAAGATGGAGAAGCTCCGCCAGCGTTTTCGCTCCGAGAAACACCGTCTCGCCATCGGCACCCAATCTTCCTCCCGTTGGCTCTACTTCGAACTTATGAACAACCTATTACGCGGTCCGCTACCAATCTCCGCTCGTCCGATGTCTTCGATTCCGTTCGATAACGACGAAGAAGACCAAACCGCTGAGAAATCTGATAACTACAACTCCGATTACGAGGAAGAGGAGAAGAATCACAGTAAATCGAAGAGCATTAGCAACATACTCCGTCGGCCGATTGCTTTTGATGGATCTTCCTCACGAAGGAGAAATAGAAATTCTAGCGAAGACGACGAAGACGACGAAGAAGAAGCCGACATTAGGGTTTCCAGGTTTCCGGAGGAGGATCTGGCAGGAGAGGCGGCGGTAGCTGAGGAAGGGAAAGAGATGTGCTCCAAATTAGCGGCAGAGATTCGATTGTTCGCCGATAGATTGGTCGGAATGGAGAGTTTGAAGATGGAGATGATGAAGGAAGCGGAGATGAACAGAATTGCGATGGAGAACAGGCGGATTGAGATGATTTTAGAATCGGAAAAGAAGATTGTGAATTCAATTGCAAAAGCTTTCGGATCTCCTCCCCCCAAGAGGCTGAAGATTGGCCATGATCCTTGA

Protein sequence

MATPPPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNHSKSKSISNILRRPIAFDGSSSRRRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEAAVAEEGKEMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFGSPPPKRLKIGHDP
BLAST of CmaCh08G002180 vs. Swiss-Prot
Match: ASIL2_ARATH (Trihelix transcription factor ASIL2 OS=Arabidopsis thaliana GN=ASIL2 PE=2 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 1.5e-09
Identity = 32/93 (34.41%), Postives = 58/93 (62.37%), Query Frame = 1

Query: 29  WTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSHFDPSKTSVQCRHKME 88
           W+   T  LI A+ E++  L RG LK   W+EVA  V++R  Y      KT +QC+++++
Sbjct: 84  WSEAATAVLIDAWGERYLELSRGNLKQKHWKEVAEIVSSREDYGKIP--KTDIQCKNRID 143

Query: 89  KLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLL 122
            ++++++ EK R+A G    SRW++F+ ++ L+
Sbjct: 144 TVKKKYKQEKVRIANG-GGRSRWVFFDKLDRLI 173

BLAST of CmaCh08G002180 vs. TrEMBL
Match: A0A0A0K6R6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G341240 PE=4 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 2.2e-116
Identity = 242/318 (76.10%), Postives = 261/318 (82.08%), Query Frame = 1

Query: 4   PPPAP-LSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVA 63
           PPP P LSS+KSIST    +K  PIPWTHQET++LI AYQ+KWY+LERGQLKS+QWEEVA
Sbjct: 6   PPPLPSLSSSKSIST----DKPHPIPWTHQETIHLIHAYQDKWYSLERGQLKSNQWEEVA 65

Query: 64  VTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLR 123
           VTVAARCGYSHFDPSKTSVQCRHKMEKLRQR RSEKHRL+ GTQSSSRWLYF+LMNNLLR
Sbjct: 66  VTVAARCGYSHFDPSKTSVQCRHKMEKLRQRLRSEKHRLSTGTQSSSRWLYFDLMNNLLR 125

Query: 124 GPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNH-SKSKSISNILRRPIAFDG 183
           GPLPISARPMSSIPFDND++D  AEKSDNYNSDYEEEE+N+ SKSKSISNILRRPI    
Sbjct: 126 GPLPISARPMSSIPFDNDQDDHIAEKSDNYNSDYEEEERNNRSKSKSISNILRRPIV--- 185

Query: 184 SSSRRRNRNSSE--------------------DDEDDEEEADIRVSRFPEEDLAGEAAVA 243
               RR RNSSE                    +DE +EEE DIRVSRF EE    E    
Sbjct: 186 ---ARRTRNSSEEEEEEEEEEDNEDEGEEEDNEDEGEEEERDIRVSRFREEYATAE---E 245

Query: 244 EEGKEMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNS 300
           EEGKEMCSKLAAEIRLFADRLVGME+ KM+MMKEAEMNRIAMEN+R+EMILESEKKIVNS
Sbjct: 246 EEGKEMCSKLAAEIRLFADRLVGMENWKMDMMKEAEMNRIAMENKRMEMILESEKKIVNS 305

BLAST of CmaCh08G002180 vs. TrEMBL
Match: B9S517_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_1719870 PE=4 SV=1)

HSP 1 Score: 236.9 bits (603), Expect = 3.2e-59
Identity = 152/310 (49.03%), Postives = 202/310 (65.16%), Query Frame = 1

Query: 1   MATP-----PPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKS 60
           MATP     PP P       S   TT+K  P+PWTHQETV+LIQAYQEKWY+L+RGQLK+
Sbjct: 1   MATPSPSPSPPPPAEPPPYSSKPRTTKKPHPVPWTHQETVHLIQAYQEKWYSLKRGQLKA 60

Query: 61  SQWEEVAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFE 120
           +QWEEVA TVAARCGY +   +KT +QCRHKMEKLR+R+R E+ RL++    +  W YF+
Sbjct: 61  NQWEEVAETVAARCGYEYNHLAKTVIQCRHKMEKLRKRYREERRRLSL--NGTCFWQYFD 120

Query: 121 LMNNLLRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNHSKSKSISNILRR 180
           LM++L RGPLPISARP++ IP ++D E+   E+ +    + EEE    S+S SI+ IL++
Sbjct: 121 LMDSLERGPLPISARPLTLIPGNDDNEEDDDEEEE---EEEEEEYGYRSRSLSINYILQK 180

Query: 181 PI---AFDGSSSR-------RRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEAAVAEEGK 240
           P     F GS SR       +R R    ++E+ EEE         EED          GK
Sbjct: 181 PTIVNRFAGSDSRLLPAVMNKRKREEIVEEEEQEEE--------EEED---------SGK 240

Query: 241 EMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKA 296
            +  +LA EIR F +R+VGME  KM+MMKE E  R+ MEN+RIEMIL+S++KIV+ I+ A
Sbjct: 241 SVELELAGEIRAFTERIVGMERKKMQMMKETERWRMEMENKRIEMILDSQRKIVDMISTA 288

BLAST of CmaCh08G002180 vs. TrEMBL
Match: W9S464_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003045 PE=4 SV=1)

HSP 1 Score: 230.7 bits (587), Expect = 2.3e-57
Identity = 155/303 (51.16%), Postives = 195/303 (64.36%), Query Frame = 1

Query: 14  SISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSH 73
           S S   T +K QPIPWTH+ETV+LI+AYQ+KWY+L RGQLKS QWEE+AVTVAARCGY  
Sbjct: 2   SSSPEPTGKKPQPIPWTHEETVHLIEAYQQKWYSLNRGQLKSPQWEEIAVTVAARCGYDF 61

Query: 74  FDPS-KTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLRGPLPISARPM 133
             PS K+++QCRHKMEKLRQRFRS  HRL      SS W YF+LM+ LLRGP PISARPM
Sbjct: 62  SHPSSKSALQCRHKMEKLRQRFRSHSHRLG----PSSPWPYFDLMDRLLRGPFPISARPM 121

Query: 134 SSIPFDNDEEDQTAEKS-----------DNYNSDYEEEEKNHSKSKSISNILRRPIAFDG 193
                D D+EDQ    +            + N D +++E +++KS+SI+ ILR+P   + 
Sbjct: 122 -----DYDDEDQPCHAAAYEPDLDHRDVHHDNDDDDDDESSYTKSRSINYILRQPTIVNR 181

Query: 194 SSSRRRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEA------AVAEEGKEMCS--KLAA 253
            +             DD      RV  F EE  A  A         E+G+E  S  +LA 
Sbjct: 182 FAVGEEKLGIGGRCADDGVH---RV--FWEEPAAKRARNDGCDVEREKGRESESVLRLAK 241

Query: 254 EIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFGSPPPKR 297
           EIR F++R+VGMES+KMEMMKE E  RI ME++RIEMI+ S+ KIV+SIA+AFGS  P R
Sbjct: 242 EIRAFSERIVGMESMKMEMMKETERCRIEMESKRIEMIIRSQHKIVDSIARAFGSSSPNR 290

BLAST of CmaCh08G002180 vs. TrEMBL
Match: F6I024_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g06670 PE=4 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 8.8e-57
Identity = 144/308 (46.75%), Postives = 201/308 (65.26%), Query Frame = 1

Query: 1   MATPPPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEE 60
           MATP P P SS    S     +K QP+PW+HQET +LIQAYQEKWY+L+RGQLK+SQWEE
Sbjct: 1   MATPSPPPPSS----SPPNRPKKAQPLPWSHQETTHLIQAYQEKWYSLKRGQLKASQWEE 60

Query: 61  VAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNL 120
           VAVTVAARC Y   +PSK++ QCRHK+EKLR+R+R+EK R+ IG  +SS W YF LM++L
Sbjct: 61  VAVTVAARCNYD--EPSKSATQCRHKIEKLRKRYRAEKQRIVIGAAASSSWPYFHLMDSL 120

Query: 121 LRGPLPISARPMSSIPFD-------NDEEDQTAEKSDNYNSDYEEEEKN--HSKSKSISN 180
            RGPLPISA+PMS + ++       N + +     +D+Y++   +E+     ++S+SI+ 
Sbjct: 121 ERGPLPISAQPMSVLKYEKAYPYSHNCKPNDDNVGNDDYSNSSNDEDSGALKTRSRSINY 180

Query: 181 ILRRPIAFDGSSSRRRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEAAVAEEGKEMCSKL 240
           IL+RP   +  +   +   S+E +E                     A V E GK +  +L
Sbjct: 181 ILQRPAVVNRFAVDPKLNWSTEQEETG-------------------AVVVEGGKGVVWEL 240

Query: 241 AAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFGSPPP 300
           A E+R FA+R V ME++KMEMMKE E  R+ ME+RR+EMI+ES++KIV +I +A GS   
Sbjct: 241 AGEMRAFAERFVRMENMKMEMMKETERRRMEMESRRMEMIVESQRKIVETIGRALGS--N 281

BLAST of CmaCh08G002180 vs. TrEMBL
Match: G7K6Q6_MEDTR (Myb/SANT-like DNA-binding domain protein OS=Medicago truncatula GN=MTR_5g017500 PE=4 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 1.7e-55
Identity = 137/270 (50.74%), Postives = 171/270 (63.33%), Query Frame = 1

Query: 21  TEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSHFDPSKTS 80
           T+K QPIPWTHQET+NLI+AYQ+KWY+L+RG L+ SQWEEVAV VAARCGY +  PSKT+
Sbjct: 14  TKKPQPIPWTHQETINLIRAYQDKWYSLKRGPLRGSQWEEVAVVVAARCGYDYNHPSKTA 73

Query: 81  VQCRHKMEKLRQRFRSEKHRL-AIGTQSSSR-WLYFELMNNLLRGPLPISARPMSSIPFD 140
           +QCRHKMEKLRQR RSEK RL A  + +SSR W YF LM++L RGPLPIS RP+S     
Sbjct: 74  LQCRHKMEKLRQRHRSEKRRLTATSSVASSRSWQYFRLMDDLERGPLPISVRPLSHNHPI 133

Query: 141 NDEEDQTAEKSDNYNSDYEEEEKNHSKSKSISNILRRPIAFDGSSSRRRNRNSSEDDEDD 200
           +D+ D  A                 ++S+SI NIL                N  + DE D
Sbjct: 134 SDDSDGAA-----------------ARSRSIHNIL----------------NQKQRDETD 193

Query: 201 EEEADIRVSRFPEEDLAGEAAVAEEGKEMCSKLAAEIRLFADRLVGMESLKMEMMKEAEM 260
           EEE D+                      M   L AE+R FA+R++G+E++KMEMMKE E 
Sbjct: 194 EEEEDV----------------------MAKGLTAELRSFAERIIGLENMKMEMMKETER 228

Query: 261 NRIAMENRRIEMILESEKKIVNSIAKAFGS 289
            R+ MEN+RI MILES+ +IV+SI KAFGS
Sbjct: 254 FRLEMENKRIRMILESQWRIVDSIGKAFGS 228

BLAST of CmaCh08G002180 vs. TAIR10
Match: AT3G24860.1 (AT3G24860.1 Homeodomain-like superfamily protein)

HSP 1 Score: 142.9 bits (359), Expect = 3.2e-34
Identity = 112/294 (38.10%), Postives = 163/294 (55.44%), Query Frame = 1

Query: 2   ATPPPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEV 61
           ++PPP       + ST A   K QP+ WT  ET+ LI++Y+EKW+A+ RG LKS+ WEE+
Sbjct: 38  SSPPPHTTVVALAASTSAVARKTQPVLWTQDETLLLIESYKEKWFAIGRGPLKSTHWEEI 97

Query: 62  AVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNL- 121
           AV  ++R G       +TS QCRHK+EK+R+RFRSE+  +       S W ++  M  L 
Sbjct: 98  AVAASSRSGV-----ERTSTQCRHKIEKMRKRFRSERQSMG----PISIWPFYNQMEELD 157

Query: 122 LRGPLPISARPMSSIP------FDNDEEDQTAEKSDNYNSDYEEEEKNHSKSKSISNILR 181
              P PISARP++ +P      + +DEE+   E ++NY  + EEE++  SKS+SI+ ILR
Sbjct: 158 SSNPAPISARPLTRLPPNSNNRYVDDEEEDEEEDNNNYEEE-EEEDERQSKSRSINYILR 217

Query: 182 RPIAFDGSSSRRRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEAAVAEEGKEMCSKLAAE 241
           RP    G+ +R             +E    R S+    D  G     E  ++    +AAE
Sbjct: 218 RP----GTVNRFAGVGGGLLSWGQKE----RSSKRKRNDGDG----GERRRKGMRAVAAE 277

Query: 242 IRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFGS 289
           IR FA+R++ ME  K+E  KE    R  ME RRI +I  S+ +++  I  AF S
Sbjct: 278 IRAFAERVMVMEKKKIEFAKETVRLRKEMEIRRINLIQSSQTQLLQFINNAFDS 309

BLAST of CmaCh08G002180 vs. TAIR10
Match: AT2G44730.1 (AT2G44730.1 Alcohol dehydrogenase transcription factor Myb/SANT-like family protein)

HSP 1 Score: 84.0 bits (206), Expect = 1.8e-16
Identity = 59/161 (36.65%), Postives = 85/161 (52.80%), Query Frame = 1

Query: 2   ATPPPAPLSSTKSISTVATTEKVQPIP---WTHQETVNLIQAYQEKWYALERGQLKSSQW 61
           A+  PA  +  KS S    ++  + +P   W+ +ET+ LI AY++KWYAL RG LK++ W
Sbjct: 34  ASTEPASNTDLKSASIPTASKNSRRLPPPCWSLEETIALIDAYRDKWYALNRGNLKANHW 93

Query: 62  EEVAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSE--KHRLAIGTQSSSRWLYFEL 121
           EEVA  V A C        KT+VQCRHKMEKLR+R+R+E  + R     +  S W++F+ 
Sbjct: 94  EEVAEAVGANC--PDVILKKTAVQCRHKMEKLRKRYRTEIQRARSVPVARFISSWVHFKR 153

Query: 122 MNNLLRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYE 158
           M  +   P          I   N+  D       NY + Y+
Sbjct: 154 MEAMENRP---------EIKQGNESGDDDDHDDGNYTARYQ 183

BLAST of CmaCh08G002180 vs. TAIR10
Match: AT3G54390.1 (AT3G54390.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 67.8 bits (164), Expect = 1.3e-11
Identity = 38/129 (29.46%), Postives = 68/129 (52.71%), Query Frame = 1

Query: 10  SSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARC 69
           S  K  ++    ++++   W+      L++AY+ KW    R +LK   WE+VA  V++R 
Sbjct: 19  SLKKPSASSVVVDRLKRDEWSEGAVSTLLEAYESKWVLRNRAKLKGQDWEDVAKHVSSRA 78

Query: 70  GYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLRGPLPISA 129
             +H    KT  QC++K+E +++R+RSE       T   S W  +  +++LLRG  P   
Sbjct: 79  --THTKSPKTQTQCKNKIESMKKRYRSES-----ATADGSSWPLYPRLDHLLRGTQP-QP 138

Query: 130 RPMSSIPFD 139
           +P + +P +
Sbjct: 139 QPQAVLPLN 139

BLAST of CmaCh08G002180 vs. TAIR10
Match: AT3G14180.1 (AT3G14180.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 65.1 bits (157), Expect = 8.5e-11
Identity = 32/93 (34.41%), Postives = 58/93 (62.37%), Query Frame = 1

Query: 29  WTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSHFDPSKTSVQCRHKME 88
           W+   T  LI A+ E++  L RG LK   W+EVA  V++R  Y      KT +QC+++++
Sbjct: 84  WSEAATAVLIDAWGERYLELSRGNLKQKHWKEVAEIVSSREDYGKIP--KTDIQCKNRID 143

Query: 89  KLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLL 122
            ++++++ EK R+A G    SRW++F+ ++ L+
Sbjct: 144 TVKKKYKQEKVRIANG-GGRSRWVFFDKLDRLI 173

BLAST of CmaCh08G002180 vs. TAIR10
Match: AT3G58630.1 (AT3G58630.1 sequence-specific DNA binding transcription factors)

HSP 1 Score: 64.3 bits (155), Expect = 1.5e-10
Identity = 35/115 (30.43%), Postives = 63/115 (54.78%), Query Frame = 1

Query: 29  WTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAAR-------CGYSHFDPSKTSV 88
           W+ + T  LIQA+  ++  L RG L+   W+EVA  V  R          +   P +T V
Sbjct: 26  WSEEATFTLIQAWGNRYVDLSRGNLRQKHWQEVANAVNDRHYNTGRNVSAAKSQPYRTDV 85

Query: 89  QCRHKMEKLRQRFRSEKHRLAIGTQSS--SRWLYFELMNNLLRGPLPISARPMSS 135
           QC+++++ L+++++ EK R++     +  S W +F  +++LLR   P S+ P S+
Sbjct: 86  QCKNRIDTLKKKYKVEKARVSESNPGAYISPWPFFSALDDLLRESFPTSSNPDST 140

BLAST of CmaCh08G002180 vs. NCBI nr
Match: gi|449443728|ref|XP_004139629.1| (PREDICTED: trihelix transcription factor ASIL1 [Cucumis sativus])

HSP 1 Score: 426.8 bits (1096), Expect = 3.2e-116
Identity = 242/318 (76.10%), Postives = 261/318 (82.08%), Query Frame = 1

Query: 4   PPPAP-LSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVA 63
           PPP P LSS+KSIST    +K  PIPWTHQET++LI AYQ+KWY+LERGQLKS+QWEEVA
Sbjct: 6   PPPLPSLSSSKSIST----DKPHPIPWTHQETIHLIHAYQDKWYSLERGQLKSNQWEEVA 65

Query: 64  VTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLR 123
           VTVAARCGYSHFDPSKTSVQCRHKMEKLRQR RSEKHRL+ GTQSSSRWLYF+LMNNLLR
Sbjct: 66  VTVAARCGYSHFDPSKTSVQCRHKMEKLRQRLRSEKHRLSTGTQSSSRWLYFDLMNNLLR 125

Query: 124 GPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNH-SKSKSISNILRRPIAFDG 183
           GPLPISARPMSSIPFDND++D  AEKSDNYNSDYEEEE+N+ SKSKSISNILRRPI    
Sbjct: 126 GPLPISARPMSSIPFDNDQDDHIAEKSDNYNSDYEEEERNNRSKSKSISNILRRPIV--- 185

Query: 184 SSSRRRNRNSSE--------------------DDEDDEEEADIRVSRFPEEDLAGEAAVA 243
               RR RNSSE                    +DE +EEE DIRVSRF EE    E    
Sbjct: 186 ---ARRTRNSSEEEEEEEEEEDNEDEGEEEDNEDEGEEEERDIRVSRFREEYATAE---E 245

Query: 244 EEGKEMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNS 300
           EEGKEMCSKLAAEIRLFADRLVGME+ KM+MMKEAEMNRIAMEN+R+EMILESEKKIVNS
Sbjct: 246 EEGKEMCSKLAAEIRLFADRLVGMENWKMDMMKEAEMNRIAMENKRMEMILESEKKIVNS 305

BLAST of CmaCh08G002180 vs. NCBI nr
Match: gi|659116312|ref|XP_008458014.1| (PREDICTED: neurofilament medium polypeptide [Cucumis melo])

HSP 1 Score: 412.9 bits (1060), Expect = 4.7e-112
Identity = 232/314 (73.89%), Postives = 258/314 (82.17%), Query Frame = 1

Query: 1   MATPPPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEE 60
           M++P P+PL+S  S  +++T +K  PIPWTHQET++LI AYQ+KWY+LER QLKS+QWEE
Sbjct: 1   MSSPRPSPLASLSSSKSIST-QKPHPIPWTHQETLHLIYAYQDKWYSLERDQLKSNQWEE 60

Query: 61  VAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNL 120
           VAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQR RSEKHRL+ GTQSSS WLYF+LMNNL
Sbjct: 61  VAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRLRSEKHRLSTGTQSSSHWLYFDLMNNL 120

Query: 121 LRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNH-SKSKSISNILRRPIAF 180
           LRGPLPISARPMSSIPFDN ++D   EKSDNYNSDYEEEE+N+ SKSKSISNILRR I  
Sbjct: 121 LRGPLPISARPMSSIPFDNHQDDHITEKSDNYNSDYEEEERNNRSKSKSISNILRRSIV- 180

Query: 181 DGSSSRRRNRNSSE--------------DDEDDEEEADIRVSRFPEEDLAGEAAVAEEGK 240
                 RR RNSSE              +DE +EEE DIRVSRFPEE    E    EEGK
Sbjct: 181 -----ARRTRNSSENEEEEEEEEEEEDNEDEGEEEERDIRVSRFPEEYAGAE---EEEGK 240

Query: 241 EMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKA 300
           EMCSKLAAEIRLFADRLVGME+ KM+MMKEAEMNRIA+EN+R+EMILESEKKIVNSIAKA
Sbjct: 241 EMCSKLAAEIRLFADRLVGMENWKMDMMKEAEMNRIAIENKRMEMILESEKKIVNSIAKA 300

BLAST of CmaCh08G002180 vs. NCBI nr
Match: gi|1009149299|ref|XP_015892407.1| (PREDICTED: trihelix transcription factor ASIL1 [Ziziphus jujuba])

HSP 1 Score: 252.3 bits (643), Expect = 1.1e-63
Identity = 153/301 (50.83%), Postives = 204/301 (67.77%), Query Frame = 1

Query: 1   MATPPPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEE 60
           MATPPP+  +S    +   T +K QP+PWTHQETV+LIQAYQEKWY+L+RGQLKSSQWEE
Sbjct: 1   MATPPPSASASPSPPAANVTAKKPQPLPWTHQETVHLIQAYQEKWYSLKRGQLKSSQWEE 60

Query: 61  VAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNL 120
           VAVTVAARCGY + +PSKT++QCRHKMEKLRQR+R+EK +    + +SS W YF+LM++L
Sbjct: 61  VAVTVAARCGYDYSEPSKTAIQCRHKMEKLRQRYRAEKKQRLGLSGASSSWQYFDLMDSL 120

Query: 121 LRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNHS-KSKSISNILRRPIA- 180
            RGPLPISARPM ++P  + E    A++ ++ + D E++ +NHS KS+SI+ ILRRP   
Sbjct: 121 ERGPLPISARPMVAVPCSHVE----AQEDEDDDEDEEDDVENHSNKSRSINYILRRPAVV 180

Query: 181 --FDGSSSRRRNRNSSEDDEDD---------EEEADIRVSRFPEEDLAGEAAVAEEGKEM 240
             F G     R         +D          E    +     +ED        + G+E+
Sbjct: 181 NRFAGEPKLSREGGGGGGGGNDGNWGFSGSLREPVAKKRKESVDEDYEEVEVEGQRGREL 240

Query: 241 CSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFG 289
             KLA EIR FADR +GME++KME++KE E  R+ ME +R++MIL+S+ KIV+SIAKAF 
Sbjct: 241 VLKLAGEIRGFADRFIGMENMKMEILKETERCRMEMETKRMDMILQSQHKIVDSIAKAFE 297

BLAST of CmaCh08G002180 vs. NCBI nr
Match: gi|255560137|ref|XP_002521086.1| (PREDICTED: trihelix transcription factor ASIL1 [Ricinus communis])

HSP 1 Score: 236.9 bits (603), Expect = 4.6e-59
Identity = 152/310 (49.03%), Postives = 202/310 (65.16%), Query Frame = 1

Query: 1   MATP-----PPAPLSSTKSISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKS 60
           MATP     PP P       S   TT+K  P+PWTHQETV+LIQAYQEKWY+L+RGQLK+
Sbjct: 1   MATPSPSPSPPPPAEPPPYSSKPRTTKKPHPVPWTHQETVHLIQAYQEKWYSLKRGQLKA 60

Query: 61  SQWEEVAVTVAARCGYSHFDPSKTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFE 120
           +QWEEVA TVAARCGY +   +KT +QCRHKMEKLR+R+R E+ RL++    +  W YF+
Sbjct: 61  NQWEEVAETVAARCGYEYNHLAKTVIQCRHKMEKLRKRYREERRRLSL--NGTCFWQYFD 120

Query: 121 LMNNLLRGPLPISARPMSSIPFDNDEEDQTAEKSDNYNSDYEEEEKNHSKSKSISNILRR 180
           LM++L RGPLPISARP++ IP ++D E+   E+ +    + EEE    S+S SI+ IL++
Sbjct: 121 LMDSLERGPLPISARPLTLIPGNDDNEEDDDEEEE---EEEEEEYGYRSRSLSINYILQK 180

Query: 181 PI---AFDGSSSR-------RRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEAAVAEEGK 240
           P     F GS SR       +R R    ++E+ EEE         EED          GK
Sbjct: 181 PTIVNRFAGSDSRLLPAVMNKRKREEIVEEEEQEEE--------EEED---------SGK 240

Query: 241 EMCSKLAAEIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKA 296
            +  +LA EIR F +R+VGME  KM+MMKE E  R+ MEN+RIEMIL+S++KIV+ I+ A
Sbjct: 241 SVELELAGEIRAFTERIVGMERKKMQMMKETERWRMEMENKRIEMILDSQRKIVDMISTA 288

BLAST of CmaCh08G002180 vs. NCBI nr
Match: gi|703151227|ref|XP_010110065.1| (hypothetical protein L484_003045 [Morus notabilis])

HSP 1 Score: 230.7 bits (587), Expect = 3.3e-57
Identity = 155/303 (51.16%), Postives = 195/303 (64.36%), Query Frame = 1

Query: 14  SISTVATTEKVQPIPWTHQETVNLIQAYQEKWYALERGQLKSSQWEEVAVTVAARCGYSH 73
           S S   T +K QPIPWTH+ETV+LI+AYQ+KWY+L RGQLKS QWEE+AVTVAARCGY  
Sbjct: 2   SSSPEPTGKKPQPIPWTHEETVHLIEAYQQKWYSLNRGQLKSPQWEEIAVTVAARCGYDF 61

Query: 74  FDPS-KTSVQCRHKMEKLRQRFRSEKHRLAIGTQSSSRWLYFELMNNLLRGPLPISARPM 133
             PS K+++QCRHKMEKLRQRFRS  HRL      SS W YF+LM+ LLRGP PISARPM
Sbjct: 62  SHPSSKSALQCRHKMEKLRQRFRSHSHRLG----PSSPWPYFDLMDRLLRGPFPISARPM 121

Query: 134 SSIPFDNDEEDQTAEKS-----------DNYNSDYEEEEKNHSKSKSISNILRRPIAFDG 193
                D D+EDQ    +            + N D +++E +++KS+SI+ ILR+P   + 
Sbjct: 122 -----DYDDEDQPCHAAAYEPDLDHRDVHHDNDDDDDDESSYTKSRSINYILRQPTIVNR 181

Query: 194 SSSRRRNRNSSEDDEDDEEEADIRVSRFPEEDLAGEA------AVAEEGKEMCS--KLAA 253
            +             DD      RV  F EE  A  A         E+G+E  S  +LA 
Sbjct: 182 FAVGEEKLGIGGRCADDGVH---RV--FWEEPAAKRARNDGCDVEREKGRESESVLRLAK 241

Query: 254 EIRLFADRLVGMESLKMEMMKEAEMNRIAMENRRIEMILESEKKIVNSIAKAFGSPPPKR 297
           EIR F++R+VGMES+KMEMMKE E  RI ME++RIEMI+ S+ KIV+SIA+AFGS  P R
Sbjct: 242 EIRAFSERIVGMESMKMEMMKETERCRIEMESKRIEMIIRSQHKIVDSIARAFGSSSPNR 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASIL2_ARATH1.5e-0934.41Trihelix transcription factor ASIL2 OS=Arabidopsis thaliana GN=ASIL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K6R6_CUCSA2.2e-11676.10Uncharacterized protein OS=Cucumis sativus GN=Csa_7G341240 PE=4 SV=1[more]
B9S517_RICCO3.2e-5949.03Transcription factor, putative OS=Ricinus communis GN=RCOM_1719870 PE=4 SV=1[more]
W9S464_9ROSA2.3e-5751.16Uncharacterized protein OS=Morus notabilis GN=L484_003045 PE=4 SV=1[more]
F6I024_VITVI8.8e-5746.75Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0005g06670 PE=4 SV=... [more]
G7K6Q6_MEDTR1.7e-5550.74Myb/SANT-like DNA-binding domain protein OS=Medicago truncatula GN=MTR_5g017500 ... [more]
Match NameE-valueIdentityDescription
AT3G24860.13.2e-3438.10 Homeodomain-like superfamily protein[more]
AT2G44730.11.8e-1636.65 Alcohol dehydrogenase transcription factor Myb/SANT-like family prot... [more]
AT3G54390.11.3e-1129.46 sequence-specific DNA binding transcription factors[more]
AT3G14180.18.5e-1134.41 sequence-specific DNA binding transcription factors[more]
AT3G58630.11.5e-1030.43 sequence-specific DNA binding transcription factors[more]
Match NameE-valueIdentityDescription
gi|449443728|ref|XP_004139629.1|3.2e-11676.10PREDICTED: trihelix transcription factor ASIL1 [Cucumis sativus][more]
gi|659116312|ref|XP_008458014.1|4.7e-11273.89PREDICTED: neurofilament medium polypeptide [Cucumis melo][more]
gi|1009149299|ref|XP_015892407.1|1.1e-6350.83PREDICTED: trihelix transcription factor ASIL1 [Ziziphus jujuba][more]
gi|255560137|ref|XP_002521086.1|4.6e-5949.03PREDICTED: trihelix transcription factor ASIL1 [Ricinus communis][more]
gi|703151227|ref|XP_010110065.1|3.3e-5751.16hypothetical protein L484_003045 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR006578MADF-dom
IPR017877Myb-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh08G002180.1CmaCh08G002180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006578MADF domainSMARTSM00595118neu2coord: 36..125
score: 2.
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 21..91
score: 6
NoneNo IPR availablePANTHERPTHR31307FAMILY NOT NAMEDcoord: 5..299
score: 2.3
NoneNo IPR availablePANTHERPTHR31307:SF17GENOMIC DNA, CHROMOSOME 3, TAC CLONE:K7P8coord: 5..299
score: 2.3
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 28..119
score: 4.1