CmaCh05G013200 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G013200
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionmyb family transcription factor EFM
LocationCma_Chr05: 9980785 .. 9984538 (-)
RNA-Seq ExpressionCmaCh05G013200
SyntenyCmaCh05G013200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTCTCTCTCTTTTGCCTTTATTTGCCTTTGTTCTTTGCAAGAACTTACCCTTCTTCTTCTTTATTCCCTCTTCATTGAACAAAAAACCAGACACCCTCTTTAGTTAAGGTTTGTTTGGACATGAAATCTTGAGCTGTTTTCCTTAAATAATCCGAAGTTAAGAGAGGCAAAGAGTATGGATTGCAAGGCTCAGAGGTACTTTATGTTGTTGAAGTCTTTTGTTTGTGACGATGATCAAGTGGAAGCAGCTTCTTGTGTTGAAGAAGATCATTCACATTCCCAAACACAACTTGAAGATTTGCTGTCTTGTCTTGAAGATGAAAGACTCAAAATTGATGCTTTCAAACGTGAACTTCCTCTTTGCATGCACCTTTTAACCAATGGTAATTACCCTTTTGTTTATTCTATTTTCGAGTTAAGTTTTCGAATACGAGAGATGTTCGTGTAGTGTGGCGGGGACGGAGACGGGGACAGGAAGCCTTCTTCTGTCTCTGTCTTCGTCTTCGCTTCTTATTTCAAATCTTATCTATGTGAAATTCATCTATTTTTTGTGGGGACTGGGTCCGGGATGGGAAGTCTTCCCTGTTTCTGTCCATGTCCCCACTACTTATTTCAATTGTTATCTTTGTGAAATTCTTCATATATTTTCGTGGGAATTGGATGAATTCTTAGAAAGGAGTTACCCACATCTTTTTTTTTTCTTTCATAGAGATAATTTTCATTTTTTAATATAATTTCTATTAAATAATGTTAGATATCCTAATAAATTCAACCACATTTCATAAACAAAAATATAATATAATATAATATAATCATCATCTCTGCTTCGTTTAGATCTCTGCTTCGTTTAGCTAACATGGAGAAATCTTTTCCTATTCTCTGCCTATTTTTCGTTTATTCGGGGATTCTCGATCTAGTTCAAGATGAATTCCGTGAGTTAAATGAACATCTCTAGTTTTGATTCTCTTTGCCTATTTTTCGTTTAAACAGAGATTCTCCATCCAGTTCAAGATGAGTCCCGTGAGTTAAATGAACATCTTTAGTTTTGATTCTCTCTATCTATTTTTCGTTTATACGGGGATTCTCGACCAGTTCAAGATGGATTCCGTGAGTTAAATGAACATTTCTAGTTTTGATTCTCTTGCCTAAATTTCATTTAAACGGAGATTCTCCATCTAGTTCAAGATGAGTCCCGTGAGTTAAATGAACATCCTAATTTTGATTCTCTCTGCCTATTTTTCTTCCAAACGGGGATTTCCCATCCAATTCAAGATGGGTCATGTGAATTAAATGAACATCTCTAATTTTGATTCTCTCTGCCTAGTTTTTTTTGTTGAAACGGAGATTCTCCATTTAGTTCAAGATGTGTCTCGGGAGTTGAAAGAACAATCTCAACTGAATTAGGTTTTTTTTTTTTGTTCTGTTTTATAGCTTTGGAGAGTTGTAGACAAGAGCTACAAGCATGGAGAGTAAAAAGCAAAATCCATCAACAAAATGGGGAAAAACCAGTTCTTGAAGAATTCATTCCACTCAAACAATCAACCTCAGAACCCTCACAAACCCTATCAAACATTTCTGATAGGGCAAACTGGATGACGTCAGCCCAACTTTGGAGCCAAACAACAAATGATGAATCAAAACCAAATCTTTCCTCTTCCAAACAACAACCCGTTGACCATCAAATTGGATTCATCCAAAAGCTTCATACAAACTCCAATAATGGTGGAGCTTTTCTTCCCTTCTCAAAAGATACAATTTTGAGCCATACCCACAATTCTACTCCTCCTCAGCTTCCCGATTTGGCCCTCGCCGCCACCGCCGGTGATAATCGGCTAGAGGTTGATGGAAATAAGTGTGGTAATTCCGATCACCACCAGCCGGAAAATGGTAATATGGTCACCGGGAAAGTGAATTGGGGCGGCGAGGGAGAAGTTATTACAAGTAATAATGCGAATAATTATAACAATACTAACAACTCTACAACTCAGAATAGTCACCGGAAAGCTAGAAGGTACTGGTCGCCGGATTTGCACCGGCGGTTCGTTAATGCTCTTCAAATGCTTGGTGGTTCACAAGGTGAGTGACAAAGGTTTGAGCTTTTTATTCTTTCAATTTGGATATGAAATCTCTTCTTTTTAGGAAGAAATTAAAGTCCCAATGAATGCCAAGTTCGTGTCTTTTTGAGGAAATTAATGATTTTTTTTATGAATTTTGATTGATTCTGGAAGAAAATGAAAAGGGTTTTTTTTTTTTTTTTCCTTTTATTTTTAAAATATAATTTATTACTTATTTTCACCTATTTAATTGGAGTTTACCTTTTTTTTTTTCTTTTCTAGTTGCCACCCCAAAACAAATCAGGGAGCTGATGAAGGTCGAAGGTTTGACCAATGATGAGGTCAAAAGCCATCTACAGGTTTTTTTCACTCTCTCTCAACATGATAATGTTTTTTTTTTTTTTAATTTTTTGGATAACCCCGCCACTACCCTTATCTTTCTTTCGCTATTCTTGTCTCGGGGTTACTTAGCCGATCTCGAACCAACTCTTAGTAAGCTAGGTCAAAGCTCCTAGTATATGTTGGAGCTTCAAAACCGACTAGATTATTCACTTTTAGCATCTCATCTTTTCTAAATAAGCAAAATTCAAACATTTCGAACAGATGGTCTTGTTTGATAAATTTTTTTTTTTGTCCAAATTCATTTGATAATCAATATGTTTTTATTATTTTTCTTTTATTTTATGAAATTCAGTGCATGTTTAATCACATATTTTAATCGTTATTTTCATTTTTTAAAGTAAATATTTGAATTTTTTGGGCAATTTTTTTTTTTTTAAAAAATAACTTTTAAAAACTTTTTTTTTTAGTTTGAAAACTTCCTTTTAGATTTGTAGAAGAGAATATATAGCTTATAAGCTTAATTTTCTGAAACAAAAAGAAAATAAAACAGTTATCAAACGAGACCTTAACGATCCAGATACAATAATACTTTGTTTTTTTAAATTCGATCTCTTGTCCTTAATTTTGAACTCTAAAAAGAAATAAATCTCTATTAGTTTATCGTAATGGAGAATTTAACTTATTAGAAATTCTAAAATCGAACTATGTATCGGTATTAAATCTCATAGAAACCTTCAAAAAATAAAAAATAAATTAACCTAAAGAATAAAAGAATGATTCTTTTTTTTTTTTTTTTATGCACAGAAATACAGGCTGCACACTAGACGACTCAGTCCAAGTCCACAAGCGGCAGGAGCTCCGGCGCCGCAGCTGGTGGTCTTAGGAGGCATCTGGGTTCCACCGGATTACGCCGCGCATGGCGGAGCACCAGCCCTCTACGGAGCACACCCAACAACCTACTGTGGACCGCCGGTGCACCAAGATTTCTATCCGCCGCAACCGCACCACCAGCTTCACCACCAACACATGCTACACCAGCAGCTACAAATGTACAAGGGCGCCGCCACTGCTCAGCCCCAAATCTCCCCAGATTCAGACGGACGAGGCAACGGCGACCGGTCAGAGAGCATTGAGGACGGCGGAAAGTCGGAGAGCGGCAGCTGGAAGGTGGAGAGCGGCGGAGAGGGAAAGGGGAACTTGACGGCGCTAAGGCAAGAGGGCGGAGACGAGAGCAATGGGAGTGAAATCACACTCAAGTTCTGAGTCTAATTAAGCGATTAATTAAGATGTGATTAGGGGTTTAGACTAATTAATTTTAAGAGTTAGAAATATGAATATATTAGAGAGAGAGAGAGT

mRNA sequence

CTCTCTCTCTCTTTTGCCTTTATTTGCCTTTGTTCTTTGCAAGAACTTACCCTTCTTCTTCTTTATTCCCTCTTCATTGAACAAAAAACCAGACACCCTCTTTAGTTAAGGTTTGTTTGGACATGAAATCTTGAGCTGTTTTCCTTAAATAATCCGAAGTTAAGAGAGGCAAAGAGTATGGATTGCAAGGCTCAGAGGTACTTTATGTTGTTGAAGTCTTTTGTTTGTGACGATGATCAAGTGGAAGCAGCTTCTTGTGTTGAAGAAGATCATTCACATTCCCAAACACAACTTGAAGATTTGCTGTCTTGTCTTGAAGATGAAAGACTCAAAATTGATGCTTTCAAACGTGAACTTCCTCTTTGCATGCACCTTTTAACCAATGCTTTGGAGAGTTGTAGACAAGAGCTACAAGCATGGAGAGTAAAAAGCAAAATCCATCAACAAAATGGGGAAAAACCAGTTCTTGAAGAATTCATTCCACTCAAACAATCAACCTCAGAACCCTCACAAACCCTATCAAACATTTCTGATAGGGCAAACTGGATGACGTCAGCCCAACTTTGGAGCCAAACAACAAATGATGAATCAAAACCAAATCTTTCCTCTTCCAAACAACAACCCGTTGACCATCAAATTGGATTCATCCAAAAGCTTCATACAAACTCCAATAATGGTGGAGCTTTTCTTCCCTTCTCAAAAGATACAATTTTGAGCCATACCCACAATTCTACTCCTCCTCAGCTTCCCGATTTGGCCCTCGCCGCCACCGCCGGTGATAATCGGCTAGAGGTTGATGGAAATAAGTGTGGTAATTCCGATCACCACCAGCCGGAAAATGGTAATATGGTCACCGGGAAAGTGAATTGGGGCGGCGAGGGAGAAGTTATTACAAGTAATAATGCGAATAATTATAACAATACTAACAACTCTACAACTCAGAATAGTCACCGGAAAGCTAGAAGGTACTGGTCGCCGGATTTGCACCGGCGGTTCGTTAATGCTCTTCAAATGCTTGGTGGTTCACAAGTTGCCACCCCAAAACAAATCAGGGAGCTGATGAAGGTCGAAGGTTTGACCAATGATGAGGTCAAAAGCCATCTACAGAAATACAGGCTGCACACTAGACGACTCAGTCCAAGTCCACAAGCGGCAGGAGCTCCGGCGCCGCAGCTGGTGGTCTTAGGAGGCATCTGGGTTCCACCGGATTACGCCGCGCATGGCGGAGCACCAGCCCTCTACGGAGCACACCCAACAACCTACTGTGGACCGCCGGTGCACCAAGATTTCTATCCGCCGCAACCGCACCACCAGCTTCACCACCAACACATGCTACACCAGCAGCTACAAATGTACAAGGGCGCCGCCACTGCTCAGCCCCAAATCTCCCCAGATTCAGACGGACGAGGCAACGGCGACCGGTCAGAGAGCATTGAGGACGGCGGAAAGTCGGAGAGCGGCAGCTGGAAGGTGGAGAGCGGCGGAGAGGGAAAGGGGAACTTGACGGCGCTAAGGCAAGAGGGCGGAGACGAGAGCAATGGGAGTGAAATCACACTCAAGTTCTGAGTCTAATTAAGCGATTAATTAAGATGTGATTAGGGGTTTAGACTAATTAATTTTAAGAGTTAGAAATATGAATATATTAGAGAGAGAGAGAGT

Coding sequence (CDS)

ATGGATTGCAAGGCTCAGAGGTACTTTATGTTGTTGAAGTCTTTTGTTTGTGACGATGATCAAGTGGAAGCAGCTTCTTGTGTTGAAGAAGATCATTCACATTCCCAAACACAACTTGAAGATTTGCTGTCTTGTCTTGAAGATGAAAGACTCAAAATTGATGCTTTCAAACGTGAACTTCCTCTTTGCATGCACCTTTTAACCAATGCTTTGGAGAGTTGTAGACAAGAGCTACAAGCATGGAGAGTAAAAAGCAAAATCCATCAACAAAATGGGGAAAAACCAGTTCTTGAAGAATTCATTCCACTCAAACAATCAACCTCAGAACCCTCACAAACCCTATCAAACATTTCTGATAGGGCAAACTGGATGACGTCAGCCCAACTTTGGAGCCAAACAACAAATGATGAATCAAAACCAAATCTTTCCTCTTCCAAACAACAACCCGTTGACCATCAAATTGGATTCATCCAAAAGCTTCATACAAACTCCAATAATGGTGGAGCTTTTCTTCCCTTCTCAAAAGATACAATTTTGAGCCATACCCACAATTCTACTCCTCCTCAGCTTCCCGATTTGGCCCTCGCCGCCACCGCCGGTGATAATCGGCTAGAGGTTGATGGAAATAAGTGTGGTAATTCCGATCACCACCAGCCGGAAAATGGTAATATGGTCACCGGGAAAGTGAATTGGGGCGGCGAGGGAGAAGTTATTACAAGTAATAATGCGAATAATTATAACAATACTAACAACTCTACAACTCAGAATAGTCACCGGAAAGCTAGAAGGTACTGGTCGCCGGATTTGCACCGGCGGTTCGTTAATGCTCTTCAAATGCTTGGTGGTTCACAAGTTGCCACCCCAAAACAAATCAGGGAGCTGATGAAGGTCGAAGGTTTGACCAATGATGAGGTCAAAAGCCATCTACAGAAATACAGGCTGCACACTAGACGACTCAGTCCAAGTCCACAAGCGGCAGGAGCTCCGGCGCCGCAGCTGGTGGTCTTAGGAGGCATCTGGGTTCCACCGGATTACGCCGCGCATGGCGGAGCACCAGCCCTCTACGGAGCACACCCAACAACCTACTGTGGACCGCCGGTGCACCAAGATTTCTATCCGCCGCAACCGCACCACCAGCTTCACCACCAACACATGCTACACCAGCAGCTACAAATGTACAAGGGCGCCGCCACTGCTCAGCCCCAAATCTCCCCAGATTCAGACGGACGAGGCAACGGCGACCGGTCAGAGAGCATTGAGGACGGCGGAAAGTCGGAGAGCGGCAGCTGGAAGGTGGAGAGCGGCGGAGAGGGAAAGGGGAACTTGACGGCGCTAAGGCAAGAGGGCGGAGACGAGAGCAATGGGAGTGAAATCACACTCAAGTTCTGA

Protein sequence

MDCKAQRYFMLLKSFVCDDDQVEAASCVEEDHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKIHQQNGEKPVLEEFIPLKQSTSEPSQTLSNISDRANWMTSAQLWSQTTNDESKPNLSSSKQQPVDHQIGFIQKLHTNSNNGGAFLPFSKDTILSHTHNSTPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANNYNNTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEVKSHLQKYRLHTRRLSPSPQAAGAPAPQLVVLGGIWVPPDYAAHGGAPALYGAHPTTYCGPPVHQDFYPPQPHHQLHHQHMLHQQLQMYKGAATAQPQISPDSDGRGNGDRSESIEDGGKSESGSWKVESGGEGKGNLTALRQEGGDESNGSEITLKF
Homology
BLAST of CmaCh05G013200 vs. ExPASy Swiss-Prot
Match: Q9ZQ85 (Myb family transcription factor EFM OS=Arabidopsis thaliana OX=3702 GN=EFM PE=1 SV=2)

HSP 1 Score: 312.4 bits (799), Expect = 8.7e-84
Identity = 236/491 (48.07%), Postives = 283/491 (57.64%), Query Frame = 0

Query: 1   MDCKAQRYFMLLKSFVCDDDQVEAASCVEEDHSHSQTQLEDLLSCLEDERLKIDAFKREL 60
           +DCK Q Y MLLKSF  D+ Q +  +           +LEDLLS LE ERLKIDAFKREL
Sbjct: 9   LDCKPQSYSMLLKSF-GDNFQSDPTT----------HKLEDLLSRLEQERLKIDAFKREL 68

Query: 61  PLCMHLLTNALESCRQELQAWRVKSKIHQQN-GEKPVLEEFIPLKQSTSEPSQTLSNISD 120
           PLCM LL NA+E  +Q+L+A+R  S  + Q+ G +PVLEEFIPL+   ++P +T +  S 
Sbjct: 69  PLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRPVLEEFIPLR---NQPEKTNNKGS- 128

Query: 121 RANWMTSAQLWSQTTNDESKP-NLSSSKQQPV-DHQIGFIQKL-HTNS---NNGGAFLPF 180
             NWMT+AQLWSQ+   E+KP N+ S+  Q +   +I    KL H ++   N  GAFLPF
Sbjct: 129 --NWMTTAQLWSQS---ETKPKNIDSTTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPF 188

Query: 181 SKDTILSHTHNSTPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGG 240
           SK+             LP+LAL+                 ++H   ++GN          
Sbjct: 189 SKE-----------QSLPELALSTEV--------KRVSPTNEHTNGQDGN---------- 248

Query: 241 EGEVITSNNANNYNNTNN---------STTQNSHRKARRYWSPDLHRRFVNALQMLGGSQ 300
             +    NN NNYNN NN         STT  S+RKARR WSPDLHRRFV ALQMLGGSQ
Sbjct: 249 --DESMINNDNNYNNNNNNNSNSNGVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQ 308

Query: 301 VATPKQIRELMKVEGLTNDEVKSHLQKYRLHTRRLSPSPQAAGAPAPQLVVLGGIWVPPD 360
           VATPKQIRELMKV+GLTNDEVKSHLQKYRLHTRR SPSPQ +G P P LVVLGGIWVPP+
Sbjct: 309 VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPSPQTSGGPGPHLVVLGGIWVPPE 368

Query: 361 Y-AAHGGAPALY----GAHPTTYCGPP-----VHQDFY-PPQPHHQLHHQHMLHQQLQMY 420
           Y +AHGG P LY      H T   GPP       Q+FY  P P   LHH H      Q +
Sbjct: 369 YTSAHGGTPTLYHHQVHHHHTNTAGPPPPHFCSSQEFYTTPPPPQPLHHHH-----FQTF 428

Query: 421 KGAATAQPQISPDSDGRGNGDRSESIEDGGKSESGSWKVESGGEGKGNLTALRQEGGDES 463
            G+          S G  + D +        +  G      GGE KG L ALR+E  D S
Sbjct: 429 NGS----------SGGTASTDSTHHQVTDSPTVEGKSPESGGGERKG-LAALREECEDHS 432

BLAST of CmaCh05G013200 vs. ExPASy Swiss-Prot
Match: Q5VRW2 (Transcription factor NIGTH1 OS=Oryza sativa subsp. japonica OX=39947 GN=NHO1 PE=1 SV=2)

HSP 1 Score: 260.0 bits (663), Expect = 5.1e-68
Identity = 193/468 (41.24%), Postives = 245/468 (52.35%), Query Frame = 0

Query: 30  EDHSH----------SQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQ 89
           +DH H          +  +L++ LS LE+ERLKIDAFKRELPLCM LL +A+E+ RQ+L+
Sbjct: 11  DDHHHLTAVAAASGQATQKLQEFLSRLEEERLKIDAFKRELPLCMQLLNHAMEAYRQQLE 70

Query: 90  AWRVKSKIHQQNGEKP----VLEEFIPLKQ--------STSEPSQTLSNISDRANWMTSA 149
           A+++ S+             VLEEFIP+K           +  +   S  S++A+WM SA
Sbjct: 71  AYQMGSQHSAAAAAAARAPLVLEEFIPVKNIGIDVVAADKAAAAGGNSVSSEKASWMVSA 130

Query: 150 QLWSQTTNDESKPNLSSSKQQPVDH--------QIGFIQKLHTNSNNGGAFLPFSKDTIL 209
           QLW+   +  +    +   Q P +H            I  L      GGAFLPFSKD  +
Sbjct: 131 QLWNAPASASAADTAAKGPQTPKEHSEHHPLDTSPKLITALDGGGGGGGAFLPFSKDNAM 190

Query: 210 SHTHNSTPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVIT 269
                +    LP+LALA    +   +      G  D     + N           G V  
Sbjct: 191 GDGSAAAAAALPELALA--PAEKAADAITIAAGEVDKKPYAHDN-----------GVVAR 250

Query: 270 SNNANNYNNTNNSTTQNS--------HRKARRYWSPDLHRRFVNALQMLGGSQVATPKQI 329
           S  A N     ++ +           HRKARR WSP+LHRRFVNALQ+LGG+QVATPKQI
Sbjct: 251 SREAQNGGKPPSTPSDGQAVPPPPQPHRKARRCWSPELHRRFVNALQILGGAQVATPKQI 310

Query: 330 RELMKVEGLTNDEVKSHLQKYRLHTRRLSPSPQAAGAPAPQLVVLGGIWVPPDYAAHGGA 389
           RELMKV+GLTNDEVKSHLQKYRLHTRR  PSP    A  PQLVVLGGIWVPP+YA     
Sbjct: 311 RELMKVDGLTNDEVKSHLQKYRLHTRRPMPSPAPPTAATPQLVVLGGIWVPPEYATQAAG 370

Query: 390 PALYGAHPTT---YCGPPVHQDFYPPQPHHQLHH---QHMLH-------------QQL-Q 427
           PA+YGAHP T   Y      Q++Y    HH  HH     ++H             QQL  
Sbjct: 371 PAIYGAHPATQPHYTAAVAAQEYYHHHHHHLQHHPAAAALVHHRAVAPPPPLPPQQQLAP 430

BLAST of CmaCh05G013200 vs. ExPASy Swiss-Prot
Match: Q6Z869 (Transcription factor NIGT1 OS=Oryza sativa subsp. japonica OX=39947 GN=NIGT1 PE=2 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 6.3e-42
Identity = 160/446 (35.87%), Postives = 210/446 (47.09%), Query Frame = 0

Query: 31  DHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKIHQQ 90
           D   ++ +  + L  LE+ER KI  F+RELPLC  L+T  +E  R ++ A   +  +  Q
Sbjct: 7   DRDGARRRCREYLLALEEERRKIQVFQRELPLCFDLVTQTIEGMRSQMDAAGSEETVSDQ 66

Query: 91  NGEKPVLEEFIPLKQSTS-EPSQTLSNISDRA---------------------------- 150
            G  PVLEEFIPLK S S   S+  S  +D A                            
Sbjct: 67  -GPPPVLEEFIPLKPSLSLSSSEEESTHADAAKSGKKEEAETSERHSSPPPPPPEAKKVT 126

Query: 151 -NWMTSAQLWSQTTNDE-SKPNLSSSKQQPVDHQIGFIQKLHTNSNN-GGAFLPFSKDTI 210
            +W+ S QLWSQ    + S P+ + +K  P        + +  N+   GGAF PF K+  
Sbjct: 127 PDWLQSVQLWSQEEPQQPSSPSPTPTKDLP-------CKPVALNARKAGGAFQPFEKE-- 186

Query: 211 LSHTHNSTPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVI 270
                     +LP  +  A A    +   G+K  + D  +    +M T K          
Sbjct: 187 -------KRAELPASSTTAAASSTVVGDSGDKPTDDDTEK----HMETDK---------- 246

Query: 271 TSNNANNYNNTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVE 330
             +N  +  + +       HRK RR W+P+LHRRF+ ALQ LGGS VATPKQIRELMKV+
Sbjct: 247 --DNDKDAKDKDKEGQSQPHRKPRRCWAPELHRRFLQALQQLGGSHVATPKQIRELMKVD 306

Query: 331 GLTNDEVKSHLQKYRLHTRRLSPSPQAAGA------PAPQLVVLGGIWV-PPDYAAHGGA 390
           GLTNDEVKSHLQKYRLHTRR S + Q++ A      PAPQ VV+G IWV PP+YAA   A
Sbjct: 307 GLTNDEVKSHLQKYRLHTRRPSSTGQSSAAAGVPAPPAPQFVVVGSIWVPPPEYAAAAAA 366

Query: 391 P-----ALYGAHPTTYCGP---PVHQDFYPPQPHHQLHHQHMLHQQLQMYKGAATAQPQI 430
                 A  G + +    P   PV       QPH     QH   QQ Q + G        
Sbjct: 367 QQHVQLAAAGNNASGSANPVYAPVAMLPAGLQPHSH-RKQHQQQQQGQRHSG-------- 407

BLAST of CmaCh05G013200 vs. ExPASy Swiss-Prot
Match: Q8VZS3 (Transcription factor HHO2 OS=Arabidopsis thaliana OX=3702 GN=HHO2 PE=1 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 5.3e-41
Identity = 124/341 (36.36%), Postives = 178/341 (52.20%), Query Frame = 0

Query: 28  VEEDHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKI 87
           VE D++    +  + +  LE+E+ KI  F+RELPLC+ L+T A+E+CR+EL      +  
Sbjct: 3   VEMDYAKKMQKCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIEACRKELSG--TTTTT 62

Query: 88  HQQNGEK-------PVLEEFIPLKQSTS---------------EPSQTLSNISDRANWMT 147
            +Q  E+       PV EEFIP+K+ +S               E S  L N + +++W+ 
Sbjct: 63  SEQCSEQTTSVCGGPVFEEFIPIKKISSLCEEVQEEEEEDGEHESSPELVN-NKKSDWLR 122

Query: 148 SAQLWSQTTNDESKPNLSSSKQQPVDHQIGFIQKLHTNSNNGGAFLPFSKDTILSHTHNS 207
           S QLW+ +      P+L+     P + ++    K+       GAF PF K  + +     
Sbjct: 123 SVQLWNHS------PDLN-----PKEERVAKKAKVVEVKPKSGAFQPFQKRVLETDLQ-- 182

Query: 208 TPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANN 267
                P + +A++            C                    GG+ ++I + +   
Sbjct: 183 -----PAVKVASSMPATTTSSTTETC--------------------GGKSDLIKAGDEER 242

Query: 268 YNNTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEV 327
                 S + ++HRK RR WSP+LHRRF+NALQ LGGS VATPKQIR+ MKV+GLTNDEV
Sbjct: 243 RIEQQQSQS-HTHRKQRRCWSPELHRRFLNALQQLGGSHVATPKQIRDHMKVDGLTNDEV 301

Query: 328 KSHLQKYRLHTRRLSPSPQAAGAPA----PQLVVLGGIWVP 343
           KSHLQKYRLHTRR + +  AA +      PQ VV+GGIWVP
Sbjct: 303 KSHLQKYRLHTRRPAATSVAAQSTGNQQQPQFVVVGGIWVP 301

BLAST of CmaCh05G013200 vs. ExPASy Swiss-Prot
Match: Q9FPE8 (Transcription factor HHO3 OS=Arabidopsis thaliana OX=3702 GN=HHO3 PE=1 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 3.9e-36
Identity = 114/338 (33.73%), Postives = 168/338 (49.70%), Query Frame = 0

Query: 31  DHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKIHQQ 90
           D++    +  + +  LE+E+ KI  F+RELPLC+ L+T A+ESCR+EL           +
Sbjct: 10  DYTQKMKRCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIESCRKELSESSEHVGGQSE 69

Query: 91  NGEK-------PVLEEFIPLKQST-----------SEPSQTLSNISD-----RANWMTSA 150
             E+        V EEF+P+K S+           +E ++ ++N ++     +++W+ S 
Sbjct: 70  CSERTTSECGGAVFEEFMPIKWSSASSDETDKDEEAEKTEMMTNENNDGDKKKSDWLRSV 129

Query: 151 QLWSQTTNDESKPNLSSSKQQPVDHQIGFIQKLHTNSNNGGAFLPFSKDTILSHTHNSTP 210
           QLW+Q+      P+   + ++P+  ++           + GAF PF K+       +S P
Sbjct: 130 QLWNQS------PDPQPNNKKPMVIEV---------KRSAGAFQPFQKEK--PKAADSQP 189

Query: 211 PQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANNYN 270
                   + T   +  E  G   G     Q ++                          
Sbjct: 190 LIKAITPTSTTTTSSTAETVGG--GKEFEEQKQS-------------------------- 249

Query: 271 NTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEVKS 330
                   +S+RK RR WSP+LHRRF++ALQ LGGS VATPKQIR+LMKV+GLTNDEVKS
Sbjct: 250 --------HSNRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRDLMKVDGLTNDEVKS 294

Query: 331 HLQKYRLHTRRLSPSPQAAGAPAP---QLVVLGGIWVP 343
           HLQKYRLHTRR +      G   P   Q +V+ GIWVP
Sbjct: 310 HLQKYRLHTRRPATPVVRTGGENPQQRQFMVMEGIWVP 294

BLAST of CmaCh05G013200 vs. TAIR 10
Match: AT2G03500.1 (Homeodomain-like superfamily protein )

HSP 1 Score: 312.4 bits (799), Expect = 6.2e-85
Identity = 236/491 (48.07%), Postives = 283/491 (57.64%), Query Frame = 0

Query: 1   MDCKAQRYFMLLKSFVCDDDQVEAASCVEEDHSHSQTQLEDLLSCLEDERLKIDAFKREL 60
           +DCK Q Y MLLKSF  D+ Q +  +           +LEDLLS LE ERLKIDAFKREL
Sbjct: 9   LDCKPQSYSMLLKSF-GDNFQSDPTT----------HKLEDLLSRLEQERLKIDAFKREL 68

Query: 61  PLCMHLLTNALESCRQELQAWRVKSKIHQQN-GEKPVLEEFIPLKQSTSEPSQTLSNISD 120
           PLCM LL NA+E  +Q+L+A+R  S  + Q+ G +PVLEEFIPL+   ++P +T +  S 
Sbjct: 69  PLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTRPVLEEFIPLR---NQPEKTNNKGS- 128

Query: 121 RANWMTSAQLWSQTTNDESKP-NLSSSKQQPV-DHQIGFIQKL-HTNS---NNGGAFLPF 180
             NWMT+AQLWSQ+   E+KP N+ S+  Q +   +I    KL H ++   N  GAFLPF
Sbjct: 129 --NWMTTAQLWSQS---ETKPKNIDSTTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPF 188

Query: 181 SKDTILSHTHNSTPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGG 240
           SK+             LP+LAL+                 ++H   ++GN          
Sbjct: 189 SKE-----------QSLPELALSTEV--------KRVSPTNEHTNGQDGN---------- 248

Query: 241 EGEVITSNNANNYNNTNN---------STTQNSHRKARRYWSPDLHRRFVNALQMLGGSQ 300
             +    NN NNYNN NN         STT  S+RKARR WSPDLHRRFV ALQMLGGSQ
Sbjct: 249 --DESMINNDNNYNNNNNNNSNSNGVSSTTSQSNRKARRCWSPDLHRRFVQALQMLGGSQ 308

Query: 301 VATPKQIRELMKVEGLTNDEVKSHLQKYRLHTRRLSPSPQAAGAPAPQLVVLGGIWVPPD 360
           VATPKQIRELMKV+GLTNDEVKSHLQKYRLHTRR SPSPQ +G P P LVVLGGIWVPP+
Sbjct: 309 VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPSPQTSGGPGPHLVVLGGIWVPPE 368

Query: 361 Y-AAHGGAPALY----GAHPTTYCGPP-----VHQDFY-PPQPHHQLHHQHMLHQQLQMY 420
           Y +AHGG P LY      H T   GPP       Q+FY  P P   LHH H      Q +
Sbjct: 369 YTSAHGGTPTLYHHQVHHHHTNTAGPPPPHFCSSQEFYTTPPPPQPLHHHH-----FQTF 428

Query: 421 KGAATAQPQISPDSDGRGNGDRSESIEDGGKSESGSWKVESGGEGKGNLTALRQEGGDES 463
            G+          S G  + D +        +  G      GGE KG L ALR+E  D S
Sbjct: 429 NGS----------SGGTASTDSTHHQVTDSPTVEGKSPESGGGERKG-LAALREECEDHS 432

BLAST of CmaCh05G013200 vs. TAIR 10
Match: AT1G68670.1 (myb-like transcription factor family protein )

HSP 1 Score: 170.2 bits (430), Expect = 3.8e-42
Identity = 124/341 (36.36%), Postives = 178/341 (52.20%), Query Frame = 0

Query: 28  VEEDHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKI 87
           VE D++    +  + +  LE+E+ KI  F+RELPLC+ L+T A+E+CR+EL      +  
Sbjct: 3   VEMDYAKKMQKCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIEACRKELSG--TTTTT 62

Query: 88  HQQNGEK-------PVLEEFIPLKQSTS---------------EPSQTLSNISDRANWMT 147
            +Q  E+       PV EEFIP+K+ +S               E S  L N + +++W+ 
Sbjct: 63  SEQCSEQTTSVCGGPVFEEFIPIKKISSLCEEVQEEEEEDGEHESSPELVN-NKKSDWLR 122

Query: 148 SAQLWSQTTNDESKPNLSSSKQQPVDHQIGFIQKLHTNSNNGGAFLPFSKDTILSHTHNS 207
           S QLW+ +      P+L+     P + ++    K+       GAF PF K  + +     
Sbjct: 123 SVQLWNHS------PDLN-----PKEERVAKKAKVVEVKPKSGAFQPFQKRVLETDLQ-- 182

Query: 208 TPPQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANN 267
                P + +A++            C                    GG+ ++I + +   
Sbjct: 183 -----PAVKVASSMPATTTSSTTETC--------------------GGKSDLIKAGDEER 242

Query: 268 YNNTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEV 327
                 S + ++HRK RR WSP+LHRRF+NALQ LGGS VATPKQIR+ MKV+GLTNDEV
Sbjct: 243 RIEQQQSQS-HTHRKQRRCWSPELHRRFLNALQQLGGSHVATPKQIRDHMKVDGLTNDEV 301

Query: 328 KSHLQKYRLHTRRLSPSPQAAGAPA----PQLVVLGGIWVP 343
           KSHLQKYRLHTRR + +  AA +      PQ VV+GGIWVP
Sbjct: 303 KSHLQKYRLHTRRPAATSVAAQSTGNQQQPQFVVVGGIWVP 301

BLAST of CmaCh05G013200 vs. TAIR 10
Match: AT1G25550.1 (myb-like transcription factor family protein )

HSP 1 Score: 154.1 bits (388), Expect = 2.8e-37
Identity = 114/338 (33.73%), Postives = 168/338 (49.70%), Query Frame = 0

Query: 31  DHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAWRVKSKIHQQ 90
           D++    +  + +  LE+E+ KI  F+RELPLC+ L+T A+ESCR+EL           +
Sbjct: 10  DYTQKMKRCHEYVEALEEEQKKIQVFQRELPLCLELVTQAIESCRKELSESSEHVGGQSE 69

Query: 91  NGEK-------PVLEEFIPLKQST-----------SEPSQTLSNISD-----RANWMTSA 150
             E+        V EEF+P+K S+           +E ++ ++N ++     +++W+ S 
Sbjct: 70  CSERTTSECGGAVFEEFMPIKWSSASSDETDKDEEAEKTEMMTNENNDGDKKKSDWLRSV 129

Query: 151 QLWSQTTNDESKPNLSSSKQQPVDHQIGFIQKLHTNSNNGGAFLPFSKDTILSHTHNSTP 210
           QLW+Q+      P+   + ++P+  ++           + GAF PF K+       +S P
Sbjct: 130 QLWNQS------PDPQPNNKKPMVIEV---------KRSAGAFQPFQKEK--PKAADSQP 189

Query: 211 PQLPDLALAATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANNYN 270
                   + T   +  E  G   G     Q ++                          
Sbjct: 190 LIKAITPTSTTTTSSTAETVGG--GKEFEEQKQS-------------------------- 249

Query: 271 NTNNSTTQNSHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEVKS 330
                   +S+RK RR WSP+LHRRF++ALQ LGGS VATPKQIR+LMKV+GLTNDEVKS
Sbjct: 250 --------HSNRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRDLMKVDGLTNDEVKS 294

Query: 331 HLQKYRLHTRRLSPSPQAAGAPAP---QLVVLGGIWVP 343
           HLQKYRLHTRR +      G   P   Q +V+ GIWVP
Sbjct: 310 HLQKYRLHTRRPATPVVRTGGENPQQRQFMVMEGIWVP 294

BLAST of CmaCh05G013200 vs. TAIR 10
Match: AT4G37180.1 (Homeodomain-like superfamily protein )

HSP 1 Score: 137.5 bits (345), Expect = 2.7e-32
Identity = 103/313 (32.91%), Postives = 155/313 (49.52%), Query Frame = 0

Query: 22  VEAASCVEEDHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAW 81
           ++  S ++++HS   ++++  +  LE+ER KID FKRELPLCM LL  A+ + + E    
Sbjct: 29  LDEVSRIKDNHS-KLSEIDGYVGKLEEERNKIDVFKRELPLCMLLLNEAIGALKDEA--- 88

Query: 82  RVKSKIHQQNGEKPVLEEFIPLKQSTSEPSQTLSNISDRANWMTSAQLWSQTTNDESKPN 141
           R    +   NG+   +E   P               +D+ +WM+SAQLW    N + +  
Sbjct: 89  RKGLSLMASNGKFDDVERAKP--------------ETDKKSWMSSAQLWISNPNSQFR-- 148

Query: 142 LSSSKQQPVDHQIGFIQKLHTN-SNNGGAFLPFSKDTILSHTHNSTPPQLPDLALAATAG 201
             S+ ++  D  +        N  N GG F+PF          N  PP  P   L+    
Sbjct: 149 --STNEEEEDRCVSQNPFQTCNYPNQGGVFMPF----------NRPPPPPPPAPLSLMTP 208

Query: 202 DNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANNYNNTNNSTTQNSHRK 261
            + + +D ++   S HH                          + +N  ++ +     ++
Sbjct: 209 TSEMMMDYSRIEQSHHH--------------------------HQFNKPSSQSHHIQKKE 268

Query: 262 ARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEVKSHLQKYRLHTRR-- 321
            RR WS +LHR+FV+AL  LGG QVATPKQIR+LMKV+GLTNDEVKSHLQKYR+H R+  
Sbjct: 269 QRRRWSQELHRKFVDALHRLGGPQVATPKQIRDLMKVDGLTNDEVKSHLQKYRMHIRKHP 283

Query: 322 LSPSPQAAGAPAP 332
           L P+   + +  P
Sbjct: 329 LHPTKTLSSSDQP 283

BLAST of CmaCh05G013200 vs. TAIR 10
Match: AT4G37180.2 (Homeodomain-like superfamily protein )

HSP 1 Score: 131.7 bits (330), Expect = 1.5e-30
Identity = 101/317 (31.86%), Postives = 156/317 (49.21%), Query Frame = 0

Query: 22  VEAASCVEEDHSHSQTQLEDLLSCLEDERLKIDAFKRELPLCMHLLTNALESCRQELQAW 81
           ++  S ++++HS   ++++  +  LE+ER KID FKRELPLCM LL   +      + A 
Sbjct: 29  LDEVSRIKDNHS-KLSEIDGYVGKLEEERNKIDVFKRELPLCMLLLNEEIVFLCVAIGAL 88

Query: 82  RVKSK----IHQQNGEKPVLEEFIPLKQSTSEPSQTLSNISDRANWMTSAQLWSQTTNDE 141
           + +++    +   NG+   +E   P               +D+ +WM+SAQLW    N +
Sbjct: 89  KDEARKGLSLMASNGKFDDVERAKP--------------ETDKKSWMSSAQLWISNPNSQ 148

Query: 142 SKPNLSSSKQQPVDHQIGFIQKLHTN-SNNGGAFLPFSKDTILSHTHNSTPPQLPDLALA 201
            +    S+ ++  D  +        N  N GG F+PF          N  PP  P   L+
Sbjct: 149 FR----STNEEEEDRCVSQNPFQTCNYPNQGGVFMPF----------NRPPPPPPPAPLS 208

Query: 202 ATAGDNRLEVDGNKCGNSDHHQPENGNMVTGKVNWGGEGEVITSNNANNYNNTNNSTTQN 261
                + + +D ++   S HH                          + +N  ++ +   
Sbjct: 209 LMTPTSEMMMDYSRIEQSHHH--------------------------HQFNKPSSQSHHI 268

Query: 262 SHRKARRYWSPDLHRRFVNALQMLGGSQVATPKQIRELMKVEGLTNDEVKSHLQKYRLHT 321
             ++ RR WS +LHR+FV+AL  LGG QVATPKQIR+LMKV+GLTNDEVKSHLQKYR+H 
Sbjct: 269 QKKEQRRRWSQELHRKFVDALHRLGGPQVATPKQIRDLMKVDGLTNDEVKSHLQKYRMHI 290

Query: 322 RR--LSPSPQAAGAPAP 332
           R+  L P+   + +  P
Sbjct: 329 RKHPLHPTKTLSSSDQP 290

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZQ858.7e-8448.07Myb family transcription factor EFM OS=Arabidopsis thaliana OX=3702 GN=EFM PE=1 ... [more]
Q5VRW25.1e-6841.24Transcription factor NIGTH1 OS=Oryza sativa subsp. japonica OX=39947 GN=NHO1 PE=... [more]
Q6Z8696.3e-4235.87Transcription factor NIGT1 OS=Oryza sativa subsp. japonica OX=39947 GN=NIGT1 PE=... [more]
Q8VZS35.3e-4136.36Transcription factor HHO2 OS=Arabidopsis thaliana OX=3702 GN=HHO2 PE=1 SV=1[more]
Q9FPE83.9e-3633.73Transcription factor HHO3 OS=Arabidopsis thaliana OX=3702 GN=HHO3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
AT2G03500.16.2e-8548.07Homeodomain-like superfamily protein [more]
AT1G68670.13.8e-4236.36myb-like transcription factor family protein [more]
AT1G25550.12.8e-3733.73myb-like transcription factor family protein [more]
AT4G37180.12.7e-3232.91Homeodomain-like superfamily protein [more]
AT4G37180.21.5e-3031.86Homeodomain-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006447Myb domain, plantsTIGRFAMTIGR01557TIGR01557coord: 260..315
e-value: 5.9E-26
score: 88.5
NoneNo IPR availableGENE3D1.10.10.60coord: 259..317
e-value: 9.8E-26
score: 91.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 240..259
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 410..427
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 447..462
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 373..462
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 393..409
NoneNo IPR availablePANTHERPTHR31003:SF34MYB-LIKE TRANSCRIPTION FACTOR FAMILY PROTEINcoord: 1..462
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 263..313
e-value: 1.4E-5
score: 25.2
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 257..317
score: 11.39057
IPR044787Myb family transcription factor HRS1-likePANTHERPTHR31003MYB FAMILY TRANSCRIPTION FACTORcoord: 1..462
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 258..318

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G013200.1CmaCh05G013200.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 DNA-binding transcription factor activity