CmaCh16G002700 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G002700
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionTranscription factor, putative
LocationCma_Chr16 : 1270291 .. 1273777 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTCGTTCGGACCCTTCAAACTTTCCTGTGTTCTTCGAGTGGTTTGGTTTAGATCTGATCCATCTTCGCAAATCTGGCTCTTTCTGGAAATTCTCTTCCGATTTTCTTCTGTAAGATTCTTTCTCATTTTCTAGCAATTTCAGCTCTTGATTCCTGCTTGTTTCTCTCGTGTATTTTTGTTTTCGCGCTTTTCTGATTCTGTTGCCTGGCAAAGGTTTCTGAGAATCTGAAGAGAATTAATCGACATGGAAATAGTTGTTCAGAGCTTTGGGAAGTAGGAAATGCCGAATGATTTTGTCGGTTCTGTTGTTTGCTAGATGGAAATTTGATTTGATTGTCTTGGGAACTAAGCTGAAACGGATCATTAAAATTTCTGTTTTAAAATAAGATCTAATTTCTTCGTTTTCTTGGGGATCAAAAAGTGGCTTGTTTTTCCAAGAAGATTGTCTGAAATTTCTCTTTGAAAATTGAAATATTGTTCTTCTGCTTCAAATTCTCTTTCACATTCTTAAAATTCCTTTTGGATTTTGTATATATGTTCTGTAGACATTGACATATCTCTCTTTTGCAGCTTAGAAAAAGCTAGAAATTTCCTCCCAAATAGTTGGGTATCCTGTTTCTTTCGTCATATGAAATGCTGGCTTTCTTCAGTGTACTGTCTTATGTCAAATATTTGAATGAATGTAGATTGTTGGAGTTCTTTACTTTTTTCTTGTTTAGAGTTTATTTTGTTATTTTCAATCCTTTGTATTTATTGTAATTTTTCATGTTTTTGCAGGGGCACCATTATAAAAGTTGTGGATGTGTTAACTTTACGTATTGAGAGAATAATATGCTTTGGTTGAGCATATCCATTGATGTATTTCGTAAAGTTGTGCATTTGATCGTCTATTGCAGGGTTATTTTGTAGATCACTTTTTGTATAGATAACACTGAATTGTTCCTCCCTGAGCTCCTTTTTTCTATCATCCATGGGTTCCAATCACTTTTTTATCATTTCCTGTAATATCCACATTAGCTGAAAAGCTTCCTGTCCTCAAACACCATTTTGGTTTTGGTCTTCTGTTTTAGTCTTATTTTTTCTGCCCCTTTGTCAAGGAATCTAGATAGCTCATTAAATTTCCAGTTAAATAGTTCTTTCTGCAGTGAATATTCTCCGTTTAAGGTTCAGAAGTAGTTATGGTGTGCCAAGCTGCAACCCAGACAAGATTTCGGGCATTGAAACATGAGAATGGAATTGCAGGAAAACCAACAATTATTGTTAGAGTGATCGCATGCTTTCAACCTTTGCAGAATTGTCAGGTTTGTCAACTCAGTCTCTGCATCGTGGTTTTGCTTGTTGATTAACAATTGAATTATTTGAATCCCTCTGCTTGTTCCCCCTGAACTTGGCTCGTTCACTACCATGATTAAACTTTTGGTTTTTCCTGTTCCAGGCCGAGTATTTTCGACAATTGCTTAAGCCTGTCACGTAGATTGAGTAGCATTTATTTCTCTTAAGCTGGACTCTTGCGAACTTCTTTTTCTCTTTGTTCCTGAAAGCATTTTGCAGATAACTCCCATAGCAAACTAGAGCACAACATTTTGAGGTAGTTCTTGCCACCTGCTTGCTAAGTTTATTCTCCTTATGTTATTTCGGTTGTTTCGAAATAGTTATATTTTGAAATATTGATTAGGTGGTTGTTTTGCGGTATGGTTGGGACTGATAACTCTCGGCTTGATTTTGAGCATTTTGCTTGGCAACTGCACAATCGTGATTCCATGAATGCACCAGTAGAGATCAAGCAACAGGAAAGCTTACAAGGGAATATCAACTGTGGAAATTGCATACTTCCCACATGTGTGGGAGGAATGCAAGGGTTTGCAATTCCACCATTACCCAGTTTTAAAGTGGAACAGCTAAATGTTGTTCAGGAGTCACGTCAATGCTTACCCCCTCATTTCCAGAACTCACTTGGTACTCCAATGCCTTGGCAGAAAGGAAAGGAATCTATGCATTATGCTCATGCTGGACCTAGTGGGATGCCTGTGTCAAAATCAAATAATGGCTCCTATCCGAAAGGTTTTCTTATCTTTGATCAATCTGGAAACCAGAAGAGATTGATGTATGCTCCTATGTATCCTCCTGTCTATATTCCCTCAACTTTTGCTGAAACCAAGCGTTGTGGTTGGCTTGAAGAAGAAGGGGCAGCTGGAGAGATTAATTCTGTCAAGTATTTTCCAAATACTCTGTCCAATGAGAATTATGTAGGTGATGGAGAGAGTGAAATGCATGAAAACACGGAAGAAATAGATGCATTACTTTACTCAGACTACGATGGTACTGGTTGTAGTAGCGACGATGAAGTGACTAGCACAGGACATTCCCCAGAGATGATCCATGATCATTGTGAGAAGGAAGAACAATGTCAAGAAACTACTACAGAAGCTGCTAGTTCTAATGGCCCAAGAAAAAGGCAGAGAGTGCATGATGGGGGCTACGTAAAATCACTGCCAATTGCTACGGCTTCTTCTGCAAGAGTTGAATTACAGAAGTTTGTCAATGACGCAGAATCAAGCTGTGGCATGGTCCATGAGGAAGAGGTTGGAACAGATGTTGATTTTTGCCGTTCTTCTTGCAAGAAAGACAGGATCAAGGAGACGTTGAGAGTACTTGAGAGTTTAGTTCCTGATGCTAAGGGAAAAGACCCATTGTTGGTTATTGATGAAGCTATTGATTACTTGAAGTCCTTAAAACATGAAGCTAGCGCTCTAGGTGTGAGCTGCTATTAGATCGCATGCCCCGAATTTAGATGTTGATGGTGTGGTAACTTAAGAATTCTTATTTTGGATTTTGATGGATGCAAGATCAGTTTGTGATGGGTTAGCAAATGGAGGTATAATGTCTGGATAACATTGAACCCAGCCGGTAATGAGTGAATTATTGCCATTTTTATTCGTTGACAAGACATGATTATGTTTGATGCTTTTAAGGGACTAAAGATCAGGGGAAAGTCACAAAGATTTGAACGCATGGGATGTGATAGTGGTGGGTTATGCTGAGAATTTAAAGACCCCACCTTTTCCAGCAATTTGGAGAATCTCCCTGGATTGAGTGCATGCGTGTTAGGTAACTGAGGTCCCACTTTCCACACTTTTTTCTGGATTATGTGCTATGTCTTTACCCTCATTACTCATTTATAGAGATTGTGCTGTAAAAGTGGAGCTTGTCCCTTGTGGTGTTAGGTAGATCTTGCACGGTTCGAGTTTTGATTATGAGAGCACTATGAAACTTTTGAAACTTTTGACTGATATCAGCTTTGGAGATAGATTTTAGTTTTGCTATTAGGTGTTGCGTTGATGTAATAATTGAGGTGTGAATGCTTTTGTGAGTCTATCATGTTACATGTACTGCTTTCTCTATGCTTGTACTTGATCTAAGCCTTGAGAATACTGGCTGTATAATTATATTGTTGATGAATTATACCATACGAGATTGTTAGCCAA

mRNA sequence

TGTTCGTTCGGACCCTTCAAACTTTCCTGTGTTCTTCGAGTGGTTTGGTTTAGATCTGATCCATCTTCGCAAATCTGGCTCTTTCTGGAAATTCTCTTCCGATTTTCTTCTGGGCACCATTATAAAAGTTGTGGATGTGTTAACTTTACGTATTGAGAGAATAATATGCTTTGGTTGAGCATATCCATTGATGTATTTCGTAAAGTTGTGCATTTGATCGTCTATTGCAGGGTTATTTTGTAGATCACTTTTTGTATAGATAACACTGAATTGTTCCTCCCTGAGCTCCTTTTTTCTATCATCCATGGGTTCCAATCACTTTTTTATCATTTCCTGTAATATCCACATTAGCTGAAAAGCTTCCTGTCCTCAAACACCATTTTGGTTTTGGTCTTCTGTTTTAGTCTTATTTTTTCTGCCCCTTTGTCAAGGAATCTAGATAGCTCATTAAATTTCCAGTTAAATAGTTCTTTCTGCAGTGAATATTCTCCGTTTAAGGTTCAGAAGTAGTTATGGTGTGCCAAGCTGCAACCCAGACAAGATTTCGGGCATTGAAACATGAGAATGGAATTGCAGGAAAACCAACAATTATTGTTAGAGTGATCGCATGCTTTCAACCTTTGCAGAATTGTCAGGCCGAGTATTTTCGACAATTGCTTAAGCCTCATTTTGCAGATAACTCCCATAGCAAACTAGAGCACAACATTTTGAGGTGGTTGTTTTGCGGTATGGTTGGGACTGATAACTCTCGGCTTGATTTTGAGCATTTTGCTTGGCAACTGCACAATCGTGATTCCATGAATGCACCAGTAGAGATCAAGCAACAGGAAAGCTTACAAGGGAATATCAACTGTGGAAATTGCATACTTCCCACATGTGTGGGAGGAATGCAAGGGTTTGCAATTCCACCATTACCCAGTTTTAAAGTGGAACAGCTAAATGTTGTTCAGGAGTCACGTCAATGCTTACCCCCTCATTTCCAGAACTCACTTGGTACTCCAATGCCTTGGCAGAAAGGAAAGGAATCTATGCATTATGCTCATGCTGGACCTAGTGGGATGCCTGTGTCAAAATCAAATAATGGCTCCTATCCGAAAGGTTTTCTTATCTTTGATCAATCTGGAAACCAGAAGAGATTGATGTATGCTCCTATGTATCCTCCTGTCTATATTCCCTCAACTTTTGCTGAAACCAAGCGTTGTGGTTGGCTTGAAGAAGAAGGGGCAGCTGGAGAGATTAATTCTGTCAAGTATTTTCCAAATACTCTGTCCAATGAGAATTATGTAGGTGATGGAGAGAGTGAAATGCATGAAAACACGGAAGAAATAGATGCATTACTTTACTCAGACTACGATGGTACTGGTTGTAGTAGCGACGATGAAGTGACTAGCACAGGACATTCCCCAGAGATGATCCATGATCATTGTGAGAAGGAAGAACAATGTCAAGAAACTACTACAGAAGCTGCTAGTTCTAATGGCCCAAGAAAAAGGCAGAGAGTGCATGATGGGGGCTACGTAAAATCACTGCCAATTGCTACGGCTTCTTCTGCAAGAGTTGAATTACAGAAGTTTGTCAATGACGCAGAATCAAGCTGTGGCATGGTCCATGAGGAAGAGGTTGGAACAGATGTTGATTTTTGCCGTTCTTCTTGCAAGAAAGACAGGATCAAGGAGACGTTGAGAGTACTTGAGAGTTTAGTTCCTGATGCTAAGGGAAAAGACCCATTGTTGGTTATTGATGAAGCTATTGATTACTTGAAGTCCTTAAAACATGAAGCTAGCGCTCTAGGTGTGAGCTGCTATTAGATCGCATGCCCCGAATTTAGATGTTGATGGTGTGGTAACTTAAGAATTCTTATTTTGGATTTTGATGGATGCAAGATCAGTTTGTGATGGGTTAGCAAATGGAGGTATAATGTCTGGATAACATTGAACCCAGCCGGTAATGAGTGAATTATTGCCATTTTTATTCGTTGACAAGACATGATTATGTTTGATGCTTTTAAGGGACTAAAGATCAGGGGAAAGTCACAAAGATTTGAACGCATGGGATGTGATAGTGGTGGGTTATGCTGAGAATTTAAAGACCCCACCTTTTCCAGCAATTTGGAGAATCTCCCTGGATTGAGTGCATGCGTGTTAGGTAACTGAGGTCCCACTTTCCACACTTTTTTCTGGATTATGTGCTATGTCTTTACCCTCATTACTCATTTATAGAGATTGTGCTGTAAAAGTGGAGCTTGTCCCTTGTGGTGTTAGGTAGATCTTGCACGGTTCGAGTTTTGATTATGAGAGCACTATGAAACTTTTGAAACTTTTGACTGATATCAGCTTTGGAGATAGATTTTAGTTTTGCTATTAGGTGTTGCGTTGATGTAATAATTGAGGTGTGAATGCTTTTGTGAGTCTATCATGTTACATGTACTGCTTTCTCTATGCTTGTACTTGATCTAAGCCTTGAGAATACTGGCTGTATAATTATATTGTTGATGAATTATACCATACGAGATTGTTAGCCAA

Coding sequence (CDS)

ATGGTGTGCCAAGCTGCAACCCAGACAAGATTTCGGGCATTGAAACATGAGAATGGAATTGCAGGAAAACCAACAATTATTGTTAGAGTGATCGCATGCTTTCAACCTTTGCAGAATTGTCAGGCCGAGTATTTTCGACAATTGCTTAAGCCTCATTTTGCAGATAACTCCCATAGCAAACTAGAGCACAACATTTTGAGGTGGTTGTTTTGCGGTATGGTTGGGACTGATAACTCTCGGCTTGATTTTGAGCATTTTGCTTGGCAACTGCACAATCGTGATTCCATGAATGCACCAGTAGAGATCAAGCAACAGGAAAGCTTACAAGGGAATATCAACTGTGGAAATTGCATACTTCCCACATGTGTGGGAGGAATGCAAGGGTTTGCAATTCCACCATTACCCAGTTTTAAAGTGGAACAGCTAAATGTTGTTCAGGAGTCACGTCAATGCTTACCCCCTCATTTCCAGAACTCACTTGGTACTCCAATGCCTTGGCAGAAAGGAAAGGAATCTATGCATTATGCTCATGCTGGACCTAGTGGGATGCCTGTGTCAAAATCAAATAATGGCTCCTATCCGAAAGGTTTTCTTATCTTTGATCAATCTGGAAACCAGAAGAGATTGATGTATGCTCCTATGTATCCTCCTGTCTATATTCCCTCAACTTTTGCTGAAACCAAGCGTTGTGGTTGGCTTGAAGAAGAAGGGGCAGCTGGAGAGATTAATTCTGTCAAGTATTTTCCAAATACTCTGTCCAATGAGAATTATGTAGGTGATGGAGAGAGTGAAATGCATGAAAACACGGAAGAAATAGATGCATTACTTTACTCAGACTACGATGGTACTGGTTGTAGTAGCGACGATGAAGTGACTAGCACAGGACATTCCCCAGAGATGATCCATGATCATTGTGAGAAGGAAGAACAATGTCAAGAAACTACTACAGAAGCTGCTAGTTCTAATGGCCCAAGAAAAAGGCAGAGAGTGCATGATGGGGGCTACGTAAAATCACTGCCAATTGCTACGGCTTCTTCTGCAAGAGTTGAATTACAGAAGTTTGTCAATGACGCAGAATCAAGCTGTGGCATGGTCCATGAGGAAGAGGTTGGAACAGATGTTGATTTTTGCCGTTCTTCTTGCAAGAAAGACAGGATCAAGGAGACGTTGAGAGTACTTGAGAGTTTAGTTCCTGATGCTAAGGGAAAAGACCCATTGTTGGTTATTGATGAAGCTATTGATTACTTGAAGTCCTTAAAACATGAAGCTAGCGCTCTAGGTGTGAGCTGCTATTAG

Protein sequence

MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSKLEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILPTCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSVKYFPNTLSNENYVGDGESEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHEASALGVSCY
BLAST of CmaCh16G002700 vs. Swiss-Prot
Match: SAC51_ARATH (Transcription factor SAC51 OS=Arabidopsis thaliana GN=SAC51 PE=2 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-27
Identity = 106/302 (35.10%), Postives = 144/302 (47.68%), Query Frame = 1

Query: 131 IPPLPSFKVEQLNVVQESRQCL-PPHFQNSLGTPMPWQKGKE----SMHYAHAGPSGMPV 190
           +P +P  ++ +L   +   +CL PP FQ+ L +      GK      M    A  +    
Sbjct: 29  LPRIPLPELGKLYAAKLQARCLQPPPFQSLLCSHDKESYGKRFSRSDMRSWCAAATTTTT 88

Query: 191 SKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSV 250
                 S  K  LIFDQSG+Q RL+  P   P+  PS  A         E     E+  +
Sbjct: 89  PLGALESSQKRLLIFDQSGDQTRLLQCPF--PLRFPSHAAA--------EPVKLSELQGI 148

Query: 251 KYFPNTLSNENYVGDG-ESEMHENTEEIDALLYSDYD-GTGCSSDDEVTSTGHSPEMIHD 310
           +        E +  DG ESEMHE+TEEI+ALLYSD D    C SDDEV STGHSP     
Sbjct: 149 EKAFKEDGEEFHKSDGTESEMHEDTEEINALLYSDDDYDDDCESDDEVMSTGHSPYPNEG 208

Query: 311 HCEKEEQCQETTTEAASSNGPRKRQRVHDG-GYVKSLP--IATASSARVELQKFVNDAES 370
            C K E            +GP KRQ++ D    +  L   + T SS ++    F+ D + 
Sbjct: 209 VCNKRE--------LEEIDGPCKRQKLLDKVNNISDLSSLVGTESSTQLNGSSFLKDKKL 268

Query: 371 SCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLK 423
                   +  T         KKD+I+  L++LES+VP AKG + LL++DEAIDYLK LK
Sbjct: 269 PESKTISTKEDTGSGLSNEQSKKDKIRTALKILESVVPGAKGNEALLLLDEAIDYLKLLK 312

BLAST of CmaCh16G002700 vs. Swiss-Prot
Match: BH143_ARATH (Transcription factor bHLH143 OS=Arabidopsis thaliana GN=BHLH143 PE=2 SV=1)

HSP 1 Score: 124.4 bits (311), Expect = 3.0e-27
Identity = 112/344 (32.56%), Postives = 177/344 (51.45%), Query Frame = 1

Query: 99  PVEIKQQESLQGNINCGNCILPTCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQN 158
           P++ KQQ+ L   +N   C+        +    P +P  ++ ++   +   + L P FQ 
Sbjct: 2   PLDTKQQKWLPLGLNPQACVQDKATEYFR----PGIPFPELGKVYAAEHQFRYLQPPFQA 61

Query: 159 SLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKG--------FLIFDQSGNQKRLM 218
            L        GK+               +S+NG+ P+G        F++FDQSG Q RL+
Sbjct: 62  LLSRYDQQSCGKQVSCLN---------GRSSNGAAPEGALKSSRKRFIVFDQSGEQTRLL 121

Query: 219 YAPMYPPVYIPSTFAETKR--CGWLEEEGAAGEINSVKYFPNTLSNENYV-GDGESEMHE 278
                 P+  PS+    +    G L  E    + ++++     L +E++  G+ +SEMHE
Sbjct: 122 QCGF--PLRFPSSMDAERGNILGALHPEKGFSKDHAIQ--EKILQHEDHENGEEDSEMHE 181

Query: 279 NTEEIDALLYSDYDGTG-CSSDDEVTSTGHSPEMIHDHCEKEEQCQETTTE----AASSN 338
           +TEEI+ALLYSD D      SDDEV STGHSP  +     +++ C  TT E     ++ +
Sbjct: 182 DTEEINALLYSDDDDNDDWESDDEVMSTGHSPFTV-----EQQACNITTEELDETESTVD 241

Query: 339 GP-RKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVGTDVDFCRSSC 398
           GP  KRQ++ D  Y  S P +   + +V+     N  ES+  +  ++E G+ +   +S  
Sbjct: 242 GPLLKRQKLLDHSYRDSSP-SLVGTTKVKGLSDENLPESN--ISSKQETGSGLSDEQS-- 301

Query: 399 KKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHEASA 426
           +KD+I   LR+LES+VP AKGK+ LL++DEAIDYLK LK   ++
Sbjct: 302 RKDKIHTALRILESVVPGAKGKEALLLLDEAIDYLKLLKQSLNS 318

BLAST of CmaCh16G002700 vs. Swiss-Prot
Match: BH145_ARATH (Transcription factor bHLH145 OS=Arabidopsis thaliana GN=BHLH145 PE=2 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 1.7e-17
Identity = 77/241 (31.95%), Postives = 115/241 (47.72%), Query Frame = 1

Query: 185 VSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINS 244
           +SK+      K FL+FDQSG+Q  L+ A       I  +F   K+    + +      N 
Sbjct: 97  ISKAEEQCSQKRFLVFDQSGDQTTLLLASD-----IRKSFETLKQHACPDMKEELQRSNK 156

Query: 245 VKYFPNTLSNENYVGDGESEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDH 304
             +  + +      G+ E ++ E++EE++ALLYS+ +   CS +DEVTS  HSP ++   
Sbjct: 157 DLFVCHGMQ-----GNSEPDLKEDSEELNALLYSEDESGYCSEEDEVTSADHSPSIV--- 216

Query: 305 CEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGM 364
                               R+ Q+   G Y + L  A          + + DAESSCG 
Sbjct: 217 -----------------VSGREDQKTFLGSYGQPLN-AKKRKILETSNESMRDAESSCGS 276

Query: 365 VHEEEVGTDVDFCRSSCK--KDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHE 424
                +        SS K  +++I ET+ +L S+VP  +  DP+LVID AIDYLKSLK E
Sbjct: 277 CDNTRISFLKRSKLSSNKIGEEKIFETVSLLRSVVPGEELVDPILVIDRAIDYLKSLKME 306

BLAST of CmaCh16G002700 vs. TrEMBL
Match: B7SHL0_CUCSA (Putative transcription factor OS=Cucumis sativus GN=Csa_4G113170 PE=2 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 2.0e-158
Identity = 281/349 (80.52%), Postives = 302/349 (86.53%), Query Frame = 1

Query: 73  MVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILPTCVGGMQGFAIP 132
           MVGTDNSRLDFEHFAWQLHN +SMNA +E KQQES Q +IN  NCI   C+G MQ FAIP
Sbjct: 1   MVGTDNSRLDFEHFAWQLHNYNSMNASIETKQQESCQTSINHENCIFSKCMGRMQRFAIP 60

Query: 133 PLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGS 192
           PLPSF+VEQLNV+Q SR CL PHFQNS GT + +Q  KESMHYAHAGPSGMPVSKSNNGS
Sbjct: 61  PLPSFEVEQLNVIQGSRHCLSPHFQNSRGTFISYQNEKESMHYAHAGPSGMPVSKSNNGS 120

Query: 193 YPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSVKYFPNTL 252
           YPKGFLIFDQSGNQKRLMYAPM P VY PS   E K CGWLEE+GA  +INSVKY PNTL
Sbjct: 121 YPKGFLIFDQSGNQKRLMYAPMCP-VYFPSIVTENKCCGWLEEKGAVRDINSVKYSPNTL 180

Query: 253 SNENYVGDGES-EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDHCEKEEQC 312
           SNENYV DGES EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMI++HCEKEEQC
Sbjct: 181 SNENYVADGESSEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMINEHCEKEEQC 240

Query: 313 QETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVG 372
           QETTTE ASS+ PRKRQR+HDGGY+KSLPIAT S ARVE Q + NDAESSCGMVH+EE G
Sbjct: 241 QETTTEVASSDVPRKRQRLHDGGYIKSLPIATGSCARVESQNYANDAESSCGMVHKEEAG 300

Query: 373 TDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLK 421
            D+DFC  SCKKDRI+ETLRVLESLVP AKGKDPLLVIDEAI+Y + LK
Sbjct: 301 ADIDFCYCSCKKDRIEETLRVLESLVPGAKGKDPLLVIDEAINYFEVLK 348

BLAST of CmaCh16G002700 vs. TrEMBL
Match: W9R7U4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011398 PE=4 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.5e-73
Identity = 195/442 (44.12%), Postives = 250/442 (56.56%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALKHENGIAGKPTIIVRVIACFQPLQ+CQAEYFR LLKP         
Sbjct: 1   MVCQAASQTRFRALKHENGIAGKPTIIVRVIACFQPLQDCQAEYFRHLLKP--------- 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                +  LF  MV   +S L  +  + QL + + M+  +E +QQE L    N   C + 
Sbjct: 61  -----VTLLFGLMVKASDSWLSSQLSSQQLPDLNCMSTLLETRQQECLPLLTNHSTCKVS 120

Query: 121 TCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGP 180
             V  + G   P L + + E ++   E   C  P F   +    P+  GK+S      G 
Sbjct: 121 EPV-MLPGSTSPRLQNLQTEHIDAAHEPLHCFSPDFHALIPATNPYINGKQST--LPYGF 180

Query: 181 SGMPVSKSN-NGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGW--LEEEG 240
           SGM V  +  + S  KGFLIFDQS NQ R++Y  + PP   P         G+  L+  G
Sbjct: 181 SGMVVPNTKFSASCQKGFLIFDQSENQTRMIYNYVCPPTQNPIIANVRIDSGYDVLQMTG 240

Query: 241 AAGEINSVKYFPNTLSNENYVGDGESEMHENTEEIDALLYSDYDGTGC-----SSDDEVT 300
            A +++ +    N +S E   G+ ESEMHE++EEI+ALLYSD DG          DDEVT
Sbjct: 241 NAAKMDRIDPIKN-ISCEASDGNKESEMHEDSEEINALLYSDDDGNDSGDDEYGEDDEVT 300

Query: 301 STGHSPEM-IHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVE- 360
            TGH P M + +  EK E   E T E ASS+GP KRQ++ DGG  KS  + TAS   ++ 
Sbjct: 301 CTGHFPPMPMKEDHEKHEHIGELTEEVASSDGPNKRQKMLDGGCKKSSALYTASVVNLDG 360

Query: 361 LQKFVNDAESSCGMVHEEEVGTDVDFCRSS---CKKDRIKETLRVLESLVPDAKGKDPLL 420
             ++  DA+S C    + + G +   C S     K+D+I E LRVLES++P  KGKDPLL
Sbjct: 361 SHEYDKDAKSCCA---DGQTGVEESDCTSGNMRSKRDKIIEILRVLESIIPGVKGKDPLL 420

Query: 421 VIDEAIDYLKSLKHEASALGVS 430
           VID AIDYL   K +A  LGVS
Sbjct: 421 VIDGAIDYLTITKLKAETLGVS 421

BLAST of CmaCh16G002700 vs. TrEMBL
Match: A0A061E122_THECC (Sequence-specific DNA binding transcription factors,transcription regulators, putative OS=Theobroma cacao GN=TCM_007402 PE=4 SV=1)

HSP 1 Score: 274.6 bits (701), Expect = 2.0e-70
Identity = 187/457 (40.92%), Postives = 255/457 (55.80%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALK+ENGIAGK TI+VRVIACFQP+++CQAEYFR LLKP      H  
Sbjct: 1   MVCQAASQTRFRALKYENGIAGKSTIVVRVIACFQPMEDCQAEYFRHLLKPI----EHCS 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                  W    MV T+NS    +H  WQL     M+  +E +Q E L   IN    +  
Sbjct: 61  YPGGCSSW----MVQTNNSWFFPQHSTWQLPKLSCMSTSLEPRQPERLPACINPSTHMFS 120

Query: 121 TCVGGMQGFAIP-------------------PLPSFKVEQLNVVQESRQCLPPHFQNSLG 180
                M G  +P                    + + K EQ     +  Q L P F  SL 
Sbjct: 121 VS-RSMPGSLVPGINPGIHAVPATMAMPRSADISTLKTEQKYHSDQLLQQLYPCFPTSLP 180

Query: 181 TPMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIP 240
           +   + K ++ M     G SG   +   +G   KG +IFDQSG+Q RL+Y  + PP    
Sbjct: 181 SLGSYLKEQQLM--IAKGYSGRATANVVSGFLQKGLVIFDQSGSQTRLIYGSV-PPTSQY 240

Query: 241 STFAETKRCGWLE-EEGAAGEINSVKYFPNTLS---NENYVGDGESEMHENTEEIDALLY 300
           +T A T+    L+  EG A +++     P TL    +EN++   ESEM E+TEE++ALLY
Sbjct: 241 ATTAVTEPASCLDLHEGQAVKMSPFTPTPPTLQEEFDENHLSVEESEMREDTEELNALLY 300

Query: 301 SDY--DGTGCSSDDEVTSTGHSPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGY 360
           SD   D      DDEV ST HSP  I  + + E+Q  +   E ASS+GP KRQ++ +GG+
Sbjct: 301 SDEEDDDYHDGDDDEVMSTDHSPFPIKRNYQNEDQVGDVMEEVASSDGPNKRQKLLNGGH 360

Query: 361 VKSLPIATASSARVE-LQKFVNDAESSCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLE 420
            +S  + TA S ++E   ++  DAESS  + H +    D        KKD+I+ TL++LE
Sbjct: 361 KQSSMVDTACSVKLEGSHEYDGDAESSYAIGHNQREEIDSSLRSKQSKKDKIRFTLKILE 420

Query: 421 SLVPDAKGKDPLLVIDEAIDYLKSLKHEASALGVSCY 432
           S++P AKGK+PLLV+DE+I++LKSLK EA +LG+S Y
Sbjct: 421 SIIPGAKGKNPLLVLDESIEHLKSLKLEAKSLGLSHY 445

BLAST of CmaCh16G002700 vs. TrEMBL
Match: A0A0B0MJS2_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_26115 PE=4 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 1.1e-63
Identity = 180/460 (39.13%), Postives = 246/460 (53.48%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALKHENGIAGKPTI+VRVIACFQP+++CQAEYFR LLKP      H  
Sbjct: 1   MVCQAASQTRFRALKHENGIAGKPTIVVRVIACFQPMEDCQAEYFRHLLKP--VTIEHCP 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                  W    MV T+NS +  +H +W+L     M+A +E +Q E L   IN  + IL 
Sbjct: 61  SPGVCSSW----MVKTNNSWVFPQHSSWRLPELSCMSASLEPRQPECLPACINPSSHILS 120

Query: 121 TCVG-------GMQ--------GFAIP---PLPSFKVEQLNVVQESRQCLPPHFQNSLGT 180
             V        GM           A+P    +   K EQ        Q L P F  SL  
Sbjct: 121 VSVSKLGSLVPGMNYGTHVLPANIAMPGSADISVLKAEQKYQPHGLLQQLYPSFPTSL-- 180

Query: 181 PMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPP-VYIP 240
           P       E       G +G   +   +GS+ KG +IFD SG+Q RL+      P  +  
Sbjct: 181 PSRGSFLNEQQFMIANGHTGRAAANFVSGSFQKGLIIFDHSGSQTRLICGSFRSPHQHAA 240

Query: 241 STFAETKRCGWLEEEGAAGEINSVKYFPNTLS---NENYVGDGESEMHENTEEIDALLYS 300
           +   E      + E   A + N++   P  L    +EN +G   SEM E+TEE++ALLYS
Sbjct: 241 TAITELASSLDIHEGLQAVKTNTLIPTPPALQEEYDENRLGVEGSEMREDTEELNALLYS 300

Query: 301 DYDGTGCS------SDDEVTSTGHSPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHD 360
           D +   C        DDEV ST HSP  I    + ++   +   + ASS+GP KRQ++ +
Sbjct: 301 DEEEDDCGVGDDDCDDDEVMSTAHSPIGIKRSFQNQDHDNDVIEQVASSDGPNKRQKLLN 360

Query: 361 GGYVKSLPIATASSARVE-LQKFVNDAESSC--GMVHEEEVGTDVDFCRSSCKKDRIKET 420
           GG+ + + +  A S ++E   ++ +DAESS    ++H E+             KD+I+ T
Sbjct: 361 GGHKQLIMVDAACSVKLEGSHEYDSDAESSYRGEILHTEQ-----------SMKDKIRLT 420

Query: 421 LRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHEASALGVS 430
           L++LES++P  KGKDPLLV+DE++DYLKSLK EA  LGV+
Sbjct: 421 LKILESIIPGTKGKDPLLVLDESVDYLKSLKLEAETLGVN 441

BLAST of CmaCh16G002700 vs. TrEMBL
Match: B9S9F0_RICCO (Transcription factor, putative OS=Ricinus communis GN=RCOM_0884580 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 9.0e-63
Identity = 178/435 (40.92%), Postives = 234/435 (53.79%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MV QAA+QTRFRALK+ENGIAGKPTIIVRVIAC+QPLQ+CQA            +NS   
Sbjct: 1   MVFQAASQTRFRALKYENGIAGKPTIIVRVIACYQPLQDCQA------------NNS--- 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                  WLF              H  W+L + + M+  VE  Q   L   ++ G    P
Sbjct: 61  -------WLFP------------PHETWELSDFNCMSTSVEPVQPGCLPAFVSHGT---P 120

Query: 121 TCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGP 180
           T +  M   ++P  PS + +Q    Q   Q   P F   L  P      KES+   + G 
Sbjct: 121 TNMM-MPRISVPTYPSLRTQQSTGAQGLPQSKAPPFHQVL--PAIDSYPKESLPAFNYGF 180

Query: 181 SGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCG-WLEEEGAA 240
           SG     +      K F+IFDQSGN+ RL+Y+  +P    P+  A     G +L  E  A
Sbjct: 181 SGESALNAVPACQRK-FVIFDQSGNETRLIYSSFFPTGAKPTIAASRPTAGSYLRSEEHA 240

Query: 241 GEINSVKYFPNTL---SNENYVGDGESEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGH 300
            +++ +      L   S+ENY    ESEMHE+TEEIDALLYSD D      DDEV STGH
Sbjct: 241 AKLDGINLIMPKLQEVSDENYFSGEESEMHEDTEEIDALLYSD-DNDDDYDDDEVISTGH 300

Query: 301 SPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVN 360
           SP +I ++  +  Q +E T E   S+G  KRQ++ DGGY +S    TA S +V +    +
Sbjct: 301 SPSLIRNYGMRG-QVEEITEEVTDSDGQNKRQKLLDGGYKRSSLTDTAGSTKVAMAHGYD 360

Query: 361 --DAESSCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAID 420
             DAESSC +    +     +  +   KKD+I+ TL++LES++P  K KDPLLV+D AID
Sbjct: 361 CDDAESSCAIGQNHKELRLANLGKEQLKKDKIRATLKILESIIPGVKDKDPLLVLDVAID 392

Query: 421 YLKSLKHEASALGVS 430
           YLKSLK  A  LGV+
Sbjct: 421 YLKSLKLSAKTLGVN 392

BLAST of CmaCh16G002700 vs. TAIR10
Match: AT5G64340.1 (AT5G64340.1 sequence-specific DNA binding transcription factors;transcription regulators)

HSP 1 Score: 125.2 bits (313), Expect = 1.0e-28
Identity = 106/302 (35.10%), Postives = 144/302 (47.68%), Query Frame = 1

Query: 131 IPPLPSFKVEQLNVVQESRQCL-PPHFQNSLGTPMPWQKGKE----SMHYAHAGPSGMPV 190
           +P +P  ++ +L   +   +CL PP FQ+ L +      GK      M    A  +    
Sbjct: 29  LPRIPLPELGKLYAAKLQARCLQPPPFQSLLCSHDKESYGKRFSRSDMRSWCAAATTTTT 88

Query: 191 SKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSV 250
                 S  K  LIFDQSG+Q RL+  P   P+  PS  A         E     E+  +
Sbjct: 89  PLGALESSQKRLLIFDQSGDQTRLLQCPF--PLRFPSHAAA--------EPVKLSELQGI 148

Query: 251 KYFPNTLSNENYVGDG-ESEMHENTEEIDALLYSDYD-GTGCSSDDEVTSTGHSPEMIHD 310
           +        E +  DG ESEMHE+TEEI+ALLYSD D    C SDDEV STGHSP     
Sbjct: 149 EKAFKEDGEEFHKSDGTESEMHEDTEEINALLYSDDDYDDDCESDDEVMSTGHSPYPNEG 208

Query: 311 HCEKEEQCQETTTEAASSNGPRKRQRVHDG-GYVKSLP--IATASSARVELQKFVNDAES 370
            C K E            +GP KRQ++ D    +  L   + T SS ++    F+ D + 
Sbjct: 209 VCNKRE--------LEEIDGPCKRQKLLDKVNNISDLSSLVGTESSTQLNGSSFLKDKKL 268

Query: 371 SCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLK 423
                   +  T         KKD+I+  L++LES+VP AKG + LL++DEAIDYLK LK
Sbjct: 269 PESKTISTKEDTGSGLSNEQSKKDKIRTALKILESVVPGAKGNEALLLLDEAIDYLKLLK 312

BLAST of CmaCh16G002700 vs. TAIR10
Match: AT5G09460.1 (AT5G09460.1 sequence-specific DNA binding transcription factors;transcription regulators)

HSP 1 Score: 124.4 bits (311), Expect = 1.7e-28
Identity = 112/344 (32.56%), Postives = 177/344 (51.45%), Query Frame = 1

Query: 99  PVEIKQQESLQGNINCGNCILPTCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQN 158
           P++ KQQ+ L   +N   C+        +    P +P  ++ ++   +   + L P FQ 
Sbjct: 2   PLDTKQQKWLPLGLNPQACVQDKATEYFR----PGIPFPELGKVYAAEHQFRYLQPPFQA 61

Query: 159 SLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKG--------FLIFDQSGNQKRLM 218
            L        GK+               +S+NG+ P+G        F++FDQSG Q RL+
Sbjct: 62  LLSRYDQQSCGKQVSCLN---------GRSSNGAAPEGALKSSRKRFIVFDQSGEQTRLL 121

Query: 219 YAPMYPPVYIPSTFAETKR--CGWLEEEGAAGEINSVKYFPNTLSNENYV-GDGESEMHE 278
                 P+  PS+    +    G L  E    + ++++     L +E++  G+ +SEMHE
Sbjct: 122 QCGF--PLRFPSSMDAERGNILGALHPEKGFSKDHAIQ--EKILQHEDHENGEEDSEMHE 181

Query: 279 NTEEIDALLYSDYDGTG-CSSDDEVTSTGHSPEMIHDHCEKEEQCQETTTE----AASSN 338
           +TEEI+ALLYSD D      SDDEV STGHSP  +     +++ C  TT E     ++ +
Sbjct: 182 DTEEINALLYSDDDDNDDWESDDEVMSTGHSPFTV-----EQQACNITTEELDETESTVD 241

Query: 339 GP-RKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVGTDVDFCRSSC 398
           GP  KRQ++ D  Y  S P +   + +V+     N  ES+  +  ++E G+ +   +S  
Sbjct: 242 GPLLKRQKLLDHSYRDSSP-SLVGTTKVKGLSDENLPESN--ISSKQETGSGLSDEQS-- 301

Query: 399 KKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHEASA 426
           +KD+I   LR+LES+VP AKGK+ LL++DEAIDYLK LK   ++
Sbjct: 302 RKDKIHTALRILESVVPGAKGKEALLLLDEAIDYLKLLKQSLNS 318

BLAST of CmaCh16G002700 vs. TAIR10
Match: AT5G50010.1 (AT5G50010.1 sequence-specific DNA binding transcription factors;transcription regulators)

HSP 1 Score: 92.0 bits (227), Expect = 9.3e-19
Identity = 77/241 (31.95%), Postives = 115/241 (47.72%), Query Frame = 1

Query: 185 VSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINS 244
           +SK+      K FL+FDQSG+Q  L+ A       I  +F   K+    + +      N 
Sbjct: 97  ISKAEEQCSQKRFLVFDQSGDQTTLLLASD-----IRKSFETLKQHACPDMKEELQRSNK 156

Query: 245 VKYFPNTLSNENYVGDGESEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDH 304
             +  + +      G+ E ++ E++EE++ALLYS+ +   CS +DEVTS  HSP ++   
Sbjct: 157 DLFVCHGMQ-----GNSEPDLKEDSEELNALLYSEDESGYCSEEDEVTSADHSPSIV--- 216

Query: 305 CEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGM 364
                               R+ Q+   G Y + L  A          + + DAESSCG 
Sbjct: 217 -----------------VSGREDQKTFLGSYGQPLN-AKKRKILETSNESMRDAESSCGS 276

Query: 365 VHEEEVGTDVDFCRSSCK--KDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHE 424
                +        SS K  +++I ET+ +L S+VP  +  DP+LVID AIDYLKSLK E
Sbjct: 277 CDNTRISFLKRSKLSSNKIGEEKIFETVSLLRSVVPGEELVDPILVIDRAIDYLKSLKME 306

BLAST of CmaCh16G002700 vs. TAIR10
Match: AT5G50011.1 (AT5G50011.1 conserved peptide upstream open reading frame 37)

HSP 1 Score: 86.7 bits (213), Expect = 3.9e-17
Identity = 41/51 (80.39%), Postives = 45/51 (88.24%), Query Frame = 1

Query: 1  MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKP 52
          MVCQ+A QTRFR LKHE+GI G   I+VRVIACFQPLQ+CQAEYFRQLLKP
Sbjct: 1  MVCQSAGQTRFRTLKHEHGITGN--IVVRVIACFQPLQDCQAEYFRQLLKP 49

BLAST of CmaCh16G002700 vs. TAIR10
Match: AT5G09461.1 (AT5G09461.1 conserved peptide upstream open reading frame 43)

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-14
Identity = 36/52 (69.23%), Postives = 43/52 (82.69%), Query Frame = 1

Query: 1  MVCQAATQTRFRALKHEN-GIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKP 52
          MV Q+A QTRFR  K+EN G + +PTI+VRVIACFQP+ NCQAEYFR +LKP
Sbjct: 1  MVSQSAGQTRFRTFKYENNGDSSRPTIVVRVIACFQPMDNCQAEYFRHILKP 52

BLAST of CmaCh16G002700 vs. NCBI nr
Match: gi|525507252|ref|NP_001267664.1| (transcription factor bHLH143-like [Cucumis sativus])

HSP 1 Score: 567.0 bits (1460), Expect = 2.8e-158
Identity = 281/349 (80.52%), Postives = 302/349 (86.53%), Query Frame = 1

Query: 73  MVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILPTCVGGMQGFAIP 132
           MVGTDNSRLDFEHFAWQLHN +SMNA +E KQQES Q +IN  NCI   C+G MQ FAIP
Sbjct: 1   MVGTDNSRLDFEHFAWQLHNYNSMNASIETKQQESCQTSINHENCIFSKCMGRMQRFAIP 60

Query: 133 PLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGS 192
           PLPSF+VEQLNV+Q SR CL PHFQNS GT + +Q  KESMHYAHAGPSGMPVSKSNNGS
Sbjct: 61  PLPSFEVEQLNVIQGSRHCLSPHFQNSRGTFISYQNEKESMHYAHAGPSGMPVSKSNNGS 120

Query: 193 YPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSVKYFPNTL 252
           YPKGFLIFDQSGNQKRLMYAPM P VY PS   E K CGWLEE+GA  +INSVKY PNTL
Sbjct: 121 YPKGFLIFDQSGNQKRLMYAPMCP-VYFPSIVTENKCCGWLEEKGAVRDINSVKYSPNTL 180

Query: 253 SNENYVGDGES-EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDHCEKEEQC 312
           SNENYV DGES EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMI++HCEKEEQC
Sbjct: 181 SNENYVADGESSEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMINEHCEKEEQC 240

Query: 313 QETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVG 372
           QETTTE ASS+ PRKRQR+HDGGY+KSLPIAT S ARVE Q + NDAESSCGMVH+EE G
Sbjct: 241 QETTTEVASSDVPRKRQRLHDGGYIKSLPIATGSCARVESQNYANDAESSCGMVHKEEAG 300

Query: 373 TDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLK 421
            D+DFC  SCKKDRI+ETLRVLESLVP AKGKDPLLVIDEAI+Y + LK
Sbjct: 301 ADIDFCYCSCKKDRIEETLRVLESLVPGAKGKDPLLVIDEAINYFEVLK 348

BLAST of CmaCh16G002700 vs. NCBI nr
Match: gi|659125851|ref|XP_008462888.1| (PREDICTED: transcription factor bHLH143-like [Cucumis melo])

HSP 1 Score: 557.4 bits (1435), Expect = 2.2e-155
Identity = 283/360 (78.61%), Postives = 304/360 (84.44%), Query Frame = 1

Query: 73  MVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILPTCVGGMQGFAIP 132
           MVGTD          WQLHN +SMNA +EIKQQES Q NIN  +C+   C+GGMQ FAIP
Sbjct: 1   MVGTDT---------WQLHNYNSMNASIEIKQQESCQTNINHESCMFSKCMGGMQRFAIP 60

Query: 133 PLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGPSGMPVSKSNNGS 192
           PLPSF+VEQLNVVQ SR CL PHFQNSL T + +QK KESM+YAHAGPSGMPVSKS NGS
Sbjct: 61  PLPSFEVEQLNVVQGSRHCLSPHFQNSLVTFISYQKEKESMYYAHAGPSGMPVSKSTNGS 120

Query: 193 YPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAGEINSVKYFPNTL 252
           YPKGFLIFDQSGNQKRLMY PM   V + S  +E KRCGWL E+GA  +INSVKY PNTL
Sbjct: 121 YPKGFLIFDQSGNQKRLMYDPMCL-VSLSSIVSENKRCGWLAEKGAVRDINSVKYSPNTL 180

Query: 253 SNENYVGDGES-EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMIHDHCEKEEQC 312
           SNENYV D ES EMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMI++HCEKEEQC
Sbjct: 181 SNENYVADEESSEMHENTEEIDALLYSDYDGTGCSSDDEVTSTGHSPEMINEHCEKEEQC 240

Query: 313 QETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVELQKFVNDAESSCGMVHEEEVG 372
           QETTTE ASS  PRK+QRVHDGGY+KSLPIA  S ARVE Q + NDAESSCGMVH+EE G
Sbjct: 241 QETTTEVASSEVPRKKQRVHDGGYIKSLPIAAVSCARVESQNYANDAESSCGMVHKEEAG 300

Query: 373 TDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVIDEAIDYLKSLKHEASALGVSCY 432
           TD+DFC SSCKKDRIKETLRVLESLVP AKGKDPLLVIDEAIDYLKSLKHEA+ALGVSCY
Sbjct: 301 TDIDFCHSSCKKDRIKETLRVLESLVPGAKGKDPLLVIDEAIDYLKSLKHEATALGVSCY 350

BLAST of CmaCh16G002700 vs. NCBI nr
Match: gi|703093100|ref|XP_010094825.1| (hypothetical protein L484_011398 [Morus notabilis])

HSP 1 Score: 285.0 bits (728), Expect = 2.1e-73
Identity = 195/442 (44.12%), Postives = 250/442 (56.56%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALKHENGIAGKPTIIVRVIACFQPLQ+CQAEYFR LLKP         
Sbjct: 1   MVCQAASQTRFRALKHENGIAGKPTIIVRVIACFQPLQDCQAEYFRHLLKP--------- 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                +  LF  MV   +S L  +  + QL + + M+  +E +QQE L    N   C + 
Sbjct: 61  -----VTLLFGLMVKASDSWLSSQLSSQQLPDLNCMSTLLETRQQECLPLLTNHSTCKVS 120

Query: 121 TCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGP 180
             V  + G   P L + + E ++   E   C  P F   +    P+  GK+S      G 
Sbjct: 121 EPV-MLPGSTSPRLQNLQTEHIDAAHEPLHCFSPDFHALIPATNPYINGKQST--LPYGF 180

Query: 181 SGMPVSKSN-NGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGW--LEEEG 240
           SGM V  +  + S  KGFLIFDQS NQ R++Y  + PP   P         G+  L+  G
Sbjct: 181 SGMVVPNTKFSASCQKGFLIFDQSENQTRMIYNYVCPPTQNPIIANVRIDSGYDVLQMTG 240

Query: 241 AAGEINSVKYFPNTLSNENYVGDGESEMHENTEEIDALLYSDYDGTGC-----SSDDEVT 300
            A +++ +    N +S E   G+ ESEMHE++EEI+ALLYSD DG          DDEVT
Sbjct: 241 NAAKMDRIDPIKN-ISCEASDGNKESEMHEDSEEINALLYSDDDGNDSGDDEYGEDDEVT 300

Query: 301 STGHSPEM-IHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVE- 360
            TGH P M + +  EK E   E T E ASS+GP KRQ++ DGG  KS  + TAS   ++ 
Sbjct: 301 CTGHFPPMPMKEDHEKHEHIGELTEEVASSDGPNKRQKMLDGGCKKSSALYTASVVNLDG 360

Query: 361 LQKFVNDAESSCGMVHEEEVGTDVDFCRSS---CKKDRIKETLRVLESLVPDAKGKDPLL 420
             ++  DA+S C    + + G +   C S     K+D+I E LRVLES++P  KGKDPLL
Sbjct: 361 SHEYDKDAKSCCA---DGQTGVEESDCTSGNMRSKRDKIIEILRVLESIIPGVKGKDPLL 420

Query: 421 VIDEAIDYLKSLKHEASALGVS 430
           VID AIDYL   K +A  LGVS
Sbjct: 421 VIDGAIDYLTITKLKAETLGVS 421

BLAST of CmaCh16G002700 vs. NCBI nr
Match: gi|743827484|ref|XP_011023035.1| (PREDICTED: transcription factor bHLH143-like isoform X2 [Populus euphratica])

HSP 1 Score: 277.7 bits (709), Expect = 3.4e-71
Identity = 183/440 (41.59%), Postives = 248/440 (56.36%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALKHENG AGK TIIVRVIAC+QPLQ+CQAE                 
Sbjct: 1   MVCQAASQTRFRALKHENGSAGKLTIIVRVIACYQPLQDCQAEG---------------- 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                  WLF  +              WQ  N + M   ++  Q + L   +N G C+  
Sbjct: 61  ------SWLFPPLS------------TWQSPNFNRMTTSLDPAQLQCLPACMNPGTCMTS 120

Query: 121 TCVGGMQGFAIPPLPSFKVEQLNVVQESRQCLPPHFQNSLGTPMPWQKGKESMHYAHAGP 180
             +  M G A+P +P+F+ +Q N       C+PPHFQN L    P+ K   S+ +++   
Sbjct: 121 ANMS-MPGLAVPSIPNFETQQGNEAYGLPPCVPPHFQNFLPGTNPYVKENLSV-FSYGLG 180

Query: 181 SGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIPSTFAETKRCGWLEEEGAAG 240
            G+P   +      + F IFDQSGN+KRLMY+     V  P+T       G+L  +  A 
Sbjct: 181 RGVP---NPIVGCQRRFFIFDQSGNEKRLMYSSFGLAVPKPTTADAKPIPGYLNYKEYAS 240

Query: 241 EINSVKYFPNTLSNENYVGDGESEMHENTEEIDALLYS---DYDG---TGCSSDDEVTST 300
           +++ +K   + +S+EN+    E+EMHE+TEEI+ALL S   DYDG      S DDEV ST
Sbjct: 241 KMDQMKLKLHEVSDENHFNGEETEMHEDTEEINALLDSDGDDYDGGSNDDDSDDDEVRST 300

Query: 301 GHSPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGYVKSLPIATASSARVE---- 360
           GH P +I  H   +EQ +E T E  SS+GP KRQ++ DGGY KS P+ TASS +VE    
Sbjct: 301 GHFPILIKSH-GAQEQVEEITEEVTSSDGPNKRQKLIDGGYKKSSPVKTASSVKVEGFLG 360

Query: 361 -LQKFVNDAESSCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLESLVPDAKGKDPLLVI 420
               + +D ESS  +   ++ G          +KD+I  TL++LES++P AK K+PLLV+
Sbjct: 361 YDNGYDSDMESSYAVGQTQKEGLVSILGSKQFRKDKIHATLKILESIIPGAKNKEPLLVL 400

Query: 421 DEAIDYLKSLKHEASALGVS 430
           DEAI+YLKSLK +A  LGV+
Sbjct: 421 DEAINYLKSLKLKAKTLGVN 400

BLAST of CmaCh16G002700 vs. NCBI nr
Match: gi|590688176|ref|XP_007042873.1| (Sequence-specific DNA binding transcription factors,transcription regulators, putative [Theobroma cacao])

HSP 1 Score: 274.6 bits (701), Expect = 2.9e-70
Identity = 187/457 (40.92%), Postives = 255/457 (55.80%), Query Frame = 1

Query: 1   MVCQAATQTRFRALKHENGIAGKPTIIVRVIACFQPLQNCQAEYFRQLLKPHFADNSHSK 60
           MVCQAA+QTRFRALK+ENGIAGK TI+VRVIACFQP+++CQAEYFR LLKP      H  
Sbjct: 1   MVCQAASQTRFRALKYENGIAGKSTIVVRVIACFQPMEDCQAEYFRHLLKPI----EHCS 60

Query: 61  LEHNILRWLFCGMVGTDNSRLDFEHFAWQLHNRDSMNAPVEIKQQESLQGNINCGNCILP 120
                  W    MV T+NS    +H  WQL     M+  +E +Q E L   IN    +  
Sbjct: 61  YPGGCSSW----MVQTNNSWFFPQHSTWQLPKLSCMSTSLEPRQPERLPACINPSTHMFS 120

Query: 121 TCVGGMQGFAIP-------------------PLPSFKVEQLNVVQESRQCLPPHFQNSLG 180
                M G  +P                    + + K EQ     +  Q L P F  SL 
Sbjct: 121 VS-RSMPGSLVPGINPGIHAVPATMAMPRSADISTLKTEQKYHSDQLLQQLYPCFPTSLP 180

Query: 181 TPMPWQKGKESMHYAHAGPSGMPVSKSNNGSYPKGFLIFDQSGNQKRLMYAPMYPPVYIP 240
           +   + K ++ M     G SG   +   +G   KG +IFDQSG+Q RL+Y  + PP    
Sbjct: 181 SLGSYLKEQQLM--IAKGYSGRATANVVSGFLQKGLVIFDQSGSQTRLIYGSV-PPTSQY 240

Query: 241 STFAETKRCGWLE-EEGAAGEINSVKYFPNTLS---NENYVGDGESEMHENTEEIDALLY 300
           +T A T+    L+  EG A +++     P TL    +EN++   ESEM E+TEE++ALLY
Sbjct: 241 ATTAVTEPASCLDLHEGQAVKMSPFTPTPPTLQEEFDENHLSVEESEMREDTEELNALLY 300

Query: 301 SDY--DGTGCSSDDEVTSTGHSPEMIHDHCEKEEQCQETTTEAASSNGPRKRQRVHDGGY 360
           SD   D      DDEV ST HSP  I  + + E+Q  +   E ASS+GP KRQ++ +GG+
Sbjct: 301 SDEEDDDYHDGDDDEVMSTDHSPFPIKRNYQNEDQVGDVMEEVASSDGPNKRQKLLNGGH 360

Query: 361 VKSLPIATASSARVE-LQKFVNDAESSCGMVHEEEVGTDVDFCRSSCKKDRIKETLRVLE 420
            +S  + TA S ++E   ++  DAESS  + H +    D        KKD+I+ TL++LE
Sbjct: 361 KQSSMVDTACSVKLEGSHEYDGDAESSYAIGHNQREEIDSSLRSKQSKKDKIRFTLKILE 420

Query: 421 SLVPDAKGKDPLLVIDEAIDYLKSLKHEASALGVSCY 432
           S++P AKGK+PLLV+DE+I++LKSLK EA +LG+S Y
Sbjct: 421 SIIPGAKGKNPLLVLDESIEHLKSLKLEAKSLGLSHY 445

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SAC51_ARATH1.8e-2735.10Transcription factor SAC51 OS=Arabidopsis thaliana GN=SAC51 PE=2 SV=1[more]
BH143_ARATH3.0e-2732.56Transcription factor bHLH143 OS=Arabidopsis thaliana GN=BHLH143 PE=2 SV=1[more]
BH145_ARATH1.7e-1731.95Transcription factor bHLH145 OS=Arabidopsis thaliana GN=BHLH145 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
B7SHL0_CUCSA2.0e-15880.52Putative transcription factor OS=Cucumis sativus GN=Csa_4G113170 PE=2 SV=1[more]
W9R7U4_9ROSA1.5e-7344.12Uncharacterized protein OS=Morus notabilis GN=L484_011398 PE=4 SV=1[more]
A0A061E122_THECC2.0e-7040.92Sequence-specific DNA binding transcription factors,transcription regulators, pu... [more]
A0A0B0MJS2_GOSAR1.1e-6339.13Uncharacterized protein OS=Gossypium arboreum GN=F383_26115 PE=4 SV=1[more]
B9S9F0_RICCO9.0e-6340.92Transcription factor, putative OS=Ricinus communis GN=RCOM_0884580 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G64340.11.0e-2835.10 sequence-specific DNA binding transcription factors;transcription re... [more]
AT5G09460.11.7e-2832.56 sequence-specific DNA binding transcription factors;transcription re... [more]
AT5G50010.19.3e-1931.95 sequence-specific DNA binding transcription factors;transcription re... [more]
AT5G50011.13.9e-1780.39 conserved peptide upstream open reading frame 37[more]
AT5G09461.11.1e-1469.23 conserved peptide upstream open reading frame 43[more]
Match NameE-valueIdentityDescription
gi|525507252|ref|NP_001267664.1|2.8e-15880.52transcription factor bHLH143-like [Cucumis sativus][more]
gi|659125851|ref|XP_008462888.1|2.2e-15578.61PREDICTED: transcription factor bHLH143-like [Cucumis melo][more]
gi|703093100|ref|XP_010094825.1|2.1e-7344.12hypothetical protein L484_011398 [Morus notabilis][more]
gi|743827484|ref|XP_011023035.1|3.4e-7141.59PREDICTED: transcription factor bHLH143-like isoform X2 [Populus euphratica][more]
gi|590688176|ref|XP_007042873.1|2.9e-7040.92Sequence-specific DNA binding transcription factors,transcription regulators, pu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G002700.1CmaCh16G002700.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR36066FAMILY NOT NAMEDcoord: 74..431
score: 2.0
NoneNo IPR availablePANTHERPTHR36066:SF2TRANSCRIPTION FACTOR SAC51-RELATEDcoord: 74..431
score: 2.0

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh16G002700CmaCh11G016600Cucurbita maxima (Rimu)cmacmaB135
CmaCh16G002700CmaCh19G004700Cucurbita maxima (Rimu)cmacmaB342
CmaCh16G002700CmaCh04G003980Cucurbita maxima (Rimu)cmacmaB351
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh16G002700Cucurbita moschata (Rifu)cmacmoB338
CmaCh16G002700Cucurbita pepo (Zucchini)cmacpeB351
CmaCh16G002700Silver-seed gourdcarcmaB0526