CmoCh04G009370 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G009370
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionNAC domain-containing protein
LocationCmo_Chr04 : 4731763 .. 4733618 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAAAACAAAGAAGGAAGAGAAATAGAGAGAAATAGAGAGAGAGAGAGAGTTTGATTCAATTATAGGTTTCATCACAAATTTTATGGCAATTGCAGCTGCAGCTAGCAGTTCATCCACGATGAGCCACGAAGAAATTAGCAGCTCCAACAACACCCACAACAAAAACGACGACGACAACAACAACTGCAGCGACGATCAACACGAACACGACGTCGTTATGCCAGGCTTCCGCTTCCACCCCACCGAAGAAGAACTAGTTGAGTTCTACCTTCGCCGTAAGGTCGAAGGCAAACGATTTAATGTCGAACTCATTACTTTCCTTGATCTATATCGCTATGACCCATGGGAGCTTCCTGGTATTCTTATATTCTTTTCATTTAACTCCAAACCCTATTCCTCAATTTCTCTTCAATTCATTCAATCTTTTGGATTCTTATTTATGTCTAGCCTTGGCTGCCATTGGCGAGAAGGAGTGGTTCTTCTACGTCCCTCGAGATCGCAAGTACCGAAATGGGGACCGCCCCAATCGAGTTACGACCTCCGGTTACTGGAAGGCCACCGGGGCCGATCGGATGATTCGAACCGAGGATTTTCGGTCCATCGGCCTTAAGAAAACCCTAGTTTTCTACTCTGGAAAGGCTCCAAAGGGTATCCGTACGAGTTGGATTATGAACGAGTATCGGTTGCCACATCATGAGACCGAGCGGTATCAAAAGGTTGGTTCTTATTTTGAGCTTAATCAAATAATTTTGTGTTTGATTAGGGTTTGATTAATTTGGATTAATCTAATCATTAATTTGCAGAGGGGAATTTATGAATTTCTTGTTTGCTTCGTACTGCCATATCACTGATCAGTCTCTGACAATACTGTTTTATGTTTGTTTAAATGTTTTATCACCAACTGCGTGATCCGCTTTAAATTGTCCCCCAAGTTTTCAGAGGGACAGCTTATGAACAATGTTGTTTCCTTGTTTTTGATGTCACACAGACCGAGATTTCGCTATGTCGTGTTTACAAAAGAGCTGGCGTTGAAGATCACCCGTCGCTCCCTCGTTCTCTCCCGTCGAGAGCGTCGTCGTCGTCTTCGAGACCGATAAGCTCGTCGACACCGAAGAATCTTCATCAAACTTCATCGTCATCTACAGACAAGTTTCAGAGATTCGAACCTCAATTCAACCACCATCTTCAAATTGGTGCCACCATTGAAACCACCGCAGCCGACGCCTCCGCTACGAGCTCTTGCGAAGAAGTAACCACCGTCCTCGGCCTCTCGAAACAAAACCCTTTTACCTCAACTCCATTGATCAACATGGCTGCCACTTCTTCTTACACTCCTTTCTTCTCTCCTTCTTCCAATAATTCCCTTGACGACCTACAAAAACTCATTCACTTTCAACAACAACCGCCGCCATCTACTCCCACCACCCTCATAAACTCTCTACCAACGCCGTATTACCAGCCGACGCTACCACCACCGCCGCAGCTACCGGTCGTGTTCCCCGATAGATTGTGGGAGTGGAATCCAATCCCAGATGGAGTCAATCCGTTCAAGTAAATTCTCGACGAGAAGTCTCAAATTTTAACCAGTTACACTATAATCAGGTCGGTAATATTGATACGCTTCTACTTGATTGAAGAATAACGCCTCCTAGCTCAAACTACTTCAATTTCCATTTTCTTTTTTGTTTATTATGCAGCTACATGCGATGAAGCTCATGAACATAAATATATATGACTAATTATAAGTTAGTTTACTATTGTTTCTTGTTGTCAATTATAAGTTAGAAGGGTTTTCATTAGATTTTTAAATAACTAGTTAGAACGTTTTATATTCGTAATAGATCGAAAGGGGATC

mRNA sequence

TTAAAACAAAGAAGGAAGAGAAATAGAGAGAAATAGAGAGAGAGAGAGAGTTTGATTCAATTATAGGTTTCATCACAAATTTTATGGCAATTGCAGCTGCAGCTAGCAGTTCATCCACGATGAGCCACGAAGAAATTAGCAGCTCCAACAACACCCACAACAAAAACGACGACGACAACAACAACTGCAGCGACGATCAACACGAACACGACGTCGTTATGCCAGGCTTCCGCTTCCACCCCACCGAAGAAGAACTAGTTGAGTTCTACCTTCGCCGTAAGGTCGAAGGCAAACGATTTAATGTCGAACTCATTACTTTCCTTGATCTATATCGCTATGACCCATGGGAGCTTCCTGCCTTGGCTGCCATTGGCGAGAAGGAGTGGTTCTTCTACGTCCCTCGAGATCGCAAGTACCGAAATGGGGACCGCCCCAATCGAGTTACGACCTCCGGTTACTGGAAGGCCACCGGGGCCGATCGGATGATTCGAACCGAGGATTTTCGGTCCATCGGCCTTAAGAAAACCCTAGTTTTCTACTCTGGAAAGGCTCCAAAGGGTATCCGTACGAGTTGGATTATGAACGAGTATCGGTTGCCACATCATGAGACCGAGCGGTATCAAAAGACCGAGATTTCGCTATGTCGTGTTTACAAAAGAGCTGGCGTTGAAGATCACCCGTCGCTCCCTCGTTCTCTCCCGTCGAGAGCGTCGTCGTCGTCTTCGAGACCGATAAGCTCGTCGACACCGAAGAATCTTCATCAAACTTCATCGTCATCTACAGACAAGTTTCAGAGATTCGAACCTCAATTCAACCACCATCTTCAAATTGGTGCCACCATTGAAACCACCGCAGCCGACGCCTCCGCTACGAGCTCTTGCGAAGAAGTAACCACCGTCCTCGGCCTCTCGAAACAAAACCCTTTTACCTCAACTCCATTGATCAACATGGCTGCCACTTCTTCTTACACTCCTTTCTTCTCTCCTTCTTCCAATAATTCCCTTGACGACCTACAAAAACTCATTCACTTTCAACAACAACCGCCGCCATCTACTCCCACCACCCTCATAAACTCTCTACCAACGCCGTATTACCAGCCGACGCTACCACCACCGCCGCAGCTACCGGTCGTGTTCCCCGATAGATTGTGGGAGTGGAATCCAATCCCAGATGGAGTCAATCCGTTCAAGTAAATTCTCGACGAGAAGTCTCAAATTTTAACCAGTTACACTATAATCAGGTCGGTAATATTGATACGCTTCTACTTGATTGAAGAATAACGCCTCCTAGCTCAAACTACTTCAATTTCCATTTTCTTTTTTGTTTATTATGCAGCTACATGCGATGAAGCTCATGAACATAAATATATATGACTAATTATAAGTTAGTTTACTATTGTTTCTTGTTGTCAATTATAAGTTAGAAGGGTTTTCATTAGATTTTTAAATAACTAGTTAGAACGTTTTATATTCGTAATAGATCGAAAGGGGATC

Coding sequence (CDS)

ATGGCAATTGCAGCTGCAGCTAGCAGTTCATCCACGATGAGCCACGAAGAAATTAGCAGCTCCAACAACACCCACAACAAAAACGACGACGACAACAACAACTGCAGCGACGATCAACACGAACACGACGTCGTTATGCCAGGCTTCCGCTTCCACCCCACCGAAGAAGAACTAGTTGAGTTCTACCTTCGCCGTAAGGTCGAAGGCAAACGATTTAATGTCGAACTCATTACTTTCCTTGATCTATATCGCTATGACCCATGGGAGCTTCCTGCCTTGGCTGCCATTGGCGAGAAGGAGTGGTTCTTCTACGTCCCTCGAGATCGCAAGTACCGAAATGGGGACCGCCCCAATCGAGTTACGACCTCCGGTTACTGGAAGGCCACCGGGGCCGATCGGATGATTCGAACCGAGGATTTTCGGTCCATCGGCCTTAAGAAAACCCTAGTTTTCTACTCTGGAAAGGCTCCAAAGGGTATCCGTACGAGTTGGATTATGAACGAGTATCGGTTGCCACATCATGAGACCGAGCGGTATCAAAAGACCGAGATTTCGCTATGTCGTGTTTACAAAAGAGCTGGCGTTGAAGATCACCCGTCGCTCCCTCGTTCTCTCCCGTCGAGAGCGTCGTCGTCGTCTTCGAGACCGATAAGCTCGTCGACACCGAAGAATCTTCATCAAACTTCATCGTCATCTACAGACAAGTTTCAGAGATTCGAACCTCAATTCAACCACCATCTTCAAATTGGTGCCACCATTGAAACCACCGCAGCCGACGCCTCCGCTACGAGCTCTTGCGAAGAAGTAACCACCGTCCTCGGCCTCTCGAAACAAAACCCTTTTACCTCAACTCCATTGATCAACATGGCTGCCACTTCTTCTTACACTCCTTTCTTCTCTCCTTCTTCCAATAATTCCCTTGACGACCTACAAAAACTCATTCACTTTCAACAACAACCGCCGCCATCTACTCCCACCACCCTCATAAACTCTCTACCAACGCCGTATTACCAGCCGACGCTACCACCACCGCCGCAGCTACCGGTCGTGTTCCCCGATAGATTGTGGGAGTGGAATCCAATCCCAGATGGAGTCAATCCGTTCAAGTAA
BLAST of CmoCh04G009370 vs. Swiss-Prot
Match: NAC35_ARATH (NAC domain-containing protein 35 OS=Arabidopsis thaliana GN=NAC035 PE=1 SV=2)

HSP 1 Score: 330.1 bits (845), Expect = 3.1e-89
Identity = 188/330 (56.97%), Postives = 227/330 (68.79%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHE----EISSSNNTHNKNDDDN--NNCSDDQHEHDVVMPGFRFHPT 60
           MAI ++ +S   MS++    E    +N H    + +  N    D H+HD+VMPGFRFHPT
Sbjct: 1   MAIVSSTTSIIPMSNQVNNNEKGIEDNDHRGGQESHVQNEDEADDHDHDMVMPGFRFHPT 60

Query: 61  EEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNG 120
           EEEL+EFYLRRKVEGKRFNVELITFLDLYRYDPWELPA+AAIGEKEW+FYVPRDRKYRNG
Sbjct: 61  EEELIEFYLRRKVEGKRFNVELITFLDLYRYDPWELPAMAAIGEKEWYFYVPRDRKYRNG 120

Query: 121 DRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHH 180
           DRPNRVTTSGYWKATGADRMIR+E  R IGLKKTLVFYSGKAPKG RTSWIMNEYRLPHH
Sbjct: 121 DRPNRVTTSGYWKATGADRMIRSETSRPIGLKKTLVFYSGKAPKGTRTSWIMNEYRLPHH 180

Query: 181 ETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTD 240
           ETE+YQK EISLCRVYKR GVEDHPS+PRSL +R  + +S   S    +     SSSS  
Sbjct: 181 ETEKYQKAEISLCRVYKRPGVEDHPSVPRSLSTRHHNHNSSTSSRLALRQQQHHSSSSNH 240

Query: 241 KFQRFEPQFN-HHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPFTSTPLINMAATS 300
                    N ++L+  +T  +     + T++       + L+ QN +   P      TS
Sbjct: 241 SDNNLNNNNNINNLEKLSTEYSGDGSTTTTTTNSNSDVTIALANQNIYRPMPY----DTS 300

Query: 301 SYTPFFSPSSNNS------LDDLQKLIHFQ 318
           + T   S  ++        +DDLQ+L+++Q
Sbjct: 301 NNTLIVSTRNHQDDDETAIVDDLQRLVNYQ 326

BLAST of CmoCh04G009370 vs. Swiss-Prot
Match: NAC94_ARATH (Putative NAC domain-containing protein 94 OS=Arabidopsis thaliana GN=ANAC094 PE=3 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 7.7e-48
Identity = 100/202 (49.50%), Postives = 132/202 (65.35%), Query Frame = 1

Query: 28  NDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDP 87
           +D+++NN    +   DVV+PGFRFHPT+EELV FYL+RKV  K    +LI  +D+Y+YDP
Sbjct: 6   DDEESNNV---ERYDDVVLPGFRFHPTDEELVSFYLKRKVLHKSLPFDLIKKVDIYKYDP 65

Query: 88  WELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTED-FRSIGLK 147
           W+LP LAA+GEKEW+FY PRDRKYRN  RPNRVT  G+WKATG DR I + D  R IGLK
Sbjct: 66  WDLPKLAAMGEKEWYFYCPRDRKYRNSTRPNRVTGGGFWKATGTDRPIYSLDSTRCIGLK 125

Query: 148 KTLVFYSGKAPKGIRTSWIMNEYRLP-----HHET------------------ERYQKTE 205
           K+LVFY G+A KG++T W+M+E+RLP     HH +                  E      
Sbjct: 126 KSLVFYRGRAAKGVKTDWMMHEFRLPSLSDSHHSSYPNYNNKKQHLNNNNNSKELPSNDA 185

BLAST of CmoCh04G009370 vs. Swiss-Prot
Match: FEZ_ARATH (Protein FEZ OS=Arabidopsis thaliana GN=FEZ PE=2 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 6.5e-47
Identity = 121/307 (39.41%), Postives = 172/307 (56.03%), Query Frame = 1

Query: 31  DNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 90
           D NN  D + E DV++PGFRFHPT+EELV FYL+RKV+    ++ELI  LD+Y+YDPW+L
Sbjct: 3   DRNNDGDQKME-DVLLPGFRFHPTDEELVSFYLKRKVQHNPLSIELIRQLDIYKYDPWDL 62

Query: 91  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMI-RTEDFRSIGLKKTL 150
           P  A  GEKEW+FY PRDRKYRN  RPNRVT +G+WKATG DR I  +E  + IGLKK+L
Sbjct: 63  PKFAMTGEKEWYFYCPRDRKYRNSSRPNRVTGAGFWKATGTDRPIYSSEGNKCIGLKKSL 122

Query: 151 VFYSGKAPKGIRTSWIMNEYRL-----PHHETERYQKTEIS------LCRVYKRAGVEDH 210
           VFY G+A KG++T W+M+E+RL     P   ++R+  + +S      +CR++K+      
Sbjct: 123 VFYKGRAAKGVKTDWMMHEFRLPSLSEPSPPSKRFFDSPVSPNDSWAICRIFKKTNTTTL 182

Query: 211 PSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRFEPQFNHHLQIGATIETTAA 270
            +L  S  S     +S    S+  ++   T   S+DK  +    F  H +   T +T   
Sbjct: 183 RALSHSFVSSLPPETSTDTMSNQKQS--NTYHFSSDKILKPSSHFQFHHENMNTPKT--- 242

Query: 271 DASATSSCEEVTTVLGLSKQNPFTSTPLINMAATSSYTPFFSPSSNNSLDDLQKLIHFQQ 326
              + S+   V T+      +PF+    ++  +    T  F+P S      L  L    Q
Sbjct: 243 ---SNSTTPSVPTI------SPFS---YLDFTSYDKPTNVFNPVSCLDQQYLTNLFLATQ 291

BLAST of CmoCh04G009370 vs. Swiss-Prot
Match: NAC72_ARATH (NAC domain-containing protein 72 OS=Arabidopsis thaliana GN=NAC072 PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 9.4e-46
Identity = 91/177 (51.41%), Postives = 122/177 (68.93%), Query Frame = 1

Query: 47  PGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVP 106
           PGFRF+PT+EEL+  YL RKV G  F++++I  +DLY++DPW+LP+ A  GEKEW+F+ P
Sbjct: 16  PGFRFYPTDEELLVQYLCRKVAGYHFSLQVIGDIDLYKFDPWDLPSKALFGEKEWYFFSP 75

Query: 107 RDRKYRNGDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIM 166
           RDRKY NG RPNRV  SGYWKATG D++I T D R +G+KK LVFY+GKAPKG +T+WIM
Sbjct: 76  RDRKYPNGSRPNRVAGSGYWKATGTDKII-TADGRRVGIKKALVFYAGKAPKGTKTNWIM 135

Query: 167 NEYRLPHHETER--YQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSST 222
           +EYRL  H       +  +  LCR+YK+       ++      R   S++   SSS+
Sbjct: 136 HEYRLIEHSRSHGSSKLDDWVLCRIYKKTSGSQRQAVTPVQACREEHSTNGSSSSSS 191

BLAST of CmoCh04G009370 vs. Swiss-Prot
Match: NAC55_ARATH (NAC domain-containing protein 55 OS=Arabidopsis thaliana GN=NAC055 PE=2 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.8e-44
Identity = 93/183 (50.82%), Postives = 121/183 (66.12%), Query Frame = 1

Query: 47  PGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVP 106
           PGFRF+PT+EEL+  YL RK  G  F+++LI  +DLY++DPW LP+ A  GEKEW+F+ P
Sbjct: 16  PGFRFYPTDEELMVEYLCRKAAGHDFSLQLIAEIDLYKFDPWVLPSKALFGEKEWYFFSP 75

Query: 107 RDRKYRNGDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIM 166
           RDRKY NG RPNRV  SGYWKATG D++I TE  R +G+KK LVFY GKAPKG +T+WIM
Sbjct: 76  RDRKYPNGSRPNRVAGSGYWKATGTDKVISTEG-RRVGIKKALVFYIGKAPKGTKTNWIM 135

Query: 167 NEYRL--PHHETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKN 226
           +EYRL  P       +  +  LCR+YK+       +    + S    S++    SST  +
Sbjct: 136 HEYRLIEPSRRNGSTKLDDWVLCRIYKKQTSAQKQAYNNLMTSGREYSNN---GSSTSSS 194

Query: 227 LHQ 228
            HQ
Sbjct: 196 SHQ 194

BLAST of CmoCh04G009370 vs. TrEMBL
Match: A0A0A0L2S0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G113370 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 5.8e-135
Identity = 288/432 (66.67%), Postives = 310/432 (71.76%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHEEISSSNNTHNKND--DDNNNCSDDQHEHDVVMPGFRFHPTEEEL 60
           MAIAAAA SSS    +E  SSN T+N N+  +D+++   DQHEHDVVMPGFRFHPTEEEL
Sbjct: 1   MAIAAAARSSSARMRQEEISSNKTNNNNNNCEDHDDIDHDQHEHDVVMPGFRFHPTEEEL 60

Query: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120
           VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN
Sbjct: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120

Query: 121 RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER 180
           RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER
Sbjct: 121 RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER 180

Query: 181 YQKTEISLCRVYKRAGVEDHPSLPRSLPSRAS-----SSSSRPISSSTPKNLHQTSSSST 240
           YQK EISLCRVYKRAGVEDHPSLPRSLPSRAS     SS +  +      N+ QTSSSST
Sbjct: 181 YQKAEISLCRVYKRAGVEDHPSLPRSLPSRASSSRMTSSKNNLLPGGGSVNVVQTSSSST 240

Query: 241 DKF-QRFEPQFN-HHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPFTSTPLINMAA 300
           DKF   FE QF+ H LQIG+ +E TAADASATSSCEEVTTVLGLSKQNPF ++PLINMAA
Sbjct: 241 DKFPTSFESQFHPHQLQIGSGVEATAADASATSSCEEVTTVLGLSKQNPFPTSPLINMAA 300

Query: 301 TSS-YTPFFSPSSNNSL--DDLQKLI-HFQQQPPPSTPTTLINSLPTPYY---------- 360
           TSS   P  + ++ N +  DD Q +I H QQQ          +SL  P Y          
Sbjct: 301 TSSLQIPASASTTPNCMEEDDHQSIILHKQQQQQQQQLLPSSSSLILPTYTSFFSPSSNN 360

Query: 361 ----------------------------------QPTLPPPPQ------LPVVFPDRLWE 370
                                             QPT PPPPQ      LPVVF DRLW+
Sbjct: 361 SLDDLQKLIHYQQQQPPLSASPTTIINSLPSQYYQPTPPPPPQQLALNTLPVVFSDRLWD 420

BLAST of CmoCh04G009370 vs. TrEMBL
Match: W9R289_9ROSA (Putative NAC domain-containing protein 94 OS=Morus notabilis GN=L484_016488 PE=4 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 1.5e-106
Identity = 241/425 (56.71%), Postives = 275/425 (64.71%), Query Frame = 1

Query: 3   IAAAASSSSTMSHEEISSSNNTHNKNDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVEFY 62
           +A AASS+ST++   +S       +N+ +NNN + D HEHD+VMPGFRFHPTEEELVEFY
Sbjct: 1   MAIAASSNSTITTTTMSQ------ENESNNNNKTTDDHEHDMVMPGFRFHPTEEELVEFY 60

Query: 63  LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT 122
           LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT
Sbjct: 61  LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT 120

Query: 123 SGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQKT 182
           SGYWKATGADRMIR+E+FRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLP HETERYQK 
Sbjct: 121 SGYWKATGADRMIRSENFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPQHETERYQKA 180

Query: 183 EISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRF--- 242
           EISLCRVYKRAGVEDHPSLPRSLPSRASS      S +   N  Q  +++ +K Q F   
Sbjct: 181 EISLCRVYKRAGVEDHPSLPRSLPSRASS------SRAAADNKKQYPNNAMEKLQTFGVH 240

Query: 243 -----EPQFNHHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPF---TSTPLINMAA 302
                 P   H  ++  T ET   D S++S      T  GLSK+  +    ++P+   AA
Sbjct: 241 QVLPPPPPPPHQFEVENTNET---DGSSSSDVAAAGT-KGLSKRKAYRGSIASPIAQPAA 300

Query: 303 TSSY------------------TPFFSPSSN---NSLDDLQKLIHFQQQPPPSTPTTLIN 362
            ++                   T F S SS+   NS+DDL +LIH  QQ P S+ T+  N
Sbjct: 301 ANTSMEEEVAMLSAQQPKQFCPTLFSSGSSSTPPNSIDDLHRLIHNYQQAPSSSTTSSSN 360

Query: 363 S------------------LPTPYYQPTLPPPPQ-----------LPVVFPDRLWEWNPI 367
           +                   P P   P   PP Q           LP  F DRLWEWNP 
Sbjct: 361 ANIIVSHHHHSQQHYFNQFHPMPLPLPVPVPPQQQQVALNPLSNFLPTAFSDRLWEWNPF 409

BLAST of CmoCh04G009370 vs. TrEMBL
Match: A0A068TNX5_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00015084001 PE=4 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 3.3e-106
Identity = 221/364 (60.71%), Postives = 253/364 (69.51%), Query Frame = 1

Query: 31  DNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 90
           +NNN  + +H+HD+VMPGFRFHPTEEEL+EFYLRRKVEGKRFNVELITFLDLYRYDPWEL
Sbjct: 10  ENNNKDEHEHDHDMVMPGFRFHPTEEELIEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 69

Query: 91  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLV 150
           PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTE+FRSIGLKKTLV
Sbjct: 70  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTENFRSIGLKKTLV 129

Query: 151 FYSGKAPKGIRTSWIMNEYRLPHHETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRAS 210
           FYSGKAPKGIRTSWIMNEYRLP HETER QK EISLCRVYKRAGVEDHPSLPRSLP+RAS
Sbjct: 130 FYSGKAPKGIRTSWIMNEYRLPQHETERLQKAEISLCRVYKRAGVEDHPSLPRSLPTRAS 189

Query: 211 SSSSRPISSSTPKNLHQTSSSSTDKFQRF--EPQFNHHLQIGATIETTAADASATSSCEE 270
           SS     SSS  K+   T+ +S ++FQ F   PQ     Q+   +  T+      SSC +
Sbjct: 190 SSRG-TTSSSAKKSQEATNHASMERFQAFVGNPQ-----QLDEKLSETSG-----SSCTD 249

Query: 271 VTTVLGLSKQNPFTS----TPLINMAATSSYTP---------FFSPSSNNSLDDLQKLIH 330
           + T LGLSK N F S    T  ++   +++  P          F P+ N +LDDL +L++
Sbjct: 250 IGTSLGLSKHNTFMSLAPMTTTLSQLCSTTLAPDCTTIFAGSSFVPTVNTTLDDLHRLVN 309

Query: 331 FQQQPPPSTPTTLINSLPTPYYQPTLPP-----------PPQLPVVFPDRLWEWNPIPDG 369
           FQQ           N+   P    +L P           P  L   F DRLW+WN I + 
Sbjct: 310 FQQASMSQHQQQYHNNPNHPSQFSSLQPQVQQSLALNMLPGPLQAAFTDRLWDWNSINEA 362

BLAST of CmoCh04G009370 vs. TrEMBL
Match: A0A061GB68_THECC (NAC domain containing protein 35 OS=Theobroma cacao GN=TCM_027945 PE=4 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 5.3e-104
Identity = 240/418 (57.42%), Postives = 273/418 (65.31%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHEEISSSNNTHNKNDDDNNNCSD--DQHEHDVVMPGFRFHPTEEEL 60
           MAIAAAA+         +S+  N +N NDD NN+ S   D HEHD+VMPGFRFHPTEEEL
Sbjct: 1   MAIAAAAT---------MSNDPNDNNNNDDHNNSSSSSKDDHEHDMVMPGFRFHPTEEEL 60

Query: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120
           VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN
Sbjct: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120

Query: 121 RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER 180
           RVTTSGYWKATGADRMIR E+ RSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLP HETER
Sbjct: 121 RVTTSGYWKATGADRMIRAENSRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPQHETER 180

Query: 181 YQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQR 240
           YQK EISLCRVYKRAGVEDHPSLPR LP+R S+S       S  K  H  +  + ++FQ 
Sbjct: 181 YQKAEISLCRVYKRAGVEDHPSLPRCLPTRPSASLRG--QQSGKKYPHDAAQQAMERFQG 240

Query: 241 FEPQFNHHLQIGATIETTAADASATSSCEEVTTVLGLSKQN------PFTST-------- 300
           F  Q +  ++I    ET      ++SS  +VTT LGLSKQN      P ++T        
Sbjct: 241 FGGQ-SQQMEIEKISETD----GSSSSTSDVTTALGLSKQNVYRPMAPISTTLGLPSGIE 300

Query: 301 ---PLINMA--ATSSYTP-------FFSPSSNNSLDDLQKLIHFQQQPPPSTPTTLINSL 360
                +N +    SS  P         S  S N +DDL +L+ +Q     +T     +  
Sbjct: 301 EEGMFLNQSKQGCSSLVPNCTTVFTVGSSVSPNVVDDLHRLVSYQH----ATMNQQQHYY 360

Query: 361 PTPYYQ-------PTLPP----------PPQLPVVFPDRLWEWNPIPDG----VNPFK 370
              ++Q        TLPP          P  LP+ F DRLWEWNPIP+      NPFK
Sbjct: 361 SDHHHQQQQQSEFSTLPPQSQQLSLNMLPSSLPMAFSDRLWEWNPIPEANREYNNPFK 398

BLAST of CmoCh04G009370 vs. TrEMBL
Match: A0A0D2RJ65_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090000 PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 2.2e-102
Identity = 234/424 (55.19%), Postives = 270/424 (63.68%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHEEISSSNNTHNKNDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVE 60
           MAI  AA  S+  +     +  +  N N+  +++ S D+HEHD+VMPGFRFHPTEEELVE
Sbjct: 1   MAIPPAAIMSNDPNDNTNINVVDDRNNNNMSSSSNSKDEHEHDMVMPGFRFHPTEEELVE 60

Query: 61  FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV 120
           FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV
Sbjct: 61  FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV 120

Query: 121 TTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ 180
           TTSGYWKATGADRMIR E+ RSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ
Sbjct: 121 TTSGYWKATGADRMIRGENSRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ 180

Query: 181 KTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRFE 240
           K EISLCRVYKR GVEDHPSLPR LP+R S+ SSR    S  K   + +  + ++FQ F 
Sbjct: 181 KAEISLCRVYKRPGVEDHPSLPRCLPTRRSAESSRG-QQSEKKYPQEAAQQAMERFQAFG 240

Query: 241 PQFNHHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPFTSTPLINMA---------- 300
                 ++I    ET   D S+++S  +VTT LGLSKQN +   P I+            
Sbjct: 241 GGQPQQMEIEKLTET---DGSSSTSTSDVTTALGLSKQNLYRPMPPISTTLGLPSGMEGE 300

Query: 301 -----------------ATSSYTPFFSPSSNNSLDDLQKLIHFQQQPPPSTPTTLINSLP 360
                            +T+ +    S   +N +DDL +L+ +QQ           N  P
Sbjct: 301 GMFLNQSKQGCCSLLPNSTTLFPVGSSSVPSNVVDDLHRLVSYQQVALNQQQYYNTNH-P 360

Query: 361 TPYYQ--------PTLPP-----PPQL-----------PVVFPDRLWEWNPIPDG----V 370
            P+ Q         TLPP     P QL           P  F DRLWEWNPIP+      
Sbjct: 361 HPHQQQHQPQSEFSTLPPQSQAQPQQLSLNVLPSAIPSPTAFSDRLWEWNPIPEPNREYN 419

BLAST of CmoCh04G009370 vs. TAIR10
Match: AT2G02450.2 (AT2G02450.2 NAC domain containing protein 35)

HSP 1 Score: 330.1 bits (845), Expect = 1.7e-90
Identity = 188/330 (56.97%), Postives = 227/330 (68.79%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHE----EISSSNNTHNKNDDDN--NNCSDDQHEHDVVMPGFRFHPT 60
           MAI ++ +S   MS++    E    +N H    + +  N    D H+HD+VMPGFRFHPT
Sbjct: 1   MAIVSSTTSIIPMSNQVNNNEKGIEDNDHRGGQESHVQNEDEADDHDHDMVMPGFRFHPT 60

Query: 61  EEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNG 120
           EEEL+EFYLRRKVEGKRFNVELITFLDLYRYDPWELPA+AAIGEKEW+FYVPRDRKYRNG
Sbjct: 61  EEELIEFYLRRKVEGKRFNVELITFLDLYRYDPWELPAMAAIGEKEWYFYVPRDRKYRNG 120

Query: 121 DRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHH 180
           DRPNRVTTSGYWKATGADRMIR+E  R IGLKKTLVFYSGKAPKG RTSWIMNEYRLPHH
Sbjct: 121 DRPNRVTTSGYWKATGADRMIRSETSRPIGLKKTLVFYSGKAPKGTRTSWIMNEYRLPHH 180

Query: 181 ETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTD 240
           ETE+YQK EISLCRVYKR GVEDHPS+PRSL +R  + +S   S    +     SSSS  
Sbjct: 181 ETEKYQKAEISLCRVYKRPGVEDHPSVPRSLSTRHHNHNSSTSSRLALRQQQHHSSSSNH 240

Query: 241 KFQRFEPQFN-HHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPFTSTPLINMAATS 300
                    N ++L+  +T  +     + T++       + L+ QN +   P      TS
Sbjct: 241 SDNNLNNNNNINNLEKLSTEYSGDGSTTTTTTNSNSDVTIALANQNIYRPMPY----DTS 300

Query: 301 SYTPFFSPSSNNS------LDDLQKLIHFQ 318
           + T   S  ++        +DDLQ+L+++Q
Sbjct: 301 NNTLIVSTRNHQDDDETAIVDDLQRLVNYQ 326

BLAST of CmoCh04G009370 vs. TAIR10
Match: AT2G17040.1 (AT2G17040.1 NAC domain containing protein 36)

HSP 1 Score: 196.8 bits (499), Expect = 2.3e-50
Identity = 87/153 (56.86%), Postives = 120/153 (78.43%), Query Frame = 1

Query: 43  DVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWF 102
           D+ +PGFRFHPTEEEL++FYL+  V GKR +VE+I FL++YR+DPW+LP L+ IGE+EW+
Sbjct: 4   DIELPGFRFHPTEEELLDFYLKNMVYGKRSSVEVIGFLNIYRHDPWDLPGLSRIGEREWY 63

Query: 103 FYVPRDRKYRNGDRPNRVTTSGYWKATGADRMI--RTEDFRSIGLKKTLVFYSGKAPKGI 162
           F+VPR+RK+ NG RP+R T  GYWKATG+DR I   +E  R IGLKKTLVFY G+AP G 
Sbjct: 64  FFVPRERKHGNGGRPSRTTEKGYWKATGSDRKIISLSEPKRVIGLKKTLVFYRGRAPGGS 123

Query: 163 RTSWIMNEYRLPHHETERYQKTEISLCRVYKRA 194
           +T W+MNE+R+P + +      ++ LC++Y++A
Sbjct: 124 KTDWVMNEFRMPDNCS---LPKDVVLCKIYRKA 153

BLAST of CmoCh04G009370 vs. TAIR10
Match: AT5G39820.1 (AT5G39820.1 NAC domain containing protein 94)

HSP 1 Score: 192.6 bits (488), Expect = 4.4e-49
Identity = 100/202 (49.50%), Postives = 132/202 (65.35%), Query Frame = 1

Query: 28  NDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDP 87
           +D+++NN    +   DVV+PGFRFHPT+EELV FYL+RKV  K    +LI  +D+Y+YDP
Sbjct: 6   DDEESNNV---ERYDDVVLPGFRFHPTDEELVSFYLKRKVLHKSLPFDLIKKVDIYKYDP 65

Query: 88  WELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTED-FRSIGLK 147
           W+LP LAA+GEKEW+FY PRDRKYRN  RPNRVT  G+WKATG DR I + D  R IGLK
Sbjct: 66  WDLPKLAAMGEKEWYFYCPRDRKYRNSTRPNRVTGGGFWKATGTDRPIYSLDSTRCIGLK 125

Query: 148 KTLVFYSGKAPKGIRTSWIMNEYRLP-----HHET------------------ERYQKTE 205
           K+LVFY G+A KG++T W+M+E+RLP     HH +                  E      
Sbjct: 126 KSLVFYRGRAAKGVKTDWMMHEFRLPSLSDSHHSSYPNYNNKKQHLNNNNNSKELPSNDA 185

BLAST of CmoCh04G009370 vs. TAIR10
Match: AT1G26870.1 (AT1G26870.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 189.5 bits (480), Expect = 3.7e-48
Identity = 121/307 (39.41%), Postives = 172/307 (56.03%), Query Frame = 1

Query: 31  DNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 90
           D NN  D + E DV++PGFRFHPT+EELV FYL+RKV+    ++ELI  LD+Y+YDPW+L
Sbjct: 10  DRNNDGDQKME-DVLLPGFRFHPTDEELVSFYLKRKVQHNPLSIELIRQLDIYKYDPWDL 69

Query: 91  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMI-RTEDFRSIGLKKTL 150
           P  A  GEKEW+FY PRDRKYRN  RPNRVT +G+WKATG DR I  +E  + IGLKK+L
Sbjct: 70  PKFAMTGEKEWYFYCPRDRKYRNSSRPNRVTGAGFWKATGTDRPIYSSEGNKCIGLKKSL 129

Query: 151 VFYSGKAPKGIRTSWIMNEYRL-----PHHETERYQKTEIS------LCRVYKRAGVEDH 210
           VFY G+A KG++T W+M+E+RL     P   ++R+  + +S      +CR++K+      
Sbjct: 130 VFYKGRAAKGVKTDWMMHEFRLPSLSEPSPPSKRFFDSPVSPNDSWAICRIFKKTNTTTL 189

Query: 211 PSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRFEPQFNHHLQIGATIETTAA 270
            +L  S  S     +S    S+  ++   T   S+DK  +    F  H +   T +T   
Sbjct: 190 RALSHSFVSSLPPETSTDTMSNQKQS--NTYHFSSDKILKPSSHFQFHHENMNTPKT--- 249

Query: 271 DASATSSCEEVTTVLGLSKQNPFTSTPLINMAATSSYTPFFSPSSNNSLDDLQKLIHFQQ 326
              + S+   V T+      +PF+    ++  +    T  F+P S      L  L    Q
Sbjct: 250 ---SNSTTPSVPTI------SPFS---YLDFTSYDKPTNVFNPVSCLDQQYLTNLFLATQ 298

BLAST of CmoCh04G009370 vs. TAIR10
Match: AT3G15500.1 (AT3G15500.1 NAC domain containing protein 3)

HSP 1 Score: 181.4 bits (459), Expect = 1.0e-45
Identity = 93/183 (50.82%), Postives = 121/183 (66.12%), Query Frame = 1

Query: 47  PGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVP 106
           PGFRF+PT+EEL+  YL RK  G  F+++LI  +DLY++DPW LP+ A  GEKEW+F+ P
Sbjct: 16  PGFRFYPTDEELMVEYLCRKAAGHDFSLQLIAEIDLYKFDPWVLPSKALFGEKEWYFFSP 75

Query: 107 RDRKYRNGDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIM 166
           RDRKY NG RPNRV  SGYWKATG D++I TE  R +G+KK LVFY GKAPKG +T+WIM
Sbjct: 76  RDRKYPNGSRPNRVAGSGYWKATGTDKVISTEG-RRVGIKKALVFYIGKAPKGTKTNWIM 135

Query: 167 NEYRL--PHHETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKN 226
           +EYRL  P       +  +  LCR+YK+       +    + S    S++    SST  +
Sbjct: 136 HEYRLIEPSRRNGSTKLDDWVLCRIYKKQTSAQKQAYNNLMTSGREYSNN---GSSTSSS 194

Query: 227 LHQ 228
            HQ
Sbjct: 196 SHQ 194

BLAST of CmoCh04G009370 vs. NCBI nr
Match: gi|659102563|ref|XP_008452197.1| (PREDICTED: NAC domain-containing protein 55 [Cucumis melo])

HSP 1 Score: 514.2 bits (1323), Expect = 1.9e-142
Identity = 304/438 (69.41%), Postives = 326/438 (74.43%), Query Frame = 1

Query: 1   MAIAAAASSSST-MSHEEISSSNNTHNKNDDDNNNCSD------DQHEHDVVMPGFRFHP 60
           MAIAAAA SSST M  EEISS+   H   DD+NNNC D      DQHEHDVVMPGFRFHP
Sbjct: 1   MAIAAAARSSSTRMRQEEISSNKTNH---DDNNNNCEDHDDIDHDQHEHDVVMPGFRFHP 60

Query: 61  TEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRN 120
           TEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRN
Sbjct: 61  TEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRN 120

Query: 121 GDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPH 180
           GDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPH
Sbjct: 121 GDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPH 180

Query: 181 HETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPK---------- 240
           HETERYQK EISLCRVYKRAGVE+HPSLPRSLPSRASSS  R  +SSTPK          
Sbjct: 181 HETERYQKAEISLCRVYKRAGVENHPSLPRSLPSRASSS--RMTTSSTPKSNLLPGGGSV 240

Query: 241 NLHQTSSSSTDKFQR-FEPQFNHH-LQIGATIETTAADASATSSCEEVTTVLGLSKQNPF 300
           N+ QTSSSSTDKF   FE QF+HH LQIG+ +E TAADASATSSCEEVTTVLGLSKQNPF
Sbjct: 241 NVVQTSSSSTDKFPTSFESQFHHHQLQIGSGVEATAADASATSSCEEVTTVLGLSKQNPF 300

Query: 301 TSTPLINMAATSS-YTPFFSPSSNNSL--DDLQKLI--HFQQQPPPSTPTTLI------- 360
            ++PL+NMAATSS   P  + ++ N +  DD Q +I  + QQQ  PS+ + ++       
Sbjct: 301 PTSPLLNMAATSSLQIPASASTTPNCMEEDDHQSIILHNKQQQLLPSSSSLILPTYTSFF 360

Query: 361 -----NS---------------------------LPTPYYQPTLPPPPQ------LPVVF 370
                NS                           LP+ YYQPT PPPPQ      LPVVF
Sbjct: 361 SPSSNNSLDDLQKLIHYQQQQPPLSASPATIINSLPSQYYQPTPPPPPQQLALNTLPVVF 420

BLAST of CmoCh04G009370 vs. NCBI nr
Match: gi|778676468|ref|XP_011650588.1| (PREDICTED: NAC domain-containing protein 55 [Cucumis sativus])

HSP 1 Score: 488.8 bits (1257), Expect = 8.3e-135
Identity = 288/432 (66.67%), Postives = 310/432 (71.76%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHEEISSSNNTHNKND--DDNNNCSDDQHEHDVVMPGFRFHPTEEEL 60
           MAIAAAA SSS    +E  SSN T+N N+  +D+++   DQHEHDVVMPGFRFHPTEEEL
Sbjct: 1   MAIAAAARSSSARMRQEEISSNKTNNNNNNCEDHDDIDHDQHEHDVVMPGFRFHPTEEEL 60

Query: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120
           VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN
Sbjct: 61  VEFYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPN 120

Query: 121 RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER 180
           RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER
Sbjct: 121 RVTTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETER 180

Query: 181 YQKTEISLCRVYKRAGVEDHPSLPRSLPSRAS-----SSSSRPISSSTPKNLHQTSSSST 240
           YQK EISLCRVYKRAGVEDHPSLPRSLPSRAS     SS +  +      N+ QTSSSST
Sbjct: 181 YQKAEISLCRVYKRAGVEDHPSLPRSLPSRASSSRMTSSKNNLLPGGGSVNVVQTSSSST 240

Query: 241 DKF-QRFEPQFN-HHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPFTSTPLINMAA 300
           DKF   FE QF+ H LQIG+ +E TAADASATSSCEEVTTVLGLSKQNPF ++PLINMAA
Sbjct: 241 DKFPTSFESQFHPHQLQIGSGVEATAADASATSSCEEVTTVLGLSKQNPFPTSPLINMAA 300

Query: 301 TSS-YTPFFSPSSNNSL--DDLQKLI-HFQQQPPPSTPTTLINSLPTPYY---------- 360
           TSS   P  + ++ N +  DD Q +I H QQQ          +SL  P Y          
Sbjct: 301 TSSLQIPASASTTPNCMEEDDHQSIILHKQQQQQQQQLLPSSSSLILPTYTSFFSPSSNN 360

Query: 361 ----------------------------------QPTLPPPPQ------LPVVFPDRLWE 370
                                             QPT PPPPQ      LPVVF DRLW+
Sbjct: 361 SLDDLQKLIHYQQQQPPLSASPTTIINSLPSQYYQPTPPPPPQQLALNTLPVVFSDRLWD 420

BLAST of CmoCh04G009370 vs. NCBI nr
Match: gi|743935057|ref|XP_011011884.1| (PREDICTED: NAC domain-containing protein 72-like [Populus euphratica])

HSP 1 Score: 399.1 bits (1024), Expect = 8.7e-108
Identity = 243/420 (57.86%), Postives = 279/420 (66.43%), Query Frame = 1

Query: 1   MAIAAAASSSSTMSHEEISSSNNTHNKNDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVE 60
           MAIAA      TMSH   +  NN +N  + ++NN  DD H+HD+VMPGFRFHPTEEELVE
Sbjct: 1   MAIAA------TMSHNTNNEQNNNNNDCNSNSNN-KDDDHDHDMVMPGFRFHPTEEELVE 60

Query: 61  FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV 120
           FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV
Sbjct: 61  FYLRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRV 120

Query: 121 TTSGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ 180
           TT+GYWKATGADRMIRTE+ RSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ
Sbjct: 121 TTTGYWKATGADRMIRTENSRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQ 180

Query: 181 KTEISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRFE 240
           K EISLCRVYKRAGVEDHPSLPRSLPSRASSS        T  +    S  + ++FQ + 
Sbjct: 181 KVEISLCRVYKRAGVEDHPSLPRSLPSRASSSRG------TQSDKKHQSHITVERFQPYV 240

Query: 241 PQFNHHLQIGATIETTAADASATSSCEEVTTVLGLSKQNP--FTSTPLINMA-------- 300
            Q +  +++    ET A      SS  +VTT LGLSK N   +  TP I+ +        
Sbjct: 241 GQSSQQIEMEKMSETDA------SSSSDVTTALGLSKHNSNAYHPTPPISNSLGLPASVG 300

Query: 301 ----------ATSSYTPFF-------SPSSNNSLDDLQKLI---------HFQQQPP--- 360
                     A+SS  P F       S  S++ +DDL +L+         H+QQQ     
Sbjct: 301 EGMFLNLPKQASSSLIPSFTNLFSVTSSVSSSPIDDLHRLLNYQQASIDHHYQQQQQQQF 360

Query: 361 -----PSTPTTLINSLPTPYYQPTLPP---PPQLPVVFPDRLWEWNPIP----DGVNPFK 370
                P   ++ ++S+ TP     LP    P  LP  FPDR+WEWN +P    D  NPFK
Sbjct: 361 YLLQQPQHQSSQLSSM-TPQTSQQLPLNMLPELLPPTFPDRIWEWNQMPEANRDFNNPFK 400

BLAST of CmoCh04G009370 vs. NCBI nr
Match: gi|703070110|ref|XP_010088702.1| (Putative NAC domain-containing protein 94 [Morus notabilis])

HSP 1 Score: 394.4 bits (1012), Expect = 2.1e-106
Identity = 241/425 (56.71%), Postives = 275/425 (64.71%), Query Frame = 1

Query: 3   IAAAASSSSTMSHEEISSSNNTHNKNDDDNNNCSDDQHEHDVVMPGFRFHPTEEELVEFY 62
           +A AASS+ST++   +S       +N+ +NNN + D HEHD+VMPGFRFHPTEEELVEFY
Sbjct: 1   MAIAASSNSTITTTTMSQ------ENESNNNNKTTDDHEHDMVMPGFRFHPTEEELVEFY 60

Query: 63  LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT 122
           LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT
Sbjct: 61  LRRKVEGKRFNVELITFLDLYRYDPWELPALAAIGEKEWFFYVPRDRKYRNGDRPNRVTT 120

Query: 123 SGYWKATGADRMIRTEDFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPHHETERYQKT 182
           SGYWKATGADRMIR+E+FRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLP HETERYQK 
Sbjct: 121 SGYWKATGADRMIRSENFRSIGLKKTLVFYSGKAPKGIRTSWIMNEYRLPQHETERYQKA 180

Query: 183 EISLCRVYKRAGVEDHPSLPRSLPSRASSSSSRPISSSTPKNLHQTSSSSTDKFQRF--- 242
           EISLCRVYKRAGVEDHPSLPRSLPSRASS      S +   N  Q  +++ +K Q F   
Sbjct: 181 EISLCRVYKRAGVEDHPSLPRSLPSRASS------SRAAADNKKQYPNNAMEKLQTFGVH 240

Query: 243 -----EPQFNHHLQIGATIETTAADASATSSCEEVTTVLGLSKQNPF---TSTPLINMAA 302
                 P   H  ++  T ET   D S++S      T  GLSK+  +    ++P+   AA
Sbjct: 241 QVLPPPPPPPHQFEVENTNET---DGSSSSDVAAAGT-KGLSKRKAYRGSIASPIAQPAA 300

Query: 303 TSSY------------------TPFFSPSSN---NSLDDLQKLIHFQQQPPPSTPTTLIN 362
            ++                   T F S SS+   NS+DDL +LIH  QQ P S+ T+  N
Sbjct: 301 ANTSMEEEVAMLSAQQPKQFCPTLFSSGSSSTPPNSIDDLHRLIHNYQQAPSSSTTSSSN 360

Query: 363 S------------------LPTPYYQPTLPPPPQ-----------LPVVFPDRLWEWNPI 367
           +                   P P   P   PP Q           LP  F DRLWEWNP 
Sbjct: 361 ANIIVSHHHHSQQHYFNQFHPMPLPLPVPVPPQQQQVALNPLSNFLPTAFSDRLWEWNPF 409

BLAST of CmoCh04G009370 vs. NCBI nr
Match: gi|661899683|emb|CDO97677.1| (unnamed protein product [Coffea canephora])

HSP 1 Score: 393.3 bits (1009), Expect = 4.8e-106
Identity = 221/364 (60.71%), Postives = 253/364 (69.51%), Query Frame = 1

Query: 31  DNNNCSDDQHEHDVVMPGFRFHPTEEELVEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 90
           +NNN  + +H+HD+VMPGFRFHPTEEEL+EFYLRRKVEGKRFNVELITFLDLYRYDPWEL
Sbjct: 10  ENNNKDEHEHDHDMVMPGFRFHPTEEELIEFYLRRKVEGKRFNVELITFLDLYRYDPWEL 69

Query: 91  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTEDFRSIGLKKTLV 150
           PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTE+FRSIGLKKTLV
Sbjct: 70  PALAAIGEKEWFFYVPRDRKYRNGDRPNRVTTSGYWKATGADRMIRTENFRSIGLKKTLV 129

Query: 151 FYSGKAPKGIRTSWIMNEYRLPHHETERYQKTEISLCRVYKRAGVEDHPSLPRSLPSRAS 210
           FYSGKAPKGIRTSWIMNEYRLP HETER QK EISLCRVYKRAGVEDHPSLPRSLP+RAS
Sbjct: 130 FYSGKAPKGIRTSWIMNEYRLPQHETERLQKAEISLCRVYKRAGVEDHPSLPRSLPTRAS 189

Query: 211 SSSSRPISSSTPKNLHQTSSSSTDKFQRF--EPQFNHHLQIGATIETTAADASATSSCEE 270
           SS     SSS  K+   T+ +S ++FQ F   PQ     Q+   +  T+      SSC +
Sbjct: 190 SSRG-TTSSSAKKSQEATNHASMERFQAFVGNPQ-----QLDEKLSETSG-----SSCTD 249

Query: 271 VTTVLGLSKQNPFTS----TPLINMAATSSYTP---------FFSPSSNNSLDDLQKLIH 330
           + T LGLSK N F S    T  ++   +++  P          F P+ N +LDDL +L++
Sbjct: 250 IGTSLGLSKHNTFMSLAPMTTTLSQLCSTTLAPDCTTIFAGSSFVPTVNTTLDDLHRLVN 309

Query: 331 FQQQPPPSTPTTLINSLPTPYYQPTLPP-----------PPQLPVVFPDRLWEWNPIPDG 369
           FQQ           N+   P    +L P           P  L   F DRLW+WN I + 
Sbjct: 310 FQQASMSQHQQQYHNNPNHPSQFSSLQPQVQQSLALNMLPGPLQAAFTDRLWDWNSINEA 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAC35_ARATH3.1e-8956.97NAC domain-containing protein 35 OS=Arabidopsis thaliana GN=NAC035 PE=1 SV=2[more]
NAC94_ARATH7.7e-4849.50Putative NAC domain-containing protein 94 OS=Arabidopsis thaliana GN=ANAC094 PE=... [more]
FEZ_ARATH6.5e-4739.41Protein FEZ OS=Arabidopsis thaliana GN=FEZ PE=2 SV=1[more]
NAC72_ARATH9.4e-4651.41NAC domain-containing protein 72 OS=Arabidopsis thaliana GN=NAC072 PE=2 SV=1[more]
NAC55_ARATH1.8e-4450.82NAC domain-containing protein 55 OS=Arabidopsis thaliana GN=NAC055 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L2S0_CUCSA5.8e-13566.67Uncharacterized protein OS=Cucumis sativus GN=Csa_3G113370 PE=4 SV=1[more]
W9R289_9ROSA1.5e-10656.71Putative NAC domain-containing protein 94 OS=Morus notabilis GN=L484_016488 PE=4... [more]
A0A068TNX5_COFCA3.3e-10660.71Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00015084001 PE=4 SV=1[more]
A0A061GB68_THECC5.3e-10457.42NAC domain containing protein 35 OS=Theobroma cacao GN=TCM_027945 PE=4 SV=1[more]
A0A0D2RJ65_GOSRA2.2e-10255.19Uncharacterized protein OS=Gossypium raimondii GN=B456_011G090000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G02450.21.7e-9056.97 NAC domain containing protein 35[more]
AT2G17040.12.3e-5056.86 NAC domain containing protein 36[more]
AT5G39820.14.4e-4949.50 NAC domain containing protein 94[more]
AT1G26870.13.7e-4839.41 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT3G15500.11.0e-4550.82 NAC domain containing protein 3[more]
Match NameE-valueIdentityDescription
gi|659102563|ref|XP_008452197.1|1.9e-14269.41PREDICTED: NAC domain-containing protein 55 [Cucumis melo][more]
gi|778676468|ref|XP_011650588.1|8.3e-13566.67PREDICTED: NAC domain-containing protein 55 [Cucumis sativus][more]
gi|743935057|ref|XP_011011884.1|8.7e-10857.86PREDICTED: NAC domain-containing protein 72-like [Populus euphratica][more]
gi|703070110|ref|XP_010088702.1|2.1e-10656.71Putative NAC domain-containing protein 94 [Morus notabilis][more]
gi|661899683|emb|CDO97677.1|4.8e-10660.71unnamed protein product [Coffea canephora][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G009370.1CmoCh04G009370.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 47..171
score: 7.1
IPR003441NAC domainPROFILEPS51005NACcoord: 45..192
score: 55
IPR003441NAC domainunknownSSF101941NAC domaincoord: 43..191
score: 5.36
NoneNo IPR availablePANTHERPTHR31719FAMILY NOT NAMEDcoord: 22..269
score: 1.9E
NoneNo IPR availablePANTHERPTHR31719:SF24SUBFAMILY NOT NAMEDcoord: 22..269
score: 1.9E