ClCG09G022950 (gene) Watermelon (Charleston Gray)

NameClCG09G022950
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionNAC domain-containing protein, putative
LocationCG_Chr09 : 40056510 .. 40058450 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGCCCGAAAATGGACAACAGTTAAGTGTTCCACCAGGCTTTCGATTCCATCCAACAGATGAGGAGCTTCTTTATTACTACCTTAGGAAGAAGGTTTCGTATGAAGCCATTGAGCTTGATGTTATCAGGGAAGTGGATCTAAACAAACTAGAGCCTTGGGATCTCAAAGGTTATGATCTATTGATTATTATTTATAATTGTTATACGTTAGTGTTAAAAGTTTGATATAATTAACTTAATAAACCTACAGAATCGGTTTTTGGTTTCTTCACTGGTGATTTAACAATTAGCTAAAGATTGTGGGTTAGTAGATTCATTTGCATGCACGTACACGTTCTCTTGTAGATACATATATATCTAAATCTAAACAAACAAAAATAATAATATAAAGTTACTTTTTGGTTAATTTTCAGATAAATGTAGAATTGGATCTGGGCATCAAAACGAATGGTATTTCTTTAGCCATAAGGACAAAAAATATCCAACTGGAACTCGAACTAATAGAGCAACCAGTGCTGGATTTTGGAAGGCAACAGGGAGAGACAAAACCATTCACACGGGCAATTGCAATTCCAAAAGGATTGGCATGAGGAAGACACTGGTGTTCTACACAGGTCGTGCTCCTCATGGCCAGAAGACTGATTGGATCATGCATGAATACCGTCTTGAAGATGACAATCCTGAGGTTCAGATATATATTCAAATAATAATAATTACCCCTGTTTCCTTCCCTACATTTAGTTACGTTTTATATGTTATATTTATAGCTAGGCTTCTCATCATAGGCAAAAGTTAGCTCCATCTCAATGTTACACAAGTATGATAATAGATTTGATAATAATGTTAATACCCTGTTGAAAATTTAGCTTTTAAACAATACTTCTTTATATATATACACACACACACACACACATATACATATATCTATTTTCTACTTATATTTTCAAAAAACCAGCCCAAAATTAAAAAATTAAAAAAAGTATATATATTTTTCAACAAAATATTTTTAAAGAATTTAAGTGTTTCCTTTAAGACCATGAAATGAAAATTATAATAGACATAAACTTTAAAAACTAAGAACTAAAAACATACAAAATGATACCTTAATAAAACCTATCTAGTTGATCAAATTAAAAGACCATGTGTTCCAAATAGGGTGGACATAGTTTTAGTACTAATTGACATTCTCTTTTTTTAGGTCAAGTCTCACGGATCTTAGTTACATTAAGCTGGTTCATTTAATAAAAATTTAGATAAATTTCAAAAAGCCAAATACTAAAAAAAGAAATCGTTATAAAGCGGAGCGTTGTTAACCTTTGCCCAATCTAGCTGAAATGGAAGGAAGGAGGTGATCTGATCTGTTGAATTCCAATTTGTGAAATGACTTCTTTTTCTTTCTTTTTTTCTTTAATTCTATAGCTGCACGAAGATGGTTGGGTGGTGTGCAGAGTTTTCAAGAAGAAAAGTCACAAGCCAGAGGTTCCGCAAGAGCAGCAGTTAGATTACTATACACATACGAAGCTAGGTAGCAGTTCTGCTTCGGTCATGGGTGCAGAAATGGGAGAGCCAAAAAACAATAATAATCATATGCAAGAGCCATATAATAATGATTATAGCTTTGATGGTTGCATGCAGCTGCCGCAGCTGTTTAGTCCAGAGTCATCAGTTGCAACACCTATCTCTTTAAACGCCGCCGCCACTGTAGAGTGTCCTCAAAATATCTGGAGGTTAAGTTGCGGGCTCGTGCAACATGAGCGATTGAACACTGATTGGTCATTCTTGAATAGGCTGCTTGCTTTGGATCAACAATCTCGTTCCAAATCTACGCTTTCGGATGACCTTACTATGGGACACTCCAACTCCAGAAAATTTTCATTTCCATTTCCATATCCTTATCCTCTTGCCTCTGGTGCCGACTTTATCAAATTCTCCAAGTAG

mRNA sequence

ATGATGCCCGAAAATGGACAACAGTTAAGTGTTCCACCAGGCTTTCGATTCCATCCAACAGATGAGGAGCTTCTTTATTACTACCTTAGGAAGAAGGTTTCGTATGAAGCCATTGAGCTTGATGTTATCAGGGAAGTGGATCTAAACAAACTAGAGCCTTGGGATCTCAAAGATAAATGTAGAATTGGATCTGGGCATCAAAACGAATGGTATTTCTTTAGCCATAAGGACAAAAAATATCCAACTGGAACTCGAACTAATAGAGCAACCAGTGCTGGATTTTGGAAGGCAACAGGGAGAGACAAAACCATTCACACGGGCAATTGCAATTCCAAAAGGATTGGCATGAGGAAGACACTGGTGTTCTACACAGGTCGTGCTCCTCATGGCCAGAAGACTGATTGGATCATGCATGAATACCGTCTTGAAGATGACAATCCTGAGGTTCAGATATATATTCAAATAATAATAATTACCCCTCTGCACGAAGATGGTTGGGTGGTGTGCAGAGTTTTCAAGAAGAAAAGTCACAAGCCAGAGGTTCCGCAAGAGCAGCAGTTAGATTACTATACACATACGAAGCTAGGTAGCAGTTCTGCTTCGGTCATGGGTGCAGAAATGGGAGAGCCAAAAAACAATAATAATCATATGCAAGAGCCATATAATAATGATTATAGCTTTGATGGTTGCATGCAGCTGCCGCAGCTGTTTAGTCCAGAGTCATCAGTTGCAACACCTATCTCTTTAAACGCCGCCGCCACTGTAGAGTGTCCTCAAAATATCTGGAGGTTAAGTTGCGGGCTCGTGCAACATGAGCGATTGAACACTGATTGGTCATTCTTGAATAGGCTGCTTGCTTTGGATCAACAATCTCGTTCCAAATCTACGCTTTCGGATGACCTTACTATGGGACACTCCAACTCCAGAAAATTTTCATTTCCATTTCCATATCCTTATCCTCTTGCCTCTGGTGCCGACTTTATCAAATTCTCCAAGTAG

Coding sequence (CDS)

ATGATGCCCGAAAATGGACAACAGTTAAGTGTTCCACCAGGCTTTCGATTCCATCCAACAGATGAGGAGCTTCTTTATTACTACCTTAGGAAGAAGGTTTCGTATGAAGCCATTGAGCTTGATGTTATCAGGGAAGTGGATCTAAACAAACTAGAGCCTTGGGATCTCAAAGATAAATGTAGAATTGGATCTGGGCATCAAAACGAATGGTATTTCTTTAGCCATAAGGACAAAAAATATCCAACTGGAACTCGAACTAATAGAGCAACCAGTGCTGGATTTTGGAAGGCAACAGGGAGAGACAAAACCATTCACACGGGCAATTGCAATTCCAAAAGGATTGGCATGAGGAAGACACTGGTGTTCTACACAGGTCGTGCTCCTCATGGCCAGAAGACTGATTGGATCATGCATGAATACCGTCTTGAAGATGACAATCCTGAGGTTCAGATATATATTCAAATAATAATAATTACCCCTCTGCACGAAGATGGTTGGGTGGTGTGCAGAGTTTTCAAGAAGAAAAGTCACAAGCCAGAGGTTCCGCAAGAGCAGCAGTTAGATTACTATACACATACGAAGCTAGGTAGCAGTTCTGCTTCGGTCATGGGTGCAGAAATGGGAGAGCCAAAAAACAATAATAATCATATGCAAGAGCCATATAATAATGATTATAGCTTTGATGGTTGCATGCAGCTGCCGCAGCTGTTTAGTCCAGAGTCATCAGTTGCAACACCTATCTCTTTAAACGCCGCCGCCACTGTAGAGTGTCCTCAAAATATCTGGAGGTTAAGTTGCGGGCTCGTGCAACATGAGCGATTGAACACTGATTGGTCATTCTTGAATAGGCTGCTTGCTTTGGATCAACAATCTCGTTCCAAATCTACGCTTTCGGATGACCTTACTATGGGACACTCCAACTCCAGAAAATTTTCATTTCCATTTCCATATCCTTATCCTCTTGCCTCTGGTGCCGACTTTATCAAATTCTCCAAGTAG

Protein sequence

MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQLFSPESSVATPISLNAAATVECPQNIWRLSCGLVQHERLNTDWSFLNRLLALDQQSRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK
BLAST of ClCG09G022950 vs. Swiss-Prot
Match: SMB_ARATH (Protein SOMBRERO OS=Arabidopsis thaliana GN=SMB PE=1 SV=1)

HSP 1 Score: 325.5 bits (833), Expect = 6.9e-88
Identity = 186/379 (49.08%), Postives = 235/379 (62.01%), Query Frame = 1

Query: 6   GQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSG 65
           G QLSVPPGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG
Sbjct: 12  GGQLSVPPGFRFHPTEEELLYYYLKKKVSYEPIDLDVIREVDLNKLEPWELKEKCRIGSG 71

Query: 66  HQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTG 125
            QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH    +SK+IG+RKTLVFYTG
Sbjct: 72  PQNEWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKSIHLN--SSKKIGLRKTLVFYTG 131

Query: 126 RAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQEQ 185
           RAPHGQKT+WIMHEYRL+D   E+Q            EDGWVVCRVFKKK+H     QEQ
Sbjct: 132 RAPHGQKTEWIMHEYRLDDSENEIQ------------EDGWVVCRVFKKKNHFRGFHQEQ 191

Query: 186 QLDYYTHTKLGSSSASVMGAEMGEPKNNNN-----HMQEPYNNDYSF------------- 245
           + D++ H +  S++         +  +NN+     H  + +++ +               
Sbjct: 192 EQDHHHHHQYISTNNDHDHHHHIDSNSNNHSPLILHPLDHHHHHHHIGRQIHMPLHEFAN 251

Query: 246 ---DGCMQLPQLFSPESSVATPISLNAA---------ATVECPQNIWRLSCGLVQHERLN 305
               G M LPQLFSP+S+ A   +  +A           +EC QN+ RL+     +    
Sbjct: 252 TLSHGSMHLPQLFSPDSAAAAAAAAASAQPFVSPINTTDIECSQNLLRLT----SNNNYG 311

Query: 306 TDWSFLNRLLA---LDQQSRS-----KSTLSDDLT----------MGHSNSRKFSFP--- 333
            DWSFL++LL    ++QQ +      ++    DL+          +G++N    S P   
Sbjct: 312 GDWSFLDKLLTTGNMNQQQQQQVQNHQAKCFGDLSNNDNNDQADHLGNNNGGSSSSPVNQ 371

BLAST of ClCG09G022950 vs. Swiss-Prot
Match: NAC76_ORYSJ (NAC domain-containing protein 76 OS=Oryza sativa subsp. japonica GN=NAC76 PE=2 SV=2)

HSP 1 Score: 300.4 bits (768), Expect = 2.4e-80
Identity = 160/275 (58.18%), Postives = 186/275 (67.64%), Query Frame = 1

Query: 2   MPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCR 61
           M  +G  LSVPPGFRFHPTDEELLYYYLRKKV+YEAI+LDVIRE+DLNKLEPWDLKD+CR
Sbjct: 1   MHPSGGALSVPPGFRFHPTDEELLYYYLRKKVAYEAIDLDVIREIDLNKLEPWDLKDRCR 60

Query: 62  IGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLV 121
           IG+G QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I     N+ RIGMRKTLV
Sbjct: 61  IGTGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIFL--ANACRIGMRKTLV 120

Query: 122 FYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHK--- 181
           FY GRAPHG+KTDWIMHEYRL+ DN +VQ            EDGWVVCRVF KKS++   
Sbjct: 121 FYVGRAPHGKKTDWIMHEYRLDQDNVDVQ------------EDGWVVCRVFMKKSYQRGL 180

Query: 182 -----PEVPQEQQLDYYTH----TKLGSSSASVMGAEMGEPKNNNNHMQEP---YNNDYS 241
                  V  +  L ++ H     +L   +A       G   ++++H+ +P   Y++  S
Sbjct: 181 NPADMAAVDDDDLLHHHHHPFPPAQLHGGAAD--HKHDGAGGHHHHHLMQPHHHYDDFPS 240

Query: 242 FDGCMQLPQLFSPESSVATPISLNAAATVECPQNI 262
           FD  MQLPQL S +     P SL    T    Q +
Sbjct: 241 FDPSMQLPQLMSADQPPPPPPSLLPGGTASSLQRL 259

BLAST of ClCG09G022950 vs. Swiss-Prot
Match: NAC43_ARATH (NAC domain-containing protein 43 OS=Arabidopsis thaliana GN=NAC043 PE=2 SV=2)

HSP 1 Score: 265.0 bits (676), Expect = 1.1e-69
Identity = 157/333 (47.15%), Postives = 198/333 (59.46%), Query Frame = 1

Query: 5   NGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 64
           NGQ   VPPGFRFHPT+EELL YYLRKKV+   I+LDVIR+VDLNKLEPWD+++ C+IG+
Sbjct: 11  NGQS-QVPPGFRFHPTEEELLQYYLRKKVNSIEIDLDVIRDVDLNKLEPWDIQEMCKIGT 70

Query: 65  GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYT 124
             QN+WYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I++   N +RIGMRKTLVFY 
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKIIYS---NGRRIGMRKTLVFYK 130

Query: 125 GRAPHGQKTDWIMHEYRLEDD--NPE-VQIYIQIIIITPLHED-GWVVCRVFKKKSHKPE 184
           GRAPHGQK+DWIMHEYRL+D+  +PE V ++  + II    +D GWVVCR+FKKK+    
Sbjct: 131 GRAPHGQKSDWIMHEYRLDDNIISPEDVTVHEVVSIIGEASQDEGWVVCRIFKKKN---- 190

Query: 185 VPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNH-------------MQEPYNNDYSF 244
                 L    ++ +G +S S  G     PK  ++              M      + + 
Sbjct: 191 ------LHKTLNSPVGGASLSGGG---DTPKTTSSQIFNEDTLDQFLELMGRSCKEELNL 250

Query: 245 DGCMQLPQLFSPESSVATPISLNAAATVECPQNIWRLSCGLVQHERLNTDWSFLNRLLA- 304
           D  M+LP L SP S             V  P     +    V      T W+ L+RL+A 
Sbjct: 251 DPFMKLPNLESPNSQAIN------NCHVSSPDTNHNIHVSNVVDTSFVTSWAALDRLVAS 310

Query: 305 -LDQQSRSKSTLSDDLTMGHSNSRKFSFPFPYP 319
            L+  +    T  ++  +GH +    S   PYP
Sbjct: 311 QLNGPTSYSITAVNESHVGHDHLALPSVRSPYP 320

BLAST of ClCG09G022950 vs. Swiss-Prot
Match: NAC12_ARATH (NAC domain-containing protein 12 OS=Arabidopsis thaliana GN=NAC012 PE=2 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 1.6e-68
Identity = 150/310 (48.39%), Postives = 197/310 (63.55%), Query Frame = 1

Query: 5   NGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 64
           NGQ   VPPGFRFHPT+EELL+YYLRKKV+ + I+LDVIREVDLNKLEPWD++++CRIGS
Sbjct: 11  NGQS-KVPPGFRFHPTEEELLHYYLRKKVNSQKIDLDVIREVDLNKLEPWDIQEECRIGS 70

Query: 65  GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNS-KRIGMRKTLVFY 124
             QN+WYFFSHKDKKYPTGTRTNRAT AGFWKATGRDK I    C+  +RIG+RKTLVFY
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATVAGFWKATGRDKII----CSCVRRIGLRKTLVFY 130

Query: 125 TGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPL--HEDGWVVCRVFKKKSHK--- 184
            GRAPHGQK+DWIMHEYRL DD P    Y  ++   P+  +E+GWVVCRVF+KK+++   
Sbjct: 131 KGRAPHGQKSDWIMHEYRL-DDTPMSNGYADVVTEDPMSYNEEGWVVCRVFRKKNYQKID 190

Query: 185 ----------PEVPQEQQLDYYTHTKLGSSSASVM------GAEMGEPKNNNNHMQEPYN 244
                     P+  +E++   + +T+  +    V+      G+ +  P++        + 
Sbjct: 191 DCPKITLSSLPDDTEEEKGPTFHNTQNVTGLDHVLLYMDRTGSNICMPESQTT---TQHQ 250

Query: 245 NDYSFDGCMQLPQLFSPES------SVATPISLNAAATVECPQNIWRLSCGLVQHERLNT 287
           +D  F   MQLP L +P+S      S  TP  L+ +   E            +    + +
Sbjct: 251 DDVLF---MQLPSLETPKSESPVDQSFLTPSKLDFSPVQE-----------KITERPVCS 297

BLAST of ClCG09G022950 vs. Swiss-Prot
Match: BRN2_ARATH (Protein BEARSKIN2 OS=Arabidopsis thaliana GN=BRN2 PE=2 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 2.1e-68
Identity = 159/361 (44.04%), Postives = 200/361 (55.40%), Query Frame = 1

Query: 11  VPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEW 70
           VPPGFRFHPTDEELL+YYL+KK+SY+  E++VIREVDLNKLEPWDL+++C+IGS  QNEW
Sbjct: 9   VPPGFRFHPTDEELLHYYLKKKISYQKFEMEVIREVDLNKLEPWDLQERCKIGSTPQNEW 68

Query: 71  YFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTGRAPHG 130
           YFFSHKD+KYPTG+RTNRAT AGFWKATGRDK I     + K+IGMRKTLVFY GRAPHG
Sbjct: 69  YFFSHKDRKYPTGSRTNRATHAGFWKATGRDKCIRN---SYKKIGMRKTLVFYKGRAPHG 128

Query: 131 QKTDWIMHEYRLED-DNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQEQQLDY 190
           QKTDWIMHEYRLED D+P+              EDGWVVCRVF KK            + 
Sbjct: 129 QKTDWIMHEYRLEDADDPQAN----------PSEDGWVVCRVFMKK------------NL 188

Query: 191 YTHTKLGSSSASVMGAEMGEPKNNNN--------HMQEPY-----NNDYSFD------GC 250
           +     GSSS + +     +  NNN+        H   PY     +   +F+        
Sbjct: 189 FKVVNEGSSSINSLDQHNHDASNNNHALQARSFMHRDSPYQLVRNHGAMTFELNKPDLAL 248

Query: 251 MQLPQLFSPESSVATPISLNAAATVE------------CPQNIWRLSCGLV---QHERLN 310
            Q P +F    S+    S   A   E            C   +   +C  V    H++  
Sbjct: 249 HQYPPIFHKPPSLGFDYSSGLARDSESAASEGLQYQQACEPGLDVGTCETVASHNHQQGL 308

Query: 311 TDWSFLNRLLA--LDQQSRSKSTLSDDLTMGHSNSRKFSFPFP--YPYPLASGADFIKFS 333
            +W+ ++RL+   +  +  S+    +D   G++NS     P P      L S  DF  +S
Sbjct: 309 GEWAMMDRLVTCHMGNEDSSRGITYED---GNNNSSSVVQPVPATNQLTLRSEMDFWGYS 341

BLAST of ClCG09G022950 vs. TrEMBL
Match: A0A0A0K475_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G252700 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 4.9e-157
Identity = 288/342 (84.21%), Postives = 295/342 (86.26%), Query Frame = 1

Query: 1   MMPENGQQL-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60
           MMPEN QQL SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60

Query: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH--TGNCNSKRI-GM 120
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH  + N NSKRI GM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKS 180
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  NPEVQ            EDGWVVCRVFKKKS
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHNPEVQ------------EDGWVVCRVFKKKS 180

Query: 181 HKPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPY-NNDYSFDGCMQLPQ 240
            K EVP+EQQLDYY HTKLG SS S +G EMGEPKNNNNHMQEP+ NNDYSFDGCMQLPQ
Sbjct: 181 QKSEVPEEQQLDYYAHTKLGGSSGSAVGTEMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQ 240

Query: 241 LFSPESSVA---TPISLNAA-ATVECPQNIWRLSCGLVQHERLN-TDWSFLNRLLALDQQ 300
           LFSPESS       ISLNAA A VECPQNIWRLSCG+VQHERLN TDWSFLNRLLALDQQ
Sbjct: 241 LFSPESSTVPTLPAISLNAAGAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQ 300

Query: 301 SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           SRSKSTLSD+LT+    SR FSFPFPYPY L SG DFIKFSK
Sbjct: 301 SRSKSTLSDELTI----SRNFSFPFPYPYHLPSGPDFIKFSK 326

BLAST of ClCG09G022950 vs. TrEMBL
Match: A0A0M4FKZ1_MANES (NAC transcription factors 51 OS=Manihot esculenta PE=2 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 6.9e-111
Identity = 227/346 (65.61%), Postives = 256/346 (73.99%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           MM  NGQ L+VPPGFRFHPTDEELLYYYL+KKVSYEAI+LDVIRE+DLNKLEPWDLKDKC
Sbjct: 1   MMAGNGQ-LAVPPGFRFHPTDEELLYYYLKKKVSYEAIDLDVIRELDLNKLEPWDLKDKC 60

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH    NSKRIGMRKTL
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLS--NSKRIGMRKTL 120

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH-KP 180
           VFYTGRAPHGQKTDWIMHEYRL+DDN EVQ            EDGWVVCRVFKKK+  + 
Sbjct: 121 VFYTGRAPHGQKTDWIMHEYRLDDDNSEVQ------------EDGWVVCRVFKKKNQSRG 180

Query: 181 EVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQLFSP 240
            +P   Q D+++H K+ SSSAS+        ++  NHMQ  Y  D SFDG M LPQLFSP
Sbjct: 181 FLPDVAQDDHFSHMKVSSSSASM--------EHKQNHMQALY--DCSFDGSMHLPQLFSP 240

Query: 241 ESSVA----TPISLNAAATVECPQNIWRLS---CGLVQHERLNTDWSFLNRLLA----LD 300
           ES+VA    TP+ LN     EC QN+ RL+   CGLVQ ER+N+DWSFL++LLA    LD
Sbjct: 241 ESAVAPSFITPLPLNTMDINECSQNLLRLTSSGCGLVQPERVNSDWSFLDKLLASHQSLD 300

Query: 301 QQSRSKSTLSDDLT--MGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           QQS+SK   S  L   MG S+ +KF+  FPY   L    D +KF K
Sbjct: 301 QQSQSKRDPSFHLVDHMGASSHQKFT-TFPY---LGCENDILKFPK 317

BLAST of ClCG09G022950 vs. TrEMBL
Match: A0A061FBQ8_THECC (NAC domain transcriptional regulator superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_033728 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 9.0e-111
Identity = 225/351 (64.10%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           M+P NGQ LSVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 19  MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 78

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH   CNSK+IGMRKTL
Sbjct: 79  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHL--CNSKKIGMRKTL 138

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH--- 180
           VFYTGRAPHGQKTDWIMHEYRL+DD+ +VQ            EDGWVVCRVFKKK+H   
Sbjct: 139 VFYTGRAPHGQKTDWIMHEYRLDDDDSDVQ------------EDGWVVCRVFKKKNHSRG 198

Query: 181 --KPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQ 240
             +PE  QE+    +TH K  +SSA +        +  +NH+Q  Y  D+SFDG MQLP 
Sbjct: 199 NFQPEFSQEES---FTHIKTVASSAQL--------ETRHNHLQALY--DFSFDGSMQLPH 258

Query: 241 LFSPESSVA----TPISLNAAATVECPQNIWRLS----CGLVQHERLNTDWSFLNRLLA- 300
           LFSPES+VA    +P+SLN +  +EC QN+ RL+    CGLVQ ER N +WSFL++LLA 
Sbjct: 259 LFSPESAVASSFISPVSLN-STDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLAT 318

Query: 301 ----LDQQ-SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
               +DQQ S+ K T S  + +G S  +   FPF Y   L   AD +KFSK
Sbjct: 319 HHLSVDQQHSQGKCTPSSQVDVGTSTQK---FPFQY---LGCEADILKFSK 334

BLAST of ClCG09G022950 vs. TrEMBL
Match: A0A061FAG5_THECC (NAC domain transcriptional regulator superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_033728 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 9.0e-111
Identity = 225/351 (64.10%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           M+P NGQ LSVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 1   MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 60

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH   CNSK+IGMRKTL
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHL--CNSKKIGMRKTL 120

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH--- 180
           VFYTGRAPHGQKTDWIMHEYRL+DD+ +VQ            EDGWVVCRVFKKK+H   
Sbjct: 121 VFYTGRAPHGQKTDWIMHEYRLDDDDSDVQ------------EDGWVVCRVFKKKNHSRG 180

Query: 181 --KPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQ 240
             +PE  QE+    +TH K  +SSA +        +  +NH+Q  Y  D+SFDG MQLP 
Sbjct: 181 NFQPEFSQEES---FTHIKTVASSAQL--------ETRHNHLQALY--DFSFDGSMQLPH 240

Query: 241 LFSPESSVA----TPISLNAAATVECPQNIWRLS----CGLVQHERLNTDWSFLNRLLA- 300
           LFSPES+VA    +P+SLN +  +EC QN+ RL+    CGLVQ ER N +WSFL++LLA 
Sbjct: 241 LFSPESAVASSFISPVSLN-STDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLAT 300

Query: 301 ----LDQQ-SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
               +DQQ S+ K T S  + +G S  +   FPF Y   L   AD +KFSK
Sbjct: 301 HHLSVDQQHSQGKCTPSSQVDVGTSTQK---FPFQY---LGCEADILKFSK 316

BLAST of ClCG09G022950 vs. TrEMBL
Match: B9S679_RICCO (NAC domain-containing protein, putative OS=Ricinus communis GN=RCOM_0674920 PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 1.9e-108
Identity = 223/345 (64.64%), Postives = 252/345 (73.04%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           MM  NGQ LSVPPGFRFHPTDEELLYYYL+KKVSYEAI+LDVIREVDLNKLEPWDLK+KC
Sbjct: 1   MMAGNGQ-LSVPPGFRFHPTDEELLYYYLKKKVSYEAIDLDVIREVDLNKLEPWDLKEKC 60

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH    NSKRIGMRKTL
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLS--NSKRIGMRKTL 120

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH-KP 180
           VFYTGRAPHGQKTDWIMHEYRL+DDN EVQ            EDGWVVCRVFKKK+  + 
Sbjct: 121 VFYTGRAPHGQKTDWIMHEYRLDDDNSEVQ------------EDGWVVCRVFKKKNQTRG 180

Query: 181 EVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQLFSP 240
            +P+  Q ++++H K G+SS S+      EPK   +HMQ  Y  DY+FDG M LPQLFSP
Sbjct: 181 FLPEAAQEEHFSHMKAGASSVSM------EPK--QHHMQALY--DYNFDGSMHLPQLFSP 240

Query: 241 ES-----SVATPISLNAAATVECPQNIWRL---SCGLVQHERLNTDWSFLNRLLA----L 300
           ES     S  TP+SLN    +EC QN+ RL   SCGLVQ ER + DWSFL++LLA    L
Sbjct: 241 ESAAVPPSFVTPLSLN-TMDIECSQNLLRLTSTSCGLVQPERFHGDWSFLDKLLASHQSL 300

Query: 301 DQQSRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           D Q +   + S  + MG S   +  FPFPY   L    D ++FSK
Sbjct: 301 DHQGKGNPS-SQVVDMGASVHHQ-KFPFPY---LGCETDIMRFSK 314

BLAST of ClCG09G022950 vs. TAIR10
Match: AT1G79580.1 (AT1G79580.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 325.5 bits (833), Expect = 3.9e-89
Identity = 186/379 (49.08%), Postives = 235/379 (62.01%), Query Frame = 1

Query: 6   GQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSG 65
           G QLSVPPGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG
Sbjct: 12  GGQLSVPPGFRFHPTEEELLYYYLKKKVSYEPIDLDVIREVDLNKLEPWELKEKCRIGSG 71

Query: 66  HQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTG 125
            QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK+IH    +SK+IG+RKTLVFYTG
Sbjct: 72  PQNEWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKSIHLN--SSKKIGLRKTLVFYTG 131

Query: 126 RAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQEQ 185
           RAPHGQKT+WIMHEYRL+D   E+Q            EDGWVVCRVFKKK+H     QEQ
Sbjct: 132 RAPHGQKTEWIMHEYRLDDSENEIQ------------EDGWVVCRVFKKKNHFRGFHQEQ 191

Query: 186 QLDYYTHTKLGSSSASVMGAEMGEPKNNNN-----HMQEPYNNDYSF------------- 245
           + D++ H +  S++         +  +NN+     H  + +++ +               
Sbjct: 192 EQDHHHHHQYISTNNDHDHHHHIDSNSNNHSPLILHPLDHHHHHHHIGRQIHMPLHEFAN 251

Query: 246 ---DGCMQLPQLFSPESSVATPISLNAA---------ATVECPQNIWRLSCGLVQHERLN 305
               G M LPQLFSP+S+ A   +  +A           +EC QN+ RL+     +    
Sbjct: 252 TLSHGSMHLPQLFSPDSAAAAAAAAASAQPFVSPINTTDIECSQNLLRLT----SNNNYG 311

Query: 306 TDWSFLNRLLA---LDQQSRS-----KSTLSDDLT----------MGHSNSRKFSFP--- 333
            DWSFL++LL    ++QQ +      ++    DL+          +G++N    S P   
Sbjct: 312 GDWSFLDKLLTTGNMNQQQQQQVQNHQAKCFGDLSNNDNNDQADHLGNNNGGSSSSPVNQ 371

BLAST of ClCG09G022950 vs. TAIR10
Match: AT2G46770.1 (AT2G46770.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 265.0 bits (676), Expect = 6.2e-71
Identity = 157/333 (47.15%), Postives = 198/333 (59.46%), Query Frame = 1

Query: 5   NGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 64
           NGQ   VPPGFRFHPT+EELL YYLRKKV+   I+LDVIR+VDLNKLEPWD+++ C+IG+
Sbjct: 11  NGQS-QVPPGFRFHPTEEELLQYYLRKKVNSIEIDLDVIRDVDLNKLEPWDIQEMCKIGT 70

Query: 65  GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYT 124
             QN+WYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I++   N +RIGMRKTLVFY 
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKIIYS---NGRRIGMRKTLVFYK 130

Query: 125 GRAPHGQKTDWIMHEYRLEDD--NPE-VQIYIQIIIITPLHED-GWVVCRVFKKKSHKPE 184
           GRAPHGQK+DWIMHEYRL+D+  +PE V ++  + II    +D GWVVCR+FKKK+    
Sbjct: 131 GRAPHGQKSDWIMHEYRLDDNIISPEDVTVHEVVSIIGEASQDEGWVVCRIFKKKN---- 190

Query: 185 VPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNH-------------MQEPYNNDYSF 244
                 L    ++ +G +S S  G     PK  ++              M      + + 
Sbjct: 191 ------LHKTLNSPVGGASLSGGG---DTPKTTSSQIFNEDTLDQFLELMGRSCKEELNL 250

Query: 245 DGCMQLPQLFSPESSVATPISLNAAATVECPQNIWRLSCGLVQHERLNTDWSFLNRLLA- 304
           D  M+LP L SP S             V  P     +    V      T W+ L+RL+A 
Sbjct: 251 DPFMKLPNLESPNSQAIN------NCHVSSPDTNHNIHVSNVVDTSFVTSWAALDRLVAS 310

Query: 305 -LDQQSRSKSTLSDDLTMGHSNSRKFSFPFPYP 319
            L+  +    T  ++  +GH +    S   PYP
Sbjct: 311 QLNGPTSYSITAVNESHVGHDHLALPSVRSPYP 320

BLAST of ClCG09G022950 vs. TAIR10
Match: AT1G32770.1 (AT1G32770.1 NAC domain containing protein 12)

HSP 1 Score: 261.2 bits (666), Expect = 9.0e-70
Identity = 150/310 (48.39%), Postives = 197/310 (63.55%), Query Frame = 1

Query: 5   NGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 64
           NGQ   VPPGFRFHPT+EELL+YYLRKKV+ + I+LDVIREVDLNKLEPWD++++CRIGS
Sbjct: 11  NGQS-KVPPGFRFHPTEEELLHYYLRKKVNSQKIDLDVIREVDLNKLEPWDIQEECRIGS 70

Query: 65  GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNS-KRIGMRKTLVFY 124
             QN+WYFFSHKDKKYPTGTRTNRAT AGFWKATGRDK I    C+  +RIG+RKTLVFY
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATVAGFWKATGRDKII----CSCVRRIGLRKTLVFY 130

Query: 125 TGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPL--HEDGWVVCRVFKKKSHK--- 184
            GRAPHGQK+DWIMHEYRL DD P    Y  ++   P+  +E+GWVVCRVF+KK+++   
Sbjct: 131 KGRAPHGQKSDWIMHEYRL-DDTPMSNGYADVVTEDPMSYNEEGWVVCRVFRKKNYQKID 190

Query: 185 ----------PEVPQEQQLDYYTHTKLGSSSASVM------GAEMGEPKNNNNHMQEPYN 244
                     P+  +E++   + +T+  +    V+      G+ +  P++        + 
Sbjct: 191 DCPKITLSSLPDDTEEEKGPTFHNTQNVTGLDHVLLYMDRTGSNICMPESQTT---TQHQ 250

Query: 245 NDYSFDGCMQLPQLFSPES------SVATPISLNAAATVECPQNIWRLSCGLVQHERLNT 287
           +D  F   MQLP L +P+S      S  TP  L+ +   E            +    + +
Sbjct: 251 DDVLF---MQLPSLETPKSESPVDQSFLTPSKLDFSPVQE-----------KITERPVCS 297

BLAST of ClCG09G022950 vs. TAIR10
Match: AT4G10350.1 (AT4G10350.1 NAC domain containing protein 70)

HSP 1 Score: 260.8 bits (665), Expect = 1.2e-69
Identity = 159/361 (44.04%), Postives = 200/361 (55.40%), Query Frame = 1

Query: 11  VPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEW 70
           VPPGFRFHPTDEELL+YYL+KK+SY+  E++VIREVDLNKLEPWDL+++C+IGS  QNEW
Sbjct: 9   VPPGFRFHPTDEELLHYYLKKKISYQKFEMEVIREVDLNKLEPWDLQERCKIGSTPQNEW 68

Query: 71  YFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTGRAPHG 130
           YFFSHKD+KYPTG+RTNRAT AGFWKATGRDK I     + K+IGMRKTLVFY GRAPHG
Sbjct: 69  YFFSHKDRKYPTGSRTNRATHAGFWKATGRDKCIRN---SYKKIGMRKTLVFYKGRAPHG 128

Query: 131 QKTDWIMHEYRLED-DNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQEQQLDY 190
           QKTDWIMHEYRLED D+P+              EDGWVVCRVF KK            + 
Sbjct: 129 QKTDWIMHEYRLEDADDPQAN----------PSEDGWVVCRVFMKK------------NL 188

Query: 191 YTHTKLGSSSASVMGAEMGEPKNNNN--------HMQEPY-----NNDYSFD------GC 250
           +     GSSS + +     +  NNN+        H   PY     +   +F+        
Sbjct: 189 FKVVNEGSSSINSLDQHNHDASNNNHALQARSFMHRDSPYQLVRNHGAMTFELNKPDLAL 248

Query: 251 MQLPQLFSPESSVATPISLNAAATVE------------CPQNIWRLSCGLV---QHERLN 310
            Q P +F    S+    S   A   E            C   +   +C  V    H++  
Sbjct: 249 HQYPPIFHKPPSLGFDYSSGLARDSESAASEGLQYQQACEPGLDVGTCETVASHNHQQGL 308

Query: 311 TDWSFLNRLLA--LDQQSRSKSTLSDDLTMGHSNSRKFSFPFP--YPYPLASGADFIKFS 333
            +W+ ++RL+   +  +  S+    +D   G++NS     P P      L S  DF  +S
Sbjct: 309 GEWAMMDRLVTCHMGNEDSSRGITYED---GNNNSSSVVQPVPATNQLTLRSEMDFWGYS 341

BLAST of ClCG09G022950 vs. TAIR10
Match: AT4G36160.1 (AT4G36160.1 NAC domain containing protein 76)

HSP 1 Score: 258.5 bits (659), Expect = 5.8e-69
Identity = 155/348 (44.54%), Postives = 194/348 (55.75%), Query Frame = 1

Query: 7   QQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGH 66
           Q  SVPPGFRFHPTDEEL+ YYLRKKV+ + I+LDVIR++DL ++EPWDL++ CRIG   
Sbjct: 6   QSCSVPPGFRFHPTDEELVGYYLRKKVASQKIDLDVIRDIDLYRIEPWDLQESCRIGYEE 65

Query: 67  QNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTLVFYTGR 126
           +NEWYFFSHKDKKYPTGTRTNRAT AGFWKATGRDK ++     SK IGMRKTLVFY GR
Sbjct: 66  RNEWYFFSHKDKKYPTGTRTNRATMAGFWKATGRDKAVYD---KSKLIGMRKTLVFYKGR 125

Query: 127 APHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSHKPEVPQ-EQ 186
           AP+GQKTDWIMHEYRLE D              P  E+GWVVCR FKKK    +    E 
Sbjct: 126 APNGQKTDWIMHEYRLESDEN-----------APPQEEGWVVCRAFKKKPMTGQAKNTET 185

Query: 187 QLDYYTHTKLGSSSASVMGAEMGEPKNN-NNHMQEPYNNDYSF----------------D 246
               Y + +L S   SV      EP N  +   Q  +  D  F                D
Sbjct: 186 WSSSYFYDELPSGVRSVT-----EPLNYVSKQKQNVFAQDLMFKQELEGSDIGLNFIHCD 245

Query: 247 GCMQLPQLFSPESSVA-TPISLNAAATVECPQNIWR-------------LSCGLVQHER- 306
             +QLPQL SP   +   P+SL +  ++E  +NI++             +S G    ++ 
Sbjct: 246 QFIQLPQLESPSLPLTKRPVSLTSITSLEKNKNIYKRHLIEEDVSFNALISSGNKDKKKK 305

Query: 307 ----LNTDWSFLNRLLALDQQSRSKSTL-------SDDLTMGHSNSRK 311
               + TDW  L++ +A    S+             D+  +GH N+ +
Sbjct: 306 KTSVMTTDWRALDKFVASQLMSQEDGVSGFGGHHEEDNNKIGHYNNEE 334

BLAST of ClCG09G022950 vs. NCBI nr
Match: gi|449461154|ref|XP_004148307.1| (PREDICTED: NAC domain-containing protein 76-like [Cucumis sativus])

HSP 1 Score: 562.0 bits (1447), Expect = 7.0e-157
Identity = 288/342 (84.21%), Postives = 295/342 (86.26%), Query Frame = 1

Query: 1   MMPENGQQL-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60
           MMPEN QQL SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60

Query: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH--TGNCNSKRI-GM 120
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH  + N NSKRI GM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKS 180
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  NPEVQ            EDGWVVCRVFKKKS
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHNPEVQ------------EDGWVVCRVFKKKS 180

Query: 181 HKPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPY-NNDYSFDGCMQLPQ 240
            K EVP+EQQLDYY HTKLG SS S +G EMGEPKNNNNHMQEP+ NNDYSFDGCMQLPQ
Sbjct: 181 QKSEVPEEQQLDYYAHTKLGGSSGSAVGTEMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQ 240

Query: 241 LFSPESSVA---TPISLNAA-ATVECPQNIWRLSCGLVQHERLN-TDWSFLNRLLALDQQ 300
           LFSPESS       ISLNAA A VECPQNIWRLSCG+VQHERLN TDWSFLNRLLALDQQ
Sbjct: 241 LFSPESSTVPTLPAISLNAAGAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQ 300

Query: 301 SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           SRSKSTLSD+LT+    SR FSFPFPYPY L SG DFIKFSK
Sbjct: 301 SRSKSTLSDELTI----SRNFSFPFPYPYHLPSGPDFIKFSK 326

BLAST of ClCG09G022950 vs. NCBI nr
Match: gi|659092435|ref|XP_008447060.1| (PREDICTED: NAC domain-containing protein 76 [Cucumis melo])

HSP 1 Score: 560.5 bits (1443), Expect = 2.0e-156
Identity = 287/341 (84.16%), Postives = 294/341 (86.22%), Query Frame = 1

Query: 1   MMPENGQQL-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60
           MMPEN QQL SVPPGFRFHPTDEELLYYYLRKKVS+EAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSFEAIELDVIREVDLNKLEPWDLKDK 60

Query: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH--TGNCNSKRI-GM 120
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIH  + N NSKRI GM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKS 180
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  +PEVQ            EDGWVVCRVFKKKS
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHDPEVQ------------EDGWVVCRVFKKKS 180

Query: 181 HKPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPY-NNDYSFDGCMQLPQ 240
            KPEVP+EQ LDYY HTKLG SS S  G  MGEPKNNNNHMQEP+ NNDYSFDGCMQLPQ
Sbjct: 181 QKPEVPEEQHLDYYAHTKLGGSSGSAEGTGMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQ 240

Query: 241 LFSPESS--VATPISLN-AAATVECPQNIWRLSCGLVQHERLN-TDWSFLNRLLALDQQS 300
           LFSPESS     PISLN AAA VECPQNIWRLSCG+VQHERLN TDWSFLNRLLALDQQS
Sbjct: 241 LFSPESSSVPTLPISLNAAAAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQS 300

Query: 301 RSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           RSKSTLSDDLT+    SR FSFPFPYPY L SG DFIKFSK
Sbjct: 301 RSKSTLSDDLTI----SRNFSFPFPYPYSLPSGPDFIKFSK 325

BLAST of ClCG09G022950 vs. NCBI nr
Match: gi|925170223|gb|ALC79028.1| (NAC transcription factors 51 [Manihot esculenta])

HSP 1 Score: 408.7 bits (1049), Expect = 9.9e-111
Identity = 227/346 (65.61%), Postives = 256/346 (73.99%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           MM  NGQ L+VPPGFRFHPTDEELLYYYL+KKVSYEAI+LDVIRE+DLNKLEPWDLKDKC
Sbjct: 1   MMAGNGQ-LAVPPGFRFHPTDEELLYYYLKKKVSYEAIDLDVIRELDLNKLEPWDLKDKC 60

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH    NSKRIGMRKTL
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLS--NSKRIGMRKTL 120

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH-KP 180
           VFYTGRAPHGQKTDWIMHEYRL+DDN EVQ            EDGWVVCRVFKKK+  + 
Sbjct: 121 VFYTGRAPHGQKTDWIMHEYRLDDDNSEVQ------------EDGWVVCRVFKKKNQSRG 180

Query: 181 EVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQLFSP 240
            +P   Q D+++H K+ SSSAS+        ++  NHMQ  Y  D SFDG M LPQLFSP
Sbjct: 181 FLPDVAQDDHFSHMKVSSSSASM--------EHKQNHMQALY--DCSFDGSMHLPQLFSP 240

Query: 241 ESSVA----TPISLNAAATVECPQNIWRLS---CGLVQHERLNTDWSFLNRLLA----LD 300
           ES+VA    TP+ LN     EC QN+ RL+   CGLVQ ER+N+DWSFL++LLA    LD
Sbjct: 241 ESAVAPSFITPLPLNTMDINECSQNLLRLTSSGCGLVQPERVNSDWSFLDKLLASHQSLD 300

Query: 301 QQSRSKSTLSDDLT--MGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
           QQS+SK   S  L   MG S+ +KF+  FPY   L    D +KF K
Sbjct: 301 QQSQSKRDPSFHLVDHMGASSHQKFT-TFPY---LGCENDILKFPK 317

BLAST of ClCG09G022950 vs. NCBI nr
Match: gi|590613952|ref|XP_007022813.1| (NAC domain transcriptional regulator superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 408.3 bits (1048), Expect = 1.3e-110
Identity = 225/351 (64.10%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           M+P NGQ LSVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 19  MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 78

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH   CNSK+IGMRKTL
Sbjct: 79  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHL--CNSKKIGMRKTL 138

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH--- 180
           VFYTGRAPHGQKTDWIMHEYRL+DD+ +VQ            EDGWVVCRVFKKK+H   
Sbjct: 139 VFYTGRAPHGQKTDWIMHEYRLDDDDSDVQ------------EDGWVVCRVFKKKNHSRG 198

Query: 181 --KPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQ 240
             +PE  QE+    +TH K  +SSA +        +  +NH+Q  Y  D+SFDG MQLP 
Sbjct: 199 NFQPEFSQEES---FTHIKTVASSAQL--------ETRHNHLQALY--DFSFDGSMQLPH 258

Query: 241 LFSPESSVA----TPISLNAAATVECPQNIWRLS----CGLVQHERLNTDWSFLNRLLA- 300
           LFSPES+VA    +P+SLN +  +EC QN+ RL+    CGLVQ ER N +WSFL++LLA 
Sbjct: 259 LFSPESAVASSFISPVSLN-STDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLAT 318

Query: 301 ----LDQQ-SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
               +DQQ S+ K T S  + +G S  +   FPF Y   L   AD +KFSK
Sbjct: 319 HHLSVDQQHSQGKCTPSSQVDVGTSTQK---FPFQY---LGCEADILKFSK 334

BLAST of ClCG09G022950 vs. NCBI nr
Match: gi|590613955|ref|XP_007022814.1| (NAC domain transcriptional regulator superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 408.3 bits (1048), Expect = 1.3e-110
Identity = 225/351 (64.10%), Postives = 258/351 (73.50%), Query Frame = 1

Query: 1   MMPENGQQLSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 60
           M+P NGQ LSVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 1   MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 60

Query: 61  RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHTGNCNSKRIGMRKTL 120
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH   CNSK+IGMRKTL
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHL--CNSKKIGMRKTL 120

Query: 121 VFYTGRAPHGQKTDWIMHEYRLEDDNPEVQIYIQIIIITPLHEDGWVVCRVFKKKSH--- 180
           VFYTGRAPHGQKTDWIMHEYRL+DD+ +VQ            EDGWVVCRVFKKK+H   
Sbjct: 121 VFYTGRAPHGQKTDWIMHEYRLDDDDSDVQ------------EDGWVVCRVFKKKNHSRG 180

Query: 181 --KPEVPQEQQLDYYTHTKLGSSSASVMGAEMGEPKNNNNHMQEPYNNDYSFDGCMQLPQ 240
             +PE  QE+    +TH K  +SSA +        +  +NH+Q  Y  D+SFDG MQLP 
Sbjct: 181 NFQPEFSQEES---FTHIKTVASSAQL--------ETRHNHLQALY--DFSFDGSMQLPH 240

Query: 241 LFSPESSVA----TPISLNAAATVECPQNIWRLS----CGLVQHERLNTDWSFLNRLLA- 300
           LFSPES+VA    +P+SLN +  +EC QN+ RL+    CGLVQ ER N +WSFL++LLA 
Sbjct: 241 LFSPESAVASSFISPVSLN-STDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLAT 300

Query: 301 ----LDQQ-SRSKSTLSDDLTMGHSNSRKFSFPFPYPYPLASGADFIKFSK 333
               +DQQ S+ K T S  + +G S  +   FPF Y   L   AD +KFSK
Sbjct: 301 HHLSVDQQHSQGKCTPSSQVDVGTSTQK---FPFQY---LGCEADILKFSK 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SMB_ARATH6.9e-8849.08Protein SOMBRERO OS=Arabidopsis thaliana GN=SMB PE=1 SV=1[more]
NAC76_ORYSJ2.4e-8058.18NAC domain-containing protein 76 OS=Oryza sativa subsp. japonica GN=NAC76 PE=2 S... [more]
NAC43_ARATH1.1e-6947.15NAC domain-containing protein 43 OS=Arabidopsis thaliana GN=NAC043 PE=2 SV=2[more]
NAC12_ARATH1.6e-6848.39NAC domain-containing protein 12 OS=Arabidopsis thaliana GN=NAC012 PE=2 SV=1[more]
BRN2_ARATH2.1e-6844.04Protein BEARSKIN2 OS=Arabidopsis thaliana GN=BRN2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K475_CUCSA4.9e-15784.21Uncharacterized protein OS=Cucumis sativus GN=Csa_7G252700 PE=4 SV=1[more]
A0A0M4FKZ1_MANES6.9e-11165.61NAC transcription factors 51 OS=Manihot esculenta PE=2 SV=1[more]
A0A061FBQ8_THECC9.0e-11164.10NAC domain transcriptional regulator superfamily protein isoform 1 OS=Theobroma ... [more]
A0A061FAG5_THECC9.0e-11164.10NAC domain transcriptional regulator superfamily protein isoform 2 OS=Theobroma ... [more]
B9S679_RICCO1.9e-10864.64NAC domain-containing protein, putative OS=Ricinus communis GN=RCOM_0674920 PE=4... [more]
Match NameE-valueIdentityDescription
AT1G79580.13.9e-8949.08 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT2G46770.16.2e-7147.15 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT1G32770.19.0e-7048.39 NAC domain containing protein 12[more]
AT4G10350.11.2e-6944.04 NAC domain containing protein 70[more]
AT4G36160.15.8e-6944.54 NAC domain containing protein 76[more]
Match NameE-valueIdentityDescription
gi|449461154|ref|XP_004148307.1|7.0e-15784.21PREDICTED: NAC domain-containing protein 76-like [Cucumis sativus][more]
gi|659092435|ref|XP_008447060.1|2.0e-15684.16PREDICTED: NAC domain-containing protein 76 [Cucumis melo][more]
gi|925170223|gb|ALC79028.1|9.9e-11165.61NAC transcription factors 51 [Manihot esculenta][more]
gi|590613952|ref|XP_007022813.1|1.3e-11064.10NAC domain transcriptional regulator superfamily protein isoform 1 [Theobroma ca... [more]
gi|590613955|ref|XP_007022814.1|1.3e-11064.10NAC domain transcriptional regulator superfamily protein isoform 2 [Theobroma ca... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007275 multicellular organism development
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0048829 root cap development
biological_process GO:0044210 'de novo' CTP biosynthetic process
biological_process GO:0000478 endonucleolytic cleavage involved in rRNA processing
biological_process GO:0006541 glutamine metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005829 cytosol
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0003883 CTP synthase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG09G022950.1ClCG09G022950.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 12..142
score: 3.2
IPR003441NAC domainPROFILEPS51005NACcoord: 11..174
score: 58
IPR003441NAC domainunknownSSF101941NAC domaincoord: 5..174
score: 1.96
NoneNo IPR availablePANTHERPTHR31989FAMILY NOT NAMEDcoord: 6..289
score: 1.5E
NoneNo IPR availablePANTHERPTHR31989:SF16SUBFAMILY NOT NAMEDcoord: 6..289
score: 1.5E