Cp4.1LG15g09030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG15g09030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionNAC domain-containing protein, putative
LocationCp4.1LG15 : 8794874 .. 8796759 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAATTATTTATCCTTTATTTCTGTGCCCATCTTTGATCGGTTGTCAGTCAATCTTAACCTTTGCGCTAACGTTTCTTTCTCTCTCTCTCTCTATCCCTTTTTATATTCACGTCGTGCCGTTGAAGGCAACTTCTCAATACCAATCTGCAAAGGATAATTACTTTCTCTTCTTCCTCTCCCAGGTTTGCCCTTTCAAGCTTGCATTTAACTATTTGCTCCATTAATGCAAGCTTGTTATAACTGCTTGTTCATCATTTTCTGAATACACAGACAAAACCCCATCAAGGGACGATGATGCCCGAAAATGGCCAACACTTCAGTGTTCCTCCAGGCTTTAGGTTCCATCCAACAGACGAGGAGCTTCTTTATTACTACCTCAGGAAGAAGGTTTCCTACGAGGCCATTGAGCTTGATGTTATCAGAGAAGTGGATCTAAACAAACTGGAGCCTTGGGACCTCAAAGGTTGCCCATTCATTATTCTTCATACTTGCCAATTTATTAAGCTTTGATTTGACGAACAATGANTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTTTTTTTTTTCGGGGGGTGGGTTCAGATAAATGTAGAATTGGATCTGGGCACCAGAACGAGTGGTATTTCTTTAGCCATAAGGACAAGAAATACCCAACTGGAACTCGAACTAATAGAGCCACCAGTGCTGGATTCTGGAAGGCAACCGGGAGAGACAAAGGCATTCACATGGCCAATTCCAAGAGGATTGGCATGAGGAAGACACTCGTGTTCTATACCGGTCGTGCTCCTCATGGACAAAAGACTGATTGGATCATGCATGAATACCGGCTCGAAGATCAGGATCCCGAAATTCAGGTACTAATCGTCCTATTTCCTTCCTTATATTCAGCTATGTGGGCATTTAAATGTATTTTATGATTCAAATCCATATGTCAAAACACAAATAACTGAAGCTGAAGAAAGGGAGAGTTTTTTTTTTTTTTTAACTTATTAACTCACATTTTATTTGATAGACGCAGGAAGATGGGTGGGTGGTGTGCAGGGTTTTCAAGAAGAAAAGTCAGAAGGCAGAGGGCTTGGAGTACCATGCGCATACGAAGGTGGGGGGCAGTAGTTCTGGTTCCGCCGCCTTGATGGGTGCAGAAATAGGAGAGCCGAAAAGGCATAACCATACGCAGCAGCCATATAATGAGTATGGGTTGGATGGGTGCATGCAGCTGCCGCAGCTGTTTAGTCCGGACTCAGCAGTGGCAACACCCGTCTCCATAGAGTGTCCTCAAAATATGTGGAGGCTAAATTGCGGGGGTGTGCAACAGGAGCGGTTGAACACAGATTGGTCATTCTTGAGTAGGCTGCTTGCTTCGGATCACCAATCCCGTACCAAATCTGCGCTCCCAGATCAGCTTAGTGTGGGACACCCCAACTCCACAATGTTTCCATTTCCATTTCCATTTCCATTTCCCTACCCTTATCATCTTCCCTCCGCCGCCGACTCATTCAAATTCTCCAAGTAGGCCTACAGCTTTTCACTTTCTTAGTTTAACATTATGTGCAGTTTATTTAATACCCTACATCACTTCACCTATGCTATTTTTCCATTTTACTGTTCCTAACTCCGCACTTAATAATGGATTGCTTTTTAAACCTCAATTTCACACTCTTTATTAACAAACTATAAGAAGTGAATAATATCATATCGTTATTTCTAGTATATTATACTTTTAATTTACCTTTCTTAAAATATTTTTGAGGTTGCTGAAATGAAAAATATATGAAAATATAATCGAAAGTTTATAATCGAAAGTATATGAAAAAGATACTTTCATAAAATCATGTTTGAGTATGCTCTTACAAAATTCGAATAAATAATTAAGTTGGATCCAAAA

mRNA sequence

ATAATTATTTATCCTTTATTTCTGTGCCCATCTTTGATCGGTTGTCAGTCAATCTTAACCTTTGCGCTAACGTTTCTTTCTCTCTCTCTCTCTATCCCTTTTTATATTCACGTCGTGCCGTTGAAGGCAACTTCTCAATACCAATCTGCAAAGGATAATTACTTTCTCTTCTTCCTCTCCCAGACAAAACCCCATCAAGGGACGATGATGCCCGAAAATGGCCAACACTTCAGTGTTCCTCCAGGCTTTAGGTTCCATCCAACAGACGAGGAGCTTCTTTATTACTACCTCAGGAAGAAGGTTTCCTACGAGGCCATTGAGCTTGATGTTATCAGAGAAGTGGATCTAAACAAACTGGAGCCTTGGGACCTCAAAGATAAATGTAGAATTGGATCTGGGCACCAGAACGAGTGGTATTTCTTTAGCCATAAGGACAAGAAATACCCAACTGGAACTCGAACTAATAGAGCCACCAGTGCTGGATTCTGGAAGGCAACCGGGAGAGACAAAGGCATTCACATGGCCAATTCCAAGAGGATTGGCATGAGGAAGACACTCGTGTTCTATACCGGTCGTGCTCCTCATGGACAAAAGACTGATTGGATCATGCATGAATACCGGCTCGAAGATCAGGATCCCGAAATTCAGACGCAGGAAGATGGGTGGGTGGTGTGCAGGGTTTTCAAGAAGAAAAGTCAGAAGGCAGAGGGCTTGGAGTACCATGCGCATACGAAGGTGGGGGGCAGTAGTTCTGGTTCCGCCGCCTTGATGGGTGCAGAAATAGGAGAGCCGAAAAGGCATAACCATACGCAGCAGCCATATAATGAGTATGGGTTGGATGGGTGCATGCAGCTGCCGCAGCTGTTTAGTCCGGACTCAGCAGTGGCAACACCCGTCTCCATAGAGTGTCCTCAAAATATGTGGAGGCTAAATTGCGGGGGTGTGCAACAGGAGCGGTTGAACACAGATTGGTCATTCTTGAGTAGGCTGCTTGCTTCGGATCACCAATCCCGTACCAAATCTGCGCTCCCAGATCAGCTTAGTGTGGGACACCCCAACTCCACAATGTTTCCATTTCCATTTCCATTTCCATTTCCCTACCCTTATCATCTTCCCTCCGCCGCCGACTCATTCAAATTCTCCAAGTAGGCCTACAGCTTTTCACTTTCTTAGTTTAACATTATGTGCAGTTTATTTAATACCCTACATCACTTCACCTATGCTATTTTTCCATTTTACTGTTCCTAACTCCGCACTTAATAATGGATTGCTTTTTAAACCTCAATTTCACACTCTTTATTAACAAACTATAAGAAGTGAATAATATCATATCGTTATTTCTAGTATATTATACTTTTAATTTACCTTTCTTAAAATATTTTTGAGGTTGCTGAAATGAAAAATATATGAAAATATAATCGAAAGTTTATAATCGAAAGTATATGAAAAAGATACTTTCATAAAATCATGTTTGAGTATGCTCTTACAAAATTCGAATAAATAATTAAGTTGGATCCAAAA

Coding sequence (CDS)

ATAATTATTTATCCTTTATTTCTGTGCCCATCTTTGATCGGTTGTCAGTCAATCTTAACCTTTGCGCTAACGTTTCTTTCTCTCTCTCTCTCTATCCCTTTTTATATTCACGTCGTGCCGTTGAAGGCAACTTCTCAATACCAATCTGCAAAGGATAATTACTTTCTCTTCTTCCTCTCCCAGACAAAACCCCATCAAGGGACGATGATGCCCGAAAATGGCCAACACTTCAGTGTTCCTCCAGGCTTTAGGTTCCATCCAACAGACGAGGAGCTTCTTTATTACTACCTCAGGAAGAAGGTTTCCTACGAGGCCATTGAGCTTGATGTTATCAGAGAAGTGGATCTAAACAAACTGGAGCCTTGGGACCTCAAAGATAAATGTAGAATTGGATCTGGGCACCAGAACGAGTGGTATTTCTTTAGCCATAAGGACAAGAAATACCCAACTGGAACTCGAACTAATAGAGCCACCAGTGCTGGATTCTGGAAGGCAACCGGGAGAGACAAAGGCATTCACATGGCCAATTCCAAGAGGATTGGCATGAGGAAGACACTCGTGTTCTATACCGGTCGTGCTCCTCATGGACAAAAGACTGATTGGATCATGCATGAATACCGGCTCGAAGATCAGGATCCCGAAATTCAGACGCAGGAAGATGGGTGGGTGGTGTGCAGGGTTTTCAAGAAGAAAAGTCAGAAGGCAGAGGGCTTGGAGTACCATGCGCATACGAAGGTGGGGGGCAGTAGTTCTGGTTCCGCCGCCTTGATGGGTGCAGAAATAGGAGAGCCGAAAAGGCATAACCATACGCAGCAGCCATATAATGAGTATGGGTTGGATGGGTGCATGCAGCTGCCGCAGCTGTTTAGTCCGGACTCAGCAGTGGCAACACCCGTCTCCATAGAGTGTCCTCAAAATATGTGGAGGCTAAATTGCGGGGGTGTGCAACAGGAGCGGTTGAACACAGATTGGTCATTCTTGAGTAGGCTGCTTGCTTCGGATCACCAATCCCGTACCAAATCTGCGCTCCCAGATCAGCTTAGTGTGGGACACCCCAACTCCACAATGTTTCCATTTCCATTTCCATTTCCATTTCCCTACCCTTATCATCTTCCCTCCGCCGCCGACTCATTCAAATTCTCCAAGTAG

Protein sequence

IIIYPLFLCPSLIGCQSILTFALTFLSLSLSIPFYIHVVPLKATSQYQSAKDNYFLFFLSQTKPHQGTMMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAEGLEYHAHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVATPVSIECPQNMWRLNCGGVQQERLNTDWSFLSRLLASDHQSRTKSALPDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK
BLAST of Cp4.1LG15g09030 vs. Swiss-Prot
Match: SMB_ARATH (Protein SOMBRERO OS=Arabidopsis thaliana GN=SMB PE=1 SV=1)

HSP 1 Score: 323.2 bits (827), Expect = 3.9e-87
Identity = 188/369 (50.95%), Postives = 226/369 (61.25%), Query Frame = 1

Query: 74  GQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSG 133
           G   SVPPGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG
Sbjct: 12  GGQLSVPPGFRFHPTEEELLYYYLKKKVSYEPIDLDVIREVDLNKLEPWELKEKCRIGSG 71

Query: 134 HQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGRA 193
            QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ +SK+IG+RKTLVFYTGRA
Sbjct: 72  PQNEWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKSIHLNSSKKIGLRKTLVFYTGRA 131

Query: 194 PHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKS------QKAEGLEYHAHT--- 253
           PHGQKT+WIMHEYRL+D + EI  QEDGWVVCRVFKKK+      Q+ E   +H H    
Sbjct: 132 PHGQKTEWIMHEYRLDDSENEI--QEDGWVVCRVFKKKNHFRGFHQEQEQDHHHHHQYIS 191

Query: 254 ---------KVGGSSSGSAALMGAEIGEPKRHNH----TQQPYNEYG---LDGCMQLPQL 313
                     +  +S+  + L+   +     H+H       P +E+      G M LPQL
Sbjct: 192 TNNDHDHHHHIDSNSNNHSPLILHPLDHHHHHHHIGRQIHMPLHEFANTLSHGSMHLPQL 251

Query: 314 FSPDSAVA------------TPVS---IECPQNMWRLNCGGVQQERLNTDWSFLSRLLAS 373
           FSPDSA A            +P++   IEC QN+ RL            DWSFL +LL +
Sbjct: 252 FSPDSAAAAAAAAASAQPFVSPINTTDIECSQNLLRL----TSNNNYGGDWSFLDKLLTT 311

Query: 374 ------------DHQSRTKSAL--------PDQLSVGHPNSTMFPFPFPFPFPYPYHLPS 383
                       +HQ++    L         D L   +  S+  P    FPF Y   L +
Sbjct: 312 GNMNQQQQQQVQNHQAKCFGDLSNNDNNDQADHLGNNNGGSSSSPVNQRFPFHY---LGN 371

BLAST of Cp4.1LG15g09030 vs. Swiss-Prot
Match: NAC76_ORYSJ (NAC domain-containing protein 76 OS=Oryza sativa subsp. japonica GN=NAC76 PE=2 SV=2)

HSP 1 Score: 307.8 bits (787), Expect = 1.7e-82
Identity = 158/249 (63.45%), Postives = 177/249 (71.08%), Query Frame = 1

Query: 70  MPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCR 129
           M  +G   SVPPGFRFHPTDEELLYYYLRKKV+YEAI+LDVIRE+DLNKLEPWDLKD+CR
Sbjct: 1   MHPSGGALSVPPGFRFHPTDEELLYYYLRKKVAYEAIDLDVIREIDLNKLEPWDLKDRCR 60

Query: 130 IGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFY 189
           IG+G QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I +AN+ RIGMRKTLVFY
Sbjct: 61  IGTGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIFLANACRIGMRKTLVFY 120

Query: 190 TGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKA-------------E 249
            GRAPHG+KTDWIMHEYRL DQD  +  QEDGWVVCRVF KKS +              +
Sbjct: 121 VGRAPHGKKTDWIMHEYRL-DQD-NVDVQEDGWVVCRVFMKKSYQRGLNPADMAAVDDDD 180

Query: 250 GLEYHAHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEY----GLDGCMQLPQLFSPD 302
            L +H H        G AA    +      H+H  QP++ Y      D  MQLPQL S D
Sbjct: 181 LLHHHHHPFPPAQLHGGAADHKHDGAGGHHHHHLMQPHHHYDDFPSFDPSMQLPQLMSAD 240

BLAST of Cp4.1LG15g09030 vs. Swiss-Prot
Match: NAC43_ARATH (NAC domain-containing protein 43 OS=Arabidopsis thaliana GN=NAC043 PE=2 SV=2)

HSP 1 Score: 272.7 bits (696), Expect = 6.1e-72
Identity = 153/317 (48.26%), Postives = 196/317 (61.83%), Query Frame = 1

Query: 73  NGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 132
           NGQ   VPPGFRFHPT+EELL YYLRKKV+   I+LDVIR+VDLNKLEPWD+++ C+IG+
Sbjct: 11  NGQS-QVPPGFRFHPTEEELLQYYLRKKVNSIEIDLDVIRDVDLNKLEPWDIQEMCKIGT 70

Query: 133 GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGR 192
             QN+WYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I+ +N +RIGMRKTLVFY GR
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKIIY-SNGRRIGMRKTLVFYKGR 130

Query: 193 APHGQKTDWIMHEYRLEDQ--DPEIQT------------QEDGWVVCRVFKKKSQKAEGL 252
           APHGQK+DWIMHEYRL+D    PE  T            Q++GWVVCR+FKKK+     L
Sbjct: 131 APHGQKSDWIMHEYRLDDNIISPEDVTVHEVVSIIGEASQDEGWVVCRIFKKKN-----L 190

Query: 253 EYHAHTKVGGSS---------SGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLF 312
               ++ VGG+S         + S+ +   +  +       +    E  LD  M+LP L 
Sbjct: 191 HKTLNSPVGGASLSGGGDTPKTTSSQIFNEDTLDQFLELMGRSCKEELNLDPFMKLPNLE 250

Query: 313 SPDSAVATPVSIECPQNMWRLNCGGVQQERLNTDWSFLSRLLASDHQSRTKSALP--DQL 365
           SP+S       +  P     ++   V      T W+ L RL+AS     T  ++   ++ 
Sbjct: 251 SPNSQAINNCHVSSPDTNHNIHVSNVVDTSFVTSWAALDRLVASQLNGPTSYSITAVNES 310

BLAST of Cp4.1LG15g09030 vs. Swiss-Prot
Match: BRN2_ARATH (Protein BEARSKIN2 OS=Arabidopsis thaliana GN=BRN2 PE=2 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 9.7e-70
Identity = 137/227 (60.35%), Postives = 160/227 (70.48%), Query Frame = 1

Query: 79  VPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEW 138
           VPPGFRFHPTDEELL+YYL+KK+SY+  E++VIREVDLNKLEPWDL+++C+IGS  QNEW
Sbjct: 9   VPPGFRFHPTDEELLHYYLKKKISYQKFEMEVIREVDLNKLEPWDLQERCKIGSTPQNEW 68

Query: 139 YFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGRAPHGQK 198
           YFFSHKD+KYPTG+RTNRAT AGFWKATGRDK I  +  K+IGMRKTLVFY GRAPHGQK
Sbjct: 69  YFFSHKDRKYPTGSRTNRATHAGFWKATGRDKCIRNSY-KKIGMRKTLVFYKGRAPHGQK 128

Query: 199 TDWIMHEYRLED-QDPEIQTQEDGWVVCRVFKKK---------SQKAEGLEYHAHTKVGG 258
           TDWIMHEYRLED  DP+    EDGWVVCRVF KK         S     L+ H H     
Sbjct: 129 TDWIMHEYRLEDADDPQANPSEDGWVVCRVFMKKNLFKVVNEGSSSINSLDQHNH----D 188

Query: 259 SSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAV 296
           +S+ + AL      + +   H   PY      G M   +L  PD A+
Sbjct: 189 ASNNNHAL------QARSFMHRDSPYQLVRNHGAMTF-ELNKPDLAL 223

BLAST of Cp4.1LG15g09030 vs. Swiss-Prot
Match: BRN1_ARATH (Protein BEARSKIN1 OS=Arabidopsis thaliana GN=BRN1 PE=2 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.2e-67
Identity = 121/164 (73.78%), Postives = 140/164 (85.37%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           M   NG    VPPGFRFHPTDEELL+YYL+KK+SYE  E++VI+EVDLNK+EPWDL+D+C
Sbjct: 1   MSSSNG---GVPPGFRFHPTDEELLHYYLKKKISYEKFEMEVIKEVDLNKIEPWDLQDRC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           +IGS  QNEWYFFSHKD+KYPTG+RTNRAT +GFWKATGRDK I   + K+IGMRKTLVF
Sbjct: 61  KIGSTPQNEWYFFSHKDRKYPTGSRTNRATHSGFWKATGRDKCIR-NSYKKIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKS 233
           Y GRAPHGQKTDWIMHEYR+ED + +    EDGWVVCRVFKKK+
Sbjct: 121 YKGRAPHGQKTDWIMHEYRIEDTEDD--PCEDGWVVCRVFKKKN 158

BLAST of Cp4.1LG15g09030 vs. TrEMBL
Match: A0A0A0K475_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G252700 PE=4 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 1.7e-129
Identity = 247/339 (72.86%), Postives = 271/339 (79.94%), Query Frame = 1

Query: 69  MMPENGQHF-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 128
           MMPEN Q   SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60

Query: 129 CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHM----ANSKR-IGM 188
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDK IHM    +NSKR IGM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 189 RKTLVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAE-----G 248
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  +PE+  QEDGWVVCRVFKKKSQK+E      
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHNPEV--QEDGWVVCRVFKKKSQKSEVPEEQQ 180

Query: 249 LEYHAHTKVGGSSSGSAALMGAEIGEPK-RHNHTQQPY--NEYGLDGCMQLPQLFSPDSA 308
           L+Y+AHTK+GG SSGSA  +G E+GEPK  +NH Q+P+  N+Y  DGCMQLPQLFSP+S+
Sbjct: 181 LDYYAHTKLGG-SSGSA--VGTEMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQLFSPESS 240

Query: 309 V----------ATPVSIECPQNMWRLNCGGVQQERLN-TDWSFLSRLLASDHQSRTKSAL 368
                      A   ++ECPQN+WRL+CG VQ ERLN TDWSFL+RLLA D QSR+KS L
Sbjct: 241 TVPTLPAISLNAAGAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQSRSKSTL 300

Query: 369 PDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
            D+L++           F FPFPYPYHLPS  D  KFSK
Sbjct: 301 SDELTISR--------NFSFPFPYPYHLPSGPDFIKFSK 326

BLAST of Cp4.1LG15g09030 vs. TrEMBL
Match: A0A061FBQ8_THECC (NAC domain transcriptional regulator superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_033728 PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 1.4e-102
Identity = 214/343 (62.39%), Postives = 235/343 (68.51%), Query Frame = 1

Query: 66  QGTMMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLK 125
           +  M+P NGQ  SVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLK
Sbjct: 16  ENKMLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLK 75

Query: 126 DKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKT 185
           DKCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ NSK+IGMRKT
Sbjct: 76  DKCRIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLCNSKKIGMRKT 135

Query: 186 LVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAEGL------- 245
           LVFYTGRAPHGQKTDWIMHEYRL+D D ++  QEDGWVVCRVFKKK+             
Sbjct: 136 LVFYTGRAPHGQKTDWIMHEYRLDDDDSDV--QEDGWVVCRVFKKKNHSRGNFQPEFSQE 195

Query: 246 EYHAHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVA-- 305
           E   H K   SS+              RHNH Q  Y ++  DG MQLP LFSP+SAVA  
Sbjct: 196 ESFTHIKTVASSAQLET----------RHNHLQALY-DFSFDGSMQLPHLFSPESAVASS 255

Query: 306 --TPVS-----IECPQNMWRL----NCGGVQQERLNTDWSFLSRLLASDH------QSRT 365
             +PVS     IEC QN+ RL     CG VQQER N +WSFL +LLA+ H       S+ 
Sbjct: 256 FISPVSLNSTDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLATHHLSVDQQHSQG 315

Query: 366 KSALPDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
           K     Q+ VG   ST       FPF Y   L   AD  KFSK
Sbjct: 316 KCTPSSQVDVG--TSTQ-----KFPFQY---LGCEADILKFSK 334

BLAST of Cp4.1LG15g09030 vs. TrEMBL
Match: A0A061FAG5_THECC (NAC domain transcriptional regulator superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_033728 PE=4 SV=1)

HSP 1 Score: 380.9 bits (977), Expect = 1.8e-102
Identity = 214/340 (62.94%), Postives = 234/340 (68.82%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           M+P NGQ  SVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 1   MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ NSK+IGMRKTLVF
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLCNSKKIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAEGL-------EYH 248
           YTGRAPHGQKTDWIMHEYRL+D D ++  QEDGWVVCRVFKKK+             E  
Sbjct: 121 YTGRAPHGQKTDWIMHEYRLDDDDSDV--QEDGWVVCRVFKKKNHSRGNFQPEFSQEESF 180

Query: 249 AHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVA----T 308
            H K   SS+              RHNH Q  Y ++  DG MQLP LFSP+SAVA    +
Sbjct: 181 THIKTVASSAQLET----------RHNHLQALY-DFSFDGSMQLPHLFSPESAVASSFIS 240

Query: 309 PVS-----IECPQNMWRL----NCGGVQQERLNTDWSFLSRLLASDH------QSRTKSA 368
           PVS     IEC QN+ RL     CG VQQER N +WSFL +LLA+ H       S+ K  
Sbjct: 241 PVSLNSTDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLATHHLSVDQQHSQGKCT 300

Query: 369 LPDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
              Q+ VG   ST       FPF Y   L   AD  KFSK
Sbjct: 301 PSSQVDVG--TSTQ-----KFPFQY---LGCEADILKFSK 316

BLAST of Cp4.1LG15g09030 vs. TrEMBL
Match: B9S679_RICCO (NAC domain-containing protein, putative OS=Ricinus communis GN=RCOM_0674920 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 3.0e-102
Identity = 212/338 (62.72%), Postives = 237/338 (70.12%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           MM  NGQ  SVPPGFRFHPTDEELLYYYL+KKVSYEAI+LDVIREVDLNKLEPWDLK+KC
Sbjct: 1   MMAGNGQ-LSVPPGFRFHPTDEELLYYYLKKKVSYEAIDLDVIREVDLNKLEPWDLKEKC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH++NSKRIGMRKTLVF
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLSNSKRIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQ------KAEGLEYHA 248
           YTGRAPHGQKTDWIMHEYRL+D + E+  QEDGWVVCRVFKKK+Q      +A   E+ +
Sbjct: 121 YTGRAPHGQKTDWIMHEYRLDDDNSEV--QEDGWVVCRVFKKKNQTRGFLPEAAQEEHFS 180

Query: 249 HTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVATP---- 308
           H K G SS            EPK+H H Q  Y +Y  DG M LPQLFSP+SA   P    
Sbjct: 181 HMKAGASSVSM---------EPKQH-HMQALY-DYNFDGSMHLPQLFSPESAAVPPSFVT 240

Query: 309 ------VSIECPQNMWRL---NCGGVQQERLNTDWSFLSRLLASDHQSRTKSALPDQLSV 368
                 + IEC QN+ RL   +CG VQ ER + DWSFL +LLAS HQS       D    
Sbjct: 241 PLSLNTMDIECSQNLLRLTSTSCGLVQPERFHGDWSFLDKLLAS-HQSL------DHQGK 300

Query: 369 GHPNSTMFPFPF-----PFPFPYPYHLPSAADSFKFSK 383
           G+P+S +           FPFPY   L    D  +FSK
Sbjct: 301 GNPSSQVVDMGASVHHQKFPFPY---LGCETDIMRFSK 314

BLAST of Cp4.1LG15g09030 vs. TrEMBL
Match: A0A0M3R863_MANES (NAC transcription factors 73 OS=Manihot esculenta PE=2 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 5.7e-101
Identity = 209/336 (62.20%), Postives = 236/336 (70.24%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           MM  NGQ  SVPPGFRFHPTDEELLYYYL+KKVSYEA++LDVIREVDLNKLEPWDLKDKC
Sbjct: 1   MMGGNGQ-LSVPPGFRFHPTDEELLYYYLKKKVSYEAVDLDVIREVDLNKLEPWDLKDKC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           RIGS  QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH++NS+RIGMRKTLVF
Sbjct: 61  RIGSSPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLSNSQRIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQK----AEGLEYHAHT 248
           YTGRAPHGQKTDWIMHEYRL+D + E+  QEDGWVVCRVFKKK+Q      E ++ H   
Sbjct: 121 YTGRAPHGQKTDWIMHEYRLDDDNSEV--QEDGWVVCRVFKKKNQSRGFFPEAVQEHWSH 180

Query: 249 KVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVA-------- 308
               SSS S           ++ NH Q PY +Y  DG M LPQLFSP+SAVA        
Sbjct: 181 MNASSSSASM---------EQKQNHMQAPY-DYSFDGSMHLPQLFSPESAVAPSFVSPFP 240

Query: 309 -TPVSIECPQNMWRL---NCGGVQQ-ERLNTDWSFLSRLLASDHQSRTKSAL----PDQL 368
              + IEC QN+ +L    CG VQ  ER N+DWSFL +LLAS HQS  + +     P  L
Sbjct: 241 MNSMDIECSQNLLKLTSSGCGIVQPGERFNSDWSFLDKLLAS-HQSLDQHSQSRGNPSSL 300

Query: 369 SVGHPN-STMFPFPFPFPFPYPYHLPSAADSFKFSK 383
            V H   S+   FPFP+       L   +D  KFSK
Sbjct: 301 VVDHVGCSSQQKFPFPY-------LGCESDILKFSK 315

BLAST of Cp4.1LG15g09030 vs. TAIR10
Match: AT1G79580.1 (AT1G79580.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 323.2 bits (827), Expect = 2.2e-88
Identity = 188/369 (50.95%), Postives = 226/369 (61.25%), Query Frame = 1

Query: 74  GQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSG 133
           G   SVPPGFRFHPT+EELLYYYL+KKVSYE I+LDVIREVDLNKLEPW+LK+KCRIGSG
Sbjct: 12  GGQLSVPPGFRFHPTEEELLYYYLKKKVSYEPIDLDVIREVDLNKLEPWELKEKCRIGSG 71

Query: 134 HQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGRA 193
            QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ +SK+IG+RKTLVFYTGRA
Sbjct: 72  PQNEWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKSIHLNSSKKIGLRKTLVFYTGRA 131

Query: 194 PHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKS------QKAEGLEYHAHT--- 253
           PHGQKT+WIMHEYRL+D + EI  QEDGWVVCRVFKKK+      Q+ E   +H H    
Sbjct: 132 PHGQKTEWIMHEYRLDDSENEI--QEDGWVVCRVFKKKNHFRGFHQEQEQDHHHHHQYIS 191

Query: 254 ---------KVGGSSSGSAALMGAEIGEPKRHNH----TQQPYNEYG---LDGCMQLPQL 313
                     +  +S+  + L+   +     H+H       P +E+      G M LPQL
Sbjct: 192 TNNDHDHHHHIDSNSNNHSPLILHPLDHHHHHHHIGRQIHMPLHEFANTLSHGSMHLPQL 251

Query: 314 FSPDSAVA------------TPVS---IECPQNMWRLNCGGVQQERLNTDWSFLSRLLAS 373
           FSPDSA A            +P++   IEC QN+ RL            DWSFL +LL +
Sbjct: 252 FSPDSAAAAAAAAASAQPFVSPINTTDIECSQNLLRL----TSNNNYGGDWSFLDKLLTT 311

Query: 374 ------------DHQSRTKSAL--------PDQLSVGHPNSTMFPFPFPFPFPYPYHLPS 383
                       +HQ++    L         D L   +  S+  P    FPF Y   L +
Sbjct: 312 GNMNQQQQQQVQNHQAKCFGDLSNNDNNDQADHLGNNNGGSSSSPVNQRFPFHY---LGN 371

BLAST of Cp4.1LG15g09030 vs. TAIR10
Match: AT2G46770.1 (AT2G46770.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 272.7 bits (696), Expect = 3.4e-73
Identity = 153/317 (48.26%), Postives = 196/317 (61.83%), Query Frame = 1

Query: 73  NGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 132
           NGQ   VPPGFRFHPT+EELL YYLRKKV+   I+LDVIR+VDLNKLEPWD+++ C+IG+
Sbjct: 11  NGQS-QVPPGFRFHPTEEELLQYYLRKKVNSIEIDLDVIRDVDLNKLEPWDIQEMCKIGT 70

Query: 133 GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGR 192
             QN+WYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK I+ +N +RIGMRKTLVFY GR
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATAAGFWKATGRDKIIY-SNGRRIGMRKTLVFYKGR 130

Query: 193 APHGQKTDWIMHEYRLEDQ--DPEIQT------------QEDGWVVCRVFKKKSQKAEGL 252
           APHGQK+DWIMHEYRL+D    PE  T            Q++GWVVCR+FKKK+     L
Sbjct: 131 APHGQKSDWIMHEYRLDDNIISPEDVTVHEVVSIIGEASQDEGWVVCRIFKKKN-----L 190

Query: 253 EYHAHTKVGGSS---------SGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLF 312
               ++ VGG+S         + S+ +   +  +       +    E  LD  M+LP L 
Sbjct: 191 HKTLNSPVGGASLSGGGDTPKTTSSQIFNEDTLDQFLELMGRSCKEELNLDPFMKLPNLE 250

Query: 313 SPDSAVATPVSIECPQNMWRLNCGGVQQERLNTDWSFLSRLLASDHQSRTKSALP--DQL 365
           SP+S       +  P     ++   V      T W+ L RL+AS     T  ++   ++ 
Sbjct: 251 SPNSQAINNCHVSSPDTNHNIHVSNVVDTSFVTSWAALDRLVASQLNGPTSYSITAVNES 310

BLAST of Cp4.1LG15g09030 vs. TAIR10
Match: AT4G10350.1 (AT4G10350.1 NAC domain containing protein 70)

HSP 1 Score: 265.4 bits (677), Expect = 5.5e-71
Identity = 137/227 (60.35%), Postives = 160/227 (70.48%), Query Frame = 1

Query: 79  VPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGSGHQNEW 138
           VPPGFRFHPTDEELL+YYL+KK+SY+  E++VIREVDLNKLEPWDL+++C+IGS  QNEW
Sbjct: 9   VPPGFRFHPTDEELLHYYLKKKISYQKFEMEVIREVDLNKLEPWDLQERCKIGSTPQNEW 68

Query: 139 YFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGRAPHGQK 198
           YFFSHKD+KYPTG+RTNRAT AGFWKATGRDK I  +  K+IGMRKTLVFY GRAPHGQK
Sbjct: 69  YFFSHKDRKYPTGSRTNRATHAGFWKATGRDKCIRNSY-KKIGMRKTLVFYKGRAPHGQK 128

Query: 199 TDWIMHEYRLED-QDPEIQTQEDGWVVCRVFKKK---------SQKAEGLEYHAHTKVGG 258
           TDWIMHEYRLED  DP+    EDGWVVCRVF KK         S     L+ H H     
Sbjct: 129 TDWIMHEYRLEDADDPQANPSEDGWVVCRVFMKKNLFKVVNEGSSSINSLDQHNH----D 188

Query: 259 SSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAV 296
           +S+ + AL      + +   H   PY      G M   +L  PD A+
Sbjct: 189 ASNNNHAL------QARSFMHRDSPYQLVRNHGAMTF-ELNKPDLAL 223

BLAST of Cp4.1LG15g09030 vs. TAIR10
Match: AT1G33280.1 (AT1G33280.1 NAC domain containing protein 15)

HSP 1 Score: 258.5 bits (659), Expect = 6.7e-69
Identity = 121/164 (73.78%), Postives = 140/164 (85.37%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           M   NG    VPPGFRFHPTDEELL+YYL+KK+SYE  E++VI+EVDLNK+EPWDL+D+C
Sbjct: 1   MSSSNG---GVPPGFRFHPTDEELLHYYLKKKISYEKFEMEVIKEVDLNKIEPWDLQDRC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           +IGS  QNEWYFFSHKD+KYPTG+RTNRAT +GFWKATGRDK I   + K+IGMRKTLVF
Sbjct: 61  KIGSTPQNEWYFFSHKDRKYPTGSRTNRATHSGFWKATGRDKCIR-NSYKKIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKS 233
           Y GRAPHGQKTDWIMHEYR+ED + +    EDGWVVCRVFKKK+
Sbjct: 121 YKGRAPHGQKTDWIMHEYRIEDTEDD--PCEDGWVVCRVFKKKN 158

BLAST of Cp4.1LG15g09030 vs. TAIR10
Match: AT1G32770.1 (AT1G32770.1 NAC domain containing protein 12)

HSP 1 Score: 254.2 bits (648), Expect = 1.3e-67
Identity = 148/297 (49.83%), Postives = 186/297 (62.63%), Query Frame = 1

Query: 73  NGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKCRIGS 132
           NGQ   VPPGFRFHPT+EELL+YYLRKKV+ + I+LDVIREVDLNKLEPWD++++CRIGS
Sbjct: 11  NGQS-KVPPGFRFHPTEEELLHYYLRKKVNSQKIDLDVIREVDLNKLEPWDIQEECRIGS 70

Query: 133 GHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVFYTGR 192
             QN+WYFFSHKDKKYPTGTRTNRAT AGFWKATGRDK I  +  +RIG+RKTLVFY GR
Sbjct: 71  TPQNDWYFFSHKDKKYPTGTRTNRATVAGFWKATGRDK-IICSCVRRIGLRKTLVFYKGR 130

Query: 193 APHGQKTDWIMHEYRLED------------QDPEIQTQEDGWVVCRVFKKK--------- 252
           APHGQK+DWIMHEYRL+D            +DP +   E+GWVVCRVF+KK         
Sbjct: 131 APHGQKSDWIMHEYRLDDTPMSNGYADVVTEDP-MSYNEEGWVVCRVFRKKNYQKIDDCP 190

Query: 253 ----------SQKAEGLEYHAHTKVGGSSSGSAAL--MGAEIGEPKRHNHTQQPYNEYGL 312
                     +++ +G  +H    V G       +   G+ I  P+    TQ   +    
Sbjct: 191 KITLSSLPDDTEEEKGPTFHNTQNVTGLDHVLLYMDRTGSNICMPESQTTTQHQDDVL-- 250

Query: 313 DGCMQLPQLFSPDSAVATPVSIECPQNMWRLNCGGVQQE----RLNTDWSFLSRLLA 333
              MQLP L +P S      S   P    +L+   VQ++     + ++W+ L RL+A
Sbjct: 251 --FMQLPSLETPKSESPVDQSFLTPS---KLDFSPVQEKITERPVCSNWASLDRLVA 297

BLAST of Cp4.1LG15g09030 vs. NCBI nr
Match: gi|449461154|ref|XP_004148307.1| (PREDICTED: NAC domain-containing protein 76-like [Cucumis sativus])

HSP 1 Score: 470.7 bits (1210), Expect = 2.4e-129
Identity = 247/339 (72.86%), Postives = 271/339 (79.94%), Query Frame = 1

Query: 69  MMPENGQHF-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 128
           MMPEN Q   SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 60

Query: 129 CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHM----ANSKR-IGM 188
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDK IHM    +NSKR IGM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 189 RKTLVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAE-----G 248
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  +PE+  QEDGWVVCRVFKKKSQK+E      
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHNPEV--QEDGWVVCRVFKKKSQKSEVPEEQQ 180

Query: 249 LEYHAHTKVGGSSSGSAALMGAEIGEPK-RHNHTQQPY--NEYGLDGCMQLPQLFSPDSA 308
           L+Y+AHTK+GG SSGSA  +G E+GEPK  +NH Q+P+  N+Y  DGCMQLPQLFSP+S+
Sbjct: 181 LDYYAHTKLGG-SSGSA--VGTEMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQLFSPESS 240

Query: 309 V----------ATPVSIECPQNMWRLNCGGVQQERLN-TDWSFLSRLLASDHQSRTKSAL 368
                      A   ++ECPQN+WRL+CG VQ ERLN TDWSFL+RLLA D QSR+KS L
Sbjct: 241 TVPTLPAISLNAAGAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQSRSKSTL 300

Query: 369 PDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
            D+L++           F FPFPYPYHLPS  D  KFSK
Sbjct: 301 SDELTISR--------NFSFPFPYPYHLPSGPDFIKFSK 326

BLAST of Cp4.1LG15g09030 vs. NCBI nr
Match: gi|659092435|ref|XP_008447060.1| (PREDICTED: NAC domain-containing protein 76 [Cucumis melo])

HSP 1 Score: 462.2 bits (1188), Expect = 8.6e-127
Identity = 245/338 (72.49%), Postives = 266/338 (78.70%), Query Frame = 1

Query: 69  MMPENGQHF-SVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDK 128
           MMPEN Q   SVPPGFRFHPTDEELLYYYLRKKVS+EAIELDVIREVDLNKLEPWDLKDK
Sbjct: 1   MMPENEQQLVSVPPGFRFHPTDEELLYYYLRKKVSFEAIELDVIREVDLNKLEPWDLKDK 60

Query: 129 CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHM----ANSKR-IGM 188
           CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDK IHM    +NSKR IGM
Sbjct: 61  CRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKTIHMSSSNSNSKRIIGM 120

Query: 189 RKTLVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAE-----G 248
           RKTLVFYTGRAPHGQKTDWIMHEYRLE  DPE+  QEDGWVVCRVFKKKSQK E      
Sbjct: 121 RKTLVFYTGRAPHGQKTDWIMHEYRLEHHDPEV--QEDGWVVCRVFKKKSQKPEVPEEQH 180

Query: 249 LEYHAHTKVGGSSSGSAALMGAEIGEPK-RHNHTQQPY--NEYGLDGCMQLPQLFSPDSA 308
           L+Y+AHTK+GG SSGSA   G  +GEPK  +NH Q+P+  N+Y  DGCMQLPQLFSP+S+
Sbjct: 181 LDYYAHTKLGG-SSGSAE--GTGMGEPKNNNNHMQEPHNNNDYSFDGCMQLPQLFSPESS 240

Query: 309 V---------ATPVSIECPQNMWRLNCGGVQQERLN-TDWSFLSRLLASDHQSRTKSALP 368
                     A   ++ECPQN+WRL+CG VQ ERLN TDWSFL+RLLA D QSR+KS L 
Sbjct: 241 SVPTLPISLNAAAAAVECPQNIWRLSCGVVQHERLNTTDWSFLNRLLALDQQSRSKSTLS 300

Query: 369 DQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
           D L++           F FPFPYPY LPS  D  KFSK
Sbjct: 301 DDLTISR--------NFSFPFPYPYSLPSGPDFIKFSK 325

BLAST of Cp4.1LG15g09030 vs. NCBI nr
Match: gi|590613952|ref|XP_007022813.1| (NAC domain transcriptional regulator superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 381.3 bits (978), Expect = 1.9e-102
Identity = 214/343 (62.39%), Postives = 235/343 (68.51%), Query Frame = 1

Query: 66  QGTMMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLK 125
           +  M+P NGQ  SVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLK
Sbjct: 16  ENKMLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLK 75

Query: 126 DKCRIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKT 185
           DKCRIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ NSK+IGMRKT
Sbjct: 76  DKCRIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLCNSKKIGMRKT 135

Query: 186 LVFYTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAEGL------- 245
           LVFYTGRAPHGQKTDWIMHEYRL+D D ++  QEDGWVVCRVFKKK+             
Sbjct: 136 LVFYTGRAPHGQKTDWIMHEYRLDDDDSDV--QEDGWVVCRVFKKKNHSRGNFQPEFSQE 195

Query: 246 EYHAHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVA-- 305
           E   H K   SS+              RHNH Q  Y ++  DG MQLP LFSP+SAVA  
Sbjct: 196 ESFTHIKTVASSAQLET----------RHNHLQALY-DFSFDGSMQLPHLFSPESAVASS 255

Query: 306 --TPVS-----IECPQNMWRL----NCGGVQQERLNTDWSFLSRLLASDH------QSRT 365
             +PVS     IEC QN+ RL     CG VQQER N +WSFL +LLA+ H       S+ 
Sbjct: 256 FISPVSLNSTDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLATHHLSVDQQHSQG 315

Query: 366 KSALPDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
           K     Q+ VG   ST       FPF Y   L   AD  KFSK
Sbjct: 316 KCTPSSQVDVG--TSTQ-----KFPFQY---LGCEADILKFSK 334

BLAST of Cp4.1LG15g09030 vs. NCBI nr
Match: gi|590613955|ref|XP_007022814.1| (NAC domain transcriptional regulator superfamily protein isoform 2 [Theobroma cacao])

HSP 1 Score: 380.9 bits (977), Expect = 2.5e-102
Identity = 214/340 (62.94%), Postives = 234/340 (68.82%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           M+P NGQ  SVPPGFRFHPTDEELLYYYLRKKVSYEAI+LDVIREVDLNKLEPWDLKDKC
Sbjct: 1   MLPGNGQ-LSVPPGFRFHPTDEELLYYYLRKKVSYEAIDLDVIREVDLNKLEPWDLKDKC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH+ NSK+IGMRKTLVF
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLCNSKKIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQKAEGL-------EYH 248
           YTGRAPHGQKTDWIMHEYRL+D D ++  QEDGWVVCRVFKKK+             E  
Sbjct: 121 YTGRAPHGQKTDWIMHEYRLDDDDSDV--QEDGWVVCRVFKKKNHSRGNFQPEFSQEESF 180

Query: 249 AHTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVA----T 308
            H K   SS+              RHNH Q  Y ++  DG MQLP LFSP+SAVA    +
Sbjct: 181 THIKTVASSAQLET----------RHNHLQALY-DFSFDGSMQLPHLFSPESAVASSFIS 240

Query: 309 PVS-----IECPQNMWRL----NCGGVQQERLNTDWSFLSRLLASDH------QSRTKSA 368
           PVS     IEC QN+ RL     CG VQQER N +WSFL +LLA+ H       S+ K  
Sbjct: 241 PVSLNSTDIECSQNLLRLTSSGGCGLVQQERYNGEWSFLDKLLATHHLSVDQQHSQGKCT 300

Query: 369 LPDQLSVGHPNSTMFPFPFPFPFPYPYHLPSAADSFKFSK 383
              Q+ VG   ST       FPF Y   L   AD  KFSK
Sbjct: 301 PSSQVDVG--TSTQ-----KFPFQY---LGCEADILKFSK 316

BLAST of Cp4.1LG15g09030 vs. NCBI nr
Match: gi|255560971|ref|XP_002521498.1| (PREDICTED: protein SOMBRERO [Ricinus communis])

HSP 1 Score: 380.2 bits (975), Expect = 4.3e-102
Identity = 212/338 (62.72%), Postives = 237/338 (70.12%), Query Frame = 1

Query: 69  MMPENGQHFSVPPGFRFHPTDEELLYYYLRKKVSYEAIELDVIREVDLNKLEPWDLKDKC 128
           MM  NGQ  SVPPGFRFHPTDEELLYYYL+KKVSYEAI+LDVIREVDLNKLEPWDLK+KC
Sbjct: 1   MMAGNGQ-LSVPPGFRFHPTDEELLYYYLKKKVSYEAIDLDVIREVDLNKLEPWDLKEKC 60

Query: 129 RIGSGHQNEWYFFSHKDKKYPTGTRTNRATSAGFWKATGRDKGIHMANSKRIGMRKTLVF 188
           RIGSG QNEWYFFSHKDKKYPTGTRTNRAT+AGFWKATGRDK IH++NSKRIGMRKTLVF
Sbjct: 61  RIGSGPQNEWYFFSHKDKKYPTGTRTNRATTAGFWKATGRDKAIHLSNSKRIGMRKTLVF 120

Query: 189 YTGRAPHGQKTDWIMHEYRLEDQDPEIQTQEDGWVVCRVFKKKSQ------KAEGLEYHA 248
           YTGRAPHGQKTDWIMHEYRL+D + E+  QEDGWVVCRVFKKK+Q      +A   E+ +
Sbjct: 121 YTGRAPHGQKTDWIMHEYRLDDDNSEV--QEDGWVVCRVFKKKNQTRGFLPEAAQEEHFS 180

Query: 249 HTKVGGSSSGSAALMGAEIGEPKRHNHTQQPYNEYGLDGCMQLPQLFSPDSAVATP---- 308
           H K G SS            EPK+H H Q  Y +Y  DG M LPQLFSP+SA   P    
Sbjct: 181 HMKAGASSVSM---------EPKQH-HMQALY-DYNFDGSMHLPQLFSPESAAVPPSFVT 240

Query: 309 ------VSIECPQNMWRL---NCGGVQQERLNTDWSFLSRLLASDHQSRTKSALPDQLSV 368
                 + IEC QN+ RL   +CG VQ ER + DWSFL +LLAS HQS       D    
Sbjct: 241 PLSLNTMDIECSQNLLRLTSTSCGLVQPERFHGDWSFLDKLLAS-HQSL------DHQGK 300

Query: 369 GHPNSTMFPFPF-----PFPFPYPYHLPSAADSFKFSK 383
           G+P+S +           FPFPY   L    D  +FSK
Sbjct: 301 GNPSSQVVDMGASVHHQKFPFPY---LGCETDIMRFSK 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SMB_ARATH3.9e-8750.95Protein SOMBRERO OS=Arabidopsis thaliana GN=SMB PE=1 SV=1[more]
NAC76_ORYSJ1.7e-8263.45NAC domain-containing protein 76 OS=Oryza sativa subsp. japonica GN=NAC76 PE=2 S... [more]
NAC43_ARATH6.1e-7248.26NAC domain-containing protein 43 OS=Arabidopsis thaliana GN=NAC043 PE=2 SV=2[more]
BRN2_ARATH9.7e-7060.35Protein BEARSKIN2 OS=Arabidopsis thaliana GN=BRN2 PE=2 SV=1[more]
BRN1_ARATH1.2e-6773.78Protein BEARSKIN1 OS=Arabidopsis thaliana GN=BRN1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K475_CUCSA1.7e-12972.86Uncharacterized protein OS=Cucumis sativus GN=Csa_7G252700 PE=4 SV=1[more]
A0A061FBQ8_THECC1.4e-10262.39NAC domain transcriptional regulator superfamily protein isoform 1 OS=Theobroma ... [more]
A0A061FAG5_THECC1.8e-10262.94NAC domain transcriptional regulator superfamily protein isoform 2 OS=Theobroma ... [more]
B9S679_RICCO3.0e-10262.72NAC domain-containing protein, putative OS=Ricinus communis GN=RCOM_0674920 PE=4... [more]
A0A0M3R863_MANES5.7e-10162.20NAC transcription factors 73 OS=Manihot esculenta PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79580.12.2e-8850.95 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT2G46770.13.4e-7348.26 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT4G10350.15.5e-7160.35 NAC domain containing protein 70[more]
AT1G33280.16.7e-6973.78 NAC domain containing protein 15[more]
AT1G32770.11.3e-6749.83 NAC domain containing protein 12[more]
Match NameE-valueIdentityDescription
gi|449461154|ref|XP_004148307.1|2.4e-12972.86PREDICTED: NAC domain-containing protein 76-like [Cucumis sativus][more]
gi|659092435|ref|XP_008447060.1|8.6e-12772.49PREDICTED: NAC domain-containing protein 76 [Cucumis melo][more]
gi|590613952|ref|XP_007022813.1|1.9e-10262.39NAC domain transcriptional regulator superfamily protein isoform 1 [Theobroma ca... [more]
gi|590613955|ref|XP_007022814.1|2.5e-10262.94NAC domain transcriptional regulator superfamily protein isoform 2 [Theobroma ca... [more]
gi|255560971|ref|XP_002521498.1|4.3e-10262.72PREDICTED: protein SOMBRERO [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0048829root cap development
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0048829 root cap development
biological_process GO:0044210 'de novo' CTP biosynthetic process
biological_process GO:0000478 endonucleolytic cleavage involved in rRNA processing
biological_process GO:0006541 glutamine metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005829 cytosol
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0003883 CTP synthase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG15g09030.1Cp4.1LG15g09030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 80..208
score: 5.9
IPR003441NAC domainPROFILEPS51005NACcoord: 79..230
score: 58
IPR003441NAC domainunknownSSF101941NAC domaincoord: 68..230
score: 4.18
NoneNo IPR availablePANTHERPTHR31989FAMILY NOT NAMEDcoord: 61..335
score: 1.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None