Cla012144 (gene) Watermelon (97103) v1

NameCla012144
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionNAC-domain containing protein (AHRD V1 ***- B5TZH2_CUCMA); contains Interpro domain(s) IPR003441 No apical meristem (NAM) protein
LocationChr4 : 15660560 .. 15662087 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGAACCACCGTCCACCGCCGTCAATTTGCCTCCGGGCTTCAGATTCCACCCCACAGATGAAGAGATCGTCACGTATTACCTCTCTCAGAAGATCGCCAACCCCACCTTCACCGCCACTGCCATCGGTGAAGCTGACTTGAACAAGTGTGAGCCTTGGGATTTGCCTCGTGAGTCCTCGTTTCTCTTCCTTTTATTCTAAATTTTTACTATCAAATTAAAATATACTTTGTTCTATTTTATTTTCTCTACTTTCAATTCTTTAAATTCTTCCAACTTTTTTGAATTAAAATATACTATTACACGATATCTCGAAACTTGATTTATCATAGAATTGAAGTTCTTTCTTTGAAATTTTGAGAAACTTGTTCTTATTTTTGGTTGATTTTACTTTAAAACAACTAAAATTCAAGTTAGGGATTAAAAATAAAAAAAAAAATAATAATAAAAAAAATCCCAAAATATGTAAGGGCAAAATGGGTATTTTGAATTTTTGGTCAGAAGTTTTTTTTTTTGGGTATGATTTTGCAGAGAAGGCTAAGATGGGAGAAAAGGAATGGTACTTCTTTTGCCAGCGGGACCGGAAATATCCGACCGGGATGAGAACGAACCGGGCGACTCAGACCGGTTATTGGAAGGCAACCGGAAAAGACCGAGAGATTTTCAAGGGAAAATCGGTTCTTGCCGGTATGAAGAAAACGTTGGTTTTTTACAAAGGAAGAGCTCCCAGAGGTGAAAAGACCAATTGGGTCATGCATGAATTCCGACTCGAACCCAAATATTCTCACTTTCTTCGTCATGCTAGGCCTGTTAAGGTAATTTACTCTATTTAAATTCTATTTTCATCTCTAAAAATTAGGAAAAAAAATCAATTTATGTGGCTCCATTAAAAATACATGCATGCATTGAAATTTAAAATAAAAAAAATAAAAAAAATAAAAAGTTTGAAATTTTGATGTTGGTAATTGCTTAAATTCTTCCTCATGCATGAATTCGAATTTAATTTACAGGACGATTGGGTTGTTTTTCGGGTTTTTCACAAGAATCCCACTACATCGTCAACGACGCTGGTGGGAAGGATTCAAACTTCTGATTTCTCTTCTTCTCTTCCACCTCTAATGGATCCCTCTGCTAGTTGTATTCCGATCAATGCCGGGTTCGATGACTTCGAAGCCAAGTATAGGCCACCGAAACCATCCGATTATAATTACTTGAAGTACATGTCAATTGGCACAAATGATCACCAACCGGAAGCAACACTTCCATCGTTGGCGACGAGTGCCGCAATGACAATGAACGTTCCGTTTCCATCATCGGTTCCCGACGACGAATTTTTCTCGTTTGATCAGCTGGCCGCTGGTGGGACTACGTCGATAACGCCGCTGCCAACAACGATGGAGCGCAAAATGGAGCAAGTTTCATGGTCGACGATGAGCGGCGTGACGCAGGACATGTCGCCATCGATCGAAAATAGCGGTTACGTGGCGGCCGACATGGCGGATCTGGACCTGTGGGATTACTACTGA

mRNA sequence

ATGGAAGAACCACCGTCCACCGCCGTCAATTTGCCTCCGGGCTTCAGATTCCACCCCACAGATGAAGAGATCGTCACGTATTACCTCTCTCAGAAGATCGCCAACCCCACCTTCACCGCCACTGCCATCGGTGAAGCTGACTTGAACAAGTGTGAGCCTTGGGATTTGCCTCAGAAGGCTAAGATGGGAGAAAAGGAATGGTACTTCTTTTGCCAGCGGGACCGGAAATATCCGACCGGGATGAGAACGAACCGGGCGACTCAGACCGGTTATTGGAAGGCAACCGGAAAAGACCGAGAGATTTTCAAGGGAAAATCGGTTCTTGCCGGTATGAAGAAAACGTTGGTTTTTTACAAAGGAAGAGCTCCCAGAGGTGAAAAGACCAATTGGGTCATGCATGAATTCCGACTCGAACCCAAATATTCTCACTTTCTTCGTCATGCTAGGCCTGTTAAGGACGATTGGGTTGTTTTTCGGGTTTTTCACAAGAATCCCACTACATCGTCAACGACGCTGGTGGGAAGGATTCAAACTTCTGATTTCTCTTCTTCTCTTCCACCTCTAATGGATCCCTCTGCTAGTTGTATTCCGATCAATGCCGGGTTCGATGACTTCGAAGCCAAGTATAGGCCACCGAAACCATCCGATTATAATTACTTGAAGTACATGTCAATTGGCACAAATGATCACCAACCGGAAGCAACACTTCCATCGTTGGCGACGAGTGCCGCAATGACAATGAACGTTCCGTTTCCATCATCGGTTCCCGACGACGAATTTTTCTCGTTTGATCAGCTGGCCGCTGGTGGGACTACGTCGATAACGCCGCTGCCAACAACGATGGAGCGCAAAATGGAGCAAGTTTCATGGTCGACGATGAGCGGCGTGACGCAGGACATGTCGCCATCGATCGAAAATAGCGGTTACGTGGCGGCCGACATGGCGGATCTGGACCTGTGGGATTACTACTGA

Coding sequence (CDS)

ATGGAAGAACCACCGTCCACCGCCGTCAATTTGCCTCCGGGCTTCAGATTCCACCCCACAGATGAAGAGATCGTCACGTATTACCTCTCTCAGAAGATCGCCAACCCCACCTTCACCGCCACTGCCATCGGTGAAGCTGACTTGAACAAGTGTGAGCCTTGGGATTTGCCTCAGAAGGCTAAGATGGGAGAAAAGGAATGGTACTTCTTTTGCCAGCGGGACCGGAAATATCCGACCGGGATGAGAACGAACCGGGCGACTCAGACCGGTTATTGGAAGGCAACCGGAAAAGACCGAGAGATTTTCAAGGGAAAATCGGTTCTTGCCGGTATGAAGAAAACGTTGGTTTTTTACAAAGGAAGAGCTCCCAGAGGTGAAAAGACCAATTGGGTCATGCATGAATTCCGACTCGAACCCAAATATTCTCACTTTCTTCGTCATGCTAGGCCTGTTAAGGACGATTGGGTTGTTTTTCGGGTTTTTCACAAGAATCCCACTACATCGTCAACGACGCTGGTGGGAAGGATTCAAACTTCTGATTTCTCTTCTTCTCTTCCACCTCTAATGGATCCCTCTGCTAGTTGTATTCCGATCAATGCCGGGTTCGATGACTTCGAAGCCAAGTATAGGCCACCGAAACCATCCGATTATAATTACTTGAAGTACATGTCAATTGGCACAAATGATCACCAACCGGAAGCAACACTTCCATCGTTGGCGACGAGTGCCGCAATGACAATGAACGTTCCGTTTCCATCATCGGTTCCCGACGACGAATTTTTCTCGTTTGATCAGCTGGCCGCTGGTGGGACTACGTCGATAACGCCGCTGCCAACAACGATGGAGCGCAAAATGGAGCAAGTTTCATGGTCGACGATGAGCGGCGTGACGCAGGACATGTCGCCATCGATCGAAAATAGCGGTTACGTGGCGGCCGACATGGCGGATCTGGACCTGTGGGATTACTACTGA

Protein sequence

MEEPPSTAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFSSSLPPLMDPSASCIPINAGFDDFEAKYRPPKPSDYNYLKYMSIGTNDHQPEATLPSLATSAAMTMNVPFPSSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQVSWSTMSGVTQDMSPSIENSGYVAADMADLDLWDYY
BLAST of Cla012144 vs. Swiss-Prot
Match: NAC79_ARATH (NAC domain-containing protein 79 OS=Arabidopsis thaliana GN=NAC079 PE=2 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 4.7e-65
Identity = 123/188 (65.43%), Postives = 147/188 (78.19%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEE++T+YL +K+ +  F+A AIGE DLNK EPW+LP KAK+GEKEWY
Sbjct: 15  MDLPPGFRFHPTDEELITHYLHKKVLDLGFSAKAIGEVDLNKAEPWELPYKAKIGEKEWY 74

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFC RDRKYPTG+RTNRATQ GYWKATGKD+EIF+GKS L GMKKTLVFY+GRAP+G+KT
Sbjct: 75  FFCVRDRKYPTGLRTNRATQAGYWKATGKDKEIFRGKS-LVGMKKTLVFYRGRAPKGQKT 134

Query: 129 NWVMHEFRLEPKYS-HFLRHARPVKDDWVVFRVFHKNPTTSS---TTLVGRIQTSDFSSS 188
           NWVMHE+RL+ K S H L   +  K++WV+ RVFHK         +TL+ RI +    SS
Sbjct: 135 NWVMHEYRLDGKLSAHNL--PKTAKNEWVICRVFHKTAGGKKIPISTLI-RIGSYGTGSS 194

Query: 189 LPPLMDPS 193
           LPPL D S
Sbjct: 195 LPPLTDSS 198

BLAST of Cla012144 vs. Swiss-Prot
Match: NAC92_ARATH (NAC domain-containing protein 92 OS=Arabidopsis thaliana GN=NAC92 PE=1 SV=1)

HSP 1 Score: 247.3 bits (630), Expect = 2.3e-64
Identity = 122/190 (64.21%), Postives = 141/190 (74.21%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEE++T+YL  K+ N  F+ATAIGE DLNK EPWDLP KAKMGEKEWY
Sbjct: 18  IDLPPGFRFHPTDEELITHYLKPKVFNTFFSATAIGEVDLNKIEPWDLPWKAKMGEKEWY 77

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFC RDRKYPTG+RTNRAT+ GYWKATGKD+EIFKGKS L GMKKTLVFYKGRAP+G KT
Sbjct: 78  FFCVRDRKYPTGLRTNRATEAGYWKATGKDKEIFKGKS-LVGMKKTLVFYKGRAPKGVKT 137

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKN------PTTSSTTLVGRIQTSDFS 188
           NWVMHE+RLE KY       +  K++WV+ RVF K       P +     + R++     
Sbjct: 138 NWVMHEYRLEGKYC-IENLPQTAKNEWVICRVFQKRADGTKVPMSMLDPHINRME----P 197

Query: 189 SSLPPLMDPS 193
           + LP LMD S
Sbjct: 198 AGLPSLMDCS 201

BLAST of Cla012144 vs. Swiss-Prot
Match: NAC59_ARATH (NAC domain-containing protein 59 OS=Arabidopsis thaliana GN=NAC59 PE=1 SV=1)

HSP 1 Score: 245.0 bits (624), Expect = 1.1e-63
Identity = 122/191 (63.87%), Postives = 140/191 (73.30%), Query Frame = 1

Query: 2   EEPPSTAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAK 61
           E   S  ++LPPGFRFHPTDEE++T+YL  K+ N  F+A AIGE DLNK EPWDLP KAK
Sbjct: 15  EVEDSEKIDLPPGFRFHPTDEELITHYLRPKVVNSFFSAIAIGEVDLNKVEPWDLPWKAK 74

Query: 62  MGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGR 121
           +GEKEWYFFC RDRKYPTG+RTNRAT+ GYWKATGKD+EIFKGKS L GMKKTLVFYKGR
Sbjct: 75  LGEKEWYFFCVRDRKYPTGLRTNRATKAGYWKATGKDKEIFKGKS-LVGMKKTLVFYKGR 134

Query: 122 APRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDF 181
           AP+G KTNWVMHE+RLE K++     ++  K++ V+ RVFH    T  T           
Sbjct: 135 APKGVKTNWVMHEYRLEGKFA-IDNLSKTAKNECVISRVFHTR--TDGT-------KEHM 194

Query: 182 SSSLPPLMDPS 193
           S  LPPLMD S
Sbjct: 195 SVGLPPLMDSS 194

BLAST of Cla012144 vs. Swiss-Prot
Match: NC100_ARATH (NAC domain-containing protein 100 OS=Arabidopsis thaliana GN=NAC100 PE=2 SV=1)

HSP 1 Score: 244.6 bits (623), Expect = 1.5e-63
Identity = 119/190 (62.63%), Postives = 145/190 (76.32%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEE++T+YL +K+ + +F+A AIGE DLNK EPW+LP  AKMGEKEWY
Sbjct: 14  MDLPPGFRFHPTDEELITHYLHKKVLDTSFSAKAIGEVDLNKSEPWELPWMAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFC RDRKYPTG+RTNRAT+ GYWKATGKD+EI++GKS L GMKKTLVFY+GRAP+G+KT
Sbjct: 74  FFCVRDRKYPTGLRTNRATEAGYWKATGKDKEIYRGKS-LVGMKKTLVFYRGRAPKGQKT 133

Query: 129 NWVMHEFRLEPKYS-HFLRHARPVKDDWVVFRVFHKNP-----TTSSTTLVGRIQTSDFS 188
           NWVMHE+RLE K+S H L   +  K++WV+ RVF K+        SS   +G + T    
Sbjct: 134 NWVMHEYRLEGKFSAHNL--PKTAKNEWVICRVFQKSAGGKKIPISSLIRIGSLGTDFNP 193

Query: 189 SSLPPLMDPS 193
           S LP L D S
Sbjct: 194 SLLPSLTDSS 200

BLAST of Cla012144 vs. Swiss-Prot
Match: NAC98_ARATH (Protein CUP-SHAPED COTYLEDON 2 OS=Arabidopsis thaliana GN=NAC098 PE=1 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 8.2e-62
Identity = 111/161 (68.94%), Postives = 130/161 (80.75%), Query Frame = 1

Query: 11  LPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWYFF 70
           LPPGFRFHPTDEE++T+YL +K+ +  F++ AI E DLNKCEPW LP +AKMGEKEWYFF
Sbjct: 17  LPPGFRFHPTDEELITHYLLRKVLDGCFSSRAIAEVDLNKCEPWQLPGRAKMGEKEWYFF 76

Query: 71  CQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKS-VLAGMKKTLVFYKGRAPRGEKTN 130
             RDRKYPTG+RTNRAT+ GYWKATGKDREIF  K+  L GMKKTLVFYKGRAP+GEK+N
Sbjct: 77  SLRDRKYPTGLRTNRATEAGYWKATGKDREIFSSKTCALVGMKKTLVFYKGRAPKGEKSN 136

Query: 131 WVMHEFRLEPKYS-HFLRHARPVKDDWVVFRVFHKNPTTSS 170
           WVMHE+RLE K+S HF+  +R  KD+WV+ RVF K    S+
Sbjct: 137 WVMHEYRLEGKFSYHFI--SRSSKDEWVISRVFQKTTLAST 175

BLAST of Cla012144 vs. TrEMBL
Match: A0A0A0KWG9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G193250 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 2.0e-131
Identity = 251/335 (74.93%), Postives = 270/335 (80.60%), Query Frame = 1

Query: 1   MEEPP-STAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQK 60
           MEEP   T VNLPPGFRFHPTDEEIVTYYL+QKI +  F A AIGEADLNKCEPWDLPQK
Sbjct: 1   MEEPQLMTTVNLPPGFRFHPTDEEIVTYYLAQKIIDAAFNAAAIGEADLNKCEPWDLPQK 60

Query: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYK 120
           AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREI+KGKSVLAGMKKTLVFYK
Sbjct: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIYKGKSVLAGMKKTLVFYK 120

Query: 121 GRAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQ-T 180
           GRAP+GEKTNWVMHEFRLEPK++HFLR  RPVKDDWVV RVFHKNPT ++TT + RIQ T
Sbjct: 121 GRAPKGEKTNWVMHEFRLEPKFAHFLRLPRPVKDDWVVCRVFHKNPTMAATTPIRRIQTT 180

Query: 181 SDFSSSLPPLMDPSA-SCIPINA-GFDDFEAKYRPPKPSDYNYLKYMSIG-TND----HQ 240
           SD SSSLPPL+DP A + IPIN+ GFDDFE K RP   SDY + KY+S   TND    HQ
Sbjct: 181 SDLSSSLPPLIDPPATTLIPINSGGFDDFEVKCRPSGQSDYRF-KYISTDTTNDRHHYHQ 240

Query: 241 PEAT--LPSLATSAAMTM-NVPFPSSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQV 300
           P AT  LP+  T+AA TM NV +  SVPDD FFSFDQLAA G T     P TME KMEQ 
Sbjct: 241 PAATLPLPAATTTAATTMNNVSYAPSVPDDGFFSFDQLAAVGGTMPLTTPPTMECKMEQT 300

Query: 301 SWSTMSGVTQDMSPSIENSGYVAADMADLDLWDYY 324
           SWS MSGVTQD S SI+NSGY      DLD+WDYY
Sbjct: 301 SWSMMSGVTQDASSSIDNSGY------DLDVWDYY 328

BLAST of Cla012144 vs. TrEMBL
Match: B5TZH2_CUCMA (NAC-domain containing protein OS=Cucurbita maxima GN=NACP1 PE=2 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 3.7e-117
Identity = 220/326 (67.48%), Postives = 250/326 (76.69%), Query Frame = 1

Query: 1   MEEPPSTAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKA 60
           MEEPP  A++LPPGFRFHPTDEEIVTYYL  KI +  FTATAIGEADLNKCEPWDLP KA
Sbjct: 1   MEEPPPNALDLPPGFRFHPTDEEIVTYYLIHKITDAAFTATAIGEADLNKCEPWDLPHKA 60

Query: 61  KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKG 120
           KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKD+EI KG++VLAGMKKTLVFYKG
Sbjct: 61  KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDKEILKGRTVLAGMKKTLVFYKG 120

Query: 121 RAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSD 180
           RAP+GEKTNWVMHEFRLEPK+  FL   +P+K DWVV RVFHKN TT++  +V +IQTSD
Sbjct: 121 RAPKGEKTNWVMHEFRLEPKFFQFLGFPKPIKADWVVCRVFHKN-TTNTVGVVKKIQTSD 180

Query: 181 FSSSLPPLMDPSASCIPINAGFDDFEAKYRPPKPSDYNYLKYMSIGTNDHQPEATLPSLA 240
           FSSSLPPL+DP+ +  PI+  FD+ E  +R   P D NY        ND+      P  A
Sbjct: 181 FSSSLPPLIDPTTAHTPISGRFDNGEVNWRLSVPFD-NY-------ANDYHYHR--PFSA 240

Query: 241 TSAAMTMNVPFPSSVPDDEFFSFDQLAAGGTTSI----TPLPTTMERKMEQVSWSTMSGV 300
           T+ A+TM   +PSSVPDDEFFSFDQL  GGT S+    T   TTME K+EQVSWSTMSGV
Sbjct: 241 TNTAVTMISSYPSSVPDDEFFSFDQLDVGGTMSMAAATTTTTTTMECKIEQVSWSTMSGV 300

Query: 301 TQDMSPSIENSGYVAADMADLDLWDY 323
           T ++S SI+N        A L+ WDY
Sbjct: 301 TPEISSSIDNE-------AALEFWDY 308

BLAST of Cla012144 vs. TrEMBL
Match: B9SHJ6_RICCO (NAC domain-containing protein 21/22, putative OS=Ricinus communis GN=RCOM_1122330 PE=4 SV=1)

HSP 1 Score: 302.8 bits (774), Expect = 5.2e-79
Identity = 180/374 (48.13%), Postives = 226/374 (60.43%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEEI+T+YL++K+ N  F+A AIGE DLNKCEPWDLP+KAKMGEKEWY
Sbjct: 14  IDLPPGFRFHPTDEEIITHYLTEKVMNSGFSACAIGEVDLNKCEPWDLPKKAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFCQRDRKYPTGMRTNRAT +GYWKATGKD+EI+KGK+ L GMKKTLVFYKGRAP+GEKT
Sbjct: 74  FFCQRDRKYPTGMRTNRATDSGYWKATGKDKEIYKGKNCLVGMKKTLVFYKGRAPKGEKT 133

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFS------ 188
           NWVMHE+RLE K+S++   ++  KDDWVV RVFHK+     T++   ++ + F       
Sbjct: 134 NWVMHEYRLEGKFSYY-NLSKAAKDDWVVCRVFHKSIGIKKTSIQDLLRVNSFGDDFLDY 193

Query: 189 SSLPPLMDPSASCIP---INAGFDDFEAKYRPPKPSDYNYLKYMS--IGTN------DHQ 248
           SSLPPLMDP  S  P    N+G DDF+A     +  D NYL   S  + TN       HQ
Sbjct: 194 SSLPPLMDPPNSSRPSSSFNSGDDDFKA--MTSRTMDGNYLSQFSTTMVTNHNQNYFHHQ 253

Query: 249 PEAT-------------LPSLATSAAMTMNVPFPSSVPDDEFFSFDQL---AAGGTTSIT 308
           P  +             +PS       T N+       +  F + DQ    A  G T I 
Sbjct: 254 PSNSSYQQQPSSIFYPQIPSFTFQT--TPNMSAAGYFQNSTFGANDQTLLRALAGNTRIE 313

Query: 309 PLPTTMERKMEQV----SWSTMS---GVTQDMSPSIENSGYVAAD--------------- 323
           P       K+EQ     S +T+S   G++ D++ + E S  V ++               
Sbjct: 314 PSRQEKLCKVEQFSSNHSMATLSQDTGLSTDVNTAAEISSVVVSEQEIGSNNKVYNDLDQ 373

BLAST of Cla012144 vs. TrEMBL
Match: A0A0M4FBP6_MANES (NAC transcription factors 15 OS=Manihot esculenta PE=2 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 2.4e-76
Identity = 155/312 (49.68%), Postives = 206/312 (66.03%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEEI+T+YL++K+ N  F+A AIGE DLNK EPWDLP+KAKMGEKEWY
Sbjct: 14  IDLPPGFRFHPTDEEIITHYLTEKVMNSCFSACAIGEVDLNKSEPWDLPKKAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFCQRDRKYPTGMRTNRAT+ GYWKATGKD+EI+KGK+ L GMKKTLVFY+GRAP+GEKT
Sbjct: 74  FFCQRDRKYPTGMRTNRATEAGYWKATGKDKEIYKGKNCLVGMKKTLVFYRGRAPKGEKT 133

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFS------ 188
           NWVMHE+RLE K+S++    +  KD+WVV RVFHK+     T++   ++ + F       
Sbjct: 134 NWVMHEYRLEGKFSYY-NLPKASKDEWVVCRVFHKSTGIKKTSIQDLLRVNSFGDDFLDY 193

Query: 189 SSLPPLMDPSASCIPINAGFDDFEAKYRPPKPSDYNYLKYMSIGTNDHQP---------E 248
           SSLPPLMDP     P ++ F+D + +++    ++ NYL +    ++   P          
Sbjct: 194 SSLPPLMDPPQYNRPGSSSFNDEDDEFKAMINNNQNYLHHQLPNSSYEAPITSTFYSQIP 253

Query: 249 ATLPSLATSAAMTMNVPFP-SSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQVSWST 302
           A+ P        TM+  FP SS   +E      LAA   TS+      +E+     S +T
Sbjct: 254 ASSPLFTFQTTPTMSGYFPSSSFGANEQTILRALAANTETSVQEKHCKVEQFSSNQSVAT 313

BLAST of Cla012144 vs. TrEMBL
Match: A0A0M4FBT5_MANES (NAC transcription factors 78 OS=Manihot esculenta PE=2 SV=1)

HSP 1 Score: 290.8 bits (743), Expect = 2.0e-75
Identity = 147/266 (55.26%), Postives = 186/266 (69.92%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           V+LPPGFRFHP DEEI+T+YL++K+ N  F++ AIGE DLNKCEPWDLP+KAKMGEKEWY
Sbjct: 14  VDLPPGFRFHPADEEIITHYLTEKVMNSCFSSCAIGEVDLNKCEPWDLPKKAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFCQRDRKYPTG RTNRAT+ GYWKATGKD+EI+KGK+ L GMKKTLVFY+GRAP+GEKT
Sbjct: 74  FFCQRDRKYPTGTRTNRATEAGYWKATGKDKEIYKGKNCLVGMKKTLVFYRGRAPKGEKT 133

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFS------ 188
           NWVMHE+RLE K+S++    +  KD+WVV RVFHK+     T++   ++ + F       
Sbjct: 134 NWVMHEYRLEGKFSYYTL-PKASKDEWVVCRVFHKSTGIKKTSIQDLLRVNSFGDEFLDY 193

Query: 189 SSLPPLMDPSASCIPINAGF---DDFEAKYRPPKPSDYNYLKYMSIG-TNDHQPEATLPS 248
           SSLPPLMDP  S  P ++ F   DD E K    K    NY+ ++S    N++Q       
Sbjct: 194 SSLPPLMDPPQSSRPGSSSFNDEDDDEFKAITSKSLGGNYMPHLSTTMVNNNQSYLHQQQ 253

Query: 249 LATSAAMTMNVPFPSSVP-DDEFFSF 264
           L  S+  T +  F   +P  + F +F
Sbjct: 254 LPNSSYQTPSSVFYPQIPASNPFLTF 278

BLAST of Cla012144 vs. NCBI nr
Match: gi|659082665|ref|XP_008441966.1| (PREDICTED: NAC domain-containing protein 100-like [Cucumis melo])

HSP 1 Score: 482.3 bits (1240), Expect = 6.8e-133
Identity = 254/335 (75.82%), Postives = 272/335 (81.19%), Query Frame = 1

Query: 1   MEEPPS-TAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQK 60
           MEEP   T VNLPPGFRFHPTDEEIVTYYL+QKI +  FTATAIGEADLNKCEPWDLPQK
Sbjct: 1   MEEPQLITPVNLPPGFRFHPTDEEIVTYYLAQKIVDAAFTATAIGEADLNKCEPWDLPQK 60

Query: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYK 120
           AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYK
Sbjct: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYK 120

Query: 121 GRAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQ-T 180
           GRAP+GEKTNWVMHEFRLEPK++HFLR  RPVKDDWVV RVFHKNPT ++TT V RIQ T
Sbjct: 121 GRAPKGEKTNWVMHEFRLEPKFAHFLRLPRPVKDDWVVCRVFHKNPTMAATTPVRRIQTT 180

Query: 181 SDFSSSLPPLMDPSASC-IPINA-GFDDFEAKYRPPKPSDYNYLKYMSIGT--ND----H 240
           SDFSSSLPPL+DP A+  IPIN+ G DDFE K RP   SDY  LKY+S  +  ND    H
Sbjct: 181 SDFSSSLPPLIDPPATTHIPINSGGLDDFEVKCRPSGQSDY-CLKYISTDSTNNDRHHYH 240

Query: 241 QPEAT--LPSLATSAAMTMNVPFPSSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQV 300
           QP AT  LP   T+AA TMNV +  SVPD+ FFSFDQLAA G T     P TME KMEQ 
Sbjct: 241 QPAATLPLPPATTTAATTMNVSYAPSVPDNGFFSFDQLAAVGGTMPLTTPPTMECKMEQT 300

Query: 301 SWSTMSGVTQDMSPSIENSGYVAADMADLDLWDYY 324
           SWS MSGVTQD+S SI+NSGY      DLD+WDYY
Sbjct: 301 SWSMMSGVTQDVSSSIDNSGY------DLDVWDYY 328

BLAST of Cla012144 vs. NCBI nr
Match: gi|449459638|ref|XP_004147553.1| (PREDICTED: NAC domain-containing protein 100-like [Cucumis sativus])

HSP 1 Score: 476.9 bits (1226), Expect = 2.9e-131
Identity = 251/335 (74.93%), Postives = 270/335 (80.60%), Query Frame = 1

Query: 1   MEEPP-STAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQK 60
           MEEP   T VNLPPGFRFHPTDEEIVTYYL+QKI +  F A AIGEADLNKCEPWDLPQK
Sbjct: 1   MEEPQLMTTVNLPPGFRFHPTDEEIVTYYLAQKIIDAAFNAAAIGEADLNKCEPWDLPQK 60

Query: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYK 120
           AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREI+KGKSVLAGMKKTLVFYK
Sbjct: 61  AKMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIYKGKSVLAGMKKTLVFYK 120

Query: 121 GRAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQ-T 180
           GRAP+GEKTNWVMHEFRLEPK++HFLR  RPVKDDWVV RVFHKNPT ++TT + RIQ T
Sbjct: 121 GRAPKGEKTNWVMHEFRLEPKFAHFLRLPRPVKDDWVVCRVFHKNPTMAATTPIRRIQTT 180

Query: 181 SDFSSSLPPLMDPSA-SCIPINA-GFDDFEAKYRPPKPSDYNYLKYMSIG-TND----HQ 240
           SD SSSLPPL+DP A + IPIN+ GFDDFE K RP   SDY + KY+S   TND    HQ
Sbjct: 181 SDLSSSLPPLIDPPATTLIPINSGGFDDFEVKCRPSGQSDYRF-KYISTDTTNDRHHYHQ 240

Query: 241 PEAT--LPSLATSAAMTM-NVPFPSSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQV 300
           P AT  LP+  T+AA TM NV +  SVPDD FFSFDQLAA G T     P TME KMEQ 
Sbjct: 241 PAATLPLPAATTTAATTMNNVSYAPSVPDDGFFSFDQLAAVGGTMPLTTPPTMECKMEQT 300

Query: 301 SWSTMSGVTQDMSPSIENSGYVAADMADLDLWDYY 324
           SWS MSGVTQD S SI+NSGY      DLD+WDYY
Sbjct: 301 SWSMMSGVTQDASSSIDNSGY------DLDVWDYY 328

BLAST of Cla012144 vs. NCBI nr
Match: gi|204600790|gb|ACI01723.1| (NAC-domain containing protein [Cucurbita maxima])

HSP 1 Score: 429.5 bits (1103), Expect = 5.2e-117
Identity = 220/326 (67.48%), Postives = 250/326 (76.69%), Query Frame = 1

Query: 1   MEEPPSTAVNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKA 60
           MEEPP  A++LPPGFRFHPTDEEIVTYYL  KI +  FTATAIGEADLNKCEPWDLP KA
Sbjct: 1   MEEPPPNALDLPPGFRFHPTDEEIVTYYLIHKITDAAFTATAIGEADLNKCEPWDLPHKA 60

Query: 61  KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKG 120
           KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKD+EI KG++VLAGMKKTLVFYKG
Sbjct: 61  KMGEKEWYFFCQRDRKYPTGMRTNRATQTGYWKATGKDKEILKGRTVLAGMKKTLVFYKG 120

Query: 121 RAPRGEKTNWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSD 180
           RAP+GEKTNWVMHEFRLEPK+  FL   +P+K DWVV RVFHKN TT++  +V +IQTSD
Sbjct: 121 RAPKGEKTNWVMHEFRLEPKFFQFLGFPKPIKADWVVCRVFHKN-TTNTVGVVKKIQTSD 180

Query: 181 FSSSLPPLMDPSASCIPINAGFDDFEAKYRPPKPSDYNYLKYMSIGTNDHQPEATLPSLA 240
           FSSSLPPL+DP+ +  PI+  FD+ E  +R   P D NY        ND+      P  A
Sbjct: 181 FSSSLPPLIDPTTAHTPISGRFDNGEVNWRLSVPFD-NY-------ANDYHYHR--PFSA 240

Query: 241 TSAAMTMNVPFPSSVPDDEFFSFDQLAAGGTTSI----TPLPTTMERKMEQVSWSTMSGV 300
           T+ A+TM   +PSSVPDDEFFSFDQL  GGT S+    T   TTME K+EQVSWSTMSGV
Sbjct: 241 TNTAVTMISSYPSSVPDDEFFSFDQLDVGGTMSMAAATTTTTTTMECKIEQVSWSTMSGV 300

Query: 301 TQDMSPSIENSGYVAADMADLDLWDY 323
           T ++S SI+N        A L+ WDY
Sbjct: 301 TPEISSSIDNE-------AALEFWDY 308

BLAST of Cla012144 vs. NCBI nr
Match: gi|255568990|ref|XP_002525465.1| (PREDICTED: NAC domain-containing protein 92 [Ricinus communis])

HSP 1 Score: 302.8 bits (774), Expect = 7.4e-79
Identity = 180/374 (48.13%), Postives = 226/374 (60.43%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEEI+T+YL++K+ N  F+A AIGE DLNKCEPWDLP+KAKMGEKEWY
Sbjct: 14  IDLPPGFRFHPTDEEIITHYLTEKVMNSGFSACAIGEVDLNKCEPWDLPKKAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFCQRDRKYPTGMRTNRAT +GYWKATGKD+EI+KGK+ L GMKKTLVFYKGRAP+GEKT
Sbjct: 74  FFCQRDRKYPTGMRTNRATDSGYWKATGKDKEIYKGKNCLVGMKKTLVFYKGRAPKGEKT 133

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFS------ 188
           NWVMHE+RLE K+S++   ++  KDDWVV RVFHK+     T++   ++ + F       
Sbjct: 134 NWVMHEYRLEGKFSYY-NLSKAAKDDWVVCRVFHKSIGIKKTSIQDLLRVNSFGDDFLDY 193

Query: 189 SSLPPLMDPSASCIP---INAGFDDFEAKYRPPKPSDYNYLKYMS--IGTN------DHQ 248
           SSLPPLMDP  S  P    N+G DDF+A     +  D NYL   S  + TN       HQ
Sbjct: 194 SSLPPLMDPPNSSRPSSSFNSGDDDFKA--MTSRTMDGNYLSQFSTTMVTNHNQNYFHHQ 253

Query: 249 PEAT-------------LPSLATSAAMTMNVPFPSSVPDDEFFSFDQL---AAGGTTSIT 308
           P  +             +PS       T N+       +  F + DQ    A  G T I 
Sbjct: 254 PSNSSYQQQPSSIFYPQIPSFTFQT--TPNMSAAGYFQNSTFGANDQTLLRALAGNTRIE 313

Query: 309 PLPTTMERKMEQV----SWSTMS---GVTQDMSPSIENSGYVAAD--------------- 323
           P       K+EQ     S +T+S   G++ D++ + E S  V ++               
Sbjct: 314 PSRQEKLCKVEQFSSNHSMATLSQDTGLSTDVNTAAEISSVVVSEQEIGSNNKVYNDLDQ 373

BLAST of Cla012144 vs. NCBI nr
Match: gi|925170151|gb|ALC78992.1| (NAC transcription factors 15 [Manihot esculenta])

HSP 1 Score: 293.9 bits (751), Expect = 3.4e-76
Identity = 155/312 (49.68%), Postives = 206/312 (66.03%), Query Frame = 1

Query: 9   VNLPPGFRFHPTDEEIVTYYLSQKIANPTFTATAIGEADLNKCEPWDLPQKAKMGEKEWY 68
           ++LPPGFRFHPTDEEI+T+YL++K+ N  F+A AIGE DLNK EPWDLP+KAKMGEKEWY
Sbjct: 14  IDLPPGFRFHPTDEEIITHYLTEKVMNSCFSACAIGEVDLNKSEPWDLPKKAKMGEKEWY 73

Query: 69  FFCQRDRKYPTGMRTNRATQTGYWKATGKDREIFKGKSVLAGMKKTLVFYKGRAPRGEKT 128
           FFCQRDRKYPTGMRTNRAT+ GYWKATGKD+EI+KGK+ L GMKKTLVFY+GRAP+GEKT
Sbjct: 74  FFCQRDRKYPTGMRTNRATEAGYWKATGKDKEIYKGKNCLVGMKKTLVFYRGRAPKGEKT 133

Query: 129 NWVMHEFRLEPKYSHFLRHARPVKDDWVVFRVFHKNPTTSSTTLVGRIQTSDFS------ 188
           NWVMHE+RLE K+S++    +  KD+WVV RVFHK+     T++   ++ + F       
Sbjct: 134 NWVMHEYRLEGKFSYY-NLPKASKDEWVVCRVFHKSTGIKKTSIQDLLRVNSFGDDFLDY 193

Query: 189 SSLPPLMDPSASCIPINAGFDDFEAKYRPPKPSDYNYLKYMSIGTNDHQP---------E 248
           SSLPPLMDP     P ++ F+D + +++    ++ NYL +    ++   P          
Sbjct: 194 SSLPPLMDPPQYNRPGSSSFNDEDDEFKAMINNNQNYLHHQLPNSSYEAPITSTFYSQIP 253

Query: 249 ATLPSLATSAAMTMNVPFP-SSVPDDEFFSFDQLAAGGTTSITPLPTTMERKMEQVSWST 302
           A+ P        TM+  FP SS   +E      LAA   TS+      +E+     S +T
Sbjct: 254 ASSPLFTFQTTPTMSGYFPSSSFGANEQTILRALAANTETSVQEKHCKVEQFSSNQSVAT 313

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAC79_ARATH4.7e-6565.43NAC domain-containing protein 79 OS=Arabidopsis thaliana GN=NAC079 PE=2 SV=1[more]
NAC92_ARATH2.3e-6464.21NAC domain-containing protein 92 OS=Arabidopsis thaliana GN=NAC92 PE=1 SV=1[more]
NAC59_ARATH1.1e-6363.87NAC domain-containing protein 59 OS=Arabidopsis thaliana GN=NAC59 PE=1 SV=1[more]
NC100_ARATH1.5e-6362.63NAC domain-containing protein 100 OS=Arabidopsis thaliana GN=NAC100 PE=2 SV=1[more]
NAC98_ARATH8.2e-6268.94Protein CUP-SHAPED COTYLEDON 2 OS=Arabidopsis thaliana GN=NAC098 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KWG9_CUCSA2.0e-13174.93Uncharacterized protein OS=Cucumis sativus GN=Csa_4G193250 PE=4 SV=1[more]
B5TZH2_CUCMA3.7e-11767.48NAC-domain containing protein OS=Cucurbita maxima GN=NACP1 PE=2 SV=1[more]
B9SHJ6_RICCO5.2e-7948.13NAC domain-containing protein 21/22, putative OS=Ricinus communis GN=RCOM_112233... [more]
A0A0M4FBP6_MANES2.4e-7649.68NAC transcription factors 15 OS=Manihot esculenta PE=2 SV=1[more]
A0A0M4FBT5_MANES2.0e-7555.26NAC transcription factors 78 OS=Manihot esculenta PE=2 SV=1[more]
Match NameE-valueIdentityDescription
gi|659082665|ref|XP_008441966.1|6.8e-13375.82PREDICTED: NAC domain-containing protein 100-like [Cucumis melo][more]
gi|449459638|ref|XP_004147553.1|2.9e-13174.93PREDICTED: NAC domain-containing protein 100-like [Cucumis sativus][more]
gi|204600790|gb|ACI01723.1|5.2e-11767.48NAC-domain containing protein [Cucurbita maxima][more]
gi|255568990|ref|XP_002525465.1|7.4e-7948.13PREDICTED: NAC domain-containing protein 92 [Ricinus communis][more]
gi|925170151|gb|ALC78992.1|3.4e-7649.68NAC transcription factors 15 [Manihot esculenta][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007275 multicellular organism development
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla012144Cla012144.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 12..137
score: 7.2
IPR003441NAC domainPROFILEPS51005NACcoord: 11..163
score: 57
IPR003441NAC domainunknownSSF101941NAC domaincoord: 8..164
score: 3.27
NoneNo IPR availablePANTHERPTHR31744FAMILY NOT NAMEDcoord: 9..206
score: 1.9E