Cp4.1LG01g21380 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g21380
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionNAC domain protein,
LocationCp4.1LG01 : 18046287 .. 18053687 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAGCCAACAAGTTTAGGTCGCGAGCCCTCTTACTGCGGAGAGAACGGCGAATGGCCTACTTCGATCTCCTGTTGCAGAAGTGTTCTTCCTTCTCGCAAATCAAGCAACTCCAAGCAAACCTCATCACCAATGGCCATTTTCACTTCTCTTCCTCTCGCACCAAGCTTCTCGAGCTCTGCGCCATCTCCCCCTTCGGCGACCTTTCTCATGCCCTCCATGTTTTCCGCCATATCCACTCCCCTTCCACCAAGGATTGGAACGCCGTCATTCGCGGCACCGCCTTGAGCTCCAATCCCTCAAATGCCATTTTCTGGTACAGAACCATGACCGCGTCAAATGGGCCTCATAGAGTCGACGCTCTCACCTGCTCCTTTGCCCTCAAAGCTTGTGCGCGTGCGTTGGCTCGTTCTGAAGTGATGCAATTGCATTCACAGGTTTTGCGATTTGGGTTCGATGCTGATGTTCTCCTGCAGACTACATTGCTTGATGCGTACGCAAAAGTTGAGGATCTTGATCAGGCCCAGAAGGTGTTCGACGAAATGCCAGAACCAGATATCGCCTCGTGGAATTCTCTGATTGCTGGGTTTGCTCAGGGGGGTCGACCAAGCGATGCTATAGATTTGTTTAAGAGAATGAAGGAGGATGGGAATTTGAGGCCCAATGAAGTAACCGTTCAAGGTGCTCTCTCGGCGTGTTCCCAATTGGGTACTTTAAAAGAAGGTGAGAATGTTCACAAATACATAGCAGAGGAGAATTTAGACACAGTTGTGCAGGTTTGTAATGTTGTTATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATTGGGTGTTTGAGAATATGAGATGTGAGAAAAGTTTAATCACATGGAATACAATGATAATGGCATTTGCAATGCATGGTGATGGACACAAAGCGTTAGATCTTTTTGAAAAGTTGGGTCGATCTGGTATATATCCTGATGCAATATCATATCTAGCGGTGCTATGCGCCTGTAACCACGCAGGGCTCATAGAGGAAGGACTTAAGCTCTTCAATTCAATGGTGCAAAGGGGTGTGGCGCCAAATATAAAGCATTACGGAGTAGTGGTCGATTTGTTGGGTCGAGCTGGGCGTCTGAAAGAAGCTTATGACATTGTAAGTTCAATGCCTTTTCCTAATATGGTTCTGTGGCAGACTTTGCTTGGTGCTTGCAGGACTTATGGGGATGTAAAAATGGCAGAGATGGCATCTAGGAAGCTAGTAGAGATGGGCTTTATTAGCTGTGGTGATTTTGTTTTGTTGTCGAATGTGTATGCTGCTCGTCGGAGATGGGATGACGTTGGGCGAGTTAGGGATGCCATGAGAAGAAGGGATGTGAAGAAGACGCCTGGATTCAGTTACATAGAAGTGAAAGGTAATATGCACAAATTTCTATATGGCGATCGAAGCCATTCGAGTTGCCGTGAAATTTATGCAAAGCTTGATGAGATCATGTTTAGGATCAAAGCCATTGGATATACAGCTGAAACTGGTAATGTATTGCATGATATTGAGGAGGAGGACAAGGAGAACGTGCTGTGTTATCATAGTGAGAAGCTTGCTGTAGCCTTTGGATTATCTTGTACTGAAGAAGGGACCCCAATCCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTGATTAAGCTAATATCAAAGAGTTATAATCGAGAAATTTTTGTAAGGGACCGAACTCGATTTCACCGGTTTAAAGATGGTTTGTGTTCTTGCAAAGATTATTGGTGATACTTATTGGCGCAAAGGTTCTAAAATAGAGACGTTTTCTTGGTAAAAGTCCTGTATCTCACCGATCATGAGTTGGCTTAGAATAAGCGTCGGGAATGAGTTCAATTCGAGGTGAACACCTACTTAGGATTTAATGTCCTATGAGGATCAAACCGTTGTTCTGTGAGATTAGTTGAGGTACATGACTTTGAACGGATATGATATTCAAAAAGAAAATAATGCCCATATGGCAAGTGTAGGAATTATTTAGATATTTCGTTGATTCAAGGTCTAAGAGGCATGGGGCGTGTCGAGACCGAGGAAGAAGGCAGCTGTGGATAATGGAACATAGGAGTCGATACTGAGGAAGAAGGCAGCTGTGGATAATGGAATATAGGAGTCGATACCGAGGAAGAAGGCAGCTGTGGATAATGGAACATAGGAGTCGATACCCAGGAAGAAGGCAGCTGTGAATAATGGAACATAGGAGTCGAGACCAGCAGCTGTGCATAACGGAACATAGGAGTCGAGACCAGCAGCTGTGCATAACGGAACATAGGAGTCGAGACCGAGGAAGAAGGCGGCTGCGAATAACGGAACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGTATAATGGAGATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGAACATAGGAGTCGAGATCGAGTAAGAACGCAGCTGTGAATAATGAAACATAGAACTTGTAACTTGGATCAAGCGCTTTTGGTAGATTCATACAGAAAACGGCAACGAACAACAACAATGCGCTGTGAATCTGGTAAGTCTTTATTCATCGCTTACTTCCCTTTTTCCTATTTAGAAAAAGTAAAAAAAAATTTACCTTCCTAATACAGATACAAATGTCACTGTTAAAAAGTGAAGTGAAAATCTACTTGGTAGGATTCTTCCTAAGGGCTATAGAACTTCTAGACCTTAAGATAAATTTTATTTTTTTATAAAATATTGTTGAAAACAATATATATTTTATTATTATTTTTTAAATTTATAAAATAATTAAAATTCAAAGCAAATAAATACTTGGTCAAATGTTTAAAATGGGATATGTTTGTTTAAGTTTTTTAAAGCATATTTCACGGTGAATCAAAATAATCCTGTGCATCGCGAATCACATGAAGCATTCGTTTGAACGAATCAAACGGTGATCCAAGCACCATCTTTCATCAAGCCATTAATATCCTGCCACCTCAACACTGATCGATCCATTGTCCAGTGTAAGCAATTGACTCAGTCTCACCGGTCGTCGCCGCTCCAAAAAGAAAAAACAATAAATTGATAAAAATATTAAATTATGTACGGATTATTGTATTTTTAAAATTCTTTTCTTTTCTCATTTTTAAATATAATTCCAACAAAATACTATATTTTATAATTTTTTAAATCAAATATTAACATATACCTTAGTCATTAAGGTATAATTTTCTCGTGATTTTATATAATAAGAAAAGTTAATAATTATTTTTGAAATATTGTCATAATCTTAAAATTAGAAGTTTTTCATTATATATATAATTTATGTTGAGATATTAAAAATATAAATTGTTATAGTCACTCTTCTAAAATGTAGTCTCTCATAAAATTTTTAATGAACTAAAAAATAAATCTAAAATCAATAATTAAAATTAAAAAAAAAAAAAAAGAATTATAACACATATAACAAAAAAGATTTACCTAAATAAAATAATAATAATAAAATATCCGATCACTTCAGGAAGTTTCCGCGTATTACCGTAATAATAAGAAGCCCGCCACCACTCCCTTCGCTGCTTCATCCCCATTACTTGGTTTGACCAAATCAGCTCAACTCCGCCATCTCTCTCATTTTTTTTTTCTCCCTGCGACGGTCTTTCAGCCGTCGCTGGTGGTCGGAGTCTGCCGATCTTTCCCAAATTCAATTTTGATTGTGTGGATTCGTCGAGTTTTGGAGCGAATTTTTTCGCCCTCGCTAACGATTTTGTCATAGGATTAGGAGTTTTGATTATTTATTTATTTATTTTATTGAGGTGTTGTTCGTAAGCGATTTGATGTCAACCGTGTCGTCGTCGATGGGATTGTGCGACGAGGGGGATTGGCCACCAGGGTTTCGGTTTCACCCGACAGATGAGGAGCTGGTTCTTTATTACTTAAAGTTCAAAGTCTGTAGACGCAAATTGAAGCTCGATATCATTCGCGAGATCGATGTTTACAAGTGGGAACCCGGGGAGTTGCCTGGTAAATAAACTTTCCTTCCCGCGCTTAATCTGATCGATTGATGATCTTGTGGTTTATCTTTGACCTATTCTGTGTTTGACTGAATTTGCTGTAGATTGATTTTATTTAACTGTTGAGGATGGATTTTGGGATCTTTCTGTACCTATTTGCGTGATTTTCGTTTGATTAGTTTCTTTGTTTGCTGATTTTGCTTTGCTAGAAGTCAAAGCTTTGTTCAGAAGATCACTCAAGTTCTTCATCAAATTTAAATTGCATCTTTCGGTACATTTTTCCCGTTCGGTTGTTTCTATAAGTCACGTATCTATTCTCATAAATCGTAGTTGTAGCTATTTTTTGTTGCTCCTTTCTGTTTTTCTTATGAATCATTTCTATTCAATGAGTTCTTATTACTGGAAGGAATCATGTAATAGCCGAAGACATAAGCCCCTTGTTTTTGGATTAGGAATTTAATTGATATAGCCAATTCATCATCTATATGGTAACTGATTATCGATAATAATCCTTTTTCCCATAATTTTAGAAAATTGGTTTCCCTCTCTCTGTAAATGCAAGTCTTGTTCTATGTTTTTGGATATTAATGAATATTGCTAGTTTTTCTGATCATGAATAAATGAATGCAGGTCAATCTAAGTTAAAAACTGGTGATAGACAATGGTTCTTTTTCAGTCCTAGGGAACGTAGATATCCAAATGCATCTAGATTAAGTAGAGCTACAAGAAATGGCCACTGGAAGGTCACGGGAAAGGATCGAATAATAAAGTACAATTCACGTAATGTTGGTGTGAAGAAGACCCTGGTGTTCTATCAAGGCCGTGCCCCTAATGGTGAGCGGACTGATTGGGTTATGCATGAGTATACCTTGGATGAAGATGAGCTCAAGAGGTGTAAGAATGTAAAGGTAACAGTGTGTTTAAGCTCTTCTCTTGCTTATTATTTTCTTGCAAGATCTTGTTTTACTAATCTTAAGCCTTTGTTGCAGGATTACTATGCGCTCTACAAACTTTACAAGAAAAGTGGACCTGGTCCTAAAAATGGTGAGCAATATGGAGCACCATTTAAAGAAGAAAATTGGAATGATGGTGAATGCCAAGTTTTAGTTCACTCCGACGCTCAGGAGCCACAAGTGAACATACTTGATGAGGTTACCTCTGTCGATCGTGAAAGAGCGAACGTTCAAATACAGCTTTCATCAGATGATATTGAGGAACTTTTGCAACAATTTACAAATGATCCTGTCCTAGAGTTGCCATCAGTCAGTGGTATTCATCAATTTGACTCGGTCGTGCAGGTTGGCTTCCCTGAACATTGCATTCTGATATTCTGTTCCCATTTCCATGTTAATTTACCTTGGTGAATAGACTGGTCATCACTTTGAGATAAAAATGTTTCTTATTCGATTGATCATATAGCTGTAGGAATTGGGAACATCGGAACGTATGAACTGCATATCTAAACTTCTAAGTGTCTATTGTCCATAATATTTAGAATCTTTGACATATATGACATATATGCTGTATTTCATTGGATATTTTGTCGGCCATTCCTTTATATGCTTTATTTGTTCAGTTTATCAGTTAGCTTATGATTTGTGTACTTTATTTGTCTGTTTAGGTTGATGACAAAGAAGAAACTGCAAGTACTATGATTGATACTTACTCTCAGAATCACATACTTCCTGAAGCCGATAAAGTACTAAACTTAAGTGGCCAGCCAAGTGATTTGCATCCATGCTTCCAATTTACCCAGGCAGGTCTCTTCCAATTGCAATCCTTTGAGAGCGAGGTATCATCGTCTCCCAAATATCGTGAAGAAGAAGATTTCTTGGAGATCGATGATCTCATTGGTCCTGAGCCAACTCTCGTGGCAAATGTCAACCCTTTGGGAAATTCAGAGTTCGATGGATTGAACGAGTTAGAGCTGTTCCATGATGCAAATATGTTTCTTCGCGACTTGGGACCTGTCGTCCCAGAAACGTTTTTAGATCCATATTTGAGTGCTGACGGTGGTATTCTTGTTGCCAACGATGTGAATGGCCATTTGCAATATGATCAGATGGGCAATGGGTTCTGGGAGAATGAGACAGAAAATTCCTTCAGCTTCCCAGAACCGCATCAACAGTTTGGTACTCAACCAAATTTAGGTATATAATTCAGCTCAAACTTGATATCTGCCCAGTTTTAGCAAACATTAAGCAGTCTACTTTTACTAAGCTGTTTCTTTCTGTTCGATTTTCGCGAAGTTGCTGGCTCGATTAGCTCATCGACACAGGGCTCTTGCTCAGATAAGTGGGGTTTGTTTGATAAGCCATTCTATACTTCAAATGGCAAGAGACCCCTCTTGTGAAATATTGAGCCAGCTTCGACAGATACCTAGAACGAGTATTCTTCTCGTACATTGTGTATACTTCTCGTACATTATCGAATTCACTGATACGAAAGGAGCTATTCTCCAATTATACTTGTCAACCTCATCATAAACTGCCATTTTGGTGCCAATTCCAGGTGCCATTTTATCAATGTGAAGCAGTGGGGCAGTTGCTAATTATGAAACTTTCTGGTCTTGTTTAGGTGTGGGATATGAATCTGTAAGTTCTGCAGCACCAGAAATAAGGGATAACCAAATAGCAAACGGTGGAGGCAGTTCTGCAAGTAAGTTCTCTTCCAATTTGTGGGCCTTTGTAGAGTCGATACCGACCACCCCCGCTTCAGCTTCCGAGAACGTCAACCGCACTTTTCAGAGGATGTCTAGCTTTAGCAGATTGAGACTAAATACCCTGAACACCTTTAACGCCAATGTCGCCATAGGTAATCCCAAAACTAGCGCGAGGAGAACGGGTACGAATAAGGGATTCTTTTTATTTTCAATTCTTGGAGTATTGTGTGCCATTCTATGGGTGTTGTTAGGAGATGTTAGATTACAGGAAAGAGGCATTGCCTCATGAATCTGCTTCATTAAAATTCTGCCATCTTCTTTTATACCTTTATTATATGCATATCTTATAGTTCCAATCTTGGTGGAAACTGAATTTTGTATGTAAATTATAGCTCAGTGATAGATGAAATGAGAGTTATAAGTCTCAATGGTTACGAGGAAATATTAGTGATCGAATTCAATTCATAGGTCTATCTCAATTACAGTTTTTTC

mRNA sequence

CAAAGCCAACAAGTTTAGGTCGCGAGCCCTCTTACTGCGGAGAGAACGGCGAATGGCCTACTTCGATCTCCTGTTGCAGAAGTGTTCTTCCTTCTCGCAAATCAAGCAACTCCAAGCAAACCTCATCACCAATGGCCATTTTCACTTCTCTTCCTCTCGCACCAAGCTTCTCGAGCTCTGCGCCATCTCCCCCTTCGGCGACCTTTCTCATGCCCTCCATGTTTTCCGCCATATCCACTCCCCTTCCACCAAGGATTGGAACGCCGTCATTCGCGGCACCGCCTTGAGCTCCAATCCCTCAAATGCCATTTTCTGGTACAGAACCATGACCGCGTCAAATGGGCCTCATAGAGTCGACGCTCTCACCTGCTCCTTTGCCCTCAAAGCTTGTGCGCGTGCGTTGGCTCGTTCTGAAGTGATGCAATTGCATTCACAGGTTTTGCGATTTGGGTTCGATGCTGATGTTCTCCTGCAGACTACATTGCTTGATGCGTACGCAAAAGTTGAGGATCTTGATCAGGCCCAGAAGGTGTTCGACGAAATGCCAGAACCAGATATCGCCTCGTGGAATTCTCTGATTGCTGGGTTTGCTCAGGGGGGTCGACCAAGCGATGCTATAGATTTGTTTAAGAGAATGAAGGAGGATGGGAATTTGAGGCCCAATGAAGTAACCGTTCAAGGTGCTCTCTCGGCGTGTTCCCAATTGGGTACTTTAAAAGAAGGTGAGAATGTTCACAAATACATAGCAGAGGAGAATTTAGACACAGTTGTGCAGGTTTGTAATGTTGTTATTGATATGTATGCTAAATGTGGATCTGTGGATAAAGCTTATTGGGTGTTTGAGAATATGAGATGTGAGAAAAGTTTAATCACATGGAATACAATGATAATGGCATTTGCAATGCATGGTGATGGACACAAAGCGTTAGATCTTTTTGAAAAGTTGGGTCGATCTGGTATATATCCTGATGCAATATCATATCTAGCGGTGCTATGCGCCTGTAACCACGCAGGGCTCATAGAGGAAGGACTTAAGCTCTTCAATTCAATGGTGCAAAGGGGTGTGGCGCCAAATATAAAGCATTACGGAGTAGTGGTCGATTTGTTGGGTCGAGCTGGGCGTCTGAAAGAAGCTTATGACATTGTAAGTTCAATGCCTTTTCCTAATATGGTTCTGTGGCAGACTTTGCTTGGTGCTTGCAGGACTTATGGGGATGTAAAAATGGCAGAGATGGCATCTAGGAAGCTAGTAGAGATGGGCTTTATTAGCTGTGGTGATTTTGTTTTGTTGTCGAATGTGTATGCTGCTCGTCGGAGATGGGATGACGTTGGGCGAGTTAGGGATGCCATGAGAAGAAGGGATGTGAAGAAGACGCCTGGATTCAGTTACATAGAAGTGAAAGGTAATATGCACAAATTTCTATATGGCGATCGAAGCCATTCGAGTTGCCGTGAAATTTATGCAAAGCTTGATGAGATCATGTTTAGGATCAAAGCCATTGGATATACAGCTGAAACTGGTAATGTATTGCATGATATTGAGGAGGAGGACAAGGAGAACGTGCTGTGTTATCATAGTGAGAAGCTTGCTGTAGCCTTTGGATTATCTTGTACTGAAGAAGGGACCCCAATCCAAGTGATTAAGAATTTAAGGATTTGTGGGGATTGTCATGTTGTGATTAAGCTAATATCAAAGAGTTATAATCGAGAAATTTTTGTAAGGGACCGAACTCGATTTCACCGGTTTAAAGATGGTTTGTGTTCTTGCAAAGATTATTGGTGATACTTATTGGCGCAAAGGTTCTAAAATAGAGACGTTTTCTTGGTAAAAGTCCTGTATCTCACCGATCATGAGTTGGCTTAGAATAAGCGTCGGGAATGAGTTCAATTCGAGGTGAACACCTACTTAGGATTTAATGTCCTATGAGGATCAAACCGTTGTTCTGTGAGATTAGTTGAGGTACATGACTTTGAACGGATATGATATTCAAAAAGAAAATAATGCCCATATGGCAAGTGTAGGAATTATTTAGATATTTCGTTGATTCAAGGTCTAAGAGGCATGGGGCGTGTCGAGACCGAGGAAGAAGGCAGCTGTGCATAACGGAACATAGGAGTCGAGACCGAGGAAGAAGGCGGCTGCGAATAACGGAACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGTATAATGGAGATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGGAGTCGAGACCGAGGAAGAAAGCAGCTGCGAATAATGGACATAGAACATAGGAGTCGAGATCGAGTAAGAACGCAGCTGTGAATAATGAAACATAGAACTTGTAACTTGGATCAAGCGCTTTTGGTAGATTCATACAGAAAACGGCAACGAACAACAACAATGCGCTGTGAATCTGGTGTTGTTCGTAAGCGATTTGATGTCAACCGTGTCGTCGTCGATGGGATTGTGCGACGAGGGGGATTGGCCACCAGGGTTTCGGTTTCACCCGACAGATGAGGAGCTGGTTCTTTATTACTTAAAGTTCAAAGTCTGTAGACGCAAATTGAAGCTCGATATCATTCGCGAGATCGATGTTTACAAGTGGGAACCCGGGGAGTTGCCTGGTCAATCTAAGTTAAAAACTGGTGATAGACAATGGTTCTTTTTCAGTCCTAGGGAACGTAGATATCCAAATGCATCTAGATTAAGTAGAGCTACAAGAAATGGCCACTGGAAGGTCACGGGAAAGGATCGAATAATAAAGTACAATTCACGTAATGTTGGTGTGAAGAAGACCCTGGTGTTCTATCAAGGCCGTGCCCCTAATGGTGAGCGGACTGATTGGGTTATGCATGAGTATACCTTGGATGAAGATGAGCTCAAGAGGTGTAAGAATGTAAAGGATTACTATGCGCTCTACAAACTTTACAAGAAAAGTGGACCTGGTCCTAAAAATGGTGAGCAATATGGAGCACCATTTAAAGAAGAAAATTGGAATGATGGTGAATGCCAAGTTTTAGTTCACTCCGACGCTCAGGAGCCACAAGTGAACATACTTGATGAGGTTACCTCTGTCGATCGTGAAAGAGCGAACGTTCAAATACAGCTTTCATCAGATGATATTGAGGAACTTTTGCAACAATTTACAAATGATCCTGTCCTAGAGTTGCCATCAGTCAGTGGTATTCATCAATTTGACTCGGTCGTGCAGGTTGATGACAAAGAAGAAACTGCAAGTACTATGATTGATACTTACTCTCAGAATCACATACTTCCTGAAGCCGATAAAGTACTAAACTTAAGTGGCCAGCCAAGTGATTTGCATCCATGCTTCCAATTTACCCAGGCAGGTCTCTTCCAATTGCAATCCTTTGAGAGCGAGGTATCATCGTCTCCCAAATATCGTGAAGAAGAAGATTTCTTGGAGATCGATGATCTCATTGGTCCTGAGCCAACTCTCGTGGCAAATGTCAACCCTTTGGGAAATTCAGAGTTCGATGGATTGAACGAGTTAGAGCTGTTCCATGATGCAAATATGTTTCTTCGCGACTTGGGACCTGTCGTCCCAGAAACGTTTTTAGATCCATATTTGAGTGCTGACGGTGGTATTCTTGTTGCCAACGATGTGAATGGCCATTTGCAATATGATCAGATGGGCAATGGGTTCTGGGAGAATGAGACAGAAAATTCCTTCAGCTTCCCAGAACCGCATCAACAGTTTGGTACTCAACCAAATTTAGGTGTGGGATATGAATCTGTAAGTTCTGCAGCACCAGAAATAAGGGATAACCAAATAGCAAACGGTGGAGGCAGTTCTGCAAGTAAGTTCTCTTCCAATTTGTGGGCCTTTGTAGAGTCGATACCGACCACCCCCGCTTCAGCTTCCGAGAACGTCAACCGCACTTTTCAGAGGATGTCTAGCTTTAGCAGATTGAGACTAAATACCCTGAACACCTTTAACGCCAATGTCGCCATAGGTAATCCCAAAACTAGCGCGAGGAGAACGGGTACGAATAAGGGATTCTTTTTATTTTCAATTCTTGGAGTATTGTGTGCCATTCTATGGGTGTTGTTAGGAGATGTTAGATTACAGGAAAGAGGCATTGCCTCATGAATCTGCTTCATTAAAATTCTGCCATCTTCTTTTATACCTTTATTATATGCATATCTTATAGTTCCAATCTTGGTGGAAACTGAATTTTGTATGTAAATTATAGCTCAGTGATAGATGAAATGAGAGTTATAAGTCTCAATGGTTACGAGGAAATATTAGTGATCGAATTCAATTCATAGGTCTATCTCAATTACAGTTTTTTC

Coding sequence (CDS)

ATGTCAACCGTGTCGTCGTCGATGGGATTGTGCGACGAGGGGGATTGGCCACCAGGGTTTCGGTTTCACCCGACAGATGAGGAGCTGGTTCTTTATTACTTAAAGTTCAAAGTCTGTAGACGCAAATTGAAGCTCGATATCATTCGCGAGATCGATGTTTACAAGTGGGAACCCGGGGAGTTGCCTGGTCAATCTAAGTTAAAAACTGGTGATAGACAATGGTTCTTTTTCAGTCCTAGGGAACGTAGATATCCAAATGCATCTAGATTAAGTAGAGCTACAAGAAATGGCCACTGGAAGGTCACGGGAAAGGATCGAATAATAAAGTACAATTCACGTAATGTTGGTGTGAAGAAGACCCTGGTGTTCTATCAAGGCCGTGCCCCTAATGGTGAGCGGACTGATTGGGTTATGCATGAGTATACCTTGGATGAAGATGAGCTCAAGAGGTGTAAGAATGTAAAGGATTACTATGCGCTCTACAAACTTTACAAGAAAAGTGGACCTGGTCCTAAAAATGGTGAGCAATATGGAGCACCATTTAAAGAAGAAAATTGGAATGATGGTGAATGCCAAGTTTTAGTTCACTCCGACGCTCAGGAGCCACAAGTGAACATACTTGATGAGGTTACCTCTGTCGATCGTGAAAGAGCGAACGTTCAAATACAGCTTTCATCAGATGATATTGAGGAACTTTTGCAACAATTTACAAATGATCCTGTCCTAGAGTTGCCATCAGTCAGTGGTATTCATCAATTTGACTCGGTCGTGCAGGTTGATGACAAAGAAGAAACTGCAAGTACTATGATTGATACTTACTCTCAGAATCACATACTTCCTGAAGCCGATAAAGTACTAAACTTAAGTGGCCAGCCAAGTGATTTGCATCCATGCTTCCAATTTACCCAGGCAGGTCTCTTCCAATTGCAATCCTTTGAGAGCGAGGTATCATCGTCTCCCAAATATCGTGAAGAAGAAGATTTCTTGGAGATCGATGATCTCATTGGTCCTGAGCCAACTCTCGTGGCAAATGTCAACCCTTTGGGAAATTCAGAGTTCGATGGATTGAACGAGTTAGAGCTGTTCCATGATGCAAATATGTTTCTTCGCGACTTGGGACCTGTCGTCCCAGAAACGTTTTTAGATCCATATTTGAGTGCTGACGGTGGTATTCTTGTTGCCAACGATGTGAATGGCCATTTGCAATATGATCAGATGGGCAATGGGTTCTGGGAGAATGAGACAGAAAATTCCTTCAGCTTCCCAGAACCGCATCAACAGTTTGGTACTCAACCAAATTTAGGTGTGGGATATGAATCTGTAAGTTCTGCAGCACCAGAAATAAGGGATAACCAAATAGCAAACGGTGGAGGCAGTTCTGCAAGTAAGTTCTCTTCCAATTTGTGGGCCTTTGTAGAGTCGATACCGACCACCCCCGCTTCAGCTTCCGAGAACGTCAACCGCACTTTTCAGAGGATGTCTAGCTTTAGCAGATTGAGACTAAATACCCTGAACACCTTTAACGCCAATGTCGCCATAGGTAATCCCAAAACTAGCGCGAGGAGAACGGGTACGAATAAGGGATTCTTTTTATTTTCAATTCTTGGAGTATTGTGTGCCATTCTATGGGTGTTGTTAGGAGATGTTAGATTACAGGAAAGAGGCATTGCCTCATGA

Protein sequence

MSTVSSSMGLCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSFESEVSSSPKYREEEDFLEIDDLIGPEPTLVANVNPLGNSEFDGLNELELFHDANMFLRDLGPVVPETFLDPYLSADGGILVANDVNGHLQYDQMGNGFWENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSSASKFSSNLWAFVESIPTTPASASENVNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQERGIAS
BLAST of Cp4.1LG01g21380 vs. Swiss-Prot
Match: NAC17_ARATH (NAC domain-containing protein 17 OS=Arabidopsis thaliana GN=NAC017 PE=2 SV=1)

HSP 1 Score: 406.8 bits (1044), Expect = 3.9e-112
Identity = 258/575 (44.87%), Postives = 343/575 (59.65%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV+YYLK K+CR++L++++I  +DVYK +P ELPGQS LKTGDRQWF+F
Sbjct: 18  PGFRFHPTDEELVMYYLKRKICRKRLRVNVIGVVDVYKMDPEELPGQSMLKTGDRQWFYF 77

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           +PR R+YPNA+R +R T NG+WK TGKDR+I+YNSR+VG+KKTLVFY+GRAP+GERTDWV
Sbjct: 78  TPRSRKYPNAARSNRGTENGYWKATGKDRVIEYNSRSVGLKKTLVFYRGRAPSGERTDWV 137

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENW--NDGECQVLV 197
           MHEYT+DEDEL RCKN ++YYALYKL+KKSG GPKNGEQYGAPF+EE W  +D E    +
Sbjct: 138 MHEYTMDEDELGRCKNPQEYYALYKLFKKSGAGPKNGEQYGAPFQEEEWVDDDNEDVNAI 197

Query: 198 HSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQFDS 257
                E  V   ++   VD  R    + L  +DI+ELL    N P        G+ Q   
Sbjct: 198 AVAVPEQPVVRYEDARRVDERRLFNPVILQLEDIDELLNGIPNAP--------GVPQ-RC 257

Query: 258 VVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSFE-S 317
           + QV+ +EE  ST+++  S    LP        +GQ        Q+ +   F   S E +
Sbjct: 258 IPQVNSEEELQSTLVNN-SAREFLP--------NGQ--------QYNRPSSF--DSLETA 317

Query: 318 EVSSSPKYREEEDFLEIDDLI-----GPEPTLVANVNPLGNSEFDGLNEL-ELFHDANMF 377
           EV+S+P   E+EDF+E+DDL+     G   T  A      + EFD  NE  +LFHD +M 
Sbjct: 318 EVTSAPLVFEKEDFIEMDDLLLIPEFGASSTEKA-AQFSNHGEFDDFNEFDQLFHDVSMS 377

Query: 378 LRDLGPVVPETFLDPYLSADGGILVANDVNGHLQYDQMGNGFWENETEN----------- 437
           L D+ P+   T  +   S        +D    L Y Q  +   EN+  N           
Sbjct: 378 L-DMEPIDQGTSAN-LSSLSDSANYTSDQKQQLLYQQFQDQTPENQLNNIMDPSTTLNQI 437

Query: 438 ----------SFSFPEPHQQFG--TQPNLGVGYESVS-SAAPEIRDNQIANGGGSSASKF 497
                     +  F +     G    P+ GV  +S + + +   + ++I NGGG++ S+F
Sbjct: 438 TSDIWFEDDQAILFDQQQSFSGAFASPSSGVMPDSTNPTMSVNAQGHEIQNGGGTT-SQF 497

Query: 498 SSNLWAFVESIPTTPASASEN-VNRTFQRMSSFSRLRLNTLNTFNANVAIGNP--KTSAR 557
           SS LWA ++SIP+TPASA E  +NRTF RMSSFSR+R N         A G P   T A+
Sbjct: 498 SSALWALMDSIPSTPASACEGPLNRTFVRMSSFSRMRFN-------GKANGTPVSTTIAK 553

BLAST of Cp4.1LG01g21380 vs. Swiss-Prot
Match: NAC16_ARATH (NAC domain-containing protein 16 OS=Arabidopsis thaliana GN=NAC016 PE=2 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 1.8e-109
Identity = 245/573 (42.76%), Postives = 335/573 (58.46%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV+YYLK K+C +KL+++ I  +DVYK +P ELPG S LKTGDRQWFFF
Sbjct: 18  PGFRFHPTDEELVVYYLKRKICCKKLRVNAIGVVDVYKVDPSELPGLSMLKTGDRQWFFF 77

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           +PR R+YPNA+R SR T  G+WK TGKDR+I+YNSR+VG+KKTLVFY+GRAPNGERTDWV
Sbjct: 78  TPRNRKYPNAARSSRGTATGYWKATGKDRVIEYNSRSVGLKKTLVFYRGRAPNGERTDWV 137

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHS 197
           MHEYT+DE+EL RCKN K+YYALYKLYKKSG GPKNGEQYGAPF+EE W D + +     
Sbjct: 138 MHEYTMDEEELGRCKNAKEYYALYKLYKKSGAGPKNGEQYGAPFQEEEWVDSDSEDADSV 197

Query: 198 DAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQ----- 257
              +  V   +    VD  +    ++L  +DIE+LL         E+P   G++Q     
Sbjct: 198 AVPDYPVVRYENGPCVDDTKFCNPVKLQLEDIEKLLN--------EIPDAPGVNQRQFDE 257

Query: 258 FDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSF 317
           F  V Q +  E   ST+++  S  +I P  + +   +GQ  +    FQ        L SF
Sbjct: 258 FVGVPQGNSAEVIQSTLLNNSSGEYIDPRTNGMFLPNGQLYNRDSSFQ------SHLNSF 317

Query: 318 ESEVSSSPKY-REEEDFLEIDDLIGPE---PTLVANVNPLGNSEFDGLNEL-ELFHDANM 377
           E+    +P    E+E+++E++DL+ PE    +   +   L + EF  +NE  +LF+D ++
Sbjct: 318 EATSGMAPLLDNEKEEYIEMNDLLIPELGASSTEKSTEFLNHGEFGDVNEYDQLFNDISV 377

Query: 378 FLRDLGPVVPETFLDPYLSADGG------------ILVANDVNGHLQ----YDQMGNGFW 437
           F    G     + L  + +   G                N +N ++      +Q  +  W
Sbjct: 378 F---QGTSTDLSCLSNFTNNTSGQRQQLLYEQFQYQTPENQLNNYMHPSTTLNQFTDNMW 437

Query: 438 ENETENSFSFPEPHQQFG--TQPNLGVGYESVS---SAAPEIRDNQIANGGGSSASKFSS 497
             + + +     P    G  T  + GV  ES++   S  P+ ++ Q  NGGG + S+FSS
Sbjct: 438 FKDDQAALYVQPPQSSSGAFTSQSTGVMPESMNPTMSVNPQYKEGQ--NGGG-TRSQFSS 497

Query: 498 NLWAFVESIPTTPASASEN-VNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSARRTGT 557
            LW  +ESIP+TPASA E  +N+TF RMSSFSR+R      FN         T A++  +
Sbjct: 498 ALWELLESIPSTPASACEGPLNQTFVRMSSFSRIR------FNGTSVTSRKVTVAKKRIS 557

Query: 558 NKGFFLFSILGVLCAILWVLLGDVRLQERGIAS 559
           N+GF L SI+G LCAI WV    V +  R + S
Sbjct: 558 NRGFLLLSIMGALCAIFWVFKATVGVMGRPLLS 564

BLAST of Cp4.1LG01g21380 vs. Swiss-Prot
Match: NAC13_ARATH (NAC domain-containing protein 13 OS=Arabidopsis thaliana GN=NAC13 PE=1 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 6.0e-68
Identity = 141/260 (54.23%), Postives = 179/260 (68.85%), Query Frame = 1

Query: 12  DEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGD 71
           + G   PGFRFHPTDEELV+YYLK K+ R+KL+++ I E DVYK++P ELP ++  KT D
Sbjct: 6   ENGGLAPGFRFHPTDEELVVYYLKRKIRRKKLRVEAIGETDVYKFDPEELPEKALYKTRD 65

Query: 72  RQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNG 131
           RQWFFFS R+R++   SR SRAT  G+WK TGKDR+I  +SR VG KKTLVF++GRAPNG
Sbjct: 66  RQWFFFSLRDRKH--GSRSSRATERGYWKATGKDRVIHCDSRPVGEKKTLVFHRGRAPNG 125

Query: 132 ERTDWVMHEYTLDEDELKRC--KNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDG 191
           ERT+WVMHEYTL ++ELKRC  ++VKD Y LYK+YKKSG GPKNGEQYGAPF EE W + 
Sbjct: 126 ERTNWVMHEYTLHKEELKRCGGEDVKDAYVLYKIYKKSGSGPKNGEQYGAPFIEEEWAED 185

Query: 192 ECQVLVHSDAQEPQVNILDEVTSVDRE---RANVQIQLSSDDIEELLQQFTND--PVLEL 251
           +       D  EP  N L    SVD     +   Q +L  +DIEEL+ Q  +   P L+ 
Sbjct: 186 D-----DDDVDEP-ANQLVVSASVDNSLWGKGLNQSELDDNDIEELMSQVRDQSGPTLQQ 245

Query: 252 PSVSGIHQFDSVVQVDDKEE 265
             VSG++       +++ EE
Sbjct: 246 NGVSGLNSHVDTYNLENLEE 257

BLAST of Cp4.1LG01g21380 vs. Swiss-Prot
Match: NAC78_ARATH (NAC domain-containing protein 78 OS=Arabidopsis thaliana GN=NAC078 PE=2 SV=2)

HSP 1 Score: 222.6 bits (566), Expect = 1.1e-56
Identity = 103/193 (53.37%), Postives = 138/193 (71.50%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV YYLK KVC +  K D I   D+YK EP +LP +SKLK+ D +W+FF
Sbjct: 11  PGFRFHPTDEELVRYYLKRKVCNKPFKFDAISVTDIYKSEPWDLPDKSKLKSRDLEWYFF 70

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           S  +++Y N S+ +RAT  G+WK TGKDR I+  SR VG+KKTLV+++GRAP GERT+WV
Sbjct: 71  SMLDKKYSNGSKTNRATEKGYWKTTGKDREIRNGSRVVGMKKTLVYHKGRAPRGERTNWV 130

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHS 197
           MHEY L +++LK+    ++ Y L ++++KSG GPKNGEQYGAP+ EE W +     +   
Sbjct: 131 MHEYRLSDEDLKKAGVPQEAYVLCRIFQKSGTGPKNGEQYGAPYLEEEWEEDGMTYVPAQ 190

Query: 198 DAQEPQVNILDEV 211
           DA    + + D+V
Sbjct: 191 DAFSEGLALNDDV 203

BLAST of Cp4.1LG01g21380 vs. Swiss-Prot
Match: NAC53_ARATH (NAC domain-containing protein 53 OS=Arabidopsis thaliana GN=NAC053 PE=2 SV=1)

HSP 1 Score: 213.4 bits (542), Expect = 6.4e-54
Identity = 98/178 (55.06%), Postives = 129/178 (72.47%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV YYLK K+C +  K D I   DVYK EP +LP +S+LK+ D +W+FF
Sbjct: 11  PGFRFHPTDEELVRYYLKRKICNKPFKFDAISVTDVYKSEPWDLPDKSRLKSRDLEWYFF 70

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           S  +++Y N S+ +RAT  G+WK TGKDR I   S+ VG+KKTLV+++GRAP GERT+WV
Sbjct: 71  SMLDKKYRNGSKTNRATEMGYWKTTGKDREILNGSKVVGMKKTLVYHKGRAPRGERTNWV 130

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLV 196
           MHEY L + +L +    +D + L ++++KSG GPKNGEQYGAPF EE W + +    V
Sbjct: 131 MHEYRLVDQDLDKTGVHQDAFVLCRIFQKSGSGPKNGEQYGAPFVEEEWEEEDDMTFV 188

BLAST of Cp4.1LG01g21380 vs. TrEMBL
Match: A0A0A0K9G3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G052680 PE=4 SV=1)

HSP 1 Score: 817.0 bits (2109), Expect = 1.4e-233
Identity = 428/579 (73.92%), Postives = 469/579 (81.00%), Query Frame = 1

Query: 1   MSTVSSSMGLCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGE 60
           MSTVSSS GLCDEGDWPPGFRFHPTDEEL+LYYLKFK+C RKLKLDIIRE DVYKWEP E
Sbjct: 1   MSTVSSSRGLCDEGDWPPGFRFHPTDEELILYYLKFKICGRKLKLDIIRETDVYKWEPDE 60

Query: 61  LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKT 120
           LPGQSKLKTGDRQWFFFSPRE RYPNASRLSRATR G+WK TGKDRII+ NSRNVGVKKT
Sbjct: 61  LPGQSKLKTGDRQWFFFSPREHRYPNASRLSRATRYGYWKATGKDRIIQCNSRNVGVKKT 120

Query: 121 LVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180
           LVFY GRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP
Sbjct: 121 LVFYLGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180

Query: 181 FKEENWNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDP 240
           F+EE+W D        +   EPQV I+DEV  V  ER N QIQ+SS+DIEE ++Q  NDP
Sbjct: 181 FREEDWVDD-------AGCLEPQVKIVDEVDPVVCERDNGQIQISSEDIEEFMKQMVNDP 240

Query: 241 VLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQ 300
           VLELP V+G HQ  S +QVDDKEETASTMID Y+  HILP+ DKV + S QPSDL+  F 
Sbjct: 241 VLELPLVNGYHQLGSALQVDDKEETASTMIDDYTHEHILPQLDKVCHFSVQPSDLNASFD 300

Query: 301 FTQAGLFQLQSFESEVSSSPKYREEE-DFLEIDDLIGPEPTLVANVNPLGN---SEFDGL 360
           FTQ+G+ QLQ FE+EVSS+PK  EEE DFLEI+DL+G EPT VANVNPLGN   SE DGL
Sbjct: 301 FTQSGISQLQPFEAEVSSAPKDCEEEGDFLEINDLVGSEPTPVANVNPLGNIPPSELDGL 360

Query: 361 NELELFHDANMFLRDLGPVVPETFLDPYLSADGGILVANDVNGHLQYD-----QMGNGFW 420
           +EL+LFHDANMFLRDLGP+ PET LDPYL+A   + VA++ NG+ QYD     Q  N FW
Sbjct: 361 SELDLFHDANMFLRDLGPIAPETVLDPYLNALD-VDVADNSNGNWQYDPYQQIQTDNVFW 420

Query: 421 EN-ETENSFSF----------PEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSS 480
           +N ETEN+FS           PE H QF TQ NLGVGYESVSS A   R+ Q AN GG S
Sbjct: 421 KNNETENAFSIQSNGHSFNQIPESHGQFVTQSNLGVGYESVSSTAAGTREIQSANDGGGS 480

Query: 481 ASKFSSNLWAFVESIPTTPASASENVNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSA 540
            S FSSNLWAFVESIPTTPASASENVNR F+RMSSFSRLRLNTLNT N NVA+ NP+T A
Sbjct: 481 TSWFSSNLWAFVESIPTTPASASENVNRAFERMSSFSRLRLNTLNTLNTNVAVRNPETGA 540

Query: 541 -RRTGTNKGFFLFSILGVLCAILWVLLGDVRLQERGIAS 559
            RRTG NKGFFLFSILGVLCAILWVL+G+VRL    I+S
Sbjct: 541 RRRTGMNKGFFLFSILGVLCAILWVLIGNVRLSGNFISS 571

BLAST of Cp4.1LG01g21380 vs. TrEMBL
Match: M5W6F5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003410mg PE=4 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 2.1e-144
Identity = 289/563 (51.33%), Postives = 374/563 (66.43%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV+YYLK K+C+++LKL++I E DVYKW+P ELPG S LKTGDRQWFFF
Sbjct: 19  PGFRFHPTDEELVVYYLKRKICKKRLKLNVIAETDVYKWDPEELPGLSLLKTGDRQWFFF 78

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           SPR+R+YPN  R +RATR+G+WK TGKDR I   SR+VG+KKTLV+Y+GRAP+GERTDWV
Sbjct: 79  SPRDRKYPNGGRSNRATRHGYWKATGKDRNITCYSRSVGLKKTLVYYKGRAPSGERTDWV 138

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHS 197
           MHEYTLDE+ELKRC+NV++YYALYK+YKKSGPGPKNGEQYGAPF+EE W D E  V+  S
Sbjct: 139 MHEYTLDEEELKRCRNVQEYYALYKVYKKSGPGPKNGEQYGAPFREEEWADDELPVINSS 198

Query: 198 DAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQFDSVV 257
             ++  V    +V SVD  + N ++  +  DIEE ++Q  ++ VLELP ++G     ++ 
Sbjct: 199 ADRQIPVKQSVDVISVDPVKVNGEVHSALSDIEEFMKQIVDEAVLELPQMNGYAY--TIP 258

Query: 258 QVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSFE-SEV 317
           Q   +EET ST++D YS+  + PE   V N S    +L   F FT++   Q+Q +E SEV
Sbjct: 259 QAVSEEETQSTVVDLYSREVVCPEPYTVFNPSDHQCNLQASFDFTESDTSQIQRYEASEV 318

Query: 318 SSS--------PKYREEEDFLEIDDLIGPEPTLVANVNPLGNSEF---DGLNELELFHDA 377
           ++S        P    EEDFLE+DDL+GPEPT+    NP+ N +F   DGL+E +L+HDA
Sbjct: 319 TTSAPEIHEQGPPILREEDFLEMDDLLGPEPTISNIENPVDNLQFEGIDGLSEFDLYHDA 378

Query: 378 NMFLRDLGPVVPETFL-DPYLSADGGILV-------------ANDVNGHL--QYDQMGNG 437
            MF  D+GP    T     Y+++ G  +V              N VN  L  +  QM N 
Sbjct: 379 AMFFHDMGPFDQGTVSHQQYMNSLGNNIVDQFEYQLQPNPPAVNQVNHQLNPESTQMNNQ 438

Query: 438 FWENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSSASKFSSNLW 497
            W   TE +    EP+Q+F +    GV YE  S+ A +   NQ  N      S+FSS LW
Sbjct: 439 LW-THTERA----EPNQEFVSYSTSGVVYEP-SNFASQANQNQSGNEAAGGPSQFSSALW 498

Query: 498 AFVESIPTTPASASEN--VNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSARRTGTNK 551
           AFVESIPTTPASASEN  VNR F+RMSSFSRLR+N++   +ANV  G+  + A+R G  +
Sbjct: 499 AFVESIPTTPASASENALVNRAFERMSSFSRLRINSV---SANVTAGS-SSEAKRAGRRR 558

BLAST of Cp4.1LG01g21380 vs. TrEMBL
Match: U5GS92_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s06210g PE=4 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 1.0e-143
Identity = 302/604 (50.00%), Postives = 378/604 (62.58%), Query Frame = 1

Query: 12  DEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGD 71
           D+ +WPPGFRFHPTDEELV+YYLK K+C+++LKL+IIRE+DVYKW+P ELPGQS LKTGD
Sbjct: 12  DDKEWPPGFRFHPTDEELVVYYLKRKICKKRLKLNIIREVDVYKWDPEELPGQSILKTGD 71

Query: 72  RQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNG 131
           RQWFFFSPR+R+YPN +R +RATR G+WK TGKDRI+  NSRNVGVKKTLVFY+GRAPNG
Sbjct: 72  RQWFFFSPRDRKYPNGARTNRATRQGYWKATGKDRIVVCNSRNVGVKKTLVFYRGRAPNG 131

Query: 132 ERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGEC 191
           +RTDWVMHEY+LDE+ELKRC NV+DYYALYK+YKKSG GPKNGE YGAPFKEE+W D E 
Sbjct: 132 DRTDWVMHEYSLDEEELKRCSNVQDYYALYKVYKKSGAGPKNGEHYGAPFKEEDWADDEF 191

Query: 192 QVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPV-LELPSVSGI 251
           Q +      +  V   +EVT VD    + Q++   +D EE+++Q   +P   EL +    
Sbjct: 192 QCVNGMFTPDIPVKKHNEVTLVDNFIQSAQLEPPLNDFEEIIKQIGEEPAHNELQNNDFT 251

Query: 252 HQFDSV-VQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQL 311
           +    V V V  +EE  ST++D   +  +   A + L  SGQ  + H  F F Q+G   L
Sbjct: 252 YLLPQVCVMVTGEEEAQSTLVDPSFREFVCEPAGE-LTTSGQHCNKHTSFNFDQSGTATL 311

Query: 312 QSFES-EVSSSPKYRE-----EEDFLEIDDLIGPEPTLVANVNPLGN---SEFDGLNELE 371
           Q  E+ EV+S   Y +     EEDFLEI+DLI PEP+      P+ N    +FDGL+E +
Sbjct: 312 QLHEAPEVTSGTNYEQAPQLNEEDFLEINDLIDPEPSFSNTEQPVENLQFDDFDGLSEFD 371

Query: 372 LFHDANMFLRDLGPV----VPETFLDPY-----------LSADGGI-----------LVA 431
           L+HDA MFLRD+GPV    V  +++ PY           L  D  I           LVA
Sbjct: 372 LYHDAAMFLRDMGPVDQEAVSHSYMHPYGCDMVNQVGYQLQPDSIINAVDYQLQQSNLVA 431

Query: 432 NDVNGHLQ-----YDQMGNGFW-ENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEI 491
           N V+  LQ      +QM N  W   +  N  +  E H     QP  GV  ES S+ +   
Sbjct: 432 NQVDCDLQPQFFDAEQMNNQLWVHGQRSNMLAASESHNGNLFQPTPGVVCES-SNNSTRT 491

Query: 492 RDNQIANGGGSSASKFSSNLWAFVESIPTTPASASEN--VNRTFQRMSSFSRLRLNT--- 551
             NQ    G ++   FSS LW FVESIPTTPASASEN  VN+ F+RMSSFSR+++N    
Sbjct: 492 NGNQGGKEGDAADGWFSSALWGFVESIPTTPASASENPLVNKAFERMSSFSRIKMNVKSI 551

Query: 552 ---------LNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQER 559
                    +N  + NV   N   S R    NKGF L SI+GVLCAILWV +G  RL  R
Sbjct: 552 NVDAASRIRMNVNSINVVAANGAASVRSASRNKGFVLLSIVGVLCAILWVFVGSGRLLGR 611

BLAST of Cp4.1LG01g21380 vs. TrEMBL
Match: B9GT49_POPTR (No apical meristem family protein OS=Populus trichocarpa GN=POPTR_0002s06210g PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 3.0e-143
Identity = 302/604 (50.00%), Postives = 380/604 (62.91%), Query Frame = 1

Query: 12  DEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGD 71
           D+ +WPPGFRFHPTDEELV+YYLK K+C+++LKL+IIRE+DVYKW+P ELPGQS LKTGD
Sbjct: 12  DDKEWPPGFRFHPTDEELVVYYLKRKICKKRLKLNIIREVDVYKWDPEELPGQSILKTGD 71

Query: 72  RQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNG 131
           RQWFFFSPR+R+YPN +R +RATR G+WK TGKDRI+  NSRNVGVKKTLVFY+GRAPNG
Sbjct: 72  RQWFFFSPRDRKYPNGARTNRATRQGYWKATGKDRIVVCNSRNVGVKKTLVFYRGRAPNG 131

Query: 132 ERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGEC 191
           +RTDWVMHEY+LDE+ELKRC NV+DYYALYK+YKKSG GPKNGE YGAPFKEE+W D E 
Sbjct: 132 DRTDWVMHEYSLDEEELKRCSNVQDYYALYKVYKKSGAGPKNGEHYGAPFKEEDWADDEF 191

Query: 192 QVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPV-LELPSVSGI 251
           Q +      +  V   +EVT VD    + Q++   +D EE+++Q   +P   EL +    
Sbjct: 192 QCVNGMFTPDIPVKKHNEVTLVDNFIQSAQLEPPLNDFEEIIKQIGEEPAHNELQN---- 251

Query: 252 HQFDSVV-QVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQL 311
           + F  ++ QV  +EE  ST++D   +  +   A + L  SGQ  + H  F F Q+G   L
Sbjct: 252 NDFTYLLPQVTGEEEAQSTLVDPSFREFVCEPAGE-LTTSGQHCNKHTSFNFDQSGTATL 311

Query: 312 QSFES-EVSSSPKYRE-----EEDFLEIDDLIGPEPTLVANVNPLGN---SEFDGLNELE 371
           Q  E+ EV+S   Y +     EEDFLEI+DLI PEP+      P+ N    +FDGL+E +
Sbjct: 312 QLHEAPEVTSGTNYEQAPQLNEEDFLEINDLIDPEPSFSNTEQPVENLQFDDFDGLSEFD 371

Query: 372 LFHDANMFLRDLGPV----VPETFLDPY-----------LSADGGI-----------LVA 431
           L+HDA MFLRD+GPV    V  +++ PY           L  D  I           LVA
Sbjct: 372 LYHDAAMFLRDMGPVDQEAVSHSYMHPYGCDMVNQVGYQLQPDSIINAVDYQLQQSNLVA 431

Query: 432 NDVNGHLQ-----YDQMGNGFW-ENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEI 491
           N V+  LQ      +QM N  W   +  N  +  E H     QP  GV  ES S+ +   
Sbjct: 432 NQVDCDLQPQFFDAEQMNNQLWVHGQRSNMLAASESHNGNLFQPTPGVVCES-SNNSTRT 491

Query: 492 RDNQIANGGGSSASKFSSNLWAFVESIPTTPASASEN--VNRTFQRMSSFSRLRLNT--- 551
             NQ    G ++   FSS LW FVESIPTTPASASEN  VN+ F+RMSSFSR+++N    
Sbjct: 492 NGNQGGKEGDAADGWFSSALWGFVESIPTTPASASENPLVNKAFERMSSFSRIKMNVKSI 551

Query: 552 ---------LNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQER 559
                    +N  + NV   N   S R    NKGF L SI+GVLCAILWV +G  RL  R
Sbjct: 552 NVDAASRIRMNVNSINVVAANGAASVRSASRNKGFVLLSIVGVLCAILWVFVGSGRLLGR 609

BLAST of Cp4.1LG01g21380 vs. TrEMBL
Match: A0A061FKH8_THECC (NAC domain protein, IPR003441, putative isoform 1 OS=Theobroma cacao GN=TCM_036587 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.6e-139
Identity = 300/604 (49.67%), Postives = 371/604 (61.42%), Query Frame = 1

Query: 3   TVSSSMG---LCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPG 62
           TV+++ G   L D+  WPPGFRFHPTDEELVLYYLK K+CRRKLKLDIIRE DVYKW+P 
Sbjct: 2   TVTAAAGDSCLGDDQVWPPGFRFHPTDEELVLYYLKRKICRRKLKLDIIRETDVYKWDPE 61

Query: 63  ELPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKK 122
           ELP QS LK+GDRQWFFFSPR+R+YPN +R +RATR G+WK TGKDR I  NSR VGVKK
Sbjct: 62  ELPAQSILKSGDRQWFFFSPRDRKYPNGARSNRATRQGYWKATGKDRTITCNSRVVGVKK 121

Query: 123 TLVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGA 182
           TLVFY GRAPNG R+DWVMHEYTLDE+ELKRC+N+KDYYALYK+YKKSGPGPKNGEQYGA
Sbjct: 122 TLVFYGGRAPNGVRSDWVMHEYTLDEEELKRCQNMKDYYALYKVYKKSGPGPKNGEQYGA 181

Query: 183 PFKEENWNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTND 242
           PFKEE+W D E    V +      V + +E    D   ANVQ+Q + ++IEE ++Q  ++
Sbjct: 182 PFKEEDWVDEE---YVSNPITVTPVKLPNEAIPDDNVNANVQVQSALNEIEEFMRQLADE 241

Query: 243 PVLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCF 302
           P L  P     H    VV    +EET ST++D   +  I  E   V+            F
Sbjct: 242 PALPQPQAQPGHALPQVVS---EEETQSTLLDPSPRGVIFHEPIGVVLEQAS-------F 301

Query: 303 QFTQAGLFQLQSFESEVSSSPKYRE-----EEDFLEIDDLIGPEPTLVANVN-PLGNSEF 362
           +F+Q+   QL       S +  + +     EE FLEIDDLIGPE TL +NV  P  N +F
Sbjct: 302 EFSQSPTSQLHEAPEVTSVADHFEQVPQICEEGFLEIDDLIGPE-TLTSNVGKPAENVQF 361

Query: 363 ---DGLNELELFHDANMFLRDLGPV----VPETFLD------------------PYLSAD 422
              DGL+E +LFHDA MFL+D+GP+    VP ++ D                  P L+A 
Sbjct: 362 NELDGLSEFDLFHDAAMFLQDMGPIDQGAVPFSYTDNMINQPQLNAFGANQQLQPQLNAF 421

Query: 423 GGIL------------VANDVNGHLQYDQMGNGFWENETENSFSFPEPHQQFGTQPNLGV 482
           G  +            V ++++  +Q DQ+    W ++  +    P         P  G+
Sbjct: 422 GDNMLNQVDYQLQFQSVGDELDQQIQLDQIHEPLWTHDQSSDVFAPSGSNLGNAAPTSGL 481

Query: 483 GYESVSSAAPEIRDNQIANGGGSSASKFSSNLWAFVESIPTTPASASEN--VNRTFQRMS 542
            Y   +      +D    NGGG  AS FSS LW+FVESIPTTPASASE   VNR  +RMS
Sbjct: 482 IYNGNN------QDQGDKNGGG--ASMFSSALWSFVESIPTTPASASETPLVNRALERMS 541

Query: 543 SFSRLRLNTLNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQER 559
           SFSRLRLN  NT    V+  +   +ARR G N+G F  SILG LCAILW   G VR+  R
Sbjct: 542 SFSRLRLNARNTA---VSAVDGAATARRIGGNRGIFFISILGALCAILWFFTGTVRILGR 580

BLAST of Cp4.1LG01g21380 vs. TAIR10
Match: AT1G34190.1 (AT1G34190.1 NAC domain containing protein 17)

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-113
Identity = 258/575 (44.87%), Postives = 343/575 (59.65%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV+YYLK K+CR++L++++I  +DVYK +P ELPGQS LKTGDRQWF+F
Sbjct: 18  PGFRFHPTDEELVMYYLKRKICRKRLRVNVIGVVDVYKMDPEELPGQSMLKTGDRQWFYF 77

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           +PR R+YPNA+R +R T NG+WK TGKDR+I+YNSR+VG+KKTLVFY+GRAP+GERTDWV
Sbjct: 78  TPRSRKYPNAARSNRGTENGYWKATGKDRVIEYNSRSVGLKKTLVFYRGRAPSGERTDWV 137

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENW--NDGECQVLV 197
           MHEYT+DEDEL RCKN ++YYALYKL+KKSG GPKNGEQYGAPF+EE W  +D E    +
Sbjct: 138 MHEYTMDEDELGRCKNPQEYYALYKLFKKSGAGPKNGEQYGAPFQEEEWVDDDNEDVNAI 197

Query: 198 HSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQFDS 257
                E  V   ++   VD  R    + L  +DI+ELL    N P        G+ Q   
Sbjct: 198 AVAVPEQPVVRYEDARRVDERRLFNPVILQLEDIDELLNGIPNAP--------GVPQ-RC 257

Query: 258 VVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSFE-S 317
           + QV+ +EE  ST+++  S    LP        +GQ        Q+ +   F   S E +
Sbjct: 258 IPQVNSEEELQSTLVNN-SAREFLP--------NGQ--------QYNRPSSF--DSLETA 317

Query: 318 EVSSSPKYREEEDFLEIDDLI-----GPEPTLVANVNPLGNSEFDGLNEL-ELFHDANMF 377
           EV+S+P   E+EDF+E+DDL+     G   T  A      + EFD  NE  +LFHD +M 
Sbjct: 318 EVTSAPLVFEKEDFIEMDDLLLIPEFGASSTEKA-AQFSNHGEFDDFNEFDQLFHDVSMS 377

Query: 378 LRDLGPVVPETFLDPYLSADGGILVANDVNGHLQYDQMGNGFWENETEN----------- 437
           L D+ P+   T  +   S        +D    L Y Q  +   EN+  N           
Sbjct: 378 L-DMEPIDQGTSAN-LSSLSDSANYTSDQKQQLLYQQFQDQTPENQLNNIMDPSTTLNQI 437

Query: 438 ----------SFSFPEPHQQFG--TQPNLGVGYESVS-SAAPEIRDNQIANGGGSSASKF 497
                     +  F +     G    P+ GV  +S + + +   + ++I NGGG++ S+F
Sbjct: 438 TSDIWFEDDQAILFDQQQSFSGAFASPSSGVMPDSTNPTMSVNAQGHEIQNGGGTT-SQF 497

Query: 498 SSNLWAFVESIPTTPASASEN-VNRTFQRMSSFSRLRLNTLNTFNANVAIGNP--KTSAR 557
           SS LWA ++SIP+TPASA E  +NRTF RMSSFSR+R N         A G P   T A+
Sbjct: 498 SSALWALMDSIPSTPASACEGPLNRTFVRMSSFSRMRFN-------GKANGTPVSTTIAK 553

BLAST of Cp4.1LG01g21380 vs. TAIR10
Match: AT1G34180.2 (AT1G34180.2 NAC domain containing protein 16)

HSP 1 Score: 389.8 bits (1000), Expect = 2.8e-108
Identity = 245/585 (41.88%), Postives = 335/585 (57.26%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQ------------S 77
           PGFRFHPTDEELV+YYLK K+C +KL+++ I  +DVYK +P ELPG             S
Sbjct: 18  PGFRFHPTDEELVVYYLKRKICCKKLRVNAIGVVDVYKVDPSELPGNFQHLLIDFDSCLS 77

Query: 78  KLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQ 137
            LKTGDRQWFFF+PR R+YPNA+R SR T  G+WK TGKDR+I+YNSR+VG+KKTLVFY+
Sbjct: 78  MLKTGDRQWFFFTPRNRKYPNAARSSRGTATGYWKATGKDRVIEYNSRSVGLKKTLVFYR 137

Query: 138 GRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEEN 197
           GRAPNGERTDWVMHEYT+DE+EL RCKN K+YYALYKLYKKSG GPKNGEQYGAPF+EE 
Sbjct: 138 GRAPNGERTDWVMHEYTMDEEELGRCKNAKEYYALYKLYKKSGAGPKNGEQYGAPFQEEE 197

Query: 198 WNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELP 257
           W D + +        +  V   +    VD  +    ++L  +DIE+LL         E+P
Sbjct: 198 WVDSDSEDADSVAVPDYPVVRYENGPCVDDTKFCNPVKLQLEDIEKLLN--------EIP 257

Query: 258 SVSGIHQ-----FDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQ 317
              G++Q     F  V Q +  E   ST+++  S  +I P  + +   +GQ  +    FQ
Sbjct: 258 DAPGVNQRQFDEFVGVPQGNSAEVIQSTLLNNSSGEYIDPRTNGMFLPNGQLYNRDSSFQ 317

Query: 318 FTQAGLFQLQSFESEVSSSPKY-REEEDFLEIDDLIGPE---PTLVANVNPLGNSEFDGL 377
                   L SFE+    +P    E+E+++E++DL+ PE    +   +   L + EF  +
Sbjct: 318 ------SHLNSFEATSGMAPLLDNEKEEYIEMNDLLIPELGASSTEKSTEFLNHGEFGDV 377

Query: 378 NEL-ELFHDANMFLRDLGPVVPETFLDPYLSADGG------------ILVANDVNGHLQ- 437
           NE  +LF+D ++F    G     + L  + +   G                N +N ++  
Sbjct: 378 NEYDQLFNDISVF---QGTSTDLSCLSNFTNNTSGQRQQLLYEQFQYQTPENQLNNYMHP 437

Query: 438 ---YDQMGNGFWENETENSFSFPEPHQQFG--TQPNLGVGYESVS---SAAPEIRDNQIA 497
               +Q  +  W  + + +     P    G  T  + GV  ES++   S  P+ ++ Q  
Sbjct: 438 STTLNQFTDNMWFKDDQAALYVQPPQSSSGAFTSQSTGVMPESMNPTMSVNPQYKEGQ-- 497

Query: 498 NGGGSSASKFSSNLWAFVESIPTTPASASEN-VNRTFQRMSSFSRLRLNTLNTFNANVAI 557
           NGGG + S+FSS LW  +ESIP+TPASA E  +N+TF RMSSFSR+R      FN     
Sbjct: 498 NGGG-TRSQFSSALWELLESIPSTPASACEGPLNQTFVRMSSFSRIR------FNGTSVT 557

Query: 558 GNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQERGIAS 559
               T A++  +N+GF L SI+G LCAI WV    V +  R + S
Sbjct: 558 SRKVTVAKKRISNRGFLLLSIMGALCAIFWVFKATVGVMGRPLLS 576

BLAST of Cp4.1LG01g21380 vs. TAIR10
Match: AT1G32870.1 (AT1G32870.1 NAC domain protein 13)

HSP 1 Score: 260.0 bits (663), Expect = 3.4e-69
Identity = 141/260 (54.23%), Postives = 179/260 (68.85%), Query Frame = 1

Query: 12  DEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGD 71
           + G   PGFRFHPTDEELV+YYLK K+ R+KL+++ I E DVYK++P ELP ++  KT D
Sbjct: 6   ENGGLAPGFRFHPTDEELVVYYLKRKIRRKKLRVEAIGETDVYKFDPEELPEKALYKTRD 65

Query: 72  RQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNG 131
           RQWFFFS R+R++   SR SRAT  G+WK TGKDR+I  +SR VG KKTLVF++GRAPNG
Sbjct: 66  RQWFFFSLRDRKH--GSRSSRATERGYWKATGKDRVIHCDSRPVGEKKTLVFHRGRAPNG 125

Query: 132 ERTDWVMHEYTLDEDELKRC--KNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDG 191
           ERT+WVMHEYTL ++ELKRC  ++VKD Y LYK+YKKSG GPKNGEQYGAPF EE W + 
Sbjct: 126 ERTNWVMHEYTLHKEELKRCGGEDVKDAYVLYKIYKKSGSGPKNGEQYGAPFIEEEWAED 185

Query: 192 ECQVLVHSDAQEPQVNILDEVTSVDRE---RANVQIQLSSDDIEELLQQFTND--PVLEL 251
           +       D  EP  N L    SVD     +   Q +L  +DIEEL+ Q  +   P L+ 
Sbjct: 186 D-----DDDVDEP-ANQLVVSASVDNSLWGKGLNQSELDDNDIEELMSQVRDQSGPTLQQ 245

Query: 252 PSVSGIHQFDSVVQVDDKEE 265
             VSG++       +++ EE
Sbjct: 246 NGVSGLNSHVDTYNLENLEE 257

BLAST of Cp4.1LG01g21380 vs. TAIR10
Match: AT5G04410.1 (AT5G04410.1 NAC domain containing protein 2)

HSP 1 Score: 222.6 bits (566), Expect = 5.9e-58
Identity = 103/193 (53.37%), Postives = 138/193 (71.50%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV YYLK KVC +  K D I   D+YK EP +LP +SKLK+ D +W+FF
Sbjct: 11  PGFRFHPTDEELVRYYLKRKVCNKPFKFDAISVTDIYKSEPWDLPDKSKLKSRDLEWYFF 70

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           S  +++Y N S+ +RAT  G+WK TGKDR I+  SR VG+KKTLV+++GRAP GERT+WV
Sbjct: 71  SMLDKKYSNGSKTNRATEKGYWKTTGKDREIRNGSRVVGMKKTLVYHKGRAPRGERTNWV 130

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHS 197
           MHEY L +++LK+    ++ Y L ++++KSG GPKNGEQYGAP+ EE W +     +   
Sbjct: 131 MHEYRLSDEDLKKAGVPQEAYVLCRIFQKSGTGPKNGEQYGAPYLEEEWEEDGMTYVPAQ 190

Query: 198 DAQEPQVNILDEV 211
           DA    + + D+V
Sbjct: 191 DAFSEGLALNDDV 203

BLAST of Cp4.1LG01g21380 vs. TAIR10
Match: AT3G10500.1 (AT3G10500.1 NAC domain containing protein 53)

HSP 1 Score: 213.4 bits (542), Expect = 3.6e-55
Identity = 98/178 (55.06%), Postives = 129/178 (72.47%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV YYLK K+C +  K D I   DVYK EP +LP +S+LK+ D +W+FF
Sbjct: 11  PGFRFHPTDEELVRYYLKRKICNKPFKFDAISVTDVYKSEPWDLPDKSRLKSRDLEWYFF 70

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           S  +++Y N S+ +RAT  G+WK TGKDR I   S+ VG+KKTLV+++GRAP GERT+WV
Sbjct: 71  SMLDKKYRNGSKTNRATEMGYWKTTGKDREILNGSKVVGMKKTLVYHKGRAPRGERTNWV 130

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLV 196
           MHEY L + +L +    +D + L ++++KSG GPKNGEQYGAPF EE W + +    V
Sbjct: 131 MHEYRLVDQDLDKTGVHQDAFVLCRIFQKSGSGPKNGEQYGAPFVEEEWEEEDDMTFV 188

BLAST of Cp4.1LG01g21380 vs. NCBI nr
Match: gi|659113672|ref|XP_008456695.1| (PREDICTED: uncharacterized protein LOC103496564 [Cucumis melo])

HSP 1 Score: 841.6 bits (2173), Expect = 7.7e-241
Identity = 437/569 (76.80%), Postives = 477/569 (83.83%), Query Frame = 1

Query: 1   MSTVSSSMGLCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGE 60
           MST SSS GLCDEGDWPPGFRFHPTDEEL+LYYLKFK+CRRKLKLDIIRE DVYKWEP E
Sbjct: 1   MSTASSSKGLCDEGDWPPGFRFHPTDEELILYYLKFKICRRKLKLDIIRETDVYKWEPDE 60

Query: 61  LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKT 120
           LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATR+G+WK TGKDRII+ NSRNVGVKKT
Sbjct: 61  LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRSGYWKATGKDRIIQCNSRNVGVKKT 120

Query: 121 LVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180
           LVFY GRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP
Sbjct: 121 LVFYLGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180

Query: 181 FKEENWNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDP 240
           F+EE+W D        +D  EPQV  +DEV SV RER N QIQLSS+DIEEL++Q  NDP
Sbjct: 181 FREEDWVD-------DADCLEPQVKTVDEVDSVVRERDNGQIQLSSEDIEELMKQIVNDP 240

Query: 241 VLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQ 300
           VLELPSVSG HQ  S +QVDDKEETASTMID Y+Q HILP+ADKV + S QPSDLH  F 
Sbjct: 241 VLELPSVSGYHQLGSALQVDDKEETASTMIDAYAQEHILPQADKVYHFSVQPSDLHASFD 300

Query: 301 FTQAGLFQLQSFESEVSSSPKYREEE-DFLEIDDLIGPEPTLVANVNPLGN---SEFDGL 360
           FTQ+G+ QLQSFE+EVSS+ K  EEE DFLEI DLIGPEPT VANVNPLGN   SE DGL
Sbjct: 301 FTQSGISQLQSFEAEVSSALKDCEEEGDFLEIYDLIGPEPTPVANVNPLGNIPPSELDGL 360

Query: 361 NELELFHDANMFLRDLGPVVPETFLDPYLSADGGILVANDVNGHLQY-----DQMGNGFW 420
           +EL+LFHDANMFLRDLGP+  ET LDPYL+A   + VAN++NG+LQ+     +Q  +GFW
Sbjct: 361 SELDLFHDANMFLRDLGPITTETVLDPYLNA-LDVDVANNLNGNLQHVSYQQNQTDDGFW 420

Query: 421 E-NETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSSASKFSSNLWA 480
           + NETEN+FSF E H QF TQ  LGVG ESVSS A   R+NQ AN GG S S FSSNLWA
Sbjct: 421 KNNETENAFSFHESHGQFVTQSTLGVGCESVSSTAAGTRENQSANDGGGSTSWFSSNLWA 480

Query: 481 FVESIPTTPASASENVNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSA-RRTGTNKGF 540
           FVESIPTTPASASENVNR F+RMSSFSRLRLNTLNT N NVA+ NP+  A RRTG NKGF
Sbjct: 481 FVESIPTTPASASENVNRAFERMSSFSRLRLNTLNTLNTNVAVRNPERGARRRTGMNKGF 540

Query: 541 FLFSILGVLCAILWVLLGDVRLQERGIAS 559
           FLFSILGVLCAILWVL+G+VRL    I+S
Sbjct: 541 FLFSILGVLCAILWVLIGNVRLSGNCISS 561

BLAST of Cp4.1LG01g21380 vs. NCBI nr
Match: gi|449446361|ref|XP_004140940.1| (PREDICTED: uncharacterized protein LOC101217428 [Cucumis sativus])

HSP 1 Score: 817.0 bits (2109), Expect = 2.0e-233
Identity = 428/579 (73.92%), Postives = 469/579 (81.00%), Query Frame = 1

Query: 1   MSTVSSSMGLCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGE 60
           MSTVSSS GLCDEGDWPPGFRFHPTDEEL+LYYLKFK+C RKLKLDIIRE DVYKWEP E
Sbjct: 1   MSTVSSSRGLCDEGDWPPGFRFHPTDEELILYYLKFKICGRKLKLDIIRETDVYKWEPDE 60

Query: 61  LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKT 120
           LPGQSKLKTGDRQWFFFSPRE RYPNASRLSRATR G+WK TGKDRII+ NSRNVGVKKT
Sbjct: 61  LPGQSKLKTGDRQWFFFSPREHRYPNASRLSRATRYGYWKATGKDRIIQCNSRNVGVKKT 120

Query: 121 LVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180
           LVFY GRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP
Sbjct: 121 LVFYLGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180

Query: 181 FKEENWNDGECQVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDP 240
           F+EE+W D        +   EPQV I+DEV  V  ER N QIQ+SS+DIEE ++Q  NDP
Sbjct: 181 FREEDWVDD-------AGCLEPQVKIVDEVDPVVCERDNGQIQISSEDIEEFMKQMVNDP 240

Query: 241 VLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQ 300
           VLELP V+G HQ  S +QVDDKEETASTMID Y+  HILP+ DKV + S QPSDL+  F 
Sbjct: 241 VLELPLVNGYHQLGSALQVDDKEETASTMIDDYTHEHILPQLDKVCHFSVQPSDLNASFD 300

Query: 301 FTQAGLFQLQSFESEVSSSPKYREEE-DFLEIDDLIGPEPTLVANVNPLGN---SEFDGL 360
           FTQ+G+ QLQ FE+EVSS+PK  EEE DFLEI+DL+G EPT VANVNPLGN   SE DGL
Sbjct: 301 FTQSGISQLQPFEAEVSSAPKDCEEEGDFLEINDLVGSEPTPVANVNPLGNIPPSELDGL 360

Query: 361 NELELFHDANMFLRDLGPVVPETFLDPYLSADGGILVANDVNGHLQYD-----QMGNGFW 420
           +EL+LFHDANMFLRDLGP+ PET LDPYL+A   + VA++ NG+ QYD     Q  N FW
Sbjct: 361 SELDLFHDANMFLRDLGPIAPETVLDPYLNALD-VDVADNSNGNWQYDPYQQIQTDNVFW 420

Query: 421 EN-ETENSFSF----------PEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSS 480
           +N ETEN+FS           PE H QF TQ NLGVGYESVSS A   R+ Q AN GG S
Sbjct: 421 KNNETENAFSIQSNGHSFNQIPESHGQFVTQSNLGVGYESVSSTAAGTREIQSANDGGGS 480

Query: 481 ASKFSSNLWAFVESIPTTPASASENVNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSA 540
            S FSSNLWAFVESIPTTPASASENVNR F+RMSSFSRLRLNTLNT N NVA+ NP+T A
Sbjct: 481 TSWFSSNLWAFVESIPTTPASASENVNRAFERMSSFSRLRLNTLNTLNTNVAVRNPETGA 540

Query: 541 -RRTGTNKGFFLFSILGVLCAILWVLLGDVRLQERGIAS 559
            RRTG NKGFFLFSILGVLCAILWVL+G+VRL    I+S
Sbjct: 541 RRRTGMNKGFFLFSILGVLCAILWVLIGNVRLSGNFISS 571

BLAST of Cp4.1LG01g21380 vs. NCBI nr
Match: gi|595796617|ref|XP_007201176.1| (hypothetical protein PRUPE_ppa003410mg [Prunus persica])

HSP 1 Score: 520.8 bits (1340), Expect = 3.0e-144
Identity = 289/563 (51.33%), Postives = 374/563 (66.43%), Query Frame = 1

Query: 18  PGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGDRQWFFF 77
           PGFRFHPTDEELV+YYLK K+C+++LKL++I E DVYKW+P ELPG S LKTGDRQWFFF
Sbjct: 19  PGFRFHPTDEELVVYYLKRKICKKRLKLNVIAETDVYKWDPEELPGLSLLKTGDRQWFFF 78

Query: 78  SPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNGERTDWV 137
           SPR+R+YPN  R +RATR+G+WK TGKDR I   SR+VG+KKTLV+Y+GRAP+GERTDWV
Sbjct: 79  SPRDRKYPNGGRSNRATRHGYWKATGKDRNITCYSRSVGLKKTLVYYKGRAPSGERTDWV 138

Query: 138 MHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGECQVLVHS 197
           MHEYTLDE+ELKRC+NV++YYALYK+YKKSGPGPKNGEQYGAPF+EE W D E  V+  S
Sbjct: 139 MHEYTLDEEELKRCRNVQEYYALYKVYKKSGPGPKNGEQYGAPFREEEWADDELPVINSS 198

Query: 198 DAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPVLELPSVSGIHQFDSVV 257
             ++  V    +V SVD  + N ++  +  DIEE ++Q  ++ VLELP ++G     ++ 
Sbjct: 199 ADRQIPVKQSVDVISVDPVKVNGEVHSALSDIEEFMKQIVDEAVLELPQMNGYAY--TIP 258

Query: 258 QVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQLQSFE-SEV 317
           Q   +EET ST++D YS+  + PE   V N S    +L   F FT++   Q+Q +E SEV
Sbjct: 259 QAVSEEETQSTVVDLYSREVVCPEPYTVFNPSDHQCNLQASFDFTESDTSQIQRYEASEV 318

Query: 318 SSS--------PKYREEEDFLEIDDLIGPEPTLVANVNPLGNSEF---DGLNELELFHDA 377
           ++S        P    EEDFLE+DDL+GPEPT+    NP+ N +F   DGL+E +L+HDA
Sbjct: 319 TTSAPEIHEQGPPILREEDFLEMDDLLGPEPTISNIENPVDNLQFEGIDGLSEFDLYHDA 378

Query: 378 NMFLRDLGPVVPETFL-DPYLSADGGILV-------------ANDVNGHL--QYDQMGNG 437
            MF  D+GP    T     Y+++ G  +V              N VN  L  +  QM N 
Sbjct: 379 AMFFHDMGPFDQGTVSHQQYMNSLGNNIVDQFEYQLQPNPPAVNQVNHQLNPESTQMNNQ 438

Query: 438 FWENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEIRDNQIANGGGSSASKFSSNLW 497
            W   TE +    EP+Q+F +    GV YE  S+ A +   NQ  N      S+FSS LW
Sbjct: 439 LW-THTERA----EPNQEFVSYSTSGVVYEP-SNFASQANQNQSGNEAAGGPSQFSSALW 498

Query: 498 AFVESIPTTPASASEN--VNRTFQRMSSFSRLRLNTLNTFNANVAIGNPKTSARRTGTNK 551
           AFVESIPTTPASASEN  VNR F+RMSSFSRLR+N++   +ANV  G+  + A+R G  +
Sbjct: 499 AFVESIPTTPASASENALVNRAFERMSSFSRLRINSV---SANVTAGS-SSEAKRAGRRR 558

BLAST of Cp4.1LG01g21380 vs. NCBI nr
Match: gi|743840791|ref|XP_011026293.1| (PREDICTED: NAC domain-containing protein 74-like isoform X2 [Populus euphratica])

HSP 1 Score: 518.5 bits (1334), Expect = 1.5e-143
Identity = 304/614 (49.51%), Postives = 379/614 (61.73%), Query Frame = 1

Query: 1   MSTVSSSMGLCDEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGE 60
           M+  + S    D+ +WPPGFRFHPTDEELV+YYLK K+C+++LKL+IIRE+DVYKW+P E
Sbjct: 1   MTVTTDSCFGADDKEWPPGFRFHPTDEELVVYYLKRKICKKRLKLNIIREVDVYKWDPEE 60

Query: 61  LPGQSKLKTGDRQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKT 120
           LPGQS LKTGDRQWFFFSPR+R+YPN +R +RATR G+WK TGKDRI+  NSRNVGVKKT
Sbjct: 61  LPGQSILKTGDRQWFFFSPRDRKYPNGARSNRATRQGYWKATGKDRIVVSNSRNVGVKKT 120

Query: 121 LVFYQGRAPNGERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAP 180
           LVFY+GRAPNGERTDWVMHEY+LDE+ELKRC NV+DYYALYK+YKKSG GPKNGE YGAP
Sbjct: 121 LVFYRGRAPNGERTDWVMHEYSLDEEELKRCSNVQDYYALYKVYKKSGAGPKNGEHYGAP 180

Query: 181 FKEENWNDGECQVLVHSDAQE-PQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTND 240
           FKEE+W D E Q +      E P     +EVT VD    + Q +L  +D EE+++    +
Sbjct: 181 FKEEDWADDESQCVNGMFTPEIPVKQQHNEVTLVDNFIQSAQPELPLNDFEEIIKPIGEE 240

Query: 241 PVLELPSVSGIHQFDSVVQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCF 300
           P       +G      + QV  +EE  ST++D   +  +   A + L  SGQ  + H  F
Sbjct: 241 PAHNELQNNGFTNL--LPQVTGEEEAQSTLVDPSFREFVCEPAGE-LTTSGQHCNKHTSF 300

Query: 301 QFTQAGLFQLQSFES-EVSSSPKYRE-----EEDFLEIDDLIGPEPTLVANVNPLGN--- 360
            F Q+G   LQ  E+ EV+S   Y++     EEDFLEI+DLI PEP+      PL N   
Sbjct: 301 NFDQSGTSTLQLREAPEVTSGTNYKQAPQLNEEDFLEINDLIDPEPSFSNTEQPLENLQF 360

Query: 361 SEFDGLNELELFHDANMFLRDLGPV----VPETFLDPY-----------LSADGGI---- 420
            +FDGL+E +L+HDA MFLRD+GPV    V  +++ PY           L  D  I    
Sbjct: 361 GDFDGLSEFDLYHDAAMFLRDMGPVDQEAVSHSYMHPYGRDMVNQVGCQLQPDSIINVVD 420

Query: 421 -------LVANDVNGHLQ-----YDQMGNGFW-ENETENSFSFPEPHQQFGTQPNLGVGY 480
                  LVAN V+  LQ      +QM N  W   +  N  +  E H     QP  GV  
Sbjct: 421 YQLQQSNLVANQVDCDLQPQFFDAEQMNNQLWVHGQRSNMLAASESHNGNLFQPTPGVVC 480

Query: 481 ESVSSAAPEIRDNQIANGGGSSASKFSSNLWAFVESIPTTPASASEN--VNRTFQRMSSF 540
           ES S+ +    +NQ    G ++   FSS LW FVESIPT PASASEN  VN+ F+RMSS 
Sbjct: 481 ES-SNNSTRTNENQGGKEGDAADGWFSSALWGFVESIPTNPASASENPLVNKAFERMSSL 540

Query: 541 SRLRLNT------------LNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWV 559
           SR+R+N             +N  + NV   N   S R    NKGF L SI+GVLCAILWV
Sbjct: 541 SRIRMNVKSINVDAASRIRMNVNSINVVAANGAASVRSASRNKGFVLLSIVGVLCAILWV 600

BLAST of Cp4.1LG01g21380 vs. NCBI nr
Match: gi|566156516|ref|XP_006386290.1| (hypothetical protein POPTR_0002s06210g [Populus trichocarpa])

HSP 1 Score: 518.5 bits (1334), Expect = 1.5e-143
Identity = 302/604 (50.00%), Postives = 378/604 (62.58%), Query Frame = 1

Query: 12  DEGDWPPGFRFHPTDEELVLYYLKFKVCRRKLKLDIIREIDVYKWEPGELPGQSKLKTGD 71
           D+ +WPPGFRFHPTDEELV+YYLK K+C+++LKL+IIRE+DVYKW+P ELPGQS LKTGD
Sbjct: 12  DDKEWPPGFRFHPTDEELVVYYLKRKICKKRLKLNIIREVDVYKWDPEELPGQSILKTGD 71

Query: 72  RQWFFFSPRERRYPNASRLSRATRNGHWKVTGKDRIIKYNSRNVGVKKTLVFYQGRAPNG 131
           RQWFFFSPR+R+YPN +R +RATR G+WK TGKDRI+  NSRNVGVKKTLVFY+GRAPNG
Sbjct: 72  RQWFFFSPRDRKYPNGARTNRATRQGYWKATGKDRIVVCNSRNVGVKKTLVFYRGRAPNG 131

Query: 132 ERTDWVMHEYTLDEDELKRCKNVKDYYALYKLYKKSGPGPKNGEQYGAPFKEENWNDGEC 191
           +RTDWVMHEY+LDE+ELKRC NV+DYYALYK+YKKSG GPKNGE YGAPFKEE+W D E 
Sbjct: 132 DRTDWVMHEYSLDEEELKRCSNVQDYYALYKVYKKSGAGPKNGEHYGAPFKEEDWADDEF 191

Query: 192 QVLVHSDAQEPQVNILDEVTSVDRERANVQIQLSSDDIEELLQQFTNDPV-LELPSVSGI 251
           Q +      +  V   +EVT VD    + Q++   +D EE+++Q   +P   EL +    
Sbjct: 192 QCVNGMFTPDIPVKKHNEVTLVDNFIQSAQLEPPLNDFEEIIKQIGEEPAHNELQNNDFT 251

Query: 252 HQFDSV-VQVDDKEETASTMIDTYSQNHILPEADKVLNLSGQPSDLHPCFQFTQAGLFQL 311
           +    V V V  +EE  ST++D   +  +   A + L  SGQ  + H  F F Q+G   L
Sbjct: 252 YLLPQVCVMVTGEEEAQSTLVDPSFREFVCEPAGE-LTTSGQHCNKHTSFNFDQSGTATL 311

Query: 312 QSFES-EVSSSPKYRE-----EEDFLEIDDLIGPEPTLVANVNPLGN---SEFDGLNELE 371
           Q  E+ EV+S   Y +     EEDFLEI+DLI PEP+      P+ N    +FDGL+E +
Sbjct: 312 QLHEAPEVTSGTNYEQAPQLNEEDFLEINDLIDPEPSFSNTEQPVENLQFDDFDGLSEFD 371

Query: 372 LFHDANMFLRDLGPV----VPETFLDPY-----------LSADGGI-----------LVA 431
           L+HDA MFLRD+GPV    V  +++ PY           L  D  I           LVA
Sbjct: 372 LYHDAAMFLRDMGPVDQEAVSHSYMHPYGCDMVNQVGYQLQPDSIINAVDYQLQQSNLVA 431

Query: 432 NDVNGHLQ-----YDQMGNGFW-ENETENSFSFPEPHQQFGTQPNLGVGYESVSSAAPEI 491
           N V+  LQ      +QM N  W   +  N  +  E H     QP  GV  ES S+ +   
Sbjct: 432 NQVDCDLQPQFFDAEQMNNQLWVHGQRSNMLAASESHNGNLFQPTPGVVCES-SNNSTRT 491

Query: 492 RDNQIANGGGSSASKFSSNLWAFVESIPTTPASASEN--VNRTFQRMSSFSRLRLNT--- 551
             NQ    G ++   FSS LW FVESIPTTPASASEN  VN+ F+RMSSFSR+++N    
Sbjct: 492 NGNQGGKEGDAADGWFSSALWGFVESIPTTPASASENPLVNKAFERMSSFSRIKMNVKSI 551

Query: 552 ---------LNTFNANVAIGNPKTSARRTGTNKGFFLFSILGVLCAILWVLLGDVRLQER 559
                    +N  + NV   N   S R    NKGF L SI+GVLCAILWV +G  RL  R
Sbjct: 552 NVDAASRIRMNVNSINVVAANGAASVRSASRNKGFVLLSIVGVLCAILWVFVGSGRLLGR 611

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAC17_ARATH3.9e-11244.87NAC domain-containing protein 17 OS=Arabidopsis thaliana GN=NAC017 PE=2 SV=1[more]
NAC16_ARATH1.8e-10942.76NAC domain-containing protein 16 OS=Arabidopsis thaliana GN=NAC016 PE=2 SV=1[more]
NAC13_ARATH6.0e-6854.23NAC domain-containing protein 13 OS=Arabidopsis thaliana GN=NAC13 PE=1 SV=1[more]
NAC78_ARATH1.1e-5653.37NAC domain-containing protein 78 OS=Arabidopsis thaliana GN=NAC078 PE=2 SV=2[more]
NAC53_ARATH6.4e-5455.06NAC domain-containing protein 53 OS=Arabidopsis thaliana GN=NAC053 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K9G3_CUCSA1.4e-23373.92Uncharacterized protein OS=Cucumis sativus GN=Csa_6G052680 PE=4 SV=1[more]
M5W6F5_PRUPE2.1e-14451.33Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003410mg PE=4 SV=1[more]
U5GS92_POPTR1.0e-14350.00Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0002s06210g PE=4 SV=1[more]
B9GT49_POPTR3.0e-14350.00No apical meristem family protein OS=Populus trichocarpa GN=POPTR_0002s06210g PE... [more]
A0A061FKH8_THECC2.6e-13949.67NAC domain protein, IPR003441, putative isoform 1 OS=Theobroma cacao GN=TCM_0365... [more]
Match NameE-valueIdentityDescription
AT1G34190.12.2e-11344.87 NAC domain containing protein 17[more]
AT1G34180.22.8e-10841.88 NAC domain containing protein 16[more]
AT1G32870.13.4e-6954.23 NAC domain protein 13[more]
AT5G04410.15.9e-5853.37 NAC domain containing protein 2[more]
AT3G10500.13.6e-5555.06 NAC domain containing protein 53[more]
Match NameE-valueIdentityDescription
gi|659113672|ref|XP_008456695.1|7.7e-24176.80PREDICTED: uncharacterized protein LOC103496564 [Cucumis melo][more]
gi|449446361|ref|XP_004140940.1|2.0e-23373.92PREDICTED: uncharacterized protein LOC101217428 [Cucumis sativus][more]
gi|595796617|ref|XP_007201176.1|3.0e-14451.33hypothetical protein PRUPE_ppa003410mg [Prunus persica][more]
gi|743840791|ref|XP_011026293.1|1.5e-14349.51PREDICTED: NAC domain-containing protein 74-like isoform X2 [Populus euphratica][more]
gi|566156516|ref|XP_006386290.1|1.5e-14350.00hypothetical protein POPTR_0002s06210g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031124 mRNA 3'-end processing
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003677 DNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0000166 nucleotide binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g21380.1Cp4.1LG01g21380.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 17..143
score: 3.8
IPR003441NAC domainPROFILEPS51005NACcoord: 16..166
score: 53
IPR003441NAC domainunknownSSF101941NAC domaincoord: 12..166
score: 8.5
NoneNo IPR availableunknownCoilCoilcoord: 215..235
scor
NoneNo IPR availablePANTHERPTHR31989FAMILY NOT NAMEDcoord: 1..492
score: 1.7E
NoneNo IPR availablePANTHERPTHR31989:SF27F23M19.14 PROTEIN-RELATEDcoord: 1..492
score: 1.7E