Cp4.1LG06g05890 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG06g05890
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionNAC domain protein
LocationCp4.1LG06 : 3580976 .. 3601081 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACGCCCAAATTCGTAAATGTATATATAATCTGCGGCGCCATCGTTGATTTTGGCGACAAAACAATGGCCACTGGCTTTGGCGGCAATGGGAGTAGCAGTTGCAAGTACAAGTGCGACACTGCCGCCAATACTCTGCAATGGATAAAAGCCATCGCCGACTTCATTAGACCTTACTCGTTCCTGATCAATGCTCCCGTCGTCAATTTTTTTAAGGTTTGTTGATTTATTTTTTCATTCGCACAAGTTTTGGAATCTGCTTTGAGTGTAGCTTGTAATTTGATTTTGATTTTTCCTTTTCACTCGTGGATGTTCTTTTATGATTTACATCATGCTACTGCTTCCATTCTCTTTTACAGGATAGACTATGGGAAGCTGTTGATGAGGAATGGATGGAGTGCTTGCGCAAGGAACATGTGAAGAATCTGCTTCTAATTCCTTCTGGAGCTGTCCAGGTGCGTCTGTAATTGGAGAACATTTTTAGACTTGATTTTATTTCAATTAGCTTGTGTTTGAACGTTTGAAGCTTAGTTTTGAAGCTAGATTTGAGCTCACTTGTAAGATTTTTGTCCATAACAAAAGTACGCAGGTAGTATTTGACTGAATTGACAGTTACACTTTCTCAATGATCCATTATAAGCCTTATTTGGATATGTTTAGGACATAATCCTCTATATGCATGTTAATTTAGACATAATTGTCTGATTATCTGGATCAAAGTATAATATCGATCAAAATATTGATTCGTACCTGTCGAGTAATTTAAAAATATTCACGGAACTGAAATTGATCATTATTTGTAATAATAACGTCATTATTTTCAAAGTAGTTATTATCCGTGTTGGTATGTAGTTGGATGATTAATTCTATGGAAGAATGCTGAGACTGTACAATGACATATTGCATTGCATAAATCCAGTTTCAATTTCTGGGAAAATAGATGTGTAATGGATTGAATTTTATTGTTATTATTATTTTATTCTGAATCAGATACCATAAAGATTTCTGACTAACCTTGTGTTTGCTTGGGGAGCATATAAATCCGCTTATCAACATCTATAATTGATTAGTTGGGTTTCAAAATAAGGGTGAACTAAAAAAAAAACTCCCATATTTAGGGGTGTCGTTAACTAACTCCATATGTTGACGTCTTTTTTTTTTGTTCCTTTTAAAAATTGAATTAATTTTTTCATACTGTCAGTAGATTTCTTATTAAATATTGGTACATCCTTATGGGGCATGTGTGATGTCTTTGACTAATTAGATTAAATAGAACTGAAGTTATGAAATGGCAAGAACTCAGAAATGAAAAATGAGTGGCGAAATATGGCTAGAGTAAAAAAGGTTGATGATAGCCTTCTGCGTGTCACTGTATGTTATGATATATTTCTAAAAATGCATCATCGTTGTTGAGAAGTATCAGAGTACGAAAGTTATGAGCATTGGTCACAGGGGAGCGTTTTGTCTGGAAATATTGCTGATAGTTTAATCCATTTTAAGGAATTTATGGAACTATGTGATCTGCTTTTCTTTGTGTTTCTTGTTATGAGTGCATTTACTCTAATGTACTGAAGTCTAAGTTTCTGATTGACAAAATCTTTCAGGAATACTGGCCAGATTCACTAAAGAAATTTATCCGTACTTCAAGATCTCTTGCCTTTGGACGTGAACAAGCAGACTTGCAGATGGTAGTGCTTTTATGTTATACAGTTGATAGTTTCCGATTGATTATATAGATTTGCTGCGTATAGCAAGAAAAAGAAACATAGGCTTGAAAATTGCATAATATCCATTATCCACTTCGACTCATATGTTACTTTCAAGAAGATGAGAGCTCGCTCCACTCAGGAAGTTCTCTTTCATTGAGCTGTCTTCTAAACCTGAGATTCCAGCTATTATCAACCAAATTCCCTTATCCTTGAAACAATGAAGACTCACATTCTGTAATACAAAAGAATGTGCAGATTTTCTGATATCAGTCTCGTAAGGATAGAACAGAAGTATCTGAACATGGCATTTTAAAACGTGTTGATGACCTTATCTTACATAAGATTTCTGTAGATTTACCCCTTTTCCTTTTTTTTTTTGGATAAAAACTGGGAAGCTTCTTCTTGATCTTCACCCCTCAACTTTAGGAGTCTGGTCCCATTCGTATTCTTTTGTTGCTATTAGAATTATTTTTCACTAAGGACATGGATGGAGAGTTATTGACATTTAACCACCCTTTCTTTAAGATAGACTTCCTGTTATTAGTGTGTCTTGTTGGATTGAGACAAAGAGTGGTCCGTGGTTGGAGTATAATTAAGCTACTTTGAGGCCTGTTTGTAACTAACGGTTACTAGGAGTATGGAAAGGGAAGGTGTGAGTGGGTATCTATCCGGCTAATTTTATTTTATTTTATTTTATTTTATTTTTTAATTTTATATAAGAGGGAATTATTCCTTTGGACATTGTACTGGGAGATGGGAGAGCGAGATCTCTTGAATCGCACTTCTCTGTTTCTTTTTAATAATAAAGAACTGTCACTAACGTCATCCCACTTATGCGTGTCAAACAGAATGTTCCAGGTTGATTTTATCCATTTCTTAGTACCACAGTTGTGTGTATTATTGCAGGTTCTACCTGGTTGGTGCATCGCTTCACTTAACACTGTTCTTTCTCAAGGCATGAATCAGAAGAAAAAACATGAAGTAGGTTATTGGGTCTTACATTTCCGAATGGAATACTTTATGCCTTTTCCAATTTACTAAGAGAGAGCTATCCTTGAGTGCGTGTTAAACAAATCATTTAAATTCAAAATCAGAGAGAAACATATCTATCAGACTAAATAAGTTAAAATCTAAATCAGTAACAAATTACTCTTATTTATGAAATAAAGTCCAAATATTCTCATTTTAATGTGAAACCCTCTTTCTTTTCAAAGCTTGACCATGATTTTTAAATTGAGGTTCTAATTCTTATGATCTTGTTTTCCTACGTGGGAGTCGAATTTCCTAGCATTAGTTTTCAATTTGGTGTAAAACTATGACCATTTAAATTGAATATGATGTATTTCATCTTTAACTTTGCATCCGGATATGATTTATGCTTGGTCCATTTTGTATGATACAATCTGTGGGTTTCATGGCTTGTCAAACGCGTTGATTCAATCTACAAATTGTGTTGTGATATATTATAACCAAAACATTTTCTATATGGTTCATTAAGAAATTTTTTAATGCTTCAACAGGTTGAAGTCCTTTCCGCCATTATTAGTTTGATTGCAAGTGACCTGAAAAGTCATGCAATTGTTGATGTTGGTGCTGGGCAGGTAACACTTACTTTCATTACTGTATGCAAGATGGGTAAGTTTGATTCTGCATCCAGCTTCTTTATATTAGACTGTACTAAAAAAAGAGTCATTCCACAGGGATGGGCTTAAACATCTTCCATTATTTTGGATCTATTAGGTTCCTTTCATTTTCTCCTAAAATGTATGGGAAAAAGTACTTCCTCAATTAAATATGTAAATTTCTTTATGAAATCAACTTAGTTGATTTTATGTTGTAGTACTTGAAGAAATGAACTTCACTTCCTTGCCTTTATGATCAATGTCTGAGAGGTAACTGAGTCTAGGATATTCCTGTACATACTTCTTCCACAGCAGCAACAAAATTCTATTCTTTCTAGGCTTCTAATTTAAGCATTTTCTGATAACTCCTTTTCTACATTTTAATTGAATCTCACTCATATTTTTTGAATGATTTATTCCTCTGTAATCTACTTTCTCTGTCGATCTGTTTTTTCTTCCATTATTGTTTCGGTGTATTCCTAAAAGGGTTTATGATCCATATAAATTCTTCTTACAACAAAAATTTCCATGTACTCGTATGCTCTATATAACGTTCTACTATTGCAAGTGCTTTCATACTTATGATCTTTTCTTGTTAGGGTTATTTAGCGCAAGTGCTTTCCTTCCATTACAAGCATTCAGTTCTTGCAATTGATGCTTGTTCTCATCATGGAAATGTGACAAGTGCACGTTCAGAACGGATTAAGAAGTATTACTCAGCACAGATTCGTAAATCTGGGTAAGGGTATAGAAAACGTTTTTAGGCAAACATTTTGTGTTGATGCGAGCAGATAGAATTAAAATGTTTTACTTTCATTTCTCCTATTTTTTTAAATCTCTTGGAAGCTATTTCTCTTATTTGTGTTTTCTTGAACTTGGTCCCATCTTCTATTGGTTATCTTAAGAACTTCTGGTTTCAAGGGAGTGATAATGAGATCAGCGAGAGCATCTACCACCCACACGGCAATTTATGAGGCCAAACTAAGCTCATACCCATTGTATTTATTCTTTTATAAAATGTTCCCTTCTGAAGACTGATATAAGTTATAGAAAAGTGTTATTATGTATCTACAAATCTCAGAGATCCACATTATTAAACTTTCTAGTCCTCTCACAACAACATAGAAATCACTTTTCAATTTTGATGAAACTCAAACCATCTTGGATTGAGAAATGGTTCTCGCATATTTCCATCTCGAAGAAGCCAGTCACTGATTTTGTATGTGGTTTCAGTCATTGTAACTTAGAGGGAAAGACTTCTGGATACTCAGGTGTATCTTAGAGGACTAAATGCAAACCTTTTTTTTTCTCTTGCCCTGCTTAAATGTTCTTAGTTTCTTACTAGACTAAAATAGTATTTTGGTTTCTACTTATTGTTTTTTTTAACTTAATATCTAAAAAATCATTACTCTTTTTCTAATTTCGCTGAACTCTTTACAGATTGGAAACCAACAATTTGAGACTGCCGAAGGCAATGACATTTCATGTATTATCCGTTGATGCATTAAAATCCCTTGCTAACATGTCACTACAAGACGATCATGCTGATAAGACAAGTGTGACTGGCGATGATCTAGAGAAAACCAATCGGCAGGAGTCAAAGGGTTTGTGCCGTTCAGGCAAAGAGCCTTCATTGGTTCTTGCTGGACTTCATGCTTGCGGTGATCTTTCAGTGATAATGCTCAGGTTAGAATTACATGATATTGAATTACTCTTTTTATTAAGATGTGTCAAATATACATTTCTTGAAAATGCAAAGTAGACATAAGTGAAAATTTTGGTGATTTTTGGAGCCGTATGAGTTTTCATTTTTCTAGTACGTTTTAATGTTTTTTTTTTTTTAAATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAACTATCATTCTGAAGAATGAAAGAGGATACAAAGGGATTAGCTAAAGGGAAAACATGAATCTTCTTTTATAAATTTGGACGAACTAAAAAGTTTATGCGGTGGCTGTGAGAAGCTCACTGGTCAGGAACTTTGGTTTCCCATAATATATGAAAATATCACAAAAGATAACGTGTCATGAATTTGGTTCGTCCACCTATATGTCAAATCTCATCGAAGATACAGTGCTTTTAGAAGAAAAAAGGGCAAAGGAGATGGGCAGTAGTTTTTATCATACTGGTGTTATTATCTCTCTCCTTTCATTGCAGGATTTAACAAATGTACTGTATTCACCATATAACCAGGACCTTTGTTGAGTGCAAGGAAGTAAAAGCAGTCATTAATATTGGTTGCTGTTACAACCTACTCTCGGAGTATGGATCTGACTATGAAGGTGTCCAAAATGGATTTCCAATGAGCTTTGGTGTCAAATCTTCTGGCTTGTTTCTGGGAAAAAGTGGCAGAGACCTTGCATGTCAGGTCAGCTTTTGACCTTCTCCACCATCGTACTAAAAAATATTTCATTTTTGTTTGTTTATCTTCTCGAGATAAGTTGGGTTATCGATTGAGTTTTCCTTGAATTACCTGCATGATAAAATTAACATTAGATGGAGTTATAGTTCACATTTACTTGGAAAGGAGACATAAAAGTAGTTTTATGCCCAAAGTATTTTTAGAATTAAACTATATGCAAGGTGTACATAGTTGTTTCATGATTGAAACACATTGTCCCCAGCAGATTTTAAGACTCTTGTTGTATTGTTCTAGTTCTTTAAAATGGACTTTAAGTGAAGTAAAATATAGGTAACTACATTTGGCTTTGATGCCAGTTGTTGCCCGAGGAGGGAGAGCTAGGGCTTTTCGTTGTTGTTTAGCTTATACTTTATTACAATAAATACTTTAAACAGATAAAATCTAATCACAAGTAAAAAAATAAGTGCACTTAGAAAATCAATAGATCTATAATTTTCAACGCTCGATTAAAATTTATAACTTGTTTCTCTCATCAGAGTGCAGAAAGATGGAGGAATTTGGAAAATGAGGGTGGTGTCCATAATTTTGAGTTGCATGCTTTCCGTGCTGCTTTCCAAATGGTAAAAGTTGTGACCAATTGTTTATTATTTCATCATGCTTTATCATGCTTAACTTAGTTCAATTATAATTTGCTTTTAGCTTCAAGCATATTTTGAAAAAAATCCTATTGAATCAGTTTGAAATTTGACCTTGTCTTCTAACATTATTGATTCTTTCCTTCTAAATTCAAATCAATCAAATATCTATTGTTGGAGGTCTTTAAGCCAAGTTACTTCATTAGAAAACTGTTGTTTTTGTCGTAAAAACTCTTGTGAGCTTATGGAAATGGAAACTGACCTATATGTTCTAGTTAGAGTGTAGGGTCCTATAACACAAAAGATCCAACTGAAGTTGCTAATAGATTTGACAATTAAATTTGGCCAACCATAAGTTACACAAAAGATCTCTGAAATAGCATGTAATCTGCAGATCTGTTTGCATGAACTTTATGTTTTTCCTTTTTAAGTGTCTCTTTCTTTCTTTTTTCTTCTGTTAAGGTACTCTATAAATATTATCCAGAAGTTGTAGCAACTTGCCCATCTGTTGGGCGTCAAGGAAAGGCATTGCGTCGTCGAAAGAAAAGGGAGGCTGCATTATCCTCACAGTGTCATGAAGATAAGCTTGAGGCATCACAATCAGGTATGAGATGGTAAAGACTAGAATATTCTAGATTTCTGTTAATCAATGGCATTCTATCACACACCCTAGTATCCTGTGCTGAAATGGAGGATTTTAAGAGAGGGTTAGTGGGATGTGCAGCATAGCTTCAAGGCATCCGTTTGTTTTATCTAGTAGATTGAATAATACGAATGCAATTTTAACTGTTTCCTTAACCAATATTCAAGTAACAATTTTCATCTACTTTGTTACCCTCAAAATATGCTGCAGATCTTATTGGGGGGTTGCCAGACAAGAGCAATGCCTTTTCTCACACCATTTCTGACTATGGGAGTACGCCATGTGAACAGGCCAAATCTGTTGACAAATATCTTCTTTTTGAAAATTTTTGTCAATCTGGATTAAATCGCCTTGGACTTCAATCTTTGCAGGATATGGATTATTATGGAATCTGGATGGACAATGAGCCTTTTGCTGTATGTATACTTACTAAGTTCCAATACTTGTTATATAGGACTGAGTCATATACCAGTTACAATAATATATAAGCTCTAGTCTAAACCAATTTAGTTGCTCCAAAACTAATGATTGTTGTTAATTTGAATCTCTTTCTTCTTAGATGTTAATTTATTATTTGATTTGCTTCAGAATTATGAATGCTTTATCGAGTGTGGTATCTAATTTATTCCAAGGAATAGGACAGTTGTCTTAGTATTTAAGGATTAAATCATACAATTAGAAAGTTAGAACCCAAGGTACCATTAGATGATGTTAGTGTTACAGGGGATTCCGTTCATCTTATTGAAGCACATGAGTTTAGTTGGAGAATGTTGCACTAAGGTCTCAAATATTATTACCGGTTAGATGATGTAGGTGGTTGCTTTGGTGATGGTCATTATGGGTTCGCATTCCAATGCATCCTTGCATCATTCAAAGGCTTTGTTTATGTTAATTCACCAAGTTCCTCAAATTGCTTGGTTTATCTGATAAGTTTTTTTTATCGTTTAAATTGCAATACATTTTAGTTTTATTATGGGTTCTTATATATATCCAGATGTTAATGGCTCTTATATATATATATATATATATCCAAATGTAACTAATAAATCGTATGCAGGAACTTATTGGGCCTTACTGGTCTCTTCGAGCTGCTCTGGGCCCAGTTTTGGAAACATGTATTCTGCTTGACAGATTACTATTTCTCCAGGAGCAAGGTGAATCCCTTGAAGCCATGTTGCTACCTATTTTTGATCCGGATTTATCACCGAGGAATGTGGCTATAATCGCTCGGAAATTTGGTGCAACATAAGATTCATTAGGAGAAAGGTAATACTAGTGAATATGGCTTTTTCTCAATTTTTGTTACTGAATAATTTTCAATGTGAGAAAACGTTGATCCAAGAGAGTTTTATAGTTGTATTCATGACAGAGAATCAAATATTACGCTAAATTTATTTACCTTAACATTATTCGAATGCTTTAGGACTTAAGAGTAGAAATGTTTTGTTCTTTGATTTATACACCAATAAGCACGTATATGATACATTAATTTCTAGAAATGTAAGACACGGACACATTGGGATCATTTTTTAGAAATATATGTATATATGCTAACGGATTTGATACTTCAAAATAGATAGCACTCATAAATTTAACATGAAAAAAAAAAAAAAAAAATATTTGTAATAATTAGTAGCGTCGACTTCTCCCCACAATTTCTTTGGTCTTGCAAATGAATTTTTAAGATTGAATTAAATTTTAGAGTTGATCATAGTAAGATAAGTAATGGGTTGGGCATCGAGCATGTAGTGACGATAGGGTTAAATATATATACTTTTTTCTTTTTAAATTTAGAATCAATTTTCTAGGTTTAAGTGTCCCTACAGGTCTGGATGTGACCGAAATTGAAAGAAAAAATAAATTAGGACACCTTTTGGTATTTATATAGTTCTTAATTAATTTTTAGAATATTTTAAACTTGATTTACAATATAGTTCCTTTAGAATTGTTTGATTTTGTTTGCCAAGTTGACATCTTTTATTCGCATATCTAAACTGTGGGGCTTAAGCAGACCTGATTTTTGAATTGGTCATAGTGCTTACATGTTTATATCTTCATATTTCTATTATAAAAAAAAATTGTAGTCGAAGGCTTTTACAAAGTTAGTTGGATATGGGTTGAAGAGTACCAATAAAATTAACCAGTCACATCTGTCCAAAGGTTGAGTTATAGATTTGGGAACTATTGTCTGGTCCAGTTAAGTGTAACTTATCCATTTCTTTTTCTTTGGAATAATGAATTGGCTACTAGGTCATATTTTGCATTATCCTTTTTCAGTCAATGCTTCATATCTTCTCAGTTGCCTGATAAAGGAGTAGTCTGATTAATGACGATGATACTAACTGAAATTTTAAGTATAGGAACTAATACTAAACAATGTTATAATCCGTTGTATTGGTGTAGCAGGATTTAATGGTTCTGGATTAATGCATAATTCAAGTGCTGAGGATGAATTATTCAAATTCTGCAGAACTTATGGGTTCCCTTTTCTCACTGTTTGTATTTCATTTCAGATTATTCCTTTAACTTTCCAATACTGTCATTTCTGATATCTGTCTTTTATAATGGCTATTTACAGCCTCACAGGCATAATTTATATTGCCCAATGGTATTATTAAAATTCATTGTTAGTAAGGCATATGGCCTCTTTTTTTTTTTTTTAAACACCCCGATTACCCAAATACTAGCAAAAAGGCTTGGAGTTACATNTAGGTGTCAAGGTTAGGATATCATTCTTATTTATCTTCTTTCTTTTTTTGGTTTGTTGCATTAAGACTATGAGAGATTGGATAAGAAGATCAATCAGAAAGAAGGTTTTTTTTTGTATTTTCTAACAGCAAAGTCAATAGTTTCCCAGTGGTAAGATCGTAATTTCACTCGATGCTTTGGATAATTCCGAAGACAGGAAGGAATATGCAATTGAATCCTTTCGGATCTTAGTAGTTGTTTTAGGCCTAATAGTCTTGTCCAGGACCCTGAAAAACCTCTATTTTAGTAACTAGTATGACCTAATCTTCATAGGTTGACCTAATGATCAATCAAAACAATGAAGATAACTGTGACGGTCCAAGCCCACCGTTAGCAAATATTGTCTTTTTTGGATTTTTCCTTTCGGGCTTTCTCTCAAGGTTTTTAAAACGCGTCTGCTAGGGAAAGGTTTCCACTCCCTTATAAAGGGTGTCTCATTCTCCTTCTCACGGGATCTCACAATAATAAAGGAAATTGTTCGATGAAAATAGTTGAAGGTATGTACAGGCTAGTTCATACTCTTACGGACATCCAAAAGATAAAATGTAATTAATATTGCCCTGAAGAATTTTTTGTGGTCAGTATGCCAAACAGGAACGAATGACGATAAGGAGCACAGAAACGCAAACGTCGGCATGAAGTTGACACCATACAAATCGATCGAACTGCAATTCAGGTACGGCTTCTTTAGTTAAACAGAAGGCAGCTCCTCATGGGTGGGGGGGATACATATCTCTATTTTGAATGATAATTTGTCTCTGCGCAACACTAGATTCTTCTTGTTCTTTCGATCAAAGACGTGGTTGGTGGACATTTCAGAAACTGCTGCTCTTTGTCTTTTTATGCATCAAAGAAAAGGTTGCTATCAGTAAAAAGGTACGTCATCTCTGAGTAGAACATAAATATGGTACACTTTGAAATAGTTAGTGAACTTGTCAAAAGGTTGTGATAGTTCATTATTTGTTTGGAATCTTTTAACTTTTGGGAAAGTTGAGCATATCTATAAGGAATAATGTATTCAAAGGTTTGAATACATTTGGATAATGGATGGGATGGATGATATCTCCGTCGTAATCAAAATTATTAAGCATTGNGATGCTTTGGATAATTCCGAAGACAGGAAGGAATATGCAATTGAATCCTTTCGGATCTTAGTAGTTGTTTTAGGCCTAATAGTCTTGTCCAGGACCCTGAAAAACCTCTATTTTAGTAACTAGTATGACCTAATCTTCATAGGTTGACCTAATGATCAATCAAAACAATGAAGATAACTGTGACGGTCCAAGCCCACCGTTAGCAAATATTGTCTTTTTTGGATTTTTCCTTTCGGGCTTTCTCTCAAGGTTTTTAAAACGCGTCTGCTAGGGAAAGGTTTCCACTCCCTTATAAAGGGTGTCTCATTCTCCTTCTCACGGGATCTCACAATAATAAAGGAAATTGTTCGATGAAAATAGTTGAAGGTATGTACAGGCTAGTTCATACTCTTACGGACATCCAAAAGATAAAATGTAATTAATATTGCCCTGAAGAATTTTTTGTGGTCAGTATGCCAAACAGGAACGAATGACGATAAGGAGCACAGAAACGCAAACGTCGGCATGAAGTTGACACCATACAAATCGATCGAACTGCAATTCAGGTACGGCTTCTTTAGTTAAACAGAAGGCAGCTCCTCATGGGTGGGGGGGATACATATCTCTATTTTGAATGATNTGATTTTTAAAATTTTTAATACATTGTTTTATTGATTTTATATTTTTATTCCAAAACTTTTAAATATCTAACTTTTTAGCTTTTGTAGGTTGAAAATTTATTTTAGTCACTATTACTATTATTTCTTGATAATTCACTCCAAAAAAAAAAAAAAAAAAATCCTGAATTTTATGTCTGAGATTTTTTAAATTATAAAATAGTTGGTATATATAGAAACTTCAGTAATTTATTTTTTTTATTAAAGAAAAATTAATCCTATCGTTTTTCTTTATCATCCCGTGCTTCTACTTGGAATTGGCGGCGGAAAAGCTCATTTCTTTTTCCCCCTCAATCTCCCAAACGTCAGAAAACAACCTCCCGAAGAATAAAATTTTATCAAAGGCCAATTAGAAAATACTTTTTAATATTATTTTGAAACCAGCCTCTTTGGGCCCAATATCTTTTCCTCTTTTTTTCTTTTGTTCTAAAATGGTAATTGATACCCTTATCTTGAATTATAATTGGCGTTCTATAGCTAAATTTAATACTATGGGTTTAGATTTTGAATAAGTGAGTATTTAGCTTATCAATTAGTCTAGGATTAAGAAATCTAACTTATTTTGGTATATGGGTTTAGTAGCCCAAGATGATGAAGCCCATCTTTTACTTGTTAGATATGCTTCTTAACAAATTATAATTTCAATGATAAAAAAAGAACATGTTACACAACCTCAAATTTTTGTACCCTGTTTCTTTGCACAACCCAACGTAAAGATTATTGAATGTTAACCTTCCTGTTCGTGTCTCATAAATCCTATAAGTGAACAAAAATACCATCCATTTGGGTAAAGTCTGCCAACATAAGCATAACTCTTTAGTCGTGTCATCCATATTCAACTTTAAAGTAGAACTTATCTAAGAGTAATTGCACGATTCATAAAAGATTTAGGATTTCTTTAGTATTTAACGCCCCCTTCCAAACTTTGAACTTTAAATATTCTAAATCTATTCTTTTAGTATACTTTGAATTATACCAATATAATAAATAGTTAATTGTTGTGTTTTTTTAAGCTGGTTGAAGAATTTGAATTTTTTAAAACTCCTAATTAAAAGTATTTATTTTAATGANCTTTTCCTCTTTTTTTCTTTTGTTCTAAAATGGTAATTGATACCCTTATCTTGAATTATAATTGGCGTTCTATAGCTAAATTTAATACTATGGGTTTAGATTTTGAATAAGTGAGTATTTAGCTTATCAATTAGTCTAGGATTAAGAAATCTAACTTATTTTGGTATATGGGTTTAGTAGCCCAAGATGATGAAGCCCATCTTTTACTTGTTAGATATGCTTCTTAACAAATTATAATTTCAATGATAAAAAAAGAACATGTTACACAACCTCAAATTTTTGTACCCTGTTTCTTTGCACAACCCAACGTAAAGATTATTGAATGTTAACCTTCCTGTTCGTGTCTCATAAATCCTATAAGTGAACAAAAATACCATCCATTTGGGTAAAGTCTGCCAACATAAGCATAACTCTTTAGTCGTGTCATCCATATTCAACTTTAAAGTAGAACTTATCTAAGAGTAATTGCACGATTCATAAATGATTTAAGATTTCTTTAGTATTTAACGCCCCCTCCCAAACTTTGAACTTTAAATATACTAAATCTATTCTTTTAGTATACTTTGAATTATACCATTAATTGTTGTGTTTTTTTAAGCTTGTTGAAGAATATGAATTTTTTAAAACTCCTAATTAAAAGTATTTATTTTAATGAGCTGAACTACGTTCATGATGATGTGTATGGTTGATATTCAAGTCTTTGACATATAGGAAACGTTATTAATGCTTATAACACCATTCTAACATTTTCTATATATACAATTTAACTCGAAGTTCATAATTAATTTTTCTTTTTTGAATGGTACGTTTTTCTTTCGTTATTTATGTGTAGAGTTATAATTAGTTTGGGAGTTTTCTAAAATTAATTTTGGATACTAATGAACATCAATCTATTGGATGTTCTTGCCACCCGCTTTACTAAAAATCAAATTCCATAAATCGTTTCAAGTGGACATTTCAAACTTAGTGAGCTTCTTTTTGTTTAGTGGGGTTGTGGGGCTGTTGTTGGTTGGATCTCTTTCTGTGAGTTGTGTTCTTTCTGGTTTTGTTGAGCTACTCTTTCTATTTATTTTTGTACCGTTTTTGGCTCTTGTTTATTTGTTTCTTGATCTCAAGGTGTGAAGTCTCCTTTGTCATTATTACCTTTGTAAAAGAGGGAAAATAAATAGAAAATACCCAAATTTTGAATGTTGTGGTTGGATTATATGTAATTTATACCATATTAATTCTATGAAAAGTATTGTGTATTATCCAACCACAAAGGAGAGAGGAATCTATAAAATTTGGACCCCCTTTAGAAAAAAATCTTTCTTCCTAAGTATGCATTAAGGTATCAATGAGGGAGGGCTAGAAAAAATTATAGGAAAACTTTTCATATTATGAGTTGATGATTGCAAAGTCCATGAGCTCAAGCCCACTCGGGCTAAGCACACCCCTAAGTAAGGCGTGCCCTCCAGGTGCTCGCAAGTTTGAGTCGTGGAATGTGATCGACTTCTCTAAATTGAAGTTTAGTTCCATATTAGGCTTTTAGGCCGAGCTAAATTGACAAACAACTTATTGTTGTATCCCTCATAAGATCCTAACATTTATGTATGGGGGAGAAAATAATATAAATATGAGGCCAAGACCAACTCTCCATTTAAGACACACAACCTAACTTGTAATTAACTTGACATTTTGTTCTTAGTATTGACTTGAGCACGAAATGTTAGAAGAAAATACCACATAGATGCGAGATCTCTCTTTAATAGGGTGACTCAAATTAAGACTCTTTTAGTCGATCTCAAATCTATTTTGGTCGACTAATTAAGTCTATGTATAAATATTTTGAATTATCACAAAACAAAGTTGAGGGTAATTACTTTCACTTTCTTTGTTCACTCAATTTATTTATAAGTGTCCAATAAACTGACAGGCTTAAAAAATAGATGGTCCATTTAAACAAGTTTTGTTTTTTTCTATCATAGTGTATGAATTGAACCTATATTCATCAATATCCAATCAATTAATTTAAATTAAATTTATTCTCTCAATTAATTTAAATTAAATTAATTGAAAGAATAGTAGATAATTATGAATTCTTAATTTAATTTGTTTCTTTTCTCTCCTAGATTAGATGGAACATATTAAAATTTTACCAAATAGGGAGGAGAGATATTAGAATGGCAGGCATGAAGAGTTAGCAGCCAGAACAAGTTGAACCTAAAAAATGGTTAGATTCAGGTGCCAACTCTTACATAATCCAAATATTTATTAATGTTGACATGGCAATAAATGCCAAATAGATAGATGCAGATTTCCAAATAGGCACTTCCATAGTTCATCAAACCAAACCAGATTCTCTCTCCCATATATTTTTAATTAATTAACTAATCAATCCATCAATTAAAATGTGTGCCCAAAAAGTTGTGTAAAATACATGGCTCATACAGCTTACAAAAATATGTCTATATATATATATATATATTAGATTCCCTGAGATTAATTATTATCCGTTTATTATTTTTATTTTCTTTAGCTTTATACAGCTTACAAAAATATGTCTGTGTATATATATATATATATATATATATATATATATTATATTAGATTCCCTGAGATTAATTATTATCCGTTTATTATTTTTATTTTCTTTAGCTTGATAAATAAATAAAAAATTATGCTTGCTTTTATTTAAATTGCTAGTTCTATAGGATGGAAAATATGGCAAGGATGAAGTTGAAGATGGGAATGAAGAATATATTCTTCGATCTCCGTTCCGTTTAGTTGATAGAAAAAGATTTATCTCTATTCCTTCTCCTATATTCCGCCCCATTAAGGTATGACTCGGGAGCTTTTCTCTGTTATTCTCCGCCCCATCAAAGTAGGTCCCGCAAACTCCATTTTCGTAGAAAAACGAATATCCGTATTCCATACGAAATAAATTAATATCTTCCCTACAACATATACTCAAAATTAAATATTTGTGTATATATATATACATGTATGGGGTAGAGCCACAGCTTTCTCTAAATAGTTTTATAATTAGAATTAACCTCTTGATCATATAGCTTTATACTCTTTTCTTTTTATTTCAATCTCCTTCCCTTCTCTCTCTCCTTCTCTCTCCTTCTCTCTCCTTTGTTTTGCAGAAGAGAAGAAGAAAAGAAAAAATGGAGAATAATATAAGCATGGTGGAGGCCAAACTTCCTCCTGGATTTAGGTTCCATCCAAGAGATGAAGAATTGGTCTGTGATTATCTGATGAAGAAGATTGGCTCTGTTTGTGATTCTTCTTCTCTCTTGATTGAAGTTGACCTCAACCAGTGTGAGCCTTGGGATATTCCAAGTAAGTCTTTCTTTTTTCTTTCTAATTTTTAATTCTTTTGGTGTTCTTATTTTTTTTGCCTTCTTCTTCTTCTTGGTTGATTTTCTGAAACCCATAATCCATTATTTAAAAGATCATACAAATATAAATAAGGTTTAGTTTAATTTAAGGGCCTTGAATTTGTATAGCGTAATTTGATATAAAGTTTATATTTGATTTTTGAAATTAGTATTTATTAAAAAAAAAAAGTTGTATTGTTTGAGAGCTATTTTGAGTCGTTAAATTAATTACTTTGGTGGTTCATAAGCTAGATGTTTTGAAGGCTTTCTATTGGTTTAATTCAATCTAGATTTTATATGATATTTCAATAGTTTTTACTATATGAAAAAAAAAAAATAATAATAATTTTGTTGCTCTTTTTTGCCCTTGTCATTTTTTTTTTAAATTTCTTCCCAATTTCGACCTTTCTTTTGAAGTGATGTTGTGTTGTTCTTCTTTTACAAATGGGCTAAAATGGTCATTTTACCAGATTAACATCAACATCTTACTCAAAAGTCTTTTCCTAAGATAACTATCCTTTATTTTTCTAAATTCCAAGATAATTTATAAAACAATAAAGTCATATATTTTCATCAAAACTTGAAAGAAAAAAAAAAAAAACTTAGTTTGTGTTATCAATATTAATTAATTTTATTTATATGAACATAAAAATTTATATTTACTTGGTTTAATTTTATTTTATTTCAGTTATAGATATAATAATATAATAAGCAAATTTGATGAATAATTCAAAACTTTCCAAATTCATAAGTTTATGCAGCTGAAACCGATACTTTTGGATTTTTTTAAAAAAAATGAAAATGATAATGAAAAAAAGAAAAGGAAATAGATGTGAGAGTGACTGATAGCTACAAGCCAACAAATTCCAACCCCAAATATCATCAAGTTGTTCTTTGTTCCCACCCAATAAGAGTCCTCAAATATGTAGAATTTTAAACCATATTTCTTCAATTAATTATTATTATTATTATTATCTCATTTCTTCCACTAATTTAATTTTGTTTCCTTACCAAATCCAATAAGGGTTGTCACTTTCACTTTTATGTGCCTTTTCACCTTTTAAAATTACTGTATAGGAATCTTTTTAACTTTTTTTTATTAATTTTTTACCTTAATTTATTTTTTATTAAATAGTGATTGATATTGACACTTTAACTGCACACAATTAGATATGGTTCGACCTTGATGACTCATCACCCAAATATTCTTTCCATTCATATATATATATTTTCTCTTATTTGACTATTGTATGTTTTTTTAATAATTATTTCAATAATCTAAGTGGCATTATCGTTTAGGTCAATAGATTTAGGTAGTGGGTAGCAAATTTAGTAGTTAAAAACAATTAAATTCAATACTATTGAAAATAATTAATTTATAGCTCATGATTATAATTATGTTTCAATTATAAAGAATTTATTCAATTATGAAGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNGAACTTCAGGGTCATGATTGTAATTCACTATTTTCAAAATTTTTAGTATAGAAATATAGAAAAAGAAAATCGTGTGTACAAATATTTATTTAGTTAAATCACGAATTTGCTCTATGATTTCAGTATAATTACAATATAACATATCAATTTAACCAAAACCTTTATGGTTTTTTTTTTTTTTTTTTTTTTTTTTNACCAAATTGTTCAATTGTTCATTTGATATGTGAATTTTGTGTAGGAGAGGCATGCGTGGGGGGAAAAGAGTGGTACTTCTTCAGCCAGCGGGACCGTAAGTACGCGACTGGGCTTAGAACAAACCGCGCCACAGCCTCAGGGTACTGGAAGGCCACTGGCAAAGACAGGCCTGTCTTTCACAAGGCCAATCAACTCGTTGGGATGAGAAAGACTCTTGTTTTCTACCAAGGAAGGGCCCCTAAAGGCCGAAAAACCGAGTGGGTTATGCATGAATTTCGGCTTGAGGGTCCACTTTCTCCTCTCAAAGATCCACCTCCTAGGGTATGAACGTTCTCTCTCTATTTCTCTATCTTACCATGCAAGTGTTTGTGAAAATGTCTGTGTGAATTTGTTGAGCTTTTTGTGTTATTGGTTTGATGAATATGTTACCTTGCTTAATCAACTCATTAGTGACGGTTTAGCTAGACTGTTTTTGTTGATTTAACACTCTATCTATCCTCTTGCCTCCAAGTGTTTGTTAAAATGCCTATGGGAGTTTGTTGAGTTTTTTCTTTGTATCATTTTCTTGTTTATTGATGTGAATTCACTTATTGTTATGTTTTCATGAATATGTTACCATGCTTAAACAACTCACTAGTGATGGTTTAGCAATAGCCGCTTGAAGTTTCTTTTTGTTGATTTGAAATGTTTATGATACCAATTTTTCTAACCATTTAGCCATGTTTTGATTGTGATATATTGGTAAAATTCGTGAGGGGTTGAATCTCTCTGATTTAGGTTTGGATTTTGTAACGACCCAAGCCCACCGTTAGCATATATTGTCCTCTGTAGGCTTTTCCTTTCGGACTTCCCCTCAAGGTTTTTAAAACGCACTTGCTAAGGATAGATTTCCACAATCTTATAAACAATGTTTCATTCTTCTCCCCTACCGATGAGGGATCTTACAATTCACCCCCTTCGAGGCCTAGCATCCTCGCTGGCACTCATTTCCTTCTCTAATCGATGTGAGACCTCCCAATCCACCCCTCCTTCGGGGTCCAGCGTCCTTGCTAGCACACTGCCTCGTGTCCACCCCCTTCAGGGCTTAGCCTCCATACTGTCACATTGCCCGGTGCTTGGCTCTGATACCATTTGTAACAGCCCAAGCCCAGCACTAGCAGATATTGTCCTCATTGGCCTTTCCCTTTCGGGCTTCCCCCTCAAAATTTTTAAAACACGTCTGTTAAGGAGAGATTTCCACACTCTTATAAAGAATGTCTCGTTCTCCTCCCCAACTGACGTGGGATCTCACAAATTTTCTCCCTACTTTGTGTTGTAATCGAGAAGGGATTAATATCAGTTTAATCTAGGGAGAATATGAGAGTTAAAATAAAATATCTGTTCATCAAATTCTGAAATTACCCTTCTAACATTTGGCATTTAGTTTCCTAACAGTGACTCATTTTGTTTAATTTTTTTCTCTGCTCTCATTCAACATCTATTGGCTTGACTTAGATATGCATAAAATAAACATTTGATGCAACCTGTATTTTGAGAATTTCCATATTAGTTGGTTTGTTTGTCTGGCACTAAAATCTTTGACTTTACAAATTCAACTTCAAAAATTTTAAATAATGTTGGCAGCTGATGGCGCAATATTTTCCTGTTCATGATAAAGGATTGCGTCTAAGAAGGCCAATTAGTTTACGTTGATAGCAACGTCACGAAGAAGTCTGCACTATTTTAACTGTTGTGGCTTGTGCTTCTTGTGTCCACAGGAGGACTGGGTTCTGTGCAGAATGTTCTGTAAACAGAAAGAAGTGCCCGCCCGACCGAGCACGGGAAGCAGCAGCTCCTACAGCAACGTCGCTGCCTCGTCGTCTCTCCCAGCTTTGATGGACTCATACCTCAGTTTTCACCAAAATCCAAGCAGCTATTTAAATGAGTTTGAGCAAGTGCCCTGCTTCTCCATTTTGTCTCAAAACCAAACCATCCCATCTCTCTCAAACCTCATACAAATGGAGGCAAACACAGGCAACAACATCAAGAACTTCACCACAATGTATGGAGGAATGCCAAATTCAAGCACTTATTCTTCAAATATTGACCCTTTTGCCTGTGACTCATCAGTGCTCAAAGTTGTTCTAAATAACATTACTAAGATGGAAACAAATGGCAACCCCTTCATAGGGCAACCCAGCATGGGAGAAGGCAGCTCTGACAGCTACTTGTCTGAGGTTGGTGATGATATTTCCAGCTTTTGGAACAGATGACCCAGAAGGAAAATGAAGGAGAAATTAGTTTATTGTAAAGAAAAATGGTGATTATTAAGAAGGATTGTAATTTAAAGGGTTTTTTTTTTTAAATTTATTATTTTAATAAAAAGATCCCAAGTGTAAATTTTATTGCATATTTTGGGAACTTTTTAAAGCCTAATTCATGGTTGATGAAATGCAAATCCATGTTTGATTTTTTTTTTTCATAATTATACAAAAAAAAAAAA

mRNA sequence

TACGCCCAAATTCGTAAATGTATATATAATCTGCGGCGCCATCGTTGATTTTGGCGACAAAACAATGGCCACTGGCTTTGGCGGCAATGGGAGTAGCAGTTGCAAGTACAAGTGCGACACTGCCGCCAATACTCTGCAATGGATAAAAGCCATCGCCGACTTCATTAGACCTTACTCGTTCCTGATCAATGCTCCCGTCGTCAATTTTTTTAAGGATAGACTATGGGAAGCTGTTGATGAGGAATGGATGGAGTGCTTGCGCAAGGAACATGTGAAGAATCTGCTTCTAATTCCTTCTGGAGCTGTCCAGGAATACTGGCCAGATTCACTAAAGAAATTTATCCGTACTTCAAGATCTCTTGCCTTTGGACGTGAACAAGCAGACTTGCAGATGGTTCTACCTGGTTGGTGCATCGCTTCACTTAACACTGTTCTTTCTCAAGGCATGAATCAGAAGAAAAAACATGAAGTTGAAGTCCTTTCCGCCATTATTAGTTTGATTGCAAGTGACCTGAAAAGTCATGCAATTGTTGATGTTGGTGCTGGGCAGGGTTATTTAGCGCAAGTGCTTTCCTTCCATTACAAGCATTCAGTTCTTGCAATTGATGCTTGTTCTCATCATGGAAATGTGACAAGTGCACGTTCAGAACGGATTAAGAAGTATTACTCAGCACAGATTCGTAAATCTGGATTGGAAACCAACAATTTGAGACTGCCGAAGGCAATGACATTTCATGTATTATCCGTTGATGCATTAAAATCCCTTGCTAACATGTCACTACAAGACGATCATGCTGATAAGACAAGTGTGACTGGCGATGATCTAGAGAAAACCAATCGGCAGGAGTCAAAGGGTTTGTGCCGTTCAGGCAAAGAGCCTTCATTGGTTCTTGCTGGACTTCATGCTTGCGGTGATCTTTCAGTGATAATGCTCAGGACCTTTGTTGAGTGCAAGGAAGTAAAAGCAGTCATTAATATTGGTTGCTGTTACAACCTACTCTCGGAGTATGGATCTGACTATGAAGGTGTCCAAAATGGATTTCCAATGAGCTTTGGTGTCAAATCTTCTGGCTTGTTTCTGGGAAAAAGTGGCAGAGACCTTGCATGTCAGAGTGCAGAAAGATGGAGGAATTTGGAAAATGAGGGTGGTGTCCATAATTTTGAGTTGCATGCTTTCCGTGCTGCTTTCCAAATGGTACTCTATAAATATTATCCAGAAGTTGTAGCAACTTGCCCATCTGTTGGGCGTCAAGGAAAGGCATTGCGTCGTCGAAAGAAAAGGGAGGCTGCATTATCCTCACAGTGTCATGAAGATAAGCTTGAGGCATCACAATCAGATCTTATTGGGGGGTTGCCAGACAAGAGCAATGCCTTTTCTCACACCATTTCTGACTATGGGAGTACGCCATGTGAACAGGCCAAATCTGTTGACAAATATCTTCTTTTTGAAAATTTTTGTCAATCTGGATTAAATCGCCTTGGACTTCAATCTTTGCAGGATATGGATTATTATGGAATCTGGATGGACAATGAGCCTTTTGCTGAACTTATTGGGCCTTACTGGTCTCTTCGAGCTGCTCTGGGCCCAGTTTTGGAAACATGTATTCTGCTTGACAGATTACTATTTCTCCAGGAGCAAGGTGAATCCCTTGAAGCCATGTTGCTACCTATTTTTGATCCGGATTTATCACCGAGGAATGTGGCTATAATCGCTCGGAAATTTGGTGCAACATAAGATTCATTAGGAGAAAGGATTTAATGGTTCTGGATTAATGCATAATTCAAGTGCTGAGGATGAATTATTCAAATTCTGCAGAACTTATGGGTTCCCTTTTCTCACTGTTTGTATTTCATTTCAGATTATTCCTTTAACTTTCCAATACTGTCATTTCTGATATCTGTCTTTTATAATGGCTATTTACAGCCTCACAGGCATAATTTATATTGCCCAATGGTATTATTAAAATTCATTGTTAGTAAGGCATATGGCCTCTTTTTTTTTTTTTTAAACACCCCGATTACCCAAATACTAGCAAAAAGGCTTGGAGTTACATNTAGGTGTCAAGGTTAGGATATCATTCTTATTTATCTTCTTTCTTTTTTTGGTTTGTTGCATTAAGACTATGAGAGATTGGATAAGAAGATCAATCAGAAAGAAGGTTTTTTTTTGTATTTTCTAACAGCAAAGTCAATAGTTTCCCAGTGGTAAGATCGTAATTTCACTCGATGCTTTGGATAATTCCGAAGACAGGAAGGAATATGCAATTGAATCCTTTCGGATCTTAGTAGTTGTTTTAGGCCTAATAGTCTTGTCCAGGACCCTGAAAAACCTCTATTTTAGTAACTAGTATGACCTAATCTTCATAGGTTGACCTAATGATCAATCAAAACAATGAAGATAACTGTGACGGTCCAAGCCCACCGTTAGCAAATATTGTCTTTTTTGGATTTTTCCTTTCGGGCTTTCTCTCAAGGTTTTTAAAACGCGTCTGCTAGGGAAAGGTTTCCACTCCCTTATAAAGGGTGTCTCATTCTCCTTCTCACGGGATCTCACAATAATAAAGGAAATTGTTCGATGAAAATAGTTGAAGTATGCCAAACAGGAACGAATGACGATAAGGAGCACAGAAACGCAAACGTCGGCATGAAGTTGACACCATACAAATCGATCGAACTGCAATTCAGATTCTTCTTGTTCTTTCGATCAAAGACGTGGTTGGTGGACATTTCAGAAACTGCTGCTCTTTGTCTTTTTATGCATCAAAGAAAAGGTTGCTATCAGTAAAAAGGATGGAAAATATGGCAAGGATGAAGTTGAAGATGGGAATGAAGAATATATTCTTCGATCTCCGTTCCGTTTAGTTGATAGAAAAAGATTTATCTCTATTCCTTCTCCTATATTCCGCCCCATTAAGAAGAGAAGAAGAAAAGAAAAAATGGAGAATAATATAAGCATGGTGGAGGCCAAACTTCCTCCTGGATTTAGGTTCCATCCAAGAGATGAAGAATTGGTCTGTGATTATCTGATGAAGAAGATTGGCTCTGTTTGTGATTCTTCTTCTCTCTTGATTGAAGTTGACCTCAACCAGTGTGAGCCTTGGGATATTCCAAGAGAGGCATGCGTGGGGGGAAAAGAGTGGTACTTCTTCAGCCAGCGGGACCGTAAGTACGCGACTGGGCTTAGAACAAACCGCGCCACAGCCTCAGGGTACTGGAAGGCCACTGGCAAAGACAGGCCTGTCTTTCACAAGGCCAATCAACTCGTTGGGATGAGAAAGACTCTTGTTTTCTACCAAGGAAGGGCCCCTAAAGGCCGAAAAACCGAGTGGGTTATGCATGAATTTCGGCTTGAGGGTCCACTTTCTCCTCTCAAAGATCCACCTCCTAGGGAGGACTGGGTTCTGTGCAGAATGTTCTGTAAACAGAAAGAAGTGCCCGCCCGACCGAGCACGGGAAGCAGCAGCTCCTACAGCAACGTCGCTGCCTCGTCGTCTCTCCCAGCTTTGATGGACTCATACCTCAGTTTTCACCAAAATCCAAGCAGCTATTTAAATGAGTTTGAGCAAGTGCCCTGCTTCTCCATTTTGTCTCAAAACCAAACCATCCCATCTCTCTCAAACCTCATACAAATGGAGGCAAACACAGGCAACAACATCAAGAACTTCACCACAATGTATGGAGGAATGCCAAATTCAAGCACTTATTCTTCAAATATTGACCCTTTTGCCTGTGACTCATCAGTGCTCAAAGTTGTTCTAAATAACATTACTAAGATGGAAACAAATGGCAACCCCTTCATAGGGCAACCCAGCATGGGAGAAGGCAGCTCTGACAGCTACTTGTCTGAGGTTGGTGATGATATTTCCAGCTTTTGGAACAGATGACCCAGAAGGAAAATGAAGGAGAAATTAGTTTATTGTAAAGAAAAATGGTGATTATTAAGAAGGATTGTAATTTAAAGGGTTTTTTTTTTTAAATTTATTATTTTAATAAAAAGATCCCAAGTGTAAATTTTATTGCATATTTTGGGAACTTTTTAAAGCCTAATTCATGGTTGATGAAATGCAAATCCATGTTTGATTTTTTTTTTTCATAATTATACAAAAAAAAAAAA

Coding sequence (CDS)

ATGGAGAATAATATAAGCATGGTGGAGGCCAAACTTCCTCCTGGATTTAGGTTCCATCCAAGAGATGAAGAATTGGTCTGTGATTATCTGATGAAGAAGATTGGCTCTGTTTGTGATTCTTCTTCTCTCTTGATTGAAGTTGACCTCAACCAGTGTGAGCCTTGGGATATTCCAAGAGAGGCATGCGTGGGGGGAAAAGAGTGGTACTTCTTCAGCCAGCGGGACCGTAAGTACGCGACTGGGCTTAGAACAAACCGCGCCACAGCCTCAGGGTACTGGAAGGCCACTGGCAAAGACAGGCCTGTCTTTCACAAGGCCAATCAACTCGTTGGGATGAGAAAGACTCTTGTTTTCTACCAAGGAAGGGCCCCTAAAGGCCGAAAAACCGAGTGGGTTATGCATGAATTTCGGCTTGAGGGTCCACTTTCTCCTCTCAAAGATCCACCTCCTAGGGAGGACTGGGTTCTGTGCAGAATGTTCTGTAAACAGAAAGAAGTGCCCGCCCGACCGAGCACGGGAAGCAGCAGCTCCTACAGCAACGTCGCTGCCTCGTCGTCTCTCCCAGCTTTGATGGACTCATACCTCAGTTTTCACCAAAATCCAAGCAGCTATTTAAATGAGTTTGAGCAAGTGCCCTGCTTCTCCATTTTGTCTCAAAACCAAACCATCCCATCTCTCTCAAACCTCATACAAATGGAGGCAAACACAGGCAACAACATCAAGAACTTCACCACAATGTATGGAGGAATGCCAAATTCAAGCACTTATTCTTCAAATATTGACCCTTTTGCCTGTGACTCATCAGTGCTCAAAGTTGTTCTAAATAACATTACTAAGATGGAAACAAATGGCAACCCCTTCATAGGGCAACCCAGCATGGGAGAAGGCAGCTCTGACAGCTACTTGTCTGAGGTTGGTGATGATATTTCCAGCTTTTGGAACAGATGA

Protein sequence

MENNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKNFTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGNPFIGQPSMGEGSSDSYLSEVGDDISSFWNR
BLAST of Cp4.1LG06g05890 vs. Swiss-Prot
Match: NAC22_ARATH (NAC domain-containing protein 21/22 OS=Arabidopsis thaliana GN=NAC021 PE=1 SV=2)

HSP 1 Score: 318.5 bits (815), Expect = 8.0e-86
Identity = 180/324 (55.56%), Postives = 229/324 (70.68%), Query Frame = 1

Query: 2   ENNISMVEAKLPPGFRFHPRDEELVCDYLMKK-IGSVCDSSSLLIEVDLNQCEPWDIPRE 61
           E++ISMVEAKLPPGFRFHP+D+ELVCDYLM++ + +      +LI+VDLN+CEPWDIP+ 
Sbjct: 9   ESSISMVEAKLPPGFRFHPKDDELVCDYLMRRSLHNNHRPPLVLIQVDLNKCEPWDIPKM 68

Query: 62  ACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQ 121
           ACVGGK+WYF+SQRDRKYATGLRTNRATA+GYWKATGKDR +  K  +LVGMRKTLVFYQ
Sbjct: 69  ACVGGKDWYFYSQRDRKYATGLRTNRATATGYWKATGKDRTILRK-GKLVGMRKTLVFYQ 128

Query: 122 GRAPKGRKTEWVMHEFRLEGPLSPLKD--PPPREDWVLCRMFCKQKE-VPARPSTGSSSS 181
           GRAP+GRKT+WVMHEFRL+G   P       P+EDWVLCR+F K  E V  R + GS   
Sbjct: 129 GRAPRGRKTDWVMHEFRLQGSHHPPNHSLSSPKEDWVLCRVFHKNTEGVICRDNMGSC-- 188

Query: 182 YSNVAASSSLPALMDSYLSFHQNPSSYLNE------FEQVPCFSILSQNQTIPSLSNLIQ 241
             +  AS+SLP LMD Y++F Q PSSYL++       E VPCFS LSQNQT+   SNL  
Sbjct: 189 -FDETASASLPPLMDPYINFDQEPSSYLSDDHHYIINEHVPCFSNLSQNQTLN--SNLTN 248

Query: 242 MEANTGNNIKNFTTMYGGMPNSSTYSSNIDPF-ACDSSVLKVVLNNITKMETNGNPFIGQ 301
             +      KN   ++ G   S+T  + +D F + D  VL+ +L+ +TK++ +  P   Q
Sbjct: 249 SVSELKIPCKNPNPLFTGGSASATL-TGLDSFCSSDQMVLRALLSQLTKIDGSLGPKESQ 308

Query: 302 PSMGEGSSDSYLSEVGDDISSFWN 315
            S GEGSS+S L+++G   S+ WN
Sbjct: 309 -SYGEGSSESLLTDIGIP-STVWN 323

BLAST of Cp4.1LG06g05890 vs. Swiss-Prot
Match: NAC98_ARATH (Protein CUP-SHAPED COTYLEDON 2 OS=Arabidopsis thaliana GN=NAC098 PE=1 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 3.4e-52
Identity = 96/153 (62.75%), Postives = 115/153 (75.16%), Query Frame = 1

Query: 12  LPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFF 71
           LPPGFRFHP DEEL+  YL++K+   C SS  + EVDLN+CEPW +P  A +G KEWYFF
Sbjct: 17  LPPGFRFHPTDEELITHYLLRKVLDGCFSSRAIAEVDLNKCEPWQLPGRAKMGEKEWYFF 76

Query: 72  SQRDRKYATGLRTNRATASGYWKATGKDRPVF-HKANQLVGMRKTLVFYQGRAPKGRKTE 131
           S RDRKY TGLRTNRAT +GYWKATGKDR +F  K   LVGM+KTLVFY+GRAPKG K+ 
Sbjct: 77  SLRDRKYPTGLRTNRATEAGYWKATGKDREIFSSKTCALVGMKKTLVFYKGRAPKGEKSN 136

Query: 132 WVMHEFRLEGPLS-PLKDPPPREDWVLCRMFCK 163
           WVMHE+RLEG  S        +++WV+ R+F K
Sbjct: 137 WVMHEYRLEGKFSYHFISRSSKDEWVISRVFQK 169

BLAST of Cp4.1LG06g05890 vs. Swiss-Prot
Match: NAC92_ARATH (NAC domain-containing protein 92 OS=Arabidopsis thaliana GN=NAC92 PE=1 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 5.8e-52
Identity = 109/216 (50.46%), Postives = 138/216 (63.89%), Query Frame = 1

Query: 5   ISMVEAK----LPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPRE 64
           + MVE +    LPPGFRFHP DEEL+  YL  K+ +   S++ + EVDLN+ EPWD+P +
Sbjct: 9   VEMVEDEEHIDLPPGFRFHPTDEELITHYLKPKVFNTFFSATAIGEVDLNKIEPWDLPWK 68

Query: 65  ACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQ 124
           A +G KEWYFF  RDRKY TGLRTNRAT +GYWKATGKD+ +F K   LVGM+KTLVFY+
Sbjct: 69  AKMGEKEWYFFCVRDRKYPTGLRTNRATEAGYWKATGKDKEIF-KGKSLVGMKKTLVFYK 128

Query: 125 GRAPKGRKTEWVMHEFRLEGPLSPLKDP-PPREDWVLCRMFCKQKEVPARPSTGSSSSYS 184
           GRAPKG KT WVMHE+RLEG       P   + +WV+CR+F K+ +    P +     + 
Sbjct: 129 GRAPKGVKTNWVMHEYRLEGKYCIENLPQTAKNEWVICRVFQKRADGTKVPMS-MLDPHI 188

Query: 185 NVAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFS 216
           N    + LP+LMD          S+      V CFS
Sbjct: 189 NRMEPAGLPSLMDC-----SQRDSFTGSSSHVTCFS 217

BLAST of Cp4.1LG06g05890 vs. Swiss-Prot
Match: NAC79_ARATH (NAC domain-containing protein 79 OS=Arabidopsis thaliana GN=NAC079 PE=2 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 7.5e-52
Identity = 121/270 (44.81%), Postives = 164/270 (60.74%), Query Frame = 1

Query: 12  LPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFF 71
           LPPGFRFHP DEEL+  YL KK+  +  S+  + EVDLN+ EPW++P +A +G KEWYFF
Sbjct: 17  LPPGFRFHPTDEELITHYLHKKVLDLGFSAKAIGEVDLNKAEPWELPYKAKIGEKEWYFF 76

Query: 72  SQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEW 131
             RDRKY TGLRTNRAT +GYWKATGKD+ +F +   LVGM+KTLVFY+GRAPKG+KT W
Sbjct: 77  CVRDRKYPTGLRTNRATQAGYWKATGKDKEIF-RGKSLVGMKKTLVFYRGRAPKGQKTNW 136

Query: 132 VMHEFRLEGPLSPLKDP-PPREDWVLCRMFCKQ---KEVPARPSTGSSSSYSNVAASSSL 191
           VMHE+RL+G LS    P   + +WV+CR+F K    K++P      +     +    SSL
Sbjct: 137 VMHEYRLDGKLSAHNLPKTAKNEWVICRVFHKTAGGKKIP----ISTLIRIGSYGTGSSL 196

Query: 192 PALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKNFTTMY 251
           P L DS  S + + +    E   VPCFS  +Q +T  ++ N     + +         + 
Sbjct: 197 PPLTDS--SPYNDKTK--TEPVYVPCFS--NQAETRGTILNCFSNPSLSSIQPDFLQMIP 256

Query: 252 GGMPNSSTYSSNIDP-FACDSSVLKVVLNN 277
              P S   S + +P    + SVL+ ++ N
Sbjct: 257 LYQPQSLNISESSNPVLTQEQSVLQAMMEN 275

BLAST of Cp4.1LG06g05890 vs. Swiss-Prot
Match: NC100_ARATH (NAC domain-containing protein 100 OS=Arabidopsis thaliana GN=NAC100 PE=2 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.1e-50
Identity = 108/212 (50.94%), Postives = 133/212 (62.74%), Query Frame = 1

Query: 12  LPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFF 71
           LPPGFRFHP DEEL+  YL KK+     S+  + EVDLN+ EPW++P  A +G KEWYFF
Sbjct: 16  LPPGFRFHPTDEELITHYLHKKVLDTSFSAKAIGEVDLNKSEPWELPWMAKMGEKEWYFF 75

Query: 72  SQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEW 131
             RDRKY TGLRTNRAT +GYWKATGKD+ ++ +   LVGM+KTLVFY+GRAPKG+KT W
Sbjct: 76  CVRDRKYPTGLRTNRATEAGYWKATGKDKEIY-RGKSLVGMKKTLVFYRGRAPKGQKTNW 135

Query: 132 VMHEFRLEGPLSPLKDP-PPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVAASSSLPAL 191
           VMHE+RLEG  S    P   + +WV+CR+F  QK    +    SS        +   P+L
Sbjct: 136 VMHEYRLEGKFSAHNLPKTAKNEWVICRVF--QKSAGGKKIPISSLIRIGSLGTDFNPSL 195

Query: 192 MDSYLSFHQNPSSYLNEFEQVPCFSILSQNQT 223
           + S             E   VPCFS    NQT
Sbjct: 196 LPSLTDSSPYNDKTKTEPVYVPCFS----NQT 220

BLAST of Cp4.1LG06g05890 vs. TrEMBL
Match: A0A0A0LDG6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G523580 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 1.9e-155
Identity = 273/318 (85.85%), Postives = 293/318 (92.14%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS-VCDSSSLLIEVDLNQCEPWDIPREA 62
           NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS    SSSLLIEVDLN+CEPWDIPREA
Sbjct: 8   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSNSSSSSSLLIEVDLNKCEPWDIPREA 67

Query: 63  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 122
           CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG
Sbjct: 68  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 127

Query: 123 RAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSN- 182
           RAPKGRKTEWVMHEFRLEGP SP+ DP P+EDWVLCR+FCKQKEV  +PSTGSSS Y++ 
Sbjct: 128 RAPKGRKTEWVMHEFRLEGPFSPITDPSPKEDWVLCRLFCKQKEVTPQPSTGSSSCYNDT 187

Query: 183 VAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNI 242
           + +SSSLPALMDSY+SF QNP+S+LNE+EQVPCFSI S NQTIP+L+NLIQMEANTGNNI
Sbjct: 188 IGSSSSLPALMDSYISFDQNPNSHLNEYEQVPCFSIFSHNQTIPTLTNLIQMEANTGNNI 247

Query: 243 KNFTTMY-GGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGNPFIGQPSMGEGSSD 302
           KN +TM+ GGMPNS+T SSNIDPF CDS VLKVVLNNITKMETNG+ FIGQ SMGEGSSD
Sbjct: 248 KNLSTMFGGGMPNSTTCSSNIDPFTCDSKVLKVVLNNITKMETNGSSFIGQTSMGEGSSD 307

Query: 303 SYLSE--VGDDISSFWNR 316
           SYLSE  VGDDI+S WNR
Sbjct: 308 SYLSEVGVGDDIASLWNR 325

BLAST of Cp4.1LG06g05890 vs. TrEMBL
Match: A0A0B2QZ54_GLYSO (NAC domain-containing protein 21/22 OS=Glycine soja GN=glysoja_011392 PE=4 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 1.8e-105
Identity = 207/314 (65.92%), Postives = 234/314 (74.52%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S LLI+VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVQH--NDSLLLIDVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDRP+  K    VGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRPILRKGTH-VGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKGRKTEWVMHEFR+EGP  P K    +EDWVLCR+F K  EV A+PS G  S Y +  
Sbjct: 122 APKGRKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNSEVLAKPSMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SSSLPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSSLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTNPIFNHMTTMEPKFPLN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            TT YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATTTYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHVWN 301

BLAST of Cp4.1LG06g05890 vs. TrEMBL
Match: B2ZGS0_SOYBN (NAC domain protein OS=Glycine max GN=NAC34 PE=2 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 4.1e-105
Identity = 206/314 (65.61%), Postives = 234/314 (74.52%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S LLI+VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVQH--NDSLLLIDVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDRP+  K    VGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRPILRKGTH-VGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKGRKTEWVMHEFR+EGP  P K    +EDWVLCR+F K  EV A+PS G  S Y +  
Sbjct: 122 APKGRKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNSEVLAKPSMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SS+LPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSTLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTNPIFNHMTTMEPKFPLN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            TT YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATTTYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHVWN 301

BLAST of Cp4.1LG06g05890 vs. TrEMBL
Match: C6TN89_SOYBN (Putative uncharacterized protein OS=Glycine max PE=2 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 4.1e-105
Identity = 206/314 (65.61%), Postives = 234/314 (74.52%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S LLI+VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVQH--NDSLLLIDVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDRP+  K    VGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRPILRKGTH-VGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKGRKTEWVMHEFR+EGP  P K    +EDWVLCR+F K  EV A+PS G  S Y +  
Sbjct: 122 APKGRKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNSEVLAKPSMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SS+LPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSTLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTNPIFNHMTTMEPKFPLN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            TT YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATTAYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHVWN 301

BLAST of Cp4.1LG06g05890 vs. TrEMBL
Match: A0A0B2R272_GLYSO (NAC domain-containing protein 21/22 OS=Glycine soja GN=glysoja_025555 PE=4 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 4.5e-104
Identity = 203/314 (64.65%), Postives = 231/314 (73.57%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S L+I VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVAH--NDSLLMINVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDR +  K   LVGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRSILRKGT-LVGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKG KTEWVMHEFR+EGP  P K    +EDWVLCR+F K +EV A+P  G  S Y +  
Sbjct: 122 APKGNKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNREVSAKPRMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SSSLPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSSLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTSPIFNHMATMEPKLPAN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            T  YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATNAYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHMWN 301

BLAST of Cp4.1LG06g05890 vs. TAIR10
Match: AT1G56010.2 (AT1G56010.2 NAC domain containing protein 1)

HSP 1 Score: 318.5 bits (815), Expect = 4.5e-87
Identity = 180/324 (55.56%), Postives = 229/324 (70.68%), Query Frame = 1

Query: 2   ENNISMVEAKLPPGFRFHPRDEELVCDYLMKK-IGSVCDSSSLLIEVDLNQCEPWDIPRE 61
           E++ISMVEAKLPPGFRFHP+D+ELVCDYLM++ + +      +LI+VDLN+CEPWDIP+ 
Sbjct: 9   ESSISMVEAKLPPGFRFHPKDDELVCDYLMRRSLHNNHRPPLVLIQVDLNKCEPWDIPKM 68

Query: 62  ACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQ 121
           ACVGGK+WYF+SQRDRKYATGLRTNRATA+GYWKATGKDR +  K  +LVGMRKTLVFYQ
Sbjct: 69  ACVGGKDWYFYSQRDRKYATGLRTNRATATGYWKATGKDRTILRK-GKLVGMRKTLVFYQ 128

Query: 122 GRAPKGRKTEWVMHEFRLEGPLSPLKD--PPPREDWVLCRMFCKQKE-VPARPSTGSSSS 181
           GRAP+GRKT+WVMHEFRL+G   P       P+EDWVLCR+F K  E V  R + GS   
Sbjct: 129 GRAPRGRKTDWVMHEFRLQGSHHPPNHSLSSPKEDWVLCRVFHKNTEGVICRDNMGSC-- 188

Query: 182 YSNVAASSSLPALMDSYLSFHQNPSSYLNE------FEQVPCFSILSQNQTIPSLSNLIQ 241
             +  AS+SLP LMD Y++F Q PSSYL++       E VPCFS LSQNQT+   SNL  
Sbjct: 189 -FDETASASLPPLMDPYINFDQEPSSYLSDDHHYIINEHVPCFSNLSQNQTLN--SNLTN 248

Query: 242 MEANTGNNIKNFTTMYGGMPNSSTYSSNIDPF-ACDSSVLKVVLNNITKMETNGNPFIGQ 301
             +      KN   ++ G   S+T  + +D F + D  VL+ +L+ +TK++ +  P   Q
Sbjct: 249 SVSELKIPCKNPNPLFTGGSASATL-TGLDSFCSSDQMVLRALLSQLTKIDGSLGPKESQ 308

Query: 302 PSMGEGSSDSYLSEVGDDISSFWN 315
            S GEGSS+S L+++G   S+ WN
Sbjct: 309 -SYGEGSSESLLTDIGIP-STVWN 323

BLAST of Cp4.1LG06g05890 vs. TAIR10
Match: AT3G12977.1 (AT3G12977.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein)

HSP 1 Score: 259.2 bits (661), Expect = 3.2e-69
Identity = 151/315 (47.94%), Postives = 195/315 (61.90%), Query Frame = 1

Query: 2   ENNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREA 61
           + +ISMVEA LPPGFRFHPRD+ELVCDYLM++         +LI+VDLN+CEPWDIP+ A
Sbjct: 8   KGSISMVEANLPPGFRFHPRDDELVCDYLMRRTVRSLYQPVVLIDVDLNKCEPWDIPQTA 67

Query: 62  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 121
            VGGKEWYF+SQ+DRKYATG RTNRATA+GYWKATGKDR +  +   LVGMRKTLVFY+G
Sbjct: 68  RVGGKEWYFYSQKDRKYATGYRTNRATATGYWKATGKDRAI-QRNGGLVGMRKTLVFYRG 127

Query: 122 RAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSS-SSYSN 181
           R+PKGRKT+WVMHEFRL+G L         E+WVLCR+F K        S G+     + 
Sbjct: 128 RSPKGRKTDWVMHEFRLQGKLLHHSPNSLEEEWVLCRVFHKN-------SNGADIDDITR 187

Query: 182 VAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFS-ILSQNQTIPSLSNLIQMEANTGNN 241
             + ++  A MDSY++F  +    +N  + VPCFS  LS NQT  + S LI         
Sbjct: 188 SCSDATASAFMDSYINFDHH--HIIN--QHVPCFSNNLSHNQT--NQSGLIS-------- 247

Query: 242 IKNFTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGNPFIGQPSMGEGSSD 301
            KN + ++   P              D  +L+ +L+ +TK            S G+GSS+
Sbjct: 248 -KNSSPLFNASP--------------DQMILRTLLSQLTKKVEESQ------SRGDGSSE 278

Query: 302 SYLSEVGDDISSFWN 315
           S L+++G   S  WN
Sbjct: 308 SQLTDIGIP-SHAWN 278

BLAST of Cp4.1LG06g05890 vs. TAIR10
Match: AT3G18400.1 (AT3G18400.1 NAC domain containing protein 58)

HSP 1 Score: 214.5 bits (545), Expect = 9.1e-56
Identity = 119/271 (43.91%), Postives = 168/271 (61.99%), Query Frame = 1

Query: 8   VEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKE 67
           +E  LPPGFRFHP DEEL+  YL +K+  +  +   +++VDLN+CEPWD+P +A +G KE
Sbjct: 1   MEENLPPGFRFHPTDEELITHYLCRKVSDIGFTGKAVVDVDLNKCEPWDLPAKASMGEKE 60

Query: 68  WYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGRAPKGR 127
           WYFFSQRDRKY TGLRTNRAT +GYWK TGKD+ ++ ++  LVGM+KTLVFY+GRAPKG 
Sbjct: 61  WYFFSQRDRKYPTGLRTNRATEAGYWKTTGKDKEIY-RSGVLVGMKKTLVFYKGRAPKGE 120

Query: 128 KTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCK----QKEVPARPSTGSSSSYSNVAA 187
           K+ WVMHE+RLE    P  +P  +E+WV+CR+F K    +K    +P +   S  S   A
Sbjct: 121 KSNWVMHEYRLESK-QPF-NPTNKEEWVVCRVFEKSTAAKKAQEQQPQSSQPSFGSPCDA 180

Query: 188 SSSLP---ALMDSYLSFHQNPSS--YLNEFEQVPCFSILSQNQTIPSLSNLIQMEAN-TG 247
           +SS+      +D   + + N S+  Y N   Q    ++ S++ T  +    + M  N   
Sbjct: 181 NSSMANEFEDIDELPNLNSNSSTIDYNNHIHQYSQRNVYSEDNTTSTAG--LNMNMNMAS 240

Query: 248 NNIKNFTTMYGGMPNSSTYSSNIDPFACDSS 269
            N++++TT   G P S   S  +  F   +S
Sbjct: 241 TNLQSWTTSLLGPPLSPINSLLLKAFQIRNS 266

BLAST of Cp4.1LG06g05890 vs. TAIR10
Match: AT4G28530.1 (AT4G28530.1 NAC domain containing protein 74)

HSP 1 Score: 209.9 bits (533), Expect = 2.2e-54
Identity = 103/175 (58.86%), Postives = 121/175 (69.14%), Query Frame = 1

Query: 8   VEAKLPPGFRFHPRDEELVCDYLMKKIGSVCD----------------SSSLLIEVDLNQ 67
           + +KLPPGFRFHP DEELVC YL  KI +  D                 S+ L+E+DL+ 
Sbjct: 6   IGSKLPPGFRFHPSDEELVCHYLCNKIRAKSDHGDVDDDDDDVDEALKGSTDLVEIDLHI 65

Query: 68  CEPWDIPREACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFH-KANQLV 127
           CEPW++P  A +  KEWYFFS RDRKYATG RTNRAT SGYWKATGKDR V   +  QLV
Sbjct: 66  CEPWELPDVAKLNAKEWYFFSFRDRKYATGYRTNRATVSGYWKATGKDRTVMDPRTRQLV 125

Query: 128 GMRKTLVFYQGRAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKE 166
           GMRKTLVFY+ RAP G KT W+MHEFRLE P     + PP+EDWVLCR+F K ++
Sbjct: 126 GMRKTLVFYRNRAPNGIKTTWIMHEFRLECP-----NIPPKEDWVLCRVFNKGRD 175

BLAST of Cp4.1LG06g05890 vs. TAIR10
Match: AT5G18270.1 (AT5G18270.1 Arabidopsis NAC domain containing protein 87)

HSP 1 Score: 208.4 bits (529), Expect = 6.5e-54
Identity = 123/287 (42.86%), Postives = 168/287 (58.54%), Query Frame = 1

Query: 12  LPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFF 71
           LPPGFRFHP DEE++  YL +K+ +   ++  + E DLN+CEPWD+P+ A +G KE+YFF
Sbjct: 21  LPPGFRFHPTDEEIITCYLKEKVLNSRFTAVAMGEADLNKCEPWDLPKRAKMGEKEFYFF 80

Query: 72  SQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEW 131
            QRDRKY TG+RTNRAT SGYWKATGKD+ +F     LVGM+KTLVFY+GRAPKG KT W
Sbjct: 81  CQRDRKYPTGMRTNRATESGYWKATGKDKEIFKGKGCLVGMKKTLVFYRGRAPKGEKTNW 140

Query: 132 VMHEFRLEGPLSPLKDP-PPREDWVLCRMFCKQK---------EVPARPST--GSSSSYS 191
           VMHE+RLEG  S    P   R++WV+CR+F K            +P    T   S  +  
Sbjct: 141 VMHEYRLEGKYSYYNLPKSARDEWVVCRVFHKNNPSTTTQPMTRIPVEDFTRMDSLENID 200

Query: 192 NVAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQ-MEANTGN 251
           ++   SSLP L+D          S++++ EQ P F  +  N     +S+ IQ    N+  
Sbjct: 201 HLLDFSSLPPLID---------PSFMSQTEQ-PNFKPI--NPPTYDISSPIQPHHFNSYQ 260

Query: 252 NIKNFTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGN 286
           +I N      G  + STY++N +    + S++ V        + N N
Sbjct: 261 SIFNHQVF--GSASGSTYNNNNEMIKMEQSLVSVSQETCLSSDVNAN 293

BLAST of Cp4.1LG06g05890 vs. NCBI nr
Match: gi|449466123|ref|XP_004150776.1| (PREDICTED: NAC domain-containing protein 21/22-like isoform X1 [Cucumis sativus])

HSP 1 Score: 556.6 bits (1433), Expect = 2.8e-155
Identity = 273/318 (85.85%), Postives = 293/318 (92.14%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS-VCDSSSLLIEVDLNQCEPWDIPREA 62
           NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS    SSSLLIEVDLN+CEPWDIPREA
Sbjct: 8   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSNSSSSSSLLIEVDLNKCEPWDIPREA 67

Query: 63  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 122
           CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG
Sbjct: 68  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 127

Query: 123 RAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSN- 182
           RAPKGRKTEWVMHEFRLEGP SP+ DP P+EDWVLCR+FCKQKEV  +PSTGSSS Y++ 
Sbjct: 128 RAPKGRKTEWVMHEFRLEGPFSPITDPSPKEDWVLCRLFCKQKEVTPQPSTGSSSCYNDT 187

Query: 183 VAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNI 242
           + +SSSLPALMDSY+SF QNP+S+LNE+EQVPCFSI S NQTIP+L+NLIQMEANTGNNI
Sbjct: 188 IGSSSSLPALMDSYISFDQNPNSHLNEYEQVPCFSIFSHNQTIPTLTNLIQMEANTGNNI 247

Query: 243 KNFTTMY-GGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGNPFIGQPSMGEGSSD 302
           KN +TM+ GGMPNS+T SSNIDPF CDS VLKVVLNNITKMETNG+ FIGQ SMGEGSSD
Sbjct: 248 KNLSTMFGGGMPNSTTCSSNIDPFTCDSKVLKVVLNNITKMETNGSSFIGQTSMGEGSSD 307

Query: 303 SYLSE--VGDDISSFWNR 316
           SYLSE  VGDDI+S WNR
Sbjct: 308 SYLSEVGVGDDIASLWNR 325

BLAST of Cp4.1LG06g05890 vs. NCBI nr
Match: gi|778681513|ref|XP_011651529.1| (PREDICTED: NAC domain-containing protein 21/22-like isoform X2 [Cucumis sativus])

HSP 1 Score: 548.9 bits (1413), Expect = 5.8e-153
Identity = 271/318 (85.22%), Postives = 292/318 (91.82%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS-VCDSSSLLIEVDLNQCEPWDIPREA 62
           NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGS    SSSLLIEVDLN+CEPWDIP+ A
Sbjct: 8   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSNSSSSSSLLIEVDLNKCEPWDIPK-A 67

Query: 63  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 122
           CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG
Sbjct: 68  CVGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQG 127

Query: 123 RAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSN- 182
           RAPKGRKTEWVMHEFRLEGP SP+ DP P+EDWVLCR+FCKQKEV  +PSTGSSS Y++ 
Sbjct: 128 RAPKGRKTEWVMHEFRLEGPFSPITDPSPKEDWVLCRLFCKQKEVTPQPSTGSSSCYNDT 187

Query: 183 VAASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNI 242
           + +SSSLPALMDSY+SF QNP+S+LNE+EQVPCFSI S NQTIP+L+NLIQMEANTGNNI
Sbjct: 188 IGSSSSLPALMDSYISFDQNPNSHLNEYEQVPCFSIFSHNQTIPTLTNLIQMEANTGNNI 247

Query: 243 KNFTTMY-GGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETNGNPFIGQPSMGEGSSD 302
           KN +TM+ GGMPNS+T SSNIDPF CDS VLKVVLNNITKMETNG+ FIGQ SMGEGSSD
Sbjct: 248 KNLSTMFGGGMPNSTTCSSNIDPFTCDSKVLKVVLNNITKMETNGSSFIGQTSMGEGSSD 307

Query: 303 SYLSE--VGDDISSFWNR 316
           SYLSE  VGDDI+S WNR
Sbjct: 308 SYLSEVGVGDDIASLWNR 324

BLAST of Cp4.1LG06g05890 vs. NCBI nr
Match: gi|659087103|ref|XP_008444276.1| (PREDICTED: NAC domain-containing protein 21/22-like [Cucumis melo])

HSP 1 Score: 484.2 bits (1245), Expect = 1.8e-133
Identity = 241/284 (84.86%), Postives = 262/284 (92.25%), Query Frame = 1

Query: 36  SVCDSSSLLIEVDLNQCEPWDIPREACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKA 95
           S   SSSLLIEVDLN+CEPWDIP+ ACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKA
Sbjct: 20  SSSSSSSLLIEVDLNKCEPWDIPK-ACVGGKEWYFFSQRDRKYATGLRTNRATASGYWKA 79

Query: 96  TGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEWVMHEFRLEGPLSPLKDPPPREDWV 155
           TGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEWVMHEFRLEGP SP+ DP P+EDWV
Sbjct: 80  TGKDRPVFHKANQLVGMRKTLVFYQGRAPKGRKTEWVMHEFRLEGPFSPITDPSPKEDWV 139

Query: 156 LCRMFCKQKEVPARPSTGSSSSYSNV-AASSSLPALMDSYLSFHQNPSSYLNEFEQVPCF 215
           LCR+FCKQKEV  +PSTGSSS Y+++ A+SSSLPALMDSY+SF QNPSS+LNE+EQVPCF
Sbjct: 140 LCRLFCKQKEVTPQPSTGSSSCYNDIIASSSSLPALMDSYISFDQNPSSHLNEYEQVPCF 199

Query: 216 SILSQNQTIPSLSNLIQMEANTGNNIKNFTTMY-GGMPNSSTYSSNIDPFACDSSVLKVV 275
           SI SQNQTIP+L++L+QMEANTGNNIKNF+TM+ GGMPNS+T SSNIDPFACDS VLK V
Sbjct: 200 SIFSQNQTIPTLTSLMQMEANTGNNIKNFSTMFGGGMPNSTTCSSNIDPFACDSKVLK-V 259

Query: 276 LNNITKMETNGNPFIGQPSMGEGSSDSYLSEVG--DDISSFWNR 316
           LNNITKMETNG  FIGQPSMGEGSSDSYLSEVG  DDI+S WNR
Sbjct: 260 LNNITKMETNGTSFIGQPSMGEGSSDSYLSEVGVADDIASLWNR 301

BLAST of Cp4.1LG06g05890 vs. NCBI nr
Match: gi|734390512|gb|KHN26800.1| (NAC domain-containing protein 21/22 [Glycine soja])

HSP 1 Score: 390.6 bits (1002), Expect = 2.6e-105
Identity = 207/314 (65.92%), Postives = 234/314 (74.52%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S LLI+VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVQH--NDSLLLIDVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDRP+  K    VGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRPILRKGTH-VGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKGRKTEWVMHEFR+EGP  P K    +EDWVLCR+F K  EV A+PS G  S Y +  
Sbjct: 122 APKGRKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNSEVLAKPSMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SSSLPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSSLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTNPIFNHMTTMEPKFPLN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            TT YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATTTYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHVWN 301

BLAST of Cp4.1LG06g05890 vs. NCBI nr
Match: gi|255647843|gb|ACU24381.1| (unknown [Glycine max])

HSP 1 Score: 389.4 bits (999), Expect = 5.9e-105
Identity = 206/314 (65.61%), Postives = 234/314 (74.52%), Query Frame = 1

Query: 3   NNISMVEAKLPPGFRFHPRDEELVCDYLMKKIGSVCDSSSLLIEVDLNQCEPWDIPREAC 62
           +NISMVEAKLPPGFRFHPRDEELVCDYLMKK+    + S LLI+VDLN+CEPWDIP  AC
Sbjct: 2   SNISMVEAKLPPGFRFHPRDEELVCDYLMKKVQH--NDSLLLIDVDLNKCEPWDIPETAC 61

Query: 63  VGGKEWYFFSQRDRKYATGLRTNRATASGYWKATGKDRPVFHKANQLVGMRKTLVFYQGR 122
           VGGKEWYF++QRDRKYATGLRTNRATASGYWKATGKDRP+  K    VGMRKTLVFYQGR
Sbjct: 62  VGGKEWYFYTQRDRKYATGLRTNRATASGYWKATGKDRPILRKGTH-VGMRKTLVFYQGR 121

Query: 123 APKGRKTEWVMHEFRLEGPLSPLKDPPPREDWVLCRMFCKQKEVPARPSTGSSSSYSNVA 182
           APKGRKTEWVMHEFR+EGP  P K    +EDWVLCR+F K  EV A+PS G  S Y +  
Sbjct: 122 APKGRKTEWVMHEFRIEGPHGPPKISSSKEDWVLCRVFYKNSEVLAKPSMG--SCYED-T 181

Query: 183 ASSSLPALMDSYLSFHQNPSSYLNEFEQVPCFSILSQNQTIPSLSNLIQMEANTGNNIKN 242
            SS+LPALMDSY+SF Q   ++ +EFEQVPCFSI SQNQT P  +++  ME     N  +
Sbjct: 182 GSSTLPALMDSYISFDQT-QTHADEFEQVPCFSIFSQNQTNPIFNHMTTMEPKFPLN--H 241

Query: 243 FTTMYGGMPNSSTYSSNIDPFACDSSVLKVVLNNITKMETN--GNPFIGQPSMGEGSSDS 302
            TT YGG PN       +DP +CD  +LK VLN ITKME N       G PS+GEGSS+S
Sbjct: 242 ATTAYGGAPN---LGYCLDPLSCDRKMLKAVLNQITKMERNPLNQSLKGSPSLGEGSSES 301

Query: 303 YLSEVGDDISSFWN 315
           YLSEVG  +   WN
Sbjct: 302 YLSEVG--MPHVWN 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NAC22_ARATH8.0e-8655.56NAC domain-containing protein 21/22 OS=Arabidopsis thaliana GN=NAC021 PE=1 SV=2[more]
NAC98_ARATH3.4e-5262.75Protein CUP-SHAPED COTYLEDON 2 OS=Arabidopsis thaliana GN=NAC098 PE=1 SV=1[more]
NAC92_ARATH5.8e-5250.46NAC domain-containing protein 92 OS=Arabidopsis thaliana GN=NAC92 PE=1 SV=1[more]
NAC79_ARATH7.5e-5244.81NAC domain-containing protein 79 OS=Arabidopsis thaliana GN=NAC079 PE=2 SV=1[more]
NC100_ARATH1.1e-5050.94NAC domain-containing protein 100 OS=Arabidopsis thaliana GN=NAC100 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LDG6_CUCSA1.9e-15585.85Uncharacterized protein OS=Cucumis sativus GN=Csa_3G523580 PE=4 SV=1[more]
A0A0B2QZ54_GLYSO1.8e-10565.92NAC domain-containing protein 21/22 OS=Glycine soja GN=glysoja_011392 PE=4 SV=1[more]
B2ZGS0_SOYBN4.1e-10565.61NAC domain protein OS=Glycine max GN=NAC34 PE=2 SV=1[more]
C6TN89_SOYBN4.1e-10565.61Putative uncharacterized protein OS=Glycine max PE=2 SV=1[more]
A0A0B2R272_GLYSO4.5e-10464.65NAC domain-containing protein 21/22 OS=Glycine soja GN=glysoja_025555 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G56010.24.5e-8755.56 NAC domain containing protein 1[more]
AT3G12977.13.2e-6947.94 NAC (No Apical Meristem) domain transcriptional regulator superfamil... [more]
AT3G18400.19.1e-5643.91 NAC domain containing protein 58[more]
AT4G28530.12.2e-5458.86 NAC domain containing protein 74[more]
AT5G18270.16.5e-5442.86 Arabidopsis NAC domain containing protein 87[more]
Match NameE-valueIdentityDescription
gi|449466123|ref|XP_004150776.1|2.8e-15585.85PREDICTED: NAC domain-containing protein 21/22-like isoform X1 [Cucumis sativus][more]
gi|778681513|ref|XP_011651529.1|5.8e-15385.22PREDICTED: NAC domain-containing protein 21/22-like isoform X2 [Cucumis sativus][more]
gi|659087103|ref|XP_008444276.1|1.8e-13384.86PREDICTED: NAC domain-containing protein 21/22-like [Cucumis melo][more]
gi|734390512|gb|KHN26800.1|2.6e-10565.92NAC domain-containing protein 21/22 [Glycine soja][more]
gi|255647843|gb|ACU24381.1|5.9e-10565.61unknown [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR003441NAC-dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g05890.1Cp4.1LG06g05890.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003441NAC domainPFAMPF02365NAMcoord: 13..138
score: 1.0
IPR003441NAC domainPROFILEPS51005NACcoord: 12..162
score: 55
IPR003441NAC domainunknownSSF101941NAC domaincoord: 4..162
score: 1.57
NoneNo IPR availablePANTHERPTHR31744FAMILY NOT NAMEDcoord: 1..315
score: 2.6E
NoneNo IPR availablePANTHERPTHR31744:SF5NAC DOMAIN-CONTAINING PROTEIN 21/22-RELATEDcoord: 1..315
score: 2.6E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG06g05890Cp4.1LG02g02790Cucurbita pepo (Zucchini)cpecpeB468
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG06g05890Cucumber (Gy14) v2cgybcpeB566
Cp4.1LG06g05890Cucumber (Chinese Long) v3cpecucB0973