CmaCh04G008120 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G008120
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionHistone
LocationCma_Chr04 : 4154842 .. 4168469 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGGCATGACCGTCGGATCAGAAGACGCATGGCGTCAATCAACGGTCACGATTTGAGATGACAAAATTTAATGGCAGTGCTCAAAAAACACACCTCTCCCACATGTCTCATCACTTCTCCTTCCTTTCTCCGCCATAGCCGCCAAAATTCCTACCGTTTCCGGCCGTTCTCCCGCCTCCGCAATTCGCTTTTGATGGATTTCAAGTCCAAATTCCGTTTCATACGACAGAGTGAAAGCTAAGGAGTTCGAGGAGGAAGAAGGACGGAACAGAAAAAGTGGTTTGATTTATGACACCACAATACAACCAAACCCCACACGAGTCCCAAGCAAAAAGTCCCAACTACTTATCTCTCTCTTTCTGGTGCTCTGTATCGTATAAGCACGTACACGAACTTGCGCGCGCGGCACTCTCTCTTTCTTTCTCTGAAGGAAGCTCACTTTCTTTGTTTCTTTTCCTCTTATACTTCACTTTGCAGTTTCGATTGTTCTGTTTTTGTTCGATGAGAGACGGTGATTGTGGCTACCGGTCTGGTAATTGAGACCTCCGGAATGACCGACAGCAACTCTGTTGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCCGACCGGAAATTTGCTAGGGTCCAAGACGTCCCGGCTTACGGCCGTGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCATATATGCGTCTATGGAAGTTCCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTCTGAAATTGGCGAGATCGCTAGTCGGATAGGTCAGCTTTACTTTGGGCATTATATGAGAACCAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTGTTCTACGAAGCGATTCTTAATCGAAACTATTTTGAGGAATCGAAGAATTCGAGGAAGGATCTGGGAGCGAGGTTCAAACAGCTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGCTTTTGAATCGCACGCACACGGTTCAGGTTCTCGCCGAGAGATTGAAGGCTCTGGTGGACGATAGCAAGGTCGCTTTTCGGGTAACTTCTATTTTGCACAATAATTATAACTGAATTTTGGATTATGTCAAATATTTTCAGTTTGCTGATGGACCTTTTTGACGGCCTGTTTATTTTTGGAGCTCATTTTTTTTTTCACGATCCAGCTTATTTTTCTAATTTTGGCTTTTGATTCAATGCTGTCTAGACCTTGTGTAGCTTAGATTTTTCTGTTGTCTTTCTTTCTTTACCATAATTTTAATTAATATTTTAAAATTCGATTGCCTTTTCTTTGAATTGAAGAGCAAGTTTATGTTTGTTTCCATTCATTTGGTTTCTTCTCAGCTGCCCAACCTAACATAGTAAAGAAACATTTATAATATGCTTGCACTGTTTGTCATGTTTGTGTCCCGGAAATAGTTTATCTGAAAGGGCTCTATCAAGAAATTCGCAACAAATGCGAACAGTGGAGGAGTAATTTTTCCTTTTACTTTTTTTTTTCTTCTTTTTGCTGGCTTGTAAGCAACTTTGAATTGAACTTACCAATATAAGATATTTGGGAGATTTTCTAATTATGTCTGTTGAGATCAGTATGAGGTCTCCATATAATTATTCGAATCACGGTTGATTTGTTGTGTTTAACATTTATCCAGATCTCGTGCTTTAGGTTTTTTTTTTTTTCAAATTAAACATGGTATTGTTCAATCTTTGAAGAATGAAAATGATTTGGACACCCATGCTTGATCCAATAAATGAACTCCTTTTTTATTTCACTTTGTCTGCTTAGAAAGTATCGATATCTTCTTCACTTAGAAAGTGTTACAAGGATATCACTGCAGCTTTTAACTGCATAACTGAGGCACGCTAACTATTGCTTCTAGGGCACTGACTTTAAAGAATGGCGGCTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCAACAATCTCAATGAATGTCAGACCTCTGCGTTACTCTGCTTCATTTGATTCCCATCAGTTATCCCTTCCATTTGTAGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTGAGTTAACTAGATATTATTTTTTTGTCTGTCGCAAGAGTAAATATAAATCTTGAATGAGAAGAATGTTTATAGATTTAACTGATTTCATGCACTACTACTTTCAGGTTAAATTTGCAGAAATTACTTTAGACACGTATAGAATGCTACAATGTCTCGAATGGGAGCCTGGTTTCTTCTACCAAAAGCATCCAGTTGAACCAAATGAAAATGGTGCTACCATTGATCATTCTGGGGCTTCTGGTATAATTGATATTAATTTAGCTACCGATATGACTGATCCATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTAAGTTTTTATTTATTCATGAAATTGTTACTGCTTCAGTTGGATGGTAGAGTTTTTAATTTATTCAACTTTTGGCCTTGTTTTTGTTTTCTTAGAGAAGAGAAACAGATTTTATTGATGAATGAAATTAAGGGAGGTAAACCCCCAACCCCCCCNCCCCCCTCCAAAATACAAGCAAAATTACAAGAAAGAAGTCCAATTAGAGAGGGAAATAGAAAGACTATGATTACTTAAGGGGTGTTTACATTTACACCAATAAAAAGTGTTGAACAAGACCAAATCCATTAAACCAACAAAGTTTAAAGAAACATGTCTGAAAACTGAAGCAAGAAGATCGAAAATAGTTTTCCAGTGAATGTTAAGTTGGCCAATTTCTGTATTTTATTTCCTAAGATCCTCTTATCTCAAAATATTTTCTCGTTATTCTTCTGAATTGTTTGTGGGTATATGCCCAATATCTTGTTACATATTATATGATGCCCTGAAGCTCTGTATTCGTTCTGCTTGCACTTATTATCTTGGATTTGCTTCCCATGATCTAATAAGGATAGGAACTTACGCATGCTTTACTGAAGCAAAAGAAAATCAGAAGGTTGATAGCATCGAGAGAGGGATCTTTCTCCACAGTTCTGTACCCAAATATGTACCAACCTTTCCCACTTTCTTCTAATACTAAACGCTACTGAACTCATAATGTGCCTCATAACTACCCATATAACAACTAGCTAAATTCCAATTCCTATAACCACCCACTGTCCAAAATTAACTTGTGCTCAAGGTGATGGATTCAAAATTAAGATATGTCATATCTTCTCATAATGACTTTTGTGTCTTATAGCTATTCAACATCAGAGGGAGAAATCACCACTTTTTTGTGAGAAAATTCTGCTTCGATGGTTGTTGGTGCTGGAGCCTGTGAGGGGAATTATCAACAGCTCCTCTAGTGCGCACTCCATATTTCTCAGTCATCTTCACACTTTTGGAGGTACCCAAGAACAGTGTCTCAGCCAGTGTTTCTCCTCTAGCTTCCCATAGTCAGTATTGCATTCAGCCCCAAACTTTTCTCCTTAGCTATTCCATACTACCTAAAATCTTCATCCTAAACTCCAATGCTTATCACCACCCCCTGCCTATATATTACCTTGTCCTCTAGGTAATAAATAAGAACTAGGACATGCCCATCTCCTCTTAGTGACCATTGTGACTTCAGGTATCCATAGTAAGAGATACAAAGCACCCCTTTTTTTTTGGTATATAAGTCTTCGCTAGTCATACTACTTTAGCCCCTCTAACGTCACTGTTATAATGCTTGACATGGAAAGGTTGATATGATAGCAGTGACCTCTACGTTCAAAGCAAGTATTCCAGCTTCACTCTTGCTTTTCTCTATCATCTTCACAACTTCCGAAGGTAACCTCCCATGCACAGTGCTTAGGCTGCGCATCTTCTCTAGGTTCCCATCTCCATTCGAAGCTTCCTCTTCCCTCTCTATCAAATCCATGTTTGAAAGTGGAATTTGGCCTTTTATTCTATGGTTTTTTTGGGCTTGATGCAGTCACACTCTCAGCCCACTCATGAAATTTAGTGCCCAGAACCTCTCAAGCTCATGTAGTAGCATTCTCAACTATCAACCTTTTTCTTCTTATCGATTGTGTTCTATTCTATTTGTAGCCAATGCCAACAATTTTTCTACTCACTGGTTATCGAAATCTACCTTCAAACCCTAATTTTCTGTTTGCGGTAAAACCACTCCTTTGGAATTTTGTCTACCAAGATTGACCATATCTTCTTCTATGCTCCCACCACTTTCTATTTATGAAAAATAGGCACTAGGTACGGCTTTCATTTGTTCAAGATTGGTTTGATCTTTTCGTGCTACCTTCACTGTTGTTACGGGGCCTTATTGAATTGTGCCACCGTCCTTCAATGTGGATTGTGAGCCCCTTGGTCTTCAAGTTTCTTCATTCCTCACCGAGTGTTCCAAGATCTAGATTTCCATCCACCAATGGATTTTAAAGCTCGAGACACATTAAAGCAGAATAGCACTCTGGGGCTTAAGCGTGGGTTTTTTTTTTTTTTTTGTGATGCACACAGTTGAAAAATGTTGATAAATTTTATGTATATGAAGAAAAATACTACTACAAATAATAAGATGATGAAAACTTTAAGAAAACAGAGATTTTATTTTAGGGGAAAAGGATTATTTTGTAAAAGAGTATTCGTTTGTGCCTTTTCCCTTACAAGTTAGACAACTTCAAAAAATAGTGGAAGTAAAGCGCTGAGCGGCACACCACAAAGTATTGAGGCATGTGCTTCATCAAGGGAGCCTCACTTTTTTAAGCAAGGTGCAGAGACTGTGCCTTGGGCCTAGGCGTGCACCTCAGTGAGCTTTTTAAAATGTCCAAATATTTGCCTACTTCACTAATCTAAAACCAATGTTAATCAACAGTTCCCTCTAATAGACACTATCTATCCCTTATACAATATCGATGTAGTAACCCACCCCTGTACCAATCAAACTCACCAATGACTATGAAATAGTTGATACTCACTATCTACCTTTAGGAGAATTGCAAAAGAAAATTTTTGAGTTTAGTTCTGTACCTAGTTAATTAAAATTATCAAGTAGGTCATTTCTGAACCGAAACAATGTTTTTGTTATGAGAATGCCTACTCTGCATCGTGTTTTTTGCCAGATGGTGTGTGTGGTTGCTCTGCCCGCTTTTTTATTTTTATTTTTTGTTGCATTTGTATGATAAGAACTTTCAATTGAGCTTCTTGGCTTGTGATCAGGTCATGGCTACAGTTTGTGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTATCTGCAGCAGGTTAATGTCCTATAATTGATGATTCAAGTGAACTGATCTCATTTTATATTTTACTATTATTTATTGGCACTCATTGACAAAATGTGCAGGAAAATGTTGTCAAAACAGTGTCAATCAAATGGCAAGTTATGGGGAATCAAGAAAATCCGTGAGAAGTAAAGTCATCACCCAGAACTCACGAGAAAATTGTAATGCTTTGCCTGAATCTTGTAAGAGTGAGAAGCGAGGATCAAGTGACCTATATGATGAGTATTTGTGGTTTGGGCATAGGAGTAATGGAGGTACAGTCCTGCCATACTCTCTCTCGATCGTCTTTTCTGTTGAGTTATGTGGATACTTCATTTCAACATTTTGTTGTTTCCATTTTCATTGATGCCTTAGAATTCTTCTTATTCTTCTTCTCCTTTTTCTCTCTCTAAATTTTCAGGTCCAAACGTTCTATACCCTGGTGATATAATTCCCTTTACACGTAGACCTGTTTTTCTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGGTTTGTCAAAACTTTTACATGTGTTGCCAGGTCATCATGGTTTGCTTCACACTGCTATTTTGAAAATGAAATAGGTGGATATTTCTGGTTTTTCTCCAGTATCATCATATCTGCTCTATATACGTGATTATCACGGCTTTGAGCATGACATGAGATAGTTATGACCATCAGATGCATTAAATTGATCTGCTTTCCTTGTATAAAAGGACAGGTGTTGAAGCCAACTATTGCTTCCAACTTTCATGTTGCACTTTCAGTATGAATCTTTTCATGTACGCAAAATTGCCAGTTCACCAGTATCTTACTGACAGTTTCATGTTGATGGACCTGTCTGGTCTAGAGTGAATTTAATTGGTTTTTAGAGATACAATGCTTCAAGAAGTCACTTAAATCCGTCAATTTTTCTATGCTTCAAGAATTTAATTCTATTTGACCAACCACAAGTCTCAGAGGACTATCTCAGAATTCACTTCGTTTAGTATTCCTAAATCTTACATATTAAGTGCGATACCTTGTTCTGTTGTTACTCTCAAGTCAATCTGACTGCTAACGTGGAAACGATATTTGGTTTTCAGGTTCTACATGGTGCAGAGAGAGGAGAGACTGCCGCCATACTTCTATCACCTTTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACGATTCAATCCGGAAGTCAGTTCACTTTTTTCTTGACTGCCCCTCTTCCTGCATTTTTTGAAATGGTTGGCCTGTCCTCGGCCAATATGGATACAGTAAGTTCATTATTACAGTCAAAACTCCTCTTCAAATATTTTCAGGACTTCCGTTTAATCCACAACTTGTTATTATTTATAGGATGTTTACAATGATGCTGAAACCATAGTCTCCTCCGCATTTTCGGAGTGGGAAACAGTTCTTTGTACATCAACTAGCTTGAATATCGTTTGGGCCCAAGTTTTGTCTGATAATTTTTTACGCCGTCTCATTCTCAGGTATTTTCCAGTCAACTTGCTCACGTTATTTATGCAAATATTATATGTACCCAAAGCATCTTTTTTATGGAAGCGTGAATCATGTGGAATGCTCTAGCCATGGAAAGAAGTCACAATTATCTTTCTTTTGTTCCATAATCTGCAGATTTATATTCTGTCGATCCGTGCTATCCTTCTTCAGTACTAAAGAAGATGATGATCTTCCTATTTGTCTGCCTTGTCTCCCCGATTCTGTTGCTTCTAAGTCTGGAGTTGTCTGCTCAGCAATTTGCCGTCTCGCAAAGCACCTTAACGTTGCAGACTTATTTAACTTCCATGAAGTATGATCACGATCTAGATATTGCGAAAACAAATTTGAGTTCATAGTCTTCCTCAAAGGTCAGTTCACAACCGAAACATTACACCATTGACCTCTTGAGAGTTGCAGTAGGACCACAAATTGGTATTTGTTCTGAAGCTAAAATTTATGAAACACTGGTAGTGTGATGGTGGTTGCCTTCCATTTTAATGTTTATGAGCAACCTGTTGTGGGCTTACTGTTTAGTTAGATGAGAATGTGATAATTTATTGTAGGTTTCTATTGAGTCTTGAAGATTCATTTAGTTCCCTCTTATTTTTTGGGCCAATTTTTCTTTCATTTCTGTGTGGAAAAAAAGTGGGGAAAAGAAATTGAAGCAGAGAAATGAAACAAGTTACTTGTAGATGCTGGCAAATGGGGGAGTGTACAATCGCTTCTAGGACGCTTATTACAAAAGTGTCAAATCATGAGAAGTCTGACTTCATCTTTGTAACTTGAGTCTGTTCATTATTGGAGTGTTATAAGAAAGCATTTTCGAAGTTCAGGAAGCTGTTCATGTTGCCTAAGATTAAATCATGCCCAAGTTGGACTCATCTTTAAGTGCTATGAAGTCTTTTTTTTTTATTTTTTTGATTTCTATAGGCATGTTAGATACAAGGGGGAATTGACATTTTTACCATTTCATAAACAAATATGAATTAATTAAAATGACTCTTTTTAGAGTTCAAGTTAAGTTAGATGTCATGCAAATTGAACAGGAATGTCATGAGCAGAGGATATGCATGACACGTGTCCCCTTGAGTGTCCAGCCATAGCGTCAACACTAACCTATCAATAAAGGCAGATCAAAGGGTGGTCGAGACCAAGATCGAGGGGTAGGAGTTGTATTTAGTTTTCGATCAAAATTCTTCTCCTCTCGTTTGTCTCGTGGCTTAGTCTTTTGTGTTAACTTGGAACTTTTGTGCTACATTGAGTATTTTTTTTTATAGTTGTTTCGCCCCTAACGTCTCTTACAACTAGACCTTTTGGTCACAATTAATTAGTCACAAAATTCGTCATTCATTTTAACAACAAAAGTTTAGAAAGATAGATATATATTTTGAAGGCCAAAATAAAATGAGCAAAAATGTTGGAAATATGTGTTAAGAAAAGAACCCAAATTGGATAAAATCCTGTATGGCGCCCATTTGAAAGTTCTATTTTTTGCAATTCCCGTATATCTCCCAACAGAATAAACAAAACTCCAATTTTCCAATCTTTTATGTTTGACAACTTCAAAATCCAAAGACCCCTTGAAATTGGGTAAAACTTCATATTTAACGGTTTGATTCTATAACCTCTCAAACAGAAGGAAAAAAAAAAAAAAAAAAAAAGATAAATGCCACAGATTTATCGCAATTCCGACCAATCGGAAACGAGAATTGCATTCTCTCGATCCTCGTCTTCCTTCTAAAGGTCACATCCAGGCCGTTCACTCTACTCGAGAAAACACAACTCGCGCCATCATTTACCGTTAGATTCGTACACCGAAAAAACACACGGATATGCTTTCTTAACCGTCCGATCTCAGATTCTATCTTCCAGTGCGACCCTATAAATAAGCCCTAACTGCTCTTGTTGTCAAACTTTTTAGCTTCTTTGACATTGAGAGATCTGCTCAAGAAAGAGAAGAAGCAGCGCCAAGCGACACGAGGTATGCCTGATTTTATTTTTTTGTTTTCTTTCGCCATTTTTTGATTTTTGTTTGCTACTGGGGAATTGTTTAGGGCTTTAGACTTCATTTTAGGGTTTTCTTCTTTAGAGTTGGTGCATCGTTGCTGGATTTTTCTATTTTCGTTGATGATTCATATCCTTGTTCGGCGTTCTCCGCTCTTGATTGTGTTTCCAGACCTAGGCTTGAGAAATTGGAATACTAAGAGTTCCCTTGGGTTTTAGGTTTCTCATACGTTAACAACTTTCCCTGATTTGTGATCTCTTTTAGGTTTCGTGATCTTTGTTTCTGGAAGTTTGTGAGAACATGGGAATCTGGTATTTGTTAGGAAATGGGATTAGCTATCAATTAGATGTGACGAATTAGTTTTTAGCGTTGTGTAATCTTATGTGGGCGCTTAAATTAAAGCATTTGCTTGACCTGAATTTGAGTATGGTACGGTTTAATTGTGAATAAATCGGAAGTGGGGTTTATTTTCCGCAGATGGCTCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGAAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGTTTGTTCATTTTTCAGAAGTTCGATTCCGTGGGTGAATTTGGTTGATTACTCAATTCTATGGACGTTCTAATGTGGTTATGTTTTTGGAACTGTAGGCAGCACGTAAGTCCGCCCCAACCACAGGAGGGGTGAAGAAGCCTCACAGATACCGACCTGGTACTGTTGCTCTTCGGTGAGTTCACGTGGCCTTTATCTCCCATGTTTTCCCATCTTGAATTTGCTGTATGTGTTTTGTAACTTTCGATTACGTTTTGATGCGCAGTGAAATTCGCAAGTACCAGAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCGTTCCAGAGGTTGGTTCGTGAAATTGCACAGGACTTCAAGGTATTTTCTTATGTCTGTTAACTTATTTCATCTCCTGCTTGTAGATTTTGCCGAGGTTTGTGTAATTTAGGTCTTGCGTTAAATTCTATTGGTGCAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCTTTGCAAGAGGCCGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTGTGCGCCATTCATGCCAAGCGTGTTACCATAATGCCCAAAGATATCCAGTTGGCTCGGAGAATCCGAGGTGAACGTGCATAGAAGATGGTGGATGGTATTGGCTGCAAGGAGGTGATCAAAGGCCTAATTGATGTGATGATATGGATATGGATGGATGGATGGTTCTTACTTTTGTTTTAAGCAACGTCGGTAGATAGCAATACTAGGAAACTTGTAGTGAATATGTGGTAAAATGGCATAATGCTTTGCTTGTGCATGTTGATGTTTAGCTGTGGATGATTTTGTTTTTAGTTTTTTCAAAAGATGGAGTTGGATTCTGTATTTCATTGAATTGTGTACTTCGGGCATGCAGCCAAACTAATATCGTCGGTTTAACCTATCACATCTAATCTGTAGTTCAGGTTTCTAAGTTCCAGTTCAATTTTTCTATCTGTAGTGCTTTGAACGGATATTGAATTCCTATTGCGACTCCTAGAAAATTTTTCATGCGCTTGGTATACAAACTCTCTATCACGGATCTGGTCCGAAGTTGTCGAGCCATTAATTTTAACAAAGCAACCCTCTATTTGGATTGATAACTAATGAACAACTGTTCTTGTTGTCCTTACATTGTTGCCTCATTCATTCCTCATTATTAATGTGGGACTATCAACAAAGCAACCCTCTAGAATCTCTTACCTTGTACAAGTAGAGATTTACCATGATGATTAACGAAGCAGTAATGTGAGTCTTTATATTAGTATTTGAGGTTCTGTCCAAATCTAGAAAACAAAGTACGTGCGGGGAGGTTGAAGGTGAAGTGTGTGGAGTGCTCACGTAGTTGAGGGGTACAATGCATGTGAGGTTGGTTTATTGTCTCGAGATAGTTATATGAGCTTTGTTCCCATTGTAGGATGAATTATTATAAAATAAATTGTTTATACCTAAAAGAAAACAACCCAATAAAGACTAATTAATAATGCCTTAGAAGAGGTAAATTGGATTAAAGAGAATGATGAGACAAATATAACCAAATCTTCCAACTTCCTTTTCACTTAATTATCCAGAATTAAGTGTCAAGTTCACATCCATGCATATTTCATAATAAAATTTATGAACGAAACATGTTTGAGAGCTTGAGAGTTTTTTCTTTCTCAATAACGTTCTAATTCAATTACTTGTGTCTACTAATGACAATCTTACGTACCATATTATATAAAGAAAGTTTTTATGATTGAACCACTGAAAAGACTGCTCTCATTTGATTGTGATTTTTGGTTCATTCATGTATCTCTTCTTTACTCGAGTGTCACATTCTAACCGAGACTATTTATGAGATTTTATTAAACTATACGACTAAATTCCACAAAATTTATATTTTAATACAACATTGAAACACCTGCTAAATTAAGAAAACATATATTTTAAGGATATCATTATTTATTACTTAAACTAAGAATTAAACAAATAATTAACCAAAAAAAAAAAAAGGCCAATTGACAACGTCAAATCCAATTCCTAATTAACTTCGTCCTTCCTTTGCCTCTCTCTCTCTCTCTCTCTCATTTTATTTTTTATTTTTTTATTTTTTAGAATCGACAAAGATTGAAAGCCCAATAATTAGTTTACTGGAACAACCGCCAACTTCGGATTTTCCTTCATTCTCAAACCTATTTTCTTCTTCCAATCTTTGCTCAAATCTCCAAATTCACAAACCAACACACATATCTTCAATCCCTTTCACTTTTGTAGCTGTTCTTCTTTGAATGGAATTGCGTGTTCTTTTAACTACTTCGATTCTGTAAGTTTCCTTTTCTCTTCCCCTCCTAGTTTTCTCCCATTGACCATTTCTCTTTCCTACAAATTTCCTGTTCTTCGATTTAGATTCTCCATTTCTTTTGATTCTATTGAAATTTTGATTTCTTGAGGAAATGGGTTTTGTTCTGTTTTGTGTTTTGCTGCCAAATTCGCAGAGAAGTTGATCAGAGTTTCTGAGATGCATCAACAAATGGAGAAGACTGATGAGAAAACTCGTGTGGCAGAGGAAGAACAAAATGGTTCATCCGAGCTTAATGATCCGCTTGAAGAGATGAAGAAAATGGCGGACGAGACCAATACAAAGCTCAGGTCTGAGTTGGAATCTGTGAAGAGCAAAAGAGATTCTGCCATGGAAAAGGCTAAGGAATTGGAGCTTCAATTGGCAGAGAAGAGTTCTAATATGGCAAAGCAAAAAGAGGAGCTCTCTGTTTTGAAACGTTTCGAATCGCAAACCCAAACAAGAATCCAAGAACTCGAAAAAAAGTACCAAAATTCGAAAGAATCAGAAGAGAAAACCAAGGAATTATTAGCAGAACAAACAAAGCGCCTCGAACAGACAAAGATTTCCCTGGAAGAATCGAAGATAGAGATCCTATCTCTTCATGAAAAACTTGTGAAATTCTCTACCGAAACCCATTTCAACGAACTCCCGACGCACAACATTCCAACAAAAAATGAATTCGAACGCCTAAAATTCGAGCTCCAATCGACAAGACACCAACTTGGTGTCCTAAAAAACGAGCTAAAAGTAACCACAGAGGCTGAGGAGAACAACAAAACAGCCATGGACGATCTAGCAATGGCATTAAAAGAGGTAGCAACAGAAGCCCATCATTTGAAAAGGAAATGCAGCACAACCGAAAAGGAATTACAGAAAACAAAAGAAGAAGCAGATAATTTGAAAGCGACATTAAAAAACACAGAGGAAAAGTACAAATCTTTGCTACAAGAGGCAAGAAGAGAGGCAGATTTGTACAAGAGCACTGTGGATAGACTGAGATTGGAAGCAGAGGAGTCTCTGTTGGCATGGAGTGGAAGAGAAACAAGCTTAGTAGACTGCATAAGAAGAGCCGAAGACGACAGATTCAATGCTCAACAAGAGAATCGCCGTTTGATGGATACTCTGAGGCTAGCAGAATTGAAGAACATGACATCAAAAGAAGAGATAAAGAAATTAAGGGACATTCTAAAACAGGCCTTGAACGAAGCCACAGTGGCCAAGGAAGCTGCAGGAATCGCCATTGAAGAGAATTCACAGCTGAAAGACTCCTTAGCTGAGAAGGAGAACGCATTGGATTTCGTTAGCAGTGAGAATGAGACACTGAAAGTCAACAAAGCTGCAGCATTAGAGGAGATTAAGGAGCTAAAGCAGCTGCTAGAAGCAAGTAAGAGAGGAGAAAGCAATGGAAAAGAAGAAAACAAGGGGAAAGAAGAAAACAAGGGGAAAGAAGAAGGGAAGGAGCAGGTGGAGAAGGAGATAACGAGGTCGAGGCCACCGTTGAGTCCGAGTCCAAGCCTGACTCCACCGCCAGTAGAAAAGGAAGATACATTCGGGAGAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGCTGAGACTGACATCAGAGAAGAAGAAGGAGGTCGAAGAGGACGAGGGAGAGCCTCAAATGGAGGAGACGCTCAAAGGGTCAATTTTCGACGAGGTGGACTCGCCTGGTTCAGGGAGGGTGCACGAGAGGAAGCGATCATTGTCTCAGTTCGACGGTGATAGGGATATACTGAATGATGAGATTGAGGATCTTGAGCATTTGGAAGAGGGTAATTTGGATGGAGAAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGTTTCCAGAAGAAGGAACAATCCCCAGAATGA

mRNA sequence

TAGGCATGACCGTCGGATCAGAAGACGCATGGCGTCAATCAACGGTCACGATTTGAGATGACAAAATTTAATGGCAGTGCTCAAAAAACACACCTCTCCCACATGTCTCATCACTTCTCCTTCCTTTCTCCGCCATAGCCGCCAAAATTCCTACCGTTTCCGGCCGTTCTCCCGCCTCCGCAATTCGCTTTTGATGGATTTCAAGTCCAAATTCCGTTTCATACGACAGAGTGAAAGCTAAGGAGTTCGAGGAGGAAGAAGGACGGAACAGAAAAAGTGGTTTGATTTATGACACCACAATACAACCAAACCCCACACGAGTCCCAAGCAAAAAGTCCCAACTACTTATCTCTCTCTTTCTGGTGCTCTGTATCGTATAAGCACGTACACGAACTTGCGCGCGCGGCACTCTCTCTTTCTTTCTCTGAAGGAAGCTCACTTTCTTTGTTTCTTTTCCTCTTATACTTCACTTTGCAGTTTCGATTGTTCTGTTTTTGTTCGATGAGAGACGGTGATTGTGGCTACCGGTCTGGTAATTGAGACCTCCGGAATGACCGACAGCAACTCTGTTGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCCGACCGGAAATTTGCTAGGGTCCAAGACGTCCCGGCTTACGGCCGTGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCATATATGCGTCTATGGAAGTTCCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTCTGAAATTGGCGAGATCGCTAGTCGGATAGGTCAGCTTTACTTTGGGCATTATATGAGAACCAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTGTTCTACGAAGCGATTCTTAATCGAAACTATTTTGAGGAATCGAAGAATTCGAGGAAGGATCTGGGAGCGAGGTTCAAACAGCTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGCTTTTGAATCGCACGCACACGGTTCAGGTTCTCGCCGAGAGATTGAAGGCTCTGGTGGACGATAGCAAGGTCGCTTTTCGGGGCACTGACTTTAAAGAATGGCGGCTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCAACAATCTCAATGAATGTCAGACCTCTGCGTTACTCTGCTTCATTTGATTCCCATCAGTTATCCCTTCCATTTGTAGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTAGACACGTATAGAATGCTACAATGTCTCGAATGGGAGCCTGGTTTCTTCTACCAAAAGCATCCAGTTGAACCAAATGAAAATGGTGCTACCATTGATCATTCTGGGGCTTCTGGTATAATTGATATTAATTTAGCTACCGATATGACTGATCCATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAGTTTGTGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTATCTGCAGCAGGAAAATGTTGTCAAAACAGTGTCAATCAAATGGCAAGTTATGGGGAATCAAGAAAATCCGTGAGAAGTAAAGTCATCACCCAGAACTCACGAGAAAATTGTAATGCTTTGCCTGAATCTTGTAAGAGTGAGAAGCGAGGATCAAGTGACCTATATGATGAGTATTTGTGGTTTGGGCATAGGAGTAATGGAGGTCCAAACGTTCTATACCCTGGTGATATAATTCCCTTTACACGTAGACCTGTTTTTCTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGTCAATCTGACTGCTAACGTGGAAACGATATTTGGTTTTCAGGTTCTACATGGTGCAGAGAGAGGAGAGACTGCCGCCATACTTCTATCACCTTTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACGATTCAATCCGGAAGTCAGTTCACTTTTTTCTTGACTGCCCCTCTTCCTGCATTTTTTGAAATGGTTGGCCTGTCCTCGGCCAATATGGATACAGATGTTTACAATGATGCTGAAACCATAGTCTCCTCCGCATTTTCGGAGTGGGAAACAGTTCTTTGTACATCAACTAGCTTGAATATCGTTTGGGCCCAAGTTTTGTCTGATAATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGTCGATCCGTGCTATCCTTCTTCAGTACTAAAGAAGATGATGATCTTCCTATTTGTCTGCCTTGTCTCCCCGATTCTGTTGCTTCTAAGTCTGGAGTTGTCTGCTCAGCAATTTGCCGTCTCGCAAAGCACCTTAACGTTGCAGACTTATTTAACTTCCATGAAATGGCTCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGAAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCAGCACGTAAGTCCGCCCCAACCACAGGAGGGGTGAAGAAGCCTCACAGATACCGACCTGGTACTGTTGCTCTTCGTGAAATTCGCAAGTACCAGAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCGTTCCAGAGGTTGGTTCGTGAAATTGCACAGGACTTCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCTTTGCAAGAGGCCGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTGTGCGCCATTCATGCCAAGCGTGTTACCATAATGCCCAAAGATATCCAGTTGGCTCGGAGAATCCGAGAGAAGTTGATCAGAGTTTCTGAGATGCATCAACAAATGGAGAAGACTGATGAGAAAACTCGTGTGGCAGAGGAAGAACAAAATGGTTCATCCGAGCTTAATGATCCGCTTGAAGAGATGAAGAAAATGGCGGACGAGACCAATACAAAGCTCAGGTCTGAGTTGGAATCTGTGAAGAGCAAAAGAGATTCTGCCATGGAAAAGGCTAAGGAATTGGAGCTTCAATTGGCAGAGAAGAGTTCTAATATGGCAAAGCAAAAAGAGGAGCTCTCTGTTTTGAAACGTTTCGAATCGCAAACCCAAACAAGAATCCAAGAACTCGAAAAAAAGTACCAAAATTCGAAAGAATCAGAAGAGAAAACCAAGGAATTATTAGCAGAACAAACAAAGCGCCTCGAACAGACAAAGATTTCCCTGGAAGAATCGAAGATAGAGATCCTATCTCTTCATGAAAAACTTGTGAAATTCTCTACCGAAACCCATTTCAACGAACTCCCGACGCACAACATTCCAACAAAAAATGAATTCGAACGCCTAAAATTCGAGCTCCAATCGACAAGACACCAACTTGGTGTCCTAAAAAACGAGCTAAAAGTAACCACAGAGGCTGAGGAGAACAACAAAACAGCCATGGACGATCTAGCAATGGCATTAAAAGAGGTAGCAACAGAAGCCCATCATTTGAAAAGGAAATGCAGCACAACCGAAAAGGAATTACAGAAAACAAAAGAAGAAGCAGATAATTTGAAAGCGACATTAAAAAACACAGAGGAAAAGTACAAATCTTTGCTACAAGAGGCAAGAAGAGAGGCAGATTTGTACAAGAGCACTGTGGATAGACTGAGATTGGAAGCAGAGGAGTCTCTGTTGGCATGGAGTGGAAGAGAAACAAGCTTAGTAGACTGCATAAGAAGAGCCGAAGACGACAGATTCAATGCTCAACAAGAGAATCGCCGTTTGATGGATACTCTGAGGCTAGCAGAATTGAAGAACATGACATCAAAAGAAGAGATAAAGAAATTAAGGGACATTCTAAAACAGGCCTTGAACGAAGCCACAGTGGCCAAGGAAGCTGCAGGAATCGCCATTGAAGAGAATTCACAGCTGAAAGACTCCTTAGCTGAGAAGGAGAACGCATTGGATTTCGTTAGCAGTGAGAATGAGACACTGAAAGTCAACAAAGCTGCAGCATTAGAGGAGATTAAGGAGCTAAAGCAGCTGCTAGAAGCAAGTAAGAGAGGAGAAAGCAATGGAAAAGAAGAAAACAAGGGGAAAGAAGAAAACAAGGGGAAAGAAGAAGGGAAGGAGCAGGTGGAGAAGGAGATAACGAGGTCGAGGCCACCGTTGAGTCCGAGTCCAAGCCTGACTCCACCGCCAGTAGAAAAGGAAGATACATTCGGGAGAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGCTGAGACTGACATCAGAGAAGAAGAAGGAGGTCGAAGAGGACGAGGGAGAGCCTCAAATGGAGGAGACGCTCAAAGGGTCAATTTTCGACGAGGTGGACTCGCCTGGTTCAGGGAGGGTGCACGAGAGGAAGCGATCATTGTCTCAGTTCGACGGTGATAGGGATATACTGAATGATGAGATTGAGGATCTTGAGCATTTGGAAGAGGGTAATTTGGATGGAGAAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGTTTCCAGAAGAAGGAACAATCCCCAGAATGA

Coding sequence (CDS)

ATGACCGACAGCAACTCTGTTGCGAAGACCTTCCGAGCTCTGGTGGAGAGCGCCGACCGGAAATTTGCTAGGGTCCAAGACGTCCCGGCTTACGGCCGTGTGGACAACCACCACTATTTTCATAAGGTTTTCAAGGCATATATGCGTCTATGGAAGTTCCAGCAGGAATTCCGCGCCAAGCTTGTTGAATCTGGTCTCAACCGCTCTGAAATTGGCGAGATCGCTAGTCGGATAGGTCAGCTTTACTTTGGGCATTATATGAGAACCAGTGAGGCCAGGTTCTTGATTGAAGCCTATGTGTTCTACGAAGCGATTCTTAATCGAAACTATTTTGAGGAATCGAAGAATTCGAGGAAGGATCTGGGAGCGAGGTTCAAACAGCTGAGGTTTTACGCGAGGTTTTTGCTGGTTTCGTTGCTTTTGAATCGCACGCACACGGTTCAGGTTCTCGCCGAGAGATTGAAGGCTCTGGTGGACGATAGCAAGGTCGCTTTTCGGGGCACTGACTTTAAAGAATGGCGGCTAGTTGTACAAGAAATTTTCTGCTTCATGAAAGTAGCAACAATCTCAATGAATGTCAGACCTCTGCGTTACTCTGCTTCATTTGATTCCCATCAGTTATCCCTTCCATTTGTAGCTCGTTTCCATGCAAAGAGGGTTCTTAAATTCCGAGATGCTGTTTTGACAAGCTACCACCGAAATGAGGTTAAATTTGCAGAAATTACTTTAGACACGTATAGAATGCTACAATGTCTCGAATGGGAGCCTGGTTTCTTCTACCAAAAGCATCCAGTTGAACCAAATGAAAATGGTGCTACCATTGATCATTCTGGGGCTTCTGGTATAATTGATATTAATTTAGCTACCGATATGACTGATCCATCTTTACCTCCAAATCCAAAGAAAGCTATCCTCTATCGGCCTTCTGTGACTCATTTGATAGCTGTCATGGCTACAGTTTGTGAGGAGCTCCTTCCAGATAGTATCATGCTGATTTATCTATCTGCAGCAGGAAAATGTTGTCAAAACAGTGTCAATCAAATGGCAAGTTATGGGGAATCAAGAAAATCCGTGAGAAGTAAAGTCATCACCCAGAACTCACGAGAAAATTGTAATGCTTTGCCTGAATCTTGTAAGAGTGAGAAGCGAGGATCAAGTGACCTATATGATGAGTATTTGTGGTTTGGGCATAGGAGTAATGGAGGTCCAAACGTTCTATACCCTGGTGATATAATTCCCTTTACACGTAGACCTGTTTTTCTGATAGTTGACAGTAACAACAGCCATGCATTCAAGGCAGTCAATCTGACTGCTAACGTGGAAACGATATTTGGTTTTCAGGTTCTACATGGTGCAGAGAGAGGAGAGACTGCCGCCATACTTCTATCACCTTTGAGGCCTGCATTCAAGAATCCCTTAAATGTTGATACGATTCAATCCGGAAGTCAGTTCACTTTTTTCTTGACTGCCCCTCTTCCTGCATTTTTTGAAATGGTTGGCCTGTCCTCGGCCAATATGGATACAGATGTTTACAATGATGCTGAAACCATAGTCTCCTCCGCATTTTCGGAGTGGGAAACAGTTCTTTGTACATCAACTAGCTTGAATATCGTTTGGGCCCAAGTTTTGTCTGATAATTTTTTACGCCGTCTCATTCTCAGATTTATATTCTGTCGATCCGTGCTATCCTTCTTCAGTACTAAAGAAGATGATGATCTTCCTATTTGTCTGCCTTGTCTCCCCGATTCTGTTGCTTCTAAGTCTGGAGTTGTCTGCTCAGCAATTTGCCGTCTCGCAAAGCACCTTAACGTTGCAGACTTATTTAACTTCCATGAAATGGCTCGTACCAAGCAAACTGCCCGTAAGTCTACTGGAGGAAAGGCTCCCAGGAAGCAGCTGGCCACAAAGGCAGCACGTAAGTCCGCCCCAACCACAGGAGGGGTGAAGAAGCCTCACAGATACCGACCTGGTACTGTTGCTCTTCGTGAAATTCGCAAGTACCAGAAGAGTACTGAGCTTTTGATTAGGAAGTTGCCGTTCCAGAGGTTGGTTCGTGAAATTGCACAGGACTTCAAGACTGATCTGCGTTTCCAGAGCCATGCTGTTCTTGCTTTGCAAGAGGCCGCCGAGGCCTACCTTGTTGGGTTGTTCGAGGACACCAATCTGTGCGCCATTCATGCCAAGCGTGTTACCATAATGCCCAAAGATATCCAGTTGGCTCGGAGAATCCGAGAGAAGTTGATCAGAGTTTCTGAGATGCATCAACAAATGGAGAAGACTGATGAGAAAACTCGTGTGGCAGAGGAAGAACAAAATGGTTCATCCGAGCTTAATGATCCGCTTGAAGAGATGAAGAAAATGGCGGACGAGACCAATACAAAGCTCAGGTCTGAGTTGGAATCTGTGAAGAGCAAAAGAGATTCTGCCATGGAAAAGGCTAAGGAATTGGAGCTTCAATTGGCAGAGAAGAGTTCTAATATGGCAAAGCAAAAAGAGGAGCTCTCTGTTTTGAAACGTTTCGAATCGCAAACCCAAACAAGAATCCAAGAACTCGAAAAAAAGTACCAAAATTCGAAAGAATCAGAAGAGAAAACCAAGGAATTATTAGCAGAACAAACAAAGCGCCTCGAACAGACAAAGATTTCCCTGGAAGAATCGAAGATAGAGATCCTATCTCTTCATGAAAAACTTGTGAAATTCTCTACCGAAACCCATTTCAACGAACTCCCGACGCACAACATTCCAACAAAAAATGAATTCGAACGCCTAAAATTCGAGCTCCAATCGACAAGACACCAACTTGGTGTCCTAAAAAACGAGCTAAAAGTAACCACAGAGGCTGAGGAGAACAACAAAACAGCCATGGACGATCTAGCAATGGCATTAAAAGAGGTAGCAACAGAAGCCCATCATTTGAAAAGGAAATGCAGCACAACCGAAAAGGAATTACAGAAAACAAAAGAAGAAGCAGATAATTTGAAAGCGACATTAAAAAACACAGAGGAAAAGTACAAATCTTTGCTACAAGAGGCAAGAAGAGAGGCAGATTTGTACAAGAGCACTGTGGATAGACTGAGATTGGAAGCAGAGGAGTCTCTGTTGGCATGGAGTGGAAGAGAAACAAGCTTAGTAGACTGCATAAGAAGAGCCGAAGACGACAGATTCAATGCTCAACAAGAGAATCGCCGTTTGATGGATACTCTGAGGCTAGCAGAATTGAAGAACATGACATCAAAAGAAGAGATAAAGAAATTAAGGGACATTCTAAAACAGGCCTTGAACGAAGCCACAGTGGCCAAGGAAGCTGCAGGAATCGCCATTGAAGAGAATTCACAGCTGAAAGACTCCTTAGCTGAGAAGGAGAACGCATTGGATTTCGTTAGCAGTGAGAATGAGACACTGAAAGTCAACAAAGCTGCAGCATTAGAGGAGATTAAGGAGCTAAAGCAGCTGCTAGAAGCAAGTAAGAGAGGAGAAAGCAATGGAAAAGAAGAAAACAAGGGGAAAGAAGAAAACAAGGGGAAAGAAGAAGGGAAGGAGCAGGTGGAGAAGGAGATAACGAGGTCGAGGCCACCGTTGAGTCCGAGTCCAAGCCTGACTCCACCGCCAGTAGAAAAGGAAGATACATTCGGGAGAAGGCTGGGGAAGGCTTTCAGTTTCAGTTTCTTGGAGCTGAGACTGACATCAGAGAAGAAGAAGGAGGTCGAAGAGGACGAGGGAGAGCCTCAAATGGAGGAGACGCTCAAAGGGTCAATTTTCGACGAGGTGGACTCGCCTGGTTCAGGGAGGGTGCACGAGAGGAAGCGATCATTGTCTCAGTTCGACGGTGATAGGGATATACTGAATGATGAGATTGAGGATCTTGAGCATTTGGAAGAGGGTAATTTGGATGGAGAAGAAGGGGATAGGAATTCAAGGAAGAAGAAGGCATTGATAAGGAGATTTGGGGATCTTTTGATGAGGAGGAGGAGTTTCCAGAAGAAGGAACAATCCCCAGAATGA

Protein sequence

MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAKLVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKDLGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEIFCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDIIPFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDDLPICLPCLPDSVASKSGVVCSAICRLAKHLNVADLFNFHEMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIREKLIRVSEMHQQMEKTDEKTRVAEEEQNGSSELNDPLEEMKKMADETNTKLRSELESVKSKRDSAMEKAKELELQLAEKSSNMAKQKEELSVLKRFESQTQTRIQELEKKYQNSKESEEKTKELLAEQTKRLEQTKISLEESKIEILSLHEKLVKFSTETHFNELPTHNIPTKNEFERLKFELQSTRHQLGVLKNELKVTTEAEENNKTAMDDLAMALKEVATEAHHLKRKCSTTEKELQKTKEEADNLKATLKNTEEKYKSLLQEARREADLYKSTVDRLRLEAEESLLAWSGRETSLVDCIRRAEDDRFNAQQENRRLMDTLRLAELKNMTSKEEIKKLRDILKQALNEATVAKEAAGIAIEENSQLKDSLAEKENALDFVSSENETLKVNKAAALEEIKELKQLLEASKRGESNGKEENKGKEENKGKEEGKEQVEKEITRSRPPLSPSPSLTPPPVEKEDTFGRRLGKAFSFSFLELRLTSEKKKEVEEDEGEPQMEETLKGSIFDEVDSPGSGRVHERKRSLSQFDGDRDILNDEIEDLEHLEEGNLDGEEGDRNSRKKKALIRRFGDLLMRRRSFQKKEQSPE
BLAST of CmaCh04G008120 vs. Swiss-Prot
Match: H33_ORYCO (Histone H3.3 OS=Oryza coarctata PE=2 SV=3)

HSP 1 Score: 249.6 bits (636), Expect = 1.9e-64
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. Swiss-Prot
Match: H33_ORYSI (Histone H3.3 OS=Oryza sativa subsp. indica GN=OsI_011536 PE=3 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 1.9e-64
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. Swiss-Prot
Match: H33_PINPS (Histone H3.3 OS=Pinus pinaster PE=2 SV=3)

HSP 1 Score: 249.6 bits (636), Expect = 1.9e-64
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. Swiss-Prot
Match: H33_GOSHI (Histone H3.3 OS=Gossypium hirsutum GN=HIS3 PE=2 SV=3)

HSP 1 Score: 249.6 bits (636), Expect = 1.9e-64
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. Swiss-Prot
Match: H33_TOBAC (Histone H3.3 OS=Nicotiana tabacum GN=H3 PE=1 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 1.9e-64
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. TrEMBL
Match: A0A0A0KU99_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G026910 PE=4 SV=1)

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 544/613 (88.74%), Postives = 569/613 (92.82%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           MTD +  AKTFRA+VE+A+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFRAK
Sbjct: 1   MTDHDCEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKD 120
           LVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNR+YFE SKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEI 180
           LGARFK+LRFYARFLLVSLLLNRT TVQVLAERLKALVDDSK  FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM +AT S NVRPLRYS +FDSH  SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNIATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRS 360
           KKAIL+RPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKS+++
Sbjct: 301 KKAILHRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASVGESRKSLKN 360

Query: 361 KVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDIIPFTRRPVF 420
           KV  QNSRENCNAL ESCKSEK GSSDLYDEYLWFGHR +GGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKSEKPGSSDLYDEYLWFGHRGSGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480
           LIVDSNNSHAFK               VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS
Sbjct: 421 LIVDSNNSHAFK---------------VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480

Query: 481 GSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIVWA 540
           GSQFTFFLTAPLPAF EMVGLSSAN+D DVYNDA+TI+SSAFS+WE +LCTSTSLNIVWA
Sbjct: 481 GSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWA 540

Query: 541 QVLSDNFLRRLILRFIFCRSVLSFFSTKEDDDLPICLPCLPDSVASKSGVVCSAICRLAK 600
           QVLSD+FLRRLILRFIFCRSVLSFF+TKEDDDLP+CLPCLPDSV+S SGVV SAI RLAK
Sbjct: 541 QVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAK 598

Query: 601 HLNVADLFNFHEM 614
           HLNVADLFNFHE+
Sbjct: 601 HLNVADLFNFHEV 598

BLAST of CmaCh04G008120 vs. TrEMBL
Match: M5WJ20_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003080mg PE=4 SV=1)

HSP 1 Score: 840.1 bits (2169), Expect = 3.7e-240
Identity = 437/621 (70.37%), Postives = 508/621 (81.80%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           M+D++ V+ TFRALVESADRKFARV+DVPAYGRV N HYFHKVFKAYMRLWK+QQE R+K
Sbjct: 1   MSDNDVVSGTFRALVESADRKFARVRDVPAYGRVHNQHYFHKVFKAYMRLWKYQQEKRSK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNS--- 120
           L+E+GLNR EIGEIASRIGQLYFG YMRTSEARFL+EAYVFYEAIL+R+YFE S +S   
Sbjct: 61  LIEAGLNRWEIGEIASRIGQLYFGQYMRTSEARFLVEAYVFYEAILSRSYFEGSNSSSKA 120

Query: 121 --RKDLGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRL 180
             +KDLG RFK+LRFYARFLLVSL+LNRT  VQ+LAER KALVDD K  +R T+FKEWRL
Sbjct: 121 FGKKDLGVRFKELRFYARFLLVSLILNRTEMVQLLAERFKALVDDCKANYRETNFKEWRL 180

Query: 181 VVQEIFCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNE 240
           VVQEI  FMKV T  MNVRPLRYS  FDSH  SLP+VARFHAKRVLKF+DA+LTSYHRNE
Sbjct: 181 VVQEIVRFMKVDTAFMNVRPLRYSTLFDSHPSSLPYVARFHAKRVLKFQDALLTSYHRNE 240

Query: 241 VKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSG---ASGIIDINLATDM 300
           VKFAE+TLDTYRMLQCLEWEP G FYQK  VE  ENG  IDHSG   ASG+ID+NLA DM
Sbjct: 241 VKFAELTLDTYRMLQCLEWEPSGSFYQKRAVESKENGTFIDHSGASAASGVIDMNLAADM 300

Query: 301 TDPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASY 360
           TDP+LPPNP+KA++YRPSVTHLIAVMAT+CEEL  DSIML+YLSA+GK  +N+V Q  + 
Sbjct: 301 TDPTLPPNPRKAVIYRPSVTHLIAVMATICEELPLDSIMLVYLSASGKAGRNNVTQAYNS 360

Query: 361 GESRKSVRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDI 420
           G S KS ++KV         N++PESC + K  SS  YD+YLWFG R NGG + LYPGDI
Sbjct: 361 GGSHKSSKNKVPLPAQN---NSMPESCINNKGESSGYYDQYLWFGPRGNGGLSNLYPGDI 420

Query: 421 IPFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKN 480
           IPFTRRP+FLI+DS+NSHAFK               VLHGAERGETAA+ LSPLRPAFKN
Sbjct: 421 IPFTRRPLFLIIDSDNSHAFK---------------VLHGAERGETAALFLSPLRPAFKN 480

Query: 481 PLNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCT 540
           P + D +Q+GSQFTFFLTAPL AF ++VG SS++ + +VYN+AE+I+S+AFSEWE +LCT
Sbjct: 481 PADADLMQNGSQFTFFLTAPLSAFCQLVGFSSSDTEAEVYNNAESILSAAFSEWEVILCT 540

Query: 541 STSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSG 600
           STSL++VWAQV+SD FLRRLILRFIFCR+VLSFF   ED +  LPICLP LP SV+  S 
Sbjct: 541 STSLDLVWAQVISDPFLRRLILRFIFCRAVLSFFCPPEDSEQYLPICLPLLPISVSPDSE 600

Query: 601 VVCSAICRLAKHLNVADLFNF 611
           VV SA+ R+AKHL+V+D F+F
Sbjct: 601 VVQSAVHRVAKHLSVSDFFHF 603

BLAST of CmaCh04G008120 vs. TrEMBL
Match: W9S1S4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_019798 PE=4 SV=1)

HSP 1 Score: 812.4 bits (2097), Expect = 8.3e-232
Identity = 422/616 (68.51%), Postives = 494/616 (80.19%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           M++++ V++TFRALVESADRKFARV+DVPAYGRV N HYF KVFKAYMRLWK+QQE RA+
Sbjct: 1   MSENDVVSRTFRALVESADRKFARVRDVPAYGRVQNQHYFQKVFKAYMRLWKYQQENRAR 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKD 120
           LVE+GLNR EIGEIASRIGQLYFG YMRTSEARFL+EA VFYEAILNR YF+ SK   KD
Sbjct: 61  LVEAGLNRWEIGEIASRIGQLYFGQYMRTSEARFLVEACVFYEAILNRGYFQGSKGFGKD 120

Query: 121 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEI 180
           LG RFK+LRFYARFLLVS +LNRT  V+ L +R KALVDDSK  FR T+FKEWR VVQEI
Sbjct: 121 LGVRFKELRFYARFLLVSFILNRTEMVKHLLDRFKALVDDSKANFRDTNFKEWRQVVQEI 180

Query: 181 FCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
             FMKV T  MNVRPLRY A+FDSH  SLP+++RFHAK+VLKF+DA+LTSYHRNEVKFAE
Sbjct: 181 VRFMKVDTAFMNVRPLRYCATFDSHPKSLPYLSRFHAKKVLKFQDALLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPN 300
           +TLDT+RMLQCLEWEP G FYQK P+EP ENG+  DHSGASG+ID+NLA DMTDP+LPPN
Sbjct: 241 LTLDTFRMLQCLEWEPTGSFYQKRPIEPKENGSFTDHSGASGLIDMNLAADMTDPTLPPN 300

Query: 301 PKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVR 360
           P+KAILYRPSVTHLIAV+AT+CEEL PD I+L+YLSA+GK  +N V QM   G S+K  R
Sbjct: 301 PRKAILYRPSVTHLIAVLATICEELPPDGIVLVYLSASGKVGRN-VTQMECVGGSQKLPR 360

Query: 361 SKVITQNSRENCNALPESCKSEKRGSSD-LYDEYLWFGHRSNGGPNVLYPGDIIPFTRRP 420
           +K ++ +  E  N L +SC S+K  SS   YD+YLWFG   NGG N LYPGDI+PFTRRP
Sbjct: 361 NKAVSLH--EQNNTLTDSCVSDKGDSSSGFYDKYLWFGPGGNGGLNNLYPGDIVPFTRRP 420

Query: 421 VFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDTI 480
           +FLI+DS++SHAFKA                   ERGE AA+ LSPL P FKNP + DT 
Sbjct: 421 LFLIIDSDSSHAFKA-------------------ERGEPAALFLSPLLPVFKNPSDADTT 480

Query: 481 QSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIV 540
           Q+GSQFTFFLTAPL AF ++VGLS  + +  VYNDAE I+S+AFSEWE +LCTST L++V
Sbjct: 481 QNGSQFTFFLTAPLSAFCQLVGLSYLDTEAGVYNDAEEILSTAFSEWEVILCTSTILDLV 540

Query: 541 WAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSGVVCSAIC 600
           WAQVLSD FLRRLILRFIFCRSVL FF + ED++  LP CLP LP SV+ +S VV +A+ 
Sbjct: 541 WAQVLSDPFLRRLILRFIFCRSVLYFFCSPEDNEQCLPTCLPSLPGSVSPESDVVRAAVL 594

Query: 601 RLAKHLNVADLFNFHE 613
           RLAKHL VAD F+F +
Sbjct: 601 RLAKHLRVADCFHFDD 594

BLAST of CmaCh04G008120 vs. TrEMBL
Match: B9HF83_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05040g PE=4 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 8.3e-224
Identity = 410/624 (65.71%), Postives = 487/624 (78.04%), Query Frame = 1

Query: 3   DSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAKLV 62
           +++ V++TFRALVESADRKF RV+D+P YGR   +HYF KVFKAYMRLWK+QQE R+KLV
Sbjct: 5   ENDVVSQTFRALVESADRKFGRVRDLPLYGRAPQNHYFQKVFKAYMRLWKYQQENRSKLV 64

Query: 63  ESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEE--SKNSRKD 122
           +SGLNR EIGEIASRIGQLYF  YMR+SEARFL+EAYVFYEAIL R YF+   S   + D
Sbjct: 65  DSGLNRWEIGEIASRIGQLYFNQYMRSSEARFLVEAYVFYEAILERKYFDARGSGKPKVD 124

Query: 123 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEI 182
           +G RFK+LRFYARFLLV+L+ N+   V++LAER K LVDDS   FR T+FKEW+LVVQEI
Sbjct: 125 VGVRFKELRFYARFLLVALIFNKVDMVRLLAERFKGLVDDSMTKFRETNFKEWKLVVQEI 184

Query: 183 FCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 242
           F FM+V T   NVRPLRY A FDSH  S P++ARFHA++++KFRDA+LTSYHRNEVKFAE
Sbjct: 185 FRFMEVGTAFTNVRPLRYCALFDSHPASRPYLARFHARKIVKFRDALLTSYHRNEVKFAE 244

Query: 243 ITLDTYRMLQCLEWEP-GFFYQK---------HPVEPNENGATIDHSG-ASGIIDINLAT 302
           +TLDTYRM+QCLEWEP G FYQK         HPVE NENG  IDHSG ASG+IDINLA 
Sbjct: 245 LTLDTYRMMQCLEWEPSGSFYQKRPVESVYQKHPVESNENGTVIDHSGAASGLIDINLAA 304

Query: 303 DMTDPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMA 362
           D+TDP+LPPNP+KA+LYRPSVTHL+AVMAT+CEEL P++I+LIYLSA+GK   ++++Q  
Sbjct: 305 DLTDPTLPPNPRKAVLYRPSVTHLLAVMATICEELPPETIVLIYLSASGKAAHSNLSQSG 364

Query: 363 SYGESRKSVRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPG 422
           S G SRKS + KV++    E+ ++ PES  + KR SSD YD YLW G R  GG N LYPG
Sbjct: 365 SSGGSRKSSKDKVVSGVYGEDKSSAPESHCNGKRESSDYYDNYLWLGPRGYGGSNALYPG 424

Query: 423 DIIPFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAF 482
           DIIPFTRRP+FLI+DS++SHAFK                   AERGE AA+LLSPL+PAF
Sbjct: 425 DIIPFTRRPLFLIIDSDSSHAFK-------------------AERGEPAALLLSPLKPAF 484

Query: 483 KNPLNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVL 542
           KN   VDT   GSQFTFFLTAPL AF +MVGL+SA+ D D YNDAE I+S AFSEWE ++
Sbjct: 485 KNLSAVDTSHCGSQFTFFLTAPLQAFCQMVGLTSADSDMDFYNDAEEILSIAFSEWEVII 544

Query: 543 CTSTSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD-LPICLPCLPDSVASKS 602
           CTS  L++VWAQVLSD FLRRLILRFIFCRSVLS F   ED+  LPICLP LP+SV+++S
Sbjct: 545 CTSKGLDLVWAQVLSDPFLRRLILRFIFCRSVLSVFCPLEDEQYLPICLPHLPNSVSARS 604

Query: 603 GVVCSAICRLAKHLNVADLFNFHE 613
            VV SA+ RLA HL VAD F F +
Sbjct: 605 EVVQSAVFRLANHLKVADCFQFDD 609

BLAST of CmaCh04G008120 vs. TrEMBL
Match: D7TRL9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0191g00140 PE=4 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 1.1e-223
Identity = 413/615 (67.15%), Postives = 487/615 (79.19%), Query Frame = 1

Query: 5   NSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAKLVES 64
           N VA+TFRALVESAD+KFARV+DVP YGR  NH YF KVFKAY RLW++QQE R KLVE+
Sbjct: 4   NVVARTFRALVESADQKFARVRDVPTYGRGPNH-YFQKVFKAYTRLWQYQQENRLKLVEA 63

Query: 65  GLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKDLGAR 124
           GL R EIGEIASRIGQLYFG YMRTSEARFL+E+Y+FYEAILNR YF+ SK S KDLG R
Sbjct: 64  GLQRWEIGEIASRIGQLYFGQYMRTSEARFLLESYIFYEAILNRGYFQGSKGSMKDLGVR 123

Query: 125 FKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGT--DFKEWRLVVQEIFC 184
           FK+LRFYARFLLVSLLLNR+  V+VL ER +ALVDD KV FR T  +F+EW LVVQEI  
Sbjct: 124 FKELRFYARFLLVSLLLNRSEMVRVLVERFRALVDDCKVTFRETKNNFREWTLVVQEIVR 183

Query: 185 FMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEIT 244
           FMKV T  MNVRPLRY   FDS+  S+P+VARFHAK++LKFRDA+L SYHRNEVKFAE+T
Sbjct: 184 FMKVETAFMNVRPLRYCNLFDSYPSSVPYVARFHAKKMLKFRDALLASYHRNEVKFAELT 243

Query: 245 LDTYRMLQCLEWEP-GFFYQKHPVE----PNENGATIDHSGASGIIDINLATDMTDPSLP 304
           LDTYRMLQCLEWEP G FYQK  VE     NENGA  DHSG SG+IDINL  DMTDP+LP
Sbjct: 244 LDTYRMLQCLEWEPSGSFYQKRQVELNEKLNENGALTDHSGTSGLIDINLVVDMTDPTLP 303

Query: 305 PNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKS 364
           PNP+KAILYRPSVTHLIAV+AT+CEEL PDS+ML+YLSA+GK  + +++QM + G +RKS
Sbjct: 304 PNPRKAILYRPSVTHLIAVIATICEELPPDSVMLVYLSASGKSGRGNISQMENSGGTRKS 363

Query: 365 VRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGP-NVLYPGDIIPFTR 424
            + KV+++ S E  +++PE+  + K  SSD ++ +L    + NGG  N LYPGDIIPFTR
Sbjct: 364 SKVKVVSEISHELDSSMPEAPVNNKGHSSDCHENFLCLCPKGNGGSNNNLYPGDIIPFTR 423

Query: 425 RPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVD 484
           RP+FLI+DS+NSHAFK                LHGAERGETAA+LLSPL+PAFK P   D
Sbjct: 424 RPLFLIIDSDNSHAFKD---------------LHGAERGETAALLLSPLKPAFKGPFGFD 483

Query: 485 TIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLN 544
           T Q+GSQFTFFLTAPL AF+E+VGLS ++ D+D YNDA+ I+S+A SEWE +LCTSTSL+
Sbjct: 484 TTQNGSQFTFFLTAPLQAFYELVGLSFSDTDSDAYNDADKILSTALSEWEVILCTSTSLD 543

Query: 545 IVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSGVVCSA 604
           +VWAQVLSD FLRRLILRFIFCR VLS F   ED +  LP+CLP LP SV+  S  V SA
Sbjct: 544 LVWAQVLSDPFLRRLILRFIFCRCVLSLFCPPEDSEQYLPLCLPQLPSSVSPNSDAVQSA 602

Query: 605 ICRLAKHLNVADLFN 610
           + +LA HL VA+ F+
Sbjct: 604 VLQLANHLGVAECFS 602

BLAST of CmaCh04G008120 vs. TAIR10
Match: AT4G40050.1 (AT4G40050.1 Protein of unknown function (DUF3550/UPF0682))

HSP 1 Score: 683.3 bits (1762), Expect = 2.9e-196
Identity = 363/609 (59.61%), Postives = 463/609 (76.03%), Query Frame = 1

Query: 7   VAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAKLVESGL 66
           V+  FRALVE+ADRKFARV+D+PA+GR  +H YF KVFKAYM+LW +QQ  R+KLVESGL
Sbjct: 6   VSSNFRALVENADRKFARVRDLPAFGRAQSH-YFQKVFKAYMKLWNYQQSHRSKLVESGL 65

Query: 67  NRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKDLGARFK 126
           NR EIGEIASRIGQLYF  YMRTSEARFL+EA+VFYEAIL R+YF+E++   KDLGARFK
Sbjct: 66  NRWEIGEIASRIGQLYFSQYMRTSEARFLLEAFVFYEAILKRSYFDEAQG--KDLGARFK 125

Query: 127 QLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEIFCFMKV 186
           +LRFYARFLLVSL+++R   +  LA++L+ LVD S   FR T+FKEWRLVVQEI  F++ 
Sbjct: 126 ELRFYARFLLVSLIVDRKQMLLHLADKLRLLVDHSISNFRETNFKEWRLVVQEITRFIES 185

Query: 187 ATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDTY 246
            T    +RPLRY A  DS+  S  ++ARFHAK++ KFRDA+L SYHRNEVK+AE+TLDTY
Sbjct: 186 DTNLTYLRPLRYCAMLDSYPASQTYLARFHAKKLFKFRDALLASYHRNEVKYAEVTLDTY 245

Query: 247 RMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKAIL 306
           RM+QCLEWEP G FYQK PVE  ENG  +DH+  SG+ID+NLA DM DPSLPPNP+KAIL
Sbjct: 246 RMMQCLEWEPSGSFYQKKPVEAKENGFVVDHTLTSGLIDMNLAADMADPSLPPNPRKAIL 305

Query: 307 YRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRSKVITQ 366
           YRP+V+HL+AV+A +C+EL P+++ML+YLSA+G   + +V Q  +   S ++ +SK++ +
Sbjct: 306 YRPTVSHLLAVLAMICDELSPETVMLLYLSASGGPARENVAQPENSVGSSRTSKSKLLAR 365

Query: 367 NSRENCNALPESCKSEKRGSSDLYDEYLWFGHR-SNGGPNVLYPGDIIPFTRRPVFLIVD 426
            S+E  +   E   + K  S++ Y+ +LW G R  + G N LYPGD+IPFTR+P+FLI+D
Sbjct: 366 ASQEQKSYKSEPHSNGKLMSAEYYENHLWLGPRGGSSGSNNLYPGDLIPFTRKPLFLIID 425

Query: 427 SNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDT-IQSGSQ 486
           S+ S AFK               VL GAERGE  A+LLSPL+P+F+NP   DT   +GSQ
Sbjct: 426 SDTSRAFK---------------VLGGAERGEPVAMLLSPLKPSFENPSTDDTEALNGSQ 485

Query: 487 FTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIVWAQVL 546
           FTFFLTAPL AF +M+GLS+   D ++ ++AE+I+S++FSEWET+L TS  LN+VWAQVL
Sbjct: 486 FTFFLTAPLQAFCQMLGLSNTKPDPELCDEAESILSASFSEWETILLTSKVLNLVWAQVL 545

Query: 547 SDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSGVVCSAICRLAKH 606
            D FLRRLILRFIFCRSVL+ FS  EDDD  LP C P LP+ ++S S  V S++ RLA+H
Sbjct: 546 PDPFLRRLILRFIFCRSVLTSFSRTEDDDPYLPQCHPNLPELLSSVSKPVQSSVQRLAEH 596

Query: 607 LNVADLFNF 611
           L VA  F+F
Sbjct: 606 LGVAKSFHF 596

BLAST of CmaCh04G008120 vs. TAIR10
Match: AT3G03570.1 (AT3G03570.1 Protein of unknown function (DUF3550/UPF0682))

HSP 1 Score: 474.2 bits (1219), Expect = 2.7e-133
Identity = 272/614 (44.30%), Postives = 384/614 (62.54%), Query Frame = 1

Query: 7   VAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAKLVESGL 66
           +++ + +LV  AD+KF++++D+P Y R    +YF KVFK Y +LWKFQQE R KLVE+GL
Sbjct: 13  LSEVYWSLVNKADKKFSKIRDLPFYERSRYENYFFKVFKVYTQLWKFQQENRQKLVEAGL 72

Query: 67  NRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKDLGARFK 126
            R EIGEIASRI QLY+GHYMRTS+A +L E+YVFYEAIL R YF++     +DL    K
Sbjct: 73  KRWEIGEIASRIAQLYYGHYMRTSDAGYLSESYVFYEAILTREYFKD--GLFQDLNIANK 132

Query: 127 QLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEIFCFMKV 186
           QLRF ARFL+V L+L R   V  L ++ K L+D+ K  F+ TDFKEW++V QEI  F+K 
Sbjct: 133 QLRFLARFLMVCLVLGRREMVHQLVDQFKRLIDECKRTFQETDFKEWKVVAQEIVRFLKS 192

Query: 187 ATISMNVRPLRYSASFDSH-QLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAEITLDT 246
            T  MN+RPLRYS   D +     P      A R L+  DA+L+SY+ NEVK++E+TLD+
Sbjct: 193 DTAFMNIRPLRYSLVLDPNLDAGTP-----RASRSLRLTDAILSSYYCNEVKYSELTLDS 252

Query: 247 YRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNPKKAI 306
           +RMLQCLEWEP G  YQ         GA +  +   G+  IN +  M DP+LPPNP+KA+
Sbjct: 253 FRMLQCLEWEPSGSLYQ-------STGAKMGQNAPVGVARIN-SQSMNDPTLPPNPRKAV 312

Query: 307 LYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESR------KSV 366
           LYRPS+TH +AV+AT+CEEL    I+L+YLSA+GK  Q S + +++   +       +  
Sbjct: 313 LYRPSITHFLAVLATICEELPSHGILLLYLSASGKIGQISSSPLSARSATSVEENILRDF 372

Query: 367 RSKVITQNSRENCNALPESCKSEKRGSSDLYDE--YLWFGHRSNGGPNVLYPGDIIPFTR 426
            S  I Q +  +    P S +S ++ S D       L FG     G + +YP D++PFTR
Sbjct: 373 ESHTIKQETEPSLQITP-SGQSLRQISEDAVSTPCGLSFGSHGLTGSSYIYPSDLVPFTR 432

Query: 427 RPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVD 486
           +P+F+I+DS++S  FK +                GAE+GE AA+LLSP         +  
Sbjct: 433 KPLFIIIDSDSSTVFKNI---------------CGAEKGEPAALLLSPSHTPPLISADFS 492

Query: 487 TIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLN 546
              SGS FT FLT+P+ AF  +  +S+++M+TD++  AE ++SS+ +EW + L TS +L+
Sbjct: 493 RQPSGSLFTIFLTSPVQAFCLLSEISASDMETDIFTKAEKLLSSSMNEWASTLATSDTLH 552

Query: 547 IVWAQVLSDNFLRRLILRFIFCRSVLSFFST--KEDDDLPICLPCLPDSVASKSGVVCSA 606
            VW+Q+L D FLRRL+LRFIFCR+VL+ ++       + P C P LP+S+   +  V SA
Sbjct: 553 PVWSQILKDPFLRRLLLRFIFCRAVLALYTPVFNNKQNQPECCPSLPESLLPTAPAVQSA 595

Query: 607 ICRLAKHLNVADLF 609
           + ++A        F
Sbjct: 613 VFQMANVFGATSKF 595

BLAST of CmaCh04G008120 vs. TAIR10
Match: AT4G40030.2 (AT4G40030.2 Histone superfamily protein)

HSP 1 Score: 250.0 bits (637), Expect = 8.3e-66
Identity = 132/133 (99.25%), Postives = 133/133 (100.00%), Query Frame = 1

Query: 612 EMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKST 671
           +MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKST
Sbjct: 28  KMARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKST 87

Query: 672 ELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVT 731
           ELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVT
Sbjct: 88  ELLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVT 147

Query: 732 IMPKDIQLARRIR 745
           IMPKDIQLARRIR
Sbjct: 148 IMPKDIQLARRIR 160

BLAST of CmaCh04G008120 vs. TAIR10
Match: AT4G40040.1 (AT4G40040.1 Histone superfamily protein)

HSP 1 Score: 249.6 bits (636), Expect = 1.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. TAIR10
Match: AT5G10980.1 (AT5G10980.1 Histone superfamily protein)

HSP 1 Score: 249.6 bits (636), Expect = 1.1e-65
Identity = 132/132 (100.00%), Postives = 132/132 (100.00%), Query Frame = 1

Query: 613 MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 672
           MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE
Sbjct: 1   MARTKQTARKSTGGKAPRKQLATKAARKSAPTTGGVKKPHRYRPGTVALREIRKYQKSTE 60

Query: 673 LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 732
           LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI
Sbjct: 61  LLIRKLPFQRLVREIAQDFKTDLRFQSHAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTI 120

Query: 733 MPKDIQLARRIR 745
           MPKDIQLARRIR
Sbjct: 121 MPKDIQLARRIR 132

BLAST of CmaCh04G008120 vs. NCBI nr
Match: gi|449458277|ref|XP_004146874.1| (PREDICTED: protein SCAI [Cucumis sativus])

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 544/613 (88.74%), Postives = 569/613 (92.82%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           MTD +  AKTFRA+VE+A+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFRAK
Sbjct: 1   MTDHDCEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKD 120
           LVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNR+YFE SKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEI 180
           LGARFK+LRFYARFLLVSLLLNRT TVQVLAERLKALVDDSK  FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM +AT S NVRPLRYS +FDSH  SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNIATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRS 360
           KKAIL+RPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMAS GESRKS+++
Sbjct: 301 KKAILHRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASVGESRKSLKN 360

Query: 361 KVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDIIPFTRRPVF 420
           KV  QNSRENCNAL ESCKSEK GSSDLYDEYLWFGHR +GGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKSEKPGSSDLYDEYLWFGHRGSGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480
           LIVDSNNSHAFK               VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS
Sbjct: 421 LIVDSNNSHAFK---------------VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480

Query: 481 GSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIVWA 540
           GSQFTFFLTAPLPAF EMVGLSSAN+D DVYNDA+TI+SSAFS+WE +LCTSTSLNIVWA
Sbjct: 481 GSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWA 540

Query: 541 QVLSDNFLRRLILRFIFCRSVLSFFSTKEDDDLPICLPCLPDSVASKSGVVCSAICRLAK 600
           QVLSD+FLRRLILRFIFCRSVLSFF+TKEDDDLP+CLPCLPDSV+S SGVV SAI RLAK
Sbjct: 541 QVLSDHFLRRLILRFIFCRSVLSFFNTKEDDDLPVCLPCLPDSVSSNSGVVSSAIRRLAK 598

Query: 601 HLNVADLFNFHEM 614
           HLNVADLFNFHE+
Sbjct: 601 HLNVADLFNFHEV 598

BLAST of CmaCh04G008120 vs. NCBI nr
Match: gi|659107684|ref|XP_008453803.1| (PREDICTED: protein SCAI isoform X1 [Cucumis melo])

HSP 1 Score: 1079.7 bits (2791), Expect = 0.0e+00
Identity = 545/613 (88.91%), Postives = 569/613 (92.82%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           MTD++S AKTFRA+VE+A+RKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWK+QQEFRAK
Sbjct: 1   MTDNDSEAKTFRAMVENANRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKYQQEFRAK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSRKD 120
           LVESGLNR EIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNR+YFE SKNSRKD
Sbjct: 61  LVESGLNRWEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRSYFEGSKNSRKD 120

Query: 121 LGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQEI 180
           LGARFK+LRFYARFLLVSLLLNRT TVQVLAERLKALVDDSK  FR TDFKEWRLVVQEI
Sbjct: 121 LGARFKELRFYARFLLVSLLLNRTDTVQVLAERLKALVDDSKATFRATDFKEWRLVVQEI 180

Query: 181 FCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKFAE 240
           FCFM VAT S NVRPLRYS +FDSH  SLPFV RFHAKRVLKFRDAVLTSYHRNEVKFAE
Sbjct: 181 FCFMNVATASTNVRPLRYSTAFDSHPPSLPFVGRFHAKRVLKFRDAVLTSYHRNEVKFAE 240

Query: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGATIDHSGASGIIDINLATDMTDPSLPPNP 300
           ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGA IDHSGASGIIDINLATD+TDPSLPPNP
Sbjct: 241 ITLDTYRMLQCLEWEPGFFYQKHPVEPNENGAGIDHSGASGIIDINLATDVTDPSLPPNP 300

Query: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGESRKSVRS 360
           KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQM S+GESRKS+++
Sbjct: 301 KKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQM-SFGESRKSLKN 360

Query: 361 KVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDIIPFTRRPVF 420
           KV  QNSRENCNAL ESCK EK GSSDLYDEYLWFGHR NGGPNVLYPGDIIPFTRRPVF
Sbjct: 361 KVTAQNSRENCNALAESCKLEKPGSSDLYDEYLWFGHRGNGGPNVLYPGDIIPFTRRPVF 420

Query: 421 LIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480
           LIVDSNNSHAFK               VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS
Sbjct: 421 LIVDSNNSHAFK---------------VLHGAERGETAAILLSPLRPAFKNPLNVDTIQS 480

Query: 481 GSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTSTSLNIVWA 540
           GSQFTFFLTAPLPAF EMVGLSSAN+D DVYNDA+TI+SSAFS+WE +LCTSTSLNIVWA
Sbjct: 481 GSQFTFFLTAPLPAFCEMVGLSSANLDIDVYNDADTILSSAFSDWEIILCTSTSLNIVWA 540

Query: 541 QVLSDNFLRRLILRFIFCRSVLSFFSTKEDDDLPICLPCLPDSVASKSGVVCSAICRLAK 600
           QVLSD+FLRRLILRFIFCR+VLSFF+TKEDDDLP CLPCLPDSV+S SGVV SAI RLAK
Sbjct: 541 QVLSDHFLRRLILRFIFCRAVLSFFNTKEDDDLPFCLPCLPDSVSSNSGVVSSAIRRLAK 597

Query: 601 HLNVADLFNFHEM 614
           HLNVADLFNFHE+
Sbjct: 601 HLNVADLFNFHEV 597

BLAST of CmaCh04G008120 vs. NCBI nr
Match: gi|595840491|ref|XP_007208029.1| (hypothetical protein PRUPE_ppa003080mg [Prunus persica])

HSP 1 Score: 840.1 bits (2169), Expect = 5.3e-240
Identity = 437/621 (70.37%), Postives = 508/621 (81.80%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           M+D++ V+ TFRALVESADRKFARV+DVPAYGRV N HYFHKVFKAYMRLWK+QQE R+K
Sbjct: 1   MSDNDVVSGTFRALVESADRKFARVRDVPAYGRVHNQHYFHKVFKAYMRLWKYQQEKRSK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNS--- 120
           L+E+GLNR EIGEIASRIGQLYFG YMRTSEARFL+EAYVFYEAIL+R+YFE S +S   
Sbjct: 61  LIEAGLNRWEIGEIASRIGQLYFGQYMRTSEARFLVEAYVFYEAILSRSYFEGSNSSSKA 120

Query: 121 --RKDLGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRL 180
             +KDLG RFK+LRFYARFLLVSL+LNRT  VQ+LAER KALVDD K  +R T+FKEWRL
Sbjct: 121 FGKKDLGVRFKELRFYARFLLVSLILNRTEMVQLLAERFKALVDDCKANYRETNFKEWRL 180

Query: 181 VVQEIFCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNE 240
           VVQEI  FMKV T  MNVRPLRYS  FDSH  SLP+VARFHAKRVLKF+DA+LTSYHRNE
Sbjct: 181 VVQEIVRFMKVDTAFMNVRPLRYSTLFDSHPSSLPYVARFHAKRVLKFQDALLTSYHRNE 240

Query: 241 VKFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSG---ASGIIDINLATDM 300
           VKFAE+TLDTYRMLQCLEWEP G FYQK  VE  ENG  IDHSG   ASG+ID+NLA DM
Sbjct: 241 VKFAELTLDTYRMLQCLEWEPSGSFYQKRAVESKENGTFIDHSGASAASGVIDMNLAADM 300

Query: 301 TDPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASY 360
           TDP+LPPNP+KA++YRPSVTHLIAVMAT+CEEL  DSIML+YLSA+GK  +N+V Q  + 
Sbjct: 301 TDPTLPPNPRKAVIYRPSVTHLIAVMATICEELPLDSIMLVYLSASGKAGRNNVTQAYNS 360

Query: 361 GESRKSVRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDI 420
           G S KS ++KV         N++PESC + K  SS  YD+YLWFG R NGG + LYPGDI
Sbjct: 361 GGSHKSSKNKVPLPAQN---NSMPESCINNKGESSGYYDQYLWFGPRGNGGLSNLYPGDI 420

Query: 421 IPFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKN 480
           IPFTRRP+FLI+DS+NSHAFK               VLHGAERGETAA+ LSPLRPAFKN
Sbjct: 421 IPFTRRPLFLIIDSDNSHAFK---------------VLHGAERGETAALFLSPLRPAFKN 480

Query: 481 PLNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCT 540
           P + D +Q+GSQFTFFLTAPL AF ++VG SS++ + +VYN+AE+I+S+AFSEWE +LCT
Sbjct: 481 PADADLMQNGSQFTFFLTAPLSAFCQLVGFSSSDTEAEVYNNAESILSAAFSEWEVILCT 540

Query: 541 STSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSG 600
           STSL++VWAQV+SD FLRRLILRFIFCR+VLSFF   ED +  LPICLP LP SV+  S 
Sbjct: 541 STSLDLVWAQVISDPFLRRLILRFIFCRAVLSFFCPPEDSEQYLPICLPLLPISVSPDSE 600

Query: 601 VVCSAICRLAKHLNVADLFNF 611
           VV SA+ R+AKHL+V+D F+F
Sbjct: 601 VVQSAVHRVAKHLSVSDFFHF 603

BLAST of CmaCh04G008120 vs. NCBI nr
Match: gi|694379250|ref|XP_009365824.1| (PREDICTED: protein SCAI-like [Pyrus x bretschneideri])

HSP 1 Score: 830.9 bits (2145), Expect = 3.2e-237
Identity = 430/622 (69.13%), Postives = 510/622 (81.99%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           M++++ V +TFRALVE ADRKFARV+DVPAYGR+ N HYFHKVFKAYMRLWK+QQE R+K
Sbjct: 1   MSENDVVTRTFRALVEGADRKFARVRDVPAYGRLPNQHYFHKVFKAYMRLWKYQQENRSK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNSR-- 120
           LV +GLNR EIGEIASRIGQLYFG YMRTSEARFLIEAYVFYEAIL+RNYF+ S  SR  
Sbjct: 61  LVGAGLNRWEIGEIASRIGQLYFGQYMRTSEARFLIEAYVFYEAILSRNYFDGSAGSRGF 120

Query: 121 --KDLGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLV 180
             KD+G RFK+LRFYARFLLVSL+LNRT TVQ+LAER K+LVD+ K  FR T+FKEWRLV
Sbjct: 121 GKKDIGVRFKELRFYARFLLVSLILNRTETVQLLAERFKSLVDNCKANFRDTNFKEWRLV 180

Query: 181 VQEIFCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEV 240
           VQEI  FMKV T  MNVRPLRYS  FDS+  SLP+V RFHAKRVLKF+DA+LTSYHRNEV
Sbjct: 181 VQEIARFMKVDTAFMNVRPLRYSTLFDSNPASLPYVTRFHAKRVLKFQDALLTSYHRNEV 240

Query: 241 KFAEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSG---ASGIIDINLATDMT 300
           KFAE+TLDT+RMLQCLEWEP G FYQK PVEP ENG  I+HSG   ASG+ID+NLA DMT
Sbjct: 241 KFAELTLDTFRMLQCLEWEPSGSFYQKRPVEPKENGVFIEHSGASAASGVIDLNLAADMT 300

Query: 301 DPSLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYG 360
           DPSLPPNP+KA++YRPSVTHLIAVMAT+CEEL  DSIML+YLSA+GK  Q++ +Q+ + G
Sbjct: 301 DPSLPPNPRKAVIYRPSVTHLIAVMATICEELPLDSIMLVYLSASGKVDQSNTSQVYNSG 360

Query: 361 ESRKSVRSKVITQNSRENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGDII 420
            S+KS ++KV  Q+  +    LPES  + K  SS  YD+YLWFG R NGG N LYPGDII
Sbjct: 361 GSQKSSKNKVPLQDQNK---FLPESRINNKGESSGYYDQYLWFGPRGNGGSNNLYPGDII 420

Query: 421 PFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFKNP 480
           PFTRRP+FL++DS+NSHAFK               VLHGAERGETAA+LLSP+RPAFKNP
Sbjct: 421 PFTRRPLFLVIDSDNSHAFK---------------VLHGAERGETAALLLSPMRPAFKNP 480

Query: 481 LNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLCTS 540
            + D  Q+GSQFTFFLTAPL AF ++VG SS++ + +VY++AE+++S+AFSEWE +LCT+
Sbjct: 481 ADADLTQNGSQFTFFLTAPLAAFGQLVGFSSSDTEAEVYSNAESLLSAAFSEWEVILCTT 540

Query: 541 TSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKSGV 600
           TSL++VWAQV+SD FLRRLILRFIFCR+VL FF   ED +  LP+CLP LP+SV+  S V
Sbjct: 541 TSLDLVWAQVISDPFLRRLILRFIFCRAVLYFFCPPEDSEQHLPVCLPLLPNSVSPDSEV 600

Query: 601 VCSAICRLAKHLNVADLFNFHE 613
           V SA+ RLAKHL+V+DLF+  E
Sbjct: 601 VRSAVQRLAKHLSVSDLFHLVE 604

BLAST of CmaCh04G008120 vs. NCBI nr
Match: gi|470103477|ref|XP_004288163.1| (PREDICTED: protein SCAI [Fragaria vesca subsp. vesca])

HSP 1 Score: 829.3 bits (2141), Expect = 9.4e-237
Identity = 434/624 (69.55%), Postives = 499/624 (79.97%), Query Frame = 1

Query: 1   MTDSNSVAKTFRALVESADRKFARVQDVPAYGRVDNHHYFHKVFKAYMRLWKFQQEFRAK 60
           M+D++ V+ TFRALVESADRKFARV+DVPAYGRV N HYFHKVFKAYMRLWK+QQE RAK
Sbjct: 1   MSDTDVVSGTFRALVESADRKFARVRDVPAYGRVHNQHYFHKVFKAYMRLWKYQQENRAK 60

Query: 61  LVESGLNRSEIGEIASRIGQLYFGHYMRTSEARFLIEAYVFYEAILNRNYFEESKNS--R 120
           LVE+GLNR EIGEIASRIGQLYFG YMRTSEARFL EAYVFYEAIL+R YFE       +
Sbjct: 61  LVEAGLNRWEIGEIASRIGQLYFGQYMRTSEARFLAEAYVFYEAILSRRYFEGGAKGLGK 120

Query: 121 KDLGARFKQLRFYARFLLVSLLLNRTHTVQVLAERLKALVDDSKVAFRGTDFKEWRLVVQ 180
           K++G RFK+LRFYARFLLV+L+LNRT TVQ+LAER KALVDDS+  +R T+FKEWRLVVQ
Sbjct: 121 KEIGVRFKELRFYARFLLVALILNRTETVQLLAERFKALVDDSRANYRETNFKEWRLVVQ 180

Query: 181 EIFCFMKVATISMNVRPLRYSASFDSHQLSLPFVARFHAKRVLKFRDAVLTSYHRNEVKF 240
           EI  FMKV T  MNVRPLRYSA FDSH  SLP+VARFHAKRVLKF+DA+LTSYHRNE KF
Sbjct: 181 EIVRFMKVDTAFMNVRPLRYSAMFDSHPASLPYVARFHAKRVLKFQDALLTSYHRNEAKF 240

Query: 241 AEITLDTYRMLQCLEWEP-GFFYQKHPVEPNENGATIDHSGAS---GIIDINLATDMTDP 300
           AE+TLDTYRMLQCLEWEP G FY K PVE  ENG+ I+HSGAS   G+ID+NLA DMTDP
Sbjct: 241 AELTLDTYRMLQCLEWEPSGSFYHKRPVESKENGSFIEHSGASAASGVIDMNLAADMTDP 300

Query: 301 SLPPNPKKAILYRPSVTHLIAVMATVCEELLPDSIMLIYLSAAGKCCQNSVNQMASYGES 360
           SLPPNP+KA++YRPSVTHLIAVMAT+CEEL  DS+ML+YLSA+G   +++VNQ+ + G S
Sbjct: 301 SLPPNPRKAVIYRPSVTHLIAVMATICEELPVDSVMLLYLSASGTAGRSNVNQLYTSGGS 360

Query: 361 RKSVRSKVITQNSR----ENCNALPESCKSEKRGSSDLYDEYLWFGHRSNGGPNVLYPGD 420
           +KS ++KV   N      E+CN   ES        S  YD+YLWFG R NGG N LYPGD
Sbjct: 361 QKSSKNKVPLPNQNTYRAESCNGKAES--------SGYYDKYLWFGPRGNGGENNLYPGD 420

Query: 421 IIPFTRRPVFLIVDSNNSHAFKAVNLTANVETIFGFQVLHGAERGETAAILLSPLRPAFK 480
           IIPFTRRP+FLIVDS+NSHAFK               VLHGAERGETAA+ LSPLRPAFK
Sbjct: 421 IIPFTRRPLFLIVDSDNSHAFK---------------VLHGAERGETAALFLSPLRPAFK 480

Query: 481 NPLNVDTIQSGSQFTFFLTAPLPAFFEMVGLSSANMDTDVYNDAETIVSSAFSEWETVLC 540
           NP + D  Q+GSQFTFFLTAPL AF ++VGLSS++ +TDVYN AE I+S AFS+WE +LC
Sbjct: 481 NPADADLTQNGSQFTFFLTAPLSAFCQLVGLSSSDTETDVYNGAEGILSDAFSKWEIILC 540

Query: 541 TSTSLNIVWAQVLSDNFLRRLILRFIFCRSVLSFFSTKEDDD--LPICLPCLPDSVASKS 600
           TST +++VWAQV+SD FLRRLILRFIFCR+VLS F   ED +  LP+CLP LP SV+  S
Sbjct: 541 TSTKMDLVWAQVMSDPFLRRLILRFIFCRAVLSLFCPPEDSEQYLPVCLPFLPSSVSPDS 600

Query: 601 GVVCSAICRLAKHLNVADLFNFHE 613
            VV S I RLAKHL V D FNF +
Sbjct: 601 EVVQSCISRLAKHLRVDDCFNFDD 601

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
H33_ORYCO1.9e-64100.00Histone H3.3 OS=Oryza coarctata PE=2 SV=3[more]
H33_ORYSI1.9e-64100.00Histone H3.3 OS=Oryza sativa subsp. indica GN=OsI_011536 PE=3 SV=1[more]
H33_PINPS1.9e-64100.00Histone H3.3 OS=Pinus pinaster PE=2 SV=3[more]
H33_GOSHI1.9e-64100.00Histone H3.3 OS=Gossypium hirsutum GN=HIS3 PE=2 SV=3[more]
H33_TOBAC1.9e-64100.00Histone H3.3 OS=Nicotiana tabacum GN=H3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KU99_CUCSA0.0e+0088.74Uncharacterized protein OS=Cucumis sativus GN=Csa_4G026910 PE=4 SV=1[more]
M5WJ20_PRUPE3.7e-24070.37Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003080mg PE=4 SV=1[more]
W9S1S4_9ROSA8.3e-23268.51Uncharacterized protein OS=Morus notabilis GN=L484_019798 PE=4 SV=1[more]
B9HF83_POPTR8.3e-22465.71Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0007s05040g PE=4 SV=1[more]
D7TRL9_VITVI1.1e-22367.15Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0191g00140 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT4G40050.12.9e-19659.61 Protein of unknown function (DUF3550/UPF0682)[more]
AT3G03570.12.7e-13344.30 Protein of unknown function (DUF3550/UPF0682)[more]
AT4G40030.28.3e-6699.25 Histone superfamily protein[more]
AT4G40040.11.1e-65100.00 Histone superfamily protein[more]
AT5G10980.11.1e-65100.00 Histone superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449458277|ref|XP_004146874.1|0.0e+0088.74PREDICTED: protein SCAI [Cucumis sativus][more]
gi|659107684|ref|XP_008453803.1|0.0e+0088.91PREDICTED: protein SCAI isoform X1 [Cucumis melo][more]
gi|595840491|ref|XP_007208029.1|5.3e-24070.37hypothetical protein PRUPE_ppa003080mg [Prunus persica][more]
gi|694379250|ref|XP_009365824.1|3.2e-23769.13PREDICTED: protein SCAI-like [Pyrus x bretschneideri][more]
gi|470103477|ref|XP_004288163.1|9.4e-23769.55PREDICTED: protein SCAI [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000164Histone_H3/CENP-A
IPR007125Histone_H2A/H2B/H3
IPR009072Histone-fold
IPR022709SCAI
Vocabulary: Cellular Component
TermDefinition
GO:0000786nucleosome
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0046982protein heterodimerization activity
GO:0003714transcription corepressor activity
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1903507 negative regulation of nucleic acid-templated transcription
biological_process GO:0045892 negative regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0000786 nucleosome
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0003677 DNA binding
molecular_function GO:0046982 protein heterodimerization activity
molecular_function GO:0003714 transcription corepressor activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G008120.1CmaCh04G008120.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000164Histone H3/CENP-APRINTSPR00622HISTONEH3coord: 670..687
score: 4.2E-77coord: 615..629
score: 4.2E-77coord: 629..643
score: 4.2E-77coord: 726..747
score: 4.2E-77coord: 646..667
score: 4.2E-77coord: 710..726
score: 4.2E-77coord: 692..710
score: 4.2
IPR000164Histone H3/CENP-ASMARTSM00428h35coord: 646..748
score: 2.4
IPR000164Histone H3/CENP-APROSITEPS00322HISTONE_H3_1coord: 627..633
scor
IPR000164Histone H3/CENP-APROSITEPS00959HISTONE_H3_2coord: 679..687
scor
IPR007125Histone H2A/H2B/H3PFAMPF00125Histonecoord: 613..744
score: 4.3
IPR009072Histone-foldGENE3DG3DSA:1.10.20.10coord: 614..746
score: 3.0
IPR009072Histone-foldunknownSSF47113Histone-foldcoord: 614..745
score: 2.01
IPR022709Protein SCAIPFAMPF12070DUF3550coord: 13..566
score: 7.5E
NoneNo IPR availableunknownCoilCoilcoord: 841..889
score: -coord: 1110..1158
score: -coord: 1047..1067
score: -coord: 750..770
score: -coord: 792..833
score: -coord: 962..1028
score: -coord: 1271..1291
score: -coord: 1075..1102
scor
NoneNo IPR availablePANTHERPTHR21243FAMILY NOT NAMEDcoord: 448..621
score: 0.0coord: 1..432
score:
NoneNo IPR availablePANTHERPTHR21243:SF13SUBFAMILY NOT NAMEDcoord: 448..621
score: 0.0coord: 1..432
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh04G008120CmaCh16G006950Cucurbita maxima (Rimu)cmacmaB350