Csor.00g282190 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g282190
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionCleavage and polyadenylation specificity factor subunit 2
LocationCsor_Chr16: 554541 .. 565648 (-)
RNA-Seq ExpressionCsor.00g282190
SyntenyCsor.00g282190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTCTTTGTGGAAACTTCTATATTTGCTGGAACCTGCACCTGTCACTCTCATTGTGACTGCAGTGGCTGTTACATTTGGATCAGCTTTCCGGGCCTTAAATTATGGGAAGGAAATGGAGCGAAACCGTGACTTTTCAGAAGCTTCCATTACCTTAGACAGATCCCAAGCACTAATGATCCCAGTTATGAGTTCTTGCAGTTTGCTTTTGATGTTCTACCTGTTTTCTTCTGTGTCCCAACTTCTTACTGCATTCACAGCAGTTGCTTCGGTTTCATCCCTCTTCTTCTGTTTAGCGCCTTACATGGCCTGTTTAAAGTCTCAGTTTGGATTGGCTGATCCTTACGTATCAAGGTGTTGTTCCAAGTCATTTACACGAATTCAAGGGTTATTGTTGATGGCATGTTTTGGTTTAGTTGCAGCATGGCTTGTTTCTGGGCATTGGATATTGAACAATTTGTTGGGAATTTCAATATGTGTTGCCTTTGTCAGCCATGTACGTCTCCCTAATGTTAAAATATGTGCAATGCTCCTCGTTTGTCTCTTTGTATATGATATTTTCTGGGTCTTCTTCTCTGAGAGATTCTTTGGAGCTAATGTAATGGTATCTGTAGCAACTCAGCAAGCATCGAATCCTGTTCACACAGTTGCTAATAGTCTGAGTCTTCCTGGTCTGCAATTGATAACTAAGAAGCTGGAGTTACCCGTCAAGATAGTTTTTCCAAGGAACTTACTTGGTGGAGTCATTCCAGGAAAAAATGCCACTGATTTCATGATGCTTGGCCTTGGTGATATGGTATGCTATTGTATTTTCCCTGCATTTTCTTTCGTCATATAACTTCCTCCAGCAACAGTTGTTAATATTAAGTTTGTTCATCTTTCACTGGTAAGACTTCATGAAATTAGTCACCTGCCGTTAAAATACAATTACAATGTTAATCAAGATCAGAAAATCCTGCAACTCAACATTATGCTCCATTATATCAATATGGTAATTGTTGTTTCTTGCCTGCATCTGGATACTTGGAGACCTATTTCTGTGTGAATTTCTATCTATAAAATGGAACAATAGACTTTGATAATTGATATAAACAAAAGGAAGGGGAAGACCCTAGCCACCAAGTGTGGATTTCCGTGAAGACCTTCATTCCTGTTCTGTCTTGATCAAACCCACTGTTTTTGTAATGTTCGTTGTTTCACATTGTAGGCAATTCCTGCCATGTTTCTAGCTCTAGTTCTTTGTTTTGACCATCGGAAGAGTAGGGATACAGTCAATCTCTTAGATATGCACGCAAGGGGCCACAAGTACATTTGGTATGCCCTGCCTGGTTATGCCATTGGGCTGGTGACCGCTCTAGCAGCTGGTGTTTTGACTCACTCGCCTCAACCTGCCCTATTGTATCTGGTAATGCTGTTTCATCTTTCTACGTTATTCATATCTGATGTATGCACACATATACACACACACAAGATAAAAAACAATTTCACTGATGTATGAAATTACAAAAAATGATGGATAATCCATGGTGATTACGAAAAGCTTGCCCAAGTTGTTAATGGAGTAGCATAACTATAAGAAGAAAAGATATTGGTAGATTTACATCGAGAGATAGCATGATAGACAACAGAATCATAAAGCTTGTAAAAAAGAACTTCTTTATCCAAAAAAAAAAATCTTCGGTTTTTTTTTTTTCAAAGATTCCAAAGAAAAGCCCTGATGATAAATTTCCACAACAAAGCTTTAGCTTCCCTGAAAGGATGATCCGACAAAACCATGGAAGGAAAAGTCATTTCTACATCGTGTTTTCTACGCCTACAATCTTTTCTACATCATTCATGTATGCATATTCTTATAAGAGCTTAGATTATTGGAAAATGAGTCCTGTTCCTCCATTCTCCTCCACATTTAGTAAATTATCAACTAGTTAGTTAAGTGGCATTTTTAGAGAGAAAGAGGGAGTCCCGAGATCTTTTTACATCCCGAGTCATTGTTCTTCTAAGTTTTCGCCCTCATAAATTAATACATTGAAACTACCTTAAGCATTGAAATATTTCATGTTGGTTGAAAATTCGAATATCTACATCTACAAATGTAATATTACATTCTCAAATGAATGAGTACTCGATCACTGTTACTTATCTTCTGCTTCTTACTCTTGTATTCTTAATGGTGTCTGCAGGTGCCTTCTACATTAGGACCTGTTATTGCCATATCTTGGATAAGGAAGGATTTTTTGGAGTTATGGGAAGGGCCTTTGCCGAACCCCAATGATAAAGTACGTGAAGTAGAAGTAGTTTGAATCACTCGCCTCTTCTGCGCAATACCCCTCTTTCTTTGTTCATGAAGAGAGCTGTAAAATCATTTGCTGTATCAAATGCCATCCTTTGAAAACATTTGTTAACTGTATGAAATTGCACGGCTTACAAAAGTTCTGTTTAAACGCTCGACTCATTAATGAAAAGGGGGTAAGAATGCATCAACTTTGGTGTGGCTAACCTGGGAAAAAATGATCTCTTGCTTCGATTTCATTTACTCGATTCGATGATGCATCTAGGGACATGGATCCCTTATGTTTATATATTTAGACACTCTATCTTTTGATTTTCATAGGAAGACTCAAAAGAATCGGTAAGGATTTTATTATTCGTTGTCTAGGTTTGTATCTCTTGGAAAGGAGAAAAATTAAGTTGGTCTTCAATCACTCGGATGAAACATACTCATTTAGTTTATCACGATATACTTTTAAGATTATATGGTTCAAAAAATCTTTTTTCAATAGTTTCAAGTTTAAACTAGGCGGCTTAAAAATATTTCATACGAGTGGAGGTAGAATCAAGGTTTAAGACTTATCATGGGTACATATTACGGAACAATTTAAAATCTTATTATTTTCATTTAGCTACTTGAGTTTTTTATAAATATAAATCGAGGTCTTGAACATATTTCTACATCGTCCGGTTCTTAGTTTGAGAGGAATTAAAAAAACAAAAAAAAAAAAACTACCAAAACGACGTCGTATCCTGATAGAAAGAAGGAAACAAAAAAAAGAAAAAAGAAAAAAAAGAAAAAAACCACCCCGCCATTTTCCCTAAAACCTCCTTGCTCCTCCAAAACTAGGGAGTTGCAGATAGTTTGGAGCTTCGTATCAATGGGAACCTCTGTTCAGGTGACGCCTCTTTGCGGCGTGTACAATGAAAATCCTTTATCCTATTTGGTCTCCGTTGACGGCTTCAACTTCCTTATCGACTGTGGTTGGAACGACCACTTCGATCCTGCTCTTCTTCAACCTCTATCCAGGTTTGCTTTCTCAACATTTCTTTGCAATCTGTTTTGTTTGAGCTGAACTCGCAAGTTTCATGTATTGAAGCTGCTGAATGCCATGTGACTGTCGACTGGTACTAGCCGAGTAAGGAAATCAGTGATACCTAGCTGTTTTCGCCTGTTCTGAAACGTGTTAGGTTCCTAAGAAAATCGGTGGAATGGTTCATTAGAATTTCTTCTTACATTTGAATGTAACAATTCTTTTTTGGATTAAAGGTTCGGAATGAATAATCACTCGCTAAAAGGCTTCTCAACAATTGGGCTACAAACTCTTGATTTTTGTTTACTTCATCGCCCTTAATACGTAATTGTGGTTTTTCAGGGTGGCATCGACGATTGATGCAGTTTTGATATCACATCCTGATACACTTCACCTCGGTGCCCTTCCATATGCCATGAAACAACTTGGACTTTCTGCTCCAGTATATTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTTTATAGCGAGGAAGGTAATTTTTTTTTTTCATATAATTTGGTTCCTTAATGTTATGAATTAACCAAATCAAACAAGCTGCTACACTATTTACCTGAACTACTTGCCATCAAGTTTCTTTAAAACTCAAGAACCTAGTTGTATAACCTATTGAATAGATCCTACTAAAGTAAGCTTGTATACTTGCGGTCAAATTTGAAGTGTTGTGAAGAAAATAGTCCTTTTTGTTGTAGCCTTGTGAGCGTGGTAGACTCCTACAATTATGTGAGTGATTACTGACCAAGCATGTTTTGGTTAGTTTAAATATGATATTTATGCTGTTAACTTATCTAGGTCAATCATTACTCGCCAGTTTATGTGGGCTTGGACCCTCGAAATTTTTGCTGTATGTTGTCTTAAAGAATTTTAATTTCTTTTTTGCTTGTGTGTGGGTGGGGGGTGGGGAGTTATTGTCGGATCCAAGTTATATATTTTGTAATTACATGGTTGCTAATCCGTTGCCGACATTTCATATTAGCATTGGTTATGAGTAGATTTTTTTTGTTTAATAATATTTAAGTCAGAATAATAGTCTCTGCTTCACCAATTTACCATTGATTCCTTATTGTTGTTTTCACTAACTTTTCTTTACTGTCTTTGTGCTGGGTGTTGCCATATGGACACCTCCCCTTTCCCTTATGCAGCAAGTATCAGAGTTTGATCTATTTACGCTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGGCTAACATACTCCCAGAATCATCATCTTTCAGGTGCCTTTTCTTGGTCTTGTTTATTGTTAGATCAAAGCTAATCGATCCCTCTTTCTCTTGGTTGCCTAATGATTGTATATGGTCTATACTGATGACCCACCTAATGTATCTTGGAAAATCATTCAAGTCATTCATCTGAGAAGCACAAACATGTTATAGCATGATATGTTACAATAATGACTGTCCTTCTGGAATTAAAAGAAAAAAAGCGAGGGAACCTCAGGACACGCCCATGACAACACATATTTTTTGAAATACTTCTGTAAAGTATGTTATGATATAGGATTTAAGATATGTGTCTCCTATGGGAAATTAAAGAAGCCAATGACATGCTTCCAAATATGCTACATTTTATAACAAAAACTTATAAAAGCAATGGAAAACCAAATATAAGTATTTTGTCATTATTCAATGTCTTAAAAGAAAAGGAAATTCAGAATAAAATATGTCCTACCATTGATATGCTTAAGTTTTTAATATTTGAACTCCTTATCGCCCCTCCTTTTGTATACTTTTTTTGTTCAAAGCTTTTGTCACTTGTTCACAGAAATTCCTATGCTCTCTTATTTTGTTGTATCCTCATGCTCGAGCTTGTTTCATTGTGTTGAAATAAAAAGAAATTATAAAAACAATTAACAATTGAGTTTGAATGTCCAATTAGCGGTGAAGTGTTAAGCTATCTAACTTATATTATATGAAGCATGGCCTACTTCTTAGACTGAATTGTTCATTCTCCATGGTTGTTCATGTAGCAATGTGATAATTTGTTAAAATTTTATTGATGAAAAAACCATCTTTCTTTGAGAATAAATGAAAGCAATAAAAAGGAGAGCCTCAATACAAAAGGAGCAAAATCAATAGGCTAATAGAAACGACTAAGCTACAAAAAGAAGCTCCAATTTAAAATATATAAATTTCATTTTGCTAGATGAGAAAAAGCTTTCATTAAAAACAATAAAAGAATATGGAGGAAGCCAGAAAGCTTAACAAAACAATGGCAAACTTGAAACCACTCAATATTTTCAACAAAAGAAGTAGATATCTCAAAATAAAGGAAAAGTAAAGTCCACAAGCCAATATTGTACTGTTTTGCACTCAGGAAGAAGGTTAGGACTTACTATTTGTTTACCATTACTTAGGTTGTACATAACTATTTTCCCCTGTTTTCTCTTCTCTGATACTTTATGACAGGCAAAGGAGAGGGAATAGTTATTGCACCTCATGTGGCTGGGCATTTATTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGATTTTAACCACCGCAAGGAAAGGTATGGTAACCGTTGTTACAAGCACCAACACCTGCGTTAAGACTACTCACACAAAAGTGTAATACACAATGCTTATTTAACTTACTATCAGGCATCTGAATGGAACCATTCTAGAGTCATTTGTGCGACCTGCTGTATTGATAACGGATGCTTATAATGCTCTAAATAATCAGCCTTACAGGCGTCAGAAGGACAAAGAATTTGGAGGTACTTAAAATCCTGCCAGAGATTCCATCTTTTGCTTCATTCCCCCCTCCCATGCTCTTCTTGTAGCATAAAAGTATATTCTAAAAGTCCTTTGTTATGGATGCTTAGTCGTTTTTGCCAAGAATTGTTGATGGAAGTTATCATTTTGATTAACATAGATACTATTCAGAAGACCTTAAGAGCTAATGGAAATGTCTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTAGAATGGGTTAGTTGGTGAAAGACAAAACTTTTGTCTCTTCTCTTTTTTTTGTTTAAATTTTTTTATAATATTTAGTCTCATTGTTATTACTGCAGTACTGGGAAGAGGAAAGTTTAAATTTTCCCATTTTCTTTTTAACTTACGTCGCATCTAGCACAATTGATTATATCAAGAGTTTCCTAGAGTGGATGAGTGATTCAATAGCAAAGTCTTTTGAACACACACGGAACAACGCCTTTCTTCTCAAGTAAGTATTCTTCCAAGTATTACATTTTTTTTTTAATCTTTTGATAGGAAACAGGGGGTAGTTACCGGTCAATCCCATTGCCTTGAGGGTTCAAGTCTTACATATTTCTCACTCAATAGCATGCTTTATTTACCTTTGATTAAAGAAATTGAGATAGACTAAAATAAGTTTTAGTGTAGCAGAGTGCTGTGTGAAGTCATACAAATGGAACTAGGAACTCTTACCAGGCTGTTGTTTTAAATCTTGCAGGCATGTCACCCTTCTAATAAACAAAAGTGAACTTGATAATGCTCCAGATGGACCAAAGGTTGATTGCAACAAATTGTTAATTTTGTTATATGTTTACTTCCTTTTGCTTCAGGTTTACATGTATTGACGTCTATATGGGGTGTCTTATAGGTTGTTCTAGCATCAATGGCTAGTTTGGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAACGGATGCCAAAAATCTCATCCTTTTTTCTGAAAGAGGCCAGGTATTCTTTAGTAACTTGTTTACATGATTAGCAAGCCGTCTTCTAGTGATTTTTCATGTCTTGGTCTATTTATCTTCAGTCCTCGGAACTGAGTAACTGTTCTCCAAAATTGATTTGTTACACCTTGCATAAATGCCGTTCCATCTTTGTTTCCATTTTCTAAACATCTTCCATTTGACTAGAGTTTCTTTAATAGAAAATCAAAACACTTTTAATATTATTCGTAGTTTCTTTATTTGTTGTTCTTGTAGAAGTTCTAGCAAGTCATTGATTCATTGGTAGTCAATTACTCTAGGTGCACTTCGTTTCCTTGTCTTTTTCATTTTCCTCCTCGAAAAATTCCTTTGTGGCTGATGACTTAGTTTTGTTAGGGAGCTTTTTTCCTTTTCGTTCGATTTCCATCGTCCTTTGTTTGATAGGAAAATGACGGATGTCACTATCATTCTATCTTTGATAGAAGAGTTTGATTTTAGGATTGGGAGAAGGAGTGTGTGTACTGGGAGCCCTAACTACCTCTTGCCTATGTATTATTTCATTTTTTCTCAATGAAAAGTTGTTTCTCTTAAAAAATGTATACTCTTGGTGCCATGGGCGTGTCTTGCTATGACTGTGCTTGGAAGATGGAAAAGTAATTTTTTCTCCTTTCTTTCTTTCTCCCCCTTCTGCATATACCTGATCAACTGCATGCTTCTTTGAACTCTTATAATTTCTACACTTATATTGTGAATGTGTAACCCCCATCAGTTGGAAAATGTAGTTTGGAACTTTGGCCCGCATGCTGCAAGCAGATCCACCTCCCAAAGCTGTTAAGGTGACTGTGTCTAAGAGAGTCCCTTTGACTGGAGATGAGCTCGTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCTCTTAAGGCTAGTTTGCTCAAGGAGGAACAATCTAAAGCATCACATGGAACTGATAACGATACTGGTGATCCAATGATCATTGATGCTAGCAGTAATGTAGCACCAGATGGTATGATGATTGATAACTGCAGTTTTCTAGACACGTGCAAACTTTTTTTAGCAAGGAAATTATGAGCATCCAAACTTTTTACTTGAATTGGAAAAGACACTTAAAGTGAAAATAGGATTTGGTTGCATAAATATTAGCAAAATGTTTTCCCAAACTCTATCATGGAAACATAGAACACGAAGTGTACACTGATGTGCCAGGTTTTAAGATGGTTTAGAAATGCTATAAATATCCCCATTGCTTGGGCACCACCAGTTATTTTAGTTGAAGGTGTCACTATCTTTTCTTAGTCCTAAGAATATACATGCATTTCAGTGACACTAGAAATGAGGATCCTAACCTGCAATCATCAAATGGTCTGTGAAAAATTCCCCATACATGCCTATGATTGTGTTTTTTTTTATTGGAGTTGTTTTAATGTACGTTTTAATAATTGTTAGTGGAATGGATTGGTTGCTGATTGTATCTTGCAGTAGTTGGTTCACATGGAGGTGCATACCGAGACATATTTATTGATGGTTTTGTTCCTCCTTCAACAAGCGTTTCTCCAATGTTTCCCTTTTATGAAAACACTTCCGCATGGGATGATTTTGGTGAAGTAATCAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAATCAGCGCCGCATGTAAGATTCTTTGATGTAATTATCTGCATTTAATTCCTGGACACATGTAAGATTGATTGAAATGAAATGTTAAATTGGTTGATAGGTTGTGGGTTGAAATAGAAAGCCTTTCTTGTGGTAAAATTTAGGAAACCGATTTGTTTTATTTATTTATTATTATTATTATTATTTTTTTGAGAATACAAACCATTTTGTTCAATAACATTTGGAATGTAAGATCCTGCTTGGGCTGGTTTTAGGAATAGGATATATATATATATATTTAGTTTTGAGATTTATCAATATTATTAATGTTGATGAGTAACCAACATTGAGAGGAATGAAAAGTAACAGCCAAGGACTGCTATCAAATTCCCTGGGAACAGCCGTGTGAAGATCAAAATGCATCCATGTAGCCAATTCATATAGTTGGGACACACAGTCAAATGTACTTTGATGCCCTTCGTCATTTTTTGTTGTGAATGGATGATTGCTAATTTACTAGAATCATACTTACAGGGTGGTGTGGACGTGGATGGAAAACTAGATGAAACTGCTGCTAACTTGATTCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTAAGTTTATCTCCTGTTAAAGTTCTGTTTAAGGTTTATATTGTCGGCTGAATTCACATGAATGTGTATTTAATGTTATTGTCTCCAGTATATGTATATAAAGTTTATTTTAAGGAAAATATTTGACTTAATAGGAGAAGGTTATAATGAGTTAATAAAATATCTCTCTGATCTGTCTTTGTGTGTGTATGTGTATAATTTTTTTTTTTTTTTTTTTTTTTTCTGATATCACTTTTTTTTTTTTTAAAGTGCTTAATATTTATACTTTCCATTGATCATCTTTGTACCTATGTTCAATTTGATGATAGAATCAGGTTATTTATCTATTGATTGGATCAATGATTTTTTCTGTAATTTCCCTATCCAGGTCCAAGTTAAATGCTCATTGCATTACATGGATTTCGAAGGTCGTTCAGACGGGAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTATGGTGTTCTTTTACAACTGCTCTTGTGTTGTGTATGAGCTTGTGCACTTGTAACTATTGATCGTAGGGTGAAAGTTCAGTTTAGCTAAGTTGGTGTATTCTGAGAAATTTCCTATTAGCCGGGTTTTTAAATTTCCACATCTTGTTGTTGCAATAGCTTTTTGTAGACATTGTTCAGTGTTACCACAAAAACTACTTTAATCAACGATTCAACCGTTTTGATTCTATTTATTTATTTAAATTGTTGTTTGAAGGTCTTGGTGCATGGAACTGCAGAGGCCACTGAGCATCTTAAACAACATTGCCTTAAAAATGTCTGCCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATTGATGTTACTTCTGATCTGTGTGCATATAAGGTACAGTCGAGACTATTCATTTCCTTTTCAGGTTGTCATTTCAAATATTGAGACTAACAAATCTCCAATTCTTGAATTTCAGGTACAACTTTCAGAGAAGCTGATGAGCAATGTGCTGTTTAAGAAGGTAAAGTTTCTGGAGATCCATGAAATTTTCCTCCAAAATTAAGTTGTATTTTGTATTATTGTAAGCACTCTTCCCTACGACGCCTGACATAATTTTGAAAACAGCTAGGAGATTATGAAATCTCTTGGCTTGATGCTGACGTAGGAAAGACCGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCCGCTTTGCCTCATAAATCTGTTCTTGTTGGGGATCTAAAAATGGCTGACTTCAAACAATTTCTTTCCAGCAAGGGAATACAGGTATCTTTGCCTGTGAATCCCTCCCTTCTCAATGGGAACGAATATCTCCTTTTTGACATGTATAATAGACACTAAAGCTACAATAGAGCGTTAAAAGTGAACGATATGATCGTCCTATATGCATTGCAGGTTGAATTTGCTGGGGGTGCTTTGAGATGTGGCGAGTATGTTACCCTACGCAAGGTTTCAGATGCAAGTCAGAAGGTGAGAAATTCTAAAAAAAGGGGGAAATAGATAAGTTGAATATGGCTCTTAATGTTGCTTATCGAATTTGAACCACGTTTCAGCTTCATACCACAAATGAATGCATATTTGCTTGTGATGGTGTTGTTTGGCTAGTACCATTTTCATTGGTTTGCTTCTAAGCATAAACATAGTATATTCAATTTATGCAGGGTGGTGGTTCTGGTACCCAACAAGTTGTCATCGAAGGGCCCTTATGTGAAGATTATTACAAAATTCGGGAGCTTTTGTATTCACAATTTTATTTGCTATAG

mRNA sequence

ATGGAGTCTTTGTGGAAACTTCTATATTTGCTGGAACCTGCACCTGTCACTCTCATTGTGACTGCAGTGGCTGTTACATTTGGATCAGCTTTCCGGGCCTTAAATTATGGGAAGGAAATGGAGCGAAACCGTGACTTTTCAGAAGCTTCCATTACCTTAGACAGATCCCAAGCACTAATGATCCCAGTTATGAGTTCTTGCAGTTTGCTTTTGATGTTCTACCTGTTTTCTTCTGTGTCCCAACTTCTTACTGCATTCACAGCAGTTGCTTCGGTTTCATCCCTCTTCTTCTGTTTAGCGCCTTACATGGCCTGTTTAAAGTCTCAGTTTGGATTGGCTGATCCTTACGTATCAAGGTGTTGTTCCAAGTCATTTACACGAATTCAAGGGTTATTGTTGATGGCATGTTTTGGTTTAGTTGCAGCATGGCTTGTTTCTGGGCATTGGATATTGAACAATTTGTTGGGAATTTCAATATGTGTTGCCTTTGTCAGCCATGTACGTCTCCCTAATGTTAAAATATGTGCAATGCTCCTCGTTTGTCTCTTTGTATATGATATTTTCTGGGTCTTCTTCTCTGAGAGATTCTTTGGAGCTAATGTAATGGTATCTGTAGCAACTCAGCAAGCATCGAATCCTGTTCACACAGTTGCTAATAGTCTGAGTCTTCCTGGTCTGCAATTGATAACTAAGAAGCTGGAGTTACCCGTCAAGATAGTTTTTCCAAGGAACTTACTTGGTGGAGTCATTCCAGGAAAAAATGCCACTGATTTCATGATGCTTGGCCTTGGTGATATGGCAATTCCTGCCATGTTTCTAGCTCTAGTTCTTTGTTTTGACCATCGGAAGAGTAGGGATACAGTCAATCTCTTAGATATGCACGCAAGGGGCCACAAGTACATTTGGTATGCCCTGCCTGGTTATGCCATTGGGCTGGTGACCGCTCTAGCAGCTGGTGTTTTGACTCACTCGCCTCAACCTGCCCTATTGTATCTGGTGCCTTCTACATTAGGACCTGTTATTGCCATATCTTGGATAAGGAAGGATTTTTTGGAGTTATGGGAAGGGCCTTTGCCGAACCCCAATGATAAAGTGACGCCTCTTTGCGGCGTGTACAATGAAAATCCTTTATCCTATTTGGTCTCCGTTGACGGCTTCAACTTCCTTATCGACTGTGGTTGGAACGACCACTTCGATCCTGCTCTTCTTCAACCTCTATCCAGGGTGGCATCGACGATTGATGCAGTTTTGATATCACATCCTGATACACTTCACCTCGGTGCCCTTCCATATGCCATGAAACAACTTGGACTTTCTGCTCCAGTATATTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTTTATAGCGAGGAAGCAAGTATCAGAGTTTGATCTATTTACGCTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGGCTAACATACTCCCAGAATCATCATCTTTCAGGCAAAGGAGAGGGAATAGTTATTGCACCTCATGTGGCTGGGCATTTATTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGATTTTAACCACCGCAAGGAAAGGCATCTGAATGGAACCATTCTAGAGTCATTTGTGCGACCTGCTGTATTGATAACGGATGCTTATAATGCTCTAAATAATCAGCCTTACAGGCGTCAGAAGGACAAAGAATTTGGAGATACTATTCAGAAGACCTTAAGAGCTAATGGAAATGTCTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTAGAATGGTACTGGGAAGAGGAAAGTTTAAATTTTCCCATTTTCTTTTTAACTTACGTCGCATCTAGCACAATTGATTATATCAAGAGTTTCCTAGAGTGGATGAGTGATTCAATAGCAAAGTCTTTTGAACACACACGGAACAACGCCTTTCTTCTCAAGCATGTCACCCTTCTAATAAACAAAAGTGAACTTGATAATGCTCCAGATGGACCAAAGGTTGTTCTAGCATCAATGGCTAGTTTGGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAACGGATGCCAAAAATCTCATCCTTTTTTCTGAAAGAGGCCAGTTTGGAACTTTGGCCCGCATGCTGCAAGCAGATCCACCTCCCAAAGCTGTTAAGGTGACTGTGTCTAAGAGAGTCCCTTTGACTGGAGATGAGCTCGTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCTCTTAAGGCTAGTTTGCTCAAGGAGGAACAATCTAAAGCATCACATGGAACTGATAACGATACTGGTGATCCAATGATCATTGATGCTAGCAGTAATGTAGCACCAGATGTTGGTTCACATGGAGGTGCATACCGAGACATATTTATTGATGGTTTTGTTCCTCCTTCAACAAGCGTTTCTCCAATGTTTCCCTTTTATGAAAACACTTCCGCATGGGATGATTTTGGTGAAGTAATCAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAATCAGCGCCGCATGGTGGTGTGGACGTGGATGGAAAACTAGATGAAACTGCTGCTAACTTGATTCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTTAAATGCTCATTGCATTACATGGATTTCGAAGGTCGTTCAGACGGGAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTCTTGGTGCATGGAACTGCAGAGGCCACTGAGCATCTTAAACAACATTGCCTTAAAAATGTCTGCCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATTGATGTTACTTCTGATCTGTGTGCATATAAGGTACAACTTTCAGAGAAGCTGATGAGCAATGTGCTGTTTAAGAAGCTAGGAGATTATGAAATCTCTTGGCTTGATGCTGACGTAGGAAAGACCGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCCGCTTTGCCTCATAAATCTGTTCTTGTTGGGGATCTAAAAATGGCTGACTTCAAACAATTTCTTTCCAGCAAGGGAATACAGGTTGAATTTGCTGGGGGTGCTTTGAGATGTGGCGAGTATGTTACCCTACGCAAGGTTTCAGATGCAAGTCAGAAGGGTGGTGGTTCTGGTACCCAACAAGTTGTCATCGAAGGGCCCTTATGTGAAGATTATTACAAAATTCGGGAGCTTTTGTATTCACAATTTTATTTGCTATAG

Coding sequence (CDS)

ATGGAGTCTTTGTGGAAACTTCTATATTTGCTGGAACCTGCACCTGTCACTCTCATTGTGACTGCAGTGGCTGTTACATTTGGATCAGCTTTCCGGGCCTTAAATTATGGGAAGGAAATGGAGCGAAACCGTGACTTTTCAGAAGCTTCCATTACCTTAGACAGATCCCAAGCACTAATGATCCCAGTTATGAGTTCTTGCAGTTTGCTTTTGATGTTCTACCTGTTTTCTTCTGTGTCCCAACTTCTTACTGCATTCACAGCAGTTGCTTCGGTTTCATCCCTCTTCTTCTGTTTAGCGCCTTACATGGCCTGTTTAAAGTCTCAGTTTGGATTGGCTGATCCTTACGTATCAAGGTGTTGTTCCAAGTCATTTACACGAATTCAAGGGTTATTGTTGATGGCATGTTTTGGTTTAGTTGCAGCATGGCTTGTTTCTGGGCATTGGATATTGAACAATTTGTTGGGAATTTCAATATGTGTTGCCTTTGTCAGCCATGTACGTCTCCCTAATGTTAAAATATGTGCAATGCTCCTCGTTTGTCTCTTTGTATATGATATTTTCTGGGTCTTCTTCTCTGAGAGATTCTTTGGAGCTAATGTAATGGTATCTGTAGCAACTCAGCAAGCATCGAATCCTGTTCACACAGTTGCTAATAGTCTGAGTCTTCCTGGTCTGCAATTGATAACTAAGAAGCTGGAGTTACCCGTCAAGATAGTTTTTCCAAGGAACTTACTTGGTGGAGTCATTCCAGGAAAAAATGCCACTGATTTCATGATGCTTGGCCTTGGTGATATGGCAATTCCTGCCATGTTTCTAGCTCTAGTTCTTTGTTTTGACCATCGGAAGAGTAGGGATACAGTCAATCTCTTAGATATGCACGCAAGGGGCCACAAGTACATTTGGTATGCCCTGCCTGGTTATGCCATTGGGCTGGTGACCGCTCTAGCAGCTGGTGTTTTGACTCACTCGCCTCAACCTGCCCTATTGTATCTGGTGCCTTCTACATTAGGACCTGTTATTGCCATATCTTGGATAAGGAAGGATTTTTTGGAGTTATGGGAAGGGCCTTTGCCGAACCCCAATGATAAAGTGACGCCTCTTTGCGGCGTGTACAATGAAAATCCTTTATCCTATTTGGTCTCCGTTGACGGCTTCAACTTCCTTATCGACTGTGGTTGGAACGACCACTTCGATCCTGCTCTTCTTCAACCTCTATCCAGGGTGGCATCGACGATTGATGCAGTTTTGATATCACATCCTGATACACTTCACCTCGGTGCCCTTCCATATGCCATGAAACAACTTGGACTTTCTGCTCCAGTATATTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTTTATAGCGAGGAAGCAAGTATCAGAGTTTGATCTATTTACGCTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGGCTAACATACTCCCAGAATCATCATCTTTCAGGCAAAGGAGAGGGAATAGTTATTGCACCTCATGTGGCTGGGCATTTATTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGATTTTAACCACCGCAAGGAAAGGCATCTGAATGGAACCATTCTAGAGTCATTTGTGCGACCTGCTGTATTGATAACGGATGCTTATAATGCTCTAAATAATCAGCCTTACAGGCGTCAGAAGGACAAAGAATTTGGAGATACTATTCAGAAGACCTTAAGAGCTAATGGAAATGTCTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTAGAATGGTACTGGGAAGAGGAAAGTTTAAATTTTCCCATTTTCTTTTTAACTTACGTCGCATCTAGCACAATTGATTATATCAAGAGTTTCCTAGAGTGGATGAGTGATTCAATAGCAAAGTCTTTTGAACACACACGGAACAACGCCTTTCTTCTCAAGCATGTCACCCTTCTAATAAACAAAAGTGAACTTGATAATGCTCCAGATGGACCAAAGGTTGTTCTAGCATCAATGGCTAGTTTGGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAACGGATGCCAAAAATCTCATCCTTTTTTCTGAAAGAGGCCAGTTTGGAACTTTGGCCCGCATGCTGCAAGCAGATCCACCTCCCAAAGCTGTTAAGGTGACTGTGTCTAAGAGAGTCCCTTTGACTGGAGATGAGCTCGTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCTCTTAAGGCTAGTTTGCTCAAGGAGGAACAATCTAAAGCATCACATGGAACTGATAACGATACTGGTGATCCAATGATCATTGATGCTAGCAGTAATGTAGCACCAGATGTTGGTTCACATGGAGGTGCATACCGAGACATATTTATTGATGGTTTTGTTCCTCCTTCAACAAGCGTTTCTCCAATGTTTCCCTTTTATGAAAACACTTCCGCATGGGATGATTTTGGTGAAGTAATCAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAATCAGCGCCGCATGGTGGTGTGGACGTGGATGGAAAACTAGATGAAACTGCTGCTAACTTGATTCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTTAAATGCTCATTGCATTACATGGATTTCGAAGGTCGTTCAGACGGGAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTCTTGGTGCATGGAACTGCAGAGGCCACTGAGCATCTTAAACAACATTGCCTTAAAAATGTCTGCCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATTGATGTTACTTCTGATCTGTGTGCATATAAGGTACAACTTTCAGAGAAGCTGATGAGCAATGTGCTGTTTAAGAAGCTAGGAGATTATGAAATCTCTTGGCTTGATGCTGACGTAGGAAAGACCGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCCGCTTTGCCTCATAAATCTGTTCTTGTTGGGGATCTAAAAATGGCTGACTTCAAACAATTTCTTTCCAGCAAGGGAATACAGGTTGAATTTGCTGGGGGTGCTTTGAGATGTGGCGAGTATGTTACCCTACGCAAGGTTTCAGATGCAAGTCAGAAGGGTGGTGGTTCTGGTACCCAACAAGTTGTCATCGAAGGGCCCTTATGTGAAGATTATTACAAAATTCGGGAGCTTTTGTATTCACAATTTTATTTGCTATAG

Protein sequence

MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALMIPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRCCSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLVCLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIVFPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKYIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPNPNDKVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCEDYYKIRELLYSQFYLL
Homology
BLAST of Csor.00g282190 vs. ExPASy Swiss-Prot
Match: Q9LKF9 (Cleavage and polyadenylation specificity factor subunit 2 OS=Arabidopsis thaliana OX=3702 GN=CPSF100 PE=1 SV=2)

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 574/735 (78.10%), Postives = 662/735 (90.07%), Query Frame = 0

Query: 364  KVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDT 423
            +VTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDT
Sbjct: 6    QVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLLSHPDT 65

Query: 424  LHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV 483
            LH+GALPYAMKQLGLSAPVY+TEPV+RLGLLTMYDQF++RKQVS+FDLFTLDDIDSAFQ 
Sbjct: 66   LHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDIDSAFQN 125

Query: 484  VTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNG 543
            V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNG
Sbjct: 126  VIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNG 185

Query: 544  TILESFVRPAVLITDAYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLE 603
            T+L+SFVRPAVLITDAY+AL  NQ  R+Q+DKEF DTI K L   GNVLLPVDTAGRVLE
Sbjct: 186  TVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLE 245

Query: 604  LIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHV 663
            L+ ILE +W +   +FPI+FLTYV+SSTIDY+KSFLEWMSDSI+KSFE +R+NAFLL+HV
Sbjct: 246  LLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHV 305

Query: 664  TLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARM 723
            TLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWA D +NL+LF+E GQFGTLARM
Sbjct: 306  TLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARM 365

Query: 724  LQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSKASHGTDND 783
            LQ+ PPPK VKVT+SKRVPL G+EL+AYEEEQNR K+EEAL+ASL+KEE++KASHG+D++
Sbjct: 366  LQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDN 425

Query: 784  TGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVIN 843
            + +PMIID +      +GSHG AY+DI IDGFVPPS+SV+PMFP+Y+NTS WDDFGE+IN
Sbjct: 426  SSEPMIID-TKTTHDVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEWDDFGEIIN 485

Query: 844  PDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDF 903
            PDDYVIKDEDMD+ A H G DVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+
Sbjct: 486  PDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVSCSLVKMDY 545

Query: 904  EGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD 963
            EGRSDGRSIKS+++HV+PLKLVLVH  AEATEHLKQHCL N+CPHVYAPQIEET+DVTSD
Sbjct: 546  EGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIEETVDVTSD 605

Query: 964  LCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGD 1023
            LCAYKVQLSEKLMSNV+FKKLGD E++W+D++VGKTE    SLLP+  AA PHK VLVGD
Sbjct: 606  LCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASPHKPVLVGD 665

Query: 1024 LKMADFKQFLSSKGIQVEFA-GGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED 1083
            LK+ADFKQFLSSKG+QVEFA GGALRCGEYVTLRKV    QKGG SG QQ++IEGPLCED
Sbjct: 666  LKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQILIEGPLCED 725

Query: 1084 YYKIRELLYSQFYLL 1096
            YYKIR+ LYSQFYLL
Sbjct: 726  YYKIRDYLYSQFYLL 739

BLAST of Csor.00g282190 vs. ExPASy Swiss-Prot
Match: Q652P4 (Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza sativa subsp. japonica OX=39947 GN=Os09g0569400 PE=2 SV=1)

HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 539/735 (73.33%), Postives = 631/735 (85.85%), Query Frame = 0

Query: 364  KVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDT 423
            +VTPL G Y E PL YL++VDGF FL+DCGW D  DP+ LQPL++VA TIDAVL+SH DT
Sbjct: 6    QVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLLSHADT 65

Query: 424  LHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV 483
            +HLGALPYAMK LGLSAPVY+TEPV+RLG+LT+YD FI+R+QVS+FDLFTLDDID+AFQ 
Sbjct: 66   MHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDIDAAFQN 125

Query: 484  VTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNG 543
            V RL YSQNH L+ KGEGIVIAPHVAGH LGGT+WKITKDGEDV+YAVDFNHRKERHLNG
Sbjct: 126  VVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKERHLNG 185

Query: 544  TILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLEL 603
            T L SFVRPAVLITDAYNALNN  Y+RQ+D++F D + K L   G+VLLP+DTAGRVLE+
Sbjct: 186  TALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAGRVLEI 245

Query: 604  IQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVT 663
            + ILE YW +  L +PI+FLT V++ST+DY+KSFLEWM+DSI+KSFEHTR+NAFLLK VT
Sbjct: 246  LLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFLLKCVT 305

Query: 664  LLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARML 723
             +INK EL+   D PKVVLASMASLE G+SHDIFV+ A +AKNL+LF+E+GQFGTLARML
Sbjct: 306  QIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGTLARML 365

Query: 724  QADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDT 783
            Q DPPPKAVKVT+SKR+PL GDEL AYEEEQ R KKEEALKASL KEE+ KAS G++   
Sbjct: 366  QVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLGSNAKA 425

Query: 784  GDPMIIDASSNVAP-DVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVIN 843
             DPM+IDAS++  P + GS  G   DI IDGFVPPS+SV+PMFPF+ENTS WDDFGEVIN
Sbjct: 426  SDPMVIDASTSRKPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFFENTSEWDDFGEVIN 485

Query: 844  PDDYVIKDEDMDQS-APHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMD 903
            P+DY++K E+MD +  P  G  +D  LDE +A L+LD  PSKV+SNE+TVQVKCSL YMD
Sbjct: 486  PEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKCSLAYMD 545

Query: 904  FEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTS 963
            FEGRSDGRS+KS+++HVAPLKLVLVHG+AEATEHLK HC KN   HVYAPQIEETIDVTS
Sbjct: 546  FEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIEETIDVTS 605

Query: 964  DLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVG 1023
            DLCAYKVQLSEKLMSNV+ KKLG++EI+W+DA+VGKT++  L+LLP S     HKSVLVG
Sbjct: 606  DLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDD-KLTLLPPSSTPAAHKSVLVG 665

Query: 1024 DLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED 1083
            DLK+ADFKQFL++KG+QVEFAGGALRCGEY+TLRK+ DA QK G +G+QQ+VIEGPLCED
Sbjct: 666  DLKLADFKQFLANKGLQVEFAGGALRCGEYITLRKIGDAGQK-GSTGSQQIVIEGPLCED 725

Query: 1084 YYKIRELLYSQFYLL 1096
            YYKIRELLYSQFYLL
Sbjct: 726  YYKIRELLYSQFYLL 738

BLAST of Csor.00g282190 vs. ExPASy Swiss-Prot
Match: Q93Z32 (Signal peptide peptidase-like 1 OS=Arabidopsis thaliana OX=3702 GN=SPPL1 PE=2 SV=1)

HSP 1 Score: 612.1 bits (1577), Expect = 1.3e-173
Identity = 312/366 (85.25%), Postives = 338/366 (92.35%), Query Frame = 0

Query: 1   MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
           ME+LW LLYLLEPAP TLIVTAV VTF SAFRALNYGKEMERNRDFSEASITLD SQALM
Sbjct: 1   METLWTLLYLLEPAPATLIVTAVTVTFASAFRALNYGKEMERNRDFSEASITLDSSQALM 60

Query: 61  IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
           IPVMSSCSLLLMFYLFSSVSQLLTAFTA+ASVSSLF+ L+PY   +K+Q GL+DP++SRC
Sbjct: 61  IPVMSSCSLLLMFYLFSSVSQLLTAFTAIASVSSLFYWLSPYAVYMKTQLGLSDPFLSRC 120

Query: 121 CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
           CSKSFTRIQGLLL+AC   V AWL+SGHW+LNNLLGISIC+AFVSHVRLPN+KICAMLLV
Sbjct: 121 CSKSFTRIQGLLLVACAMTVVAWLISGHWVLNNLLGISICIAFVSHVRLPNIKICAMLLV 180

Query: 181 CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
           CLFVYDIFWVFFSERFFGANVMV+VATQQASNPVHTVANSL+LPGLQLITKKLELPVKIV
Sbjct: 181 CLFVYDIFWVFFSERFFGANVMVAVATQQASNPVHTVANSLNLPGLQLITKKLELPVKIV 240

Query: 241 FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDM-HARGHK 300
           FPRNLLGGV+PG +A+DFMMLGLGDMAIPAM LALVLCFDHRK+RD VN+ D+  ++GHK
Sbjct: 241 FPRNLLGGVVPGVSASDFMMLGLGDMAIPAMLLALVLCFDHRKTRDVVNIFDLKSSKGHK 300

Query: 301 YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGP-L 360
           YIWYALPGYAIGLV ALAAGVLTHSPQPALLYLVPSTLGPVI +SW RKD  ELWEGP L
Sbjct: 301 YIWYALPGYAIGLVAALAAGVLTHSPQPALLYLVPSTLGPVIFMSWRRKDLAELWEGPAL 360

Query: 361 PNPNDK 365
            NP +K
Sbjct: 361 SNPIEK 366

BLAST of Csor.00g282190 vs. ExPASy Swiss-Prot
Match: Q7G7C7 (Signal peptide peptidase-like 1 OS=Oryza sativa subsp. japonica OX=39947 GN=SPPL1 PE=2 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 1.9e-153
Identity = 274/365 (75.07%), Postives = 316/365 (86.58%), Query Frame = 0

Query: 1   MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
           MESLWKL YLLEPA + LI+TAV+V + SA RAL++G+EMERN DFSEASITLDRSQALM
Sbjct: 1   MESLWKLSYLLEPASLALILTAVSVAYASASRALDHGREMERNLDFSEASITLDRSQALM 60

Query: 61  IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
           IP+ SSCSLLLMFYLFSSVS L+TAFTAVAS  +LFFCL+PY+ C++S+ G+ DP+VSRC
Sbjct: 61  IPLASSCSLLLMFYLFSSVSHLVTAFTAVASAMALFFCLSPYVNCVRSRLGVGDPFVSRC 120

Query: 121 CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
           CSK FTR+QGLL+  C G V AWLVSGHW+LNNLLGISIC+AFVSHVRLPN+KICA+LLV
Sbjct: 121 CSKPFTRLQGLLVAICVGTVVAWLVSGHWLLNNLLGISICIAFVSHVRLPNIKICALLLV 180

Query: 181 CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
           CLFVYD+FWVFFSERFFGANVMVSVATQ+ASNPVHTVAN LSLPGLQLITKKLELPVK+V
Sbjct: 181 CLFVYDVFWVFFSERFFGANVMVSVATQKASNPVHTVANKLSLPGLQLITKKLELPVKLV 240

Query: 241 FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDM-HARGHK 300
           FPR+L+GG+ PG +  D+MMLGLGDMAIP M LALVL FDHRK +D     DM  ++  K
Sbjct: 241 FPRSLMGGLAPGSSPGDYMMLGLGDMAIPGMLLALVLSFDHRKIKDMSVSQDMPPSKQRK 300

Query: 301 YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLP 360
           Y+WYAL GY +GLVTALAAG+L+ SPQPALLYLVPSTLGPV+ +SW+R +  ELWEG  P
Sbjct: 301 YVWYALTGYGVGLVTALAAGILSQSPQPALLYLVPSTLGPVMYMSWLRNELWELWEGSRP 360

Query: 361 NPNDK 365
             NDK
Sbjct: 361 IINDK 365

BLAST of Csor.00g282190 vs. ExPASy Swiss-Prot
Match: Q9V3D6 (Probable cleavage and polyadenylation specificity factor subunit 2 OS=Drosophila melanogaster OX=7227 GN=Cpsf100 PE=1 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 5.5e-153
Identity = 302/771 (39.17%), Postives = 451/771 (58.50%), Query Frame = 0

Query: 364  KVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDT 423
            K+  + G  +E+P  Y++ +D    L+DCGW++ FD   ++ L R   T+DAVL+SHPD 
Sbjct: 6    KLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLLSHPDA 65

Query: 424  LHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV 483
             HLGALPY + +LGL+ P+Y+T PV+++G + MYD +++   + +FDLF+LDD+D+AF+ 
Sbjct: 66   YHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEK 125

Query: 484  VTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLN 543
            +T+L Y+Q   L  KG GI I P  AGH++GGT+WKI K G ED++YA DFNH+KERHL+
Sbjct: 126  ITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLS 185

Query: 544  GTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLE 603
            G  L+   RP++LITDAYNA   Q  RR +D++    I +T+R NGNVL+ VDTAGRVLE
Sbjct: 186  GCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLE 245

Query: 604  LIQILEWYWEEES---LNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLL 663
            L  +L+  W+ +    + + +  L  V+ + I++ KS +EWMSD + K+FE  RNN F  
Sbjct: 246  LAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQF 305

Query: 664  KHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTL 723
            KH+ L  + +++   P GPKVVLAS   LE+G++ D+FV+WA++A N I+ + R   GTL
Sbjct: 306  KHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTL 365

Query: 724  A-RMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGT 783
            A  +++   P K +++ V +RV L G EL  Y   Q  K    +    ++EE S  S   
Sbjct: 366  AMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQGEKLNPLIVKPDVEEESSSES--- 425

Query: 784  DNDTGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGE 843
            ++D    +I      V    G H          GF   +     MFP++E     D++GE
Sbjct: 426  EDDIEMSVITGKHDIVVRPEGRHH--------SGFFKSNKRHHVMFPYHEEKVKCDEYGE 485

Query: 844  VINPDDYVIKD--------------EDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVV 903
            +IN DDY I D              E++ +  P  G +          ++ L  KP+K++
Sbjct: 486  IINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDNDVQLLEKPTKLI 545

Query: 904  SNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVC 963
            S   T++V   +  +DFEGRSDG S+  ILS + P +++++HGTAE T+ + +HC +NV 
Sbjct: 546  SQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQNVG 605

Query: 964  PHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDA------------ 1023
              V+ PQ  E IDVTS++  Y+V+L+E L+S + F+K  D E++W+D             
Sbjct: 606  ARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDGRLGMRVKAIEAP 665

Query: 1024 -------DVGKTENGTLSLLPLSKAALP-HKSVLVGDLKMADFKQFLSSKGIQVEFAGGA 1083
                   D    E  TL+L  L+   +P H SVL+ +LK++DFKQ L    I  EF+GG 
Sbjct: 666  MDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGV 725

Query: 1084 LRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCEDYYKIRELLYSQFYLL 1096
            L C       +  DA          +V +EG L E+YYKIRELLY Q+ ++
Sbjct: 726  LWCSNGTLALRRVDAG---------KVAMEGCLSEEYYKIRELLYEQYAIV 756

BLAST of Csor.00g282190 vs. NCBI nr
Match: KAG6576721.1 (Cleavage and polyadenylation specificity factor subunit 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 2170 bits (5623), Expect = 0.0
Identity = 1095/1095 (100.00%), Postives = 1095/1095 (100.00%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM
Sbjct: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV
Sbjct: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300
            FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY
Sbjct: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300

Query: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360
            IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN
Sbjct: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360

Query: 361  PNDKVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISH 420
            PNDKVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISH
Sbjct: 361  PNDKVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISH 420

Query: 421  PDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSA 480
            PDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSA
Sbjct: 421  PDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSA 480

Query: 481  FQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERH 540
            FQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERH
Sbjct: 481  FQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERH 540

Query: 541  LNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRV 600
            LNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRV
Sbjct: 541  LNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRV 600

Query: 601  LELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLK 660
            LELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLK
Sbjct: 601  LELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLK 660

Query: 661  HVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLA 720
            HVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLA
Sbjct: 661  HVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLA 720

Query: 721  RMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDN 780
            RMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDN
Sbjct: 721  RMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDN 780

Query: 781  DTGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVI 840
            DTGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVI
Sbjct: 781  DTGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVI 840

Query: 841  NPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMD 900
            NPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMD
Sbjct: 841  NPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMD 900

Query: 901  FEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTS 960
            FEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTS
Sbjct: 901  FEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTS 960

Query: 961  DLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVG 1020
            DLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVG
Sbjct: 961  DLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVG 1020

Query: 1021 DLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED 1080
            DLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED
Sbjct: 1021 DLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED 1080

Query: 1081 YYKIRELLYSQFYLL 1095
            YYKIRELLYSQFYLL
Sbjct: 1081 YYKIRELLYSQFYLL 1095

BLAST of Csor.00g282190 vs. NCBI nr
Match: TYK22961.1 (cleavage and polyadenylation specificity factor subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 2001 bits (5184), Expect = 0.0
Identity = 1022/1114 (91.74%), Postives = 1042/1114 (93.54%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            MESLWKLLYLLEPAP TLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM
Sbjct: 1    MESLWKLLYLLEPAPATLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCL+PYMA LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLSPYMAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQGLLL+AC GLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVK+CAMLLV
Sbjct: 121  CSKSFTRIQGLLLLACSGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKVCAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300
            FPRNLLGGVIPGK+ATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLD+H RGHKY
Sbjct: 241  FPRNLLGGVIPGKHATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDIHTRGHKY 300

Query: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360
            IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWE     
Sbjct: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEESHTY 360

Query: 361  PND---------------------------------------------------KVTPLC 420
             ND                                                   +VTPLC
Sbjct: 361  QNDDVWFDRGKKKKKRKRHRPAIFPNTFFFFLQNEGSSRELQRAWSFIAMGTSVQVTPLC 420

Query: 421  GVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480
            GVYNENPLSYLVSVD FNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL
Sbjct: 421  GVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480

Query: 481  PYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVVTRLTY 540
            PYAMKQLGLSAPV+STEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV+TRLTY
Sbjct: 481  PYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVITRLTY 540

Query: 541  SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF 600
            SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF
Sbjct: 541  SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF 600

Query: 601  VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW 660
            VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW
Sbjct: 601  VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW 660

Query: 661  YWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINKS 720
            YWEEESLN+PIFFLTYVASSTIDYIKSFLEWMSD+IAKSFEHTRNNAFLLKHVTLLINKS
Sbjct: 661  YWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKS 720

Query: 721  ELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPPP 780
            ELDNAPDGPKVVLASMASLEAGYSHDIFV+WA DAKNL+LFSERGQFGTLARMLQADPPP
Sbjct: 721  ELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPP 780

Query: 781  KAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIID 840
            KAVKVTVSKRVPLTGDEL+AYEEEQNRKKEEALKASLLKEEQSKASHG DNDTGDPMIID
Sbjct: 781  KAVKVTVSKRVPLTGDELIAYEEEQNRKKEEALKASLLKEEQSKASHGADNDTGDPMIID 840

Query: 841  ASSNVAPDVGS-HGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVINPDDYVIK 900
            ASSN APDVGS HGGAYRDI IDGFVPPSTSV+PMFPFYENTSAWDDFGEVINPDDYVIK
Sbjct: 841  ASSNAAPDVGSSHGGAYRDILIDGFVPPSTSVAPMFPFYENTSAWDDFGEVINPDDYVIK 900

Query: 901  DEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR 960
            DEDMDQ+A H G DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR
Sbjct: 901  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR 960

Query: 961  SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQ 1020
            SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ+EETIDVTSDLCAYKVQ
Sbjct: 961  SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQVEETIDVTSDLCAYKVQ 1020

Query: 1021 LSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADFK 1062
            LSEKLMSNVLFKKLGDYEI+WLDA+VGKTENGTLSLLPLSKA  PHKSVLVGDLKMADFK
Sbjct: 1021 LSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAPAPHKSVLVGDLKMADFK 1080

BLAST of Csor.00g282190 vs. NCBI nr
Match: KAA0033663.1 (cleavage and polyadenylation specificity factor subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 1996 bits (5172), Expect = 0.0
Identity = 1022/1115 (91.66%), Postives = 1042/1115 (93.45%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            MESLWKLLYLLEPAP TLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM
Sbjct: 1    MESLWKLLYLLEPAPATLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCL+PYMA LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLSPYMAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQGLLL+AC GLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVK+CAMLLV
Sbjct: 121  CSKSFTRIQGLLLLACSGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKVCAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300
            FPRNLLGGVIPGK+ATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLD+H RGHKY
Sbjct: 241  FPRNLLGGVIPGKHATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDIHTRGHKY 300

Query: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360
            IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWE     
Sbjct: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEESHTY 360

Query: 361  PND---------------------------------------------------KVTPLC 420
             ND                                                   +VTPLC
Sbjct: 361  QNDDVWFDRGKKKKKRKRHRPAIFPNTFFFFLQNEGSSRELQRAWSFIAMGTSVQVTPLC 420

Query: 421  GVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480
            GVYNENPLSYLVSVD FNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL
Sbjct: 421  GVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480

Query: 481  PYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVVTRLTY 540
            PYAMKQLGLSAPV+STEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV+TRLTY
Sbjct: 481  PYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVITRLTY 540

Query: 541  SQNHHLSG-KGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES 600
            SQNHHLSG KGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES
Sbjct: 541  SQNHHLSGGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES 600

Query: 601  FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE 660
            FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE
Sbjct: 601  FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE 660

Query: 661  WYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINK 720
            WYWEEESLN+PIFFLTYVASSTIDYIKSFLEWMSD+IAKSFEHTRNNAFLLKHVTLLINK
Sbjct: 661  WYWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINK 720

Query: 721  SELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPP 780
            SELDNAPDGPKVVLASMASLEAGYSHDIFV+WA DAKNL+LFSERGQFGTLARMLQADPP
Sbjct: 721  SELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPP 780

Query: 781  PKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMII 840
            PKAVKVTVSKRVPLTGDEL+AYEEEQNRKKEEALKASLLKEEQSKASHG DNDTGDPMII
Sbjct: 781  PKAVKVTVSKRVPLTGDELIAYEEEQNRKKEEALKASLLKEEQSKASHGADNDTGDPMII 840

Query: 841  DASSNVAPDVGS-HGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVINPDDYVI 900
            DASSN APDVGS HGGAYRDI IDGFVPPSTSV+PMFPFYENTSAWDDFGEVINPDDYVI
Sbjct: 841  DASSNAAPDVGSSHGGAYRDILIDGFVPPSTSVAPMFPFYENTSAWDDFGEVINPDDYVI 900

Query: 901  KDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG 960
            KDEDMDQ+A H G DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG
Sbjct: 901  KDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG 960

Query: 961  RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKV 1020
            RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ+EETIDVTSDLCAYKV
Sbjct: 961  RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQVEETIDVTSDLCAYKV 1020

Query: 1021 QLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADF 1062
            QLSEKLMSNVLFKKLGDYEI+WLDA+VGKTENGTLSLLPLSKA  PHKSVLVGDLKMADF
Sbjct: 1021 QLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAPAPHKSVLVGDLKMADF 1080

BLAST of Csor.00g282190 vs. NCBI nr
Match: RXH95086.1 (hypothetical protein DVH24_024770 [Malus domestica])

HSP 1 Score: 1861 bits (4821), Expect = 0.0
Identity = 932/1104 (84.42%), Postives = 1013/1104 (91.76%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            ME LWKL YLLEPAP+TL+VTAV VTFGSAFRALNYGKEME+NRD SE SITLDRSQALM
Sbjct: 1    MEPLWKLFYLLEPAPITLVVTAVGVTFGSAFRALNYGKEMEKNRDLSETSITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSS SLLLMFYLF+SVSQLLT FTAVASVSSLFFC++PY+A LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSISLLLMFYLFTSVSQLLTVFTAVASVSSLFFCISPYIAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQ LLL+ C G V+AWLVSGHWILNNLLGISICVAFVSHVRLPN+KICAMLLV
Sbjct: 121  CSKSFTRIQALLLLLCIGTVSAWLVSGHWILNNLLGISICVAFVSHVRLPNIKICAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSER FGANVMVSVATQQASNPVHTVANSLSLPGLQ++TKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERIFGANVMVSVATQQASNPVHTVANSLSLPGLQMVTKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHA-RGHK 300
            FPRNL+GG IPG  A DFMMLGLGDMAIPAM LALVLCFDHR+S+D VNLLDMH+ +GHK
Sbjct: 241  FPRNLIGGEIPG-GARDFMMLGLGDMAIPAMLLALVLCFDHRRSKDLVNLLDMHSSKGHK 300

Query: 301  YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLP 360
            YIWYALPGYAIGLVTALAAG+LTHSPQPALLYLVPSTLGP++ ISWIRK+  ELW+GPLP
Sbjct: 301  YIWYALPGYAIGLVTALAAGILTHSPQPALLYLVPSTLGPIVFISWIRKELAELWDGPLP 360

Query: 361  NPNDK-----VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTID 420
            + NDK     VTPLCGVYNENPLSYLVS+DGFN LIDCGWNDHFDP+LLQPLSRVAST+D
Sbjct: 361  SMNDKAHQIEVTPLCGVYNENPLSYLVSIDGFNLLIDCGWNDHFDPSLLQPLSRVASTVD 420

Query: 421  AVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTL 480
            AVL+SHPDTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQF++RKQVS+FDLFTL
Sbjct: 421  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFLSRKQVSDFDLFTL 480

Query: 481  DDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFN 540
            DDIDSAFQ  TRLTY+QNHHLSGKGEGIVI+PHV+GHLLGGT+WKITKDGEDVIYAVDFN
Sbjct: 481  DDIDSAFQNFTRLTYAQNHHLSGKGEGIVISPHVSGHLLGGTVWKITKDGEDVIYAVDFN 540

Query: 541  HRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPV 600
            HRKE+HLNG    +FVRPAVLITDAYNALNNQPYRRQKDKEF D I+KTLR++GNVLLPV
Sbjct: 541  HRKEKHLNGINQSAFVRPAVLITDAYNALNNQPYRRQKDKEFTDAIKKTLRSDGNVLLPV 600

Query: 601  DTAGRVLELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRN 660
            DTAGRV+EL+QILE  W EE+LN+PIFFLTYVASSTIDY+KSFLEWMSD+IAKSFE TR 
Sbjct: 601  DTAGRVMELVQILESCWTEENLNYPIFFLTYVASSTIDYVKSFLEWMSDAIAKSFEKTRE 660

Query: 661  NAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERG 720
            N F LK + LL++KSELD+APDGPKVVLASMASLEAG+SHDIFVEWA D KNL+LF+ER 
Sbjct: 661  NVFNLKRIRLLVSKSELDDAPDGPKVVLASMASLEAGFSHDIFVEWANDPKNLVLFTERA 720

Query: 721  QFGTLARMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSK 780
            QFG+LARMLQADPPPKAVKVT+SKRVPL G+EL+AYEEEQNR +KEEALKASL+KEE+SK
Sbjct: 721  QFGSLARMLQADPPPKAVKVTISKRVPLVGEELIAYEEEQNRIRKEEALKASLVKEEESK 780

Query: 781  ASHGTDNDTGDPMIIDASSNVA-PDV-GSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTS 840
            ASHG D +T DPMIIDAS+  + PD  G  G  YRDI IDGF PPSTSV+PMFPFYEN++
Sbjct: 781  ASHGADVNTSDPMIIDASNTHSLPDAAGPQGSGYRDILIDGFTPPSTSVAPMFPFYENST 840

Query: 841  AWDDFGEVINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQ 900
             WDDFGEVINPDDYVIKDEDMD  A H G D+DGKLDE +A+LILD +PSKVVS ELTVQ
Sbjct: 841  EWDDFGEVINPDDYVIKDEDMDHGAMHVGGDMDGKLDEGSASLILDTRPSKVVSTELTVQ 900

Query: 901  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ 960
            VKCSL YMDFEGRSD RSIKSILSH+APLKLVLVHGTAEATEHLKQHCLK+VCPHVYAPQ
Sbjct: 901  VKCSLIYMDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCLKHVCPHVYAPQ 960

Query: 961  IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAA 1020
            IEETIDVTSDLCAYKVQLSEKLMSNVLFKK+GDYEI+W+D++ GKTEN  LSL PLS   
Sbjct: 961  IEETIDVTSDLCAYKVQLSEKLMSNVLFKKVGDYEIAWVDSEAGKTENDMLSLQPLSNPP 1020

Query: 1021 LPHKSVLVGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQV 1080
             PH+SVLVGDLKMA+FKQFLS KG+Q EFAGG LRCGEYVTLRKV DAS KGGGSGTQQ+
Sbjct: 1021 PPHESVLVGDLKMANFKQFLSDKGVQAEFAGGVLRCGEYVTLRKVGDASHKGGGSGTQQI 1080

Query: 1081 VIEGPLCEDYYKIRELLYSQFYLL 1095
            VIEGPLCEDYYKIRE LYSQFYLL
Sbjct: 1081 VIEGPLCEDYYKIREYLYSQFYLL 1103

BLAST of Csor.00g282190 vs. NCBI nr
Match: VVA11124.1 (PREDICTED: cleavage and polyadenylation [Prunus dulcis])

HSP 1 Score: 1830 bits (4741), Expect = 0.0
Identity = 916/1097 (83.50%), Postives = 997/1097 (90.88%), Query Frame = 0

Query: 20   VTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALMIPVMSSCSLLLMFYLFSSV 79
            V AV VTFGSAFRALNYGKEMERNRD SE SITLDRSQALMIPVMSS SLLLMFYLFSSV
Sbjct: 75   VFAVGVTFGSAFRALNYGKEMERNRDLSETSITLDRSQALMIPVMSSISLLLMFYLFSSV 134

Query: 80   SQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRCCSKSFTRIQGLLLMACFGL 139
            SQLLT FTA+ASVSSLFFCL+PY+A LKSQFG ADPYVSRCCSKSFTRIQGLLL  C G 
Sbjct: 135  SQLLTVFTAIASVSSLFFCLSPYVAYLKSQFGFADPYVSRCCSKSFTRIQGLLLFLCIGT 194

Query: 140  VAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLVCLFVYDIFWVFFSERFFGA 199
            V AWLV+GHW+LNNLLGISIC+AFVSHVRLPN+KICAMLLVCLFVYDIFWVFFSERFFGA
Sbjct: 195  VVAWLVTGHWVLNNLLGISICIAFVSHVRLPNIKICAMLLVCLFVYDIFWVFFSERFFGA 254

Query: 200  NVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIVFPRNLLGGVIPGKNATDFM 259
            NVMVSVATQQASNPVHTVANSLSLPGLQ++TKKLELPVKIVFPRNLLGG+IPG  A DFM
Sbjct: 255  NVMVSVATQQASNPVHTVANSLSLPGLQMVTKKLELPVKIVFPRNLLGGLIPG-GAKDFM 314

Query: 260  MLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHA-RGHKYIWYALPGYAIGLVTALAA 319
            MLGLGDMAIPAM LALVLCFDHR+SRD++NLL+MH+ +GHKYIWYALPGYAIGLVTALAA
Sbjct: 315  MLGLGDMAIPAMLLALVLCFDHRRSRDSINLLEMHSSKGHKYIWYALPGYAIGLVTALAA 374

Query: 320  GVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPNPNDK-------------- 379
            GVLTHSPQPALLYLVPSTLGP++ ISWIRK+  ELW+GPLPN NDK              
Sbjct: 375  GVLTHSPQPALLYLVPSTLGPIVFISWIRKELAELWDGPLPNSNDKAHQIERVTQKKMGT 434

Query: 380  ---VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHP 439
               VTPLCGVYNENPLSYLVS+DGFNFLIDCGWNDHFDP+LL+PLSRVAST+DAVL+SHP
Sbjct: 435  SVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLEPLSRVASTVDAVLLSHP 494

Query: 440  DTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAF 499
            DTLHLGALP+AMKQLGLSA VYSTEPVYRLGLLTMYDQ+++RKQVS+FDLFTLDDIDSAF
Sbjct: 495  DTLHLGALPFAMKQLGLSAVVYSTEPVYRLGLLTMYDQYLSRKQVSDFDLFTLDDIDSAF 554

Query: 500  QVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHL 559
            Q VTRLTY+QNHHLSGKGEGIVI+PHV+GHLLGGT+WKITKDGEDVIYAVDFNHRKE+HL
Sbjct: 555  QNVTRLTYAQNHHLSGKGEGIVISPHVSGHLLGGTVWKITKDGEDVIYAVDFNHRKEKHL 614

Query: 560  NGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVL 619
            NG    SFVRPAVLITDAYNALNNQ YRRQKDKEF DTI+KTLR++GNVLLPVDTAGRVL
Sbjct: 615  NGINQASFVRPAVLITDAYNALNNQAYRRQKDKEFTDTIKKTLRSDGNVLLPVDTAGRVL 674

Query: 620  ELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKH 679
            EL+QILE  W +E+LN+PIFFLTYVASSTIDY+KSFLEWMSDSIAKSFE TR NAF+LK 
Sbjct: 675  ELVQILESCWADENLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEKTRENAFILKR 734

Query: 680  VTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLAR 739
            +TLL+NKSELDNA DGPKVVLASMASLEAG+SHDIFVEWATD KNL+LF+ER QFGTLAR
Sbjct: 735  ITLLVNKSELDNASDGPKVVLASMASLEAGFSHDIFVEWATDPKNLVLFTERAQFGTLAR 794

Query: 740  MLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSKASHGTDN 799
            MLQADPPPKAVKVT+SKRVPL G+EL+AYEEEQNR +K+E LKASL+KEE+SK++ G D 
Sbjct: 795  MLQADPPPKAVKVTMSKRVPLVGEELIAYEEEQNRIRKDEVLKASLIKEEESKSAQGADV 854

Query: 800  DTGDPMIIDASS--NVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGE 859
             T DPM++DAS+  ++      HGG YRD+ IDGF PPSTS +PMFPFYEN S WDDFGE
Sbjct: 855  STSDPMVVDASNTHSLLDAAVPHGGGYRDMLIDGFTPPSTSAAPMFPFYENNSDWDDFGE 914

Query: 860  VINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHY 919
            VINPDDYVIKD DMDQ A H G D+DGKLDE +A+LILD +PSKVV+ ELTVQVKCSL Y
Sbjct: 915  VINPDDYVIKDADMDQGAMHVGGDMDGKLDEGSASLILDTRPSKVVATELTVQVKCSLIY 974

Query: 920  MDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDV 979
            MDFEGRSD RSIKSILSH+APLKLVLVHGTAEATEHLKQHCL +VCPHVYAPQIEETIDV
Sbjct: 975  MDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCLTHVCPHVYAPQIEETIDV 1034

Query: 980  TSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVL 1039
            TSDLCAYKVQLSEKLMSNVLFKKLGDYEI+W+D++ GKTENG LSLLP+S  A PH+SVL
Sbjct: 1035 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDSEAGKTENGALSLLPISTPAPPHESVL 1094

Query: 1040 VGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLC 1095
            VGDLKMA+FKQFLS  G+QVEFA GALRCGEYVTLRKV DAS KGGGSGTQQ+VIEGPLC
Sbjct: 1095 VGDLKMANFKQFLSDNGVQVEFASGALRCGEYVTLRKVGDASHKGGGSGTQQIVIEGPLC 1154

BLAST of Csor.00g282190 vs. ExPASy TrEMBL
Match: A0A5D3DH28 (Cleavage and polyadenylation specificity factor subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold386G00660 PE=3 SV=1)

HSP 1 Score: 2001 bits (5184), Expect = 0.0
Identity = 1022/1114 (91.74%), Postives = 1042/1114 (93.54%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            MESLWKLLYLLEPAP TLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM
Sbjct: 1    MESLWKLLYLLEPAPATLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCL+PYMA LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLSPYMAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQGLLL+AC GLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVK+CAMLLV
Sbjct: 121  CSKSFTRIQGLLLLACSGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKVCAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300
            FPRNLLGGVIPGK+ATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLD+H RGHKY
Sbjct: 241  FPRNLLGGVIPGKHATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDIHTRGHKY 300

Query: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360
            IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWE     
Sbjct: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEESHTY 360

Query: 361  PND---------------------------------------------------KVTPLC 420
             ND                                                   +VTPLC
Sbjct: 361  QNDDVWFDRGKKKKKRKRHRPAIFPNTFFFFLQNEGSSRELQRAWSFIAMGTSVQVTPLC 420

Query: 421  GVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480
            GVYNENPLSYLVSVD FNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL
Sbjct: 421  GVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480

Query: 481  PYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVVTRLTY 540
            PYAMKQLGLSAPV+STEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV+TRLTY
Sbjct: 481  PYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVITRLTY 540

Query: 541  SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF 600
            SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF
Sbjct: 541  SQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESF 600

Query: 601  VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW 660
            VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW
Sbjct: 601  VRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEW 660

Query: 661  YWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINKS 720
            YWEEESLN+PIFFLTYVASSTIDYIKSFLEWMSD+IAKSFEHTRNNAFLLKHVTLLINKS
Sbjct: 661  YWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKS 720

Query: 721  ELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPPP 780
            ELDNAPDGPKVVLASMASLEAGYSHDIFV+WA DAKNL+LFSERGQFGTLARMLQADPPP
Sbjct: 721  ELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPP 780

Query: 781  KAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIID 840
            KAVKVTVSKRVPLTGDEL+AYEEEQNRKKEEALKASLLKEEQSKASHG DNDTGDPMIID
Sbjct: 781  KAVKVTVSKRVPLTGDELIAYEEEQNRKKEEALKASLLKEEQSKASHGADNDTGDPMIID 840

Query: 841  ASSNVAPDVGS-HGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVINPDDYVIK 900
            ASSN APDVGS HGGAYRDI IDGFVPPSTSV+PMFPFYENTSAWDDFGEVINPDDYVIK
Sbjct: 841  ASSNAAPDVGSSHGGAYRDILIDGFVPPSTSVAPMFPFYENTSAWDDFGEVINPDDYVIK 900

Query: 901  DEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR 960
            DEDMDQ+A H G DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR
Sbjct: 901  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGR 960

Query: 961  SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQ 1020
            SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ+EETIDVTSDLCAYKVQ
Sbjct: 961  SIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQVEETIDVTSDLCAYKVQ 1020

Query: 1021 LSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADFK 1062
            LSEKLMSNVLFKKLGDYEI+WLDA+VGKTENGTLSLLPLSKA  PHKSVLVGDLKMADFK
Sbjct: 1021 LSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAPAPHKSVLVGDLKMADFK 1080

BLAST of Csor.00g282190 vs. ExPASy TrEMBL
Match: A0A5A7ST61 (Cleavage and polyadenylation specificity factor subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold239G001300 PE=3 SV=1)

HSP 1 Score: 1996 bits (5172), Expect = 0.0
Identity = 1022/1115 (91.66%), Postives = 1042/1115 (93.45%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            MESLWKLLYLLEPAP TLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM
Sbjct: 1    MESLWKLLYLLEPAPATLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCL+PYMA LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLSPYMAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQGLLL+AC GLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVK+CAMLLV
Sbjct: 121  CSKSFTRIQGLLLLACSGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKVCAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 300
            FPRNLLGGVIPGK+ATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLD+H RGHKY
Sbjct: 241  FPRNLLGGVIPGKHATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDIHTRGHKY 300

Query: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPN 360
            IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWE     
Sbjct: 301  IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEESHTY 360

Query: 361  PND---------------------------------------------------KVTPLC 420
             ND                                                   +VTPLC
Sbjct: 361  QNDDVWFDRGKKKKKRKRHRPAIFPNTFFFFLQNEGSSRELQRAWSFIAMGTSVQVTPLC 420

Query: 421  GVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480
            GVYNENPLSYLVSVD FNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL
Sbjct: 421  GVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGAL 480

Query: 481  PYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVVTRLTY 540
            PYAMKQLGLSAPV+STEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV+TRLTY
Sbjct: 481  PYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQVITRLTY 540

Query: 541  SQNHHLSG-KGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES 600
            SQNHHLSG KGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES
Sbjct: 541  SQNHHLSGGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILES 600

Query: 601  FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE 660
            FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE
Sbjct: 601  FVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE 660

Query: 661  WYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINK 720
            WYWEEESLN+PIFFLTYVASSTIDYIKSFLEWMSD+IAKSFEHTRNNAFLLKHVTLLINK
Sbjct: 661  WYWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINK 720

Query: 721  SELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPP 780
            SELDNAPDGPKVVLASMASLEAGYSHDIFV+WA DAKNL+LFSERGQFGTLARMLQADPP
Sbjct: 721  SELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPP 780

Query: 781  PKAVKVTVSKRVPLTGDELVAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMII 840
            PKAVKVTVSKRVPLTGDEL+AYEEEQNRKKEEALKASLLKEEQSKASHG DNDTGDPMII
Sbjct: 781  PKAVKVTVSKRVPLTGDELIAYEEEQNRKKEEALKASLLKEEQSKASHGADNDTGDPMII 840

Query: 841  DASSNVAPDVGS-HGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVINPDDYVI 900
            DASSN APDVGS HGGAYRDI IDGFVPPSTSV+PMFPFYENTSAWDDFGEVINPDDYVI
Sbjct: 841  DASSNAAPDVGSSHGGAYRDILIDGFVPPSTSVAPMFPFYENTSAWDDFGEVINPDDYVI 900

Query: 901  KDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG 960
            KDEDMDQ+A H G DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG
Sbjct: 901  KDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDG 960

Query: 961  RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKV 1020
            RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ+EETIDVTSDLCAYKV
Sbjct: 961  RSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQVEETIDVTSDLCAYKV 1020

Query: 1021 QLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADF 1062
            QLSEKLMSNVLFKKLGDYEI+WLDA+VGKTENGTLSLLPLSKA  PHKSVLVGDLKMADF
Sbjct: 1021 QLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAPAPHKSVLVGDLKMADF 1080

BLAST of Csor.00g282190 vs. ExPASy TrEMBL
Match: A0A498JJ41 (Cleavage and polyadenylation specificity factor subunit 2 OS=Malus domestica OX=3750 GN=DVH24_024770 PE=3 SV=1)

HSP 1 Score: 1861 bits (4821), Expect = 0.0
Identity = 932/1104 (84.42%), Postives = 1013/1104 (91.76%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            ME LWKL YLLEPAP+TL+VTAV VTFGSAFRALNYGKEME+NRD SE SITLDRSQALM
Sbjct: 1    MEPLWKLFYLLEPAPITLVVTAVGVTFGSAFRALNYGKEMEKNRDLSETSITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSS SLLLMFYLF+SVSQLLT FTAVASVSSLFFC++PY+A LKSQFGLADPYVSRC
Sbjct: 61   IPVMSSISLLLMFYLFTSVSQLLTVFTAVASVSSLFFCISPYIAYLKSQFGLADPYVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTRIQ LLL+ C G V+AWLVSGHWILNNLLGISICVAFVSHVRLPN+KICAMLLV
Sbjct: 121  CSKSFTRIQALLLLLCIGTVSAWLVSGHWILNNLLGISICVAFVSHVRLPNIKICAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSER FGANVMVSVATQQASNPVHTVANSLSLPGLQ++TKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERIFGANVMVSVATQQASNPVHTVANSLSLPGLQMVTKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHA-RGHK 300
            FPRNL+GG IPG  A DFMMLGLGDMAIPAM LALVLCFDHR+S+D VNLLDMH+ +GHK
Sbjct: 241  FPRNLIGGEIPG-GARDFMMLGLGDMAIPAMLLALVLCFDHRRSKDLVNLLDMHSSKGHK 300

Query: 301  YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLP 360
            YIWYALPGYAIGLVTALAAG+LTHSPQPALLYLVPSTLGP++ ISWIRK+  ELW+GPLP
Sbjct: 301  YIWYALPGYAIGLVTALAAGILTHSPQPALLYLVPSTLGPIVFISWIRKELAELWDGPLP 360

Query: 361  NPNDK-----VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTID 420
            + NDK     VTPLCGVYNENPLSYLVS+DGFN LIDCGWNDHFDP+LLQPLSRVAST+D
Sbjct: 361  SMNDKAHQIEVTPLCGVYNENPLSYLVSIDGFNLLIDCGWNDHFDPSLLQPLSRVASTVD 420

Query: 421  AVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTL 480
            AVL+SHPDTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQF++RKQVS+FDLFTL
Sbjct: 421  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFLSRKQVSDFDLFTL 480

Query: 481  DDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFN 540
            DDIDSAFQ  TRLTY+QNHHLSGKGEGIVI+PHV+GHLLGGT+WKITKDGEDVIYAVDFN
Sbjct: 481  DDIDSAFQNFTRLTYAQNHHLSGKGEGIVISPHVSGHLLGGTVWKITKDGEDVIYAVDFN 540

Query: 541  HRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPV 600
            HRKE+HLNG    +FVRPAVLITDAYNALNNQPYRRQKDKEF D I+KTLR++GNVLLPV
Sbjct: 541  HRKEKHLNGINQSAFVRPAVLITDAYNALNNQPYRRQKDKEFTDAIKKTLRSDGNVLLPV 600

Query: 601  DTAGRVLELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRN 660
            DTAGRV+EL+QILE  W EE+LN+PIFFLTYVASSTIDY+KSFLEWMSD+IAKSFE TR 
Sbjct: 601  DTAGRVMELVQILESCWTEENLNYPIFFLTYVASSTIDYVKSFLEWMSDAIAKSFEKTRE 660

Query: 661  NAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERG 720
            N F LK + LL++KSELD+APDGPKVVLASMASLEAG+SHDIFVEWA D KNL+LF+ER 
Sbjct: 661  NVFNLKRIRLLVSKSELDDAPDGPKVVLASMASLEAGFSHDIFVEWANDPKNLVLFTERA 720

Query: 721  QFGTLARMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSK 780
            QFG+LARMLQADPPPKAVKVT+SKRVPL G+EL+AYEEEQNR +KEEALKASL+KEE+SK
Sbjct: 721  QFGSLARMLQADPPPKAVKVTISKRVPLVGEELIAYEEEQNRIRKEEALKASLVKEEESK 780

Query: 781  ASHGTDNDTGDPMIIDASSNVA-PDV-GSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTS 840
            ASHG D +T DPMIIDAS+  + PD  G  G  YRDI IDGF PPSTSV+PMFPFYEN++
Sbjct: 781  ASHGADVNTSDPMIIDASNTHSLPDAAGPQGSGYRDILIDGFTPPSTSVAPMFPFYENST 840

Query: 841  AWDDFGEVINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQ 900
             WDDFGEVINPDDYVIKDEDMD  A H G D+DGKLDE +A+LILD +PSKVVS ELTVQ
Sbjct: 841  EWDDFGEVINPDDYVIKDEDMDHGAMHVGGDMDGKLDEGSASLILDTRPSKVVSTELTVQ 900

Query: 901  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQ 960
            VKCSL YMDFEGRSD RSIKSILSH+APLKLVLVHGTAEATEHLKQHCLK+VCPHVYAPQ
Sbjct: 901  VKCSLIYMDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCLKHVCPHVYAPQ 960

Query: 961  IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAA 1020
            IEETIDVTSDLCAYKVQLSEKLMSNVLFKK+GDYEI+W+D++ GKTEN  LSL PLS   
Sbjct: 961  IEETIDVTSDLCAYKVQLSEKLMSNVLFKKVGDYEIAWVDSEAGKTENDMLSLQPLSNPP 1020

Query: 1021 LPHKSVLVGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQV 1080
             PH+SVLVGDLKMA+FKQFLS KG+Q EFAGG LRCGEYVTLRKV DAS KGGGSGTQQ+
Sbjct: 1021 PPHESVLVGDLKMANFKQFLSDKGVQAEFAGGVLRCGEYVTLRKVGDASHKGGGSGTQQI 1080

Query: 1081 VIEGPLCEDYYKIRELLYSQFYLL 1095
            VIEGPLCEDYYKIRE LYSQFYLL
Sbjct: 1081 VIEGPLCEDYYKIREYLYSQFYLL 1103

BLAST of Csor.00g282190 vs. ExPASy TrEMBL
Match: A0A5E4E5T2 (Cleavage and polyadenylation specificity factor subunit 2 OS=Prunus dulcis OX=3755 GN=ALMOND_2B028284 PE=3 SV=1)

HSP 1 Score: 1830 bits (4741), Expect = 0.0
Identity = 916/1097 (83.50%), Postives = 997/1097 (90.88%), Query Frame = 0

Query: 20   VTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALMIPVMSSCSLLLMFYLFSSV 79
            V AV VTFGSAFRALNYGKEMERNRD SE SITLDRSQALMIPVMSS SLLLMFYLFSSV
Sbjct: 75   VFAVGVTFGSAFRALNYGKEMERNRDLSETSITLDRSQALMIPVMSSISLLLMFYLFSSV 134

Query: 80   SQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRCCSKSFTRIQGLLLMACFGL 139
            SQLLT FTA+ASVSSLFFCL+PY+A LKSQFG ADPYVSRCCSKSFTRIQGLLL  C G 
Sbjct: 135  SQLLTVFTAIASVSSLFFCLSPYVAYLKSQFGFADPYVSRCCSKSFTRIQGLLLFLCIGT 194

Query: 140  VAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLVCLFVYDIFWVFFSERFFGA 199
            V AWLV+GHW+LNNLLGISIC+AFVSHVRLPN+KICAMLLVCLFVYDIFWVFFSERFFGA
Sbjct: 195  VVAWLVTGHWVLNNLLGISICIAFVSHVRLPNIKICAMLLVCLFVYDIFWVFFSERFFGA 254

Query: 200  NVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIVFPRNLLGGVIPGKNATDFM 259
            NVMVSVATQQASNPVHTVANSLSLPGLQ++TKKLELPVKIVFPRNLLGG+IPG  A DFM
Sbjct: 255  NVMVSVATQQASNPVHTVANSLSLPGLQMVTKKLELPVKIVFPRNLLGGLIPG-GAKDFM 314

Query: 260  MLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHA-RGHKYIWYALPGYAIGLVTALAA 319
            MLGLGDMAIPAM LALVLCFDHR+SRD++NLL+MH+ +GHKYIWYALPGYAIGLVTALAA
Sbjct: 315  MLGLGDMAIPAMLLALVLCFDHRRSRDSINLLEMHSSKGHKYIWYALPGYAIGLVTALAA 374

Query: 320  GVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLPNPNDK-------------- 379
            GVLTHSPQPALLYLVPSTLGP++ ISWIRK+  ELW+GPLPN NDK              
Sbjct: 375  GVLTHSPQPALLYLVPSTLGPIVFISWIRKELAELWDGPLPNSNDKAHQIERVTQKKMGT 434

Query: 380  ---VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHP 439
               VTPLCGVYNENPLSYLVS+DGFNFLIDCGWNDHFDP+LL+PLSRVAST+DAVL+SHP
Sbjct: 435  SVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLEPLSRVASTVDAVLLSHP 494

Query: 440  DTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAF 499
            DTLHLGALP+AMKQLGLSA VYSTEPVYRLGLLTMYDQ+++RKQVS+FDLFTLDDIDSAF
Sbjct: 495  DTLHLGALPFAMKQLGLSAVVYSTEPVYRLGLLTMYDQYLSRKQVSDFDLFTLDDIDSAF 554

Query: 500  QVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHL 559
            Q VTRLTY+QNHHLSGKGEGIVI+PHV+GHLLGGT+WKITKDGEDVIYAVDFNHRKE+HL
Sbjct: 555  QNVTRLTYAQNHHLSGKGEGIVISPHVSGHLLGGTVWKITKDGEDVIYAVDFNHRKEKHL 614

Query: 560  NGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVL 619
            NG    SFVRPAVLITDAYNALNNQ YRRQKDKEF DTI+KTLR++GNVLLPVDTAGRVL
Sbjct: 615  NGINQASFVRPAVLITDAYNALNNQAYRRQKDKEFTDTIKKTLRSDGNVLLPVDTAGRVL 674

Query: 620  ELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKH 679
            EL+QILE  W +E+LN+PIFFLTYVASSTIDY+KSFLEWMSDSIAKSFE TR NAF+LK 
Sbjct: 675  ELVQILESCWADENLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEKTRENAFILKR 734

Query: 680  VTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLAR 739
            +TLL+NKSELDNA DGPKVVLASMASLEAG+SHDIFVEWATD KNL+LF+ER QFGTLAR
Sbjct: 735  ITLLVNKSELDNASDGPKVVLASMASLEAGFSHDIFVEWATDPKNLVLFTERAQFGTLAR 794

Query: 740  MLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSKASHGTDN 799
            MLQADPPPKAVKVT+SKRVPL G+EL+AYEEEQNR +K+E LKASL+KEE+SK++ G D 
Sbjct: 795  MLQADPPPKAVKVTMSKRVPLVGEELIAYEEEQNRIRKDEVLKASLIKEEESKSAQGADV 854

Query: 800  DTGDPMIIDASS--NVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGE 859
             T DPM++DAS+  ++      HGG YRD+ IDGF PPSTS +PMFPFYEN S WDDFGE
Sbjct: 855  STSDPMVVDASNTHSLLDAAVPHGGGYRDMLIDGFTPPSTSAAPMFPFYENNSDWDDFGE 914

Query: 860  VINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHY 919
            VINPDDYVIKD DMDQ A H G D+DGKLDE +A+LILD +PSKVV+ ELTVQVKCSL Y
Sbjct: 915  VINPDDYVIKDADMDQGAMHVGGDMDGKLDEGSASLILDTRPSKVVATELTVQVKCSLIY 974

Query: 920  MDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDV 979
            MDFEGRSD RSIKSILSH+APLKLVLVHGTAEATEHLKQHCL +VCPHVYAPQIEETIDV
Sbjct: 975  MDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCLTHVCPHVYAPQIEETIDV 1034

Query: 980  TSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVL 1039
            TSDLCAYKVQLSEKLMSNVLFKKLGDYEI+W+D++ GKTENG LSLLP+S  A PH+SVL
Sbjct: 1035 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDSEAGKTENGALSLLPISTPAPPHESVL 1094

Query: 1040 VGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLC 1095
            VGDLKMA+FKQFLS  G+QVEFA GALRCGEYVTLRKV DAS KGGGSGTQQ+VIEGPLC
Sbjct: 1095 VGDLKMANFKQFLSDNGVQVEFASGALRCGEYVTLRKVGDASHKGGGSGTQQIVIEGPLC 1154

BLAST of Csor.00g282190 vs. ExPASy TrEMBL
Match: A0A5N6L0H3 (Cleavage and polyadenylation specificity factor subunit 2 OS=Carpinus fangiana OX=176857 GN=FH972_025130 PE=3 SV=1)

HSP 1 Score: 1828 bits (4734), Expect = 0.0
Identity = 921/1087 (84.73%), Postives = 1004/1087 (92.36%), Query Frame = 0

Query: 1    MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
            M+ LWKL YLLEPAP+TLIVTAVAVTFGSAFRALNYGKEMERNRD SEASITLDRSQALM
Sbjct: 1    MDPLWKLSYLLEPAPITLIVTAVAVTFGSAFRALNYGKEMERNRDLSEASITLDRSQALM 60

Query: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
            IPVMSSCSLLLMFYLFSSVSQLLTAFTA+ASVSSLFFCL+PY+A +KSQFGL DP+VSRC
Sbjct: 61   IPVMSSCSLLLMFYLFSSVSQLLTAFTAIASVSSLFFCLSPYVAYMKSQFGLTDPFVSRC 120

Query: 121  CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
            CSKSFTR QGLLL+ C G+VAAWLVSGHWILNNLLGISIC+AFVSHVRLPN+KICAMLLV
Sbjct: 121  CSKSFTRTQGLLLLTCSGIVAAWLVSGHWILNNLLGISICIAFVSHVRLPNIKICAMLLV 180

Query: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
            CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV
Sbjct: 181  CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240

Query: 241  FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMH-ARGHK 300
            FPRNLLGGV+PG++A DFMMLGLGDMAIP+M LALVLCFDHRKSRD+VNL+D++ A+GHK
Sbjct: 241  FPRNLLGGVVPGESARDFMMLGLGDMAIPSMLLALVLCFDHRKSRDSVNLIDINSAKGHK 300

Query: 301  YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGPLP 360
            YIWYAL GYAIGLVTALAAGVLTHSPQPALLYLVPSTLGP+I +S+IRK+ +ELWEG +P
Sbjct: 301  YIWYALTGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPIIVVSYIRKELMELWEGNIP 360

Query: 361  NPNDK-----VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTID 420
            N NDK     VTPLCGVYNENPLSY+VS+DGFNFLIDCGW+DHFDP +LQPLS+VASTID
Sbjct: 361  NLNDKAHQTEVTPLCGVYNENPLSYVVSIDGFNFLIDCGWHDHFDPTILQPLSKVASTID 420

Query: 421  AVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTL 480
            AVL+S+PDTLHLGALPYA+KQLGLSAPVY+TEP+YRLGLLTMYDQ+++RKQVSEFDLFTL
Sbjct: 421  AVLLSYPDTLHLGALPYAVKQLGLSAPVYTTEPIYRLGLLTMYDQYLSRKQVSEFDLFTL 480

Query: 481  DDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFN 540
            DDIDSAFQ VTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVD N
Sbjct: 481  DDIDSAFQKVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVIYAVDLN 540

Query: 541  HRKERHLNGTILESFVRPAVLITDAYNALNNQPYRR-QKDKEFGDTIQKTLRANGNVLLP 600
            HRKERHLNGT+L SFVRPAVLITDAYNALNNQPYRR +K+ EFG+TI+KTL A GNVLLP
Sbjct: 541  HRKERHLNGTVLASFVRPAVLITDAYNALNNQPYRRGEKENEFGETIKKTLGAGGNVLLP 600

Query: 601  VDTAGRVLELIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTR 660
            VDTAGRV ELI ILE YW ++SLN+PIFFLTYVASSTIDY+KSFLEWMSDSI KSFE  R
Sbjct: 601  VDTAGRVFELILILEQYWADKSLNYPIFFLTYVASSTIDYVKSFLEWMSDSIPKSFEQNR 660

Query: 661  NNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSER 720
             N FLLK+VT LINKSELDNAPDGPKVV+ASMASLE G+SHDIFVEWA+DAKNL+LF+ER
Sbjct: 661  ENPFLLKNVTFLINKSELDNAPDGPKVVIASMASLEVGFSHDIFVEWASDAKNLVLFTER 720

Query: 721  GQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQS 780
            GQF TLAR+LQADPPPKAVKV +SKRVPL G+EL+AYEEEQNR KKEE LKA+L+KEE +
Sbjct: 721  GQFATLARILQADPPPKAVKVAMSKRVPLVGEELIAYEEEQNRIKKEETLKATLIKEE-T 780

Query: 781  KASHGTDNDTGDPMIIDASSNVA-PDV-GSHGGAYRDIFIDGFVPPSTSVSPMFPFYENT 840
            KASH  D D  DPM++D S+  A PDV G HGG YRDI IDGFVP STSV+PMFPFY+NT
Sbjct: 781  KASHDADIDASDPMVVDVSNTHALPDVAGPHGGGYRDILIDGFVPSSTSVAPMFPFYDNT 840

Query: 841  SAWDDFGEVINPDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTV 900
              WDDFGEVINPDDYVIKDEDMDQ+  + G   DGK DE +A++ILD KPSKVVSNELTV
Sbjct: 841  FEWDDFGEVINPDDYVIKDEDMDQTGMNVGGYSDGKFDEGSASMILDTKPSKVVSNELTV 900

Query: 901  QVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAP 960
            QVKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCP VYAP
Sbjct: 901  QVKCLLIYMDFEGRSDGRSIKSILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPQVYAP 960

Query: 961  QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKA 1020
             IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEI+W+DA+VGKT++G LSL P S A
Sbjct: 961  HIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTDDGMLSLNPCSTA 1020

Query: 1021 ALPHKSVLVGDLKMADFKQFLSSKGIQVEFAGGALRCGEYVTLRKVSDASQKGGGSGTQQ 1077
            A PHKSVLVGDLKMADFKQFL+SKGIQVEFAGGALRCGEYVT+RKV DASQKGG SG QQ
Sbjct: 1021 APPHKSVLVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTIRKVGDASQKGGASGPQQ 1080

BLAST of Csor.00g282190 vs. TAIR 10
Match: AT5G23880.1 (cleavage and polyadenylation specificity factor 100 )

HSP 1 Score: 1180.2 bits (3052), Expect = 0.0e+00
Identity = 574/735 (78.10%), Postives = 662/735 (90.07%), Query Frame = 0

Query: 364  KVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDT 423
            +VTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDT
Sbjct: 6    QVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLLSHPDT 65

Query: 424  LHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDIDSAFQV 483
            LH+GALPYAMKQLGLSAPVY+TEPV+RLGLLTMYDQF++RKQVS+FDLFTLDDIDSAFQ 
Sbjct: 66   LHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDIDSAFQN 125

Query: 484  VTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNG 543
            V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNG
Sbjct: 126  VIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNG 185

Query: 544  TILESFVRPAVLITDAYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLE 603
            T+L+SFVRPAVLITDAY+AL  NQ  R+Q+DKEF DTI K L   GNVLLPVDTAGRVLE
Sbjct: 186  TVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLE 245

Query: 604  LIQILEWYWEEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHV 663
            L+ ILE +W +   +FPI+FLTYV+SSTIDY+KSFLEWMSDSI+KSFE +R+NAFLL+HV
Sbjct: 246  LLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHV 305

Query: 664  TLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARM 723
            TLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWA D +NL+LF+E GQFGTLARM
Sbjct: 306  TLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARM 365

Query: 724  LQADPPPKAVKVTVSKRVPLTGDELVAYEEEQNR-KKEEALKASLLKEEQSKASHGTDND 783
            LQ+ PPPK VKVT+SKRVPL G+EL+AYEEEQNR K+EEAL+ASL+KEE++KASHG+D++
Sbjct: 366  LQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDN 425

Query: 784  TGDPMIIDASSNVAPDVGSHGGAYRDIFIDGFVPPSTSVSPMFPFYENTSAWDDFGEVIN 843
            + +PMIID +      +GSHG AY+DI IDGFVPPS+SV+PMFP+Y+NTS WDDFGE+IN
Sbjct: 426  SSEPMIID-TKTTHDVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEWDDFGEIIN 485

Query: 844  PDDYVIKDEDMDQSAPHGGVDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDF 903
            PDDYVIKDEDMD+ A H G DVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+
Sbjct: 486  PDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVSCSLVKMDY 545

Query: 904  EGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD 963
            EGRSDGRSIKS+++HV+PLKLVLVH  AEATEHLKQHCL N+CPHVYAPQIEET+DVTSD
Sbjct: 546  EGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIEETVDVTSD 605

Query: 964  LCAYKVQLSEKLMSNVLFKKLGDYEISWLDADVGKTENGTLSLLPLSKAALPHKSVLVGD 1023
            LCAYKVQLSEKLMSNV+FKKLGD E++W+D++VGKTE    SLLP+  AA PHK VLVGD
Sbjct: 606  LCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASPHKPVLVGD 665

Query: 1024 LKMADFKQFLSSKGIQVEFA-GGALRCGEYVTLRKVSDASQKGGGSGTQQVVIEGPLCED 1083
            LK+ADFKQFLSSKG+QVEFA GGALRCGEYVTLRKV    QKGG SG QQ++IEGPLCED
Sbjct: 666  LKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQILIEGPLCED 725

Query: 1084 YYKIRELLYSQFYLL 1096
            YYKIR+ LYSQFYLL
Sbjct: 726  YYKIRDYLYSQFYLL 739

BLAST of Csor.00g282190 vs. TAIR 10
Match: AT4G33410.1 (SIGNAL PEPTIDE PEPTIDASE-LIKE 1 )

HSP 1 Score: 612.1 bits (1577), Expect = 8.9e-175
Identity = 312/366 (85.25%), Postives = 338/366 (92.35%), Query Frame = 0

Query: 1   MESLWKLLYLLEPAPVTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALM 60
           ME+LW LLYLLEPAP TLIVTAV VTF SAFRALNYGKEMERNRDFSEASITLD SQALM
Sbjct: 1   METLWTLLYLLEPAPATLIVTAVTVTFASAFRALNYGKEMERNRDFSEASITLDSSQALM 60

Query: 61  IPVMSSCSLLLMFYLFSSVSQLLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPYVSRC 120
           IPVMSSCSLLLMFYLFSSVSQLLTAFTA+ASVSSLF+ L+PY   +K+Q GL+DP++SRC
Sbjct: 61  IPVMSSCSLLLMFYLFSSVSQLLTAFTAIASVSSLFYWLSPYAVYMKTQLGLSDPFLSRC 120

Query: 121 CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 180
           CSKSFTRIQGLLL+AC   V AWL+SGHW+LNNLLGISIC+AFVSHVRLPN+KICAMLLV
Sbjct: 121 CSKSFTRIQGLLLVACAMTVVAWLISGHWVLNNLLGISICIAFVSHVRLPNIKICAMLLV 180

Query: 181 CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 240
           CLFVYDIFWVFFSERFFGANVMV+VATQQASNPVHTVANSL+LPGLQLITKKLELPVKIV
Sbjct: 181 CLFVYDIFWVFFSERFFGANVMVAVATQQASNPVHTVANSLNLPGLQLITKKLELPVKIV 240

Query: 241 FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDM-HARGHK 300
           FPRNLLGGV+PG +A+DFMMLGLGDMAIPAM LALVLCFDHRK+RD VN+ D+  ++GHK
Sbjct: 241 FPRNLLGGVVPGVSASDFMMLGLGDMAIPAMLLALVLCFDHRKTRDVVNIFDLKSSKGHK 300

Query: 301 YIWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLGPVIAISWIRKDFLELWEGP-L 360
           YIWYALPGYAIGLV ALAAGVLTHSPQPALLYLVPSTLGPVI +SW RKD  ELWEGP L
Sbjct: 301 YIWYALPGYAIGLVAALAAGVLTHSPQPALLYLVPSTLGPVIFMSWRRKDLAELWEGPAL 360

Query: 361 PNPNDK 365
            NP +K
Sbjct: 361 SNPIEK 366

BLAST of Csor.00g282190 vs. TAIR 10
Match: AT2G01730.1 (cleavage and polyadenylation specificity factor 73 kDa subunit-II )

HSP 1 Score: 138.3 bits (347), Expect = 3.8e-32
Identity = 93/357 (26.05%), Postives = 166/357 (46.50%), Query Frame = 0

Query: 380 LVSVDGFNFLIDCGW-------NDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYA 439
           +V+++G   + DCG        N + + +L+       + I  ++I+H    H+GALPY 
Sbjct: 20  VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 440 MKQLGLSAPVYSTEPVYRLGLLTMYD-QFIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQ 499
            +  G + P+Y + P   L  L + D + +   +  E +LFT   I +  + V  +   Q
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELFTTTHIANCMKKVIAIDLKQ 139

Query: 500 NHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVR 559
              +    E + I  + AGH+LG  +         ++Y  D+N   +RHL    ++  ++
Sbjct: 140 TIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-LQ 199

Query: 560 PAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYW 619
             +LI+++  A   +  +  +++EF   + K +   G  L+P    GR  EL  +L+ YW
Sbjct: 200 LDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDYW 259

Query: 620 EEESLNFPIFFLTYVASSTIDYIKSFLEWMSDSIAKSFEHTRNNAFLLKHVTLLINKSEL 679
           E  ++  PI+F + +      Y K  + W S ++ +  +H  +N F  K+V        L
Sbjct: 260 ERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKE--KHNTHNPFDFKNVKDF--DRSL 319

Query: 680 DNAPDGPKVVLASMASLEAGYSHDIFVEWATDAKNLILFSERGQFGTLARMLQADPP 729
            +AP GP V+ A+   L AG+S ++F  WA    NL+        GT+   L A  P
Sbjct: 320 IHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367

BLAST of Csor.00g282190 vs. TAIR 10
Match: AT2G43070.1 (SIGNAL PEPTIDE PEPTIDASE-LIKE 3 )

HSP 1 Score: 98.6 bits (244), Expect = 3.3e-20
Identity = 90/321 (28.04%), Postives = 145/321 (45.17%), Query Frame = 0

Query: 42  RNRDFSEASITLDRSQALMIPVMSSCSLLLMFYLFSS-VSQLLTAFTAVASVSSLFFCLA 101
           R  D  +  + +  + A+   V +S  LLL+FY  SS    +LT F  +  +  +   + 
Sbjct: 240 RKDDPEKEILDISVTGAVFFIVTASIFLLLLFYFMSSWFVWVLTIFFCIGGMQGMHNIIM 299

Query: 102 PYMACLKSQFGLADPYVSRCCSKSFTRIQGLLLMACFGLVAAWLVSGH----WILNNLLG 161
             +  L+    LA   V      + + +  L+ + C      W +  H    W+  ++LG
Sbjct: 300 AVI--LRKCRHLARKSVKLPLLGTMSVLSLLVNIVCLAFAVFWFIKRHTSYSWVGQDILG 359

Query: 162 ISICVAFVSHVRLPNVKICAMLLVCLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHT 221
           I + +  +  VRLPN+K+  +LL C FVYDIFWVF S   F  +VM+ VA   +S     
Sbjct: 360 ICLMITALQVVRLPNIKVATVLLCCAFVYDIFWVFISPLIFHESVMIVVAQGDSSTGE-- 419

Query: 222 VANSLSLPGLQLITKKLELPVKIVFPR--NLLGGVIPGKNATDFMMLGLGDMAIPAMFLA 281
                             +P+ +  PR  +  GG         + M+G GD+  P + ++
Sbjct: 420 -----------------SIPMLLRIPRFFDPWGG---------YDMIGFGDILFPGLLIS 479

Query: 282 LVLCFDHRKSRDTVNLLDMHARGHKYIWYALPGYAIG-LVTALAAGVLTHSPQPALLYLV 341
               +D  K R   N       G+ ++W  + GY IG L+T L   ++    QPALLY+V
Sbjct: 480 FASRYDKIKKRVISN-------GY-FLWLTI-GYGIGLLLTYLGLYLMDGHGQPALLYIV 521

Query: 342 PSTLGPVIAISWIRKDFLELW 355
           P TLG  + +  +R +  ELW
Sbjct: 540 PCTLGLAVILGLVRGELKELW 521

BLAST of Csor.00g282190 vs. TAIR 10
Match: AT2G03120.1 (signal peptide peptidase )

HSP 1 Score: 97.1 bits (240), Expect = 9.6e-20
Identity = 94/338 (27.81%), Postives = 147/338 (43.49%), Query Frame = 0

Query: 16  VTLIVTAVAVTFGSAFRALNYGKEMERNRDFSEASITLDRSQALMIPVMSSCSLLLMFYL 75
           + +I+TA    +   FR++      E          T+ +  A+  P++ S  LL +F L
Sbjct: 28  LNVILTACITVYVGCFRSVKDTPPTE----------TMSKEHAMRFPLVGSAMLLSLFLL 87

Query: 76  FSSVSQ-----LLTAFTAVASVSSLFFCLAPYMACLKSQFGLADPY----------VSRC 135
           F  +S+     +LTA+  V  + +L   L P +        L +P+            + 
Sbjct: 88  FKFLSKDLVNAVLTAYFFVLGIVALSATLLPAIRRF-----LPNPWNDNLIVWRFPYFKS 147

Query: 136 CSKSFTRIQGLLLMACFGLVAAWLVSGHWILNNLLGISICVAFVSHVRLPNVKICAMLLV 195
               FT+ Q +  +      A +    HW+ NN+LG+S C+  +  + L + K  A+LL 
Sbjct: 148 LEVEFTKSQVVAGIPGTFFCAWYAWKKHWLANNILGLSFCIQGIEMLSLGSFKTGAILLA 207

Query: 196 CLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIV 255
            LF YDIFWVFF+       VMVSVA                        K  + P+K++
Sbjct: 208 GLFFYDIFWVFFTP------VMVSVA------------------------KSFDAPIKLL 267

Query: 256 FPRNLLGGVIPGKNATDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDMHARGHKY 315
           FP         G     + MLGLGD+ IP +F+AL L FD  + R             +Y
Sbjct: 268 FP--------TGDALRPYSMLGLGDIVIPGIFVALALRFDVSRRRQP-----------QY 301

Query: 316 IWYALPGYAIGLVTALAAGVLTHSPQPALLYLVPSTLG 339
              A  GYA+G++  +       + QPALLY+VP+ +G
Sbjct: 328 FTSAFIGYAVGVILTIVVMNWFQAAQPALLYIVPAVIG 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LKF90.0e+0078.10Cleavage and polyadenylation specificity factor subunit 2 OS=Arabidopsis thalian... [more]
Q652P40.0e+0073.33Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza sativa subsp.... [more]
Q93Z321.3e-17385.25Signal peptide peptidase-like 1 OS=Arabidopsis thaliana OX=3702 GN=SPPL1 PE=2 SV... [more]
Q7G7C71.9e-15375.07Signal peptide peptidase-like 1 OS=Oryza sativa subsp. japonica OX=39947 GN=SPPL... [more]
Q9V3D65.5e-15339.17Probable cleavage and polyadenylation specificity factor subunit 2 OS=Drosophila... [more]
Match NameE-valueIdentityDescription
KAG6576721.10.0100.00Cleavage and polyadenylation specificity factor subunit 2, partial [Cucurbita ar... [more]
TYK22961.10.091.74cleavage and polyadenylation specificity factor subunit 2 [Cucumis melo var. mak... [more]
KAA0033663.10.091.66cleavage and polyadenylation specificity factor subunit 2 [Cucumis melo var. mak... [more]
RXH95086.10.084.42hypothetical protein DVH24_024770 [Malus domestica][more]
VVA11124.10.083.50PREDICTED: cleavage and polyadenylation [Prunus dulcis][more]
Match NameE-valueIdentityDescription
A0A5D3DH280.091.74Cleavage and polyadenylation specificity factor subunit 2 OS=Cucumis melo var. m... [more]
A0A5A7ST610.091.66Cleavage and polyadenylation specificity factor subunit 2 OS=Cucumis melo var. m... [more]
A0A498JJ410.084.42Cleavage and polyadenylation specificity factor subunit 2 OS=Malus domestica OX=... [more]
A0A5E4E5T20.083.50Cleavage and polyadenylation specificity factor subunit 2 OS=Prunus dulcis OX=37... [more]
A0A5N6L0H30.084.73Cleavage and polyadenylation specificity factor subunit 2 OS=Carpinus fangiana O... [more]
Match NameE-valueIdentityDescription
AT5G23880.10.0e+0078.10cleavage and polyadenylation specificity factor 100 [more]
AT4G33410.18.9e-17585.25SIGNAL PEPTIDE PEPTIDASE-LIKE 1 [more]
AT2G01730.13.8e-3226.05cleavage and polyadenylation specificity factor 73 kDa subunit-II [more]
AT2G43070.13.3e-2028.04SIGNAL PEPTIDE PEPTIDASE-LIKE 3 [more]
AT2G03120.19.6e-2027.81signal peptide peptidase [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 750..770
IPR022712Beta-Casp domainSMARTSM01027Beta_Casp_2coord: 600..723
e-value: 1.7E-25
score: 100.7
IPR022712Beta-Casp domainPFAMPF10996Beta-Caspcoord: 600..723
e-value: 1.1E-18
score: 67.6
IPR006639Presenilin/signal peptide peptidaseSMARTSM00730psh_8coord: 49..347
e-value: 2.9E-45
score: 166.4
IPR001279Metallo-beta-lactamaseSMARTSM00849Lactamase_B_5acoord: 375..580
e-value: 0.0072
score: 23.5
IPR001279Metallo-beta-lactamasePFAMPF16661Lactamase_B_6coord: 380..555
e-value: 1.2E-53
score: 181.5
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeGENE3D3.60.15.10coord: 366..615
e-value: 2.8E-75
score: 254.7
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeSUPERFAMILY56281Metallo-hydrolase/oxidoreductasecoord: 364..958
IPR007369Peptidase A22B, signal peptide peptidasePFAMPF04258Peptidase_A22Bcoord: 50..356
e-value: 2.5E-67
score: 227.4
IPR011108Zn-dependent metallo-hydrolase, RNA specificity domainPFAMPF07521RMMBLcoord: 889..940
e-value: 3.3E-11
score: 43.0
IPR025069Cleavage and polyadenylation specificity factor 2, C-terminalPFAMPF13299CPSF100_Ccoord: 1000..1092
e-value: 2.2E-18
score: 67.0
IPR027075Cleavage and polyadenylation specificity factor subunit 2PANTHERPTHR45922CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 2coord: 364..1094
IPR035639CPSF2, metallo-hydrolase domainCDDcd16293CPSF2-like_MBL-foldcoord: 366..561
e-value: 2.66476E-106
score: 328.711

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g282190.m01Csor.00g282190.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006398 mRNA 3'-end processing by stem-loop binding and cleavage
biological_process GO:0006378 mRNA polyadenylation
biological_process GO:0098789 pre-mRNA cleavage required for polyadenylation
biological_process GO:0006508 proteolysis
biological_process GO:0006379 mRNA cleavage
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005847 mRNA cleavage and polyadenylation specificity factor complex
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003723 RNA binding