Cp4.1LG01g16320 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g16320
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptioncleavage and polyadenylation specificity factor subunit 3-I-like
LocationCp4.1LG01: 9981562 .. 9990218 (+)
RNA-Seq ExpressionCp4.1LG01g16320
SyntenyCp4.1LG01g16320
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGATTACTCGCAAGTCACGAGATGCTATCCCCACTGCGTCTAAATTTCATCGCTCTCCTATCCCGCTTTCTACCTTCCGTCTTAGGCACCGCTTCTCTCTAAAACCCATAAAACAAAATACACAGTTCTTGAAACCTTACTAGGGTTTACCTCTCTTTCTTCCCCCATTTCCACTCACCGTAACTTCTCCTTCACTACTTTCATTTCCCCTTTCTTCGAATTTTTTCACATCTGTTATTCGCTCTGATCTACACTCCCCTTTCGCTCTTTCGTTTACAGAAAATTTCAAGAAATACAACTGAATCAATCGTGTATGAATAACTTGGTGCTAGTATATATATATTTAGGATTGTAAATTGAAGCATGAATTTGGATTGTCGTGTTGTTATCGTTCATACTGATTCAAACTTTTCGTTCTTGATATATTGCAGCTGGAAAGATCTTGATCCTGTTTGAGGAAAAATGGCCTCGGTTGGACAGCCGCCCTCCTTCAAGAAACGGGAGTTATCCGCGACGAGAGAGGAGGATCAGCTTATTGTAACTCCTCTTGGGGCTGGCAATGAAGTGGGTCGATCCTGCGTTTATATGTCTTACAAAGGCAAAGTTGTGTTGGTACGTTTTTGAACTCATCCATTTTCTGGGTAGAACATACAGAACTGTAATTGAATAAGTGGTCATGTTTCTTTCCCCGCTTACTTTTCTTGGAGGATAGTTCGATTGTGGAATTCACCCTGCTTACTCTGGCATGGCTGCTTTGCCGTACTTTGATGAGATTGACCCCTCAACAATTGATGTTCTGCTCATTACTCAGTATGTATTTCTTTTTAAAATAGTTATAAAATGCAGCTCTTAGATCGAACTTTGCATTTTAATTCTCTTGTTTACTTCTGCAAAATCAGTGAATGACGGTGGATGGTTTATGCAATTGCATTTGTGAATGGGTGTAGGGTCACTGCTTTGAAATCTTTGCCCCTAAAGATGAAAAGCGTTCCCTTCCTGATCAATTTTTTCTAGGCGCAACTTGTTCAACATGTTGTTTAAAACTCACGTATCCTTATTAAAATCCATGGCTCTTTTAGCATGGAAAATGCAGATACTCTAATCGTCTCAGCTTTCTTTATTTTCTTCCACCTTTTGTTTGTGTTCTTGTAACTAGGATAATTTTCTGCACACTTATTTTGCAGCTTTCATTTGGATCATGCCGCATCACTCCCTTATTTCTTGGAGAAGGTACGTTGTTTTTAATTGAATTGCAGTCTGATTATTGTTTGAGTACATAAACTAATTTAGCATTTAATATGATAGACTACATTCAAAGGACGAGTCTTTATGACTTATGCTACTAAAGCGATCTACAAGTTGTTGCTGTCGGACTTTGTGAAAGTGAGCAAAGTTTCGATTGAAGATATGTTGTTTGACGAGCAGGACATAAATCGTTCCATGGACAAAATTGAGGTAAATGATTTCTTAGAATGGATGTCTATTGGTTTCCTGTTTCGTCGATGTCGATAAAATGGATCAAATTTCAGTATAGGAAATTTTAGTGATTCTACCAAATTCATACTTTCTATTGAATCTGCTAGACGTTTTATTTGGTAACAGTTTATATTTTCCTCCCTCACTATATATGATCTGTCTGTCTCTGCCCTTATCCCCTCTTCTGCTGTAGTTCTTTGTTAACTATTTTCGTTAAAATGTTTGAAATTCAGTTCTTATAGTTTCCAGAACCAAATATAACAGTACTAGAAGTGGTATTTTGTTGTATTCATGCATTTTTTTTTCTGATTGTTCTCTGAAATTAACTGATTTATAAAATATAATTTGATTTAAATGTTATATTTCTAAAATTAACATGTATAAGAGAAGAAACAAATAGAAACAGAACAACAGAGAAGAAACAAATTGAAATGTAATACAGTTCTAAAAATGCTTTCTGGAAAAGTAAAAATGATATCTGTATGATAATCATTTTACCCACTCTAATGCAGATTCGTTATATTGATTACATTGTTATCAACTTGCATAGGTTTCATCTCTATGCTTACAGATTTGCAATGATGAAAACGGAAATTAGTTTCTCTTATTCAATAACAGTGCACAGTAATTGCAGGTCATTGATTTCCATCAAACAGTAGAAGTAAATGGTATTCGGTTTTGGTGTTACACTGCTGGTCATGTGCTTGGTGCTGCCATGTTTATGGTGGATATTGCTGGCGTCCGAGTCCTCTACACTGGAGACTATTCGCGTGAAGAAGATCGACATCTTCGAGCCGCCGAGATGCCTCAATTTTCTCCTGATGTTTGCATAATTGAATCTACATACGGTGTCCAGCTCCATCAACCTCGACATATTCGAGAGAAGCGCTTCACTGATGTTGTACATTCAACCATTTCTCAAGGTGGTCGTGTGCTAATTCCAGCTTTTGCCCTTGGACGTGCCCAGGAACTCCTCCTTATTCTTGATGAGTACTGGGCGAACCATCCCGACCTCCATAATGTTCCCATATATTATGCTTCTCCTCTGGCAAAAAGATGTTTGACTGTATACGAGACGTACACGCTCTCCATGAATGATAGGATCCAAAATGCCAAATCAAACCCCTTTAGATTCAAGTACATATCCCCACTAAAGAGCATTGAAGTTTTCAAAGATGTTGGCCCATCAGTGGTGATGGCCAGCCCTGGTGGACTTCAGAGCGGTTTATCACGACAACTCTTTGACTTGTGGTGTTCGGATAAGAAAAATTCATGTGTGCTTCCTGGTTATGTTGTTGAAGGGACACTGGCTAAGACTATCATCAATGAACCAAAGGAAGTCACCCTCATGAGTGGACTCACAGCTCCTCTTAACATGCAGGTTCATTACATTTCGTTCTCTGCTCATGCTGACTTTGCGCAGACCAGCGCGTTCTTGGAAGAGCTCATGCCGCCCAACATAATTCTCGTGCACGGAGAAGCTAATGAGATGGGGAGGCTCAAACAGAAGCTTATATCCCAGTTTTCTGATCGGAATACAAAGATTCTTACTCCAAAGAATTGTCAGTCTGTTGAAATGTACTTCAACTCTCAGAAGATGGCAAAAACTATTGGAAAATTGGCTGAGAAAACCCCAGAAGTGGGCGAAACTGTCAGCGGTTTACTGGTGAAGAAAGGATTTGCACATCAAATAATGGCACCAGATGATCTACACATCTTCTCTCAGCTATCAACTGCCAACATCAACCAGCGTATTACAATTCCATACTCGAACGCCTTTAATGTGATTGTACGCAGGCTCAAACAGGTATATGAGAGTGTAGAATCTTCAACAGACGAGGAGTCTGGCGTTCCAACAATTCGTGTGCACGATCGTGTGACAGTAAAGCACGAATCAGAGAGGCACATCTCACTTCACTGGACATCAGACCCGTTAAGTGACATGGTATCGGATTCTGTTGTAGCTCTCATCCTGAACATCAACCGCGAGGTACCGAAAGTCATCGTCGAGTCAGAGGCTGTAAAAACGGAAGAAGAGAACAAAAAGAAAGCCGAGAAGGTAATTCATGCCCTCCTTGTTTCACTCTTTGGCAATGTGAAGTTAGGAGGAAATGGGAAGCTGGTGATCAACGTTGATGGGAGTATAGCAGAGCTTGATAAACAGAGTGGGGAGGTAGAAAGTGAAAATGAAGTTCTCAAGGAAAGAGTAAAGACAGCCTTCCGGCGAATCCAATGCGCTGTGAAGCCAATTCCTCCCTCTACATCTTAGCTCGAGCTGAATGTACCTGCTGCTGCTGCAATATGATGAGAGACTATTCCGCCTGGAACAAAAGGATCAAACCCTTCGACACCCCACGCAGGATCTACAGCTTGTACGCTTTCGGTTAATCCATAAGGATCGGATACCCATATTCCCGGACTATTTTCTATATCTGGGCTGTACAAGTAGAACTGGTCTGTGTTCTTGTAGCTGACTACTGACTATCATCCAATTTAGGGACATAAAATGTATTTTACAGGTTCAAAAGTTGGAGAAATCCCATGAATTATTTGAACTTTTTGTTTGTTTGTTTGTTGTTTGTTGTTTGTCCACTTGGATTTTGGCGATTGTCTAAAAAAAGTAAGAGAGAGAGAGAGAGAAAAAGTCGTATGCCATATCATATATGAATGGTGTTGAGCAAATGACAGCAAAGGCAGCAACTCAGAAACTGGAATTATTGATAATTGAAGATTATTTAATGTTGCCTTTCCTTCATCCTTAATCAATCCCTTCCTCCACGCACCTACTACCCAATCCATTAGAGAGATTGGGAGAGAATGTAAAATCATGCCCTAGGCCCGCTTCGGGGAAGAGAGAGAGAGAGAGAGAGAGGACTCGAGGATTTACTTTGCGTCAACTCCTTTCCCCTTCCCATTGCATTTTACAAGATCTCGGTCTACCGATCAGCATTCTCGGTATGCTCATTCTTCCTCTTTTCTTTCATCTTTCATTAATTTTTGCATTTCCGGGAGCTCTTCAATCATTTTCATTGTAATTTGTGAGAAACTTCAACGTTTTTGCGTTTGCTATTTACGCATTCGCAATCTTGGATCGACTCGGGAATGAGCTGCATTTTGCTTTCTCAGATCATTTGTCTCTGATGTTCTAATTTCAGCTATTCTCTTCGTCTCTCTGTCTCTCTGTCTCTCTGATTGCGGATGCAATGCGGGAGGAGAGATTCCAGCGTTGGTATTTTTCCTTTTGAGTTGTTAATTTCTTGAATTTGTGCATTCACAAAGTTTCCTTCTGTTTTCTAAGAAAGTTTTTTGAGGTTCTAACGGACTGCATGTTTGTATTCGACGATCTCATCTCTCCTCTCATTTCGGATTTCCTTTTTCGAAATCAGCCTGAACTTTCCGTCGCAAAATGATGCAGAATCGAGTCGGATCATTCTAGCTTGCTAATTGTCGGGTTGCATGTTTGTCTATCATCAATTTCTGTGCGCACGGTATTTGCGTCACTGGAAACTTGCATCCATGGAATAAACTGTTAGATCGGGATGAGAGGAGGATGAATATCAACTCCTCTTATAGATCTGCCAAACCCTGCGAAACTTTTTCATTAGTTTTCGTGTTCGTTCTGGAGGAAGGAGTGTCTCCTTCCACGCTTTGGTTCAAGAGCGATTCCTGCATGTGACTTCCAAGATTTCAAATTATTCTCTGCTTTGATGTTGACTTCTTTGATTTTGATAGATATTGTCGCTTTATTTTCTAAACATTATCATAAACTTCGTACGCATGGCTGTTCTTGAAAACAATGCACATGCTGCCAGGATTTCACATTAACCCCCTTCAAGTAACCTGTTAAGCTGTTAACAGTAGTTTCCATCCAGTTCTTCGCCCAAATGGCCTATTCCTACATCGATTAATTTTGTTAAAACAGTAGCTGGTGCTGCAAGAATTTTACTCATGATATTGTTTTGTATGCTTTTTGGCAGAAAATTGCGCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACACGAGAAATGTGATTTACTCTGGAAAGCATGCTCTACTTCCTCCCAAGAGTCCATTTCCTAGTGGTTCCTCCCCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTCTAATGGAGGATCAACCTTCTTGGCTCAATGATCTTCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCGAGTGACTCCTTTGCGTACTTGGATGCAGGAAATGTTTTGAATGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCACAAGATTTTGATTTCCGCAAAGATCCCCATCAAGCTTCTTTCAATATGAAAGCAAGCTCGATCAAACAGAAGAACAGGGCATGGGAATTGCCTCCAGCTACATTGACAACTAACCTGGGTTCCCGGCCTTCTGCCAAAAGTAGCATTCTTCTTGAGAGCTCGAGGTCGTTAAGTACACCACAGGAAGTAAATGGGTTCTCATCAACAACTACTGAAAAGCAGGATTCAGCAGAAACCAGTAGTATGCCTGATCGAAAGTCATCCGAGAGAATAGATGGTCCCCATATTAAGCCAGCTCCGGCTGATACAGATAATAAAAGAGCTAAACAGTAAGTTTTGCATTGCTTGTTATCTACATTATTGTTGTGCAAAAAAAATCCTCTCCCCCTGCACATAATTTTGAATTCTGTTTTGCAATCATGTCTCCCAGATTGTTCCAATCTATTTGTTAGTCTCGTACCATCTTTAGTTGCCTGCTGCTTAAGTTTTCGTGTTGGTTTATGGTTTGCTTTTCCACTTGAATAACTTCAATGTTTTGCTTTGCTAAGGAATTATACATTCAAGAAACCAGGATAGAATTAGAGGGTTCTTTTTAGAAAAGTGTTGGTCTATAATTCTTAAATTTTTCTTGTCGATGCATTCTTATTTCGACCAACCTAGTGAGCCCCTCTTGTCTTATTGTGGATCGTTCGAGTTTTTACCTATGACGCATTCTTAAGTATCACATCTGCTTCTTGATTTAATGATCATAGGAGCACTATGATTAAAATGAAGATCACTAAGTATCAACGTAATCTGAACACAGTGATTACGTTGATAGATTAATATTATCTGCATGCGCTCAAATTGCTAGGGAGTTGATTTATTTTTCTAGATTTTCATTTTCTGTGTCACTTTCTTGATAGATTAATCCACCTCTTCTTCCCCACTCCTAATACTGGGACGTATTCTCAGACAGGAATTGGGAGAGAATCTCCATTATAAAACTTGCTGATATTGGATGGATGTGTGTAGTAATAGATGGATATAACATGCCCTTCTGAAGAAATGCTAAATATAACGTGTGCTTATGTTATGCAGAGATGTACTATGTATGTGATTTAACCCGACTTGTTATGAACACAGGCAATTTGCTCAACGTTCACGTGTACGTAAACTTCAGTACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTTAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTGTGAGCTTCCATAACTCTCAAACTTGAGTTTTGACAGTCCGTTCTAATGGTTTATTTTACTTTTTTTCCAAAAATGTTGAATTGGTCTTGGATGAAAAATTCTTCTCTCAGTCGGTTTTGAACTGAATTTTTAAAAAGAGAAACATCCTTTAATTTGCTTTACTAATATGACTTGAATGTGACATTATATTTAGTGGAGCATGAAGTGCTGGAGAGGGAGATAGGAAGACTACGAATGTTGTACCAACAGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGCCAAGCTCTCTTTGAGACAGAAGGATGGGCGTTCGGGTCCCGAGTCTATGGCCGGTCCAGTTCAAATCTAGATTTGTAAATCAGTTGGGAGTTGTTGTGCATGGCCTAACGATTTCTCGGATGTACGAAACCATGTTTGTCGATTACTTGTGCAGGATACCCAGGGCTCATCAAAAGTCAGTCTGTACTCAGCATTTGACTGTGGTGGCTCTTGGTAGATGTGAATGAGAATCTGCCAATGCACCTGGCTGTCATGGCTGGAAAATGTCCTCTTTTGGGCTGGTGACTTGCCGGTGGTGATATCTATTTTCTAAGCATGGTCTCCTATGGTTTTTTTTTATCAGGATGTAATGAAAATTCTAGGTGTTTTTCAAGTCCCCCATTCCTCTTGGGGGGCTCTCTCTTTTTTTCCCCTAAATCATATGCAAATGCATCCAGAGTATTAGGTGTTGCGGGCATGGATGGTACTAACAATTACTATTGAACAATATCAATGATCATGCTTGATATCCTCCTGCTTTGTGGGTGGGCCAAAGTGCTAAACATTGAGCCTTTGCCCAATAATTTCATTCTGATGACCCATTATATAGGGGTGTCATTTGACCTGTCATTTTGAATGGTTACCAATAAAGCTTGTTTGTGCATTATGCAAAGTGAAAGCAATGGCTGCCGGCCTTTATATAACTCCAAGTAATCCCCTAAAGGAAATGATATTTGCAATCATAAAAAAAAATGTGGTCCATATAGTACATGGTCATCAGATTTATCACAATCTATAAACTATGCCATGAGAGAACAAAAGTTCATCAAAAGTAGCTGATGAATTTTCATTAGAGAATTGGAATCATATTGGTTAAAGCGCCTTTAGATCCAAACCCCTGAAAAGAAAAACAAGCGTTCATGAGATGAGAATAGGATGAATCTCGTCTCTGTGACGTAAAAGAGCAAACGATAAATAAGAAATTGTGACTAATGAAGACGAGCATGCTCATGCCCTGGCTTAGAATTGGGTTGCTTATTATAGAAGTTGTGGAGTCCATTTTTTGTATTATTAACTGTAACGGCCTAAACCTGCCGCTAGCAAATATTGTCCTCTTTAGGCTCTCTTTCGGGTTTCTCCCTCAATGTTTTGCTAGGGAGA

mRNA sequence

TGATTACTCGCAAGTCACGAGATGCTATCCCCACTGCGTCTAAATTTCATCGCTCTCCTATCCCGCTTTCTACCTTCCGTCTTAGGCACCGCTTCTCTCTAAAACCCATAAAACAAAATACACAGTTCTTGAAACCTTACTAGGGTTTACCTCTCTTTCTTCCCCCATTTCCACTCACCGTAACTTCTCCTTCACTACTTTCATTTCCCCTTTCTTCGAATTTTTTCACATCTGTTATTCGCTCTGATCTACACTCCCCTTTCGCTCTTTCGTTTACAGAAAATTTCAAGAAATACAACTGAATCAATCGTCTGGAAAGATCTTGATCCTGTTTGAGGAAAAATGGCCTCGGTTGGACAGCCGCCCTCCTTCAAGAAACGGGAGTTATCCGCGACGAGAGAGGAGGATCAGCTTATTGTAACTCCTCTTGGGGCTGGCAATGAAGTGGGTCGATCCTGCGTTTATATGTCTTACAAAGGCAAAGTTGTGTTGTTCGATTGTGGAATTCACCCTGCTTACTCTGGCATGGCTGCTTTGCCGTACTTTGATGAGATTGACCCCTCAACAATTGATGTTCTGCTCATTACTCACTTTCATTTGGATCATGCCGCATCACTCCCTTATTTCTTGGAGAAGACTACATTCAAAGGACGAGTCTTTATGACTTATGCTACTAAAGCGATCTACAAGTTGTTGCTGTCGGACTTTGTGAAAGTGAGCAAAGTTTCGATTGAAGATATGTTGTTTGACGAGCAGGACATAAATCGTTCCATGGACAAAATTGAGGTCATTGATTTCCATCAAACAGTAGAAGTAAATGGTATTCGGTTTTGGTGTTACACTGCTGGTCATGTGCTTGGTGCTGCCATGTTTATGGTGGATATTGCTGGCGTCCGAGTCCTCTACACTGGAGACTATTCGCGTGAAGAAGATCGACATCTTCGAGCCGCCGAGATGCCTCAATTTTCTCCTGATGTTTGCATAATTGAATCTACATACGGTGTCCAGCTCCATCAACCTCGACATATTCGAGAGAAGCGCTTCACTGATGTTGTACATTCAACCATTTCTCAAGGTGGTCGTGTGCTAATTCCAGCTTTTGCCCTTGGACGTGCCCAGGAACTCCTCCTTATTCTTGATGAGTACTGGGCGAACCATCCCGACCTCCATAATGTTCCCATATATTATGCTTCTCCTCTGGCAAAAAGATGTTTGACTGTATACGAGACGTACACGCTCTCCATGAATGATAGGATCCAAAATGCCAAATCAAACCCCTTTAGATTCAAGTACATATCCCCACTAAAGAGCATTGAAGTTTTCAAAGATGTTGGCCCATCAGTGGTGATGGCCAGCCCTGGTGGACTTCAGAGCGGTTTATCACGACAACTCTTTGACTTGTGGTGTTCGGATAAGAAAAATTCATGTGTGCTTCCTGGTTATGTTGTTGAAGGGACACTGGCTAAGACTATCATCAATGAACCAAAGGAAGTCACCCTCATGAGTGGACTCACAGCTCCTCTTAACATGCAGGTTCATTACATTTCGTTCTCTGCTCATGCTGACTTTGCGCAGACCAGCGCGTTCTTGGAAGAGCTCATGCCGCCCAACATAATTCTCGTGCACGGAGAAGCTAATGAGATGGGGAGGCTCAAACAGAAGCTTATATCCCAGTTTTCTGATCGGAATACAAAGATTCTTACTCCAAAGAATTGTCAGTCTGTTGAAATGTACTTCAACTCTCAGAAGATGGCAAAAACTATTGGAAAATTGGCTGAGAAAACCCCAGAAGTGGGCGAAACTGTCAGCGGTTTACTGGTGAAGAAAGGATTTGCACATCAAATAATGGCACCAGATGATCTACACATCTTCTCTCAGCTATCAACTGCCAACATCAACCAGCGTATTACAATTCCATACTCGAACGCCTTTAATGTGATTGTACGCAGGCTCAAACAGGTATATGAGAGTGTAGAATCTTCAACAGACGAGGAGTCTGGCGTTCCAACAATTCGTGTGCACGATCGTGTGACAGTAAAGCACGAATCAGAGAGGCACATCTCACTTCACTGGACATCAGACCCGTTAAGTGACATGGTATCGGATTCTGTTGTAGCTCTCATCCTGAACATCAACCGCGAGGTACCGAAAGTCATCGTCGAGTCAGAGGCTGTAAAAACGGAAGAAGAGAACAAAAAGAAAGCCGAGAAGGTAATTCATGCCCTCCTTGTTTCACTCTTTGGCAATGTGAAGTTAGGAGGAAATGGGAAGCTGGTGATCAACGTTGATGGGAGTATAGCAGAGCTTGATAAACAGAGTGGGGAGGTAGAAAGTGAAAATGAAGTTCTCAAGGAAAGACTATTCTCTTCGTCTCTCTGTCTCTCTGTCTCTCTGATTGCGGATGCAATGCGGGAGGAGAGATTCCAGCGTTGAAAATTGCGCAGTTATTGGGCAGTCATGGAGAATTCCAAGGTGTTGTCAAACACGAGAAATGTGATTTACTCTGGAAAGCATGCTCTACTTCCTCCCAAGAGTCCATTTCCTAGTGGTTCCTCCCCATATGCTGATTATTTCCCCAGTCCCATTATTGGGTCAAGAGCTGTGCAGAATCCCAGAGAGGGAAATGTGCACCATCATAGAACATCATCTGAAAGTCTTCTAATGGAGGATCAACCTTCTTGGCTCAATGATCTTCTCGATGAACCTGAAACACCTGTTCAAAGAGGTGGTCATCGACGTTCATCGAGTGACTCCTTTGCGTACTTGGATGCAGGAAATGTTTTGAATGAAAATTATACGCAAGATGACTCCCAATGTAAAAATATGTATTTACCTTCCTGGGCATCACAAGATTTTGATTTCCGCAAAGATCCCCATCAAGCTTCTTTCAATATGAAAGCAAGCTCGATCAAACAGAAGAACAGGGCATGGGAATTGCCTCCAGCTACATTGACAACTAACCTGGGTTCCCGGCCTTCTGCCAAAAGTAGCATTCTTCTTGAGAGCTCGAGGTCGTTAAGTACACCACAGGAAGTAAATGGGTTCTCATCAACAACTACTGAAAAGCAGGATTCAGCAGAAACCAGTAGTATGCCTGATCGAAAGTCATCCGAGAGAATAGATGGTCCCCATATTAAGCCAGCTCCGGCTGATACAGATAATAAAAGAGCTAAACAGCAATTTGCTCAACGTTCACGTGTACGTAAACTTCAGTACATTGCAGAGCTAGAACGGAACGTACAAGCTTTACAAGCAGAAGGTTCTGAAGTTTCTGCCGAGCTTGAATTTCTTAGTCAGCAAAACTTAATTCTTGGCATGGAGAATAAAGCACTCAAGCAGCGATTAGAAAATTTATCTCAGGAGCAGCTTATAAAATACTTGGAGCATGAAGTGCTGGAGAGGGAGATAGGAAGACTACGAATGTTGTACCAACAGCAACAACAGCCTCAGCCACCACCTTCCAGCCTTAAACGTACCAAGAGCCGAGACCTTGAGACGCAGTTTGCCAAGCTCTCTTTGAGACAGAAGGATGGGCGTTCGGGTCCCGAGTCTATGGCCGGTCCAGTTCAAATCTAGATTTGTAAATCAGTTGGGAGTTGTTGTGCATGGCCTAACGATTTCTCGGATGTACGAAACCATGTTTGTCGATTACTTGTGCAGGATACCCAGGGCTCATCAAAAGTCAGTCTGTACTCAGCATTTGACTGTGGTGGCTCTTGGTAGATGTGAATGAGAATCTGCCAATGCACCTGGCTGTCATGGCTGGAAAATGTCCTCTTTTGGGCTGGTGACTTGCCGGTGGTGATATCTATTTTCTAAGCATGGTCTCCTATGGTTTTTTTTTATCAGGATGTAATGAAAATTCTAGGTGTTTTTCAAGTCCCCCATTCCTCTTGGGGGGCTCTCTCTTTTTTTCCCCTAAATCATATGCAAATGCATCCAGAGTATTAGGTGTTGCGGGCATGGATGGTACTAACAATTACTATTGAACAATATCAATGATCATGCTTGATATCCTCCTGCTTTGTGGGTGGGCCAAAGTGCTAAACATTGAGCCTTTGCCCAATAATTTCATTCTGATGACCCATTATATAGGGGTGTCATTTGACCTGTCATTTTGAATGGTTACCAATAAAGCTTGTTTGTGCATTATGCAAAGTGAAAGCAATGGCTGCCGGCCTTTATATAACTCCAAGTAATCCCCTAAAGGAAATGATATTTGCAATCATAAAAAAAAATGTGGTCCATATAGTACATGGTCATCAGATTTATCACAATCTATAAACTATGCCATGAGAGAACAAAAGTTCATCAAAAGTAGCTGATGAATTTTCATTAGAGAATTGGAATCATATTGGTTAAAGCGCCTTTAGATCCAAACCCCTGAAAAGAAAAACAAGCGTTCATGAGATGAGAATAGGATGAATCTCGTCTCTGTGACGTAAAAGAGCAAACGATAAATAAGAAATTGTGACTAATGAAGACGAGCATGCTCATGCCCTGGCTTAGAATTGGGTTGCTTATTATAGAAGTTGTGGAGTCCATTTTTTGTATTATTAACTGTAACGGCCTAAACCTGCCGCTAGCAAATATTGTCCTCTTTAGGCTCTCTTTCGGGTTTCTCCCTCAATGTTTTGCTAGGGAGA

Coding sequence (CDS)

ATGGCCTCGGTTGGACAGCCGCCCTCCTTCAAGAAACGGGAGTTATCCGCGACGAGAGAGGAGGATCAGCTTATTGTAACTCCTCTTGGGGCTGGCAATGAAGTGGGTCGATCCTGCGTTTATATGTCTTACAAAGGCAAAGTTGTGTTGTTCGATTGTGGAATTCACCCTGCTTACTCTGGCATGGCTGCTTTGCCGTACTTTGATGAGATTGACCCCTCAACAATTGATGTTCTGCTCATTACTCACTTTCATTTGGATCATGCCGCATCACTCCCTTATTTCTTGGAGAAGACTACATTCAAAGGACGAGTCTTTATGACTTATGCTACTAAAGCGATCTACAAGTTGTTGCTGTCGGACTTTGTGAAAGTGAGCAAAGTTTCGATTGAAGATATGTTGTTTGACGAGCAGGACATAAATCGTTCCATGGACAAAATTGAGGTCATTGATTTCCATCAAACAGTAGAAGTAAATGGTATTCGGTTTTGGTGTTACACTGCTGGTCATGTGCTTGGTGCTGCCATGTTTATGGTGGATATTGCTGGCGTCCGAGTCCTCTACACTGGAGACTATTCGCGTGAAGAAGATCGACATCTTCGAGCCGCCGAGATGCCTCAATTTTCTCCTGATGTTTGCATAATTGAATCTACATACGGTGTCCAGCTCCATCAACCTCGACATATTCGAGAGAAGCGCTTCACTGATGTTGTACATTCAACCATTTCTCAAGGTGGTCGTGTGCTAATTCCAGCTTTTGCCCTTGGACGTGCCCAGGAACTCCTCCTTATTCTTGATGAGTACTGGGCGAACCATCCCGACCTCCATAATGTTCCCATATATTATGCTTCTCCTCTGGCAAAAAGATGTTTGACTGTATACGAGACGTACACGCTCTCCATGAATGATAGGATCCAAAATGCCAAATCAAACCCCTTTAGATTCAAGTACATATCCCCACTAAAGAGCATTGAAGTTTTCAAAGATGTTGGCCCATCAGTGGTGATGGCCAGCCCTGGTGGACTTCAGAGCGGTTTATCACGACAACTCTTTGACTTGTGGTGTTCGGATAAGAAAAATTCATGTGTGCTTCCTGGTTATGTTGTTGAAGGGACACTGGCTAAGACTATCATCAATGAACCAAAGGAAGTCACCCTCATGAGTGGACTCACAGCTCCTCTTAACATGCAGGTTCATTACATTTCGTTCTCTGCTCATGCTGACTTTGCGCAGACCAGCGCGTTCTTGGAAGAGCTCATGCCGCCCAACATAATTCTCGTGCACGGAGAAGCTAATGAGATGGGGAGGCTCAAACAGAAGCTTATATCCCAGTTTTCTGATCGGAATACAAAGATTCTTACTCCAAAGAATTGTCAGTCTGTTGAAATGTACTTCAACTCTCAGAAGATGGCAAAAACTATTGGAAAATTGGCTGAGAAAACCCCAGAAGTGGGCGAAACTGTCAGCGGTTTACTGGTGAAGAAAGGATTTGCACATCAAATAATGGCACCAGATGATCTACACATCTTCTCTCAGCTATCAACTGCCAACATCAACCAGCGTATTACAATTCCATACTCGAACGCCTTTAATGTGATTGTACGCAGGCTCAAACAGGTATATGAGAGTGTAGAATCTTCAACAGACGAGGAGTCTGGCGTTCCAACAATTCGTGTGCACGATCGTGTGACAGTAAAGCACGAATCAGAGAGGCACATCTCACTTCACTGGACATCAGACCCGTTAAGTGACATGGTATCGGATTCTGTTGTAGCTCTCATCCTGAACATCAACCGCGAGGTACCGAAAGTCATCGTCGAGTCAGAGGCTGTAAAAACGGAAGAAGAGAACAAAAAGAAAGCCGAGAAGGTAATTCATGCCCTCCTTGTTTCACTCTTTGGCAATGTGAAGTTAGGAGGAAATGGGAAGCTGGTGATCAACGTTGATGGGAGTATAGCAGAGCTTGATAAACAGAGTGGGGAGGTAGAAAGTGAAAATGAAGTTCTCAAGGAAAGACTATTCTCTTCGTCTCTCTGTCTCTCTGTCTCTCTGATTGCGGATGCAATGCGGGAGGAGAGATTCCAGCGTTGA

Protein sequence

MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESENEVLKERLFSSSLCLSVSLIADAMREERFQR
Homology
BLAST of Cp4.1LG01g16320 vs. ExPASy Swiss-Prot
Match: Q9C952 (Cleavage and polyadenylation specificity factor subunit 3-I OS=Arabidopsis thaliana OX=3702 GN=CPSF73-I PE=1 SV=1)

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 548/668 (82.04%), Postives = 625/668 (93.56%), Query Frame = 0

Query: 9   SFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYF 68
           S K+RE   +R+ DQLIVTPLGAG+EVGRSCVYMS++GK +LFDCGIHPAYSGMAALPYF
Sbjct: 7   SLKRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYF 66

Query: 69  DEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKV 128
           DEIDPS+IDVLLITHFH+DHAASLPYFLEKTTF GRVFMT+ATKAIYKLLL+D+VKVSKV
Sbjct: 67  DEIDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKV 126

Query: 129 SIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLY 188
           S+EDMLFDEQDIN+SMDKIEVIDFHQTVEVNGI+FWCYTAGHVLGAAMFMVDIAGVR+LY
Sbjct: 127 SVEDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILY 186

Query: 189 TGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRV 248
           TGDYSREEDRHLRAAE+PQFSPD+CIIEST GVQLHQ RHIREKRFTDV+HST++QGGRV
Sbjct: 187 TGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRV 246

Query: 249 LIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQN- 308
           LIPAFALGRAQELLLILDEYWANHPDLHN+PIYYASPLAK+C+ VY+TY LSMNDRI+N 
Sbjct: 247 LIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQ 306

Query: 309 -AKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPG 368
            A SNPF FK+ISPL SI+ F DVGPSVVMA+PGGLQSGLSRQLFD WCSDKKN+C++PG
Sbjct: 307 FANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPG 366

Query: 369 YVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIIL 428
           Y+VEGTLAKTIINEPKEVTLM+GLTAPLNMQVHYISFSAHAD+AQTS FL+ELMPPNIIL
Sbjct: 367 YMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 426

Query: 429 VHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGE 488
           VHGEANEM RLKQKL+++F D NTKI+TPKNC+SVEMYFNS+K+AKTIG+LAEKTP+VG+
Sbjct: 427 VHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGD 486

Query: 489 TVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVES 548
           TVSG+LVKKGF +QIMAPD+LH+FSQLSTA + QRITIP+  AF VI  RL++++ESVE 
Sbjct: 487 TVSGILVKKGFTYQIMAPDELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEF 546

Query: 549 STDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIV 608
           STDEESG+P ++VH+RVTVK ESE+HISL W+SDP+SDMVSDS+VALILNI+REVPK+++
Sbjct: 547 STDEESGLPALKVHERVTVKQESEKHISLQWSSDPISDMVSDSIVALILNISREVPKIVM 606

Query: 609 ESE-AVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESE 668
           E E AVK+EEEN KK EKVI+ALLVSLFG+VKLG NGKLVI VDG++A+LDK+SGEVESE
Sbjct: 607 EEEDAVKSEEENGKKVEKVIYALLVSLFGDVKLGENGKLVIRVDGNVAQLDKESGEVESE 666

Query: 669 NEVLKERL 674
           +  LKER+
Sbjct: 667 HSGLKERV 674

BLAST of Cp4.1LG01g16320 vs. ExPASy Swiss-Prot
Match: Q9UKF6 (Cleavage and polyadenylation specificity factor subunit 3 OS=Homo sapiens OX=9606 GN=CPSF3 PE=1 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 7.9e-198
Identity = 344/655 (52.52%), Postives = 467/655 (71.30%), Query Frame = 0

Query: 20  EEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPSTIDVL 79
           E DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHP   GM ALPY D IDP+ ID+L
Sbjct: 8   ESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLL 67

Query: 80  LITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQD 139
           LI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSD+VKVS +S +DML+ E D
Sbjct: 68  LISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETD 127

Query: 140 INRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
           +  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+EDRH
Sbjct: 128 LEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRH 187

Query: 200 LRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQ 259
           L AAE+P   PD+ IIESTYG  +H+ R  RE RF + VH  +++GGR LIP FALGRAQ
Sbjct: 188 LMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQ 247

Query: 260 ELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRI--QNAKSNPFRFKY 319
           ELLLILDEYW NHP+LH++PIYYAS LAK+C+ VY+TY  +MND+I  Q   +NPF FK+
Sbjct: 248 ELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKH 307

Query: 320 ISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTI 379
           IS LKS++ F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  ++ GY VEGTLAK I
Sbjct: 308 ISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHI 367

Query: 380 INEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRL 439
           ++EP+E+T MSG   PL M V YISFSAH D+ QTS F+  L PP++ILVHGE NEM RL
Sbjct: 368 MSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARL 427

Query: 440 KQKLISQFSDR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVK 499
           K  LI ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG+LVK
Sbjct: 428 KAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVK 487

Query: 500 KGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGV 559
           + F + I++P DL  ++ L+ + + Q   IPY+  FN++  +L+++   VE    +E   
Sbjct: 488 RNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEELEIQEK-- 547

Query: 560 PTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNI--NREVPKVIVESEAVK 619
           P ++V   +TV  E    + L W ++P +DM +D+V  +IL +  N ++ K  V+  + K
Sbjct: 548 PALKVFKNITVIQEPGM-VVLEWLANPSNDMYADTVTTVILEVQSNPKIRKGAVQKVSKK 607

Query: 620 TEEENKKKAEKVIHALLVSLFGN--VKLGGNGKLVINVDGSIAELDKQSGEVESE 666
            E     K    +  +L  +FG   V +  +  L + VDG  A L+ ++  VE E
Sbjct: 608 LEMHVYSKR---LEIMLQDIFGEDCVSVKDDSILSVTVDGKTANLNLETRTVECE 656

BLAST of Cp4.1LG01g16320 vs. ExPASy Swiss-Prot
Match: P79101 (Cleavage and polyadenylation specificity factor subunit 3 OS=Bos taurus OX=9913 GN=CPSF3 PE=1 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 1.8e-197
Identity = 343/655 (52.37%), Postives = 466/655 (71.15%), Query Frame = 0

Query: 20  EEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPSTIDVL 79
           E DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHP   GM ALPY D IDP+ ID+L
Sbjct: 8   ESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLL 67

Query: 80  LITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQD 139
           LI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSD+VKVS +S +DML+ E D
Sbjct: 68  LISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETD 127

Query: 140 INRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
           +  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+EDRH
Sbjct: 128 LEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRH 187

Query: 200 LRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQ 259
           L AAE+P   PD+ IIESTYG  +H+ R  RE RF + VH  +++GGR LIP FALGRAQ
Sbjct: 188 LMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQ 247

Query: 260 ELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRI--QNAKSNPFRFKY 319
           ELLLILDEYW NHP+LH++PIYYAS LAK+C+ VY+TY  +MND+I  Q   +NPF FK+
Sbjct: 248 ELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKH 307

Query: 320 ISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTI 379
           IS LKS++ F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  ++ GY VEGTLAK I
Sbjct: 308 ISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHI 367

Query: 380 INEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRL 439
           ++EP+E+T MSG   PL M V YISFSAH D+ QTS F+  L PP++ILVHGE NEM RL
Sbjct: 368 MSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARL 427

Query: 440 KQKLISQFSDR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVK 499
           K  LI ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG+LVK
Sbjct: 428 KAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVK 487

Query: 500 KGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGV 559
           + F + I++P DL  ++ L+ + + Q   IPY+  FN++  +L+++   VE    +E   
Sbjct: 488 RNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQEK-- 547

Query: 560 PTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNI--NREVPKVIVESEAVK 619
           P ++V   +TV  E    + L W ++P +DM +D+V  +IL +  N ++ K  V+  + K
Sbjct: 548 PALKVFKNITVIQEPGM-VVLEWLANPSNDMYADTVTTVILEVQSNPKIRKGAVQKVSKK 607

Query: 620 TEEENKKKAEKVIHALLVSLFGN--VKLGGNGKLVINVDGSIAELDKQSGEVESE 666
            E     K    +  +L  +FG   V +     L + VDG  A ++ ++  VE E
Sbjct: 608 LEMHVYSKR---LEIMLQDIFGEDCVSVKDGSILSVTVDGKTANINLETRTVECE 656

BLAST of Cp4.1LG01g16320 vs. ExPASy Swiss-Prot
Match: Q86A79 (Cleavage and polyadenylation specificity factor subunit 3 OS=Dictyostelium discoideum OX=44689 GN=cpsf3 PE=3 SV=1)

HSP 1 Score: 687.6 bits (1773), Expect = 1.5e-196
Identity = 354/737 (48.03%), Postives = 493/737 (66.89%), Query Frame = 0

Query: 12  KRELSATREEDQLI-VTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDE 71
           KR L    E+D ++ +TP+G+G+EVGRSCV + YKGK V+FDCG+HPAYSG+ +LP+FD 
Sbjct: 23  KRPLKGGTEDDDILEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDS 82

Query: 72  I--DPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKV 131
           I  D   ID+LL++HFHLDHAA++PYF+ KT FKGRVFMT+ TKAIY +LLSD+VKVS +
Sbjct: 83  IESDIPDIDLLLVSHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSDYVKVSNI 142

Query: 132 S-IEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVL 191
           +  +DMLFD+ D++RS++KIE + + Q VE NGI+  C+ AGHVLGAAMFM++IAGV++L
Sbjct: 143 TRDDDMLFDKSDLDRSLEKIEKVRYRQKVEHNGIKVTCFNAGHVLGAAMFMIEIAGVKIL 202

Query: 192 YTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGR 251
           YTGD+SR+EDRHL  AE P    DV IIESTYGVQ+H+PR  REKRFT  VH  + + G+
Sbjct: 203 YTGDFSRQEDRHLMGAETPPVKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGK 262

Query: 252 VLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRI-- 311
            LIP FALGRAQELLLILDEYW  +P LH+VPIYYAS LAK+C+ VY TY   MNDR+  
Sbjct: 263 CLIPVFALGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRA 322

Query: 312 QNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLP 371
           Q   SNPF FK+I  +K IE F D GP V MASPG LQSGLSRQLF+ WCSDK+N  V+P
Sbjct: 323 QFDVSNPFEFKHIKNIKGIESFDDRGPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIP 382

Query: 372 GYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNII 431
           GY VEGTLAK I++EP E+T +  +  PLN+ V Y+SFSAH+DF QTS F++E+ PP+++
Sbjct: 383 GYSVEGTLAKHIMSEPAEITRLDNVNVPLNLTVSYVSFSAHSDFLQTSEFIQEIQPPHVV 442

Query: 432 LVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVG 491
           LVHG+ANEM RL+Q L+++F   N  +LTPKN  SV + F  +K+AKT+G +    P+  
Sbjct: 443 LVHGDANEMSRLRQSLVAKFKTIN--VLTPKNAMSVALEFRPEKVAKTLGSIITNPPKQN 502

Query: 492 ETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVE 551
           + + G+LV K F H I++  D+H ++ L T  I Q++T+P++  +++++  L+Q+YE + 
Sbjct: 503 DIIQGILVTKDFTHHILSASDIHNYTNLKTNIIKQKLTLPFAQTYHILISTLEQIYEQII 562

Query: 552 SSTDEESG----VPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNI---- 611
            ST+   G     PTI +++ + + +     I L W S+ ++DM+ DS++ALI  I    
Sbjct: 563 ESTESTGGGGNEKPTITIYNEIKLIYNIGVSIILEWNSNTVNDMICDSIIALISQIELNP 622

Query: 612 ---------------NREVPKVIVESEAVKTEEE------------------------NK 671
                            E+ K  +E E  K +E+                        NK
Sbjct: 623 LSIKVRNPNFNNIDEKEEITKDDIEKEKEKEKEQQDGDDDDDDEIQIKVVSRKSRKLSNK 682

Query: 672 KKAEKVIHALLVSLFGNVKLGGNGKLVI--NVDGSIAELDKQSGEVESENEVLKERLFSS 692
                 +  LL   +GN K+  N  L++  N+D   A +  ++  VES +++LK+++ +S
Sbjct: 683 LNTITEVKLLLEQQYGNFKVDENDPLILHFNLDNQKAIIHLETLTVESLDQILKQKIENS 742

BLAST of Cp4.1LG01g16320 vs. ExPASy Swiss-Prot
Match: Q9QXK7 (Cleavage and polyadenylation specificity factor subunit 3 OS=Mus musculus OX=10090 GN=Cpsf3 PE=1 SV=2)

HSP 1 Score: 687.6 bits (1773), Expect = 1.5e-196
Identity = 341/655 (52.06%), Postives = 466/655 (71.15%), Query Frame = 0

Query: 20  EEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPSTIDVL 79
           E DQL++ PLGAG EVGRSC+ + +KG+ ++ DCGIHP   GM ALPY D IDP+ ID+L
Sbjct: 8   ESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLL 67

Query: 80  LITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQD 139
           LI+HFHLDH  +LP+FL+KT+FKGR FMT+ATKAIY+ LLSD+VKVS +S +DML+ E D
Sbjct: 68  LISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETD 127

Query: 140 INRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
           +  SMDKIE I+FH+  EV GI+FWCY AGHVLGAAMFM++IAGV++LYTGD+SR+EDRH
Sbjct: 128 LEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRH 187

Query: 200 LRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQ 259
           L AAE+P   PD+ IIESTYG  +H+ R  RE RF + VH  +++GGR LIP FALGRAQ
Sbjct: 188 LMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQ 247

Query: 260 ELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRI--QNAKSNPFRFKY 319
           ELLLILDEYW NHP+LH++PIYYAS LAK+C+ VY+TY  +MND+I  Q   +NPF FK+
Sbjct: 248 ELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKH 307

Query: 320 ISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTI 379
           IS LKS++ F D+GPSVVMASPG +Q+GLSR+LF+ WC+DK+N  ++ GY VEGTLAK I
Sbjct: 308 ISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHI 367

Query: 380 INEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRL 439
           ++EP+E+T MSG   PL M V YISFSAH D+ QTS F+  L PP++ILVHGE NEM RL
Sbjct: 368 MSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARL 427

Query: 440 KQKLISQFSDR---NTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVK 499
           K  LI ++ D    + ++  P+N ++V + F  +K+AK +G LA+K PE G+ VSG+LVK
Sbjct: 428 KAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVK 487

Query: 500 KGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGV 559
           + F + I++P DL  ++ L+ + + Q   IPY+  F ++  +L+++   VE    +E   
Sbjct: 488 RNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEK-- 547

Query: 560 PTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNI--NREVPKVIVESEAVK 619
           P ++V   +TV  E    + L W ++P +DM +D+V  +IL +  N ++ K  V+  + K
Sbjct: 548 PALKVFKSITVVQEPGM-VVLEWLANPSNDMYADTVTTVILEVQSNPKIRKGAVQKVSKK 607

Query: 620 TEEENKKKAEKVIHALLVSLFGN--VKLGGNGKLVINVDGSIAELDKQSGEVESE 666
            E     K    +  +L  +FG   V +  +  L + VDG  A ++ ++  VE E
Sbjct: 608 LEMHVYSKR---LEVMLQDIFGEDCVSVKDDSVLSVTVDGKTANINLETRAVECE 656

BLAST of Cp4.1LG01g16320 vs. NCBI nr
Match: XP_023524148.1 (cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita pepo subsp. pepo] >XP_023524157.1 cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1329 bits (3440), Expect = 0.0
Identity = 672/673 (99.85%), Postives = 673/673 (100.00%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENEVLKER+
Sbjct: 661 EVESENEVLKERV 673

BLAST of Cp4.1LG01g16320 vs. NCBI nr
Match: XP_022956704.1 (cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita moschata] >XP_022956705.1 cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita moschata] >XP_022956706.1 cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita moschata] >KAG6601616.1 Cleavage and polyadenylation specificity factor subunit 3-I, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1326 bits (3432), Expect = 0.0
Identity = 670/673 (99.55%), Postives = 673/673 (100.00%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREED+LIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDKLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEEN+KKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENEKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENEVLKER+
Sbjct: 661 EVESENEVLKERV 673

BLAST of Cp4.1LG01g16320 vs. NCBI nr
Match: XP_022997543.1 (cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita maxima] >XP_022997550.1 cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita maxima] >XP_022997558.1 cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita maxima])

HSP 1 Score: 1319 bits (3414), Expect = 0.0
Identity = 667/673 (99.11%), Postives = 670/673 (99.55%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREED+LIVTPLGAG EVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDKLIVTPLGAGTEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHP+LHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPTI VHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTILVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEENKKKAEKV HALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENKKKAEKVTHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENEVLKER+
Sbjct: 661 EVESENEVLKERV 673

BLAST of Cp4.1LG01g16320 vs. NCBI nr
Match: KAG7032374.1 (Cleavage and polyadenylation specificity factor subunit 3-I [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1305 bits (3378), Expect = 0.0
Identity = 669/708 (94.49%), Postives = 672/708 (94.92%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREED+LIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDKLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITH-----------------------------------FH 120
           GMAALPYFDEIDPSTIDVLLIT                                    FH
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITQYGHCFEIFAPKDEKRSLPDHIFLGTTCSTFCLKLTFH 120

Query: 121 LDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQDINRSMD 180
           LDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQDINRSMD
Sbjct: 121 LDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKVSIEDMLFDEQDINRSMD 180

Query: 181 KIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAEM 240
           KIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAEM
Sbjct: 181 KIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAEM 240

Query: 241 PQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQELLLIL 300
           PQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQELLLIL
Sbjct: 241 PQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQELLLIL 300

Query: 301 DEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQNAKSNPFRFKYISPLKSIE 360
           DEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQNAKSNPFRFKYISPLKSIE
Sbjct: 301 DEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQNAKSNPFRFKYISPLKSIE 360

Query: 361 VFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTIINEPKEVT 420
           VFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTIINEPKEVT
Sbjct: 361 VFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLAKTIINEPKEVT 420

Query: 421 LMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRLKQKLISQF 480
           LMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRLKQKLISQF
Sbjct: 421 LMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRLKQKLISQF 480

Query: 481 SDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVKKGFAHQIMAPD 540
           SDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVKKGFAHQIMAPD
Sbjct: 481 SDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGETVSGLLVKKGFAHQIMAPD 540

Query: 541 DLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGVPTIRVHDRVTV 600
           DLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGVPTIRVHDRVTV
Sbjct: 541 DLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVESSTDEESGVPTIRVHDRVTV 600

Query: 601 KHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIVESEAVKTEEENKKKAEKVI 660
           KHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIVESEAVKTEEEN+KKAEKVI
Sbjct: 601 KHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIVESEAVKTEEENEKKAEKVI 660

Query: 661 HALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESENEVLKERL 673
           HALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESENEVLKER+
Sbjct: 661 HALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESENEVLKERV 708

BLAST of Cp4.1LG01g16320 vs. NCBI nr
Match: XP_038877886.1 (cleavage and polyadenylation specificity factor subunit 3-I [Benincasa hispida])

HSP 1 Score: 1292 bits (3344), Expect = 0.0
Identity = 648/673 (96.29%), Postives = 663/673 (98.51%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPS KKRE SATREEDQLI+TPLGAGNEVGRSCVYMSYKGK+VLFDCGIHPAYS
Sbjct: 1   MASVGQPPSLKKREASATREEDQLIITPLGAGNEVGRSCVYMSYKGKIVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVS+EDML+DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSVEDMLYDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIP FALGRAQELLLILDEYWANHP+LHN+PIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPVFALGRAQELLLILDEYWANHPELHNIPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFD+WCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGL APLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLMAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKL+SQF+DRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLMSQFADRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFA+QIMAP+DLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAYQIMAPEDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEES VP IRVHDRVTVKHESERH+SLHW SDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESEVPMIRVHDRVTVKHESERHVSLHWISDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEEN KKAEKVIHALLVSLFG+VKLG NGKLVINVDG+IAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENNKKAEKVIHALLVSLFGDVKLGENGKLVINVDGNIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENE LKER+
Sbjct: 661 EVESENEALKERV 673

BLAST of Cp4.1LG01g16320 vs. ExPASy TrEMBL
Match: A0A6J1GZU7 (cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita moschata OX=3662 GN=LOC111458343 PE=4 SV=1)

HSP 1 Score: 1326 bits (3432), Expect = 0.0
Identity = 670/673 (99.55%), Postives = 673/673 (100.00%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREED+LIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDKLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEEN+KKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENEKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENEVLKER+
Sbjct: 661 EVESENEVLKERV 673

BLAST of Cp4.1LG01g16320 vs. ExPASy TrEMBL
Match: A0A6J1KE99 (cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita maxima OX=3661 GN=LOC111492433 PE=4 SV=1)

HSP 1 Score: 1319 bits (3414), Expect = 0.0
Identity = 667/673 (99.11%), Postives = 670/673 (99.55%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVGQPPSFKKRELSATREED+LIVTPLGAG EVGRSCVYMSYKGKVVLFDCGIHPAYS
Sbjct: 1   MASVGQPPSFKKRELSATREEDKLIVTPLGAGTEVGRSCVYMSYKGKVVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHP+LHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPTI VHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTILVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEENKKKAEKV HALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAVKTEEENKKKAEKVTHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENEVLKER+
Sbjct: 661 EVESENEVLKERV 673

BLAST of Cp4.1LG01g16320 vs. ExPASy TrEMBL
Match: A0A6J1G1N4 (cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita moschata OX=3662 GN=LOC111449842 PE=4 SV=1)

HSP 1 Score: 1287 bits (3330), Expect = 0.0
Identity = 646/673 (95.99%), Postives = 663/673 (98.51%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVG  PS KKRE SATREEDQLI+TPLGAGNEVGRSCVYMSYKGK+VLFDCGIHPAYS
Sbjct: 1   MASVGPSPSLKKRESSATREEDQLIITPLGAGNEVGRSCVYMSYKGKIVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDML+DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLYDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHP+LHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGL APLNMQVHYISFSAHAD+AQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLMAPLNMQVHYISFSAHADYAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKL+S+F+DRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAE 
Sbjct: 421 PPNIILVHGEANEMGRLKQKLMSKFADRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEN 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEV ETVSGLLVKKGFA+QIMAPDDLHIFSQLSTANI QRITIPY+NAFNVIVRRLKQV
Sbjct: 481 TPEVDETVSGLLVKKGFAYQIMAPDDLHIFSQLSTANIYQRITIPYTNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPT+RVHDRVTVKHESERH+SLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTVRVHDRVTVKHESERHVSLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEEN+KKAEKVIHALLVSLFG+VKLG NGKLV+NVDGSIAE+DKQSG
Sbjct: 601 VPKVIVESEAVKTEEENEKKAEKVIHALLVSLFGDVKLGENGKLVVNVDGSIAEVDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENE LKER+
Sbjct: 661 EVESENEALKERV 673

BLAST of Cp4.1LG01g16320 vs. ExPASy TrEMBL
Match: A0A6J1HTF4 (cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita maxima OX=3661 GN=LOC111466435 PE=4 SV=1)

HSP 1 Score: 1284 bits (3322), Expect = 0.0
Identity = 645/673 (95.84%), Postives = 662/673 (98.37%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASVG  PS KKRE SATREEDQLIVTPLGAGNEVGRSCVYMSYKGK+VLFDCGIHPAYS
Sbjct: 1   MASVGPSPSLKKRESSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKIVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLL+THFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVSIEDML+DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSIEDMLYDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHP+LHNVPIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNVPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           SCVLPGYVVEGTLAKTIINEPKEVTLMSGL APLNMQVHYISFSAHAD+AQTSAFLEELM
Sbjct: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLMAPLNMQVHYISFSAHADYAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKL+S+F+DRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAE 
Sbjct: 421 PPNIILVHGEANEMGRLKQKLMSKFADRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEN 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPEV ETVSGLLVKKGFA+QIMAPDDLHIFSQLSTANI QRITIPY+NAFNVIVRRLKQV
Sbjct: 481 TPEVDETVSGLLVKKGFAYQIMAPDDLHIFSQLSTANIYQRITIPYTNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVESSTDEESGVPT+ VHDRVTVKHESERH+SLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESSTDEESGVPTVCVHDRVTVKHESERHVSLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEAVKTEEEN+KKAEKVIHALLVSLFG+VKLG NGKLV+NVDGSIAE+DKQSG
Sbjct: 601 VPKVIVESEAVKTEEENEKKAEKVIHALLVSLFGDVKLGENGKLVVNVDGSIAEVDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENE LKER+
Sbjct: 661 EVESENEALKERV 673

BLAST of Cp4.1LG01g16320 vs. ExPASy TrEMBL
Match: A0A6J1DE61 (cleavage and polyadenylation specificity factor subunit 3-I OS=Momordica charantia OX=3673 GN=LOC111019708 PE=4 SV=1)

HSP 1 Score: 1283 bits (3319), Expect = 0.0
Identity = 644/673 (95.69%), Postives = 661/673 (98.22%), Query Frame = 0

Query: 1   MASVGQPPSFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYS 60
           MASV QPPS KKRE SATREEDQLI+TPLGAGNEVGRSCVYMSYKGK+VLFDCGIHPAYS
Sbjct: 1   MASVVQPPSLKKRESSATREEDQLIITPLGAGNEVGRSCVYMSYKGKIVLFDCGIHPAYS 60

Query: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120
           GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS
Sbjct: 61  GMAALPYFDEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLS 120

Query: 121 DFVKVSKVSIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180
           DFVKVSKVS+EDML+DEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD
Sbjct: 121 DFVKVSKVSVEDMLYDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVD 180

Query: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240
           IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS
Sbjct: 181 IAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHS 240

Query: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLS 300
           TISQGGRVLIPAFALGRAQELLLILDEYWANHP+LHN+PIYYASPLAKRCLTVYETYTLS
Sbjct: 241 TISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNIPIYYASPLAKRCLTVYETYTLS 300

Query: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKN 360
           MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFD+WCSDKKN
Sbjct: 301 MNDRIQNAKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKKN 360

Query: 361 SCVLPGYVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELM 420
           +CVLPGYVVEGTLAKTIINEPKEVTLM+GL APLNMQVHYISFSAHADFAQTSAFLEELM
Sbjct: 361 ACVLPGYVVEGTLAKTIINEPKEVTLMNGLMAPLNMQVHYISFSAHADFAQTSAFLEELM 420

Query: 421 PPNIILVHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEK 480
           PPNIILVHGEANEMGRLKQKLISQF+DRNTKILTPKNCQSVEMYFNSQKMAKTIG+LAEK
Sbjct: 421 PPNIILVHGEANEMGRLKQKLISQFADRNTKILTPKNCQSVEMYFNSQKMAKTIGRLAEK 480

Query: 481 TPEVGETVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540
           TPE GETVSGLLVKKGFA+QIMA DDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV
Sbjct: 481 TPEAGETVSGLLVKKGFAYQIMASDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQV 540

Query: 541 YESVESSTDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINRE 600
           YESVES TDEESGVP I VHDRVTVKHESE+H+SLHWTSDPLSDMVSDSVVALILNINRE
Sbjct: 541 YESVESLTDEESGVPAICVHDRVTVKHESEKHVSLHWTSDPLSDMVSDSVVALILNINRE 600

Query: 601 VPKVIVESEAVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSG 660
           VPKVIVESEA KTEEEN+KKAEKVIHALLVSLFG+VKLG NGKLVINVDGSIAELDKQSG
Sbjct: 601 VPKVIVESEAAKTEEENEKKAEKVIHALLVSLFGDVKLGENGKLVINVDGSIAELDKQSG 660

Query: 661 EVESENEVLKERL 673
           EVESENE LKER+
Sbjct: 661 EVESENEGLKERV 673

BLAST of Cp4.1LG01g16320 vs. TAIR 10
Match: AT1G61010.1 (cleavage and polyadenylation specificity factor 73-I )

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 548/668 (82.04%), Postives = 625/668 (93.56%), Query Frame = 0

Query: 9   SFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYF 68
           S K+RE   +R+ DQLIVTPLGAG+EVGRSCVYMS++GK +LFDCGIHPAYSGMAALPYF
Sbjct: 7   SLKRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYF 66

Query: 69  DEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKV 128
           DEIDPS+IDVLLITHFH+DHAASLPYFLEKTTF GRVFMT+ATKAIYKLLL+D+VKVSKV
Sbjct: 67  DEIDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKV 126

Query: 129 SIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLY 188
           S+EDMLFDEQDIN+SMDKIEVIDFHQTVEVNGI+FWCYTAGHVLGAAMFMVDIAGVR+LY
Sbjct: 127 SVEDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILY 186

Query: 189 TGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRV 248
           TGDYSREEDRHLRAAE+PQFSPD+CIIEST GVQLHQ RHIREKRFTDV+HST++QGGRV
Sbjct: 187 TGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRV 246

Query: 249 LIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQN- 308
           LIPAFALGRAQELLLILDEYWANHPDLHN+PIYYASPLAK+C+ VY+TY LSMNDRI+N 
Sbjct: 247 LIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQ 306

Query: 309 -AKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPG 368
            A SNPF FK+ISPL SI+ F DVGPSVVMA+PGGLQSGLSRQLFD WCSDKKN+C++PG
Sbjct: 307 FANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPG 366

Query: 369 YVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIIL 428
           Y+VEGTLAKTIINEPKEVTLM+GLTAPLNMQVHYISFSAHAD+AQTS FL+ELMPPNIIL
Sbjct: 367 YMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 426

Query: 429 VHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGE 488
           VHGEANEM RLKQKL+++F D NTKI+TPKNC+SVEMYFNS+K+AKTIG+LAEKTP+VG+
Sbjct: 427 VHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGD 486

Query: 489 TVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVES 548
           TVSG+LVKKGF +QIMAPD+LH+FSQLSTA + QRITIP+  AF VI  RL++++ESVE 
Sbjct: 487 TVSGILVKKGFTYQIMAPDELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEF 546

Query: 549 STDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIV 608
           STDEESG+P ++VH+RVTVK ESE+HISL W+SDP+SDMVSDS+VALILNI+REVPK+++
Sbjct: 547 STDEESGLPALKVHERVTVKQESEKHISLQWSSDPISDMVSDSIVALILNISREVPKIVM 606

Query: 609 ESE-AVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESE 668
           E E AVK+EEEN KK EKVI+ALLVSLFG+VKLG NGKLVI VDG++A+LDK+SGEVESE
Sbjct: 607 EEEDAVKSEEENGKKVEKVIYALLVSLFGDVKLGENGKLVIRVDGNVAQLDKESGEVESE 666

Query: 669 NEVLKERL 674
           +  LKER+
Sbjct: 667 HSGLKERV 674

BLAST of Cp4.1LG01g16320 vs. TAIR 10
Match: AT1G61010.2 (cleavage and polyadenylation specificity factor 73-I )

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 548/668 (82.04%), Postives = 625/668 (93.56%), Query Frame = 0

Query: 9   SFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYF 68
           S K+RE   +R+ DQLIVTPLGAG+EVGRSCVYMS++GK +LFDCGIHPAYSGMAALPYF
Sbjct: 7   SLKRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYF 66

Query: 69  DEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKV 128
           DEIDPS+IDVLLITHFH+DHAASLPYFLEKTTF GRVFMT+ATKAIYKLLL+D+VKVSKV
Sbjct: 67  DEIDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKV 126

Query: 129 SIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLY 188
           S+EDMLFDEQDIN+SMDKIEVIDFHQTVEVNGI+FWCYTAGHVLGAAMFMVDIAGVR+LY
Sbjct: 127 SVEDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILY 186

Query: 189 TGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRV 248
           TGDYSREEDRHLRAAE+PQFSPD+CIIEST GVQLHQ RHIREKRFTDV+HST++QGGRV
Sbjct: 187 TGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRV 246

Query: 249 LIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQN- 308
           LIPAFALGRAQELLLILDEYWANHPDLHN+PIYYASPLAK+C+ VY+TY LSMNDRI+N 
Sbjct: 247 LIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQ 306

Query: 309 -AKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPG 368
            A SNPF FK+ISPL SI+ F DVGPSVVMA+PGGLQSGLSRQLFD WCSDKKN+C++PG
Sbjct: 307 FANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPG 366

Query: 369 YVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIIL 428
           Y+VEGTLAKTIINEPKEVTLM+GLTAPLNMQVHYISFSAHAD+AQTS FL+ELMPPNIIL
Sbjct: 367 YMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 426

Query: 429 VHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGE 488
           VHGEANEM RLKQKL+++F D NTKI+TPKNC+SVEMYFNS+K+AKTIG+LAEKTP+VG+
Sbjct: 427 VHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGD 486

Query: 489 TVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVES 548
           TVSG+LVKKGF +QIMAPD+LH+FSQLSTA + QRITIP+  AF VI  RL++++ESVE 
Sbjct: 487 TVSGILVKKGFTYQIMAPDELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEF 546

Query: 549 STDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIV 608
           STDEESG+P ++VH+RVTVK ESE+HISL W+SDP+SDMVSDS+VALILNI+REVPK+++
Sbjct: 547 STDEESGLPALKVHERVTVKQESEKHISLQWSSDPISDMVSDSIVALILNISREVPKIVM 606

Query: 609 ESE-AVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESE 668
           E E AVK+EEEN KK EKVI+ALLVSLFG+VKLG NGKLVI VDG++A+LDK+SGEVESE
Sbjct: 607 EEEDAVKSEEENGKKVEKVIYALLVSLFGDVKLGENGKLVIRVDGNVAQLDKESGEVESE 666

Query: 669 NEVLKERL 674
           +  LKER+
Sbjct: 667 HSGLKERV 674

BLAST of Cp4.1LG01g16320 vs. TAIR 10
Match: AT1G61010.3 (cleavage and polyadenylation specificity factor 73-I )

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 548/668 (82.04%), Postives = 625/668 (93.56%), Query Frame = 0

Query: 9   SFKKRELSATREEDQLIVTPLGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYF 68
           S K+RE   +R+ DQLIVTPLGAG+EVGRSCVYMS++GK +LFDCGIHPAYSGMAALPYF
Sbjct: 7   SLKRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYF 66

Query: 69  DEIDPSTIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVSKV 128
           DEIDPS+IDVLLITHFH+DHAASLPYFLEKTTF GRVFMT+ATKAIYKLLL+D+VKVSKV
Sbjct: 67  DEIDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKV 126

Query: 129 SIEDMLFDEQDINRSMDKIEVIDFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRVLY 188
           S+EDMLFDEQDIN+SMDKIEVIDFHQTVEVNGI+FWCYTAGHVLGAAMFMVDIAGVR+LY
Sbjct: 127 SVEDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILY 186

Query: 189 TGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRV 248
           TGDYSREEDRHLRAAE+PQFSPD+CIIEST GVQLHQ RHIREKRFTDV+HST++QGGRV
Sbjct: 187 TGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRV 246

Query: 249 LIPAFALGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRIQN- 308
           LIPAFALGRAQELLLILDEYWANHPDLHN+PIYYASPLAK+C+ VY+TY LSMNDRI+N 
Sbjct: 247 LIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQ 306

Query: 309 -AKSNPFRFKYISPLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPG 368
            A SNPF FK+ISPL SI+ F DVGPSVVMA+PGGLQSGLSRQLFD WCSDKKN+C++PG
Sbjct: 307 FANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPG 366

Query: 369 YVVEGTLAKTIINEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIIL 428
           Y+VEGTLAKTIINEPKEVTLM+GLTAPLNMQVHYISFSAHAD+AQTS FL+ELMPPNIIL
Sbjct: 367 YMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 426

Query: 429 VHGEANEMGRLKQKLISQFSDRNTKILTPKNCQSVEMYFNSQKMAKTIGKLAEKTPEVGE 488
           VHGEANEM RLKQKL+++F D NTKI+TPKNC+SVEMYFNS+K+AKTIG+LAEKTP+VG+
Sbjct: 427 VHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGD 486

Query: 489 TVSGLLVKKGFAHQIMAPDDLHIFSQLSTANINQRITIPYSNAFNVIVRRLKQVYESVES 548
           TVSG+LVKKGF +QIMAPD+LH+FSQLSTA + QRITIP+  AF VI  RL++++ESVE 
Sbjct: 487 TVSGILVKKGFTYQIMAPDELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEF 546

Query: 549 STDEESGVPTIRVHDRVTVKHESERHISLHWTSDPLSDMVSDSVVALILNINREVPKVIV 608
           STDEESG+P ++VH+RVTVK ESE+HISL W+SDP+SDMVSDS+VALILNI+REVPK+++
Sbjct: 547 STDEESGLPALKVHERVTVKQESEKHISLQWSSDPISDMVSDSIVALILNISREVPKIVM 606

Query: 609 ESE-AVKTEEENKKKAEKVIHALLVSLFGNVKLGGNGKLVINVDGSIAELDKQSGEVESE 668
           E E AVK+EEEN KK EKVI+ALLVSLFG+VKLG NGKLVI VDG++A+LDK+SGEVESE
Sbjct: 607 EEEDAVKSEEENGKKVEKVIYALLVSLFGDVKLGENGKLVIRVDGNVAQLDKESGEVESE 666

Query: 669 NEVLKERL 674
           +  LKER+
Sbjct: 667 HSGLKERV 674

BLAST of Cp4.1LG01g16320 vs. TAIR 10
Match: AT2G01730.1 (cleavage and polyadenylation specificity factor 73 kDa subunit-II )

HSP 1 Score: 287.3 bits (734), Expect = 3.2e-77
Identity = 160/426 (37.56%), Postives = 233/426 (54.69%), Query Frame = 0

Query: 29  LGAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPS-----TIDVLLITH 88
           LGAG E+G+SCV ++  GK ++FDCG+H         P F  I  S      I  ++ITH
Sbjct: 8   LGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITH 67

Query: 89  FHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLSDFVKVS-KVSIEDMLFDEQDINR 148
           FH+DH  +LPYF E   + G ++M+Y TKA+  L+L D+ +V      E+ LF    I  
Sbjct: 68  FHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELFTTTHIAN 127

Query: 149 SMDKIEVIDFHQTVEVN-GIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLR 208
            M K+  ID  QT++V+  ++   Y AGHVLGA M    +    ++YTGDY+   DRHL 
Sbjct: 128 CMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLG 187

Query: 209 AAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGGRVLIPAFALGRAQEL 268
           AA++ +   D+ I ESTY   +   ++ RE+ F   VH  ++ GG+ LIP+FALGRAQEL
Sbjct: 188 AAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQEL 247

Query: 269 LLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYT--LSMNDRIQNAKSNPFRFKYIS 328
            ++LD+YW        VPIY++S L  +    Y+      S N + ++   NPF FK + 
Sbjct: 248 CMLLDDYWERMN--IKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFDFKNVK 307

Query: 329 PLKSIEVFKDVGPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYVVEGTLA-KTII 388
                 +    GP V+ A+PG L +G S ++F  W     N   LPGY V GT+  K + 
Sbjct: 308 DFDR-SLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMA 367

Query: 389 NEPKEVTLMSGLTAPLNMQVHYISFSAHADFAQTSAFLEELMPPNIILVHGEANEMGRLK 445
            +P  V L +G    +  +VH ++FS H D        + L P N++LVHGE   M  LK
Sbjct: 368 GKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPSMMILK 427

BLAST of Cp4.1LG01g16320 vs. TAIR 10
Match: AT5G23880.1 (cleavage and polyadenylation specificity factor 100 )

HSP 1 Score: 141.0 bits (354), Expect = 3.7e-33
Identity = 105/373 (28.15%), Postives = 185/373 (49.60%), Query Frame = 0

Query: 26  VTPL-GAGNEVGRSCVYMSYKGKVVLFDCGIHPAYSGMAALPYFDEIDPSTIDVLLITHF 85
           VTPL G  NE   S + +S  G   L DCG +  +      P       STID +L++H 
Sbjct: 7   VTPLCGVYNENPLSYL-VSIDGFNFLIDCGWNDLFDTSLLEPL--SRVASTIDAVLLSHP 66

Query: 86  HLDHAASLPYFLEKTTFKGRVFMTYATKAIYKL----LLSDFVKVSKVSIEDMLFDEQDI 145
              H  +LPY +++      V   YAT+ +++L    +   F+   +VS  D LF   DI
Sbjct: 67  DTLHIGALPYAMKQLGLSAPV---YATEPVHRLGLLTMYDQFLSRKQVSDFD-LFTLDDI 126

Query: 146 NRSMDKIEVIDFHQTVEVN----GIRFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREE 205
           + +   +  + + Q   ++    GI    + AGH+LG +++ +   G  V+Y  DY+  +
Sbjct: 127 DSAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRK 186

Query: 206 DRHLRAAEMPQF-SPDVCIIESTYGVQLHQ-PRHIREKRFTDVVHSTISQGGRVLIPAFA 265
           +RHL    +  F  P V I ++ + +  +Q  R  R+K F D +   +  GG VL+P   
Sbjct: 187 ERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDT 246

Query: 266 LGRAQELLLILDEYWANHPDLHNVPIYYASPLAKRCLTVYETYTLSMNDRI----QNAKS 325
            GR  ELLLIL+++W+      + PIY+ + ++   +   +++   M+D I    + ++ 
Sbjct: 247 AGRVLELLLILEQHWSQRG--FSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRD 306

Query: 326 NPFRFKYISPLKSIEVFKDV--GPSVVMASPGGLQSGLSRQLFDLWCSDKKNSCVLPGYV 382
           N F  ++++ L +     +   GP VV+AS   L++G +R++F  W +D +N  +     
Sbjct: 307 NAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETG 366

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9C9520.0e+0082.04Cleavage and polyadenylation specificity factor subunit 3-I OS=Arabidopsis thali... [more]
Q9UKF67.9e-19852.52Cleavage and polyadenylation specificity factor subunit 3 OS=Homo sapiens OX=960... [more]
P791011.8e-19752.37Cleavage and polyadenylation specificity factor subunit 3 OS=Bos taurus OX=9913 ... [more]
Q86A791.5e-19648.03Cleavage and polyadenylation specificity factor subunit 3 OS=Dictyostelium disco... [more]
Q9QXK71.5e-19652.06Cleavage and polyadenylation specificity factor subunit 3 OS=Mus musculus OX=100... [more]
Match NameE-valueIdentityDescription
XP_023524148.10.099.85cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita pepo... [more]
XP_022956704.10.099.55cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita mosc... [more]
XP_022997543.10.099.11cleavage and polyadenylation specificity factor subunit 3-I-like [Cucurbita maxi... [more]
KAG7032374.10.094.49Cleavage and polyadenylation specificity factor subunit 3-I [Cucurbita argyrospe... [more]
XP_038877886.10.096.29cleavage and polyadenylation specificity factor subunit 3-I [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1GZU70.099.55cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita mo... [more]
A0A6J1KE990.099.11cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita ma... [more]
A0A6J1G1N40.095.99cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita mo... [more]
A0A6J1HTF40.095.84cleavage and polyadenylation specificity factor subunit 3-I-like OS=Cucurbita ma... [more]
A0A6J1DE610.095.69cleavage and polyadenylation specificity factor subunit 3-I OS=Momordica charant... [more]
Match NameE-valueIdentityDescription
AT1G61010.10.0e+0082.04cleavage and polyadenylation specificity factor 73-I [more]
AT1G61010.20.0e+0082.04cleavage and polyadenylation specificity factor 73-I [more]
AT1G61010.30.0e+0082.04cleavage and polyadenylation specificity factor 73-I [more]
AT2G01730.13.2e-7737.56cleavage and polyadenylation specificity factor 73 kDa subunit-II [more]
AT5G23880.13.7e-3328.15cleavage and polyadenylation specificity factor 100 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 652..672
NoneNo IPR availableGENE3D3.40.50.10890coord: 226..405
e-value: 4.4E-151
score: 505.0
NoneNo IPR availablePANTHERPTHR11203CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR FAMILY MEMBERcoord: 7..677
NoneNo IPR availablePANTHERPTHR11203:SF48SUBFAMILY NOT NAMEDcoord: 7..677
NoneNo IPR availableCDDcd16292CPSF3-like_MBL-foldcoord: 24..217
e-value: 8.62966E-143
score: 413.134
IPR022712Beta-Casp domainSMARTSM01027Beta_Casp_2coord: 258..377
e-value: 2.1E-37
score: 140.3
IPR022712Beta-Casp domainPFAMPF10996Beta-Caspcoord: 258..377
e-value: 7.3E-26
score: 90.7
IPR001279Metallo-beta-lactamaseSMARTSM00849Lactamase_B_5acoord: 36..224
e-value: 2.8E-13
score: 60.1
IPR001279Metallo-beta-lactamasePFAMPF00753Lactamase_Bcoord: 34..200
e-value: 5.2E-22
score: 78.8
IPR021718Pre-mRNA 3'-end-processing endonuclease polyadenylation factor C-termSMARTSM01098CPSF73_100_C_2coord: 484..687
e-value: 3.3E-34
score: 129.6
IPR021718Pre-mRNA 3'-end-processing endonuclease polyadenylation factor C-termPFAMPF11718CPSF73-100_Ccoord: 487..673
e-value: 2.1E-41
score: 141.9
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeGENE3D3.60.15.10coord: 29..434
e-value: 4.4E-151
score: 505.0
IPR036866Ribonuclease Z/Hydroxyacylglutathione hydrolase-likeSUPERFAMILY56281Metallo-hydrolase/oxidoreductasecoord: 22..465
IPR011108Zn-dependent metallo-hydrolase, RNA specificity domainPFAMPF07521RMMBLcoord: 391..445
e-value: 3.7E-15
score: 55.6

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g16320.1Cp4.1LG01g16320.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing
cellular_component GO:0005634 nucleus