Cla97C03G053860 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G053860
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionpolyadenylation and cleavage factor homolog 4 isoform X1
LocationCla97Chr03: 3054945 .. 3067383 (-)
RNA-Seq ExpressionCla97C03G053860
SyntenyCla97C03G053860
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCGAAGGAGTATTCGATTTTGGGGTTCTAAAAACGTGAACCCGTGCCAATCTGGAGGGGATCTGGGCTGTTTGGAACAAGAAGCGGCGGCTGGTTTAGCGTCGGTCGACGGTGTCTGCAGCGAAGCTAGGACTGGGCGGCGACTGCAGCGCACGAGTTTGGGAGTGCTGCGTGAATGATTGACTGGTAGTTCTTCTTGAAGACGGCGACGAAGTGCAGCGGACGGAAGGTAAGAAACTGATCTGAGTTGTATGCGTGAATGATTGACTGGACTGGTTTGTACGCAGAGGAGAAAACTGAAAAGCTTGTGGGTGCAAAAGGGCTGAGAATTTCTTTTCTTTCTTTTCTTTCTTTTAGGTCTCTGATACCATGCGAATGGGATAATTTTCTATTGAATAATCTGTATATCTTTACATAGGTGAATGAATAGGAGACTAGAATTGACAGCTCGTATACATTTACGTTAACAATAATATTAGCTGGAAATTGAGAGTGTACACTGGCGAAGAATTATACTAAGCCACACCGATCTCTCTCTAAGCCGCTCTCTTATTGTTTCTCTCTCTAGCTTCTTCTATGGTGGGAAGAAACCCTAATTCAGTTCTCTTCCCTCCTTGGCGGGATAAAATTCTGAACTCAATCGATACACTTTGCATTTCATGACCCCTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCAGCATATCCATCCGACCGCCAACTCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTCCCCCTTCTATAGCTCACCGGTTTAGAGCTCAGCTAAAGCAGCGGGATGACGAATTCAGGGTTTCTGGCCATGATATTGTGCCCCCTCCCACTGCTGAGGATATCGTTCAATTGTACGACCTCATGTTGTCCGAGCTCACCTTTAATTCGAAGCCCATCATTACGGATCTCACGGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCTGACTTAATTTGTGCACGTATTCTCGAGGTTTGTTTCTGTAATTATGTTTTCCTCTAATTTGAGTTGAGAATACTCTTTTGAACATTTCAATTTCTGGTAGTATATGAAGGAAGTATATTCTGATATTTTCAAATGGACTGTTTTTCTCTCTCTCTTTTTTTTTGGGTTATCCGTCTATCACGTATCTCTTGTTTTAGATTTGATGGGGATTTGGGGTATAAAATTTAGGCTTTTGTCTGCTTTGATGATTATATTTTCTGAGACCTTGCGGATGTTCCAGTTCATCATTTCAAATTTTCGTGCAGGTTCCGGTTGAGCAAAAACTTCCTTCATTATATTTATTGGATAGCATTGTTAAGAATGTTGGGCATGAATACATCAGTTATTTCTCGTCTCGTTTACCTGAGGTATGTAAATTTTGGCTTCTGGTGCACACTCTTGTTTCTTATATTTTCAATAATCAAACCAATATTAGTTTGAATTTGATGAGGTTTTTATCAATTCTTTTGTTCTATCTTTTGGTGCTAAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCATAATGCAATGCGCCACCTCTTTGGGACATGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAACTTTCTCTGCTAACAGCACAAGAGTCGTCAAGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGCATTCATGTCAATCCAAAATACTTGCGTCAACTGGAACACTCAGTGGTGGATAAAGTGAGCATCTATCTCTTTTTCTACTTAGAATACAATATTGGGTTTGCTTCTGTCAATTTGAATATTTATCATGCAACATCCCTTCTTGTCCAAAAAAATTCATCTTGCATCCTTTTGTTGGCTGTTTGTGATTCCCCCCCCACCCTCTCATTTTGTGAATGCAACCAACCCAAGTGAATGAATGCCTTTTTTTCTTTCTTCTTTTTAATGTATTAATTGGATTAGTCCATCCCAGCCATCTATTATGTATGTTGGGGCCTGGGGTTGTGTTTGGTTTAGATGCGTACAGAAGGTGAAAAAGATGATATTATGAGGATTTTTATCTGTTCATTGGTTTTGCTTTTTCAGACAGAGATTGAATATTTTGCTAGGCTTTGCTAAAAAATGAAGTCCTTTAGAGTTTAGCTCCTTAACACATTACTTGTAAACGATGACTATGGAGGCAAACATTGAAATTAAGTGCCTAGCACAAAGGGATGAAAATTTGGGTCTTTTCTATATGAATATATGTGATGTTGAGATAACAGAGGAAAAAAGTTCTTTTATATAAACGATGTTAGCCTAGTTAAATTTAGCAAACTCATTGTTGGTGGGTATTAATACCTCTACTGGTGAAGATGTAGCCACAGACAGAAGTGACATTCAAGGTTTGGTTCTTAGTCTCTTTCTTAAGTGAGGTTATCCCTTGGGTGGAACCCTACTAGATCTTCTAGCTTTGGGATCCTATTATGGAAAAAATTTCGTTGAAGCTGATGAAAGTGAAGCTTTGATTCCTAGTTCTGTAGTTATAGGCAACTTTCGCCTAATTTGTACTTCAAAATCTTCTATCTATGCTAGGAGTTTTTAGGACTCTTGCAAAGGTGACTCCAAAGATGGAATAAGTCATTAAGATTTAAGATTTCTTTTGAGAAGACTCGATGTACGATGCTTATGTTTCTATTTGCGGTGAGGAAGCCCTACATTTTATTTAAGAATATTTGAGTGTCCAAGTTAGCTTGTGCACATCTCAATTAATATTATATTACAGTACCCATCATATTTGGTTGTCAAAGAAACTCATAGGGTCTTAAATCTTAGGTAGGTGGCTTCTAGGACGTGAGTTTATCTCCTCTAAGCTCATTTATTTTCATTAGTCTGACCCATGGGGGATGATTTTTTAGGAAATTTTGCGATGTAATTCTCTACACAATTGACCATAGGATTACATATTTTTCAGACAACACTCTTTATTCAAGCTGGGAGATGGAGTTTCTGTGAGGTTCATGGGTATGCTAATTTCATATTTTGTGATTCTTTTGAGAGGTTGTGTGGTTTGGTTAATTTTGTCGTGGTTGTTTCCTTCGGTATACACTTTGAGGGTGGGGAGGAGTGGGCTCCCTTTCAGCAAAACTTGATAATGCTATTCATTGACCATCTCAAGACAGAAATATGAGGTATGAAATGGAAGCTTGATCACTTGCGACTTTCTTTTTACAGCAGGGAGACTATTATTCTTCAAGATTGTGGTATGTAAATACAAATGATTTTATATAGAGAATGCATCCAATCCTCCAACCTTTATAGAGAATACATCCAAATTGTTGTCTCAGGACTGGTTGGTGCTTTCTGTGCCAACAAATATATGAATTGCTGAGAACTGTGTTTTCTCTTGGTCTCCTACTCTCCTTGGGATATTCTATCCAATACCTCTGTGTTGCTGAATAACACCTAGGAGTTTATTTGTAAGATGGTGTGCAACTTTCTCTTGTTTCATGGCCTTGCAGTACTATTGTTTCTTCTTTTTTTGAAGATTTTGTTGAGAGGAACCTTAGTATTCCAACGACAAGGAAATTTCTAACCCTCAATTTGGAACCAAAAACTTCAAGAAATGATTTTTAAATTCTGTAATGAGTGTTTTAAAAAGCCCTCTTAGGTGAGCTCCCGGGTGTTGGGTACAAGTCTGGCACAAAAGAAAATGAGGATTAGGCGCTAAGAAGATAAGGATTAGGCATGCACCTTTTTTAATTATTTAAAAAAATAATAGTTATTAGGGTTTCTCCTTCATTAGTTTAAAAAAATAAAATTTACTAAGCTTAAATACATAATTATTTGTGTTTAGGGTTTTTACTTTTTTGTCTTATTTTGGCGGGTTGTCTCGTGAGATTAGTCAAGGTGTGCGTAAGCTAGTTCGGACACTCATAAAAAAAAAAAAATTGTCTATTCTCCCTTCAATTATGCTTTCTCTTCTTTATGTTGTACTATATATAGTGCCCCTCAAAAATAAAAAGTTCGTGCTTTATTTATTTATTTATTATTATTTTTTTTAATACCTTGCTTTTAAGCCCCATAAGAGAGATTATTAAACTTTAGAAAACCTTGCTTGTGATTTGAAATTTAAGAAATGATTTGAAATTGTTTATTTTGTTTGGAATGATAAATATTTGAAAGGATGTATTTAGAATTGTTTATTTTGTTTGGAATGATAAATATTTGAAAGGATGTATTTAGAATTTTGTGTTTGGATGGTAACCATAAAAGAAAAGGAAGTTAAAAGTTTAATAACATTGGATGTATTATGGTTTTTTAATTGAAATTTTACTATGACAAGGTTAAATAGTCGGTAAGTTACCTCCAAATCCTATATAATCTTGCAAAATCCATCCCCATTTTAGTTAGGATTATAATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATATATATATTGTTAGTGAAAAAATAGATTTCAAGTCATTCTCTTTCTTGTGGGTTCTGCAGTCATTTGAAATCTCTCAAAATCATGAGGTTCCTAACAGGGCATAAAGGAAAATTATGAGATTCTATATCTTCATTTTTTCTAATTGGTCTTTTCTTTTTCAATTACTTTGTTAGTTTCATATTTAAGTGATTGTAGTTAATGGGAGAGCCCTTTTATAATTCGTCTGCGTGTTTTGGTTGTTTGCCTCTTTTTGTATATTTATATATTTATATACCATTTAAATTTGTTTTATGTCAGATAATAGGGGGAAGATAAAATCAAGTTCCTCAGATTTTGATGATTTCGTGCAGCTGATTATGTTCAAATATGAAATTCCTGTTCTATGTCGTGTGCCATCTATATATCTTTAAATGTGCACATGAAATACATACACAACACACACACATCTCCCAACATAATTCCCTGACATTAATTCAATGCAATTAGGACTGTTAATTTTCTGTTTTCTTTTGAATATCTTGCATATATGGTAAAGAAATAACATACAAATATGTAACTAGCTTTTAGCTTTGCAGCATATCCAAGATGCAAGAGGGGCCTCAGCTCTAAAAGTTCATGATAAAAAGCTTGCTCCCGGATACGAAGAGTATGATTACGATCATGCAGATGTTCTTGAACATGGTGGAGGTCAAGCATTCCATCCAATGGGAAGCATTGGCCATGATTCTTTTGCTCTTGGAACAAATAAAGCAAATATAAAGCTAGCGAAATCATCTCTGTCTTCAAGAATTGGACACAGTAGACCTCTACAATCAGCTGGTGATGAACTTGAAGCAGTTAGAGCCTCACCCTCGCAGAATGTATATGATTATGAAGGTTCTAGAATGATTGATAGAATTGAGGATACTAATAAATGGAGAAGAAAACAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATTGAAGCATATGGAAGTGATAAAGGAAAGGGTTATTTAAATGACAATCCACCTCAGGCTGAACATTTTTCTATCAATGGTATAGACAACAAGGTGACTCCAGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCCGATAGAGGCAGAAATAATGATATGTTGAAGCCACCTGTCCTGCCTTCAAGATTTAGGACAAGAATAGGATTTGAAAGATCAAATGCTATGTCTATAGAGCCTGGAATGAGAAGCAGTTGGTCTAGTCAGGTTCAGCTACCTACTATTGATTCCTCCATGGTTATTGAAGATGTGGTCCAATCAACACCTGTATGTTTCCTGAACTTGTTTACTCTGTTATCTCTCTATTGCCATTATTATCATTTGCTTCTGGCATTCTAGTCTTGGTGTTTATCAAATGGTTGTAAAGCATCTTTGTTCATGTTATGTTGTTTAAATTTGTGACCAAGCATTACTCTACAGTTTTGTTTTTTTTTTGGGATTCCTATCATACACTAGTAGAATAAACCTGTTAGATCCCACAGCGTGTAAAAGCTTTGCAATTGGTGATATGCACCCTTTTCTTCCACTAATCCTGTATTTTTTGCTTATTGGCATCACTTATTGAGAGACGTGTAAGACCCATATCTGTGTTCAGATTTGATAGCACGCATTCTGTTATTATTCTGTTTTTGTTGAGTTAGTATGTAGTAGGATCTCTGCATTCCACTGGGTAATTAATAGTATGGGTTTTCATTTTCCTATTTTCAGGATATTTGGAATATGCACAATCACATTTCTCAGACATCCCAGAACCTCATGAACAATAAAGGAGCAGGAAGAAATTTCCAGATGCCTTTGTTGGGGAGAGGCATGGCTTCATCTGGTGGTGAGAAAATGTTTCCTTTTGCAGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCCCAACCATTGCTTCGAGATTGGGTTCTTCTGGTCTTGACTCTAGCATGGAGTCACAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAACAAGAGTCAGTTTGAGTCTTTAAATGGTAGTAATTCTCTCATCAATCGTGCAAATAGGTCTTTTTTGCCTGAGCAGCAAATGAATAACATGAGAAATAAGGAGCCAAGTCTTACAAGTAAGTTGCCACAAGTTGGCAATCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGCCCATCCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAGGAAAATTTAAGTGCATCAGCAGTACCTCCAGCATTACCGCACTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGCACATCGCCCTGCTATTAGTGAGTGTTTGTCAAGTTCTGCCCCTATTGGGCAATGGAATTTGCCTGTTCACAATAGCCCCAGTAACCCTTTGCATTTACAAGGGGGGCCACTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGTTCCGTCAATACCTCTCTCTCAAAAGGCAGGATCTCTTGTTCCTGGTCAGCAACCAGGAACTGCATTTCCTGGCCTGATAAGTTCTCTCATGGCCCACGGTTTAATCTCATTGAACAATCAAGCTTCTGTACAGGTATATATCTGGGTAATATCCTTCTTAATAGCTTTAGTTTGGGCATTTAATTTTTTCTACTGTTATATTTTATCCACTAAGAGAGTTAAATGTTTAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATAACTGCTCTATATGCTGATCTTCCTAGACAATGCATGACCTGTGGCCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAACCGTATGTCAAAAAGTAGGAAGCAGAAGCCTTCTCGCAAGTGGTTTGTAAGTATAAGCATGTGGCTTAGTGGTGCAGAGGCTCTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTCATTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCCGCTGACGAGGATCAGAAGACGTGTGCATTATGTGGAGAACCTTTTGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGCGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATAGATCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTTGTTCCCTCCGAAAGTTTTGACCAGGATGAACAAGGGGTATGCTTATATCTTTTTTGTTTTTCTCCCCTAGACCCATGTCACCTGTGCGTTCCTTGTATTTCGACCTCTAATTTGGTTGCTTTGCTAATGTATACCACGTGCTTTGTTAGCTGCAAGATTTTATGTGCTTTTTAAAACTCATTTTTAATATTTTGACTGCAGGGAGTAAGTGAAGAGGGTAATCGAAGAAAAAGATTGCGGAGCTAGCCTAGATGGATTTCTATACTCTCGCTGTAGCTTTATACAAGTTTTTCCACATGATCGCTTGTTCTCACAGTAGTGTTAGCTAGAATTTACTTGGATTATGCTTGAATCATTGTATATTTAAAAAATGGAAATTTTCATGCGCAATATAAATATTGCAATCTAAAAGAGGCCAACCCTCGTCTTCTTTGCTGTGGAATTTCAATTTTAGATCTAAATCCTTTTTTATCTTTCAAGGGTTAGTATGTACGTTGGTTTGGAAATATCTTTTGTAGGTTTTAATATAGATCAATAATTTGAGTGGGAGTTGTAGCTGTAGCCGCTTGTTACATCTATGCTGTGTTTGACTTGACATCTGCAGGGCATTGAATGGAGCTTGCAGTTGCGGCTTATCAGATCTTGCTCAAACGGAAAGCCAAGATTCACAGGATGGACCATCCACCGCCCCTTTTTTGTTTCTTTAATTAATTTTTGCTTTGTTTTTTCTTTGTTTTTATGGGTTCAAATTGTTATGAAGTTTGGATCGAGTTTGTACTGTGACAATTCATGAACAATACAGTATGTTGAAATGTTCTAATTTGCAAATATTGCGAGGACTTGGTTTTGTTTAATGACCGTATGGACTGAGTTCTAATTTTTTATTTTTTTTATTTTTTTTATTTTTTTTATTTTAATCATTGCATTCTTTTATGCTGAAGATCCCAATTTTGGTTTGGGGGTAGGAAGTTTCGTTTGGGAGTAGCTCTAAGAAATGATGTTGAATTTCTATATTTATATTATAAAGGACCGTTGCAGAATAGCAATTTTTTTCTATTTACCAATCTTATAATCTTAAATTTAAATTTAAATTTTCCTTTTTCCTTTATTTACTTTCTTTTTTCCTTTAATTTACAAGAGTGAGTTTATATCTATACGTTTTTATATATTTGTTCAAATATACGTTTATATTCCATAAATTTGGGGTATATTAATTAATATAAGGTATATATTATTGTTTATATTTCATAAATATGATTGAAAAAAAATATTTAGTATACATGCTTGATATATATGTATGGATAGATAAATAAAAGAAAAGGATATGAGAAAAAATTGTTCAAAATGTTGAATATTTCTCTATTAATCAATAATGCAGATAATATATATAATATAGTGTGTTAACATTTTCAACCTTCATTCATTGTTCTAGAAAAAAAAAAAAATCAGCAACTTAGAAAAATAATATAAATTAAATTACACAAATAGACATGTATATATTGAGTTACATGAGGATATAATTAAAAGGAAAAGAAAACCATCCATCATTATTATAGCCTAATACCAAAATAGTATACATTTAAGGAATATCAAAACATGAAAAATATAATATCTAATTATATCGTTAAAATATAAAAATCAGCAACTTAGGGGTAAATTTGGAATAAAAAATTTACAAGATTCTAGATTTTTAGACATGCTAAAGCTGAAATTTTTATTCAAAATTTTCATTTCTAAAATATATAATTAATTTAAAAATACATATAAAGCATCAATAAATATTTATATTTAAAGTTTACACACAAATTTGTTAATAAAAATACCTTTGTTGTCGCAAAAATATGTAAGTTTCTTCAAAATGTTGAATTTTCGATTTTTTTTAAAGTTATTTCATAGTATTTTATAGTCATTCTTTAGAGTAATTTAACATTAATAAAATCACATTTATCATGTAAGAAATCAAACAAACTGGTATGTAAAAACACTTTTGAGAAAAAATTGTTAGACACATGAATGTGTTTATTAAAAAAAAAAAAAGTTTAATCAAAAGTGTTAAAATAAAAGTGACTTGACTAGAGACGTCACATGCACAAATTTGTGAAATGATTGCTTAAATTTTAAAAAGAAAAATTGAAATGCATGATCCAAAGATAAACCCTAAACCTCATCTCCTAAATCTCACAAAAGTATATAAAATAAAGTGCAATCTATAGTTTATTAAGAATATATATGTAAAATTATTGAATAAAAAAGTTTTCAAACAAAAATAAACAAAGACATATGAACTCAGACATATAAACTCCACATAGATATGAAGTCTAGATATATGAACTTAACTAAGCGTGCTAAGCAGCTTTGGATAGACTCCAAAAACTAGATCTAAATTTTGCTACACCTACAAATTCTTTTTGCATTGTGCTATATTTGTTAATACTTTTGATCTAAGGGTGTGTTTGGGGTAAGAAGATGAGTTGAGATATGAACACCTACAAAAGTATGTAAGTTACGTCAAAATGTTGAATTTCAGTTTTTTAATTAACTAATAGTATTTTATAGTTGTTATTTATAGTAACTTAACATTAATAAAATCACTTTTATCATGTAAGAAATCAAACAAACTTATATGTAAAAACACTTTTGAGAAAAATTGTCAAATGCATGAGTTATTTATTTTATTTTAAAAGGTTTTTATCAAAAGTGTTAAAATAAAAATAACTTAACTAGAAATGTCACATGCATAAATTTGTAAAATGATTGGTTAAATTAAAAAAAAAAAATTGAAATGCATGATCCAAACGTAAAACCCTAAACTCCATTTTCCAAAACTCCCAACAATACACATGTGCTTAAAAACAATAAAATAAAGTGCAATCCATAGTTCATATATATATATATATATATATATATATATATGTATATGTATATGTATGTATGTATATATAGAATTATTGAATAAAAAGTTTTCCAAACAAAAATAACATCATAAAACCATACTATTTAGGTAAAAAAAAAAAAAAAATTGCACCAGATACCTTTGACATATGAACTTAGCACCCCAAACACCATGATCTTCACAGACATATGAACAACTCAACGCATCAAATAGTCCCTAATTGTTATATTTGCAACTACCCTCGAATCTACACAATATTCTCGTCAATTTTTTTTTAACGCTTTGAATTAGAATGGTGATTTATGCCCCTTTAATTTAGAGGAAGGAAGAAAGGAGAATGGAAAAGAAAGAATGCGAAGAAGGAAAAAAATCCACGTCCACTAAGAAGGAGGGGATCTATGTCAGGAAGAAGGAAAGAAAGGGAAATTTGGTGGTACTATGATAGTACCTATTACTCCCATTAATAATTTGACATGTCATCAAATGATATATTCATTTATTTCACAGACTCTACACAATTCTTAAAATTGACTTTTGTGGAATAGAGATAGTATGAAAAAAATTTCTCTACTACCATTAATAATTTAATTATGTGTCAAATTATTAATAGGAGTAGAAGTATGTACTACCGTAGTACTAGAAATTATTTCAATAAAAGAAAAAAGAGAGATAAGAGATGAGACTTGAGGGGTGGAGCGAAATGAAACATAACCTAAAATAAGTAAAACAATTTGTCTAACAATTTAAAAATAATTCTTGTGAGCAATTAGCGATAAAATCATGCTTCTATTACTTAACTAACAGTAAAAAGAATTGTTTTGAATTTTTTTTTCTAAACCGGTTAAAGCGTTACTTTTAGAATTTCAAGGATGAAAAAGATAATGAACCAAACTCAAACTTACCCTCCAATTAGATCCAAGAAAGTTCAAGAAATGATTCTAAATCCAATGATTTGAAATTCGAGAAATGATTTGAAATTAATTAATTAATTATTTTGTTTGGAAAAAAAAAATTTGAAATGAATGTATTTAAAATTTTGTACTTGAATGGTAACAAAAAAAAAAAAAAAAAGAAAATTAAAAACTTACTGTTATAGATTTTTAAGTATGATATGTTAATTGAAAACTATAATATGACAATGTCAGTTTGAGTAATTTATTTGTAGTTTTTTTTTTAAGTAAAAACAAGATTTGAATTATTTTAAAATTTAAAATCATTCTCTTATCTGCAATAACCAACAACTAAATTTATTTGAGTAATCTTTTTCTAGTCTTTTTAAGTAAAAATAAGATTTGAATTATTTTAAATTTTAAAATCATTCTCTTCCTTGCAATAACAAACAACCAAATTCATTTCACATCCTATATGAATTGAAATCCCTTCAAGATACCATTAATGGAACTTAGATGACGAAAAAGTGAATTTTGCCATTAAAAAAGAAAAAGAAAAAAGAAAAAAAAGAAAAAAGAAAAATCAAGTCTTAGAACCGAGTCGCCAGCCCCTTCACGCCATCTCCTTCTCTCTCCCTGGAATCTGCCCATATCGACATCTTTCTCCTACGTGTTCAGCAACCCAGTCGACGTCCAGCCAATCGCCGTCGAGAGTCGCCACCGCCGGACCTCCCCTCCTCTGGACTTTTATACTAAGCTTCTTGAGGCTGCTCATTTCACTTCGATTTCATTCATCTTGAGCAAGTTCGGAATTCTTTGA

mRNA sequence

GGCGAAGGAGTATTCGATTTTGGGGTTCTAAAAACGTGAACCCGTGCCAATCTGGAGGGGATCTGGGCTGTTTGGAACAAGAAGCGGCGGCTGGTTTAGCGTCGGTCGACGGTGTCTGCAGCGAAGCTAGGACTGGGCGGCGACTGCAGCGCACGAGTTTGGGAGTGCTGCGTGAATGATTGACTGGTAGTTCTTCTTGAAGACGGCGACGAAGTGCAGCGGACGGAAGGTCTCTGATACCATGCGAATGGGATAATTTTCTATTGAATAATCTGTATATCTTTACATAGGTGAATGAATAGGAGACTAGAATTGACAGCTCGTATACATTTACGTTAACAATAATATTAGCTGGAAATTGAGAGTGTACACTGGCGAAGAATTATACTAAGCCACACCGATCTCTCTCTAAGCCGCTCTCTTATTGTTTCTCTCTCTAGCTTCTTCTATGGTGGGAAGAAACCCTAATTCAGTTCTCTTCCCTCCTTGGCGGGATAAAATTCTGAACTCAATCGATACACTTTGCATTTCATGACCCCTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCAGCATATCCATCCGACCGCCAACTCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTCCCCCTTCTATAGCTCACCGGTTTAGAGCTCAGCTAAAGCAGCGGGATGACGAATTCAGGGTTTCTGGCCATGATATTGTGCCCCCTCCCACTGCTGAGGATATCGTTCAATTGTACGACCTCATGTTGTCCGAGCTCACCTTTAATTCGAAGCCCATCATTACGGATCTCACGGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCTGACTTAATTTGTGCACGTATTCTCGAGGTTCCGGTTGAGCAAAAACTTCCTTCATTATATTTATTGGATAGCATTGTTAAGAATGTTGGGCATGAATACATCAGTTATTTCTCGTCTCGTTTACCTGAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCATAATGCAATGCGCCACCTCTTTGGGACATGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAACTTTCTCTGCTAACAGCACAAGAGTCGTCAAGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGCATTCATGTCAATCCAAAATACTTGCGTCAACTGGAACACTCAGTGGTGGATAAACATATCCAAGATGCAAGAGGGGCCTCAGCTCTAAAAGTTCATGATAAAAAGCTTGCTCCCGGATACGAAGAGTATGATTACGATCATGCAGATGTTCTTGAACATGGTGGAGGTCAAGCATTCCATCCAATGGGAAGCATTGGCCATGATTCTTTTGCTCTTGGAACAAATAAAGCAAATATAAAGCTAGCGAAATCATCTCTGTCTTCAAGAATTGGACACAGTAGACCTCTACAATCAGCTGGTGATGAACTTGAAGCAGTTAGAGCCTCACCCTCGCAGAATGTATATGATTATGAAGGTTCTAGAATGATTGATAGAATTGAGGATACTAATAAATGGAGAAGAAAACAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATTGAAGCATATGGAAGTGATAAAGGAAAGGGTTATTTAAATGACAATCCACCTCAGGCTGAACATTTTTCTATCAATGGTATAGACAACAAGGTGACTCCAGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCCGATAGAGGCAGAAATAATGATATGTTGAAGCCACCTGTCCTGCCTTCAAGATTTAGGACAAGAATAGGATTTGAAAGATCAAATGCTATGTCTATAGAGCCTGGAATGAGAAGCAGTTGGTCTAGTCAGGTTCAGCTACCTACTATTGATTCCTCCATGGTTATTGAAGATGTGGTCCAATCAACACCTGATATTTGGAATATGCACAATCACATTTCTCAGACATCCCAGAACCTCATGAACAATAAAGGAGCAGGAAGAAATTTCCAGATGCCTTTGTTGGGGAGAGGCATGGCTTCATCTGGTGGTGAGAAAATGTTTCCTTTTGCAGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCCCAACCATTGCTTCGAGATTGGGTTCTTCTGGTCTTGACTCTAGCATGGAGTCACAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAACAAGAGTCAGTTTGAGTCTTTAAATGGTAGTAATTCTCTCATCAATCGTGCAAATAGGTCTTTTTTGCCTGAGCAGCAAATGAATAACATGAGAAATAAGGAGCCAAGTCTTACAAGTAAGTTGCCACAAGTTGGCAATCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGCCCATCCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAGGAAAATTTAAGTGCATCAGCAGTACCTCCAGCATTACCGCACTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGCACATCGCCCTGCTATTAGTGAGTGTTTGTCAAGTTCTGCCCCTATTGGGCAATGGAATTTGCCTGTTCACAATAGCCCCAGTAACCCTTTGCATTTACAAGGGGGGCCACTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGTTCCGTCAATACCTCTCTCTCAAAAGGCAGGATCTCTTGTTCCTGGTCAGCAACCAGGAACTGCATTTCCTGGCCTGATAAGTTCTCTCATGGCCCACGGTTTAATCTCATTGAACAATCAAGCTTCTGTACAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATAACTGCTCTATATGCTGATCTTCCTAGACAATGCATGACCTGTGGCCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAACCGTATGTCAAAAAGTAGGAAGCAGAAGCCTTCTCGCAAGTGGTTTGTAAGTATAAGCATGTGGCTTAGTGGTGCAGAGGCTCTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTCATTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCCGCTGACGAGGATCAGAAGACGTGTGCATTATGTGGAGAACCTTTTGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGCGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATAGATCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTTGTTCCCTCCGAAAGTTTTGACCAGGATGAACAAGGGGGCATTGAATGGAGCTTGCAGTTGCGGCTTATCAGATCTTGCTCAAACGGAAAGCCAAGATTCACAGGATGGACCATCCACCGCCCCTTTTTTGTTTCTTTAATTAATTTTTGCTTTGTTTTTTCTTTGTTTTTATGGGTTCAAATTGTTATGAACAACCCAGTCGACGTCCAGCCAATCGCCGTCGAGAGTCGCCACCGCCGGACCTCCCCTCCTCTGGACTTTTATACTAAGCTTCTTGAGGCTGCTCATTTCACTTCGATTTCATTCATCTTGAGCAAGTTCGGAATTCTTTGA

Coding sequence (CDS)

ATGACCCCTTTCATGGAATCGGAAAAGCTCTTAATTTCACGAGGAAACCCTAGAAATTCAGCATATCCATCCGACCGCCAACTCCCCACCACCAGCGGCAGGACTATGCCCAATGAGTTGCCACAAAAGCCTCCCCCTTCTATAGCTCACCGGTTTAGAGCTCAGCTAAAGCAGCGGGATGACGAATTCAGGGTTTCTGGCCATGATATTGTGCCCCCTCCCACTGCTGAGGATATCGTTCAATTGTACGACCTCATGTTGTCCGAGCTCACCTTTAATTCGAAGCCCATCATTACGGATCTCACGGTTCTTGCTGATGAGCAGAGAGAACATGGGAAGGGCATTGCTGACTTAATTTGTGCACGTATTCTCGAGGTTCCGGTTGAGCAAAAACTTCCTTCATTATATTTATTGGATAGCATTGTTAAGAATGTTGGGCATGAATACATCAGTTATTTCTCGTCTCGTTTACCTGAGGTGTTTTGCGAGGCTTACAGGCAAGTTCATCCTAATTTGCATAATGCAATGCGCCACCTCTTTGGGACATGGGCAACTGTGTTTCCACCATCCATCATTCGGAAGATTGAAGCTCAACTTTCTCTGCTAACAGCACAAGAGTCGTCAAGTTTGACATCCTCAAGGGCTTCTGAATCTCCTCGGCCAACTCATGGCATTCATGTCAATCCAAAATACTTGCGTCAACTGGAACACTCAGTGGTGGATAAACATATCCAAGATGCAAGAGGGGCCTCAGCTCTAAAAGTTCATGATAAAAAGCTTGCTCCCGGATACGAAGAGTATGATTACGATCATGCAGATGTTCTTGAACATGGTGGAGGTCAAGCATTCCATCCAATGGGAAGCATTGGCCATGATTCTTTTGCTCTTGGAACAAATAAAGCAAATATAAAGCTAGCGAAATCATCTCTGTCTTCAAGAATTGGACACAGTAGACCTCTACAATCAGCTGGTGATGAACTTGAAGCAGTTAGAGCCTCACCCTCGCAGAATGTATATGATTATGAAGGTTCTAGAATGATTGATAGAATTGAGGATACTAATAAATGGAGAAGAAAACAATATCCTGACGATAATCTGAATGGACTTGAAAGTACTTCATATAATATTAGAAATGGACATGCACTTGAGGGACCAAGAGCTTTAATTGAAGCATATGGAAGTGATAAAGGAAAGGGTTATTTAAATGACAATCCACCTCAGGCTGAACATTTTTCTATCAATGGTATAGACAACAAGGTGACTCCAGTAACATGGCAGAACACTGAAGAAGAAGAGTTTGATTGGGAAGATATGAGCCCCACATTAGCCGATAGAGGCAGAAATAATGATATGTTGAAGCCACCTGTCCTGCCTTCAAGATTTAGGACAAGAATAGGATTTGAAAGATCAAATGCTATGTCTATAGAGCCTGGAATGAGAAGCAGTTGGTCTAGTCAGGTTCAGCTACCTACTATTGATTCCTCCATGGTTATTGAAGATGTGGTCCAATCAACACCTGATATTTGGAATATGCACAATCACATTTCTCAGACATCCCAGAACCTCATGAACAATAAAGGAGCAGGAAGAAATTTCCAGATGCCTTTGTTGGGGAGAGGCATGGCTTCATCTGGTGGTGAGAAAATGTTTCCTTTTGCAGACAAGCTTTTGACCAATGATGCTTTACATAGGCCCCCAACCATTGCTTCGAGATTGGGTTCTTCTGGTCTTGACTCTAGCATGGAGTCACAATCAATTGTACAATCTATGGGCCCAAGGCATCCTCTGAATCTTTCTAACTCTTGCCCACCCTCTAGACCTCCAATTTTTCCTGTACCAAGACACAACAAGAGTCAGTTTGAGTCTTTAAATGGTAGTAATTCTCTCATCAATCGTGCAAATAGGTCTTTTTTGCCTGAGCAGCAAATGAATAACATGAGAAATAAGGAGCCAAGTCTTACAAGTAAGTTGCCACAAGTTGGCAATCAACATACTGGGCATATTCCTTTAACTCGGGGAAACCAATTGCAGCCCATCCCTTTAAAACCGCAATTTCTACCATCTCAGGACATGCAGGAAAATTTAAGTGCATCAGCAGTACCTCCAGCATTACCGCACTTAATGGCACCATCTTTGAGTCAAGGATACATTTCACAAGCACATCGCCCTGCTATTAGTGAGTGTTTGTCAAGTTCTGCCCCTATTGGGCAATGGAATTTGCCTGTTCACAATAGCCCCAGTAACCCTTTGCATTTACAAGGGGGGCCACTGCCACCTCTTCCACCTGGGCCTCATCCTACTTCTGTTCCGTCAATACCTCTCTCTCAAAAGGCAGGATCTCTTGTTCCTGGTCAGCAACCAGGAACTGCATTTCCTGGCCTGATAAGTTCTCTCATGGCCCACGGTTTAATCTCATTGAACAATCAAGCTTCTGTACAGGATTCTGTTGGGTTAGAATTCAATCCAGATGTACTCAAGGTGCGACATGAATCTGCAATAACTGCTCTATATGCTGATCTTCCTAGACAATGCATGACCTGTGGCCTTCGATTCAAGACCCAGGAAGAGCATAGTAATCATATGGATTGGCATGTCACTAAAAACCGTATGTCAAAAAGTAGGAAGCAGAAGCCTTCTCGCAAGTGGTTTGTAAGTATAAGCATGTGGCTTAGTGGTGCAGAGGCTCTAGGAACGGAGGCAGTTCCAGGATTTTTGCCTGCTGAGGTCATTGTAGAGAAAAAAGATGATGAAGAACTGGCTGTTCCCGCTGACGAGGATCAGAAGACGTGTGCATTATGTGGAGAACCTTTTGAGGATTTTTACAGTGATGAAACAGAGGAGTGGATGTATCGGGGCGCTGTCTACATGAATGCACCTGATGGACAAACAGCCGGCATGGATAGATCTCAGTTAGGGCCCATAGTGCATGCTAAATGCAGGACCGAAACTAATGTTGTTCCCTCCGAAAGTTTTGACCAGGATGAACAAGGGGGCATTGAATGGAGCTTGCAGTTGCGGCTTATCAGATCTTGCTCAAACGGAAAGCCAAGATTCACAGGATGGACCATCCACCGCCCCTTTTTTGTTTCTTTAATTAATTTTTGCTTTGTTTTTTCTTTGTTTTTATGGGTTCAAATTGTTATGAACAACCCAGTCGACGTCCAGCCAATCGCCGTCGAGAGTCGCCACCGCCGGACCTCCCCTCCTCTGGACTTTTATACTAAGCTTCTTGAGGCTGCTCATTTCACTTCGATTTCATTCATCTTGAGCAAGTTCGGAATTCTTTGA

Protein sequence

MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNKANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQYPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKVTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMRSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGSLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRSCSNGKPRFTGWTIHRPFFVSLINFCFVFSLFLWVQIVMNNPVDVQPIAVESRHRRTSPPLDFYTKLLEAAHFTSISFILSKFGIL
Homology
BLAST of Cla97C03G053860 vs. NCBI nr
Match: XP_038894060.1 (polyadenylation and cleavage factor homolog 4 isoform X3 [Benincasa hispida])

HSP 1 Score: 1899.8 bits (4920), Expect = 0.0e+00
Identity = 947/999 (94.79%), Postives = 964/999 (96.50%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPVEQKLPSLYLLDSIVKNVG EYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVEQKLPSLYLLDSIVKNVGQEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
            DK I DARG SALKVHDKKLA GYEEYDYDHA+VLEHGG QAFH + S+ HDSFALGTNK
Sbjct: 241  DKQIHDARGVSALKVHDKKLASGYEEYDYDHAEVLEHGGAQAFH-LRSMAHDSFALGTNK 300

Query: 301  ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
            ANIKLAKSS SSRIGH+RPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ
Sbjct: 301  ANIKLAKSSPSSRIGHNRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420
            YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV
Sbjct: 361  YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420

Query: 421  TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMR 480
            TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRF TR GFERSNAMSIEPGMR
Sbjct: 421  TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPSVPPSRFVTRTGFERSNAMSIEPGMR 480

Query: 481  SSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRG 540
            S+WSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQ PLLGRG
Sbjct: 481  SNWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQTPLLGRG 540

Query: 541  MASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSN 600
            +A SGGEKM PFADKLLTNDALHRP TIASRLGSSGLDSSME QSIVQSMGPRHPLNL N
Sbjct: 541  IALSGGEKMSPFADKLLTNDALHRPTTIASRLGSSGLDSSMELQSIVQSMGPRHPLNLPN 600

Query: 601  SCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQV 660
            SCPPSRPPIFPVPRHNKS FESLNG NS INRANRSFLPEQQMNNMRNKE SLT+KLPQV
Sbjct: 601  SCPPSRPPIFPVPRHNKSPFESLNGGNSFINRANRSFLPEQQMNNMRNKELSLTTKLPQV 660

Query: 661  GNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQA 720
            GNQHTGHIPLTRGNQLQ IPLKPQFLPSQDMQ+NLSAS VPPALPHLMAPSLSQGYISQ 
Sbjct: 661  GNQHTGHIPLTRGNQLQAIPLKPQFLPSQDMQDNLSASVVPPALPHLMAPSLSQGYISQG 720

Query: 721  HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGS 780
            HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTS+P+IP+ QKAGS
Sbjct: 721  HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSIPTIPIPQKAGS 780

Query: 781  LVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 840
            LVPGQ+PGT F GLISSLMA GLISLNNQ SVQDSVGLEFNPDVLKVRHESAITALYADL
Sbjct: 781  LVPGQRPGTEFSGLISSLMAQGLISLNNQPSVQDSVGLEFNPDVLKVRHESAITALYADL 840

Query: 841  PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900
            PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA
Sbjct: 841  PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900

Query: 901  VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960
            VPGFLP EVIVEKKDDEELAVPAD+DQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD
Sbjct: 901  VPGFLPPEVIVEKKDDEELAVPADDDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960

Query: 961  GQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGI 1000
            GQTAGMDRSQLGPIVHAKCRTETNVV SESF+Q+EQGG+
Sbjct: 961  GQTAGMDRSQLGPIVHAKCRTETNVVTSESFEQEEQGGV 998

BLAST of Cla97C03G053860 vs. NCBI nr
Match: XP_038894058.1 (polyadenylation and cleavage factor homolog 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 1895.9 bits (4910), Expect = 0.0e+00
Identity = 946/997 (94.88%), Postives = 962/997 (96.49%), Query Frame = 0

Query: 1   MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
           MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1   MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61  DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
           DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61  DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121 ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
           ARILEVPVEQKLPSLYLLDSIVKNVG EYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121 ARILEVPVEQKLPSLYLLDSIVKNVGQEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181 GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
           GTWATVFPPSIIRKIEAQLS LTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181 GTWATVFPPSIIRKIEAQLSQLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241 DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
           DK I DARG SALKVHDKKLA GYEEYDYDHA+VLEHGG QAFH + S+ HDSFALGTNK
Sbjct: 241 DKQIHDARGVSALKVHDKKLASGYEEYDYDHAEVLEHGGAQAFH-LRSMAHDSFALGTNK 300

Query: 301 ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
           ANIKLAKSS SSRIGH+RPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ
Sbjct: 301 ANIKLAKSSPSSRIGHNRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360

Query: 361 YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420
           YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV
Sbjct: 361 YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420

Query: 421 TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMR 480
           TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRF TR GFERSNAMSIEPGMR
Sbjct: 421 TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPSVPPSRFVTRTGFERSNAMSIEPGMR 480

Query: 481 SSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRG 540
           S+WSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQ PLLGRG
Sbjct: 481 SNWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQTPLLGRG 540

Query: 541 MASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSN 600
           +A SGGEKM PFADKLLTNDALHRP TIASRLGSSGLDSSME QSIVQSMGPRHPLNL N
Sbjct: 541 IALSGGEKMSPFADKLLTNDALHRPTTIASRLGSSGLDSSMELQSIVQSMGPRHPLNLPN 600

Query: 601 SCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQV 660
           SCPPSRPPIFPVPRHNKS FESLNG NS INRANRSFLPEQQMNNMRNKE SLT+KLPQV
Sbjct: 601 SCPPSRPPIFPVPRHNKSPFESLNGGNSFINRANRSFLPEQQMNNMRNKELSLTTKLPQV 660

Query: 661 GNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQA 720
           GNQHTGHIPLTRGNQLQ IPLKPQFLPSQDMQ+NLSAS VPPALPHLMAPSLSQGYISQ 
Sbjct: 661 GNQHTGHIPLTRGNQLQAIPLKPQFLPSQDMQDNLSASVVPPALPHLMAPSLSQGYISQG 720

Query: 721 HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGS 780
           HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTS+P+IP+ QKAGS
Sbjct: 721 HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSIPTIPIPQKAGS 780

Query: 781 LVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 840
           LVPGQ+PGT F GLISSLMA GLISLNNQ SVQDSVGLEFNPDVLKVRHESAITALYADL
Sbjct: 781 LVPGQRPGTEFSGLISSLMAQGLISLNNQPSVQDSVGLEFNPDVLKVRHESAITALYADL 840

Query: 841 PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900
           PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA
Sbjct: 841 PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900

Query: 901 VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960
           VPGFLP EVIVEKKDDEELAVPAD+DQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD
Sbjct: 901 VPGFLPPEVIVEKKDDEELAVPADDDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960

Query: 961 GQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQG 998
           GQTAGMDRSQLGPIVHAKCRTETNVV SESF+Q+EQG
Sbjct: 961 GQTAGMDRSQLGPIVHAKCRTETNVVTSESFEQEEQG 996

BLAST of Cla97C03G053860 vs. NCBI nr
Match: XP_038894059.1 (polyadenylation and cleavage factor homolog 4 isoform X2 [Benincasa hispida])

HSP 1 Score: 1891.7 bits (4899), Expect = 0.0e+00
Identity = 946/997 (94.88%), Postives = 962/997 (96.49%), Query Frame = 0

Query: 1   MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
           MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1   MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61  DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
           DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61  DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121 ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
           ARILEVPVEQKLPSLYLLDSIVKNVG EYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121 ARILEVPVEQKLPSLYLLDSIVKNVGQEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181 GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
           GTWATVFPPSIIRKIEAQLS LTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181 GTWATVFPPSIIRKIEAQLSQLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241 DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
           DK I DARG SALKVHDKKLA GYEEYDYDHA+VLEHGG QAFH + S+ HDSFALGTNK
Sbjct: 241 DK-IHDARGVSALKVHDKKLASGYEEYDYDHAEVLEHGGAQAFH-LRSMAHDSFALGTNK 300

Query: 301 ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
           ANIKLAKSS SSRIGH+RPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ
Sbjct: 301 ANIKLAKSSPSSRIGHNRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360

Query: 361 YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420
           YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV
Sbjct: 361 YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420

Query: 421 TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMR 480
           TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRF TR GFERSNAMSIEPGMR
Sbjct: 421 TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPSVPPSRFVTRTGFERSNAMSIEPGMR 480

Query: 481 SSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRG 540
           S+WSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQ PLLGRG
Sbjct: 481 SNWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQTPLLGRG 540

Query: 541 MASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSN 600
           +A SGGEKM PFADKLLTNDALHRP TIASRLGSSGLDSSME QSIVQSMGPRHPLNL N
Sbjct: 541 IALSGGEKMSPFADKLLTNDALHRPTTIASRLGSSGLDSSMELQSIVQSMGPRHPLNLPN 600

Query: 601 SCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQV 660
           SCPPSRPPIFPVPRHNKS FESLNG NS INRANRSFLPEQQMNNMRNKE SLT+KLPQV
Sbjct: 601 SCPPSRPPIFPVPRHNKSPFESLNGGNSFINRANRSFLPEQQMNNMRNKELSLTTKLPQV 660

Query: 661 GNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQA 720
           GNQHTGHIPLTRGNQLQ IPLKPQFLPSQDMQ+NLSAS VPPALPHLMAPSLSQGYISQ 
Sbjct: 661 GNQHTGHIPLTRGNQLQAIPLKPQFLPSQDMQDNLSASVVPPALPHLMAPSLSQGYISQG 720

Query: 721 HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGS 780
           HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTS+P+IP+ QKAGS
Sbjct: 721 HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSIPTIPIPQKAGS 780

Query: 781 LVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 840
           LVPGQ+PGT F GLISSLMA GLISLNNQ SVQDSVGLEFNPDVLKVRHESAITALYADL
Sbjct: 781 LVPGQRPGTEFSGLISSLMAQGLISLNNQPSVQDSVGLEFNPDVLKVRHESAITALYADL 840

Query: 841 PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900
           PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA
Sbjct: 841 PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900

Query: 901 VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960
           VPGFLP EVIVEKKDDEELAVPAD+DQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD
Sbjct: 901 VPGFLPPEVIVEKKDDEELAVPADDDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960

Query: 961 GQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQG 998
           GQTAGMDRSQLGPIVHAKCRTETNVV SESF+Q+EQG
Sbjct: 961 GQTAGMDRSQLGPIVHAKCRTETNVVTSESFEQEEQG 995

BLAST of Cla97C03G053860 vs. NCBI nr
Match: XP_008462986.1 (PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2 [Cucumis melo])

HSP 1 Score: 1844.3 bits (4776), Expect = 0.0e+00
Identity = 917/1011 (90.70%), Postives = 952/1011 (94.16%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNSAYPSDR +PTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
            DKH QD+RG SA+KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+LGTNK
Sbjct: 241  DKHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
            AN+KLAKSSLSSRIGH RPLQS GDELE+VRASPSQNVYDYEGS+++DR EDTNKWRRKQ
Sbjct: 301  ANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK 420
            YPDDN+NGLE+T SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+GIDNK
Sbjct: 361  YPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNK 420

Query: 421  VTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGM 480
             TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRFRTR GFERSNAM IEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGR 540
            RS+WSSQVQLP IDSS+VIEDVV STPDIW MHNHISQTSQNLMNNKG GRNFQMP+LGR
Sbjct: 481  RSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGR 540

Query: 541  GMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            G+ SSGGEKM P+ DKLLTNDALHRP  IASRLGSSGLDS+MESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQ 660
            NSCPPSRPP+FPVPRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT+K PQ
Sbjct: 601  NSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHL+APSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQ 720

Query: 721  AHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAG 780
             HRPA SE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+SQK  
Sbjct: 721  GHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQK-- 780

Query: 781  SLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840
              VPGQQPGTA  GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD
Sbjct: 781  --VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840

Query: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900
            LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE
Sbjct: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900

Query: 901  AVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960
            AVPGFLPAEV+VEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP
Sbjct: 901  AVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960

Query: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRS 1011
            DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDE G  E   + + +RS
Sbjct: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1007

BLAST of Cla97C03G053860 vs. NCBI nr
Match: XP_008462960.1 (PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis melo] >XP_008462968.1 PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis melo])

HSP 1 Score: 1838.2 bits (4760), Expect = 0.0e+00
Identity = 917/1016 (90.26%), Postives = 952/1016 (93.70%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNSAYPSDR +PTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DK-----HIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFA 300
            DK     H QD+RG SA+KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+
Sbjct: 241  DKLLALQHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFS 300

Query: 301  LGTNKANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNK 360
            LGTNKAN+KLAKSSLSSRIGH RPLQS GDELE+VRASPSQNVYDYEGS+++DR EDTNK
Sbjct: 301  LGTNKANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNK 360

Query: 361  WRRKQYPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN 420
            WRRKQYPDDN+NGLE+T SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+
Sbjct: 361  WRRKQYPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIS 420

Query: 421  GIDNKVTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMS 480
            GIDNK TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRFRTR GFERSNAM 
Sbjct: 421  GIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMP 480

Query: 481  IEPGMRSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQM 540
            IEPGMRS+WSSQVQLP IDSS+VIEDVV STPDIW MHNHISQTSQNLMNNKG GRNFQM
Sbjct: 481  IEPGMRSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQM 540

Query: 541  PLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRH 600
            P+LGRG+ SSGGEKM P+ DKLLTNDALHRP  IASRLGSSGLDS+MESQSIVQSMGPRH
Sbjct: 541  PMLGRGITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRH 600

Query: 601  PLNLSNSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLT 660
            PLNLSNSCPPSRPP+FPVPRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT
Sbjct: 601  PLNLSNSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLT 660

Query: 661  SKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQ 720
            +K PQVGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHL+APSLSQ
Sbjct: 661  TKSPQVGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQ 720

Query: 721  GYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPL 780
            GYISQ HRPA SE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+
Sbjct: 721  GYISQGHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780

Query: 781  SQKAGSLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAIT 840
            SQK    VPGQQPGTA  GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAIT
Sbjct: 781  SQK----VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAIT 840

Query: 841  ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE 900
            ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE
Sbjct: 841  ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE 900

Query: 901  ALGTEAVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV 960
            ALGTEAVPGFLPAEV+VEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV
Sbjct: 901  ALGTEAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV 960

Query: 961  YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRS 1011
            YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDE G  E   + + +RS
Sbjct: 961  YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1012

BLAST of Cla97C03G053860 vs. ExPASy Swiss-Prot
Match: Q0WPF2 (Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN=PCFS4 PE=1 SV=1)

HSP 1 Score: 641.3 bits (1653), Expect = 1.9e-182
Identity = 439/987 (44.48%), Postives = 555/987 (56.23%), Query Frame = 0

Query: 5   MESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQK--PPPSIAHRFRAQLKQRDDE 64
           M+SEK+L    NPR  +      + +TS + M  ELPQK  PPPS+  RF+A L QR+DE
Sbjct: 1   MDSEKIL----NPRLVS------INSTSRKGMSVELPQKPPPPPSLLDRFKALLNQREDE 60

Query: 65  FRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICAR 124
           F   G + V PP+ ++IVQLY+++L ELTFNSKPIITDLT++A EQREHG+GIA+ IC R
Sbjct: 61  F--GGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 120

Query: 125 ILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGT 184
           ILE PVEQKLPSLYLLDSIVKN+G +Y  YFSSRLPEVFC AYRQ HP+LH +MRHLFGT
Sbjct: 121 ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 180

Query: 185 WATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDK 244
           W++VFPP ++RKI+ QL L +A   SS+    ASE  +PT GIHVNPKYLR+LE S  + 
Sbjct: 181 WSSVFPPPVLRKIDMQLQLSSAANQSSV---GASEPSQPTRGIHVNPKYLRRLEPSAAEN 240

Query: 245 HIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNKAN 304
           +++     S+ +V+ +    GY +++    D LE     +  P      D F   +N   
Sbjct: 241 NLRGIN--SSARVYGQNSLGGYNDFE----DQLESPSSLSSTP------DGFTRRSNDG- 300

Query: 305 IKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQYP 364
                                        A+PS   ++Y   R   R ++  +WRRK+  
Sbjct: 301 -----------------------------ANPSNQAFNYGMGRATSRDDEHMEWRRKE-- 360

Query: 365 DDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK-VT 424
                       N+  G+  E PRALI+AYG D  K    + P +     +NG+ +K VT
Sbjct: 361 ------------NLGQGNDHERPRALIDAYGVDTSKHVTINKPIR----DMNGMHSKMVT 420

Query: 425 PVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP--PVLPS-RFRTRIGFERSNAMSIEPG 484
           P  WQNTEEEEFDWEDMSPTL DR R  + L+   P L S R R R+G   ++   ++  
Sbjct: 421 P--WQNTEEEEFDWEDMSPTL-DRSRAGEFLRSSVPALGSVRARPRVG--NTSDFHLDSD 480

Query: 485 MRSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLG 544
           +++  S Q++                  + W++  +   TS  +  +  AG++ ++    
Sbjct: 481 IKNGVSHQLR------------------ENWSLSQNYPHTSNRV--DTRAGKDLKVLASS 540

Query: 545 RGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNL 604
            G+ SS  E   P  D +           + SR G +  D +    S   + GP      
Sbjct: 541 VGLVSSNSEFGAPPFDSI---------QDVNSRFGRALPDGTWPHLS---ARGP------ 600

Query: 605 SNSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLP 664
            NS         PVP            S  L + AN    P   M+N    +P L     
Sbjct: 601 -NS--------LPVP------------SAHLHHLAN----PGNAMSNRLQGKP-LYRPEN 660

Query: 665 QVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYIS 724
           QV   H     +T+ NQ     +   +LP        S+SA+ P     +   +S GY  
Sbjct: 661 QVSQSHLN--DMTQQNQ-----MLVNYLP--------SSSAMAPRPMQSLLTHVSHGY-- 720

Query: 725 QAHRPAISECLSSSAPIGQWNLPVHNSPSNP-LHLQGGPLPPLPPGPHPTSVPSIPLSQK 784
                                 P H S   P L +QGG         HP S  S  LSQ 
Sbjct: 721 ----------------------PPHGSTIRPSLSIQGG------EAMHPLS--SGVLSQI 780

Query: 785 AGSLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALY 844
             S    Q PG AF GLI SLMA GLISLNNQ + Q  +GLEF+ D+LK+R+ESAI+ALY
Sbjct: 781 GAS---NQPPGGAFSGLIGSLMAQGLISLNNQPAGQGPLGLEFDADMLKIRNESAISALY 793

Query: 845 ADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALG 904
            DLPRQC TCGLRFK QEEHS HMDWHVTKNRMSK+ KQ PSRKWFVS SMWLSGAEALG
Sbjct: 841 GDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMWLSGAEALG 793

Query: 905 TEAVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMN 964
            EAVPGFLP E   EKKDDE++AVPADEDQ +CALCGEPFEDFYSDETEEWMY+GAVYMN
Sbjct: 901 AEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWMYKGAVYMN 793

Query: 965 APDGQTAGMDRSQLGPIVHAKCRTETN 985
           AP+  T  MD+SQLGPIVHAKCR E+N
Sbjct: 961 APEESTTDMDKSQLGPIVHAKCRPESN 793

BLAST of Cla97C03G053860 vs. ExPASy Swiss-Prot
Match: Q9C710 (Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN=PCFS1 PE=1 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 2.4e-39
Identity = 101/209 (48.33%), Postives = 123/209 (58.85%), Query Frame = 0

Query: 792 PGLISSLMAHGLISLNN-------QASVQDS--VGLEF-NPDVLKVRHESAITALYADLP 851
           P ++S  +   L  LNN       +AS  DS  VGL F NP  L VRHES I +LY+D+P
Sbjct: 194 PIVLSKELTDLLSLLNNEKEKKTLEASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMP 253

Query: 852 RQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKS-----RKQKPSRKWFVSISMWLSGAEAL 911
           RQC +CGLRFK QEEHS HMDWHV KNR  K+     ++ K SR W  S S+WL  A   
Sbjct: 254 RQCSSCGLRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGG 313

Query: 912 GTEAVPGFLPAEVIVEKKDDEE---LAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGA 971
            T  V  F   E+  +K  DEE   L VPADEDQK CALC EPFE+F+S E ++WMY+ A
Sbjct: 314 ETVEVASF-GGEMQKKKGKDEEPKQLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDA 373

Query: 972 VYMNAPDGQTAGMDRSQLGPIVHAKCRTE 983
           VY+            ++ G IVH KC  E
Sbjct: 374 VYL------------TKNGRIVHVKCMPE 389

BLAST of Cla97C03G053860 vs. ExPASy Swiss-Prot
Match: Q9FIX8 (Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN=PCFS5 PE=1 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 1.2e-38
Identity = 96/209 (45.93%), Postives = 122/209 (58.37%), Query Frame = 0

Query: 792 PGLISSLMAHGLISLNN-------QASVQDS--VGLEF-NPDVLKVRHESAITALYADLP 851
           P ++S  +   L  LNN       +AS  DS  VGL F NP  L VRHES I +LY+D+P
Sbjct: 187 PIVLSKELTDLLSLLNNEKEKKTSEASNNDSLPVGLSFDNPSSLNVRHESVIKSLYSDMP 246

Query: 852 RQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKS-----RKQKPSRKWFVSISMWLSGAEAL 911
           RQC +CG+RFK QEEHS HMDWHV KNR  K+     ++ K SR W  S S+WL      
Sbjct: 247 RQCTSCGVRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAPTGG 306

Query: 912 GTEAVPGFLPAEVIVEKKDDE---ELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGA 971
           GT  V  F   E+  + + D+   +  VPADEDQK CALC EPFE+F+S E ++WMY+ A
Sbjct: 307 GTVEVASFGGGEMQKKNEKDQVQKQHMVPADEDQKNCALCVEPFEEFFSHEADDWMYKDA 366

Query: 972 VYMNAPDGQTAGMDRSQLGPIVHAKCRTE 983
           VY+            ++ G IVH KC  E
Sbjct: 367 VYL------------TKNGRIVHVKCMPE 383

BLAST of Cla97C03G053860 vs. ExPASy Swiss-Prot
Match: O94913 (Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 SV=3)

HSP 1 Score: 98.2 bits (243), Expect = 6.1e-19
Identity = 57/158 (36.08%), Postives = 83/158 (52.53%), Query Frame = 0

Query: 77  EDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVEQKLPSLY 136
           ED  + Y   L +LTFNSKP I  LT+LA+E     K I  LI A+  + P  +KLP +Y
Sbjct: 16  EDACRDYQSSLEDLTFNSKPHINMLTILAEENLPFAKEIVSLIEAQTAKAPSSEKLPVMY 75

Query: 137 LLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIRKIE 196
           L+DSIVKNVG EY++ F+  L   F   + +V  N   ++  L  TW  +FP   +  ++
Sbjct: 76  LMDSIVKNVGREYLTAFTKNLVATFICVFEKVDENTRKSLFKLRSTWDEIFPLKKLYALD 135

Query: 197 AQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQ 235
            +++ L             +     T  IHVNPK+L +
Sbjct: 136 VRVNSLDPAWPIKPLPPNVN-----TSSIHVNPKFLNK 168

BLAST of Cla97C03G053860 vs. ExPASy Swiss-Prot
Match: Q10237 (Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPAC4G9.04c PE=4 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 2.6e-14
Identity = 59/159 (37.11%), Postives = 75/159 (47.17%), Query Frame = 0

Query: 78  DIVQL-YDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVEQKLPSLY 137
           D+V+L Y   L +LTFNSKPII  LT +A E   +   I + I   I + P   KLP+LY
Sbjct: 2   DLVELDYLSALEDLTFNSKPIIHTLTYIAQENEPYAISIVNAIEKHIQKCPPNCKLPALY 61

Query: 138 LLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTW----------ATV 197
           LLDSI KN+G  Y  +F   L   F  AY  V P L   +  L  TW            V
Sbjct: 62  LLDSISKNLGAPYTYFFGLHLFSTFMSAYTVVEPRLRLKLDQLLATWKQRPPNSSSLEPV 121

Query: 198 FPPSIIRKIEAQL----SLLTAQESSSLTSSRASESPRP 222
           F P +  KIE  L    S +   +S  L ++  S    P
Sbjct: 122 FSPIVTAKIENALLKYKSTILRHQSPLLANTSISSFSAP 160

BLAST of Cla97C03G053860 vs. ExPASy TrEMBL
Match: A0A1S3CI66 (polyadenylation and cleavage factor homolog 4 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501218 PE=4 SV=1)

HSP 1 Score: 1844.3 bits (4776), Expect = 0.0e+00
Identity = 917/1011 (90.70%), Postives = 952/1011 (94.16%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNSAYPSDR +PTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
            DKH QD+RG SA+KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+LGTNK
Sbjct: 241  DKHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
            AN+KLAKSSLSSRIGH RPLQS GDELE+VRASPSQNVYDYEGS+++DR EDTNKWRRKQ
Sbjct: 301  ANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK 420
            YPDDN+NGLE+T SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+GIDNK
Sbjct: 361  YPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNK 420

Query: 421  VTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGM 480
             TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRFRTR GFERSNAM IEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGR 540
            RS+WSSQVQLP IDSS+VIEDVV STPDIW MHNHISQTSQNLMNNKG GRNFQMP+LGR
Sbjct: 481  RSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGR 540

Query: 541  GMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            G+ SSGGEKM P+ DKLLTNDALHRP  IASRLGSSGLDS+MESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQ 660
            NSCPPSRPP+FPVPRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT+K PQ
Sbjct: 601  NSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHL+APSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQ 720

Query: 721  AHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAG 780
             HRPA SE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+SQK  
Sbjct: 721  GHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQK-- 780

Query: 781  SLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840
              VPGQQPGTA  GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD
Sbjct: 781  --VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840

Query: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900
            LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE
Sbjct: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900

Query: 901  AVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960
            AVPGFLPAEV+VEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP
Sbjct: 901  AVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960

Query: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRS 1011
            DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDE G  E   + + +RS
Sbjct: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1007

BLAST of Cla97C03G053860 vs. ExPASy TrEMBL
Match: A0A1S3CJP9 (polyadenylation and cleavage factor homolog 4 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501218 PE=4 SV=1)

HSP 1 Score: 1838.2 bits (4760), Expect = 0.0e+00
Identity = 917/1016 (90.26%), Postives = 952/1016 (93.70%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNSAYPSDR +PTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DK-----HIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFA 300
            DK     H QD+RG SA+KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+
Sbjct: 241  DKLLALQHTQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFS 300

Query: 301  LGTNKANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNK 360
            LGTNKAN+KLAKSSLSSRIGH RPLQS GDELE+VRASPSQNVYDYEGS+++DR EDTNK
Sbjct: 301  LGTNKANVKLAKSSLSSRIGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNK 360

Query: 361  WRRKQYPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN 420
            WRRKQYPDDN+NGLE+T SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+
Sbjct: 361  WRRKQYPDDNMNGLENTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIS 420

Query: 421  GIDNKVTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMS 480
            GIDNK TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP V PSRFRTR GFERSNAM 
Sbjct: 421  GIDNKATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMP 480

Query: 481  IEPGMRSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQM 540
            IEPGMRS+WSSQVQLP IDSS+VIEDVV STPDIW MHNHISQTSQNLMNNKG GRNFQM
Sbjct: 481  IEPGMRSNWSSQVQLPGIDSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQM 540

Query: 541  PLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRH 600
            P+LGRG+ SSGGEKM P+ DKLLTNDALHRP  IASRLGSSGLDS+MESQSIVQSMGPRH
Sbjct: 541  PMLGRGITSSGGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRH 600

Query: 601  PLNLSNSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLT 660
            PLNLSNSCPPSRPP+FPVPRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT
Sbjct: 601  PLNLSNSCPPSRPPVFPVPRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLT 660

Query: 661  SKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQ 720
            +K PQVGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHL+APSLSQ
Sbjct: 661  TKSPQVGNQHTGHIPLTRGNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQ 720

Query: 721  GYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPL 780
            GYISQ HRPA SE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+
Sbjct: 721  GYISQGHRPANSEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPI 780

Query: 781  SQKAGSLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAIT 840
            SQK    VPGQQPGTA  GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAIT
Sbjct: 781  SQK----VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAIT 840

Query: 841  ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE 900
            ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE
Sbjct: 841  ALYADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAE 900

Query: 901  ALGTEAVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV 960
            ALGTEAVPGFLPAEV+VEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV
Sbjct: 901  ALGTEAVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAV 960

Query: 961  YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRS 1011
            YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDE G  E   + + +RS
Sbjct: 961  YMNAPDGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEGGVSEDGNRRKRLRS 1012

BLAST of Cla97C03G053860 vs. ExPASy TrEMBL
Match: A0A0A0LVG0 (CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G109350 PE=4 SV=1)

HSP 1 Score: 1827.8 bits (4733), Expect = 0.0e+00
Identity = 917/1011 (90.70%), Postives = 944/1011 (93.37%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            MT FMESEKLLISRGNPRNS YPSDR +PTTSGRTMPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MTRFMESEKLLISRGNPRNSVYPSDRPIPTTSGRTMPNELPQKPAPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSGHD+VP PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC
Sbjct: 61   DEFRVSGHDVVPLPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            ARILEVPV+QKLPSLYLLDSIVKNVGHEYISYF+SRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  ARILEVPVDQKLPSLYLLDSIVKNVGHEYISYFASRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTWATVFPPSIIRKIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVV
Sbjct: 181  GTWATVFPPSIIRKIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240

Query: 241  DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
            DKH QD+RG SA+KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+LGTNK
Sbjct: 241  DKHSQDSRGTSAIKVHDKKLASGYEEYDYDHADALEHGGPQGFHSMGSMGHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
            ANIKLAKSSLSSRIG  RPLQS GDE E VRASPSQNVYDYEGS+MIDR EDTNKWRRKQ
Sbjct: 301  ANIKLAKSSLSSRIGPHRPLQSVGDEHETVRASPSQNVYDYEGSKMIDRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLEST-SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK 420
            YPDDNLNGLEST SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSIN IDNK
Sbjct: 361  YPDDNLNGLESTSSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINVIDNK 420

Query: 421  VTPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGM 480
             TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPV PSRFRTR GFERSNAM IEPGM
Sbjct: 421  ATPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVPPSRFRTRSGFERSNAMPIEPGM 480

Query: 481  RSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGR 540
            RS+WSS V+LP IDSS+VIEDVV STPD WNMHNHISQTSQNLMNNKG GRNFQMP+LGR
Sbjct: 481  RSNWSSPVRLPGIDSSIVIEDVVHSTPDNWNMHNHISQTSQNLMNNKGQGRNFQMPMLGR 540

Query: 541  GMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600
            G+ SS GEKM P+ DKLLTNDALHRP  IASRLGSSGLDSSMESQSIVQSMGPRHPLNLS
Sbjct: 541  GITSSVGEKMSPYGDKLLTNDALHRPTNIASRLGSSGLDSSMESQSIVQSMGPRHPLNLS 600

Query: 601  NSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQ 660
            NSCPPSRPPIFPVPRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT+K PQ
Sbjct: 601  NSCPPSRPPIFPVPRHNASQFESLNGSNSFMNCANRTFLPEQQMNNLRNKELSLTTKSPQ 660

Query: 661  VGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQ 720
            VGNQHTGHIPLTRGNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHLMAPSLSQGYISQ
Sbjct: 661  VGNQHTGHIPLTRGNQLQGMPLKPQFLPSQDMQDNFSGSAVPPVLPHLMAPSLSQGYISQ 720

Query: 721  AHRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAG 780
             HRPAISE LSSSAPIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+SQK  
Sbjct: 721  GHRPAISEGLSSSAPIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQK-- 780

Query: 781  SLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840
              VPGQQPGTA  GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD
Sbjct: 781  --VPGQQPGTAISGLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYAD 840

Query: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900
            LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE
Sbjct: 841  LPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTE 900

Query: 901  AVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960
            AVPGFLPAEV+VEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP
Sbjct: 901  AVPGFLPAEVVVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAP 960

Query: 961  DGQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGIEWSLQLRLIRS 1011
            DGQTAGMD SQLGPIVHAKCRTETNVVPSESFDQDE G  E   + + +RS
Sbjct: 961  DGQTAGMDISQLGPIVHAKCRTETNVVPSESFDQDEGGVSEEGNRRKRLRS 1007

BLAST of Cla97C03G053860 vs. ExPASy TrEMBL
Match: A0A5A7UC46 (Polyadenylation and cleavage factor-like protein 4 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold609G001150 PE=4 SV=1)

HSP 1 Score: 1819.7 bits (4712), Expect = 0.0e+00
Identity = 901/983 (91.66%), Postives = 933/983 (94.91%), Query Frame = 0

Query: 14   RGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHDIVPP 73
            RGNPRNSAYPSDR +PTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHD+VPP
Sbjct: 160  RGNPRNSAYPSDRPIPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRDDEFRVSGHDVVPP 219

Query: 74   PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVEQKLP 133
            PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPV+QKLP
Sbjct: 220  PTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICARILEVPVDQKLP 279

Query: 134  SLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 193
            SLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR
Sbjct: 280  SLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTWATVFPPSIIR 339

Query: 194  KIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHIQDARGASAL 253
            KIEAQLS LTAQESS LTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKH QD+RG SA+
Sbjct: 340  KIEAQLSQLTAQESSGLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDKHTQDSRGTSAI 399

Query: 254  KVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNKANIKLAKSSLSSR 313
            KVHDKKLA GYEEYDYDHAD LEHGG Q FH MGS+GHDSF+LGTNKAN+KLAKSSLSSR
Sbjct: 400  KVHDKKLASGYEEYDYDHADALEHGGAQEFHSMGSMGHDSFSLGTNKANVKLAKSSLSSR 459

Query: 314  IGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQYPDDNLNGLEST- 373
            IGH RPLQS GDELE+VRASPSQNVYDYEGS+++DR EDTNKWRRKQYPDDN+NGLE+T 
Sbjct: 460  IGHHRPLQSLGDELESVRASPSQNVYDYEGSKILDRNEDTNKWRRKQYPDDNMNGLENTS 519

Query: 374  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKVTPVTWQNTEEEE 433
            SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI+GIDNK TPVTWQNTEEEE
Sbjct: 520  SYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSISGIDNKATPVTWQNTEEEE 579

Query: 434  FDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMRSSWSSQVQLPTI 493
            FDWEDMSPTLADRGRNNDMLKP V PSRFRTR GFERSNAM IEPGMRS+WSSQVQLP I
Sbjct: 580  FDWEDMSPTLADRGRNNDMLKPTVPPSRFRTRSGFERSNAMPIEPGMRSNWSSQVQLPGI 639

Query: 494  DSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRGMASSGGEKMFPF 553
            DSS+VIEDVV STPDIW MHNHISQTSQNLMNNKG GRNFQMP+LGRG+ SSGGEKM P+
Sbjct: 640  DSSIVIEDVVHSTPDIWKMHNHISQTSQNLMNNKGPGRNFQMPMLGRGITSSGGEKMSPY 699

Query: 554  ADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSNSCPPSRPPIFPV 613
             DKLLTNDALHRP  IASRLGSSGLDS+MESQSIVQSMGPRHPLNLSNSCPPSRPP+FPV
Sbjct: 700  GDKLLTNDALHRPTNIASRLGSSGLDSNMESQSIVQSMGPRHPLNLSNSCPPSRPPVFPV 759

Query: 614  PRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQVGNQHTGHIPLTR 673
            PRHN SQFESLNGSNS +N ANR+FLPEQQMNN+RNKE SLT+K PQVGNQHTGHIPLTR
Sbjct: 760  PRHNTSQFESLNGSNSFMNSANRTFLPEQQMNNLRNKELSLTTKSPQVGNQHTGHIPLTR 819

Query: 674  GNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQAHRPAISECLSSS 733
            GNQLQ +PLKPQFLPSQDMQ+N S SAVPP LPHL+APSLSQGYISQ HRPA SE LSSS
Sbjct: 820  GNQLQSMPLKPQFLPSQDMQDNFSGSAVPPVLPHLIAPSLSQGYISQGHRPANSEGLSSS 879

Query: 734  APIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGSLVPGQQPGTAFP 793
            APIGQWNL VHNS SNPLHLQGGPLPPLPPGPHPTS P+IP+SQK    VPGQQPGTA  
Sbjct: 880  APIGQWNLSVHNSSSNPLHLQGGPLPPLPPGPHPTSGPTIPISQK----VPGQQPGTAIS 939

Query: 794  GLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFK 853
            GLISSLMA GLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFK
Sbjct: 940  GLISSLMARGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFK 999

Query: 854  TQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVIVE 913
            TQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEV+VE
Sbjct: 1000 TQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVVVE 1059

Query: 914  KKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLG 973
            KKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLG
Sbjct: 1060 KKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLG 1119

Query: 974  PIVHAKCRTETNVVPSESFDQDE 996
            PIVHAKCRTETNVVPSESFDQDE
Sbjct: 1120 PIVHAKCRTETNVVPSESFDQDE 1138

BLAST of Cla97C03G053860 vs. ExPASy TrEMBL
Match: A0A6J1EZ18 (polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111440036 PE=4 SV=1)

HSP 1 Score: 1711.0 bits (4430), Expect = 0.0e+00
Identity = 866/1011 (85.66%), Postives = 916/1011 (90.60%), Query Frame = 0

Query: 1    MTPFMESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQKPPPSIAHRFRAQLKQRD 60
            M PFMESEKLLISRGNPR  AY SDR LPTT+GR MPNELPQKP PSIAHRFRAQLKQRD
Sbjct: 1    MNPFMESEKLLISRGNPRTLAYTSDRPLPTTTGRAMPNELPQKPSPSIAHRFRAQLKQRD 60

Query: 61   DEFRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLIC 120
            DEFRVSG D+ P PT EDIVQLY+LMLSELTFNSKPIITDLTVLA+EQREHGKGIADLIC
Sbjct: 61   DEFRVSGLDVAPLPTTEDIVQLYELMLSELTFNSKPIITDLTVLAEEQREHGKGIADLIC 120

Query: 121  ARILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180
            +RILEVPV+QKLPSLYLLDSIVKNVGHEYI+YFSSRLPEVFCEAYRQVHPNLHNAMRHLF
Sbjct: 121  SRILEVPVDQKLPSLYLLDSIVKNVGHEYINYFSSRLPEVFCEAYRQVHPNLHNAMRHLF 180

Query: 181  GTWATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVV 240
            GTW+TVFPPSI+RKIEA+LS LT QE+S+LTSSRASESPRPTHGIHVNPKYLRQLEHSV 
Sbjct: 181  GTWSTVFPPSILRKIEARLSQLTTQETSALTSSRASESPRPTHGIHVNPKYLRQLEHSVG 240

Query: 241  DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 300
            DKHI DARGAS LKVHDKKLAPGYEEYDYDHAD LEHGG QAF+ MGS+ HDSF+LGTNK
Sbjct: 241  DKHIPDARGASTLKVHDKKLAPGYEEYDYDHADGLEHGGSQAFNSMGSMSHDSFSLGTNK 300

Query: 301  ANIKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQ 360
            ANIKLAKSSLSSRIGH+RPLQS GDELEAVRASPSQNVYDYEG RMI+R EDTNKWRRKQ
Sbjct: 301  ANIKLAKSSLSSRIGHNRPLQSVGDELEAVRASPSQNVYDYEGFRMINRNEDTNKWRRKQ 360

Query: 361  YPDDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNKV 420
            YPDDNLNGLESTS+NIRNG ALEGPRALIEAYGSDKGKGYLNDNPPQAEHFS+NGIDNK+
Sbjct: 361  YPDDNLNGLESTSFNIRNGCALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSMNGIDNKM 420

Query: 421  TPVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKPPVLPSRFRTRIGFERSNAMSIEPGMR 480
            TPVTWQNTEEEEFDWEDMSPTLADRGR+NDMLKPPV PSRFRTR+GF+RSNAMSIEPGMR
Sbjct: 421  TPVTWQNTEEEEFDWEDMSPTLADRGRSNDMLKPPVPPSRFRTRLGFDRSNAMSIEPGMR 480

Query: 481  SSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLGRG 540
            S+ S Q                    D W+MH+H+SQTSQNLM+ KG G NFQ+PLLGRG
Sbjct: 481  SNSSHQ--------------------DAWSMHSHLSQTSQNLMSTKGTGGNFQIPLLGRG 540

Query: 541  MASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNLSN 600
            +ASSGGEKM PF DKL TNDALHR PT+ASRLGSS LDSSMESQS+VQSMG RHP+NLS+
Sbjct: 541  IASSGGEKMSPFVDKLPTNDALHR-PTVASRLGSSALDSSMESQSVVQSMGQRHPVNLSD 600

Query: 601  SCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLPQV 660
            SCPPSRPP F VP HNKSQFESLNGSN+ INRANRSFLPEQQMNN+RNKE S T+K PQV
Sbjct: 601  SCPPSRPP-FHVPGHNKSQFESLNGSNAFINRANRSFLPEQQMNNVRNKELSHTTKSPQV 660

Query: 661  GNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYISQA 720
            GNQH G I LT+GNQLQ IPLKPQFLPSQDM ++ SASAVPP LPHLMAPSLSQGY SQ 
Sbjct: 661  GNQHGGRILLTQGNQLQTIPLKPQFLPSQDMHDSFSASAVPPVLPHLMAPSLSQGYSSQG 720

Query: 721  HRPAISECLSSSAPIGQWNLPVHNSPSNPLHLQGGPLPPLPPGPHPTSVPSIPLSQKAGS 780
             RP ISECLSSS PIGQWNLPVHNSPSNPLHLQ GPLPPLP GPHPT      +SQ AGS
Sbjct: 721  LRPGISECLSSSVPIGQWNLPVHNSPSNPLHLQ-GPLPPLPAGPHPT------ISQNAGS 780

Query: 781  LVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALYADL 840
            LVPGQQPGTAF GLISSLMA GLISLNN+ASVQDSVG+EFNPDVLKVRH+SAITALYADL
Sbjct: 781  LVPGQQPGTAFSGLISSLMAQGLISLNNKASVQDSVGVEFNPDVLKVRHDSAITALYADL 840

Query: 841  PRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEA 900
            PRQCMTCGLRFKTQEEHSNHMDWHVT+NRMSKSRKQKPSRKWFVS SMWLSGAEALGTEA
Sbjct: 841  PRQCMTCGLRFKTQEEHSNHMDWHVTRNRMSKSRKQKPSRKWFVSTSMWLSGAEALGTEA 900

Query: 901  VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPD 960
            VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPF+DFYSDETEEWMYRGAVYMNAPD
Sbjct: 901  VPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFDDFYSDETEEWMYRGAVYMNAPD 960

Query: 961  GQTAGMDRSQLGPIVHAKCRTETNVVPSESFDQDEQGGI-EWSLQLRLIRS 1011
            GQTAGMDRSQLGPIVHAKCRTE+NVVPSESFDQDEQ G+ E   Q + +RS
Sbjct: 961  GQTAGMDRSQLGPIVHAKCRTESNVVPSESFDQDEQRGVSEEGSQRKRLRS 982

BLAST of Cla97C03G053860 vs. TAIR 10
Match: AT4G04885.1 (PCF11P-similar protein 4 )

HSP 1 Score: 641.3 bits (1653), Expect = 1.4e-183
Identity = 439/987 (44.48%), Postives = 555/987 (56.23%), Query Frame = 0

Query: 5   MESEKLLISRGNPRNSAYPSDRQLPTTSGRTMPNELPQK--PPPSIAHRFRAQLKQRDDE 64
           M+SEK+L    NPR  +      + +TS + M  ELPQK  PPPS+  RF+A L QR+DE
Sbjct: 1   MDSEKIL----NPRLVS------INSTSRKGMSVELPQKPPPPPSLLDRFKALLNQREDE 60

Query: 65  FRVSGHDIVPPPTAEDIVQLYDLMLSELTFNSKPIITDLTVLADEQREHGKGIADLICAR 124
           F   G + V PP+ ++IVQLY+++L ELTFNSKPIITDLT++A EQREHG+GIA+ IC R
Sbjct: 61  F--GGGEEVLPPSMDEIVQLYEVVLGELTFNSKPIITDLTIIAGEQREHGEGIANAICTR 120

Query: 125 ILEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGT 184
           ILE PVEQKLPSLYLLDSIVKN+G +Y  YFSSRLPEVFC AYRQ HP+LH +MRHLFGT
Sbjct: 121 ILEAPVEQKLPSLYLLDSIVKNIGRDYGRYFSSRLPEVFCLAYRQAHPSLHPSMRHLFGT 180

Query: 185 WATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESPRPTHGIHVNPKYLRQLEHSVVDK 244
           W++VFPP ++RKI+ QL L +A   SS+    ASE  +PT GIHVNPKYLR+LE S  + 
Sbjct: 181 WSSVFPPPVLRKIDMQLQLSSAANQSSV---GASEPSQPTRGIHVNPKYLRRLEPSAAEN 240

Query: 245 HIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNKAN 304
           +++     S+ +V+ +    GY +++    D LE     +  P      D F   +N   
Sbjct: 241 NLRGIN--SSARVYGQNSLGGYNDFE----DQLESPSSLSSTP------DGFTRRSNDG- 300

Query: 305 IKLAKSSLSSRIGHSRPLQSAGDELEAVRASPSQNVYDYEGSRMIDRIEDTNKWRRKQYP 364
                                        A+PS   ++Y   R   R ++  +WRRK+  
Sbjct: 301 -----------------------------ANPSNQAFNYGMGRATSRDDEHMEWRRKE-- 360

Query: 365 DDNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSINGIDNK-VT 424
                       N+  G+  E PRALI+AYG D  K    + P +     +NG+ +K VT
Sbjct: 361 ------------NLGQGNDHERPRALIDAYGVDTSKHVTINKPIR----DMNGMHSKMVT 420

Query: 425 PVTWQNTEEEEFDWEDMSPTLADRGRNNDMLKP--PVLPS-RFRTRIGFERSNAMSIEPG 484
           P  WQNTEEEEFDWEDMSPTL DR R  + L+   P L S R R R+G   ++   ++  
Sbjct: 421 P--WQNTEEEEFDWEDMSPTL-DRSRAGEFLRSSVPALGSVRARPRVG--NTSDFHLDSD 480

Query: 485 MRSSWSSQVQLPTIDSSMVIEDVVQSTPDIWNMHNHISQTSQNLMNNKGAGRNFQMPLLG 544
           +++  S Q++                  + W++  +   TS  +  +  AG++ ++    
Sbjct: 481 IKNGVSHQLR------------------ENWSLSQNYPHTSNRV--DTRAGKDLKVLASS 540

Query: 545 RGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSMESQSIVQSMGPRHPLNL 604
            G+ SS  E   P  D +           + SR G +  D +    S   + GP      
Sbjct: 541 VGLVSSNSEFGAPPFDSI---------QDVNSRFGRALPDGTWPHLS---ARGP------ 600

Query: 605 SNSCPPSRPPIFPVPRHNKSQFESLNGSNSLINRANRSFLPEQQMNNMRNKEPSLTSKLP 664
            NS         PVP            S  L + AN    P   M+N    +P L     
Sbjct: 601 -NS--------LPVP------------SAHLHHLAN----PGNAMSNRLQGKP-LYRPEN 660

Query: 665 QVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASAVPPALPHLMAPSLSQGYIS 724
           QV   H     +T+ NQ     +   +LP        S+SA+ P     +   +S GY  
Sbjct: 661 QVSQSHLN--DMTQQNQ-----MLVNYLP--------SSSAMAPRPMQSLLTHVSHGY-- 720

Query: 725 QAHRPAISECLSSSAPIGQWNLPVHNSPSNP-LHLQGGPLPPLPPGPHPTSVPSIPLSQK 784
                                 P H S   P L +QGG         HP S  S  LSQ 
Sbjct: 721 ----------------------PPHGSTIRPSLSIQGG------EAMHPLS--SGVLSQI 780

Query: 785 AGSLVPGQQPGTAFPGLISSLMAHGLISLNNQASVQDSVGLEFNPDVLKVRHESAITALY 844
             S    Q PG AF GLI SLMA GLISLNNQ + Q  +GLEF+ D+LK+R+ESAI+ALY
Sbjct: 781 GAS---NQPPGGAFSGLIGSLMAQGLISLNNQPAGQGPLGLEFDADMLKIRNESAISALY 793

Query: 845 ADLPRQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALG 904
            DLPRQC TCGLRFK QEEHS HMDWHVTKNRMSK+ KQ PSRKWFVS SMWLSGAEALG
Sbjct: 841 GDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHKQNPSRKWFVSASMWLSGAEALG 793

Query: 905 TEAVPGFLPAEVIVEKKDDEELAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMN 964
            EAVPGFLP E   EKKDDE++AVPADEDQ +CALCGEPFEDFYSDETEEWMY+GAVYMN
Sbjct: 901 AEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGEPFEDFYSDETEEWMYKGAVYMN 793

Query: 965 APDGQTAGMDRSQLGPIVHAKCRTETN 985
           AP+  T  MD+SQLGPIVHAKCR E+N
Sbjct: 961 APEESTTDMDKSQLGPIVHAKCRPESN 793

BLAST of Cla97C03G053860 vs. TAIR 10
Match: AT2G36480.1 (ENTH/VHS family protein )

HSP 1 Score: 206.8 bits (525), Expect = 8.6e-53
Identity = 241/902 (26.72%), Postives = 369/902 (40.91%), Query Frame = 0

Query: 124 LEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP +QKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S+   S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 303
            + +Q +     +     + AP         +D LE         + SI      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGHSRPLQSAGDELEAVRASP--SQNVYDYEGSRMI-DRIEDTNKW 363
             NI+  +  L S   + + ++S   E +     P  S++V    GSR+  D  E     
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYG 240

Query: 364 RRKQYPD---DNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI 423
              + PD   D  +GL S S   R  +        +E+ G  +  G   D          
Sbjct: 241 ATNRDPDLISDQRDGLHSKS---RTSNYATARVENLESSGPSRNIGVPYD---------- 300

Query: 424 NGIDNKVTPVTWQNTEEEEFDWEDMSPTLADR-----GRNNDMLKPPVLPSRFRTRIGFE 483
                     +W+N+EEEEF W DM   L++         N++  P             +
Sbjct: 301 ----------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHLLK 360

Query: 484 RSNAMSIEP-----GMRSSWSSQVQLPTIDSSMVIEDV-VQSTPDIWNMHNHISQTSQNL 543
           R    +++P        +S+SS+ + P+             ST     +       S  +
Sbjct: 361 RPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASSGI 420

Query: 544 MNNKGAGRNFQMPLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSME 603
           + + G+G + Q PL       S  ++     D    +    R P  ASR  +        
Sbjct: 421 LPSSGSGSDRQSPL-----HDSTSKQNVTKQDVRRAHSLPQRDPR-ASRFPA-------- 480

Query: 604 SQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNKSQFESLNG---SNSLINRANRSFLP 663
            Q++ +    R P + S     +   +      +KS  E+  G   ++    + N S L 
Sbjct: 481 KQNVPRDDSVRLPSSSSQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLL 540

Query: 664 EQQMNNMRNKEPSLTSKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASA 723
           E  M     K   L++       +   H  +  G    P   KP+ LP     +NL    
Sbjct: 541 EAVM-----KSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATDNL---- 600

Query: 724 VPPALPHLMAPSLSQGYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLH-------- 783
                       L++  + Q+  P +S   S +           +  S+PL         
Sbjct: 601 ------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCLLSSLVS 660

Query: 784 --LQGGPLPPLPPGPHPTSVPSIPLSQKAG---SLVPGQ-QPGTAFPGLISSLMAHGLI- 843
             L       LP  P  T   S   S  +    S+VP   QP     G  ++    GL  
Sbjct: 661 KGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPKVKGLAA 720

Query: 844 -SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDW 903
            S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE   HM+ 
Sbjct: 721 PSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMEL 780

Query: 904 HVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVIVEKKDDEELAVPA 963
           H  K ++  S      R WF  +  W++   A   E  P +       E   ++  AV A
Sbjct: 781 H-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIEDCQAVAA 815

Query: 964 DEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVHAKCRTET 986
           DE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH  C T +
Sbjct: 841 DETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHTGCLTTS 815

BLAST of Cla97C03G053860 vs. TAIR 10
Match: AT2G36480.2 (ENTH/VHS family protein )

HSP 1 Score: 206.8 bits (525), Expect = 8.6e-53
Identity = 241/902 (26.72%), Postives = 369/902 (40.91%), Query Frame = 0

Query: 124 LEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP +QKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S+   S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 303
            + +Q +     +     + AP         +D LE         + SI      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGHSRPLQSAGDELEAVRASP--SQNVYDYEGSRMI-DRIEDTNKW 363
             NI+  +  L S   + + ++S   E +     P  S++V    GSR+  D  E     
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYG 240

Query: 364 RRKQYPD---DNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI 423
              + PD   D  +GL S S   R  +        +E+ G  +  G   D          
Sbjct: 241 ATNRDPDLISDQRDGLHSKS---RTSNYATARVENLESSGPSRNIGVPYD---------- 300

Query: 424 NGIDNKVTPVTWQNTEEEEFDWEDMSPTLADR-----GRNNDMLKPPVLPSRFRTRIGFE 483
                     +W+N+EEEEF W DM   L++         N++  P             +
Sbjct: 301 ----------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHLLK 360

Query: 484 RSNAMSIEP-----GMRSSWSSQVQLPTIDSSMVIEDV-VQSTPDIWNMHNHISQTSQNL 543
           R    +++P        +S+SS+ + P+             ST     +       S  +
Sbjct: 361 RPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASSGI 420

Query: 544 MNNKGAGRNFQMPLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSME 603
           + + G+G + Q PL       S  ++     D    +    R P  ASR  +        
Sbjct: 421 LPSSGSGSDRQSPL-----HDSTSKQNVTKQDVRRAHSLPQRDPR-ASRFPA-------- 480

Query: 604 SQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNKSQFESLNG---SNSLINRANRSFLP 663
            Q++ +    R P + S     +   +      +KS  E+  G   ++    + N S L 
Sbjct: 481 KQNVPRDDSVRLPSSSSQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLL 540

Query: 664 EQQMNNMRNKEPSLTSKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASA 723
           E  M     K   L++       +   H  +  G    P   KP+ LP     +NL    
Sbjct: 541 EAVM-----KSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATDNL---- 600

Query: 724 VPPALPHLMAPSLSQGYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLH-------- 783
                       L++  + Q+  P +S   S +           +  S+PL         
Sbjct: 601 ------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCLLSSLVS 660

Query: 784 --LQGGPLPPLPPGPHPTSVPSIPLSQKAG---SLVPGQ-QPGTAFPGLISSLMAHGLI- 843
             L       LP  P  T   S   S  +    S+VP   QP     G  ++    GL  
Sbjct: 661 KGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPKVKGLAA 720

Query: 844 -SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDW 903
            S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE   HM+ 
Sbjct: 721 PSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMEL 780

Query: 904 HVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVIVEKKDDEELAVPA 963
           H  K ++  S      R WF  +  W++   A   E  P +       E   ++  AV A
Sbjct: 781 H-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIEDCQAVAA 815

Query: 964 DEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVHAKCRTET 986
           DE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH  C T +
Sbjct: 841 DETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHTGCLTTS 815

BLAST of Cla97C03G053860 vs. TAIR 10
Match: AT2G36480.3 (ENTH/VHS family protein )

HSP 1 Score: 206.8 bits (525), Expect = 8.6e-53
Identity = 241/902 (26.72%), Postives = 369/902 (40.91%), Query Frame = 0

Query: 124 LEVPVEQKLPSLYLLDSIVKNVGHEYISYFSSRLPEVFCEAYRQVHPNLHNAMRHLFGTW 183
           ++VP +QKLP+LYLLDSIVKN+G +YI YF +RLPEVF +AYRQV P +H+ MRHLFGTW
Sbjct: 1   MQVPSDQKLPTLYLLDSIVKNIGRDYIKYFGARLPEVFVKAYRQVDPPMHSNMRHLFGTW 60

Query: 184 ATVFPPSIIRKIEAQLSLLTAQESSSLTSSRASESP---RPTHGIHVNPKYLRQLEHSVV 243
             VF P  ++ IE +L      + S+   S A   P   RP H IHVNPKYL +      
Sbjct: 61  KGVFHPQTLQLIEKELGFNAKSDGSAAVVSTARAEPQSQRPPHSIHVNPKYLER------ 120

Query: 244 DKHIQDARGASALKVHDKKLAPGYEEYDYDHADVLEHGGGQAFHPMGSIGHDSFALGTNK 303
            + +Q +     +     + AP         +D LE         + SI      +G  K
Sbjct: 121 -QRLQQSGRTKGMVTDVPETAPNLTR----DSDRLER--------VSSIASGGSWVGPAK 180

Query: 304 A-NIKLAKSSLSSRIGHSRPLQSAGDELEAVRASP--SQNVYDYEGSRMI-DRIEDTNKW 363
             NI+  +  L S   + + ++S   E +     P  S++V    GSR+  D  E     
Sbjct: 181 VNNIRRPQRDLLSEPLYEKDIESIAGEYDYASDLPHNSRSVIKNVGSRITDDGCEKQWYG 240

Query: 364 RRKQYPD---DNLNGLESTSYNIRNGHALEGPRALIEAYGSDKGKGYLNDNPPQAEHFSI 423
              + PD   D  +GL S S   R  +        +E+ G  +  G   D          
Sbjct: 241 ATNRDPDLISDQRDGLHSKS---RTSNYATARVENLESSGPSRNIGVPYD---------- 300

Query: 424 NGIDNKVTPVTWQNTEEEEFDWEDMSPTLADR-----GRNNDMLKPPVLPSRFRTRIGFE 483
                     +W+N+EEEEF W DM   L++         N++  P             +
Sbjct: 301 ----------SWKNSEEEEFMW-DMHSRLSETDVATINPKNELHAPDESERLESENHLLK 360

Query: 484 RSNAMSIEP-----GMRSSWSSQVQLPTIDSSMVIEDV-VQSTPDIWNMHNHISQTSQNL 543
           R    +++P        +S+SS+ + P+             ST     +       S  +
Sbjct: 361 RPRFSALDPRFDPANSTNSYSSEQKDPSSIGHWAFSSTNATSTATRKGIQPQPRVASSGI 420

Query: 544 MNNKGAGRNFQMPLLGRGMASSGGEKMFPFADKLLTNDALHRPPTIASRLGSSGLDSSME 603
           + + G+G + Q PL       S  ++     D    +    R P  ASR  +        
Sbjct: 421 LPSSGSGSDRQSPL-----HDSTSKQNVTKQDVRRAHSLPQRDPR-ASRFPA-------- 480

Query: 604 SQSIVQSMGPRHPLNLSNSCPPSRPPIFPVPRHNKSQFESLNG---SNSLINRANRSFLP 663
            Q++ +    R P + S     +   +      +KS  E+  G   ++    + N S L 
Sbjct: 481 KQNVPRDDSVRLPSSSSQFKNTNMRELPVEIFDSKSAAENAPGLTLASEATGQPNMSDLL 540

Query: 664 EQQMNNMRNKEPSLTSKLPQVGNQHTGHIPLTRGNQLQPIPLKPQFLPSQDMQENLSASA 723
           E  M     K   L++       +   H  +  G    P   KP+ LP     +NL    
Sbjct: 541 EAVM-----KSGILSNNSTCGAIKEESHDEVNPGALTLPAASKPKTLPISLATDNL---- 600

Query: 724 VPPALPHLMAPSLSQGYISQAHRPAISECLSSSAPIGQWNLPVHNSPSNPLH-------- 783
                       L++  + Q+  P +S   S +           +  S+PL         
Sbjct: 601 ------------LARLKVEQSSAPLVSCAASLTGITSVQTSKEKSKASDPLSCLLSSLVS 660

Query: 784 --LQGGPLPPLPPGPHPTSVPSIPLSQKAG---SLVPGQ-QPGTAFPGLISSLMAHGLI- 843
             L       LP  P  T   S   S  +    S+VP   QP     G  ++    GL  
Sbjct: 661 KGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQPSVLVKGPSTAPKVKGLAA 720

Query: 844 -SLNNQASVQDSVGLEFNPDVLKVRHESAITALYADLPRQCMTCGLRFKTQEEHSNHMDW 903
            S  +++  +D +GL+F  D ++  H S I++L+ DLP  C +C +R K +EE   HM+ 
Sbjct: 721 PSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLCTSCSVRLKQKEELDRHMEL 780

Query: 904 HVTKNRMSKSRKQKPSRKWFVSISMWLSGAEALGTEAVPGFLPAEVIVEKKDDEELAVPA 963
           H  K ++  S      R WF  +  W++   A   E  P +       E   ++  AV A
Sbjct: 781 H-DKKKLELSGTNSKCRVWFPKVDNWIA---AKAGELEPEYEEVLSEPESAIEDCQAVAA 815

Query: 964 DEDQKTCALCGEPFEDFYSDETEEWMYRGAVYMNAPDGQTAGMDRSQLGPIVHAKCRTET 986
           DE Q  C LCGE FED++S E  +WM++GA Y+  P   +        GPIVH  C T +
Sbjct: 841 DETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSEAS-----GPIVHTGCLTTS 815

BLAST of Cla97C03G053860 vs. TAIR 10
Match: AT1G66500.1 (Pre-mRNA cleavage complex II )

HSP 1 Score: 166.0 bits (419), Expect = 1.7e-40
Identity = 101/209 (48.33%), Postives = 123/209 (58.85%), Query Frame = 0

Query: 792 PGLISSLMAHGLISLNN-------QASVQDS--VGLEF-NPDVLKVRHESAITALYADLP 851
           P ++S  +   L  LNN       +AS  DS  VGL F NP  L VRHES I +LY+D+P
Sbjct: 194 PIVLSKELTDLLSLLNNEKEKKTLEASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMP 253

Query: 852 RQCMTCGLRFKTQEEHSNHMDWHVTKNRMSKS-----RKQKPSRKWFVSISMWLSGAEAL 911
           RQC +CGLRFK QEEHS HMDWHV KNR  K+     ++ K SR W  S S+WL  A   
Sbjct: 254 RQCSSCGLRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGG 313

Query: 912 GTEAVPGFLPAEVIVEKKDDEE---LAVPADEDQKTCALCGEPFEDFYSDETEEWMYRGA 971
            T  V  F   E+  +K  DEE   L VPADEDQK CALC EPFE+F+S E ++WMY+ A
Sbjct: 314 ETVEVASF-GGEMQKKKGKDEEPKQLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDA 373

Query: 972 VYMNAPDGQTAGMDRSQLGPIVHAKCRTE 983
           VY+            ++ G IVH KC  E
Sbjct: 374 VYL------------TKNGRIVHVKCMPE 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894060.10.0e+0094.79polyadenylation and cleavage factor homolog 4 isoform X3 [Benincasa hispida][more]
XP_038894058.10.0e+0094.88polyadenylation and cleavage factor homolog 4 isoform X1 [Benincasa hispida][more]
XP_038894059.10.0e+0094.88polyadenylation and cleavage factor homolog 4 isoform X2 [Benincasa hispida][more]
XP_008462986.10.0e+0090.70PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2 [Cucumis mel... [more]
XP_008462960.10.0e+0090.26PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Cucumis mel... [more]
Match NameE-valueIdentityDescription
Q0WPF21.9e-18244.48Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9C7102.4e-3948.33Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9FIX81.2e-3845.93Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN... [more]
O949136.1e-1936.08Pre-mRNA cleavage complex 2 protein Pcf11 OS=Homo sapiens OX=9606 GN=PCF11 PE=1 ... [more]
Q102372.6e-1437.11Uncharacterized protein C4G9.04c OS=Schizosaccharomyces pombe (strain 972 / ATCC... [more]
Match NameE-valueIdentityDescription
A0A1S3CI660.0e+0090.70polyadenylation and cleavage factor homolog 4 isoform X2 OS=Cucumis melo OX=3656... [more]
A0A1S3CJP90.0e+0090.26polyadenylation and cleavage factor homolog 4 isoform X1 OS=Cucumis melo OX=3656... [more]
A0A0A0LVG00.0e+0090.70CID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G109350 PE=4 SV... [more]
A0A5A7UC460.0e+0091.66Polyadenylation and cleavage factor-like protein 4 isoform X2 OS=Cucumis melo va... [more]
A0A6J1EZ180.0e+0085.66polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucurbita mosch... [more]
Match NameE-valueIdentityDescription
AT4G04885.11.4e-18344.48PCF11P-similar protein 4 [more]
AT2G36480.18.6e-5326.72ENTH/VHS family protein [more]
AT2G36480.28.6e-5326.72ENTH/VHS family protein [more]
AT2G36480.38.6e-5326.72ENTH/VHS family protein [more]
AT1G66500.11.7e-4048.33Pre-mRNA cleavage complex II [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006569CID domainSMARTSM00582558neu5coord: 78..200
e-value: 1.8E-41
score: 153.8
IPR006569CID domainPFAMPF04818CIDcoord: 87..194
e-value: 2.2E-11
score: 44.0
IPR006569CID domainPROSITEPS51391CIDcoord: 75..203
score: 37.097122
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 73..200
e-value: 8.4E-42
score: 144.3
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 75..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 752..769
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..47
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 746..769
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 13..38
NoneNo IPR availablePANTHERPTHR15921:SF12POLYADENYLATION AND CLEAVAGE FACTOR HOMOLOG 4coord: 42..985
NoneNo IPR availableCDDcd16982CID_Pcf11coord: 80..199
e-value: 3.9776E-54
score: 182.38
IPR045154Protein PCF11-likePANTHERPTHR15921PRE-MRNA CLEAVAGE COMPLEX IIcoord: 42..985
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 844..864

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G053860.2Cla97C03G053860.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006379 mRNA cleavage
biological_process GO:0006378 mRNA polyadenylation
biological_process GO:0009911 positive regulation of flower development
biological_process GO:0006369 termination of RNA polymerase II transcription
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005849 mRNA cleavage factor complex
molecular_function GO:0003729 mRNA binding
molecular_function GO:0000993 RNA polymerase II complex binding