HG10011827 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10011827
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCleavage and polyadenylation specificity factor subunit 1
LocationChr01: 13228796 .. 13262854 (-)
RNA-Seq ExpressionHG10011827
SyntenyHG10011827
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTGGGACTTTAAGAACATTAATATGGATAAAGAAGATTCAGAACGAGACTACTACTGTAGGTGGTTCTGCAAGGTTGAGGCTCTTCAACCTCAGCCAAAGGGAAGACAACCATGCCAAGACGAAACAGAACAAATGATTGAAGCCCTGGAAAATGGAATAGGCGAATCAGATCCCCATGACTCAAAATCAAAAGAACCAGGAGTTTTTAGTTTACAAGTTGAAGCAAAGAACAGAAAGAAAGATGCTGGAAGTAAGGCCGAGGTCAAGGTTAGGGTCTACGCCAAGGTTGAGGTCGGGGTTGAGGTCGAGGTTGAGGCCAGGGTCGAGACCGAGGCTGAGGTTGGGGCCGAAGAGATGGACAATAGAAGGAATGGGCAAATCAGGTCCCCGTTATCTGAAACTAAAATAACCAAGAGTTTTAAAGCTACAAGTTGAAACAATAGACAGAAAAGAAGATGTCGGAACCAAGGCTGAGGCTGAGGCCGAGCCTCCAACAATAAGCTTGCCAGAGACGAACTTGCACAGTTGGATCGAGGAAAGATGAGGTAAAGCAAAAACAGAGTAAGAGACCAACTAAGGGAACATGACACAAGGCAAAAAGGCGAAGGATAGAGGAACAAGATCCGCAATGGTTCAGCAACGGTGCTCAGCCCCAAGGCTGAGGCCGGACCTTGTTAAACACATAATGAGGCATGCTTTATGCATGCCTCGACCCAAAGTAACATTCTAGGCTTGAGATATGGCCTAATGGTGACGCTCGACCTCAGCCAAAGGCTGAGGCCGACCTTAGCCAAATACGAAGATGGACACGCTCCTTTGCCCAGAAGGTTATCTACTAATTCTAAAGCAACTTGGAAACCCTAGAGAGTACCCTATATCAACTTTCTCTATAAATACGTCATAATGGCCTCATGCAAAGACCATACCCAGGAGATAAACAACAATATTGCTTCTTTCATCAAAAGTAAAAGATGCACAAAACGACTACATGGGTAAAACATTGGAAGAATAATAGCATAAGAAAGGAAGTAAGAGAAGATCAATAAAGCTTCCCAGTTGTCTTGATTGGCAGCATCGTGGTCAGCCTCAACCCAATTATCAACGGGCTGAAAGGCTTAAGGATTATGGCACAGAGTTCCAAAATCCCAAGTTAGATAGTCGAAGTTCATCTCAAGGTATTCAATGTTAGAGAAGAGAGCATCAACCTTGTCAAGTTTGACTTTAAAGAAGGAGATGTTGGCTCGTGGTGACTCAAATTCAGCTTGGGAATAAAATAAAAGTCACTTGAGACGGGCAATATTGGCCTGGGCTTTAGTTAACTCTTCGCATAGGACGTAGGATTCCGTCAGATAGATGATACAGTTAGCCATGCTTATTGTTGGCATAGCAATTCAGGAGAAGGACTAGGACAAACCAAAGAAGCAGAAGGTTACTTAGGTGGTAAAAAAAAAAAATACCGGCCGACCATGCCCGCGTGGACCAAGGCCAACCGTACCGAAAACAGGAGGTAATTTTATCTTGGAGCCAAGGAGAAAGACGACGACTCATTGGAGGATAGAATCAGATCGTTGCCAAATGCTTAGCAAGTGCATGCTTTCCGCATGTTCTGGTGCAAAAAATCATGTTAGAACTGGAATATAGTCAAAAGAAGATACTCGACCTCAGCCAAGGCCAAGGTCAACCCTAGTCAAATGCCTAACAGGGCGTGCTTTACGCAATCCACACCCAAGAAATCATTTTAAGCCTAAAATATGGCCAAAGAAGGACGCTTGGCCTTGGCTAAGGTCACAGCCGACCCAAGCCAAACGTGAAAGTTTTAAATCTTTTCCATAACTCATTTTAAGTCATTTAAAATTTTCTTTAAATTACACTTTTCGTGTTTTTGAAAAATTAAAAATTAATATTTTATTTCAAAAAGTTTCCTAATCATTTTTTTTTCTAATAACTTGTTTCATGATCAACCAAGTGAATGGGTGGATTATGAATCCCTATGCTAGATCTTTAAATTAAATTAAGTTCATTTCAATTTTCGCTTTACTTTTATGCACACTAAATTCAAATCTTTACATTTCTTGCAAACACAACACACAAATCCCCTCGTGGCAACCCTTGAATCCAAGAAAAAGCTAAAATCATCCTAAGTTCCCTGAGGAGACTACTCGATCTTGCCACTTGTACTATGCCTTTGTAGTATAGATAGTATCGGTCGGACTTTTATAAATTTATTTAATTGGTGGGAAGAGGAATTGTGACGCCCTCTCTCCAGTCATATCGCTGATAAACTTAATAATGATGCACTCTTTTATAATGGTGATACACTCGACTAATGTATTACTGATACACTTGATTTATATTCTGTTGATAAACTTGAAATGATACACTTGACTGAGTTGATAAACTTTATAATGATACACTTAGGACATGTTTGAGAGTGATTTAAAATGGTTAAAATCACTTTTGTCATCTTCTTCTTTGCAGCAACATACTCCTCCCCACACACATCTCTCTTTCACCAACCAACACTCCTTCACACCATCTGCCGTCGCCCACTGCTCTCACCTCCCCGTCGCTTACATTCGTCAACCGCCGCACGTCGGGTTGCCGATCACCATTTTTCTCTCTCTTTATCTTTCTTTCCCCCAACATCTCTTTCACCAACTACCACAGTGGCGGATCCAGAACTTCAAGTCACGGGCACAAATGTGTATATAAAGTTAAAAGTTTAACCCATAAACTTCTAGATTTTTGTTTATATATAATTCCTAAACTTTAAAAAATGTTTAATAGATCCTTGAATTCTCGATTTTTGTATTTAATAGATTAATAAAATATCAATTATGCATTCAATAGGTCTGTTATATATTTGAAAATATTAAAAATTAAACGTTATATTAGACATTAAATTTAAATTTTTGTTTAACAAGTACTATTAGACATCTTTTTAAAATTAATTTATCTATTAAGTATAAAATTTAAAACTTAAAGTTTAATTAGACACAAACTATTTCTTTTCTCTTATGAGATTTATTAGATATAAAATTGAAGTTTATAGATTTATTAGACATTTTAAGAGCTAGAGTCAAAATTGAAGTAATATTAATTATAAAATTATTATCAATTTTATAAAATTATTCCCTTAAATTAAGCAAAAATTATTCAATCAATTTTTATAAAATAATTTTATAAAGATTAAAAAATATTTGTATATTTAAATATATATGCCAGAAAATTAAAATAGAATTAAGTTTTTCGAACATGTTTTGAGTGTATTGAAAATCAAAATATTATATATTTCGTGAATAATACATGTATAGAGAAATATTTAATATGTTGTTCAAAATTAAAATAAAAAGTGAACTTCTCAAGATTTGATCCCACGACCTGATGCCAAAAAGATGAAAATAAACACTTTACCACACCACTAGGCCCAATCTTACTCATTGAAATTAGAAAAACGAAAATTATATAAAATATAATTAAGACATTTTAATATCCAAGGAAAACCAATATACAAATTTTAATTTGAGGGGTACACCCCTTCATAGATCCGCCTTTGACCATTGTTCTACATCATCTGCTGTCAGGTCGCCATAGTTCTACTTATTTGAAATTTTTGCCATATTTACAAAATATTATATGAGTTACACACTTATTAGCATATTCAAATTGTCACCCATTGCAATTTTTTTTTTTTTTTTTAAAAAAAACCTAAATGAAATTAAAAAAGCAAAATTAGAGTCCCTCCCTCCGTCTCCCGCCAAACCAAACCTCCCCAAAGCCCTAGTCTTGTGGTCTCTTCTTCTCTAGCCTCTGCTCCTCCACATTCCGTTTCCAAAGTCCTCGAACTCTCGACTCCATAACTTTCTCTCAAGCTCCGCATTGTCTCTCATTTTATTTCTGAACATTCATTTTTCACTCTCGTAGGATGAGTTTTGCTGCTTATAGAATGATGCACTGGCCTACCGGCATCGAAAACTGCGATTCAGGCTTTATCACCCATTCTCGCGCCGACTTCGTCCCCGGTGTCACATCTCACACCGACGACCTTGACTCCGACTGGCAGCCCCGCCGAGAAATTGGTCCAGTTCCCAATCTCGTTGTCACCGCCGGCAATGTCCTCGAGGTATATGTTGTTAGGGTTCAAGAAGAAGGTGGAAGAGAATCAAGAAGTTCAGGAGAAGTCAGACGCGGTGGCATTATGGATGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTACAGGTATGCTCCAAAGTTTTGTGAATGTACTAATTGATTTTCATGTTACCATGTTCAATCTTATACACCATGGAGCAGCTTTTTGCGAGCTCCTCAATTTTGGGTCTTTTTGTTGCGTCTCATTTTCTACTTCTCAACAAAAATTAATGTGACTAGTTATCTTGAGCAAAGGAGGTTCGTTGATTTATATTTGGTAAGCACTTGCTATTTTTGTTCAGTGATAATTTACTATTCTTGCGTTTTGCTTTGCAGGTTGCATGGTAATGTTGAGTCCATGGCGATTTTGTCTAGTAGAGGAGGTGATGGTTCCAAGAAGAGAGATTCGATTATATTAGTCTTTCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGGTGGGTGGAGGTGTTAAAGGAACGAGATATTTGTTTGCCTTTTTGATTGTTCTCTACAATCATTTAGTCATTCACATGTGCAGGTTTTTGAACTTTCAAACAAAGATAATGTGCATAATCAGAAATAGTTACATTTACATGGGATGTGTAAATTTTAAGCTTTTGGTATTTTCTTTTGGCTATATGTGTCATGTGTATCTACTAAAACTCTTTCTCATTGTTTGATCTATGATTAACATAATTGGGGACTAAGTCCCTAATTGTCGAATTCATATTTGTCACAATTCTTGTGATTTTTCCTTTATTTTTTTTGGAGAGAAAAACATAATTGGTGCAAGTAGAAAATCGTTCATTCCAGATGTGATATCAGTGACGAGTTTTATTTATTATTTTTTGAAAGATAAGATACGAAGGCAATATAATTATTTGTTGGCGTTGAATGTACATTGCTGGAGCTTTTTTGTTCTTTCAAGATTCCTATTAGAATCTATGAAAACTTACATTTAGTATGGAATATGGTTCTTTAATCCGATATGCTATGCGATATAAATGTTATTCTCTTATCATCCTTTTGGTTTTAAACTTCTTAATTGTCTTGTTTATTTTCTCAAAATGGTTCAACATTATTAGTCAACTAGACCATCTTCTATTGAAGTTATATATATATATAAAATTCATGGAACATAAACAAAGCTCCAGGAAGTTACTAAAATCCTATTATATATTTAGTATCTAAGAACTCAAAGTGTAAATCATTCCAAATGCACATGATCCTAGTCTTTCTCCACTCCCTCGAATGTTGTGGATTTACGCTGCCTACTAGCTCGATTGACCAACAATTTTCTTGGGCTCTAATCTCATTATGTGACTTAAATATCAAGACAACATACTTGTTTGTTATGCATGTAATTATCCTGTCCCTCCAAATAATTTCCTAATTGTTTCCTTCCGTCCTCCACCAGATAGATGTACAAAGGTACTATTCTTGGAAAGAACTTGTGCAAGATGATAGCCTGGTTATTTATTATTTTAGTTGCCTAAATTTATTAGGCATTTGACAATTACTATTGAGTGGAAATTTTCGTTTGTAGCTCAATGCATTGCTTTGAGGGCCCTCAATGGCTTCATTTGAAAAGAGGTCGGGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGTGTTTTTTGCCTTTCAAACCTTTTTATTTGGTTGTGCATGACAATTTAATGCCATGCTTTGTTAGTACAATAAATATGTTAAAATTGACTCGTCATTCCCAATGTTTGCACGTCGAGTGAGATGAGATATGCCCAGCATGTTTAGGTAATGCACTAGAACTGGACGTAGTAATTTTTAGTGTTGATGGTAGTCAAACTCATACTAATGGAGGTGACAAATATGAAGAAAATGTTTAGAAAACTTGTTTGACTATACTTTTGACTCTTTTTTTTTTTTTTTTTTTTTTGTAAACCTTAATGGACATTGTTGATTGCCATTATATTTTCAGTTGACTATTGGGAAGAAAAATAAAATTGTTGTATTTTTTTTCTCACATATCCAAAACATATACAAATTCCTCTTATTTAGAAAAAAAGACCATCAACCAATAAGGAAAAGGAATCATCAACAAATGAAATCTGTTCTTAATCAATCTAACTTTGTAAGAGTTGGTAGACAAAAGGAGGATGCTGAATCTGTTCCATCTTGATAAAGCCTTATTGACATGGAAGGCAAGTACCTGACAAAACGATACTCCTATGTTCTAACTAATAAAGGATGGTGTACCATCGGTAACTTTATTCATAAGTTTGAAAGGTGAGATCAACCATCAATGAAAAAATTTCCATCATTCCTTCCTATGGTGGATGGATTAAAATTTGAAACATGTTTGTTTTCTTAGGCTCCTTGCAATGGTCTTTTTCTTTTTTGCTTGCATGCATGTATCTTCATTTATCTAAATGAAAGTTTGGTTTCTCATGATTTGTTTGTTAGGCTGGTTCTAGTTTGGTTGTGGACGATGAAGCTTTTGGTAACACTGGTGCAATTTCTGCTCGAGTTGAATCATCATACCTAATTAACCTAAGGGATTTGGATGTGAAGCATGTAAAGGATTTTGTATTTGTACATGGTGAGTGTGTATAGAAAATTCCTTGACTTTCTATCATACTTATCCTGCTTTATTCTTACCCCTCATATTTGATTGATTTATTTATCTATTTATTTTTATCTATTATTATATTGTCTCGACTCTTTTTCCTTTAGCATTTTAATTATAGGGACCATTAGGAGAATTTAAGAATTTTGACGATGTTGATCATTCTTCATCGATATGCATTGTTCATAAAGTTTTTCTTTGGAAGCATATATATTTTATTTGGGATAAATGGTGTATCAAGAGGCCTTAATTTTGCATTATGCTCATGGCCTACCTTTTCGAATTAATAAGAGGATCTTATAATTACTTTCACACCCAACAAGAGTTGTCTCATTAAAAGCAGGGAGCATTGGAAATCAAATGATTAGAAGACGCAATCAAAAGCAGGAGGGATGTTGATTTTAATGGAGAATGCTCATGTAGTCTTGAGCAAGAGACTGGCATGAGTACTCTGTACTTCCTTCAAGGGCAAGAGTGAACCACCCATTTTCCTCTACTCCATACACAATAGTTATGACCTTCCATAAAGGACCTTCTTCATTCACGAATCTCCAAAGCCATTTCTTCAAAAAAGCATTATTCCATTGTTTGAAGGTCGTATTCCCAAATCCCACTATAAATTAGGGAGGGAAGTCCACACCCAATTAGCAAGACGAGTGCAATTAGAGTTTTTTTTTGAGTGATTAGGACTTTGTGAACCCTTAGGGTCTTAGGTGTAGAGCACCTTGTTGGAGGTTTTGATCTTCTACTTTGGTTTTGGTGTATCACCTTTGGTCAGGACCTGTTATCACGGCTAAAATCTGCCGCCGCCGCCGCCATCGTCGTCGTTAAGGTCTGTTACCGGAGTATTTTTTCATCACCGCCCAACGTAGCAAAAAAGCCTACTTGCCGATCTATTGGAGCCCAAATTTTGGTCACAAAATCTCGAGCACATGAGGCACACGTGCTGTCGAAAATCGCCGCCCTTCATCACACGTGCTGGTGCATGAGCGTGCGCGACTTGTTGGTTTTTTACCTTGTCGCCTGGGTTTGTATCGGTTTAGGTTTTCGATGTGTCTGTGTGTTCAAATCCAACAAATCCTGCTCAGTTGTAAAGAAATTTGGACAAGTAGTTGATTTTTCTTTCTTGGTTCAGATATGTTTTTTGACGGTACAGTTTTGTTTTTGAAGCCTTTAGTAAATCCTTGGTTCTTACTTTTTTCAACTGTTCATGTGGATATTGTGGTCATAATTTGCTTGAGTTCGAAGTTGTTTTGTAAAAGAGGTCGTAATGATGCCTAATGTGGTTCAATCGAGGGCTCCATTCATCCATGGTGTATCTAAGAAGCTATCTGTTAATTCCTGCAGTTGAGATCGATGTATTTCAACAGTACCAGAAATCATCTAAAGTATTTTCTTCATCCATGCCCACTCCTATTACAACCATTGCTGGTTCAGGTAACATGAATTGTTGTCTTTTGTCCTCGTCTATCAAATGGGTCATAGATTTCGGTGATACCGATCATATGACAGGTAATCCCAAATTGTTTTCCACGATTTTGTCATCTACATCCCTCCCAATTTAAAGGCTTTGGCACCTTATGGCGATCCTGGATCTGTGGTTGTGTCTCTAGTGCTAATTTCTCCATTATCATTAATGGCCGGCCTCGAGGTAAGATCATTCCATCTTGGGGTATTCACTAAGGTGATCCTTTGTCTCCGTTTCTTTTCATCTTGGTTGCGGATTGTTTGAGTCGTTTAATGGAGCGTAGTCTATCATTGCAACCCACCCAATTGGTTCTTCTGCCTTTGTCCTTAGCCACCTTCAGTTTGCTGATGATACTCTTCTGTTCTCCACGGTTGATCGGTCGGCTTTGCATCGCCTTTTTGAGGTTATTCACATTTTTGAACAAGCTTCTGGTTTAGCTATTAATCTCTCCAAGAGCGAGCTGGTTGGGATTAATATTTCGGACACTGATTTAGCATGGATGGTTTCTACCTTTGGGTGTAAACAAGGTCATTAGCCTACAACTTATCTTGGGCTTCCTTTGGGTGGTAACTCACTTGTCGCTAGTTTCTGAAACCCTATTATTGAGAGAATACAAAGTAAGCTTCATAATTGGAAATATGCATACGTCTCTGGGGGGGGGGGGGGGGCGACACACTCTTGTTCAGGCTACTCTGTCTAGCCTTCCCACATATTATTTATCTTTATATTATGCTCCTGCTGAGATTATCAAGCGTCTTGATAAATTGGTTCGAGATTTCTTTTGGGAGGGCTCTTGTGGTAATGACGGGATGCATAATGTGAATTGGGAGTTAACTTAGCGTCTTAAATTAATGGGTGGTCTTGGTATTGGTAATTTTCGACTTCGAAATTATGCTCTTTTGGCAAAATGGATTTGGAGGTTTATACGTGAAAAAGATGCTTTATGGAGACGCTTGATTGTTGCTAAGTATTACTCTCCCTCTTTTACTTGGCCGTCTTCTATTCGTGTTGCCACAAAAGCCCTTTGGAGATATATCTGCCAAACCATTGACCTAGTGGCTAGCCGCACATAGTTTAGACTTGGAACATGTACCACCATATCATTCTGGAAGGATCATTGGCTTAGCTGTGGTGCTCTCTCTATTGCTTTCCCCCGTTTATTTCGCCTTGCTCGTTTTCCTAACATTAGAGTGGCTGATGTTTGGATTGCGGACTCAGATGCATGGACCTTCACCTTCGCCGCAATCTTAATGACCTTTAAGTTCTTGAGTGGGTTTCCTTGTCTCAATATTTACTACTTGTTCGTCTTCGCCATACCCCCGACACTTGGATTTGAGCTCCTTCCTCATCTTTCTCTGTTAAATCACCGATGGAGGATCTGGTTGGTTCTGCTGAACCCCGTGTTAAAGGTTTATATTCTGCTATTTGGTTTGATCACTTTCCTAAGAAAATTAAGATTTTCTTATGGGAGCTTAGTTTGGATGTTATTAATACCGTTGATCGTCTCCAACGCCGTATGCAATATATGTCTATATCTCCATCTTGGTGTCTTATGTGCCAACGCCACTTTGAATCAGCCGTTCATCTGTTTTTGCACTGTTCCTTCGCTGCTCGTTTCTGGCATGTTGTCTTGGCCACTTTTGGTTGGTCTTTGGTCAGCTCCAATAACATGTTTGATATTCTGGCTTCTCTTCGGTGGGACATCCTTTCTTTGGAGACAAAAGGGTGCTTTGGTTGGCTATTTTGCGTGCTTACTTTTGGACTTTATGGGGTGAACACAATAAACGTCTTTTTAGCGACTCTTTTTCTGATTTGATCATTTTATGGATCTGGTTTTATCTACTGCTTTTTATTGGTACAAAAATAAGCATCCTTTTTAGCACTTTAGCCTTTCTTATTTAGTTTCCAATTGGAAATATTTTCTGTAACACCTTTTGGTGCTTTGGGGTTTCCCCATTATTTCATTTTATCAATGAAATGTTTCTTCTCTAAAAAAATCAATCATCGGGTGCTGATACTCCATCTCAAAATGGGGTTGCTGAATGAAAGAACAAACATCTTTTTGAAACTGTAAGAGCATTATCCTTTCAAATACATGTTCCAAAGCATTTTTGGGTTGATGTAATTTCCTCGACTTGTTTTCTTATTAATCGAATGCCTTCGTCTATTCTTCAAGGTCAGATTCCTTATCATGTTTTCTTTCCAACCAAGCCATTGTTTCCTATTGAGCCAAAAATATTTGGTTGTACTTGTTTTGTTCGAGATATTCGTCCTAATTTGACAAAATTAGATCCTAAGTCTTTGAAGTGCATTTGAAGTGCATTTTCTTAGGCTATTCTCGTGTTTAAAAAGGATAGAAATGCTATGTCTAGGTTTGAATAGTTATCTTATATCACGGGATGTTACCTTTTTTGAGGACAAGCCTTTCACTACATCTTTGCCTAGTATGAGTCAGGGGGAGGATGATGATCTTTTTGTTTATACATTCATGCCTTCGAAACCCTCCCTTGATTTGTCTCTTTCAGAGTCTGTCCCTAATCGTCCACCTATAACTAAAGTTTACTCTAGAAAGCAACAACCTCTAGGTGAATGTTCTGTCCCAGAAGACTCTTTGTCGTCAGATCCAGGACCAAGTGATGAGCTTCCTATTGCCCTTCGTAAAGGTAAACGTTCTTGCACTTATCCTATTGATTCTTGTGTCTTTTATGGCCACCTATCATTTGTTACATGTTCTTTTGTTAAATCTCTTGATTATGTCTCGATCCCTAAGACCGTTCATGAAGCTGTCTCATTCTGGTTGGCGCACTGTGATGATTGAAGAGATGAATGCCTTGGATGATAATGGTAGATGAGATTTAGTATCTCTTCCTGCACGAAAGAAGACTATTGGGTGCAAACCTGATGGATCGGTAGCTCGTTAAAAGCGCGCCTTGTGGCCAAAGGCTATGCTCAAACATATGGAATTGACTATTTTGATACATTTTCTCCTGTTGCTAAGCTCACTTTTGTTAGATTATTCATTTCTATGGCAGCATTTCAAGGTTGGCCTTTGCATCAGCTTGCCAATAAAAAGGCCTTCCTCCATGGTGATCTTCAGGAGGAAGTGTATATGCTCACCACCTAAGTTTGTTGCTCAGGGGGAGAATGATAAAGTGTATTGCCTTCGCAAATCTTTGTATAGGTTGAAACAGAGTTTGCATGCATGGTTTGGAAAATTTAGTTAGGCACTTGAGCTATTTGGAATGAAGAAAAGAAAATCGGATCACTCAGTCTTCTATCGACGATCTATGAATAGTATTATTCTACTTGTTGTATATGTTGATGATATTGTCATAACTGAAAGTGATACATCAGATATAGTCTCTCAAATCTTTCCTCCAAAGTCAATTTCATACTAAAGACTTGGGAGTGCTAAAGTACTTTTTGGGAATTGAAGTAATGAGAAGCAAGAAGAGTATTCTTATGACGCAAAGAAAATATGTGCTTGATTTACTATCTGAGACTGGAAAATTGAGAGCCAAGCCATGCAGTACTCCGATGATGCCTAGTTTGCAACTTTTAAAAGATGGAGAACCATTCAAAGATCCTAAGAGATATAGAAAATTTGTTGGAAAGTTGAATTTCCTTATAGTAACACGACCAGACATAGCTTATTTGGTAAGCATTGTGGGTCAGTTTATCACATCACTCACAGTGGATCATTGGGCTGCAATACAACAAATTCTATGTTATTTGAAGGCTGCACTTGGATGTGGGATCTTATATAAAGACCATGATCATATGAAGATTGAATGTTTTTTAGATGCCGATTGGGCAGGATCAAAAGAAGACAGAAGATCAACTTTTGGCTATTGTACTTTTGTTCGAGGCATTTTGGTTTCGTGGAAAAGTAAGAAGCAAAAGGTAGTCTCACCATTGAGTGTTGAATCAGAATATAGAGCTATGACACAATCGGTTTGTGAAATAATGTGGTTACATCAACTCTTGATTGAGTTGAGCTTCAATACTACAGTTCCTACTAAGTTGTGGTGTGACAATCAAGTTGCCCTTCATATTGCATCTAACCCAATATTTCATGAGCATACTAATCATATTGAAATTGATTGTCATTTTGTTCGGGAAAAAATACAAGGACTGGTGTCTACTGGATATGTGAAGACTGGAGAACAATTAGGAGATATTTTCATGAAAGGTTTAAATAGAATTCGAATAGAGTACCTATGTAACAAGTTGGGCATGATTAACATATATGCTCCAGCTTGAGGGGGAGTGTTATGTATATATGTAATTATAGGAGCTTATTAGTCTTTTATCCTTTATTCTAATTTACTATTCTTAGTTAAGGGTTTCCATTTATGTCTATATATATTGAACTCTAATGTATAGAATGATTGGGGTGTTCTCATCAAACAATATCAAAGACATCCAGAAATTCAAGACCAAGGCCTTCCAAATCAAAATACCAAGCACGTGTCCTGCAAAACATAAACCCAAAAGAAAAAGTCATCTTCCAACAAGTTTGCACCAAGACCAATTTAACAAACTTCCATATCTTCGATATAAGTTTTACTGCTAAAAGAATGAATCATGATTTTATTGTTTCTGCTTAACAAACTTCCATATCTTCAACTTTCAAACATTTAATATTGGTGTATACAAATTCTTGGAGAGTTCTTGATTTGAATCACGATTTTATTGGAATGAATGCTACCTCATTACAACTGTTAGGTGTATTATGGAGCCCCTATAGTACAAACTTATGTTAGGAGGTTTAATAGTTGAAATGAACTGTAATACTGTCTTTATAGTTCTTGCCCATGTATTAAGGGTTATCTTTTGTGTTAAGTATGAACTGAGTGTTCTAGACTCCTTGAGTGGTATCCCTGTGTATTTCTTCTCCCTTTTAGGGCTCCTTGTGAAGTTTGTAAGAGTTTGGAGAAGATGATGCGTGATTTCCTCTTGGAAGGGTTTAAAGAAGGTAAAGGAAAAGGTTTGCATTTGGTGAGGTGGGAGATTATCAAGAGACCGATGTCTATAGGGGGTCTTGGAATTGGGAATCTTAGGGTGCGTAACAAAGCTTCGCTAGCCAAATGACTTTGGTGTTTCTCTCAAAGCCTAATTCTCTGTGGCACAAGATCATTGTTAGTAAAGTGTAAACACGATCCCATCCTTTCGAGTGGTTGTCTAAGGAGGTCAAGGGCACTTACCGGAATTTGTGGAAAGATATTGCGAAAGAGCTCCCTTCTTTTGTTCATCTTGTCCGTTGTATGGTTGGGGAGGGGTGTAATACGTACTTTTGGGAAGATCACTGGGTGGGGGAGAGACCTCTTTGTGGTTTTTTTCCTCGTTTGTATTAGTTGTTGGCTCTTAAAAATCATTTTGTTTCTAACTTCCTCGTGTGGTCTGGGAACTCTTGTTCTTTCTCCTTTGGGTTCTATCGCTCTCTTTCCGATAGGGAAGCGACAGAAGTGGTGGGTCTTCTTTCCTTATGTGAGAGTTATTCCTTTAGAAGAGGGAGATGAGATGTTAGATTTTGGAGCCTTAGTCCTTTGGAAGGGTTCTCGTGCAAGTCTTTCTTTCATTATATGGTTGATCCTTCTCCCTTATGGGTGTCGGTTTTTTTGGTTCTTTGGAGGATTAAGATCCCAAAGGAAGGTGAGGTTCTTCACTTGGCAGGTGCTCCACGGTCGTGCTAGCACGATGGATAGGCTCGTGAGGAAGATACCCTCGCTTGTTGGGCCTTTTTGTTGTATTCTTTGTCGGAAAGCAAAGGAAGACCTTGACCATATTCTTTGGAGCTATGAGTATGTGAGTCGTGTTTGGGACTCTTTCTTCTAGACGTTTGGTTTGACGACTGTGCGACGCAAAGATGTCAGTGACTTGATCCTCCTCAATCTACCTTTTGGGAGGAAAGGTAGTTTTTTATAGAGTGTGGGGGTGGCAGCGATCTTGTGGGTTTTGTGGGGCGAGCAGAATAGGAAGTGTTTAGGGGGTTGGATAGGGATCCTTTTGATGTTTGGTCCCTTGTTTGTTTTCATGTCTCTTGTTGGGCTTTATTTTTGAAGGATTTTTGTAATTATTCTATAGGCTTTATTGTGCACAGTTGGAGTCCCTTGTAGAGGGAGCTCCCATTTTTTTTTTTTTTTAACTTGACTTTTTGTATGCCCTTGTATTCCTTCATTTTTTCTCAATGAAAGTAGTTTTCATTTAAAAAAAAAAACCCGAGTGTTCTGGACTAGAGAAAACCACACTCTCGAATTGATTGAATCATATGTATTTGTATCTTTTGTGCTTAATATTACAAATCAAATCTCCTAGTGAAGTTCTCACAACAGCTTAGGGTGATAAGGAAATAAGTGGCAATGATAAGAGCTAGAGCTAAATGTTCTTACACATGGTGAAGTGTTGCAACCTTTCAAGTTTTGAAAGCCACATTTTCACCCAAAAGATATAATGAATCTGAAAATCTAGATTTATTCTCATCTTTAATACATTATGTACTTATTGATTTAATGTATTTGCACTTATTGAAGATTCTTGTATGAACTTGAAGGGTTGGGGTTGAACCTAGACAACATAGATAAAAGGTTGTGTAATTTCATGGTCACCTTTCATTTAGTATAAATAGTGTTTTGTATCTTTTACATTTGAAGTTGAATTTGAATTTGAATGAATTTATCATGGAGTTAAATCTCTCTCAAGGTGTTATGCTTTCTTTGCATTCTTGTGTTTAGAATGTATGCTTAAAACTTTCAATGTAGTTTGATTAATTAGATTGTGGATAGTTTGAAATCTTGTAGATAGGATCAAGTTCTCTTTTCTTCTTGATATTTGACCATTTTTTCAAGCCAAATTTAGTGAACTTTCTTTAGATTGTGTATCGATCCGTTATTGGGGGTCGTTCTTATTATTGTTGTGAGGGTTCACCTTGAATAAAGGGAGTTTTTAAACGTAACTGAGATAATTTCCTTTAGCATAGGGACCCTATTTAAAGGCTACAAACCGGCCTTTTGAACATGGTAAATAAGAATTACCATGAGGTAAACTTTAAAACATTGAAAGGGAAAATACAAAAGAACTTATATAACAACCAAAGAAAACCGGCTGAACAAAATGACTAACATAATTAAATAGAAAGCCGAAAACCCCTACATCAAATATCTATGCCCATCAAACAAAACCCATGATCCAAAGGGACCGTTATATTCTCCCCCATATCATTATGCTTATTTAGCTATAGAAATCTGATGGATGTGTCTGCTGGCAAATTAAAATTCGACATTTTTTCATATTCACGTTGCTTTTTTATTAAAAAAGAGCTTAAATAAGAGAACAAGCTTTTGTCTCAATTCATTTTATCAATGAAAGAGACTCGTTTCCTTTTAAAAAAAAATAAAAATAATAAGAGAACAAGCTTGATTTAAGAACTTGTTGACCAAAAGTCTCATACAAGAATCCCCCAAATCTCTTGTATTCCATGAGGTTTTGGTGCAGAGTTTGTATTTACATTTTCAAGTGTGGGGTTCTGTTTTTTGGTTAGAGTTGAGGATGGTGGTTTTCATACATTCTCACATTGTAACTTGTCGCTTGTTTAATGAAAGGTTTAATTTTTTCTTTTTTAGATTAGGATTTTTTTTTTTTAAAATTTTTTTAAATGTAGTTTGGTTTTTCATTAAAAGATTAACATTTGAATGACTTTTTCTTGTATGATGTTTCTGACTGAATATTTCTCCCCATTTTAGGTTATATTGAACCTGTGATGGTGATCCTTCATGAGCAGGAGCTTACTTGGGCTGGCCGTGTATCTTGGAAGCATCACACGTGTATGATTTCTGCGCTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACGTAAGTATAAAGTTGGAAGCTCAAAAACTTCACCAACTTAAACCGTATAGATATATCATTTTTATCAAATAACCTATTGCATTTAGACCTTTTGATTAATCGGCATATTTGTTGAAATTACCCTTGGTTTTATTTAGTGTTGAAATTACAAATATAGCATTTTTATCAAATGATATTTTAAAGCACATAAGAAATAGTAACGGATTTCTTAAAACAAGAAGAAATTTGCTGCTGCTTTGCATCCTCGACATCAGTACCACAAGTTGAAGACTTTTGCCAATTAATCTTGAGCCTAGAAAATTGCTCAAAAACTTCCATGGTTTTGATTAAAATCTCTAACATCAAATCCTTGTATTTACAATATAACAACGTTGCATCCACAAATTAAAGAATTGAAACTTGAGTGTTATCCACCCAAAAACTACTGCAAAATTTTGTCTCCCTAAAAAAACTTCTCCTCGAAAAAACCTTAGTTGCCTGAAAAATCTAGTTGCCTTCCTTCTATCCCTCGACCATGGCCGCATCTCAATCCAACTTCCGATCATCATTGTCAGCCGTAGACGCCTCCAATCACCTATTGCCGCCATCCTTGGCTGTCGATCTGTAAGGGACCTAGTTAGAGATATATGGGATTATTATTGGGATAATTAGTATGATGGGGTGTAAAGGGTATAAGGGTAATTAGGTAGGAAGTTAGTTATTAATGATTGCTATAAATAGAAGGAGTGGGAGTGGTAGGAGGGTGTGAAGAATTGTGGGATTTCCTAATTGGGAATTTGGGAGAGGATTCTCAGCCCTCTAGAATGTGCTGAGGTATATTGTAATTTCTTTATTGATATTGCAATATAATTCTCTTTTAGTTTTCTTGTTTGTTCTTTGTGTTCTTGAGTTCTTTTGTTAGGAAGAAATCCTAACACGATCCTTGTTGTCAACTTCTTTTCTTTCTCCTTTTAGGTCGACATCTTCCTCTTTTCTTCATTCTTTTAGTTGGCGTTCCTCTTCATCCCTCTCCACCATAGACTCCACCAACGATTGCAGACATCTCAGCCACCTTCCCCCGACGCCCTGGTCATTTCTCTATCATCAGTCACCGTGCCCTAAACCCTAGATTCCCTCCCTTTTTTCTCACTGATTACCCTCCCTTGCCCATCCCCATATTTTTTTGCCAGCATTCTATCTCTCCTTCATCACAATTGTCTTTTGCCATTACTGCCACATTTGATAGCCGCCTCTGCGTGCAACTCACTGATTAATGCCATTTTTGTGGGTTTTTCATTCTATTTTTTCATTTCTAGTGCAATTTCAGTGTGGTTTTTTCAAGTTACGATGGAAGTTAAGAGTTGTTGCATTTAGAATTCCTATTTTTTGTATTTTGAAATGAGGAGAATGGGTTTTTTTTGTGGAAGATACGAGCTACAAACATAAGATAAGCCTTTTCTCTCCCTGTTACGTTGGTTTGAAGCTAGTTTGGTTGAGCTTTTGCATCTCCGTGAGCAGTCATTTTTTCAATAAAAGGATTGTGGAGCTTCAAGAGTCATCCAATTATCTAAGTTTAGATCTTCTTATGAGTATGCTATTTGGCCTAGTACTAGGGAAGAATGAATATCCACGTCTATTTACTTCTGTCGCTTGCTTAAAGAGGGTAGACTTACCGAGTTTCAAAATCTTTTAAGCCTGATTTCGAGTAGAAGGGTGGTTGGGAGTCTAGATGGGCGTATTTGGTCCTTGGAAGCTTCTGGCATTTTCTCGGTTAAGTCTCTTTCTAAGCACTTGACTACACCTTATCCTTTGGATAAGTTGATTTATAAAGCATTATGGAAGTCAAAGAGTCCAGACGTGTTAATATTCTCGTATGGATAATGATTTTTGGGTCATTGAATGCTGTTTGGTTATTCAAAGGAAGCTTCCTTCTCACAGCTTGTCTCCCTCTATTTATCCTGTGTATGGCTGAAAGTGAAGACTTGCAACACCTTTTATTTGGGTGCTGTTTTTCTAGAATATGCTGGTGGCATTTGTACTTTTTTTTTTTCTTGGATGAGAAACAATTTCATTGATTGAATGAAATAATCCAAAGAGGTACAAAAGGGGCCCATCATAGGATTACAGAAAATGTCTCCAATTTGCAACAAGGAAATTTAAACTAAAATTGTTAAAGGGATGTGTATCTTTACACCAAAAAAAGTATGGAAGATTGATAGATTTAAGATTTCCTTAGAGGATCTAACTTTGCCTTTGAACAGCCGAGCGATTGTTCTCCTTCTCCAATATCAGTTGGGTATTTGTTGAGGAGTTTAAAATTTGTTGAGGAGTTTAAAACAAATGTGGTTCAAATTTTAGTTGGTTCAGCTCTAAAGTCTTGCCTCAACTTATTTGCTGGGTGAAATTTGGTTTGAAAGAAATCATGATAAATATCTTGCTTGGATGGATCATTTTGATGTTACTCGGATTAGTGCTTCCTCCTAGTGCACTCTTTCTAAGTCTTTTGTGGATTTTCCCATTCAAGACCTATGCTTAAATTGGAATGCCATCAATTTTCCAGCTTAATGTTTATGTTGCTTTTTTGTATTTGCTTGTTGAACTTCGTTACTTTATGTTCTTTGTTATCATTGTATTTCGTTTTAGTATCTTGTATTTTGAGCATTGTCTCTTTTCACCCCTTTAATGAAAAAAAGTTATTTTCTTTTCGAAAAAAAAGGAGGAAAACGACAACTTTTTGGGGACTTACATCTAGAATTGAACACTGATCCAATAAGTCTCATTCTTATATGGAATTCACTAGGAGTTACGGAGGATTGATTAGCATTAAGAATTTACCTTCGAGTTATTGGAAGAAATCAGTCTTTGAAGCCATTGGGCAACATTTTGGTGGTTTGGTTAGCATCATTTCACAAACTCTTAATTGTTTAGATTGTTCCAATGCAGTGATGGAGGTGCAAAACAACATTTGTGGTTCTATTCTTGTCGAGATTGTTATTAAGGATCCAAATATTTGAGATTTCTCTCTTCGGTTTGGTTGTTTGACGATGTTGTTCTTGGAAGCTCCAAGTTTGAAGTCATTAAGTCATGGTGTTCTATTTGCCAAAGACTTTTCAAACTCTGTTGATTTGGTCAAGTTAGGCAAGTTATGGAAGATGAAGAGTTTAATTTGGATGATGATGATCTACCTCTAGAATTGAATACTTTATGTTTAAATGATACCTTTGACAATCAAGGGCCAAGATCAGTTTGTTGCATGATGTTAACTCTATTTCAGTTGTTGGAGTTGAAGTGAATGATCATTTAGATTTTGAAGAAGCTCTACAAGCTCGAGTTGATCACCCTCTCTCTCATTAAAATGAGATTTCTCGTTCATTCAAGAAATCTTGCTCCCGTTCAGTTGTGGTTTGAAGATCTCACTTTCGGTCTTCAAGGACTTTTTCCTCAATTGCTCACTAGTCTTTAGAGGACATTGGATAAAGTCTAAGGCTTTGGGCTTCTTTTCGAGTGTTGTTTTGTTAGGTTTTAAGAGTTTGCTTTTTCGGAGTGTTCTTTTCATTGTATCTCAATAGTATTATCTACATTTGTTTTCCTTTGTTTTTCTCTTCTTTGGAAGGTTATTGTATCTTTGAGCATTGGTCTCTTTTCATTGTTTCTATGAAAAGTTTTGTTTCCGTTTCAGAAAAAATAAATGACGGTTTCAAAAATTGTTTTTAAAATATATATTTAGAAAAGCGAGGCATGCTTTTTTCCCCCTTTTGTTTGGGCTTCGAGTTTTTGTTTTTGGGGTTTTGGTTGGGCTATCTATGTTCGACTTTTGGAAGTTGTTGTTACTTTTATTGTAATTTTTGATTTTATCATTAATTTTTTTTGTTACCTTGTTAAAGAAAGCAATATTTTGATTTTGTCTTTATAAGGCTATGGCGTTCCTCCTCTAAGCCTAGCTCCCTTTGGTGTAGGATTATTGTTAGTAAACATGGGTCATCCTTTCGATTGGCTGTCCAAGGGGGTTAAAGGTACTCACCAGAACTTGTGGAAGAATATTTCGAAAGAGCCTCTTTCTTTTGTTCATTTGGTCCCTTGTGTGGTGGGGGAGGCGAGGGATACATATTTTTAGGAAGATCATTGGCTGGGGGAGAGACCTCTTTGTTTATTGTTCCCTCGTTTGTATCATTTGTTTGCTCTAAATAATCATTTTGTTGTTGACTTTCTTGTGTGGTATGAGAGCTCTTTTTCCTTTTCTTTCGGATTCTGTCTCGCTCTTTCCAATAGGGAAGCAATGGAAGTGGTCATTTTTCTTTCTTTACTTGAGGGTCACCCCTTCAGGCGTAACCATGTAGGAGAAGAGATGTGAGGATTTGGAGTCTTGATCCTTTGGAGGGGTTCTCGTGTAAGTCTTTCATAATTTGGTTGATCCTTCTTCCCTAGGTTTGTCGATTTTTTTAGTGCTTTGGAGGATTAAGATTCCTAGGAAAGTGAGGTTCTTTACTTGGCAAGTTTTACACGGTCGTGCTAACATGATGGATCGGCTTAAGAGGAAGTTGCCCTCGGTTGTTGGTCCTTTCTGTTGTATTCTTTGTCCAAAGGTGGAGGAAGATCTGGACCATATTATTTGGTATTGTGGGTTGGCCAGTTTAGTATGGGATTTTTCTATCAGACGTTTGACATGATGGTTGCTCGTCACAGAGATGTTTGTGCGATGATTGGGGAGTCCCTTCTCAACCCGCAATAGTAGGGTTCTAGAGGTATGGAGAGGGACCCTAGTGAGATTTGGTCTCTTGTTCATTTTCATATTTCTTTGTGGGCTTCGATTTCGAAGACTTTTTTTTTGTAACTATTCTATAGATGTTATTTTACATGGTTGAAGTCCCTTTTTATAGAGGAAGCTCCATTCTTTTGTGGCTGGTTTTTTGTATGCCCTTGTGTTCTTTCATTTTTTCAATGAAAGTAGTAGTTTTCATCAAAAAGAAAAAAAAGAAAAGAGAAGAAAAGAAGATATTATTATATAAATTGAGACTTACTAAAGACCCTCCACATTTTTCTTTGAATGATGATTCTCATTAGATTGTTGGAGGCAACTTTTCTCCGTTTTTTTTAATCTTCATTGGGCTTCAAGTGTTGTTTTAAAATCCATTAATAACCAGATTTTAGTGGGTTTTGTCTTAAATTTCTTCTTCTTATGGGGAATTTTGAGTTTTTTGCTTTATTTCCTCCTATATTTTGAGTATTAGTCTCATTTCATTAAATCAAATGGAAATTTTGTTTTTCTTTAATATATATATATATATATATTTGTAGATATATATAAATTAGGACTTACATGTTATGCTGCAGTGAGATGACACGTGTGCTGCATTGTATCAGGGGAAAAGCTTTGCCCTCCTTTGATCATATTGCTATGGTCTATCTCTTCTTGACAAAATTGTCCATCCGTTCCCTTCACCATTTTTTGAGTTTTCACTAAATTCTTTGTCTAATTGAAGTGAGCTAAAGCAAGGGTTAATCTTGTGGCCTGTAACTATTCAGAATAAGTGGAAATAGTGGTACCTGCGCAAAAAGTTTCATAAAATGATTAATGTTTGATCAAGATGGTCAATGTTCATTTGCAACTTGACCACTTCCATGGTATTGGTGGGGTGAGGGGGTCTTAATTACTTAGACGTTACTATTAATCTTTCTTTTCTTAGAACTCTTTCAATTGACTAATTTTCTTTCGATTGCTTTTTCCTTTTTCCGTTTTATGCTGTAGAACCTCCCTCATGATGCTTACAAGCTACTTGCGGTGCCATCACCAATTGGTGGTGTACTTGTCGTCAGTGCAAATAGTATTTACTATCACAGTCAGGTAGCATTCATTTTCTGTTCATGTACTTTTAATGAACAAAGTATGCCTTCTGCCAAAAAAAAAAAGAAAAAAAGAAAAAAGAAAAAAAGAAAAAAGAAAAAAGAGCTGAAGTAATTTTTTGTACGAGTGGCCTCGTCCTATATGGCTTTCTCCACTTTACATTTTGGTTACTTACTCTGGAATTTTGTCATACTTTTGTCAATCTAATGTGCTGGTTCTCTATTTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCCGATAGCAGGTTGTGTTTCTCTCTCATTTTCATTTTTTGTTTATTTATCTGCTTTTTTCATTTATATTTACGTGACCATTTCCAACCTCTCCTCCTCCTTGAAATTGGAAAGAAACATGTTATCTGTCTAATATCGTTTATCAGTTTTTATTTTGGCATTCATGCTGGGACATCATGTAAATCAATATTTTTTTTTTGAATAATATGTAAATCACATGCCTATATAAGATAACAAGTAGAATGTTTTCATAAGGGGAAAAAAAAAAGATAACAAGTAGAATGTAAATCAATTTTGGCATTCAGTTCTGTAATGCTGGATAATGTATCCGTTGAACTGAACATTATTTTTCTTTTTTCTTGAGTGGTTGAGATATTTAGGGTTGTTGAGTTTTTTTTCTGTACCCTTTTGGTTGTGGTCCTTGGAAATTGATATCCCAAACTGGACCTCATTTCCTCCATTTTGTTGATTTTTACGTTAGGAATGACATCCACATTCATTTTATGAAGGATTGCAATTCATTTTATGAAGGATTGTTGTTGTGCAGTGCCCTTCGTTTGGTTTCTTTCACCTGGTTATCTCATGGACTTTATCTTCTTGTTCTTCCATTGTTTCCAGATTCCTTTCTTCTTCTTTTTCTTTTTCTTTCTTTTAAAAGATACTGAATCCATTAATGAGTTAATGCCTCAAATACAAGGCGAAGGCCCATATAAATAGGGGAAACCGGAAGATAGGCTTAACTAACAAAGATAACTAACATCTTATCAAAAACTAAAAAAGGGAAAAAAAACCTTTTTGGTCCCTAAGTTTTGAGTCTAGTTTCCATTTGGTCCCTAAGTTTCAAAATGTCATATATGAAGTCCTTAAGTTTTGAGTTTGATTTCAATTTAGTCCCTTAGGTTTCAAAATATTTGTTGTGGTTTTTCTCCCATCTTTCCTTTATTGATAGAAGCTGGGCTCTTTTGTTCTTGGAGTTTCAGGCTGTTTTCTGGGGTATTTGGTTAGAAAAAAATAGTTTTCAGGAGTACAAGGAGGTTTGCAGTGGTGTGCAATATAATGCCTTTTTGTGGGCCATTTCATCTTTTTTTTTTTTTGGTATAGCCTTTTTCATTTCTTGGTTTTTTTTGTCGCAAACTTATGGTTTCATCATTATAATTCAGTCAAGACATGCCTAGATCAAATTTCAATGTGGAATTGGATGCTGCCAGTGCTACATGGTTGGTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTATTGCTGGCACTTGTCTATGATGGACGGTGAGTGCTTCCAGTTTTCTTGGTGCCCATATGTTTCCAATTTCACAGCATCATCATCATAATTGTCAGCTTCTCTCCTTCGAACATTTTTAGGATGAAATAATACTCTGTTGATTTGCAGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGTTAGTTAGATTTGTTGTAATTTTAAGCCAGGAAACTTTGATGCTACTTTGTCCTACCTTTTTAGTTGCTAGTTTGATTAAAAAACGAGGTTTATGTATAAGTATATAAATTCTGTTTTATTATTGATAGGGAACATGAAAGAATATTGAAATTGATGTTTATATATGTATATCTTGTTGGAATTATATGTAGGGCATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTACTTGTGCAGTTTAGTTGTGGAGTGGGATCTTCAGGATTGGCATCCAGTTTAAAGGACGAGGTTTGTTGGGCTCCATAACTTTACTTTTGCTAATTACTTTGGACTTGCTTTTGTTGCTTACTTTGGACACTTCTAGTTAATGTTGTTTCCTTGTATTTTGGTGAGTCTTTGTTGAAGTGTTGATATGATAAATATTTTTTTTTAAAGGAAACAATACTCTTAATTGAAATAATGGAAAGAGGCTAATGTGTTCAAAATACAAGAAACTAAAATAAAGAAAAAAACCAACAAGATAGAATAAGATTAAAAAGAAAAAAATAAAATGATTCCAATTTAAACAAATATCTTGAATAGTATAATCTGCAAAGAACTTAGAAAGAGAGCACCAGGAATAGGATGACACTTTGAGGCAAGCAGAACTCAATCTATCCATCCAAGGAAGTGATTTATCTTGAAATGTATGTTGATTCCTTTCAAACCTAATTTCAGATAGAATAACTTTGATAACATTAGAATTTAGACCATAAAAGTTGGGATCTGTTCACCAAAGACAGCCTAACCAGAAGCTGTGATATTTTTCTTACATGAATTCAAAGATCCACTGGATATTAAACTAGAAGAAGTGTCACTAGGACTGTCTTAAAGTAAGAGCCCTAAAAAATGTGTGTTGCAAGTCCTAATTTGCAGCCAAACAAAGAGGACAAATAAAAGGAGATAAATAAAGCGTTGATGTAACTAAATTAACCATAACCCATCGGCTGAAGTTTTTGGGGCAATCAGTGATTTTACATGGTATTAGAGTAGGACGTTTTGTGTTCAAACCTTTGTAATTTCATTTCCTTCCACAATATTAGAGCAGGAGGTTTTGTGTTCCACCCCTTCATCTTCTCTGATTGTGGTTCAAATTGACCGTTTTTTTGCAGCGATCAGTATCTTGGTTTCATTGCTGGAGTTCATGGGCTCCATCCTTTGGTCGATGGTACAACTTTTTTTGACGCTTCTCCCAATGTTTTTGATGAGTTGCAGAAAAGCCTCCTTGGACCGCACCCATTAGTCGAATGTAGGTTTGGAACACAGGTACCTACCCTTGGGTTATCTTCTGACCCATCTTCTTCCTCCCTTCCTATCCACCCTCGATCTCTCCCCACTGGCCTTCTTATCCTTTAACACGTCTAGTTACCAAACCCATGATGCCCAACCATTCTGGGCCCAATCTTTCACTCGATCCACATGACACCCCTTTTTAGCCATATTCGATCCTTTGCCCCTTGCCCGAATCGCCAATATCTGACTCGAAACCCTCCTCTCCTTTACCTGACTCATCAAAAGTTAATCTCTCTCTGCACAAGCTTTCTGTCTCCATCCACAATAAAAAAAATATCTTATTTCATCCTTAGGACCAAATTTACCTCTTCAGTATTCCTCTTGCTCGAATTTTTCCGATCATGATGACCTTCCTATCAGCCCCCTTTTGCTTTCTACAACCCCGCGCACACCTTTCTCCAATCTCCCTCTGCCCTCTCCGGACTTCAGCCTTCAAAACACTATGGTTGGGATGACTCTATTATCAGATCGTGATTTAGACCCTGAAGAAGACATCGAGGTTGAAGCTTCCATCGACCCTATACACCACCTCCAAATTGTGGTCCTTTGGCTTATGGAACACAAATTGTGCATTATGGCTATTCCCTCAATCTCTACCTCCAAGAAAAGTAAACCCACCCAGAAATAGTCTATAGGCATAAACGAATTAAAAGGCCTCTTTTCCTCCATCAACTATGAAAAGTCCACTCCTAGATTAGAGCAAGGTGGGTTAGCCTCGTCTGAATGATTATTCTCTCCTGGCATGTTCAAGGTATAAGCTTGGGACAAAAAGGCACTCGTGAAAATTCTCATCCTTAAGCACATTCCTACAATGATTATACTTCAAGAAACCGAACTTGAAGAGGTGGATAGAGCCACCATTAAATCTCTGTGGAGTTCAAGACACATCAGTTGGTCTGCTTTAGAGGCCTTCAATACTTCAGGTGGCATGATAATCATGTGGAATGATCCGTCGTTATCATCAGGGGAGGTTATAAGAGGTCTCTTCTCTCTCACTTGGCTTTCTGGTATATATGGCCCTTTTTGTGCAAAAGACAAGCCTCTTTTTTGGGAATTAATAGATCTTCAATGTCTTTGCTCAGTCAATTGGATTTTTGGAGGTGACTTCAATGTATCTTGTTGGTTGCATGAGAAGTCTAGTGGTAGGCCTCAATCTTCAAGTATGAAGGCTTTCAATAGTTTCGTTGATGCCGCTGATCTCCTGGACATTCCCCTTTAAAATGGCTTATACACTTGATCTGGTTAGCGACATTACCCTATCTTCACCCTTATTGATCGGTTTTTGCTCACAGATAGTATCCTACAAAAATTTGCCTATACCACATCTAAGAGACTAGAGAGATCTACATTTGACCATTTTCCTATTCTTTTATCCATTGGCGTGGAAAATTGCCTTCGTTTTGAGAACATGTGGTTACAACATGGCGATTTCTTCCAGATTGTTGATTATTGGTGGAAGAACACTCCTCATCGAGGTTGGCTGGGTCATCGATTAATTTTGAAACTCACGGGTTTGAAGGAGGTCCTTAAAGTATGGAATCTACCTTTGGCTGCATCACCTCACAAAAGAACCAAATTGGAAGAATCTAATGACGGTCTTCCAATACATACGATGCTGTGGCGGGGTAGCCTTAAATCTTAGCTTCTCGTTATTGTTGCTAAGGAGGAAACCCTTTGGGTGCAAAAATGTAAGTCTATATGGCTCAATGAAGGTGATTTGAATACTCGCTTTTTTCATAAAGTGGTGGCTACTAAAAGAAGGCAAAATACCATCCTTGAAATTATCTCTGCCAATGGGTCAAGTATCCTCACTGAATCTGGTATTGAATCAAAGTTCCTATGTTTTTTTACTTGAATCTACACAGAAATCTTGTCTTCAAGCTCTGCCCCTTTAGATTGGACCACCTTACCAGCCCATCAATCTCAACAATTGGAATTACCCTTCGATGAACAGGAAATCTAGCATACAGTTTGCTCTCTTGGTAGAAACAAGTCTCCTAGGCCTTGATGGCTCTTCGCAGAAATAAGTTCCCTGGCTTGATGGTCTCACGACTGAATTTTATAAAAAGTCTTGGCACATCCTTAAGAACGATACATTGAGGTTTACCAGATTTTTTTTTCTTTCGAGAAAGGCATTGTTAATGCTAATTAAAATGAGTCTTTTATCTGTCTTATTCCAAAGAGATCCAATGCCTATAGAGTTGGAGATCTTAGATCGATTAGCCTCAAAACTGGAGTTTACAAAATTCTTGCTAGAGTTCTATTTGATAGAATCAAAGGGGTTGTTCCATCCATCATTTCTGAATACCAATCTACCTTTGTGGAGGGCTATAAATCCTTGATGCATCTCTGGTTGCGAATGAGCCCATTGATGATTGGAAGAGGAAGAACAAAAAAAGCATCGTGATCAAATTGGACATTGAAAAGGCTTTCAATACCGTTGATTGGGCCTTCCTTAATTCAATTCTTGGGGTTAGAGGCTTTGGAAATCTCTAGAAAAAATGGATCATGGGCTTTTTATCTACAACAAATGTTTTGATCATTATTAACGGGCGACCTCGTGGTAAAATTCGTGCTACTAGAGGTATTAGGCAACGCGACTAATGTAAGAAGGATAAGGGTTAATTGGGTAATTAGTTAATATTCTAGGTTAATTCCCTTTTTACCCCCACTTGTAATAAAGCTCTATAAATAGGAATCTTCCCCTCTTGTATCAAACACATGGTTCATTCTAATAAAAGATCCACAATATTAATTCTTGGAGAGAATTCTCTTTGTTATATCTTTAGGCTACATCAGCGACCCACGGTCTTCTTTCCTCTTCAACATTATTTTCGATACTCTTAGCAGACTTCTGACCCAAGTTGAGTTGGAAAGGAAAATACAGGGTTTTAACGTGGGCAAAGACGGTCTATCCATCAACCACCTACAATTTGTAGATGACACTATCCTTTTTTCCACATACAACACTACGTACATAAAGGGGATGTTTGACCCATCAACTTCATTAGTTGGTATTAAACAACTCAACTTCAATAGTGCTTACTATTGAAGTTTGCACATTTATCGAAGAACTTCATCTTCCTCCCTTTCTCTATTTTGCACATTTATCGAGAAACTCCAACTTACAACTTTGCAGCCCAAACAAAGGCTTTCTAACTCCAATATATTAACTCCAATCTTCATAACTCAACTCAGTACCCCAAGCACCCCCCAAAGAATTTATTTGACATCATCAGGGATTTTCAATCCTGTTTGGGTGTTTCCATCAATCACCAAAAATTTGAAATTATGGGCATTAATCTCCATCCGGTTGAAACACTGAGCATGACGGATTTATATGGATGTAAGGTTGGTTCTTTGCCTAACTTCTACCTTAGCCTCCCTTTGAATGGTAAACCGAGAACCCTTTCGTTTTGGGCTCCTATTGTTGAGAAGTTCGAGCGGAGAATCAAAGCTTGGGCCTTATCCTTCATATCCGAGGAGGTAGTTAATTCTCATTCAAGCCATGTTGAGTAATCTCCCTACTTATTACCTCTCCCTCTTTAAATTGCCTCGTAAAGTGGCACACATGATTGAAAAGCTTTATAGAGATCTCCTGTGGAAAGGGAAAGAGGATGGCTCCAATTTGCACCTTGTTCGTTGGGATAAAATTTCCTCTCCTCTCTGTCATGGTGGTTTGGGGTTTACCAATCTTAAACTGTGGAATTAAGCCCTCCTTGCCAAATGGATTTGGAGATTTAGAGTGGAAAAACATGCCTTGGGGGGAAGGCTCATTGCGGCCAAATTTGGCTCCTCCAAATGTGATAATAAAGCTGGTACATGCTCGCTGACTACTGCCAAAGGTCCATGGAAGGCCATATCTATCCTTCAACCATTGGTGAATCAACATGTTGCAGTGAAATTAGGCAACATGCGTTTTGTTTCTTTTTGGCATGATTCTTGGGTGAACTCTAACCCCTTGCACACAACCTTCCCTAACTTATTTGCTCTCTCCCTTGCTAAGGAGGATACGGTCTTTGATTCATGGAAGGAGGCTAAGTGGTGTTGGGACCTCAATCTCCGTAGGATCCTCAACAAGTTGGAAATTTGTGAATGGACTGATTTATCTCATGTGCTAACAAACTACCAACTACTTCAAACAGATGATAAATCTTCTAGAACCATTTGGGATCTTCTCCACCAAATCTCTGATTGCTTTCAAGCTCAAGATCAACTCTAAAGTATCCGTGGGTCCTTTCAAATCTATATGGTCTGATCACTTGCCTAAAAAGATTAAATTCTTTTTGTGGGAGCTATCACACAAGGCTGTTCTTACCCATGACAAGCTTCAATGCTGTTTGCTTTCTTTGGCCGTTTCCCCCCGTTGGTGCACTCTTTACAAGAAGAACAATTAGACTCAAAGCCACCTTTTCATCTTCTTCCCATTCTCTGACAGATATTGGAACGACATCCTCTATGCTTTCAATTGGTTTGTGGTCTTTCCATCTGATTTCAACAGTTTTATAGGAGGGATACCCTTTCAGAGGAAGAAAGAGCTTCTTATGGATGCATCTTATCAAAGTTTTTTTATGGTCAATGTGGATTGAAGGAAACAAGCGTATTTTCAACGACAAAGCTCAACATTTTGACCTTTTTTGAGATCACGTTATCTTCATCGGTCTCTCTTGATGTAAATTGTCTAAATACTTCCGTTCATATGGTTTTGATTCTTTTTTGTCGAATTGGAGATGTCTTTTGTAACTCCTTTGGCTCGAGACTTCCTCCCCCTCCCCTTTTTAGTAATTTCATTCATCAATGAATTGTTTCGTATAAAAAAAGAATTACAATAACCCATCGACTTAAGCGTTTGGGTCAATCGGTGATTAACGATTCTTACCATTCATGTTTTCCAACTCAATTACTTGTTTTCCATTTGACTTTATGTTCCCTTTTCATTGTTATTTTTATTATTTTTAATGGCAAAGGTTCTGTTAATTTGCTGGTTGTGTGTGCTTTTTGATCGGAGAAGATTCATTCTATTATTAATAAATATCAGCATATGACCTTTCATTGATAAAGAATGGAGTTGAATAATTTCATTTATTACATATAAAATTTATATTCCACGAACTTATGTTCTGATTCTTCTGCTAATTTGTGGACTGTCGAAGCTGGTTAAGGAAATTGTCCCATTAGGATGTTATTTTTTATTCCCTCCAACAACTTAAAGTCTTACCTGCATAAGTTGAATTGATAGGAGATAATTAAGTTTTATTTCATTCATCTTTCATTGTGAACAAGGAAGTTCATTTACTGATAAATTTTGGAGTTGAATAAGTTCATTTATTATTCATTTGAAGTTTTCTTTCCGTGAACGTTACATGTTGAGATTAGCTTTATCCTTACAAATGGCAAGTTGTCTTCAATGCCATGCATACTAATATTTTGTGCACTTCATAGGTTGGAGATATTGAAGTTGATGCTCATACAGCCAAGCGAATGCGTATGTCATCTTCCGATGCTCTACTAGATATGGTTGGAGGAGATGAGCTATCGTTGTATGGTTCAGCTCCAAATAATACGGAATCTGCTCAGGTTTTATAGAATACACTATGTGAACAAACTTTCAATCTCTCTTAACCTTTCTCTTATTTCAATTATCTTTTACCTATATGCATTGGTCTCTTTTCATCATTTCAATGAAAAGTTTTGTTTCCTTTTTTAAAATTTTTTTTTTTTTATATGGATAAATTGAAAAGAAACTAAGAGTTTCGATAAAAGCCAAAAAATGAAAGAGAGAGAAGAAATCAAGGACTACTGGTTGGTAACAAATGGATTTGTAAAATACATCTAGACTTGTTAAAATTTTAATAGTTTTAACTCTATGCCTATGCATATGAAATGCCTAGAATTTATAGTTTTGGGAGGATAAGTGGTGGGAGGATAAGACCCCACAGTTCCTTGTTTCCTAGTTATATTAGTTGTTCTCGTCAAATTATCACTCAGATGCTTTAATTCTTTCCCATTATGAGAGTTTGCCTTCTTCTCTGGGTTTCCACTATCTATTGACCAAGAGAGAAACAACAGATGTTTTGGACTTGCCAAAGGTTTCTCTTGTAATTTTTGTTTCAGTATTTGGTTTGTTCCTCCCATCGGTCGTTTCCATTTTTCCCCAAGTGTGGAAGATTTAAGTTTTTGCATGAAAGGGTTAATATCTTCACTCATCGAGAGAGGCTCTCTTCGAGTGATGTTTGATCTCGTGTTGGATTCTAAGTTTATTTTTGGCTTTTTGTGACCAAGCCCTTTTGTAATTATGCTATTAGTCTTATTTTACTTCATTGGAGCCCTTTGGTATAGTTCGATTCCCCTTTGTGGGCTTCTATTTGTTGTAGGCCCAAGCATTTATTTTTTCTCTTAATGAAAGTATGGTTTTTTCATTAGAGAAAAACATAGTTGTTGATACAGGGGTATTCTCGAGCTTTTGCCTAAGCTTTGAGATGTTCTTTTAGGACAGGTTTCATCCGTTGTACATGAGTTCCAACCGGCCGCCTCCTATCAGAGGAACTACCAGAATGAATAACCGCCCTAAATGAAAATTAGGAGGGAATGTAGTTGCTAGTAAATGTCTTTCTTTGGTGAGCCTTAATGGTATTAAAAATAAGAGCATAGAGGATTCAAAACAAGCAAATGCATATGTGATAGGTAAATGGGAGGAGGTTGACATCCCTTTTTGGGTCCATTCGTCAATTCTATCTTATAATTATTAGGGAAATTTTCACAGATAGAAAAATTTTCAAAGTATTTACAGAAAATAGCAAAAAAAATACTGATAGACATTGATAGACTTCTATCAGCATCTATCAGTGATAGACTTTTATAATTTCTATTACTGATAGACCCTGATAGACTTCTATCAGCGTCTATCGTAACTATCTAAAAAATTTTCTATTTTGTGTAAATAGTTTTCCTTGTTTTTCTATTTTTAAAAATCCCCCTAATTATTATTTGTGAGTGTTAGATGATATAATATTAAATTTGCCTTCACTCATCAGCTTAAACTTTTGGGTCAATTGGTGATTGTGATTTAACATGGTATTAGAGCAGGGGGTCCAGGGAGGTCCTGTGTTCAAATCCCTGCTTTATTCTTTCTGCTCCAATTAAAATTGATTTTCACTTGTTGGGTATTCTTCATATTTCAAGCTCAGAAGTGAGGGGGAGTGTTAGATAATATAATATTAAATTTACCTTCACCCATCGGCTTAAGCTTTTGGGTCAATTGATGATTTAACCGTGAGTGATTACCAACAACTGGTGCTCAACTCTATAATTTTTTTGTTATTGTGTTGGTTAGGCCACATTTTTTAATTTTTTTTTTTATTTTTATGTGTGTGTTTTTAAAAATAAATGAACCTCCAATAGTCCCTTTCTCTAATAAGCTTCTCTAAACATCAATCAACATAACAGTAAAAGCAAGACAGGATCACCATTGTGGCATTATCTTAACTTGTGGATGTTTTTATTTTTCATCTTGGATTACCATTGAGAAAATTTGAGAGCTTAGTACAGTGAACGCATCCCTTTGTCTGATTAATCCATCTTTGCAAGAACCGATTATCTCCAAAACAGCTAGAAAAGAATCTCCCATCCAAGCTATCATAGGCCTTTCTCAAGGACGTTTCCACTCCATGTTCTGGTTAACTCATTTTCCTAGTCTTAAATTATTTGTGTGGCATCTCTTACAAGAGAGACAAAACACGTGTAGTACATTTTAGATGTGGCTGAAACAGTCTAGATAACATCATGCTTGAGATTGTACCTATAAATTCAGATTCTCTTATCCAATTTCTAGAAGTTTGTTTTCTTTGAACATTTTGTATCTCTTCACTGTTTCAATTCAGTGATAAGTTTTTATATTGTTAAAATTATGTGTGTTAGTTTCTCTAGTGAATCATTACCAAAAAAATAAATGGACATTATTTTGCTCAGAATATTGTAGTCTTATTTGAATTCTTGCTGCTTGGCTTTCTGACATCAGGTCCTTTTCATGCACGCAGAAAAGTTTCTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGGCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATCCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTCGTTAGTAACCTAATAACCATGTGAGTTTTAGTATTTAGTTAAAGTAGATAATTTCTCTTCTTTGATAATTTGTAGGTTTGTTGTTCGGGTCATGGTAAAAATGGGGCATTATGCATTCTTCGCCAGTCAATTCGCCCTGAAATGATTACTGAGGTATATTATTGTGATATACAGAAGTTGAATTGTACGTGGTTCCTAACCAATAATAAATCTATTCTTATAAAAGATTAATAATTAATTTATGCACTGATTAATCTGTATAGCCCAACTGACAGTATTGGTTTGCGAAGAAGCATTAAGGATGCTCTCTTCTCTCCCCTTTCACATTCCAACCCCGCCAATTCTATTGCCAGTCAACCCCTAGTTTATTTATTTTTTATTTTAAAATCATTTTCTCTTTTCAGACTCTAACTTCATTTGATTCCTTCATATAAGACTTTTCTCTTCATTGGTTGTTTCTTCAGAATCCTCTCTTTGTTTCGTCTTAGTTTGGTGCTCTCTCGGTGATATCATGGAAGTTAAGAGCTGTTGTGTTTGGAATACTTGCTTGTGTATTCGTTAAGGAAAGAATGGTCTCTTCCTTGAAGATATGACCAATAAGAATAGAATCTTCCTTTCTTTACCTCTTTGTCGTTGGTTTGAAGCAGTTTTGGTGGAATTACTGCATCCTCCTGAAAGTTCGAGATGATTTTGGGACTGTTTGGTTGTCTAAGTTTTATGCTTCTCATGGTTGGTTCTTTGAATGTGCTGTTTGA

mRNA sequence

ATGAGTTGGGACTTTAAGAACATTAATATGGATAAAGAAGATTCAGAACGAGACTACTACTGTAGGTGGTTCTGCAAGGTTGAGGCTCTTCAACCTCAGCCAAAGGGAAGACAACCATGCCAAGACGAAACAGAACAAATGATTGAAGCCCTGGAAAATGGAATAGGCGAATCAGATCCCCATGACTCAAAATCAAAAGAACCAGGAGTTTTTAGTTTACAAGTTGAAGCAAAGAACAGAAAGAAAGATGCTGGAAGTAAGGCCGAGGTCAAGGTTAGGGTCTACGCCAAGGTTGAGGTCGGGGTTGAGGTCGAGGTTGAGGCCAGGGTCGAGACCGAGGCTGAGGTTGGGGCCGAAGAGATGGACAATAGAAGGAATGGGCAAATCAGCAATTCAGGAGAAGGACTAGGACAAACCAAAGAAGCAGAAGGTTACTTAGGTGGTAAAAAAAAAAAATACCGGCCGACCATGCCCGCGTGGACCAAGGCCAACCGTACCGAAAACAGGAGGATGAGTTTTGCTGCTTATAGAATGATGCACTGGCCTACCGGCATCGAAAACTGCGATTCAGGCTTTATCACCCATTCTCGCGCCGACTTCGTCCCCGGTGTCACATCTCACACCGACGACCTTGACTCCGACTGGCAGCCCCGCCGAGAAATTGGTCCAGTTCCCAATCTCGTTGTCACCGCCGGCAATGTCCTCGAGGTATATGTTGTTAGGGTTCAAGAAGAAGGTGGAAGAGAATCAAGAAGTTCAGGAGAAGTCAGACGCGGTGGCATTATGGATGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTACAGGTTGCATGGTAATGTTGAGTCCATGGCGATTTTGTCTAGTAGAGGAGGTGATGGTTCCAAGAAGAGAGATTCGATTATATTAGTCTTTCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGCTCAATGCATTGCTTTGAGGGCCCTCAATGGCTTCATTTGAAAAGAGGTCGGGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGCTGGTTCTAGTTTGGTTGTGGACGATGAAGCTTTTGGTAACACTGGTGCAATTTCTGCTCGAGTTGAATCATCATACCTAATTAACCTAAGGGATTTGGATGTGAAGCATGTAAAGGATTTTGTATTTGTACATGGTTATATTGAACCTGTGATGGTGATCCTTCATGAGCAGGAGCTTACTTGGGCTGGCCGTGTATCTTGGAAGCATCACACGTGTATGATTTCTGCGCTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACAACCTCCCTCATGATGCTTACAAGCTACTTGCGGTGCCATCACCAATTGGTGGTGTACTTGTCGTCAGTGCAAATAGTATTTACTATCACAGTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCCGATAGCAGTCAAGACATGCCTAGATCAAATTTCAATGTGGAATTGGATGCTGCCAGTGCTACATGGTTGGTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTATTGCTGGCACTTGTCTATGATGGACGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGGCATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTACTTGTGCAGTTTAGTTGTGGAGTGGGATCTTCAGGATTGGCATCCAGTTTAAAGGACGAGGTTGGAGATATTGAAGTTGATGCTCATACAGCCAAGCGAATGCGTATGTCATCTTCCGATGCTCTACTAGATATGGTTGGAGGAGATGAGCTATCGTTGTATGGTTCAGCTCCAAATAATACGGAATCTGCTCAGAAAAGTTTCTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGGCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATCCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTCGTTTGTTGTTCGGGTCATGGTAAAAATGGGGCATTATGCATTCTTCGCCAGTCAATTCGCCCTGAAATGATTACTGAGTTTTGGTGGAATTACTGCATCCTCCTGAAAGTTCGAGATGATTTTGGGACTGTTTGGTTGTCTAAGTTTTATGCTTCTCATGGTTGGTTCTTTGAATGTGCTGTTTGA

Coding sequence (CDS)

ATGAGTTGGGACTTTAAGAACATTAATATGGATAAAGAAGATTCAGAACGAGACTACTACTGTAGGTGGTTCTGCAAGGTTGAGGCTCTTCAACCTCAGCCAAAGGGAAGACAACCATGCCAAGACGAAACAGAACAAATGATTGAAGCCCTGGAAAATGGAATAGGCGAATCAGATCCCCATGACTCAAAATCAAAAGAACCAGGAGTTTTTAGTTTACAAGTTGAAGCAAAGAACAGAAAGAAAGATGCTGGAAGTAAGGCCGAGGTCAAGGTTAGGGTCTACGCCAAGGTTGAGGTCGGGGTTGAGGTCGAGGTTGAGGCCAGGGTCGAGACCGAGGCTGAGGTTGGGGCCGAAGAGATGGACAATAGAAGGAATGGGCAAATCAGCAATTCAGGAGAAGGACTAGGACAAACCAAAGAAGCAGAAGGTTACTTAGGTGGTAAAAAAAAAAAATACCGGCCGACCATGCCCGCGTGGACCAAGGCCAACCGTACCGAAAACAGGAGGATGAGTTTTGCTGCTTATAGAATGATGCACTGGCCTACCGGCATCGAAAACTGCGATTCAGGCTTTATCACCCATTCTCGCGCCGACTTCGTCCCCGGTGTCACATCTCACACCGACGACCTTGACTCCGACTGGCAGCCCCGCCGAGAAATTGGTCCAGTTCCCAATCTCGTTGTCACCGCCGGCAATGTCCTCGAGGTATATGTTGTTAGGGTTCAAGAAGAAGGTGGAAGAGAATCAAGAAGTTCAGGAGAAGTCAGACGCGGTGGCATTATGGATGGAGTCTCTGGGGCCTCGCTCGAGCTTGTTTGCCACTACAGGTTGCATGGTAATGTTGAGTCCATGGCGATTTTGTCTAGTAGAGGAGGTGATGGTTCCAAGAAGAGAGATTCGATTATATTAGTCTTTCAAGAAGCAAAAATTTCAGTGCTAGAGTTTGATGATTCTATCCATAGTCTCCGTACAAGCTCAATGCATTGCTTTGAGGGCCCTCAATGGCTTCATTTGAAAAGAGGTCGGGAATCATTTGCAAGAGGTCCAGTGGTAAAGGTTGATCCTCAAGGCAGGTGTGGAGGAGTTCTTGTTTATGGTTTGCAAATGATAATACTTAAGGCTTCTCAGGCTGGTTCTAGTTTGGTTGTGGACGATGAAGCTTTTGGTAACACTGGTGCAATTTCTGCTCGAGTTGAATCATCATACCTAATTAACCTAAGGGATTTGGATGTGAAGCATGTAAAGGATTTTGTATTTGTACATGGTTATATTGAACCTGTGATGGTGATCCTTCATGAGCAGGAGCTTACTTGGGCTGGCCGTGTATCTTGGAAGCATCACACGTGTATGATTTCTGCGCTAAGTATTAGCACAACCTTGAAGCAGCATCCTCTAATATGGTCTGCCAACAACCTCCCTCATGATGCTTACAAGCTACTTGCGGTGCCATCACCAATTGGTGGTGTACTTGTCGTCAGTGCAAATAGTATTTACTATCACAGTCAGTCAGCTTCATGCATGTTGGCTTTGAATAATTATGCTGTTTCTGCCGATAGCAGTCAAGACATGCCTAGATCAAATTTCAATGTGGAATTGGATGCTGCCAGTGCTACATGGTTGGTAAATGATGTGGCCTTGCTGTCAACCAAGACTGGGGAGCTATTATTGCTGGCACTTGTCTATGATGGACGGGTTGTGCAGAGACTTGATCTTTCAAAGTCTAAAGCTTCAGTACTTACATCGGGCATTGCATCAATTGGAAATTCATTATTTTTTCTGGGCAGTCGATTGGGAGATAGTTTACTTGTGCAGTTTAGTTGTGGAGTGGGATCTTCAGGATTGGCATCCAGTTTAAAGGACGAGGTTGGAGATATTGAAGTTGATGCTCATACAGCCAAGCGAATGCGTATGTCATCTTCCGATGCTCTACTAGATATGGTTGGAGGAGATGAGCTATCGTTGTATGGTTCAGCTCCAAATAATACGGAATCTGCTCAGAAAAGTTTCTCTTTTGCTGTTAGAGATTCATTGATCAATATTGGGCCTCTGAAGGATTTCTCCTACGGTTTAAGAATTAATGCAGATCCTAATGCGACTGGAATTGCCAAACAAAGCAATTATGAACTCGTTTGTTGTTCGGGTCATGGTAAAAATGGGGCATTATGCATTCTTCGCCAGTCAATTCGCCCTGAAATGATTACTGAGTTTTGGTGGAATTACTGCATCCTCCTGAAAGTTCGAGATGATTTTGGGACTGTTTGGTTGTCTAAGTTTTATGCTTCTCATGGTTGGTTCTTTGAATGTGCTGTTTGA

Protein sequence

MSWDFKNINMDKEDSERDYYCRWFCKVEALQPQPKGRQPCQDETEQMIEALENGIGESDPHDSKSKEPGVFSLQVEAKNRKKDAGSKAEVKVRVYAKVEVGVEVEVEARVETEAEVGAEEMDNRRNGQISNSGEGLGQTKEAEGYLGGKKKKYRPTMPAWTKANRTENRRMSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVTAGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGGDELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEFWWNYCILLKVRDDFGTVWLSKFYASHGWFFECAV
Homology
BLAST of HG10011827 vs. NCBI nr
Match: XP_038887722.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 555/566 (98.06%), Postives = 558/566 (98.59%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDSGFITHS ADFVPGV SH DDLDSDWQPRREIGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSGFITHSPADFVPGVASHADDLDSDWQPRREIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRVQEEGGRES+SSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVQEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDS HSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLVVSANSI+YHSQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAH AKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHIAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. NCBI nr
Match: KAA0049896.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 548/566 (96.82%), Postives = 556/566 (98.23%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. NCBI nr
Match: XP_008441850.1 (PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis melo])

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 548/566 (96.82%), Postives = 556/566 (98.23%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. NCBI nr
Match: XP_011648998.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis sativus] >KGN61267.1 hypothetical protein Csa_006291 [Cucumis sativus])

HSP 1 Score: 1081.6 bits (2796), Expect = 0.0e+00
Identity = 543/566 (95.94%), Postives = 555/566 (98.06%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS +ITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRES+SSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDS HSLRTSSMHCF+GPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           +NLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 SNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. NCBI nr
Match: XP_023520837.1 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520838.1 cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520839.1 cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1057.4 bits (2733), Expect = 6.1e-305
Identity = 537/566 (94.88%), Postives = 548/566 (96.82%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMH PTGIENCDSGFITHSRADFVP VTSH DDL+SDW PRREIGPVPNLVVT
Sbjct: 1   MSFAAYRMMHSPTGIENCDSGFITHSRADFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRVQE+GG+ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM ILSS
Sbjct: 61  AGNVLEVYVVRVQEDGGKESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMVILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEA GN GA SARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGNVGASSARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLVVSANSI+YHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA ATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361 VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDA T+KRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTSKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYG   NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481 DELSLYG-VSNNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSG+GKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGNGKNGALCILRQSIRPEMITE 565

BLAST of HG10011827 vs. ExPASy Swiss-Prot
Match: Q9FGR0 (Cleavage and polyadenylation specificity factor subunit 1 OS=Arabidopsis thaliana OX=3702 GN=CPSF160 PE=1 SV=2)

HSP 1 Score: 869.0 bits (2244), Expect = 4.0e-251
Identity = 428/571 (74.96%), Postives = 496/571 (86.87%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADF---VPGVTSHTDDLDSDW-QPRREIGPVPN 230
           MSFAAY+MMHWPTG+ENC SG+ITHS +D    +P V+ H DD++++W  P+R IGP+PN
Sbjct: 1   MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVH-DDIEAEWPNPKRGIGPLPN 60

Query: 231 LVVTAGNVLEVYVVRVQEEGG-RESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 290
           +V+TA N+LEVY+VR QEEG  +E R+    +RGG+MDGV G SLELVCHYRLHGNVES+
Sbjct: 61  VVITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESI 120

Query: 291 AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 350
           A+L   GG+ SK RDSIIL F++AKISVLEFDDSIHSLR +SMHCFEGP WLHLKRGRES
Sbjct: 121 AVLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRES 180

Query: 351 FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLI 410
           F RGP+VKVDPQGRCGGVLVYGLQMIILK SQ GS LV DD+AF + G +SARVESSY+I
Sbjct: 181 FPRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYII 240

Query: 411 NLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHP 470
           NLRDL++KHVKDFVF+HGYIEPV+VIL E+E TWAGRVSWKHHTC++SALSI++TLKQHP
Sbjct: 241 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 300

Query: 471 LIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMP 530
           +IWSA NLPHDAYKLLAVPSPIGGVLV+ AN+I+YHSQSASC LALNNYA SADSSQ++P
Sbjct: 301 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 360

Query: 531 RSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIA 590
            SNF+VELDAA  TW+ NDVALLSTK+GELLLL L+YDGR VQRLDLSKSKASVL S I 
Sbjct: 361 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 420

Query: 591 SIGNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALL 650
           S+GNSLFFLGSRLGDSLLVQFSC  G +     L+DE  DIE + H AKR+RM +SD   
Sbjct: 421 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQ 480

Query: 651 DMVGGDELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQ 710
           D +G +ELSL+GS PNN++SAQKSFSFAVRDSL+N+GP+KDF+YGLRINAD NATG++KQ
Sbjct: 481 DTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQ 540

Query: 711 SNYELVCCSGHGKNGALCILRQSIRPEMITE 737
           SNYELVCCSGHGKNGALC+LRQSIRPEMITE
Sbjct: 541 SNYELVCCSGHGKNGALCVLRQSIRPEMITE 569

BLAST of HG10011827 vs. ExPASy Swiss-Prot
Match: Q7XWP1 (Probable cleavage and polyadenylation specificity factor subunit 1 OS=Oryza sativa subsp. japonica OX=39947 GN=Os04g0252200 PE=3 SV=2)

HSP 1 Score: 708.4 bits (1827), Expect = 9.0e-203
Identity = 367/580 (63.28%), Postives = 437/580 (75.34%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFV---------PGVTSHTDDLDSDWQPRREI 230
           MS+AAY+MMHWPTG+++C +GF+THS +D           PG     D   +  +PRR +
Sbjct: 1   MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRR-L 60

Query: 231 GPVPNLVVTAGNVLEVYVVRVQ---EEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRL 290
           GP PNLVV A NVLEVY VR +   E+GG  ++ S     G ++DG+SGA LELVC+YRL
Sbjct: 61  GPSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSS--SGAVLDGISGARLELVCYYRL 120

Query: 291 HGNVESMAILSSRGGDGSK-KRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWL 350
           HGN+ESM +LS    DG++ +R +I L F++AKI+ LEFDD+IH LRTSSMHCFEGP+W 
Sbjct: 121 HGNIESMTVLS----DGAENRRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 180

Query: 351 HLKRGRESFARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISA 410
           HLKRGRESFA GPV+K DP GRCG  L YGLQMIILKA+Q G SLV +DE      + + 
Sbjct: 181 HLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTAV 240

Query: 411 RVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSI 470
            +ESSYLI+LR LD+ HVKDF FVHGYIEPV+VILHEQE TWAGR+  KHHTCMISA SI
Sbjct: 241 CIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSI 300

Query: 471 STTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVS 530
           S TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ ANSI+YHSQS SC L LNN++  
Sbjct: 301 SMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSH 360

Query: 531 ADSSQDMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKA 590
            D S ++ +SNF VELDAA ATWL ND+ + STK GE+LLL +VYDGRVVQRLDL KSKA
Sbjct: 361 PDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKA 420

Query: 591 SVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMR 650
           SVL+S + SIGNS FFLGSRLGDSLLVQFS     S L     +   DIE D   +KR++
Sbjct: 421 SVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLK 480

Query: 651 MSSSDALLDMVGGDELSLYG-SAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINAD 710
              SD L D+   +ELS     APN+ ESAQK  S+ VRD+LIN+GPLKDFSYGLR NAD
Sbjct: 481 RIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANAD 540

Query: 711 PNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITE 737
           PNA G AKQSNYELVCCSGHGKNG+L +L+QSIRP++ITE
Sbjct: 541 PNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITE 572

BLAST of HG10011827 vs. ExPASy Swiss-Prot
Match: Q10570 (Cleavage and polyadenylation specificity factor subunit 1 OS=Homo sapiens OX=9606 GN=CPSF1 PE=1 SV=2)

HSP 1 Score: 282.0 bits (720), Expect = 2.1e-74
Identity = 190/521 (36.47%), Postives = 288/521 (55.28%), Query Frame = 0

Query: 226 NLVVTAGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 285
           NLVV   + L VY +    E   ++  S E +            LEL   +   GNV SM
Sbjct: 29  NLVVAGTSQLYVYRLNRDAEALTKNDRSTEGK-------AHREKLELAASFSFFGNVMSM 88

Query: 286 AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 345
           A +   G     KRD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G   
Sbjct: 89  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 148

Query: 346 FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLI 405
               P V+VDP GRC  +LVYG ++++L   +   SL  + E     G  S+ +  SY+I
Sbjct: 149 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRR--ESLAEEHEGLVGEGQRSSFL-PSYII 208

Query: 406 NLRDLDVK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQ 465
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 209 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 268

Query: 466 HPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCM-LALNNYAVSADSSQ 525
           HP+IWS  +LP D  + LAVP PIGGV+V + NS+ Y +QS     +ALN+      +  
Sbjct: 269 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 328

Query: 526 DMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKASVLT 585
              +    + LD A AT++  D  ++S K GE+ +L L+ DG R V+     K+ ASVLT
Sbjct: 329 LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 388

Query: 586 SGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRM----R 645
           + + ++     FLGSRLG+SLL++++  +      +S   E  D E      KR+     
Sbjct: 389 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEP--PASAVREAADKEEPPSKKKRVDATAG 448

Query: 646 MSSSDALLDMVGGDELSLYGS-APNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINAD 705
            S++   +     DE+ +YGS A + T+ A  ++SF V DS++NIGP  + + G      
Sbjct: 449 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGEPAFLS 508

Query: 706 PNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEF 738
                 + + + E+V CSGHGKNGAL +L++SIRP+++T F
Sbjct: 509 EEFQN-SPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTF 527

BLAST of HG10011827 vs. ExPASy Swiss-Prot
Match: Q9EPU4 (Cleavage and polyadenylation specificity factor subunit 1 OS=Mus musculus OX=10090 GN=Cpsf1 PE=1 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 2.1e-74
Identity = 196/579 (33.85%), Postives = 305/579 (52.68%), Query Frame = 0

Query: 173 FAAYRMMHWPTGIE-NCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVTA 232
           +A Y+  H PTG+E      F  +S  + V   TS                         
Sbjct: 2   YAVYKQAHPPTGLEFTMYCNFFNNSERNLVVAGTS------------------------- 61

Query: 233 GNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSSR 292
               ++YV R+  +    +++ G        +      LELV  +   GNV SMA +   
Sbjct: 62  ----QLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSMASVQLA 121

Query: 293 GGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGPV 352
           G     KRD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G       P 
Sbjct: 122 GA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPR 181

Query: 353 VKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDLD 412
           V+VDP GRC  +L+YG ++++L   +   SL  + E     G  S+ +  SY+I++R LD
Sbjct: 182 VRVDPDGRCAAMLIYGTRLVVLPFRR--ESLAEEHEGLMGEGQRSSFL-PSYIIDVRALD 241

Query: 413 VK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 472
            K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS
Sbjct: 242 EKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWS 301

Query: 473 ANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCM-LALNNYAVSADSSQDMPRSN 532
             +LP D  + LAVP PIGGV++ + NS+ Y +QS     +ALN+      +     +  
Sbjct: 302 LTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEG 361

Query: 533 FNVELDAASATWLVNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKASVLTSGIASI 592
             + LD A A ++  D  ++S K GE+ +L L+ DG R V+     K+ ASVLT+ + ++
Sbjct: 362 VRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 421

Query: 593 GNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDM 652
                FLGSRLG+SLL++++  +     ASS++ E  D E      KR+     +  +  
Sbjct: 422 EPGYLFLGSRLGNSLLLKYTEKL-QEPPASSVR-EAADKEEPPSKKKRV-----EPAVGW 481

Query: 653 VGG--------DELSLYGS-APNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPN 712
            GG        DE+ +YGS A + T+ A  ++SF V DS++NIGP  + + G        
Sbjct: 482 TGGKTVPQDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEE 526

Query: 713 ATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEF 738
               + + + E+V CSG+GKNGAL +L++SIRP+++T F
Sbjct: 542 FQN-SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 526

BLAST of HG10011827 vs. ExPASy Swiss-Prot
Match: Q10569 (Cleavage and polyadenylation specificity factor subunit 1 OS=Bos taurus OX=9913 GN=CPSF1 PE=1 SV=1)

HSP 1 Score: 281.6 bits (719), Expect = 2.7e-74
Identity = 194/574 (33.80%), Postives = 302/574 (52.61%), Query Frame = 0

Query: 173 FAAYRMMHWPTGIE-NCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVTA 232
           +A Y+  H PTG+E +    F  +S  + V   TS                         
Sbjct: 2   YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTS------------------------- 61

Query: 233 GNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSSR 292
               ++YV R+  +   E+ +  +    G         LELV  +   GNV SMA +   
Sbjct: 62  ----QLYVYRLNRDS--EAPTKNDRSTDGKAHREHREKLELVASFSFFGNVMSMASVQLA 121

Query: 293 GGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGPV 352
           G     KRD+++L F++AK+SV+E+D   H L+T S+H FE P+   L+ G       P 
Sbjct: 122 GA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPR 181

Query: 353 VKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDLD 412
           V+VDP GRC  +L+YG ++++L   +   SL  + E     G  S+ +  SY+I++R LD
Sbjct: 182 VRVDPDGRCAAMLIYGTRLVVLPFRR--ESLAEEHEGLVGEGQRSSFL-PSYIIDVRALD 241

Query: 413 VK--HVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 472
            K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS
Sbjct: 242 EKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWS 301

Query: 473 ANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCM-LALNNYAVSADSSQDMPRSN 532
             +LP D  + LAVP PIGGV++ + NS+ Y +QS     +ALN+      +     +  
Sbjct: 302 LTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEG 361

Query: 533 FNVELDAASATWLVNDVALLSTKTGELLLLALVYDG-RVVQRLDLSKSKASVLTSGIASI 592
             + LD A A ++  D  ++S K GE+ +L L+ DG R V+     K+ ASVLT+ + ++
Sbjct: 362 VRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 421

Query: 593 GNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMS---SSDAL 652
                FLGSRLG+SLL++++  +      +S   E  D E      KR+  +   S    
Sbjct: 422 EPGYLFLGSRLGNSLLLKYTEKLQEP--PASTAREAADKEEPPSKKKRVDATTGWSGSKS 481

Query: 653 LDMVGGDELSLYGS-APNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIA 712
           +     DE+ +YGS A + T+ A  ++SF V DS++NIGP  + + G            +
Sbjct: 482 VPQDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQN-S 529

Query: 713 KQSNYELVCCSGHGKNGALCILRQSIRPEMITEF 738
            + + E+V CSG+GKNGAL +L++SIRP+++T F
Sbjct: 542 PEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 529

BLAST of HG10011827 vs. ExPASy TrEMBL
Match: A0A5A7U6U4 (Cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold13G00480 PE=4 SV=1)

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 548/566 (96.82%), Postives = 556/566 (98.23%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. ExPASy TrEMBL
Match: A0A1S3B4E8 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485885 PE=4 SV=1)

HSP 1 Score: 1088.6 bits (2814), Expect = 0.0e+00
Identity = 548/566 (96.82%), Postives = 556/566 (98.23%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS FITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAFITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVL SGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLASGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. ExPASy TrEMBL
Match: A0A0A0LKI9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074280 PE=4 SV=1)

HSP 1 Score: 1081.6 bits (2796), Expect = 0.0e+00
Identity = 543/566 (95.94%), Postives = 555/566 (98.06%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMHWPTGIENCDS +ITHSRADFVP VTSH+DDLDSDW PRR+IGPVPNLVVT
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTSHSDDLDSDWHPRRDIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRV EEGGRES+SSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS
Sbjct: 61  AGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVFQEAKISVLEFDDS HSLRTSSMHCF+GPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEAFGNTGAISARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           +NLPHDAYKLLAVPSPIGGVLV+SANSI+Y+SQSASCMLALNNYAVSADSSQDMPRSNFN
Sbjct: 301 SNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA+ATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS
Sbjct: 361 VELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLAS+LKDE GDIEVDAHTAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEGGDIEVDAHTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYGSA NNTESAQK FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL
Sbjct: 481 DELSLYGSAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSGHGKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGHGKNGALCILRQSIRPEMITE 566

BLAST of HG10011827 vs. ExPASy TrEMBL
Match: A0A6J1I7X9 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472174 PE=4 SV=1)

HSP 1 Score: 1056.2 bits (2730), Expect = 6.5e-305
Identity = 537/566 (94.88%), Postives = 547/566 (96.64%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMH PTGIENCDSGFITHSRADFVP VTSH DDL+SDW PRREIGPVPNLVVT
Sbjct: 1   MSFAAYRMMHCPTGIENCDSGFITHSRADFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRVQEEGG+ESRSSGEVRRGGIMDG+SGASLELVCHYRLHGNVESM ILSS
Sbjct: 61  AGNVLEVYVVRVQEEGGKESRSSGEVRRGGIMDGLSGASLELVCHYRLHGNVESMVILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEA G  GA SARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGKIGASSARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLVVSANSI+YHSQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVVSANSIHYHSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA ATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361 VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDA TAKRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTAKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYG   NNTESAQK+FSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481 DELSLYG-VSNNTESAQKNFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSG+GKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGNGKNGALCILRQSIRPEMITE 565

BLAST of HG10011827 vs. ExPASy TrEMBL
Match: A0A6J1EY93 (cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439584 PE=4 SV=1)

HSP 1 Score: 1053.5 bits (2723), Expect = 4.2e-304
Identity = 536/566 (94.70%), Postives = 547/566 (96.64%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADFVPGVTSHTDDLDSDWQPRREIGPVPNLVVT 230
           MSFAAYRMMH PTGIENCDSGFITHSR+DFVP VTSH DDL+SDW PRREIGPVPNLVVT
Sbjct: 1   MSFAAYRMMHCPTGIENCDSGFITHSRSDFVPRVTSHADDLESDWPPRREIGPVPNLVVT 60

Query: 231 AGNVLEVYVVRVQEEGGRESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILSS 290
           AGNVLEVYVVRVQE+GG+ESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM ILSS
Sbjct: 61  AGNVLEVYVVRVQEDGGKESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMVILSS 120

Query: 291 RGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 350
           RGGDGSKKRDSIILVF+EAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP
Sbjct: 121 RGGDGSKKRDSIILVFKEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRESFARGP 180

Query: 351 VVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLINLRDL 410
           VVKVDPQGRCGGVLVYGLQMIILKASQAGS LVVDDEA GN GA SARVESSYLINLRDL
Sbjct: 181 VVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEACGNVGASSARVESSYLINLRDL 240

Query: 411 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 470
           DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV+WKHHTCMISALSISTTLKQHPLIWSA
Sbjct: 241 DVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVAWKHHTCMISALSISTTLKQHPLIWSA 300

Query: 471 NNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMPRSNFN 530
           NNLPHDAYKLLAVPSPIGGVLVVSANSI+Y SQSASCMLALNNYAVS DSSQDMPRSNFN
Sbjct: 301 NNLPHDAYKLLAVPSPIGGVLVVSANSIHYLSQSASCMLALNNYAVSPDSSQDMPRSNFN 360

Query: 531 VELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGNS 590
           VELDAA ATWL+NDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG+S
Sbjct: 361 VELDAAHATWLLNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGSS 420

Query: 591 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALLDMVGG 650
           LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDA T+KRMR SSSDAL DMVGG
Sbjct: 421 LFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAPTSKRMRRSSSDALQDMVGG 480

Query: 651 DELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYEL 710
           DELSLYG   NNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINAD NATGIAKQSNYEL
Sbjct: 481 DELSLYG-VSNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADANATGIAKQSNYEL 540

Query: 711 VCCSGHGKNGALCILRQSIRPEMITE 737
           VCCSG+GKNGALCILRQSIRPEMITE
Sbjct: 541 VCCSGNGKNGALCILRQSIRPEMITE 565

BLAST of HG10011827 vs. TAIR 10
Match: AT5G51660.1 (cleavage and polyadenylation specificity factor 160 )

HSP 1 Score: 869.0 bits (2244), Expect = 2.8e-252
Identity = 428/571 (74.96%), Postives = 496/571 (86.87%), Query Frame = 0

Query: 171 MSFAAYRMMHWPTGIENCDSGFITHSRADF---VPGVTSHTDDLDSDW-QPRREIGPVPN 230
           MSFAAY+MMHWPTG+ENC SG+ITHS +D    +P V+ H DD++++W  P+R IGP+PN
Sbjct: 1   MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVH-DDIEAEWPNPKRGIGPLPN 60

Query: 231 LVVTAGNVLEVYVVRVQEEGG-RESRSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESM 290
           +V+TA N+LEVY+VR QEEG  +E R+    +RGG+MDGV G SLELVCHYRLHGNVES+
Sbjct: 61  VVITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESI 120

Query: 291 AILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWLHLKRGRES 350
           A+L   GG+ SK RDSIIL F++AKISVLEFDDSIHSLR +SMHCFEGP WLHLKRGRES
Sbjct: 121 AVLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRES 180

Query: 351 FARGPVVKVDPQGRCGGVLVYGLQMIILKASQAGSSLVVDDEAFGNTGAISARVESSYLI 410
           F RGP+VKVDPQGRCGGVLVYGLQMIILK SQ GS LV DD+AF + G +SARVESSY+I
Sbjct: 181 FPRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYII 240

Query: 411 NLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALSISTTLKQHP 470
           NLRDL++KHVKDFVF+HGYIEPV+VIL E+E TWAGRVSWKHHTC++SALSI++TLKQHP
Sbjct: 241 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 300

Query: 471 LIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNNYAVSADSSQDMP 530
           +IWSA NLPHDAYKLLAVPSPIGGVLV+ AN+I+YHSQSASC LALNNYA SADSSQ++P
Sbjct: 301 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 360

Query: 531 RSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIA 590
            SNF+VELDAA  TW+ NDVALLSTK+GELLLL L+YDGR VQRLDLSKSKASVL S I 
Sbjct: 361 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 420

Query: 591 SIGNSLFFLGSRLGDSLLVQFSCGVGSSGLASSLKDEVGDIEVDAHTAKRMRMSSSDALL 650
           S+GNSLFFLGSRLGDSLLVQFSC  G +     L+DE  DIE + H AKR+RM +SD   
Sbjct: 421 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQ 480

Query: 651 DMVGGDELSLYGSAPNNTESAQKSFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQ 710
           D +G +ELSL+GS PNN++SAQKSFSFAVRDSL+N+GP+KDF+YGLRINAD NATG++KQ
Sbjct: 481 DTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQ 540

Query: 711 SNYELVCCSGHGKNGALCILRQSIRPEMITE 737
           SNYELVCCSGHGKNGALC+LRQSIRPEMITE
Sbjct: 541 SNYELVCCSGHGKNGALCVLRQSIRPEMITE 569

BLAST of HG10011827 vs. TAIR 10
Match: AT4G21100.1 (damaged DNA binding protein 1B )

HSP 1 Score: 65.9 bits (159), Expect = 1.7e-10
Identity = 78/351 (22.22%), Postives = 142/351 (40.46%), Query Frame = 0

Query: 265 VSGASLELVCHYRLHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLR 324
           +S   L+ +    L+G + +M +    G    + +D + +  +  K  VL++D     L 
Sbjct: 45  LSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWDYESSELI 104

Query: 325 TSSMHCFEGPQWLHLKRGRESFARGPVVKVDPQGRCGGVLVY-GLQMIILKASQAGSSLV 384
           T +M           + GR +   G +  +DP  R  G+ +Y GL  +I           
Sbjct: 105 TRAMGDVSD------RIGRPT-DNGQIGIIDPDCRVIGLHLYDGLFKVI----------- 164

Query: 385 VDDEAFGNTGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRV 444
                F N G    +++ ++ I L +L V  +K   F++G  +P + +L++         
Sbjct: 165 ----PFDNKG----QLKEAFNIRLEELQVLDIK---FLYGCTKPTIAVLYQDNKD----- 224

Query: 445 SWKHHTCMISALSISTTLKQHPLIWSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQ 504
               H              + P  WS NNL + A  L+ VPSP+ GVL++   +I Y S 
Sbjct: 225 --ARHVKTYEVSLKDKNFVEGP--WSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSA 284

Query: 505 SASCMLALNNYAVSADSSQDMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYD 564
           +A   + +      A    D+  S +                 LL    G + LL + ++
Sbjct: 285 NAFKAIPIRPSITKAYGRVDLDGSRY-----------------LLGDHAGLIHLLVITHE 336

Query: 565 GRVVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSG 615
              V  L +     + + S I+ + N++ F+GS  GDS L++ +    + G
Sbjct: 345 KEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKG 336

BLAST of HG10011827 vs. TAIR 10
Match: AT4G05420.1 (damaged DNA binding protein 1A )

HSP 1 Score: 58.5 bits (140), Expect = 2.7e-08
Identity = 75/341 (21.99%), Postives = 139/341 (40.76%), Query Frame = 0

Query: 278 LHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSIHSLRTSSMHCFEGPQWL 337
           ++G + ++ +    G    + +D + +  +  K  VL++D     L T +M         
Sbjct: 58  IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSD---- 117

Query: 338 HLKRGRESFARGPVVKVDPQGRCGGVLVY-GLQMIILKASQAGSSLVVDDEAFGNTGAIS 397
             + GR +   G +  +DP  R  G+ +Y GL  +I                F N G   
Sbjct: 118 --RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVI---------------PFDNKG--- 177

Query: 398 ARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMISALS 457
            +++ ++ I L +L V  +K   F+ G  +P + +L++           +H        +
Sbjct: 178 -QLKEAFNIRLEELQVLDIK---FLFGCAKPTIAVLYQD------NKDARH------VKT 237

Query: 458 ISTTLKQHPLI---WSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSASCMLALNN 517
              +LK    +   WS N+L + A  L+ VP P+ GVL++   +I Y S SA   + +  
Sbjct: 238 YEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIRP 297

Query: 518 YAVSADSSQDMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLS 577
               A    D+  S +                 LL    G + LL + ++   V  L + 
Sbjct: 298 SITKAYGRVDVDGSRY-----------------LLGDHAGMIHLLVITHEKEKVTGLKIE 336

Query: 578 KSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSG 615
               + + S I+ + N++ F+GS  GDS LV+ +    + G
Sbjct: 358 LLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKG 336

BLAST of HG10011827 vs. TAIR 10
Match: AT4G05420.2 (damaged DNA binding protein 1A )

HSP 1 Score: 53.1 bits (126), Expect = 1.1e-06
Identity = 54/229 (23.58%), Postives = 97/229 (42.36%), Query Frame = 0

Query: 389 FGNTGAISARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHH 448
           F N G    +++ ++ I L +L V  +K   F+ G  +P + +L++           +H 
Sbjct: 123 FDNKG----QLKEAFNIRLEELQVLDIK---FLFGCAKPTIAVLYQD------NKDARH- 182

Query: 449 TCMISALSISTTLKQHPLI---WSANNLPHDAYKLLAVPSPIGGVLVVSANSIYYHSQSA 508
                  +   +LK    +   WS N+L + A  L+ VP P+ GVL++   +I Y S SA
Sbjct: 183 -----VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYCSASA 242

Query: 509 SCMLALNNYAVSADSSQDMPRSNFNVELDAASATWLVNDVALLSTKTGELLLLALVYDGR 568
              + +      A    D+  S +                 LL    G + LL + ++  
Sbjct: 243 FKAIPIRPSITKAYGRVDVDGSRY-----------------LLGDHAGMIHLLVITHEKE 302

Query: 569 VVQRLDLSKSKASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSG 615
            V  L +     + + S I+ + N++ F+GS  GDS LV+ +    + G
Sbjct: 303 KVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKG 315

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887722.10.0e+0098.06cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Benincasa ... [more]
KAA0049896.10.0e+0096.82cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis me... [more]
XP_008441850.10.0e+0096.82PREDICTED: cleavage and polyadenylation specificity factor subunit 1 isoform X1 ... [more]
XP_011648998.10.0e+0095.94cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucumis sa... [more]
XP_023520837.16.1e-30594.88cleavage and polyadenylation specificity factor subunit 1 isoform X1 [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9FGR04.0e-25174.96Cleavage and polyadenylation specificity factor subunit 1 OS=Arabidopsis thalian... [more]
Q7XWP19.0e-20363.28Probable cleavage and polyadenylation specificity factor subunit 1 OS=Oryza sati... [more]
Q105702.1e-7436.47Cleavage and polyadenylation specificity factor subunit 1 OS=Homo sapiens OX=960... [more]
Q9EPU42.1e-7433.85Cleavage and polyadenylation specificity factor subunit 1 OS=Mus musculus OX=100... [more]
Q105692.7e-7433.80Cleavage and polyadenylation specificity factor subunit 1 OS=Bos taurus OX=9913 ... [more]
Match NameE-valueIdentityDescription
A0A5A7U6U40.0e+0096.82Cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis ... [more]
A0A1S3B4E80.0e+0096.82cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucumis ... [more]
A0A0A0LKI90.0e+0095.94Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G074280 PE=4 SV=1[more]
A0A6J1I7X96.5e-30594.88cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbit... [more]
A0A6J1EY934.2e-30494.70cleavage and polyadenylation specificity factor subunit 1 isoform X1 OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G51660.12.8e-25274.96cleavage and polyadenylation specificity factor 160 [more]
AT4G21100.11.7e-1022.22damaged DNA binding protein 1B [more]
AT4G05420.12.7e-0821.99damaged DNA binding protein 1A [more]
AT4G05420.21.1e-0623.58damaged DNA binding protein 1A [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 223..612
e-value: 7.9E-71
score: 240.7
IPR018846Cleavage/polyadenylation specificity factor, A subunit, N-terminalPFAMPF10433MMS1_Ncoord: 300..729
e-value: 4.3E-19
score: 68.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..140
NoneNo IPR availablePANTHERPTHR10644DNA REPAIR/RNA PROCESSING CPSF FAMILYcoord: 172..735
NoneNo IPR availablePANTHERPTHR10644:SF2CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1coord: 172..735

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10011827.1HG10011827.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006378 mRNA polyadenylation
cellular_component GO:0005634 nucleus
molecular_function GO:0003684 damaged DNA binding
molecular_function GO:0005515 protein binding