Cp4.1LG10g12240 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g12240
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionDNA polymerase I
LocationCp4.1LG10 : 9191676 .. 9215326 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACCCTCAGGGTGGGCGAAGATTGTAAACGAGGTTATATCCCCGAATTGGGCGGTGATCGAGGAGGCGCTTTGGTGGTTCTCATGGCCTGCCACCATCTTCACACTGCGACTGCTAGTGCGTCGCACATTTGCAGAAATTTTTTGAGATACATTTTCACCGCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCTGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGTAACTCAGTTTCTGCTTATTGGTACTTATTGTTCTGAATTGATTCTAGCATATTTACCGAGAAGTTTAGTTGCAAAAAGTAGGAGGCTCATTAATGGTGGAGTGTTATTGTGTCGATGTGAAATGGGCGAGATATATTATGGTAGCAAGTGTAATTCGATTAATCAAAGAGCAGAATGTGGTTGATGGCTCTTATTGCAATAGTTTTTCTTGTCATCTCGACATGAGAAATGGATACTATTTTGATCTTAGTTGTGATCATGGCGTCTAATTTTAGCGTTTGGTTAAATTCAAAGCAATATACAACTTTTTGCCTCTGTTTTCTGCTGCAGTTTTGACTTCTTCTAAGGATTTTTCTCGGTTTCTCAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATTGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGGTATGTTAGATTTGTCATCAGAATGTTTTAGTTTGACTTAGTTTGTTATCCTAACTTATATTTTCAATCCGTATTTCCACTGGAGATGGTTTATGTGCATAGAGGATAATTGATAGCAGAAAAGTGGCACACATAAACCCGTTGGTAAAGAAGTAGACTTTAAATTCTTAACATTTTTATTCTTTCATAAGAAACGCGGACAAACTTCATTAATAGAGAAAAAGATATACAAAAATGGAGACGATATATCCCCAAATGCCAAGGGGATTATAAAAAGGATTTCCAATTAGCATTAATTTATATCAAATTAGAACTACAAACATGAGGGAGCAAAGAACGCCATTTTGGTGAAAACAATCTAACTTGCTCCCTACATTCATCACGAGAGGATTCACTGTCTTGAAAAGTCCTCTTGTTCCTTTCATACTAAAGTCTCCGTAAAACCACTACCACTGAATTGAACCACAACGCTCTAGCTTTGCCCCCAAAAAGAAGTCTCATAAATGAGCTGAAAAAAAGCCCCCGAACACCCCTTGGGGAGCACCCAACTTCAGTTAAAACTTAAAAAAGACTACACCAACCAAAAGCAACATACGTTTTTGCTATATTTACTTCTTTGATTTATTTGTTTGTTTGGATTTCTGTTTTGGAGCAGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCATGTCACTTGTAAGTAACATTCTTGTTAGGTTCTTGTTTTCAAATGTTATTTTACCTGTCCTTTTTCCAACAATGATATACATTCTACTGATAGTGCCTCAAACCCTAATGCTATTCTAAATGTTCTCCTCATCTTTGTTGTTTTTTGCCATGAAGTAATTTCTGTTAAAATATTACCATAAAATATAGGAATATCAATTTTCAGCTTGTTGGGTATTGGCATAATGCATATGCCGTGTTTGACTAGCCATGGGAACAAAAGAGAAACATATGAGGGATCATCAACATGTAGGAGCTCGGGCCTATCCAAATCATCCTCAATAATTATATCATAGGGTGCCTCTATGGTATGAAAGGGTGGTGGGTCGAGCTCGGGTTTAAAAGTTTTGGGGTAGTGTCTGTGATAACTAGTGAGAAAGACGCTGTATTAGAGGGAAGAGGGGGAGGCTATTCAAAGAAGGTGGATAGGTATGGTAGTGAGTCAATTGGAGAAAGGTGATGTCGTGGGGAGCCTTCATCAAATGAGTGCAAGGGGATGATAATATGTTTTCTTAGTCTAAAAGAGGTTTATCGGTGGCTGTCGAGAACTGATAAAGATGATGAAGGAAAAAGAAGTGATGAGGTGGGTTACAATTCCCTTCCTAGGCCCGTTTGACATTGTCTTTCGGTTTCCAGTTCATTTCCTCTAACCTTACCATCTCTTCAATCTTACATTTCAACAAGACACTTGTTAGGCCTACCATCATTAGGCAATACTTTAGAAGCGGTAGAAAAGGGAATGTACCCTCATCAAGTTAGTATCTCTAAAGGCCTTTTGTTCCAACCTTTCTAAAGCTTCCTTTAGTGAGCGGAGTTTTCCCATGAATTTGAAACCTTCCCACTTTCAGGAACATTTGTTGGCCACCAAGAAACTTAAAAAGGAGTGATAGGAGATCAACATATTTTCATACCTTAGAGGACAGGGTCCCTACTTTGGCTCCCATTGATCCAGAGAAATCGGCCAATGACTCGATGTGTCCCTGGGACCTAGAGATCTTTTGGCACCAAAGAAAGAATCCATCCAAACTCTAGATAACAAGGCTCTGGACCACCCTCAAAAGACAGTTCTATTTTCTCCGAAGTGCAACTATTTCCTTCCGTAGATTTCATCCTTCCTTCACCAATTTGACTAAAGAGGGTAGCTCCATAGATGTCCTATGTTGTTTGACCCTTCGAGGAACTTGAGATTTACTCAAAAGAATTGTAGCAAGTGTCATAAGAAATCTTCCAACCCTTGCACGCAATACTTTTGGGGCACCCATCAAGGGGTTCAAAAGCAATCATACATCTGCCTCTTTGATAACTCATGTTAGACATTTGAACAAATATCTATTCGAAGAACTTCTTTGCTTTTGATCTCACGATGATGAAAGAAAAAAATAGCTTACCCACATGAATATCCCCAAATGAACATTCACATTTGAAACTTTTGATATGAACCAAGACATCTCAATCATGCAGTATATACTTCAAGGAAGATGTGAAAAAAATGATCAAATTACCACCAGATAAAATCAAGTTGCAATTCATGACATATTAAAGAAGAATCAAGTTTCAATTGTTCTTGTATGATTACAAATTTTGGGTGGATGATAATTAAGAGTTATGGTGTTTTATTAATATATTTTAAGGCACTTTATTTACTTTAAGGTTACATTTATATAGTGGCGAATTATGGTCATAAAGTATACATCAATAAGGTTGTTAGATCAAATTCTCAAGAGTTGCAATCATTTGAAATGAAATCAAATTGATTTGGGGTTTCTGCATTTGTGAAATAAAGGTTCGCATATCCACGCGCCTTATGAGAGTTCTTAGATTCAATTGACTAGAGTTTCCATATCATTTGGTATCATTTGTGTTTCATTTGTGTTTCTTATCAATTTGGTTTTTATTTATTTTTATTCTTTTAGCCCATAATCGATTTAGAGTTTAAAAACTAATCTAGAATGAGAGCTTCATTTATTATGTAAATGAGTGTAAACAGAGTGCCCTAAAATTTCTTGAGTGTATACATGTGAGAGAGTGCTGTGAGGTCTTTTGTATTTTCTGAATTTAATTTTGCAACGAAGTTTCTCAAATAGATTAGATTTGAGGACAAATCCTCTTGAAGAAGTGAACAATGATATGAACCAATGCATCTCAATCATGCAATAGCAATATGTACTTCAAGGAAGATGTTAAAAAAATGTTGTTCAAATTATCACAAGATAAAATCAAGTTGCAATTCATGACACATTAAAGAAGAATCAAGCTTTAATTATTCTTGTATGGTTACAAATTTTGGGTGGATGATAAATTAGAGTTATGATATTTTATTAATACATTTTAAGACACTTCATTTACCTTAATGTTACATTTACATAGTGGCTAATTATGGTCATAAAGTATACATGATGGCTAATTATGGTCATAAAGTATGAATGTTACAACTCTTGAAATGCCTCATGGAAGGTGAATGGTTTTATTTTGAGACTATATAACGCCATGTATCTTGTCTTTGTAAAGTAGACTTGGAAAATGTAATAAAAATAGCATTTGTGTTTCTTTTAGCCAAAAGCTAAGTTTCTTTGTAATTTTATGTAGTTTATGTTGAATATTTAGAATCATTCAAGCTTGACATGATCAATCTTTGATCTTGCTTGTGGAGTAATTCAAATCTCAAATAAGTGTTCTTGCCTTGAGATATTCGATCAACAAGGTAATTTGGATCTTACTTTCCCTTGGAGTGGTTCTTGGATATTTAGATCAAAAAGGTTTTACTCTTCAATAAGGTTGTTAGATCAAATTCCCAAGAGTTGCAATCATTTGGAATGAAATCAAATTGATTTGGGTGTTTTTGCATTTGTGAATTAAGGGTTTGGATATCCAGGCACCTTACGAGGGTTCTTAGACTCAACTAGCTAGGGTTTCCATATCATACAATATTGTTCAATTAGCACCTTCGTCAATTCATCTTAGAGGTTGAACACATCTCTTCCTTTCAATCTTCACTGAGCATGTATACACCTCACCAGCCCTAACCTCTTCCCGCTGGCTCTCATCCTTCAGTCCACTAACACATTATGAAAGAGATGAGGAGAAGAAAGAGAGAGAGTGTGTGTGCGTGTGTGCGCGCGTGTCAATCCAAAAGTGTTTTCAACCCATGGTTATATTGACAAAAGTCTTTTGTAATTATAGTGTAATTGTACATCCTTCACTTGTGCTTCTCAATGTCTTGCTGAAACATTTTCTGTTTCTAAGCAAAAAAAGACGGGTGGTTTTACAAATTTTGAAAGATGGAGATATTTCTTGTGATAATTGCTATATCTATTATATATCTTTTCTGCAAGAATTTGTGATTTTTCTGCAGGTAAAGTTCAAGTTTTCTGTTTAGTTTAGTGGTTCATGTGCTTACTCATTATTATTATTCTCTTGCAGATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTAGCTCTTCTTTTTCCCTCACTGCTCTCTTTAATTATGTTATCTGCAAACATTAACCAATTAAGATTCTTTTAAATATAAGTTTTTTGGTATTTGAAAATTGAGTTCGATTTTCATTTCATTCATAATTTTTTAAAAATTTTGTTTCATCAATCCTTCATTTTTTTTTTTTACTTCTAATTATCTGTTAGCTTCTTTGGCTAAAAATGAAGTGATTTTTTCTAAAATTTTCTTAGGATGATGTGACAATATTTAACCAAAACTCCAGAATTTTTTATGTGGAAAAAAATATATAAAAATATGCTCTCCCGTACGTCATCCGAACATCCACTCTACCCACACCCGCACCGTCGGATTCTCTTGGTTTCCTTTGACTCTAGGCCTCTACCTCTCCCACTCTGACTTCCTATCGGATTTGTTTTTTTTTTTCTTTCCAGATTTGGGGTTGAGGATGGGCACTGGTAGGGAGGGAGAAATAAAGGAGGTGGTGGGGAGAGAGAAATAAAAGAGGTGGTTGGTTGAAGAGAGTGTTTTAAAATATTTTTTTGGTCACATCAACAAAATAAAATTATAAAATAGCTAATTATTGTAAGGAAATATAATACAAAACCATGGCATCTTTATTGTCAAGGAAACTAACAAAAAATCAACAACAGGTACCAAAATGGAGCATCATTGAAACCTAAATGGCAAAAACGCAAGATTTTGAAACCTAGAGACAAAATGGATATTAAACTCTAACCGCGGGGATCAAAAGAGCATTTATCCCTATAATATGTTAGTTAGACTAATAAGTGATGATGTCATTTGATTTGTTGATAATTAGACATGTACGTGTACAGAATCATTCATCCGTCTTTTTAGGGTCATCTCTTATTTTTAGCATAACTAATCAATATCCAGACATTTGCAGGATTTGATGTTTAGCTTGTGCCTCCCATATTTATTGTCTTAACTATGAATGCAATTGCAATATTCACTTATACGTTCTCTATGTTGTGCAGGTTGTGTTTGATCATGATGGTAAGATCTAGGATTCGAAATTATCACATTACTGATTTCTACAAGAACGTGTATGTGCTTGTATTCCATTACTTTTCTTGGATCTGCAGCTTTCATAATTATCTACTTACATGTACTTTACCTGGTGATGGGAAGGATACCAATGTTTAGTGGGTGACAGTTCGGATACTAGTGTCTAGTGGATTATGGGTGATGGGAGAGATATTTTTTTTTTCTCCGAGGTCAGGGTGTTGGAGGGTAGACCCCTTTGCACTTGTTTCTCTCGTTTATATCACTTTTCCTCGATTAAACCACTCAGTGGCGGATATGTTGCCCTTTTCTCAGATATTATGTTCTTCGTACATGCTGGGATTATGATGTTTTTGTTAGCAGGAAAACGTCAGATTTTATCTCCATTCTCTCATTGATTGGAAGTTTTGTTGTAGTCTGGGGAAAAAATATGTTTGTATTTTGAGCCCTTGTTCTTCAGTTGGCTTCTCTTGCAAGTCTTTTTTTTGTGGCCTGGTCAATTGCTCTCTCTCGAGTGGTGTCTTTGCTTCACTGTGGAAGGTGAAAATGTTAAAGAAGGTTCAACTTTTTGGGTGGTAGGTTCTTCATGTATAAGTTAACACCTTGGATCAAGTTCCGAGAAGGTACAAATAGGTTAACGCACTGGATCAAGTTTTGAGAAGGTTGTGCAATTTGATTGGGTCGTTCTGTTGTTTACTTTGTAGGTAGGTGGATGAGGACCTTGACCATATACTCTGACACTGTGACATTGCGCAGGAGATTTGGAGTTATTTTTTTTAGGAGTTTGGCTTAAGGTATCTAGACATAACGCCTGTAAAGAGATGCTTGAGGAGCTCTCTCTTTCCCTCATTTTCATGAGAAAGGAAGGTTATTGTGGTAGGCTAGGGTGTGTGCGATTTTGTGGGATATTTGGGGGGAGAGGAATAGTCTTTAGAGAGTTCGAGAGACCTTGGAGAGGGATGTTTGGACCCTTATATGGTTCAACATCTCTATGTGAGTTTTGATAGCTAAGCTGTTACGTAAATACCCGTGGCTCTTATTTTACTTGATTGGAGGCCATTTCTTTAGTTTGCCTCCGTCTTTTGTGGGGGCCATTTTTATGGTGGCCTTTATTTTTGTATGTCGTCGTATTTTTTCTTTATTTTTTTCTCAAAGAAAGTTCGGTTGTTTATTAAAAAAGAAAACATACGAGAAAATGTTTGGAATATTATTAAGTGGACTTTAGAAGTATTATTAGGATAATACATATATTTTTTCTGAACAAGAAACATAACTTTTTATTGAATAAACGAAAGAGATTGAGGCTCAAGATACATTTTTTTCATTGAACAAATGAAAAGAAACTTCCAAAACCTTAAATTTTAGATTGTAGTGCACGCTTATTTTGGAGTTGCACTACTAAGGCGGACTTTCATTGAGAAGAAAAGAAGAATACAGAAGTGTAACCTAGTGAAGAGAATGCTAACTAAATACAGAACATTATAGAAACCCAACCCAAAGGGATACACTAAACCTAGTCAAGGCCTAAACTCTTTCAAAAACTCTCCACGCCCCTAAAAACTCATCTCTAGCCAATATTTCATAAGACTGCAAACAAGTGGCCTAAAGGAGTATGAAATAGCACCTCCTTCTATCATAAAAAAACGTACTTTTAGGACTGATCGACTAAAATGAAGACTGCTTCAAAGTTTCCAACCCTTGGGTGGTGGATTATAATTGATAAGATCAAGATTGAGGATTCTTGTTGTAGAGGGGATGCAAGCATGAAGGAAATGATGTCAGTCTAACGTGTGATCTTGTAACTGCTCCACCACTTGGGAGGCTAAAGAGGAGACTGGGTATTAAAGTGGTGTGAAGTTAGTGTAGAGTGAGAATCTATTGGTGGATTTTCCTTTCTACTGATTGTAAATATTGATCGGAAATATTTATTATCCATATTTATGTATAATTCCTTCATATTTTTTTAGACGATCTCACTAAATATAAGAGGACCTTGTATATTACCTTTACAAATATTGAAGAAATAAAATATTCTTTTCTCCATAAAAATCGAAACTTATTTATGTGAGAGTCAAGCTTTTGACCATCGATAATTTTCTTTTTCATCCTTCATTCTTTAATTGTAACTCAATGCTGACAGAAATAGTACATGGAGGTGCCTTAGTTGTTTTTGGTTGACTAGCTATCGTTCTATCATCTCCTTAAATCCTTTCAGTCATTTAAATTGTACTTTGGATGTTGGATAAAAATGATGTGACAGAATAAAAAAAGAGTTGAATAGTTCAGAAACATGACTAAATTCCATCGCATTTTGATCAATATCCCAAGCCTTTTTCTTTTCTTTTCTTGGATTGTGTTAGTAGAACTGCATTTGGAGTTTTCTCAATTATAATCTCCGTCGTTTACCTTTTTTTCCCTTCAACAATTTGGAATCATTCAGTTGAACTTCATTCTAGAATTTAAATATTGTGGATTCTTAGACAAATTAGTTGGTAGTTATAAGTGATTTCTATTTAATTTCAAGGCTGGCCTTGTGCATTTTGATGACATAGTAATGAGTACTTGGCGATTTTATACTTTCTGTAGTAAAGTTTCAATTGGTATCTGTCTATCAAAATGGGAGTTTCTTGGCATTTAGTGTCAATTCCCTTCCCAATTGTGGTGGGTAGCGTTGAGGAATTTCACAATTCCAAGAACAAACTAATCCTTTTGGTAAAAACAAACCTTTTCTTTCAGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGTATGCTCTTCTTCTTTGAACGATTGAGTTGATAGTGACTTTTTTTTTTCTCTTTTAAAACAAGAGTAAAATGATATTAACATGTTAAAATTAAGGAAATTAATATTTGTTGGTTACACGTTTTAATCTAGACGACTTTTATTTGAATCCTATTCTAACTGAAGTATTTTGGTGTGTATAATGGCTTTTGTATGGGAGTAATCTAAGTTTTTTTCGTTCATTCTTTGAAAACAAAAACTAGACTTTTCTTTTTCATTAATTTTTTTAAAGCTTCAATAAAATTGTCGAGTTTTGGCTTTTCTGAGAACTATTTATAGTTTATTTTGTTCACCCCTCGCCATGTAATTTTGTGATCTTATATTATAACATTTTTTTAAAAAGGAAACTTGTAGTCTTGAGATCTCAAAAGTTTCTTTTTAGTATTGCTGAAGCGGTGCCTTCTTCTTCTTATTGGGTTAAAAAAGACCAAGAAGTGGTGAGTTGAATTTCAACGAGCTAATTGGTTTTCCTAGCTTATTTGCTCATAACAGTTGGAAGGAAATTAAGCATACTTTGGAAGTTCATTTTAAGGACTCTGTATTGATTAATCCCTTCATGATTTAGAAGACTTTACTAAATTTCAAGGGTGAAAACTCCTTCAAGGCTCTTGAGTCTGAAGGAAAATGGAGACCATAGTGATTTTCATTTGCTTATTGAAAAACGGTCTTGAGAAAAGAACAGTCATCTGACTTTTATTGAAGGTTATGGTGGATGGATCTCCATCAAAAACTTGCCTCTAACCTGTTGGGATAATAGCATGTTCGAAGCTATCGACCATAACTTTGGTGGTCTTGAGAGTATTTCTTCACAAAATCTTTATATGTTAGACTGCACTAAGGCTCATATAGAAGTCAAGAGGGATTTGTGTGGTTTTCTTCTAGCTATAATAGAATTCAAGGATAAAAAAGAGGGAATACGTTTCTTAGATTTGGTGATGTTACTGTCATAGATCCTCAAAATATCATTCATTAACGGAGCTCTCTCCTTGAAAGACTTCTCCAATTCTTTGGACTTGCACCGTTTACATTAAATCATGGAAGATGAAACATTGACATCGTTTACACATTCCTTCCTCCCTCTTTAGTCCCCAGCAAATTTGAATCTCTTATTGAAGCTTGTGGTCTTGAATTATAGAAAATTCCTCCTAAAATATCTTAAAGATTAGTTTTTAATTAAGTTTCAAGAGAGCAGCTCTTTTTGGTCTTTACTTTCTCTTTAAGTAGTTTGAGTTTTGGGGCTTTTGTTCTTAGTATCTTAGTTGGTTTGTTAGATCCTTTTTTGAGGACAATTTTTGCCAACTTTTTCTTATATTGGATCTTTTGGTTACTTGTTGTTGAGTTTTCTTTAGAGCTGAGCTTTGGTTTGATGGGTCCTTTTTGTTTTTGGGTTGTTCTCTTCGTTTGAATCTACTTTAAGATATTGTTTTGTTTTTCTGGTTCACCCTTTGAGATTTTTGTATCCTTTGAACATTTTCATTTTCATTATATAAAATAAGTTGTTTCTAGTTAATAACAAAGAATACATAGAATTTTCGTCGATAATTAAAAATATATAGTAACTGCTCTTAGTTGATGAAAAGTTACTGTGTATCCTCAAAAGTTTGTATCTTTGATTTTTTTTTTTTTCTTTTCCACTAATCAATGAAAAGTTCATATCTAGTTAAAAAAATAACATATTACAGTCGAATAAAGTAAATAACTTTCATTGGAAAAACATATATTGCCCAACCAAAAAATACAAATACAACATACCCGATTAGCCAAAACCAAATGATGAATAAAGCCTATACGTAAACCATAAACAACCATTACATATTCTACCTAAACCTAACAACAACAGAATGCTTGGATACATCTCCAAGGAGCTTGAAAAAGGAAAAGATGAGCTATACAAAGATGCTAAACAAGCATCAAAACTGTAGGAAGGAAAAAATGAGCTATACAAAGATGCTAAACAACCATCAAAACTATAGAAATGAATTCCCCTTCCATATCGATTTCTATGAAACATGAGTTGTTGGAAAATAGCTTTTATCTTCACAGCTCCATAGACGCTTCAACACAAAGCCCAAAAGAAAAATCATCTTAAACTATCTGAATCAAGAAAGCCCATGCAAACAAAATGAAAAGAACCCAAATTAGATGAAGGCAAGGGAAGGGTCGATATTGAATTCTAGTATTAAACCTCGATCTTCATACATGTGTTATGCCTTGCAAAAGCTAGTATTAGAGTGGGCATGGGTAGCACACTTGAAGATAGAGGGAGGCCAGGCGGGAGATGTGCAGGAAGTTGGTGTGAGTTGTCATGGACGTACTTTCTTGCCATTGAGCTTGCTTTGATGATAACTTCAATTAGTTGAAGCAATGGCATATCAAGCTATTGTTAGTCTTGACCACAATCTTGGCCCTTAGAGATATCGTCCTTCAGTCCCCTAGACAGTGGACCACAATTAACACCTCTTTCTCAAAAGTGACATACCTTTTCTCAGCGTCATTCAATTTTTGACATTCGTTTGTGATGAGATATGTCTTGAGGAGTATGCCACCAAAGACAAAATTTGATGGATCTCCTCTTCAAAAAAATTTGGCTAGGTTGGCAATCCTAAGCACCTGCTGTGAAAATCCATCGAAAGTGGCTTGATATTTTAGAATCCAACTCCAATTAGGGTTCTTCTTTAGCATCTCTGTCAAGGGCCCCCCTGTTTAGAACATTCTTTAGTGAACAAGCAGTAGTAATTGATAATCTAGGAAAGAGTGCAGTTTTGTGATGCAAGTTAGCACCTTTCAGCCATGGATGAGTTCAATGTACCAATTTGCCCATATTCGATTGGGTGGCCCAGAAAGCTGATCACTCTTATGCAAAGGAACATTTCCCCCTTTCCTCATACAACTGATTTTTTCTCAACTTCTCAAAGATGAGTTATAGGTGAATATGACATTCTTCTATAGTCGAGCTATGGACACTGTGTCTTTCAGGTAGACCAGCAAACTTCACAAGATACTCATGGAACACTTGGTTCATTATGTGTATAAGGTGGCCGATGCATTTTTAAGGCCATAAGGAATGTAAAGAAACTCACAAGCCCCGTACTACGTGAGACGCATTGTCTTGCAATTAGTCTCCTTCACCATACGGACTTGGTAGTAACTTACTCTCATGTCCAGAGGTTCAAGAAATACTTTGCCACATGATGTTGGTCTCATAAGTCATTGATGAACTTTTTTAGAGTATGGTGTTCTATGCATGGGCCAACACTTCCATTCTACTTCCTTTGGAAGAGGACTAGGGCTCCACTATGACCTTTTGTCGATTGGATGAACCGAACATTCAATAATTTATCGAGTTGTTTTTGAGTTCAACTAACTTAGGAAGGACTATGTAGTACATGTTCTTCGTGGGTAGTTTCTCCCCTGGCAACAACTCAATCTCATGATCTATGTGTCTATGTGGAGATGCAATCTTTGTTAGGCTATCAAGCATGCACCGTAATACTCTTCTAACACTTGTTGGCTTTCTTCAAGGGAAATTTTTAGAACTCTCATCTTTAATCACGAGAATGGCCGTGAATGTTTGTTTGTCATGAGCAAACCCTATGTTTTCTCAACTCAAATTTGAGTTGGGTCTTCAGAGAGGATTCAATATTTAATGGTCTTCAAACATTATTTGTTTCTGATATGCCCATCATGCCAAATTCTTATGGTTTAATGCTTACCAGGACATTGTTTCATATATCTAGTTGGAAAGAAAAAAATGGGGTTTTGAAGAGAAGTATACAGTTCGGTTGGATTGTTTTGAGCTCATCGAGATAGTCGATGGTATTCTGTCTAAATTCTTTCATGGTCATTCTTCTAGTGACATTGTAGTTTTAGTTGTTTCCTACTACTCTCTTTATTTATTTTTTTTTCTNTTTTTTTTTGTATTTATTTTCTGCAATTTACATTTATGCTTTAATTTTATTTTTTGAGTTTACGCTTTGTAGTCTTTGAATTCTTGTACTCTCTTATAGCTTGAGAAGGTTTTGAAGGCTTTCAGATAAGAAAATACATTTAAGCAAGCTCTGTTTTTTATTTTTCATGGCACAACAAATTAATTGGAGAATATAGTTATTTTTTGTTATTTAAGAAACCATATTTACATTTAAATCTACTTGTATTAATGGTCTATTAGTTCAAACTAGTTATCAACTAAATTGCACAATCTTCTGTGTCTATGTAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCCACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTAATAAATTGTTGGAATTTCTACTATTTATGAATGTTTGAGTGTAATTATTGATCATCAAATATTTGATCTTTATTGCCTTAATTATTTTTCATATTTGTATTAGGTTTTTTCTATATTTGTAATAGGTCTCTCCTATACAAGGAAACGTAAATTCCACATCATACACAATTAAAATGTTTCTTAGTCCCTCAATTAAGATAAATAAATGATTTTTTCGTAGTTTTTTTTATTAGCCTACCAGATTTTTAGAGAGAGAGGGAGGAAGAGACTAGAGAGAAAAAAATATATGTTTTTATTAGACTACTTCCAGATTATTTGCTTTGCATATAAGGATGTAGTAGTTACATTCAGCTTTTTATTAGCAGCTCGATGAAATTTGTGTACTTCTTTCCTACAATATTTACTATCAATAACCTGTTCTAATCTTCTTCTAGAATTAGTCATGTCTTTTGTTTGTTTGATTATTTAGCCTGTTATTCCCTGGTGGTTGGCTTTAATTAGACTCAATGATAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTGACACACTGAATAGATATTTCAGAATGACATAGGGGTCCTTTTTTAATTTATTTTGCATGCTTGCCAAGATTATGTTCTTTTATGGGGATATATTTAATCATCCACTACTTGATCTTATTGAGTGTCAAACAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGTAAGAACATATGGATTACTTGTTTACTCATTAAAAATGAATTCCTCTAACTTAATAAAAGAAATAGTTTTTTCCTGATAAATTTGCAGTCATTTTCTCTATGGACTCTGAGGGTAGAATAGTTATATCTTATACTTTATGCATTAATATTGTTCTCCTTATCGTAGCTGAGGGATTGGCATATGATTGTTGTCCAAAATAGAACACCTGCATATGAAATTTTTTCTCCGTATTAAAGGACCACAATTAGGCATGCATTTTTTTATATCTGGCTTTTTTGTTCAATATTTAGACTAGAAAACTAAAAATATCTGTTTTTCTTTATTATTACTTTTTAATTGTTACTGGGAAGATGATCGATAATCTAATCCGTTCTTTTGAAATTTTATTATTGAACCTGCATAGGATCATAGAGTTTATTCTTGAAAAATGAGATGTAGGTGGGTTCCATGAGAGTCTGGGGAAGGAAATCCCTTCATTGTGAGGGTTGAACAAGAGCAGTTTGTTTTTCGAACTAGTGAATTGGGAAAGGTGGAGATTGAGGAAATTTACAAAGAAAATATGTTAAAGATTTCGTTGAACCTTAGGTTGTAATTGGGTTCAGTTACTTGGGTAAGGATCTATTTTGATGTTGGTGGGTTGCATCTGAAGCATTAATCTAGATTCAGCTGGTTTCAAACCCTGGGAAAATTGGTGTGTCATAGGTGGTGGAGTGTCCTTAGTGGACAAGGTCTTCGATTATCATCCACGAAAGAGAAGAAAGATAAGGTAGATACTGGTTTTACCACGAATCTAGAGTGCTCCAAATCAGGGGGACAATCGTTCTGCGAATTCTTCCTCCTATTCTGCTTCTTCATTCACTTATTTGGTGAAGGAAGGAAGTAATGGATTGGAAGAATATATGGTTACTTAGAGGAGGAGATGTAGTTCTCTTTTGAGGCTCCTAAGTGTCCTGGTAGTATGGAGCATGCAGCTGATCCGTGGTTGATGTTCAAATAGGAACTTGGAAGGAAGTTCGAGGAAGTACTAAAAGTGATGCAATTTTGTGGCCACATGGCTTATTATTTTTGGCGAGGATCATTGTCTTAGTTTGAAACTTTTAGAGTTAGAGTTCGGCGGGGGAGATCTTGTGTATTATATGGTGAGTCCAGGGGAGATCTTGTGTATTATATGGTGGTGAGTCCAGGGGAGATCTTGTGTATTATTTGCTGATTCCAGGGGAGATCTTGTGTACTATATGGTGAGTCCATTATCGTGGAACACTGAAACTATAGCTTAGGGTAGGTGAATGGTCTTCAATAGAGGTTAGGTGAAGGTTCTTGTGTGGAAGACTGGATGGATGGAGGATGTGATTAGCTTGGCACAGGGTGTAAGAGTAGGTGCATCTTCTTCATCTTTGGGCTTTCAACGAAGTTTTCTTGTTGATCTTTTCTCCTGGGCTATTGCATGGCGCTGCTTTGGAGCTTAGCAGGAGTGTTTTTTGCAGCTATGGCTCGTTAAGTTTCCAAGATCTTCTTCGACATTTTTGTTCTTCAATATTTTATATGTCTCTTGGCACTTATATTTGGAGACATTTACTCCCTCTGTGGGTTTGATCCTAACATTTCATTATCTTTTCATCAATCAATGAAACTAAGTTTGTTTTTTGTTTTAAAAAAAATTAAAAAAATGAAGATATATACAGAATTATTGAAAAAGAGAGTGCTTAGAGAGATGGGTGAATTTGATCCTAGCATTTCACCATCTTTTCATCAATCAATGAAAAAAAAGTTTAAAAAAATGAAAACAATGAATTTATATATAATTATTGAAAGAGAGATTACTTAGAGAGGGGTAAATGTTTATAAAGATAAAGAAGAGACTCCGGGAGAAGGGATGCTTAGAGAGAGCGAGGATGTTTAAATCATGAGTTCAAATTTTTTTGGCTTTTTTTGTTTTTTTGTTTTTTCAGTGTGGCTTCAATTTTTGGATTTGATTTTATGGTACCTTCTTTGAAACTTCGACATTTACTAAGCTTCTCTGCCCATTTTTAGGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGTGATTTATGAATCTCTGTCTAAAAACCTCCATAACTTCGTTTTGTACCAATTACAACGCAATTGCAACTTTGATGATTTGTAAGTGTCTTGGCGCCACACTTGGTGACTTTGAACTGAAGTACTTTTCCATGCTTCAGGTCTAATCCACGGGACAAAAATTACTCTAGGGGCAAAAATATCATTGTAACCGAATGTAATATTGTTACCAACATAAACATGCTGAATTTGGAAATTTTAGCAGAAGAAACTAAGGTCCTGTTTCCTTATTAATGAGTGTTTTTCTTTTTGCTGTTATTTATTAGGAATTTTCTTGAATCAAAATTAAAACCTATTGTACTAATGCCTAGAAGCTGTCCGAAAGCGGGAATAGAATCGAAATATAATACTGTATAATTTTAAACTAACTTGCTACAACAATTTTTTTTTTTTTTTTTTCTCCCTTCAGAAGAGACAATTTCATTGATGATTGAAATTTACAAAAGGGATATAATATCATGGTGTTTACAAAAGACCTTTCCAATTTGTAATGAGGGTGACATAACTATAGGAAGTAAAAATATTAGAACTTTAACCAAGATATAGCTTGGTAACAACATTGTTGAAAAGTTGTGTGTAGGTCTGTGTCATTTCTCTGATTTTTTTTCTTTCCATAGATTCCAAAAGAAAGTCATGATGAGGTTTTTCCATAGTAGGGCTTTTGCATTCTTGAAAGGGTGGTACGTTAATGCAATATCCAAAAGATCCTTTACCTCCCTAGGAAATGTGACATGCCATCCGAATATATTGAAATCGTTGTCTAGAAATTATGAGCGTATGTGCATTTCATAAACAAGTGACTTTGTGATTCGTTTTTTTCTTGCATATTGGACACCAATTTGGAGCAAGAGTGGTGTAAGACACTCTTTTCAGAAGATTTTCACTTGTGCTAATGGCTTTATGCGCTATTTCCCAAAGGAAAAACTTCACCTTTTTAGGCTGATGTCCTTTCCATACTCTCTTTGCTAGGGTGGGATTTATTGCTTCTACTTTTTCCCCCATGTCCATCATCAAGGATTTTGTAGAAAAGACCCTATCAGCGTTGAGGAGCCAAGTCAGTGAGTCTGCTTTGTTTGACAAGACCATAGAGGCAAGGTCAAGAGTTAATTCGGCCCATTCCGTTGCTTCATTATCCTTTGAATTCCTACCAAGCTTCAGGTCCCAGAATTTGTTGACAACATTCCATGTTTCTTTGATCGTGACTTTTTTGCTATGAGAGACTGTTAAAAAACGGATACCTCAAGGCTAGCGTGTTGTTTTCAATCATGGGTCGGTCCAAAATGATCTGCTTCTTCCATCACCCACCTTATGGCAAGTTCGGTTGGTGATGAGGTTTTGATGTTTCTTTATGTACTTTCAAGGCCCTTTTGTAGAAGATGGAGGAGTGATTTTTGTTTGATGTAGGAGTATATTTGACCTTTATAAGATTTCTCCACAAGGCCTTTTCTTCATAATGATATCTACATATCCATTTGGCGAGGAGAGCTTTATTCTTCTTCTTTATCGAGAGAAGACCAAGGCCTCCCTTTTTTGTTGGGAGGTTTATAATATTTCATCCAACAAGGTGTAGATCATCTTTCCAGAAACTTTTCCATAAGTAGTTTCTAAAAAATCTTTCTATATGACTTGTATTAGGGTGAGCCTTCCACCTTTTAAAGTATAACATGATGACCATGTTGACAACCTTCTTTCTCTTCAACAGAATAATCTTCCAGGGTGAGTCTACAACAGCTTGTTTTTACTTCTCCAAGTAATGCCGATTATAGATCTCTAGTCAGTTAATTCTCTATCCAGCTAGCCTAAGTGTAGAGTTTTACCAGTCTATTTGATGGCTTTTTGGATAATAGCCCATGATCTTGAGTTCCTTTCAAGTATCTCAAAATTTTGTTTATAACTCTAAGATGATGGTCATATACTGACTAACAAAACTCACAGAATATGCGATGTTTGGTCTGGTATTTGAAAGATAAATTAACTTTCCCATGATTCTTTGATACATGCCCTTGTCAACTGGAATAACTTCTTCATTTTGGTGTAGAACTAAATTCAGGTCTATGGGTGTTACTGCAGGTTTACACCCAAGATTTCTCATCTCTTTCAACAAGTTTAGGACATTTTCTTTGAGATTTTTTAAAAAATCTCAAAGGAAACGAAATTTTTTTGATTGAAGAATGAAAAGAGGCTAGTACTCAAAAGATACAAAGCTCCACAGAGGAATGAAAATAAATAGCTAGGTTAGGCTGGAAAGGATTATGTGCAATATTGATCGCACAACTTTGACGGCACGACTTCAACGTCAAGGCACGACTTCAACCGTAGCTATGATGGCTCCAAAATTTGGTCAAGGTGGAAGATTGGTAGCCATGAAGGCTCTAAATAAATTTAGTCAAGTGGGAAGATCGGTACCTATGAACACTCCGAATAAATTTGGTCAATATGAAAGATCGGAAGCTATGAAAGACTATGCCAGTTAATCACACCTCGTAGAAAAGAATGACTCAGATACCCGGTTGCAAGTGTAAAATTATCTTGCTGCTTTTTTATACTTAATTTTCAAAGTGGTACACCTTTCTTTTTATAATAAAGAAAGGCTAACTAAAGTACCTAGATACTACCTAGAAAATAGGTAACAATGTAGAAAAATAAATCCCTAACAATATAGACATTATTAAGGCTAGGATTTAAATAGATATAATTAATGTCATTCCTTCAACAAAAAACATAAACCAACAATAAAAAAGAATTATAATTCCTCCCAATAAGTTTCGTATCAAATCTGGAGTTCCCGTGTTCAAACGTCTATAATGTCATTTTCTTTCCATGTAATGTTGACAATTATTTGTTAAATTTTCTACTAATTTTCAAGCCCATAAGTGAGAGAGTTGTTAGAATATGAATGGAGTATTAGGAATGTGAATTCACAAAGGCCCCCATTGGCAAGGGCTTGGGGTCCAAGGTTCACTCAGATACCGAGCCTTGGGCACCGGGTGCCCCTTGGTATAAGGGAGCAAAGCTCCGACTCCTGGTTATCGGAAAAAAATATGAACTCACACGTGAGTAGTAGTAAGAATATAAATAGAATATTAAGAATGTGATTAATAATTAAATTTCCTATAACCCAACAACTTAGCCAAGTTTATAACCTTGTACTAGTTAAGATTAGGCAAAGTTGCCCAAGTAACCGAGGACTGAAAATGTAGCAGTAGATTGCCTTTCTATCATTACTAGTTTGTGAAGTCTTTAATTGACTAGAATGGTGCTTGATGTGCCCTAAATGCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGTAAGTAGGTTTAAGAGACTATTGCCTTCCCGGAGAATTGTGGACTACAATTTATTCAATCACATGTATGCTTACCGTCTGATATTTTATTGTTATTGAGAATAATGGTAAAGGTGAAAACCTGCTTTCTACGGCGGACTAATCTATTCAAGGAAATAATAAATCCTATTAATCAGGAGATATTTTTGAAAGAAAAATTGTAAGTCAGGAGTTCTTAAGTATTGAAGTTATAAATGACTATAAATCTTCAACTACATTTCAAATAAGCGTGAACATATAAGATGAACAATAAAATATTTTGGCAATTTGATTTAGATTTCTCCTGGCATCTTTTTGAAACTGCTGTTCAACTTGTTATCTTATGTTCTACACATTTTTTTTTCTTTCTTCTTTTTGATTTGATTGGAAAAAAAAAAAAAAAAAAAAAATTCATGTGAAAAACATATAAAAGAATGGAAATGGAGGAGAAATCTTCTAACTAAAACATAAAAAATATTTAAAACACTTCCTGCGAATGCTTTGAATAAATTTTGTTATTTTTTTTTCTTTTTAATTTTTGGAAGTTAGTCTTCCATTGTAATTTTTCATTTTATCCGTGGAAACTTGTGTTTCCTTGTTAATTAAGACGCTTTCTGCAAAAGTAATCCTAAAATATAAAGTTTTTTTTTTTTTCATTTTGTGACCACAAAAGGTTAATTAGAACTATATTGTACGGTATAAAATGAGAATGCCTTGGTACTATACTTTTGTGTTCCTTGATTATGTTTCTTATCTTAATCCTTTATTAACTTGTATGTTGTGCCTGAAATATGATGGTTTGCATTTACTTATTTGCAAATTCACTTGTCAGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGGTATGTCTATCAGAGTATAATAACTGGCTTCTCAACAAAAAAATGTATGAAAACACTTGTATTGTTTTCAGTGTATTTGATGCGGTAAGCATATACGATTAATGAATTTTGTTAATTTAGTCATGTTGTTCGGTGCTGGATTTGATTGCATTCAGCCTTTAAGATTTCTGGGTTGAAGTCAGACTTGTAAAACAATTCTGATAATTTATTGTTTATATACAATAATATATAATCAATTGTTTATTTACTATTCGACTAAGATTGAGAAGCAATTCTAGGAGTAGAAAACAAAGCAAAGAAGTTCTAGTAAACAAACTTATTTTTTAAATACAAAAACAAAATTATAATCTAATGGTCTTAAATAATCTCAGTTATATTTGAAGTCAGTTTGCTATATTTGTTGTTCTTTTATTCCTTTTTCTTTCTAGTTTCTGTGTAATCAGATTTGTTTATGGAAACTAGCTGGTACTAAGCTCCTTATTTCACCCTTAATGACTGAAAATTTTATTTGAAATTGGTCCTGAACTAATCCTTCACTCCTAAGATTATTAAAATATTTTTTGTATCTTGTCCCGAACAAATTATCATCCCTATTAAGATTCCAACTTAAAGGGAAAATATGAAATCTCTGGAAAATTAGTTTTCTTGATAAATCCCTATTTATTATCTCTATCTACTTCAGTAAGCATGTTTTGGGTAGAGATGCTTTTGTTAATGTAACAAAATGAAAAGGAAATTATAATGATGGACTTGTTTTACAATGATTTTAAAGGTTTAATTGTTTGTGAAAATATATGCATTGTGAAATTAAGAACGCATGCTCGGGAAATGAGCAGAAGAACGACATGGATATGAGCATGAGATACCAGTAAATTTTTCTTTTATACAGATATGTCATACTTTGTGATGCTTGCATTTACAAGATAATGCGAAAGGTGGACATATACAATACTCTAAGAACTGATCCATCAGTTACTGCAAATACAACAAATACTTGATGCTTGGTCTTTATTTTCATTCCTTTTTCTTGTAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGTAATTTCACTTCAGTTTTATTAGGTTACTTGTGTTCTTGTGAAGGCAGAATTTTTTTTCCGCATATACTATATTTGGTTGATATTTTGGGGTAATTTTCTTTAGAATCATAAGGACTAGTTAACTCTTTTCTCCTTGATACTCTTTATGGTCTTAAACAATAATCAGTATTTTGAAAAGCATGCCTCGTGTCACCGTCACAAGGCGTGTTGCCCATGGTCGCATGCCCGTTACATGCTGAATCCCTTGCCTTGTACTACTTTATCGCTTTTCTCTACTGCTTTCATGTTATATAATTTATTTCTTGCTTTTTCTAAGAGTATGACATTTCGGCAAAATCATGTCTACGCCGGGTAGACATGAAATGTCCCAAAATGTATTATAGGGGATGAGACTAGTTGTCAAATTTTCCCTACCCGAAAAGTTATATACTGTTTAAGCTGTTGTTGTTGCCTTTTGCTTGAATGCTATATAACACCGTTTTCAATCTCACTCAATCTTGCTTTTAAACCCTCTTTCAAAATTTTTTTGCCCCGCTTTCTAAAACCTTCTTAAATCGGGGCTAGAGGCTTGAGTATACACGTTGCCTGGCCGTAGGCGACGCGATTCGAAAGTTGGCTTGACTCACTCAGTGGTAAGAGTGCGTGCCGTAAAAACGCCAAGTCCGCTGCCAAGAAAAAGGCGATCTCGCAAAGAGGCCCTCAACTTCTGTTTACTTCTTCATGACATTTTGTGAATAAATTAAGAAATATGTTCCAAAAACTGTTATTTTCCTAAACAAATATTCATTTAATCCCTTGGAATGGTCGACTGGTTAGGCAGGGGAATGCAATTGTAAGTAGCATGGATCCATTGTTTCTTCTTATTACATAAATGCAAAAACCTGTACTTGACTTAAATGTAAATATTGGCATCCACCATGTTATTTTAACTTGTTGCTTCATCCAAGTCATTTATGCTGCAGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGTTTGTGCTATTATTTGTAGGACATGGCACTTATTCGAATTTTACTGTTCAATCTGGTGATGCATTGCTAGAAATTATTTGAAACTTTAGGAAGGACTTAATTTGACCCACTAATGGGGTCTCAATTGGGCAAAATACTGATTTGAGAAATGAGACATTTAAGAGTAAGATTCTCATTAAAGTGTTGATATAATCAAATGTAACGCCCAAGCCCACTACTAGCAGATATTACAGATATTGTCCTCTTTGGTCTTCCCTTCAAGGTTTTTAAAACGTGTCTGCTAGGGAGAGGTTTCCACACCCTTCTCCTCCCCAACCGATGTGGGATCTTACAATCCACCCCCTCCGAGCCCAGCGTCCTTGCTGGCTCAACGCCTCGTATCCACCCCCTTTCGGGGTTCAACCTCCTCGTCGTGACCTCAACATGCACTGCTGTGCATGGTGTAATGGTATTCATATATTCAGCTATTTGTAATACGTGTGTTCAGCTTTCAAATGCTAGAAGGTGTTTAGAAGCAAGACAACTGTGAATTTTGACAGAAATTTTTGGTTGTACAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAGTGATCGATGTCAATTTGCTCAGCCTCTTGTCCACTTTGTTGCATGTAGATTTGGTTTTGTTGTATAATTAGAGATGTTAGCAGAGGTTAGAAACCAAATTAGTCTCTGTTTATGAATACCCCCTTGTTCTTCGTAGCCTTTGAGCAAAGAAAATGAGAAACACAGGAAATTTCAGCTTTACATGTGATGGAGTATCCCTTGCTGTGAATCACTGAAAACTTGCTTCTTCTCCACTTGGAGCATATTGATTCTTTAGAAGATGGAGGTATTTTAAGCTTTTAAATTCATTCAAAATTTAATATTTACAATATTGAAAGATGAGACTTTTA

mRNA sequence

TACCCTCAGGGTGGGCGAAGATTGTAAACGAGGTTATATCCCCGAATTGGGCGGTGATCGAGGAGGCGCTTTGGTGGTTCTCATGGCCTGCCACCATCTTCACACTGCGACTGCTAGTGCGTCGCACATTTGCAGAAATTTTTTGAGATACATTTTCACCGCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCTGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATTGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCATGTCACTTATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTTGTGTTTGATCATGATGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCCACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAGTGATCGATGTCAATTTGCTCAGCCTCTTGTCCACTTTGTTGCATGTAGATTTGGTTTTGTTGTATAATTAGAGATGTTAGCAGAGGTTAGAAACCAAATTAGTCTCTGTTTATGAATACCCCCTTGTTCTTCGTAGCCTTTGAGCAAAGAAAATGAGAAACACAGGAAATTTCAGCTTTACATGTGATGGAGTATCCCTTGCTGTGAATCACTGAAAACTTGCTTCTTCTCCACTTGGAGCATATTGATTCTTTAGAAGATGGAGGTATTTTAAGCTTTTAAATTCATTCAAAATTTAATATTTACAATATTGAAAGATGAGACTTTTA

Coding sequence (CDS)

ATGGCCTGCCACCATCTTCACACTGCGACTGCTAGTGCGTCGCACATTTGCAGAAATTTTTTGAGATACATTTTCACCGCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCTGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATTGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCATGTCACTTATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTTGTGTTTGATCATGATGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCCACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAG

Protein sequence

MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLLSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLEKS
BLAST of Cp4.1LG10g12240 vs. Swiss-Prot
Match: DPO1_THEFI (DNA polymerase I, thermostable OS=Thermus filiformis GN=polA PE=1 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 8.0e-39
Identity = 99/275 (36.00%), Postives = 155/275 (56.36%), Query Frame = 1

Query: 122 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAV 181
           RV+L+DG  + YR ++ L         S     G+ V  ++     ++  L+     V V
Sbjct: 13  RVLLVDGHHLAYRTFYAL---------SLTTSRGEPVQMVYGFARSLLKALKEDGQAVVV 72

Query: 182 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 241
           VFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  +
Sbjct: 73  VFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALVKRLVDLL 132

Query: 242 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 301
            +  +E PG EADDV+GTLA ++   G +VR+++ D+DFFQ++S  + +L   P G  + 
Sbjct: 133 GLVRLEAPGYEADDVLGTLAKKAEREGMEVRILTGDRDFFQLLSEKVSVL--LPDGTLVT 192

Query: 302 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 361
               +D  EKYGV  P ++VD  +L GD+SDNIPGV GIG   A++L+  +G++ENLL++
Sbjct: 193 P---KDVQEKYGV-PPERWVDFRALTGDRSDNIPGVAGIGEKTALRLLAEWGSVENLLKN 252

Query: 362 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 397
           +D+V+ + +R+ +  + E   LS DLA +R+DLPL
Sbjct: 253 LDRVKPDSLRRKIEAHLEDLHLSLDLARIRTDLPL 254

BLAST of Cp4.1LG10g12240 vs. Swiss-Prot
Match: DPO1_GEOSE (DNA polymerase I OS=Geobacillus stearothermophilus GN=polA PE=1 SV=2)

HSP 1 Score: 161.8 bits (408), Expect = 1.8e-38
Identity = 104/306 (33.99%), Postives = 165/306 (53.92%), Query Frame = 1

Query: 122 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAV 181
           +++LIDG S+ YRA+  L   LH+    H +    + + +   ++      E  P+H+ V
Sbjct: 4   KLVLIDGNSVAYRAFFAL-PLLHNDKGIHTNAVYGFTMMLNKILA------EEQPTHILV 63

Query: 182 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 241
            FD                  A  +TFRH  +  YK  R  TP  + +    L+  +K+ 
Sbjct: 64  AFD------------------AGKTTFRHETFQDYKGGRQQTPPELSEQFPLLRELLKAY 123

Query: 242 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGF-EM 301
            +   E+   EADD+IGT+A R+   G  V+V+S D+D  Q+ SP + +  I  +G  ++
Sbjct: 124 RIPAYELDHYEADDIIGTMAARAEREGFAVKVISGDRDLTQLASPQVTV-EITKKGITDI 183

Query: 302 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 361
            S+  E   EKYG L P Q VD+  L+GDKSDNIPGV GIG   AV+L+ +FGT+EN+L 
Sbjct: 184 ESYTPETVVEKYG-LTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLKQFGTVENVLA 243

Query: 362 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 421
            +D+++ E++++ L    + A+LSK LA +  D P   V  T  D+++K   ++ EK  +
Sbjct: 244 SIDEIKGEKLKENLRQYRDLALLSKQLAAICRDAP---VELTLDDIVYK--GEDREKVVA 277

Query: 422 LLTAIG 427
           L   +G
Sbjct: 304 LFQELG 277

BLAST of Cp4.1LG10g12240 vs. Swiss-Prot
Match: DPO1_BACCA (DNA polymerase I OS=Bacillus caldotenax GN=polA PE=1 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 4.0e-38
Identity = 103/306 (33.66%), Postives = 167/306 (54.58%), Query Frame = 1

Query: 122 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAV 181
           +++LIDG+S+ YRA+  L   LH+    H +    + + +   ++      E  P+H+ V
Sbjct: 4   KLVLIDGSSVAYRAFFAL-PLLHNDKGIHTNAVYGFTMMLNKILA------EEEPTHMLV 63

Query: 182 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 241
            FD                  A  +TFRH  +  YK  R  TP  + +    L+  +++ 
Sbjct: 64  AFD------------------AGKTTFRHEAFQEYKGGRQQTPPELSEQFPLLRELLRAY 123

Query: 242 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGF-EM 301
            +   E+   EADD+IGTLA R+   G +V+V+S D+D  Q+ SP + +  I  +G  ++
Sbjct: 124 RIPAYELENYEADDIIGTLAARAEQEGFEVKVISGDRDLTQLASPHVTV-DITKKGITDI 183

Query: 302 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 361
             +  E   EKYG L P Q VD+  L+GDKSDNIPGV GIG   AV+L+ +FGT+EN+L 
Sbjct: 184 EPYTPEAVREKYG-LTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLRQFGTVENVLA 243

Query: 362 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 421
            +D+++ E++++ L  + E A+LSK LA +R D P   V  +  D+ ++   ++ EK  +
Sbjct: 244 SIDEIKGEKLKETLRQHREMALLSKKLAAIRRDAP---VELSLDDIAYQ--GEDREKVVA 277

Query: 422 LLTAIG 427
           L   +G
Sbjct: 304 LFKELG 277

BLAST of Cp4.1LG10g12240 vs. Swiss-Prot
Match: DPO1T_THET8 (DNA polymerase I, thermostable OS=Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579) GN=polA PE=3 SV=2)

HSP 1 Score: 155.6 bits (392), Expect = 1.3e-36
Identity = 100/276 (36.23%), Postives = 152/276 (55.07%), Query Frame = 1

Query: 122 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFT-AMSLIVDVLEFMPSHVA 181
           RV+L+DG  + YR +  L               G+ V  ++  A SL+  + E     V 
Sbjct: 13  RVLLVDGHHLAYRTFFALKGL--------TTSRGEPVQAVYGFAKSLLKALKEDGYKAVF 72

Query: 182 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 241
           VVFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  
Sbjct: 73  VVFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALIKELVDL 132

Query: 242 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 301
           +    +EVPG EADDV+ TLA ++   G +VR+++ D+D +Q++S  + +L   P G  +
Sbjct: 133 LGFTRLEVPGYEADDVLATLAKKAEKEGYEVRILTADRDLYQLVSDRVAVLH--PEGHLI 192

Query: 302 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 361
                E   EKYG L P Q+VD  +LVGD SDN+PGV GIG   A++L+  +G+LENLL+
Sbjct: 193 TP---EWLWEKYG-LRPEQWVDFRALVGDPSDNLPGVKGIGEKTALKLLKEWGSLENLLK 252

Query: 362 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 397
           ++D+V+ E +R+ +  + E   LS +L+ +R+DLPL
Sbjct: 253 NLDRVKPENVREKIKAHLEDLRLSLELSRVRTDLPL 256

BLAST of Cp4.1LG10g12240 vs. Swiss-Prot
Match: DPO1F_THETH (DNA polymerase I, thermostable OS=Thermus thermophilus GN=polA PE=1 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 1.7e-36
Identity = 100/275 (36.36%), Postives = 146/275 (53.09%), Query Frame = 1

Query: 122 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAV 181
           RV+L+DG  + YR +  L               G+ V  ++     ++  L+     V V
Sbjct: 12  RVLLVDGHHLAYRTFFALKGL--------TTSRGEPVQAVYGFAKSLLKALKEDGDVVVV 71

Query: 182 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 241
           VFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  +
Sbjct: 72  VFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALIKELVDLL 131

Query: 242 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 301
            +  +EVPG EADDV+ TLA R+   G +VR+++ D+D +Q++S  + +L   P G+ + 
Sbjct: 132 GLVRLEVPGFEADDVLATLAKRAEKEGYEVRILTADRDLYQLLSERIAILH--PEGYLIT 191

Query: 302 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 361
              L    EKYG L P Q+VD  +L GD SDNIPGV GIG   A +LI  +G+LENL QH
Sbjct: 192 PAWL---YEKYG-LRPEQWVDYRALAGDPSDNIPGVKGIGEKTAQRLIREWGSLENLFQH 251

Query: 362 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 397
           +DQV+   +R+ L    E   LS+ L+ + +DLPL
Sbjct: 252 LDQVKPS-LREKLQAGMEALALSRKLSQVHTDLPL 253

BLAST of Cp4.1LG10g12240 vs. TrEMBL
Match: D7T2D6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0094g00430 PE=4 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 2.0e-158
Identity = 289/392 (73.72%), Postives = 335/392 (85.46%), Query Frame = 1

Query: 59  LLSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSP 118
           +LS KG C+ S S++++I+  ++   +G+   SS  +    Q +  +S   KE     S 
Sbjct: 49  ILSRKGCCTLSNSLDSSIHEVAHTISYGNTTISSKSERKLCQGAFVDSVDHKERKMDISS 108

Query: 119 SDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSH 178
           S+ RVMLIDGTSIIYRAY+KLLAKLHHG+LSHADGNGDWVLTIF A+SLIVDVL+F+PSH
Sbjct: 109 SNGRVMLIDGTSIIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFAALSLIVDVLDFIPSH 168

Query: 179 VAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASI 238
           VAVVFDH+G  +GHT  SS E+ MAKG  FRHT YP+YKSNRPPTPDTIVQGLQYLKASI
Sbjct: 169 VAVVFDHNGIPFGHTSISSKESIMAKGLNFRHTLYPSYKSNRPPTPDTIVQGLQYLKASI 228

Query: 239 KSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGF 298
           K+MS+KVIEVPGVEADDVIGTL++RSV AG KVRVVSPDKDFFQI+SPSLRLLRIAPRGF
Sbjct: 229 KAMSIKVIEVPGVEADDVIGTLSVRSVDAGYKVRVVSPDKDFFQILSPSLRLLRIAPRGF 288

Query: 299 EMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENL 358
           EM SFG+EDFA++YG LEPSQFVDV+SLVGDKSDNIPGV+GIGNV+AVQLIT+FGTLENL
Sbjct: 289 EMTSFGMEDFAKRYGNLEPSQFVDVISLVGDKSDNIPGVEGIGNVHAVQLITKFGTLENL 348

Query: 359 LQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKF 418
           LQ VDQV++ERIRK L++ A+QA+LSK+LA+LR DLP YMVPFTT DL+F KPEDNGEKF
Sbjct: 349 LQCVDQVQEERIRKALISGADQAVLSKNLALLRCDLPFYMVPFTTEDLIFTKPEDNGEKF 408

Query: 419 TSLLTAIGAYAEGFSADPIMRRVLNLWKKLEK 451
           TSLL AI AYAEGFSADPI+RR   LWKKLEK
Sbjct: 409 TSLLNAISAYAEGFSADPIIRRAFYLWKKLEK 440

BLAST of Cp4.1LG10g12240 vs. TrEMBL
Match: A0A061EJA7_THECC (5\'-3\' exonuclease family protein isoform 1 OS=Theobroma cacao GN=TCM_019883 PE=4 SV=1)

HSP 1 Score: 557.4 bits (1435), Expect = 1.6e-155
Identity = 289/402 (71.89%), Postives = 333/402 (82.84%), Query Frame = 1

Query: 53  FPSFSVLLSP-----KGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSP 112
           F  F V+  P     KGYCS S ++N       +AT HG+   SS ++ L  Q++  ++ 
Sbjct: 38  FKKFYVIRPPPCQTIKGYCSLSYTLNTLPGA-RHATSHGNAVISSKKEQLLHQEAALDTS 97

Query: 113 TCKEETEIDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSL 172
             +E     + S+ RVMLIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTA+SL
Sbjct: 98  NLQERVVNANYSNNRVMLIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSL 157

Query: 173 IVDVLEFMPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTI 232
           I+DVLEF+PSHVAVVFDHDG  +GHT  SS EN MAKG  FRHT YP+YKSNRPPTPDTI
Sbjct: 158 IIDVLEFVPSHVAVVFDHDGIPFGHTSISSKENVMAKGLNFRHTLYPSYKSNRPPTPDTI 217

Query: 233 VQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPS 292
           VQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA RSV AG KVRVVSPDKDFFQI+SPS
Sbjct: 218 VQGLQYLKASIKAMSIKVIEVPGVEADDVIGTLAARSVDAGFKVRVVSPDKDFFQILSPS 277

Query: 293 LRLLRIAPRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQ 352
           LRLLRIAPRG+EMVSFGLEDF+++YG L+PSQFVD+++L+GD+ DNIPGVDGIGNV+AVQ
Sbjct: 278 LRLLRIAPRGYEMVSFGLEDFSKRYGDLKPSQFVDMVALMGDRCDNIPGVDGIGNVHAVQ 337

Query: 353 LITRFGTLENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLL 412
           LI++FGTLENLLQ VDQVE + IRK L  NA+QA+LSK+LA+LR DLP YM PF T DL 
Sbjct: 338 LISKFGTLENLLQCVDQVEVDHIRKALKGNADQALLSKNLAMLRCDLPFYMAPFATTDLT 397

Query: 413 FKKPEDNGEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 450
           FKKPEDNGEKFTSLLTAI AYAEGFSADPI+RR   LWKKLE
Sbjct: 398 FKKPEDNGEKFTSLLTAISAYAEGFSADPIIRRAFYLWKKLE 438

BLAST of Cp4.1LG10g12240 vs. TrEMBL
Match: A0A059DB23_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00006 PE=4 SV=1)

HSP 1 Score: 555.8 bits (1431), Expect = 4.7e-155
Identity = 286/390 (73.33%), Postives = 323/390 (82.82%), Query Frame = 1

Query: 68  SSRSVNAAINVDSNATYHGSPASSSSQQMLQV-------QDSLSNSPTCKEETEIDSPSD 127
           SSR  +  +   S +T HG    +S   ++         QD+L +     E      PS 
Sbjct: 43  SSRKGHYVLAKCSFSTLHGVATETSRNSVIPSKTSPSVGQDALLDQVNQGEREAKADPSG 102

Query: 128 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVA 187
            RVMLIDGTS+IYRAY+KLLA+LHHGHL HADGNGDWVLTI TA+SLI+DVLEF PSHVA
Sbjct: 103 GRVMLIDGTSVIYRAYYKLLARLHHGHLPHADGNGDWVLTIVTALSLIIDVLEFGPSHVA 162

Query: 188 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 247
           VVFDHDG  +GHT   S E+FMAKG  FRHT YP YKSNRPPTPDTIVQGLQYLKASIK+
Sbjct: 163 VVFDHDGIPFGHTFNQSKESFMAKGLNFRHTLYPTYKSNRPPTPDTIVQGLQYLKASIKA 222

Query: 248 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 307
           MS+KVIEVPGVEADDVIGTLA+RSV AG KVRVVSPDKDFFQI+SPSLRLLRIAPRG +M
Sbjct: 223 MSIKVIEVPGVEADDVIGTLAVRSVEAGYKVRVVSPDKDFFQILSPSLRLLRIAPRGLDM 282

Query: 308 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 367
           VSFG+EDFA+KYG L+PSQFVDV+SLVGDK DNIPGV+GIGNV+AVQLIT+FGTLENLLQ
Sbjct: 283 VSFGMEDFAKKYGALDPSQFVDVVSLVGDKCDNIPGVEGIGNVHAVQLITKFGTLENLLQ 342

Query: 368 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 427
            VDQVE+ERIRK L+  A+ AILSK+LA++R+DLP YMVPFTT DL FKKPED+GEKFTS
Sbjct: 343 CVDQVEEERIRKALIAQADNAILSKNLALIRTDLPFYMVPFTTEDLAFKKPEDDGEKFTS 402

Query: 428 LLTAIGAYAEGFSADPIMRRVLNLWKKLEK 451
           LL AIGAYAEGFSADPI+RR LNLWKKLE+
Sbjct: 403 LLKAIGAYAEGFSADPIIRRALNLWKKLER 432

BLAST of Cp4.1LG10g12240 vs. TrEMBL
Match: A0A0D2NBI5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1)

HSP 1 Score: 548.1 bits (1411), Expect = 9.9e-153
Identity = 278/387 (71.83%), Postives = 324/387 (83.72%), Query Frame = 1

Query: 63  KGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDAR 122
           K YCS S ++++ +  D +   HG+   SS ++ +  Q++       +E       S+ R
Sbjct: 53  KQYCSLSGNLSSTVPGD-HPIPHGNAVISSKKEQIFHQEAALGRANLQETVVNAKSSNGR 112

Query: 123 VMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAVV 182
           VMLIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTA+SLI+DVLEF+PSHVAVV
Sbjct: 113 VMLIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSLIIDVLEFVPSHVAVV 172

Query: 183 FDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMS 242
           FDHDG  +GHT  SS EN M KG  FRHT +P+YKSNRPPTPDTIVQGLQYLKASIK+MS
Sbjct: 173 FDHDGIPFGHTSISSKENVMGKGLNFRHTLFPSYKSNRPPTPDTIVQGLQYLKASIKAMS 232

Query: 243 VKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVS 302
           +KVIEVPGVEADDVIGTLA RSV  G KVRVVSPDKDFFQI+ PSLRLLRIAPRG+EMVS
Sbjct: 233 IKVIEVPGVEADDVIGTLAARSVDEGFKVRVVSPDKDFFQILCPSLRLLRIAPRGYEMVS 292

Query: 303 FGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHV 362
           FG+EDF+++YG L+PSQFVDV+SLVGD+ DNIPGVDGIGNV+AVQLIT+FGTLENLL+ V
Sbjct: 293 FGMEDFSKRYGDLKPSQFVDVVSLVGDRCDNIPGVDGIGNVHAVQLITKFGTLENLLKCV 352

Query: 363 DQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLL 422
           D+VE + IRK L+ NA+QA+LSK+LA+LR DLP YMVPF+TRDL F KPEDNGEKFTSLL
Sbjct: 353 DEVEVDHIRKALIANADQAVLSKNLAMLRCDLPFYMVPFSTRDLTFNKPEDNGEKFTSLL 412

Query: 423 TAIGAYAEGFSADPIMRRVLNLWKKLE 450
            AI AYAEGFSADPI+RR   LWKKLE
Sbjct: 413 NAISAYAEGFSADPIIRRAFYLWKKLE 438

BLAST of Cp4.1LG10g12240 vs. TrEMBL
Match: A0A0D2RDM6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 2.2e-152
Identity = 277/385 (71.95%), Postives = 323/385 (83.90%), Query Frame = 1

Query: 65  YCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDARVM 124
           YCS S ++++ +  D +   HG+   SS ++ +  Q++       +E       S+ RVM
Sbjct: 54  YCSLSGNLSSTVPGD-HPIPHGNAVISSKKEQIFHQEAALGRANLQETVVNAKSSNGRVM 113

Query: 125 LIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAVVFD 184
           LIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTA+SLI+DVLEF+PSHVAVVFD
Sbjct: 114 LIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSLIIDVLEFVPSHVAVVFD 173

Query: 185 HDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVK 244
           HDG  +GHT  SS EN M KG  FRHT +P+YKSNRPPTPDTIVQGLQYLKASIK+MS+K
Sbjct: 174 HDGIPFGHTSISSKENVMGKGLNFRHTLFPSYKSNRPPTPDTIVQGLQYLKASIKAMSIK 233

Query: 245 VIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVSFG 304
           VIEVPGVEADDVIGTLA RSV  G KVRVVSPDKDFFQI+ PSLRLLRIAPRG+EMVSFG
Sbjct: 234 VIEVPGVEADDVIGTLAARSVDEGFKVRVVSPDKDFFQILCPSLRLLRIAPRGYEMVSFG 293

Query: 305 LEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHVDQ 364
           +EDF+++YG L+PSQFVDV+SLVGD+ DNIPGVDGIGNV+AVQLIT+FGTLENLL+ VD+
Sbjct: 294 MEDFSKRYGDLKPSQFVDVVSLVGDRCDNIPGVDGIGNVHAVQLITKFGTLENLLKCVDE 353

Query: 365 VEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLLTA 424
           VE + IRK L+ NA+QA+LSK+LA+LR DLP YMVPF+TRDL F KPEDNGEKFTSLL A
Sbjct: 354 VEVDHIRKALIANADQAVLSKNLAMLRCDLPFYMVPFSTRDLTFNKPEDNGEKFTSLLNA 413

Query: 425 IGAYAEGFSADPIMRRVLNLWKKLE 450
           I AYAEGFSADPI+RR   LWKKLE
Sbjct: 414 ISAYAEGFSADPIIRRAFYLWKKLE 437

BLAST of Cp4.1LG10g12240 vs. TAIR10
Match: AT3G52050.3 (AT3G52050.3 5'-3' exonuclease family protein)

HSP 1 Score: 513.8 bits (1322), Expect = 1.0e-145
Identity = 267/395 (67.59%), Postives = 319/395 (80.76%), Query Frame = 1

Query: 57  SVLLSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEE--TE 116
           S+  S K YCSS      A++  SN    GS  +S S+    V       P   EE    
Sbjct: 59  SLARSAKYYCSS-----VAVSEFSNEAASGSTLTSISED---VTPQSIKYPFKSEERVAS 118

Query: 117 IDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEF 176
             + S+ RVMLIDGTSIIYRAY+KLLA+L+HGHL+HADGN DWVLTIF+++SL++DVL+F
Sbjct: 119 TAASSNGRVMLIDGTSIIYRAYYKLLARLNHGHLAHADGNADWVLTIFSSLSLLIDVLKF 178

Query: 177 MPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYL 236
           +PSHVAVVFDHDG  YG T  SS     AKG  FRHT YPAYKSNRPPTPDTIVQGLQYL
Sbjct: 179 LPSHVAVVFDHDGVPYGTTSNSSTGYRSAKGMNFRHTLYPAYKSNRPPTPDTIVQGLQYL 238

Query: 237 KASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIA 296
           KASIK+MS+KVIEVPGVEADDVIGTLA+RS++AG KVRVVSPDKDFFQI+SPSLRLLR+ 
Sbjct: 239 KASIKAMSIKVIEVPGVEADDVIGTLAMRSISAGFKVRVVSPDKDFFQILSPSLRLLRLT 298

Query: 297 PRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGT 356
           PRG EM SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIPGVDGIGNV+AV+LI+RFGT
Sbjct: 299 PRGSEMASFGMEDFAKKFGNLEPAQFVDIIALAGDKSDNIPGVDGIGNVHAVELISRFGT 358

Query: 357 LENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDN 416
           LENLLQ VD++++ +I++ L+ +A+QAILSK LA+LRSDLP Y+VPF T+DL FKKPEDN
Sbjct: 359 LENLLQSVDEIKEGKIKESLIASADQAILSKKLALLRSDLPDYIVPFDTKDLTFKKPEDN 418

Query: 417 GEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 450
           GEK +SLL AI  YAEGFSADP++RR   LW+KLE
Sbjct: 419 GEKLSSLLIAIADYAEGFSADPVIRRAFRLWEKLE 445

BLAST of Cp4.1LG10g12240 vs. TAIR10
Match: AT1G34380.2 (AT1G34380.2 5'-3' exonuclease family protein)

HSP 1 Score: 68.6 bits (166), Expect = 1.2e-11
Identity = 80/340 (23.53%), Postives = 144/340 (42.35%), Query Frame = 1

Query: 24  IFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLLSPKGYCSSSRSVNAAINVDSNAT 83
           + T  F  P    S S+ S+    P              K   SSS S ++  +V+   T
Sbjct: 1   MITVGFIQPNSLFSFSTKSIDKTQPSR-----------TKWVSSSSSSFSSHSSVE---T 60

Query: 84  YHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDARVMLIDGTSIIYRAYHKLLAKL 143
           +H     + + Q+LQ      N+   +++ +       RV  +D + + Y          
Sbjct: 61  FH----RTGNVQVLQKDVLCGNNEEIRKKNK-------RVFFLDVSPLCYE--------- 120

Query: 144 HHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVAVVFDHDGHAYGHTCYSSNENFMA 203
             G+   +   G W+   F+ +SL   V       +AV+   +G+        S   + A
Sbjct: 121 --GNKPSSQAFGHWISLFFSQVSLTDPV-------IAVIDGEEGNQRRRELLPS---YKA 180

Query: 204 KGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALR 263
              +  H RY    S RP          Q++   ++  +V V+ + G EADDV+ TL  +
Sbjct: 181 HRKSPNHGRY----SKRPH---------QFVDEVLRKCNVPVVRIEGHEADDVVATLMEQ 240

Query: 264 SVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVSFGLEDFAEKYGVLEPSQFVDV 323
           +V  G +  + SPDKDF Q+IS +++++           + L+ +  +Y   +P   +  
Sbjct: 241 AVQRGYRAVIASPDKDFKQLISENVQIVIPLADLRRWSFYTLKHYHAQYN-CDPQSDLSF 280

Query: 324 MSLVGDKSDNIPG----VDGIGNVNAVQLITRFGTLENLL 360
             ++GD+ D +PG    V   G   A++L+ + G+LE+LL
Sbjct: 301 RCIMGDEVDGVPGIQHMVPAFGRKTAMKLVRKHGSLESLL 280

BLAST of Cp4.1LG10g12240 vs. NCBI nr
Match: gi|449453197|ref|XP_004144345.1| (PREDICTED: uncharacterized protein LOC101222649 isoform X1 [Cucumis sativus])

HSP 1 Score: 743.8 bits (1919), Expect = 1.8e-211
Identity = 384/451 (85.14%), Postives = 410/451 (90.91%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLL 60
           MA HHLHTATASASHICRNFL +IFT+KF  P RFS+SSS   RI     H FPSFS+LL
Sbjct: 19  MASHHLHTATASASHICRNFLGFIFTSKFPHPFRFSTSSS---RI-----HSFPSFSLLL 78

Query: 61  SPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSD 120
           SPKGYCSSS S+N+A  +D+  TYHGS AS+  Q M+Q QDSLSN  T KE+T ID+P+D
Sbjct: 79  SPKGYCSSSGSINSANTMDTVPTYHGSSASTRCQPMVQFQDSLSNPLTFKEDTGIDNPAD 138

Query: 121 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVA 180
           ARVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTA+SLIVDVLE MPSHVA
Sbjct: 139 ARVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEIMPSHVA 198

Query: 181 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 240
           VVFDHDGH YGHT  SSNENFM+KGSTFRHT YPAYKSNR PTPDT+VQGLQYLKASIKS
Sbjct: 199 VVFDHDGHPYGHTYISSNENFMSKGSTFRHTIYPAYKSNRAPTPDTVVQGLQYLKASIKS 258

Query: 241 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 300
           MS+KVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EM
Sbjct: 259 MSIKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEM 318

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
           VSFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ
Sbjct: 319 VSFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 378

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
           HVDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTS
Sbjct: 379 HVDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTS 438

Query: 421 LLTAIGAYAEGFSADPIMRRVLNLWKKLEKS 452
           LLTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LLTAIGAYAERFSADPIIRRVLYLWKKLDKN 461

BLAST of Cp4.1LG10g12240 vs. NCBI nr
Match: gi|659069411|ref|XP_008449676.1| (PREDICTED: uncharacterized protein LOC103491482 isoform X1 [Cucumis melo])

HSP 1 Score: 736.9 bits (1901), Expect = 2.1e-209
Identity = 383/451 (84.92%), Postives = 412/451 (91.35%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLL 60
           MA HHLHTATASASHICRNFL ++FT+KF  P RFS+SSS   RI     H FPS S+LL
Sbjct: 19  MASHHLHTATASASHICRNFLGFVFTSKFPVPFRFSTSSS---RI-----HSFPS-SLLL 78

Query: 61  SPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSD 120
           SPKGYCSSS S+N++  +D+ ATYHGS AS+  Q M+Q QDSLSNS T KE+T ID+P+D
Sbjct: 79  SPKGYCSSSGSINSSNIIDTIATYHGSSASTRRQSMVQFQDSLSNSLTFKEDTGIDNPAD 138

Query: 121 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVA 180
           ARVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTA+SLIVDVLEFMPSHVA
Sbjct: 139 ARVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVA 198

Query: 181 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 240
           VVFDHDG++YGHT  SSNENF++KGSTFRHT YPAYKSNR P PDTIVQGLQYLKASIKS
Sbjct: 199 VVFDHDGYSYGHTYISSNENFVSKGSTFRHTIYPAYKSNRAPVPDTIVQGLQYLKASIKS 258

Query: 241 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 300
           MSVKVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EM
Sbjct: 259 MSVKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEM 318

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
           VSFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ
Sbjct: 319 VSFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 378

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
           HVDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTS
Sbjct: 379 HVDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTS 438

Query: 421 LLTAIGAYAEGFSADPIMRRVLNLWKKLEKS 452
           LLTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LLTAIGAYAERFSADPIIRRVLYLWKKLDKN 460

BLAST of Cp4.1LG10g12240 vs. NCBI nr
Match: gi|778694790|ref|XP_011653865.1| (PREDICTED: uncharacterized protein LOC101222649 isoform X2 [Cucumis sativus])

HSP 1 Score: 703.4 bits (1814), Expect = 2.6e-199
Identity = 370/451 (82.04%), Postives = 395/451 (87.58%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLL 60
           MA HHLHTATASASHICRNFL +IFT+KF  P RFS+SSS   RI     H FPSFS+LL
Sbjct: 19  MASHHLHTATASASHICRNFLGFIFTSKFPHPFRFSTSSS---RI-----HSFPSFSLLL 78

Query: 61  SPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSD 120
           SPKGYCSSS S+N+A  +D+  TYHGS AS+  Q M+Q QDSLSN  T KE+T ID+P+D
Sbjct: 79  SPKGYCSSSGSINSANTMDTVPTYHGSSASTRCQPMVQFQDSLSNPLTFKEDTGIDNPAD 138

Query: 121 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVA 180
           ARVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTA+SLIVDVLE MPSHVA
Sbjct: 139 ARVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEIMPSHVA 198

Query: 181 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 240
           VVFDHDG                  STFRHT YPAYKSNR PTPDT+VQGLQYLKASIKS
Sbjct: 199 VVFDHDG------------------STFRHTIYPAYKSNRAPTPDTVVQGLQYLKASIKS 258

Query: 241 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 300
           MS+KVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EM
Sbjct: 259 MSIKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEM 318

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
           VSFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ
Sbjct: 319 VSFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 378

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
           HVDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTS
Sbjct: 379 HVDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTS 438

Query: 421 LLTAIGAYAEGFSADPIMRRVLNLWKKLEKS 452
           LLTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LLTAIGAYAERFSADPIIRRVLYLWKKLDKN 443

BLAST of Cp4.1LG10g12240 vs. NCBI nr
Match: gi|659069413|ref|XP_008449686.1| (PREDICTED: uncharacterized protein LOC103491482 isoform X2 [Cucumis melo])

HSP 1 Score: 699.1 bits (1803), Expect = 5.0e-198
Identity = 371/451 (82.26%), Postives = 396/451 (87.80%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLL 60
           MA HHLHTATASASHICRNFL ++FT+KF  P RFS+SSS   RI     H FPS S+LL
Sbjct: 19  MASHHLHTATASASHICRNFLGFVFTSKFPVPFRFSTSSS---RI-----HSFPS-SLLL 78

Query: 61  SPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSD 120
           SPKGYCSSS S+N++  +D+ ATYHGS AS+  Q M+Q QDSLSNS T KE+T ID+P+D
Sbjct: 79  SPKGYCSSSGSINSSNIIDTIATYHGSSASTRRQSMVQFQDSLSNSLTFKEDTGIDNPAD 138

Query: 121 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMPSHVA 180
           ARVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTA+SLIVDVLEFMPSHVA
Sbjct: 139 ARVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVA 198

Query: 181 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 240
           VVFDHDG                  STFRHT YPAYKSNR P PDTIVQGLQYLKASIKS
Sbjct: 199 VVFDHDG------------------STFRHTIYPAYKSNRAPVPDTIVQGLQYLKASIKS 258

Query: 241 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 300
           MSVKVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EM
Sbjct: 259 MSVKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEM 318

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
           VSFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ
Sbjct: 319 VSFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 378

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
           HVDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTS
Sbjct: 379 HVDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTS 438

Query: 421 LLTAIGAYAEGFSADPIMRRVLNLWKKLEKS 452
           LLTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LLTAIGAYAERFSADPIIRRVLYLWKKLDKN 442

BLAST of Cp4.1LG10g12240 vs. NCBI nr
Match: gi|694384553|ref|XP_009368167.1| (PREDICTED: uncharacterized protein LOC103957700 isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 582.0 bits (1499), Expect = 8.8e-163
Identity = 318/453 (70.20%), Postives = 362/453 (79.91%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLRYIFTAKFASPLRFSSSSSSSLRIQSPGHHCFPSFSVLL 60
           MAC H H+     +     +    FT +F    R  SS      IQSP       FS L+
Sbjct: 1   MACCHSHSWLLHNTQSLSLWKWRSFTLRFVGTTRRRSS------IQSPNSL---RFSPLI 60

Query: 61  SPKGYCSS-SRSVNAAI-NVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCK--EETEID 120
           S KGYC++ +RS N+A   V S +    + AS S  + +  +D L  S   K  E+T   
Sbjct: 61  SHKGYCNTFNRSFNSASPGVVSESGDAAAAASYSKSEGVVSRDMLLASTLFKREEKTVNS 120

Query: 121 SPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTAMSLIVDVLEFMP 180
           +PSD RVMLIDGTSIIYRAY+KLLAKLHHGHLSHADGNGDWVLTIF+A+SLI+DVL F+P
Sbjct: 121 NPSDGRVMLIDGTSIIYRAYYKLLAKLHHGHLSHADGNGDWVLTIFSALSLIIDVLMFIP 180

Query: 181 SHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKA 240
           SHVAVVFDHDG ++G TC SSNE+F  KG  FRHT YPAYKSNRPPTPDTIVQGLQYLKA
Sbjct: 181 SHVAVVFDHDGVSFGQTCNSSNESFKGKGLNFRHTLYPAYKSNRPPTPDTIVQGLQYLKA 240

Query: 241 SIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPR 300
           SIK+MS+KVIEVPGVEADDVIGTLA+RSV +G KVRVVSPDKDFFQI+SPSLRLLRIAPR
Sbjct: 241 SIKAMSIKVIEVPGVEADDVIGTLAVRSVDSGYKVRVVSPDKDFFQILSPSLRLLRIAPR 300

Query: 301 GFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLE 360
           GF+MVSFG+EDFAEKYG L+PSQFVDV+SLVGDKSDNIPGV GIGNV+AVQLIT+FGTLE
Sbjct: 301 GFDMVSFGMEDFAEKYGSLQPSQFVDVISLVGDKSDNIPGVHGIGNVHAVQLITKFGTLE 360

Query: 361 NLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGE 420
           NLLQ VDQVE+ERIRK L+ NA+QA+LSK+LA+LRSDLPLYMVPF T+DL F+KPEDNGE
Sbjct: 361 NLLQCVDQVEEERIRKALIENADQALLSKNLALLRSDLPLYMVPFATKDLTFQKPEDNGE 420

Query: 421 KFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 450
           KFTSLLTAI AYAEGFSADPI+RR   LW KLE
Sbjct: 421 KFTSLLTAISAYAEGFSADPIIRRAFYLWNKLE 444

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DPO1_THEFI8.0e-3936.00DNA polymerase I, thermostable OS=Thermus filiformis GN=polA PE=1 SV=1[more]
DPO1_GEOSE1.8e-3833.99DNA polymerase I OS=Geobacillus stearothermophilus GN=polA PE=1 SV=2[more]
DPO1_BACCA4.0e-3833.66DNA polymerase I OS=Bacillus caldotenax GN=polA PE=1 SV=1[more]
DPO1T_THET81.3e-3636.23DNA polymerase I, thermostable OS=Thermus thermophilus (strain HB8 / ATCC 27634 ... [more]
DPO1F_THETH1.7e-3636.36DNA polymerase I, thermostable OS=Thermus thermophilus GN=polA PE=1 SV=1[more]
Match NameE-valueIdentityDescription
D7T2D6_VITVI2.0e-15873.72Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0094g00430 PE=4 SV=... [more]
A0A061EJA7_THECC1.6e-15571.895\'-3\' exonuclease family protein isoform 1 OS=Theobroma cacao GN=TCM_019883 PE... [more]
A0A059DB23_EUCGR4.7e-15573.33Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00006 PE=4 SV=1[more]
A0A0D2NBI5_GOSRA9.9e-15371.83Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1[more]
A0A0D2RDM6_GOSRA2.2e-15271.95Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52050.31.0e-14567.59 5'-3' exonuclease family protein[more]
AT1G34380.21.2e-1123.53 5'-3' exonuclease family protein[more]
Match NameE-valueIdentityDescription
gi|449453197|ref|XP_004144345.1|1.8e-21185.14PREDICTED: uncharacterized protein LOC101222649 isoform X1 [Cucumis sativus][more]
gi|659069411|ref|XP_008449676.1|2.1e-20984.92PREDICTED: uncharacterized protein LOC103491482 isoform X1 [Cucumis melo][more]
gi|778694790|ref|XP_011653865.1|2.6e-19982.04PREDICTED: uncharacterized protein LOC101222649 isoform X2 [Cucumis sativus][more]
gi|659069413|ref|XP_008449686.1|5.0e-19882.26PREDICTED: uncharacterized protein LOC103491482 isoform X2 [Cucumis melo][more]
gi|694384553|ref|XP_009368167.1|8.8e-16370.20PREDICTED: uncharacterized protein LOC103957700 isoform X1 [Pyrus x bretschneide... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR0200465-3_exonucl_a-hlix_arch_N
IPR020045DNA_polI_H3TH
IPR008918HhH2
IPR0024215-3_exonuclease_N
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008152 metabolic process
biological_process GO:0015979 photosynthesis
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0006261 DNA-dependent DNA replication
cellular_component GO:0005575 cellular_component
cellular_component GO:0031361 integral component of thylakoid membrane
cellular_component GO:0042575 DNA polymerase complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0004527 exonuclease activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0003887 DNA-directed DNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g12240.1Cp4.1LG10g12240.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR0024215'-3' exonuclease, N-terminalSMARTSM0047553exo3coord: 121..409
score: 6.5E
IPR008918Helix-hairpin-helix motif, class 2SMARTSM00279HhH_4coord: 317..352
score: 1.1
IPR0200455'-3' exonuclease, C-terminal domainPFAMPF013675_3_exonuccoord: 316..411
score: 2.2
IPR0200455'-3' exonuclease, C-terminal domainunknownSSF478075' to 3' exonuclease, C-terminal subdomaincoord: 315..404
score: 2.54
IPR0200465'-3' exonuclease, alpha-helical arch, N-terminalPFAMPF027395_3_exonuc_Ncoord: 122..313
score: 2.3
NoneNo IPR availableGENE3DG3DSA:1.10.150.20coord: 318..391
score: 1.4
NoneNo IPR availablePANTHERPTHR10133DNA POLYMERASE Icoord: 92..187
score: 2.8E-226coord: 205..450
score: 2.8E
NoneNo IPR availablePANTHERPTHR10133:SF225'-3' EXONUCLEASE FAMILY PROTEINcoord: 205..450
score: 2.8E-226coord: 92..187
score: 2.8E