CmaCh03G000030 (gene) Cucurbita maxima (Rimu)

NameCmaCh03G000030
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionFlap endonuclease Xni
LocationCma_Chr03 : 83189 .. 111562 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGGGTGGGCGAAGATCGTAAACAAGGTTATATCCGAATCGGGCAGCGGTCGAGGAGGCGCTTTGGTGGTTCTCATGGCTTGCCACCATCTTCACACTGCGACTGCTAGTGCATCGCACATTTGCAGAAATTTTTTGGGATACATTTTCACCTCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCGGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGTAACTCAGGTTCTGCTTATTGGTACTTATTGTTCTGAATTGATTCTAGCATATTTACCGAGAAATTTAGTTGAAAAAAGTAGGAGACTCATTAATGGCGGAGTGTTATTGTGTCGATGTGAAATGGGCGAGATATATTATGGTAGCAAGTGTAATTCGATCAATCAAAGAGCAGAATGTGGTTGATGGCTCTTATTGCAATAGTTTTTCTTGTCATCTCGACATGAGAAATGGATACTATTTAGATCTTAGTTGTGATCATGGCGTCTAATTTTAGCGTTTGGTTAAATTCAAAGCAATATACAACTTTTTGCCTCTGTTTTCTGCTGCAGTTTTGACTTCTTCTAAGGATTTTTCTCGGTTTCTCAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATAGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGGTATGTTAGATTTGTCATTAGAATGTTTTAGTTTGACTTAGTTTGTTATCCTAACTTATATTTTCAATCCGTATTTCCACTGGAGATGGTTTATGTGCATAGAGGATAATTGATAGCAGAAAAGTGGCACACATAAACCCGTTGGTAAAGAAGTAGACTTTAAATTCTTAACATTTTTCTTCTTTAATAAGAAACGCGGACAAACTTCATTAATAGAGAAAAAGATGTACAAAAATGGAGACGAGATATCCCCAAATGCCAAGGGGATTATAAAAAGGATTTCCAATTAGCATTAATTTATATCAAATTAGAACTACAAACATGAGGGAGCAAAGAACGCCATTTTGGGGAAAACAATCTAACTTGCTCCCTACATTCATCACGAGAGGATTCACTGTCTTGAAAAGTCCTCTTGTTTCTTTCATACTAAAGTCTCCATAAAACCACTACCACTGAATTGAACCACAACGCTCTAGCTTTCCCCCCAAAAAGAAGTCTCATAAATGAGCTGAGAAAAAGCCCCCGAACACCCCTTGGGGAGCACCCAACTTCAGTTAAAACTTAGAAAAGACTACACCAACCAAAAACAACATACGTTTTTGCTATATTTACTTCTTTGATTTATTTGTTTGTTTGGATTTCTGTTTTGGAGCAGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCTTGTCACTTGTAAGTAACATTCTTGTTAGGTTCTTGTTTTCAAATGTTATTTTACCTGTCCTTTTTCCAACAATGATATACATTCTACTGATAGTGCCTCAAACCCTAATGCTATTCTAAATGTTCTCCTCATCTTTGTTGTTTTTTTTGCCATGAAGTAATTTCTGTTAAAATATTACAATAAAATATTGGAATATCAATTTTCAGCTTGTTGGGTATTGGCATAATGCATATGCCGTGTTTGACTAGCCATGGGAACAAAAGAGAAACATATGAGGGATCATCAACATGTAGGAGCTCGGGCATATCCAAATCATCCTCAATAATTATATCACAGGGTGCCTCTATGGTATGAAAGGGTGGTGGGTCGAGCTCGGGTTTAAAAGTTTTGGGGTATTGTCCGTGATAACTAGTGAGAAAGACGCTGTATTAGAGGGAAGAGGGGGAGGCTATTCAAAGAAGGTGGATAGGTATGGTAGTGAGTCAATTGGAGAAAGGTGATGTCGTGGGGAGCCTTCATCAAATGAGTGCAAGGGGATGATAATATGTTTTCTTAGTCTAAAAGAGGTTTATCGGTGGCTGTTGAGAACTGATAAAGATGATGAAGGAAAAAGAAGTGATGAGGTGGGTTACAATTCCCTTCCTAGGCCCGTTTGACTTTGTCTTTCGGTTTCCAGTTCATTTCCTCTAACCTTACCATCTCTTCAATCTTACATTTCAACAAGACACTTGTTAGGCCTACCATCATTAGGCAATACTTTAGAAGCGGTAGATAAGGGAATGTACCCTCAACAAGTTAGTATATACGTTATCTTTATGTTATTTTTGTAGTTTGCTTTTGTGCATTGGGAGGAGGGAGAGTTTATCAGCCCTCTCGAATAGGCTACGGTATTTTGTATTGCCTTCGGGCTTGACTTCTTTTGTTTATCAATACAAGTAAGGTGAGACCTTACAACACGCTCCTCTCTTAACCCCTTAGTCTAAGAACCCTCCTTGTCTAACCATTCCAACAATTTGTTCCTCTCCAAAAGTTCTTGCTTTTTTATCCATATATCTCTAAAGGCCTTTATTCCAACCTTTCTAAAGCTTCCTTTAGTGAGCGGAGTTTTCCCATGAATTTGAAACCTTCCCACTTTCAGGAACATTTGTTGGCCACCAAGAAACTTAAAAAGGAGTGATAGGAGAGAAACATATTTTCATACCTTAGAGGACAGGGTCCCTACTTTGGCTCCCATTGATCCAGAGAAATCGGCCAATGACTCGATTTGTCCCTGGGACCTAGAGATTTTTTGGCACCAAAGAAAGAATCCATCCAAACTCTAGATAACAAGGCTCTGGACCACCCTCAAAAGACAGTTCTATTTTCTCCGAAGTGCAACTATTTCCTTCCGTAGATTTCATCCTTCCTTCACCAATTTGACTAAAGAGGGTAGCTCCATAGATGTCCTATGTTGTTTGACCCTTCGAGAAACTTGAGATTTACTTAAAAGAATTGTAGCAAGTGTCATTAGAAATCTTCCAACCCTTGCACTCAATACTTTTGGGGCACCCATCAAGGGGTTCAAAAGCAATCATACATCTGCCTCTTCGATAACTCATGTTAGACATTTGAACAAATATCTATTCGAAGAACTTCTTTGCTTTTGATCTCACGATGATGAAAGAAAAAAATCGCTTACCCACATGAATATCCCCAAATGAACATTCACATTTGAAACTTTTGATATGAACCAAGACATCTCAATCATGCAGTATATACTTCAAGGAAGATGTGAAAAAAATGATCAAATTACCACAAGATAAAATCAAGTTGCAATTCATGACATATTAAAGAAGAATCAAGTTTCAATTGTTCTTGTATGATTACAAATTTTGGGTGGATGATAATTAAGAGTTATGGTGTTTTATTAATATATTTTAAGGCACTTTATTTACTTTAAGGTTACATTTATATAGTGGCTAATTATGGTCATAAAATATACATCAATAAGGTTGTTAGATCAAATTCTCCAGAGTTGCAATCATTTGAAATGAAATCAAATTGATTTGGGGGTTTCTGCATTTGTGAAATAAAGGTTCGCATATCCAGGCGCCTTATGAGGGTTCTTAGATTCAATTGACTAGAGTTTCCATATCATTTGGTATCAAAGCTTTAGGCTAAGAACAAATATATGTTTCATTTGCGTTTCTTATCAATTTGGTTTTTATTTATTTTTATTTTTTTAGCCCATAATCGATTTAGAGTTTAAAAACTAATCTTAGCTCTACTGAAAAATTTGATAGGTGAGAATGAGAGCTTCATTTATGTAAATGAGTGTAAACAGAGTGCCCTAAAATTTCTTGAGTGTATACATGTGAGAGAGTGCTGTGAGGTCTTTTGTATTTTCTGAATTTAATTTTGCAACGAAGTTTCTCAAATAGATTTGAGGACAAATCCTCTTGAAGAAGTGAACAATGATATGAACCAATGCATCTCAATCATGCAATGCAATATGTACTTCAAGGAAGATGTTAAAAAAATGTTGTTCATATTATCACAAGATAAAATCAAGTTGCAATTCACGACACATTAAAGAAGAATCAAGCTTTAATTATTCTTGTATGGTTACAAATTTTGGGTGGATGATAAATTAGAGTTATGATATTTTATTAATACATTTTAAGACACTTTATTTACCTTAATGTTACATTTACATAGTGGCTAATTATGGTCATAAAGTATGAATGTTACAACTCTTGAAATGCCTCATGGAAGGTGAATGGTTTTATTTTGAGACTATATAATGCCATGCATCTTGTCTTTGTAAAGTAGATTTGGAAAATGTAATAAAAATAGCATTTGTGTTTCTTTTAGCCAAAAGCTAAGTTTCTTTGTAATTTTATTTAGTTTATGTTGAATATTTAGAATCATTCAAGCTTGACATGATCAATCTTGCTTGTGGAGTAATTCAAATCTCAAATAAGTGTTCTTGCCTTAAGATATTCGATCAACAAGGTAATTTGGATCTTACTTTCCCTTGGAGTGGTTCTTGGATATTTAGATCAAAAAGGTTTTACTCTTCAATAAGGTTGTTAGATCAAATTCCCAAGAGTTGCAATCATTTGGAATGAAATCAAATTGATTTGGGTGTTTTTGCATTTGTGAATTAAGGGTTTGGATGTCCAGGCACCTTACGAGGGTTCTTAGACTCAACTAGCTAGGGTTTCCATATCATACAATATTGTTCAATTAGCACCCTCGTCAATTCATCTTAGAGGTTGAACACATCTCTTCCTTTCAATCTTCACTGAACATGTATACACCTCACCAGCCCTAACCTCTTCCCGCTGGCTCTCATCCTTCAGTCCACTAACACATTATGAAAGAGATGAGGAGAAGAAAGAGAGTGTGTGTGTGTGCGCGCATGTCAATCCAAAAGTGTTTTCAACCCATGGTTATATTGACAAAAGTCTTTTGTAATTATAGTGTAATTGTACATCCTTCACTTGTGCTTCTCAATGTCTTGCTGAAACATTTTCTGTTTCTAAGCAAAAAAAGACGGGTGGTTTTACAAATTTTGAAAGATGGAGATATTTCTTGTGATAATTGATATATCTATTATATATCTTTTCTGCAAGAATTTGTGATTTTTCTGCAGGTAAAGTTAAAGTTTTCTGTTTAGTTTAGTGGTTCATGTGCTTACTCATTATTATTATTCTCTTGCAGATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTAGCTCTTCTTTTTTCCTCACTGCTCTCTTTAATTATGTTATCTGCAAACATTAACCAATTAAGATTCTTTAAATATAAGTTTTTTGGTATTTGAAAATTGAGTTTGATTTTCATTTCATTCATAATTTTTTTAAAATTTTGTTTCATCAATCCTTCATTTTTTTTTTTACTTCTAATTATCTGTTAGCTTCTTTGGCTAAAAATGAAGTGATTTTTTCTAAAATTTTCTTAGGATGATGTGACAATATTTAACCAAAACTCCAGAATTTTTTATGTGGAAAAAAATATATAACAATATGCTCTCCCGTACATCATCCGAACATCCACTCTACCCGCACCCGCACCCGCACCCGCACCGTCGGATTCTCTTGGTTTCCTTTGACTCTAGGCCTCTACCTCTCCCACTCTGACTTCATATCGGATTTGTTTTTTTTTTCTTTCCAGATTTGGGGTTGAGGATGGGCACTGGTAGGGAGGGAAAAATAAAGGAGGTGGTGGGGAGGGAGAAATAAAAGAGGTGGTTGGTTGAAGAGAGTGTTTTAAAATATTTTTTTGGTCACATCAACAAAATAAAATTATAAAATAGCTAATTATTGTAAGGAAATATATACAGAACCATGGCATCTTTATTGTCAAGGAAACTAACAAAAAATCAACAACAGGTACCAAAATGGAGCATCATTGAAACCTAAATGGCAAAAACGTAAGATTTTGGAACCTAGAGACAAAATGGATATTAAACTCTAACCGCGGGGATCAAAAGAGCATTTATCCCTATAATATGTTAGTTAGACTAATAAGTGATGATGTCATTTGATTTGTTGATAATTAGACATGTACGTGTACAGAATCATTCATCCGTCTTTTTAGGATCATCTCTTATTTTTAGCATAACTAATCAATATCCAGACATTTACAGGATTTGACGTTTAACTTGTGCCTCCCATATTTATTGTCTTAACTATGAATGCAATTGCAATATTCACTTATACGTTCTCTATGTTGTGCAGGTTGTGTTTGATCATGATGGTAAGATCTAGGATTCGAAATTATCACATTACTGATTTCTACAAGAACGTGTATGTGCTTGTATTCCATTACTTTTCTTGGATCTGCAGCTTTCATAATTATCTACTTACATGTACTTTACCTGGTGATGGGAAGGATACCAATGTTCAGTGGGTGACAGTTCGGATACTAGTGTCTAGTGGATTATGGGTGATGGGAGAGATATTTTTTTTTTTTCTCAGAGGTCAGGGTGTTGGAGGGTAGACCCCTTTGCACTTGTTTCTCCCGTTTATATCACTTTTCCTCGATTAAACCACTCAGTGGTGGATATGTTGCCCTTTTCTCAGATATGTTCTTCGTACATGCTGGGATTATGATGTTTTTGTCTAACAGGAAAACGTCAGATTTTATCTCCATTCTCTCATTGATTGGGAAGTTTTGTTGTAGTCTGGGGAGAAAATATGTTTGTTTTTTGAGCCCTTGTTCTTCTGTTGGCTTCTCTTGCAAGTCTTTTTTTTGTGGCCTGGTCAATTGCTCTCTCTCGAGTGGTGTCTTTGCTTCACTGTGGAAGGTGAAAATGTTAAAGAAGGTTCAACTTTTTGGGTGGTAGGTTCTTTATGTATAAGTTGACACCTTGGATCAAGTTCCGAGAAGGTACAAATAGGTTAACGCACTGGATCAAGTTTTGAGAAGGTTGTGCAATTTGATTGGGTCGTTTTGTTGTTTACTTTGTAGGTAGGTGGATGAGGACCTTGACCATATAATCTGACGCTGTGACATTGCGCCCGAGATTTGGAGTTGTTTTTTTTAGGAGTTTGGCTTAAGCTATCTAGACATAACGCCTATAAAGAGATGCTTGAGGAGCTCTCTCTTTCCCTCATTTTTGTGAGAAAGGAAGGCTACTGTGGTAGGCTAGGGTGTGTGCGGTTTTGTGGGATATTGGGGGGGGGGGGGGGGGGGGNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGAGAGGAATAGAGTCTTTAGAGAGTTCGAGAGACCTTGGAGAGGGATGTTTGGACCCTTATATGTGAGTTTTGATAGCTAAGCTGTTATGTAAATACCCGTTGGCTCTTATTTTACTTGATTTATTGAGAGGATATTTTGGAGATATATTAGTTAGAATATATCTATATTTTATTTATTTACAAGAATTAGTTAGATATATTATTTTTTTCTATTTTTAGGTATTAGTTAATAATTTGTATCTATTTAAACGTGGTAAACCTGAATGAAGATAATACACTTTTCCAGTTCTATTTTCTATTTCTATTTCTTAACATGGTATCAGAGCATCGATCGTAAATATCAAAATTTCCTTTATGGCGGGCTCAGCCAAATCCAGCTTTAAAATTTCGGATGTTGATTTAACACATCCTTATTATATGCATCACTCTGATCAGCCAGGATATTCACTTGTTCCAATCAAATTAAATGGAGCAAATTACCAATCTTGGAGTAAATCAGTTATGCATGCTCTTATTGCCAAGAAGAAAATTGGCTTCATTGATGGCACAATTGAGGAACCGCCCCAAGATGCAAATTCAACCGAATTCGAACTCTGGAATCAGTGCAACAGTATGATAATATCTTGGTTAACTCATTCTGTTGAAGCAGATATCCTAAAAGCATTATTCACGCCAAGACAGCTCATCAAGTGTGGGTTGATCTTCGCGATCAATTCTCACAAAAGAATGTTCCAGCAATTTTTCAAATACAAAACTCGATAGCAACGATGTCACAAGGAACCATGGCACTGTCAACATATTTCACCAAGCTCAAAGCACTCTGGGATGAACTGGAAGCGTACCGCACACCATTTACGTGTAATCAACGTCAAATACATATTGATCAACGCGAAGAAGACAAGTTGATGCAATTGCTCATGGGGCTCAACGAGTCTTATAAAACGGTGAGATCTAACATATTGATGATGTCTCCATTACCTAATGTGAGGCAAGCTTATTCATTACTTGTACAAGAAGAGATGCAGCGACAGGTAACTTCTGAACCTGCTGAGAATTTCTCGATTGCATCAGCAGTGCAGAAGAAAACAACATATTCAAAATTCGCCAAGGACAAATCATGTGAACACTGCAATAAAAGTGGTCATACAATCAATGAGTGTCGAATTCTTAAGTTTCATTGTAAGTTTTGTGATAAAAGGGGACATACAGAAGATCGGTGTCGACAGAAAAATAATTCTACAAGGACAAGACAAGTCAATCAACACAATCATCGTGGATACCGATCATCTGCAAATATGGCCGATGTTTCACAGTTGAATACAGGAGAACAGTCACCTAATTCCATTCCAAATTTTTCTTCTGAGCAATTACGACAGATAGCACAAGCCTTATCTGCAATCAATCAAAACCCTTCTGGTAATTCTGACAATCACATCAATGTTGCAGGTTTGTTTCCCATATCTACATTATCTGTTAACTCTGCGAGTTCTAATTCATGGATTCTCGATAGTGGAGCTACGGATCATATAGTATCAAAATCTTCTGTTATGACTGATCCAAAGGCTGCCATCATGTCTACAATAAATTTGCCTAATGGAGAGACAGCACGTGTGTCACATACTGACAATATTTCCCTTAGCCCTAACCTTAAGTTAAACAACGTTTTATGTGTGCCTTCATTCAATTTAAACCTAATGTCGATCAACAAACTTACCAATAACTTGAAATGTTATGTCACCTTCTATCCTGATTCTTGTGTTATGCAGGACTTGGCTACGGGGAAGATGATTGGCTCGGGTAAACAATTTGGAGGTCTCTATCATATTTCTTCATCTCCAATCAAATCTTCAGCTCATCAAGTATCTCAGTCATCTGATTTGTGGCATTTACGTCTAGGTCATTCTTCTTTTTCTCGTTTTAAATTTCTAGCTAATCAATTGCATCTTAATAATGCGAGTTATTCTCATAATTGTAGTATCTGCCCGTTAGCAAAACAAACTAGGTTGTCTTTCCCAAGAAGTTCAATAACAACCCATTCTGCCTTTGATCTGATACATTGTGATGTTTGGGGACCACATAAAATTCCTACCCATTCTGGTTTGCGTTTTTTTCTCACTATTGTTGATGATTTTACTCGATGTACTTGGGTTTTTCTAATGCAACATAAGTCAGAAGTACATCATTTGTTAATGAACTTTGTTAAATTTGTTCAAACTCAATTTCATACTACTATCAAGATAGTTCGATCAGACAATGGGACTGAGTTCCTATCTTTGCAACCATTCTTTACTTCTTGTGGTATTGAATTTCAGCGCACTTGTGTCTATACTCCACAACAAAATGGAGTCGTAGAACGTAAGCATCGCCATATCCTAAATGTAGCTAGGTCTCTTCTTTTTCAGTCACAGGTTCCACTTAATTTTTGGGGAGAGTGCGTTTTAACGGCTGTTTATCTTATAAATAGAACGCCATCACCAGTGTTATCTAACAAGACACCCTTTGAAGCACTCTACAAACGATCACCTACATTTCATCATCTTAAAGTTTTTGGTTGTAAATGTTATGCAACTATAGTACATCCTAAGCAAAAATTTGAACCTCGGGCAATTCCTTGTGTTTTCGTAGGATATCCTTGTGGTCATAAAGGTTACAAATTGTATGACATGCAATCTCACAAATTCTTTATCAGCCGTGATGTTAAATTTTGTGAAGATGATTTTCCTTTTTCATCAGCTTCACAAACTTCGACATTAGCTCCTTCGACTCCTGTTGTTCCACTTCATGATCCATCCTACTTAAGCACCCATTCTCCACCTTCCATTCCTCCTTCACCTCCTATCCCTTCACCTACTACTTCGTCGTCTCCTCCACCTTCTCCAGATTTGCCCACTGATTCCAATCCTATCCCGCCTGATACATCAGCTCCACTCCGACGTTCTACTCGCACTAAACAGCCTCCAGCTTGGCATAAGAATTATGAGATGTCTTCTGGAGCCAATCATTTAACCTCTAGCTCAAGTCCCGGCACTGGCACCAGGTATCCCCTTCATCATTACCTTTCATTCTCTCGTTTTTCTCCTACTCAACGTGCTTTTCTAGCTCTTATTACATCCCAGACAGAACCTAAAACCTATGACGAGGCAGTTGGCGACCCTTTATGGCAGCAGGCTATGAATGATGAAATTGCAGCTTTGGAACGTAATCATACATGGTCTCTCGTTCCTCTACCACCTGGTCATAAAGCTATTGGTTGTCGCTGGGTGTACAAAATTAAATACAACTCTGATGGTTCTGTTGAACGTTATAAAGCTCGACTTGTAGCAAAGGGATACACTCAGGTTGAAGGTATTGATTACACAGAAACATTTTCCCCTACAGCAAAACTTACTACACTTCATTGCTTACTCACTGTTGCTGCTGCTCGAAAATGGTTCACCCATCAGTTGGATGTTCAAAATGCCTTTCTCCATGGTAATCTAGACGAGGAAGTTTATATGTCTTTACCACCAGGTCTTCGCCGACAGGGGGAGAATACAGTATGTCGGCTCCATAAATCTCTTTATGGATTAAAATAGGCTTCTCGCAATTGGTTCTCCATATTTTCTACAACTATACAAAATGCAGGCTACACTCAGTCCAAAGCAGACTACTCTTTGTTTACCAAGAGTAAAGGTACTTCTTTCACTGCAGTTCTAATCTATGTTGATGATATTCTGTTGACAGACAATGATCTCAAAGAAATGCAATATCTCAAGACTAGTTTACTCCAGAAATTTCTTATCAAAGATTTAGGAAATTTGAAATATTTTCTAGGCATTGAATTTTCTCGGTCTAGAAAAGGAATTTTTATGTCTCAAAGGAAGTATGCTCTAGACATCCTTCAAGACACAGGCCTTACACGAGCACGTCCAGACAAATTTCCTATGGAGCAAAATCTGAAACTTTCTTTAACTGAAGGAGAGAAATTGAACGATCCAAGTAAATACAGACGGTTGATTGGCAGATTAATATATTTGACTGTCACTAGGCCTGACATAGCTTATTCAGTTCGTATGCTTAGCCAATTTATGCATGAACCAAGAAAACCACATTGGGAGGCAGCTCTTCGAGTTCTGAGGTACATCAAAGGCACTCCTGGTCAAGGACTCCTACTGCCATCTGAAAACAATTTAAGATTACAAGCATATTGCGATTCTGACTGGGGTGGTTGTCGAACTTCCAGACGGTCTATTTCTGGATTCTGCATTTTCCTTGGAAATTCAATTATTTCTTGGAAGTCTAAAAAGCAGACTAATGTGTCCAGATCATCAGCAGAAGCCGAGTATCGAGCTATGACAAATACTTGTTTAGAGTTAACTTGGTTAAGATACATTCTTCAAGACTTGAATGTTCCACTGTCCGAACCAGCATTATTATATTGTGATAATCAAGCAGCATTACATATAGCAGCCAATCCAGTTTTTCACGAACGTACGAAACACATTGAAATAGATTGTCATATAGTTCGAGAAAAGTTACAAGCTGGAATCATCAAACCATGTTATATTTCGACCAAAATGCAATTGGCAGATGTTTTTACTAAAGCTCTGGGAAGACAACAATTTGACCTTTTGAAGGACAAGTTGGGTGTGATCGACATACACTCTCCAACTTGAGGGGGAGTATTGAGAGGATATTTTGGAGATATATTAGTTAGAATATATCTGAGATATTTAGGGAGTTGTTAGTATAAATTTGATTAATCGTATATTTTATTTATTTACAAGATTTAGTTAGATATATTATTTTTTTCTATTTTTAGGTATTAGTTAATAATTTGTATCTATTTAAACGTGGTAAACCTGAATGAAGATAATACACTTTTCCAGTTCTATTTTCTATTTCTATTTCTTAACATGATTGGAGGCCATTTCTTTAGTTTGCCTCCATCTTTTGTGGGGGCCATTTTTATGGTGGCCTTTATTTTTGTATGTCGTCGTATTTTTTCTTCTTTATTTTTTCTCAAAGAAAGTTTGGTTGTTTATTAAAAAAGAAAACATACGAGAAAATGTTTGGAATATTATTAAGTGGACTTTAGAAGTATTATTAGGATAATACATATATTTTTTCTAAACAAGAAATATAACTTTTTATTGAATAAACGAAAGAGATTGAGGCTCAAGAAACATTTTTTTCATTGAACAGATGAAAAGAAACTTCCAAAACCTTAAATTTTAGATTGTAGTGCACGCTTATTTTGGAGTTGCACTACTAAGGCGGACTTTCATTGAGAAGAAAAGAAGAATACAGAAGTGTAAGCTAGTGAAGAGAATGCTAACTAAATACAGAACATTATAGAAATCCAATCCAAAGGGATACACTAAACTTAGTCAAGGCCTAAACTCTTTCAAAAACTCTCCACGCCCTAAAAACTCTTCTCTCTAGCCAATATTTCATAAGACTGCAAACAAATGGCCTAAAGGAGTATGAAATAGCACCTCCTTCTATCATAAAAAAACGTACTTTTAGGACTGATCAACTAAAATGAAGACTGCTTCAAAGTTTCCAACCCTTGGGTGGTGGATGATAATTGATAAGATCAAGATTGAGGATTCTTGTTGTAGAGGGGATGCAAGCATGAAGGCAATGATGTCAGTCTAACGTGTGATCTTGTAACTGCTCCACCACTTGGGAGGCTAAATAGGAGACTGGGTATTAAAATGGTGTGAAGTTAGTGTAGAGTGAGAATCTATTGGTGGATTTTCCTTTCTACTGATTGTAAATATTGATCGGAAATATTTATTATCCATATTTATGTATAATTCCTTCATATTTTTTTAGACGATCTCACTAAATATAAGAGGACCTTGTATTACCTTTACAAATATTGAAGAAATAAAATATTCTTTTCTCCATAAAAATCAGAACTTATTTATGTGAGAGTCAAGCTTTTGACCATCAAGAATTTTCTTTTTCATCCTTCATTCTTTAATTGTTACTCAATGCTGACAGAAATAAAATAGTACATGGAGGTGCCTTTAGTCATTTAAATTGTACTTTGGATGTTGGATAAAAATGATGTGACAGAATAAAAAAAGAGTTGAATAGTTCAGAAAGATGACTAAATTCCAACGCATTTTGATCAATAGACAAATTAGTTGGTAGTTATAAGTGATTTCTATTTAATTTCAAGGCTGGCCTTGTACATTTTGATGACATAGTAATGAGTACTTGGCGATTTTATACTTTCTGTAGTAAAGTTTCAATTGGTATCTGTCTATCAAAATGGGAGTTTCTTGGCATTTAGTGTCAATTCCCTTCCCAATTGTGGTGGGTAGCGTTGAGGAATTTCACAATTCCAAGAACAAACTAATCCTTTTGGTAAAAACAAACCTTTTCTTTCAGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGTATGTTCTTCTTCTTTGAACGATAGAGTTGATAGTGACTTTTTTTTTTTCTCTTTTAAAACAAGAGTAAAAAGATATTAACATGTTAAAATTAAGGAAATTAATATTTGTTGGTTACACATTTTAATCTAGACGACTTTTATTTGAATCCTATTCTAACTGAAGTATTTTGGTGTGTATAATGGCTTTTGTATGGGAGTAATCTAAGTTGTTTTTCGTTCATTCTTTGAAAACAAAAACTAGACTTTTTTTTTTCATTAATTTTTTTAAAGCTTCAATAAAATTGTCGAGTTTTGGCTTTTCTGAGAACTATTTATAGAGTTTATTTTGTTCACCCCTCATCATGTAATTTTGTGATCTTATATTATAACATTTTTTTAAAAAGGAAACTTGTAGTCTTGAGATTTCAAAAGTTTCTTTTTAGTATTGCTGAAGCGGTGCCTTCTTCTTATTGGGTTAAAAAAGACCAAGAAGTGGTGAGTTGAATTTCAACGAGCTAATTGTTGTTCCTAGCTTATTTGCTCACAACAGTTGGAAGGAAATTAAGCATACTTTGGAAGTTCATTTAAGGACTCTGTATTGATTAATCCCTTCATGGTTTAGAAGACATTACTAAATTTCAAGGGTGAAAACTCCTTCAAGGCTCTAGAGTCTGAAGGAAAATGGAGACCATAGTGATTTTCATTTGCTTATTGAAAAATGGTCTTGAGAAAAGAACAGTCATCTGAATTTTGTTGAAGGTTATGGTGGATGGATCTCCATCGAAAACTTGCCTCTAACCTGTTGGGATAATAGCATATTCGAAGCTACGACCATAACTTTGGTGGTCTTGAGAGTATTTATTCACAAAATCTTTATATGTTAGACTGCACAAAGGCTCATATAGAAGTCAAGAGGGATTTGTGTGGTTTTCTTCTAGCTATAATAGAATTCAAGGATAAAAAAGAGGGAATACGTTTCTTAGATTTGGTGATGTTACTGTCATAGATCCTCAAAATATCATCCATTAACGGAGCTCTCTTCTTGAAAGACTTCTCCAATTCTTTGGACTTGCACCGTTTACATTAAATCATGGAAGATGAAACATTGACATCGTTTACACACTCCTTCCTCCCTCTTTAGTCCCCAGCAAATTTCAATCTCTTATTGAAGCTTGTGGTCTTGAATTATAGAAAATTCCTCCAATATCTTAAAGATTAGTTTTTAATTAAGTTTCAAGAGAGCTCTTTTTGGTCTTTACTTTCTCTTTAAGTAGTTTGAGTTTTGGGGCTTTTGTTCTTAGTATCTTAGTTGGTTTGTTAGATCCTTTTTTGAGGACAATTTTTGCCAACCTTTTCTTATATTGGATCTTTTGGTTACTTGTTGTTGAGTTTTCTTTAGAGCTGAGCTTTGGTTTGATGGGTCCTTTTTGTTTCTGGGTTGTTCTCTTCGTTTGAATCTACTTTAAGATATTGTTTTGTTTTTCTGGTTCACCCTTTGAGATTTTGTATCCTTTGAACATTTTCATTTTCATTATATAAAATAAGTTGTTTCTGGTTAATAACAAAGAATACATAGAATTTTCGTCGATAATTAAAAATATATAGTAACTGCTCTTAGTTGATGAAAAGTTACTGTGTATCCTCAAAAGTTTGTATCTTTGAATTTTTTTTTTCTTTTCCACTAATCAACGAAAAGTTCATATCTAGTTAAAAAAATAAAATATTTCAGTCGAATAAAGTAAATAACTTTCATTGGAAAAACATATATTGCCCAACCAAAAAATACAAATACAACATACCCGATTAGCCAAAACCAAATGATGAATAAAGCCTATACGTAAACCATAAACAACCATTACATATTCTACCTAAACCTAACAACAACAGAATGCTTGGATACATCTCCAAGGTGCTTGAAAAAGGAAAAAATGAGCTATACAAAGATGCTAAACAACCATCAAAACTATAGAAATGAATTCCCTTTCCATATCGATTTCTATGAAACATGAGTTGTTGGAAAATAGCTTTTATCTTCACAGCTCCATAGACGCTTCAACACAAAGCCCAAAAGAAAAATCATCTTAAACTATCTGAATCAAGAAAGCCCATGCAAACAAAGGGAAAAGAACCCAAATTAGATGAAGGCAAGGGAAGGGTCGATATTGAATTCTAGTATTAAACCTCGATCTTCATACATGTGGTATGCCTTGCAAAAGCTAGTGTTAGAATGGGCATGGGTAGCACACTTGAAGATAGAGGGAGGCCAGGCGGGAGATGTGCAGGAAGTTGGAGTGAGTTGTCATGGACGTACTTTCTTGCCATTGAGCTTGCTTTGATGATAACTTCAATTAGTTGAAGCAATGGCATATCAAGCTATTGTTAGTCTTGACCACAATCTTGGCCCCTAGAGATATCGTCCTTCAGTCCCCTAGACAGTGGACCACAATTAACACCTCTTTCTCAAAAGTGACATACCTTTTCTCAGCGTCATTCAATTTTTGACATTCGTTTGTGATGAGATATGTCTTGAGGAGTATGCCACCAAAGACAAAATCTGATGGATCTCCTCTTCAAAAAAATTTGGCTAGGTTGGCAATCCTAAGCACCTTCTGTGAAAATCCATCGAAAGTGGCTTGATATTTTAGAATCCAACTCCAATTAGGGTTCTTCTTTAGCATCTCTGTCAAGGCCCCCCCCTGTTTTGAACATTCTTTAGTGAACAAGCAGTAGTAATTGATAATCTAGGAAAGAGTGCAGTTTTGTGATGCAAGTTAGCACCTTTCAGCCATGGATGAGTTCAATTTACCAATTTGCCCATATTCGATCGGGTGGTCCAGAAAGCTGATCACTCTTATGCAAAGGAACATTTCCCCCTTTCCTCATACAACTGATTTTTTCTCAACTTCTCAAAGATGAGTTATAGGTGAATATGACAATCTTCTATAGGCGAGCTATGGACACTATGTCTTTCAGTTAGACCAGCAAACTTCACAAGATACTCGTGAAACACTTGGTTCATTATGTGTATAAGGTGGCCGATGCATTTTTAAAGCCATAAGGAATGTAAAGAAACTCACAAGCCCCGTACTACATGAGACGCGTTGTCTTGCAATTAGTCTCCTTCACCATACGGATTTGGTAGTAATTTACTCTCATGTCCAGAGGTTCAAGAAATACTTTGCCCCATGATGTTGGTCTCATAAGTCATTGATGAACTTTTTTAGAGTATGGTGTTCTATGCATGGGCCAACACTTCCATTCTACTTCCTTTGGAAGAGGACTAGGGCTCCACTATGACCTTTTGTCGATTGGATGAACCGAACATTCAATAATTTATCGAGTTGTTTTTGAGTTCAACTAACTTAGGAAGGACTATGCAGTACATGTTCTTCGTGGGTAGTTTCTCCCCTGGCAACAACTCAATCTCATGATCTATTTGTCTATGTGGAGATGCAATCTTTGTTAGGCTATCAAGCATGCACCGTAATATTCTTCTAACACTTGTTGGCTTTCTTCAAGGAAAATCTTTAGAACTCTCATCTTTAATCACGAGAATGGCCGTGAATGTTTGTTTTTCATGAGCAAACCTTATGTTTTCTCAACTCAAATTTGAGTTGTTGGGTCTTCAGAGAGGATTCAATATTTAATGGTCTTCAAACATTATGTGTTTCTGATATGCCCATCAGGCCAAATTCTTATGGTTTAATGCTTACCAGGACATTGTTTCATATATCTAGTTGGAAAGAAAAAAATGGGGTTTTGAAGAGAAGTATACAGTTCGGTTGGATTGTTTTGAGCTCATCGAGATAGTCGATGGTATTCTGTCTAAATTCTTTCATGGTCATTCTTCTAGTGACATTGTAGTAATAATTGNAGAACTTATAGGATTCCAGAATTCTTGGGGTTTTTTTTTTTTTTTTTTTTTTTCCTTTTTGTATTTATTTTCTGCAATTTACATTTATGCTTTAATTTTTTTTTTTTTTGAGTTTACGCTTTGTAGTCTTTGAATTCTTGTACTCTCTTATAGCTTGAGAAGGTTTTGAAGGCTTTCAAATAAGAAAATACATTTAAGTAAGCTCTGTTTTTTTTTTTTTCATGGCACAACAAATTAAGTGGAGAATTTAGTTATTTTTTGTTATTTAAGAAACCATATTTACATTTAAATCTACTTGTATTAATGGTCTATTAGTTCAAACTAGTTATCAACTAAATTGCACTATCTTCTGTGTCTATGTAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCTACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTAATAAATTGTTGGAATTTCTACTATTTATGAATGTTTGAGTGTAATTATTGATCATCAAATATTTGATCTTTATTGTCTTAATTATTTTCCATATTCGTATTAGGTTTTTTCTATATTTGTAATAGGTCTCTCCTATACAAGGAAACGTAAATTCCACATCATACACAATTAAAATGTTTCTTAGTCCCTCAATTAAGATAAATAAATGAATTTTTCGTAGCTTTTTTAGTTAGCCTACCAGATTTTTAGAGAGAGAGGGATGAAGAGACTAGAGAGAAAAAATATATGTTTTTTTTAGATTACTTCCAGATTATTTGCTTTGCATATAAGGATGTAGTAGTTACATTCAGCTTTTTACTAGCAGCTCGATGAAATTTGTGTACTTCTTTCCTACAATATGTACAATCAATAACCTGTTCTAATCTTCTTCTAGAATTAGTCATGTCTTTTGTTTGTTTGATTATTTAGCCTGTTATTCTCTGGTGGTTGGCTTTAATTAGACTCAATGATAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTGACGCACTGAATAGATATTTCAGAATGACATAGGGGTCCTTTTTTAATTTATTTTGCATGCTTGCCAAGATTATGTTCTTTTATGGGGATATATTTAATCATCCACTACTTGATCTTATTGAGTGTCAAACAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGTAAGAACATATGGATTACTTGCTTACTCATTAAAAATGAATTCTCTAACTTAATAAAAGAAATAGTTTTTTCCTGATAAATTTGCAGTCATTTTCTCTATGGACTCTGAGGGTAGAATAGTTATATCTTATACTTTATGCCATTAATAATGTTCTCCTTATCGTAGCTGAGGGATTGGCATATGATTGTTGTCCAAAATAGAACACCTGCATATGAAATTTTTTTCTCCGTATTAAAGGACCACAATTAGGCATGCATTTTTTTATATCTGGCCTTTTTTGTTCAATATTTAGACTAGAAAACTAAAAAATATCTGTTTTTCTTTATTATTACTTTTTAATTTTTACTGGGAAGATGATCGATAATCTAATCCGTTCTTTTGAAATTTTATTATTGAACCTGCATAGGATCATAGAGTTTATTCTTGAAAAATGAGATGTAGGTGGGTTCCACGAGAGTCTGGGGAAGGAAATCCCTTCATGTGAGGGTTGAACAAGAGCAGTTTGTTTTTCGAACTAGTGAATTGGGAAAGGTGGAGATTGAGGAAATTTACAAAGAAAATATGTTAAATATTTCGTTGAACCTTAGGTTGTAATTGGGTTCAGTTACTTGGGTAAGGATCTATTTTTATGTTGGTGGGTTGCATCTGAAGCATTAATCTAGATTCAGCTGGTTTCAAACCCTGGGAAAATTGGTGTGTCATAGGTGGTGGAGTGTCCTTAGTGGACAAGGTCTTCGATTATCATCCACGAAAGAGAAGAAAGATAAGGTAGATACTGATTTTACCACGAATCTACAGTGCTCCAAATCAGGGGGACAATCGTTCTGCGAATTCTTCCTCCTATTCTGCATCTTCATTCACTTATTTGGTGAAGGAAGGAAGCAATGGATTGGAAGAATATATGGTTACTTAGAGGAGGAGATGTAGTTCTCTTTTGAGGCTCCTAAGTGTCCTGGTAGTATGGAGCATGCAGCTGATCCGTGGTTGATGTTCAAATAGGAACTTGGAAGGAAGTTCGAGGAAGTACTAAAAGTGATGCAATTTTGTGGCCACATGGCTTCTTATTTTTGGCGAGGATCATTGTCTTAGTTTGAAACTTTTAGAGTTAGAGTTCGGCGGGGGAGATCTTGTTTATTATATGGTGAGTCCAGGGGAGATCTTGTGTATTATATGGTGGTGAGTCCAGGGGAAATCTTGTGTATTATATGCTCTGATTCCAGGGGAGATCTTGTGTACTATATGGTGAGTCCGTTATCGTGGAACACTGAAACTATAGCTTAGGGTAGGTGAATGGTCTTCAATAGAGGTTGGGTGAAGGTTCTTGTGTGGAAGACTGGATGGATGGAGGATGTGATTAGCTTGGCACAGGGTGTAAGAGTAGGTGTATCTTCTTCATCTTTGGGCTTTCAACGAAGTTTTCTTGTTGATCTTTTCTCCTGGGCTATAGCATGGTGCTGCTTTGGAGCTTAGCAGGAGTGTTTTTTGCAGCTATGGCTCGTTAAGTTTCCAAGATCTTCTTCGACATTTTTGTTCTTCAATATTTTATATATTTCTTGGCACTTATATTTGGAGACATTTACTCCCTCTGTGGGTTTGATCCTAACATTTCACTATCTTTTCATCAATCAATGAAACTATATCTTTTCATCAATCAATGAAACTAAGTTTGTTTTTTGTTTTAAAAAAATTAAAAAAATGAAGATATATACAGAATTATTGAAAAAGAGAGTGCGTAGAGATGGGTGAATTTGATCCTAGCATTTCACTATCTTTTCATCAATCAATGAAAAAAAGTTTAAAAAATGAAAAGGAAGTTATATATAATTATTGAAAGAGAGATTACTTAGAGAGTGGTGAATGTTTAAAAAGATAAAGAAGAGACTCCGGGAGAAGGGATGCTTAGAGAGAGCGAGGATGTTTAAATCATATGGTTCAAATTTTTTTGGCTTTTTTTGTTTTTTTATTTTTTCAGTGTGGCTTCAATTTTTGGATTTGATTTTATGGTACCTTCTTTGAAACTTCGACATTTACTAAGCTTCTCTGCCCATTTCTAGGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGTGATTTATGAATCTCTGTCTAAAAACCACCATAACTTTGTTTTGTACCAATTACAACGCAATTGCAACTTTGATGATTTGTGAGTGTCTTGGCGCCACACTTGGTGTTCATTGAACTCGTTTGGACTTGACTTTGAACTGAAGTACTTTTCCATGCTTCAGGTCTAATCCACGGGACAAAAATTACTCTAGGGGCAAAAATATCATTGTAACCGAATGTAATATTGTTACCAACATAAACATGCTGAATTTGGAAATTTTAGCAGAAGAAACTAAGGTCCTGTTTCCTTATTAATGAGTGTTTTTCTTTTTGCTGTTATTTATTAGGAATTTTCTTGAATCAAAATTAAAACATATCATACTAATTCCTAGAAGCTGTCCGAAAGGGGAATAGAATCAAAATATAATATTGTATAATTTTGAACTAACTTGCTACAACAAAATTTATTTATTTATTTTTATTTTTTTATATTTTAATAAGAGACAACTTCATTGATGATTGAAATTTACAAAAGGAATATAATATCATGATGTTTACAAAAAAACCTTCCCAATTTGTAATGAGGGTGACTAGTAAAAATATTAGAACTTTAACCAAGATATAGCTTGGTAACAACATTGTTGTGTGTAGGTCTTTGTCATTTCTATAAATATTCTTTGATTTCTTTCTTTCCCTAGATTCTAAAAGAAAGTCATGATGAGGTTTTTCCATAGTAGAGCTTTTGCATTCTTGAAAGGGTGGTACGTTAATGCAATATCCAAAAGATCCTTTACCTCCCTAGGAAATGTGAGATACCATCTGAATATATTGAAAATCGTTGTCTAGAAATTATGAGCGTATGTGCATTTCATAAACAAGTGACTTTGTGATTCGTTTTTTTCCTTGCATATTGGACACTAATTTGGAGCAAGAGTGGTGTAAGACACTCTTTTCAGAAGATTTTCACTTGTGCTAATGGCTTTATGCACTATTTCCCAAAGGAAAAACTTCACCTTTTTAGGCTGATGTCCTTTCCATATTCTCTTTGCTAGTGTGAGATTTATTGCTTCTACTTTTTCCCCCACGTCCATCATCAAGAATTTTGTAGAAAAGACCCCGTCAGCGCTGGGGAGCCAAGTTAGTGAGTCTGCTTTGTTTGACAATGCCACAGAGGCAAGGTCAAGAGTTAATTCGGCCCATTCCATTGCTTCATTATCCTTTGAATTCCTACCAAGCTTCAGGTCCTAGAATTTGTTGACAACATTCCACGTTTCTTTGATCGTGACTTTTTTGCTATGAGAGACTTTTAAAAAACGGATACCTCAAGGCTAGCGTGGTGTTTTCAATCCATGGGTCGGTCCAAAATGATCTGCTTCTTCCATCACCCACCTTATGGCAAGTTCGGTTGGTGATGAGGTTTTGATGTTTCTTTATGTACTTCGAAGGCCCTTTTGTAGAAGATGGAGGGGTGATTTTTGTTTGATGGAGGAGTATATTTGACCTTTATAAGATTTCTCCACAAGGCCTTCTCTTCATAATGATATCTCCAATATCCATTTGGCGAGGAGAGCTTTATTCTTCTTCTTTATCGAGAGAAGACCAAGGCCTCCCTTTTTTGTTGGGAGGTTTATAATATTTCATCCAACAAGGTGTAGATCATCTTTCCATAAGTAGTTTCTGAATAATCTTTCTATATGACTTGTATTAGGGTGAGTCTTCCACCTTTTGAAGTATAACATGATGACCATGTTGACAACCTTCTTTCTATTCGACAGAATAATCTTCGAGGGTGAGTCTACAACAACTTGTTTTTAATTCTCCAAGTGACCAAGTTACCTTAAACATAGGACTAATGCCGATTATAGATCTCTAGTCAGTTAATTCTCTATCCAGCTAGCCTAAGTGTAGAGTTTTACCAGTCTATTTGATGGCTTTTTGGATAATAGCCCATGATCTTGAGTTCCTTTCAAGTATCTCAAAATTTTGTTTATAGCTCTAAGATGATGGTCATATACTGACTAACAAAACTCACAGAATATGCGATGTTTGGTCTGGTATTTGAAAGATAAATTAACTTTCCCATGATTCTTTGATACATGCCCTTGTCAACTGGAATAACTTCTTCATTTTGGTGTAGAACTAAATTCAGGTCTATGGGTGTTACTGCAGGTTTACACCCAAGATTTCTCGTCTCTTTCAATAAGTTTAGGACATTTTCTTTGAGATTTTTTAAAAATCTCAAAGGAAACGAAATTTTTTTGATTGAAGAATGAAAAGAGGCTAGTACTCAAAAGATACAAAGCTCCACAGAGGAATGAAAATAAATAGCTAGGTTAGGCTGGAAAGGAATATGTGCAATATTGATCGCACAACTTCGACGACACGACTTCAACGTCAAGGCATGACTTCAACCGTAGCTATGATGACTCCAAAATTTGGTCAAGGTGGAAGATCGGTAGCCATGAAAGCTCTAAATAAATTTAGTCAAGTGGGAAGATCGGTACCTATGAACACTCTGAATAAATTTGGTCAATATGAAAGATTGGAAGCTATGAAAGACTATGCCAGTTAATCACAGCTCCTAGAATAGAATGACTCAGAAAATAATTTTCAAAGTGGTACACCTTTCTTTTTATAATAAAGAAAGGCTAACTAAAGTACCTAGATACTACCTAGAAAATAGGTAACAATGTAGAAAATAAATCCCTAACAATATAGACATAATTAAGGCTAGGATTTAAATAGATATAATTAATGTCATTCCTTCAACAAAAAACATAAACCAACAATAAAAAAGAAATAATTCCTCCCAATAAGTTTCGGATCAAATCTGGAGTTCCCGAGTTCAAACGTCTATAATGTCATTTTCTTTCCATGTAATGTTGATAATTATTTGTTAAATTTTCTACTAATTTTCAAGCCCATAAGTGAGAGTTGTTAAAATATAAATGGAGTATTAGGAATGTGAATTCACAAAGGCCCCCATTGGCAAGGGCTTGGGGTCCAAGGTTTAAGCCTTTATGTGAGTTTAATACCAAAACTCTTGATGTCACTTAGATCCGAGCCTTGGGCACCGGGTGCCCCTTGGTATAAGGGAGCAAAGCTCCGACTCCTGGTTATCGGAAAAAAAATATGAACTCACACGTGAGTAGTAGTAAGAATATAAATAGAATATTAAGAATGTGATTAATAATTAAATTTCCTATAACCCAACAACTTAGCCAAGTTTATAACCTTGTACTAGTTAAGATTAGGCAAAGTTGCCCAAGTAACCGAGGACTGAAAATGTAGCAGTAGATTGCCTTTCTATCATTACTAGTTTGTAAAGTCTTTAATTGACTAGATGGTGCTTGATGTGCCCTAAATGCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGTAAGTAGGTTTAAGAGACTATTGCCTTCCCGGAGAATTGTGGACTACAATTTATTCAATCACATGTATGCTTACCGTCTGATATTTTATTGTTATTGAGAATAATGGTAAAGGTGAAAACCTGCTTTCTACGGCGGACTAATCTATTCAAGGAAATAATAAATCCTATTAATCAGGAGATATTTTTGAAAGAAAAATTGTAAGTCAGGAGTTCTCAAGTATTGAAGTTATAAATGACTATAAATCTTCAACTACATTTCAAATAAGCGTGAACATATAAGATGAACAATAAAATATTTTGGCAATTTGATTTAGATTTCTCCTGGCATCTTTTTGAAACTGCTGTTCAACTTGTTATCTTATGTTCTAGACATTTTTTTTTCTTTCTTCTTTTTGATTTGATAGGAAAAAAAAAAAAAGGAAAAAACATGTGAAAAACATATAAAAGAATGGAAATGGAGGAGAAATCTTCTAATTAAAACATAAAAAATATTTAAAACACTTCCCGCGAAGGCTTTGAATGAATTTTGTATTTTTTTTTTTTTTTAATTTTTGGAAGCAAGTCTTCCATTGTAATTGTTCATTTTATCCGTGGAAACTTGTGTTTCCTTGTTAATTAAGACGCTTTCTGCAAAAGTAATCCTAAAATATAAAGTTTTTTTTTTTCATTTTGTGACCACAAAAGGTTAATTAGAACTATATTTTACGGTATAAAATGAGAATGCCTTGGTACTATACTTTTGTGTTCCTTGATTATGTTTCTTATCTTAATCCTTTATTAACATGTATGTTGTGCCTGAAATATGATGGTTTGCATTTACTTATTTGCAAATTCACTTGTCAGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGGTATGTCTATCAGAGTATAATAACTGGCTTCTCAACCAAAAAATGTATGAAAACACTTGTATTGTTTTCAGTGTATTTGATGCGGTAAGCATATACAATTAATGAATTTTGTTAATTTAGTCATGTTGTTCGGTGCTGGATTTGATTGCATTCAGCCTTTAAGATTTCTGGGTTGAAGTCAGACTTGTAAAACAATTCTGATAATTTAATCAATTGTTTATTTACTATTCGACTAAGATTGAGAAGCAATTCTAGGAGTAGAAAACAAAGCATAGAAGTTCTAGTAAACAAACTTATTTTTTAAATACAAAAACAAAATTATAATCTAATGGTCTTAAATAATCTCAGTTATATTTGAAGTCAGTTTGCTATATTTGTTGTTCTTTTATTCCTTTTTCTTTCTAGTTTCTGTGTAATCAGATTTGTTTATGGAAACTAGCTGGTACTAAGCTCCTTGTTTTCACCCTTAATGACTGAAAATTTTATTTGAAATTGGTCCTGAACTAATTCTTCACTCCTAAGATTATTAAAATATTTTTTGTATCTTGTCCCGAACAAATTATCATCCCTATTAAGATTCCAACTTAAAGGGAAAATATGAAAACTCTGGAAAATTAGTTTTCTTGATAAATCCCTGTTTATTATCTCAATCTACTTCAGTAAGCATGTTTTGGGTGTGAGATGCTTTTGTTAATGTAACAAAATGAAAAAGAAATTATAATGATGGACTTGTTTTACAATGATTTTAAAGGTTTAATTGTTTGTGAAAATATATGCATTGTGAAACTAAGAACGCATGCTCGGGAAATGAGCAGAAGAATGACATGGATATGAGCACAGAGACACCAATAAACTTTTTTTTTATACAGATATGTCATACCTTGTGATTCTTGCAATAAATGCGAAAGGTGGACATATACAATACTCTAAGAACTGATCCATCAGTTATTGCAAATACAACAAATACTTGATGCTTTGTCTTTATTTTCGTTCCTTTTTCTTGTAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGTAATTTCACTTCAGTTTTATTAGGTTACTTGTGTTCTTGTGAAAGCAGAATTTTTTTTCCGCATATACTATATTTGGTTGATATTTTGGGGTAATTTTCTTTTTCTTTAGAATCGTAAGGACTAGTTAACTCTTTTCTCCTTGATACTCTTTATGGTCTTAATCAATAATCAGTATTTTGAAAAGCATGCCTCGTGTCACCGTCACAAGGCGTGTTGCCCATGGCCGCATGCTCGTTACATGCTGAATCCCTTGCCTTGTACTACTTTATCGCTTTTCTCTATTGCTTTCATGTTATATAATTTATTTCTTGCTTTTTCTAAGAGTAAGACATTTCAGCAAAATGTATTATAGGGGATGAGACTAGTTGTCATATTTTCCCTACCCGAAAAGTTATATACTGTTTAAGCTGTTGTTGTTGCCTTTTGCTTGAATGCTATATAACACCGTTTTCAATCTCTCACTCAATCTTGCTTTTAAACTCTCTTTCAAAATTTCTCTGCCCCGCTTTCTAAAATCGAGGCTAGAGGCTTGAGTATACGTTGCCTAGCCGTAGGCGAAGCGATCCGAAAGTTGGCTTGACTCACTCAGTGGTAAGAGTGCGTGCTGCAAAAACGGCAAGTCCGCTGCCAAGAAAAAGGCGATTGTGACACCTCGCATAGAGCCCTCAACTTCTGTTTACTTCTTCATGACATTTTGTGAATAAATTAAGAAATATGTTCCAAAAACTGTTATTTTTCTAAACAAATATTCATTTAATCCCTTGGAATGGTCGACTGGTTAGGCAGGGGAATGCAATTCTAAGTAGCATGGATCCATTGTTTTTTCTTATTACATAAATGCAAAAACCTGTACTTGACTTAATGTAAATATTGGCATCCACCATGTTATTTTAACTTGTTGCTTCATCCAAATCATTTATGCTGCAGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGTTTGTGCTATTAATTGTAGGACATGGCACTTCTTCGAATTTTACTGTTCAATCTGGTGATGCATTGCTAGAAATTATTTGAAACTTTAGGAAGGACTTAATTTGACCCACTAATGGGGTCTCAATTCGGCAAAATACTGATTTGAGAAATGAGACATTTAAGAGTAAGATTCTCATTAAAGTGTATAATCAAATGTAACGCCCCAAGCTCACTACTAGCTGATATTGCAGATATTGTTCTCTTTGGGCCCCTTAAGGTTTTTAAAACGTGTCTGCTAGGGTAAGGTTTCCACACCCTTATAAGGAATGTTTCGTTCTCCTCCCCAACCGATGTGGAATCTTACAATCTACCCCCTCCGAACTCAGTGTCCTTGCTGGCTCAACGCCTCGTATCCCCACCCCCTTTCGGGGTTCAGCCTCCTCGTCGTGACCTCAACATGCACTGCTGTGCATGGTGTAATGGGATTCATATATTCAGCTATTTGTAATACGTGTGTAAAGCTTTCAAATGCTAGAAGGTGTTTAGAAGCAAGACAACTCTGATTTTTGACAGAAATTTTTGGTTGTACAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAGTGATCGATGTCAATTTGCTCAGCCTCTTGTCCACTTTGTTGCATGTAAATTTGGTTTTGTTGTATAAGTGGAGATGTTAGCAGAGGTTAGAAACCAAATTAGTCTCTGTTTATGAATACCCCCTTGTTCTTCGTTGCCTTTGAGCAAAGAAAATGAGAAACACAGGAAATTTCAGCTTTACATGTGATGGGGTATCCCTTGCTGTGAATCACTGAATACTGGCTGCTTCTCCA

mRNA sequence

CAGGGTGGGCGAAGATCGTAAACAAGGTTATATCCGAATCGGGCAGCGGTCGAGGAGGCGCTTTGGTGGTTCTCATGGCTTGCCACCATCTTCACACTGCGACTGCTAGTGCATCGCACATTTGCAGAAATTTTTTGGGATACATTTTCACCTCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCGGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATAGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCTTGTCACTTATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTTGTGTTTGATCATGATGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCTACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAGTGATCGATGTCAATTTGCTCAGCCTCTTGTCCACTTTGTTGCATGTAAATTTGGTTTTGTTGTATAAGTGGAGATGTTAGCAGAGGTTAGAAACCAAATTAGTCTCTGTTTATGAATACCCCCTTGTTCTTCGTTGCCTTTGAGCAAAGAAAATGAGAAACACAGGAAATTTCAGCTTTACATGTGATGGGGTATCCCTTGCTGTGAATCACTGAATACTGGCTGCTTCTCCA

Coding sequence (CDS)

ATGGCTTGCCACCATCTTCACACTGCGACTGCTAGTGCATCGCACATTTGCAGAAATTTTTTGGGATACATTTTCACCTCCAAATTTGCTTCTCCTCTTCGTTTCTCTTCTTCTTCTTCTTCTTTGAGGATACAGTCTCCGGGTCACCATTGTTTTCCGTCTTTCTCCGTTCTGCTATCCCCGAAGGGTTACTGTAGTTCATCTAGAAGTGTAAATGCTGCGATTAACGTAGATAGCAATGCAACTTATCATGGTAGTCCGGCATCTTCTAGTAGCCAGCAAATGCTGCAAGTCCAAGATTCATTATCGAATTCACCCACATGCAAAGAAGAAACTGAAATTGATAGTCCTTCAGATGCGAGGGTCATGCTCATAGATGGCACATCAATCATTTATAGAGCATACCACAAGCTTTTGGCAAAGCTGCATCATGGCCATTTATCACATGCGGATGGCAATGGAGATTGGGTGCTAACAATATTTACTGCCTTGTCACTTATTGTTGATGTTCTGGAGTTTATGCCTTCTCATGTGGCGGTTGTGTTTGATCATGATGGACATGCATATGGTCATACTTGTTATTCATCCAATGAAAATTTCATGGCAAAAGGATCCACTTTTCGTCATACACGTTACCCTGCATACAAGAGTAACAGGCCACCTACACCTGATACCATAGTCCAGGGGCTCCAATACTTAAAGGCATCCATAAAGTCCATGTCCGTAAAGGTGATTGAGGTACCTGGAGTGGAGGCTGATGATGTGATTGGCACATTGGCTTTGAGAAGTGTTGCTGCTGGGTGTAAGGTTCGTGTTGTCTCCCCTGACAAAGACTTCTTCCAGATTATATCTCCTTCATTACGTCTTTTACGAATAGCTCCACGTGGGTTTGAGATGGTTTCTTTTGGGCTGGAGGATTTTGCCGAAAAATATGGAGTTCTGGAACCTTCTCAGTTCGTTGATGTGATGTCTTTAGTTGGTGACAAATCTGATAATATTCCAGGAGTCGATGGAATTGGAAATGTCAATGCTGTGCAACTTATCACTAGATTCGGCACATTAGAAAATTTGTTGCAACATGTTGATCAAGTGGAAGATGAACGTATAAGGAAGATGTTGGTAACAAATGCTGAACAAGCTATCTTGAGCAAGGACCTGGCAATCTTGCGATCTGATCTTCCGCTCTATATGGTACCATTTACCACCAGAGATCTTTTATTCAAGAAACCGGAGGATAATGGGGAGAAATTCACAAGCCTCTTAACTGCTATTGGTGCATATGCAGAAGGATTCTCAGCTGATCCAATTATGAGGAGAGTACTAAACTTATGGAAGAAGCTTGAAAAAAGTTAG

Protein sequence

MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLEKS
BLAST of CmaCh03G000030 vs. Swiss-Prot
Match: DPO1_THEFI (DNA polymerase I, thermostable OS=Thermus filiformis GN=polA PE=1 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 8.0e-39
Identity = 99/275 (36.00%), Postives = 155/275 (56.36%), Query Frame = 1

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RV+L+DG  + YR ++ L         S     G+ V  ++     ++  L+     V V
Sbjct: 13  RVLLVDGHHLAYRTFYAL---------SLTTSRGEPVQMVYGFARSLLKALKEDGQAVVV 72

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  +
Sbjct: 73  VFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALVKRLVDLL 132

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
            +  +E PG EADDV+GTLA ++   G +VR+++ D+DFFQ++S  + +L   P G  + 
Sbjct: 133 GLVRLEAPGYEADDVLGTLAKKAEREGMEVRILTGDRDFFQLLSEKVSVL--LPDGTLVT 192

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
               +D  EKYGV  P ++VD  +L GD+SDNIPGV GIG   A++L+  +G++ENLL++
Sbjct: 193 P---KDVQEKYGV-PPERWVDFRALTGDRSDNIPGVAGIGEKTALRLLAEWGSVENLLKN 252

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 396
           +D+V+ + +R+ +  + E   LS DLA +R+DLPL
Sbjct: 253 LDRVKPDSLRRKIEAHLEDLHLSLDLARIRTDLPL 254

BLAST of CmaCh03G000030 vs. Swiss-Prot
Match: DPO1_GEOSE (DNA polymerase I OS=Geobacillus stearothermophilus GN=polA PE=1 SV=2)

HSP 1 Score: 162.5 bits (410), Expect = 1.0e-38
Identity = 105/306 (34.31%), Postives = 165/306 (53.92%), Query Frame = 1

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           +++LIDG S+ YRA+  L   LH+    H +    + + +   L+      E  P+H+ V
Sbjct: 4   KLVLIDGNSVAYRAFFAL-PLLHNDKGIHTNAVYGFTMMLNKILA------EEQPTHILV 63

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
            FD                  A  +TFRH  +  YK  R  TP  + +    L+  +K+ 
Sbjct: 64  AFD------------------AGKTTFRHETFQDYKGGRQQTPPELSEQFPLLRELLKAY 123

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGF-EM 300
            +   E+   EADD+IGT+A R+   G  V+V+S D+D  Q+ SP + +  I  +G  ++
Sbjct: 124 RIPAYELDHYEADDIIGTMAARAEREGFAVKVISGDRDLTQLASPQVTV-EITKKGITDI 183

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
            S+  E   EKYG L P Q VD+  L+GDKSDNIPGV GIG   AV+L+ +FGT+EN+L 
Sbjct: 184 ESYTPETVVEKYG-LTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLKQFGTVENVLA 243

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
            +D+++ E++++ L    + A+LSK LA +  D P   V  T  D+++K   ++ EK  +
Sbjct: 244 SIDEIKGEKLKENLRQYRDLALLSKQLAAICRDAP---VELTLDDIVYK--GEDREKVVA 277

Query: 421 LLTAIG 426
           L   +G
Sbjct: 304 LFQELG 277

BLAST of CmaCh03G000030 vs. Swiss-Prot
Match: DPO1_BACCA (DNA polymerase I OS=Bacillus caldotenax GN=polA PE=1 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 2.3e-38
Identity = 104/306 (33.99%), Postives = 167/306 (54.58%), Query Frame = 1

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           +++LIDG+S+ YRA+  L   LH+    H +    + + +   L+      E  P+H+ V
Sbjct: 4   KLVLIDGSSVAYRAFFAL-PLLHNDKGIHTNAVYGFTMMLNKILA------EEEPTHMLV 63

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
            FD                  A  +TFRH  +  YK  R  TP  + +    L+  +++ 
Sbjct: 64  AFD------------------AGKTTFRHEAFQEYKGGRQQTPPELSEQFPLLRELLRAY 123

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGF-EM 300
            +   E+   EADD+IGTLA R+   G +V+V+S D+D  Q+ SP + +  I  +G  ++
Sbjct: 124 RIPAYELENYEADDIIGTLAARAEQEGFEVKVISGDRDLTQLASPHVTV-DITKKGITDI 183

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
             +  E   EKYG L P Q VD+  L+GDKSDNIPGV GIG   AV+L+ +FGT+EN+L 
Sbjct: 184 EPYTPEAVREKYG-LTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLRQFGTVENVLA 243

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 420
            +D+++ E++++ L  + E A+LSK LA +R D P   V  +  D+ ++   ++ EK  +
Sbjct: 244 SIDEIKGEKLKETLRQHREMALLSKKLAAIRRDAP---VELSLDDIAYQ--GEDREKVVA 277

Query: 421 LLTAIG 426
           L   +G
Sbjct: 304 LFKELG 277

BLAST of CmaCh03G000030 vs. Swiss-Prot
Match: DPO1F_THETH (DNA polymerase I, thermostable OS=Thermus thermophilus GN=polA PE=1 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.3e-36
Identity = 100/275 (36.36%), Postives = 146/275 (53.09%), Query Frame = 1

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RV+L+DG  + YR +  L               G+ V  ++     ++  L+     V V
Sbjct: 12  RVLLVDGHHLAYRTFFALKGL--------TTSRGEPVQAVYGFAKSLLKALKEDGDVVVV 71

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  +
Sbjct: 72  VFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALIKELVDLL 131

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
            +  +EVPG EADDV+ TLA R+   G +VR+++ D+D +Q++S  + +L   P G+ + 
Sbjct: 132 GLVRLEVPGFEADDVLATLAKRAEKEGYEVRILTADRDLYQLLSERIAILH--PEGYLIT 191

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
              L    EKYG L P Q+VD  +L GD SDNIPGV GIG   A +LI  +G+LENL QH
Sbjct: 192 PAWL---YEKYG-LRPEQWVDYRALAGDPSDNIPGVKGIGEKTAQRLIREWGSLENLFQH 251

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 396
           +DQV+   +R+ L    E   LS+ L+ + +DLPL
Sbjct: 252 LDQVKPS-LREKLQAGMEALALSRKLSQVHTDLPL 253

BLAST of CmaCh03G000030 vs. Swiss-Prot
Match: DPO1T_THET8 (DNA polymerase I, thermostable OS=Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579) GN=polA PE=3 SV=2)

HSP 1 Score: 155.2 bits (391), Expect = 1.7e-36
Identity = 100/276 (36.23%), Postives = 152/276 (55.07%), Query Frame = 1

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFT-ALSLIVDVLEFMPSHVA 180
           RV+L+DG  + YR +  L               G+ V  ++  A SL+  + E     V 
Sbjct: 13  RVLLVDGHHLAYRTFFALKGL--------TTSRGEPVQAVYGFAKSLLKALKEDGYKAVF 72

Query: 181 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 240
           VVFD                  AK  +FRH  Y AYK+ R PTP+   + L  +K  +  
Sbjct: 73  VVFD------------------AKAPSFRHEAYEAYKAGRAPTPEDFPRQLALIKELVDL 132

Query: 241 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 300
           +    +EVPG EADDV+ TLA ++   G +VR+++ D+D +Q++S  + +L   P G  +
Sbjct: 133 LGFTRLEVPGYEADDVLATLAKKAEKEGYEVRILTADRDLYQLVSDRVAVLH--PEGHLI 192

Query: 301 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 360
                E   EKYG L P Q+VD  +LVGD SDN+PGV GIG   A++L+  +G+LENLL+
Sbjct: 193 TP---EWLWEKYG-LRPEQWVDFRALVGDPSDNLPGVKGIGEKTALKLLKEWGSLENLLK 252

Query: 361 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPL 396
           ++D+V+ E +R+ +  + E   LS +L+ +R+DLPL
Sbjct: 253 NLDRVKPENVREKIKAHLEDLRLSLELSRVRTDLPL 256

BLAST of CmaCh03G000030 vs. TrEMBL
Match: D7T2D6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0094g00430 PE=4 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 4.6e-158
Identity = 304/451 (67.41%), Postives = 360/451 (79.82%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYI--FTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVL 60
           MAC+       S+ H  R   G +  + S F+   +  ++S  L+ ++  H   PS   +
Sbjct: 1   MACYR------SSHHHIRFLWGNLNCWRSSFSRTQKIGNNSCCLQRRNLIHS--PS---I 60

Query: 61  LSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPS 120
           LS KG C+ S S++++I+  ++   +G+   SS  +    Q +  +S   KE     S S
Sbjct: 61  LSRKGCCTLSNSLDSSIHEVAHTISYGNTTISSKSERKLCQGAFVDSVDHKERKMDISSS 120

Query: 121 DARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHV 180
           + RVMLIDGTSIIYRAY+KLLAKLHHG+LSHADGNGDWVLTIF ALSLIVDVL+F+PSHV
Sbjct: 121 NGRVMLIDGTSIIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFAALSLIVDVLDFIPSHV 180

Query: 181 AVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIK 240
           AVVFDH+G  +GHT  SS E+ MAKG  FRHT YP+YKSNRPPTPDTIVQGLQYLKASIK
Sbjct: 181 AVVFDHNGIPFGHTSISSKESIMAKGLNFRHTLYPSYKSNRPPTPDTIVQGLQYLKASIK 240

Query: 241 SMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFE 300
           +MS+KVIEVPGVEADDVIGTL++RSV AG KVRVVSPDKDFFQI+SPSLRLLRIAPRGFE
Sbjct: 241 AMSIKVIEVPGVEADDVIGTLSVRSVDAGYKVRVVSPDKDFFQILSPSLRLLRIAPRGFE 300

Query: 301 MVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLL 360
           M SFG+EDFA++YG LEPSQFVDV+SLVGDKSDNIPGV+GIGNV+AVQLIT+FGTLENLL
Sbjct: 301 MTSFGMEDFAKRYGNLEPSQFVDVISLVGDKSDNIPGVEGIGNVHAVQLITKFGTLENLL 360

Query: 361 QHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFT 420
           Q VDQV++ERIRK L++ A+QA+LSK+LA+LR DLP YMVPFTT DL+F KPEDNGEKFT
Sbjct: 361 QCVDQVQEERIRKALISGADQAVLSKNLALLRCDLPFYMVPFTTEDLIFTKPEDNGEKFT 420

Query: 421 SLLTAIGAYAEGFSADPIMRRVLNLWKKLEK 450
           SLL AI AYAEGFSADPI+RR   LWKKLEK
Sbjct: 421 SLLNAISAYAEGFSADPIIRRAFYLWKKLEK 440

BLAST of CmaCh03G000030 vs. TrEMBL
Match: A0A061EJA7_THECC (5\'-3\' exonuclease family protein isoform 1 OS=Theobroma cacao GN=TCM_019883 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 7.3e-156
Identity = 290/402 (72.14%), Postives = 333/402 (82.84%), Query Frame = 1

Query: 52  FPSFSVLLSP-----KGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSP 111
           F  F V+  P     KGYCS S ++N       +AT HG+   SS ++ L  Q++  ++ 
Sbjct: 38  FKKFYVIRPPPCQTIKGYCSLSYTLNTLPGA-RHATSHGNAVISSKKEQLLHQEAALDTS 97

Query: 112 TCKEETEIDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSL 171
             +E     + S+ RVMLIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTALSL
Sbjct: 98  NLQERVVNANYSNNRVMLIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSL 157

Query: 172 IVDVLEFMPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTI 231
           I+DVLEF+PSHVAVVFDHDG  +GHT  SS EN MAKG  FRHT YP+YKSNRPPTPDTI
Sbjct: 158 IIDVLEFVPSHVAVVFDHDGIPFGHTSISSKENVMAKGLNFRHTLYPSYKSNRPPTPDTI 217

Query: 232 VQGLQYLKASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPS 291
           VQGLQYLKASIK+MS+KVIEVPGVEADDVIGTLA RSV AG KVRVVSPDKDFFQI+SPS
Sbjct: 218 VQGLQYLKASIKAMSIKVIEVPGVEADDVIGTLAARSVDAGFKVRVVSPDKDFFQILSPS 277

Query: 292 LRLLRIAPRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQ 351
           LRLLRIAPRG+EMVSFGLEDF+++YG L+PSQFVD+++L+GD+ DNIPGVDGIGNV+AVQ
Sbjct: 278 LRLLRIAPRGYEMVSFGLEDFSKRYGDLKPSQFVDMVALMGDRCDNIPGVDGIGNVHAVQ 337

Query: 352 LITRFGTLENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLL 411
           LI++FGTLENLLQ VDQVE + IRK L  NA+QA+LSK+LA+LR DLP YM PF T DL 
Sbjct: 338 LISKFGTLENLLQCVDQVEVDHIRKALKGNADQALLSKNLAMLRCDLPFYMAPFATTDLT 397

Query: 412 FKKPEDNGEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 449
           FKKPEDNGEKFTSLLTAI AYAEGFSADPI+RR   LWKKLE
Sbjct: 398 FKKPEDNGEKFTSLLTAISAYAEGFSADPIIRRAFYLWKKLE 438

BLAST of CmaCh03G000030 vs. TrEMBL
Match: A0A059DB23_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00006 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 2.8e-155
Identity = 287/390 (73.59%), Postives = 323/390 (82.82%), Query Frame = 1

Query: 67  SSRSVNAAINVDSNATYHGSPASSSSQQMLQV-------QDSLSNSPTCKEETEIDSPSD 126
           SSR  +  +   S +T HG    +S   ++         QD+L +     E      PS 
Sbjct: 43  SSRKGHYVLAKCSFSTLHGVATETSRNSVIPSKTSPSVGQDALLDQVNQGEREAKADPSG 102

Query: 127 ARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVA 186
            RVMLIDGTS+IYRAY+KLLA+LHHGHL HADGNGDWVLTI TALSLI+DVLEF PSHVA
Sbjct: 103 GRVMLIDGTSVIYRAYYKLLARLHHGHLPHADGNGDWVLTIVTALSLIIDVLEFGPSHVA 162

Query: 187 VVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKS 246
           VVFDHDG  +GHT   S E+FMAKG  FRHT YP YKSNRPPTPDTIVQGLQYLKASIK+
Sbjct: 163 VVFDHDGIPFGHTFNQSKESFMAKGLNFRHTLYPTYKSNRPPTPDTIVQGLQYLKASIKA 222

Query: 247 MSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEM 306
           MS+KVIEVPGVEADDVIGTLA+RSV AG KVRVVSPDKDFFQI+SPSLRLLRIAPRG +M
Sbjct: 223 MSIKVIEVPGVEADDVIGTLAVRSVEAGYKVRVVSPDKDFFQILSPSLRLLRIAPRGLDM 282

Query: 307 VSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQ 366
           VSFG+EDFA+KYG L+PSQFVDV+SLVGDK DNIPGV+GIGNV+AVQLIT+FGTLENLLQ
Sbjct: 283 VSFGMEDFAKKYGALDPSQFVDVVSLVGDKCDNIPGVEGIGNVHAVQLITKFGTLENLLQ 342

Query: 367 HVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTS 426
            VDQVE+ERIRK L+  A+ AILSK+LA++R+DLP YMVPFTT DL FKKPED+GEKFTS
Sbjct: 343 CVDQVEEERIRKALIAQADNAILSKNLALIRTDLPFYMVPFTTEDLAFKKPEDDGEKFTS 402

Query: 427 LLTAIGAYAEGFSADPIMRRVLNLWKKLEK 450
           LL AIGAYAEGFSADPI+RR LNLWKKLE+
Sbjct: 403 LLKAIGAYAEGFSADPIIRRALNLWKKLER 432

BLAST of CmaCh03G000030 vs. TrEMBL
Match: A0A0D2NBI5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 5.8e-153
Identity = 279/387 (72.09%), Postives = 324/387 (83.72%), Query Frame = 1

Query: 62  KGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDAR 121
           K YCS S ++++ +  D +   HG+   SS ++ +  Q++       +E       S+ R
Sbjct: 53  KQYCSLSGNLSSTVPGD-HPIPHGNAVISSKKEQIFHQEAALGRANLQETVVNAKSSNGR 112

Query: 122 VMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAVV 181
           VMLIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTALSLI+DVLEF+PSHVAVV
Sbjct: 113 VMLIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSLIIDVLEFVPSHVAVV 172

Query: 182 FDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMS 241
           FDHDG  +GHT  SS EN M KG  FRHT +P+YKSNRPPTPDTIVQGLQYLKASIK+MS
Sbjct: 173 FDHDGIPFGHTSISSKENVMGKGLNFRHTLFPSYKSNRPPTPDTIVQGLQYLKASIKAMS 232

Query: 242 VKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVS 301
           +KVIEVPGVEADDVIGTLA RSV  G KVRVVSPDKDFFQI+ PSLRLLRIAPRG+EMVS
Sbjct: 233 IKVIEVPGVEADDVIGTLAARSVDEGFKVRVVSPDKDFFQILCPSLRLLRIAPRGYEMVS 292

Query: 302 FGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHV 361
           FG+EDF+++YG L+PSQFVDV+SLVGD+ DNIPGVDGIGNV+AVQLIT+FGTLENLL+ V
Sbjct: 293 FGMEDFSKRYGDLKPSQFVDVVSLVGDRCDNIPGVDGIGNVHAVQLITKFGTLENLLKCV 352

Query: 362 DQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLL 421
           D+VE + IRK L+ NA+QA+LSK+LA+LR DLP YMVPF+TRDL F KPEDNGEKFTSLL
Sbjct: 353 DEVEVDHIRKALIANADQAVLSKNLAMLRCDLPFYMVPFSTRDLTFNKPEDNGEKFTSLL 412

Query: 422 TAIGAYAEGFSADPIMRRVLNLWKKLE 449
            AI AYAEGFSADPI+RR   LWKKLE
Sbjct: 413 NAISAYAEGFSADPIIRRAFYLWKKLE 438

BLAST of CmaCh03G000030 vs. TrEMBL
Match: A0A0D2RDM6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 1.3e-152
Identity = 278/385 (72.21%), Postives = 323/385 (83.90%), Query Frame = 1

Query: 64  YCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDARVM 123
           YCS S ++++ +  D +   HG+   SS ++ +  Q++       +E       S+ RVM
Sbjct: 54  YCSLSGNLSSTVPGD-HPIPHGNAVISSKKEQIFHQEAALGRANLQETVVNAKSSNGRVM 113

Query: 124 LIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAVVFD 183
           LIDGTS+IYRAY+KLLAKLHHG+LSHADGNGDWVLTIFTALSLI+DVLEF+PSHVAVVFD
Sbjct: 114 LIDGTSVIYRAYYKLLAKLHHGYLSHADGNGDWVLTIFTALSLIIDVLEFVPSHVAVVFD 173

Query: 184 HDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMSVK 243
           HDG  +GHT  SS EN M KG  FRHT +P+YKSNRPPTPDTIVQGLQYLKASIK+MS+K
Sbjct: 174 HDGIPFGHTSISSKENVMGKGLNFRHTLFPSYKSNRPPTPDTIVQGLQYLKASIKAMSIK 233

Query: 244 VIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVSFG 303
           VIEVPGVEADDVIGTLA RSV  G KVRVVSPDKDFFQI+ PSLRLLRIAPRG+EMVSFG
Sbjct: 234 VIEVPGVEADDVIGTLAARSVDEGFKVRVVSPDKDFFQILCPSLRLLRIAPRGYEMVSFG 293

Query: 304 LEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQHVDQ 363
           +EDF+++YG L+PSQFVDV+SLVGD+ DNIPGVDGIGNV+AVQLIT+FGTLENLL+ VD+
Sbjct: 294 MEDFSKRYGDLKPSQFVDVVSLVGDRCDNIPGVDGIGNVHAVQLITKFGTLENLLKCVDE 353

Query: 364 VEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSLLTA 423
           VE + IRK L+ NA+QA+LSK+LA+LR DLP YMVPF+TRDL F KPEDNGEKFTSLL A
Sbjct: 354 VEVDHIRKALIANADQAVLSKNLAMLRCDLPFYMVPFSTRDLTFNKPEDNGEKFTSLLNA 413

Query: 424 IGAYAEGFSADPIMRRVLNLWKKLE 449
           I AYAEGFSADPI+RR   LWKKLE
Sbjct: 414 ISAYAEGFSADPIIRRAFYLWKKLE 437

BLAST of CmaCh03G000030 vs. TAIR10
Match: AT3G52050.3 (AT3G52050.3 5'-3' exonuclease family protein)

HSP 1 Score: 514.6 bits (1324), Expect = 6.1e-146
Identity = 268/395 (67.85%), Postives = 319/395 (80.76%), Query Frame = 1

Query: 56  SVLLSPKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEE--TE 115
           S+  S K YCSS      A++  SN    GS  +S S+    V       P   EE    
Sbjct: 59  SLARSAKYYCSS-----VAVSEFSNEAASGSTLTSISED---VTPQSIKYPFKSEERVAS 118

Query: 116 IDSPSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEF 175
             + S+ RVMLIDGTSIIYRAY+KLLA+L+HGHL+HADGN DWVLTIF++LSL++DVL+F
Sbjct: 119 TAASSNGRVMLIDGTSIIYRAYYKLLARLNHGHLAHADGNADWVLTIFSSLSLLIDVLKF 178

Query: 176 MPSHVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYL 235
           +PSHVAVVFDHDG  YG T  SS     AKG  FRHT YPAYKSNRPPTPDTIVQGLQYL
Sbjct: 179 LPSHVAVVFDHDGVPYGTTSNSSTGYRSAKGMNFRHTLYPAYKSNRPPTPDTIVQGLQYL 238

Query: 236 KASIKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIA 295
           KASIK+MS+KVIEVPGVEADDVIGTLA+RS++AG KVRVVSPDKDFFQI+SPSLRLLR+ 
Sbjct: 239 KASIKAMSIKVIEVPGVEADDVIGTLAMRSISAGFKVRVVSPDKDFFQILSPSLRLLRLT 298

Query: 296 PRGFEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGT 355
           PRG EM SFG+EDFA+K+G LEP+QFVD+++L GDKSDNIPGVDGIGNV+AV+LI+RFGT
Sbjct: 299 PRGSEMASFGMEDFAKKFGNLEPAQFVDIIALAGDKSDNIPGVDGIGNVHAVELISRFGT 358

Query: 356 LENLLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDN 415
           LENLLQ VD++++ +I++ L+ +A+QAILSK LA+LRSDLP Y+VPF T+DL FKKPEDN
Sbjct: 359 LENLLQSVDEIKEGKIKESLIASADQAILSKKLALLRSDLPDYIVPFDTKDLTFKKPEDN 418

Query: 416 GEKFTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 449
           GEK +SLL AI  YAEGFSADP++RR   LW+KLE
Sbjct: 419 GEKLSSLLIAIADYAEGFSADPVIRRAFRLWEKLE 445

BLAST of CmaCh03G000030 vs. TAIR10
Match: AT1G34380.2 (AT1G34380.2 5'-3' exonuclease family protein)

HSP 1 Score: 70.1 bits (170), Expect = 4.0e-12
Identity = 73/301 (24.25%), Postives = 134/301 (44.52%), Query Frame = 1

Query: 62  KGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDAR 121
           K   SSS S ++  +V+   T+H     + + Q+LQ      N+   +++ +       R
Sbjct: 29  KWVSSSSSSFSSHSSVE---TFH----RTGNVQVLQKDVLCGNNEEIRKKNK-------R 88

Query: 122 VMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAVV 181
           V  +D + + Y            G+   +   G W+   F+ +SL   V       +AV+
Sbjct: 89  VFFLDVSPLCYE-----------GNKPSSQAFGHWISLFFSQVSLTDPV-------IAVI 148

Query: 182 FDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSMS 241
              +G+        S   + A   +  H RY    S RP          Q++   ++  +
Sbjct: 149 DGEEGNQRRRELLPS---YKAHRKSPNHGRY----SKRPH---------QFVDEVLRKCN 208

Query: 242 VKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMVS 301
           V V+ + G EADDV+ TL  ++V  G +  + SPDKDF Q+IS +++++           
Sbjct: 209 VPVVRIEGHEADDVVATLMEQAVQRGYRAVIASPDKDFKQLISENVQIVIPLADLRRWSF 268

Query: 302 FGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPG----VDGIGNVNAVQLITRFGTLENL 359
           + L+ +  +Y   +P   +    ++GD+ D +PG    V   G   A++L+ + G+LE+L
Sbjct: 269 YTLKHYHAQYN-CDPQSDLSFRCIMGDEVDGVPGIQHMVPAFGRKTAMKLVRKHGSLESL 280

BLAST of CmaCh03G000030 vs. NCBI nr
Match: gi|449453197|ref|XP_004144345.1| (PREDICTED: uncharacterized protein LOC101222649 isoform X1 [Cucumis sativus])

HSP 1 Score: 750.4 bits (1936), Expect = 1.9e-213
Identity = 385/450 (85.56%), Postives = 410/450 (91.11%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLS 60
           MA HHLHTATASASHICRNFLG+IFTSKF  P RFS+SSS +       H FPSFS+LLS
Sbjct: 19  MASHHLHTATASASHICRNFLGFIFTSKFPHPFRFSTSSSRI-------HSFPSFSLLLS 78

Query: 61  PKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDA 120
           PKGYCSSS S+N+A  +D+  TYHGS AS+  Q M+Q QDSLSN  T KE+T ID+P+DA
Sbjct: 79  PKGYCSSSGSINSANTMDTVPTYHGSSASTRCQPMVQFQDSLSNPLTFKEDTGIDNPADA 138

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLE MPSHVAV
Sbjct: 139 RVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEIMPSHVAV 198

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFDHDGH YGHT  SSNENFM+KGSTFRHT YPAYKSNR PTPDT+VQGLQYLKASIKSM
Sbjct: 199 VFDHDGHPYGHTYISSNENFMSKGSTFRHTIYPAYKSNRAPTPDTVVQGLQYLKASIKSM 258

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
           S+KVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EMV
Sbjct: 259 SIKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEMV 318

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
           SFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH
Sbjct: 319 SFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 378

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSL 420
           VDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTSL
Sbjct: 379 VDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTSL 438

Query: 421 LTAIGAYAEGFSADPIMRRVLNLWKKLEKS 451
           LTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LTAIGAYAERFSADPIIRRVLYLWKKLDKN 461

BLAST of CmaCh03G000030 vs. NCBI nr
Match: gi|659069411|ref|XP_008449676.1| (PREDICTED: uncharacterized protein LOC103491482 isoform X1 [Cucumis melo])

HSP 1 Score: 743.4 bits (1918), Expect = 2.3e-211
Identity = 384/450 (85.33%), Postives = 412/450 (91.56%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLS 60
           MA HHLHTATASASHICRNFLG++FTSKF  P RFS+SSS +       H FPS S+LLS
Sbjct: 19  MASHHLHTATASASHICRNFLGFVFTSKFPVPFRFSTSSSRI-------HSFPS-SLLLS 78

Query: 61  PKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDA 120
           PKGYCSSS S+N++  +D+ ATYHGS AS+  Q M+Q QDSLSNS T KE+T ID+P+DA
Sbjct: 79  PKGYCSSSGSINSSNIIDTIATYHGSSASTRRQSMVQFQDSLSNSLTFKEDTGIDNPADA 138

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV
Sbjct: 139 RVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 198

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFDHDG++YGHT  SSNENF++KGSTFRHT YPAYKSNR P PDTIVQGLQYLKASIKSM
Sbjct: 199 VFDHDGYSYGHTYISSNENFVSKGSTFRHTIYPAYKSNRAPVPDTIVQGLQYLKASIKSM 258

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
           SVKVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EMV
Sbjct: 259 SVKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEMV 318

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
           SFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH
Sbjct: 319 SFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 378

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSL 420
           VDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTSL
Sbjct: 379 VDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTSL 438

Query: 421 LTAIGAYAEGFSADPIMRRVLNLWKKLEKS 451
           LTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LTAIGAYAERFSADPIIRRVLYLWKKLDKN 460

BLAST of CmaCh03G000030 vs. NCBI nr
Match: gi|778694790|ref|XP_011653865.1| (PREDICTED: uncharacterized protein LOC101222649 isoform X2 [Cucumis sativus])

HSP 1 Score: 709.5 bits (1830), Expect = 3.7e-201
Identity = 371/450 (82.44%), Postives = 395/450 (87.78%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLS 60
           MA HHLHTATASASHICRNFLG+IFTSKF  P RFS+SSS +       H FPSFS+LLS
Sbjct: 19  MASHHLHTATASASHICRNFLGFIFTSKFPHPFRFSTSSSRI-------HSFPSFSLLLS 78

Query: 61  PKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDA 120
           PKGYCSSS S+N+A  +D+  TYHGS AS+  Q M+Q QDSLSN  T KE+T ID+P+DA
Sbjct: 79  PKGYCSSSGSINSANTMDTVPTYHGSSASTRCQPMVQFQDSLSNPLTFKEDTGIDNPADA 138

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLE MPSHVAV
Sbjct: 139 RVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEIMPSHVAV 198

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFDHDG                  STFRHT YPAYKSNR PTPDT+VQGLQYLKASIKSM
Sbjct: 199 VFDHDG------------------STFRHTIYPAYKSNRAPTPDTVVQGLQYLKASIKSM 258

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
           S+KVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EMV
Sbjct: 259 SIKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEMV 318

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
           SFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH
Sbjct: 319 SFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 378

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSL 420
           VDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTSL
Sbjct: 379 VDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTSL 438

Query: 421 LTAIGAYAEGFSADPIMRRVLNLWKKLEKS 451
           LTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LTAIGAYAERFSADPIIRRVLYLWKKLDKN 443

BLAST of CmaCh03G000030 vs. NCBI nr
Match: gi|659069413|ref|XP_008449686.1| (PREDICTED: uncharacterized protein LOC103491482 isoform X2 [Cucumis melo])

HSP 1 Score: 705.7 bits (1820), Expect = 5.3e-200
Identity = 372/450 (82.67%), Postives = 396/450 (88.00%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLS 60
           MA HHLHTATASASHICRNFLG++FTSKF  P RFS+SSS +       H FPS S+LLS
Sbjct: 19  MASHHLHTATASASHICRNFLGFVFTSKFPVPFRFSTSSSRI-------HSFPS-SLLLS 78

Query: 61  PKGYCSSSRSVNAAINVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCKEETEIDSPSDA 120
           PKGYCSSS S+N++  +D+ ATYHGS AS+  Q M+Q QDSLSNS T KE+T ID+P+DA
Sbjct: 79  PKGYCSSSGSINSSNIIDTIATYHGSSASTRRQSMVQFQDSLSNSLTFKEDTGIDNPADA 138

Query: 121 RVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 180
           RVMLIDGTSII+RAY+KLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV
Sbjct: 139 RVMLIDGTSIIFRAYYKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPSHVAV 198

Query: 181 VFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKASIKSM 240
           VFDHDG                  STFRHT YPAYKSNR P PDTIVQGLQYLKASIKSM
Sbjct: 199 VFDHDG------------------STFRHTIYPAYKSNRAPVPDTIVQGLQYLKASIKSM 258

Query: 241 SVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRGFEMV 300
           SVKVIEVPGVEADDVIGTLALRSVA GCKVRVVSPDKDFFQI+SPSLRLLRIA RG EMV
Sbjct: 259 SVKVIEVPGVEADDVIGTLALRSVAVGCKVRVVSPDKDFFQILSPSLRLLRIASRGIEMV 318

Query: 301 SFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 360
           SFGLEDFA+K+GVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH
Sbjct: 319 SFGLEDFADKFGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLENLLQH 378

Query: 361 VDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEKFTSL 420
           VDQVEDERI+KMLVTNAEQAILSKDLA LRSDLP YMVPFTTRDLLFKKPEDNGEKFTSL
Sbjct: 379 VDQVEDERIKKMLVTNAEQAILSKDLATLRSDLPFYMVPFTTRDLLFKKPEDNGEKFTSL 438

Query: 421 LTAIGAYAEGFSADPIMRRVLNLWKKLEKS 451
           LTAIGAYAE FSADPI+RRVL LWKKL+K+
Sbjct: 439 LTAIGAYAERFSADPIIRRVLYLWKKLDKN 442

BLAST of CmaCh03G000030 vs. NCBI nr
Match: gi|694384553|ref|XP_009368167.1| (PREDICTED: uncharacterized protein LOC103957700 isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 583.6 bits (1503), Expect = 3.0e-163
Identity = 319/452 (70.58%), Postives = 362/452 (80.09%), Query Frame = 1

Query: 1   MACHHLHTATASASHICRNFLGYIFTSKFASPLRFSSSSSSLRIQSPGHHCFPSFSVLLS 60
           MAC H H+     +     +    FT +F    R  SS     IQSP       FS L+S
Sbjct: 1   MACCHSHSWLLHNTQSLSLWKWRSFTLRFVGTTRRRSS-----IQSPNSL---RFSPLIS 60

Query: 61  PKGYCSS-SRSVNAAI-NVDSNATYHGSPASSSSQQMLQVQDSLSNSPTCK--EETEIDS 120
            KGYC++ +RS N+A   V S +    + AS S  + +  +D L  S   K  E+T   +
Sbjct: 61  HKGYCNTFNRSFNSASPGVVSESGDAAAAASYSKSEGVVSRDMLLASTLFKREEKTVNSN 120

Query: 121 PSDARVMLIDGTSIIYRAYHKLLAKLHHGHLSHADGNGDWVLTIFTALSLIVDVLEFMPS 180
           PSD RVMLIDGTSIIYRAY+KLLAKLHHGHLSHADGNGDWVLTIF+ALSLI+DVL F+PS
Sbjct: 121 PSDGRVMLIDGTSIIYRAYYKLLAKLHHGHLSHADGNGDWVLTIFSALSLIIDVLMFIPS 180

Query: 181 HVAVVFDHDGHAYGHTCYSSNENFMAKGSTFRHTRYPAYKSNRPPTPDTIVQGLQYLKAS 240
           HVAVVFDHDG ++G TC SSNE+F  KG  FRHT YPAYKSNRPPTPDTIVQGLQYLKAS
Sbjct: 181 HVAVVFDHDGVSFGQTCNSSNESFKGKGLNFRHTLYPAYKSNRPPTPDTIVQGLQYLKAS 240

Query: 241 IKSMSVKVIEVPGVEADDVIGTLALRSVAAGCKVRVVSPDKDFFQIISPSLRLLRIAPRG 300
           IK+MS+KVIEVPGVEADDVIGTLA+RSV +G KVRVVSPDKDFFQI+SPSLRLLRIAPRG
Sbjct: 241 IKAMSIKVIEVPGVEADDVIGTLAVRSVDSGYKVRVVSPDKDFFQILSPSLRLLRIAPRG 300

Query: 301 FEMVSFGLEDFAEKYGVLEPSQFVDVMSLVGDKSDNIPGVDGIGNVNAVQLITRFGTLEN 360
           F+MVSFG+EDFAEKYG L+PSQFVDV+SLVGDKSDNIPGV GIGNV+AVQLIT+FGTLEN
Sbjct: 301 FDMVSFGMEDFAEKYGSLQPSQFVDVISLVGDKSDNIPGVHGIGNVHAVQLITKFGTLEN 360

Query: 361 LLQHVDQVEDERIRKMLVTNAEQAILSKDLAILRSDLPLYMVPFTTRDLLFKKPEDNGEK 420
           LLQ VDQVE+ERIRK L+ NA+QA+LSK+LA+LRSDLPLYMVPF T+DL F+KPEDNGEK
Sbjct: 361 LLQCVDQVEEERIRKALIENADQALLSKNLALLRSDLPLYMVPFATKDLTFQKPEDNGEK 420

Query: 421 FTSLLTAIGAYAEGFSADPIMRRVLNLWKKLE 449
           FTSLLTAI AYAEGFSADPI+RR   LW KLE
Sbjct: 421 FTSLLTAISAYAEGFSADPIIRRAFYLWNKLE 444

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
DPO1_THEFI8.0e-3936.00DNA polymerase I, thermostable OS=Thermus filiformis GN=polA PE=1 SV=1[more]
DPO1_GEOSE1.0e-3834.31DNA polymerase I OS=Geobacillus stearothermophilus GN=polA PE=1 SV=2[more]
DPO1_BACCA2.3e-3833.99DNA polymerase I OS=Bacillus caldotenax GN=polA PE=1 SV=1[more]
DPO1F_THETH1.3e-3636.36DNA polymerase I, thermostable OS=Thermus thermophilus GN=polA PE=1 SV=1[more]
DPO1T_THET81.7e-3636.23DNA polymerase I, thermostable OS=Thermus thermophilus (strain HB8 / ATCC 27634 ... [more]
Match NameE-valueIdentityDescription
D7T2D6_VITVI4.6e-15867.41Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0094g00430 PE=4 SV=... [more]
A0A061EJA7_THECC7.3e-15672.145\'-3\' exonuclease family protein isoform 1 OS=Theobroma cacao GN=TCM_019883 PE... [more]
A0A059DB23_EUCGR2.8e-15573.59Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A00006 PE=4 SV=1[more]
A0A0D2NBI5_GOSRA5.8e-15372.09Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1[more]
A0A0D2RDM6_GOSRA1.3e-15272.21Uncharacterized protein OS=Gossypium raimondii GN=B456_005G123800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G52050.36.1e-14667.85 5'-3' exonuclease family protein[more]
AT1G34380.24.0e-1224.25 5'-3' exonuclease family protein[more]
Match NameE-valueIdentityDescription
gi|449453197|ref|XP_004144345.1|1.9e-21385.56PREDICTED: uncharacterized protein LOC101222649 isoform X1 [Cucumis sativus][more]
gi|659069411|ref|XP_008449676.1|2.3e-21185.33PREDICTED: uncharacterized protein LOC103491482 isoform X1 [Cucumis melo][more]
gi|778694790|ref|XP_011653865.1|3.7e-20182.44PREDICTED: uncharacterized protein LOC101222649 isoform X2 [Cucumis sativus][more]
gi|659069413|ref|XP_008449686.1|5.3e-20082.67PREDICTED: uncharacterized protein LOC103491482 isoform X2 [Cucumis melo][more]
gi|694384553|ref|XP_009368167.1|3.0e-16370.58PREDICTED: uncharacterized protein LOC103957700 isoform X1 [Pyrus x bretschneide... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR0024215-3_exonuclease_N
IPR008918HhH2
IPR020045DNA_polI_H3TH
IPR0200465-3_exonucl_a-hlix_arch_N
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008152 metabolic process
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0006261 DNA-dependent DNA replication
cellular_component GO:0005575 cellular_component
cellular_component GO:0042575 DNA polymerase complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0004527 exonuclease activity
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003887 DNA-directed DNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G000030.1CmaCh03G000030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR0024215'-3' exonuclease, N-terminalSMARTSM0047553exo3coord: 120..408
score: 1.0E
IPR008918Helix-hairpin-helix motif, class 2SMARTSM00279HhH_4coord: 316..351
score: 1.1
IPR0200455'-3' exonuclease, C-terminal domainPFAMPF013675_3_exonuccoord: 315..410
score: 2.2
IPR0200455'-3' exonuclease, C-terminal domainunknownSSF478075' to 3' exonuclease, C-terminal subdomaincoord: 314..403
score: 2.54
IPR0200465'-3' exonuclease, alpha-helical arch, N-terminalPFAMPF027395_3_exonuc_Ncoord: 121..312
score: 2.8
NoneNo IPR availableGENE3DG3DSA:1.10.150.20coord: 317..390
score: 1.4
NoneNo IPR availablePANTHERPTHR10133DNA POLYMERASE Icoord: 91..186
score: 6.5E-227coord: 204..449
score: 6.5E
NoneNo IPR availablePANTHERPTHR10133:SF225'-3' EXONUCLEASE FAMILY PROTEINcoord: 91..186
score: 6.5E-227coord: 204..449
score: 6.5E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh03G000030Melon (DHL92) v3.5.1cmameB616