Cla022227 (gene) Watermelon (97103) v1

NameCla022227
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionATP binding / ATP-dependent helicase/ DNA binding / DNA-directed DNA polymerase/ helicase/ nucleic a (AHRD V1 *-** D7G0G4_ECTSI); contains Interpro domain(s) IPR001098 DNA-directed DNA polymerase, family A
LocationChr8 : 21767851 .. 21789345 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGAGTTTGCAAGATAAGGCTTCCGAGTGGAGCGGAGTGGAGCGTGAAGATGCCTTCGCCATCGACGAAGTCAATTTGTTTCAGAAGTTAGGTCTCCAGACGTTTGTTAATCTATCGACCAAATTCTATAACAAGTCTGTTTCTGTTCATCAATGCAGCATGCGGCATGCGCGAAGTAATTTGAGTTTTGTGTTATTTTCTTATTTTCTCGGGAGGAAGTAAAATTGCGGCTGATTCCGAATCGGATTTTGTTATTTTTTTTTTTTCTCTATTTCGAGTAGACATGGAAATGACTTCTAATTGACGACTGTTTGATAAACAAGACGATAGATGTGTCCAAGACCAGTTGATGAAATGTTATTAATGCTTCAGCTGTATTTCGTCAGGATCGAACTGATTGTAATTTGTACTGTTGGTTACTTGTGTTTTATTGTTCCATCAAAGTGAATACTAGGGTATATGACGACGAGGAGGAGTGGTTCCGATCAATTTTTGGGAATTCGAAGAAAGAGGATGCAATTCAAAATCAGTACGAGTTCTTCGTGCAAAGAATGGGAGGTCCGCCTCTATATTCTCAAAGAAAAGGCAAGTTCTTCATTTTTTTTTCTCGCTTGTAGTTTTGCAGTATTGATTGAAAGCATAAACTGAAAGGACTCGAGGTTTAAATTTGATGTTTTGTTCCTTTCCCAAATATATTGTATAATGACAAATTCTCTTGATTTCTACCACGTCTCCTTCCTTAAACACTTGCCTCCAAAAATGACAAATTCTCTTTGTTTTTGGTGACTAAGAATTCACAAGTTTCAAATGTGTTTGTAAGAGAGATGGAAACTATATAAGAAAATAGGAAACAGTCGTAAATCTCAAAAGCAAAAAACTTAAAACTAAATTCCACATTGTTATTTTCTGAATCCTCATTTCACCCATAAGTTTGGGGAATTAGCTTGATCTATGAGACTGTAAAATATATGAAATAATAGATAAAACACCATTTTACAGGTTAGGAATCTTGACCTCCCACCCTGATATAAGCTATAAAAAACTTAGCTGCTTAAGCAAGACCAAATCTAACCATTTCATTCATTTCAACAAGTATTGTACGGTTTTGGTTGTGTAATTGAGCACTCAAATGGTTTATTGAAACAGTGTTCTAACGTTGTCTCACTCTCACTCTGATCACGATTGCTGTTTTTTCCCCTAATATTTAAATATGAGAATCTGTCTTTAATCCTTCTTTGCCTGAGTCCGACCCCTCATTTGCGTTGAGGCAATCAATGAATTTGTGAGAGCCCTAGTTAGAGATTACTAGCCAGTAAGTTTGTTAGGGCTTTTAATTATAAAAAGAGGGAATGAGTTGGGAGCAAAGAGTGGATCATCTTGGTAGTGATTTCAATTGTGGGAAATCGGGGAGAGAGATGGCCCAAAAGGCTATGATATATTGTAGCTTCATCAATGATATTGCAATATATAATTCTATCCTTTAGTGTTATTTGTGTTTTTGTTAAGTGGTATCCTAATAAAATTATGTCTTCAACATGTTCAAGACTCTTTACCAAACCATTTATCACTGAGACCTCAATATGCCACTGTTGAGTGTTGATGCATTACCACAAGAAATTCGGTTATGTCATAAAGTGTGTGGCCACTCTGCAGTTTGGTTAGTAGGTTTTTTTCTTTTTCCTCTCCCTCTCTCTGTCTAAAGGAATACATAATTAATATTATTATTATTATTATTATTTATTTATTTATTTTTAATTACTAATATTCATTAAAATCGAGAAAGAACGAACCAACAAACAAACATCAATTTCTCTCGAGGACGAGGATAAGACAAAACGAGAGAAGAACTCTCTCCAAAAGGAAGAAGACCAAGAAGAAGAAGCCAAAGCTGCTAGATTATCGGTAGCTTTGTTCTCCTCTCTAGGAATTTTACAAAATCTAGAAACATTTACGGAAGGAGCAAGAAAGAGAGTCTCCTCGACAAAAAAAAAAAAAAAAAAAAAAAACCTCAAATAAGTTAGAATTCTCAAGATTCAAAAGTTTGATCACTTCGGTTGAATCTGATTCATTTCCATCGAGGATTCTGCCCATAACAAGAAACTCTGATGTGTTTTAAACTATAGGGCAGCCATTCCAATTTAGTCGTGTGATGAAAGGTATACTTGAGAGTACGTTTGTCGTGTTTGAAAACATTTCTTAGTACAACAATTGTGGGGATGAGGGAATCGAACGTCCCACCTCTAGTAAAAATCAAACAACAATTAAAATACTTTGTAAGAAAATACTTGTGTTGCGTTCAGGGCTTATAACTCTAAGTGAAATGTTAAAGTGATTTAATACCAAACACTCTTTATCACAATGCATATAATTCAAGAACTTTCGTTGAAAAACTGTGTATTCTGGATTTCTCGCAATCTCACCCTAATTTAGCATACCCCATCGACCTAATGGTGGTTTAACAATTAATTTGATATCAAAGTCTAAGGAAGTCTTTGTAGACTTGATACTGTGCAATTTCGTGGGTACATTGTAATCTATTTGCAACATGCTTGTTATGATCGTTACAAAAGTTGGTGACATTTCAGGCCATCCAGCTCTTATAGGTCGACACCGACCATTCCCGGTCACACATAGAGCGGCAGAGAGATGGTTACATCACATGCAACTAGCGTTAGACGAGACTCCAGAAATAGATGCAGATTCAAAAGTCAGAATGACAAACTTCTTCAAGCATGTATTATTCTCTTGGTCCTTCTACCTTCATTCATTGACATTCTCTAATCTACAACATTTTGCTTTTATATATATTTATATTGTGATAAGAAACCATGCTTTCATTGAGAAAAAAATGGATGTTTGAATAGACAAGCTCGTAAAAAGATCCAACTAAAACAAAAATGGGTCCCAATATAAGGGAATAATACATAAATAAAGGGCAAATGCAAAAAAAAAAAAAAAAAAAAAAAAAAGTACGCCAAAGCCTGAAATGTAGACATGAAATCTAACTTGCGACCAAACCTCTTTAAGGAGCTCTTAGCCTCACTGAAAATTCTATTGTTTATCATGAGTCAAATGCCTTATAAAACAGCACAGAATCCCGTCTTCCATAGAAAATTACCTTTTATCTTCAGAAAGGCAAATTGAGAAGAAACCTTTTCGCCGTTGAGCTACCAATGAGCTACACTCTTTGTTTTCATGTCTTGAGTTTCCATTATTGGGTATTCTTATTTTCTGGTGGTTTGCAGACACACAACCTTCTTTCTCGTGGCTGGAGATCAAATGAAGAATCAAAACCTGCAAACTCAATGCAAGCACGGTATTCAGCAATCTGATGCCCCATAGACATTCTAGGACAGAAACATAACTTTCCATACAAACATATAATTTTATCTTACATGCTTATAAGGCTGTTGGATCTAGGTGATTCTATATGTATTTTGTCCAATTGACTTTGACTCTTGAAATGAAGATAGGGCTCTCCTCAACTTGACTTGAAATGAACTCAACTTATTTGAAATGGTTACACTCATTTTGTTATATGTTTACAATCATCCTTAAACGTGCCTTAATCATTCAAAATCAATTTGGTTTGTGTTAAAATAGTGTTTAAAGTGTAAAATAAAATTTAATTGATTTGGAATGATTGAATGCAACATTTGAGAGTGATGTAGAAAATTCATATGGAAGAACAACCTTGAGAGGAGGCTAAAGCTATAATGGTACGAGCAGAACTGTTCACCTCTTGATGGATGTGACAAAAATATGCAACATTAATAACGTTTGTGACAAAAATGTGCAACATTAATAACGTTTCTTAACAAAAAGAATGTTCTCAACGAGGATCCAGACATCAACTAAAGAAAACTCCTCTATATCGAGAAGTTTGACGACTTCCAGGTTCCACATAATGGGAGGAGGATGTGCTCAACGATGATCCCAATATAATAAGAATTGTTCTCAACGAAGATCCCAATACAACATTCAAGAGTGATCGTGAACTATAGTTCTAAGACTCCAAGGTAATCCAAGTGTGGAAAATGGTTCACATTAATGTTCGACAATATAGTTCACATTCTTTATAACAAGGTATTCATTTTGTTCGGTTACTAAAGAAAAAGAAAATATGCATATTTTATGATGTCTGTTCACGTTTGCTTAAAGTTGGTAGTGCTACTTAGTGGATTATAAATTTCTTAATTGAAAATTGTTAATATGAAACTTATAATACATTGATGTTAGGAATGAAATTACAAATACTTTTTTTAGAAAACAAAATTGCAAGTAGCATATTGAGTTGCTTTTACCTAAAAAAACAGGGGAAAAAACAGCTTTTTAGTTCCATTATAAAATTATTTTGTGTAGCATATAGCTCTTATTTTATTTTTATAGATTTTAGATATTGGAAAATTTGTCCTAGATAATAAATTAAAGAAACTCATACTTCATAACAAAAAAAATTAACTTATAAGTGTAAATAGCATTTCAAATATTTTTAAGCTCTAGAGTCATTAAACCAAATTGATTAATATAAGTTGAATACCTATGTGCACCTATGAAGATGAACATACCTTAATTGCTTAAAACACAATCTACCTCTTCGAATTGGAAATTTTCACCTTTTATTTTTATGATAACTTTTGAGATGGTAACATGTCTCATTCTATAAACTATCTTTATTTCGAAATCCCTTTTTCTTTTTAATATAAGATGCATATGTGGAGATTTAAACTACTGATATTATACTTGATTATTAGTTGAGCTTGATTATTTTAATATAACATTACATTTTTAAGTTTTTGGTTCAGAATATATGCATTAACTATTTTTTTAAAGAAATGCATTAGCCTATTTTTTTTAATTATTGTAGGAAAACTTTATTCTAGTTAAGCTATGTTAATTTTGACAATACATGAATCTATTGCTTGAGCAATGTTTAAGCTTTGCATGTCTAATGTTTAAAAAAAGTAGAATGTACGTGTCATTGCAATTTGAAATACAACAATACTTGAGTGTAAGCTTTGTTGCCCACATTTACATGTATCTACTTCAATTACAAAGAGAAAAAGTATATAACATATATATTGAAAAAAACATCAGCACATAAATAAATTACAAAAAAGATAGAGTTAATCCAAGATGTACCTAAATTTTGTTTTAGTTAATAGAAATACTTACATGGATTTTGGGCGGCCATAATCTTGTGTTTGTGCAACCCACCTCGATCGCATAAAAAAAATTAGAAGTATTTAAAAAAAAAAAAAGAAAGAGGATCTAACCCAATGTTCTCCATCTTTTAACAGTTAAAAACTAAAAAAAAAAAATTAACACATGCCACTCTTTTATTGGAACAACACGAAAGATGGGTTGGACAGTTGGATAGCGTAGACAAAGATTGATAAGAGTAAAAAATGGACTTAAATATTTTTCTAAATAACTGCAGTAAGAGTAACCCCCATGTACTAATTAATTCTACCATTGTTCTGCACTGTGCGCAACATTGCGTACTATCGGCTTTGGCGGCAGGCCTGAGAATTTTAAAACTTCATTCGAATTCCATTACTTTTTGGCGCCGAGCGAGACGCTGCCGAGATCGTAGTTGGAGTTTCGCCAAGGTTCTCTCCATTGCCCATGAATGGCGTCCGGCTCTCCTCCTTCCCGTATCGACCAGGTCTTCTTCCTTTTCAATCTGTATAGAAAATACCCTATTGACTTAACTTTTACTATTTCTTAATGTTTTCCTTAAAGCATGTCGAATCTTCGTGGCAAAAAATGGCTGTTCATGTTCTTCTTGCCGATTCAGATGGATTTTTTTAGGTTTTATTTTCCTTCTTTCCCGTCTTCGTTGTTTTCTGGGATTTTACTGGGAAAAAGGCATGGGGAAGGTTATTGTGATATGTTTGTGCTTTGGATATTATGAATGGGGGAAGTATCTACTTGGCTTCTGAGAAAATTAATAAGAGGACCAAGAAATTCAATTTTCTTTTTGTTGTTTTTAATTTAGATTAATTCGTGGTTTATTACAATCTTTTTGTCAATTCTTGATTGGATGGGATAATGTTTGTGTTCTGAATATAATAATTCGATCATTTGGACATATTATTCATTCCGATTTGAATTGATGGGTGGATATTTAAGTGTTCCTTCGTGTACTGCGGGCAGTTTTATGCTTCAAAGAAAAGGAAACCTCTTACTTCCAGTCTGAAGTCTGGGAGTTACGACAAGGATGGAAAAAAGTCACTTGAAGGGTCGCCTGGTGCTAAAGGTACGTTGGACAATTACCTAGTGACCTCACAGGACCATGGCAACTCTGATATCCCATCGCATTCAGTTCGGGAAAACTTGTCTGAGCAAGACCTAGTAAAGAGAAACCTATTGTTGAAAATTAATAGCTCCTCTAGAAATGAACACGAGGAACCCACTTTGTCTAGAGGGTGTGACACTTCTGCAGCAACTGAAGGAATCAAGAAAAGAACTCTGGAGGACTCGTACGAGACCAGAAGTTCAACAGTTAAATTGATGGCAGGTGACGGGGGTGTCACACCCTGCACGGAGAAACCAGAGCTTAAACAGTTTGCAGCTGATTTCTTGTCTCTGTACTGCAGGTATATGTAAACTCTAGTTCGTTAAGACAACAACGTCCTTTTTTTTTTTTCATTTATTTATTTATTTTTTTTAATTTTTAATTTTAATTTTTAAATGTTCGACTAAAATTATAAATGTCATGGCCATATTTGTAGCAATGAGTTGCATACAACTGTTAGTTCGCCGGGAGAGCAAAAAGTGACTTTTCTCAAGCGGCACTCCAGTCCATCTCTACTAGAGGGGGAGGCTAAATTACCAAAGAAGATACATTCTATTGCGGGTCCATCAAATGCCAAAGGTGAACCTGATTCCTCAAATGCATTGAGTGTTGGAAACAAGCAGTCCAATTTTGTTGTTGAAACTGGGGTATCTGTTACCGCTTCTTCTGAAAAGTTGTGATTCGTTTGACCTTCTTTCATAGATTTATATAATTACTTGAGAAATCTGATATCATTTATCTGCATGACTTCTATGGCAGGATACTGACAGTCATCCTCCCGTTGTGCTTAAGGCATGTCTGCAGAAATGCAATAAAGCACCTAGATCACCTTATTGTTTGACTGAATGCAAAACACCAGGCTTGTCAACTGCCAATACATGTTTTCAGGAGACTCCCAAGTCTGGAAGCTCGACATTTTCCCCTGGAGAAGCTTTTTGGAAAGAAGCAATCGTGTTTGCAGATGGTTTATGCGCTCCAAGCATTGACCTTACCAATTGTGATGCTGAAGGAGCTAATGTTGCAGAGAGTCAGAGTCATACGAAGAAACTTCCTATACCAGGGGAACCTGCTCAGAAAAGGTTAAAAGGACAGTTTGGTGGAGGTAGTGGTGGAGTCCGGCTTGGGGAACCTGGTGCTTCCATGGTTTCATTGAGGAGTGAATTAAAAGAGTTAAATAGAGAAGTGTCTTCACTACCTGTTAAGCATTTTGACTTCTCGGCTGATGATAAAAACTTGGATGGAAGTACATTACCTTATTGTGCTTCAAATGAATCTGAAGTTAATGCATATGACCTTAATGAGCAGTCTGATTGTTGTTATACTAATGATAGTCTACCAAACCATAATGATAAGACCCGCGACAGTGATTCTCTTACGAAAGAGAAGATACATGAAACAAATGTAACTTCCTCTGTTCCGGTAGTGACTGAAGTGAAATTAAACATATTTAGTCCCTCTGATAGTATCACATCTGACACAGCGGTTCATGAACTTAGGGCTTCTACTGTTCATGATTTTAAAGAGGAAACAACACCTTCAAGTTCAGTAAGACATAAAGATTGGCTGGATCTAAGTTGCTGGCTGCCTCCTGAAATTTGCAGCATTTATAAAGAGAAAGGAATAACAAAACTGCATCCTTGGCAGGTAGCAAAGTATTACAAAACTTTCTAATTGCTTCCCTCGCAAGTGTCTCAAGATATTTGGAGTACATGGATGTTCTTCTTTTGTACTTCGTAAAATATATCATTTTACTCCTGAAACACATTTAATTTTTAGTTGGTTCAATGAAAATATTACCATTTTGATAGATGGTACTTTTTGCCTTATGACAATGTTGGTTGAGTTACTTTGTTTAATTCTCTACTGTAATGCTATTTAAAAAAAAAATAAAATCTTTACTATAATGCTTGATGGCTGAAAGACATTCAATATTCTACTTCACTTTTATAGGTCGAATGTCTTAAGGTAGACGGTGTCTTGCAGAGAAGAAATCTTGTTTATTGTGCATCTACCAGGTACTGTAATTTAGTGATGACTTTACCCCTTCGCTAGAGTGGGTTACTATTTTGATACAAAAAATTCTAGCATCCAAGAACAGCGGAAAAGCTGTTGCTTGCTAGCTTGTAGCTTAGTGGTCTGTTTAATTTTGCATAGAAACGATTTGCTTTTTGGCAGTTATTTTAAGAAATAGTTTTCTTATTTTCAAAACAAAAAATTTTGTGAGAATATCTTGCATAGCTTGTTCAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCTGGGGGGGAGGGGGGATTGCAAACCAATGATGCAAGGAGGCTCTTCTAGATCTCCCCTCGCATGGAAATGAAGCTTTTGTAGCGAGCTAGTTTCTTTGAAATTTTTGGTTTGGTAGGAATATATAAGATTGGCCTTTAAGGTTGGAGCACTACTCATTTCAATATCTTCTTTTGGGTTTCAATAAATATAAATTCTTGTAGGATCATCCTTTATTGAATCACTAACAATATGTGTCACTTCTATGACCCCTTAGCGCTTTGGTCTTTTAATTTTTTGTATCCTTTATTAAATCCTCGATAGCTGGCTGCCCATGCCCTTTCAATATCTTCTTTTGGGTTTCAATATTTGGGTTGAAGGGAAAAAAAGAATTTCTAGGGGTAGGGATCTTTGAGCAAAGTCTATTTTCATTTTTCAACGAAAAGTGTTGTTTCTTTTTCCAAAAAAAAAGATGTTGAATCAATGTTTACGAGACCAGCTTAACTATTAAACAACAGACTAACAAGTAGCAATTGATCCCCTAACTTACAAAATAACGAACAAGCGGATCCTAACAACAACCAAGTTGGGCAAGTGAAAACAGAAAACAAGTAGTTTTAAAGGACTTGTCATTGAGCTGCATCTTCTTGGGAGTAGGTACAAGCCTCAGCCCACTATGATGCCTTTTCCGCTGCCTTCCTGTTGTCAAACTTTTCTTTTTATCTAGTTCACTTTGTTGGTGTTTTACTCTTCCTAATGAAAGTTTAGTTTTCTCATTAAGAAAGTACTTTATTGTTAAATACTTTTATATGATGAGGTAATTATAATTGAAGTACTTATAATAACAAATGCTAAAAGAAACACTTCCAGACGCACCCTATGTGTTGCTTATTTTGTCTGTATCTTCTGCAGTGCTGGAAAAAGTTTTGTTGCAGAGATCTTAATGTTAAGGCGGGTGATTTCTACCGGAAAGATGGCACTTCTTGTACTTCCATATGTATCAATTTGTGCAGAAAAGGTGATCTATGCCTTTCAGATTGAAGAAACTATGCAGATACAATGATCTTGTCAATAGTTATATTCTTAAATTTGATCATTTTATATTTTGTTTGCAATTTCTAATTGTTCTAGGCCGCACATCTTGATGTTCTTCTTGAACCTCTGGATAAGCATGTGCGTAGTTATTATGGAAATCAAGGTGGTGGAACGCTTCCTAAGGATACTTCTGTGGCTGTTTGCACAATTGAGAAGGCAAACTCTTTGATAAACAGATTGTTGGAGGAGGGTCGTTTGTCAGAAATTGGAATTATTGTGATCGATGAATTGCACATGGTAACTAAGAAAATCAATATGGAATTACTTTCTTTATAAAAAATTTGAAACGTGAGGTGGAAATAGAAGAAGACTGTTCTTGTGGTGTAGCATGCTTTTGGCAATTTCTTTGGTGGATTTTTAAAAAATATTTTTATTCTTACCTTCTGACCTGTAAAGAAGTGATGATTGATATGATAGCCTGAATGTTTCATGAGTACATCAGTTTCCTATTTTCTTGAAGAGGCCAATTTGTAGATAAGCTATACATTAAACCTTTCCCAGTCAGGGTATTCACCTGGATGATTCTTGTTTTATTCATAGGTTGGAGATCAGACGAGGGGTTATCTTTTGGAACTACTTTTGACTAAACTTCGTTATGCTGCCGGTGAAGGTAATTTAGATTCTAGCAGTGGTGAGAGTTCTGGGACGAGCAGTGGTAAGTCAGACCCTGCTCATGGTATTCAAATAGTTGGCATGAGTGCAACCATGCCAAATGTGGCAGCTGTTGCAGATTGGCTTCAGGTTAGTGAATTAATAACCCTTTGAAAAGTTAATGGGAAGTATGATTGTTGTTAGAGGAAGCATATTTGCTATTTTTCCATAGTTTATGTTCGTGTAGATGTTGGTTGTTCAATTGAGAAACATTGCGTGACTTCTTATCAGGATGTGTCTGGTAGTGCTTTTGATAAAAAAAACCTTGTATTACAAGGACTGAAAGCCTTATTTTATTGTTTGGTTTTAAACTTCTATTACAATGAATTATTTTTTTTAATCTATGAAGTTCTTTGACTAATATAAGTTTTTTCCCCTTTAAAATAAGAATTGAAAAATGGGTAAAAGGGATGCAACTAAAAATAACAAAAAGATTGCCTAAAAAACAGATATGATATAAATAAGAGAAAGTGCTTTTAAGAAATTAATGAGAGAAAGCATTTGCAGCAATTAATTGAATGATGTAAGAGGGAGAAAATGTTTACAAAGAGAGATTGGCAAAAGAGCAGTAGAGAGAGATTGTAGGAGAAATTTTTTTGAGAAAAGTTAATGTGTAAAAGTATGGATGTAAAAAGAAACTAGAGGGAGATGGTATATGAGGTAGACTATGGAGAGATTAGTACTAAATAAGTAAAGAAAAAGGTGAAAAGCCCCTCATATTAATGATAAGTCGCTATTATCCATTTGATTCCATATTTTCTTTCTAATATTATGTTTTTTTTTTATGAGAAACAAAAGATATTATGTTTATCCTTGCATTTAAAAAGTCTTATACACATGTGAAAATTATGCATTAATGGTTTTATAATGGAATCCTAATACAGGGTCTAAATTGATTCACATATGAATGTTTAAGGTATTTGATTGATATAGTTATTTAGTTTGGGGTTTTAATTGATTCAATCTCCTAAGTTTAGGGCTGTAACTTGATTTTTCCCATAAAATATTTTCAAATTTTACTGTAAATAGCTCTTTACACTTCATATACAAAATTTCAGTTCTTATTAAGTGGTTTGGGGATTGAGTACTTTAGTTTTACCTTTCGAGAAATCTATAGGCTTGGTCCTCGCATCTTTGATGTCAGACTACTCCATTGTTACTTGGAAAATTTGCATGCAGGCAGCCTTGTACCAGACTGATTTTCGACCTGTTCCTTTAGAGGAGTACATTAAAGTTGGGAATACCATTTATAATAGAAGTTTGGACATTGTTAGAACAATCTCAAAAACAGCTAATCTTGGCGGTAGGGATCCAGATCACATTGTGGAATTATGTAATGAGGTATGATTTCTTTTGATTTGTTAAGTCGTACTTGTTCCTAAATTTATATGGAATAATAAACTTTTTAATTGTATTTTAGGTAGTTGAAGAGGGTCACTCAGTATTGATCTTTTGCTCCAGTCGGAAAGGATGTGAATCAACAGCAAAACACGTGTCAAAATTCCTCAAGAAGTTTTCTGTCAAAATTCACAATGAGAACAGTGAGTTTACAGACATTTTTTCGGCAGTTGATGCACTGCGAAGATGTCCTTCTGGACTGGATCCTGTATTAGAGGAAACCTTTCCGTCTGGTGTTGCCTACCATCATGCTGGCCTTACTGTATATTCATCATCTCTTATTCTTACTAGTAGATCATATTCTTGAGCATCTTTTAATTATTTATTACTTCTCAATAGGTAGAGGAGAGAGAGGTTGTTGAAACTTGCTACCGCAAGGGTCTTTTGCGTGTTTTAACTGCTACATCTACCTTAGCTGCTGGAGTTAACCTGCCAGCTCGAAGGGTCATTTTCCGACAACCCAAGATTGGGCGAGACTTTATTGATGGTGCAAGGTACAGACAGATGGCTGGTCGGGCTGGCCGGACTGGGATTGATACCAAGGGGGAGAGTGTAAGTTTCCATTCTTAACTGAAGTGCCAGTTTTGTATTATTATGCTGATTTATGAACATTTATCAATTGACCTACTTTACCCTACATCTTGTTCTTGTTGTTGCTTTGTTTTGTTTTGTTATTTATTTATTTTATTCGTTGTATTGAAACATTTACCTTTTCTTGGATTCTTCTTTTTCCTGACATTTTTCTCCATTTCCATCTGAGCAGGTACTCATTTGCAGACCAGAAGAGATTAAAAGAATTAATGAACTTCTTAACGAGAGCTGTCCACCTTTGCAATCATGTTTGTCTGAAGATAAGAATGGAATGACTCATGCAATTTTAGAAGTTGTGGCTGGTGGGATTGTTCAAACTGCAACTGATATTCATCGATATGTAAGGTGTACTCTTTTGAATTCTACAAAACCATTTCAAGATGTGGTTAAATCAGCACAGGAATCTCTTCGGTGGTTATGCCATGGAAAATTTCTTGAGTGGAATGGGGATACCAAGTTGTATAGTACCACACCTCTTGGACGTGCATCGTTTGGAAGCTCTCTTAGTCCAGAAGAATCACTTGTAATAAACCTCTCCAATAACTCCACCACCATTACTTATTGTGAAAGCATGTGTTAAGAAAATATCATTTACTGTTAAGTTTTCATTTGGCTGATAGTACGTAGGATAAATGGTAATTAAGTGGACTGAAAGCTTTCTGACTGATTATTTTATTTTGAATCTCTCCTTAGATTGTTTTGGATGATCTTTCAAGGGCCCGAGAAGGATTTGTGCTTGCATCTGATTTACATTTGGTGTACCTAGTCACACCGATCAATGTTGATGTTGAGCCAGATTGGGAGCTATATTATGAACGGTTTATGGGTTTGCCTTCTCTTGACCAGGTAAGGTTGGGTACTTTGCTAGTTTAGATCCCTCTCGAATGATCAGAAATTTACAACTAGTCTTGTTTTTGGATCAGACTAGGTTTTTAATCCTCTCTACAATTAATTTTGAAGATATTAGTTATTTAAGTGCAAAGAAGTAACAATTAATATATTAGTCCACATTTCAGCAACATTACCACTACCAGAAAATTTAGATGTGGTGCTAAATTTAGATCTCTCTTCTTTTGTTATTTTTGTGAACTGGAGCTGTTAGTTTTTTGTTTTCTGGAAATGAGAGAAGACTTCTTGTTCTCTCCTTAAATAGTTTTGGACCAAGTGATGTTTAGTGTCCCGAAATTTGATAAACATTTAAGAATTACTCTCTTTGGTGCTTATTCCAGTTATATCAACGCTTTCCTCCTTTCATCTCTCTTCAAATACACCCTCCATTCTTTTCCGTAGATTCTCCTCTACTGTATGCAGAATGGATGAGTCTTCGCACCTGAAATTTATTTTGCCAACTATGAAATCTGCCAACTTCCTTCCATTTCTTCCAATCAATTCTCCATTTAGTTTTTAATGCAATTATTCCCTCGTAGGTGGAACTGCTCTAATACAACCTGTGTTGCTTCCTTTTGGTCAGCATTGGCAATGAAATTTCAATGGTCAACAGCCAAAAGAATTTCAATTTAGGAGTATTCTTCAACAAAATTCCACATGGATATTGCTCACTCTCTCTCTTTCTTTCCATTTTATTTAGCTAATTTACGGGCCAGCCAACTCTTTTCTCTCCCCAGTCAATGGTTTGTTAATCCTCTCACTTTGACTTTGGTACAGGGGTAAATTCTTTCTATATTCTCATATTCACAGTCCCTTTCTTCCCCTGTTTTCAATATGTAAAATGAGTTAAGGTCGAGGGTCTATCGTAATTCACTAGAGGTTCTTATTTCTAAATAGCTCCAAGGAGGCTCCATGTAAGATGGATTAAGGGAGTTGTTGGTAAAGCTCACTGTCTACTTGTTATTTAGAACAAATAGTAAATATGAAATTAAATGCACTGCAGTAGCTTTTTAGACCTCTTGAGATATCACTTCTTTATTTCTCAACGCAAATACTATCATAAAATGGAATACTTACACAGCAACATACATGTCCAGAAACGTAGGGTGTATGTTTCTTCCAAGTTGAGCATTGCTGTCTGCTTATTTATTTATTCCATTTTAATACATTTCCTTGTCTATGTCGTTAGGCATTAATAATGTTGAGTGTCCGTAAGATAATATGCCAGGAATTTGTGTATTTTGAATTTCACTGTCGTCAAACAATCCCACCACGCTGCAGTCTCTTCGTTACATTACTAGCAAATTAACTTTGTCAAACTCTTTCTCTCTTATTTTCTAATACTTATCTTCTTCTAATAAATATGTAAAATGAAAATGCAGTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCGTGGGCTTTCCCTCTTATTTTCTAATACTTATCTTCTTCTAATAAATATGTAAAATGAAAATGCAGTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCGTGGGTTTACGTACCAAGCGTGATGAACATGGGTGCATGTATGATGACAGGCCTTCAGAGGAGCAAACTATTCGAGTGTGTAAACGATTTTACGTGGCCCTCATCTTGTCAAGACTTGTTCAGGTGTGTGGATGTGTTGCTGTTAATCTTACATTTATTTCTACATGATTGGTAACTTTAATGTTGTCTTGTGCCCCTACATTATAGAAATAAAAGACACGAATACCTTTTTTCTGAATTACCTTACATGTATACATAAACAATAGAATTCTTTTATTTGCTCTTCCTCTCAACAGGTGAGCAAATAAATTACATTCCTTCTTTGTGGTTATTCATCTTTCAATTTAGCCCATTAAATGAACAGCTTTTGAAAGATTTCCACCTAATTTAGTTCATGTTGGTTCTGAAGGAAACTCCCATTCCTGAAGTTTGTGAAGCTTTTAAAGTTGCCAGAGGGATGGTGCAAGCATTGCAAGAGAGTGCTGGAAGGTTTGCATCTATGGTCTCCGTATTTTGCGAGAGGCTCGGATGGCACGATCTGGAAGGTTTGGTAGCCAAGTTCCAAAACCGTGTTTCATTTGGAGTTAGAGCTGAGATTGTAGAACTTACTACTATTCCATACGTTAAGGTCAGGCACTGTCCTACATTTATTTGGTTTTATGTTCCCTTGTATATTCTAATCAACAGAAGTGTGAATGTTATTTACTTTGAGCAATTTGTTATCAGGGTTCTCGAGCCAGAGCACTCTATAAAGCTGGTTTGCGGACACCTTTAGCAATTGCAGAAGCCTCTGATGCTGAATTAGTTAAAGCTCTTTTTGAGTCTGCATCATGGACTGCAGAAGGTGAGCTTAACAAGACATGCTTATTTGTTTGTGCTGATTCTGGTCAGCAGGTAAAATTATCCTCGATTTTTTTTAAAAAAAGAAACAACATTTTTCATACCTGATACTCTAATTATTTCGCTTTTAGTTGTAGTGATAATTTGTCGAGTCATTAGGATTGCAGAGTGTGGATTCATTCTCTATTCTTTTACTTTTCATTTATTTTCAGATTCAAATGACTATATTCTTAGAGATTAAATGATTTCTGGCTCTACTGTTATAATCGTAACTGTCTTCCTCTAGTTTCGAATTCTAATATTGAGGTTTGTATAGAGAGTACAGCACAAAAACGGATGCATGTTGGAATAGCTAGGAAGATTAAGCATGGTGCTCGTAAAGTTGTTCTTGATAAAGCTGAAGAGGCAAGGATTGCTGCATTTTCAGCTTTTAAATCATTGGGGTTCACTGTGCCACAAATTTCTCGTCCCTTGTCAGCAAGTGCAGATGGAAATATTACAGCACAAGTGGCTGCAAGTATTCCATCTGAAATCGATACTTTAAACAGAGTTGTTAGCACACGACAAATGGAGCATGCTTTAACAAAGTCATGTTTTGGAGGAACTTCCAGTTCTGAAAAAGTAGGTGGCAAGAATCTGAGTGAAACAGGAACAATTTCTGTTGAAGTAAAACCACCCAATTTTGGCGTTAATCCTCTGGTGAATGTTGAAGGGTCTGCAATCCAGGAGTCAAATACCGTTGTTGAATGTGCGGGAAAGGTAGATGTTACAATCTCTAATCACATGGAAAGAATTGCTCAGAGGGAACAACATAGTAGTGTCTTGCATCCTCCAAAAAGAGACAGTTCTTCCATGAAAGGTCCTATCCATGCAGCTAATACATCTGGCGGATTTGAATCTTTCTTGGATTTGTGGGATGCTAGCCAGGAATTTTTTTTTGATCTCTATTACACCAAACGGTCTGAAGTGAACTCTGTTGTCCCCTTTGAATTACATGGAATAGCCATCTGTTGGGAAAATTCTCCGGTGTATTATGTGAACCTCCCAAAGGATTTGTTGGGGCCCAAGAGTGGAAAAGGTCTTTATCCGGATGACAGGACATCTGGTGACCAGGTAGATGTTTCACAATATGAACATTGGTTTGAGATGATAGAAACGAGATGGAAAAAGATCAATAAAATTTTTGCGAAGAAAAATGTTAGAAAGTTTGCATGGAATTTGAAAATTCAGGTTCAGGTACTTAAATGTCCGGGAGTTTCCATCCAGAAATTGGGCTTCCTGAATTCTGCTCGACGTAATATGGGTCTTAAACTTGTAGACGGTTCATACTTAGTGTTGTCGAGAGTTCACATAAGCAATGTAATTGATATGTGCATTGTTGCTTGGATTCTTTGGCCAGATGACGAGAGAAATTCAACCCCGAACCTGGAGAAGGTATGGCAGAATGAACTCAACTTATGCAAAACATGACTGAAAAATAAAATTTTCTTATTAGGTGTCATTTTTCGACCAAGATAACACATGATTTTGAAGGATTTTCTGCTACGATGAATTCTTTAATATTTTATGACATCTAATCTTCAACTGAAATTGTGATGATGATGAAATAGGAAGTCAAGAAAAGATTATCTGGTGAAGCTGCTTCTGCTGCTAACCGGAGTGGCCAGTGGAAGAATCAGATGAGAAGAGTAGCCCATAATGGTTGCTGTCGGCGTGTTGCACAGACACGAGCCCTATGTTCTGTTCTCTGGAAGCTAATAATTTCTGAAAAGCTCTTGGAAGCTCTCAACAATATAGAGATTCCATTGGTAAATTTTGCCTGTGCATTTCCCTGAGGGGGAAGATGACATTAGGACAAGAGTTTATTCTTCTAATGCCAATAAAAATAGTAAACAGTAGCTGTTAGTGGGTTGTTCGCTATATACATGATGCAATTCTGATCTAATTGTCGTAAGAAGAAAAACAATCAGATTAGGTTCATATTTGCTTTTGTTGCTTTTTAGCCAGATTTATATGATACTTGAGATGAAATTTTCTTCTTTCACAAAGCAGCTGAATAATATCTATAACTTCAGCCTAAATATATTAACGATTAGTTGAAGCATTTGAACTTGTTTAACCAATGGAAACTTTTGTAGGTAAGTATTCTTGCTGATATGGAGACCTGGGGTATAGGTGTTGACATGGAGGGATGCATTCGGGCCCGTAATTTACTGGGAAAAAAACTCAAGTGCCTTGAGAAGGAAGCTTATAGGCTAGCTGGCATGAGCTTCTCCCTGTACGCAGCAGCAGATATTGCTAATGTTCTATATGGACATTTGAAGCTCTCTATTCCAGAGGGGTTCAACAAAGGCAAACAACATCCAAGTACTGATAAACATTGTTTGGACTTGCTGAGGTAATAAATTGATTCTAATAATTTTTATGCATGCATGTGAAATTGATAATTATAATAAATGCAAGGATGCAGGATATGAAACAAAGAACTTCCTGATGTTCGTCAACCTCTCCATTTTTCCTTCTAACTCTAACTGAGTTAACAGGAAAAGTTCTCCAATATTGTGGAGTCTTCAAATCTTATTTCCCCTTAACAATACCAACATAAGAGGCTCTAGGCTTAGGATCCGAATTCTGAATTCGTTGTCCTTGAATGGTGGGCTCCATCAGATTTCTTTCATCATATTTCGGCAAGTAATCTTATATCATATCCCAGAAAGTTGACCATCCTCTTTTGACGAGACCAGCTGGAATGAGTTGATCTTTCTTCCACCTGTAGTTGGTCAAATAAAACCACACATTCCATTATCCAACTTGAAGGGATGCGTTTTTTATGAATACAAACAAGTTCATTCTCATATTTTTCTTGCTTAAAGAAATTTGAATAAAGGGGAAATCACATCATTTTCACCATAGCATTTTAAAAACCATCTAAGTTGAGACAATTAAGGGGAGGGAAGCATATTCCTTTTCTCCATTTTTTCATTGTAATAAAAGTTTCTTTGAACCAAATACATAAATATTTATTTTCAACACTGCAGCCGCTCTAGTCCATACTTCAAATTAATATTAAACGCCGATAAGAGGTTGACAAGTATCTGGATTGCACTTGATTTGAATCCAAATGCTGAAGCCTTGAATCTTAAAAAACTAGCAGAAATTGAATCTCTAAGAGAAGTGAGGAGCATTTTCCATTCCATCATACAACATGTCTAAGTGCACGTTATTTATTTATTTACTGGATATGAAACATGAATCACTGTAACGAAGAATTTAATAAAGATCTTGTATTTCAGGAATGAACACCCTATTGTTCCAGTCATTAAAGAGCACCGGACATTGGCGAAGCTCTTTAACTGTACTTTGGGATCCATTTGCTCGTTAGCTAAGCTATCTGCAAGGACACAGAAATACACGCTACATGGTCATTGGCTCCAAACATCCACAGCAACTGGTCGGCTTTCCATGGAGGAGCCTAACCTTCAGGTATGGGGTTTCATTACCTCTTTTCTATATTTTGTTGATGAAATTAAAGATGAAACCACATGCTCAAGTTTGGATGAACTCTTGGAGTGCAGTGCGTTGAGCATGCGGTGGATTTCAAAATGAATGAAGATGATGTTGATCATTGTAAAATTAATGCTCGTGATTTCTTCATCTCTACTCAGGTATTTTCATTAAATTTTGTTTACTAACCTCTTCTTAAGAATTTATGGTGGAGAGTTTAGCCTCATTTAAGATTTCTTTCATCAATAAAATTTATTGTTTCTCTCTCAAAAAATTTCTTGAGCATTATGGATCTCTTTTTGGGGTCCATATTCAAAAAGACTCTCTTTCTTTTTGTTCTTTTTGGATCTTTAATAGTGTGTAAGACTAGTGTAGTTAATATGAATAATGTATAAAATACAATTTACTTAATGTTTGTGCATCAGTGCATGAAATGGTGAACTGATGATTGCGAATTCTGTCTGAAGGATTCAGAATTTTATATATGACAAGGTGATGTCAGTGAATGCTAGTAGCTCTGACGACTTGGATTAGATAACTTTAACAGTGTGTAGGAGTAGATTTTGGAGTAGTCTGTGTTTGTTTGACTTAGTGATAAGTGTCAAACATCTTTTTTTGGTTCAGTAGTTTCATTCATAAAAGAAGTTATGAGCATTCAACCATTACCACTATTCTCCGGAGGTTGGCATTTTTGCTAATGTCTTGTTGTCTGAAAGCAAGATGATTGTTTTTCCTTTTAGTTGGTTAAAGAATCAAAGATTCTATTCTGGGTGCAAACTTGGTTGAGTTGTGATTTTTAATATACATTGTAGGAAAATTGGCTGCTCGTATCAGCAGATTATTCTCAGATAGAGTTACGGCTGATGGCACATTTTTCAAAAGACTCCTCACTGATTGAACTCCTCAGTAAACCTCATGGGGATGTGTTTACCATGATTGCTGCTAGATGGACAGGGAAGACAGAAGACTCTATTGGACCTCATGAGCGAGATCAAACCAAAAGGTTGGTATATGGAATCCTTTATGGAATGGGCGCCAAAACACTTGCATTACAGTTGGAATGTAGTAAAGATGAGGCTGTAGAGAAAATTCGAAGTTTCAAGAGTTCTTTCCCTGGCGTGGCTTCCTGGCTTCATGAGGCGGTCACATTTTGCCGTCAGAAGGGGTAAAAACTTCTTGTTCTCTTTGGCGTTTTTCTATTTTCATTTGTTTAGATAAATGCTCATTCTAAGTCCTAGGAGGTTACCTGGAGATGGGTCCTAGATATATGCACTTGTAAATTTGTATATGAGAGTGTATACTGAAACAATATATACATCTGAACTTGTTTGGATTTCCTTTCACTTTCGCGCATCATATTTTATCGTAATTTTTTTCAGGTACGTTGAAACTCTTAAAGGAAGAAGACGCTTTTTGTCAAAAATTAATTCTCCAAATAGCAAGGAAAAATCAAAAGCACAGCGACAAGCTGTGAATTCAATTTGTCAGGTATTTCCACCTTCCTTTTGTGTGATTACTATTATTCTGTTCTGGGGTGGCTTGAAGGAACCTCATCCTCTCTCCCTTTCTTGATTTTAAGTGTTCTGTAATCATGTGCCAGTAGTTTTTCTTTTACTAGGGTTCAGCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCATGTGCCAGTATTTTTTCTTTTACTGGGGTTCAGCAGCTGACATAATTAAAGTGGCTATGATCAACATTTACTCCGTCATTGGAACGGATGCCCCAGATCCTACAGGATTACCTGCAGCTAACACTAATATATTGAGGGGCCACTGCCGAATTGTATTACAGGTTCATATGCAGTTTCCTTTTTCAAACTGCATAACTTTTACAGTTGCCTTGTTCTTTTCTGTCTATTCCACTTCATGTAGAATCATTTCAGTAATTGCGAAGATTTTTACTCAGGTGCATGATGAGTTAGTGCTAGAAGTGGACCCTTCCGTCGTCAAGGAGGCAGCAGCTTTGTTACAGAAGAGCATGGAAAATGCTGCCTCCCTTCTGGGTAAATATTCAGCATTTACATTTTTCCTTTTTCTACGTTCAAAATGTTTATCACAAGATCTGTAAATGCGATTCTATTTCAGTTCCATTGCAAGTCAAACTGAAAGTTGGACGAACATGGGGTTCTTTGGAGCCCTTCTTGCATGATAGCTTCAAGATTGAAGTTCTTGTACCTGGATCTTGA

mRNA sequence

ATGCAGAGTTTGCAAGATAAGGCTTCCGAGTGGAGCGGAGTGGAGCGTGAAGATGCCTTCGCCATCGACGAAGTCAATTTGTTTCAGAAGTTAGGTCTCCAGACGTTTGTTAATCTATCGACCAAATTCTATAACAAGTCTGTTTCTGTTCATCAATGCAGCATGCGGCATGCGCGAAGCCATCCAGCTCTTATAGGTCGACACCGACCATTCCCGGTCACACATAGAGCGGCAGAGAGATGGTTACATCACATGCAACTAGCGTTAGACGAGACTCCAGAAATAGATGCAGATTCAAAATTTTATGCTTCAAAGAAAAGGAAACCTCTTACTTCCAGTCTGAAGTCTGGGAGTTACGACAAGGATGGAAAAAAGTCACTTGAAGGGTCGCCTGGTGCTAAAGGTACGTTGGACAATTACCTAGTGACCTCACAGGACCATGGCAACTCTGATATCCCATCGCATTCAGTTCGGGAAAACTTGTCTGAGCAAGACCTAGTAAAGAGAAACCTATTGTTGAAAATTAATAGCTCCTCTAGAAATGAACACGAGGAACCCACTTTGTCTAGAGGGTGTGACACTTCTGCAGCAACTGAAGGAATCAAGAAAAGAACTCTGGAGGACTCGTACGAGACCAGAAGTTCAACAGTTAAATTGATGGCAGGTGACGGGGGTGTCACACCCTGCACGGAGAAACCAGAGCTTAAACAGTTTGCAGCTGATTTCTTGTCTCTGTACTGCAGCAATGAGTTGCATACAACTGTTAGTTCGCCGGGAGAGCAAAAAGTGACTTTTCTCAAGCGGCACTCCAGTCCATCTCTACTAGAGGGGGAGGCTAAATTACCAAAGAAGATACATTCTATTGCGGGTCCATCAAATGCCAAAGGTGAACCTGATTCCTCAAATGCATTGAGTGTTGGAAACAAGCAGTCCAATTTTGTTGTTGAAACTGGGGATACTGACAGTCATCCTCCCGTTGTGCTTAAGGCATGTCTGCAGAAATGCAATAAAGCACCTAGATCACCTTATTGTTTGACTGAATGCAAAACACCAGGCTTGTCAACTGCCAATACATGTTTTCAGGAGACTCCCAAGTCTGGAAGCTCGACATTTTCCCCTGGAGAAGCTTTTTGGAAAGAAGCAATCGTGTTTGCAGATGGTTTATGCGCTCCAAGCATTGACCTTACCAATTGTGATGCTGAAGGAGCTAATGTTGCAGAGAGTCAGAGTCATACGAAGAAACTTCCTATACCAGGGGAACCTGCTCAGAAAAGGTTAAAAGGACAGTTTGGTGGAGGTAGTGGTGGAGTCCGGCTTGGGGAACCTGGTGCTTCCATGGTTTCATTGAGGAGTGAATTAAAAGAGTTAAATAGAGAAGTGTCTTCACTACCTGTTAAGCATTTTGACTTCTCGGCTGATGATAAAAACTTGGATGGAAGTACATTACCTTATTGTGCTTCAAATGAATCTGAAGTTAATGCATATGACCTTAATGAGCAGTCTGATTGTTGTTATACTAATGATAGTCTACCAAACCATAATGATAAGACCCGCGACAGTGATTCTCTTACGAAAGAGAAGATACATGAAACAAATGTAACTTCCTCTGTTCCGGTAGTGACTGAAGTGAAATTAAACATATTTAGTCCCTCTGATAGTATCACATCTGACACAGCGGTTCATGAACTTAGGGCTTCTACTGTTCATGATTTTAAAGAGGAAACAACACCTTCAAGTTCAGTAAGACATAAAGATTGGCTGGATCTAAGTTGCTGGCTGCCTCCTGAAATTTGCAGCATTTATAAAGAGAAAGGAATAACAAAACTGCATCCTTGGCAGGTCGAATGTCTTAAGGTAGACGGTGTCTTGCAGAGAAGAAATCTTGTTTATTGTGCATCTACCAGTGCTGGAAAAAGTTTTGTTGCAGAGATCTTAATGTTAAGGCGGGTGATTTCTACCGGAAAGATGGCACTTCTTGTACTTCCATATGTATCAATTTGTGCAGAAAAGGCCGCACATCTTGATGTTCTTCTTGAACCTCTGGATAAGCATGTGCGTAGTTATTATGGAAATCAAGGTGGTGGAACGCTTCCTAAGGATACTTCTGTGGCTGTTTGCACAATTGAGAAGGCAAACTCTTTGATAAACAGATTGTTGGAGGAGGGTCGTTTGTCAGAAATTGGAATTATTGTGATCGATGAATTGCACATGGTTGGAGATCAGACGAGGGGTTATCTTTTGGAACTACTTTTGACTAAACTTCGTTATGCTGCCGGTGAAGGTAATTTAGATTCTAGCAGTGGTGAGAGTTCTGGGACGAGCAGTGGTAAGTCAGACCCTGCTCATGGTATTCAAATAGTTGGCATGAGTGCAACCATGCCAAATGTGGCAGCTGTTGCAGATTGGCTTCAGGCAGCCTTGTACCAGACTGATTTTCGACCTGTTCCTTTAGAGGAGTACATTAAAGTTGGGAATACCATTTATAATAGAAGTTTGGACATTGTTAGAACAATCTCAAAAACAGCTAATCTTGGCGGTAGGGATCCAGATCACATTGTGGAATTATGTAATGAGGTAGTTGAAGAGGGTCACTCAGTATTGATCTTTTGCTCCAGTCGGAAAGGATGTGAATCAACAGCAAAACACGTGTCAAAATTCCTCAAGAAGTTTTCTGTCAAAATTCACAATGAGAACAGTGAGTTTACAGACATTTTTTCGGCAGTTGATGCACTGCGAAGATGTCCTTCTGGACTGGATCCTGTATTAGAGGAAACCTTTCCGTCTGGTGTTGCCTACCATCATGCTGGCCTTACTGTAGAGGAGAGAGAGGTTGTTGAAACTTGCTACCGCAAGGGTCTTTTGCGTGTTTTAACTGCTACATCTACCTTAGCTGCTGGAGTTAACCTGCCAGCTCGAAGGGTCATTTTCCGACAACCCAAGATTGGGCGAGACTTTATTGATGGTGCAAGGTACAGACAGATGGCTGGTCGGGCTGGCCGGACTGGGATTGATACCAAGGGGGAGAGTGTACTCATTTGCAGACCAGAAGAGATTAAAAGAATTAATGAACTTCTTAACGAGAGCTGTCCACCTTTGCAATCATGTTTGTCTGAAGATAAGAATGGAATGACTCATGCAATTTTAGAAGTTGTGGCTGGTGGGATTGTTCAAACTGCAACTGATATTCATCGATATGTAAGGTGTACTCTTTTGAATTCTACAAAACCATTTCAAGATGTGGTTAAATCAGCACAGGAATCTCTTCGGTGGTTATGCCATGGAAAATTTCTTGAGTGGAATGGGGATACCAAGTTGTATAGTACCACACCTCTTGGACGTGCATCGTTTGGAAGCTCTCTTAGTCCAGAAGAATCACTTATTGTTTTGGATGATCTTTCAAGGGCCCGAGAAGGATTTGTGCTTGCATCTGATTTACATTTGGTGTACCTAGTCACACCGATCAATGTTGATGTTGAGCCAGATTGGGAGCTATATTATGAACGGTTTATGGGTTTGCCTTCTCTTGACCAGTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCGTGGGTTTACGTACCAAGCGTGATGAACATGGGTGCATGTATGATGACAGGCCTTCAGAGGAGCAAACTATTCGAGTGTGTAAACGATTTTACGTGGCCCTCATCTTGTCAAGACTTGTTCAGGAAACTCCCATTCCTGAAGTTTGTGAAGCTTTTAAAGTTGCCAGAGGGATGGTGCAAGCATTGCAAGAGAGTGCTGGAAGGTTTGCATCTATGGTCTCCGTATTTTGCGAGAGGCTCGGATGGCACGATCTGGAAGGTTTGGTAGCCAAGTTCCAAAACCGTGTTTCATTTGGAGTTAGAGCTGAGATTGTAGAACTTACTACTATTCCATACGTTAAGGGTTCTCGAGCCAGAGCACTCTATAAAGCTGGTTTGCGGACACCTTTAGCAATTGCAGAAGCCTCTGATGCTGAATTAGTTAAAGCTCTTTTTGAGTCTGCATCATGGACTGCAGAAGGTGAGCTTAACAAGACATGCTTATTTGTTTGTGCTGATTCTGGTCAGCAGGTTTGTATAGAGAGTACAGCACAAAAACGGATGCATGTTGGAATAGCTAGGAAGATTAAGCATGGTGCTCGTAAAGTTGTTCTTGATAAAGCTGAAGAGGCAAGGATTGCTGCATTTTCAGCTTTTAAATCATTGGGGTTCACTGTGCCACAAATTTCTCGTCCCTTGTCAGCAAGTGCAGATGGAAATATTACAGCACAAGTGGCTGCAAGTATTCCATCTGAAATCGATACTTTAAACAGAGTTGTTAGCACACGACAAATGGAGCATGCTTTAACAAAGTCATGTTTTGGAGGAACTTCCAGTTCTGAAAAAGTAGGTGGCAAGAATCTGAGTGAAACAGGAACAATTTCTGTTGAAGTAAAACCACCCAATTTTGGCGTTAATCCTCTGGTGAATGTTGAAGGGTCTGCAATCCAGGAGTCAAATACCGTTGTTGAATGTGCGGGAAAGGTAGATGTTACAATCTCTAATCACATGGAAAGAATTGCTCAGAGGGAACAACATAGTAGTGTCTTGCATCCTCCAAAAAGAGACAGTTCTTCCATGAAAGGTCCTATCCATGCAGCTAATACATCTGGCGGATTTGAATCTTTCTTGGATTTGTGGGATGCTAGCCAGGAATTTTTTTTTGATCTCTATTACACCAAACGGTCTGAAGTGAACTCTGTTGTCCCCTTTGAATTACATGGAATAGCCATCTGTTGGGAAAATTCTCCGGTGTATTATGTGAACCTCCCAAAGGATTTGTTGGGGCCCAAGAGTGGAAAAGGTCTTTATCCGGATGACAGGACATCTGGTGACCAGGTTCAGGTACTTAAATGTCCGGGAGTTTCCATCCAGAAATTGGGCTTCCTGAATTCTGCTCGACGTAATATGGGTCTTAAACTTGTAGACGGTTCATACTTAGTGTTGTCGAGAGTTCACATAAGCAATGTAATTGATATGTGCATTGTTGCTTGGATTCTTTGGCCAGATGACGAGAGAAATTCAACCCCGAACCTGGAGAAGGAAGTCAAGAAAAGATTATCTGGTGAAGCTGCTTCTGCTGCTAACCGGAGTGGCCAGTGGAAGAATCAGATGAGAAGAGTAGCCCATAATGGTTGCTGTCGGCGTGTTGCACAGACACGAGCCCTATGTTCTGTTCTCTGGAAGCTAATAATTTCTGAAAAGCTCTTGGAAGCTCTCAACAATATAGAGATTCCATTGGTAAGTATTCTTGCTGATATGGAGACCTGGGGTATAGGTGTTGACATGGAGGGATGCATTCGGGCCCGTAATTTACTGGGAAAAAAACTCAAGTGCCTTGAGAAGGAAGCTTATAGGCTAGCTGGCATGAGCTTCTCCCTGTACGCAGCAGCAGATATTGCTAATGTTCTATATGGACATTTGAAGCTCTCTATTCCAGAGGGGTTCAACAAAGGCAAACAACATCCAAGTACTGATAAACATTGTTTGGACTTGCTGAGGAATGAACACCCTATTGTTCCAGTCATTAAAGAGCACCGGACATTGGCGAAGCTCTTTAACTGTACTTTGGGATCCATTTGCTCGTTAGCTAAGCTATCTGCAAGGACACAGAAATACACGCTACATGGTCATTGGCTCCAAACATCCACAGCAACTGGTCGGCTTTCCATGGAGGAGCCTAACCTTCAGTGCGTTGAGCATGCGGTGGATTTCAAAATGAATGAAGATGATGTTGATCATTGTAAAATTAATGCTCGTGATTTCTTCATCTCTACTCAGGAAAATTGGCTGCTCGTATCAGCAGATTATTCTCAGATAGAGTTACGGCTGATGGCACATTTTTCAAAAGACTCCTCACTGATTGAACTCCTCAGTAAACCTCATGGGGATGTGTTTACCATGATTGCTGCTAGATGGACAGGGAAGACAGAAGACTCTATTGGACCTCATGAGCGAGATCAAACCAAAAGGTTGGTATATGGAATCCTTTATGGAATGGGCGCCAAAACACTTGCATTACAGTTGGAATGTAGTAAAGATGAGGCTGTAGAGAAAATTCGAAGTTTCAAGAGTTCTTTCCCTGGCGTGGCTTCCTGGCTTCATGAGGCGGTCACATTTTGCCGTCAGAAGGGGTACGTTGAAACTCTTAAAGGAAGAAGACGCTTTTTGTCAAAAATTAATTCTCCAAATAGCAAGGAAAAATCAAAAGCACAGCGACAAGCTGTGAATTCAATTTGTCAGTATTTTTTCTTTTACTGGGGTTCAGCAGCTGACATAATTAAAGTGGCTATGATCAACATTTACTCCGTCATTGGAACGGATGCCCCAGATCCTACAGGATTACCTGCAGCTAACACTAATATATTGAGGGGCCACTGCCGAATTGTATTACAGGTGCATGATGAGTTAGTGCTAGAAGTGGACCCTTCCGTCGTCAAGGAGGCAGCAGCTTTGTTACAGAAGAGCATGGAAAATGCTGCCTCCCTTCTGGTTCCATTGCAAGTCAAACTGAAAGTTGGACGAACATGGGGTTCTTTGGAGCCCTTCTTGCATGATAGCTTCAAGATTGAAGTTCTTGTACCTGGATCTTGA

Coding sequence (CDS)

ATGCAGAGTTTGCAAGATAAGGCTTCCGAGTGGAGCGGAGTGGAGCGTGAAGATGCCTTCGCCATCGACGAAGTCAATTTGTTTCAGAAGTTAGGTCTCCAGACGTTTGTTAATCTATCGACCAAATTCTATAACAAGTCTGTTTCTGTTCATCAATGCAGCATGCGGCATGCGCGAAGCCATCCAGCTCTTATAGGTCGACACCGACCATTCCCGGTCACACATAGAGCGGCAGAGAGATGGTTACATCACATGCAACTAGCGTTAGACGAGACTCCAGAAATAGATGCAGATTCAAAATTTTATGCTTCAAAGAAAAGGAAACCTCTTACTTCCAGTCTGAAGTCTGGGAGTTACGACAAGGATGGAAAAAAGTCACTTGAAGGGTCGCCTGGTGCTAAAGGTACGTTGGACAATTACCTAGTGACCTCACAGGACCATGGCAACTCTGATATCCCATCGCATTCAGTTCGGGAAAACTTGTCTGAGCAAGACCTAGTAAAGAGAAACCTATTGTTGAAAATTAATAGCTCCTCTAGAAATGAACACGAGGAACCCACTTTGTCTAGAGGGTGTGACACTTCTGCAGCAACTGAAGGAATCAAGAAAAGAACTCTGGAGGACTCGTACGAGACCAGAAGTTCAACAGTTAAATTGATGGCAGGTGACGGGGGTGTCACACCCTGCACGGAGAAACCAGAGCTTAAACAGTTTGCAGCTGATTTCTTGTCTCTGTACTGCAGCAATGAGTTGCATACAACTGTTAGTTCGCCGGGAGAGCAAAAAGTGACTTTTCTCAAGCGGCACTCCAGTCCATCTCTACTAGAGGGGGAGGCTAAATTACCAAAGAAGATACATTCTATTGCGGGTCCATCAAATGCCAAAGGTGAACCTGATTCCTCAAATGCATTGAGTGTTGGAAACAAGCAGTCCAATTTTGTTGTTGAAACTGGGGATACTGACAGTCATCCTCCCGTTGTGCTTAAGGCATGTCTGCAGAAATGCAATAAAGCACCTAGATCACCTTATTGTTTGACTGAATGCAAAACACCAGGCTTGTCAACTGCCAATACATGTTTTCAGGAGACTCCCAAGTCTGGAAGCTCGACATTTTCCCCTGGAGAAGCTTTTTGGAAAGAAGCAATCGTGTTTGCAGATGGTTTATGCGCTCCAAGCATTGACCTTACCAATTGTGATGCTGAAGGAGCTAATGTTGCAGAGAGTCAGAGTCATACGAAGAAACTTCCTATACCAGGGGAACCTGCTCAGAAAAGGTTAAAAGGACAGTTTGGTGGAGGTAGTGGTGGAGTCCGGCTTGGGGAACCTGGTGCTTCCATGGTTTCATTGAGGAGTGAATTAAAAGAGTTAAATAGAGAAGTGTCTTCACTACCTGTTAAGCATTTTGACTTCTCGGCTGATGATAAAAACTTGGATGGAAGTACATTACCTTATTGTGCTTCAAATGAATCTGAAGTTAATGCATATGACCTTAATGAGCAGTCTGATTGTTGTTATACTAATGATAGTCTACCAAACCATAATGATAAGACCCGCGACAGTGATTCTCTTACGAAAGAGAAGATACATGAAACAAATGTAACTTCCTCTGTTCCGGTAGTGACTGAAGTGAAATTAAACATATTTAGTCCCTCTGATAGTATCACATCTGACACAGCGGTTCATGAACTTAGGGCTTCTACTGTTCATGATTTTAAAGAGGAAACAACACCTTCAAGTTCAGTAAGACATAAAGATTGGCTGGATCTAAGTTGCTGGCTGCCTCCTGAAATTTGCAGCATTTATAAAGAGAAAGGAATAACAAAACTGCATCCTTGGCAGGTCGAATGTCTTAAGGTAGACGGTGTCTTGCAGAGAAGAAATCTTGTTTATTGTGCATCTACCAGTGCTGGAAAAAGTTTTGTTGCAGAGATCTTAATGTTAAGGCGGGTGATTTCTACCGGAAAGATGGCACTTCTTGTACTTCCATATGTATCAATTTGTGCAGAAAAGGCCGCACATCTTGATGTTCTTCTTGAACCTCTGGATAAGCATGTGCGTAGTTATTATGGAAATCAAGGTGGTGGAACGCTTCCTAAGGATACTTCTGTGGCTGTTTGCACAATTGAGAAGGCAAACTCTTTGATAAACAGATTGTTGGAGGAGGGTCGTTTGTCAGAAATTGGAATTATTGTGATCGATGAATTGCACATGGTTGGAGATCAGACGAGGGGTTATCTTTTGGAACTACTTTTGACTAAACTTCGTTATGCTGCCGGTGAAGGTAATTTAGATTCTAGCAGTGGTGAGAGTTCTGGGACGAGCAGTGGTAAGTCAGACCCTGCTCATGGTATTCAAATAGTTGGCATGAGTGCAACCATGCCAAATGTGGCAGCTGTTGCAGATTGGCTTCAGGCAGCCTTGTACCAGACTGATTTTCGACCTGTTCCTTTAGAGGAGTACATTAAAGTTGGGAATACCATTTATAATAGAAGTTTGGACATTGTTAGAACAATCTCAAAAACAGCTAATCTTGGCGGTAGGGATCCAGATCACATTGTGGAATTATGTAATGAGGTAGTTGAAGAGGGTCACTCAGTATTGATCTTTTGCTCCAGTCGGAAAGGATGTGAATCAACAGCAAAACACGTGTCAAAATTCCTCAAGAAGTTTTCTGTCAAAATTCACAATGAGAACAGTGAGTTTACAGACATTTTTTCGGCAGTTGATGCACTGCGAAGATGTCCTTCTGGACTGGATCCTGTATTAGAGGAAACCTTTCCGTCTGGTGTTGCCTACCATCATGCTGGCCTTACTGTAGAGGAGAGAGAGGTTGTTGAAACTTGCTACCGCAAGGGTCTTTTGCGTGTTTTAACTGCTACATCTACCTTAGCTGCTGGAGTTAACCTGCCAGCTCGAAGGGTCATTTTCCGACAACCCAAGATTGGGCGAGACTTTATTGATGGTGCAAGGTACAGACAGATGGCTGGTCGGGCTGGCCGGACTGGGATTGATACCAAGGGGGAGAGTGTACTCATTTGCAGACCAGAAGAGATTAAAAGAATTAATGAACTTCTTAACGAGAGCTGTCCACCTTTGCAATCATGTTTGTCTGAAGATAAGAATGGAATGACTCATGCAATTTTAGAAGTTGTGGCTGGTGGGATTGTTCAAACTGCAACTGATATTCATCGATATGTAAGGTGTACTCTTTTGAATTCTACAAAACCATTTCAAGATGTGGTTAAATCAGCACAGGAATCTCTTCGGTGGTTATGCCATGGAAAATTTCTTGAGTGGAATGGGGATACCAAGTTGTATAGTACCACACCTCTTGGACGTGCATCGTTTGGAAGCTCTCTTAGTCCAGAAGAATCACTTATTGTTTTGGATGATCTTTCAAGGGCCCGAGAAGGATTTGTGCTTGCATCTGATTTACATTTGGTGTACCTAGTCACACCGATCAATGTTGATGTTGAGCCAGATTGGGAGCTATATTATGAACGGTTTATGGGTTTGCCTTCTCTTGACCAGTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCTCTGTTGGAAATCGAGTTGGAGTTACGGAACCATTTTTAATGCGTATGGCGCATGGTGCACCGATTCGACGTGCAAATATCTCAAGAAATGGTGTCGTGGGTTTACGTACCAAGCGTGATGAACATGGGTGCATGTATGATGACAGGCCTTCAGAGGAGCAAACTATTCGAGTGTGTAAACGATTTTACGTGGCCCTCATCTTGTCAAGACTTGTTCAGGAAACTCCCATTCCTGAAGTTTGTGAAGCTTTTAAAGTTGCCAGAGGGATGGTGCAAGCATTGCAAGAGAGTGCTGGAAGGTTTGCATCTATGGTCTCCGTATTTTGCGAGAGGCTCGGATGGCACGATCTGGAAGGTTTGGTAGCCAAGTTCCAAAACCGTGTTTCATTTGGAGTTAGAGCTGAGATTGTAGAACTTACTACTATTCCATACGTTAAGGGTTCTCGAGCCAGAGCACTCTATAAAGCTGGTTTGCGGACACCTTTAGCAATTGCAGAAGCCTCTGATGCTGAATTAGTTAAAGCTCTTTTTGAGTCTGCATCATGGACTGCAGAAGGTGAGCTTAACAAGACATGCTTATTTGTTTGTGCTGATTCTGGTCAGCAGGTTTGTATAGAGAGTACAGCACAAAAACGGATGCATGTTGGAATAGCTAGGAAGATTAAGCATGGTGCTCGTAAAGTTGTTCTTGATAAAGCTGAAGAGGCAAGGATTGCTGCATTTTCAGCTTTTAAATCATTGGGGTTCACTGTGCCACAAATTTCTCGTCCCTTGTCAGCAAGTGCAGATGGAAATATTACAGCACAAGTGGCTGCAAGTATTCCATCTGAAATCGATACTTTAAACAGAGTTGTTAGCACACGACAAATGGAGCATGCTTTAACAAAGTCATGTTTTGGAGGAACTTCCAGTTCTGAAAAAGTAGGTGGCAAGAATCTGAGTGAAACAGGAACAATTTCTGTTGAAGTAAAACCACCCAATTTTGGCGTTAATCCTCTGGTGAATGTTGAAGGGTCTGCAATCCAGGAGTCAAATACCGTTGTTGAATGTGCGGGAAAGGTAGATGTTACAATCTCTAATCACATGGAAAGAATTGCTCAGAGGGAACAACATAGTAGTGTCTTGCATCCTCCAAAAAGAGACAGTTCTTCCATGAAAGGTCCTATCCATGCAGCTAATACATCTGGCGGATTTGAATCTTTCTTGGATTTGTGGGATGCTAGCCAGGAATTTTTTTTTGATCTCTATTACACCAAACGGTCTGAAGTGAACTCTGTTGTCCCCTTTGAATTACATGGAATAGCCATCTGTTGGGAAAATTCTCCGGTGTATTATGTGAACCTCCCAAAGGATTTGTTGGGGCCCAAGAGTGGAAAAGGTCTTTATCCGGATGACAGGACATCTGGTGACCAGGTTCAGGTACTTAAATGTCCGGGAGTTTCCATCCAGAAATTGGGCTTCCTGAATTCTGCTCGACGTAATATGGGTCTTAAACTTGTAGACGGTTCATACTTAGTGTTGTCGAGAGTTCACATAAGCAATGTAATTGATATGTGCATTGTTGCTTGGATTCTTTGGCCAGATGACGAGAGAAATTCAACCCCGAACCTGGAGAAGGAAGTCAAGAAAAGATTATCTGGTGAAGCTGCTTCTGCTGCTAACCGGAGTGGCCAGTGGAAGAATCAGATGAGAAGAGTAGCCCATAATGGTTGCTGTCGGCGTGTTGCACAGACACGAGCCCTATGTTCTGTTCTCTGGAAGCTAATAATTTCTGAAAAGCTCTTGGAAGCTCTCAACAATATAGAGATTCCATTGGTAAGTATTCTTGCTGATATGGAGACCTGGGGTATAGGTGTTGACATGGAGGGATGCATTCGGGCCCGTAATTTACTGGGAAAAAAACTCAAGTGCCTTGAGAAGGAAGCTTATAGGCTAGCTGGCATGAGCTTCTCCCTGTACGCAGCAGCAGATATTGCTAATGTTCTATATGGACATTTGAAGCTCTCTATTCCAGAGGGGTTCAACAAAGGCAAACAACATCCAAGTACTGATAAACATTGTTTGGACTTGCTGAGGAATGAACACCCTATTGTTCCAGTCATTAAAGAGCACCGGACATTGGCGAAGCTCTTTAACTGTACTTTGGGATCCATTTGCTCGTTAGCTAAGCTATCTGCAAGGACACAGAAATACACGCTACATGGTCATTGGCTCCAAACATCCACAGCAACTGGTCGGCTTTCCATGGAGGAGCCTAACCTTCAGTGCGTTGAGCATGCGGTGGATTTCAAAATGAATGAAGATGATGTTGATCATTGTAAAATTAATGCTCGTGATTTCTTCATCTCTACTCAGGAAAATTGGCTGCTCGTATCAGCAGATTATTCTCAGATAGAGTTACGGCTGATGGCACATTTTTCAAAAGACTCCTCACTGATTGAACTCCTCAGTAAACCTCATGGGGATGTGTTTACCATGATTGCTGCTAGATGGACAGGGAAGACAGAAGACTCTATTGGACCTCATGAGCGAGATCAAACCAAAAGGTTGGTATATGGAATCCTTTATGGAATGGGCGCCAAAACACTTGCATTACAGTTGGAATGTAGTAAAGATGAGGCTGTAGAGAAAATTCGAAGTTTCAAGAGTTCTTTCCCTGGCGTGGCTTCCTGGCTTCATGAGGCGGTCACATTTTGCCGTCAGAAGGGGTACGTTGAAACTCTTAAAGGAAGAAGACGCTTTTTGTCAAAAATTAATTCTCCAAATAGCAAGGAAAAATCAAAAGCACAGCGACAAGCTGTGAATTCAATTTGTCAGTATTTTTTCTTTTACTGGGGTTCAGCAGCTGACATAATTAAAGTGGCTATGATCAACATTTACTCCGTCATTGGAACGGATGCCCCAGATCCTACAGGATTACCTGCAGCTAACACTAATATATTGAGGGGCCACTGCCGAATTGTATTACAGGTGCATGATGAGTTAGTGCTAGAAGTGGACCCTTCCGTCGTCAAGGAGGCAGCAGCTTTGTTACAGAAGAGCATGGAAAATGCTGCCTCCCTTCTGGTTCCATTGCAAGTCAAACTGAAAGTTGGACGAACATGGGGTTCTTTGGAGCCCTTCTTGCATGATAGCTTCAAGATTGAAGTTCTTGTACCTGGATCTTGA

Protein sequence

MQSLQDKASEWSGVEREDAFAIDEVNLFQKLGLQTFVNLSTKFYNKSVSVHQCSMRHARSHPALIGRHRPFPVTHRAAERWLHHMQLALDETPEIDADSKFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRENLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKLMAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEAKLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKAPRSPYCLTECKTPGLSTANTCFQETPKSGSSTFSPGEAFWKEAIVFADGLCAPSIDLTNCDAEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNREVSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHELRASTVHDFKEETTPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQVAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFGVNPLVNVEGSAIQESNTVVECAGKVDVTISNHMERIAQREQHSSVLHPPKRDSSSMKGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGDQVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDDVDHCKINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVKEAAALLQKSMENAASLLVPLQVKLKVGRTWGSLEPFLHDSFKIEVLVPGS
BLAST of Cla022227 vs. Swiss-Prot
Match: TEB_ARATH (Helicase and polymerase-containing protein TEBICHI OS=Arabidopsis thaliana GN=TEB PE=2 SV=1)

HSP 1 Score: 2390.9 bits (6195), Expect = 0.0e+00
Identity = 1315/2239 (58.73%), Postives = 1575/2239 (70.34%), Query Frame = 1

Query: 95   IDADS------KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHG 154
            +D+DS      +FY SKKRK  + +LKSG  +K+ K + E SPG KGTLD+YL  S D  
Sbjct: 1    MDSDSSKSRIDQFYVSKKRKHQSPNLKSGRNEKNVKVTGERSPGDKGTLDSYLKASLDDK 60

Query: 155  NSDIPSHSVRENLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLED 214
            ++       R     Q+   R L L++++SS  ++  P L +    +   E + +   +D
Sbjct: 61   STTNSGLQAR-----QEAFTRKLDLEVSASSVGQNIHPCLPKPVSFATFKECLGQNGSQD 120

Query: 215  SYETRSSTVKLMAGDGGVTPCT-EKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLK 274
             ++      +  A DG +     +  EL+ FA  FLSLYCS  + + V SP  QK   LK
Sbjct: 121  LHK-EGVAAETHATDGLLCANQKDNSELRDFATSFLSLYCSG-VQSVVGSPPHQKENELK 180

Query: 275  RHSSPSLLEGEAKLPKK-------IHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDT 334
            R SS S L  + ++  K       I S+   +N  G    S A +  N+       T   
Sbjct: 181  RRSSSSSLAQDIQISHKRRCESENIPSLDDLTNPLGSKPESLARNGNNRDKPVSDPTKKM 240

Query: 335  DSHPPVVLKACLQKCNKAPRSPYCLTECKTPGLSTANTCFQETPKSG--SSTFSPGEAFW 394
             S+  V +   L+KC+KAP S   LTE  TPG S   +C   TPKSG  SS FSPGEAFW
Sbjct: 241  PSNESVEIPMGLRKCSKAPESSAHLTEFHTPG-SAIKSCPVGTPKSGCGSSMFSPGEAFW 300

Query: 395  KEAIVFADGLCAPSIDLTNCDAEGANVAESQ----SHTKKLPIPGEPAQKRLKGQFGGGS 454
             EAI  ADGL  P   + N  +  A V +      S +KK     E  ++ L        
Sbjct: 301  NEAIQVADGLTIP---IENFGSVEAKVRDQHVTILSCSKKTDKCTEKLERSLD------L 360

Query: 455  GGVRLGEPGASMVS--LRSELKELNREVSSLPVKHFDFSADDKNLDGSTLPYCASNESEV 514
              +R+ +  A   S  +    ++ N+EV  LPVK+ +    DKN++G     CAS +   
Sbjct: 361  DEIRVKDKDAIGFSKVVEKHGRDFNKEVYQLPVKNLELLFQDKNINGGIQERCASFDQNN 420

Query: 515  NAYDLNEQSDCCYTNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSD 574
                 +  S+  +        N    + D     +  +  +    P     K+ +   + 
Sbjct: 421  ITLGSSRISESAFVG------NKGCENLDIANNAQADKGLIGKMYPEPEGKKVLLCEENR 480

Query: 575  SITSDTAVHELRASTVHDFKEET-TPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHP 634
             + S + +  +R        EE+ TPSSS R+ D L LS WLP E+CS+Y +KGI+KL+P
Sbjct: 481  GVRSVSMISNMRKPVGSSESEESHTPSSSHRNYDGLSLSTWLPSEVCSVYNKKGISKLYP 540

Query: 635  WQVECLKVDGVLQRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKA 694
            WQVECL+VDGVLQ+RNLVYCASTSAGKSFVAE+LMLRRVI TGKMALLVLPYVSICAEKA
Sbjct: 541  WQVECLQVDGVLQKRNLVYCASTSAGKSFVAEVLMLRRVIRTGKMALLVLPYVSICAEKA 600

Query: 695  AHLDVLLEPLDKHVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIV 754
             HL+VLLEPL KHVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSE+GIIV
Sbjct: 601  EHLEVLLEPLGKHVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSELGIIV 660

Query: 755  IDELHMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSA 814
            IDELHMVGDQ RGYLLEL+LTKLRYAAGEG+ +SSSGESSGTSSGK+DPAHG+QIVGMSA
Sbjct: 661  IDELHMVGDQHRGYLLELMLTKLRYAAGEGSSESSSGESSGTSSGKADPAHGLQIVGMSA 720

Query: 815  TMPNVAAVADWLQAALYQTDFRPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDH 874
            TMPNV AVADWLQAALYQT+FRPVPLEEYIKVG+TIYN+ +++VRTI K A++GG+DPDH
Sbjct: 721  TMPNVGAVADWLQAALYQTEFRPVPLEEYIKVGSTIYNKKMEVVRTIPKAADMGGKDPDH 780

Query: 875  IVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDAL 934
            IVELCNEVV+EG+SVLIFCSSRKGCESTA+H+SK +K   V +  ENSEF DI SA+DAL
Sbjct: 781  IVELCNEVVQEGNSVLIFCSSRKGCESTARHISKLIKNVPVNVDGENSEFMDIRSAIDAL 840

Query: 935  RRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPA 994
            RR PSG+DPVLEET PSGVAYHHAGLTVEERE+VETCYRKGL+RVLTATSTLAAGVNLPA
Sbjct: 841  RRSPSGVDPVLEETLPSGVAYHHAGLTVEEREIVETCYRKGLVRVLTATSTLAAGVNLPA 900

Query: 995  RRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPP 1054
            RRVIFRQP IGRDFIDG RY+QM+GRAGRTGIDTKG+SVLIC+P E+KRI  LLNE+CPP
Sbjct: 901  RRVIFRQPMIGRDFIDGTRYKQMSGRAGRTGIDTKGDSVLICKPGELKRIMALLNETCPP 960

Query: 1055 LQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWL 1114
            LQSCLSEDKNGMTHAILEVVAGGIVQTA DIHRYVRCTLLNSTKPFQDVVKSAQ+SLRWL
Sbjct: 961  LQSCLSEDKNGMTHAILEVVAGGIVQTAKDIHRYVRCTLLNSTKPFQDVVKSAQDSLRWL 1020

Query: 1115 CHGKFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLV 1174
            CH KFLEWN +TKLY+TTPLGR SFGSSL PEESLIVLDDL RAREG V+ASDLHLVYLV
Sbjct: 1021 CHRKFLEWNEETKLYTTTPLGRGSFGSSLCPEESLIVLDDLLRAREGLVMASDLHLVYLV 1080

Query: 1175 TPINVDVEPDWELYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSV 1234
            TPINV VEP+WELYYERFM L  L+QSVGNRVGV EPFLMRMAHGA +R  N        
Sbjct: 1081 TPINVGVEPNWELYYERFMELSPLEQSVGNRVGVVEPFLMRMAHGATVRTLN-------- 1140

Query: 1235 GNRVGVTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDE-HGCMYDDRPSEEQTIRVCKR 1294
                                R  ++ +N    LR + D  HG       S+EQ +RVCKR
Sbjct: 1141 --------------------RPQDVKKN----LRGEYDSRHGSTSMKMLSDEQMLRVCKR 1200

Query: 1295 FYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLV 1354
            F+VALILS+LVQE  + EVCEAFKVARGMVQALQE+AGRF+SMVSVFCERLGWHDLEGLV
Sbjct: 1201 FFVALILSKLVQEASVTEVCEAFKVARGMVQALQENAGRFSSMVSVFCERLGWHDLEGLV 1260

Query: 1355 AKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESAS 1414
            AKFQNRVSFGVRAEIVELT+IPY+KGSRARALYKAGLRT  AIAEAS  E+VKALFES++
Sbjct: 1261 AKFQNRVSFGVRAEIVELTSIPYIKGSRARALYKAGLRTSQAIAEASIPEIVKALFESSA 1320

Query: 1415 WTAEGELNKTCLFVCADSGQQVCIESTAQKRMHVGIARKIKHGARKVVLDKAEEARIAAF 1474
            W AEG                     T Q+R+H+G+A+KIK+GARK+VL+KAEEAR AAF
Sbjct: 1321 WAAEG---------------------TGQRRIHLGLAKKIKNGARKIVLEKAEEARAAAF 1380

Query: 1475 SAFKSLGFTVPQISRPLSASADGNITAQVAASIPSEIDTLNRVVSTRQMEHALTKSCFGG 1534
            SAFKSLG  V ++S+PL  +   ++  Q          ++      + +E  +    F  
Sbjct: 1381 SAFKSLGLDVNELSKPLPLAPASSLNGQETTERDISRGSVGPDGLQQSIEGHMECENFDM 1440

Query: 1535 TSSSEK----VGGKNLSETGTISVEVKPPNF-GVNPLVNVEG----SAIQESNTVVECAG 1594
             +  EK    +G   L  +  I++  + PNF  +   V   G    S +      +    
Sbjct: 1441 DNHREKPSEVLGDATLGVSSEINLTSRLPNFRPIGTAVGTNGPSAVSILSSDTFPIPVYD 1500

Query: 1595 KVDVTISNHMERIAQREQHSSVLHPPKRDSSSMKGPIHAANTSGGFESFLDLWDASQEFF 1654
              ++   +++E+   R  H  +     +D +  KGP+ A N SGGF+SFL+LW ++ EFF
Sbjct: 1501 NREIKPKDNVEQHLTRNDHIPL--SSNKDGTGEKGPVTAGNISGGFDSFLELWGSAGEFF 1560

Query: 1655 FDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDLLGPKS-GKGLYPDDRTSGD- 1714
            FDL+Y K  ++NS + +E+HGIAICW  SPVYYVNL KDL   +   K    +D   G  
Sbjct: 1561 FDLHYNKLQDLNSRISYEIHGIAICWNCSPVYYVNLNKDLPNLECVEKQKLIEDAVIGKS 1620

Query: 1715 -------------------------------------QVQVLKCPGVSIQKLGFLNSARR 1774
                                                 Q+QVLK P +SIQ+   LN    
Sbjct: 1621 EVLASHNMLDVIKSRWNKISKIMGNVNTRKFTWNLKVQIQVLKSPAISIQRCTRLN-LPE 1680

Query: 1775 NMGLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGEAASAA 1834
             +  +LVDGS+L++  +H S+ IDM IV WILWPD+ER+S PN++KEVKKRLS EAA AA
Sbjct: 1681 GIRDELVDGSWLMMPPLHTSHTIDMSIVIWILWPDEERHSNPNIDKEVKKRLSPEAAEAA 1740

Query: 1835 NRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPLVSILADME 1894
            NRSG+W+NQ+RRVAHNGCCRRVAQTRALCS LWK+++SE+LL+AL  IE+PLV++LADME
Sbjct: 1741 NRSGRWRNQIRRVAHNGCCRRVAQTRALCSALWKILVSEELLQALTTIEMPLVNVLADME 1800

Query: 1895 TWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGHLKLSIPEG 1954
             WGIG+D+EGC+RARN+L  KL+ LEK+A+ LAGM+FSL+  ADIANVL+G LKL IPE 
Sbjct: 1801 LWGIGIDIEGCLRARNILRDKLRSLEKKAFELAGMTFSLHNPADIANVLFGQLKLPIPEN 1860

Query: 1955 FNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTL 2014
             +KGK HPSTDKHCLDLLRNEHP+VP+IKEHRTLAKL NCTLGSICSLAKL   TQ+YTL
Sbjct: 1861 QSKGKLHPSTDKHCLDLLRNEHPVVPIIKEHRTLAKLLNCTLGSICSLAKLRLSTQRYTL 1920

Query: 2015 HGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNED------DVDHCKINARDFFISTQEN 2074
            HG WLQTSTATGRLS+EEPNLQ VEH V+FK++++      D D  KINARDFF+ TQEN
Sbjct: 1921 HGRWLQTSTATGRLSIEEPNLQSVEHEVEFKLDKNGRDVSSDADRYKINARDFFVPTQEN 1980

Query: 2075 WLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQT 2134
            WLL++ADYSQIELRLMAHFS+DSSLI  LS+P GDVFTMIAA+WTGK EDS+ PH+RDQT
Sbjct: 1981 WLLLTADYSQIELRLMAHFSRDSSLISKLSQPEGDVFTMIAAKWTGKAEDSVSPHDRDQT 2040

Query: 2135 KRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVET 2194
            KRL+YGILYGMGA  LA QLEC+ DEA EKIRSFKSSFP V SWL+E ++FC++KGY++T
Sbjct: 2041 KRLIYGILYGMGANRLAEQLECTSDEAKEKIRSFKSSFPAVTSWLNETISFCQEKGYIQT 2100

Query: 2195 LKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVIGTDA 2254
            LKGRRRFLSKI   N+KEKSKAQRQAVNS+CQ      GSAADIIK+AMINIYS I  D 
Sbjct: 2101 LKGRRRFLSKIKFGNAKEKSKAQRQAVNSMCQ------GSAADIIKIAMINIYSAIAEDV 2154

BLAST of Cla022227 vs. Swiss-Prot
Match: DPOLQ_HUMAN (DNA polymerase theta OS=Homo sapiens GN=POLQ PE=1 SV=2)

HSP 1 Score: 557.4 bits (1435), Expect = 7.3e-157
Identity = 334/844 (39.57%), Postives = 475/844 (56.28%), Query Frame = 1

Query: 574  ETTPSSSVRHKDWLDLSCW-LPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCA 633
            E  P+     +D L L+ W LP  +   Y   G+ K+  WQ ECL +  VL+ +NLVY A
Sbjct: 56   ECKPTVPDYERDKLLLANWGLPKAVLEKYHSFGVKKMFEWQAECLLLGQVLEGKNLVYSA 115

Query: 634  STSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQ 693
             TSAGK+ VAE+L+L+RV+   K AL +LP+VS+  EK  +L  L + +   V  Y G+ 
Sbjct: 116  PTSAGKTLVAELLILKRVLEMRKKALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGST 175

Query: 694  GGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLT 753
                      +AVCTIE+AN LINRL+EE ++  +G++V+DELHM+GD  RGYLLELLLT
Sbjct: 176  SPSRHFSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLT 235

Query: 754  KLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDF 813
            K+ Y   +    S+S ++   SS     ++ +QIVGMSAT+PN+  VA WL A LY TDF
Sbjct: 236  KICYITRK----SASCQADLASS----LSNAVQIVGMSATLPNLELVASWLNAELYHTDF 295

Query: 814  RPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSS 873
            RPVPL E +KVGN+IY+ S+ +VR       + G D DH+V LC E + + HSVL+FC S
Sbjct: 296  RPVPLLESVKVGNSIYDSSMKLVREFEPMLQVKG-DEDHVVSLCYETICDNHSVLLFCPS 355

Query: 874  RKGCESTAKHVSKFLKKFSVKIHNENS-------------EFTDIFSAVDALRRCPSGLD 933
            +K CE  A  +++        +H++               E  ++   +D LRR PSGLD
Sbjct: 356  KKWCEKLADIIAREF----YNLHHQAEGLVKPSECPPVILEQKELLEVMDQLRRLPSGLD 415

Query: 934  PVLEETFPSGVAYHHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQP 993
             VL++T P GVA+HHAGLT EER+++E  +R+GL+RVL ATSTL++GVNLPARRVI R P
Sbjct: 416  SVLQKTVPWGVAFHHAGLTFEERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRTP 475

Query: 994  KIGRDFIDGARYRQMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLS-- 1053
              G   +D   Y+QM GRAGR G+DT GES+LIC+  E  +   LL  S  P++SCL   
Sbjct: 476  IFGGRPLDILTYKQMVGRAGRKGVDTVGESILICKNSEKSKGIALLQGSLKPVRSCLQRR 535

Query: 1054 ---EDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLN-STKPFQDVVKSAQESLR---- 1113
               E    M  AILE++ GG+  T+ D+H Y  CT L  S K  +  ++  QES++    
Sbjct: 536  EGEEVTGSMIRAILEIIVGGVASTSQDMHTYAACTFLAASMKEGKQGIQRNQESVQLGAI 595

Query: 1114 -----WLCHGKFLEWNG-----DTKLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGF 1173
                 WL   +F++        + K+Y  T LG A+  SSLSP ++L +  DL RA +GF
Sbjct: 596  EACVMWLLENEFIQSTEASDGTEGKVYHPTHLGSATLSSSLSPADTLDIFADLQRAMKGF 655

Query: 1174 VLASDLHLVYLVTPINVDVEPDWEL--YYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGA 1233
            VL +DLH++YLVTP+      DW    +Y  F     L  S+                  
Sbjct: 656  VLENDLHILYLVTPMF----EDWTTIDWYRFFCLWEKLPTSMKR---------------- 715

Query: 1234 PIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDEHGCMYDD 1293
                         V   VGV E FL R   G  + R            T+R         
Sbjct: 716  -------------VAELVGVEEGFLARCVKGKVVAR------------TER--------- 775

Query: 1294 RPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSVF 1353
               + + + + KRF+ +L+L  L+ E P+ E+ + +   RG +Q+LQ+SA  +A M++VF
Sbjct: 776  ---QHRQMAIHKRFFTSLVLLDLISEVPLREINQKYGCNRGQIQSLQQSAAVYAGMITVF 829

Query: 1354 CERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEAS 1382
              RLGWH++E L+++FQ R++FG++ E+ +L  +  +   RAR LY +G  T   +A A+
Sbjct: 836  SNRLGWHNMELLLSQFQKRLTFGIQRELCDLVRVSLLNAQRARVLYASGFHTVADLARAN 829


HSP 2 Score: 232.6 bits (592), Expect = 4.1e-59
Identity = 211/718 (29.39%), Postives = 320/718 (44.57%), Query Frame = 1

Query: 1642 GIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGDQVQVLKCPGVSIQKLGFLNSAR 1701
            G+A+CW     YY +L K+    +    L P    S D    LK       ++ +L S  
Sbjct: 1902 GLAVCWGGRDAYYFSLQKEQKHSEISASLVPP---SLDPSLTLK------DRMWYLQSCL 1961

Query: 1702 RNMGLK--------LVDGSYLVLSRVHIS---NVIDMCIVAWILWPDDERNSTPNLEKEV 1761
            R    K         +    ++L    IS   +  D  +  W+L PD +    P L   V
Sbjct: 1962 RKESDKECSVVIYDFIQSYKILLLSCGISLEQSYEDPKVACWLLDPDSQE---PTLHSIV 2021

Query: 1762 KKRLSGEA----ASAANRSGQWKNQMRRVAHNGCCRRVAQTRAL---CSVLWKLIISEKL 1821
               L  E         ++  Q         H+G  R   ++  +    + L  L+  E L
Sbjct: 2022 TSFLPHELPLLEGMETSQGIQSLGLNAGSEHSGRYRASVESILIFNSMNQLNSLLQKENL 2081

Query: 1822 LEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYA 1881
             +    +E+P    LA +E  GIG     C   ++++  KL  +E +AY+LAG SFS  +
Sbjct: 2082 QDVFRKVEMPSQYCLALLELNGIGFSTAECESQKHIMQAKLDAIETQAYQLAGHSFSFTS 2141

Query: 1882 AADIANVLYGHLKL----------------SIPEGFNKGKQ-----HPSTDKHCLDLLRN 1941
            + DIA VL+  LKL                S   G + G++       ST K  L+ L+ 
Sbjct: 2142 SDDIAEVLFLELKLPPNREMKNQGSKKTLGSTRRGIDNGRKLRLGRQFSTSKDVLNKLKA 2201

Query: 1942 EHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHGHWL---------QTSTAT 2001
             HP+  +I E R +            ++ K+    Q+      +L         Q+ TAT
Sbjct: 2202 LHPLPGLILEWRRITN----------AITKVVFPLQREKCLNPFLGMERIYPVSQSHTAT 2261

Query: 2002 GRLSMEEPNLQCVEHAVDFKM----NEDDVDHC-----------------KINAR----- 2061
            GR++  EPN+Q V    + KM     E                        +N R     
Sbjct: 2262 GRITFTEPNIQNVPRDFEIKMPTLVGESPPSQAVGKGLLPMGRGKYKKGFSVNPRCQAQM 2321

Query: 2062 ---------DFFISTQENWL------LVSADYSQIELRLMAHFSKDSSLIELLSKPHGDV 2121
                      F IS +  ++      +++ADYSQ+ELR++AH S D  LI++L+    DV
Sbjct: 2322 EERAADRGMPFSISMRHAFVPFPGGSILAADYSQLELRILAHLSHDRRLIQVLNTG-ADV 2381

Query: 2122 FTMIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKS 2181
            F  IAA W     +S+G   R Q K++ YGI+YGMGAK+L  Q+   +++A   I SFKS
Sbjct: 2382 FRSIAAEWKMIEPESVGDDLRQQAKQICYGIIYGMGAKSLGEQMGIKENDAACYIDSFKS 2441

Query: 2182 SFPGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFF 2241
             + G+  ++ E V  C++ G+V+T+ GRRR+L  I   N   K+ A+RQA+N+I Q    
Sbjct: 2442 RYTGINQFMTETVKNCKRDGFVQTILGRRRYLPGIKDNNPYRKAHAERQAINTIVQ---- 2501

Query: 2242 YWGSAADIIKVAMINIYSVI--------------GTDAPDPTGLPAANTNILRG-HCRI- 2251
              GSAADI+K+A +NI   +              G    D TGL  +    L+G  C I 
Sbjct: 2502 --GSAADIVKIATVNIQKQLETFHSTFKSHGHREGMLQSDQTGL--SRKRKLQGMFCPIR 2561

BLAST of Cla022227 vs. Swiss-Prot
Match: DPOLQ_MOUSE (DNA polymerase theta OS=Mus musculus GN=Polq PE=1 SV=2)

HSP 1 Score: 551.6 bits (1420), Expect = 4.0e-155
Identity = 332/832 (39.90%), Postives = 469/832 (56.37%), Query Frame = 1

Query: 585  DWLDLSCW-LPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGKSFVAE 644
            D L L+ W LP  +   Y   G+ K+  WQ ECL +  VL+ +NLVY A TSAGK+ VAE
Sbjct: 66   DQLLLANWGLPKAVLEKYHSFGVRKMFEWQAECLLLGHVLEGKNLVYSAPTSAGKTLVAE 125

Query: 645  ILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLPKDTSV 704
            +L+L+RV+ T K AL +LP+VS+  EK  +L  L + +   V  Y G+           +
Sbjct: 126  LLILKRVLETRKKALFILPFVSVAKEKKCYLQSLFQEVGLKVDGYMGSTSPTGQFSSLDI 185

Query: 705  AVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAAGEGNL 764
            AVCTIE+AN L+NRL+EE ++  +G++V+DELHM+GD  RGYLLELLLTK+ Y   +   
Sbjct: 186  AVCTIERANGLVNRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYVTRKSA- 245

Query: 765  DSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLEEYIKV 824
             S   ES+ T S      + +QIVGMSAT+PN+  VA WL A LY TDFRPVPL E IK+
Sbjct: 246  -SHQAESASTLS------NAVQIVGMSATLPNLQLVASWLNAELYHTDFRPVPLLESIKI 305

Query: 825  GNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHV 884
            GN+IY+ S+ +VR       + G D DHIV LC E +++ HSVLIFC S+K CE  A  +
Sbjct: 306  GNSIYDSSMKLVREFQPLLQVKG-DEDHIVSLCYETIQDNHSVLIFCPSKKWCEKVADII 365

Query: 885  SKFLKKFSVKIHN--ENSEFTDIF-------SAVDALRRCPSGLDPVLEETFPSGVAYHH 944
            ++       +     ++SEF  +          +D L+R PSGLD VL+ T P GVA+HH
Sbjct: 366  AREFYNLHHQPEGLVKSSEFPPVILDQKSLLEVMDQLKRSPSGLDSVLKNTVPWGVAFHH 425

Query: 945  AGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQM 1004
            AGLT EER+++E  +R+G +RVL ATSTL++GVNLPARRVI R P      +D   Y+QM
Sbjct: 426  AGLTFEERDIIEGAFRQGFIRVLAATSTLSSGVNLPARRVIIRTPIFSGQPLDILTYKQM 485

Query: 1005 AGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLS---EDKNGMTHAILEVV 1064
             GRAGR G+DT GES+L+C+  E  +   LL  S  P+ SCL    E    M  AILE++
Sbjct: 486  VGRAGRKGVDTMGESILVCKNSEKSKGIALLQGSLEPVHSCLQRQGEVTASMIRAILEII 545

Query: 1065 AGGIVQTATDIHRYVRCTLLNST--KPFQDVVKSAQES--------LRWLCHGKFLEWN- 1124
             GG+  T+ D+  Y  CT L +   +  Q + ++  ++        + WL   +F++   
Sbjct: 546  VGGVASTSQDMQTYAACTFLAAAIQEGKQGMQRNQDDAQLGAIDACVTWLLENEFIQVAE 605

Query: 1125 -GDT---KLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINV 1184
             GD    K+Y  T LG A+  SSLSP ++L +  DL RA +GFVL +DLH+VYLVTP+  
Sbjct: 606  PGDGTGGKVYHPTHLGSATLSSSLSPTDTLDIFADLQRAMKGFVLENDLHIVYLVTPV-- 665

Query: 1185 DVEPDWELYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVG 1244
                        F    S+D      +    P  M+                  V   VG
Sbjct: 666  ------------FEDWISIDWYRFFCLWEKLPTSMKR-----------------VAELVG 725

Query: 1245 VTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALI 1304
            V E FL R   G  + R            T+R            + + + + KRF+ +L+
Sbjct: 726  VEEGFLARCVKGKVVAR------------TER------------QHRQMAIHKRFFTSLV 785

Query: 1305 LSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNR 1364
            L  L+ E P+ ++ + +   RG +Q+LQ+SA  +A M++VF  RLGWH++E L+++FQ R
Sbjct: 786  LLDLISEIPLKDINQKYGCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQKR 833

Query: 1365 VSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFES 1389
            ++FG++ E+ +L  +  +   RAR LY +G  T   +A A  AE+  AL  S
Sbjct: 846  LTFGIQRELCDLIRVSLLNAQRARFLYASGFLTVADLARADSAEVEVALKNS 833


HSP 2 Score: 179.5 bits (454), Expect = 4.2e-43
Identity = 108/273 (39.56%), Postives = 157/273 (57.51%), Query Frame = 1

Query: 1998 LLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQTK 2057
            L+++ADYSQ+ELR++AH S+D  LI++L+    DVF  IAA W     D++G   R   K
Sbjct: 2279 LILAADYSQLELRILAHLSRDCRLIQVLNTG-ADVFRSIAAEWKMIEPDAVGDDLRQHAK 2338

Query: 2058 RLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVETL 2117
            ++ YGI+YGMGAK+L  Q+   +++A   I SFKS + G+  ++ + V  CR+ G+VET+
Sbjct: 2339 QICYGIIYGMGAKSLGEQMGIKENDAASYIDSFKSRYKGINHFMRDTVKNCRKNGFVETI 2398

Query: 2118 KGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVIGTDAP 2177
             GRRR+L  I   N   K+ A+RQA+N+  Q      GSAADI+K+A +NI   + T   
Sbjct: 2399 LGRRRYLPGIKDDNPYHKAHAERQAINTTVQ------GSAADIVKIATVNIQKQLETFRS 2458

Query: 2178 --------------DPTGLPAANTNILRG-HCRI-----VLQVHDELVLEVDPSVVKEAA 2237
                          D TGL       L+G  C +     +LQ+HDEL+ EV    V + A
Sbjct: 2459 TFKSHGHRESMLQNDRTGLLPKRK--LKGMFCPMRGGFFILQLHDELLYEVAEEDVVQVA 2518

Query: 2238 ALLQKSMENAASLLVPLQVKLKVGRTWGSLEPF 2251
             +++  ME A  L V L+VK+K+G +WG L+ F
Sbjct: 2519 QIVKNEMECAIKLSVKLKVKVKIGASWGELKDF 2542


HSP 3 Score: 80.5 bits (197), Expect = 2.6e-13
Identity = 93/365 (25.48%), Postives = 149/365 (40.82%), Query Frame = 1

Query: 1642 GIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGDQVQV-LKCPGVSIQKLGFLNSA 1701
            G+A+CW     YY++L K+    +    L P    +   V+  ++C    +QK    +  
Sbjct: 1857 GLAVCWGAKDAYYLSLQKEQKQSEISPSLAPPPLDATLTVKERMECLQSCLQKKS--DRE 1916

Query: 1702 RRNMGLKLVDGSYLVLSRVHIS---NVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGE 1761
            R  +    +    ++L    IS   +  D  +  W+L PD +    P L   V   L  E
Sbjct: 1917 RSVVTYDFIQTYKVLLLSCGISLEPSYEDPKVACWLLDPDSKE---PTLHSIVTSFLPHE 1976

Query: 1762 AASAANRSG----QWKNQMRRVAHNGCCRRVAQTRAL---CSVLWKLIISEKLLEALNNI 1821
             A           Q         H+G  R   ++  +    + L  L+  E L +    +
Sbjct: 1977 LALLEGMETGPGIQSLGLNVNTEHSGRYRASVESVLIFNSMNQLNSLLQKENLHDIFCKV 2036

Query: 1822 EIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANV 1881
            E+P    LA +E  GIG     C   ++++  KL  +E +AY+LAG SFS  +A DIA V
Sbjct: 2037 EMPSQYCLALLELNGIGFSTAECESQKHVMQAKLDAIETQAYQLAGHSFSFTSADDIAQV 2096

Query: 1882 LYGHLKL----------------SIPEGFNKGK-----QHPSTDKHCLDLLRNEHPIVPV 1941
            L+  LKL                S   G   G+     +  ST K  L+ L+  HP+  +
Sbjct: 2097 LFLELKLPPNGEMKTQGSKKTLGSTRRGNESGRRMRLGRQFSTSKDILNKLKGLHPLPGL 2156

Query: 1942 IKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHA 1975
            I E R ++      +  +     L+   +   ++    Q+ TATGR++  EPN+Q V   
Sbjct: 2157 ILEWRRISNAITKVVFPLQREKHLNPLLRMERIY-PVSQSHTATGRITFTEPNIQNVPRD 2215

BLAST of Cla022227 vs. Swiss-Prot
Match: DPOLQ_DROME (DNA polymerase theta OS=Drosophila melanogaster GN=mus308 PE=1 SV=1)

HSP 1 Score: 496.5 bits (1277), Expect = 1.5e-138
Identity = 335/942 (35.56%), Postives = 488/942 (51.80%), Query Frame = 1

Query: 486  ASNESEVNAYDLNEQSDCCYTNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVKL 545
            A  ++ ++  +L+E S  C   D           S+ L ++ +H  +V +      E+  
Sbjct: 118  AGADAVLDQPNLDENSFLCPAQDE--------EASEQLKEDILHSHSVLAKQEFYQEI-- 177

Query: 546  NIFSPSDSITSDTAVHELRASTVHDFKEETTPSSSVRHKDWLDL---SCW-LPPEICSIY 605
               S      S  + ++LR S       E  P       D   L   S W LP  I + Y
Sbjct: 178  ---SQVTQNLSSMSPNQLRVSPNSSRIREAMPERPAMPLDLNTLRSISAWNLPMSIQAEY 237

Query: 606  KEKGITKLHPWQVECLKVDGVL-QRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLV 665
            K+KG+  +  WQVECL    +L +  NLVY A TSAGK+ V+EILML+ V+  GK  LL+
Sbjct: 238  KKKGVVDMFDWQVECLSKPRLLFEHCNLVYSAPTSAGKTLVSEILMLKTVLERGKKVLLI 297

Query: 666  LPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLP---KDTSVAVCTIEKANSLINR 725
            LP++S+  EK  ++  LL P    V  +YG   G T P   +   VA+CTIEKANS++N+
Sbjct: 298  LPFISVVREKMFYMQDLLTPAGYRVEGFYG---GYTPPGGFESLHVAICTIEKANSIVNK 357

Query: 726  LLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGK 785
            L+E+G+L  IG++V+DE+H++ D+ RGY+LELLL K+ Y +    L              
Sbjct: 358  LMEQGKLETIGMVVVDEVHLISDKGRGYILELLLAKILYMSRRNGLQ------------- 417

Query: 786  SDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLEEYIKVGNTIYNRSLDIVRT 845
                  IQ++ MSAT+ NV  +  WL A LY T++RPV L+E IKVG  IY+  L +VR 
Sbjct: 418  ------IQVITMSATLENVQLLQSWLDAELYITNYRPVALKEMIKVGTVIYDHRLKLVRD 477

Query: 846  ISKTANLGG---RDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKFSVKI 905
            ++K   L      D D +  LC E + EG SV++FC S+  CE+ A  ++  +    V+I
Sbjct: 478  VAKQKVLLKGLENDSDDVALLCIETLLEGCSVIVFCPSKDWCENLAVQLATAIH---VQI 537

Query: 906  HNE---------NSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVV 965
             +E         N     I      LR  P+GLD V+ +      A+HHAGLT EER+++
Sbjct: 538  KSETVLGQRLRTNLNPRAIAEVKQQLRDIPTGLDGVMSKAITYACAFHHAGLTTEERDII 597

Query: 966  ETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDT 1025
            E  ++ G L+VL ATSTL++GVNLPARRV+ R P  G   +    YRQM GRAGR G DT
Sbjct: 598  EASFKAGALKVLVATSTLSSGVNLPARRVLIRSPLFGGKQMSSLTYRQMIGRAGRMGKDT 657

Query: 1026 KGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTH---AILEVVAGGIVQTATDI 1085
             GES+LIC     +   +L+     P+ SCL  D +G TH   A+LEV++ G+  T  DI
Sbjct: 658  LGESILICNEINARMGRDLVVSELQPITSCL--DMDGSTHLKRALLEVISSGVANTKEDI 717

Query: 1086 HRYVRCTLLNSTKPFQDVVKSAQE---------SLRWLCHGKFLEWNG----DTKLYSTT 1145
              +V CTLL++ K F    K   E         +L +L   +F+        +T +Y  T
Sbjct: 718  DFFVNCTLLSAQKAFHAKEKPPDEESDANYINDALDFLVEYEFVRLQRNEERETAVYVAT 777

Query: 1146 PLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERF 1205
             LG A   SS+ P + LI+  +L ++R  FVL S+LH VYLVTP +V  +          
Sbjct: 778  RLGAACLASSMPPTDGLILFAELQKSRRSFVLESELHAVYLVTPYSVCYQ---------- 837

Query: 1206 MGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGA 1265
              L  +D  +   V + E         +P+++         VG  VGV + FL +   G 
Sbjct: 838  --LQDIDWLL--YVHMWEKL------SSPMKK---------VGELVGVRDAFLYKALRG- 897

Query: 1266 PIRRANISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEV 1325
                           +TK D             + +++ KRFY+AL L  LV ETPI  V
Sbjct: 898  ---------------QTKLDY------------KQMQIHKRFYIALALEELVNETPINVV 957

Query: 1326 CEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELT 1385
               +K  RGM+Q+LQ+ A  FA +V+ FC  L W  L  +V++F++R+ FG+  ++++L 
Sbjct: 958  VHKYKCHRGMLQSLQQMASTFAGIVTAFCNSLQWSTLALIVSQFKDRLFFGIHRDLIDLM 962

Query: 1386 TIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESASW 1392
             IP +   RARAL+ AG+ + + +A A   EL K L+ S S+
Sbjct: 1018 RIPDLSQKRARALFDAGITSLVELAGADPVELEKVLYNSISF 962


HSP 2 Score: 223.8 bits (569), Expect = 1.9e-56
Identity = 155/446 (34.75%), Postives = 232/446 (52.02%), Query Frame = 1

Query: 1803 LLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLY 1862
            LL+  ++IE+P+   L  ME  G     +   +    +   +K +E + Y   G  F+L 
Sbjct: 1646 LLKFFHDIEMPIQLTLCQMELVGFPAQKQRLQQLYQRMVAVMKKVETKIYEQHGSRFNLG 1705

Query: 1863 AAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNC 1922
            ++  +A VL  H          K K   +T +  L+ L +  PI  +I  +R L+ L   
Sbjct: 1706 SSQAVAKVLGLH---------RKAKGRVTTSRQVLEKLNS--PISHLILGYRKLSGLL-- 1765

Query: 1923 TLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDDVDHC 1982
                  S+  L    Q   +HG  + T TATGR+SM EPNLQ V      ++  D V   
Sbjct: 1766 ----AKSIQPLMECCQADRIHGQSI-TYTATGRISMTEPNLQNVAKEFSIQVGSDVVH-- 1825

Query: 1983 KINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTG 2042
             I+ R  F+ T E+  L+SAD+ Q+E+R++AH S+D +L+E++ K   D+F  IAA W  
Sbjct: 1826 -ISCRSPFMPTDESRCLLSADFCQLEMRILAHMSQDKALLEVM-KSSQDLFIAIAAHWNK 1885

Query: 2043 KTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLH 2102
              E  +    R+ TK++ YGI+YGMG ++LA  L CS+ EA      F  ++ G+  +  
Sbjct: 1886 IEESEVTQDLRNSTKQVCYGIVYGMGMRSLAESLNCSEQEARMISDQFHQAYKGIRDYTT 1945

Query: 2103 EAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIK 2162
              V F R KG+VET+ GRRR+L  INS     K++A+RQAVNS  Q      GSAADI K
Sbjct: 1946 RVVNFARSKGFVETITGRRRYLENINSDVEHLKNQAERQAVNSTIQ------GSAADIAK 2005

Query: 2163 VAMINIYSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVKEAAALLQK 2222
             A++ +   I         L   + ++      +V+ +HDEL+ EV     K+ A +L  
Sbjct: 2006 NAILKMEKNIERYREK---LALGDNSV-----DLVMHLHDELIFEVPTGKAKKIAKVLSL 2055

Query: 2223 SMENAASLLVPLQVKLKVGRTWGSLE 2249
            +MEN   L VPL+VKL++GR+WG  +
Sbjct: 2066 TMENCVKLSVPLKVKLRIGRSWGEFK 2055

BLAST of Cla022227 vs. Swiss-Prot
Match: HELQ_HUMAN (Helicase POLQ-like OS=Homo sapiens GN=HELQ PE=1 SV=2)

HSP 1 Score: 385.6 bits (989), Expect = 3.8e-105
Identity = 275/818 (33.62%), Postives = 421/818 (51.47%), Query Frame = 1

Query: 433  GSGGVRLGEPGA--------SMVSLRSELKELNREVSSLPVKHFDFSADDKNLDGSTLPY 492
            G  GV + EPGA        S       L+  + ++    +K  D+ +   N     LP+
Sbjct: 179  GYEGVTI-EPGADLLYDVPSSQAIYFENLQNSSNDLGDHSMKERDWKSSSHNTVNEELPH 238

Query: 493  CASNESEVNAYDLNEQSDCCYTNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVK 552
                    N  +  +Q+D           + K R S  + + K  + ++ +++    + +
Sbjct: 239  --------NCIEQPQQND---------ESSSKVRTSSDMNRRKSIKDHLKNAMTGNAKAQ 298

Query: 553  LNIFSPSDSITSDTAVHELRAS--TVHDFKEETTPSSSVRHKDWLDLSCWLPPEICSIYK 612
              IFS S  +       E+  +  TV     +  P  S            LP ++  +Y 
Sbjct: 299  TPIFSRSKQLKDTLLSEEINVAKKTVESSSNDLGPFYS------------LPSKVRDLYA 358

Query: 613  E-KGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLVL 672
            + KGI KL+ WQ  CL ++ V +R+NL+Y   TS GK+ VAEILML+ ++   K  L++L
Sbjct: 359  QFKGIEKLYEWQHTCLTLNSVQERKNLIYSLPTSGGKTLVAEILMLQELLCCRKDVLMIL 418

Query: 673  PYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLPK---DTSVAVCTIEKANSLINRL 732
            PYV+I  EK + L      L   V  Y G++G     K     S+ + TIEK +SL+N L
Sbjct: 419  PYVAIVQEKISGLSSFGIELGFFVEEYAGSKGRFPPTKRREKKSLYIATIEKGHSLVNSL 478

Query: 733  LEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGKS 792
            +E GR+  +G++V+DELHM+G+ +RG  LE+ L K+ Y             +S T+    
Sbjct: 479  IETGRIDSLGLVVVDELHMIGEGSRGATLEMTLAKILY-------------TSKTT---- 538

Query: 793  DPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLEEYIKVGNTIY--NRSLDIVR 852
                  QI+GMSAT+ NV  +  +LQA  Y + FRPV L+EY+K+ +TIY  +   +   
Sbjct: 539  ------QIIGMSATLNNVEDLQKFLQAEYYTSQFRPVELKEYLKINDTIYEVDSKAENGM 598

Query: 853  TISKTAN------LGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKF 912
            T S+  N      L   DPDH+V L  EV+   +S L+FC S+K CE+ A+ + KFL K 
Sbjct: 599  TFSRLLNYKYSDTLKKMDPDHLVALVTEVIPN-YSCLVFCPSKKNCENVAEMICKFLSKE 658

Query: 913  SVKIHNENSEFTDIFSAVDALRRCPSG-LDPVLEETFPSGVAYHHAGLTVEEREVVETCY 972
             +K H E  +       +  L+   +G L PVL+ T P GVAYHH+GLT +ER+++E  Y
Sbjct: 659  YLK-HKEKEKC----EVIKNLKNIGNGNLCPVLKRTIPFGVAYHHSGLTSDERKLLEEAY 718

Query: 973  RKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDTKGES 1032
              G+L + T TSTLAAGVNLPARRVI R P + ++F+   +Y+QM GRAGR GIDT GES
Sbjct: 719  STGVLCLFTCTSTLAAGVNLPARRVILRAPYVAKEFLKRNQYKQMIGRAGRAGIDTIGES 778

Query: 1033 VLICRPEEIKRINELLNESCPPLQSCLS----EDKNGMTHAILEVVAGGIVQTATDIHRY 1092
            +LI + ++ +++ EL+ +   PL++C S    E   G+    L ++   I     DI+ +
Sbjct: 779  ILILQEKDKQQVLELITK---PLENCYSHLVQEFTKGIQTLFLSLIGLKIATNLDDIYHF 838

Query: 1093 VRCTLLNSTKPFQDVVKS----AQESLRWLCHGKFLE----WNGDTKL---YSTTPLGRA 1152
            +  T     +      KS      ESLR+L     L+    +  + ++   +  T LGRA
Sbjct: 839  MNGTFFGVQQKVLLKEKSLWEITVESLRYLTEKGLLQKDTIYKSEEEVQYNFHITKLGRA 898

Query: 1153 SFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINV--DVEPDWELYYERFMGL 1211
            SF  ++      I+  DL +  EG VL S LHL+YL TP ++     PDW +Y+ +F  L
Sbjct: 899  SFKGTIDLAYCDILYRDLKKGLEGLVLESLLHLIYLTTPYDLVSQCNPDWMIYFRQFSQL 933


HSP 2 Score: 64.3 bits (155), Expect = 1.9e-08
Identity = 38/120 (31.67%), Postives = 64/120 (53.33%), Query Frame = 1

Query: 1267 VCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERL-GWHD 1326
            V  R Y++ +L  L++ET I  V E F + RG +Q L      F+S V  FCE L  +  
Sbjct: 931  VVNRLYLSFVLYTLLKETNIWTVSEKFNMPRGYIQNLLTGTASFSSCVLHFCEELEEFWV 990

Query: 1327 LEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKAL 1386
               L+ +   ++++ V+AE++ L  +  V   RA+ LY AG ++ + +A A+   LV+ +
Sbjct: 991  YRALLVELTKKLTYCVKAELIPLMEVTGVLEGRAKQLYSAGYKSLMHLANANPEVLVRTI 1050

BLAST of Cla022227 vs. TrEMBL
Match: A0A0A0LS46_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G375760 PE=4 SV=1)

HSP 1 Score: 3724.1 bits (9656), Expect = 0.0e+00
Identity = 1922/2210 (86.97%), Postives = 1984/2210 (89.77%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            +FYASKKRKPLT SLKSGSYDK+GKK+LEGSPGAKGTLDNYLV SQDHG+SD PSHSVRE
Sbjct: 12   QFYASKKRKPLTPSLKSGSYDKNGKKALEGSPGAKGTLDNYLVISQDHGSSDNPSHSVRE 71

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
            NLS Q+LVKRNLLLKINSS RNEH E T SRGCD        KKRTLEDS+ETRSSTVK 
Sbjct: 72   NLSAQNLVKRNLLLKINSSFRNEHGETTSSRGCD--------KKRTLEDSFETRSSTVKS 131

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
             A D G+TPCTEKPELKQFAADFLSLYCSNEL TTVSSP EQKVTFLKRHSSPS LEGEA
Sbjct: 132  TASDCGITPCTEKPELKQFAADFLSLYCSNELQTTVSSPVEQKVTFLKRHSSPSHLEGEA 191

Query: 280  KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKAP 339
            KLPKK+HSI GPSNA+ EPDSSNALS GNK+SNFVVETGDT SH P VLKAC+QKCN+AP
Sbjct: 192  KLPKKMHSIVGPSNAESEPDSSNALSEGNKESNFVVETGDTVSHHPAVLKACMQKCNQAP 251

Query: 340  RSPYCLTECKTPGLSTANTCFQETPKSGSSTFSPGEAFWKEAIVFADGLCAPSIDLTNCD 399
             SPYCLTECKTPGLST  T  ++TPKSGSSTFSPGEAFWKEAIV ADGL APSI L NCD
Sbjct: 252  TSPYCLTECKTPGLSTGTTFIRQTPKSGSSTFSPGEAFWKEAIVLADGLRAPSIALINCD 311

Query: 400  AEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNRE 459
            AE AN+ ESQS+TKKLPIP EPAQKRLKGQFGGGSGGVRLGEPGAS   LRS+LKEL+R 
Sbjct: 312  AEEANLVESQSNTKKLPIPEEPAQKRLKGQFGGGSGGVRLGEPGAS---LRSDLKELDRV 371

Query: 460  VSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTND-SLPNHNDKTR 519
            VSSLPVKHFDFSADDKNLD ST P CASNES+VNAYDLNEQSD CYT   SLP HNDKTR
Sbjct: 372  VSSLPVKHFDFSADDKNLDDSTSPCCASNESKVNAYDLNEQSDRCYTTHISLPKHNDKTR 431

Query: 520  DSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHELRASTVHDFKEETTPS 579
            DSDSLTKEKI ET VTSSVPVV EVKLNIFSPSDSITSDTA HELRAST+HD ++ETTPS
Sbjct: 432  DSDSLTKEKIQETIVTSSVPVVNEVKLNIFSPSDSITSDTAAHELRASTIHDSRDETTPS 491

Query: 580  SSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 639
            SS RHKDWLDLSCWLPPEI SIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK
Sbjct: 492  SSTRHKDWLDLSCWLPPEISSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 551

Query: 640  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLP 699
            SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLE L KHVRSYYGNQGGGTLP
Sbjct: 552  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLESLGKHVRSYYGNQGGGTLP 611

Query: 700  KDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 759
            KDTSVAVCTIEKANSLINRLLEE RLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA
Sbjct: 612  KDTSVAVCTIEKANSLINRLLEECRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 671

Query: 760  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLE 819
            GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALY TDFRPVPLE
Sbjct: 672  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYHTDFRPVPLE 731

Query: 820  EYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCES 879
            EYIKVGNTIYN+SLDIVRTISKTANLGGRDPDHIVELCNEVVE+GHSVLIFCSSRKGCES
Sbjct: 732  EYIKVGNTIYNKSLDIVRTISKTANLGGRDPDHIVELCNEVVEDGHSVLIFCSSRKGCES 791

Query: 880  TAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLT 939
            TAKHVSKFLKKFSVKI N+NSEFTDIFSA+DALRRCPSGLDPVLEETFPSGVAYHHAGLT
Sbjct: 792  TAKHVSKFLKKFSVKIQNDNSEFTDIFSAIDALRRCPSGLDPVLEETFPSGVAYHHAGLT 851

Query: 940  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRA 999
            VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDGARYRQMAGRA
Sbjct: 852  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGARYRQMAGRA 911

Query: 1000 GRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 1059
            GRTGIDTKGESVLICRPEE+KRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT
Sbjct: 912  GRTGIDTKGESVLICRPEEVKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 971

Query: 1060 ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS 1119
            ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS
Sbjct: 972  ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS 1031

Query: 1120 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQS 1179
            SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYE FMGL SLDQS
Sbjct: 1032 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYELFMGLSSLDQS 1091

Query: 1180 VGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISR 1239
            VGNRVG TEPFLMRMAHGAP+RRANISRNGV+                            
Sbjct: 1092 VGNRVGATEPFLMRMAHGAPVRRANISRNGVA---------------------------- 1151

Query: 1240 NGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARG 1299
                GLRTKRDEH  +Y DRPSEEQTIRVCKRFYVALILSRLVQETPIPEVC+AFKVARG
Sbjct: 1152 ----GLRTKRDEHVGVYGDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCDAFKVARG 1211

Query: 1300 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1359
            MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR
Sbjct: 1212 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1271

Query: 1360 ARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTA 1419
            ARALYKAGLRTPLAIAEASDAELVKAL ESASWT E                    ESTA
Sbjct: 1272 ARALYKAGLRTPLAIAEASDAELVKALSESASWTTE--------------------ESTA 1331

Query: 1420 QKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQ 1479
            QKRMHVG+ARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQIS PLSASADGNITAQ
Sbjct: 1332 QKRMHVGLARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISHPLSASADGNITAQ 1391

Query: 1480 VAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFG 1539
            VA             V T+QME  LT SC GGTSSSEKV GKN S+TG IS++VK  N G
Sbjct: 1392 VA-------------VGTQQMERVLTLSCVGGTSSSEKVVGKNPSQTGAISIDVKQSNSG 1451

Query: 1540 VNPLVNVEGSAIQESNTVVECAGKVDVTISNHMERI----AQREQHSS-VLHPPKRDSSS 1599
            VNP VN EGSAIQ+SNTV ECAGKVDV IS+H+ERI    AQREQHSS VLH  KRD SS
Sbjct: 1452 VNPPVNAEGSAIQDSNTVGECAGKVDVAISSHLERITDKDAQREQHSSKVLHSLKRDGSS 1511

Query: 1600 MKGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVY 1659
            MKGPI AA+TSGGFESFL+LWDASQEF+FDLYYTKRSEVNSVVPFELHGIAICWE SPVY
Sbjct: 1512 MKGPIQAASTSGGFESFLNLWDASQEFYFDLYYTKRSEVNSVVPFELHGIAICWEKSPVY 1571

Query: 1660 YVNLPKDLLGPKSGKGLYPDDRTSGD---------------------------------- 1719
            YVN+PKDLLGPKSGKGL PDD  SGD                                  
Sbjct: 1572 YVNIPKDLLGPKSGKGLCPDDSISGDQVDVSQNEHWFEMIEMRWKKINEIFTKKNVRKFA 1631

Query: 1720 -----QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWIL 1779
                 QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSY+VLSRVH+SNVIDMCIVAWIL
Sbjct: 1632 WNLKVQVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYIVLSRVHMSNVIDMCIVAWIL 1691

Query: 1780 WPDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1839
            WPDDERNST NLEKEVKKRLSGEAA+AANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL
Sbjct: 1692 WPDDERNSTLNLEKEVKKRLSGEAAAAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1751

Query: 1840 WKLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRL 1899
            WKLIISEKLL+ALNNIEIPLV ILADMETWGIGVDMEGCIRARNLLGKKL+CLEKEAYRL
Sbjct: 1752 WKLIISEKLLDALNNIEIPLVGILADMETWGIGVDMEGCIRARNLLGKKLRCLEKEAYRL 1811

Query: 1900 AGMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1959
            AGM+FSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR
Sbjct: 1812 AGMTFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1871

Query: 1960 TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKM 2019
            TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAV+FKM
Sbjct: 1872 TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVEFKM 1931

Query: 2020 NEDDVDHCKINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFT 2079
            NEDDVDHCKINARDFFISTQENWLL+SADYSQIELRLMAHFSKDS LIELLS PHGDVFT
Sbjct: 1932 NEDDVDHCKINARDFFISTQENWLLLSADYSQIELRLMAHFSKDSLLIELLSIPHGDVFT 1991

Query: 2080 MIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSF 2139
            MIAARWTGKTEDSIG HERDQTKRLVYGILYGMGAK+LALQLECS+DEAVEKI+SFKSSF
Sbjct: 1992 MIAARWTGKTEDSIGSHERDQTKRLVYGILYGMGAKSLALQLECSRDEAVEKIQSFKSSF 2051

Query: 2140 PGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYW 2199
            PGVASWLHEAV FCRQKGYVETLKGRRRFLSKINSP SKEKSKAQRQAVNSICQ      
Sbjct: 2052 PGVASWLHEAVAFCRQKGYVETLKGRRRFLSKINSPISKEKSKAQRQAVNSICQ------ 2111

Query: 2200 GSAADIIKVAMINIYSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVK 2259
            GSAADIIK+AMI++YSVIGTDAPD T LPAAN+NILRGHCRIVLQVHDELVLEVDPS VK
Sbjct: 2112 GSAADIIKLAMIHVYSVIGTDAPDLTVLPAANSNILRGHCRIVLQVHDELVLEVDPSFVK 2139

Query: 2260 EAAALLQKSMENAASLLVPLQVKLKVGRTWGSLEPFLHDSFKIEVLVPGS 2265
            EAA+LLQKSMENAASLLVPLQVKLKVGRTWGSLE FL D+F+IE L PGS
Sbjct: 2172 EAASLLQKSMENAASLLVPLQVKLKVGRTWGSLETFLPDNFQIEALAPGS 2139

BLAST of Cla022227 vs. TrEMBL
Match: A0A067GWC2_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000107mg PE=4 SV=1)

HSP 1 Score: 2600.1 bits (6738), Expect = 0.0e+00
Identity = 1419/2248 (63.12%), Postives = 1663/2248 (73.98%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            +FYASKKRK  + S+KSG  +KD K ++E SP AKGTLDNYL  SQD G++     S + 
Sbjct: 12   QFYASKKRKSRSPSVKSGRAEKDAKITVEVSPSAKGTLDNYLKNSQDDGHT-----SKQS 71

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
             LS  ++VKRNL L+I+  S++E  +  LS      A  + I + + ++     +S V  
Sbjct: 72   LLSRHEVVKRNLSLEIDKYSKDEKNQALLSDQAQPQATQKVISRCSSKEG----NSEVGC 131

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
               DG      E  ELKQF  DFLSLYCS E+H++ SSP E K+   KRHSSPSLL GE 
Sbjct: 132  HMKDGSAH-IPESLELKQFPTDFLSLYCS-EIHSSASSPSEAKLKDHKRHSSPSLLGGED 191

Query: 280  -KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGD---------TDSHPPVVLK 339
             K+ KK + ++    +  +   SNA ++   QS F+V+TG+         TDS+  ++L+
Sbjct: 192  NKIAKKKYCVSNLLQSGEQTTCSNAKNIEETQSGFIVKTGNLVPNSSQRVTDSNASLLLQ 251

Query: 340  ACLQKCNKAPRSPYCLTECKTPGLSTANTCFQETPKS--GSSTFSPGEAFWKEAIVFADG 399
            A L+KC+K+ +S    T C TP  S   T  +ETPKS  G+S FSPGEAFW EAI  ADG
Sbjct: 252  ASLRKCDKSSKSTLNTTACYTPEPSIVKTYVRETPKSTCGNSIFSPGEAFWNEAIEIADG 311

Query: 400  LCAPSIDLTNCDAEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMV 459
              A +    +  AEG   ++SQ+              + K     G   V+  + G S+ 
Sbjct: 312  FFAHTDIGPSQIAEGIADSKSQNEINNSYNLRNKNYNKSKEMLNEGDSKVQHIKAGGSLK 371

Query: 460  SLRSELKELNREVSSLPVKHFDFSADDKNLDGSTLPYC-ASNESEVNAYDLNEQSDC-CY 519
             +  ++ +  +E+S LP+KH DF  +DKNL G T P C A++ SE   +     S+    
Sbjct: 372  QMGKDVIDSVKELSPLPIKHLDFLFEDKNLKG-TKPGCGAADTSEAMMFRDGVVSEKGSV 431

Query: 520  TNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPS-DSITSDTAVHELR 579
            T+ S      K    ++   E I +     SV +V E KL+I S   DSITSD+  + ++
Sbjct: 432  THKSCQKIKFKCHHDNTSRTEGISDVQEKDSVLIVHERKLDISSQGIDSITSDSPTNVIK 491

Query: 580  ASTVHDFKEET-TPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVL 639
                ++  +E  TPSSS   KD LDLS WLP EICSIYK++GI+KL+PWQVECL VDGVL
Sbjct: 492  KPVGNEKSDEAGTPSSSGMLKDCLDLSSWLPSEICSIYKKRGISKLYPWQVECLHVDGVL 551

Query: 640  QRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDK 699
            QRRNLVYCASTSAGKSFVAEILMLRR+ISTGKMALLVLPYVSICAEKA HL+VLLEPL +
Sbjct: 552  QRRNLVYCASTSAGKSFVAEILMLRRLISTGKMALLVLPYVSICAEKAEHLEVLLEPLGR 611

Query: 700  HVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTR 759
            HVRSYYGNQGGG+LPKDTSVAVCTIEKANSL+NR+LEEGRLSEIGIIVIDELHMV DQ R
Sbjct: 612  HVRSYYGNQGGGSLPKDTSVAVCTIEKANSLVNRMLEEGRLSEIGIIVIDELHMVADQNR 671

Query: 760  GYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWL 819
            GYLLELLLTKLRYAAGEG  DSSSGE+SGTSSGK+DPAHG+QIVGMSATMPNVAAVADWL
Sbjct: 672  GYLLELLLTKLRYAAGEGTSDSSSGENSGTSSGKADPAHGLQIVGMSATMPNVAAVADWL 731

Query: 820  QAALYQTDFRPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEG 879
            QAALY+T+FRPVPLEEYIKVGN IY++ +D+VRTI   ANLGG+DPDHIVELC+EVV+EG
Sbjct: 732  QAALYETNFRPVPLEEYIKVGNAIYSKKMDVVRTILTAANLGGKDPDHIVELCDEVVQEG 791

Query: 880  HSVLIFCSSRKGCESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLE 939
            HSVLIFCSSRKGCESTA+HVSKFLKKFS+ +H+ +SEF DI SA+DALRRCP+GLDPVLE
Sbjct: 792  HSVLIFCSSRKGCESTARHVSKFLKKFSINVHSSDSEFIDITSAIDALRRCPAGLDPVLE 851

Query: 940  ETFPSGVAYHHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGR 999
            ET PSGVAYHHAGLTVEEREVVETCYRKGL+RVLTATSTLAAGVNLPARRVIFRQP+IGR
Sbjct: 852  ETLPSGVAYHHAGLTVEEREVVETCYRKGLVRVLTATSTLAAGVNLPARRVIFRQPRIGR 911

Query: 1000 DFIDGARYRQMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGM 1059
            DFIDG RYRQMAGRAGRTGIDTKGES+LIC+PEE+K+I  LLNESCPPL SCLSEDKNGM
Sbjct: 912  DFIDGTRYRQMAGRAGRTGIDTKGESMLICKPEEVKKIMGLLNESCPPLHSCLSEDKNGM 971

Query: 1060 THAILEVVAGGIVQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDT 1119
            THAILEVVAGGIVQTA DIHRYVRCTLLNSTKPFQDVVKSAQ+SLRWLCH KFLEWN DT
Sbjct: 972  THAILEVVAGGIVQTAEDIHRYVRCTLLNSTKPFQDVVKSAQDSLRWLCHRKFLEWNEDT 1031

Query: 1120 KLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWE 1179
            KLYSTTPLGRA+FGSSL PEESLIVLDDLSRAREGFVLASDLHLVYL TPINV+VEPDWE
Sbjct: 1032 KLYSTTPLGRAAFGSSLCPEESLIVLDDLSRAREGFVLASDLHLVYLSTPINVEVEPDWE 1091

Query: 1180 LYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLM 1239
            LYYERF+ L +LDQSVGN+VGV+EP+LMRMAHGAP+R ++  R+                
Sbjct: 1092 LYYERFLELSALDQSVGNQVGVSEPYLMRMAHGAPMRISSKLRDST-------------- 1151

Query: 1240 RMAHGAPIRRANISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQE 1299
            +  HG    R  I+ N ++                 S+ QT+RVCKRFYVALILSRLVQE
Sbjct: 1152 KGLHGKLEYRLGITSNNML-----------------SDAQTLRVCKRFYVALILSRLVQE 1211

Query: 1300 TPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRA 1359
            TP+ EVCE FKVARGMVQALQE+AGRFASMVSVFCERLGW+DLEGL+AKFQNRVSFGVRA
Sbjct: 1212 TPVLEVCETFKVARGMVQALQENAGRFASMVSVFCERLGWYDLEGLIAKFQNRVSFGVRA 1271

Query: 1360 EIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLF 1419
            EIVELTTIPYVKGSRARALYKAGLRTPLAIAEAS +E+VKALFES+SW AE         
Sbjct: 1272 EIVELTTIPYVKGSRARALYKAGLRTPLAIAEASISEIVKALFESSSWIAE--------- 1331

Query: 1420 VCADSGQQVCIESTAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQI 1479
                          AQ+R+ +G+A+KIK+GARK+VL+KAEEARIAAFSAFKSLG  VPQ 
Sbjct: 1332 --------------AQRRVQLGVAKKIKNGARKIVLEKAEEARIAAFSAFKSLGLNVPQF 1391

Query: 1480 SRPLSASADGNITA-QVAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKV----G 1539
            SRP+ ++A  N T  + AA+     D  +  +    MEH+  K        S+KV     
Sbjct: 1392 SRPILSTATENSTGEEEAATTAPRNDKSSSFIFPVPMEHS-DKPSLEANQISKKVDLESA 1451

Query: 1540 GKNLSET----------GTISVEVKPPNFGVNPLV------------NVEGSAIQESNTV 1599
            G+ L ET          G    E++      NP V            NV  S I+  +T 
Sbjct: 1452 GEKLLETSDNELSALVEGGSITELQQKFDAENPPVPFVGPGTGGVEFNVNASEIKIPDTT 1511

Query: 1600 --VECAGKVDVTISNHME---RIAQREQHSSVLHPPKRDSSSMKGPIHAANTSGGFESFL 1659
              V+       TI+++ +    +  R      L    +D +  KGPI+A N SGGF+ FL
Sbjct: 1512 LSVQLGKNAIGTITSNRDLDLEVQDRPNRDPCL--VNKDRACNKGPINAINASGGFDCFL 1571

Query: 1660 DLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDL---------- 1719
            D W+A+ EF+FD++Y K SE NS V FE+HG+A+CWENSPVYYVNLPKDL          
Sbjct: 1572 DRWEATHEFYFDIHYDKHSEANSGVLFEIHGLAVCWENSPVYYVNLPKDLWSDHRRKDRF 1631

Query: 1720 --LGPKSGKGLYPDDRTS---------GD----------------QVQVLKCPGVSIQKL 1779
               G      L P+ +           G+                Q+QVLK   VSIQ+ 
Sbjct: 1632 LIYGSSDKNVLTPEHQLEMIKQRWKRIGEIMEKRDVRKFTWNMKVQIQVLKHAAVSIQRF 1691

Query: 1780 GFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRL 1839
            G LN    ++GL+ V  S+L+LS VH+ + IDMCIV+WILWPDDER+S PNLEKEVKKRL
Sbjct: 1692 GGLNLVGTSLGLENVGSSFLLLSPVHLKDGIDMCIVSWILWPDDERSSNPNLEKEVKKRL 1751

Query: 1840 SGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPL 1899
            S EAA+AANRSG+WKNQMRR AHNGCCRRVAQTRALCSVLWKL++SE+L+EAL NIEIPL
Sbjct: 1752 SSEAAAAANRSGRWKNQMRRAAHNGCCRRVAQTRALCSVLWKLLVSEELIEALLNIEIPL 1811

Query: 1900 VSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGH 1959
            V++LADME WGIGVDMEGC++ARNLL KKL+ LEK+AY LAGM FSLY AADIANVLYGH
Sbjct: 1812 VNVLADMELWGIGVDMEGCLQARNLLQKKLRYLEKKAYTLAGMKFSLYTAADIANVLYGH 1871

Query: 1960 LKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLS 2019
            LKL IPEG NKGKQHPSTDKHCLDLLR+EHPIVPVIKEHRTLAKL NCTLGSICSLA++S
Sbjct: 1872 LKLPIPEGHNKGKQHPSTDKHCLDLLRHEHPIVPVIKEHRTLAKLLNCTLGSICSLARIS 1931

Query: 2020 ARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDD-------VDHCKINAR 2079
              TQKYTLHGHWLQTSTATGRLSMEEPNLQCVEH V+FKM+ +D       VDHCKINAR
Sbjct: 1932 MSTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHMVEFKMSNEDIYGGNAEVDHCKINAR 1991

Query: 2080 DFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDS 2139
            DFFI +QENW+L++ADYSQIELRLMAHFSKD +LI LLSKPHGDVFTMIAARWTG++EDS
Sbjct: 1992 DFFIPSQENWILLAADYSQIELRLMAHFSKDPALIGLLSKPHGDVFTMIAARWTGRSEDS 2051

Query: 2140 IGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTF 2199
            +G  ERDQTKRL+YGILYGMG  TL+ QL CS +EA EKI+SFKSSFPGVASWLH AV+ 
Sbjct: 2052 VGSQERDQTKRLIYGILYGMGPNTLSEQLNCSSNEAKEKIKSFKSSFPGVASWLHVAVSS 2111

Query: 2200 CRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMIN 2256
            C QKGYVE+LKGR+RFLSKI   N+KEKSKAQRQAVNSICQ      GSAADIIK+AMIN
Sbjct: 2112 CHQKGYVESLKGRKRFLSKIKFGNNKEKSKAQRQAVNSICQ------GSAADIIKIAMIN 2171

BLAST of Cla022227 vs. TrEMBL
Match: M5Y7D4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020963mg PE=4 SV=1)

HSP 1 Score: 2584.7 bits (6698), Expect = 0.0e+00
Identity = 1398/2234 (62.58%), Postives = 1652/2234 (73.95%), Query Frame = 1

Query: 99   SKFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVR 158
            ++F+ASKKRKPL+  LKSG  +KD K  +EGSP AKGTLDNYL+ SQ++     PS+ V 
Sbjct: 10   NQFFASKKRKPLSPVLKSGRNEKDVKVKVEGSPSAKGTLDNYLLASQENNIISEPSYKVC 69

Query: 159  ENLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVK 218
            ++L++QD V+RNL  +I++S ++E ++  LS    + A       +       T+   VK
Sbjct: 70   DSLAQQDQVRRNLTSEIDNSLKDEFKQLPLSSQLHSEANDVSQANQKETSRQLTKVGDVK 129

Query: 219  LMAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGE 278
                    T   ++ ELK FAADFLSLYCS +L    SS  E KV   KR +SPSLL+ E
Sbjct: 130  EYPA---FTEGEDRAELKDFAADFLSLYCS-DLQPNESSLSEMKVNDHKRQASPSLLDRE 189

Query: 279  AKLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKA 338
             K  KK H I   S+ + E   S+  S    QS+ V + G T  +  + L+  L+ C+  
Sbjct: 190  DKTFKKRHCITNQSHVEHETSYSSEKSSEAVQSDSVDKNGVTIVNELLELQPTLKACSNT 249

Query: 339  PRSPYCLTECKTPGLSTANTCFQETPKS--GSSTFSPGEAFWKEAIVFADGLCAPSIDLT 398
             +    + EC TPG  T  T  +ETPKS  GSS+FSPGEAFW +AI  ADGLCA +  + 
Sbjct: 250  AKLSLDMFECCTPGSLTRKTSVRETPKSTRGSSSFSPGEAFWDDAIQLADGLCAQAAGVI 309

Query: 399  NCDAEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKEL 458
            +  A+G   ++S  + +     G+  +   +G+        R+G+ G +   +    K+L
Sbjct: 310  SV-ADGQYRSKSSCNLRNARCDGKSKEILDEGE--------RMGK-GGNTGPMGKHRKDL 369

Query: 459  NREVSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTNDSLPNHNDK 518
            ++EVS LPVKHFDFS +DKNLD S   +  +   +  A+   EQS+    +     +   
Sbjct: 370  DKEVSPLPVKHFDFSCEDKNLDKSVPHHLDAYNLKSVAHVGGEQSESSLIDPRGLRNPMM 429

Query: 519  TRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHEL-RASTVHDFKEET 578
             R + S   +       T+SV  VT +KL++      +TS + V E+ + +  H+  E +
Sbjct: 430  IRCNKSQENQVTFRDQYTNSVNAVTNMKLDL--TGKDMTSYSPVDEVVKLTGNHESDEAS 489

Query: 579  TPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTS 638
            TPSS V  KD LDL+ WLPPEICS+Y++KGI+KL+PWQV+CL+V+GVLQRRNLVYCASTS
Sbjct: 490  TPSSFVPLKDHLDLNSWLPPEICSLYRKKGISKLYPWQVDCLQVEGVLQRRNLVYCASTS 549

Query: 639  AGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGG 698
            AGKSFVAEILMLRRV+S+G MA+LVLPYVSICAEKA HLDVLLEPL K VRSYYGNQGGG
Sbjct: 550  AGKSFVAEILMLRRVLSSGTMAILVLPYVSICAEKAEHLDVLLEPLGKRVRSYYGNQGGG 609

Query: 699  TLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLR 758
            TLPKDTSVAVCTIEKAN LINRLLEEGRLSEIGIIVIDELHMVGD +RGYLLELLLTKLR
Sbjct: 610  TLPKDTSVAVCTIEKANFLINRLLEEGRLSEIGIIVIDELHMVGDPSRGYLLELLLTKLR 669

Query: 759  YAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPV 818
            YAAGEGN +SSSGESSG SS K+DPAHG+QIVGMSATMPNVAAVADWLQAALYQT+FRPV
Sbjct: 670  YAAGEGNSESSSGESSGMSSCKADPAHGLQIVGMSATMPNVAAVADWLQAALYQTEFRPV 729

Query: 819  PLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKG 878
            PLEEYIKVGNT+YN+ ++IV+TI K  +L G+DPDH+VELCNEVV+EG SVLIFCSSRKG
Sbjct: 730  PLEEYIKVGNTLYNKKMEIVKTIPKATDLSGKDPDHVVELCNEVVQEGLSVLIFCSSRKG 789

Query: 879  CESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHA 938
            CESTA+HVS+FLKKFSV I + +S+F D+  A+DALRRCP+GLDPVLEET P+GVAYHHA
Sbjct: 790  CESTARHVSRFLKKFSVNIRSNDSQFKDVTLAIDALRRCPAGLDPVLEETLPAGVAYHHA 849

Query: 939  GLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMA 998
            GLTVEERE+VETCYR+GL+RVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDG RYRQMA
Sbjct: 850  GLTVEEREIVETCYRRGLVRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGTRYRQMA 909

Query: 999  GRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGI 1058
            GRAGRTGIDTKGESVLIC+PEEIKRI  ++NESC PL+SCLSED NGMTHAILEVVAGG+
Sbjct: 910  GRAGRTGIDTKGESVLICKPEEIKRIMGIINESCLPLRSCLSEDMNGMTHAILEVVAGGM 969

Query: 1059 VQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRAS 1118
            VQTA DIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCH KF+EWN DTKLYSTTPLGRA+
Sbjct: 970  VQTANDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHRKFVEWNDDTKLYSTTPLGRAA 1029

Query: 1119 FGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSL 1178
            FGSSL PEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVD+EPDWELYYERFM L +L
Sbjct: 1030 FGSSLCPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDMEPDWELYYERFMELSAL 1089

Query: 1179 DQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRAN 1238
            DQSVGNRVGVTEPFLMRMAHGAP+R +N  R                M+  HG    R  
Sbjct: 1090 DQSVGNRVGVTEPFLMRMAHGAPMRSSNRFRE--------------NMKAVHGKYENRPG 1149

Query: 1239 ISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKV 1298
            I+ N V+                  ++Q +RVCKRFYVALILSRLVQE  I EVCEAFKV
Sbjct: 1150 ITNNTVL-----------------QDDQILRVCKRFYVALILSRLVQEAAITEVCEAFKV 1209

Query: 1299 ARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVK 1358
            ARGMVQALQE+AGRFASMV++FCERLGWHDLEGLV KFQNRVSFGVRAEIVELTTIPYVK
Sbjct: 1210 ARGMVQALQENAGRFASMVTMFCERLGWHDLEGLVCKFQNRVSFGVRAEIVELTTIPYVK 1269

Query: 1359 GSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIE 1418
            GSRAR+LYKAGLRTPLAIAEAS AE+VKALFES+SWT +                    E
Sbjct: 1270 GSRARSLYKAGLRTPLAIAEASVAEIVKALFESSSWTEQ--------------------E 1329

Query: 1419 STAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNI 1478
             +AQ+R+H+G+A+KIK+GA K+VL+KAEEAR+AAFSAFK+LG  VPQ  RP+ +S  G+ 
Sbjct: 1330 GSAQRRIHLGVAKKIKNGAHKIVLEKAEEARVAAFSAFKALGLDVPQFYRPVFSSGGGSP 1389

Query: 1479 TAQVAASIPSEIDTLNRVVSTRQMEHALTKSCFG-------GTSSSEK------VGGKNL 1538
            + Q A +   +  T +  +  R+ EHA   S  G          S EK      +GG   
Sbjct: 1390 SMQGAGNSSGDNSTSSFPIVERK-EHAAKPSLEGRVLSGKVALESREKLTKTSDIGGVAS 1449

Query: 1539 SE---TGTISVEVKPPNFGVNPLVNVEGSAI--QESNTVVECAGKVDVTISNHMERIAQR 1598
            +E   TG + ++  P N      V ++GSA    E     +     D+T    ++ +  R
Sbjct: 1450 AEVYSTGVMQIKFGPDN----STVPIQGSAALGDELKAAFDQNKNADLTDHVQLQSLGDR 1509

Query: 1599 EQHSSV--------------LHPPKRDSSSMKGPIHAANTSGGFESFLDLWDASQEFFFD 1658
             + S                L P  + ++  KGPIHA NT GGF+SFLDLW+ + EF+FD
Sbjct: 1510 NRVSDESFDLEKQERCKRVNLSPGFKGNACDKGPIHAINTLGGFDSFLDLWETTSEFYFD 1569

Query: 1659 LYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGD---- 1718
            ++Y KRSE+NSV PFE+HGIAICWENSPVYYVN+PKDLL   + K        SG+    
Sbjct: 1570 IHYNKRSELNSVAPFEIHGIAICWENSPVYYVNIPKDLLWSDNSKNECLHLNGSGNRSNV 1629

Query: 1719 -----------------------------------QVQVLKCPGVSIQKLGFLNSARRNM 1778
                                               Q+Q LK P V  Q+ G  N A ++ 
Sbjct: 1630 LPLDDMLEMARRRWKRIGEIMRKRGVRKFAWKLKIQIQALKSPAVHAQRFGCQNIAGKST 1689

Query: 1779 GLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGEAASAANR 1838
              +++D S L+L  VHI + IDMCIVAWILWPD+ER+S PNLEKEVKKRLS EAA+AANR
Sbjct: 1690 CFEIIDSSLLLLPPVHIKDGIDMCIVAWILWPDEERSSNPNLEKEVKKRLSSEAAAAANR 1749

Query: 1839 SGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPLVSILADMETW 1898
            +G+WKNQMRR AHNGCCRRVAQ RALCSVLWKL++SE L EAL NIEIPLV+ILADME W
Sbjct: 1750 NGRWKNQMRRAAHNGCCRRVAQIRALCSVLWKLLVSEGLTEALVNIEIPLVNILADMELW 1809

Query: 1899 GIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGHLKLSIPEGFN 1958
            G+G+DMEGC++AR +LGKKL+ LEKEAY+LAGM+FSLY AADIANVLYGHLKL IPEG N
Sbjct: 1810 GVGLDMEGCLQARKVLGKKLRQLEKEAYKLAGMTFSLYTAADIANVLYGHLKLPIPEGRN 1869

Query: 1959 KGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHG 2018
            KGKQHPSTDKHCLDLLR+EHPI+PVIKEHRTLAKL NCTLGSICSL +LS +TQKYTLHG
Sbjct: 1870 KGKQHPSTDKHCLDLLRDEHPIIPVIKEHRTLAKLLNCTLGSICSLGRLSVKTQKYTLHG 1929

Query: 2019 HWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDD------VDHCKINARDFFISTQENWL 2078
            HWLQTSTATGRLSMEEPNLQCVEH VDFK+ +D+      VD+  INARD+FI TQ+NWL
Sbjct: 1930 HWLQTSTATGRLSMEEPNLQCVEHMVDFKIRKDEKGSETNVDYYNINARDYFIPTQDNWL 1989

Query: 2079 LVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQTKR 2138
            L++ADYSQIELRLMAHFSKDS LIE LSKP GDVFTMIAARWTG +EDS+  + RDQTKR
Sbjct: 1990 LLTADYSQIELRLMAHFSKDSVLIEPLSKPEGDVFTMIAARWTGISEDSVSSYVRDQTKR 2049

Query: 2139 LVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVETLK 2198
            LVYGILYGMGA +LA QL+CS +EA EKI++FKSSFPGVASWL+EAV  CR+KGY+ETLK
Sbjct: 2050 LVYGILYGMGANSLAEQLDCSPEEASEKIQNFKSSFPGVASWLNEAVADCRKKGYIETLK 2109

Query: 2199 GRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVI--GTDA 2251
            GR+RFLSKI   NSKEKSKAQRQAVNSICQ      GSAADIIK+AMINIYSVI  G + 
Sbjct: 2110 GRKRFLSKIKFGNSKEKSKAQRQAVNSICQ------GSAADIIKIAMINIYSVIVGGAER 2165

BLAST of Cla022227 vs. TrEMBL
Match: A0A0D2U7X3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G218600 PE=4 SV=1)

HSP 1 Score: 2471.4 bits (6404), Expect = 0.0e+00
Identity = 1363/2220 (61.40%), Postives = 1601/2220 (72.12%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            KF+ASKKRK  +  LK+G  +K+ K ++E SP AKGTL++Y+ TSQD+     PS + R 
Sbjct: 13   KFFASKKRKTQSPGLKTGRLEKNEKTTVECSPSAKGTLNSYIRTSQDNEIVH-PSCTTRG 72

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
                +D +K NL  +I+ S ++E+E   L     + A  E  K  ++  S    ++    
Sbjct: 73   ----KDPIKMNLASEIDKSFKHENEHSLLLAETKSQAFEETHKGISMGLSEAGNAAFGDH 132

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
              G        E PELK+FA DFLSLYCS E+   V SP E KV  LKRH  PS+L  E 
Sbjct: 133  AEG----AQIGENPELKKFATDFLSLYCS-EVPVNVDSPSETKVNNLKRHGGPSMLSEED 192

Query: 280  KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGD--TDSHPPVVLKACLQKCNK 339
            K  KK H IA           S   ++  +   F+         S  P  L+A L+KCN 
Sbjct: 193  KRFKKRHLIAQQIQTVDIAVCSTNTNLEYETEEFLCNPSQDVNTSSNPFELQAGLRKCNT 252

Query: 340  APRSPYCLTECKTPGLSTANTCFQETPKS--GSSTFSPGEAFWKEAIVFADGLCAPSIDL 399
            A +S     EC TPG S    C   TP+S  GSS FSPGEAFW EAI  ADGL + S  L
Sbjct: 253  ATKSVLHTMECHTPGSSVIKGCSHRTPQSMRGSSMFSPGEAFWNEAIEIADGLFSQSDIL 312

Query: 400  TNCDAEGANVAESQSHTKKLPIPGEP-AQKRLKGQFGGGSGGVRLGEPGASMVSLRSELK 459
            +   AEG N  ESQ   K     G      + K         V+L    AS+ S   + K
Sbjct: 313  SARVAEGINNPESQYEVKNTGNLGNTNVGYKSKEISDECESRVKLQGISASLESAVKQKK 372

Query: 460  ELNREVSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTNDSLPNHN 519
            E+++EVS LPVKH DFS +DKNLDG     C   E +      +++++    N  LP   
Sbjct: 373  EIDKEVSLLPVKHLDFSFEDKNLDGGI---CHVLEKD------SQEAEGSIINHILPPTV 432

Query: 520  DKTRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSP-SDSITSDTAVHELRASTVHDF-K 579
            +K  D   L K +       +S+ VV +V++N+ S  +DSITS +  +  + S   D   
Sbjct: 433  NKLIDHAELQKTEEGGKLEQASIHVVPKVEVNLSSQDNDSITSMSPANAAKKSIGTDEGN 492

Query: 580  EETTPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCA 639
            E +TP SSV  KD L +S WLP EIC IYK+KGI +L+PWQV+CL+VDGVLQRRNLVYCA
Sbjct: 493  ESSTPLSSVALKDKLSISSWLPLEICKIYKKKGIEQLYPWQVDCLQVDGVLQRRNLVYCA 552

Query: 640  STSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQ 699
            STSAGKSFVAEILMLRR+I T K ALLVLPYVSIC EKA HL+VLLEPL K VRSYYGNQ
Sbjct: 553  STSAGKSFVAEILMLRRLILTRKAALLVLPYVSICVEKAEHLEVLLEPLGKQVRSYYGNQ 612

Query: 700  GGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLT 759
            GGGTLPKDTSVAVCTIEKANSL+NRLLEEGRLSEIGIIVIDELHMVGDQ+RGYLLELLLT
Sbjct: 613  GGGTLPKDTSVAVCTIEKANSLVNRLLEEGRLSEIGIIVIDELHMVGDQSRGYLLELLLT 672

Query: 760  KLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDF 819
            KLRYAAGEG  +SSSGESSG+SSGK+DPAHG+QIVGMSATMPNV AVADWLQAALYQT+F
Sbjct: 673  KLRYAAGEGTPESSSGESSGSSSGKADPAHGLQIVGMSATMPNVEAVADWLQAALYQTNF 732

Query: 820  RPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSS 879
            RPVPLEE+IKVGNTIY+++LD+VRTI K  +LGG+DPDH+VELCNEVV+EG SVLIFCS+
Sbjct: 733  RPVPLEEFIKVGNTIYDKNLDLVRTIPKAVDLGGKDPDHVVELCNEVVQEGQSVLIFCST 792

Query: 880  RKGCESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAY 939
            RKGCESTAKHV+KFLKKFSV  H +NSEF DI SA+DALRRCP+GLDPVLEET PSGVAY
Sbjct: 793  RKGCESTAKHVAKFLKKFSVTAHGDNSEFIDITSAIDALRRCPAGLDPVLEETLPSGVAY 852

Query: 940  HHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYR 999
            HHAGLTVEEREV+ETCYR+G +RVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDG RY+
Sbjct: 853  HHAGLTVEEREVIETCYRRGFVRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGTRYK 912

Query: 1000 QMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVA 1059
            QMAGRAGRTGIDTKGESVLIC+ EEIKRI  LLNESCPPLQSCLSEDKNGMTHAILEVVA
Sbjct: 913  QMAGRAGRTGIDTKGESVLICKTEEIKRIKGLLNESCPPLQSCLSEDKNGMTHAILEVVA 972

Query: 1060 GGIVQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLG 1119
            GG+VQTA DI+RYVRCTLLNSTKPFQ+VVKSAQESLRWLCH KFLEWN +TKLY TTPLG
Sbjct: 973  GGMVQTANDINRYVRCTLLNSTKPFQEVVKSAQESLRWLCHRKFLEWNDETKLYGTTPLG 1032

Query: 1120 RASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGL 1179
            RA+FGSSL PEESLIVLDDLSRAREGFVLASDLHLVYLVTPINV+VEPDWELYYERFM L
Sbjct: 1033 RAAFGSSLCPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVEVEPDWELYYERFMEL 1092

Query: 1180 PSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIR 1239
             +L+QSVG RVGVTEPFLMRMAHG                                 PI 
Sbjct: 1093 SALEQSVGYRVGVTEPFLMRMAHG--------------------------------VPIS 1152

Query: 1240 RANISRNGVVGLRTK-RDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCE 1299
            ++N  R+ +  L  +  ++ G       S+EQT+RVCKRFYVALILSRLVQE P+ EVCE
Sbjct: 1153 KSNGLRDSLKRLPAQFGNQPGINNSTMLSDEQTLRVCKRFYVALILSRLVQEAPVGEVCE 1212

Query: 1300 AFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTI 1359
            AF+VA+GMVQALQE+AGRFASMVSVFCERLGWHDLE LVAKFQNRVSFGVRAEIVELTTI
Sbjct: 1213 AFRVAKGMVQALQENAGRFASMVSVFCERLGWHDLEDLVAKFQNRVSFGVRAEIVELTTI 1272

Query: 1360 PYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQ 1419
            PYVKGSRARALYKAGLRTPLAIAEAS  E+VKALFES+SW A+                 
Sbjct: 1273 PYVKGSRARALYKAGLRTPLAIAEASIPEIVKALFESSSWVAQ----------------- 1332

Query: 1420 VCIESTAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPL--SA 1479
               ES AQ+RM  G+A+KIK+GARK+VLDKAEEAR AAFSAFKSLG++VPQ SRPL  S 
Sbjct: 1333 ---ESLAQRRMQFGVAKKIKNGARKIVLDKAEEARAAAFSAFKSLGYSVPQFSRPLILSG 1392

Query: 1480 SADGNITAQVAASIPSEIDTLN-RVVSTRQMEHALTKSCFGGTSSSE------KVGGKNL 1539
            S  G   A   A   S  + +    + T  M    T       SS        K    NL
Sbjct: 1393 SPGGEEAASTGAGDGSPCNVIGVEQIHTSAMPLMETGKNLEKVSSPNEGIMLTKASADNL 1452

Query: 1540 SETGTISVEVK-PPNFGVNPLVNVEGSAIQESNTVVECAGKVDV-TISNHMERIAQREQH 1599
              +  ++++     N G+     V G  +   N VVE    + + T+S ++++   +++ 
Sbjct: 1453 VASAEVNIDTTLQSNLGLENPAAVTGDKV---NAVVEQGRSIKMATVSEYLDQ-GMQDRL 1512

Query: 1600 SSVLHPPKRDSSSMKGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFEL 1659
            +  L     DS+  KGP++A N  GGF+SFL+LW+ + EF FD+++ +RSE NSV PFE+
Sbjct: 1513 NEDLSVGNADSACGKGPLNAVNAPGGFDSFLELWETAPEFCFDVHFNRRSEANSVAPFEI 1572

Query: 1660 HGIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGDQV------QVLKCPGVSIQKL 1719
            HGIAICWENSPVYYV LPKDLL   + K  +     S  +        +L+   +  +++
Sbjct: 1573 HGIAICWENSPVYYVKLPKDLLWLDNRKNNFLSTSASSGKCNSLPPEHMLEMAKLRWKRI 1632

Query: 1720 GFL---------------------------------NSARRNMGLKLVDGSYLVLSRVHI 1779
            G +                                 +   ++MGL+++D S L+L  V I
Sbjct: 1633 GDIMGKNGVHKLTWNLKVQIQVLKSSAISIQRFSGMHLGGKDMGLEIIDNSCLLLPPVLI 1692

Query: 1780 SNVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCC 1839
            ++  DMCI AWILWPD+ER+S PNLE EVKKRLS EAA+AAN+SG+WKNQMRR +HNGCC
Sbjct: 1693 NDGFDMCIAAWILWPDEERSSRPNLENEVKKRLSSEAAAAANQSGRWKNQMRRASHNGCC 1752

Query: 1840 RRVAQTRALCSVLWKLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLG 1899
             RVAQTRAL S  WKL+ISEKL++  + IE PLV +LA+ME WGIG++MEGC+ ARNLLG
Sbjct: 1753 HRVAQTRALYSAFWKLLISEKLIDVFSYIETPLVRVLAEMELWGIGINMEGCLWARNLLG 1812

Query: 1900 KKLKCLEKEAYRLAGMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLR 1959
            +KL+ LEKEAY+LAGM FSL  AADIANVLYGHLKL +PEG NKGKQHPSTDKHCLDLLR
Sbjct: 1813 EKLRYLEKEAYKLAGMKFSLSTAADIANVLYGHLKLPVPEGRNKGKQHPSTDKHCLDLLR 1872

Query: 1960 NEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEP 2019
            +EHPIVPVIKEHRTLAKL NCTLGSICSLA+LS  T KYTLHG WLQTSTATGRLSMEEP
Sbjct: 1873 DEHPIVPVIKEHRTLAKLLNCTLGSICSLARLSRSTNKYTLHGRWLQTSTATGRLSMEEP 1932

Query: 2020 NLQCVEHAVDFKMNED------DVDHCKINARDFFISTQENWLLVSADYSQIELRLMAHF 2079
            NLQCVEH V+F +++D      + DH KIN RDFFI TQ+NWLL++ADYSQIELRLMAHF
Sbjct: 1933 NLQCVEHMVEFSLSKDKNGSDANTDHYKINVRDFFIPTQDNWLLLTADYSQIELRLMAHF 1992

Query: 2080 SKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQ 2139
            S DS+LI+LLSKP GDVFTM++A WTG+ EDS+  +ERDQTKRL+YGILYGMGA TLA Q
Sbjct: 1993 SNDSALIKLLSKPQGDVFTMMSALWTGRAEDSVSSNERDQTKRLIYGILYGMGADTLAEQ 2052

Query: 2140 LECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEK 2199
            L C+ DEA EKI+SFKSSFP VASWL EAV  CRQKGY+ETLKGR+RFLSKI   NS+EK
Sbjct: 2053 LNCTPDEAKEKIKSFKSSFPDVASWLREAVASCRQKGYIETLKGRKRFLSKIKIGNSEEK 2112

Query: 2200 SKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVI--GTDAPDPTGLPAANTNILRGH 2254
            SKAQRQAVNSICQ      GSAADIIK+AMI ++SVI  G D+ +         ++L+G 
Sbjct: 2113 SKAQRQAVNSICQ------GSAADIIKIAMIKLHSVIVEGVDSLESGSSILTKFHMLKGR 2151

BLAST of Cla022227 vs. TrEMBL
Match: A0A068VD06_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00009041001 PE=4 SV=1)

HSP 1 Score: 2405.6 bits (6233), Expect = 0.0e+00
Identity = 1328/2264 (58.66%), Postives = 1597/2264 (70.54%), Query Frame = 1

Query: 87   LALDETPEIDADSKFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQD 146
            +A  ++P    D  F + KKRK ++ S+KS    KD K ++EGSPG KG+LDN+LV S++
Sbjct: 1    MASGDSPRARIDQFFASKKKRKAISPSVKSKKVGKDAKIAVEGSPGTKGSLDNFLVGSEE 60

Query: 147  HGNSDIPSHSVRENLSEQDLVKRNLLLKINSSSRNEHEEPTL-----SRGCDTSAATEGI 206
            + NS  P+ +  E+  ++  +KRNL L+I+ SS++E ++  L     ++G D     + +
Sbjct: 61   NKNS--PNRAASESPVKRVPIKRNLTLEISLSSKDEKKDALLPMEVRAQGLDLFGYAQRV 120

Query: 207  KKRTLEDSYETRSSTVKLMAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQ 266
               T  D   + +   K +  +       E PELK+FA +FLSLYCS       S P E 
Sbjct: 121  NSETSNDFGGSVAGASKEVP-ENATAGEAENPELKRFATNFLSLYCS------ASVPSET 180

Query: 267  KVTFLKRHSSPSLLEGEAKLPKKIHS--------IAGPSNAKGEPDSSNALSV-----GN 326
             V  +KRH SPS L+ E +  K+ H         + G     G+  S    S      GN
Sbjct: 181  NVHAIKRHGSPSALDSEDRSSKRRHCNINMSQLHVEGEGICSGDVHSKPLQSAIIDESGN 240

Query: 327  KQSNFVVET--GDTDSHPPVVLKACLQKCNKAPRSPYCLTECKTPGLSTANTCFQETPKS 386
              S    E   GD ++ P   LK C+        +      C TPG         ETPKS
Sbjct: 241  AVSKCSTEVKLGDNETVPGTSLKRCVNASLTIDAAG-----CITPGSLNGKLGRHETPKS 300

Query: 387  G--SSTFSPGEAFWKEAIVFADGLCAPSIDLTNCDA-EGANVAESQSHTKKLPIPGEPAQ 446
            G  SS FSPGE FWKEAI  ADGL  P  +L +  A E  ++   +  +    +P     
Sbjct: 301  GRGSSIFSPGETFWKEAIQVADGLLIPKDNLHSQFALESEHLKPDKETSMANNLPDGGCG 360

Query: 447  KRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNREVSSLPVKHFDFSA-DDKNLDGSTL 506
             +L      G      G   + +  +    K+L +EVS LPVKHFDFS  +DKN+D  T 
Sbjct: 361  NKLNNLLYAGVARDSNGGINSVVGPVSRHSKDLVKEVSPLPVKHFDFSKIEDKNMDEETP 420

Query: 507  PYCASNESEVNAYDLNEQSDCCYTNDSLPN--HNDKTRDSDSLTKEKIHETNVTSSVPVV 566
             Y   +   +      +   C   N       HN   +++ + T+  +       S    
Sbjct: 421  SYVNLSSQHIIK---GKTPGCVSQNQEYKQICHNLSLQNNAAHTECDLLGVQDMISKYDA 480

Query: 567  TEVKLNIFSPSDS---ITSDTAVHELRASTVHDFKEETTPSSSVRHKDWLDLSCWLPPEI 626
            TE KLNI++   S    T D  +++L       F ++ +PSS +  +D LDL+ WLP E+
Sbjct: 481  TENKLNIWAQDHSDMFTTKDRRLNDLTPKG--GFNQDDSPSSFLPLEDRLDLNNWLPSEL 540

Query: 627  CSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGKSFVAEILMLRRVISTGKMA 686
            CSIYK++G++KL+PWQV+CL+VDGVLQ RNLVY ASTSAGKSFVAEILMLRR++STGKMA
Sbjct: 541  CSIYKKRGMSKLYPWQVDCLQVDGVLQNRNLVYSASTSAGKSFVAEILMLRRILSTGKMA 600

Query: 687  LLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINR 746
             LVLPYVSICAEKA HL+VLLEPL K VRSYYGNQGGGTLPKDTSVAVCTIEKANSLINR
Sbjct: 601  FLVLPYVSICAEKAEHLEVLLEPLGKQVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINR 660

Query: 747  LLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGK 806
            LLEEGRLSE+GIIVIDELHMVGDQ RGYLLELLLTKLRYAAGEG+ +SSSGESSGT S K
Sbjct: 661  LLEEGRLSELGIIVIDELHMVGDQHRGYLLELLLTKLRYAAGEGSAESSSGESSGTGSSK 720

Query: 807  SDPAHGIQIVGMSATMPNVAAVADWLQ-AALYQTDFRPVPLEEYIKVGNTIYNRSLDIVR 866
            +DP  G+QIVGMSAT+PNVAAVADWLQ AALY+TDFRPVPLEEYIKVG TIYN+ ++IVR
Sbjct: 721  ADPVRGLQIVGMSATLPNVAAVADWLQQAALYETDFRPVPLEEYIKVGYTIYNKEMNIVR 780

Query: 867  TISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKFSVKIHN 926
            TI K A++GG+DPDHIVELCNE+V+EGHSVLIFCSSRKGCESTA+HV+K+LKKFSV   N
Sbjct: 781  TIPKIADIGGKDPDHIVELCNEIVQEGHSVLIFCSSRKGCESTARHVAKYLKKFSVSPQN 840

Query: 927  ENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKGLLRV 986
              +E  D+  A+DALRR P+GLDPVLEET P+GVAYHHAGLTVEERE VETCYRKG +RV
Sbjct: 841  GQNELMDLEFAIDALRRSPAGLDPVLEETLPAGVAYHHAGLTVEERETVETCYRKGFVRV 900

Query: 987  LTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDTKGESVLICRPE 1046
            LTATSTLAAGVNLPARRVIFRQP+IGRDFIDG RYRQMAGRAGRTGIDTKGESVLIC+PE
Sbjct: 901  LTATSTLAAGVNLPARRVIFRQPRIGRDFIDGTRYRQMAGRAGRTGIDTKGESVLICKPE 960

Query: 1047 EIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLNSTKP 1106
            E KRI  +LNE CP L SCLSEDKNGMTHAILEVVAGGIVQTA DIHRYVRCTLLNSTKP
Sbjct: 961  ETKRILGILNEGCPALYSCLSEDKNGMTHAILEVVAGGIVQTANDIHRYVRCTLLNSTKP 1020

Query: 1107 FQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDLSRAR 1166
            F DVV+SAQ+SLRWLCH KFLEW+ DTKLY+TTPLGRASFGSSLSPEES+IVLDDL+RAR
Sbjct: 1021 FGDVVRSAQDSLRWLCHKKFLEWSEDTKLYTTTPLGRASFGSSLSPEESMIVLDDLTRAR 1080

Query: 1167 EGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHG 1226
            +GFVLASDLHLVYLVTP NVDVEPDWELYYERFM L +LD+SVGNRVGV EPFLMRMAHG
Sbjct: 1081 DGFVLASDLHLVYLVTPTNVDVEPDWELYYERFMELSALDKSVGNRVGVQEPFLMRMAHG 1140

Query: 1227 APIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDEHGCMYD 1286
            AP+R +N                            R  N S+    GL+ K +       
Sbjct: 1141 APLRTSN----------------------------RLKNTSK----GLQAKPNCIAMWNS 1200

Query: 1287 DRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFASMVSV 1346
               S+EQ +RV +RFYVALILS LVQE P+ EVC  FKVARGMVQALQ++AGRFASMVSV
Sbjct: 1201 AMLSDEQMLRVSRRFYVALILSTLVQEVPVAEVCAVFKVARGMVQALQDNAGRFASMVSV 1260

Query: 1347 FCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLAIAEA 1406
            FCERLGWHDL  LVAKFQNRVSFGV+AEIVELTTIPYVKGSRARALYKAGLRTP  IAEA
Sbjct: 1261 FCERLGWHDLADLVAKFQNRVSFGVKAEIVELTTIPYVKGSRARALYKAGLRTPQTIAEA 1320

Query: 1407 SDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTAQKRMHVGIARKIKHGARK 1466
            S  E+ KALFES+SW A+G                     TAQ R+ +G+A+KIK+GAR+
Sbjct: 1321 SIPEIAKALFESSSWAAQG---------------------TAQWRIQLGVAKKIKNGARR 1380

Query: 1467 VVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQVA--ASIPSEIDTLNRVV 1526
            +VL+KAEEARIAAFSAFKSLG  VP +SRPL + A GN   + A  +S+     +L  + 
Sbjct: 1381 IVLEKAEEARIAAFSAFKSLGLEVPPLSRPLLSIAAGNAPQKEASSSSVEESTSSLGGLK 1440

Query: 1527 STRQM-----------EHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFGVNPLV 1586
               Q            E  L ++ F G +S+    G+ +++  T SV ++ PN     + 
Sbjct: 1441 HNEQTDNITGFVSKAHEQKLARTSFTGVNSAGAKQGEVVADK-TASV-MEGPN--APYMH 1500

Query: 1587 NVEGSAIQESNTVVEC---------AGKVDVTISNHMERIAQREQHSSVLHPPKRDSSSM 1646
            N     +  +NT + C         +G VD      ++   +++Q+    H   ++    
Sbjct: 1501 NSTSDYVDNANTSLSCQLSSIRHGRSGYVD-----KIDNFGEQQQNRGTPHTASKERVLD 1560

Query: 1647 KGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVYY 1706
            KGPI+A+N  GGF++FL+ WD SQEF+ D+++ +RSEVNS V FE+HG+AICWENSPVYY
Sbjct: 1561 KGPINASNIPGGFDTFLNWWDNSQEFYLDVHFNRRSEVNSTVLFEIHGMAICWENSPVYY 1620

Query: 1707 VNLPKDLL---GPKSGKGLYPDDRTSGDQV------------------------------ 1766
            V++PKDLL     K+ K L      +G+ V                              
Sbjct: 1621 VSIPKDLLLFNSRKTDKMLSNISGDNGNAVPPMDQFDLAKSRWQRIGKIIGKKDVRKFTW 1680

Query: 1767 ------QVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWILW 1826
                  QVL+ P VSI +LG LNSA +++GL+L+D SY VLS +H+ N ID+ I AWILW
Sbjct: 1681 NSKVQIQVLRYPAVSIHRLGNLNSAVKSVGLELIDDSYFVLSPLHVQNFIDLSIAAWILW 1740

Query: 1827 PDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLW 1886
            PD+E++S PNLEKE+KKRLS EAA+AA+R+G+WKNQMRR AHNGCCRRVAQ RAL SVLW
Sbjct: 1741 PDEEKSSNPNLEKEIKKRLSCEAAAAASRNGRWKNQMRRAAHNGCCRRVAQIRALSSVLW 1800

Query: 1887 KLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLA 1946
            KL+ISE+L+EA  +IEIPLV++LADME WGIGVDMEGC+RARN+LGKKLK LEKEA++LA
Sbjct: 1801 KLLISEELVEAFLSIEIPLVNVLADMELWGIGVDMEGCLRARNILGKKLKYLEKEAHQLA 1860

Query: 1947 GMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRT 2006
            GMSFSLY AADIANVLY HLK+ IPEG NKGK HPSTDK CLDLLRNEHPI+ VIKEHRT
Sbjct: 1861 GMSFSLYMAADIANVLYEHLKIPIPEGHNKGKYHPSTDKRCLDLLRNEHPIISVIKEHRT 1920

Query: 2007 LAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMN 2066
             AKL NCTLGSICSL+KLSARTQ+YTLHGHWLQTSTATGRLSMEEPNLQCVEH VDFKMN
Sbjct: 1921 FAKLLNCTLGSICSLSKLSARTQRYTLHGHWLQTSTATGRLSMEEPNLQCVEHVVDFKMN 1980

Query: 2067 EDDVD-------HCKINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKP 2126
              D+D       + K+NAR+FF++TQ++W L++ADYSQIELRLMAHFSKD SL+ELL+K 
Sbjct: 1981 RIDLDGKELVDEYHKVNAREFFVATQDDWYLLTADYSQIELRLMAHFSKDPSLVELLNKR 2040

Query: 2127 HGDVFTMIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIR 2186
              DVF+MIAA+WTGK E S+   ERDQTKRLVYG+LYGMGA +LA QL C+ DEA E+I 
Sbjct: 2041 DSDVFSMIAAKWTGKVESSVSSQERDQTKRLVYGMLYGMGANSLAEQLNCTSDEAAERIC 2100

Query: 2187 SFKSSFPGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQ 2246
             FK+SFPGVA+WL E VT CRQKGYV+TLKGR+RFL+KI   NSKEKSKA RQAVNSICQ
Sbjct: 2101 CFKTSFPGVATWLQEVVTSCRQKGYVKTLKGRKRFLAKIKFGNSKEKSKAHRQAVNSICQ 2160

Query: 2247 YFFFYWGSAADIIKVAMINIYSVIGTDAPDPTG--LPAANTNILRGHCRIVLQVHDELVL 2251
                  GSAADIIK+AMIN++SV+  DA         A   ++L+G CRI+LQ  + L++
Sbjct: 2161 ------GSAADIIKIAMINLHSVVAEDADTSCSSCALAEKFHMLKGRCRILLQASNRLLM 2177

BLAST of Cla022227 vs. NCBI nr
Match: gi|659088547|ref|XP_008445039.1| (PREDICTED: DNA polymerase theta isoform X1 [Cucumis melo])

HSP 1 Score: 3725.6 bits (9660), Expect = 0.0e+00
Identity = 1925/2210 (87.10%), Postives = 1985/2210 (89.82%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            +FYASKKRKPLT SLKSGSYDKDGK++LEGSP AKGTLDNYLV SQD GNSD PSHSVRE
Sbjct: 12   QFYASKKRKPLTPSLKSGSYDKDGKRALEGSPSAKGTLDNYLVVSQDRGNSDNPSHSVRE 71

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
            NLS QDLVKRNLLL+INSSS NEH E T SRGCD        KK+T+EDS ETRSSTVK 
Sbjct: 72   NLSGQDLVKRNLLLRINSSSINEHGETT-SRGCD--------KKKTMEDSLETRSSTVKS 131

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
            MA D GV PCTEKPELKQFAADFLSLYCSNEL TTVSSP EQKVT LKRHSSPS LE EA
Sbjct: 132  MASDWGVAPCTEKPELKQFAADFLSLYCSNELQTTVSSPVEQKVTSLKRHSSPSHLEEEA 191

Query: 280  KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKAP 339
            KLPKK+HSI  PSNA+GEPDSSNALS GNK+SNFVVETGDTDSH P VLKAC+QKCN+AP
Sbjct: 192  KLPKKMHSIVDPSNAEGEPDSSNALSEGNKESNFVVETGDTDSHHPAVLKACMQKCNQAP 251

Query: 340  RSPYCLTECKTPGLSTANTCFQETPKSGSSTFSPGEAFWKEAIVFADGLCAPSIDLTNCD 399
             SP+CLTECKTPGLSTA T  ++TPKSGSSTFSPGEAFWKEAIVFADGLCAPSI LTNCD
Sbjct: 252  ISPHCLTECKTPGLSTATTFIRQTPKSGSSTFSPGEAFWKEAIVFADGLCAPSIALTNCD 311

Query: 400  AEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNRE 459
             E AN+ ESQS+TKKLPIP EPAQKRLKGQFG GSGGVRLGEPGAS+VSLRS+LKEL+R 
Sbjct: 312  GEEANLVESQSNTKKLPIPEEPAQKRLKGQFGVGSGGVRLGEPGASIVSLRSDLKELDRV 371

Query: 460  VSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTND-SLPNHNDKTR 519
             SSLPVKHFDFSADDKNLD +T P CASNES+VNAYDLNEQSD CYT   SLP HNDKTR
Sbjct: 372  ASSLPVKHFDFSADDKNLDENTSPCCASNESKVNAYDLNEQSDRCYTTHVSLPKHNDKTR 431

Query: 520  DSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHELRASTVHDFKEETTPS 579
            DSDSLTKEKI ET VTSSVPVVTEVKLNIFSPSDSITSDTA HELRAST+H  ++E TPS
Sbjct: 432  DSDSLTKEKIQETKVTSSVPVVTEVKLNIFSPSDSITSDTATHELRASTIHGSRDEMTPS 491

Query: 580  SSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 639
            SS RHKDWLDL+CWLPPEI SIYKEKGITKLH WQVECLKVDGVLQRRNLVYCASTSAGK
Sbjct: 492  SSTRHKDWLDLTCWLPPEISSIYKEKGITKLHRWQVECLKVDGVLQRRNLVYCASTSAGK 551

Query: 640  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLP 699
            SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPL KHVRSYYGNQGGGTLP
Sbjct: 552  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLGKHVRSYYGNQGGGTLP 611

Query: 700  KDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 759
            KDTSVA+CTIEKANSLINRLLEE RLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA
Sbjct: 612  KDTSVAICTIEKANSLINRLLEECRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 671

Query: 760  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLE 819
            GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALY TDFRPVPLE
Sbjct: 672  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYHTDFRPVPLE 731

Query: 820  EYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCES 879
            EYIKVGNTIYN+SLDIVRTISKTANLGGRDPDHIVELCNEVVE+G+SVLIFCSSRKGCES
Sbjct: 732  EYIKVGNTIYNKSLDIVRTISKTANLGGRDPDHIVELCNEVVEDGNSVLIFCSSRKGCES 791

Query: 880  TAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLT 939
            TAKHVSKFLKKFSVKI NENSEFTDIFSA+DALRRCPSGLDPVLEETFPSGVAYHHAGLT
Sbjct: 792  TAKHVSKFLKKFSVKIQNENSEFTDIFSAIDALRRCPSGLDPVLEETFPSGVAYHHAGLT 851

Query: 940  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRA 999
            VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDGARYRQMAGRA
Sbjct: 852  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGARYRQMAGRA 911

Query: 1000 GRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 1059
            GRTGIDTKGESVLICRPEE+KRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT
Sbjct: 912  GRTGIDTKGESVLICRPEEVKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 971

Query: 1060 ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS 1119
            ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCH KFLEWNGDTKLYSTTPLGRASFGS
Sbjct: 972  ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHRKFLEWNGDTKLYSTTPLGRASFGS 1031

Query: 1120 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQS 1179
            SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGL SLDQS
Sbjct: 1032 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLSSLDQS 1091

Query: 1180 VGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISR 1239
            VGNRVGVTEPFLMRMAHGAP+RRANISRNGV+                            
Sbjct: 1092 VGNRVGVTEPFLMRMAHGAPVRRANISRNGVA---------------------------- 1151

Query: 1240 NGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARG 1299
                G RTKRDEH  MY DRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARG
Sbjct: 1152 ----GSRTKRDEHMGMYGDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARG 1211

Query: 1300 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1359
            MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR
Sbjct: 1212 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1271

Query: 1360 ARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTA 1419
            ARALYKAGLRTPLAIAEASDAELVKALFESASWT E                    ES A
Sbjct: 1272 ARALYKAGLRTPLAIAEASDAELVKALFESASWTTE--------------------ESIA 1331

Query: 1420 QKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQ 1479
            QKRMHVG+ARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQIS PLSAS DGNITAQ
Sbjct: 1332 QKRMHVGLARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISHPLSASVDGNITAQ 1391

Query: 1480 VAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFG 1539
            VA             VSTRQMEH LT S  GGTSSSEKV GKN SETG +SV+VK  N G
Sbjct: 1392 VA-------------VSTRQMEHVLTLSSVGGTSSSEKVVGKNPSETGAMSVDVKVSNSG 1451

Query: 1540 VNPLVNVEGSAIQESNTVVECAGKVDVTISNHMERI----AQREQHS-SVLHPPKRDSSS 1599
            VNP VNVEGSAIQ+SNTVVECAGKVDV IS+H+ERI    AQREQHS  VLH  KRD SS
Sbjct: 1452 VNPPVNVEGSAIQDSNTVVECAGKVDVAISSHVERITDKDAQREQHSGKVLHSLKRDDSS 1511

Query: 1600 MKGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVY 1659
            MKGPI AA TSGGFESFL+LWDASQEF+FDLYYTKRSEVNSVVPFELHGIAICWENSPVY
Sbjct: 1512 MKGPIQAATTSGGFESFLELWDASQEFYFDLYYTKRSEVNSVVPFELHGIAICWENSPVY 1571

Query: 1660 YVNLPKDLLGPKSGKGLYPDDRTSGD---------------------------------- 1719
            YVN+PKDLLGPKSGKGL PDD  SGD                                  
Sbjct: 1572 YVNIPKDLLGPKSGKGLCPDDSMSGDRVDVSQNEHWFEMIEMRWKKINEIFTKKNVRKFA 1631

Query: 1720 -----QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWIL 1779
                 QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLS VHISNVIDMCIVAWIL
Sbjct: 1632 WNLKVQVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSGVHISNVIDMCIVAWIL 1691

Query: 1780 WPDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1839
            WPDDERNST NLEKEVKKRLSGEAA+AANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL
Sbjct: 1692 WPDDERNSTLNLEKEVKKRLSGEAAAAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1751

Query: 1840 WKLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRL 1899
            WKLIISEKLLEALNNIEIPLV ILADMETWGIGVDMEGCIRARNLLGKKL+CLEKEAYRL
Sbjct: 1752 WKLIISEKLLEALNNIEIPLVGILADMETWGIGVDMEGCIRARNLLGKKLRCLEKEAYRL 1811

Query: 1900 AGMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1959
            AGM+FSLYAAADIANVLYGHLKLSIPE FNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR
Sbjct: 1812 AGMTFSLYAAADIANVLYGHLKLSIPEEFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1871

Query: 1960 TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKM 2019
            TLAKLFNCTLGSICSLA+LSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAV+FKM
Sbjct: 1872 TLAKLFNCTLGSICSLARLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVEFKM 1931

Query: 2020 NEDDVDHCKINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFT 2079
            NEDDVDHCKINARDFFISTQENWLL+SADYSQIELRLMAHFSKDS LIELLS+ HGDVFT
Sbjct: 1932 NEDDVDHCKINARDFFISTQENWLLLSADYSQIELRLMAHFSKDSLLIELLSRSHGDVFT 1991

Query: 2080 MIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSF 2139
            MIAARWTGKTEDSIG HERDQTKRLVYGILYGMGAK+LALQLECS+DEAVEKI+SFKSSF
Sbjct: 1992 MIAARWTGKTEDSIGSHERDQTKRLVYGILYGMGAKSLALQLECSRDEAVEKIQSFKSSF 2051

Query: 2140 PGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYW 2199
            PGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQ      
Sbjct: 2052 PGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQ------ 2111

Query: 2200 GSAADIIKVAMINIYSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVK 2259
            GSAADIIK+AMI++YSVIGTDAPD T LPAAN+NILRGHCRIVLQVHDELVLEVDPSVVK
Sbjct: 2112 GSAADIIKLAMIHVYSVIGTDAPDLTVLPAANSNILRGHCRIVLQVHDELVLEVDPSVVK 2141

Query: 2260 EAAALLQKSMENAASLLVPLQVKLKVGRTWGSLEPFLHDSFKIEVLVPGS 2265
            EAA+LLQKSMENAASLLVPLQVKLKVGRTWGSLE FL D+F+IE L PGS
Sbjct: 2172 EAASLLQKSMENAASLLVPLQVKLKVGRTWGSLETFLPDNFQIEALAPGS 2141

BLAST of Cla022227 vs. NCBI nr
Match: gi|778672103|ref|XP_011649741.1| (PREDICTED: helicase and polymerase-containing protein TEBICHI [Cucumis sativus])

HSP 1 Score: 3724.1 bits (9656), Expect = 0.0e+00
Identity = 1922/2210 (86.97%), Postives = 1984/2210 (89.77%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            +FYASKKRKPLT SLKSGSYDK+GKK+LEGSPGAKGTLDNYLV SQDHG+SD PSHSVRE
Sbjct: 12   QFYASKKRKPLTPSLKSGSYDKNGKKALEGSPGAKGTLDNYLVISQDHGSSDNPSHSVRE 71

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
            NLS Q+LVKRNLLLKINSS RNEH E T SRGCD        KKRTLEDS+ETRSSTVK 
Sbjct: 72   NLSAQNLVKRNLLLKINSSFRNEHGETTSSRGCD--------KKRTLEDSFETRSSTVKS 131

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
             A D G+TPCTEKPELKQFAADFLSLYCSNEL TTVSSP EQKVTFLKRHSSPS LEGEA
Sbjct: 132  TASDCGITPCTEKPELKQFAADFLSLYCSNELQTTVSSPVEQKVTFLKRHSSPSHLEGEA 191

Query: 280  KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKAP 339
            KLPKK+HSI GPSNA+ EPDSSNALS GNK+SNFVVETGDT SH P VLKAC+QKCN+AP
Sbjct: 192  KLPKKMHSIVGPSNAESEPDSSNALSEGNKESNFVVETGDTVSHHPAVLKACMQKCNQAP 251

Query: 340  RSPYCLTECKTPGLSTANTCFQETPKSGSSTFSPGEAFWKEAIVFADGLCAPSIDLTNCD 399
             SPYCLTECKTPGLST  T  ++TPKSGSSTFSPGEAFWKEAIV ADGL APSI L NCD
Sbjct: 252  TSPYCLTECKTPGLSTGTTFIRQTPKSGSSTFSPGEAFWKEAIVLADGLRAPSIALINCD 311

Query: 400  AEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNRE 459
            AE AN+ ESQS+TKKLPIP EPAQKRLKGQFGGGSGGVRLGEPGAS   LRS+LKEL+R 
Sbjct: 312  AEEANLVESQSNTKKLPIPEEPAQKRLKGQFGGGSGGVRLGEPGAS---LRSDLKELDRV 371

Query: 460  VSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTND-SLPNHNDKTR 519
            VSSLPVKHFDFSADDKNLD ST P CASNES+VNAYDLNEQSD CYT   SLP HNDKTR
Sbjct: 372  VSSLPVKHFDFSADDKNLDDSTSPCCASNESKVNAYDLNEQSDRCYTTHISLPKHNDKTR 431

Query: 520  DSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHELRASTVHDFKEETTPS 579
            DSDSLTKEKI ET VTSSVPVV EVKLNIFSPSDSITSDTA HELRAST+HD ++ETTPS
Sbjct: 432  DSDSLTKEKIQETIVTSSVPVVNEVKLNIFSPSDSITSDTAAHELRASTIHDSRDETTPS 491

Query: 580  SSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 639
            SS RHKDWLDLSCWLPPEI SIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK
Sbjct: 492  SSTRHKDWLDLSCWLPPEISSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 551

Query: 640  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGGTLP 699
            SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLE L KHVRSYYGNQGGGTLP
Sbjct: 552  SFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLESLGKHVRSYYGNQGGGTLP 611

Query: 700  KDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 759
            KDTSVAVCTIEKANSLINRLLEE RLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA
Sbjct: 612  KDTSVAVCTIEKANSLINRLLEECRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLRYAA 671

Query: 760  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLE 819
            GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALY TDFRPVPLE
Sbjct: 672  GEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYHTDFRPVPLE 731

Query: 820  EYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCES 879
            EYIKVGNTIYN+SLDIVRTISKTANLGGRDPDHIVELCNEVVE+GHSVLIFCSSRKGCES
Sbjct: 732  EYIKVGNTIYNKSLDIVRTISKTANLGGRDPDHIVELCNEVVEDGHSVLIFCSSRKGCES 791

Query: 880  TAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLT 939
            TAKHVSKFLKKFSVKI N+NSEFTDIFSA+DALRRCPSGLDPVLEETFPSGVAYHHAGLT
Sbjct: 792  TAKHVSKFLKKFSVKIQNDNSEFTDIFSAIDALRRCPSGLDPVLEETFPSGVAYHHAGLT 851

Query: 940  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRA 999
            VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDGARYRQMAGRA
Sbjct: 852  VEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGARYRQMAGRA 911

Query: 1000 GRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 1059
            GRTGIDTKGESVLICRPEE+KRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT
Sbjct: 912  GRTGIDTKGESVLICRPEEVKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQT 971

Query: 1060 ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS 1119
            ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS
Sbjct: 972  ATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGS 1031

Query: 1120 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQS 1179
            SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYE FMGL SLDQS
Sbjct: 1032 SLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYELFMGLSSLDQS 1091

Query: 1180 VGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISR 1239
            VGNRVG TEPFLMRMAHGAP+RRANISRNGV+                            
Sbjct: 1092 VGNRVGATEPFLMRMAHGAPVRRANISRNGVA---------------------------- 1151

Query: 1240 NGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARG 1299
                GLRTKRDEH  +Y DRPSEEQTIRVCKRFYVALILSRLVQETPIPEVC+AFKVARG
Sbjct: 1152 ----GLRTKRDEHVGVYGDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCDAFKVARG 1211

Query: 1300 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1359
            MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR
Sbjct: 1212 MVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSR 1271

Query: 1360 ARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTA 1419
            ARALYKAGLRTPLAIAEASDAELVKAL ESASWT E                    ESTA
Sbjct: 1272 ARALYKAGLRTPLAIAEASDAELVKALSESASWTTE--------------------ESTA 1331

Query: 1420 QKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQ 1479
            QKRMHVG+ARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQIS PLSASADGNITAQ
Sbjct: 1332 QKRMHVGLARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISHPLSASADGNITAQ 1391

Query: 1480 VAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFG 1539
            VA             V T+QME  LT SC GGTSSSEKV GKN S+TG IS++VK  N G
Sbjct: 1392 VA-------------VGTQQMERVLTLSCVGGTSSSEKVVGKNPSQTGAISIDVKQSNSG 1451

Query: 1540 VNPLVNVEGSAIQESNTVVECAGKVDVTISNHMERI----AQREQHSS-VLHPPKRDSSS 1599
            VNP VN EGSAIQ+SNTV ECAGKVDV IS+H+ERI    AQREQHSS VLH  KRD SS
Sbjct: 1452 VNPPVNAEGSAIQDSNTVGECAGKVDVAISSHLERITDKDAQREQHSSKVLHSLKRDGSS 1511

Query: 1600 MKGPIHAANTSGGFESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVY 1659
            MKGPI AA+TSGGFESFL+LWDASQEF+FDLYYTKRSEVNSVVPFELHGIAICWE SPVY
Sbjct: 1512 MKGPIQAASTSGGFESFLNLWDASQEFYFDLYYTKRSEVNSVVPFELHGIAICWEKSPVY 1571

Query: 1660 YVNLPKDLLGPKSGKGLYPDDRTSGD---------------------------------- 1719
            YVN+PKDLLGPKSGKGL PDD  SGD                                  
Sbjct: 1572 YVNIPKDLLGPKSGKGLCPDDSISGDQVDVSQNEHWFEMIEMRWKKINEIFTKKNVRKFA 1631

Query: 1720 -----QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWIL 1779
                 QVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSY+VLSRVH+SNVIDMCIVAWIL
Sbjct: 1632 WNLKVQVQVLKCPGVSIQKLGFLNSARRNMGLKLVDGSYIVLSRVHMSNVIDMCIVAWIL 1691

Query: 1780 WPDDERNSTPNLEKEVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1839
            WPDDERNST NLEKEVKKRLSGEAA+AANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL
Sbjct: 1692 WPDDERNSTLNLEKEVKKRLSGEAAAAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVL 1751

Query: 1840 WKLIISEKLLEALNNIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRL 1899
            WKLIISEKLL+ALNNIEIPLV ILADMETWGIGVDMEGCIRARNLLGKKL+CLEKEAYRL
Sbjct: 1752 WKLIISEKLLDALNNIEIPLVGILADMETWGIGVDMEGCIRARNLLGKKLRCLEKEAYRL 1811

Query: 1900 AGMSFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1959
            AGM+FSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR
Sbjct: 1812 AGMTFSLYAAADIANVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHR 1871

Query: 1960 TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKM 2019
            TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAV+FKM
Sbjct: 1872 TLAKLFNCTLGSICSLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVEFKM 1931

Query: 2020 NEDDVDHCKINARDFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFT 2079
            NEDDVDHCKINARDFFISTQENWLL+SADYSQIELRLMAHFSKDS LIELLS PHGDVFT
Sbjct: 1932 NEDDVDHCKINARDFFISTQENWLLLSADYSQIELRLMAHFSKDSLLIELLSIPHGDVFT 1991

Query: 2080 MIAARWTGKTEDSIGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSF 2139
            MIAARWTGKTEDSIG HERDQTKRLVYGILYGMGAK+LALQLECS+DEAVEKI+SFKSSF
Sbjct: 1992 MIAARWTGKTEDSIGSHERDQTKRLVYGILYGMGAKSLALQLECSRDEAVEKIQSFKSSF 2051

Query: 2140 PGVASWLHEAVTFCRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYW 2199
            PGVASWLHEAV FCRQKGYVETLKGRRRFLSKINSP SKEKSKAQRQAVNSICQ      
Sbjct: 2052 PGVASWLHEAVAFCRQKGYVETLKGRRRFLSKINSPISKEKSKAQRQAVNSICQ------ 2111

Query: 2200 GSAADIIKVAMINIYSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVK 2259
            GSAADIIK+AMI++YSVIGTDAPD T LPAAN+NILRGHCRIVLQVHDELVLEVDPS VK
Sbjct: 2112 GSAADIIKLAMIHVYSVIGTDAPDLTVLPAANSNILRGHCRIVLQVHDELVLEVDPSFVK 2139

Query: 2260 EAAALLQKSMENAASLLVPLQVKLKVGRTWGSLEPFLHDSFKIEVLVPGS 2265
            EAA+LLQKSMENAASLLVPLQVKLKVGRTWGSLE FL D+F+IE L PGS
Sbjct: 2172 EAASLLQKSMENAASLLVPLQVKLKVGRTWGSLETFLPDNFQIEALAPGS 2139

BLAST of Cla022227 vs. NCBI nr
Match: gi|659088551|ref|XP_008445041.1| (PREDICTED: DNA polymerase theta isoform X2 [Cucumis melo])

HSP 1 Score: 2658.2 bits (6889), Expect = 0.0e+00
Identity = 1378/1596 (86.34%), Postives = 1422/1596 (89.10%), Query Frame = 1

Query: 714  LINRLLEEGRLSEIGIIVIDEL-HMVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSG 773
            ++ R++  G+++ + +  +      VGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSG
Sbjct: 550  MLRRVISTGKMALLVLPYVSICAEKVGDQTRGYLLELLLTKLRYAAGEGNLDSSSGESSG 609

Query: 774  TSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPVPLEEYIKVGNTIYNRSL 833
            TSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALY TDFRPVPLEEYIKVGNTIYN+SL
Sbjct: 610  TSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYHTDFRPVPLEEYIKVGNTIYNKSL 669

Query: 834  DIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKGCESTAKHVSKFLKKFSV 893
            DIVRTISKTANLGGRDPDHIVELCNEVVE+G+SVLIFCSSRKGCESTAKHVSKFLKKFSV
Sbjct: 670  DIVRTISKTANLGGRDPDHIVELCNEVVEDGNSVLIFCSSRKGCESTAKHVSKFLKKFSV 729

Query: 894  KIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKG 953
            KI NENSEFTDIFSA+DALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKG
Sbjct: 730  KIQNENSEFTDIFSAIDALRRCPSGLDPVLEETFPSGVAYHHAGLTVEEREVVETCYRKG 789

Query: 954  LLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMAGRAGRTGIDTKGESVLI 1013
            LLRVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDGARYRQMAGRAGRTGIDTKGESVLI
Sbjct: 790  LLRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGARYRQMAGRAGRTGIDTKGESVLI 849

Query: 1014 CRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLN 1073
            CRPEE+KRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLN
Sbjct: 850  CRPEEVKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGIVQTATDIHRYVRCTLLN 909

Query: 1074 STKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDL 1133
            STKPFQDVVKSAQESLRWLCH KFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDL
Sbjct: 910  STKPFQDVVKSAQESLRWLCHRKFLEWNGDTKLYSTTPLGRASFGSSLSPEESLIVLDDL 969

Query: 1134 SRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSLDQSVGNRVGVTEPFLMR 1193
            SRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGL SLDQSVGNRVGVTEPFLMR
Sbjct: 970  SRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLSSLDQSVGNRVGVTEPFLMR 1029

Query: 1194 MAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVVGLRTKRDEHG 1253
            MAHGAP+RRANISRNGV+                                G RTKRDEH 
Sbjct: 1030 MAHGAPVRRANISRNGVA--------------------------------GSRTKRDEHM 1089

Query: 1254 CMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFAS 1313
             MY DRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFAS
Sbjct: 1090 GMYGDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKVARGMVQALQESAGRFAS 1149

Query: 1314 MVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLA 1373
            MVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLA
Sbjct: 1150 MVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVKGSRARALYKAGLRTPLA 1209

Query: 1374 IAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIESTAQKRMHVGIARKIKH 1433
            IAEASDAELVKALFESASWT E                    ES AQKRMHVG+ARKIKH
Sbjct: 1210 IAEASDAELVKALFESASWTTE--------------------ESIAQKRMHVGLARKIKH 1269

Query: 1434 GARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNITAQVAASIPSEIDTLNR 1493
            GARKVVLDKAEEARIAAFSAFKSLGFTVPQIS PLSAS DGNITAQVA            
Sbjct: 1270 GARKVVLDKAEEARIAAFSAFKSLGFTVPQISHPLSASVDGNITAQVA------------ 1329

Query: 1494 VVSTRQMEHALTKSCFGGTSSSEKVGGKNLSETGTISVEVKPPNFGVNPLVNVEGSAIQE 1553
             VSTRQMEH LT S  GGTSSSEKV GKN SETG +SV+VK  N GVNP VNVEGSAIQ+
Sbjct: 1330 -VSTRQMEHVLTLSSVGGTSSSEKVVGKNPSETGAMSVDVKVSNSGVNPPVNVEGSAIQD 1389

Query: 1554 SNTVVECAGKVDVTISNHMERI----AQREQHS-SVLHPPKRDSSSMKGPIHAANTSGGF 1613
            SNTVVECAGKVDV IS+H+ERI    AQREQHS  VLH  KRD SSMKGPI AA TSGGF
Sbjct: 1390 SNTVVECAGKVDVAISSHVERITDKDAQREQHSGKVLHSLKRDDSSMKGPIQAATTSGGF 1449

Query: 1614 ESFLDLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDLLGPKSG 1673
            ESFL+LWDASQEF+FDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVN+PKDLLGPKSG
Sbjct: 1450 ESFLELWDASQEFYFDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNIPKDLLGPKSG 1509

Query: 1674 KGLYPDDRTSGD---------------------------------------QVQVLKCPG 1733
            KGL PDD  SGD                                       QVQVLKCPG
Sbjct: 1510 KGLCPDDSMSGDRVDVSQNEHWFEMIEMRWKKINEIFTKKNVRKFAWNLKVQVQVLKCPG 1569

Query: 1734 VSIQKLGFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEK 1793
            VSIQKLGFLNSARRNMGLKLVDGSYLVLS VHISNVIDMCIVAWILWPDDERNST NLEK
Sbjct: 1570 VSIQKLGFLNSARRNMGLKLVDGSYLVLSGVHISNVIDMCIVAWILWPDDERNSTLNLEK 1629

Query: 1794 EVKKRLSGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALN 1853
            EVKKRLSGEAA+AANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALN
Sbjct: 1630 EVKKRLSGEAAAAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALN 1689

Query: 1854 NIEIPLVSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIA 1913
            NIEIPLV ILADMETWGIGVDMEGCIRARNLLGKKL+CLEKEAYRLAGM+FSLYAAADIA
Sbjct: 1690 NIEIPLVGILADMETWGIGVDMEGCIRARNLLGKKLRCLEKEAYRLAGMTFSLYAAADIA 1749

Query: 1914 NVLYGHLKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSIC 1973
            NVLYGHLKLSIPE FNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSIC
Sbjct: 1750 NVLYGHLKLSIPEEFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSIC 1809

Query: 1974 SLAKLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDDVDHCKINARD 2033
            SLA+LSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAV+FKMNEDDVDHCKINARD
Sbjct: 1810 SLARLSARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVEFKMNEDDVDHCKINARD 1869

Query: 2034 FFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSI 2093
            FFISTQENWLL+SADYSQIELRLMAHFSKDS LIELLS+ HGDVFTMIAARWTGKTEDSI
Sbjct: 1870 FFISTQENWLLLSADYSQIELRLMAHFSKDSLLIELLSRSHGDVFTMIAARWTGKTEDSI 1929

Query: 2094 GPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFC 2153
            G HERDQTKRLVYGILYGMGAK+LALQLECS+DEAVEKI+SFKSSFPGVASWLHEAVTFC
Sbjct: 1930 GSHERDQTKRLVYGILYGMGAKSLALQLECSRDEAVEKIQSFKSSFPGVASWLHEAVTFC 1989

Query: 2154 RQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINI 2213
            RQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQ      GSAADIIK+AMI++
Sbjct: 1990 RQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQ------GSAADIIKLAMIHV 2049

Query: 2214 YSVIGTDAPDPTGLPAANTNILRGHCRIVLQVHDELVLEVDPSVVKEAAALLQKSMENAA 2265
            YSVIGTDAPD T LPAAN+NILRGHCRIVLQVHDELVLEVDPSVVKEAA+LLQKSMENAA
Sbjct: 2050 YSVIGTDAPDLTVLPAANSNILRGHCRIVLQVHDELVLEVDPSVVKEAASLLQKSMENAA 2074

BLAST of Cla022227 vs. NCBI nr
Match: gi|659088551|ref|XP_008445041.1| (PREDICTED: DNA polymerase theta isoform X2 [Cucumis melo])

HSP 1 Score: 948.3 bits (2450), Expect = 2.4e-272
Identity = 485/572 (84.79%), Postives = 510/572 (89.16%), Query Frame = 1

Query: 100 KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
           +FYASKKRKPLT SLKSGSYDKDGK++LEGSP AKGTLDNYLV SQD GNSD PSHSVRE
Sbjct: 12  QFYASKKRKPLTPSLKSGSYDKDGKRALEGSPSAKGTLDNYLVVSQDRGNSDNPSHSVRE 71

Query: 160 NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
           NLS QDLVKRNLLL+INSSS NEH E T SRGCD        KK+T+EDS ETRSSTVK 
Sbjct: 72  NLSGQDLVKRNLLLRINSSSINEHGETT-SRGCD--------KKKTMEDSLETRSSTVKS 131

Query: 220 MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
           MA D GV PCTEKPELKQFAADFLSLYCSNEL TTVSSP EQKVT LKRHSSPS LE EA
Sbjct: 132 MASDWGVAPCTEKPELKQFAADFLSLYCSNELQTTVSSPVEQKVTSLKRHSSPSHLEEEA 191

Query: 280 KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKAP 339
           KLPKK+HSI  PSNA+GEPDSSNALS GNK+SNFVVETGDTDSH P VLKAC+QKCN+AP
Sbjct: 192 KLPKKMHSIVDPSNAEGEPDSSNALSEGNKESNFVVETGDTDSHHPAVLKACMQKCNQAP 251

Query: 340 RSPYCLTECKTPGLSTANTCFQETPKSGSSTFSPGEAFWKEAIVFADGLCAPSIDLTNCD 399
            SP+CLTECKTPGLSTA T  ++TPKSGSSTFSPGEAFWKEAIVFADGLCAPSI LTNCD
Sbjct: 252 ISPHCLTECKTPGLSTATTFIRQTPKSGSSTFSPGEAFWKEAIVFADGLCAPSIALTNCD 311

Query: 400 AEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKELNRE 459
            E AN+ ESQS+TKKLPIP EPAQKRLKGQFG GSGGVRLGEPGAS+VSLRS+LKEL+R 
Sbjct: 312 GEEANLVESQSNTKKLPIPEEPAQKRLKGQFGVGSGGVRLGEPGASIVSLRSDLKELDRV 371

Query: 460 VSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTND-SLPNHNDKTR 519
            SSLPVKHFDFSADDKNLD +T P CASNES+VNAYDLNEQSD CYT   SLP HNDKTR
Sbjct: 372 ASSLPVKHFDFSADDKNLDENTSPCCASNESKVNAYDLNEQSDRCYTTHVSLPKHNDKTR 431

Query: 520 DSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHELRASTVHDFKEETTPS 579
           DSDSLTKEKI ET VTSSVPVVTEVKLNIFSPSDSITSDTA HELRAST+H  ++E TPS
Sbjct: 432 DSDSLTKEKIQETKVTSSVPVVTEVKLNIFSPSDSITSDTATHELRASTIHGSRDEMTPS 491

Query: 580 SSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTSAGK 639
           SS RHKDWLDL+CWLPPEI SIYKEKGITKLH WQVECLKVDGVLQRRNLVYCASTSAGK
Sbjct: 492 SSTRHKDWLDLTCWLPPEISSIYKEKGITKLHRWQVECLKVDGVLQRRNLVYCASTSAGK 551

Query: 640 SFVAEILMLRRVISTGKMALLVLPYVSICAEK 671
           SFVAEILMLRRVISTGKMALLVLPYVSICAEK
Sbjct: 552 SFVAEILMLRRVISTGKMALLVLPYVSICAEK 574


HSP 2 Score: 2600.1 bits (6738), Expect = 0.0e+00
Identity = 1419/2248 (63.12%), Postives = 1663/2248 (73.98%), Query Frame = 1

Query: 100  KFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVRE 159
            +FYASKKRK  + S+KSG  +KD K ++E SP AKGTLDNYL  SQD G++     S + 
Sbjct: 12   QFYASKKRKSRSPSVKSGRAEKDAKITVEVSPSAKGTLDNYLKNSQDDGHT-----SKQS 71

Query: 160  NLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVKL 219
             LS  ++VKRNL L+I+  S++E  +  LS      A  + I + + ++     +S V  
Sbjct: 72   LLSRHEVVKRNLSLEIDKYSKDEKNQALLSDQAQPQATQKVISRCSSKEG----NSEVGC 131

Query: 220  MAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGEA 279
               DG      E  ELKQF  DFLSLYCS E+H++ SSP E K+   KRHSSPSLL GE 
Sbjct: 132  HMKDGSAH-IPESLELKQFPTDFLSLYCS-EIHSSASSPSEAKLKDHKRHSSPSLLGGED 191

Query: 280  -KLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGD---------TDSHPPVVLK 339
             K+ KK + ++    +  +   SNA ++   QS F+V+TG+         TDS+  ++L+
Sbjct: 192  NKIAKKKYCVSNLLQSGEQTTCSNAKNIEETQSGFIVKTGNLVPNSSQRVTDSNASLLLQ 251

Query: 340  ACLQKCNKAPRSPYCLTECKTPGLSTANTCFQETPKS--GSSTFSPGEAFWKEAIVFADG 399
            A L+KC+K+ +S    T C TP  S   T  +ETPKS  G+S FSPGEAFW EAI  ADG
Sbjct: 252  ASLRKCDKSSKSTLNTTACYTPEPSIVKTYVRETPKSTCGNSIFSPGEAFWNEAIEIADG 311

Query: 400  LCAPSIDLTNCDAEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMV 459
              A +    +  AEG   ++SQ+              + K     G   V+  + G S+ 
Sbjct: 312  FFAHTDIGPSQIAEGIADSKSQNEINNSYNLRNKNYNKSKEMLNEGDSKVQHIKAGGSLK 371

Query: 460  SLRSELKELNREVSSLPVKHFDFSADDKNLDGSTLPYC-ASNESEVNAYDLNEQSDC-CY 519
             +  ++ +  +E+S LP+KH DF  +DKNL G T P C A++ SE   +     S+    
Sbjct: 372  QMGKDVIDSVKELSPLPIKHLDFLFEDKNLKG-TKPGCGAADTSEAMMFRDGVVSEKGSV 431

Query: 520  TNDSLPNHNDKTRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPS-DSITSDTAVHELR 579
            T+ S      K    ++   E I +     SV +V E KL+I S   DSITSD+  + ++
Sbjct: 432  THKSCQKIKFKCHHDNTSRTEGISDVQEKDSVLIVHERKLDISSQGIDSITSDSPTNVIK 491

Query: 580  ASTVHDFKEET-TPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVL 639
                ++  +E  TPSSS   KD LDLS WLP EICSIYK++GI+KL+PWQVECL VDGVL
Sbjct: 492  KPVGNEKSDEAGTPSSSGMLKDCLDLSSWLPSEICSIYKKRGISKLYPWQVECLHVDGVL 551

Query: 640  QRRNLVYCASTSAGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDK 699
            QRRNLVYCASTSAGKSFVAEILMLRR+ISTGKMALLVLPYVSICAEKA HL+VLLEPL +
Sbjct: 552  QRRNLVYCASTSAGKSFVAEILMLRRLISTGKMALLVLPYVSICAEKAEHLEVLLEPLGR 611

Query: 700  HVRSYYGNQGGGTLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTR 759
            HVRSYYGNQGGG+LPKDTSVAVCTIEKANSL+NR+LEEGRLSEIGIIVIDELHMV DQ R
Sbjct: 612  HVRSYYGNQGGGSLPKDTSVAVCTIEKANSLVNRMLEEGRLSEIGIIVIDELHMVADQNR 671

Query: 760  GYLLELLLTKLRYAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWL 819
            GYLLELLLTKLRYAAGEG  DSSSGE+SGTSSGK+DPAHG+QIVGMSATMPNVAAVADWL
Sbjct: 672  GYLLELLLTKLRYAAGEGTSDSSSGENSGTSSGKADPAHGLQIVGMSATMPNVAAVADWL 731

Query: 820  QAALYQTDFRPVPLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEG 879
            QAALY+T+FRPVPLEEYIKVGN IY++ +D+VRTI   ANLGG+DPDHIVELC+EVV+EG
Sbjct: 732  QAALYETNFRPVPLEEYIKVGNAIYSKKMDVVRTILTAANLGGKDPDHIVELCDEVVQEG 791

Query: 880  HSVLIFCSSRKGCESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLE 939
            HSVLIFCSSRKGCESTA+HVSKFLKKFS+ +H+ +SEF DI SA+DALRRCP+GLDPVLE
Sbjct: 792  HSVLIFCSSRKGCESTARHVSKFLKKFSINVHSSDSEFIDITSAIDALRRCPAGLDPVLE 851

Query: 940  ETFPSGVAYHHAGLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGR 999
            ET PSGVAYHHAGLTVEEREVVETCYRKGL+RVLTATSTLAAGVNLPARRVIFRQP+IGR
Sbjct: 852  ETLPSGVAYHHAGLTVEEREVVETCYRKGLVRVLTATSTLAAGVNLPARRVIFRQPRIGR 911

Query: 1000 DFIDGARYRQMAGRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGM 1059
            DFIDG RYRQMAGRAGRTGIDTKGES+LIC+PEE+K+I  LLNESCPPL SCLSEDKNGM
Sbjct: 912  DFIDGTRYRQMAGRAGRTGIDTKGESMLICKPEEVKKIMGLLNESCPPLHSCLSEDKNGM 971

Query: 1060 THAILEVVAGGIVQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDT 1119
            THAILEVVAGGIVQTA DIHRYVRCTLLNSTKPFQDVVKSAQ+SLRWLCH KFLEWN DT
Sbjct: 972  THAILEVVAGGIVQTAEDIHRYVRCTLLNSTKPFQDVVKSAQDSLRWLCHRKFLEWNEDT 1031

Query: 1120 KLYSTTPLGRASFGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWE 1179
            KLYSTTPLGRA+FGSSL PEESLIVLDDLSRAREGFVLASDLHLVYL TPINV+VEPDWE
Sbjct: 1032 KLYSTTPLGRAAFGSSLCPEESLIVLDDLSRAREGFVLASDLHLVYLSTPINVEVEPDWE 1091

Query: 1180 LYYERFMGLPSLDQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLM 1239
            LYYERF+ L +LDQSVGN+VGV+EP+LMRMAHGAP+R ++  R+                
Sbjct: 1092 LYYERFLELSALDQSVGNQVGVSEPYLMRMAHGAPMRISSKLRDST-------------- 1151

Query: 1240 RMAHGAPIRRANISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQE 1299
            +  HG    R  I+ N ++                 S+ QT+RVCKRFYVALILSRLVQE
Sbjct: 1152 KGLHGKLEYRLGITSNNML-----------------SDAQTLRVCKRFYVALILSRLVQE 1211

Query: 1300 TPIPEVCEAFKVARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRA 1359
            TP+ EVCE FKVARGMVQALQE+AGRFASMVSVFCERLGW+DLEGL+AKFQNRVSFGVRA
Sbjct: 1212 TPVLEVCETFKVARGMVQALQENAGRFASMVSVFCERLGWYDLEGLIAKFQNRVSFGVRA 1271

Query: 1360 EIVELTTIPYVKGSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLF 1419
            EIVELTTIPYVKGSRARALYKAGLRTPLAIAEAS +E+VKALFES+SW AE         
Sbjct: 1272 EIVELTTIPYVKGSRARALYKAGLRTPLAIAEASISEIVKALFESSSWIAE--------- 1331

Query: 1420 VCADSGQQVCIESTAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQI 1479
                          AQ+R+ +G+A+KIK+GARK+VL+KAEEARIAAFSAFKSLG  VPQ 
Sbjct: 1332 --------------AQRRVQLGVAKKIKNGARKIVLEKAEEARIAAFSAFKSLGLNVPQF 1391

Query: 1480 SRPLSASADGNITA-QVAASIPSEIDTLNRVVSTRQMEHALTKSCFGGTSSSEKV----G 1539
            SRP+ ++A  N T  + AA+     D  +  +    MEH+  K        S+KV     
Sbjct: 1392 SRPILSTATENSTGEEEAATTAPRNDKSSSFIFPVPMEHS-DKPSLEANQISKKVDLESA 1451

Query: 1540 GKNLSET----------GTISVEVKPPNFGVNPLV------------NVEGSAIQESNTV 1599
            G+ L ET          G    E++      NP V            NV  S I+  +T 
Sbjct: 1452 GEKLLETSDNELSALVEGGSITELQQKFDAENPPVPFVGPGTGGVEFNVNASEIKIPDTT 1511

Query: 1600 --VECAGKVDVTISNHME---RIAQREQHSSVLHPPKRDSSSMKGPIHAANTSGGFESFL 1659
              V+       TI+++ +    +  R      L    +D +  KGPI+A N SGGF+ FL
Sbjct: 1512 LSVQLGKNAIGTITSNRDLDLEVQDRPNRDPCL--VNKDRACNKGPINAINASGGFDCFL 1571

Query: 1660 DLWDASQEFFFDLYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDL---------- 1719
            D W+A+ EF+FD++Y K SE NS V FE+HG+A+CWENSPVYYVNLPKDL          
Sbjct: 1572 DRWEATHEFYFDIHYDKHSEANSGVLFEIHGLAVCWENSPVYYVNLPKDLWSDHRRKDRF 1631

Query: 1720 --LGPKSGKGLYPDDRTS---------GD----------------QVQVLKCPGVSIQKL 1779
               G      L P+ +           G+                Q+QVLK   VSIQ+ 
Sbjct: 1632 LIYGSSDKNVLTPEHQLEMIKQRWKRIGEIMEKRDVRKFTWNMKVQIQVLKHAAVSIQRF 1691

Query: 1780 GFLNSARRNMGLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRL 1839
            G LN    ++GL+ V  S+L+LS VH+ + IDMCIV+WILWPDDER+S PNLEKEVKKRL
Sbjct: 1692 GGLNLVGTSLGLENVGSSFLLLSPVHLKDGIDMCIVSWILWPDDERSSNPNLEKEVKKRL 1751

Query: 1840 SGEAASAANRSGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPL 1899
            S EAA+AANRSG+WKNQMRR AHNGCCRRVAQTRALCSVLWKL++SE+L+EAL NIEIPL
Sbjct: 1752 SSEAAAAANRSGRWKNQMRRAAHNGCCRRVAQTRALCSVLWKLLVSEELIEALLNIEIPL 1811

Query: 1900 VSILADMETWGIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGH 1959
            V++LADME WGIGVDMEGC++ARNLL KKL+ LEK+AY LAGM FSLY AADIANVLYGH
Sbjct: 1812 VNVLADMELWGIGVDMEGCLQARNLLQKKLRYLEKKAYTLAGMKFSLYTAADIANVLYGH 1871

Query: 1960 LKLSIPEGFNKGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLS 2019
            LKL IPEG NKGKQHPSTDKHCLDLLR+EHPIVPVIKEHRTLAKL NCTLGSICSLA++S
Sbjct: 1872 LKLPIPEGHNKGKQHPSTDKHCLDLLRHEHPIVPVIKEHRTLAKLLNCTLGSICSLARIS 1931

Query: 2020 ARTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDD-------VDHCKINAR 2079
              TQKYTLHGHWLQTSTATGRLSMEEPNLQCVEH V+FKM+ +D       VDHCKINAR
Sbjct: 1932 MSTQKYTLHGHWLQTSTATGRLSMEEPNLQCVEHMVEFKMSNEDIYGGNAEVDHCKINAR 1991

Query: 2080 DFFISTQENWLLVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDS 2139
            DFFI +QENW+L++ADYSQIELRLMAHFSKD +LI LLSKPHGDVFTMIAARWTG++EDS
Sbjct: 1992 DFFIPSQENWILLAADYSQIELRLMAHFSKDPALIGLLSKPHGDVFTMIAARWTGRSEDS 2051

Query: 2140 IGPHERDQTKRLVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTF 2199
            +G  ERDQTKRL+YGILYGMG  TL+ QL CS +EA EKI+SFKSSFPGVASWLH AV+ 
Sbjct: 2052 VGSQERDQTKRLIYGILYGMGPNTLSEQLNCSSNEAKEKIKSFKSSFPGVASWLHVAVSS 2111

Query: 2200 CRQKGYVETLKGRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMIN 2256
            C QKGYVE+LKGR+RFLSKI   N+KEKSKAQRQAVNSICQ      GSAADIIK+AMIN
Sbjct: 2112 CHQKGYVESLKGRKRFLSKIKFGNNKEKSKAQRQAVNSICQ------GSAADIIKIAMIN 2171

BLAST of Cla022227 vs. NCBI nr
Match: gi|596293486|ref|XP_007226676.1| (hypothetical protein PRUPE_ppa020963mg [Prunus persica])

HSP 1 Score: 2584.7 bits (6698), Expect = 0.0e+00
Identity = 1398/2234 (62.58%), Postives = 1652/2234 (73.95%), Query Frame = 1

Query: 99   SKFYASKKRKPLTSSLKSGSYDKDGKKSLEGSPGAKGTLDNYLVTSQDHGNSDIPSHSVR 158
            ++F+ASKKRKPL+  LKSG  +KD K  +EGSP AKGTLDNYL+ SQ++     PS+ V 
Sbjct: 10   NQFFASKKRKPLSPVLKSGRNEKDVKVKVEGSPSAKGTLDNYLLASQENNIISEPSYKVC 69

Query: 159  ENLSEQDLVKRNLLLKINSSSRNEHEEPTLSRGCDTSAATEGIKKRTLEDSYETRSSTVK 218
            ++L++QD V+RNL  +I++S ++E ++  LS    + A       +       T+   VK
Sbjct: 70   DSLAQQDQVRRNLTSEIDNSLKDEFKQLPLSSQLHSEANDVSQANQKETSRQLTKVGDVK 129

Query: 219  LMAGDGGVTPCTEKPELKQFAADFLSLYCSNELHTTVSSPGEQKVTFLKRHSSPSLLEGE 278
                    T   ++ ELK FAADFLSLYCS +L    SS  E KV   KR +SPSLL+ E
Sbjct: 130  EYPA---FTEGEDRAELKDFAADFLSLYCS-DLQPNESSLSEMKVNDHKRQASPSLLDRE 189

Query: 279  AKLPKKIHSIAGPSNAKGEPDSSNALSVGNKQSNFVVETGDTDSHPPVVLKACLQKCNKA 338
             K  KK H I   S+ + E   S+  S    QS+ V + G T  +  + L+  L+ C+  
Sbjct: 190  DKTFKKRHCITNQSHVEHETSYSSEKSSEAVQSDSVDKNGVTIVNELLELQPTLKACSNT 249

Query: 339  PRSPYCLTECKTPGLSTANTCFQETPKS--GSSTFSPGEAFWKEAIVFADGLCAPSIDLT 398
             +    + EC TPG  T  T  +ETPKS  GSS+FSPGEAFW +AI  ADGLCA +  + 
Sbjct: 250  AKLSLDMFECCTPGSLTRKTSVRETPKSTRGSSSFSPGEAFWDDAIQLADGLCAQAAGVI 309

Query: 399  NCDAEGANVAESQSHTKKLPIPGEPAQKRLKGQFGGGSGGVRLGEPGASMVSLRSELKEL 458
            +  A+G   ++S  + +     G+  +   +G+        R+G+ G +   +    K+L
Sbjct: 310  SV-ADGQYRSKSSCNLRNARCDGKSKEILDEGE--------RMGK-GGNTGPMGKHRKDL 369

Query: 459  NREVSSLPVKHFDFSADDKNLDGSTLPYCASNESEVNAYDLNEQSDCCYTNDSLPNHNDK 518
            ++EVS LPVKHFDFS +DKNLD S   +  +   +  A+   EQS+    +     +   
Sbjct: 370  DKEVSPLPVKHFDFSCEDKNLDKSVPHHLDAYNLKSVAHVGGEQSESSLIDPRGLRNPMM 429

Query: 519  TRDSDSLTKEKIHETNVTSSVPVVTEVKLNIFSPSDSITSDTAVHEL-RASTVHDFKEET 578
             R + S   +       T+SV  VT +KL++      +TS + V E+ + +  H+  E +
Sbjct: 430  IRCNKSQENQVTFRDQYTNSVNAVTNMKLDL--TGKDMTSYSPVDEVVKLTGNHESDEAS 489

Query: 579  TPSSSVRHKDWLDLSCWLPPEICSIYKEKGITKLHPWQVECLKVDGVLQRRNLVYCASTS 638
            TPSS V  KD LDL+ WLPPEICS+Y++KGI+KL+PWQV+CL+V+GVLQRRNLVYCASTS
Sbjct: 490  TPSSFVPLKDHLDLNSWLPPEICSLYRKKGISKLYPWQVDCLQVEGVLQRRNLVYCASTS 549

Query: 639  AGKSFVAEILMLRRVISTGKMALLVLPYVSICAEKAAHLDVLLEPLDKHVRSYYGNQGGG 698
            AGKSFVAEILMLRRV+S+G MA+LVLPYVSICAEKA HLDVLLEPL K VRSYYGNQGGG
Sbjct: 550  AGKSFVAEILMLRRVLSSGTMAILVLPYVSICAEKAEHLDVLLEPLGKRVRSYYGNQGGG 609

Query: 699  TLPKDTSVAVCTIEKANSLINRLLEEGRLSEIGIIVIDELHMVGDQTRGYLLELLLTKLR 758
            TLPKDTSVAVCTIEKAN LINRLLEEGRLSEIGIIVIDELHMVGD +RGYLLELLLTKLR
Sbjct: 610  TLPKDTSVAVCTIEKANFLINRLLEEGRLSEIGIIVIDELHMVGDPSRGYLLELLLTKLR 669

Query: 759  YAAGEGNLDSSSGESSGTSSGKSDPAHGIQIVGMSATMPNVAAVADWLQAALYQTDFRPV 818
            YAAGEGN +SSSGESSG SS K+DPAHG+QIVGMSATMPNVAAVADWLQAALYQT+FRPV
Sbjct: 670  YAAGEGNSESSSGESSGMSSCKADPAHGLQIVGMSATMPNVAAVADWLQAALYQTEFRPV 729

Query: 819  PLEEYIKVGNTIYNRSLDIVRTISKTANLGGRDPDHIVELCNEVVEEGHSVLIFCSSRKG 878
            PLEEYIKVGNT+YN+ ++IV+TI K  +L G+DPDH+VELCNEVV+EG SVLIFCSSRKG
Sbjct: 730  PLEEYIKVGNTLYNKKMEIVKTIPKATDLSGKDPDHVVELCNEVVQEGLSVLIFCSSRKG 789

Query: 879  CESTAKHVSKFLKKFSVKIHNENSEFTDIFSAVDALRRCPSGLDPVLEETFPSGVAYHHA 938
            CESTA+HVS+FLKKFSV I + +S+F D+  A+DALRRCP+GLDPVLEET P+GVAYHHA
Sbjct: 790  CESTARHVSRFLKKFSVNIRSNDSQFKDVTLAIDALRRCPAGLDPVLEETLPAGVAYHHA 849

Query: 939  GLTVEEREVVETCYRKGLLRVLTATSTLAAGVNLPARRVIFRQPKIGRDFIDGARYRQMA 998
            GLTVEERE+VETCYR+GL+RVLTATSTLAAGVNLPARRVIFRQP+IGRDFIDG RYRQMA
Sbjct: 850  GLTVEEREIVETCYRRGLVRVLTATSTLAAGVNLPARRVIFRQPRIGRDFIDGTRYRQMA 909

Query: 999  GRAGRTGIDTKGESVLICRPEEIKRINELLNESCPPLQSCLSEDKNGMTHAILEVVAGGI 1058
            GRAGRTGIDTKGESVLIC+PEEIKRI  ++NESC PL+SCLSED NGMTHAILEVVAGG+
Sbjct: 910  GRAGRTGIDTKGESVLICKPEEIKRIMGIINESCLPLRSCLSEDMNGMTHAILEVVAGGM 969

Query: 1059 VQTATDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHGKFLEWNGDTKLYSTTPLGRAS 1118
            VQTA DIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCH KF+EWN DTKLYSTTPLGRA+
Sbjct: 970  VQTANDIHRYVRCTLLNSTKPFQDVVKSAQESLRWLCHRKFVEWNDDTKLYSTTPLGRAA 1029

Query: 1119 FGSSLSPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDVEPDWELYYERFMGLPSL 1178
            FGSSL PEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVD+EPDWELYYERFM L +L
Sbjct: 1030 FGSSLCPEESLIVLDDLSRAREGFVLASDLHLVYLVTPINVDMEPDWELYYERFMELSAL 1089

Query: 1179 DQSVGNRVGVTEPFLMRMAHGAPIRRANISRNGVSVGNRVGVTEPFLMRMAHGAPIRRAN 1238
            DQSVGNRVGVTEPFLMRMAHGAP+R +N  R                M+  HG    R  
Sbjct: 1090 DQSVGNRVGVTEPFLMRMAHGAPMRSSNRFRE--------------NMKAVHGKYENRPG 1149

Query: 1239 ISRNGVVGLRTKRDEHGCMYDDRPSEEQTIRVCKRFYVALILSRLVQETPIPEVCEAFKV 1298
            I+ N V+                  ++Q +RVCKRFYVALILSRLVQE  I EVCEAFKV
Sbjct: 1150 ITNNTVL-----------------QDDQILRVCKRFYVALILSRLVQEAAITEVCEAFKV 1209

Query: 1299 ARGMVQALQESAGRFASMVSVFCERLGWHDLEGLVAKFQNRVSFGVRAEIVELTTIPYVK 1358
            ARGMVQALQE+AGRFASMV++FCERLGWHDLEGLV KFQNRVSFGVRAEIVELTTIPYVK
Sbjct: 1210 ARGMVQALQENAGRFASMVTMFCERLGWHDLEGLVCKFQNRVSFGVRAEIVELTTIPYVK 1269

Query: 1359 GSRARALYKAGLRTPLAIAEASDAELVKALFESASWTAEGELNKTCLFVCADSGQQVCIE 1418
            GSRAR+LYKAGLRTPLAIAEAS AE+VKALFES+SWT +                    E
Sbjct: 1270 GSRARSLYKAGLRTPLAIAEASVAEIVKALFESSSWTEQ--------------------E 1329

Query: 1419 STAQKRMHVGIARKIKHGARKVVLDKAEEARIAAFSAFKSLGFTVPQISRPLSASADGNI 1478
             +AQ+R+H+G+A+KIK+GA K+VL+KAEEAR+AAFSAFK+LG  VPQ  RP+ +S  G+ 
Sbjct: 1330 GSAQRRIHLGVAKKIKNGAHKIVLEKAEEARVAAFSAFKALGLDVPQFYRPVFSSGGGSP 1389

Query: 1479 TAQVAASIPSEIDTLNRVVSTRQMEHALTKSCFG-------GTSSSEK------VGGKNL 1538
            + Q A +   +  T +  +  R+ EHA   S  G          S EK      +GG   
Sbjct: 1390 SMQGAGNSSGDNSTSSFPIVERK-EHAAKPSLEGRVLSGKVALESREKLTKTSDIGGVAS 1449

Query: 1539 SE---TGTISVEVKPPNFGVNPLVNVEGSAI--QESNTVVECAGKVDVTISNHMERIAQR 1598
            +E   TG + ++  P N      V ++GSA    E     +     D+T    ++ +  R
Sbjct: 1450 AEVYSTGVMQIKFGPDN----STVPIQGSAALGDELKAAFDQNKNADLTDHVQLQSLGDR 1509

Query: 1599 EQHSSV--------------LHPPKRDSSSMKGPIHAANTSGGFESFLDLWDASQEFFFD 1658
             + S                L P  + ++  KGPIHA NT GGF+SFLDLW+ + EF+FD
Sbjct: 1510 NRVSDESFDLEKQERCKRVNLSPGFKGNACDKGPIHAINTLGGFDSFLDLWETTSEFYFD 1569

Query: 1659 LYYTKRSEVNSVVPFELHGIAICWENSPVYYVNLPKDLLGPKSGKGLYPDDRTSGD---- 1718
            ++Y KRSE+NSV PFE+HGIAICWENSPVYYVN+PKDLL   + K        SG+    
Sbjct: 1570 IHYNKRSELNSVAPFEIHGIAICWENSPVYYVNIPKDLLWSDNSKNECLHLNGSGNRSNV 1629

Query: 1719 -----------------------------------QVQVLKCPGVSIQKLGFLNSARRNM 1778
                                               Q+Q LK P V  Q+ G  N A ++ 
Sbjct: 1630 LPLDDMLEMARRRWKRIGEIMRKRGVRKFAWKLKIQIQALKSPAVHAQRFGCQNIAGKST 1689

Query: 1779 GLKLVDGSYLVLSRVHISNVIDMCIVAWILWPDDERNSTPNLEKEVKKRLSGEAASAANR 1838
              +++D S L+L  VHI + IDMCIVAWILWPD+ER+S PNLEKEVKKRLS EAA+AANR
Sbjct: 1690 CFEIIDSSLLLLPPVHIKDGIDMCIVAWILWPDEERSSNPNLEKEVKKRLSSEAAAAANR 1749

Query: 1839 SGQWKNQMRRVAHNGCCRRVAQTRALCSVLWKLIISEKLLEALNNIEIPLVSILADMETW 1898
            +G+WKNQMRR AHNGCCRRVAQ RALCSVLWKL++SE L EAL NIEIPLV+ILADME W
Sbjct: 1750 NGRWKNQMRRAAHNGCCRRVAQIRALCSVLWKLLVSEGLTEALVNIEIPLVNILADMELW 1809

Query: 1899 GIGVDMEGCIRARNLLGKKLKCLEKEAYRLAGMSFSLYAAADIANVLYGHLKLSIPEGFN 1958
            G+G+DMEGC++AR +LGKKL+ LEKEAY+LAGM+FSLY AADIANVLYGHLKL IPEG N
Sbjct: 1810 GVGLDMEGCLQARKVLGKKLRQLEKEAYKLAGMTFSLYTAADIANVLYGHLKLPIPEGRN 1869

Query: 1959 KGKQHPSTDKHCLDLLRNEHPIVPVIKEHRTLAKLFNCTLGSICSLAKLSARTQKYTLHG 2018
            KGKQHPSTDKHCLDLLR+EHPI+PVIKEHRTLAKL NCTLGSICSL +LS +TQKYTLHG
Sbjct: 1870 KGKQHPSTDKHCLDLLRDEHPIIPVIKEHRTLAKLLNCTLGSICSLGRLSVKTQKYTLHG 1929

Query: 2019 HWLQTSTATGRLSMEEPNLQCVEHAVDFKMNEDD------VDHCKINARDFFISTQENWL 2078
            HWLQTSTATGRLSMEEPNLQCVEH VDFK+ +D+      VD+  INARD+FI TQ+NWL
Sbjct: 1930 HWLQTSTATGRLSMEEPNLQCVEHMVDFKIRKDEKGSETNVDYYNINARDYFIPTQDNWL 1989

Query: 2079 LVSADYSQIELRLMAHFSKDSSLIELLSKPHGDVFTMIAARWTGKTEDSIGPHERDQTKR 2138
            L++ADYSQIELRLMAHFSKDS LIE LSKP GDVFTMIAARWTG +EDS+  + RDQTKR
Sbjct: 1990 LLTADYSQIELRLMAHFSKDSVLIEPLSKPEGDVFTMIAARWTGISEDSVSSYVRDQTKR 2049

Query: 2139 LVYGILYGMGAKTLALQLECSKDEAVEKIRSFKSSFPGVASWLHEAVTFCRQKGYVETLK 2198
            LVYGILYGMGA +LA QL+CS +EA EKI++FKSSFPGVASWL+EAV  CR+KGY+ETLK
Sbjct: 2050 LVYGILYGMGANSLAEQLDCSPEEASEKIQNFKSSFPGVASWLNEAVADCRKKGYIETLK 2109

Query: 2199 GRRRFLSKINSPNSKEKSKAQRQAVNSICQYFFFYWGSAADIIKVAMINIYSVI--GTDA 2251
            GR+RFLSKI   NSKEKSKAQRQAVNSICQ      GSAADIIK+AMINIYSVI  G + 
Sbjct: 2110 GRKRFLSKIKFGNSKEKSKAQRQAVNSICQ------GSAADIIKIAMINIYSVIVGGAER 2165

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TEB_ARATH0.0e+0058.73Helicase and polymerase-containing protein TEBICHI OS=Arabidopsis thaliana GN=TE... [more]
DPOLQ_HUMAN7.3e-15739.57DNA polymerase theta OS=Homo sapiens GN=POLQ PE=1 SV=2[more]
DPOLQ_MOUSE4.0e-15539.90DNA polymerase theta OS=Mus musculus GN=Polq PE=1 SV=2[more]
DPOLQ_DROME1.5e-13835.56DNA polymerase theta OS=Drosophila melanogaster GN=mus308 PE=1 SV=1[more]
HELQ_HUMAN3.8e-10533.62Helicase POLQ-like OS=Homo sapiens GN=HELQ PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LS46_CUCSA0.0e+0086.97Uncharacterized protein OS=Cucumis sativus GN=Csa_2G375760 PE=4 SV=1[more]
A0A067GWC2_CITSI0.0e+0063.12Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000107mg PE=4 SV=1[more]
M5Y7D4_PRUPE0.0e+0062.58Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020963mg PE=4 SV=1[more]
A0A0D2U7X3_GOSRA0.0e+0061.40Uncharacterized protein OS=Gossypium raimondii GN=B456_013G218600 PE=4 SV=1[more]
A0A068VD06_COFCA0.0e+0058.66Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00009041001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659088547|ref|XP_008445039.1|0.0e+0087.10PREDICTED: DNA polymerase theta isoform X1 [Cucumis melo][more]
gi|778672103|ref|XP_011649741.1|0.0e+0086.97PREDICTED: helicase and polymerase-containing protein TEBICHI [Cucumis sativus][more]
gi|659088551|ref|XP_008445041.1|0.0e+0086.34PREDICTED: DNA polymerase theta isoform X2 [Cucumis melo][more]
gi|659088551|ref|XP_008445041.1|2.4e-27284.79PREDICTED: DNA polymerase theta isoform X2 [Cucumis melo][more]
gi|596293486|ref|XP_007226676.1|0.0e+0062.58hypothetical protein PRUPE_ppa020963mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001098DNA-dir_DNA_pol_A_palm_dom
IPR001486Hemoglobin_trunc
IPR001650Helicase_C
IPR002298DNA_polymerase_A
IPR009050Globin-like_sf
IPR011545DEAD/DEAH_box_helicase_dom
IPR012292Globin/Proto
IPR012337RNaseH-like_sf
IPR014001Helicase_ATP-bd
IPR027417P-loop_NTPase
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003887DNA-directed DNA polymerase activity
GO:0019825oxygen binding
GO:0003676nucleic acid binding
GO:0005524ATP binding
GO:0020037heme binding
Vocabulary: Biological Process
TermDefinition
GO:0006260DNA replication
GO:0006261DNA-dependent DNA replication
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015671 oxygen transport
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008150 biological_process
biological_process GO:0010468 regulation of gene expression
biological_process GO:1902749 regulation of cell cycle G2/M phase transition
biological_process GO:2000011 regulation of adaxial/abaxial pattern formation
biological_process GO:0009640 photomorphogenesis
biological_process GO:0009933 meristem structural organization
biological_process GO:1990067 intrachromosomal DNA recombination
biological_process GO:0051301 cell division
biological_process GO:0006261 DNA-dependent DNA replication
biological_process GO:0009733 response to auxin
biological_process GO:0006310 DNA recombination
biological_process GO:0007275 multicellular organism development
biological_process GO:0050789 regulation of biological process
biological_process GO:0044763 single-organism cellular process
biological_process GO:0006260 DNA replication
cellular_component GO:0042575 DNA polymerase complex
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004386 helicase activity
molecular_function GO:0005524 ATP binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0020037 heme binding
molecular_function GO:0005344 oxygen transporter activity
molecular_function GO:0005506 iron ion binding
molecular_function GO:0019825 oxygen binding
molecular_function GO:0008409 5'-3' exonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla022227Cla022227.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001098DNA-directed DNA polymerase, family A, palm domainPFAMPF00476DNA_pol_Acoord: 1836..2245
score: 4.2E
IPR001098DNA-directed DNA polymerase, family A, palm domainSMARTSM00482polaultra3coord: 1983..2212
score: 1.7
IPR001486Truncated hemoglobinPFAMPF01152Bac_globincoord: 59..98
score: 2.
IPR001650Helicase, C-terminalPFAMPF00271Helicase_Ccoord: 851..1002
score: 1.
IPR001650Helicase, C-terminalSMARTSM00490helicmild6coord: 920..1002
score: 6.3
IPR001650Helicase, C-terminalPROFILEPS51194HELICASE_CTERcoord: 851..1043
score: 14
IPR002298DNA polymerase APRINTSPR00868DNAPOLIcoord: 2090..2101
score: 1.1E-36coord: 1998..2021
score: 1.1E-36coord: 2195..2208
score: 1.1E-36coord: 1942..1964
score: 1.1E-36coord: 2053..2078
score: 1.1E-36coord: 2112..2123
score: 1.1
IPR009050Globin-likeunknownSSF46458Globin-likecoord: 27..105
score: 9.8
IPR011545DEAD/DEAH box helicase domainPFAMPF00270DEADcoord: 622..795
score: 1.6
IPR012292Globin/ProtoglobinGENE3DG3DSA:1.10.490.10coord: 55..98
score: 3.
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 1602..1822
score: 2.
IPR014001Helicase superfamily 1/2, ATP-binding domainSMARTSM00487ultradead3coord: 605..820
score: 4.0
IPR014001Helicase superfamily 1/2, ATP-binding domainPROFILEPS51192HELICASE_ATP_BIND_1coord: 619..811
score: 16
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 929..1025
score: 2.2E-23coord: 850..892
score: 2.2E-23coord: 593..810
score: 1.0
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 622..796
score: 5.88E-43coord: 829..885
score: 5.88E-43coord: 926..1016
score: 5.88
NoneNo IPR availableGENE3DG3DSA:1.10.150.20coord: 2007..2148
score: 5.1E-40coord: 1338..1383
score: 1.
NoneNo IPR availableGENE3DG3DSA:1.20.1060.10coord: 1825..1952
score: 9.9
NoneNo IPR availableGENE3DG3DSA:3.30.70.370coord: 1953..2006
score: 6.4E-36coord: 2158..2246
score: 6.4
NoneNo IPR availablePANTHERPTHR10133DNA POLYMERASE Icoord: 1607..2253
score: 1.3E
NoneNo IPR availablePANTHERPTHR10133:SF35SUBFAMILY NOT NAMEDcoord: 1607..2253
score: 1.3E
NoneNo IPR availableunknownSSF158702Sec63 N-terminal domain-likecoord: 1254..1380
score: 1.18
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 1803..2247
score: 1.78E

The following gene(s) are paralogous to this gene:

None