ClCG03G010040 (gene) Watermelon (Charleston Gray)

NameClCG03G010040
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionATP-dependent zinc metalloprotease FtsH
LocationCG_Chr03 : 16915743 .. 16950996 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCAAATAATCTGTGTAAAATTTAGAAAGGGAACAACTGAGAAGAAGCCTTGTGTTGAGCCTACAGTGTTGTGGTCATTGTTCCTAAAAGAAGAAAGTCTAAAGGTTACATGTCGTGGTGCTCTATTTTGGCAAATGGAGTTGGAGCCAAAAGAGATGATCAAAATGGAAAAGAAGAACAGTGCTTCCTTCTGGGATTTTATTTATTTTCAGCTTTCTACTGGTGCCTCAAATCTAAGGGGTTTAATTTGTGTTTTTATTATTATTTCTTTGTGGTGATTTCTCCATCATGTCTACAATACGAGAGTGCTTCTGAATTTCTTATGGCTTACTTTTGTAATTTCATTATTAATGATTTCTTATTTAAACAAACATGAGATTGTAAGTCTCTCAATATAGGAATTTCAGATAAGAGAGGTGTCAACAAATAACAGACTGACTGTGCCTTGTTTTGCGTTATCAATCAGATTTCCTTCCATGTTTGAAGACCACAAGTTCAGTTGCAGTGTTTGTTAAACAATTTTAATGTCTCCTCATGGTTTAGCATCTTATTGCAGGGGGTCCTAATTGTTGGTGAGAGGGGAACAGGAAAGACATCTCTTGCACTGGCCATAGCTGCAGAAGCAAAAGTACCAGTTGTTACAGTTGAAGCCCAAGAATTAGAGCCTGGACTATGGGTTGGACAAAGTGCATCCAATGTCCGGGAATTATTCCAAACTGCAAGAGATTTGGTTGTTCTCTTGAATCTTTATTGTTGAAATATGCGTACCTAGACTTTCACATATTGGTAGATATATTATTGCTTTTATTTGTTTGATTATTAACAATTTCATTAACAATTTCATTAGTAAATGAATGTACAATAGGTAAGAATAAAACTCCAAGCCTAGAGTGTTGTAGAAGCTCTTCAATTAGCAATAGTAGAAGAAATAAAGCTATTGTAAAATAGCTTTAGCAGAATAACATCAGAAGGAACATAAAAAAAAAAAAAAAAAATCCCCCAGCTTTCTCATCATATTATTCTCAAGGTTATAAAATATTCTTCTATATCCTAGCAATCAAAATCTGCAGAAGGATTGTGCGAGCAACCACTACCCGAAAGGCCCTAGCCTTGCCCTAGAACTTTCTATTAAGATATCAGGCAGGTATATGAGAAGAATGAACAATTACCAAGGAGATTAAAAATGCTCTCTTATTGGTCAGCAAAACTGGAAAAATAGCAATTTAAAAGTTCTCTTGGAGTGCATTTTGCTCATAGTGGGACAAGGTTTCCCTGCTATTCCTTAATTTGGTTCTTTCGGGAATGTCGGGCAGCCTCTTCCCTATCTTTTAGTGCAGCTATTCTCTAGAGAGGAAAGCTAAAGTCCTTTGGGTTAATGCAACTAGAGCCATTGATGTGGTACGTTTGGCTTGAGAAGGATAGAAAAATCTTCAATGAATAGGAGAAGGATTTGACCTTGGGGCTCTTTATGTCAATAGGGCCTTTGTTTTTGTGCTTTGTAAATTCCTAACATCGTTTTTCTTCCTTTAAACAATCTACCCTATTCTCAGTCAAGTTTCAATATGCAACTTATGCATGAAATGATTGGAGTTAAAAGAAGAGGTAGAAAAAGAAGGGGAAGACAAGGAGGCACTGGCCACTTGGGATCCCACCATCACTGTATTGTCAAAGGTTGATATTGTCGCAACTTCATCGGCAATGCTTGTGATTGCTGTCGATTCGTCAAGCGTTGCAGAGATTGAAGGTTTGGGGAACCCCTCCGTCATCACCTCATTAGATTCGACCGTTCATGCCTCACTGGAGAAGAAAAAATTGTATGTAACGATGAAGAAGAAAAGGAGTGTTGGGCAGCAGCTATTCACGATGAAGAAGAACGGAAGTTAGGGACAAGGGCAGTTTTGGTATTTTCTTCCTACCTAATGGACACTTTTACCACATCGGGCCTAATGGGCCACACACGGCAAAATTTCAAGTTTTATTTTAAAGATGATGGCTCAAGAATTAATTTTCATTTCAAGTTGAAGAAATTTTCATGGAGTATTTAACTTAATTCCAATTTGGCTTTGGGAAATTTTACACCAAGAATTTTCGGTTGGAGTCTATCACACTCAACGAGCTGAGAGAAGGAAGCTTCACTTTATCACCTCAAACTTGAGGATGAGAATATTCACTTTGTCACTTGGATTAATTTTATATTTTTACCTTCATGTTCGTTTTGAAACTCGAGGACAAGTTTTTTTTGGAGGGGTAGAATGATGTAGTAGGATTAGGTAAATTAGGTTTTATGGTCAAATCCTTAATTGATTAGGATTGGGATTAGTTTCCATGATTGATTAGAATTAGGATTAGTTTCCTTGGTTTATTAGGATTAGGATTACTTTCTTTGATTAATTAAGACTATGATTAGTTTGCTTTCCAATTCTCTATAAATAGAGAGATTGTCTTCTTGTTTTGATAACTTTTGATTCATTTGAATAAAATTCTCTCTTGATATTCATTAGATCACTTCAAAACCAAAGTGTTGATCAATCTGTTTGTAAAGCTTTGATTAAAGTTGTCAATGGGACTTAACCAATTTTATTGACTCTCCTAGAAAATGGACGACTTTTGAGGATTTTCAGTTGCTTTATGAAAAATGGAATAGCGAGATTCGGTCTTCTAAGATTTATAAAGGGATATGGGGGTTGAATTTTAATTAAAAATCTACCCTTAGACTACTAGTGTCGTGAGACCTTTTAGTCTTTGGGACATATTTTGGAGGTCTTGTTAGCATAGCATCTGACACTCTCAATATGATCATTTGTTCGGTGGCTCGAATTCAAGTCCAAAGCAATCTTTATAGTTTTATGCTTGCCATAATTGAAATTAATAATTAAATTAGAGGAAATATTTTCCTTCATTTGGGTGACATTGAAGTCTTAGATCCTCCAAATATAGTTTCTGATGCTTTATTTGTGAGTGACTTTCACATTCCAGTTGATATTTTTCATTTAAAACATGTTATGGAAGATGAAGGTATCAATCCTATAGTTCAGAACCTTGAAGCAGACTTTTATGAATTAATTTAGATTTTCCCCTCAAGCTCAAGGGATCCCTTCGAATCTTCTAGGATTGTGGAAAAATCTAAAGAGGTTAACCTATTAGCAAAAAGTTCTAGGTTGTCACGAGCGAAAGAGCTTGGGATGGAAGGAAATGTGGCACCAGACTTGTCTTTGCACACAAATGGTGGATTTGAGGAAGGAAACATGGTTGTTTCAAATCATTCTTTTGAGAATGTGGCACCAGTCTCTTCTTTGAACTGGAGCCCTACCCTTTCTTTTATACCAGAAAGCTTGGTTGTCCTGGTATTCCTGGAAAGTTAGAGATCAATACAGGTACTTTGGAGGAAGCCTGTACTCGGGTCATCACTTCATCATCAAAGCTAATTCAACCTCCTCCTCCTCCTCCAGATCAGTTCACTTATACAGAGTTCATTCTGCTAGGATCAAATGTTGTGTTTACTAAAGGAGTTTCAATCTCTTTCATCACCAAGCATCTACCTCCACTCAATCTATTCATTTAGGGGATTCCGATTTTGTATTAGAGGTTAGTATGAGCAGTGAGGAGATCTTATTGTTGAATATTCTCTCACGGAAGCTTTTGATAATCTTTTCTCAATTGATGATAGTAAGGTTTCAGATAAGGCTTTACTTTCAGTAGCTCCAGAGGATGAGGTTTAAGTCCCTTCTCAATCCCTTCCAAATTTCATTCCTTAATTTAGAAGTGTGGCTTTCAGATGGTAGAAATCAAATCTCAGTCACTGCAAATTAAGATTTAATTCAGGTTTTGGGTGGTGGTTGCAGTTTTGGCTTTGTTTCCCCTTTCTTCACTCCAAGTGGATTTTTGATTGCTCGTTTAAGACGTAAAACTAAGGCCTGTTTTCTTCTTCTTTGGGTTTGTGTTTTTGTCAATCATGCTTAGCGGGCTCTTCGTTATTATCAAAGTTCAACCCTTGCAAGGTATTGCAGTAGAAGCTTTTCTTCTTTTTGTTTTGGTCTTCTCTGAGTTTAAGTCTTCTTGACTATGAGTTTCTCTTGTGTTATGTGTTTAGTTGGCTTGTTCTACGTTTTCTCCAGAATTTTAGTCTTTGATGTAATTTGGTTATAGCCTATTGGCATATTTGAAGTTTTAACTTTGTTTCCTCTTATGTACTTTGAGCATTTGACTCTTTTATTGTATCAATGAATAGTTTTGTTTCCTTTACCAAAAAATAAATAAATAAATAAATAAATGCAGCTTATGCATCCAATCTTACTTGGTTTGATTCTTTCTCGTTATATTATTGTGAATCCTTTTTAACCTGCAATATTTTCCTCGGATTTTATGCTAGAAGAGATTAGATTGTTAGTTTTTGCAATCTTCCTTGTCTAACTTTAGTTCTAGAGGTATCTCTAGGCACCTGTGATCATATTTGTGGAGGATTTTGACCTGTTTGCTGGAGTTCGTGGCAAGTTTATTCATACCAAAGAACAGGATCACGAGGCTTTCATTAACCAACTTCTCGTGGAGCTTGATGGGTAAGCGTAAATTTCTTTTAGTTATTTTGACCATCCAGTGTTCTAGTTCTCTTCCTCTGGATGTCTTGAAACATAATAAAACCTATCCATGAGATAACTGGCACTTATGTATTACAACAAAATGGAAGTATAATATATAGGGGAAATGAAATGAAGCAAAGTCATTCTAGAAAATTCTATTTGTAGAGAGGAAAGAAATTGGTTCAGCTTGTGGGTTTTTTTTTTTAATTCTTTTATTATTATTTTTTTAATTTTTATCTACAGGAACACTTTAACCCTATACAGTCACAACCTCAAAAATGAGACGCACAATCCTGTGGTTTTTAATGCAGGAGGCATGTGGGGTGGGAAGTTTGGACGCATAGGAACCTTAGGATATCCAAATGTAAATCTAAAGGTGGGAAGGGGTGTGAGATTTTAGCGAAAAGGTTGGCTTCATGTTGTTTTGGTGTGCTGTCCTTAAGGATTTTAATTCTTAAAATTCTCTTGGTCTTTGTACCAAGCAGAGCCCCTTCTCAAATTTCAACAAGAGAGATTCTCAGGTCATTCAGTTTCTTTTTGTTAAAAGTAGACATAACTTTTCATTCATTAAATGAAAAGAGACTAATGCTAAAAAATACATTAAAATTCAAGAGGAGAGAGGAATTGAGAAAAAAAAGACATACTATTGAGCTAAGAAGCACTGGCACTAACACACGACATGAATACCGACACGGCGACATGCCAATTTCCAAAAAAGTAAAACACGACACGCTAGGACACGTTATATTTTTTCTTTACAAAAATAAATATATGTTGTGCATATGTTTATTGTAATTAATTAGTTTAACACAACAAAACTACAGAATAGAATCAAACAAACCTTTACATATTTTAAATAAACAAGCCTTCTAAAGGTCTTCTAATTTTAAATAAAAAAATGTAAAGAAAAAAAAGGAAAGATTTAAGAAAAGGAAAAAACAAACTAGAGAAAAAGAAAAATAAATAAAAAACAGAAATAGATTTATTATTAACATTTATTCAGTACTACACGCCAACATTTATTGCCCACACCAAACCCTTTACGAAAACTGTCATCTCTGTACTTTATGTCTGAAACTTGCCAGCCTCCTTCTGACCTTTCCCATTCTATTCCATTGGCTGTCACATCAGAAGCCCCCCCTTTTGCAAGAAACTGATTTACCTAACAAAAGAGAGATGGGCTGGGCTTCGCTTTAAATGTCCGATTTTATTGGTTTTTTTCTTCAGAAAATTGCAGCTTGCCGTGTCCTGGAGGTGTCTCCAGCCATGTCCCACCATGTCAGAAATAGAAAATTTATTAATAAAATAATTGACATGCCAGCTAGCGTGTCGGTGTCAGACACGTGTTTGACACCAACACTAGGCCATTTAGGAGTGTCCGTGCTTTATAGCTATTGAGATACAAAGAAGAGAACATACAAAGAAGCAAACATTTAAAACCCAACAAACTAACTACAGAAAAAATACCAAAACTAGAGAACACATAAACTTTAAAGCCAAAGTCAATTGAAGACCATTGAGCAGACAAAGAAAGACGTCCTTTTAGACCAAAAGTGTATGGATCAAACCACAAAGGAAACTAACTTCTAAACTGTCGCTTCCAAGCTTGCAACTACTTGGAAAAAATGCAACTTCTTAAGAGTCGCAAAACCGCGAGAATTCCAGACTTAAATTGAGAGTTCAAAAATAAAATCTTAGAAAGGTTTGATAAATGTCGGCCAATTGTAGTAAATATCCTAAATGGAATGCACTTCAAACTTCTTGGAAAGATAGCACCATGAAGAAGATTTTAACCGAGCTGAGTCAAATATGGATGAATAGTATTGTGAAGACTGCTGAAAGATGTGATGATTCCTTTCCATCCAAACCTCATAAAGCAAGCTTTAACTGCATTGACCCATAGCAACTAAGCATGAGAGTTCCATCTTACGCCACAAAGGAGTTGAAGAACATTATCCTTCCATGAATTGGAGAAAACCCAACATATATTAAAGGCTGGAAATAACAGGAACCAACAATCCTGAGAGAATCCACATTCAAAGAGGATGTGGCTAATAGTTTCCCCTACCTTTAACACAATGAAGACATGATGCATAAGCGTAGTATCTGGAAGTTTTGTCTGAAGGACATTAGCTGAATTAAGGCTTCCATTAAGCATGATCCACACTAGAATATTTGTTTATTTTGGGCTCTTAGTCTTCCAAATTGCATTAAAAGTGCTGCTGTCCAGATTAGATGAAGAAACCAAGTGCTTTGTTAAGGATTTAACCAAGAAGTTACCAGAAGGTTCAAGTGACCATAGGCGAGCATCTTCATTGTTGTTTAGAGAAATATTAGATAAAGACTGCATCAGATTTTGAAATTCTAAAATCTCCCCATCCTTAAAAAGTCTTCAGAGAGTAATATCCCAAGAAGAAGTGGAAGAATCCCAATATGAACGAATCAATTCCTTTTTATTAGAAGCAATAGCAAAGAGCCGTGGAAAGATATCCTTCAAAATAGAACCATTGAGCCAAGGGTCCTTCCAAAAGAAAATTCTCCTTCTGTTTCCTACTTTGAAATGTGCAAGGCCTTCAATCTTCTTCCATGATTTAGATATACTTATCCAAGGACTTCTAAGACTTGAATGCCCATTTCCAAGAGTATGCCAAACAGAATCATCAGTACCATCGATGCTAAGAATTACCGCTCACCAAAGAATCATGTTCCTGAGTAAATCCCTAACCCCATTTATCTAGCAAAGGCAAGTTGCTTTGAGTGAAGTCCCCAATTACAAGATTTCCTTGATCCTTTGCCTCATAAACCATGCTCCATCTTACAAGATGATTCAGTTTGCTGCCTTCGGAACCTTCTGTGAGACCCCCAATTTGAAATACATATTATAGGAAGGGCAAGAAGGGTATTTCAAAGGGAGAGAAGGATAATGGGACATGAATGGAAAATGGTTAAGAAGTGGGGACCGCCGGGGGACCTTGAGGTAAGTTTTAGGAAAGTGGTTTGCGAATGAGGTAGGGCATTGAGGAATTTGTGCATTCTGAAGCTGACGGCTGAAGGAGGAGGAAATTCCAGCTCTCTTGAATTGTGCTGGGCTCGTTCTTATTTTCTTTATTTCTTCTTCTACTTTGTTCTTAGTTCATATTGTGACCTGAGCATCTTGATCTTTAATGTATTCTGTCATATTGGATAACCCAAACAGAAGACAACCATCTCTGTTTGGTTTGGCTGGGCTCTTGTTAGGTCTATATTAAGGATGGGAACCTAACACCTTCCCAAAAGAAGCTTCAAGTCATGCGTTCTAGCTTGAAACTAACTGAAGAAGATTGGAATATTGGCTAAAACAGAAGTGCACAATGCTAAATGGCCACCTCTGGAAAGTGTGAAGCGTCCAACTGTCGATTGTCATCTGAAACTTGGCTATGATAGGTTGCCAAAACCCTCTTCTTTTGGGATAGACACCAAGAGGTAAACCAAATTAAATAAAAGGAAGAGATTCCACCGTTCAATTTGATCTTGAAGAGTAATGAAGAAACTCCGAACCATCTACATTTAACCCGACTAAGGCTGATTTTTCCCAATTTATCTTTTGGCCCTAGCACCATTGATCTAGCCTAAATGCTAAAAAGGAGAGAAATCTCCAAGAAACCAACAAAGTCTTTATTATGAATGAAAAGATGATCTAATACTAAGAGGGCTTTCTCTATTTATAGAGAATTACAAGAGTTCTAAAAGAATAACTAAAAGAGGATGAAACTAATTTGGTAATTAAAAGGTAACCAATATTAACTTGATTTTACCAAAATATCCTATCCTATTACATCATATTCTATCCCTTTCAAAAGAAACTTGTCCTCGAGTTTTAAAACAAAAATGAAGGAACAAGTAAAAAAGTAAACCGTTCGAACTCGATACTAATAACAACCAATTTTTTTGAGTCAAGCCAATAAATGGAACATTTATCTTTCCTGTGCAAATTTTTTGTAAGATCTTCACCAGTTTTTTTTTTTTTTCTTTTTGAAAAGGAAACGATCTCTTTCATTGATATTATGTAAAGAGATAAAATCTCATGGTACAATAAAGATTATAAGATTTTAGCTGAATAGATACAAAAGAAACAATTACAACCAAAAGAACTAAAGCTGAAGAAGAACAAAAACATGCCAGATCTTCAACAGTGACCTTTGGATGGTAGGATTCTCAATTGGTGGAAATATATTCTTAAAATATTTGGAGCCTTCATTGAAAGTCTGAAAATTTTTAATCTTTAATTATGGGTTGAGAATGGAAAATTGAGTGGTCTTTGGGAGATGGTTCACGCTGGTCAGTCCACATGGGTCATAGGTTGAAAAACCCAAAATTTTATTAAGAGGGTATTGGATTTTCTTGGTTGGTTCAGGGAAAAAAATGGGTGGAAGAATTGTAAAAACATGTATGGAACCGGGTCTGGGGGTTGGTCATTTGGGTCGGGTCAGAAATGGCAATTAGGAGAAGATTTTTACGAAAACATTCACCTCGTCTTCGTCTTCTTCTCTTTTGTTTCTGGCTGCAAATGGGTTGTAAACAACATTCCAATGCTTTTCTTTGTCAACCACACGATTCCCTTCTTCCTCCTTCTCCACGAACGACGTGTCTTTAATCTTCTTAGTCAGATCTTTTGGTTCGACTTCTTCTACACTGCCGGCGAAGTACCCAGAATTTTCCAATTTGACATATGCAATCGAATTTTCCGACTCCACTTCGTCTTCTTTTGGTTTAGCCACGTCTACTTCATCAGTGAAAAACCCAAAATTTTTCGATTTCACGTCATTGACAAAAAAACCCGAGATTTTTCGATTCGAGATATTTCATCTTCCACGTTCCTTTGCAATTGATTGTAACATCGCCGATTTCACCTTCTTCGTCTAGCATTGGCAATTTGGTTGCTCCCTTTTTTTGGTTGTCGACGATTTTCACTTCAACGTTTAGAGCGCCGATCACGTTTTCTTCCATGTTTGAAAATTTGTGTGTGTTTCTCAATTTTTCCTCAATTTTCCACGTTGATTTCTCTTCCGAATTAGGTTGCGCAAGCTCCAATTTGAAAATAATTTCGTTAGATTCTTCCTCAAAAGGAATTTCTTGAATTAATCCCTTTTAAGTTCTTGCCAATGTTCCATGTTTTGAGTCATCTCTCCCAAAAAATGTTGGATTTTCTGGACTTCCTTCTTCATTTCTTTCAATACTTTGGGATTAATGTACAAATTGGGTGTTCTTGAGTAAGCTTCTTCATCATAATTGTTGTAGGACCGATTCTTGAACTTTCTCGGACATGTTTTGAACTTTCTCGGTCGTTCTTCATAGCATGAATAATAATTTATGTCCTTATGCATTAGTTCTATAAAATTCCATTATTGGATGATTCTTCCAATTTCTTCTCTACCTTTTTCTGAATTTCAAGGGTTCCCCCTAGTTAGGTGTGTTGTTCTGTTCGGTAATAAGCACAACTCACGAAACTTCCTCTTGTGACAAGGGTATTTCAGACTTCCACTGGCAGCAGTCAGCTTGGCTCAAACCAGATGCTTTGATACCAAATTGATGTAATCTAATGCTAAAAAGGAGAGAAATCTCCCAGAAATCAATAGTCTTTATTACGAATGAAAAGATGATCTAATACAAGGGGCTTTCTCTCTTTATAGAGAATTACAAGAGTTGTAAAAGGATAACTAAAGGAGGTTTTAACTAATTTGATAATTAAAATGTAATTAATCAAGGATTAACCAAGATTAACTTGACTTTACCAAAATATCCCCATCCTATTACATCAACCACTGAAAGAAATCAAAAGTTTTTAAGAGAACCTCGAACATTTTCTCATTGTACGTACAAAAAAAATTAATGTGTCATCAGCAAATTGCAACAAAGGGACAAGGATGTTGTCTTTTCCCACCATAAAACCATCGTACACTCTTCTGGAATAAGCATTTTCACCCCAAATATTGAAGAAGATGTCTCTAATGCAACAGTTTTTCACATCTATGAAGGCAAAGCTAGATCTGAAACGAGCATAAGATTCCCAACCAAAACCGAGAAAATAACAAGGAAAATTGAAAAACAATCAGAATCTGGTAATGAACGTTGATGAAGGGCAGTGGCGAAGACAATGAATGACAGCAAACAGATAAAGGCAAGAAAAAGAAATTGCGAAAGAGGACAGTGATCAAGGACAAGAATTGAAATCGCAAATCCGAGAAGCTGCCATGAAGACAAGAAACTGAAATCAAGGACGAGAACAGAGACAAAAAGGACTGGCAGCTGTGCAACTATGTTCTGTGACAACAACGCAACTATGCCCGACTACAAAGGCGGCTAGGGATAGGTTGATTGGCAGTAGCCAGGGTTGAAACTGAAAGTTTCCTTTCTTATCAAAAGGAAATTCCCAGCTACTTCCTCTTCCACATTATTCTTGATCTTCCATCCTTGGCAAATTGACATGCTCTTCTGAAGTCTGAATCTATCATAGGAAAAATTCTGTAACACCACAAAAGAACATACAGCTCCACATTTTCCTATCCTTTCATATTACTTTTTCTATCTGTTCATATTATGAAAGGAAGCCAATTTTTTGTGTTCACTTTAATTAGTACTGGATAACAAAGCCACAAACATGAAAAAAACCATCTTATAGCATTTCAAACAATTTGAAGAAGTGGAGAATGTGGATGGTGAAATGGCATACTAATGACAATAAAAGTAAAACGAAGGGCTGAAAGATAAATATAGAACAAGAGAGATTCTGAGGTCATTCAATGAAATGTTTCTAATCAAAAGAGAGAAAAAACCACTTATGCCTTCCATTGCAAACTACAAGTTGCATACGTAAGCATTGTCTGCACAATCTTTTGGAAAAGGTGACGTTTGTAATGGTTCATCTTCGAAAGTTAAGATTGCCTATAGGTACTCGCGACTAGTGTTATCAAACTTGTACCATACATGAGTTGTATTGTACTAGTTTGCTCAAGAGATTCAGATTGTGCCTCTGAATTGAAAATGAGAGGAAACATACAATGCAAGGATCTAGTTAAAAAATATCAGTAAAAAGAAAAGAAAAAACATGCTGGCGGTTAGCATGGACAAAACAACCTTAGAAGGAGATGCATGTTGGCGATTAGTATGGATGAAAAAAAGAACATGTCAAGAAAACAAGAGGAAATGTTTGAAGAATGAAAGGAAAAGAAAAGTGAAGTGTTGAAGAAGAAAGAATAGAAAATGAAAGGGAAAAAAAAAAGGAAAACGAAAAGCCTCGAGTCCTAACAAACATTGAAGTCTGAGTATGGAATAAAAAAAGGAAAACTAAATACTATCTAAACTAAAGTCCTTTCTCTCATACACGTGCCATTGGGTTAAATGGGTTAGTGTTGAGTTCTTGACCATTTATTTACTTTTATTTGTTCATTATTGTTAATTCATGTTTTCTTTCTTAGAGTATTTTGAGCTTTGAAGTTAGTTGACATTTTCATACTTATTTGACCTTATTTAATGGCTCTTTATTTCTCTATTTTAGGCTTCAATTATATTAATATTTTAATGTTATTACCAAATTGAAATGTCTGATACGCGATACATTACGATATACAATAACCTAAATTTGTCTGTTTTCAATTCGATTTTAATAAAGAATTCTCCTAGATTTAGTGCTACACTAGTACAGGCCTCATTATTTCCTCTTACAGCATGTTGAGCATCTATGTATAGGTTTGAGAAACAAGATGGAGTAGTTTTGATGGCTACCACTCGGAATTTGAAGCAAATTGATGAGGCTCTACAGCGGCCTGGGCGGATGGATCGAGTATTTCATCTTCAAAGGCCAACTCAATCCGAAAGAGAGAAAATACTACAAATTGCTGCGGAAGGATCCATGGATGAAGAGCTCATTAATTATGTAGATTGGAAAAAGGTAATTCGTATCCATCCTTCTTCTGTCCGTAATATGGCAAGTGTTTGCACTGAAGCCTTATATGGCGCCTCACACTAGTAAACTTTAAGGACGATTGTTGCTGATAATGAGGGTCATGGAACTGGTTAATCGCTTGCCTACGATGTTTGCTGATTCAGGACATACTTTTTGATGGTTTGATTTTAGTGGTTTCTTGTTCGAAATTTGATCTCAGTCATTGATATATAGTTTTATGTAGTATATTTAGGATTAATCTTGTAATTTTAGTTTGAGTTTGTTAATCGTGGTTTTTACTTCAGTTTAGTTGCCAATTGGTTTGTTGATTATTTTGTTAATAACTGGTTTACTTAGGGGGGTTAAGAGGGTCACCTCTGATAAACATTTCAGCTTCTTAACGAAAAGTTTCACAATTTCTTACCACAAAAAACAGTGAAACATGTAGACGGTTGATCGATGAGTTCCTTCTCCACTCCCTTAAAAGAAAAGGGTCATTTTTGTGGCAAGTTAGGTCAAGGTGTGTGCGATTTTGTGGGGCGTTTTGGAGTGAGAGGAACAATAGAGCTTTTAGAGGGACTGGACGTTCTTCGAATGATATTTGGTCTTTTATTAGATTTAATTTAAGTTTCATTGTGGGCGTTGGTGGTGAAACTTTTTTGTAATTATCCGTTAGGTCTTATTGTACTTGATTGGACACCGTTTTCATAGTTGCTCTCCCCTTTTTGCGGACATTTCTTGTTTTTGTCCTTGTTGCGGACATTTCTTGTCAACACTTGTCTATAAAAAATTGTTAAATTAGTATCTAACACTTGTCTACTTTTTTAAAAAGAATGAAACAAAACTTTTCATTATATGACAGAATAAAAGAAGACAAACCCTACTACAAGAAAACATAAGAACTATGTTGGAAAAACATAGATCTTGAATAGTTCTGCAAGAAGCTTGGAAAGAGAACATCATGAAGAATTAGTCCAGGTGGCTTCAAAAAGACCGAACCAATTAAGAGACTTGTCTTGAAAGACACACCAATTTCTTTCAAACGAAATTTATGCAAGACTAGTTTTAACCACATTTGACCGAAAAAATTGAGATTTTTTCAAAGAATTTAAACAATGTCAACCTTAAATGAATCACCAAAGACCCATTGTATGTGAAAAATGGAGAAGAACAACAACACTAAAAAAAGAGATGTTGAAGTCCCATTATCAGCCCTACAAAGCAGACGCATAGAAGGAGACAAACAATGAGATGGAAGCTTCCTTTGGAGATGAGCAGTTCAACAACCAAAATATCGTTATCAATATGGGAATATTAACATACTAAAAACTTTATCAATATGAGAATATTGCCAAAAGATCATGCTCAACATCTTTTCTTTCTTCATTATATATGGGATTTTGCCGCAAGTTGTTTTCAACCTTCAAGCTTCAGTGGGTGTTTTTTTGACATTTTAAAGGAAAATGTCATTCAGATCCTTGTGGGTCCTTCCTTGAAGCCTCAATCTCAGTTGTTGTGGACCTATGGTCTAATGTGGTTGAAGTCATCCTATCCGAAATTTGGTTTGAAAAGAATCAATGTATTTTTTAAGATACGTTTCTACCAAGGTCTAATGTGCCCATAAAGTGTCCAAGAAGTTGGAGTGTCCAACTCCAACTTGCACCGGACATGGATAATTCTTAGGTTCTATCATTCATTTTGTATTTTTTAATAACTTACGAATGGGAGTTTTCAACTTTGTTGACTGCAGTAACTCTTTAAATATCAATGAAAAGTTTGGTTCTTTATGAAAAAATAAATAAATAAAATAGAATTGAAAGATTTCCTTTCTGGAAAAACTGCACTTTCAATACCCAAAAAACCAACTGGATTCTAAGATTTTGGTTGCTGGGAAAACATGTACTCTTTAAGGTTGTTGAAATGGGAAACTGCGCTTTTTCCTGAAAAGAAATTGCAGGTTTTTTCTTCTCAACCCTCTTGGGAGGTTGTTTTCCAGTATCTATCAGATAAAAGTTGTTTCATGTAATAAAATAGAGATTAATTTAAAAAACATGCTTGTGAATGTGTATAAAGGCCCTAATAGTCAAAATATATACAACAATAAGATCCTTTGAAAATTTTCTCTATTTTCCTTCTTTATTTCTTGATATCCATGAGTTTACACGCTTTGACTAATCTCATGGGATTGTTTTCCTTCTTAATTACACTTTAATTTATTTATTTTTGGTCACTTATTGGGATATTTTGCTTTTAGGTTGCCGAGAAAACATCCCTTTTACGACCGGTGGAACTAAAACTTGTCCCACTTGCCTTGGAAGGAAGTGCTTTTCGGAGCAAATTCCTTGACACTGATGAACTGATGGGCTACTCTAGCTTGTTTGCTGTAAGAATTACTGCTTTCTTCATTAAAATAATATTTGTTATTATTATTGTTATTGCCAGAGACAAATTTATTAATTTTTCTTGCTCACTAGTGTGTTTTATGTTTTAAAATCATTATTGTCTTCGGGTATCTCATAAAATGATGTAATAGCAAGCTAATCTCGCTAAATCCTTGATTAAGTACATTTTAATTACCAAATTAGGTACTTAGGCTACATCAATAATGAAAGAGAGAAATCTCCAAGATCCCAAACTAAAGTCTTTTATTATGAATGAAAAGATGAGATACAATGGGAGGAACCCTCTATTTATAGAGAATTATAAGTGTCTTAAAGGATAACTAAAAAAGGTTAAAAACCTAATTTGGTAATTAAAATGTACTTATCAAGGATTTACCAAGATTAGCTTGAAATGGCTAAATTATACCAAACCTATTACATCATAAAAGAAAAAAAAAACCATAGTTGGGTTTGTCTCATTGTGTGTCACGTCATTGTGGACAAAACTTTCTTTATTTTTTGATTTTAGGAAAGAAAAGGTAATGGTTCTGTTGGAAACTCCATGGGGAAATGTTTATTTGGAGGGCAGCTTCTGCTTTTTCCCCATTTTGATAAGTCTTCCTCGATTTCCCTGGGTTTTTGTCCCCACCCCCGCTTACTAATCGGGAGGAGTTGAGGCTGTCCGTTGCCTTTCTATTACTTCTGACTAGGTGACTCGTCATGGGAGAAAGGATTCTAGACTTTGGATACCAAGAGGAGTTGGGGTGTTCGGCTTGAAATGAATAATAGCTTTTTTAGGGGGTTGAGAGAACCATGTTGAGCTTTGGGAGGCGATTAGATTTAATTTATGAGTGTCTATTAATCAAGTTTTCTTTTATTATTTTTATTTTTTGTCCTAATTTCATCTTCCACTGGAAAAGAAATATGTAGAAAGATGTGTATATCCTATGAGTTCCAACACCACGTTGATGTTAATAAAAAAATTATGCCTTGTTTGTGTTGTTTACTCTTCTCCCATTATGGTGTTCTTTTTGGTTTTGTTGTTCCATTTGACTCTAGGTGATGCTTGCTTTTGTCCTAGTATTTGCTTTGTGGTTGGTTGATTTGGTTAGTCGTTTTGTTTGTAGCTTATTTTGAGGTTCTTTCTTCTTCTTCTTCTTCTTCTTTTTCTTTTTTTTTTTTTTTTTTTTTTTTGGGTGAACAATAAGAGTCCAAATCATTTGTTGGTTAATTGCTCTTCTTCGTAAGTTCTTGGGGGAATTTCTGGCTGATCTTTTGCCATTGTTCATGATGCCTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCTTTTTGTAGTTTTTTTTTCCTTTATTTTTCTAGTTTTCGTTGGTTCTTTTTGGCCTTTTTCTAGTTTTCCTTTTCGGGAGTTCGTATCCTTGAATTTTTTCCTATTTTTCATATATCAATTAAAAGTTTATTTATTGTTATTAAAAAAAAAAAAAAAAAAAGAGAGAGAGAAAAGAAAGAAAGAAGAAAAGAGAGAAAAAGAAAAACCAGTATGCCTTGTAATTGAGTTGTAAATGTTACATGCTGCCGCTTTTTGAACCATTATTGTGAAGGGGAAGAAAAGAAGAATCAGAAGGTGTATGCAATTTTTTCATTCTTTTAGTTGGTATATATGACTTCCCAACTCTATGCTTAACTTTATGTAGATAATCACTTCATGCAATTTTTCTTTCTTGTTTGTTTGAGGTATGGTATGATTTCTTTTTCTAGATACTTTTTCCTGATTGATCTACGGAAAATTTTTGTTATACAGACTTTCAGTGGTATTGTTCCCAAGTGGGTGCAGAAAACCAGAACAGTCAAAAAACTAAACAAAATGTTGGTGAATCATCTGGGATTAACACTGTCTAAAGAAGATCTTCAAAATGTGGTTGATCTGATGGAACCATACGGTCAAATAAGCAATGGAATTGAACTCCTTAATCCTCCTCTTGATGTAAGAAAGTACTTCACATTGGGTTTTAAGTTTTTAGCTGCGTTAGTTCATTATCGTCAACTTAAAGTTAAGATATAGTATTTTTTATACATCCAGTTGCAATTCATTACTTGACTTCTACAAATTTTCAAGTACTTAAGAATCACTTTTTGATGCATAAAAGGATGAAGAAACATTAATGGTATACAGTTAAACTATGACAAATTGAGAGTGAATAATTAAGGGTATAGAATTGAGGCCAAGAATAATACTGTCATTTTTTATTTTTCTCATTACAATCTAAGTTAAAAAGAGGAATGCTTTCTTTTCCATACATATTTTAATTCAGATGATAGTTATGGTTGAAAAACTTATTTATTTATTTATTTATAAACGAGAAATGAGATATTATTGCTAAAAAACATTGGATTAAATTATAAGTCTAGTCCCGGGACTTTAGGATTATGTCTAATTGGTCATTGAACTCTATAATGTCTATTTTGTTCCTAAATATTAAATTTTATGTTTAATAGGTCCCTAAAGTTAGAAAAATGTATACTAGTTCCATGAACTTTCAACATTGCATTTAATAGGTCTATAAACTTTTAAGAATATGTCTAATAGGTCTGTGAAGATTCAATTTTATGTTTAATAGGTCCTTCAAAAGACTAGTCAAAGTACCCTGTCTATACACCTTTCAAGCTCATGTAACATATGGACCAATGTATTAGACACCTTTGAAGATATCTCTCGATGCTGGGAATCTTTCATTTTCCCTTTCTATATGTCTTGATTCCCATGTCTCTTTACTGGGATAATTTTTGTTGATTATTTGGGGATTATTTTCCTTATTGTTAAATATTTGATCTTTGTATTTATTGTAAAGATATTTACTATATTTATCTCCTTCCTTATTTGTAATAGGTTTTCTTTTATATAAGAAAACCCTTGTTTAACAAAGTATATAGAGAGAAATAAAATATTTCAACATGGTATCAGAGCCTACCTCTTGAAACCCTAATTTTTTCTGAAAAAGAAAAAGAAACCCTAATCTAACCTAATCTGCTGCTGCCGCCGCCGACTCACCGCCGCTGCCGCCGCCGCCAACACCCCAGACATTCGCCGGACACCTTACCTGAGTCACCAGACAACTCCTAGCACCAGTCGTCCGTGGATCTCGTCGTTCGTGAAGCTCAAACGCTGAACAAGCCGTAGCTAGCTGACGCCGACGCCGACGCCGACGACGCCGACACCGCCTGGGTTCCCCTCCGCTCCGCCGGTCGATCAGGTCTGCACAAGTCTTCGGGTGATTTCCGACCAGTTCCGGCGACTCTCCGGTGGACCTTCATTCTGCCGCTGAGTTTTTTGGTTGGCCAAAATCATTTTTTTTCTGTCTTTTTCCTTTTACAGATTTGTTATTTTTTGGGTTTTTTCTCTCTTCAATATATCCGAAACTAAGGTATCTACCGCCAAAGTCAACGACAATCGAACCCATCCCAACAACCTCACGGTCCAAATCACCACCATTCGACTTAACGGGGAAAAATTTCTTCGTTGGTCCTAGAGTGTTCGGATGTATATTCGTGGTCAAGGTAAGATAGGGTACCTCACAGGAGAAAAAATCGCTCCTAGTCCAGATGACCCCTTATTCATTGTGTGGGATGCGGAAAACTCCATGGTTATGACGTGGCTTGTCAACTCTATGGTAGAAAACATTAGCAGTAACTACATGTGCTACACTACGGCCAAGGAATTATGGGACAGTGTGACCCAAATGTACTCTGATTTGGGGAACCAATCACAAGTGTTCGAGCTAAACCTTAAGTTGGGTGATATGCGACAAGGAGGCAATTTAGTTACACAATATTTTCACTCTCTGAAAAGGATATGGCAAGAACTTGATCTGTTTGAGACGTATGAGTGGAAATCCACAGACGACCAAAAACATTATCGGAAAACTGTTGATGATGCACATTTACAAATTTCTTGCTGGCCTCAGTGTTGAGTTTGATGAGGTTAGAGGCCGGATACTTGGGAAAAATATTCTTCAAAATCTTAATGATGTTTTTTCTAAAGTTCACAGGGAAGAAAGCCGCAGGAATGTTATGATTGGGAAAAAGGCAGTTGACTCAGTGGACAGTTCTGCACTAGTGACTGAAAGTACTGCGATGAAAGCTTCTGATCAATCCAACAAAACTCATGACAAGCCCCGTGTATGGTGTGATCATTGCAACAAACCCCCTCATACGAGGGAAACTTGTTGGAAACTACATGGCAAACCTTCAAATTGGAAGAGCTCGAAACAATTTGAGAGAAATTCCCATCAGCATGCCTCCAATGCAAATGTTGTTGATTTCAGTCCACTCAAAGAGCAAATTGATCAAATTCTGAAGCTGCTAAAATCCAATTCTACAGGTAATCCTATTGTTTCCTTGGTACAAACAGGTAATTACCCTCAAGCTCTCTCGTGTCTAAACTCCTCTCCGTGGATCATTGATTTCGGAACTACTGATCACATGACTAGTTTCTTGTGTTTATTTGATTCATACTCCCCTGTTTATAGTAAAGAAAAAGTCCGTATTGCCGATGGTAGTTTTACATCTATTGCAGGCAAAGGAACAATTTCCCTAAGTACAAAACTCATACTACGTTTTGTTCTTCATGTTCCTCAATTAGCTTGTAATTTACTTTCTGTGAGCAAAATATCTAAGGATGCTAACCGTCGTGTTATTTTTCGTGAAACCCATTGTCTCTTTCAGGATCAGGACTCGGGGGAGACGATTGGACGTGCTAGGATGATTGATGGTCTCTATTATTTTGATGAAGTTTCAACTAGTCATGAAAAGATTCAAGGCTTGAGTAGTGTCAGTTCTCTTCCTGTTCAAGAAACTATTATGCTTCCTGGTTTATTTAAAGGAATTGATTGTTCTATGTTTCAATGTGAAGACTGCATTTTCGCCAAACATCATCGATCTACGTTTTTACCCAAATCCTATAAACCCTCATCACCCTTTTACTTAATTCATACTGATGTTTGGGGGCCATCTAAGGTTTTGACTAAAAATGGCAAGCGCTGGTTTTGTTACTTTTATCGATGATCACACCCGTTTAACTTGGCTTTACTTAATCACAAAAAAGTCGGATGTAAAAGAGGTCTTTGTTCGTTTTCATAAAATGATTGAGATTCAATTTCAAACTAAAATTCGCATTCTTCACTCTGATAATGGGACTGAATTTTTTAACGAACCACTAACCACCTTTTTACATGACAAGGACATCATTCACCAAGCGACATGTCGCGATACCCCTCAGCAAAATGTTGTTGCTGAACGGAAAAATCGACACTTGCTTGAAATTGCTCGTGCCCTCATGTTTTCAATGCATGTTCCAAAATATCTTTGGGGGGATGCAGTCCTAACAGCTGCTTGCCTAATCAATAGAATACCTACTAAAGTGTTGAATTTTAAAACCCCTCTACAACGCCTAAAAGTTTTTTTTCCTACTGTCTGATTGTTCCTCAGAGTTACCTTTAAAAATTTTTGGGTGTACTGCTTATGTTCATCGAACCCTTCTTTCCCAATCCAAATTGGACCCTCGGGCTATTAAATGTGTTTTTGTAGGCTACATTCCTTTTAAAAAGGCATACAAATGTTTTGACCCCCTCACTAACAAGTATTTTGAAAGTATGGATGTGTCCTTTGTGGAAAATCAATCGTTTTTTAACCCAACTTCTCTTCAGGGGGAGTCATCATCTCTACTTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAATCCTGAAATTATGAGCTCTAGTCCTTCGATCTCGAGTATGGAAAACTCTTCGACAGGGGGAGAAACACTACAAACAGATCTGACAGGTCGAGTTCTTAAACTTAAGTTTTATACTAGAAGAAACAGAACTCAAAGGGATAGAAATCAGACAGTCGAACTAACACAGGACCAATCTGATACTCTAGTAAATGGTCCTGAAAATTCGGGTATGTCTCTTAGTCCTTCCTCTCATAATACATTGCCTAATGTCTCTGATCTTGATATTCCAATTGCCCAGAGAAAAGGTACCCACCAATGTACAAAATATCTCATTGCGAACTATCTCTCCTATCATAGATTGTTTGATAATCATAAAGCTTTCACATCCAAAATAACCAACCTATTTGTTCCAAGGAATATACAGGAAGCTCTAAATGATTCGAATTGGAAATTAGCAGTGATGGAAGAGATGAATGCGCTGAAACAAAGTGGTACTTGAGACATAGTTGATCTACCAGAAGACAAGAAAGTAGTGGGATGTAAGTGGGTTTTCACGATAAAATGTAATGCGGATGGTAGTATCGAAAGGTACAAGGCCAGGCTAGTGGCTAAGGGATTCACCCAGACTTATGGAATTGATTGTCTAGAGACTTTTGCCCCTGCAGCTAAAATTAACTCGATTAGAATTTTGCTCTCTGTTGTAGTGAATTTTGATTGGCCACTGTATCAACTAGATGTTAAAAATGCGTTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGACCTACCGCTTGGGTTTGAAGCCGACCTTGGATTGAACAAGGTATGTAAATTAAAAAAATCACTATATGGCCTTAAATAGTCTCCTAGAGCTTGGTTTGAACGTTTTGGAAAGGCAGTCACTAGCCATGGATTCAGCCAAACTCAAGCCGATCACACTATGTTCTACAAACATACGGAAAAAAACAAAGTTGTTGTTCTGATAGTGTATGTCGATGATATTATTCTTACAGGCAATGATGAAACAGAAATGTCTATTGTAAAAGAAAAATTAGCGAAAGATTTCAAGATCAAAGATCTAGGATCCCTAAAGTACTTTCTTGGCATGGAGTTTGCTAGGTCTAAAAGTGGTATTCTTGTCAATCAAAGAAAGTATATCCTTGATCTACTCAAAGAGACAGGTTTACTCGGTTGTCGAATTGCAGAAACTCCTATTGAGCAGAACTTGAAATTGGAAGCTGCAACAGAAGAAGAGGTAAAAGAAAAGGGGAAGTACCAAAGACTCGTGGGAAGACTAATATACCTCTGTCACACATGTCTCGACCATTGCCTTTGCAGTAAGTATGGTAAGTCAGTTCATGCATGCCCCTGGACCAGCTCACTTTGAAGCTTTAGAATCTGATGCAGGAGTAATATGCATTCCCTACCTCCCAACAACAGAACAAATTGCAGATGTATTAACTAAAGGGCTTCCCAAGTTGCAGTTCAACAAGTTAACAGACAAGTTGGCCATGAGTGATATCTTCAAACTAGCTTGAGGGGGAGTGTTGATTATTTGGGGATTATTTTCCTTATTGTTAAATATTTGATCTTTGTATTTATTGTAAAGATATTTACTATATTTATCTCCTTCCTTATTTGTAATAGGTTTTCTTTTATTTAAGAAAACCCTTGTTTAACAAAGTATGTAGAGAGAAATAAAATATTTCAACAATTTTTTTTCTTCTGAATTTTATATTTATTCTAAGGATTTTTGTTTTGTTTGCTTATTTTAATGTATTTTGAGCATTAGTCTAATTTCATTTCCTTAATGAAAAGTTTATATCATTTTTTAGAAAAATGACACCTTTTTTTAGTTTGCCTTTTTTTATTGGTTTTATTTTTGTATTCCCTTATATTCTTATTTTTAAGTTCGGTTATTCATAAAAAAGAACTAGTATAATTGATGTATTATTATTTTCTGGGATAATTGTTTTAAATGGCAAAATTGCTGAAAATATTTCCAAATATAGCAAAATATCATTGTCTATCAGTGATAGACACTGATGATAGATGTTTGACAGTGACACTTTGCTATATTTGTAAATAAGTTGGCTCATTTTGCTATATTTGATTACAACCCTTACCTTTTTTTTCATAAATTTATTGAAAGAAAAGAGGTCTTTGTGTATTTTTTTTTTAAAGTCTTCAAGTATAATGTAAACTTTATTAAACCTCTTCGTTTGAGTTTTCTAATTTGTTATAGGGAGTGTTATAAAAAGCCCTCCTAGTCGTGTCTAGACTCAAGGTGCAGGTCTTCAAGGCGAGGCTTACAAACAAGGCGTGCGCCTTTTGTGAAGCCCCAAGCTCAAAGCCTTGCGCCTTGGGACTTTTTTATTATTTTTTAAAAAATATTAATAATTAGGGTCCATTTTGTAACCACTTCGTTTTTTGTTTTCGTTTTTAAAAATTTAGCCTATAAACACTACTTTCACCTCCAAATTTCTTCATTTGTTGCCTAGTTTTTACTAATGGCTTAAAGAACTAAGCCAAAATTTGAAAACTTAAAAAGTAGCTTTTAAAATTTGTTTTTGTTTTTGGAATTTGACTAAGAATTCAACCCTTGTACTTAAGAAACATGCAAATCATAAGAAACGCCCTAATTTGCTGACGCCATATCCCAAGTCTTTTGCCCGAAAGAAGACTTCCTTTCTCAATTCATGGACTACTTATTGTCAACTCACATTTGTTGGAGGAATGTTGTGTAAAAGTTCAAATTCCCAAGCTCTTTCCTGAAAGGTTAGCATCTTATACCCCTATTTCTTCAACTTTTGTTCAATCAAATTTTAAAATTTCCGACTCAAATGTGAATTTTGTGCGTTGGATTCCCTTTACTTCCTCGAGCATAGCAAACCAGAGGAGGTAGGATTCTAATGGAAAATTCATAGTAAGTGCAAGTAGCGAGGAATTTGTAATTCCCGAATGCGAGTATATCCCAGATGGTTTTGGAAGAGATATTTCTAATCTATACTTAGAAGAAACCATCATAAGTTGGCCTAGTGGTAGTGGGAACATAAAAAAAAAAAAAAGGCCAAAGAGCTAAAGGGTCATATAAGTTCAATCCATGGTGGTCACCTACCTAGGATTTAATATACTATGGGTTTCCTTGACACCCAAATGTTGTAGGGTCAGGCAGTTTGTCCCATGAGATTAGTCGAGGTGCGCGTAAGCTGACTTGGACACTCATGGATATCAAAAGAAAAAAAAAAAATCTATATCTAGAAGAGGAAGTACAAAAATCTTGACCAGCTGGTGGTTGTTTAACTCCCCGTATCAAAAAACCAGATATTCCACAAAGCTTGAAATCAATAATTGAAGATTGTGATATAATTCTTGGTTGAGCAGTCCCCTGTTGTGCTTCATAGTATTCAATAGTTGATTTGGTTCGTAACTCTATAATGAAGATCTTATCGTGGGATATTAGAGGCCTTGAGAATCATCTAAACACTTGGCACTTAAAAGATGTCTGAAGAAATTGAAGATTTGGTGTTAATACAGGAGATTAAAAGTGAAGCTTTCGATCTTTACATTATTAAAGCAATATGAGCAATATGGAGTTCCAAAGATGTTGGCTGTGCATTTGTTGAATCTTATCGCGAATCTGGTGGATTGCTGACAATGTGGGATAAAAGCAAAGTCTTAGTCATTGAAATTCTTCAAGGGGGCTACTCCTTGTCAGTCAAATGCTTGACCATTTGTAGAAAAACTTGTTGGATCACGAATATTTACGGTCTGACTGACTATAAAGAGCGGAAATTTCTTTGACTGGAATTATTTTCTTTATCAGCCAAATGCTTGACCATTTAGCTTGTTTGATTGGCACACGGTTGGAAAAATTAATTTAAGCCTCCGAAGTCCTTGGATTAGCATTTCAAGAACTTGGTTGAAAGTGGAGTCCTTGGCTGTTTTTAAGTTGGGAAATGGGAGTAGAGTTGCCTTCTGGACTGATCCTTGGGTGGATAAAGCTACCTTGCAAACTTTATTTCCGAGGCTATTAGAATCACCCTTAAGCCTAATGGCTCGCTCTTGGAGCATTGGGACTCTAACCACTCTTCTCGGTCAATTCTCTTTAGAAGAATTTTAAAAGAGGAGGAAATAGTCGACTTTCAAACACTTTTACATATTTTATCAAGGCAGAAGGTTTCTGCTAGCCCCGATAGAAGAGTTTGGTCCATTGAAGCAAATGGTTTATTTTCGGTAAAGTCCCTAGTTACACATCTTTCAAGTGCTTCCCCCCAGATTTAAAATTGAAGAAAGCGATTTGGCACTCTAACAGTCCTCAGAGGGTGAATATTTCAATATGGATTATGATGTTTGGGCATTTAAATTGTGCTTCAGTCCTACAAAGAAAACTGCCATTGCATTGCCTATCGCCCAGCATTTGTCCTCTCTGTTTGGCTGGAAAGGAAGATTCACGGCATCTATTTTTCGAGTGTACCTGTGCAGAAAAATGTTGGCATAGTTTATTCTCCTATTTTAATTTAAACTGGGTGTTTGTAGCTCCTTTGGTAATAATGTTTTGCAGATTTTGGCGGGTCCGAAGTTGAAACCAGCCCCTAAATTATTATGGAAAAACACGGTCAAAGCTTTACATATTATGGTTTGAGAGAAACTAAAGAGTTTTCCATGACAATTTTACTCCTTGGTCTACCCGTCTGGAGATGGCACAAGTTAATGCTTCTTCCTGGTGCTCTTTATCAAAAGCCTTTGCTGCATACTCCATTCAAGATTTGTGCCAAAACTGGACGGCCTTCATCTTTCAAGCACCTTAGATCTTAGTTCTGTTATTTAGTGATTCGTGCTCTTTGTTCTGTTCTATTGGTGTAGTTATTTTGGTTTTGATCCATTGTATTTGGGTTTGCTCTTTGCTTTCATGTACTTTTATTGGTTGTTTATAATATTGCGCTTAGTGTGGGGATGATGAGAGTGCTAAGGAGGTGTCAACCTAGTTGAGATGTTCGAGTACACTCCCTAATCAAGAGACTTATTTCCTTTTAAAAAAAAAAGAAACATGCAAAAAACGAAATGGTTACCAAATGGGGCATAGGATTTTTCTTTCTTTATTAACTAAAAAATTCAAATTTACTAAGCCGAAACACAAAATTTCTTGTGTTTAGGGGTCTTTTTTTTCTTCATATTTCCACTTCAACTATGTTTTCTCTTCTTACTGTTAGTTAAAGACTATTGTGCTTTATTGCAAACAGTGGTCAATAAAGCACAATAGAAAGTAAAGCTACCTTCCCTTATCAACCATATTGGATGTCGAAGTGTATTCTATGTGTTGAATCCAAGGATGTACTTGTTTGAGGTGTCCAATATATATACGGTGATGTTTGACGGTTCCTTCAAATTTCTTGGAATATTGATAGTTATGTACTGGAGACATTGTCACTTTTAACCTGTTACCAATAAGCACGTACAACATAATCTTGTAAGCATTATTTCTGCCATTTTAATGTGAACTAATTAATTGAGCTCTATTCTCTAGTGGACAAGGGAGACGAAGTTTCCACATGCTGTTTGGGCAGCTGGTCGTGGTCTTATTGCCCTTCTATTACCCAATTTTGATGTTGTGGATAATCTGTGGCTTGAGCCATTATCTTGGCAGGTTAGACATTTTCTTATCCAAAAGTGCTAACAAGTTCTTTTGCATAAAAAAATATAGTTTTCAATGGTAATAAAGAGATTTGTGGTATTGTGCTAGTATGGTTACATTAATCATACATTTTTGGGTTAAACACCCTTGTCTTAATTTGTAATACAGTTATTGCTTCTTATTATTTCTTAGTTACATTTAGGGGGAAAGGAAGGATAGTGTTTAGAGGTAGGGACAAGGACCCTTGTGAAGTTTGGTTTTGGTGAGATTTCATGTGTCCCTTTGAGCTTCTGTTTTGAAGCTTTTTTGTAACTATTCCTTAAGGAAACATTTTACTTAGCTGGAACCCTTCTTTCAGTTGGAGTGTTTTGGTGGACTGGATTTTTTTTTATGTCGTTGTATTATTTCATTTTTTCTTAATAAAAGCAGTTGTTATCATCAAAAAAATTATTTCTTGGTTACATTAATTAATAAATGTATCAATGTTCAGAAGATTGCTATAAAAGGACCAACAATATGATATCCCCACCTGAAATTTACCATACCAACTAATATGTAACAAAGAAGGATTGGTCAATCTATTAGTATAAGTTAGCAGCATTTATGATGACCTCAGTTGGTCTACAGTTCATTTGATTGTGTGAAAACTAAATGTATTCAATTTTTTAAAAATTTGTATCAACTTAAACTTTTTCTATATGAATTACGTAGAATATTAATAAAATAATAAAAAGATGAAGTAGAAATTACCAGTAGCCTCGGAATCTTTTCAAGTTCCCCTTGTTTATCCCGATATGGTTAAAGTTAAAGTTTTTTTTTTTTTTTTTTAATGGAAATCAAGAGGGCTCAAGGTCATCTGTAAGAAGGCTGCCTGGATTTTTCAAAATGATGACTATTCTGAGTTGGGTCCCAATTGGAGTCCTCAGAATGTCCTATGTGTTAGTTGATGATTGAAGTTTATCTTCTTATCTTCTCTGGAAGTTTTTTTGTTTGAATTGTTGGTGGTAGCTGTTAGATACCAACCAACCGTCGTTCTAAGTGTCTAACAAGAACACAACCACGGATTCCAGCCTAGAAATTGAACGCCTCTCCTCGTTAGACAATCTTCGAATCCCCTCTCATGAAAGCCATTGTGAGGTTCGGCCAATCTAATCCCATATAGCCCATCGTGTGGAGCCAAGACATCCCCAAGATGACATCAATCCTCCCCAGATCAAGAGGTAAGAAATTCTCTAACAAAGTCACCTTGGGTAGCTCGATGATTATCTCCTTGCAGACTCCTTTTCCTTGTAACGTCAAGCCATCCCCAATCACAATTCCATAGCTTGTTGTGTTGGTTACCGGAAGCTTCAACTCCTCGACTAACTGCTGGTGAACGAAATTGTGGATGGCACCACAATCGATTAATACGACCACCTCACGTTCCCCGATTTTTCCTCTCAATTTCATCGTTCTTGGCGTCGAAAATCCTAGAACCGAACAGAGAGCGAGTTCCACCTTTTCTTCGGCCTCTCCTGTGACTTCTTCCCCGTTTGACTCTTCAATTTCGTTGACGACCTGGTTCTCTCCGGCGATTAGCAATCAGAGTTCCCTCTTCTCTTGGGTTTTACATTGATGCTTGGGCGAGTACTTTTCATCACACTTAAAACAGAGCCCTTTTTCCCTTTTTTCACGAACTTCCACATCGCTCAGCCTTCGGGTCACTCCCTCTCTTCTTGGAGCTCTGTTTTTGTCGAGTATTTGAATCTTGGTTGTATTCGGTCGGTAATAATTCCCTGAACCCGTCTTATCCAACCCACCCGATTTGCCTCCCCCCTTCGTCGTTCTCGAGCCGTCACCACTTTCGATCTCTTTGAGTGCACACATCTCATCGTCTAAGGCCTGAGCTTCGCGCATTGCTGCATCTAACCCAATCGGATTTCTACTCTCAACCTTCGCCTTCAGCTTAGGATCAAGTCCATTGATGAATGTTCCCTCAAGAACTGATTCGGATATGTCTGGAATCGGGGCCGAAAAACATTCGAAGAGCTTCCGATAATCCAAATATGTTCCTTCTTGTTTGATCGCCAGGAATCGTCTTGTCAAACTTCCCTCCTTTGAACTCCTGAACCTCGCAAACATTCTTTTCTTCAACTCCGCCCACGACTCTATCGGACTTTGGTTATGGCTCCAGCGGTACCAATCTACCGAATCATGATCGAAACCAATAACGGCCACCTTTATCTTCTCCTCCTTTGTTAATTCATGGATTTCGAAGAATTGTTTGGCCCGATAAACCCAAGAATCAGGGTGCTGTCCTTTGAATATCGGCATTTCAAGGCGTTTATACTTACCTTTGTCACCACCTCCCTTAGTAATTGACGTTTTACCCTCCTCCATTTCTGCCGTCTTACCCTTCCTTTGCGATCCTTCGGTTACCATCGACCCTTCTCTTCCCCGACTCAACGTAACCTCTCTCATTTCTCCCATCAACTGATCTACGTTCTTGCACATAGCAAAAACCAATTCCTTCAATCCCGAAATGGCGCGATCAAGTGTCTCTAATTTCTCCTCGGTCTCCTTCATCTTCTGCCTGCGTGTTCTCACAAGGATGTTTTGGGCTTTGATACCAATTTGTTAGATACCAACCAACCCTCGTTCTAAGTATCTAACAAGGACACAACCACGGATTCCAGCCCAGAAATTCAACTATTATTTATCAACAATCACTCCTCAATGAGAGTACTACAATCAGCAAAAGATAAGAAAGAACTACAAAACGGAAATAGCAACAGATCAGAAAATAAGAAGAAAAAAAATTCAGATAGGCATATTCGAGAGTGCCTTTCCTCTCCAGCTCTTCAGCTTCTAATCTTCCATACCCAAACCTTCCTCCTCAATGCCCTTCATAAACCCCTCACCCGATGGTCCCCCACCCCAGCTTCTGTCCTCTGCCTCCACTAGCTTAACGGCCACGTTCCCAGCCGAATTTGCTTCTCCACTAACTTGGTTATTTCCACTTTTACCCTTCCTCTTATATGTGTGAATAATAGGTCTAATGGTAGCTGGTCCTTCTGTGGTTTTGAAGCTTCGTCAGTTACTTAGTTGGCAGTCTTTAATTTTCAGTCTCAATATTTGTGGTTTGCTGTGGTGGAAGCCTCCTTTCTCATCGTTTTGGGTTTGCTCGAGCTGAACTATTATTGTATTGTTATTTTTCTAGGCTTTGTTCTGATTATTCATCTTGAGTTTTGTTCAATTGAAATGTTTTAGTTTTTCTTCATAGCTTTTTGGTTTTAGCCTTGTATTGGCAAGTTTGAAGCTTTTTATTTTTGTTTCTCTTGTGTAATTTGAGCATTAGACTCATTCATTATTTCAATGACAAGTCTTGTTTCCTTTAAAAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAGGTAATATGCAAAGGACNNNNNNNNGAAAAATTTAAAGCTTGTGGAAGATCGATTATCTAGAATACATTAGACAAAAGTAATACTAAATTTCAAAGAGAGGGAGAGCGCTGATAGCTTTATAGTTGCAAATTCAAAGCATGGTATGGCTACCATCCAGCTAGAATATTAAAATATTCGATGAACCTGACAGTAAATGGGGCTGGAGATCAATTTTCTCATCAGTTTCATTCCAATCAATTGGCCCAAGCACCTACAATTATTATTATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAGGTAATATGCAAAGGACAGAAAGGAAAAATTTAAAGCTTGTGGAAGATCGATTATCTAGAATACATTAGACAAAAGTAATACTAAATTTCAAAGAGAGGGAGCGCGCTGATAGCTTTATAGTTGCAAATTCAAAGCATGGGATGGCTACCATCCAGCTAGAATATTAAAATATTCGATGAACCTGACAGTAAATGGCGCTGGAGATCAATTTTCTCATCAGGTTCATTCCAATCAATTGGCCCAAGCACCTACAAGTATTATTATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGAAACATCGGGCAAATGTGTGCTATCTTGGAAAATTGTGAGCTTTGAAGGCACCATGATGATCTTATTTCGTTTAAACTTGTGACTCTTTTGTAGGGTATTGGATGTACAAAGATCAGTAAGCGGAGAAATGAAGGATCCATTAATGGGAATTCAGAATCAAGATCGTACCTAGAAAAGAAGCTGGTCTTTTGCTTTGGTTCATATATTGCAGCTCAAATGCTACTTCCTTTTGGAGAAGAAAATTTCCTATCTTCGTCTGAGTTAAAGCAGGCGCAAGAGGTTAGTTTCACAGAGTTCTCTGGGAATGTTTGTAAATCATCTTATATGCTACCTAACTTTTATTCACAACACCCGGTACCAAATCTGATCTGTGGTAGATTGCAACGAAAATGGTGATTCAATATGGCTGGGGGCCAGATGATAGTCCAGCAATTTACAGCCGAAACAATACGGTATACTTTCTTTCATATCCCAAACTTATTGTGTGATTCTTCCATTATTGTTGAATAGATCGAGTTCTCTCCAGGAACTTAATGAGTTTCAGATATCGGATCTGTCAATTACAGTAACTGCACATGTTGGATATTAGGTTTATGCCATCTAATTATCAGTAGTGGCAGGGAGGGGGAGTTCTTAGGTTTAACTTAAGAACTCATGGACTTTTAGTACTATTATTATGATGCAGAAGAGAAGAATCAATGCAAACTTTATTCATAGAAAATCTGAATGCTTAAAAGCAAATAATCCATTCAACCATAACCATAATATCCCAAACAACTTGTAATATTCAACCCAACTTGCCTAATGGGCAATGACCTCATACTAAATGCCCAAAAACATTAAAGAAGAATCCTTGAATTCAACAGTTGTGAGGATTGAATCATTGACCTCATGGTAATTCGAACCTTATTCACTGAGTTATACTCAAATTGACTTATCCATACTATACATCTTTTAAAGTTTCAACATCATATTCCTTTTTGTTACACGTAATATTGTTACTTTTCTCTTCCAAAGTCAACTCTATCATCGTTTTTGGTTATCTGCTTAAAGAACTTCTCATTGTTATTTATTATCCGGTCAAGCGGCTTTATGTGGCTCTCATTTAGTGATGTCATACCTTTGGCAATGACATAGGTTACTTCTTTGAGTATGGGCGATAAATATGAATATGAAGTGGCAGCTAAAGTTGAGAAGGTATATGTTCAATTGTTCCTCATATTTACAAATAATAGGCTGAGTTAATTTTGGTTCCTACATGGGGAAAAATATACTTAATTGGCTTAGGATTATTTAATTAACCTCGCTCACTTTGCTTTTTTAGATATATGATTTAGCGTATTGTAGAGCGAAGGAGATGTTGGCAAAAAATCGTCAAGTTCTTGAAAAGTTTGTTGAAGAATTACTAGAGTGTGAAATTCTTACTGGGAAGGTAAGCCCTCAATCTTTTAATAATTATTTGAATCCTATTCGATATATTCTCTGAATTGGGTTGTTCTGAATGACACTGATGACATTTTATTGTAATTATGCTTATGACTAGCCTTCCCCTTGGCAGGTTTTGGAGAGATTAATTGAAACCAATGGAGGAATTAGGGAGAAAGAGCCTTTTTATCTTTCTGAATACTATGGTAGAGAGGTATATTAAATTATGTTGTTTTCAGTCTATTTATGATTGTAACTCTCTTTTTTCTTCAGTGAATTGCTTTTGAAATTAAATTTATCTTTTTAGGATTTACTGCTCGTAAATTCACAACTCTATCATGCTCGATTCTTTTGCTTTTGAAATATCTTTAAGTGACAGGACACCTTGATTTTTATGATTGTTGCTGCTAAAACACTGATTGCTAAGTGTCAACACCTTACATAATATACTCTGCAATTTTTGGTATCAAACACCGTAAACATAGTTAACCATTATGGGTTGGTCTAGTTGTAGTGGGAATATAAAAAAAAGGCCAAAGGGTTTTTATATTCCCACTACAACTAGACCAACCCATAATGGTTAACTATGTTAAGACTGCTATATATTTTTAAAATTTTTTTTAAATAAATAATAGTAACCATTATTTTTTAACTTCACTCAAAATTTAAGAGTTTAATGTTGATTTTGGTTTCTAACCAAATGAACCAATACTACATAGAAAACAGAGATTCTAGGCCAATGCAATTATAGAGCAACATGATTGAGAAAAGCTTAAAGAACTAAACGGTATTCGGTAGTACGTGGATGAACTGGACATGAAAAAAATCCATCAGTTAGGTGTACCAATTGTAGCCAACACAAATTCTACAGCCAAGAGTACTCTGGTTTGGACAAGTCCTAGGTTATTTTATCTCATAAACATTTTGAAGCACATTAGTGAGGACAGACATGTTGAACGAATTTGTGTGGGTTTGTTAGAATAGCCTTTAATTTCCAAAGAAGCCTCAAGATGTTGCATGACATATTTAGGTCATTGGGGATAGCGATAATGTTTGGACCACTAGGTGTTGAATCCTGGGTTTGAACTCGTGGCCTAAAACATTACCTACCCTTTTTTTTGAGAGGGTAGCAAAAACTTCACTGTTGAATGAAATTGAAAGCTTACCACCGCACCGGTGAGTTACAAGAAAGAAGTCCAAGTGCTTAGTAAAGTAGAGTGACTATAGTCTTTAAAGGGGTGTGAACATTTGCACCAATATAAAACATTAAAAAGAGTAAGGTCAAAGAAGTCCTTGAAGGGAGTCCTATACCATACATCCTGCTGCCTTGGGACGACTGCACGTTACACGTTACATCATAGCAGCACAACAAAATGAAGGCGGAAACCCGGTTGGGCTTTCCTGCGCACCTGGCATCCTGCAAGAAAAGGCCCCTGACTCTCGAACAAAACAAGGGATTGAGCACCGCGAAAGCCGTTGACGATAATACGCCTTCCTCTTTCATTTGCATTTCCATATTGAAAGTCAAGGCGTTAGCGCATCCGTTTTCTTGCTTGTTAGTGCTTTACTAATAACATAGAAAGAGTCAAGCCAAAACCAACTCGTTGTTGACTATTTGTCCATCTAAGAAGCACGACACAGATATGGCCTATGGGAACACGTCAATTCCTAAAAATGTAGGATGCAGACATGTTGAGAGGACACATGTTTATTTGTTTCTTTTTCCTAAAAAAAAATTGTTCTAATATTAACCAATCAGATCCAAAATACTTTATTTATTTTTATTTTTATTTATTTATTTATTTACTAACTATAAAAAATAATCAAAAAAAATCATGAATGAAAGTAACTGCAAATTAATAATTAAGAAAAGGCAATTGAATTTAGTTCAAAACTTAAAAGAGGCCTGTTTTTATAGTTTGCTGAGATGTGTGCTTTATTGTGGGATATTTGGGGGAGAGGAATAACAAATTGTTTAGAGGTGTGGAGAAGGACTATTGTGATGTTTAGTCCTCGGTGAGCTTCCACGTTTCTCTTTGAGCTTCGGTTTCAAAGACTTTTAATTATTCTCTAAGTAGTATCTTACTTAGTTGGAAACCCTTTCCTTAAAGGGGTTTTGTGGGCTTATTTTTTGTCACCCTTGTATTCTTTCATTTTTTTTTCCGTGTTTCTATTAAAAGAAAAAAAAAGAAAGAAAATAAAAAAAATACCATTTTCATGTTATCTTCACATAGTATCTATTTAGTAAAATTGCCGCACAGTTTCTTTACCAAGAAAAACAATTAGGAATATTGTCATATAATTATATCTTTAGAGTTGATTCACTATTCCCTGCTGAAATTTTCCTGTTTCTTTTTCTTCTTCATCGCCTATTGATGAATTTGTTCTTGCATAAGCCTCAAGAGACTTATAAGAGAGTTTAATCTGCTGGCATTTCAGAAGAAATGTGCATGTACACAGTATCCTTGTCATGGTGGTTAGAAGTATGCAGTATTCTTTGATGTTGTGGATTTTGATGAAAATTTTAAAGTCTGTGGGTGTCTGCCAAAACTCTTTGTATGTTTACATCATTCACGAGATGTACTAAATTGTTTTCGTTTGATTGTAGCCATTGACTGGTGGATTTCTTGAAAGCGCGAACTCATCGGGAACTGCTCTTTTAAGTTAAGCAACATTGAACAAGAGGAAATGAGTGAAGTTTGGAACTAAATCATATTACTAGAAAAAATGTGAGTTGCTTTTATGTTTTTTATTGTCATTTTACATCTTATGGTTAATTTGAGTAGAGATACAATCTAGTTTAGATGTGTTGAGGATGGAGATACGATGTGATAAAAATGGCGTAGATATTATGTAATTTGGTGTACAGAACAAGGTTTTTGGGCGAGAAGAGTCGCTATGTTTCCTTAGCCACAGGATTTTGTAACACATTGAATTCAAATATCTATAAATGATGTTTGACATATCAACGAAATAATTTATGAATGAATAATTCATCATTCTTGTTAGCACTCATTTCTACGAGGAGAAACATTCTATTTTCTCTTTTGCTCCATTCAATTTCCCTTTACTCCTTTTATTGATTATTTATAGGGTTATATAATATAATTTTATATTTGTTTTTTTAAGAACTTTTGTACCATTATGGTATTTTTATTTTTAATTGAATTAATAGCTTCCCAAATATTTAAGATTTTAG

mRNA sequence

ATGGATCAAATAATCTGTGTAAAATTTAGAAAGGGAACAACTGAGAAGAAGCCTTGTGTTGAGCCTACAGTGTTGTGGTCATTGTTCCTAAAAGAAGAAAGTCTAAAGGTTACATGTCGTGGTGCTCTATTTTGGCAAATGGAGTTGGAGCCAAAAGAGATGATCAAAATGGAAAAGAAGAACAGTGCTTCCTTCTGGGATTTTATTTATTTTCAGCTTTCTACTGGTGCCTCAAATCTAAGGGGTTTAATTTGTGGGGTCCTAATTGTTGGTGAGAGGGGAACAGGAAAGACATCTCTTGCACTGGCCATAGCTGCAGAAGCAAAAGTACCAGTTGTTACAGTTGAAGCCCAAGAATTAGAGCCTGGACTATGGGTTGGACAAAGTGCATCCAATGTCCGGGAATTATTCCAAACTGCAAGAGATTTGGCACCTGTGATCATATTTGTGGAGGATTTTGACCTGTTTGCTGGAGTTCGTGGCAAGTTTATTCATACCAAAGAACAGGATCACGAGGCTTTCATTAACCAACTTCTCGTGGAGCTTGATGGGTTTGAGAAACAAGATGGAGTAGTTTTGATGGCTACCACTCGGAATTTGAAGCAAATTGATGAGGCTCTACAGCGGCCTGGGCGGATGGATCGAGTATTTCATCTTCAAAGGCCAACTCAATCCGAAAGAGAGAAAATACTACAAATTGCTGCGGAAGGATCCATGGATGAAGAGCTCATTAATTATGTAGATTGGAAAAAGGTTGCCGAGAAAACATCCCTTTTACGACCGGTGGAACTAAAACTTGTCCCACTTGCCTTGGAAGGAAGTGCTTTTCGGAGCAAATTCCTTGACACTGATGAACTGATGGGCTACTCTAGCTTGTTTGCTACTTTCAGTGGTATTGTTCCCAAGTGGGTGCAGAAAACCAGAACAGTCAAAAAACTAAACAAAATGTTGGTGAATCATCTGGGATTAACACTGTCTAAAGAAGATCTTCAAAATGTGGTTGATCTGATGGAACCATACGGTCAAATAAGCAATGGAATTGAACTCCTTAATCCTCCTCTTGATTGGACAAGGGAGACGAAGTTTCCACATGCTGTTTGGGCAGCTGGTCGTGGTCTTATTGCCCTTCTATTACCCAATTTTGATGTTGTGGATAATCTGTGGCTTGAGCCATTATCTTGGCAGGGTATTGGATGTACAAAGATCAGTAAGCGGAGAAATGAAGGATCCATTAATGGGAATTCAGAATCAAGATCGTACCTAGAAAAGAAGCTGGTCTTTTGCTTTGGTTCATATATTGCAGCTCAAATGCTACTTCCTTTTGGAGAAGAAAATTTCCTATCTTCGTCTGAGTTAAAGCAGGCGCAAGAGATTGCAACGAAAATGGTGATTCAATATGGCTGGGGGCCAGATGATAGTCCAGCAATTTACAGCCGAAACAATACGGTTACTTCTTTGAGTATGGGCGATAAATATGAATATGAAGTGGCAGCTAAAGTTGAGAAGATATATGATTTAGCGTATTGTAGAGCGAAGGAGATGTTGGCAAAAAATCGTCAAGTTCTTGAAAAGTTTGTTGAAGAATTACTAGAGTGTGAAATTCTTACTGGGAAGGTTTTGGAGAGATTAATTGAAACCAATGGAGGAATTAGGGAGAAAGAGCCTTTTTATCTTTCTGAATACTATGGTAGAGAGCCATTGACTGGTGGATTTCTTGAAAGCGCGAACTCATCGGGAACTGCTCTTTTAAAGATACAATCTAGTTTAGATGTGTTGAGGATGGAGATACGATGTGATAAAAATGGCGTAGATATTATCACTCATTTCTACGAGGAGAAACATTCTATTTTCTCTTTTGCTCCATTCAATTTCCCTTTACTCCTTTTATTGATTATTTATAGGAACTTTTGTACCATTATGCTTCCCAAATATTTAAGATTTTAG

Coding sequence (CDS)

ATGGATCAAATAATCTGTGTAAAATTTAGAAAGGGAACAACTGAGAAGAAGCCTTGTGTTGAGCCTACAGTGTTGTGGTCATTGTTCCTAAAAGAAGAAAGTCTAAAGGTTACATGTCGTGGTGCTCTATTTTGGCAAATGGAGTTGGAGCCAAAAGAGATGATCAAAATGGAAAAGAAGAACAGTGCTTCCTTCTGGGATTTTATTTATTTTCAGCTTTCTACTGGTGCCTCAAATCTAAGGGGTTTAATTTGTGGGGTCCTAATTGTTGGTGAGAGGGGAACAGGAAAGACATCTCTTGCACTGGCCATAGCTGCAGAAGCAAAAGTACCAGTTGTTACAGTTGAAGCCCAAGAATTAGAGCCTGGACTATGGGTTGGACAAAGTGCATCCAATGTCCGGGAATTATTCCAAACTGCAAGAGATTTGGCACCTGTGATCATATTTGTGGAGGATTTTGACCTGTTTGCTGGAGTTCGTGGCAAGTTTATTCATACCAAAGAACAGGATCACGAGGCTTTCATTAACCAACTTCTCGTGGAGCTTGATGGGTTTGAGAAACAAGATGGAGTAGTTTTGATGGCTACCACTCGGAATTTGAAGCAAATTGATGAGGCTCTACAGCGGCCTGGGCGGATGGATCGAGTATTTCATCTTCAAAGGCCAACTCAATCCGAAAGAGAGAAAATACTACAAATTGCTGCGGAAGGATCCATGGATGAAGAGCTCATTAATTATGTAGATTGGAAAAAGGTTGCCGAGAAAACATCCCTTTTACGACCGGTGGAACTAAAACTTGTCCCACTTGCCTTGGAAGGAAGTGCTTTTCGGAGCAAATTCCTTGACACTGATGAACTGATGGGCTACTCTAGCTTGTTTGCTACTTTCAGTGGTATTGTTCCCAAGTGGGTGCAGAAAACCAGAACAGTCAAAAAACTAAACAAAATGTTGGTGAATCATCTGGGATTAACACTGTCTAAAGAAGATCTTCAAAATGTGGTTGATCTGATGGAACCATACGGTCAAATAAGCAATGGAATTGAACTCCTTAATCCTCCTCTTGATTGGACAAGGGAGACGAAGTTTCCACATGCTGTTTGGGCAGCTGGTCGTGGTCTTATTGCCCTTCTATTACCCAATTTTGATGTTGTGGATAATCTGTGGCTTGAGCCATTATCTTGGCAGGGTATTGGATGTACAAAGATCAGTAAGCGGAGAAATGAAGGATCCATTAATGGGAATTCAGAATCAAGATCGTACCTAGAAAAGAAGCTGGTCTTTTGCTTTGGTTCATATATTGCAGCTCAAATGCTACTTCCTTTTGGAGAAGAAAATTTCCTATCTTCGTCTGAGTTAAAGCAGGCGCAAGAGATTGCAACGAAAATGGTGATTCAATATGGCTGGGGGCCAGATGATAGTCCAGCAATTTACAGCCGAAACAATACGGTTACTTCTTTGAGTATGGGCGATAAATATGAATATGAAGTGGCAGCTAAAGTTGAGAAGATATATGATTTAGCGTATTGTAGAGCGAAGGAGATGTTGGCAAAAAATCGTCAAGTTCTTGAAAAGTTTGTTGAAGAATTACTAGAGTGTGAAATTCTTACTGGGAAGGTTTTGGAGAGATTAATTGAAACCAATGGAGGAATTAGGGAGAAAGAGCCTTTTTATCTTTCTGAATACTATGGTAGAGAGCCATTGACTGGTGGATTTCTTGAAAGCGCGAACTCATCGGGAACTGCTCTTTTAAAGATACAATCTAGTTTAGATGTGTTGAGGATGGAGATACGATGTGATAAAAATGGCGTAGATATTATCACTCATTTCTACGAGGAGAAACATTCTATTTTCTCTTTTGCTCCATTCAATTTCCCTTTACTCCTTTTATTGATTATTTATAGGAACTTTTGTACCATTATGCTTCCCAAATATTTAAGATTTTAG

Protein sequence

MDQIICVKFRKGTTEKKPCVEPTVLWSLFLKEESLKVTCRGALFWQMELEPKEMIKMEKKNSASFWDFIYFQLSTGASNLRGLICGVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAPVIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDEALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELKLVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTLSKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVDNLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEENFLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGREPLTGGFLESANSSGTALLKIQSSLDVLRMEIRCDKNGVDIITHFYEEKHSIFSFAPFNFPLLLLLIIYRNFCTIMLPKYLRF
BLAST of ClCG03G010040 vs. Swiss-Prot
Match: FTSI5_ARATH (Probable inactive ATP-dependent zinc metalloprotease FTSHI 5, chloroplastic OS=Arabidopsis thaliana GN=FTSHI5 PE=2 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 2.1e-221
Identity = 385/498 (77.31%), Postives = 435/498 (87.35%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV VEAQELE GLWVGQSA+NVRELFQTARDLAP
Sbjct: 819  GVLIVGERGTGKTSLALAIAAEARVPVVNVEAQELEAGLWVGQSAANVRELFQTARDLAP 878

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKF+HTK+QDHE+FINQLLVELDGFEKQDGVVLMATTRN KQIDE
Sbjct: 879  VIIFVEDFDLFAGVRGKFVHTKQQDHESFINQLLVELDGFEKQDGVVLMATTRNHKQIDE 938

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVFHLQ PT+ ERE+IL  AAE +MD EL++ VDW+KV+EKT+LLRP+ELK
Sbjct: 939  ALRRPGRMDRVFHLQSPTEMERERILHNAAEETMDRELVDLVDWRKVSEKTTLLRPIELK 998

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALE SAFRSKFLDTDEL+ Y S FATFS IVP W++KT+  K + KMLVNHLGL L
Sbjct: 999  LVPMALESSAFRSKFLDTDELLSYVSWFATFSHIVPPWLRKTKVAKTMGKMLVNHLGLNL 1058

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +K+DL+NVVDLMEPYGQISNGIELLNP +DWTRETKFPHAVWAAGR LI LL+PNFDVV+
Sbjct: 1059 TKDDLENVVDLMEPYGQISNGIELLNPTVDWTRETKFPHAVWAAGRALITLLIPNFDVVE 1118

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SW+GIGCTKI+K  + GS  GN+ESRSYLEKKLVFCFGS+IA+QMLLP G+EN
Sbjct: 1119 NLWLEPSSWEGIGCTKITKVTSGGSAIGNTESRSYLEKKLVFCFGSHIASQMLLPPGDEN 1178

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSE+ +AQEIAT+MV+QYGWGPDDSPA+Y   N V++LSMG+ +EYE+A KVEKIYD
Sbjct: 1179 FLSSSEITKAQEIATRMVLQYGWGPDDSPAVYYATNAVSALSMGNNHEYEMAGKVEKIYD 1238

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AK ML KNR+VLEK  EELLE EILT K LER++  NGGIREKEPF+LS     E
Sbjct: 1239 LAYEKAKGMLLKNRRVLEKITEELLEFEILTHKDLERIVHENGGIREKEPFFLSGTNYNE 1298

Query: 566  PLTGGFLESANSSGTALL 584
             L+  FL+  +   TALL
Sbjct: 1299 ALSRSFLDVGDPPETALL 1316

BLAST of ClCG03G010040 vs. Swiss-Prot
Match: FTSH_HAEIN (ATP-dependent zinc metalloprotease FtsH OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=ftsH PE=3 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 5.3e-36
Identity = 134/477 (28.09%), Postives = 217/477 (45.49%), Query Frame = 1

Query: 79  NLRGLIC-GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELF 138
           NL G I  G+L+VG  GTGKT LA AIA EAKVP  T+   +    ++VG  AS VR++F
Sbjct: 178 NLGGKIPKGILMVGPPGTGKTLLARAIAGEAKVPFFTISGSDFVE-MFVGVGASRVRDMF 237

Query: 139 QTARDLAPVIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATT 198
           + A+  AP +IF+++ D     RG  +     + E  +NQ+LVE+DGF   DGV+++A T
Sbjct: 238 EQAKKNAPCLIFIDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFSGNDGVIVIAAT 297

Query: 199 RNLKQIDEALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTS 258
                +D AL RPGR DR   +  P    RE+IL++                     K S
Sbjct: 298 NRPDVLDPALTRPGRFDRQVVVGLPDVKGREQILKVH------------------MRKVS 357

Query: 259 LLRPVELKLVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKML 318
           + + V+   +     G +      D   L+  ++LFA                 ++NK  
Sbjct: 358 VAQDVDAMTLARGTPGYS----GADLANLVNEAALFAA----------------RVNKRT 417

Query: 319 VNHLGLTLSKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALL 378
           V  L    +K+             +I+ G E     +  T + K   A   AG  ++  L
Sbjct: 418 VTMLEFEKAKD-------------KINMGPE--RRTMIMTDKQKESTAYHEAGHAIVGYL 477

Query: 379 LPNFDVVDNLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQM 438
           +P  D V  + + P   + +G T      ++ SI     S+  LE KL   +   +A  +
Sbjct: 478 VPEHDPVHKVTIIPRG-RALGVTFFLPEGDQISI-----SQKQLESKLSTLYAGRLAEDL 537

Query: 439 LLPFGEENFL--SSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLS-------- 498
           +  +GEEN    +S+++K A  IA  MV Q+G+     P +Y+ +     L         
Sbjct: 538 I--YGEENISTGASNDIKVATNIARNMVTQWGFSEKLGPILYTEDEGEVFLGRSMAKAKH 592

Query: 499 MGDKYEYEVAAKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLI 545
           M D+  + +  +V  I +  Y RA+E+L  N  +L    + L++ E +  + +++L+
Sbjct: 598 MSDETAHSIDEEVRAIVNRNYARAREILIDNMDILHAMKDALVKYETIEEEQIKQLM 592

BLAST of ClCG03G010040 vs. Swiss-Prot
Match: FTSH3_SYMTH (ATP-dependent zinc metalloprotease FtsH 3 OS=Symbiobacterium thermophilum (strain T / IAM 14863) GN=ftsH3 PE=3 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.5e-33
Identity = 130/468 (27.78%), Postives = 207/468 (44.23%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           GVL+ G  GTGKT LA A+A EA VP  ++   +    ++VG  AS VR+LF+ A+  +P
Sbjct: 192 GVLLYGPPGTGKTLLAKAVAGEAGVPFFSISGSDFVE-MFVGVGASRVRDLFEQAKKNSP 251

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            I+F+++ D     RG        + E  +NQLLVE+DGF   +G++++A T     +D 
Sbjct: 252 CIVFIDEIDAVGRQRGAGYGGGHDEREQTLNQLLVEMDGFSANEGIIIIAATNRPDVLDP 311

Query: 206 ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
           AL RPGR DR   + RP    R  I Q+ A+G   E                     ++ 
Sbjct: 312 ALLRPGRFDRQIVIDRPDLKGRLAIFQVHAKGKPLEP--------------------DVD 371

Query: 266 LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
           L  LA     F     D   LM  ++L A             R  KK+            
Sbjct: 372 LEVLAKRTPGFTGA--DIANLMNEAALLAA-----------RRRKKKI------------ 431

Query: 326 SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
           S +D+++ +D +     ++ G E  +  +  + + K   A   AG  ++  +LP+ D + 
Sbjct: 432 SMQDVEDAIDRV-----LAGGPEKKSRVI--SEKEKRVTAYHEAGHAVVGHMLPHMDPLH 491

Query: 386 NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            + + P   + +G T      +  +I     S+S +  ++    G   A +  + FGE  
Sbjct: 492 KITIIPRG-RAMGYTLFLPVEDRYNI-----SKSEILDRMTMALGGRAAEE--ITFGEIT 551

Query: 446 FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMG----DKYEYEVAA--- 505
             +  ++++  + A +MV ++G      P  Y        L+        Y  EVA    
Sbjct: 552 SGAQDDIERTTQWARRMVTEWGMSEKLGPLTYGMKQDEVFLARDMTRLRNYSEEVAGLID 598

Query: 506 -KVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIE 546
            +V K   +AY RA ++L ++R  LEK  E LLE E L GK L+ L+E
Sbjct: 612 EEVRKFVHMAYQRAIDILTEHRDALEKVSEVLLEKETLEGKELQDLLE 598

BLAST of ClCG03G010040 vs. Swiss-Prot
Match: FTSH4_SORC5 (ATP-dependent zinc metalloprotease FtsH 4 OS=Sorangium cellulosum (strain So ce56) GN=ftsH4 PE=3 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 2.7e-32
Identity = 127/471 (26.96%), Postives = 210/471 (44.59%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           GVL++G  GTGKT LA AIA EA VP  ++   +    ++VG  AS VR+LF+  +  AP
Sbjct: 200 GVLMMGPPGTGKTLLARAIAGEAGVPFFSISGSDFVE-MFVGVGASRVRDLFEQGKKHAP 259

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            IIF+++ D     RG  +     + E  +NQLLVE+DGFE  +GV+++A T     +D 
Sbjct: 260 CIIFIDEIDAVGRHRGAGLGGGHDEREQTLNQLLVEMDGFESNEGVIIVAATNRPDVLDP 319

Query: 206 ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
           A+ RPGR DR   + RP    RE                             +LR V  K
Sbjct: 320 AILRPGRFDRRIVVNRPDVRGRE----------------------------GILR-VHTK 379

Query: 266 LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            VPL  +              +    L     G V   ++           LVN   L  
Sbjct: 380 KVPLGPD--------------VDMEILARGTPGFVGADIEN----------LVNEAALLA 439

Query: 326 SKED--LQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNF-D 385
           +++D  + ++VD      ++  G E  +  +  + E K   A   AG  L+A LL  F D
Sbjct: 440 ARQDKDVVSMVDFEMAKDKVLMGAERRSMVI--SDEEKRTTAYHEAGHALVAKLLEKFSD 499

Query: 386 VVDNLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFG 445
            V  + + P     +G T+   + +  S+     SR + + +L    G  +A +++  FG
Sbjct: 500 PVHKVTIIPRG-PALGLTQQLPKEDRLSM-----SRDFAKARLSVLMGGRVAEEIV--FG 559

Query: 446 EENFLSSSELKQAQEIATKMVIQYG---------WGPDDSPAIYSRNNTVTSLSMGDKYE 505
           +    + +++KQA  +A +MV ++G         +G D+      R+ T       +   
Sbjct: 560 QFTTGAGNDIKQASNLARRMVTEFGMSDVIGPISYGADEESVFLGRDFTSRRRDYSETIA 606

Query: 506 YEVAAKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLI 545
            ++  +V +    A+  A+++L  NR++LE+    LLE E L  + ++ ++
Sbjct: 620 NQIDDEVRRFILDAHAEARQLLTDNREILERLATALLERETLDAEEVDAIV 606

BLAST of ClCG03G010040 vs. Swiss-Prot
Match: FTSH_BUCAI (ATP-dependent zinc metalloprotease FtsH OS=Buchnera aphidicola subsp. Acyrthosiphon pisum (strain APS) GN=ftsH PE=3 SV=2)

HSP 1 Score: 140.6 bits (353), Expect = 6.1e-32
Identity = 123/470 (26.17%), Postives = 215/470 (45.74%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           G+L+VG  GTGKT LA AIA EAKVP  T+   +    ++VG  AS VR++F+ +R  AP
Sbjct: 187 GILMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVE-MFVGVGASRVRDMFEHSRKSAP 246

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            IIF+++ D     RG  +     + E  +NQ+LVE+DGF+  +G++L+A T     +D 
Sbjct: 247 CIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFDGNEGIILIAATNRPDVLDP 306

Query: 206 ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
           AL RPGR DR   +  P    RE+IL+              V  +KV        P+   
Sbjct: 307 ALLRPGRFDRQVIVALPDIRGREQILK--------------VHMRKV--------PLSKD 366

Query: 266 LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
           + P+ +          D   L+  ++LFA                 +L+K +V+ L    
Sbjct: 367 VDPMIIARGTPGFSGADLANLVNEAALFAA----------------RLDKRVVSMLEFER 426

Query: 326 SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
           +K+ +          G     + +     D+ +E+   H    AG  +I  L+P+ D   
Sbjct: 427 AKDKMM--------MGSERRSMVM----SDFQKESTAYH---EAGHVIIGRLVPDHDPAH 486

Query: 386 NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            + + P   + +G T      +  SI     SR  LE ++   +G  +A +++  +G +N
Sbjct: 487 KVTIIPRG-RALGVTFFLPESDTLSI-----SRQKLESQISTLYGGRLAEEII--YGAKN 546

Query: 446 FLSS--SELKQAQEIATKMVIQYGWGPDDSPAIYSR--------NNTVTSLSMGDKYEYE 505
             +   +++K A  +A  MV Q+G+     P +Y+          +   +  M D+    
Sbjct: 547 VSTGAYNDIKIATSLAKNMVTQWGFSEKLGPLLYAEEEGEIFLGRSVAKAKHMSDETARI 594

Query: 506 VAAKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIE 546
           +  +V+ + ++ Y RA+ +L +N  +L    E L++ E +    ++ L++
Sbjct: 607 IDEEVKLLIEINYSRARNILNENIDILHAMKEALIKYETIDAFQIDDLMK 594

BLAST of ClCG03G010040 vs. TrEMBL
Match: W9RHH7_9ROSA (ATP-dependent zinc metalloprotease FtsH OS=Morus notabilis GN=L484_024479 PE=4 SV=1)

HSP 1 Score: 844.7 bits (2181), Expect = 7.3e-242
Identity = 418/498 (83.94%), Postives = 459/498 (92.17%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEAKVPVV V+AQELE GLWVGQSASNVRELFQTARDLAP
Sbjct: 802  GVLIVGERGTGKTSLALAIAAEAKVPVVEVKAQELEAGLWVGQSASNVRELFQTARDLAP 861

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VI+FVEDFDLFAGVRG +IHTK QDHE+FINQLLVELDGFEKQDGVVLMATTRNL+Q+DE
Sbjct: 862  VILFVEDFDLFAGVRGTYIHTKNQDHESFINQLLVELDGFEKQDGVVLMATTRNLQQVDE 921

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDR+FHLQRPTQ+EREKILQIAA+ +MD ELI++VDWKKVAEKT+LLRP+ELK
Sbjct: 922  ALQRPGRMDRIFHLQRPTQAEREKILQIAAKETMDNELIDFVDWKKVAEKTALLRPIELK 981

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKFLD DELM Y   FATFSG +P W++KT+ VKKL+KMLVNHLGLTL
Sbjct: 982  LVPVALEGSAFRSKFLDMDELMSYCGWFATFSGFIPGWLRKTKIVKKLSKMLVNHLGLTL 1041

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +KEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1042 TKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1101

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEPLSWQGIGCTKI+K RNEGS+NGNSESRSYLEKKLVFCFGS++AAQMLLPFGEEN
Sbjct: 1102 NLWLEPLSWQGIGCTKITKARNEGSVNGNSESRSYLEKKLVFCFGSHVAAQMLLPFGEEN 1161

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSELKQAQEIAT+MVIQYGWGPDDSPAIY  +N  T+LSMG+ YEYE+A KVEK+YD
Sbjct: 1162 FLSSSELKQAQEIATRMVIQYGWGPDDSPAIYYHSNAATALSMGNNYEYEMATKVEKMYD 1221

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNRQ+LEK  EELLE EILTGK LER++E +GGI E EPF+LS  Y  E
Sbjct: 1222 LAYFKAKEMLQKNRQILEKIAEELLEFEILTGKDLERMLEDHGGIGETEPFFLSGVYDME 1281

Query: 566  PLTGGFLESANSSGTALL 584
            PL+  FLE+ N++ T LL
Sbjct: 1282 PLSSCFLENGNATATTLL 1299

BLAST of ClCG03G010040 vs. TrEMBL
Match: A0A067JUT9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23568 PE=4 SV=1)

HSP 1 Score: 837.4 bits (2162), Expect = 1.2e-239
Identity = 415/498 (83.33%), Postives = 461/498 (92.57%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV V AQ+LE GLWVGQSASNVRELFQTARDLAP
Sbjct: 796  GVLIVGERGTGKTSLALAIAAEARVPVVKVAAQQLEAGLWVGQSASNVRELFQTARDLAP 855

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTK+QDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE
Sbjct: 856  VIIFVEDFDLFAGVRGKFIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 915

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVF+LQ+PTQ+EREKIL  AA+ +MDE LI++VDWKKVAEKT+LLRPVELK
Sbjct: 916  ALRRPGRMDRVFYLQQPTQTEREKILLNAAKATMDENLIDFVDWKKVAEKTALLRPVELK 975

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKF+DTDELM Y S FATFS I+PKWV+KT+  +K+++MLVNHLGL L
Sbjct: 976  LVPVALEGSAFRSKFVDTDELMSYCSWFATFSAIIPKWVRKTKIARKMSRMLVNHLGLEL 1035

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +KEDLQ+VVDLMEPYGQISNGI+LLNPP+DWTRETKFPHAVWAAGRGLI LLLPNFDVVD
Sbjct: 1036 AKEDLQSVVDLMEPYGQISNGIDLLNPPIDWTRETKFPHAVWAAGRGLITLLLPNFDVVD 1095

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SWQGIGCTKISK RNEGS+NGN ESRSYLEKKLVFCFGSY+++Q+LLPFGEEN
Sbjct: 1096 NLWLEPCSWQGIGCTKISKARNEGSLNGNVESRSYLEKKLVFCFGSYVSSQLLLPFGEEN 1155

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSEL+QAQEIAT+MVIQYGWGPDDSPAIY  +N VTSLSMG+ +EY++AAKVEK+YD
Sbjct: 1156 FLSSSELRQAQEIATRMVIQYGWGPDDSPAIYYTSNAVTSLSMGNNHEYDIAAKVEKMYD 1215

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNR+VLEK VEELLE EILTGK LER+IE NGGIREKEPF+LSE   RE
Sbjct: 1216 LAYLKAKEMLQKNRRVLEKIVEELLEFEILTGKDLERIIENNGGIREKEPFFLSEANYRE 1275

Query: 566  PLTGGFLESANSSGTALL 584
            P++  FL++ N  G ALL
Sbjct: 1276 PVSSSFLDTGNGPGPALL 1293

BLAST of ClCG03G010040 vs. TrEMBL
Match: A0A059B763_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H04217 PE=4 SV=1)

HSP 1 Score: 836.6 bits (2160), Expect = 2.0e-239
Identity = 411/498 (82.53%), Postives = 465/498 (93.37%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV VEAQ+LE GLWVGQSASNVRELFQTARDLAP
Sbjct: 631  GVLIVGERGTGKTSLALAIAAEARVPVVKVEAQQLEAGLWVGQSASNVRELFQTARDLAP 690

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTK+QDHEAFINQLLVELDGFEKQDGVVLMATTR+LKQIDE
Sbjct: 691  VIIFVEDFDLFAGVRGKFIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATTRSLKQIDE 750

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDRVF+LQRPTQ+EREKILQIAA+ +MD+ELI+ VDW+KVAEKT+LLRP+ELK
Sbjct: 751  ALQRPGRMDRVFNLQRPTQAEREKILQIAAKETMDDELIDLVDWRKVAEKTALLRPIELK 810

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKF+D DELM Y S FATFS +VPKW+++T+ VK++++MLVNHLGLTL
Sbjct: 811  LVPVALEGSAFRSKFVDVDELMSYCSWFATFSNMVPKWIRQTKVVKQISRMLVNHLGLTL 870

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            ++ED+QNVVDLMEPYGQI+NG+ELLNPPLDWT ETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 871  TEEDMQNVVDLMEPYGQINNGVELLNPPLDWTEETKFPHAVWAAGRGLIALLLPNFDVVD 930

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SWQGIGCTKI+K R+EGS+N NSESRSYLEKKLVFCFGSY+A+Q+LLPFGEEN
Sbjct: 931  NLWLEPSSWQGIGCTKITKARSEGSVNANSESRSYLEKKLVFCFGSYVASQLLLPFGEEN 990

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSELKQAQEIAT+MVIQYGWGPDDSPAIY  +N VT+LSMG+K+EYE+AAKVEK+YD
Sbjct: 991  FLSSSELKQAQEIATRMVIQYGWGPDDSPAIYYHSNAVTALSMGNKHEYEIAAKVEKMYD 1050

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNR+VLEK V+ELLE EILTGK LER +E NGG+REKEPF L + +  +
Sbjct: 1051 LAYYKAKEMLQKNRRVLEKIVDELLEFEILTGKDLERTLEENGGMREKEPFSLVQLFNGQ 1110

Query: 566  PLTGGFLESANSSGTALL 584
            P++  FL+  N+SGTALL
Sbjct: 1111 PVSSSFLDDGNASGTALL 1128

BLAST of ClCG03G010040 vs. TrEMBL
Match: A0A061F1B0_THECC (Metalloprotease m41 ftsh, putative isoform 2 OS=Theobroma cacao GN=TCM_026140 PE=4 SV=1)

HSP 1 Score: 833.9 bits (2153), Expect = 1.3e-238
Identity = 415/500 (83.00%), Postives = 461/500 (92.20%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV VEAQ+LE GLWVGQSASNVRELFQTARDLAP
Sbjct: 801  GVLIVGERGTGKTSLALAIAAEARVPVVNVEAQQLEAGLWVGQSASNVRELFQTARDLAP 860

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTK+QDHEAFINQLLVELDGFEKQDGVVLMATTRN+KQIDE
Sbjct: 861  VIIFVEDFDLFAGVRGKFIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATTRNIKQIDE 920

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVFHLQRPTQ+EREKIL+IAA+ +MDEELI+ VDWKKVAEKT+LLRP+ELK
Sbjct: 921  ALRRPGRMDRVFHLQRPTQAEREKILRIAAKETMDEELIDLVDWKKVAEKTALLRPIELK 980

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKFLDTDELM Y S FATFSG+VPKWV+ T+ VK+++KMLVNHLGL L
Sbjct: 981  LVPVALEGSAFRSKFLDTDELMSYCSWFATFSGMVPKWVRSTKIVKQVSKMLVNHLGLKL 1040

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            ++EDLQNVVDLMEPYGQISNGIE LNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1041 TQEDLQNVVDLMEPYGQISNGIEFLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1100

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SW+GIGCTKI+K  NEGS+  N+ESRSYLEKKLVFCFGS+IAAQ+LLPFGEEN
Sbjct: 1101 NLWLEPCSWEGIGCTKITKASNEGSMYANAESRSYLEKKLVFCFGSHIAAQLLLPFGEEN 1160

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLS+SELKQAQEIAT+MVIQYGWGPDDSPAIY  +N VT+LSMG+ +E+E+A KVEKIYD
Sbjct: 1161 FLSASELKQAQEIATRMVIQYGWGPDDSPAIYYSSNAVTALSMGNNHEFEMATKVEKIYD 1220

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNRQVLEK VEELLE EILTGK LER++  NGG+REKEPF+LS+   RE
Sbjct: 1221 LAYQKAKEMLKKNRQVLEKIVEELLEFEILTGKDLERILHENGGLREKEPFFLSQVDYRE 1280

Query: 566  PLTGGFLESANSSGTALLKI 586
            PL+  FL+  ++S T  L +
Sbjct: 1281 PLSSSFLDEGSASETTFLDV 1300

BLAST of ClCG03G010040 vs. TrEMBL
Match: A0A061F0H2_THECC (Metalloprotease m41 ftsh, putative isoform 1 OS=Theobroma cacao GN=TCM_026140 PE=4 SV=1)

HSP 1 Score: 833.9 bits (2153), Expect = 1.3e-238
Identity = 415/500 (83.00%), Postives = 461/500 (92.20%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV VEAQ+LE GLWVGQSASNVRELFQTARDLAP
Sbjct: 808  GVLIVGERGTGKTSLALAIAAEARVPVVNVEAQQLEAGLWVGQSASNVRELFQTARDLAP 867

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTK+QDHEAFINQLLVELDGFEKQDGVVLMATTRN+KQIDE
Sbjct: 868  VIIFVEDFDLFAGVRGKFIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATTRNIKQIDE 927

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVFHLQRPTQ+EREKIL+IAA+ +MDEELI+ VDWKKVAEKT+LLRP+ELK
Sbjct: 928  ALRRPGRMDRVFHLQRPTQAEREKILRIAAKETMDEELIDLVDWKKVAEKTALLRPIELK 987

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKFLDTDELM Y S FATFSG+VPKWV+ T+ VK+++KMLVNHLGL L
Sbjct: 988  LVPVALEGSAFRSKFLDTDELMSYCSWFATFSGMVPKWVRSTKIVKQVSKMLVNHLGLKL 1047

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            ++EDLQNVVDLMEPYGQISNGIE LNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1048 TQEDLQNVVDLMEPYGQISNGIEFLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1107

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SW+GIGCTKI+K  NEGS+  N+ESRSYLEKKLVFCFGS+IAAQ+LLPFGEEN
Sbjct: 1108 NLWLEPCSWEGIGCTKITKASNEGSMYANAESRSYLEKKLVFCFGSHIAAQLLLPFGEEN 1167

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLS+SELKQAQEIAT+MVIQYGWGPDDSPAIY  +N VT+LSMG+ +E+E+A KVEKIYD
Sbjct: 1168 FLSASELKQAQEIATRMVIQYGWGPDDSPAIYYSSNAVTALSMGNNHEFEMATKVEKIYD 1227

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNRQVLEK VEELLE EILTGK LER++  NGG+REKEPF+LS+   RE
Sbjct: 1228 LAYQKAKEMLKKNRQVLEKIVEELLEFEILTGKDLERILHENGGLREKEPFFLSQVDYRE 1287

Query: 566  PLTGGFLESANSSGTALLKI 586
            PL+  FL+  ++S T  L +
Sbjct: 1288 PLSSSFLDEGSASETTFLDV 1307

BLAST of ClCG03G010040 vs. TAIR10
Match: AT3G04340.1 (AT3G04340.1 FtsH extracellular protease family)

HSP 1 Score: 770.0 bits (1987), Expect = 1.2e-222
Identity = 385/498 (77.31%), Postives = 435/498 (87.35%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV VEAQELE GLWVGQSA+NVRELFQTARDLAP
Sbjct: 819  GVLIVGERGTGKTSLALAIAAEARVPVVNVEAQELEAGLWVGQSAANVRELFQTARDLAP 878

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKF+HTK+QDHE+FINQLLVELDGFEKQDGVVLMATTRN KQIDE
Sbjct: 879  VIIFVEDFDLFAGVRGKFVHTKQQDHESFINQLLVELDGFEKQDGVVLMATTRNHKQIDE 938

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVFHLQ PT+ ERE+IL  AAE +MD EL++ VDW+KV+EKT+LLRP+ELK
Sbjct: 939  ALRRPGRMDRVFHLQSPTEMERERILHNAAEETMDRELVDLVDWRKVSEKTTLLRPIELK 998

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALE SAFRSKFLDTDEL+ Y S FATFS IVP W++KT+  K + KMLVNHLGL L
Sbjct: 999  LVPMALESSAFRSKFLDTDELLSYVSWFATFSHIVPPWLRKTKVAKTMGKMLVNHLGLNL 1058

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +K+DL+NVVDLMEPYGQISNGIELLNP +DWTRETKFPHAVWAAGR LI LL+PNFDVV+
Sbjct: 1059 TKDDLENVVDLMEPYGQISNGIELLNPTVDWTRETKFPHAVWAAGRALITLLIPNFDVVE 1118

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SW+GIGCTKI+K  + GS  GN+ESRSYLEKKLVFCFGS+IA+QMLLP G+EN
Sbjct: 1119 NLWLEPSSWEGIGCTKITKVTSGGSAIGNTESRSYLEKKLVFCFGSHIASQMLLPPGDEN 1178

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSE+ +AQEIAT+MV+QYGWGPDDSPA+Y   N V++LSMG+ +EYE+A KVEKIYD
Sbjct: 1179 FLSSSEITKAQEIATRMVLQYGWGPDDSPAVYYATNAVSALSMGNNHEYEMAGKVEKIYD 1238

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AK ML KNR+VLEK  EELLE EILT K LER++  NGGIREKEPF+LS     E
Sbjct: 1239 LAYEKAKGMLLKNRRVLEKITEELLEFEILTHKDLERIVHENGGIREKEPFFLSGTNYNE 1298

Query: 566  PLTGGFLESANSSGTALL 584
             L+  FL+  +   TALL
Sbjct: 1299 ALSRSFLDVGDPPETALL 1316

BLAST of ClCG03G010040 vs. TAIR10
Match: AT5G15250.2 (AT5G15250.2 FTSH protease 6)

HSP 1 Score: 129.8 bits (325), Expect = 6.1e-30
Identity = 134/493 (27.18%), Postives = 216/493 (43.81%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           GVL+ G  GTGKT LA AIA EA VP  ++   E    ++VG  AS  R+LF  A+  +P
Sbjct: 257 GVLLTGPPGTGKTLLAKAIAGEAGVPFFSLSGSEFIE-MFVGVGASRARDLFNKAKANSP 316

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            I+F+++ D    +RG  I     + E  +NQ+L E+DGF    GV+++A T   + +D 
Sbjct: 317 CIVFIDEIDAVGRMRGTGIGGGNDEREQTLNQILTEMDGFAGNTGVIVIAATNRPEILDS 376

Query: 206 ALQRPGRMDR--VFHLQRPTQSEREKILQI----AAEGSMD----EELINYVDWKKVAEK 265
           AL RPGR DR   + + +P +S R  I+       + G  D    EE++      K  +K
Sbjct: 377 ALLRPGRFDRQVCWLILKPNKSNRFGIMSTCFKQVSVGLPDIRGREEILKVHSRSKKLDK 436

Query: 266 TSLLRPVELKLVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNK 325
                  ++ L  +A+    F     D   LM  +++ A   G                 
Sbjct: 437 -------DVSLSVIAMRTPGFSG--ADLANLMNEAAILAGRRG----------------- 496

Query: 326 MLVNHLGLTLSKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIA 385
              + + LT   + +  +V  ME       G ++++       ++K   A    G  + A
Sbjct: 497 --KDKITLTEIDDSIDRIVAGME-------GTKMID------GKSKAIVAYHEVGHAICA 556

Query: 386 LLLPNFDVVDNLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAA 445
            L    D V  + L P   Q  G T      +   +     S+  L  ++V   G   A 
Sbjct: 557 TLTEGHDPVQKVTLVPRG-QARGLTWFLPGEDPTLV-----SKQQLFARIVGGLGGRAAE 616

Query: 446 QMLLPFGEENFLSSSELKQAQEIATKMVIQYG------WGPDDSPAIYSRNNTVTSL--- 505
            ++    E    ++ +L+Q  EIA +MV  +G      W   D PA+   +  +  L   
Sbjct: 617 DVIFGEPEITTGAAGDLQQVTEIARQMVTMFGMSEIGPWALTD-PAVKQNDVVLRMLARN 676

Query: 506 SMGDKYEYEVAAKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTG--------K 552
           SM +K   ++ + V+KI   AY  AK+ +  NR+ ++K V+ LLE E LTG        +
Sbjct: 677 SMSEKLAEDIDSCVKKIIGDAYEVAKKHVRNNREAIDKLVDVLLEKETLTGDEFRAILSE 700

BLAST of ClCG03G010040 vs. TAIR10
Match: AT1G50250.1 (AT1G50250.1 FTSH protease 1)

HSP 1 Score: 124.4 bits (311), Expect = 2.6e-28
Identity = 74/171 (43.27%), Postives = 97/171 (56.73%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           G L+VG  GTGKT LA A+A EA VP  +  A E    L+VG  AS VR+LF+ A+  AP
Sbjct: 297 GCLLVGPPGTGKTLLARAVAGEAGVPFFSCAASEFVE-LFVGVGASRVRDLFEKAKSKAP 356

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            I+F+++ D     RG  +     + E  INQLL E+DGF    GV+++A T     +D 
Sbjct: 357 CIVFIDEIDAVGRQRGAGMGGGNDEREQTINQLLTEMDGFSGNSGVIVLAATNRPDVLDS 416

Query: 206 ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKT 257
           AL RPGR DR   + RP  + R KILQ+ + G   + L   VD+ KVA +T
Sbjct: 417 ALLRPGRFDRQVTVDRPDVAGRVKILQVHSRG---KALGKDVDFDKVARRT 463

BLAST of ClCG03G010040 vs. TAIR10
Match: AT5G42270.1 (AT5G42270.1 FtsH extracellular protease family)

HSP 1 Score: 121.7 bits (304), Expect = 1.7e-27
Identity = 71/171 (41.52%), Postives = 98/171 (57.31%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           G L+VG  GTGKT LA A+A EA VP  +  A E    L+VG  AS VR+LF+ A+  AP
Sbjct: 285 GCLLVGPPGTGKTLLARAVAGEAGVPFFSCAASEFVE-LFVGVGASRVRDLFEKAKSKAP 344

Query: 146 VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            I+F+++ D     RG  +     + E  INQLL E+DGF    GV+++A T     +D 
Sbjct: 345 CIVFIDEIDAVGRQRGAGMGGGNDEREQTINQLLTEMDGFSGNSGVIVLAATNRPDVLDS 404

Query: 206 ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKT 257
           AL RPGR DR   + RP  + R +IL++ + G   + +   VD++KVA +T
Sbjct: 405 ALLRPGRFDRQVTVDRPDVAGRVQILKVHSRG---KAIGKDVDYEKVARRT 451


HSP 2 Score: 68.2 bits (165), Expect = 2.2e-11
Identity = 59/234 (25.21%), Postives = 108/234 (46.15%), Query Frame = 1

Query: 317 LVNHLGLTLSKEDLQNVV--DLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLI 376
           L+N   +  ++ +L+ +   ++ +   +I  G E  N  +  + E K   A   AG  L+
Sbjct: 462 LMNEAAILAARRELKEISKDEISDALERIIAGPEKKNAVV--SEEKKRLVAYHEAGHALV 521

Query: 377 ALLLPNFDVVDNLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIA 436
             L+P +D V  + + P    G G T  +   +E  +     SRSYLE ++    G  +A
Sbjct: 522 GALMPEYDPVAKISIIPRGQAG-GLTFFAP--SEERLESGLYSRSYLENQMAVALGGRVA 581

Query: 437 AQMLLPFGEENFLS--SSELKQAQEIATKMVIQYGWGPDDSPAIY--SRNNTVTSLSMGD 496
            +++  FG+EN  +  S++  Q   +A +MV ++G+           +  N     SM  
Sbjct: 582 EEVI--FGDENVTTGASNDFMQVSRVARQMVERFGFSKKIGQVAVGGAGGNPFLGQSMSS 641

Query: 497 KYEYEVA------AKVEKIYDLAYCRAKEMLAKNRQVLEKFVEELLECEILTGK 539
           + +Y +A      A+V ++ + AY RAKE++     +L K  + L+E E + G+
Sbjct: 642 QKDYSMATADVVDAEVRELVEKAYVRAKEIITTQIDILHKLAQLLIEKETVDGE 688

BLAST of ClCG03G010040 vs. TAIR10
Match: AT3G47060.1 (AT3G47060.1 FTSH protease 7)

HSP 1 Score: 121.3 bits (303), Expect = 2.2e-27
Identity = 67/149 (44.97%), Postives = 92/149 (61.74%), Query Frame = 1

Query: 86  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
           GVL+VG  GTGKT LA A+A EA+VP ++  A E    L+VG  AS VR+LF  A+  AP
Sbjct: 360 GVLLVGLPGTGKTLLAKAVAGEAEVPFISCSASEFVE-LYVGMGASRVRDLFARAKKEAP 419

Query: 146 VIIFVEDFDLFAGVR-GKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQID 205
            IIF+++ D  A  R GKF      + E  +NQLL E+DGF+    V+++  T     +D
Sbjct: 420 SIIFIDEIDAVAKSRDGKFRMGSNDEREQTLNQLLTEMDGFDSNSAVIVLGATNRADVLD 479

Query: 206 EALQRPGRMDRVFHLQRPTQSEREKILQI 234
            AL+RPGR DRV  ++ P +  RE IL++
Sbjct: 480 PALRRPGRFDRVVTVETPDKIGRESILRV 507

BLAST of ClCG03G010040 vs. NCBI nr
Match: gi|659086122|ref|XP_008443775.1| (PREDICTED: uncharacterized protein LOC103487285 [Cucumis melo])

HSP 1 Score: 932.2 bits (2408), Expect = 5.0e-268
Identity = 471/498 (94.58%), Postives = 482/498 (96.79%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP
Sbjct: 821  GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 880

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLK+IDE
Sbjct: 881  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKKIDE 940

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDRVFHLQ+PTQSEREKILQIAAEGSMDEEL+NYVDWKKVAEKT+LLRP+EL+
Sbjct: 941  ALQRPGRMDRVFHLQKPTQSEREKILQIAAEGSMDEELVNYVDWKKVAEKTALLRPMELQ 1000

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVPLALEGSAFRSK LD DELMGY S FATF  IVP+WVQKTRTVKKLNKMLVNHLGLTL
Sbjct: 1001 LVPLALEGSAFRSKILDADELMGYCSWFATFRDIVPEWVQKTRTVKKLNKMLVNHLGLTL 1060

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            SKEDLQ+VVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1061 SKEDLQSVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1120

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEPLSWQGIGCTKISKRR+EGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN
Sbjct: 1121 NLWLEPLSWQGIGCTKISKRRDEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 1180

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSELKQAQEIAT+MVIQYGWGPDDSPAIY RNN V  LSMGD YEYEVAAKVEKIYD
Sbjct: 1181 FLSSSELKQAQEIATRMVIQYGWGPDDSPAIYCRNNAVGFLSMGDSYEYEVAAKVEKIYD 1240

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAYCRAKEML KNRQVLEKFVEELLE EILTGKVLERLIETNGGIREKEPF+LSEYY RE
Sbjct: 1241 LAYCRAKEMLGKNRQVLEKFVEELLEFEILTGKVLERLIETNGGIREKEPFFLSEYYDRE 1300

Query: 566  PLTGGFLESANSSGTALL 584
            PLTGGFLES NSS TALL
Sbjct: 1301 PLTGGFLESTNSSRTALL 1318

BLAST of ClCG03G010040 vs. NCBI nr
Match: gi|449449669|ref|XP_004142587.1| (PREDICTED: uncharacterized protein LOC101207174 [Cucumis sativus])

HSP 1 Score: 912.1 bits (2356), Expect = 5.3e-262
Identity = 463/497 (93.16%), Postives = 476/497 (95.77%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGE GTGKTSLALAIAAEAKVPVVTV+AQELEPGLWVGQSASNVRELFQTARDLAP
Sbjct: 826  GVLIVGESGTGKTSLALAIAAEAKVPVVTVKAQELEPGLWVGQSASNVRELFQTARDLAP 885

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQID+
Sbjct: 886  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDD 945

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDRVFHLQ PTQ EREKILQIAAE  MDEELINYVDWKKVAEKT+LLRPVELK
Sbjct: 946  ALQRPGRMDRVFHLQSPTQYEREKILQIAAEEFMDEELINYVDWKKVAEKTALLRPVELK 1005

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
             VPLALE SAFRSKFLDTDEL+ Y S FATFSG+VP+WVQKTR VKKLNKMLVNHLGLTL
Sbjct: 1006 RVPLALEASAFRSKFLDTDELISYCSWFATFSGVVPEWVQKTRIVKKLNKMLVNHLGLTL 1065

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1066 SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1125

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEPLSWQGIGCTKISKRR++GSINGNSESRSYLEKKLVFCFGSYIAA+MLLPFGEEN
Sbjct: 1126 NLWLEPLSWQGIGCTKISKRRDKGSINGNSESRSYLEKKLVFCFGSYIAAKMLLPFGEEN 1185

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSS ELKQAQEIAT+MV+QYGWGPDDSPAIYSRNN V+ LSMGD  EYEVAAKVEKIYD
Sbjct: 1186 FLSSYELKQAQEIATRMVLQYGWGPDDSPAIYSRNNAVSFLSMGDNCEYEVAAKVEKIYD 1245

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY RAKEML KNRQVLEKFVEELLE EILTGKVLERLIETNGGIREKEPF+LSEYY RE
Sbjct: 1246 LAYSRAKEMLGKNRQVLEKFVEELLEFEILTGKVLERLIETNGGIREKEPFFLSEYYDRE 1305

Query: 566  PLTGGFLESANSSGTAL 583
            PLTGGFLESANSS TAL
Sbjct: 1306 PLTGGFLESANSSRTAL 1322

BLAST of ClCG03G010040 vs. NCBI nr
Match: gi|703120875|ref|XP_010102198.1| (ATP-dependent zinc metalloprotease FtsH [Morus notabilis])

HSP 1 Score: 844.7 bits (2181), Expect = 1.0e-241
Identity = 418/498 (83.94%), Postives = 459/498 (92.17%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEAKVPVV V+AQELE GLWVGQSASNVRELFQTARDLAP
Sbjct: 802  GVLIVGERGTGKTSLALAIAAEAKVPVVEVKAQELEAGLWVGQSASNVRELFQTARDLAP 861

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VI+FVEDFDLFAGVRG +IHTK QDHE+FINQLLVELDGFEKQDGVVLMATTRNL+Q+DE
Sbjct: 862  VILFVEDFDLFAGVRGTYIHTKNQDHESFINQLLVELDGFEKQDGVVLMATTRNLQQVDE 921

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDR+FHLQRPTQ+EREKILQIAA+ +MD ELI++VDWKKVAEKT+LLRP+ELK
Sbjct: 922  ALQRPGRMDRIFHLQRPTQAEREKILQIAAKETMDNELIDFVDWKKVAEKTALLRPIELK 981

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKFLD DELM Y   FATFSG +P W++KT+ VKKL+KMLVNHLGLTL
Sbjct: 982  LVPVALEGSAFRSKFLDMDELMSYCGWFATFSGFIPGWLRKTKIVKKLSKMLVNHLGLTL 1041

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +KEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1042 TKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1101

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEPLSWQGIGCTKI+K RNEGS+NGNSESRSYLEKKLVFCFGS++AAQMLLPFGEEN
Sbjct: 1102 NLWLEPLSWQGIGCTKITKARNEGSVNGNSESRSYLEKKLVFCFGSHVAAQMLLPFGEEN 1161

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSELKQAQEIAT+MVIQYGWGPDDSPAIY  +N  T+LSMG+ YEYE+A KVEK+YD
Sbjct: 1162 FLSSSELKQAQEIATRMVIQYGWGPDDSPAIYYHSNAATALSMGNNYEYEMATKVEKMYD 1221

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNRQ+LEK  EELLE EILTGK LER++E +GGI E EPF+LS  Y  E
Sbjct: 1222 LAYFKAKEMLQKNRQILEKIAEELLEFEILTGKDLERMLEDHGGIGETEPFFLSGVYDME 1281

Query: 566  PLTGGFLESANSSGTALL 584
            PL+  FLE+ N++ T LL
Sbjct: 1282 PLSSCFLENGNATATTLL 1299

BLAST of ClCG03G010040 vs. NCBI nr
Match: gi|1009117928|ref|XP_015875583.1| (PREDICTED: probable inactive ATP-dependent zinc metalloprotease FTSHI 5, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 843.2 bits (2177), Expect = 3.0e-241
Identity = 415/503 (82.50%), Postives = 463/503 (92.05%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEAKVPVV V+AQELE GLWVGQSASN+RELFQTARDLAP
Sbjct: 809  GVLIVGERGTGKTSLALAIAAEAKVPVVQVKAQELEAGLWVGQSASNIRELFQTARDLAP 868

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGK+IHTK+QDHEAFINQLLVELDGFEKQDGVVLMAT RNLKQIDE
Sbjct: 869  VIIFVEDFDLFAGVRGKYIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATARNLKQIDE 928

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            ALQRPGRMDRVFHLQRPTQ ERE IL+++A+ +MD +LI++VDWKKVAEKT+LLRP ELK
Sbjct: 929  ALQRPGRMDRVFHLQRPTQVERENILRMSAKATMDNDLIDFVDWKKVAEKTALLRPTELK 988

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEG+AFRSKFLDTDELM Y   FATFSG++PKWV++T   KKL+ ++VNHLGLTL
Sbjct: 989  LVPVALEGAAFRSKFLDTDELMSYCGWFATFSGVIPKWVRRTNIAKKLSSIVVNHLGLTL 1048

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +KEDL NVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD
Sbjct: 1049 TKEDLNNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 1108

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEPLSWQGIGC+KI+K +NEGS+NGNSESRSYLEKKLVFCFGS+IA+QMLLPFGEEN
Sbjct: 1109 NLWLEPLSWQGIGCSKITKAKNEGSMNGNSESRSYLEKKLVFCFGSHIASQMLLPFGEEN 1168

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            +LSSSELKQAQEIAT+MVIQYGWGPDDSPAIY  +N +T+LSMG+ +EYE+A+KVEKIYD
Sbjct: 1169 YLSSSELKQAQEIATRMVIQYGWGPDDSPAIYYHSNAITALSMGNNHEYEIASKVEKIYD 1228

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAYC+AKEML KNRQVLEK VEELLE EILTGK LER++  NGGI EKEPF+LS  + +E
Sbjct: 1229 LAYCKAKEMLLKNRQVLEKIVEELLEFEILTGKDLERILIDNGGIGEKEPFFLSRIHEKE 1288

Query: 566  PLTGGFLESANSSGTALLKIQSS 589
            PL+  FLE+ N+SG  LL   +S
Sbjct: 1289 PLSSSFLETGNASGATLLSEAAS 1311

BLAST of ClCG03G010040 vs. NCBI nr
Match: gi|802759535|ref|XP_012089378.1| (PREDICTED: uncharacterized protein LOC105647765 isoform X2 [Jatropha curcas])

HSP 1 Score: 837.4 bits (2162), Expect = 1.7e-239
Identity = 415/498 (83.33%), Postives = 461/498 (92.57%), Query Frame = 1

Query: 86   GVLIVGERGTGKTSLALAIAAEAKVPVVTVEAQELEPGLWVGQSASNVRELFQTARDLAP 145
            GVLIVGERGTGKTSLALAIAAEA+VPVV V AQ+LE GLWVGQSASNVRELFQTARDLAP
Sbjct: 796  GVLIVGERGTGKTSLALAIAAEARVPVVKVAAQQLEAGLWVGQSASNVRELFQTARDLAP 855

Query: 146  VIIFVEDFDLFAGVRGKFIHTKEQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 205
            VIIFVEDFDLFAGVRGKFIHTK+QDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE
Sbjct: 856  VIIFVEDFDLFAGVRGKFIHTKKQDHEAFINQLLVELDGFEKQDGVVLMATTRNLKQIDE 915

Query: 206  ALQRPGRMDRVFHLQRPTQSEREKILQIAAEGSMDEELINYVDWKKVAEKTSLLRPVELK 265
            AL+RPGRMDRVF+LQ+PTQ+EREKIL  AA+ +MDE LI++VDWKKVAEKT+LLRPVELK
Sbjct: 916  ALRRPGRMDRVFYLQQPTQTEREKILLNAAKATMDENLIDFVDWKKVAEKTALLRPVELK 975

Query: 266  LVPLALEGSAFRSKFLDTDELMGYSSLFATFSGIVPKWVQKTRTVKKLNKMLVNHLGLTL 325
            LVP+ALEGSAFRSKF+DTDELM Y S FATFS I+PKWV+KT+  +K+++MLVNHLGL L
Sbjct: 976  LVPVALEGSAFRSKFVDTDELMSYCSWFATFSAIIPKWVRKTKIARKMSRMLVNHLGLEL 1035

Query: 326  SKEDLQNVVDLMEPYGQISNGIELLNPPLDWTRETKFPHAVWAAGRGLIALLLPNFDVVD 385
            +KEDLQ+VVDLMEPYGQISNGI+LLNPP+DWTRETKFPHAVWAAGRGLI LLLPNFDVVD
Sbjct: 1036 AKEDLQSVVDLMEPYGQISNGIDLLNPPIDWTRETKFPHAVWAAGRGLITLLLPNFDVVD 1095

Query: 386  NLWLEPLSWQGIGCTKISKRRNEGSINGNSESRSYLEKKLVFCFGSYIAAQMLLPFGEEN 445
            NLWLEP SWQGIGCTKISK RNEGS+NGN ESRSYLEKKLVFCFGSY+++Q+LLPFGEEN
Sbjct: 1096 NLWLEPCSWQGIGCTKISKARNEGSLNGNVESRSYLEKKLVFCFGSYVSSQLLLPFGEEN 1155

Query: 446  FLSSSELKQAQEIATKMVIQYGWGPDDSPAIYSRNNTVTSLSMGDKYEYEVAAKVEKIYD 505
            FLSSSEL+QAQEIAT+MVIQYGWGPDDSPAIY  +N VTSLSMG+ +EY++AAKVEK+YD
Sbjct: 1156 FLSSSELRQAQEIATRMVIQYGWGPDDSPAIYYTSNAVTSLSMGNNHEYDIAAKVEKMYD 1215

Query: 506  LAYCRAKEMLAKNRQVLEKFVEELLECEILTGKVLERLIETNGGIREKEPFYLSEYYGRE 565
            LAY +AKEML KNR+VLEK VEELLE EILTGK LER+IE NGGIREKEPF+LSE   RE
Sbjct: 1216 LAYLKAKEMLQKNRRVLEKIVEELLEFEILTGKDLERIIENNGGIREKEPFFLSEANYRE 1275

Query: 566  PLTGGFLESANSSGTALL 584
            P++  FL++ N  G ALL
Sbjct: 1276 PVSSSFLDTGNGPGPALL 1293

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
FTSI5_ARATH2.1e-22177.31Probable inactive ATP-dependent zinc metalloprotease FTSHI 5, chloroplastic OS=A... [more]
FTSH_HAEIN5.3e-3628.09ATP-dependent zinc metalloprotease FtsH OS=Haemophilus influenzae (strain ATCC 5... [more]
FTSH3_SYMTH1.5e-3327.78ATP-dependent zinc metalloprotease FtsH 3 OS=Symbiobacterium thermophilum (strai... [more]
FTSH4_SORC52.7e-3226.96ATP-dependent zinc metalloprotease FtsH 4 OS=Sorangium cellulosum (strain So ce5... [more]
FTSH_BUCAI6.1e-3226.17ATP-dependent zinc metalloprotease FtsH OS=Buchnera aphidicola subsp. Acyrthosip... [more]
Match NameE-valueIdentityDescription
W9RHH7_9ROSA7.3e-24283.94ATP-dependent zinc metalloprotease FtsH OS=Morus notabilis GN=L484_024479 PE=4 S... [more]
A0A067JUT9_JATCU1.2e-23983.33Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23568 PE=4 SV=1[more]
A0A059B763_EUCGR2.0e-23982.53Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_H04217 PE=4 SV=1[more]
A0A061F1B0_THECC1.3e-23883.00Metalloprotease m41 ftsh, putative isoform 2 OS=Theobroma cacao GN=TCM_026140 PE... [more]
A0A061F0H2_THECC1.3e-23883.00Metalloprotease m41 ftsh, putative isoform 1 OS=Theobroma cacao GN=TCM_026140 PE... [more]
Match NameE-valueIdentityDescription
AT3G04340.11.2e-22277.31 FtsH extracellular protease family[more]
AT5G15250.26.1e-3027.18 FTSH protease 6[more]
AT1G50250.12.6e-2843.27 FTSH protease 1[more]
AT5G42270.11.7e-2741.52 FtsH extracellular protease family[more]
AT3G47060.12.2e-2744.97 FTSH protease 7[more]
Match NameE-valueIdentityDescription
gi|659086122|ref|XP_008443775.1|5.0e-26894.58PREDICTED: uncharacterized protein LOC103487285 [Cucumis melo][more]
gi|449449669|ref|XP_004142587.1|5.3e-26293.16PREDICTED: uncharacterized protein LOC101207174 [Cucumis sativus][more]
gi|703120875|ref|XP_010102198.1|1.0e-24183.94ATP-dependent zinc metalloprotease FtsH [Morus notabilis][more]
gi|1009117928|ref|XP_015875583.1|3.0e-24182.50PREDICTED: probable inactive ATP-dependent zinc metalloprotease FTSHI 5, chlorop... [more]
gi|802759535|ref|XP_012089378.1|1.7e-23983.33PREDICTED: uncharacterized protein LOC105647765 isoform X2 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000642Peptidase_M41
IPR003593AAA+_ATPase
IPR003959ATPase_AAA_core
IPR027417P-loop_NTPase
Vocabulary: Molecular Function
TermDefinition
GO:0004222metalloendopeptidase activity
GO:0005524ATP binding
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0042981 regulation of apoptotic process
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0009506 plasmodesma
molecular_function GO:0005524 ATP binding
molecular_function GO:0004222 metalloendopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G010040.1ClCG03G010040.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000642Peptidase M41PFAMPF01434Peptidase_M41coord: 415..542
score: 5.0
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 83..223
score: 6.2
IPR003959ATPase, AAA-type, corePFAMPF00004AAAcoord: 87..220
score: 2.4
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3DG3DSA:3.40.50.300coord: 80..256
score: 5.3
IPR027417P-loop containing nucleoside triphosphate hydrolaseunknownSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 83..238
score: 4.73
NoneNo IPR availablePANTHERPTHR23076METALLOPROTEASE M41 FTSHcoord: 317..641
score: 7.1E-301coord: 86..251
score: 7.1E
NoneNo IPR availablePANTHERPTHR23076:SF58FTSH EXTRACELLULAR PROTEASE FAMILY PROTEINcoord: 86..251
score: 7.1E-301coord: 317..641
score: 7.1E
NoneNo IPR availableunknownSSF140990FtsH protease domain-likecoord: 357..554
score: 1.57

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
ClCG03G010040Watermelon (97103) v2wcgwmbB186
ClCG03G010040Watermelon (97103) v1wcgwmB225