BhiUN159G30 (gene) Wax gourd

NameBhiUN159G30
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionATP-dependent clp protease ATP-binding subunit clpx, putative
LocationContig159 : 683553 .. 704642 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCCACGATTCAGAGTCAAGAACACCGAATTCATCAAGCTATTCACATCCATGAGAACTATCACCACCACTGACTCGCCGTTCCACCGCCGTTTCATAACTCCCCTTCCGTCTAAACCCTTTCTACAACCCCATGCCGCCGCCCGTTCCGCCGGATTGTTCCTTTGGTTTTCTCAGAGACGCCACAGTTGGAAAGGGGCCTCCGACTACGATTACATTAGGGCTGATGTCAACTGTCCAAGGTGCTCCAAGCAAATGCCTCTCATCTTCTCTAATCGCCCCTTGTCCATTACGGCTCGGGAGACTGGCGTCTATCAAGCCCTCAATCTCTGCCCCAATTGCAAAACTGCATTCTATTTTAGACCCTTCAAGTTGGTTCCTTTGCAGGGTACTTTTATTGAGATTGGTAGGGTTAAGGGCGCCAGGGACTTGGATGACCATGTGGGCAATGAATCGAGCCAACATACGAAAGGAGATCTTGGCGCCAATTGTACTGATGATTCTGCCTTGCCTCCACGGTGGAATGGTAGCTGTGGCGGTGGCGGTGGCGACGGTAACTTGGGTTCGGTGGAGAGTAACGGTGTTCAAAAGGGCGACGGCAGTTTAGAAATGCAGTTGCTAACTCCCAAGGAAATTTCTGAGGCGCTGGACAAGTTCATCGTTGGACAAGAAAAGGCCAAGAAGGTTGGTTTCGGGTGATATGACCATATACTGTGCTTCTTATGACATTTTTCACAAGCATTTTGTGCACATTCTACATTTGTGGCGTATTTTTCATAATTGAGGAATTGTGCGCTTCTCTTTTATGTTTCGGTTAGAAACTAAGAGTTCAAGAACGAAACTACAATTGAGGGACTACTATCTCTCCATGAGCATGGAGTTATAAAAAGTCTCCAATTAGCATTGACGAAGAAAGGAGTGATTTTAAAAAATATATTCTATTGGAAGAATGTTCCATGAGGAAGTTAGAAAACACCACCCTCAAGAAGGAACAATGAAAATTTCCGTGAAGACAGACGCACAACAAAGTTAAGGAAAAGGAGTCCCTTGATGAAGGTAGAATATACACAACAGAAGCAGATTTGAAGGATCCTAGAGAATGGATGGGAATAGGAGATGAAGAAGGGACTCACATAACCAAAAATCCTCCCAAAAAATGAGCACCTAAGCCATCCCCAATGGAGAATTTTACATTTAGTAAAAAAGAATTAAAACTAGAAGAAATGACCTTCCCAAGGATTCGTGCATTTGTCCTTAGCCCTAGAAAATGAGATCCATTAAGTGGGAAGAACCCCATATTTGCTTGCAATGATATGATGGCAGAAAGTCTCAGGCTCTAGGGAAAAACAATATAACCACTTAGCTAGGATGGCCTCATTATGAACTCTCAAATTACAAATACCCAAACTGCCCAACTCTTATCAGTTTCATAACAATTTTCCATCTGGTCAAGTTAAAAGCTTGGTGTTTCTCGTTCTCCACTCCCTCCTAAAGAAAGTTCCTCAGGATCTTCTTCCAACCTACTATGAATAGTAAAGAAGATCATAACCAATTATAGGAAATAGGTAGGAATCCCACCCAATGTTGACTGGCTGAATCAGAGTTCCTACCGCCCTAGAAAAGATTACCTTCTTAGATAATTGCTTTTGTTCTTCTCAACCATTGGGTTATAGAAAGATAGATAGAGTGTGGCGTTTGAACGTAATAATGGTGATAAGGGAATTCTGGAGCTGCCATTTGCTGCTTGATCAGGATTTGAAGAACCACCCGTTACTATAAGCTCTATACTTGCTGGTATAACTTAAGAGTTTTATTCTCTAGGCCTTTTTTCTTATGATGCTGTGGCTCCCTCTGTCCAGAGGCAAGGTGATGTAGGACATCATATGTAATTTATTTGAAAAGGAAACAAAGCTTTTCATTGAATAATAAAATTGTAATTTATTTGAAAAGGAAACAAAGCTTTTCAATGAGATAATAAAATGAAGGACATTCTATGTAATCTTGGGTAGTTCTTGACTTTCTTATAATTCTTGTTCAAGATTTTGTTTTCCAGATGGATCTTTTCATTTAAACCATAATAGAAGCAAATGAATCTGGATAGTCAATACCATATGTTTGAGTGAAGCCTTTAGCTACTAGTCTGACCTTATATCTCTTAATTTACAGCCTAACGGTTTCTTATCCTTTGGTCACTCTACTATTTCCCTAATACCAATTATGGTAAACACCCAATTACAGCCTACCAGTCTCTTATCTTTTTGTAACTCTACTATTTCTCCTCCATAACTACTTATTTCCAATTTGGATTTTCTAAAGTCTTATATATATTCTCCTTGGAAGAGCCAAGTCATTTATCCTAATTCCTAGATGTGAAGTCTTTATGACTATTTGCCAATTTTTTATATGGGATATAGGTTGCCATATGGTATTTTGTACAACTTCAAATGCCTTTTCTAATTACTATGGGAGCATCAACATGAGACATAATTGACATAATACTTTAGGGGAGATAAATAATAATGTTTAGAGACGTCAAGAAGGATCCTAGTGATGTTTGGCCGTCGGTGAAGTTCCATGTTTCTCTTTGGACTTTGATTTTGGGACTTTTTGCAATTACTCTATATGCAACATTTTACTTAGTTGGAACCCCTTCCTTTAGTGGGGTTTTTGTGGGCCCGTTTTTCTGTATGCCCTAATATTCTTTTAGTTTTCATTTTTCATTGAAAGCAGTTGTTTGTATATATATATTGCATCTGTAATCTCTTTTACTTGAAAAGTATCCTAAGAAAATGCATTTAATAGCCCTATGATCAAGTTAGGTCGAAAGTGGTTGGTTATGTGGACATAAGTATTACATCAAAATTTTTTGAGGAAAATCAAACTATACGTGGGATGTTGGAAAAACATGTTTTCACTACAAAAATAAGATTAATGCTTTTCTTTTCCCTCCTTAGTCTCTTTGTATTGTATTTGTTTTGGTTCTTTTCGTTTCTTATTTCATTGTACCTTGAGCATTAGCCTCTTTTCATTAATTTAATGAAAAGTTTCCTTAATAAAAAAATTTACCTTCCTGCCACCTATTCTGTACACTGGGATGAAAGGATGGGGGTTTAATCCAGGAAGAAGTTTAGTGTTATTTAGAATTGTCTTTTGGTCCATTGATTTTACCACGAATCTATCATTTTGATTGACTTAATGTTGACATTTCAGGTGCTTTCTGTGGCAGTATACAACCACTATAAGAGGATATATCACGCTTCATTGCAGAAAACGTTAGTTTGCAAGAGTTGTCTAGTTGTCATTTTGCAACCCTTTTCGAAATATAATTTAAATATGCTTAATATTGCTTACTGATGCATGGAGTTCACTGTGATAGTCTATATTTAAAGGAAGAAAATAGTATTATTTTTGAGGGAAAGATATTTTTCATGTTAATGATTGTGATTGGCTTTGAGTTGGTGTCTACATTTGTATTTAGATCAGGACAAGGATCGCTGGGTATTGAATTGGAGAATGATGATAATGAAATTGTGGAATTGGAAAAGAGTAATGTGTTGCTAATGGGTCCTACAGGCTCAGGTACGATTGATTTTCTTCTGCATTTCTTTCTGTGTCCAAGAGCCTTAATTTTGTTGGAGGTTTTTCATTAACCATTTATGGAATCTCTAAGGATAGGGAAGACGTTACTTGCAAAAACCCTTGCTCGTGTTGTGAATGTGCCTTTCACCATAGCTGATGCTACTGCCTTAACTCAGGCAAGTGCATTTTCGGGATAGTTATTTTTTCTTTCCTTGTTAGTGTTATGTTTTTTTCCCTTTCTAAACACTCTCTCTCTCTCTCTCTCTATATATTATATATATAATATATAAACAACACAATTCTGTAAGGCCCCTAGTTAATTGTGATATTAGGATTATTAGTAGAATATTAATATAGTTATTAGGAGGCATATTAGTAATTGCCTATTAGGGTTTGTTAGGACTTTTTAGTTATAAATAGAGGGAATGGGTTGAGAGGAAGGTATGAAAAATTTTGTGAGATTTCCTTTGGGAAATTTGGGAGAGGATTCTCCACCCTCTAGAATGTGCTGAGGTATATTGTAATTTCTTTATAGATATTGCAATATAATTCTACTTTTAGTGTTCTTTGTTTTTTCTTTGTGTTCTTGAGTGTCCTTGTTAGGTGGGTATCCTAACAAATTGGTATTAGAGTTGTAATCATCTTGGGAAGAATGCACAGTAAAACCAAGAGAGAAATGGAGGATAAGATCGAGAGCAATTCGAAAAGTAGTCTAGAATCGAGGAAATGCTTGTTTGATATGGCAAAGTCCGTCGAATGAGTAGATTAAACAGTGCATATGCTTGCAAACAAGAAGGAAAACAAAGGATCGGCGTTCGTAACCGGATCCGCAAGATCATTGAAGGATAAGGAGAGCGATGAAGGAGAGAACTCTCGATATTGGACAAACGAAGGGGGAGTTGATCAGAGTAAATACAAACAATTGGAGATGCTGGTTTTTTCAGGCGATCATCCGGATTTCTGGATCTATTAAGCGGAACATTTTTTTTCGAAATTTATGAACTATTAAATGAAGAGAAGATCAAGGTTTCGGTTAGTACATTTGCACCTGATGCAGGAGATTGGTTTTGTTGGTCACACACCCGGAGTCCGATTCGAGGCTTTGAGCGTGGGGCAAGTTGAAGCAGCGGATGTTTGCATGATTCCCTCCTACTTAGGAAAGTACTCTTCTCTCCCGACTTCTATAAATTAAGCAAGACAAAACGTATGAAGATTATCACAAGAAATTCGAGACCTTAAGAAAAACAGGAAGGGCCGGTGACGGGTGCGAAATCTGAAACTTCCAATAAGCCTGTGGAAGGATCTTTACAAACGTTGAAAGGTATCGAGGTAGAAGGGATGGTGTTTTTTCCAGCAAGCTTCGTTGAGAAAAAAGTGTGTGAGGGTGAGGGTGAGGGTGGGGTTTGCATTAAATGGCAGGAAAAACAAATTGGAGGCTGGGAGAAAAAAGTGGCTAGGGTAAACACCATTATGGATGAGGACCACTAACCATTACCATGTGGAAAGTCATTGGCCCAGGGAGTTGCTTAATGGGCTGAATCCAAATGAAGGAATCGAAGCCCAATTTGAAGGCTAAAGAGAAAGCTGGTGGTCTGGGTTTACTACTTGTTCATTAAATAATTTTTTTTTTTAAAGAAAAAAGATGATTAATCAATTAAAGACTAATCTGAGGAGAAAAAAGAGATGCAACGTAAGTTATTTGTTTTTTTAAGCTGATCTTCAGTTCAAGCTTTTTTGAGCTTTTTTTTTTTTTTTGGGTTTTTTTTCTCGTTCCATATGATATTTCTAAAATCTTTATTATCAGAATTCAATCTCAATTGGAGGGTGTTTGCCTCCCTCTCAAGTGATGGTTTTTAGTCTCTTCAGCAGCCAACAGTTTTATTTGATTTTTCAGACCGTTTGTTTAAGTTTTTGCCCCAGTGTTTGTTCTTTTTGGGGAGTGTAATGCTTTGTGCTTTTCATTTATAAGCTAGTCTTAGTTTGTTTTTGGATGTGATGAGGGCGCTAAGGGGGTGTCAACCTAGTTGAGATGCCCGGGTGCGCTAGCTGATCTGTTTGGTATTCGGTTGTTATTTAGCTCATTGTATAACTCTCTTGTACTTTGGGTTTTTGTTCGCTATTCTCTTACATCAGTGAATTGTTTCCTTTTAAAAAAAAAACATCTTTATGGAAGCATGTGAGGTTCCTAGTTAGGGGTATGATGTTAGTAAGGGCATAGAGTTAATTAGCTAGGGATGCTTTGGTTATAAATAATAGGTGTTAGGGGCTTCTGAAGGGTGTGAAGAATTTTGGTAGTGTCCAACTGTCCATTGTGAAACTTGGGAGAGAGATAACCCCGGTTATTATATATTTATATTGTAATTTCTTTATTGATATAGCAATGTTGTTATGTTTAGTGTTCTCTTGTATTTCTGTGTTTTGATGCCACAGAATAATTTTTAGAAATATTTAATGATTGGACCAATCTGTCTTGATATAGTTCCATTGATTTTTGTTTTTTTTGTTAAACTACCATTTTGGTCTCTATACTTTGAAGTTTGTTCAATTTTAGTTCTGTACTTTCAATAAATCTTAAATTTAGTTTCCAATGCTACTTTATTTTTTGGTTATCTAATAGCATTTTTATTATAAATTTTGAAAATATATTCATACATTTATTAGTTCAAGGTTTTAATCGATATTTTTCCAAAGTTCAGGGGTATAAATTGATATTTTAAAAAAAATAATATAATGAGAAGTTAATAATACTTTCACAATTAGTCCCACCTATCATACAAATTAAAAGTTATTGATCAGTACAATTTTCACATCTCACCCAATGACATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCAACTCCTGGTCAGTTCGGTTTTTTGATAAAAAAATATATGCTATTCCAATCAGTAACCGGTAAGCTCTAGATCTTTTCTCTTGTTTCTATATCAGGAATTTTGAGCCTGAATCANAATATAATGAGAAGTTAATAATACTTTCACAATTAGTCCCACCTATCATACAAATTAAAAGTTATTGATCAGTACAATTTTCACATCTCACCCAAAAAAAAAAAAAAAAAAAAAGGAAAAAAAAAAAAAGAAAAAAGAACTTTCAATATAAGCCTGATTTTTGATAAAAAAATATATGCTATTCCAATCAGTAACCGGTAAGCTCTAGATCTTTTCTCTTGTTTCTATATCAGGAATTTTGAGCCTGAATCATCTCTACATATTTTATTGATTCGAAATAATTCAAGTTCTTTTTACCAAGCAGCACTTTTGTCAGTTGGCTCAGTAGATTTTAATTCTCACTCTCCTTTTCAGGCAGGTTATGTTGGAGAAGATGTGGAATCAATATTATACAAGCTCCTTCTGGTATTAATTTATTTGTTTCCCCTCCTGTCTCAATTACTTTCTAAGTCGTTTTGAAGGTATTTTGACTCTTGAGTTCAAAATCTGTGTTTTGACCAAGATATGCTGAGAGTCATGGCTTTGGAATCACTGGATCATCCTCATATAATGTTACCATTTATTTTGTGTGTAACTATTACTAAGTTTTTAAAAATCCTTAAATCTAAGTTTTGATTCAAATCGTTCAACCACCTTTTTTTGAGTCCAAATGGTCCTGCGTAGATGGACTCTGAGGTTCAAAAGAGGGCACAGCTGTGGTACAGTTTGTTTATTTAATTTTATATTCAAGAACTATTAAGCTTTTGAAGCGTTAAAAAATAGTAACACTGTCTGTTTATTTCTTTGTTTTTTTTTAAGAAACAAACACTTTTCAATGATATAATTAATAGAGACTAATGCTCAAAGGTAAAAAAACTCTGAAGGGGAAGCAAGAACATTAAAATGATAATATCAGAACATTAAACCAGACGTCTAGTCATGATGAATTATAAGGGAAACATAAAAGCAATTCAATTAAGACTGATATCTTGAATAGAGTAGCCAGTGAAGCCTTTAGAAAGAGAGCAGTAGAGCACCATGAAGATGCCTTTAGATGAGCTGAACTAAAACGATCATCCTAAGGAAGACCTGTTATGGAAAACCCTTCAGTCACGCTTAAACCATAATTCGGATAACAAAGCTTTCACTGCATTAGACCACTCTCTATAGTAAATTTGCCCCTGATGAAGTATGTTTCAATTTTTTACCATATATAGAACAAAAACGAGTTCAAATTTTCTCAACTTTGTATGAACTAGAAATACTAGCAAAAGAAACAAAGAATAATTCATAAAGCAATTGTAGACTTGGGATATGTTCCTTGCATGTGGAATAAAAGACAGTTTTCATTATTTATTTACCATTGCAACATTAAGAATTTACTTAGAAATGAAAAGTAGGTTGGTGTAGCTGATTGAAAAACCTCAAGGAACTATTTTGTGATAAATTTCCTCTGGCTAATGGATCCGAGTTTGTGTCCGTAGGGGTGTTCAAGAGGAGTGATTTTAGACCCATCAGCTTAGTCACATCTCTCTATAAGATAATATTTGAAGTTCTTGCTAAAAGACTTAAGAAAGTCCTCCCTCTTATTGTGCATTACTTGCAAGCAGCCTTTGTTGAGGGGCATCAAATTCTGGATGCAATTTTGACCGCGTCTGAATCCTTGGATGATTGGCAGTCACTTGGGAAAAGAGGGGCTCTCCTAAAGCTTAATTTGGAAAAGGCTTATGATAAGGTTGACTAGTCTTTCCTCGATTCGATTTTGAAGTTAAAGGGCTTTAGTAAAAGGTGGAGGAAATGGATTTGGGGTTGCCTCTCATCGACCAACTTCTCTATCCTTATTAATGGAAGGCCGCGAGGCAAGATCTTTGCCAAACGAGGTTTACACCAGGGTGATCCCCTTGCTCCCTTTCTTTTCACAATTGTGGGAGACGCCCTTATTCGTATGATTTACTTTTGCAATGAAAGAAAGCTCATCATAGGGTTCTCCTTTGGCGTTATGACCTTTGACTTGACTCATCTCCAGTACGCTGATGATACCTGACAGAGCATTTGCCGTCTCTAGTAAACCTCAACCAATCAAATAATATTTATAATCATGAACTAACAACTTATACTAACTAGTAGCTTAAGTGGTAAATCTGAGTCGAACCATAGGGAAGCACGAAAGATTAATCCTTCAAAACTCATTTTTAGTTAAAGTGGTAAACAGAGGGGGGGTTTGGTGTGGATTGGATGATCGAAATGCAGATTAATTTCTGCGAAAAGTAAATGATATAAGTGTAATTAATCAAGATGTCCTAGTTTAGAGGAATGGAGTCACCAAATGCAATTTAGATCACTGAATTATTATCTTATGACTTAACGTTTCTTGCCTATGCCTAACCAAGTTAATTTGAATCCCAATTAACCGTCCTTATTATCTAATTAGCTAACATGAGGAAACGAAACTGCTCTAAACACATTAGCTAATTGTATTAAGCTTTAGGGAGATGCTAGGATGAACTAATTAATTTCCAATAATTGATGGGCTGGTCCCCCCACACCAAGTAGCGCTCTAGATAGGCCGAAGCGATGTGAGCCCCAACACTGGAATAGTTTTAGTAAACTGTTTGGTTCATAGTACTTCAAGTTAAAGCTAGATTATCTCTTTCCCCGTCTAGTTAACTCGTTCAATATGATTTTCACATTAAGAATGATTAAAGCATTACTTAGTGCGGTTGACTTCAAAATTAAGCAAGTTAATCTATTCTCTTGCTTCCTATGGTTCATTGCCTCAACTAAGGTTAAACCTAAGCATGGAATTACATAAACAAGTACTGGTTGTCAATCAAATACATAGAAATGCAATTCACAGTCAGTAAATTCAGAGTGAGAATCCATAAACAAGATATATATATCATAATATAAAGATTCTCCCATAATTGCAATGGTTTCTCCAAATTCTCTAAACTAAGAAAACCCTAGAACTTACCTTCTCATGGAGATGAAGAAGACAAATCCTGAAATCTCCTTGATTATAATTACAAAAAGAACATCCAAAGAATAAAGTGGAATAGAGAAGAGAGGGGAAAGGGGCTGGAAGCCCATTGGGTGATTCTTCTGCTCTAGAATCGAAGAACAATATATAAGCTAGTAGTGCAACTTTGCAACGCTCAATGCGCATTGCAACACTCCAATCGGAGCGTAGACCTCTGGCAATGCAATGCTCTTTGCTACGCTTTTGGGCTGGTCGTGCGCGCGCGTTTCTTTGAAGTGCCCGTCAAACTTCGGAGCGTCATACCGCTCTAAGATAGCGTTGTGATGCTGTGTGAAATGATAGCTACTGCCTTTGAGCGTAGATCTATGCTTGGTAGAGGGTTGCAATGCTCGATCTAGGGGCATTTTGGTAATTTATCTCCTTTGAAGTCCGGATGCTCAATTTAGCTCCAAATTGCTCCAAATTAACCTTGAATGACTTGTAATGGTCGACTCTATCTCCTAGGCGCTCGAAACCCTGTAAAAACATTAGTTGAACACATTACACATAGTAATGACCTCGAATTAAGAAGAGGAAGATAGCATACATCAACACCCTTTTTTGCTCTTTGGGAGAATGAAATCTTTCGACTTGGTGGATGGTTGTCAACTTGTTCCTATTGGTTTCGGGTTTGTCTCTTAATGTCTTTAAGACCTCTCTTATCGGTTCAAACCATGATAATGAGGAAGTGAATCTCTATGTTGCAAAGTTTGGTTGTAAAGTTGAGTTTCTCCCGTTCACGTATTTGGGTTTTCCTATGGTCGGGAAACACAAAGCGAAGGAGGCTTGGATTGCTCTTGAAGAGAGGTTTAAAATTAAGGTTGATAAATGGAGGGGGATCCTCCCTCTCTAAGGGTGGTAGACTGCTCGCCCAATTGGTGCTTAACTCTCTCTCTCGTGTTACTTCTTCTCTCTTATGCAAGCTTCGATCGTTACTGTCAATCGCTTGGAAAAGATTATCAGAGACTTCATATGGACGAGGGGTCCCTGCAATCCTAAGTCCCACCTTGTCAACTAGAGTGTGGCGTTTTTCCCCATCGACTTTGGTGGTCTTGGAATTGGCTCCCATCATCAGAAGAACACTGCCTTCTTATGAAATGGTTGTGCAGGTTTTGCCATGAAGAAGACACTTGAGATGTGTTATTGGGACCATATATGGGGTTGATGCTCTTGGCTGGGCTTTTAAAATGCCTAAGAAGGGGAGGAAATCTAGACTGTGGGCCAACATTTGAAAGATAATTCTTTAACTTCACAGAGTTTATCGTGGGAGATGGGAGATTGGTTGGGAAGACAGGTGGTGCGACTCTCAATCCTTTACTCTGAAATTTCCTGATACTTATACTTTATCTCTAGAAAAGGATACTGTGATTATGACTTGTTGGTGTGCCTTTAGCCAATCCTGGAACTTGGGTCTCAGATGACCAACTTTTGATAGAGAGTTGGGTGATTTGGTGGCAGTTTTGAAAACCTTAAACTCTTGGACGCCTCAAGATTCTCATGATTGTCTCAAGTAGAAGTTAGATGCCTTTGGAAAGTTTACTACTGAAGCTACTTTTTTGATTCTCATTGAGAGAACCTCTGCCCTTCCTTTAATGTCTCTTGTATGGAGGCTCAAAATCCCTAGAAATGTTAAATTCTTCCCGTGGTCCTTTGCTTATAGAAGCTTAAACACCCAGGAGAAACTTTGAAGGAAGCTTAAAAATTCTTCCCTTTGCCCTTCCATTTGTTGTCTTTGCCTATAGTTGAGGAGTCCCGGACCATTTGTTCCTTCACTGTGACTTTGCCACTATGAGGTGGGCTCCTCTGGTTAGAACGTTTGGGCTGGAGTGCTGCCTACCTAGTAGGGTAGACGATTAGATGGTGAATAGGTTGGACTGCTTTGCGTTCAGTGGCAGAGGTAAAATCATTTGGAGATGTGCTACTCATTCGCTCCTTTGGTGCATTTGGAAGGAAAGAAATAACAGAATTTTTTAAGATAAGTTTTCTCATTTTGATTCTTTTTGGATGGTTGTGCAACACACAAACCCTTGGTGGTAACGACTCGACTCCCTATGACTTGTGCTAGATCGTTACTATACGCATGCGTACTTCTCAAGACGACATTTTCCATGAATGGGCTAAAACTAAATAATTGTTATAAAATAAATAACTTTAATTAAAACTAAACTATTGAAAATCTTTAATCAGGATACCCTTAATGAAATAAAGCATCAATAGAAAATTTTATCTAAAACAACCATCAGTATAGAAACTGAAAGTTTTCTCAATTTAATTACAAAACTAAGATTTTAAAACATGAAAACATGGAAACAGGAGTGGAAGCATATGACTAGTCCCAGTGGCACGATCACGGATTTCTCCTATCATTCGTTGGTGTGCCTTTACCCTTACCTGAAAAACATAACATAATAAAGAATGAGTATAAAATACTTAGTAAGTAACCTCACTACCGGAGTCAAACTATGCACTTATATCCAATAAATGCAACGTACTGGTGGGACATGCATATAATAAACTAAGTCTTGGATGGCCACCTAGTGAGCGATTTAACTCAACCTGACATGCATATGGTGGCCCGAAGGTTCACAAACATAGTAGGGCACCTGTCGGCACCCGATAACATAGTAGGAAACTTGTTAGTTTCTAGTAATATAGTAGGAAACCCGTAGGCTTCTGGTAACATAGTAAGAAACCCGTAGGCTTTCGATAACATAGTAGGACACCTGTCGGTTTCTGGTAACATAGTAGTATTAGAAGAGTAAATCGTCACTTTTATCTATAGACATATCATGCTTTCCGTAAATTTCACAACATAAATCATAGTACATGTCTCATGGTCCTATACATAATCATATCAGTAGCATACTTAAATCAACATGCTCATAAAACAGTTCTTGGCATATCTTGCTTCTCAATTCATCCTGTCAAAACATAGAATAAAGATTAGTAACTCAACTTTAATTTATATTATGAAAAATTATTTTGTAAAATTGTTCAATAAAAGAACTATAAATATATGTATTTCATAAATAAAGGAAATACTTTTAGACTTCATAATATGTTCAATTCAACTTGATTCAATCCAAACTAATTTTAGGTGAACTCATGAACCAATCCAACCCATACAATTGGTGAGGTTTCATGTTTCTTTATGGGTTTTGGTTTCGAAGACTATTTGTAATTAGTTTTTAGGTAACATTTTACTTAGTTGGAACCCATTTAATGGAGTTTTTGTGAGTTTGTTTTTTTGTGTGCCCTTGTATTCTTTCACTTTTTTTTTTAATGAAAGCCTTGTTTTAAATATATAAAAAAAACCACCAACCCATACAATTTTGATTTAATTGGTCTTTCAAAAGGATAAAAAAAGATTTAATTGAACCCGAACTAAACATACAGGTTGGGTTATTAAGTTTTTCAGGTCATTGGGTTTTTTGAACCACACCTAATAATTTGTTTTTGTCACATACTTAGCTTTACCATTTTAGGAAGTATGTGAAGTCTAAGGTCGAAACTTTTCCGTATTTTTGTTAAATATTTTCATGTTTTCATTATTTGTCCTTTTACCTTTGCTCGAAATACACGGTGCTTTAATTGTACAGGATGCGGAGTTCAATGTAGAAGCAGCTCAACGTGGGATAGTATATATTGATGAGGTTGATAAGATAACTAAGAAGGTATAGCATGGTCTAAGATCTATTGGCAACTTTATCGTATTTGTCATAACTTGCTTTATTTCTTATTAGAGTGAGAGCATAAACTCTGGCAGAGATGTATCTGGAGAGGGGGTCCAGCAGGCACTTCTGAAAATCCTTGAAGGAACCGTGGGTATTCAATTTGACAGTAATATTTTTCATTTTTGTTGTTGTGCGTTATTTCTTATTTATTTATTTGTTTAACTTGTCTTGCATTGCTTCATCAGTAACGACCCAAACTTTCGAGTACTTTGACAAGGGCCGTTATTGATAAGCACACATTTACATTTAAAACATTTTGACCATTGAAGCAAAATTCGTTAAAACAATAAAGAAAACTTCAAAATCAATTTATAAAACTTAATAAAAGCAAATAAGCAAAACATTTATTCGGGGTCCCCTAAAATCATTTACAAAGAAATCAATTAGAATAATAAAATAAAATTTTCTGTCCAACATTAAAACTTTCAAATACCTAAAAAAAACATAGATTGAGCGGAAGCGATTCCTTTGGTCCTCTTATGGCTCTGTCCGGGTCCCGTCTGTCGCTCGTCAAGCTGTTCTTGTCATTACTTGAAAACATAAAGGAAGAAAGGATGAGTATAACAATATCTAGTAAGTTATTCGCTATTGAGCCTTGTAGGTAACCTGTTAACTAGTTTATCTAGGCATCGGTGTACATCCATATCATATCAGTCGTAGTGATCCCGAAGAATTGCATATTAGTCATAAATACGATCCGGAAGAAACACACATTAGTTGTGTTGTGTATCCCGAAGGTACACAAATCATTTGTAAACATGAATCCGGAGGAACATGCATCAGTCATAGTGTGTAACCTGAAGGTACATGAATCAGTCATAAATATGAACTCGGAGGAACATGCATCAGTTATAGTGTATGTTATAATTCTATAGTTAGGGTTTTCCATTCTTGTAAATATTTTGTGATATTTTCTTTTTCTATGTTTTGTACACTTGTATATATTCATATCTTTGGTGAATGAATAGAGTATTCTGATTCTTTCTCAAACCTAAGAGTTTTACATGGTATCAAAGCAACCTTAACAAACTTAGGGTTTTGTGAACCTTAGGGTTTGAGAATTTAGAACCAGTTTGTGATACGTTTAGGGTCTTGTGGAGAGGTGAATTTACACTTTTGCTCACAGTCGCCGTCTTTGGTTCGTTCGCCGTCATTAGTCGCCACCGTCGTGGGAAAAAAAAGGCTATTTTTTCAACACCACCCAGTCCATCGAGGAGCCCAGCTGTTGATCTACAAAACCCCAAAGTTCGGATCCCATCTAACCAGCACATGGGGCTCACGTGCCGCTTGTTCCAACTTGCCATCCGTCCATGCGCCGGTGCGTGTAGGCGCGTGCGCCATCGATTTTGGTTCGCCTTCTGTGGGTTTCCTTAGTGTTGTTGACTCTTTTTGGTGGTGTGTGTGTTTGGGATATATAGATATACTCACTGCTTGTAAAGACATCTCTTCGGTAAATTAAGGTTTGATTCTCTGCTGTCTGGTGTCTACTTTTTGAGGTAATGGCTGATAAGAAACCAGTAATGTCTGAGATTGTTCCTATGGTGTCAAAAGTAACTGAACACAAGTTGAATGGATCCAACTATTATTCATGGAGATCAAATCCAGGAGGTGTTTGCTATTATTGTCATAAACCTGGCCATACGAAACGTGAATGCAGAAGGTTGCTGAACAAGAGGTCAGAGGATGCCATCACCCTCTGCACATGTCGCCTCTACTCCTGATAATCTTGACAAGTCAATTACGATTTTGCAGAGGAGTTTGCTAAATTTCAGCAGTATCAAGAGTCATTGAGGGCATCATCTTCTACTCCCATTACGGCCATCGCAAGAGACAACTAACATTTCTAAATACCTTCTTTCCTCCACGTCAAAATGGGTCATTGACTCTGGCGCTACAGACCATATGACAGGTAACCCTAAGTTTATTCTCTAACCTTTATACATCTATCTCTTCACCTAATGTCACTATAGCTGATGGAACCTCCACTCTTGTTTTAAGATCAGGAATCGTCCATCTCACTGAATCAATTTCTCTGTCATCAATTTTAAATTTACCACATTTCTCGTTTAATTTGATTTTGATCAGTAAACTCACTCGTAATCTTCATTGTTGTGTCTCATTTTTTCTTGGTTATTGCTTATTTCAGGATCTTATGACGAAACAGACTATTGGTAAAGGGTGGGATTTGGAGGTTTACATCTTTGAACCACAAACACCTACGGCCATTACATGCTCTAGCATGTCTTCTCCCTTTGAAGAGCATTGTTGTTGGGTCATCCGTCTATTTCTGTGTTGAAGAATCTTCGTCCACAACTTCAACATTTGTCTTCTTTAGATTGTGAGTCATGTCAGTTTGCTAAATTTCATCGTTTGATTTTTATCCCCGAGCCAATAAACGAGCTAGTGCTCCTTTTGAATTAGTCCGTCTATGTTTGGGTCCATGTCCTGTTAAGTCCAAAGGAGGGTTTAGATATTTTGTTACATTTGTCCGATGACTATTCTCGTGTTACGTGGTTATATTTAATGAAAAGTCGTTCTGAGTTACTTTCTCATTTTCGTAACTTCCATGCTGAAATTCAAACTCAGTTTAGTGGTGCTCTTAAAATCCTACGGAGTGATAATGCTAAGGTATATTTCTCTCATGCACTCAAATCTTATTTAGATGTCCATGACATTCTTTATCAATCTTCTTGTGTCGATACTCCATCTCAAAATGGGGTTGCAGAACGAAAGAATCGTCATCTCCTTGAGACAACTAGAGCTTTGATGTTTCAGATGCATGTTCCAAAATCCTATTGGGTTGATGCTGTTTCCACAACTTGTTTCTTAATAAATCGCATGCCCTCTTCCATTCTTAAGGGTGAGATACCTTTTTGTACTTTATGTCTCAAGCAACATTTGTTTCCCATTCCACCCAAAATATTTGGTTGTACCTGCTTTGTTCGGGATGTATGGCCTCAGCATACCAAGCTGGATCCAAAGTCCTTAAAATGTATTTTCCTTGGTTATTCCCGTGTCTAAAAGGGGTATCGGTGTTATTGTCCTAGTCTAAATAAATATTTCGTCTCAATTTATTTATTCTACGATATATTTATATACTTTTCATCAAGTAAAATGAAAAGGTTAGCTAGAACAGGTCAAAGGGTTTATGGCCAGGAGGCTACGAAAAAGAATCGAAAGATGGAAGCTGTAAAGTGATGTTGTTACAAAATGATTTATTGACCGTGCTTTATTTGGAGGTTTAGACAACATCAAGGATATTTACGTTTATGGAGTGCTTTGAGTAAAGACTCCTTAAATTAATTCCTTTTAGATTTTGCTCACTTGTGCTACATTAGCATTCTCCAAGAGGGTGCTGGAAAGATTTTTAAATTGACCTTAGCCGATTCTCTTTCATCTATGATGATCATCCATCCCAATGTTCCAAAATTATTGTTTTTCTTCCTTCATCAACAAAAATGCTAGTATAATTTTTCATACGGTTTCAGAATTTGGAAACATTTCATTTCCACCAAGGACTTCTCCAGCAGAAGGCAGAACTATAAAGATTCAACTGAAAATCCATCATGTTCCAGTATTCCTAGGGAGCTTTGGTCCAGCATGAGGTTCCATGTTTCTCTTTGATCTTGAGTTTTGAAGACCTTTTGTAATTATTTGGCAACTTCTTACTTAGTTGGGAACCGTTTCTCTAATGAGGTTTTCCAATGGGCTTGATTTTTTTGTACCTCGTATTCTTTCATTTTTTTTCTCAATGAAAGCAGTTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTAGAAAAAAAAAATCCATTAGGTTCAAGTGATTTCTAAGTTTATGATAGGTACCAACATTCAGCAGATTCACGATACCAATCATTCTTTTTCTTCCGTGTTCAGATCTAAATAATGTTCTTTAAATTGTCTGACTAGCGGCCTTGAAGTGCTTCAAGGCATCTGCCTATGCTAGGTAAAAAGAAGCTTATGCCTCTATGGATAGAAGTGTCTGCCTTTGAGGTGTGAAAGGCCTAAGCGTCAGTGAAAAGTGAGGAAAAAAATATTAATTACAGCAGATTAGGGGATGCTGGCGGTGGTGAGATCCCATTTAATGGGGTTTTCAAGGGAAAAAAATTGCCTTGCCGACTTAAAACAACCTGACACAATTTAGGGGACAACTTGAGTCATCTGTTTTACTCCCGTGAGTACCGCAGAACTTCTTGGATTTTAGCAGTGAACTGTGGTATATTCATATTACTGACTCCGAAGTCTTTCTTTGGGCTCTTTCTCAGATGTTTTATGTAGTTGTAGTTCATTTAGAATACTTTGTATTCTTCTTGGATCTTGCGGTTCTTTGTCTTATTATCATCCACTGTATATTTGCCTCTTTATTAATGAAATCTGGCTTCCTACTTTCAAAAAGAAAAAAGAAAAACAAAAATCATTCCGAACTTCTCATTCTACTTCTAAACCATTTACTTTATTTCAGGTAGTGGATGTTTCAGACAAAGGTGCTCGAAAGCATCCTGGAGATGATACGATTCAGGTGGAGTTGAACTGCAATTTAATATTGCTTTTATTGGTTTTCTTCCTTGATATATTTATAATGTGCAATCACATTCGAATGTTAGCTCTACTTTCAGCTACCTTATTAGGTACTTAACATGAAAATTCATTTGCTTTCTCACTCAATTGACGAGAAAGGTCTAGAAGTGATGAGCCATCATTTTAAAATATTAAGTTATTAACTTCTTGAATTCACATAGTCCAAAACCTTTTCTAGTTGTTTGACATTGACAACTTGTGGGCGATCAGTCACTGTAAAGGCTCAAGCTTGACCTAGGACCCAACCCATGGATTTGACCAATTGGATCGAACTTTGGATCAGGTTGGGTGGCCCATTGGATAATTGATTTTTTAACTATTTTTTGAAAAATATTTTTCTATATTTCTTAAAGTTGAAATTTAAAAAAATATAGGGAAAAAAAATCTATTTGTGTTCTTAAACTTCGGGGTTGTATCACTTAAAACCACAAATTAGTATTATAGCAATAAAAATTCTAAACTTCATAAGTGAACCAAGTTAGACCCCTTGTTAGTATTTCATTTTTAAACTGTTGATACTTAATTTTCATGAGCATTAGTCTCTTTTTCATTTCCTTAATGAAAAATCTCGTATCCTCTTCGAAACGTGTATAGAACTTTCTTTAAATGTTGGAATTAATATATTTATTATCTTTTTTGAGATAGGGATTAGTGTATTCATAGAGAAAAATACTGAAATTAAATATTGGATAAACTTTCCGTTCTAAAATTATCATTGCATAATTTTCACTAATGTGAGTGATTTTTCATCTTCTCATCTTTTATGAATGGCAGATGGATACAAAAAACATTCTCTTCATATGTGGTGGTGCATTTGTTGGCTTGGAGAAGTGCATCTCTGACAGGTATAGCATTGCTTCGTTCTAAATGTCAATAACTGATGAATTTTTGTTTTTACCCTTTTAATAATCCATGTATGGAAACTGAATATCAACGCTCTTTCTACAGGCAGCATGATTCTTCAATTGGCTTCGGGGCTCCAGTTCGTGCCAGCATGAGAACCGGTGGATTGACTGAAGATTTAGTGACATCTTCGATGCTCGAATCTGTGAGACCTTAAACCAAAAGTTAAGGCCCGTTTGGTAGCCATTTTGTTTTTGGTTTTTTGTTTTTGAAAATTAAGCTATGGACATTGCTTCCACCTCCAAATTTCTTCCTTTGTTATCGACTAAAAATAGTTTAAAAAATAAAGCCAAAATTTGAAAACTAAAAAAAAAGTAGCTTTTAAAAACTTATTTTTGTTTTTGAAATTTGGCTAAAAATTCAGCCATTGTACTTAAAAAAAAGATGCAAATCATTGTAAAAAATCGGGAGGAAATAGATTTAATTTTCAAAAACCAAAAATAAAAAATGAGATAGTTTCCAAACGGGCCTAAATTGTTGCACTACGAAGTATTTTTGTTATTATATCTTTGGCATTTTCTGAACTGAGGCTTCATTTACTTCATTTTTTCCCTTCCATTTCACAATTTGTCTTCATTCAACTACTGAAATGGTCTTTTAACTTTAGGTAGAAAGCAGTGATCTTGTTACGTATGGTCTCATACCTGAATTTGTTGGACGATTTCCAATTCTTGTGAGTTTGTCAGCTTTGGATGAGGATGAACTTGTCCAGGTTTGTCTAGTCTTTTGGGTCATATTTACTTCGACCATTGAATTCTAGCAACGGCCATATTTCTTTATTCTGTTTTTGCCACTTAACTCTGGAAGATATAACAGATTTCTTTATCCTTATTCCTCTTTAATAGGTTCTTACAAAACCAAAAAATGCGATTGGGAAACAGTACAAGAAGATGTTTAGAATGAATGATGTAAGTACTTTTTTAAATATAATTAATGAACCAGGCATTTATTACATATGTGAAATGTTAACCTAAGTTGCTTCTTCTCCAATTTATCAATAAAAGAAGTTAAATTGCAAATTTGGTCCTATAGTTTGATGGAAGTTAGAATTTAGTCTACATGGTATTGTGATTTAGAATTTAGTCATTATGGTTTGATAAAACCTCACAAATAATCTTTAATTTTCTCCAAACGATAAGGACCAAATTCTTACTTCAAACCATAGGTACCAAATTTGCAATTTAATTAAAAAAAAAAAAAAAAAAAAAAAAAAACTCAAAGAACTTGTAAATCCAATAATCTTCTTATGTGTTTACTCCAAGTCCTCATATGATTTTGCAGGTTGAATTACATTTCACCGATCGTGCTTTAAGAATGATTGCGAGGAAAGCAATGAAAAAAAACACAGGTGCTAGAGGATTAAGATCCATTTTAGAAAATATTCTGACAGAAGCTATGTTTGAGGTATCACATTTTGGAGTGCAATGTGTTTGTTTATTCTTCATGGAATTTTTATCTTCCTAATTTCAAAATCATATTGGTTCTGAACATATGCTTTTTTCTTTTCCAAATATTTATAAAATGTTAACTCGTTTGTAAGTTTGAATACTGGAAACTGCTGCGCCAATTAAATTCATTATATTCAGGTTCCAGAAACTAACAGCATAAAGGCTGTTCTGGTTGATGAAGAAGCTGTTGGATCAGTGGACGCTCCAGGCTGTGGAGCGAAAGTTCTTTGTGATGTCGATGAATTAAAGCAATATTCAAAAGGTGAAATAGTGAGAAATCTGAAGGTATATTCTCATGATGCCCACTTGAACCCTTTGTCGTCCCATTTTTCTGTTGATCCTTTGCTTCAGTTTGAATGTTATGGATATGTAACATTGGCAAAGATAACGTACTTACCATTTTGTGCCATCATTGACATTTACGTAAATCCTCTCATCGATTCATCTTCTAGAAGGGTTCAAAGCTTATAAAAAAAAAGTGTCCCAAGCATGTCGTGGACGTGCTGGATACCTTTTTTCCCTAGTATTAAACGTTTATCCAGCATGTGCCTGAGCATGTCCATGTTAGAAACCAATGATATGTTAGAATAGATTGTTCGTGGTACTGAATGAATGATATAATTTTTGTGAGACTGAAGTGAATGGTTATTTCATTAATACAATGAAATGTTTTGTTTGCCGTTTAAAAAAAAAAAGAGACATGAATTTTCAGTTCTCCTCATCAAATTTTTTTGTGGCTATGAAGTTTGACATACTATCTTTGCCCTGATGTACAGGGAAAGGGTGTTTTGGCAGACGAGGGTCGGTTCTCAAATGGAGTCGAGTTTCCATCTGTAACTATGAGATTGTGA

mRNA sequence

ATGATTCCACGATTCAGAGTCAAGAACACCGAATTCATCAAGCTATTCACATCCATGAGAACTATCACCACCACTGACTCGCCGTTCCACCGCCGTTTCATAACTCCCCTTCCGTCTAAACCCTTTCTACAACCCCATGCCGCCGCCCGTTCCGCCGGATTGTTCCTTTGGTTTTCTCAGAGACGCCACAGTTGGAAAGGGGCCTCCGACTACGATTACATTAGGGCTGATGTCAACTGTCCAAGGTGCTCCAAGCAAATGCCTCTCATCTTCTCTAATCGCCCCTTGTCCATTACGGCTCGGGAGACTGGCGTCTATCAAGCCCTCAATCTCTGCCCCAATTGCAAAACTGCATTCTATTTTAGACCCTTCAAGTTGGTTCCTTTGCAGGGTACTTTTATTGAGATTGGTAGGGTTAAGGGCGCCAGGGACTTGGATGACCATGTGGGCAATGAATCGAGCCAACATACGAAAGGAGATCTTGGCGCCAATTGTACTGATGATTCTGCCTTGCCTCCACGGTGGAATGGTAGCTGTGGCGGTGGCGGTGGCGACGGTAACTTGGGTTCGGTGGAGAGTAACGGTGTTCAAAAGGGCGACGGCAGTTTAGAAATGCAGTTGCTAACTCCCAAGGAAATTTCTGAGGCGCTGGACAAGTTCATCGTTGGACAAGAAAAGGCCAAGAAGGTGCTTTCTGTGGCAGTATACAACCACTATAAGAGGATATATCACGCTTCATTGCAGAAAACATCAGGACAAGGATCGCTGGGTATTGAATTGGAGAATGATGATAATGAAATTGTGGAATTGGAAAAGAGTAATGTGTTGCTAATGGGTCCTACAGGCTCAGGGAAGACGTTACTTGCAAAAACCCTTGCTCGTGTTGTGAATGTGCCTTTCACCATAGCTGATGCTACTGCCTTAACTCAGGCAAGTTATGTTGGAGAAGATGTGGAATCAATATTATACAAGCTCCTTCTGGATGCGGAGTTCAATGTAGAAGCAGCTCAACGTGGGATAGTATATATTGATGAGGTTGATAAGATAACTAAGAAGAGTGAGAGCATAAACTCTGGCAGAGATGTATCTGGAGAGGGGGTCCAGCAGGCACTTCTGAAAATCCTTGAAGGAACCGTAGTGGATGTTTCAGACAAAGGTGCTCGAAAGCATCCTGGAGATGATACGATTCAGATGGATACAAAAAACATTCTCTTCATATGTGGTGGTGCATTTGTTGGCTTGGAGAAGTGCATCTCTGACAGGCAGCATGATTCTTCAATTGGCTTCGGGGCTCCAGTTCGTGCCAGCATGAGAACCGGTGGATTGACTGAAGATTTAGTGACATCTTCGATGCTCGAATCTGTAGAAAGCAGTGATCTTGTTACGTATGGTCTCATACCTGAATTTGTTGGACGATTTCCAATTCTTGTGAGTTTGTCAGCTTTGGATGAGGATGAACTTGTCCAGGTTCTTACAAAACCAAAAAATGCGATTGGGAAACAGTACAAGAAGATGTTTAGAATGAATGATGTTGAATTACATTTCACCGATCGTGCTTTAAGAATGATTGCGAGGAAAGCAATGAAAAAAAACACAGGTGCTAGAGGATTAAGATCCATTTTAGAAAATATTCTGACAGAAGCTATGTTTGAGGTTCCAGAAACTAACAGCATAAAGGCTGTTCTGGTTGATGAAGAAGCTGTTGGATCAGTGGACGCTCCAGGCTGTGGAGCGAAAGTTCTTTGTGATGTCGATGAATTAAAGCAATATTCAAAAGGTGAAATAGTGAGAAATCTGAAGGGAAAGGGTGTTTTGGCAGACGAGGGTCGGTTCTCAAATGGAGTCGAGTTTCCATCTGTAACTATGAGATTGTGA

Coding sequence (CDS)

ATGATTCCACGATTCAGAGTCAAGAACACCGAATTCATCAAGCTATTCACATCCATGAGAACTATCACCACCACTGACTCGCCGTTCCACCGCCGTTTCATAACTCCCCTTCCGTCTAAACCCTTTCTACAACCCCATGCCGCCGCCCGTTCCGCCGGATTGTTCCTTTGGTTTTCTCAGAGACGCCACAGTTGGAAAGGGGCCTCCGACTACGATTACATTAGGGCTGATGTCAACTGTCCAAGGTGCTCCAAGCAAATGCCTCTCATCTTCTCTAATCGCCCCTTGTCCATTACGGCTCGGGAGACTGGCGTCTATCAAGCCCTCAATCTCTGCCCCAATTGCAAAACTGCATTCTATTTTAGACCCTTCAAGTTGGTTCCTTTGCAGGGTACTTTTATTGAGATTGGTAGGGTTAAGGGCGCCAGGGACTTGGATGACCATGTGGGCAATGAATCGAGCCAACATACGAAAGGAGATCTTGGCGCCAATTGTACTGATGATTCTGCCTTGCCTCCACGGTGGAATGGTAGCTGTGGCGGTGGCGGTGGCGACGGTAACTTGGGTTCGGTGGAGAGTAACGGTGTTCAAAAGGGCGACGGCAGTTTAGAAATGCAGTTGCTAACTCCCAAGGAAATTTCTGAGGCGCTGGACAAGTTCATCGTTGGACAAGAAAAGGCCAAGAAGGTGCTTTCTGTGGCAGTATACAACCACTATAAGAGGATATATCACGCTTCATTGCAGAAAACATCAGGACAAGGATCGCTGGGTATTGAATTGGAGAATGATGATAATGAAATTGTGGAATTGGAAAAGAGTAATGTGTTGCTAATGGGTCCTACAGGCTCAGGGAAGACGTTACTTGCAAAAACCCTTGCTCGTGTTGTGAATGTGCCTTTCACCATAGCTGATGCTACTGCCTTAACTCAGGCAAGTTATGTTGGAGAAGATGTGGAATCAATATTATACAAGCTCCTTCTGGATGCGGAGTTCAATGTAGAAGCAGCTCAACGTGGGATAGTATATATTGATGAGGTTGATAAGATAACTAAGAAGAGTGAGAGCATAAACTCTGGCAGAGATGTATCTGGAGAGGGGGTCCAGCAGGCACTTCTGAAAATCCTTGAAGGAACCGTAGTGGATGTTTCAGACAAAGGTGCTCGAAAGCATCCTGGAGATGATACGATTCAGATGGATACAAAAAACATTCTCTTCATATGTGGTGGTGCATTTGTTGGCTTGGAGAAGTGCATCTCTGACAGGCAGCATGATTCTTCAATTGGCTTCGGGGCTCCAGTTCGTGCCAGCATGAGAACCGGTGGATTGACTGAAGATTTAGTGACATCTTCGATGCTCGAATCTGTAGAAAGCAGTGATCTTGTTACGTATGGTCTCATACCTGAATTTGTTGGACGATTTCCAATTCTTGTGAGTTTGTCAGCTTTGGATGAGGATGAACTTGTCCAGGTTCTTACAAAACCAAAAAATGCGATTGGGAAACAGTACAAGAAGATGTTTAGAATGAATGATGTTGAATTACATTTCACCGATCGTGCTTTAAGAATGATTGCGAGGAAAGCAATGAAAAAAAACACAGGTGCTAGAGGATTAAGATCCATTTTAGAAAATATTCTGACAGAAGCTATGTTTGAGGTTCCAGAAACTAACAGCATAAAGGCTGTTCTGGTTGATGAAGAAGCTGTTGGATCAGTGGACGCTCCAGGCTGTGGAGCGAAAGTTCTTTGTGATGTCGATGAATTAAAGCAATATTCAAAAGGTGAAATAGTGAGAAATCTGAAGGGAAAGGGTGTTTTGGCAGACGAGGGTCGGTTCTCAAATGGAGTCGAGTTTCCATCTGTAACTATGAGATTGTGA

Protein sequence

MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQRRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFYFRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCGGGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLKGKGVLADEGRFSNGVEFPSVTMRL
BLAST of BhiUN159G30 vs. TAIR10
Match: AT5G49840.1 (ATP-dependent Clp protease)

HSP 1 Score: 650.6 bits (1677), Expect = 9.9e-187
Identity = 362/595 (60.84%), Postives = 430/595 (72.27%), Query Frame = 0

Query: 13  IKLFTSMRTITTTDSPFHR-RFITPLPSKPFLQPHAAARSAGLFLWFSQRRHSWKGA--- 72
           I  F S +TIT++       RF+  + S P + P     S  L    S  R  W      
Sbjct: 7   ISRFVSRKTITSSSLLSRSFRFLLSVDSPPHI-PLLRPSSNTLIPSSSFSRRIWDSCXXX 66

Query: 73  ------SDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFYFR 132
                   YD+IR+DVNCPRCS QM +IFSNRPLS+TARE G+YQA+N C  CKTAFYFR
Sbjct: 67  XXXXXXXXYDHIRSDVNCPRCSAQMHVIFSNRPLSLTAREPGIYQAVNFCSQCKTAFYFR 126

Query: 133 PFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCGGG 192
           PFKL PLQG+FIE+G+VKG                +         D              
Sbjct: 127 PFKLSPLQGSFIELGKVKGTDXXXXXXXXXXXSFPRNWKIQGLRSDEXXXXXXXXXXXXX 186

Query: 193 GGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRI 252
           GGD    SV             ++L TPKEI + LD+F++GQEKAKKVLSVAVYNHYKRI
Sbjct: 187 GGDKEKQSV-------------IKLPTPKEICQGLDEFVIGQEKAKKVLSVAVYNHYKRI 246

Query: 253 YHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTI 312
           YHAS +K S   S  I++E+D+ + VEL+KSNVLL+GPTGSGKTLLAKTLAR+VNVPF I
Sbjct: 247 YHASRKKGSASESYNIDMEDDNIDHVELDKSNVLLLGPTGSGKTLLAKTLARIVNVPFAI 306

Query: 313 ADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDV 372
           ADAT+LTQASYVGEDVESILYKL ++A  NVE AQRGIVYIDEVDK+T KS S N GRDV
Sbjct: 307 ADATSLTQASYVGEDVESILYKLYVEAGCNVEEAQRGIVYIDEVDKMTMKSHSSNGGRDV 366

Query: 373 SGEGVQQALLKILEGTVVDV--SDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 432
           SGEGVQQ+LLK+LEGTVV V   +KG R+ P  D+IQMDTK+ILFICGGAF+ LEK +S+
Sbjct: 367 SGEGVQQSLLKLLEGTVVSVPIPEKGLRRDPRGDSIQMDTKDILFICGGAFIDLEKTVSE 426

Query: 433 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 492
           RQHD+SIGFGA VR +M T GL+   VTSS+LES++S DLV YGLIPEFVGR PILVSLS
Sbjct: 427 RQHDASIGFGASVRTNMSTSGLSSAAVTSSLLESLQSEDLVAYGLIPEFVGRLPILVSLS 486

Query: 493 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 552
           AL+ED+LVQVLT+PK+A+GKQYKK+FRMN+V+L FT+ A R+IARKAM KNTGARGLRSI
Sbjct: 487 ALNEDQLVQVLTEPKSALGKQYKKLFRMNNVQLQFTEGATRLIARKAMSKNTGARGLRSI 546

Query: 553 LENILTEAMFEVPE-----TNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQY 591
           LE+ILTEAMFEVP+     + SIKAVLVDEEAVGSV +PGCGAK+L   + L+Q+
Sbjct: 547 LESILTEAMFEVPDSITEGSQSIKAVLVDEEAVGSVGSPGCGAKILKGDNVLQQF 587

BLAST of BhiUN159G30 vs. TAIR10
Match: AT1G33360.1 (ATP-dependent Clp protease)

HSP 1 Score: 567.0 bits (1460), Expect = 1.4e-161
Identity = 324/562 (57.65%), Postives = 393/562 (69.93%), Query Frame = 0

Query: 74  IRADVNCPRCSKQMPLIFSNRPL-------------SITARETGVYQALNLCPNCKTAFY 133
           +RA+ NCPRCSKQM L+FSNR               S  A +   +Q++N CP CKTA+ 
Sbjct: 66  LRAEPNCPRCSKQMDLLFSNRQFPSSNLLQRPDDSDSSGAGDKTNFQSVNFCPTCKTAYG 125

Query: 134 FRPFKLVPLQGTFIEIGRVKG------------------ARDLDDHVG-------NESSQ 193
           F P  + PLQGTFIEIGRV+                       D + G         S  
Sbjct: 126 FNPRGVSPLQGTFIEIGRVQSPTXXXXXXXXXXXXXXXXXXSKDPNQGFNYRNKLRSSFW 185

Query: 194 HTKGDLGANCTDDSALPPRWN----GSCGGGGGDGNLGSVES-------NGVQK-GDGSL 253
            T    GA   +D + PP  +                 +V++       N V + G   L
Sbjct: 186 DTLRSYGAEPPEDWSPPPPHSPLNXXXXXXXXXXXXXXAVDTSPLPDAVNDVSRWGGAGL 245

Query: 254 EMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGIELEND 313
                TPKEI + LDKF++GQ +AKKVLSVAVYNHYKRIYH S++K    GS    +++D
Sbjct: 246 GRDFPTPKEICKWLDKFVIGQSRAKKVLSVAVYNHYKRIYHTSMKK----GSAAQPIDDD 305

Query: 314 DNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILY 373
           DN  VEL+KSNVLLMGPTGSGKTLLAKTLAR+VNVPF IADAT LTQA YVG+DVESIL+
Sbjct: 306 DN--VELDKSNVLLMGPTGSGKTLLAKTLARLVNVPFVIADATTLTQAGYVGDDVESILH 365

Query: 374 KLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGTVVDVS 433
           KLL  AEFNV+AAQ+GIVYIDEVDKITKK+ES+N  RDVSGEGVQQALLK+LEGT+V+V 
Sbjct: 366 KLLTVAEFNVQAAQQGIVYIDEVDKITKKAESLNISRDVSGEGVQQALLKLLEGTIVNVP 425

Query: 434 DKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMRTGGLT 493
            KGARKHP  D IQ+DTK+ILFICGGAFV LEK I DR+ DSSIGFGAPVRA+M T G+T
Sbjct: 426 GKGARKHPRGDHIQIDTKDILFICGGAFVDLEKTIVDRRQDSSIGFGAPVRANMATSGVT 485

Query: 494 EDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAIGKQYK 553
              +TSS+LESVES+DL  YGLIPEFVGRFPILVSLSAL ED+L++VL +PKNA+GKQYK
Sbjct: 486 SGAITSSLLESVESADLTAYGLIPEFVGRFPILVSLSALTEDQLIRVLVEPKNALGKQYK 545

Query: 554 KMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETNS----IK 582
           K+F MN+V+LHFT++AL +I+++AM KNTGARGLR++LE+ILTEAMFE+P+       I 
Sbjct: 546 KLFSMNNVKLHFTEKALEIISKQAMVKNTGARGLRALLESILTEAMFEIPDDKKGDERID 605

BLAST of BhiUN159G30 vs. TAIR10
Match: AT5G53350.1 (CLP protease regulatory subunit X)

HSP 1 Score: 528.9 bits (1361), Expect = 4.3e-150
Identity = 276/426 (64.79%), Postives = 338/426 (79.34%), Query Frame = 0

Query: 199 GDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGI 258
           G  +L     TPKEI + L+KF++GQE+AKKVLSVAVYNHYKRIYH S QK S   +   
Sbjct: 150 GGSNLGSDFPTPKEICKGLNKFVIGQERAKKVLSVAVYNHYKRIYHESSQKRSAGETDST 209

Query: 259 ELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDV 318
             +  D+++VELEKSN+LLMGPTGSGKTLLAKTLAR VNVPF IADAT LTQA YVGEDV
Sbjct: 210 AAKPADDDMVELEKSNILLMGPTGSGKTLLAKTLARFVNVPFVIADATTLTQAGYVGEDV 269

Query: 319 ESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGT 378
           ESILYKLL  A++NV AAQ+GIVYIDEVDKITKK+ES+N  RDVSGEGVQQALLK+LEGT
Sbjct: 270 ESILYKLLTVADYNVAAAQQGIVYIDEVDKITKKAESLNISRDVSGEGVQQALLKMLEGT 329

Query: 379 VVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMR 438
           +V+V +KGARKHP  D IQ+DTK+ILFICGGAFV +EK IS+R+HDSSIGFGAPVRA+MR
Sbjct: 330 IVNVPEKGARKHPRGDNIQIDTKDILFICGGAFVDIEKTISERRHDSSIGFGAPVRANMR 389

Query: 439 TGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAI 498
            GG+T   V S+++E+VESSDL+ YGLIPEFVGRFP+LVSLSAL E++L+QVLT+PKNA+
Sbjct: 390 AGGVTNAAVASNLMETVESSDLIAYGLIPEFVGRFPVLVSLSALTENQLMQVLTEPKNAL 449

Query: 499 GKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPE---- 558
           GKQYKKM++MN V+LHFT+ ALR+IARKA+ KNTGARGLR++LE+IL ++M+E+P+    
Sbjct: 450 GKQYKKMYQMNSVKLHFTESALRLIARKAITKNTGARGLRALLESILMDSMYEIPDEGTG 509

Query: 559 TNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLKGKGVLADEGRFSNG 618
           ++ I+AV+VDEEAV      G GAK+L     L +Y      ++         +G     
Sbjct: 510 SDMIEAVVVDEEAVEGEGRRGSGAKILRGKGALARYLSETNSKDSPQTTKEGSDGETEVE 569

Query: 619 VEFPSV 621
            E PSV
Sbjct: 570 AEIPSV 575

BLAST of BhiUN159G30 vs. TAIR10
Match: AT5G64580.1 (AAA-type ATPase family protein)

HSP 1 Score: 48.9 bits (115), Expect = 1.3e-05
Identity = 37/103 (35.92%), Postives = 53/103 (51.46%), Query Frame = 0

Query: 275 VLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILYKLLLDAEFNVE 334
           VLL GP G+GKTLLAK +A    +PF  A+ T   +  +VG     +    + D   +  
Sbjct: 352 VLLHGPPGTGKTLLAKAIAGEAGLPFFAANGTDFVE-MFVG-----VAASRVKDLFASSR 411

Query: 335 AAQRGIVYIDEVDKITKKSESINSGRDVSGEGV--QQALLKIL 376
           +    I++IDE+D I  K      G D+ G G   +Q LL+IL
Sbjct: 412 SYAPSIIFIDEIDAIGSK----RGGPDIGGGGAEREQGLLQIL 444

BLAST of BhiUN159G30 vs. TAIR10
Match: AT1G09100.1 (26S proteasome AAA-ATPase subunit RPT5B)

HSP 1 Score: 45.1 bits (105), Expect = 1.9e-04
Identity = 37/103 (35.92%), Postives = 56/103 (54.37%), Query Frame = 0

Query: 275 VLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILYKLLLDAEFNVE 334
           VLL GP G+GKTL+A+  A   N  F       L Q  ++G+       KL+ DA    +
Sbjct: 207 VLLYGPPGTGKTLMARACAAQTNATFLKLAGPQLVQ-MFIGDGA-----KLVRDAFLLAK 266

Query: 335 AAQRGIVYIDEVDKI-TKKSESINSGRDVSGE-GVQQALLKIL 376
                I++IDE+D I TK+ +S     +VSG+  VQ+ +L++L
Sbjct: 267 EKSPCIIFIDEIDAIGTKRFDS-----EVSGDREVQRTMLELL 298

BLAST of BhiUN159G30 vs. Swiss-Prot
Match: sp|F4K7F6|CLPX2_ARATH (CLP protease regulatory subunit CLPX2, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=CLPX2 PE=3 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 1.8e-185
Identity = 362/595 (60.84%), Postives = 430/595 (72.27%), Query Frame = 0

Query: 13  IKLFTSMRTITTTDSPFHR-RFITPLPSKPFLQPHAAARSAGLFLWFSQRRHSWKGA--- 72
           I  F S +TIT++       RF+  + S P + P     S  L    S  R  W      
Sbjct: 7   ISRFVSRKTITSSSLLSRSFRFLLSVDSPPHI-PLLRPSSNTLIPSSSFSRRIWDSCXXX 66

Query: 73  ------SDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFYFR 132
                   YD+IR+DVNCPRCS QM +IFSNRPLS+TARE G+YQA+N C  CKTAFYFR
Sbjct: 67  XXXXXXXXYDHIRSDVNCPRCSAQMHVIFSNRPLSLTAREPGIYQAVNFCSQCKTAFYFR 126

Query: 133 PFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCGGG 192
           PFKL PLQG+FIE+G+VKG                +         D              
Sbjct: 127 PFKLSPLQGSFIELGKVKGTDXXXXXXXXXXXSFPRNWKIQGLRSDEXXXXXXXXXXXXX 186

Query: 193 GGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRI 252
           GGD    SV             ++L TPKEI + LD+F++GQEKAKKVLSVAVYNHYKRI
Sbjct: 187 GGDKEKQSV-------------IKLPTPKEICQGLDEFVIGQEKAKKVLSVAVYNHYKRI 246

Query: 253 YHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTI 312
           YHAS +K S   S  I++E+D+ + VEL+KSNVLL+GPTGSGKTLLAKTLAR+VNVPF I
Sbjct: 247 YHASRKKGSASESYNIDMEDDNIDHVELDKSNVLLLGPTGSGKTLLAKTLARIVNVPFAI 306

Query: 313 ADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDV 372
           ADAT+LTQASYVGEDVESILYKL ++A  NVE AQRGIVYIDEVDK+T KS S N GRDV
Sbjct: 307 ADATSLTQASYVGEDVESILYKLYVEAGCNVEEAQRGIVYIDEVDKMTMKSHSSNGGRDV 366

Query: 373 SGEGVQQALLKILEGTVVDV--SDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 432
           SGEGVQQ+LLK+LEGTVV V   +KG R+ P  D+IQMDTK+ILFICGGAF+ LEK +S+
Sbjct: 367 SGEGVQQSLLKLLEGTVVSVPIPEKGLRRDPRGDSIQMDTKDILFICGGAFIDLEKTVSE 426

Query: 433 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 492
           RQHD+SIGFGA VR +M T GL+   VTSS+LES++S DLV YGLIPEFVGR PILVSLS
Sbjct: 427 RQHDASIGFGASVRTNMSTSGLSSAAVTSSLLESLQSEDLVAYGLIPEFVGRLPILVSLS 486

Query: 493 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 552
           AL+ED+LVQVLT+PK+A+GKQYKK+FRMN+V+L FT+ A R+IARKAM KNTGARGLRSI
Sbjct: 487 ALNEDQLVQVLTEPKSALGKQYKKLFRMNNVQLQFTEGATRLIARKAMSKNTGARGLRSI 546

Query: 553 LENILTEAMFEVPE-----TNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQY 591
           LE+ILTEAMFEVP+     + SIKAVLVDEEAVGSV +PGCGAK+L   + L+Q+
Sbjct: 547 LESILTEAMFEVPDSITEGSQSIKAVLVDEEAVGSVGSPGCGAKILKGDNVLQQF 587

BLAST of BhiUN159G30 vs. Swiss-Prot
Match: sp|Q66GN9|CLPX3_ARATH (CLP protease regulatory subunit CLPX3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=CLPX3 PE=2 SV=1)

HSP 1 Score: 567.0 bits (1460), Expect = 2.6e-160
Identity = 324/562 (57.65%), Postives = 393/562 (69.93%), Query Frame = 0

Query: 74  IRADVNCPRCSKQMPLIFSNRPL-------------SITARETGVYQALNLCPNCKTAFY 133
           +RA+ NCPRCSKQM L+FSNR               S  A +   +Q++N CP CKTA+ 
Sbjct: 66  LRAEPNCPRCSKQMDLLFSNRQFPSSNLLQRPDDSDSSGAGDKTNFQSVNFCPTCKTAYG 125

Query: 134 FRPFKLVPLQGTFIEIGRVKG------------------ARDLDDHVG-------NESSQ 193
           F P  + PLQGTFIEIGRV+                       D + G         S  
Sbjct: 126 FNPRGVSPLQGTFIEIGRVQSPTXXXXXXXXXXXXXXXXXXSKDPNQGFNYRNKLRSSFW 185

Query: 194 HTKGDLGANCTDDSALPPRWN----GSCGGGGGDGNLGSVES-------NGVQK-GDGSL 253
            T    GA   +D + PP  +                 +V++       N V + G   L
Sbjct: 186 DTLRSYGAEPPEDWSPPPPHSPLNXXXXXXXXXXXXXXAVDTSPLPDAVNDVSRWGGAGL 245

Query: 254 EMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGIELEND 313
                TPKEI + LDKF++GQ +AKKVLSVAVYNHYKRIYH S++K    GS    +++D
Sbjct: 246 GRDFPTPKEICKWLDKFVIGQSRAKKVLSVAVYNHYKRIYHTSMKK----GSAAQPIDDD 305

Query: 314 DNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILY 373
           DN  VEL+KSNVLLMGPTGSGKTLLAKTLAR+VNVPF IADAT LTQA YVG+DVESIL+
Sbjct: 306 DN--VELDKSNVLLMGPTGSGKTLLAKTLARLVNVPFVIADATTLTQAGYVGDDVESILH 365

Query: 374 KLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGTVVDVS 433
           KLL  AEFNV+AAQ+GIVYIDEVDKITKK+ES+N  RDVSGEGVQQALLK+LEGT+V+V 
Sbjct: 366 KLLTVAEFNVQAAQQGIVYIDEVDKITKKAESLNISRDVSGEGVQQALLKLLEGTIVNVP 425

Query: 434 DKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMRTGGLT 493
            KGARKHP  D IQ+DTK+ILFICGGAFV LEK I DR+ DSSIGFGAPVRA+M T G+T
Sbjct: 426 GKGARKHPRGDHIQIDTKDILFICGGAFVDLEKTIVDRRQDSSIGFGAPVRANMATSGVT 485

Query: 494 EDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAIGKQYK 553
              +TSS+LESVES+DL  YGLIPEFVGRFPILVSLSAL ED+L++VL +PKNA+GKQYK
Sbjct: 486 SGAITSSLLESVESADLTAYGLIPEFVGRFPILVSLSALTEDQLIRVLVEPKNALGKQYK 545

Query: 554 KMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETNS----IK 582
           K+F MN+V+LHFT++AL +I+++AM KNTGARGLR++LE+ILTEAMFE+P+       I 
Sbjct: 546 KLFSMNNVKLHFTEKALEIISKQAMVKNTGARGLRALLESILTEAMFEIPDDKKGDERID 605

BLAST of BhiUN159G30 vs. Swiss-Prot
Match: sp|Q9FK07|CLPX1_ARATH (CLP protease regulatory subunit CLPX1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=CLPX1 PE=2 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 7.8e-149
Identity = 276/426 (64.79%), Postives = 338/426 (79.34%), Query Frame = 0

Query: 199 GDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGI 258
           G  +L     TPKEI + L+KF++GQE+AKKVLSVAVYNHYKRIYH S QK S   +   
Sbjct: 150 GGSNLGSDFPTPKEICKGLNKFVIGQERAKKVLSVAVYNHYKRIYHESSQKRSAGETDST 209

Query: 259 ELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDV 318
             +  D+++VELEKSN+LLMGPTGSGKTLLAKTLAR VNVPF IADAT LTQA YVGEDV
Sbjct: 210 AAKPADDDMVELEKSNILLMGPTGSGKTLLAKTLARFVNVPFVIADATTLTQAGYVGEDV 269

Query: 319 ESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGT 378
           ESILYKLL  A++NV AAQ+GIVYIDEVDKITKK+ES+N  RDVSGEGVQQALLK+LEGT
Sbjct: 270 ESILYKLLTVADYNVAAAQQGIVYIDEVDKITKKAESLNISRDVSGEGVQQALLKMLEGT 329

Query: 379 VVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMR 438
           +V+V +KGARKHP  D IQ+DTK+ILFICGGAFV +EK IS+R+HDSSIGFGAPVRA+MR
Sbjct: 330 IVNVPEKGARKHPRGDNIQIDTKDILFICGGAFVDIEKTISERRHDSSIGFGAPVRANMR 389

Query: 439 TGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAI 498
            GG+T   V S+++E+VESSDL+ YGLIPEFVGRFP+LVSLSAL E++L+QVLT+PKNA+
Sbjct: 390 AGGVTNAAVASNLMETVESSDLIAYGLIPEFVGRFPVLVSLSALTENQLMQVLTEPKNAL 449

Query: 499 GKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPE---- 558
           GKQYKKM++MN V+LHFT+ ALR+IARKA+ KNTGARGLR++LE+IL ++M+E+P+    
Sbjct: 450 GKQYKKMYQMNSVKLHFTESALRLIARKAITKNTGARGLRALLESILMDSMYEIPDEGTG 509

Query: 559 TNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLKGKGVLADEGRFSNG 618
           ++ I+AV+VDEEAV      G GAK+L     L +Y      ++         +G     
Sbjct: 510 SDMIEAVVVDEEAVEGEGRRGSGAKILRGKGALARYLSETNSKDSPQTTKEGSDGETEVE 569

Query: 619 VEFPSV 621
            E PSV
Sbjct: 570 AEIPSV 575

BLAST of BhiUN159G30 vs. Swiss-Prot
Match: sp|Q11J59|CLPX_CHESB (ATP-dependent Clp protease ATP-binding subunit ClpX OS=Chelativorans sp. (strain BNC1) OX=266779 GN=clpX PE=3 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 9.3e-110
Identity = 209/366 (57.10%), Postives = 266/366 (72.68%), Query Frame = 0

Query: 209 TPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTSGQGSLGIELENDDNEIV 268
           TP+EI E LD +++GQ  AK+VLSVAV+NHYKR+ HA                   N  V
Sbjct: 67  TPQEIMEVLDDYVIGQTYAKRVLSVAVHNHYKRLAHAG-----------------KNADV 126

Query: 269 ELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQASYVGEDVESILYKLLLD 328
           EL KSN+LL+GPTG GKTLLA+TLAR+++VPFT+ADAT LT+A YVGEDVE+I+ KLL  
Sbjct: 127 ELSKSNILLIGPTGCGKTLLAQTLARIIDVPFTMADATTLTEAGYVGEDVENIILKLLQS 186

Query: 329 AEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILEGTVVDVSDKGAR 388
           A++NVE AQRGIVYIDE+DKI++KS++ +  RDVSGEGVQQALLKI+EGTV  V  +G R
Sbjct: 187 ADYNVERAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKIMEGTVASVPPQGGR 246

Query: 389 KHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRASMRTGGLTEDLVT 448
           KHP  + +Q+DT +ILFICGGAF GLEK ISDR   +SIGFGA V +        ED   
Sbjct: 247 KHPQQEFLQVDTSSILFICGGAFAGLEKIISDRGKKTSIGFGASVSS-------PEDRRA 306

Query: 449 SSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKNAIGKQYKKMFRM 508
             +L  VE  DL+ +GLIPEFVGR PIL +L  LDE+ LVQ+LT+PKNA+ KQY+++F M
Sbjct: 307 GELLRQVEPEDLLKFGLIPEFVGRLPILATLEDLDEEALVQILTEPKNALVKQYQRLFEM 366

Query: 509 NDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETNSIKAVLVDEEAV 568
            +VEL F + ALR IARKA+++ TGARGLRSI+E IL + MFE+P    ++ V++ E+ V
Sbjct: 367 ENVELTFHENALRAIARKAIERKTGARGLRSIMEAILLDTMFELPALEGVQEVVISEDVV 408

Query: 569 GSVDAP 575
                P
Sbjct: 427 AGSARP 408

BLAST of BhiUN159G30 vs. Swiss-Prot
Match: sp|Q1GGF7|CLPX_RUEST (ATP-dependent Clp protease ATP-binding subunit ClpX OS=Ruegeria sp. (strain TM1040) OX=292414 GN=clpX PE=3 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 1.2e-109
Identity = 215/383 (56.14%), Postives = 275/383 (71.80%), Query Frame = 0

Query: 192 ESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYKRIYHASLQKTS 251
           +++G++  DG     + TPK+I E LD +++GQ  AK+VLSVAV+NHYKR+ HA  QK  
Sbjct: 53  KASGMKATDG-----VPTPKDICEVLDDYVIGQATAKRVLSVAVHNHYKRLNHA--QKAG 112

Query: 252 GQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPFTIADATALTQA 311
                     ND    +EL KSN+LL+GPTG GKTLLA+TLAR+++VPFT+ADAT LT+A
Sbjct: 113 ----------ND----IELSKSNILLIGPTGCGKTLLAQTLARILDVPFTMADATTLTEA 172

Query: 312 SYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQAL 371
            YVGEDVE+I+ KLL  +E+NVE AQRGIVYIDEVDKIT+KSE+ +  RDVSGEGVQQAL
Sbjct: 173 GYVGEDVENIILKLLQASEYNVERAQRGIVYIDEVDKITRKSENPSITRDVSGEGVQQAL 232

Query: 372 LKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGA 431
           LK++EGTV  V  +G RKHP  + +Q+DT NILFICGGAF GL+K I  R   S++GFGA
Sbjct: 233 LKLMEGTVASVPPQGGRKHPQQEFLQVDTTNILFICGGAFAGLDKIIKQRGKGSAMGFGA 292

Query: 432 PVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVL 491
            VR     G              +E  DL+ +GLIPEFVGR P+L +L  LDED L+ +L
Sbjct: 293 DVREESDAG-------VGETFRDLEPEDLLKFGLIPEFVGRLPVLATLEDLDEDALITIL 352

Query: 492 TKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFE 551
           TKPKNA+ KQY+++F + D EL FTD AL  IA+KA+++ TGARGLRSILE+IL + MFE
Sbjct: 353 TKPKNALVKQYQRLFELEDTELDFTDEALSAIAKKAIERKTGARGLRSILEDILLDTMFE 407

Query: 552 VPETNSIKAVLVDEEAVGSVDAP 575
           +P   S+  V+V+EEAV S   P
Sbjct: 413 LPGMESVTKVVVNEEAVCSEAQP 407

BLAST of BhiUN159G30 vs. TrEMBL
Match: tr|A0A1S3CD89|A0A1S3CD89_CUCME (CLP protease regulatory subunit CLPX2, mitochondrial isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499373 PE=4 SV=1)

HSP 1 Score: 1060.8 bits (2742), Expect = 1.2e-306
Identity = 546/623 (87.64%), Postives = 571/623 (91.65%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFR  NT FIKLFTS+RTITTTDS FH RF+TP+PSK    PHAA RSA   LW SQ
Sbjct: 1   MIPRFRFNNTHFIKLFTSIRTITTTDSLFHHRFLTPVPSK----PHAAPRSAVFTLWLSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVYQALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYQALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRP KLVPL GTFIEIGRVKGARDLD    NES+Q T+GD+GANC DD ALPPR NGSC 
Sbjct: 121 FRPLKLVPLHGTFIEIGRVKGARDLDHDADNESNQPTRGDIGANCADDFALPPRQNGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGGD NLGSVESNGV K + SL MQLLTPKEIS ALDKF+VGQEKAKKVLSVAVYNHYK
Sbjct: 181 -GGGDDNLGSVESNGVHKDEASLGMQLLTPKEISLALDKFVVGQEKAKKVLSVAVYNHYK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYH+SLQKTSGQGSLG ELENDDNE VELEKSN+LLMGPTGSGKTLLAKTLARVVNVPF
Sbjct: 241 RIYHSSLQKTSGQGSLGTELENDDNETVELEKSNLLLMGPTGSGKTLLAKTLARVVNVPF 300

Query: 301 TIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360
           TIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR
Sbjct: 301 TIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360

Query: 361 DVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 420
           DVSGEGVQQALLK+LEGTVVDV D GAR+HP  DTIQMDTKNILFICGGAFVGLEKCISD
Sbjct: 361 DVSGEGVQQALLKMLEGTVVDVPDTGARRHPRGDTIQMDTKNILFICGGAFVGLEKCISD 420

Query: 421 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 480
           RQHDSSIGFGAPVRASMRT  LTEDLVTSSMLE+VESSDLVTYGLIPEFVGR PILVSLS
Sbjct: 421 RQHDSSIGFGAPVRASMRTARLTEDLVTSSMLENVESSDLVTYGLIPEFVGRCPILVSLS 480

Query: 481 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 540
           ALDED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSI
Sbjct: 481 ALDEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSI 540

Query: 541 LENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLK 600
           LENILTEAMFEVPE+NSIKAVLVD E+VGSVDAPGCGAK+LCDVDEL + SK EI+RNLK
Sbjct: 541 LENILTEAMFEVPESNSIKAVLVDGESVGSVDAPGCGAKILCDVDELTKCSKSEIIRNLK 600

Query: 601 GKGVLA-DEGRFSNGVEFPSVTM 623
           G  ++A DEGRFSNGVEFPSV M
Sbjct: 601 GNDMVADDEGRFSNGVEFPSVAM 617

BLAST of BhiUN159G30 vs. TrEMBL
Match: tr|A0A1S4E2X7|A0A1S4E2X7_CUCME (CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499373 PE=4 SV=1)

HSP 1 Score: 1039.6 bits (2687), Expect = 2.8e-300
Identity = 546/667 (81.86%), Postives = 571/667 (85.61%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFR  NT FIKLFTS+RTITTTDS FH RF+TP+PSK    PHAA RSA   LW SQ
Sbjct: 1   MIPRFRFNNTHFIKLFTSIRTITTTDSLFHHRFLTPVPSK----PHAAPRSAVFTLWLSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVYQALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYQALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRP KLVPL GTFIEIGRVKGARDLD    NES+Q T+GD+GANC DD ALPPR NGSC 
Sbjct: 121 FRPLKLVPLHGTFIEIGRVKGARDLDHDADNESNQPTRGDIGANCADDFALPPRQNGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGGD NLGSVESNGV K + SL MQLLTPKEIS ALDKF+VGQEKAKKVLSVAVYNHYK
Sbjct: 181 -GGGDDNLGSVESNGVHKDEASLGMQLLTPKEISLALDKFVVGQEKAKKVLSVAVYNHYK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYH+SLQKTSGQGSLG ELENDDNE VELEKSN+LLMGPTGSGKTLLAKTLARVVNVPF
Sbjct: 241 RIYHSSLQKTSGQGSLGTELENDDNETVELEKSNLLLMGPTGSGKTLLAKTLARVVNVPF 300

Query: 301 TIADATALTQASYVGEDVESILYKLLL--------------------------------- 360
           TIADATALTQA YVGEDVESILYKLLL                                 
Sbjct: 301 TIADATALTQAGYVGEDVESILYKLLLGCKSFHKMLERFLMKHDYSIWYVKHLQTSLQLL 360

Query: 361 -----------DAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILE 420
                      DAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLK+LE
Sbjct: 361 SKSLGACYAEKDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKMLE 420

Query: 421 GTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS 480
           GTVVDV D GAR+HP  DTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS
Sbjct: 421 GTVVDVPDTGARRHPRGDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS 480

Query: 481 MRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKN 540
           MRT  LTEDLVTSSMLE+VESSDLVTYGLIPEFVGR PILVSLSALDED+LVQVLTKPKN
Sbjct: 481 MRTARLTEDLVTSSMLENVESSDLVTYGLIPEFVGRCPILVSLSALDEDQLVQVLTKPKN 540

Query: 541 AIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETN 600
           A+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSILENILTEAMFEVPE+N
Sbjct: 541 ALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSILENILTEAMFEVPESN 600

Query: 601 SIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLKGKGVLA-DEGRFSNGV 623
           SIKAVLVD E+VGSVDAPGCGAK+LCDVDEL + SK EI+RNLKG  ++A DEGRFSNGV
Sbjct: 601 SIKAVLVDGESVGSVDAPGCGAKILCDVDELTKCSKSEIIRNLKGNDMVADDEGRFSNGV 660

BLAST of BhiUN159G30 vs. TrEMBL
Match: tr|A0A1S3CCC6|A0A1S3CCC6_CUCME (CLP protease regulatory subunit CLPX2, mitochondrial isoform X5 OS=Cucumis melo OX=3656 GN=LOC103499373 PE=4 SV=1)

HSP 1 Score: 999.2 bits (2582), Expect = 4.1e-288
Identity = 519/623 (83.31%), Postives = 544/623 (87.32%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFR  NT FIKLFTS+RTITTTDS FH RF+TP+PSK    PHAA RSA   LW SQ
Sbjct: 1   MIPRFRFNNTHFIKLFTSIRTITTTDSLFHHRFLTPVPSK----PHAAPRSAVFTLWLSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVYQALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYQALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRP KLVPL GTFIEIGRVKGARDLD    NES+Q T+GD+GANC DD ALPPR NGSC 
Sbjct: 121 FRPLKLVPLHGTFIEIGRVKGARDLDHDADNESNQPTRGDIGANCADDFALPPRQNGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGGD NLGSVESNGV K + SL MQLLTPKEIS ALDKF+VGQEKAKKVLSVAVYNHYK
Sbjct: 181 -GGGDDNLGSVESNGVHKDEASLGMQLLTPKEISLALDKFVVGQEKAKKVLSVAVYNHYK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYH+SLQKTSGQGSLG ELENDDNE VELEKSN+LLMGPTGSG                
Sbjct: 241 RIYHSSLQKTSGQGSLGTELENDDNETVELEKSNLLLMGPTGSG---------------- 300

Query: 301 TIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360
                       YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR
Sbjct: 301 ------------YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360

Query: 361 DVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 420
           DVSGEGVQQALLK+LEGTVVDV D GAR+HP  DTIQMDTKNILFICGGAFVGLEKCISD
Sbjct: 361 DVSGEGVQQALLKMLEGTVVDVPDTGARRHPRGDTIQMDTKNILFICGGAFVGLEKCISD 420

Query: 421 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 480
           RQHDSSIGFGAPVRASMRT  LTEDLVTSSMLE+VESSDLVTYGLIPEFVGR PILVSLS
Sbjct: 421 RQHDSSIGFGAPVRASMRTARLTEDLVTSSMLENVESSDLVTYGLIPEFVGRCPILVSLS 480

Query: 481 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 540
           ALDED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSI
Sbjct: 481 ALDEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSI 540

Query: 541 LENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLK 600
           LENILTEAMFEVPE+NSIKAVLVD E+VGSVDAPGCGAK+LCDVDEL + SK EI+RNLK
Sbjct: 541 LENILTEAMFEVPESNSIKAVLVDGESVGSVDAPGCGAKILCDVDELTKCSKSEIIRNLK 589

Query: 601 GKGVLA-DEGRFSNGVEFPSVTM 623
           G  ++A DEGRFSNGVEFPSV M
Sbjct: 601 GNDMVADDEGRFSNGVEFPSVAM 589

BLAST of BhiUN159G30 vs. TrEMBL
Match: tr|A0A1S4E2W3|A0A1S4E2W3_CUCME (CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499373 PE=4 SV=1)

HSP 1 Score: 978.0 bits (2527), Expect = 9.8e-282
Identity = 519/667 (77.81%), Postives = 544/667 (81.56%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFR  NT FIKLFTS+RTITTTDS FH RF+TP+PSK    PHAA RSA   LW SQ
Sbjct: 1   MIPRFRFNNTHFIKLFTSIRTITTTDSLFHHRFLTPVPSK----PHAAPRSAVFTLWLSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVYQALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYQALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRP KLVPL GTFIEIGRVKGARDLD    NES+Q T+GD+GANC DD ALPPR NGSC 
Sbjct: 121 FRPLKLVPLHGTFIEIGRVKGARDLDHDADNESNQPTRGDIGANCADDFALPPRQNGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGGD NLGSVESNGV K + SL MQLLTPKEIS ALDKF+VGQEKAKKVLSVAVYNHYK
Sbjct: 181 -GGGDDNLGSVESNGVHKDEASLGMQLLTPKEISLALDKFVVGQEKAKKVLSVAVYNHYK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYH+SLQKTSGQGSLG ELENDDNE VELEKSN+LLMGPTGSG                
Sbjct: 241 RIYHSSLQKTSGQGSLGTELENDDNETVELEKSNLLLMGPTGSG---------------- 300

Query: 301 TIADATALTQASYVGEDVESILYKLLL--------------------------------- 360
                       YVGEDVESILYKLLL                                 
Sbjct: 301 ------------YVGEDVESILYKLLLGCKSFHKMLERFLMKHDYSIWYVKHLQTSLQLL 360

Query: 361 -----------DAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKILE 420
                      DAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLK+LE
Sbjct: 361 SKSLGACYAEKDAEFNVEAAQRGIVYIDEVDKITKKSESINSGRDVSGEGVQQALLKMLE 420

Query: 421 GTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS 480
           GTVVDV D GAR+HP  DTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS
Sbjct: 421 GTVVDVPDTGARRHPRGDTIQMDTKNILFICGGAFVGLEKCISDRQHDSSIGFGAPVRAS 480

Query: 481 MRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLSALDEDELVQVLTKPKN 540
           MRT  LTEDLVTSSMLE+VESSDLVTYGLIPEFVGR PILVSLSALDED+LVQVLTKPKN
Sbjct: 481 MRTARLTEDLVTSSMLENVESSDLVTYGLIPEFVGRCPILVSLSALDEDQLVQVLTKPKN 540

Query: 541 AIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSILENILTEAMFEVPETN 600
           A+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSILENILTEAMFEVPE+N
Sbjct: 541 ALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSILENILTEAMFEVPESN 600

Query: 601 SIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLKGKGVLA-DEGRFSNGV 623
           SIKAVLVD E+VGSVDAPGCGAK+LCDVDEL + SK EI+RNLKG  ++A DEGRFSNGV
Sbjct: 601 SIKAVLVDGESVGSVDAPGCGAKILCDVDELTKCSKSEIIRNLKGNDMVADDEGRFSNGV 633

BLAST of BhiUN159G30 vs. TrEMBL
Match: tr|A0A1S3CE05|A0A1S3CE05_CUCME (CLP protease regulatory subunit CLPX2, mitochondrial isoform X4 OS=Cucumis melo OX=3656 GN=LOC103499373 PE=4 SV=1)

HSP 1 Score: 977.2 bits (2525), Expect = 1.7e-281
Identity = 509/626 (81.31%), Postives = 543/626 (86.74%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFR  NT FIKLFTS+RTITTTDS FH RF+TP+PSK    PHAA RSA   LW SQ
Sbjct: 1   MIPRFRFNNTHFIKLFTSIRTITTTDSLFHHRFLTPVPSK----PHAAPRSAVFTLWLSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVYQALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYQALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRP KLVPL GTFIEIGRVKGARDLD    NES+Q T+GD+GANC DD ALPPR NGSC 
Sbjct: 121 FRPLKLVPLHGTFIEIGRVKGARDLDHDADNESNQPTRGDIGANCADDFALPPRQNGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGGD NLGSVESNGV K + SL MQLLTPKEIS ALDKF+VGQEKAKKVLSVAVYNHYK
Sbjct: 181 -GGGDDNLGSVESNGVHKDEASLGMQLLTPKEISLALDKFVVGQEKAKKVLSVAVYNHYK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYH+SLQKTSGQGSLG ELENDDNE VELEKSN+LLMGPTGSG+    +    ++  P 
Sbjct: 241 RIYHSSLQKTSGQGSLGTELENDDNETVELEKSNLLLMGPTGSGRLCRRRCGIDIIQAP- 300

Query: 301 TIADATALTQASYVGEDVESILYKLL---LDAEFNVEAAQRGIVYIDEVDKITKKSESIN 360
                   +    + ++V  IL +     LDAEFNVEAAQRGIVYIDEVDKITKKSESIN
Sbjct: 301 --------SGLQILSQNVGKILDEARLFDLDAEFNVEAAQRGIVYIDEVDKITKKSESIN 360

Query: 361 SGRDVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKC 420
           SGRDVSGEGVQQALLK+LEGTVVDV D GAR+HP  DTIQMDTKNILFICGGAFVGLEKC
Sbjct: 361 SGRDVSGEGVQQALLKMLEGTVVDVPDTGARRHPRGDTIQMDTKNILFICGGAFVGLEKC 420

Query: 421 ISDRQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILV 480
           ISDRQHDSSIGFGAPVRASMRT  LTEDLVTSSMLE+VESSDLVTYGLIPEFVGR PILV
Sbjct: 421 ISDRQHDSSIGFGAPVRASMRTARLTEDLVTSSMLENVESSDLVTYGLIPEFVGRCPILV 480

Query: 481 SLSALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGL 540
           SLSALDED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGL
Sbjct: 481 SLSALDEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGL 540

Query: 541 RSILENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVR 600
           RSILENILTEAMFEVPE+NSIKAVLVD E+VGSVDAPGCGAK+LCDVDEL + SK EI+R
Sbjct: 541 RSILENILTEAMFEVPESNSIKAVLVDGESVGSVDAPGCGAKILCDVDELTKCSKSEIIR 600

Query: 601 NLKGKGVLA-DEGRFSNGVEFPSVTM 623
           NLKG  ++A DEGRFSNGVEFPSV M
Sbjct: 601 NLKGNDMVADDEGRFSNGVEFPSVAM 611

BLAST of BhiUN159G30 vs. NCBI nr
Match: XP_023530366.1 (CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1083.6 bits (2801), Expect = 0.0e+00
Identity = 554/625 (88.64%), Postives = 583/625 (93.28%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFRVKNT+F+ LFTS+RTITTTDS FH RF+TPLPSKP L+PHAAARSAG  LWFSQ
Sbjct: 1   MIPRFRVKNTKFLMLFTSIRTITTTDSLFHHRFLTPLPSKPSLEPHAAARSAGFILWFSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVY ALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYHALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRPFKLVPLQGTFIEIGRVKGARDLD+ V NE  Q TKGD+GAN  DDSALPPRW GSC 
Sbjct: 121 FRPFKLVPLQGTFIEIGRVKGARDLDNIVENEPCQPTKGDIGANSADDSALPPRWKGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGG G+LG VESNGV K + +   QLLTPKEISEALDKFIVGQEKAKK LSVAVYNH+K
Sbjct: 181 -GGGHGSLGLVESNGVPKDEANSGRQLLTPKEISEALDKFIVGQEKAKKALSVAVYNHHK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYHASLQKT GQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF
Sbjct: 241 RIYHASLQKTPGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300

Query: 301 TIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360
           TIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSE++NSGR
Sbjct: 301 TIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSENVNSGR 360

Query: 361 DVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 420
           DVSGEGVQQALLK+LEGTVVDV DKGARKHP  DTIQ+DTKNILF+CGGAFVGLEKCISD
Sbjct: 361 DVSGEGVQQALLKMLEGTVVDVPDKGARKHPQGDTIQIDTKNILFVCGGAFVGLEKCISD 420

Query: 421 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 480
           RQHDSSIGFGAPVRASMRTGGLT+D+VTSS+LESVESSDLVTYGLIPEFVGRFPILVSLS
Sbjct: 421 RQHDSSIGFGAPVRASMRTGGLTDDIVTSSILESVESSDLVTYGLIPEFVGRFPILVSLS 480

Query: 481 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 540
           AL+ED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSI
Sbjct: 481 ALNEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSI 540

Query: 541 LENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLK 600
           LE+ILTEAMFEVPE+ SI+AVLVDEEAVGSVD PGCGAKVL DVDELKQ SK  IV NLK
Sbjct: 541 LEHILTEAMFEVPESKSIRAVLVDEEAVGSVDGPGCGAKVLYDVDELKQCSKSGIVINLK 600

Query: 601 GKGVLA-DEGRFSNGVEFPSVTMRL 625
           G G+LA DEGRFSNGVEFPSVTMRL
Sbjct: 601 GNGLLAEDEGRFSNGVEFPSVTMRL 623

BLAST of BhiUN159G30 vs. NCBI nr
Match: XP_022933080.1 (CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita moschata])

HSP 1 Score: 1078.2 bits (2787), Expect = 0.0e+00
Identity = 554/626 (88.50%), Postives = 583/626 (93.13%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRF VKNT+FIKLFTS+R+ITTTDS FH RF+TPLP KP L+PHAAARSAG  LWFSQ
Sbjct: 1   MIPRFSVKNTKFIKLFTSIRSITTTDSLFHHRFLTPLPPKPSLEPHAAARSAGFILWFSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVY ALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYHALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRPFKLVPLQGTFIEIGRVKGARDLD+ V NES Q TKGD+GAN  DDSALPPRW GSC 
Sbjct: 121 FRPFKLVPLQGTFIEIGRVKGARDLDNIVENESCQPTKGDIGANSADDSALPPRWKGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGG G+LG VESNGV K + +   QLLTPKEISEALDKFIVGQEKAKK LSVAVYNH+K
Sbjct: 181 -GGGHGSLGLVESNGVPKDEANSGRQLLTPKEISEALDKFIVGQEKAKKALSVAVYNHHK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYHASLQKT GQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF
Sbjct: 241 RIYHASLQKTPGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300

Query: 301 TIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360
           TIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSE++NSGR
Sbjct: 301 TIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSENVNSGR 360

Query: 361 DVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 420
           DVSGEGVQQALLK+LEGTVVDV DKGARKHP  DTIQ+DTKNILFICGGAFVGLEKCISD
Sbjct: 361 DVSGEGVQQALLKMLEGTVVDVPDKGARKHPQGDTIQIDTKNILFICGGAFVGLEKCISD 420

Query: 421 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 480
           RQHDSSIGFGAPVRASMRTGGLT+D+VTSS+LESVESSDLVTYGLIPEFVGRFPILVSLS
Sbjct: 421 RQHDSSIGFGAPVRASMRTGGLTDDIVTSSILESVESSDLVTYGLIPEFVGRFPILVSLS 480

Query: 481 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 540
           AL+ED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSI
Sbjct: 481 ALNEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSI 540

Query: 541 LENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVD-ELKQYSKGEIVRNL 600
           LE+ILTEAMFEVPE+ SI+AVLVDEEAVGSVD PGCGAKVL DVD ELKQ SK  IV NL
Sbjct: 541 LEHILTEAMFEVPESKSIRAVLVDEEAVGSVDGPGCGAKVLYDVDEELKQCSKSGIVINL 600

Query: 601 KGKGVLA-DEGRFSNGVEFPSVTMRL 625
           KG G+LA DEGRFSNGVEFPS+TMRL
Sbjct: 601 KGNGLLAEDEGRFSNGVEFPSITMRL 624

BLAST of BhiUN159G30 vs. NCBI nr
Match: XP_022997640.1 (CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita maxima])

HSP 1 Score: 1078.2 bits (2787), Expect = 0.0e+00
Identity = 553/625 (88.48%), Postives = 583/625 (93.28%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFRVKNT+FI+LFTS+R+ITTTDS FH RF+TPLPSKP L+PHAAARSAG  L FSQ
Sbjct: 1   MIPRFRVKNTKFIRLFTSIRSITTTDSLFHHRFLTPLPSKPSLEPHAAARSAGFILRFSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVY ALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYHALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRPFKLVPLQGTFIEIGRVKGARDLD+ V NES Q +KGD+GAN  DDSALPPRW GSC 
Sbjct: 121 FRPFKLVPLQGTFIEIGRVKGARDLDNIVENESCQPSKGDIGANSADDSALPPRWKGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGG G+LG VESNGV K + +   QLLTPKEISEALDKFIVGQEKAKK LSVAVYNH+K
Sbjct: 181 -GGGHGSLGLVESNGVPKNEANSGRQLLTPKEISEALDKFIVGQEKAKKALSVAVYNHHK 240

Query: 241 RIYHASLQKTSGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300
           RIYHASLQKT GQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF
Sbjct: 241 RIYHASLQKTPGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVPF 300

Query: 301 TIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSGR 360
           TIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSE++NSGR
Sbjct: 301 TIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSENVNSGR 360

Query: 361 DVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCISD 420
           DVSGEGVQQALLK+LEGTVVDV DKGARKHP  DTIQ+DTKNILFICGGAFVGLEKCISD
Sbjct: 361 DVSGEGVQQALLKMLEGTVVDVPDKGARKHPQGDTIQIDTKNILFICGGAFVGLEKCISD 420

Query: 421 RQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSLS 480
           RQHDSSIGFGAPVRASMRTGGLT+D+VTSS+LESVESSDLVTYGLIPEFVGRFPILVSLS
Sbjct: 421 RQHDSSIGFGAPVRASMRTGGLTDDIVTSSILESVESSDLVTYGLIPEFVGRFPILVSLS 480

Query: 481 ALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRSI 540
           AL+ED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRSI
Sbjct: 481 ALNEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRSI 540

Query: 541 LENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNLK 600
           LE+ILTEAMFEVPE+ SI+AVLVDEEAVGSVD PGCGAKVL DVDELKQ SK  IV NLK
Sbjct: 541 LEHILTEAMFEVPESKSIRAVLVDEEAVGSVDGPGCGAKVLYDVDELKQCSKSGIVINLK 600

Query: 601 GKGVLA-DEGRFSNGVEFPSVTMRL 625
           G G+L  DEGRFSNGVEFPSVTMRL
Sbjct: 601 GNGLLVEDEGRFSNGVEFPSVTMRL 623

BLAST of BhiUN159G30 vs. NCBI nr
Match: XP_023530359.1 (CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 553/626 (88.34%), Postives = 582/626 (92.97%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRFRVKNT+F+ LFTS+RTITTTDS FH RF+TPLPSKP L+PHAAARSAG  LWFSQ
Sbjct: 1   MIPRFRVKNTKFLMLFTSIRTITTTDSLFHHRFLTPLPSKPSLEPHAAARSAGFILWFSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVY ALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYHALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRPFKLVPLQGTFIEIGRVKGARDLD+ V NE  Q TKGD+GAN  DDSALPPRW GSC 
Sbjct: 121 FRPFKLVPLQGTFIEIGRVKGARDLDNIVENEPCQPTKGDIGANSADDSALPPRWKGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGG G+LG VESNGV K + +   QLLTPKEISEALDKFIVGQEKAKK LSVAVYNH+K
Sbjct: 181 -GGGHGSLGLVESNGVPKDEANSGRQLLTPKEISEALDKFIVGQEKAKKALSVAVYNHHK 240

Query: 241 RIYHASLQKT-SGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP 300
           RIYHASLQKT   QGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP
Sbjct: 241 RIYHASLQKTLVWQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP 300

Query: 301 FTIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSG 360
           FTIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSE++NSG
Sbjct: 301 FTIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSENVNSG 360

Query: 361 RDVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCIS 420
           RDVSGEGVQQALLK+LEGTVVDV DKGARKHP  DTIQ+DTKNILF+CGGAFVGLEKCIS
Sbjct: 361 RDVSGEGVQQALLKMLEGTVVDVPDKGARKHPQGDTIQIDTKNILFVCGGAFVGLEKCIS 420

Query: 421 DRQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSL 480
           DRQHDSSIGFGAPVRASMRTGGLT+D+VTSS+LESVESSDLVTYGLIPEFVGRFPILVSL
Sbjct: 421 DRQHDSSIGFGAPVRASMRTGGLTDDIVTSSILESVESSDLVTYGLIPEFVGRFPILVSL 480

Query: 481 SALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRS 540
           SAL+ED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRS
Sbjct: 481 SALNEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRS 540

Query: 541 ILENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVDELKQYSKGEIVRNL 600
           ILE+ILTEAMFEVPE+ SI+AVLVDEEAVGSVD PGCGAKVL DVDELKQ SK  IV NL
Sbjct: 541 ILEHILTEAMFEVPESKSIRAVLVDEEAVGSVDGPGCGAKVLYDVDELKQCSKSGIVINL 600

Query: 601 KGKGVLA-DEGRFSNGVEFPSVTMRL 625
           KG G+LA DEGRFSNGVEFPSVTMRL
Sbjct: 601 KGNGLLAEDEGRFSNGVEFPSVTMRL 624

BLAST of BhiUN159G30 vs. NCBI nr
Match: XP_022933079.1 (CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 1070.1 bits (2766), Expect = 2.8e-309
Identity = 553/627 (88.20%), Postives = 582/627 (92.82%), Query Frame = 0

Query: 1   MIPRFRVKNTEFIKLFTSMRTITTTDSPFHRRFITPLPSKPFLQPHAAARSAGLFLWFSQ 60
           MIPRF VKNT+FIKLFTS+R+ITTTDS FH RF+TPLP KP L+PHAAARSAG  LWFSQ
Sbjct: 1   MIPRFSVKNTKFIKLFTSIRSITTTDSLFHHRFLTPLPPKPSLEPHAAARSAGFILWFSQ 60

Query: 61  RRHSWKGASDYDYIRADVNCPRCSKQMPLIFSNRPLSITARETGVYQALNLCPNCKTAFY 120
           +RHSWKGASDYDYIRADVNCPRCSKQMP+IFSNRPLSIT RETGVY ALNLCPNCKTAFY
Sbjct: 61  KRHSWKGASDYDYIRADVNCPRCSKQMPVIFSNRPLSITGRETGVYHALNLCPNCKTAFY 120

Query: 121 FRPFKLVPLQGTFIEIGRVKGARDLDDHVGNESSQHTKGDLGANCTDDSALPPRWNGSCG 180
           FRPFKLVPLQGTFIEIGRVKGARDLD+ V NES Q TKGD+GAN  DDSALPPRW GSC 
Sbjct: 121 FRPFKLVPLQGTFIEIGRVKGARDLDNIVENESCQPTKGDIGANSADDSALPPRWKGSC- 180

Query: 181 GGGGDGNLGSVESNGVQKGDGSLEMQLLTPKEISEALDKFIVGQEKAKKVLSVAVYNHYK 240
            GGG G+LG VESNGV K + +   QLLTPKEISEALDKFIVGQEKAKK LSVAVYNH+K
Sbjct: 181 -GGGHGSLGLVESNGVPKDEANSGRQLLTPKEISEALDKFIVGQEKAKKALSVAVYNHHK 240

Query: 241 RIYHASLQKT-SGQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP 300
           RIYHASLQKT   QGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP
Sbjct: 241 RIYHASLQKTLVWQGSLGIELENDDNEIVELEKSNVLLMGPTGSGKTLLAKTLARVVNVP 300

Query: 301 FTIADATALTQASYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSESINSG 360
           FTIADATALTQA YVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSE++NSG
Sbjct: 301 FTIADATALTQAGYVGEDVESILYKLLLDAEFNVEAAQRGIVYIDEVDKITKKSENVNSG 360

Query: 361 RDVSGEGVQQALLKILEGTVVDVSDKGARKHPGDDTIQMDTKNILFICGGAFVGLEKCIS 420
           RDVSGEGVQQALLK+LEGTVVDV DKGARKHP  DTIQ+DTKNILFICGGAFVGLEKCIS
Sbjct: 361 RDVSGEGVQQALLKMLEGTVVDVPDKGARKHPQGDTIQIDTKNILFICGGAFVGLEKCIS 420

Query: 421 DRQHDSSIGFGAPVRASMRTGGLTEDLVTSSMLESVESSDLVTYGLIPEFVGRFPILVSL 480
           DRQHDSSIGFGAPVRASMRTGGLT+D+VTSS+LESVESSDLVTYGLIPEFVGRFPILVSL
Sbjct: 421 DRQHDSSIGFGAPVRASMRTGGLTDDIVTSSILESVESSDLVTYGLIPEFVGRFPILVSL 480

Query: 481 SALDEDELVQVLTKPKNAIGKQYKKMFRMNDVELHFTDRALRMIARKAMKKNTGARGLRS 540
           SAL+ED+LVQVLTKPKNA+GKQYKKMFRMNDVELHFT+ ALRMIARKAMKKNTGARGLRS
Sbjct: 481 SALNEDQLVQVLTKPKNALGKQYKKMFRMNDVELHFTENALRMIARKAMKKNTGARGLRS 540

Query: 541 ILENILTEAMFEVPETNSIKAVLVDEEAVGSVDAPGCGAKVLCDVD-ELKQYSKGEIVRN 600
           ILE+ILTEAMFEVPE+ SI+AVLVDEEAVGSVD PGCGAKVL DVD ELKQ SK  IV N
Sbjct: 541 ILEHILTEAMFEVPESKSIRAVLVDEEAVGSVDGPGCGAKVLYDVDEELKQCSKSGIVIN 600

Query: 601 LKGKGVLA-DEGRFSNGVEFPSVTMRL 625
           LKG G+LA DEGRFSNGVEFPS+TMRL
Sbjct: 601 LKGNGLLAEDEGRFSNGVEFPSITMRL 625

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G49840.19.9e-18760.84ATP-dependent Clp protease[more]
AT1G33360.11.4e-16157.65ATP-dependent Clp protease[more]
AT5G53350.14.3e-15064.79CLP protease regulatory subunit X[more]
AT5G64580.11.3e-0535.92AAA-type ATPase family protein[more]
AT1G09100.11.9e-0435.9226S proteasome AAA-ATPase subunit RPT5B[more]
Match NameE-valueIdentityDescription
sp|F4K7F6|CLPX2_ARATH1.8e-18560.84CLP protease regulatory subunit CLPX2, mitochondrial OS=Arabidopsis thaliana OX=... [more]
sp|Q66GN9|CLPX3_ARATH2.6e-16057.65CLP protease regulatory subunit CLPX3, mitochondrial OS=Arabidopsis thaliana OX=... [more]
sp|Q9FK07|CLPX1_ARATH7.8e-14964.79CLP protease regulatory subunit CLPX1, mitochondrial OS=Arabidopsis thaliana OX=... [more]
sp|Q11J59|CLPX_CHESB9.3e-11057.10ATP-dependent Clp protease ATP-binding subunit ClpX OS=Chelativorans sp. (strain... [more]
sp|Q1GGF7|CLPX_RUEST1.2e-10956.14ATP-dependent Clp protease ATP-binding subunit ClpX OS=Ruegeria sp. (strain TM10... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CD89|A0A1S3CD89_CUCME1.2e-30687.64CLP protease regulatory subunit CLPX2, mitochondrial isoform X3 OS=Cucumis melo ... [more]
tr|A0A1S4E2X7|A0A1S4E2X7_CUCME2.8e-30081.86CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 OS=Cucumis melo ... [more]
tr|A0A1S3CCC6|A0A1S3CCC6_CUCME4.1e-28883.31CLP protease regulatory subunit CLPX2, mitochondrial isoform X5 OS=Cucumis melo ... [more]
tr|A0A1S4E2W3|A0A1S4E2W3_CUCME9.8e-28277.81CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 OS=Cucumis melo ... [more]
tr|A0A1S3CE05|A0A1S3CE05_CUCME1.7e-28181.31CLP protease regulatory subunit CLPX2, mitochondrial isoform X4 OS=Cucumis melo ... [more]
Match NameE-valueIdentityDescription
XP_023530366.10.0e+0088.64CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita pepo ... [more]
XP_022933080.10.0e+0088.50CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita mosch... [more]
XP_022997640.10.0e+0088.48CLP protease regulatory subunit CLPX2, mitochondrial isoform X2 [Cucurbita maxim... [more]
XP_023530359.10.0e+0088.34CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 [Cucurbita pepo ... [more]
XP_022933079.12.8e-30988.20CLP protease regulatory subunit CLPX2, mitochondrial isoform X1 [Cucurbita mosch... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0051082unfolded protein binding
GO:0005524ATP binding
Vocabulary: Biological Process
TermDefinition
GO:0006457protein folding
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR004487Clp_protease_ATP-bd_su_ClpX
IPR003959ATPase_AAA_core
IPR003593AAA+_ATPase
IPR019489Clp_ATPase_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006457 protein folding
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005759 mitochondrial matrix
molecular_function GO:0005524 ATP binding
molecular_function GO:0008233 peptidase activity
molecular_function GO:0051082 unfolded protein binding
molecular_function GO:0004176 ATP-dependent peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
BhiUN159M30BhiUN159M30mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019489Clp ATPase, C-terminalSMARTSM01086ClpB_D2_small_2coord: 482..576
e-value: 2.0E-27
score: 107.1
IPR019489Clp ATPase, C-terminalPFAMPF10431ClpB_D2-smallcoord: 482..559
e-value: 7.8E-16
score: 57.8
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 271..402
e-value: 4.0E-9
score: 46.3
IPR003959ATPase, AAA-type, corePFAMPF07724AAA_2coord: 271..475
e-value: 1.5E-44
score: 152.0
IPR004487Clp protease, ATP-binding subunit ClpXTIGRFAMTIGR00382TIGR00382coord: 204..572
e-value: 3.6E-152
score: 505.1
IPR004487Clp protease, ATP-binding subunit ClpXPANTHERPTHR11262HSL AND CLP PROTEASEcoord: 19..590
NoneNo IPR availableGENE3DG3DSA:1.10.8.60coord: 481..580
e-value: 1.4E-40
score: 139.6
NoneNo IPR availableGENE3DG3DSA:3.40.50.300coord: 201..480
e-value: 2.5E-110
score: 370.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 176..195
NoneNo IPR availablePANTHERPTHR11262:SF15CLP PROTEASE REGULATORY SUBUNIT CLPX2, MITOCHONDRIALcoord: 19..590
NoneNo IPR availableCDDcd00009AAAcoord: 272..376
e-value: 2.06668E-13
score: 67.1711
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILYSSF52540P-loop containing nucleoside triphosphate hydrolasescoord: 208..553