Bhi01G000923 (gene) Wax gourd (B227) v1

Overview
NameBhi01G000923
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 25611281 .. 25620718 (-)
RNA-Seq ExpressionBhi01G000923
SyntenyBhi01G000923
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCACTTTCGTTCATTCTCCAACCCTCTCTGGTTCTCCAATCCTCCATCTTTGTAGCAGCCTCGAAGCCGCACATCGTCACCCACTCGCAAGTAGATCTTTTCTTCTCTTCCTCACTCCCTGCGGCAGGGCAGCCACACCCATCCCAACTCTTCTCCGTCTTTGATTTCTTTCTTCATCCATGCCAAATCTACAATATTTATTCATTTGCTTTTGAGTTTTCACATATTCATTCTTCTTCTTTTCTGGCAGAAAAATTAGTTGGGTGATTGCAGAGATGTAAGTATGTTTATGTTTCAAAAGCTTTGGCTTCCATTTGCAAAATATGTATTATCGCCTTTGCAAAATTTTTAAACCAACTGTTTCAGTTTGAGATGAAAATGGGAAGTCATGGATATTACTTAACTAGAATTCCTATCACTAGACGATTCAGAACATACCATTTTTCATTTATTCCCACCTCGATCCCATTTTGCTCTGATTCTACTTCTACCCAGAACCAGAACAAACCCAATCAAATTGAACAGGTTCTTAGTCATCAGGAAGTAACATTGAGCACCACCAAGAAACAACATTCTAACCCTTCAGAACCTTTGTGTCATGAATTAGTCCAAAAGTTTCGGATTTTACTTCAACAGGAGCGCACGGGTGCTGCCAAGAGGCTCATTAAGTCAGTAATTCTCTCCAAATCTCCTTTCTCATCACCTTGTGATCTTATTGAAGTTTTTTCTGTCCATTCTCCATCCTTGAAGCATGTCTTTTCAAATATGTTGTTTACGGCACTTTTGGATCTCAATATGACTGATGATGCCATAAGATTATACACCTCAATGAAGAAAAATGGTGTTGTTCCTGCTGTGGCTACCCTCAATATTTTATTTAAACTATTAATGTCTTCAAAAGAGTTTAAAAAAACGCTTGACTTCTTTTCTGAACTTGTTGAATCTGGTTTTCAACCAGATAGTTTCATGTATGGTAAGGCGGTTGAGGCGGCAATAAAACTAGGGGAGATGAATAAGGCATGTGATTTGGTCTGTTGCATGAAGAAGATAGGGATTAACCCTACTGTGTTTGTTTACAATGTGATAATTTCTGGCTTTTGCAAAGAGAAGAAGATAATTGATGCACAGAAGATATTTGATGAAATGATCAACAACGTGTCCCCAAATTTGGTTACTTATAATACAATTATCAATGGATACTGTAAGGCGGGAAAACTGGATAAAGCTTTTAGTTTGAAGGAGAGGATGAAGCATGAAAATTTGGGGCCTAATCTTGTGACATATAATTCTTTGCTTAGTGGGCTCTGTCAGGCAAAGCAGATGGAGGAAGCCAAAAAGCTATTGCATGAGATGGAAACTTATGGGTTTGCACCAGATGGATTTACCTATAGCATACTTTTTGATGGGTATTTGAGGTCTGGTGGTGGTGAAGCCTCAATTGTTCAATATGAAGAAGCAATGAAAAAAGGGGTGAAAATGAATAAATATACTACTTGTATTTTGTTGAATGGGTTATGTAAAGATGGGAAGGTGGAAAAAGCAGAAGAGATTCTGACGAAACTAATGATGAATGGATTGGTTCCAGATGAAATGATTTTTAATGTACTAGTGGATGGGTACTGCCGAAAAGGAAATATTGATGGTGCTATATCAACTATCCAAAGGATGGAAAATCAAGGCTTAACACCCAGTTGCATCACTTTTAATTCGTTGATCAATAAATTTTGTGAAATAAAAGAGTTGGACAAGGCAGAGGAATGGTTGAGGAAGATGATAGGGAAGGAAATTTGCCCTAGCATTGAAACCTTTAACATCCTTCTTGATGGTTATGGACGGGTGTGTCTTTTTGATAGATGTTTCCAAGTTCTCGAAGAAATGGAAAGTAAAGGGATAAAGCCAAATGTAGTAAGCTATGGAACTCTCATTAATTGTCTCTGCAAGGTTGGTAGATTTGTTGAAGCTGAAGTAGTTTTTGCTGATATGGATGGTAAAGGAGTTTTTCCAAACGCTCAAATTTATAATATGCTGATTGATTATAATTGCACATCAGGGAAGATGCAAGGTGCTTTCAAGATTTTTGATGAAATGATTGATAGGGACATCACTCCAACACTTGCAACATACAATTCACTCATCAATGGACTGTGCAAGTTAGGGAGGATGATTGAAGCAGAAAAGCTAGTCAACCAAATTACAAACAGTGGTTTTACACCTGATTTGATCACATACAACTCCCTTATTACTGGTTATTGTAGTTCTGGGAACCCCCAAAAAGGTCTCGAGTTATATGAAACTATGAAAAAGCAAGGCATCAATCCTACATTAATAACATATCATCTTTTAATATCTGGATGTAGCAAGGCTGGTTTAGATACTGTAGAGAAACTGTTCAGTGAAATGTTGCACATGGATCTTGCTCCAGATAGAGGTATCTATAATGCACTGATTTTTTGCTATATAGAAAACAGAGATGTTCAAAAGGCATTTGTTTTGTACAAAAAGATGATAGATGAAGGAGTCCAACGAGACAAGATGACTTATAACAGCTTGATTTTAGGATGCTTGAGAGATGGTAAGGTTACAGAAGTACGGAAACTTGTTGAGGATATGAAGGCTAGGGGGTTGACCCCTAAGGCTGACACTTACAATATCGTAGTTAAGGGACTTTGTGAACTTGGTGATTATAGCGAGGCACATGCATGGTATAAAGAGATGTTCAAAAACAAGTTTTTGTTAAATTCCTCCGTTTGCAATCAACTCATTGATGGTCTTAAGAGAGAGGGGAGGTTTCAAGAAGCCCAACTTATTTTGTCTGAGATGTATGTCAAAGGACTGAATGTCTTGAATTTGAGTAACGAGCCTTCTGCATCTATGACTATGTAGGTGAAACACGTGATATGAAGAAACCTATGAACACATATGCTATCACTGTGCTGAAAAATGTTTCTACAGTGGGATTACCTTTCCATGGTTCATGGCCACAACTATTTTGTCATGATCATTTCTTCCTCATGTTTTAAATGTGACGATGCAGCTGTCGGGTCACTTGGTAAGCAGGATCCTCTTCTTAGCTTTGATTTTGTACTGACTCATTAACCTTATCGTTTATTTTTTCTGTGATTCAGACAAATGAGCTGTATAGACTTCTTGCTGTGACGGTGATGTGTTGCTTGCTGTAGCCTGTATGCTTGGGTAAGGCCTAGCCTGTATGCTTGGGTAAGGCCATATGTCTAATTTTATATGCTTTGGTTACTGATGGATTCATATTAATTCTACATTAAAAGGACGTTCCTTGCTGTTCGATGAGTCATTTCCCAATTCTCATTTACCTTTTTCTTTAAAATCAATTCTTAGATTGATTAATGATGCCTTTAAGCCAACATCATCATTCTTAAAACACTGGTTATTTCTCAAATTCCTTTCTAATCTCCAATTGGAGATCTATGTTGTAAACTTGTAATCATCTATAGGTGATGGGGTTTTCCCTTTATTTCGTTTATCAATGAAATGTTTTTTTTCCAAAAAAAAAAATGTTGTGGCTGCAACAGAGTACATAACATTTACTCGAGGAGCATCTTTCAAGCACTCAGTGATGTCTTGACTGTAAATATTGAAGTCTTGAAGTCAAGTTGATTACTTATGAATTATGAGCATGGTGAGAGCTTGGATTTAGATAATAGATAAAGAAATGGCAACTTATGTGATTGTCAATATATATGATTGAGAGGCCTCGAGACATGAAAATATGAAGTTAAATCTAAAATATATGAGCTTGGAAAGTGTCATGAAATGAAATTGAATAAAGGCTGTTTTGCTTTGAAATGTTGGACCTACTTCTACTATTGTTCAATTGCTCCTTAATTATTTAAAACCTATTTGGATCGACTTTCAAAAGTGTTTCCACTTACAATAAAAGTGATTTTAAGCACTTAGAAAATCAATCCAAACATGCTCTTAAATCATTTTACGCGAGTCATATTCCCTCTTACATTAGGCTTCTAAATTGATGTAAGGCTTTGTAAAGTTTGGTGATCTCTTCCTTTTCAGGTGGAACTAATCTGCAACTAATTTGCAACTTTTTTCCTTTCAAGTATTCGATTGTTCATTCGTGTACATTTTTTGTGGAGCCATTCGGATATTTTGATAGCATCTATCATGGTGGTTGTGTTCGTCAAGACAGCAGTTGGCAACATTGAACCACACGCCATTTAAATATTGTACGGAGACTTCTTTTCTACTAAGGTTATAATTATTGATTTTCTTGAAGAATTTATTTTCTTAATATTTTAATCTGATTATTTTTCTTGACCCTTAGGTTGGGGTCGTGCTTGCCAGAATTGGAGAATTGGATTTGTTCTCCAAAGCTGTGCTCCAAATCTTCAGCTCACCAAGTTTATCTCTTGCACTCTCTTCATTTCAGATTTTCTTGAAAAATTTAGATCGACGATTTTCTCTCCTTCACATAGGAAAAGGAAGACGATAGGTTGTTCACATAGCACATATTTAGTTTTTTTCTTAGTGTGGAATCATAAATAACAATAAATACTGGATATATCAATGAGCAAGGTGTTGGATTAATGTTATTGCAACAGTATACATGCCAACATAAACATAATTGAATTGACATAACCCAAACTTAAAAAGAGAAAAAAATACAGTATTTATAAATTTCAACGTTTTGCTCTACTGGCCACTCAAACATAAATGTTCCAATGATAAAAAAAAGTTGCCTCCACAATCCTTAGGTTCGCCTTTAGATTTTAAGGAACCTAATGGTGTTTAGGCCCCCAATTTAAGTGGTTGGAGTAAGCTATTATAACCTATCCCATCTCACTCTTTCCCTTTTTGTTATGGTATCTACTATTTCTTATTTATATGAAAATAGTTTGCACCCCAAACACAGATTAATATAACATAGACTAAAATATTTTGCATTCCAGACACAAACTACTATAACTTATAGATTATTATAACCTACAAATTATAATAACTACTGATTATAATAATTAACCTGGCGCATCAAACACCACCCCTATAATGTTTCCTTTTAGTAGTCCTCTACGTCGAAATGTTAACAATGTAGCTTACCTTCTTACATCAAGGACAGTGCGGGCATAAAGCCCACTCTTTCCAAAGGAAGCTTACAGAAGGAAAACTCATAGGGAGGAGTGAGAGTAATTAGTGAACAATGTCTTTTCCGGCCAAAGGTACCACAAAGTTGGAGCTTTTCAGGTGAGATCCTATAGAGTCCAAAACAAGAAGGAAAAAAGAAAGGCATTGAGGAGCTGGGAAAAGTAGAACTTCCATCCAAAGGAACTGAAAAGAAGAAACCAAATGGAGTGTGCAAATTTGCAATGGAGGAAAAAGAAATATGTGGTTTCCACCCTATTTTCTCACGTTATACACCAACTTGGTCGATTGTTGTGAAGGTTTCTAGCTTGTGAGCTTTGTCTTTAGATATAGATACTAGTGTGGATGAACCACCATGAGTTGGCCTACCATAAAAGAACATCACCTCATTAAAGAATTAAGAGGTCATGGGTTCAATCCATAGTGATCACCTACCTTGGATATATATTATAAGTTTCCTTTGACACCCAAGTGTTGTAGGGTCAAACGGGGTTGTTCCAAGATATTAGTCAGGGTACACATAAACTAGCTTGTGCCTCTTTTAAGGCACCTTGAAGGTTTATTTGCCAAAATATTAGTCTGGTGGCTTTTTATATTTGTCGTAGGCTTGGTGATGGTTCTTCTACTTCTTTCTGGCATGACTGGCTTAGCTGTGGTATCCTATCAGCTATGTTTCTGCGACTTTATCGCCTTGCTCAATAGTCAGAGGCTTTTGTGGCTATCTTGTGGAACTCATCCTCTCATGCTTGAGATCTTAATCTTCGCCGTAATCTTACTGAGTTGGAAATTGCGGAATGGACTACTTTGTCGTATTTGTCGTTTTTCAAGTTACGCTCTTCCCCTGATTCCTAGATTTGGCATCTGCATCCTTCTCAGGTATTTTCTGTTAAATCCCTTATGGTTGACTTGATGGATATTGGGACTTTGGCTTCCTCGGATCTTTACTAGATGATTTAGAAAGATAGCTATGCTAAGCGAATTAAATTTTTTTTTGGGAACTTTGCTTGGGTGCCATCAATACTTCTGACCATTTGTAGCTAAGGGTGGTCATTCAAACCACAAAAACCGAACTGAACCGAACTGAAAACTCAGTGGAATCGAAAAATGTGGTTCGATCCGATTTGAAAATTTTTTGAAACCGAATTAATTTGGTTCGGTTTAGTTTCACTTTCGAAAACCGAATTTGAAACCGAACCGAACCATTTTTAACTAATATATATATATATATATATTTATTTATATTTAAAAAAAGAGTAAAACCGAATTTAAAATCGAACTACTTTTAATTTAAAGAATAATATCGAACCAAACTGTTTTTTTTAATTAAAAAAATATAACATGGATTTTTTAAAAATGCCAAAACCGATTTTGAAATCGAACCGAACCGCATATATGTGGTTTGGTTCGGTTTGATTTGAACCAATAAATCGATTCGGTTCTGTTTCACATTTTAAAAGAATTGGTTCGGTTCGTTTTGACCATCAAACCGAATTGAACCGTTTTCACCCCTATTTGTAGCATCGTCTTCCAAGTATGTGTACCTGTGAAACCTAGAATTCCATATTCCTTTTTTATACAGGAAATAAAAGGGATTAACTAAGTATGAGTTAGATCCCAAGTTAAAGAGTAGGAAAAATCTAAGTGTTTAAATAGAATTTACTAAGTAGCTTTGGAAATAAGCTTTAACTGGTGAAAATTTGGTTAAGTATGAATCAAAACAGGACTATGCGTTGGAGGCTAGGGAAAAAAAATATATTAGACAAATGTGTTGGCCATAAATGTTTGAGAGGTTATTTGGGGCAAAAGAGGTTGACGTATGGCCGAGAGGAAAGGCTATACGGGGATAAGCATGCGGTAGCCTATGTTTGAGAGGTTAGCAGAGCTTGTGGTACCCTCAAGTTGTGCGGACATAGGCTCAAGGCACGGGAGATACACACGCATTGATCGTGTATGCGGCGATGGTATGCGCACGCATTGATCGTGTATGCGGCGATTGATCATGTATGCGGCGATGCTAGACGATGGATACGGCGATGCTACGCGAGGGTATGTGGTGATGCTACGCGAGGGTATGCGGCGATGTTACGCAATGGATGCGGCGATGTTACACAATGGATGCGGCGATGCTATGCGATGGATGCGGCGACCTTGCTAAAGGGTGAGCTAGGCGATGAAAGAAGCTATGATGGACGCATGGTGAATTGTGCTTGCGTTGCTAAGGCTTGCGGTAATAAGGGTTATGAGGTTGGTTGCGTTGGTCATAAGGGATGCGTTAGCAAGAACCTAGGCGATGGTGAGCTATGCTATGGGGTGTTAAGTTGAGAACCAAGTTTGACCCAAAGGTTGTTATGCGTTGGTGATGTGCTATGTGCCGAGGACATCCAAGGCATGTGGACGCATAGGACAAGGCTTGAGCAAATAGTATGCGGTAGAAGAGGGTGGTCATAATCCTAGTTGAGCGCATAAGGCAAAGGTAAGCCATGCGGTAGAGGGCATGCGGTCAGGGATGAGCTTTGCGAGCATCTAGCATATATGACCAAGTTACGTTAATGGAATGCACAAGGTAAGGTTGCGTTAATTGAAAATGGCACAACCAAGGAAAGTATGTCCCGTACAGGGAAAGGTGCTAGACGGCAATGACTTAAAGGGCGTTATGTGGAGATTTAAAGGACACCACATGGATGCAAAGATAGGATGGATGCATGCGCTAATTGGTGGTGTCCAGACATAAGAATGTTGCGGCAATTGGATGGGCGAATTTGTGGTTAGATGACCGCATATGAAGTTGGATGAACGCATATGAGGGAGGATGGACGCATTCAAGGTCGTCATCCAGACATGTGGCTTGTTAGGAAGAAGTCTTAGGCCTTAAGTAAATGAGGTGGGTGCACAAGAGACATACAATTGTAAAGGCTATGTGTTGGCTAAAGAAAGGAGAAGACACCTAGGTTTGCGATGATGTAAGATTGCGTTGATAGTGTGAGGAAAAGACACATCAAAGAGGAGGTGGAAGGTGCTGGGCGTTGGCAGAAGATGAGGAAAACACCTGGATAGGATCAATGGAAGAGGTGTCGGGCAAAGGAGAAGGGTTAGACGCTTATAAAAGGGGCAGGGCGCCTCACTTCAACCCGCAGATCATTAAAGATACGAGATATTCATAGAGGGAAGGAAGGCATTGAATGGGCGGAGGAACAAGCAGAGGTGCCGAAGAAAGCCAGGTGTATGGACAAACGCATAGGCTAACTTCAAGCAGGAAGTTTTTCCAAACTACTTGTGCAAATCAGGTGATTTTTAGACTAAAAGTTTCTAAATTAAATGTAGTTTCATTGTGAGGGCTGCCTTAATGAAATTCTAATTCTAAGGCTGTCAAATTCAAGGAAAACCGAGAGGTGCTCACAATTGAGGAAACAGTCGGGCCAGGGGGAGTAAACTTGAGACTCGGAACTTTCACCAAGGAATTTCAAGGTGAAACTTTAAGGTAACATAGTTAAGACATTTTCACACATTTTTGAAGAAGGAATTGTCTTAAGATAAGGCCCAAAAGAGTGAGTAATCTAAGCCTGAAGTTAAATCTGAAATAGGGTCCAGCGAAAGAATAAGGGAAACCGGGTGAACTGGGAAGAAAGATCTCAGCAAGTCACTGTGAGTGGTTTCTTCTTTTTCTTTTTTAAGCATATTTTGAATTGATATGCTGGAGTTTATATTATGAAATGAGTATGACGTTGGAATGGTTAGGGGGTCTGTGGACCTAGAGCCTGAGCCTAAAAGAAATCCTAACAGGATGGTCAGGGGACCAAGGAGTCTAGTTCCTGAGTCTGTGGGCTGGAAAAATCTTGAATGGGATGGTTAGAGGGCCGACGGGCCTAGCTTCTAAGCTTACATGAAAGGGAGAATTATGTGCACATAGGTTGATATTTTTAGATATGAATGTTAACTATCTACCGTGTGTGTAGATAGGTTTGTGACAAACTTCATGGTTTGGTGGAGGTTTGCATTGGAACCTCGTGATAATACTTGTGATCTGTATAAGACTTGTATTTGTAATATCACTCACTGAGCTTTGTAAAGCTCATGGTTTGTTTTATGTGTTTCTCTCGCAGGTAGCAGATGAGTCCAGGTTGCGAAGTTGGTCAAGGTCTGCCACAAGTTTAGAAATAGTACTAGATGAGATTAGTAATCGGGGTAGTGTCAATTATGTAAATGCAATTTGAATTTGTAAATTAGCTTTCTAAATAAAGCATGTAAGAATATTTTTCCGCTGCTGTAATAAATAAGGTTCTAAAGGAAAACTTGCAAGTTAATAAAAA

mRNA sequence

GTCACTTTCGTTCATTCTCCAACCCTCTCTGGTTCTCCAATCCTCCATCTTTGTAGCAGCCTCGAAGCCGCACATCGTCACCCACTCGCAAGTAGATCTTTTCTTCTCTTCCTCACTCCCTGCGGCAGGGCAGCCACACCCATCCCAACTCTTCTCCGTCTTTGATTTCTTTCTTCATCCATGCCAAATCTACAATATTTATTCATTTGCTTTTGAGTTTTCACATATTCATTCTTCTTCTTTTCTGGCAGAAAAATTAGTTGGGTGATTGCAGAGATGTAAGTATGTTTATGTTTCAAAAGCTTTGGCTTCCATTTGCAAAATATGTATTATCGCCTTTGCAAAATTTTTAAACCAACTGTTTCAGTTTGAGATGAAAATGGGAAGTCATGGATATTACTTAACTAGAATTCCTATCACTAGACGATTCAGAACATACCATTTTTCATTTATTCCCACCTCGATCCCATTTTGCTCTGATTCTACTTCTACCCAGAACCAGAACAAACCCAATCAAATTGAACAGGTTCTTAGTCATCAGGAAGTAACATTGAGCACCACCAAGAAACAACATTCTAACCCTTCAGAACCTTTGTGTCATGAATTAGTCCAAAAGTTTCGGATTTTACTTCAACAGGAGCGCACGGGTGCTGCCAAGAGGCTCATTAAGTCAGTAATTCTCTCCAAATCTCCTTTCTCATCACCTTGTGATCTTATTGAAGTTTTTTCTGTCCATTCTCCATCCTTGAAGCATGTCTTTTCAAATATGTTGTTTACGGCACTTTTGGATCTCAATATGACTGATGATGCCATAAGATTATACACCTCAATGAAGAAAAATGGTGTTGTTCCTGCTGTGGCTACCCTCAATATTTTATTTAAACTATTAATGTCTTCAAAAGAGTTTAAAAAAACGCTTGACTTCTTTTCTGAACTTGTTGAATCTGGTTTTCAACCAGATAGTTTCATGTATGGTAAGGCGGTTGAGGCGGCAATAAAACTAGGGGAGATGAATAAGGCATGTGATTTGGTCTGTTGCATGAAGAAGATAGGGATTAACCCTACTGTGTTTGTTTACAATGTGATAATTTCTGGCTTTTGCAAAGAGAAGAAGATAATTGATGCACAGAAGATATTTGATGAAATGATCAACAACGTGTCCCCAAATTTGGTTACTTATAATACAATTATCAATGGATACTGTAAGGCGGGAAAACTGGATAAAGCTTTTAGTTTGAAGGAGAGGATGAAGCATGAAAATTTGGGGCCTAATCTTGTGACATATAATTCTTTGCTTAGTGGGCTCTGTCAGGCAAAGCAGATGGAGGAAGCCAAAAAGCTATTGCATGAGATGGAAACTTATGGGTTTGCACCAGATGGATTTACCTATAGCATACTTTTTGATGGGTATTTGAGGTCTGGTGGTGGTGAAGCCTCAATTGTTCAATATGAAGAAGCAATGAAAAAAGGGGTGAAAATGAATAAATATACTACTTGTATTTTGTTGAATGGGTTATGTAAAGATGGGAAGGTGGAAAAAGCAGAAGAGATTCTGACGAAACTAATGATGAATGGATTGGTTCCAGATGAAATGATTTTTAATGTACTAGTGGATGGGTACTGCCGAAAAGGAAATATTGATGGTGCTATATCAACTATCCAAAGGATGGAAAATCAAGGCTTAACACCCAGTTGCATCACTTTTAATTCGTTGATCAATAAATTTTGTGAAATAAAAGAGTTGGACAAGGCAGAGGAATGGTTGAGGAAGATGATAGGGAAGGAAATTTGCCCTAGCATTGAAACCTTTAACATCCTTCTTGATGGTTATGGACGGGTGTGTCTTTTTGATAGATGTTTCCAAGTTCTCGAAGAAATGGAAAGTAAAGGGATAAAGCCAAATGTAGTAAGCTATGGAACTCTCATTAATTGTCTCTGCAAGGTTGGTAGATTTGTTGAAGCTGAAGTAGTTTTTGCTGATATGGATGGTAAAGGAGTTTTTCCAAACGCTCAAATTTATAATATGCTGATTGATTATAATTGCACATCAGGGAAGATGCAAGGTGCTTTCAAGATTTTTGATGAAATGATTGATAGGGACATCACTCCAACACTTGCAACATACAATTCACTCATCAATGGACTGTGCAAGTTAGGGAGGATGATTGAAGCAGAAAAGCTAGTCAACCAAATTACAAACAGTGGTTTTACACCTGATTTGATCACATACAACTCCCTTATTACTGGTTATTGTAGTTCTGGGAACCCCCAAAAAGGTCTCGAGTTATATGAAACTATGAAAAAGCAAGGCATCAATCCTACATTAATAACATATCATCTTTTAATATCTGGATGTAGCAAGGCTGGTTTAGATACTGTAGAGAAACTGTTCAGTGAAATGTTGCACATGGATCTTGCTCCAGATAGAGGTATCTATAATGCACTGATTTTTTGCTATATAGAAAACAGAGATGTTCAAAAGGCATTTGTTTTGTACAAAAAGATGATAGATGAAGGAGTCCAACGAGACAAGATGACTTATAACAGCTTGATTTTAGGATGCTTGAGAGATGGTAAGGTTACAGAAGTACGGAAACTTGTTGAGGATATGAAGGCTAGGGGGTTGACCCCTAAGGCTGACACTTACAATATCGTAGTTAAGGGACTTTGTGAACTTGGTGATTATAGCGAGGCACATGCATGGTATAAAGAGATGTTCAAAAACAAGTTTTTGTTAAATTCCTCCGTTTGCAATCAACTCATTGATGGTCTTAAGAGAGAGGGGAGGTTTCAAGAAGCCCAACTTATTTTGTCTGAGATGTATGTCAAAGGACTGAATGTCTTGAATTTGAGTAACGAGCCTTCTGCATCTATGACTATGTAGGTGAAACACGTGATATGAAGAAACCTATGAACACATATGCTATCACTGTGCTGAAAAATGTTTCTACAGTGGGATTACCTTTCCATGGTTCATGGCCACAACTATTTTGTCATGATCATTTCTTCCTCATGTTTTAAATGTGACGATGCAGCTGTCGGGTCACTTGACAAATGAGCTGTATAGACTTCTTGCTGTGACGGTGATGTGTTGCTTGCTGTAGCCTGTATGCTTGGGTAAGGCCTAGCCTGTATGCTTGGGTGGAACTAATCTGCAACTAATTTGCAACTTTTTTCCTTTCAAGTATTCGATTGTTCATTCGTGTACATTTTTTGTGGAGCCATTCGGATATTTTGATAGCATCTATCATGGTGGTTGTGTTCGTCAAGACAGCAGTTGGCAACATTGAACCACACGCCATTTAAATATTGTTGGGGTCGTGCTTGCCAGAATTGGAGAATTGGATTTGTTCTCCAAAGCTGTGCTCCAAATCTTCAGCTCACCAAGTAGCAGATGAGTCCAGGTTGCGAAGTTGGTCAAGGTCTGCCACAAGTTTAGAAATAGTACTAGATGAGATTAGTAATCGGGGTAGTGTCAATTATGTAAATGCAATTTGAATTTGTAAATTAGCTTTCTAAATAAAGCATGTAAGAATATTTTTCCGCTGCTGTAATAAATAAGGTTCTAAAGGAAAACTTGCAAGTTAATAAAAA

Coding sequence (CDS)

ATGAAAATGGGAAGTCATGGATATTACTTAACTAGAATTCCTATCACTAGACGATTCAGAACATACCATTTTTCATTTATTCCCACCTCGATCCCATTTTGCTCTGATTCTACTTCTACCCAGAACCAGAACAAACCCAATCAAATTGAACAGGTTCTTAGTCATCAGGAAGTAACATTGAGCACCACCAAGAAACAACATTCTAACCCTTCAGAACCTTTGTGTCATGAATTAGTCCAAAAGTTTCGGATTTTACTTCAACAGGAGCGCACGGGTGCTGCCAAGAGGCTCATTAAGTCAGTAATTCTCTCCAAATCTCCTTTCTCATCACCTTGTGATCTTATTGAAGTTTTTTCTGTCCATTCTCCATCCTTGAAGCATGTCTTTTCAAATATGTTGTTTACGGCACTTTTGGATCTCAATATGACTGATGATGCCATAAGATTATACACCTCAATGAAGAAAAATGGTGTTGTTCCTGCTGTGGCTACCCTCAATATTTTATTTAAACTATTAATGTCTTCAAAAGAGTTTAAAAAAACGCTTGACTTCTTTTCTGAACTTGTTGAATCTGGTTTTCAACCAGATAGTTTCATGTATGGTAAGGCGGTTGAGGCGGCAATAAAACTAGGGGAGATGAATAAGGCATGTGATTTGGTCTGTTGCATGAAGAAGATAGGGATTAACCCTACTGTGTTTGTTTACAATGTGATAATTTCTGGCTTTTGCAAAGAGAAGAAGATAATTGATGCACAGAAGATATTTGATGAAATGATCAACAACGTGTCCCCAAATTTGGTTACTTATAATACAATTATCAATGGATACTGTAAGGCGGGAAAACTGGATAAAGCTTTTAGTTTGAAGGAGAGGATGAAGCATGAAAATTTGGGGCCTAATCTTGTGACATATAATTCTTTGCTTAGTGGGCTCTGTCAGGCAAAGCAGATGGAGGAAGCCAAAAAGCTATTGCATGAGATGGAAACTTATGGGTTTGCACCAGATGGATTTACCTATAGCATACTTTTTGATGGGTATTTGAGGTCTGGTGGTGGTGAAGCCTCAATTGTTCAATATGAAGAAGCAATGAAAAAAGGGGTGAAAATGAATAAATATACTACTTGTATTTTGTTGAATGGGTTATGTAAAGATGGGAAGGTGGAAAAAGCAGAAGAGATTCTGACGAAACTAATGATGAATGGATTGGTTCCAGATGAAATGATTTTTAATGTACTAGTGGATGGGTACTGCCGAAAAGGAAATATTGATGGTGCTATATCAACTATCCAAAGGATGGAAAATCAAGGCTTAACACCCAGTTGCATCACTTTTAATTCGTTGATCAATAAATTTTGTGAAATAAAAGAGTTGGACAAGGCAGAGGAATGGTTGAGGAAGATGATAGGGAAGGAAATTTGCCCTAGCATTGAAACCTTTAACATCCTTCTTGATGGTTATGGACGGGTGTGTCTTTTTGATAGATGTTTCCAAGTTCTCGAAGAAATGGAAAGTAAAGGGATAAAGCCAAATGTAGTAAGCTATGGAACTCTCATTAATTGTCTCTGCAAGGTTGGTAGATTTGTTGAAGCTGAAGTAGTTTTTGCTGATATGGATGGTAAAGGAGTTTTTCCAAACGCTCAAATTTATAATATGCTGATTGATTATAATTGCACATCAGGGAAGATGCAAGGTGCTTTCAAGATTTTTGATGAAATGATTGATAGGGACATCACTCCAACACTTGCAACATACAATTCACTCATCAATGGACTGTGCAAGTTAGGGAGGATGATTGAAGCAGAAAAGCTAGTCAACCAAATTACAAACAGTGGTTTTACACCTGATTTGATCACATACAACTCCCTTATTACTGGTTATTGTAGTTCTGGGAACCCCCAAAAAGGTCTCGAGTTATATGAAACTATGAAAAAGCAAGGCATCAATCCTACATTAATAACATATCATCTTTTAATATCTGGATGTAGCAAGGCTGGTTTAGATACTGTAGAGAAACTGTTCAGTGAAATGTTGCACATGGATCTTGCTCCAGATAGAGGTATCTATAATGCACTGATTTTTTGCTATATAGAAAACAGAGATGTTCAAAAGGCATTTGTTTTGTACAAAAAGATGATAGATGAAGGAGTCCAACGAGACAAGATGACTTATAACAGCTTGATTTTAGGATGCTTGAGAGATGGTAAGGTTACAGAAGTACGGAAACTTGTTGAGGATATGAAGGCTAGGGGGTTGACCCCTAAGGCTGACACTTACAATATCGTAGTTAAGGGACTTTGTGAACTTGGTGATTATAGCGAGGCACATGCATGGTATAAAGAGATGTTCAAAAACAAGTTTTTGTTAAATTCCTCCGTTTGCAATCAACTCATTGATGGTCTTAAGAGAGAGGGGAGGTTTCAAGAAGCCCAACTTATTTTGTCTGAGATGTATGTCAAAGGACTGAATGTCTTGAATTTGAGTAACGAGCCTTCTGCATCTATGACTATGTAG

Protein sequence

MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTLSTTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSVHSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCKEKKIIDAQKIFDEMINNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM
Homology
BLAST of Bhi01G000923 vs. TAIR 10
Match: AT5G12100.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 759.2 bits (1959), Expect = 3.4e-219
Identity = 371/775 (47.87%), Postives = 544/775 (70.19%), Query Frame = 0

Query: 63  TKKQHSNPSEPLC-HELVQKFRILLQQERTGAAKRLIKSVILSKS-PFSSPCDLIEVFSV 122
           ++ + + P+ P+   E ++  R+LLQQ R   A+ ++ S++ S S PF+SP +L   FS+
Sbjct: 42  SQPEQAPPTNPVTGDEKLRNLRVLLQQNRIETARGVLSSLLRSDSTPFASPKELFSAFSL 101

Query: 123 HSPSLKHVFSNMLFTALL-DLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFK 182
            SPSLKH FS +L + LL +  M  +A  L+ +++  G+ P+  +L +L   L+ +K+F+
Sbjct: 102 SSPSLKHDFSYLLLSVLLNESKMISEAADLFFALRNEGIYPSSDSLTLLLDHLVKTKQFR 161

Query: 183 KTLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVII 242
            T++ F  ++ES F+P  FMYGKA++AA+KL ++ K  +L   MK   I P+VF+YNV+I
Sbjct: 162 VTINVFLNILESDFRPSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLI 221

Query: 243 SGFCKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLG 302
            G CK K++ DA+++FDEM+   + P+L+TYNT+I+GYCKAG  +K+F ++ERMK +++ 
Sbjct: 222 DGLCKGKRMNDAEQLFDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIE 281

Query: 303 PNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQ 362
           P+L+T+N+LL GL +A  +E+A+ +L EM+  GF PD FT+SILFDGY  +   EA++  
Sbjct: 282 PSLITFNTLLKGLFKAGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGV 341

Query: 363 YEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCR 422
           YE A+  GVKMN YT  ILLN LCK+GK+EKAEEIL + M  GLVP+E+I+N ++DGYCR
Sbjct: 342 YETAVDSGVKMNAYTCSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCR 401

Query: 423 KGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIET 482
           KG++ GA   I+ ME QG+ P  + +N LI +FCE+ E++ AE+ + KM  K + PS+ET
Sbjct: 402 KGDLVGARMKIEAMEKQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVET 461

Query: 483 FNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMD 542
           +NIL+ GYGR   FD+CF +L+EME  G  PNVVSYGTLINCLCK  + +EA++V  DM+
Sbjct: 462 YNILIGGYGRKYEFDKCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDME 521

Query: 543 GKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMI 602
            +GV P  +IYNMLID  C+ GK++ AF+   EM+ + I   L TYN+LI+GL   G++ 
Sbjct: 522 DRGVSPKVRIYNMLIDGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLS 581

Query: 603 EAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLI 662
           EAE L+ +I+  G  PD+ TYNSLI+GY  +GN Q+ + LYE MK+ GI PTL TYHLLI
Sbjct: 582 EAEDLLLEISRKGLKPDVFTYNSLISGYGFAGNVQRCIALYEEMKRSGIKPTLKTYHLLI 641

Query: 663 SGCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQR 722
           S C+K G++  E+LF E   M L PD  +YN ++ CY  + D++KAF L K+MI++ +  
Sbjct: 642 SLCTKEGIELTERLFGE---MSLKPDLLVYNGVLHCYAVHGDMEKAFNLQKQMIEKSIGL 701

Query: 723 DKMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWY 782
           DK TYNSLILG L+ GK+ EVR L+++M AR + P+ADTYNI+VKG CE+ DY  A+ WY
Sbjct: 702 DKTTYNSLILGQLKVGKLCEVRSLIDEMNAREMEPEADTYNIIVKGHCEVKDYMSAYVWY 761

Query: 783 KEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSAS 834
           +EM +  FLL+  + N+L+ GLK E R +EA++++SEM  + L  + +  + SA+
Sbjct: 762 REMQEKGFLLDVCIGNELVSGLKEEWRSKEAEIVISEMNGRMLGDVTVDEDLSAT 813

BLAST of Bhi01G000923 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 352.1 bits (902), Expect = 1.3e-96
Identity = 201/695 (28.92%), Postives = 344/695 (49.50%), Query Frame = 0

Query: 131 NMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTLDFFSELVE 190
           N+L   L      + +  L   M+K+G  P + T N +         FK  ++    +  
Sbjct: 237 NILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKS 296

Query: 191 SGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCKEKKIID 250
            G   D   Y   +    +   + K   L+  M+K  I+P    YN +I+GF  E K++ 
Sbjct: 297 KGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLI 356

Query: 251 AQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVTYNSLLS 310
           A ++ +EM++  +SPN VT+N +I+G+   G   +A  +   M+ + L P+ V+Y  LL 
Sbjct: 357 ASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLD 416

Query: 311 GLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAMKKGVKM 370
           GLC+  + + A+     M+  G      TY+ + DG  ++G  + ++V   E  K G+  
Sbjct: 417 GLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDP 476

Query: 371 NKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNIDGAISTI 430
           +  T   L+NG CK G+ + A+EI+ ++   GL P+ +I++ L+   CR G +  AI   
Sbjct: 477 DIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIY 536

Query: 431 QRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILLDGYGRV 490
           + M  +G T    TFN L+   C+  ++ +AEE++R M    I P+  +F+ L++GYG  
Sbjct: 537 EAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 596

Query: 491 CLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVFPNAQIY 550
               + F V +EM   G  P   +YG+L+  LCK G   EAE     +       +  +Y
Sbjct: 597 GEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMY 656

Query: 551 NMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKLVNQITN 610
           N L+   C SG +  A  +F EM+ R I P   TY SLI+GLC+ G+ + A     +   
Sbjct: 657 NTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEA 716

Query: 611 SG-FTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGCSKAG-LD 670
            G   P+ + Y   + G   +G  + G+   E M   G  P ++T + +I G S+ G ++
Sbjct: 717 RGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIE 776

Query: 671 TVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMTYNSLI 730
               L  EM + +  P+   YN L+  Y + +DV  +F+LY+ +I  G+  DK+T +SL+
Sbjct: 777 KTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLV 836

Query: 731 LGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFL 790
           LG      +    K+++    RG+     T+N+++   C  G+ + A    K M      
Sbjct: 837 LGICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGIS 896

Query: 791 LNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLN 823
           L+   C+ ++  L R  RFQE++++L EM  +G++
Sbjct: 897 LDKDTCDAMVSVLNRNHRFQESRMVLHEMSKQGIS 931

BLAST of Bhi01G000923 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 345.5 bits (885), Expect = 1.2e-94
Identity = 204/668 (30.54%), Postives = 345/668 (51.65%), Query Frame = 0

Query: 164 TLNIL--FKL-----LMSSKEFKKTL-DFFSELVESGFQ-------PDSFMYGKAVEAAI 223
           TL+IL  FKL     +++     KTL D ++ LV    Q         S ++   V++  
Sbjct: 86  TLHILTKFKLYKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYS 145

Query: 224 KLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCKEKKIID-AQKIFDEMI-NNVSPNL 283
           +L  ++KA  +V   +  G  P V  YN ++    + K+ I  A+ +F EM+ + VSPN+
Sbjct: 146 RLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNV 205

Query: 284 VTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVTYNSLLSGLCQAKQMEEAKKLLHE 343
            TYN +I G+C AG +D A +L ++M+ +   PN+VTYN+L+ G C+ +++++  KLL  
Sbjct: 206 FTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRS 265

Query: 344 METYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAMKKGVKMNKYTTCILLNGLCKDGK 403
           M   G  P+  +Y+                                   +++NGLC++G+
Sbjct: 266 MALKGLEPNLISYN-----------------------------------VVINGLCREGR 325

Query: 404 VEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNIDGAISTIQRMENQGLTPSCITFNS 463
           +++   +LT++   G   DE+ +N L+ GYC++GN   A+     M   GLTPS IT+ S
Sbjct: 326 MKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTS 385

Query: 464 LINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILLDGYGRVCLFDRCFQVLEEMESKG 523
           LI+  C+   +++A E+L +M  + +CP+  T+  L+DG+ +    +  ++VL EM   G
Sbjct: 386 LIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNG 445

Query: 524 IKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVFPNAQIYNMLIDYNCTSGKMQGAF 583
             P+VV+Y  LIN  C  G+  +A  V  DM  KG+ P+   Y+ ++   C S  +  A 
Sbjct: 446 FSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEAL 505

Query: 584 KIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKLVNQITNSGFTPDLITYNSLITGY 643
           ++  EM+++ I P   TY+SLI G C+  R  EA  L  ++   G  PD  TY +LI  Y
Sbjct: 506 RVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAY 565

Query: 644 CSSGNPQKGLELYETMKKQGINPTLITYHLLISGCSK-AGLDTVEKLFSEMLHMDLAPDR 703
           C  G+ +K L+L+  M ++G+ P ++TY +LI+G +K +     ++L  ++ + +  P  
Sbjct: 566 CMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSD 625

Query: 704 GIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMTYNSLILGCLRDGKVTEVRKLVED 763
             Y+ L    IEN     + + +K ++            SLI G    G +TE  ++ E 
Sbjct: 626 VTYHTL----IEN----CSNIEFKSVV------------SLIKGFCMKGMMTEADQVFES 685

Query: 764 MKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFLLNSSVCNQLIDGLKREGR 814
           M  +   P    YNI++ G C  GD  +A+  YKEM K+ FLL++     L+  L +EG+
Sbjct: 686 MLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIALVKALHKEGK 698

BLAST of Bhi01G000923 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 317.0 bits (811), Expect = 4.5e-86
Identity = 193/657 (29.38%), Postives = 326/657 (49.62%), Query Frame = 0

Query: 185 FSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCK 244
           + E++E    P+ + Y K V    KLG + +A   V  + + G++P  F Y  +I G+C+
Sbjct: 206 YMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQ 265

Query: 245 EKKIIDAQKIFDEM-INNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVT 304
            K +  A K+F+EM +     N V Y  +I+G C A ++D+A  L  +MK +   P + T
Sbjct: 266 RKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRT 325

Query: 305 YNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAM 364
           Y  L+  LC +++  EA  L+ EME  G  P+  TY++L D                   
Sbjct: 326 YTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLID------------------- 385

Query: 365 KKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNID 424
                            LC   K EKA E+L +++  GL+P+ + +N L++GYC++G I+
Sbjct: 386 ----------------SLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIE 445

Query: 425 GAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILL 484
            A+  ++ ME++ L+P+  T+N LI  +C+   + KA   L KM+ +++ P + T+N L+
Sbjct: 446 DAVDVVELMESRKLSPNTRTYNELIKGYCK-SNVHKAMGVLNKMLERKVLPDVVTYNSLI 505

Query: 485 DGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVF 544
           DG  R   FD  +++L  M  +G+ P+  +Y ++I+ LCK  R  EA  +F  ++ KGV 
Sbjct: 506 DGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVN 565

Query: 545 PNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKL 604
           PN  +Y  LID  C +GK+  A  + ++M+ ++  P   T+N+LI+GLC  G++ EA  L
Sbjct: 566 PNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLL 625

Query: 605 VNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLI-SGCS 664
             ++   G  P + T   LI      G+       ++ M   G  P   TY   I + C 
Sbjct: 626 EEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCR 685

Query: 665 KAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMT 724
           +  L   E + ++M    ++PD   Y++LI  Y +      AF + K+M D G +  + T
Sbjct: 686 EGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHT 745

Query: 725 YNSLILGCL------RDGKVTE------------VRKLVEDMKARGLTPKADTYNIVVKG 784
           + SLI   L      + G   E            V +L+E M    +TP A +Y  ++ G
Sbjct: 746 FLSLIKHLLEMKYGKQKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILG 805

Query: 785 LCELGDYSEAHAWYKEMFKNKFLLNSS-VCNQLIDGLKREGRFQEAQLILSEMYVKG 821
           +CE+G+   A   +  M +N+ +  S  V N L+    +  +  EA  ++ +M   G
Sbjct: 806 ICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

BLAST of Bhi01G000923 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 314.3 bits (804), Expect = 2.9e-85
Identity = 170/542 (31.37%), Postives = 292/542 (53.87%), Query Frame = 0

Query: 281 KLDKAFSLKERMKHENLGPNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYS 340
           KLD A +L   M      P+++ ++ LLS + +  + +    L  +M+  G   + +TYS
Sbjct: 61  KLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYS 120

Query: 341 ILFDGYLRSGGGEASIVQYEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMN 400
           IL + + R      ++    + MK G + N  T   LLNG C   ++ +A  ++ ++ + 
Sbjct: 121 ILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVT 180

Query: 401 GLVPDEMIFNVLVDGYCRKGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKA 460
           G  P+ + FN L+ G         A++ I RM  +G  P  +T+  ++N  C+  + D A
Sbjct: 181 GYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLA 240

Query: 461 EEWLRKMIGKEICPSIETFNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINC 520
              L KM   ++ P +  +N ++DG  +    D    + +EME+KGI+PNVV+Y +LI+C
Sbjct: 241 FNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISC 300

Query: 521 LCKVGRFVEAEVVFADMDGKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPT 580
           LC  GR+ +A  + +DM  + + P+   ++ LID     GK+  A K++DEM+ R I P+
Sbjct: 301 LCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPS 360

Query: 581 LATYNSLINGLCKLGRMIEAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYE 640
           + TY+SLING C   R+ EA+++   + +    PD++TYN+LI G+C     ++G+E++ 
Sbjct: 361 IVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFR 420

Query: 641 TMKKQGINPTLITYHLLISGCSKAG-LDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENR 700
            M ++G+    +TY++LI G  +AG  D  +++F EM+   + P+   YN L+    +N 
Sbjct: 421 EMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNG 480

Query: 701 DVQKAFVLYKKMIDEGVQRDKM-----TYNSLILGCLRDGKVTEVRKLVEDMKARGLTPK 760
            ++KA V++     E +QR KM     TYN +I G  + GKV +   L  ++  +G+ P 
Sbjct: 481 KLEKAMVVF-----EYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPD 540

Query: 761 ADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILS 817
              YN ++ G C  G   EA A +KEM ++  L NS   N LI    R+G  + +  ++ 
Sbjct: 541 VVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIK 597

BLAST of Bhi01G000923 vs. ExPASy Swiss-Prot
Match: Q9FMQ1 (Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g12100 PE=2 SV=1)

HSP 1 Score: 759.2 bits (1959), Expect = 4.8e-218
Identity = 371/775 (47.87%), Postives = 544/775 (70.19%), Query Frame = 0

Query: 63  TKKQHSNPSEPLC-HELVQKFRILLQQERTGAAKRLIKSVILSKS-PFSSPCDLIEVFSV 122
           ++ + + P+ P+   E ++  R+LLQQ R   A+ ++ S++ S S PF+SP +L   FS+
Sbjct: 42  SQPEQAPPTNPVTGDEKLRNLRVLLQQNRIETARGVLSSLLRSDSTPFASPKELFSAFSL 101

Query: 123 HSPSLKHVFSNMLFTALL-DLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFK 182
            SPSLKH FS +L + LL +  M  +A  L+ +++  G+ P+  +L +L   L+ +K+F+
Sbjct: 102 SSPSLKHDFSYLLLSVLLNESKMISEAADLFFALRNEGIYPSSDSLTLLLDHLVKTKQFR 161

Query: 183 KTLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVII 242
            T++ F  ++ES F+P  FMYGKA++AA+KL ++ K  +L   MK   I P+VF+YNV+I
Sbjct: 162 VTINVFLNILESDFRPSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLI 221

Query: 243 SGFCKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLG 302
            G CK K++ DA+++FDEM+   + P+L+TYNT+I+GYCKAG  +K+F ++ERMK +++ 
Sbjct: 222 DGLCKGKRMNDAEQLFDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIE 281

Query: 303 PNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQ 362
           P+L+T+N+LL GL +A  +E+A+ +L EM+  GF PD FT+SILFDGY  +   EA++  
Sbjct: 282 PSLITFNTLLKGLFKAGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGV 341

Query: 363 YEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCR 422
           YE A+  GVKMN YT  ILLN LCK+GK+EKAEEIL + M  GLVP+E+I+N ++DGYCR
Sbjct: 342 YETAVDSGVKMNAYTCSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCR 401

Query: 423 KGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIET 482
           KG++ GA   I+ ME QG+ P  + +N LI +FCE+ E++ AE+ + KM  K + PS+ET
Sbjct: 402 KGDLVGARMKIEAMEKQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVET 461

Query: 483 FNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMD 542
           +NIL+ GYGR   FD+CF +L+EME  G  PNVVSYGTLINCLCK  + +EA++V  DM+
Sbjct: 462 YNILIGGYGRKYEFDKCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDME 521

Query: 543 GKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMI 602
            +GV P  +IYNMLID  C+ GK++ AF+   EM+ + I   L TYN+LI+GL   G++ 
Sbjct: 522 DRGVSPKVRIYNMLIDGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLS 581

Query: 603 EAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLI 662
           EAE L+ +I+  G  PD+ TYNSLI+GY  +GN Q+ + LYE MK+ GI PTL TYHLLI
Sbjct: 582 EAEDLLLEISRKGLKPDVFTYNSLISGYGFAGNVQRCIALYEEMKRSGIKPTLKTYHLLI 641

Query: 663 SGCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQR 722
           S C+K G++  E+LF E   M L PD  +YN ++ CY  + D++KAF L K+MI++ +  
Sbjct: 642 SLCTKEGIELTERLFGE---MSLKPDLLVYNGVLHCYAVHGDMEKAFNLQKQMIEKSIGL 701

Query: 723 DKMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWY 782
           DK TYNSLILG L+ GK+ EVR L+++M AR + P+ADTYNI+VKG CE+ DY  A+ WY
Sbjct: 702 DKTTYNSLILGQLKVGKLCEVRSLIDEMNAREMEPEADTYNIIVKGHCEVKDYMSAYVWY 761

Query: 783 KEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSAS 834
           +EM +  FLL+  + N+L+ GLK E R +EA++++SEM  + L  + +  + SA+
Sbjct: 762 REMQEKGFLLDVCIGNELVSGLKEEWRSKEAEIVISEMNGRMLGDVTVDEDLSAT 813

BLAST of Bhi01G000923 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 352.1 bits (902), Expect = 1.8e-95
Identity = 201/695 (28.92%), Postives = 344/695 (49.50%), Query Frame = 0

Query: 131 NMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTLDFFSELVE 190
           N+L   L      + +  L   M+K+G  P + T N +         FK  ++    +  
Sbjct: 197 NILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKS 256

Query: 191 SGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCKEKKIID 250
            G   D   Y   +    +   + K   L+  M+K  I+P    YN +I+GF  E K++ 
Sbjct: 257 KGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLI 316

Query: 251 AQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVTYNSLLS 310
           A ++ +EM++  +SPN VT+N +I+G+   G   +A  +   M+ + L P+ V+Y  LL 
Sbjct: 317 ASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLD 376

Query: 311 GLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAMKKGVKM 370
           GLC+  + + A+     M+  G      TY+ + DG  ++G  + ++V   E  K G+  
Sbjct: 377 GLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDP 436

Query: 371 NKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNIDGAISTI 430
           +  T   L+NG CK G+ + A+EI+ ++   GL P+ +I++ L+   CR G +  AI   
Sbjct: 437 DIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIY 496

Query: 431 QRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILLDGYGRV 490
           + M  +G T    TFN L+   C+  ++ +AEE++R M    I P+  +F+ L++GYG  
Sbjct: 497 EAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 556

Query: 491 CLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVFPNAQIY 550
               + F V +EM   G  P   +YG+L+  LCK G   EAE     +       +  +Y
Sbjct: 557 GEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMY 616

Query: 551 NMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKLVNQITN 610
           N L+   C SG +  A  +F EM+ R I P   TY SLI+GLC+ G+ + A     +   
Sbjct: 617 NTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEA 676

Query: 611 SG-FTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGCSKAG-LD 670
            G   P+ + Y   + G   +G  + G+   E M   G  P ++T + +I G S+ G ++
Sbjct: 677 RGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIE 736

Query: 671 TVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMTYNSLI 730
               L  EM + +  P+   YN L+  Y + +DV  +F+LY+ +I  G+  DK+T +SL+
Sbjct: 737 KTNDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLV 796

Query: 731 LGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFL 790
           LG      +    K+++    RG+     T+N+++   C  G+ + A    K M      
Sbjct: 797 LGICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGIS 856

Query: 791 LNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLN 823
           L+   C+ ++  L R  RFQE++++L EM  +G++
Sbjct: 857 LDKDTCDAMVSVLNRNHRFQESRMVLHEMSKQGIS 891

BLAST of Bhi01G000923 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 1.7e-93
Identity = 204/668 (30.54%), Postives = 345/668 (51.65%), Query Frame = 0

Query: 164 TLNIL--FKL-----LMSSKEFKKTL-DFFSELVESGFQ-------PDSFMYGKAVEAAI 223
           TL+IL  FKL     +++     KTL D ++ LV    Q         S ++   V++  
Sbjct: 86  TLHILTKFKLYKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYS 145

Query: 224 KLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCKEKKIID-AQKIFDEMI-NNVSPNL 283
           +L  ++KA  +V   +  G  P V  YN ++    + K+ I  A+ +F EM+ + VSPN+
Sbjct: 146 RLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNV 205

Query: 284 VTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVTYNSLLSGLCQAKQMEEAKKLLHE 343
            TYN +I G+C AG +D A +L ++M+ +   PN+VTYN+L+ G C+ +++++  KLL  
Sbjct: 206 FTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRS 265

Query: 344 METYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAMKKGVKMNKYTTCILLNGLCKDGK 403
           M   G  P+  +Y+                                   +++NGLC++G+
Sbjct: 266 MALKGLEPNLISYN-----------------------------------VVINGLCREGR 325

Query: 404 VEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNIDGAISTIQRMENQGLTPSCITFNS 463
           +++   +LT++   G   DE+ +N L+ GYC++GN   A+     M   GLTPS IT+ S
Sbjct: 326 MKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTS 385

Query: 464 LINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILLDGYGRVCLFDRCFQVLEEMESKG 523
           LI+  C+   +++A E+L +M  + +CP+  T+  L+DG+ +    +  ++VL EM   G
Sbjct: 386 LIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNG 445

Query: 524 IKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVFPNAQIYNMLIDYNCTSGKMQGAF 583
             P+VV+Y  LIN  C  G+  +A  V  DM  KG+ P+   Y+ ++   C S  +  A 
Sbjct: 446 FSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEAL 505

Query: 584 KIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKLVNQITNSGFTPDLITYNSLITGY 643
           ++  EM+++ I P   TY+SLI G C+  R  EA  L  ++   G  PD  TY +LI  Y
Sbjct: 506 RVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAY 565

Query: 644 CSSGNPQKGLELYETMKKQGINPTLITYHLLISGCSK-AGLDTVEKLFSEMLHMDLAPDR 703
           C  G+ +K L+L+  M ++G+ P ++TY +LI+G +K +     ++L  ++ + +  P  
Sbjct: 566 CMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSD 625

Query: 704 GIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMTYNSLILGCLRDGKVTEVRKLVED 763
             Y+ L    IEN     + + +K ++            SLI G    G +TE  ++ E 
Sbjct: 626 VTYHTL----IEN----CSNIEFKSVV------------SLIKGFCMKGMMTEADQVFES 685

Query: 764 MKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFLLNSSVCNQLIDGLKREGR 814
           M  +   P    YNI++ G C  GD  +A+  YKEM K+ FLL++     L+  L +EG+
Sbjct: 686 MLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLHTVTVIALVKALHKEGK 698

BLAST of Bhi01G000923 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 6.4e-85
Identity = 193/657 (29.38%), Postives = 326/657 (49.62%), Query Frame = 0

Query: 185 FSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGFCK 244
           + E++E    P+ + Y K V    KLG + +A   V  + + G++P  F Y  +I G+C+
Sbjct: 206 YMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQ 265

Query: 245 EKKIIDAQKIFDEM-INNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNLVT 304
            K +  A K+F+EM +     N V Y  +I+G C A ++D+A  L  +MK +   P + T
Sbjct: 266 RKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRT 325

Query: 305 YNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEEAM 364
           Y  L+  LC +++  EA  L+ EME  G  P+  TY++L D                   
Sbjct: 326 YTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLID------------------- 385

Query: 365 KKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGNID 424
                            LC   K EKA E+L +++  GL+P+ + +N L++GYC++G I+
Sbjct: 386 ----------------SLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIE 445

Query: 425 GAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNILL 484
            A+  ++ ME++ L+P+  T+N LI  +C+   + KA   L KM+ +++ P + T+N L+
Sbjct: 446 DAVDVVELMESRKLSPNTRTYNELIKGYCK-SNVHKAMGVLNKMLERKVLPDVVTYNSLI 505

Query: 485 DGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKGVF 544
           DG  R   FD  +++L  M  +G+ P+  +Y ++I+ LCK  R  EA  +F  ++ KGV 
Sbjct: 506 DGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVN 565

Query: 545 PNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAEKL 604
           PN  +Y  LID  C +GK+  A  + ++M+ ++  P   T+N+LI+GLC  G++ EA  L
Sbjct: 566 PNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLL 625

Query: 605 VNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLI-SGCS 664
             ++   G  P + T   LI      G+       ++ M   G  P   TY   I + C 
Sbjct: 626 EEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCR 685

Query: 665 KAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKMT 724
           +  L   E + ++M    ++PD   Y++LI  Y +      AF + K+M D G +  + T
Sbjct: 686 EGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHT 745

Query: 725 YNSLILGCL------RDGKVTE------------VRKLVEDMKARGLTPKADTYNIVVKG 784
           + SLI   L      + G   E            V +L+E M    +TP A +Y  ++ G
Sbjct: 746 FLSLIKHLLEMKYGKQKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILG 805

Query: 785 LCELGDYSEAHAWYKEMFKNKFLLNSS-VCNQLIDGLKREGRFQEAQLILSEMYVKG 821
           +CE+G+   A   +  M +N+ +  S  V N L+    +  +  EA  ++ +M   G
Sbjct: 806 ICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

BLAST of Bhi01G000923 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 314.3 bits (804), Expect = 4.1e-84
Identity = 170/542 (31.37%), Postives = 292/542 (53.87%), Query Frame = 0

Query: 281 KLDKAFSLKERMKHENLGPNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYS 340
           KLD A +L   M      P+++ ++ LLS + +  + +    L  +M+  G   + +TYS
Sbjct: 61  KLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYS 120

Query: 341 ILFDGYLRSGGGEASIVQYEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMN 400
           IL + + R      ++    + MK G + N  T   LLNG C   ++ +A  ++ ++ + 
Sbjct: 121 ILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVT 180

Query: 401 GLVPDEMIFNVLVDGYCRKGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKA 460
           G  P+ + FN L+ G         A++ I RM  +G  P  +T+  ++N  C+  + D A
Sbjct: 181 GYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLA 240

Query: 461 EEWLRKMIGKEICPSIETFNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINC 520
              L KM   ++ P +  +N ++DG  +    D    + +EME+KGI+PNVV+Y +LI+C
Sbjct: 241 FNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISC 300

Query: 521 LCKVGRFVEAEVVFADMDGKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPT 580
           LC  GR+ +A  + +DM  + + P+   ++ LID     GK+  A K++DEM+ R I P+
Sbjct: 301 LCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPS 360

Query: 581 LATYNSLINGLCKLGRMIEAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYE 640
           + TY+SLING C   R+ EA+++   + +    PD++TYN+LI G+C     ++G+E++ 
Sbjct: 361 IVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFR 420

Query: 641 TMKKQGINPTLITYHLLISGCSKAG-LDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENR 700
            M ++G+    +TY++LI G  +AG  D  +++F EM+   + P+   YN L+    +N 
Sbjct: 421 EMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNG 480

Query: 701 DVQKAFVLYKKMIDEGVQRDKM-----TYNSLILGCLRDGKVTEVRKLVEDMKARGLTPK 760
            ++KA V++     E +QR KM     TYN +I G  + GKV +   L  ++  +G+ P 
Sbjct: 481 KLEKAMVVF-----EYLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPD 540

Query: 761 ADTYNIVVKGLCELGDYSEAHAWYKEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILS 817
              YN ++ G C  G   EA A +KEM ++  L NS   N LI    R+G  + +  ++ 
Sbjct: 541 VVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIRARLRDGDREASAELIK 597

BLAST of Bhi01G000923 vs. NCBI nr
Match: XP_038880496.1 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Benincasa hispida] >XP_038880500.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Benincasa hispida] >XP_038880503.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Benincasa hispida] >XP_038880509.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Benincasa hispida])

HSP 1 Score: 1689.1 bits (4373), Expect = 0.0e+00
Identity = 836/836 (100.00%), Postives = 836/836 (100.00%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL
Sbjct: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV
Sbjct: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK
Sbjct: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS
Sbjct: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240

Query: 241 GFCKEKKIIDAQKIFDEMINNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPN 300
           GFCKEKKIIDAQKIFDEMINNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPN
Sbjct: 241 GFCKEKKIIDAQKIFDEMINNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPN 300

Query: 301 LVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYE 360
           LVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYE
Sbjct: 301 LVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYE 360

Query: 361 EAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKG 420
           EAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKG
Sbjct: 361 EAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKG 420

Query: 421 NIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFN 480
           NIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFN
Sbjct: 421 NIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFN 480

Query: 481 ILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGK 540
           ILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGK
Sbjct: 481 ILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGK 540

Query: 541 GVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEA 600
           GVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEA
Sbjct: 541 GVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEA 600

Query: 601 EKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISG 660
           EKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISG
Sbjct: 601 EKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISG 660

Query: 661 CSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDK 720
           CSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDK
Sbjct: 661 CSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDK 720

Query: 721 MTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKE 780
           MTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKE
Sbjct: 721 MTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKE 780

Query: 781 MFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM 837
           MFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM
Sbjct: 781 MFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM 836

BLAST of Bhi01G000923 vs. NCBI nr
Match: XP_022922939.1 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 710/837 (84.83%), Postives = 770/837 (92.00%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           + MG HGYYL+RIP+ RRFRT + +FIP+S PFCSDST    QN PNQIEQV SHQEVTL
Sbjct: 16  LTMGRHGYYLSRIPVIRRFRTQYLAFIPSSNPFCSDST----QNNPNQIEQVPSHQEVTL 75

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           S+T+ QH N  EPLCHELVQK RILLQQERTGAAKRLIKS+ILSKSPFSSP DLI +FSV
Sbjct: 76  SSTRNQHPNHPEPLCHELVQKLRILLQQERTGAAKRLIKSIILSKSPFSSPFDLIALFSV 135

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSP+LKHVFSNML  A LDL M DDAIRL TSMK+NGVVPAVATL++LFKLLMSSKEF+K
Sbjct: 136 HSPNLKHVFSNMLLMAFLDLKMLDDAIRLCTSMKENGVVPAVATLSVLFKLLMSSKEFRK 195

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESG +PDSFMYGKAVEAAIKLGE N+A DL+CCMKKIGI+PTVFV NV+IS
Sbjct: 196 TLDFFSELVESGIRPDSFMYGKAVEAAIKLGETNRAFDLMCCMKKIGISPTVFVSNVLIS 255

Query: 241 GFCKEKKIIDAQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGP 300
           G CKEKK+IDAQK+FDEMIN N+SPN VTYNTIINGYCK GKLDKAFSLKERMK ENL P
Sbjct: 256 GLCKEKKLIDAQKMFDEMINTNLSPNSVTYNTIINGYCKVGKLDKAFSLKERMKQENLEP 315

Query: 301 NLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQY 360
           NLVTYNSLLSGLCQAKQMEE KKLL EMETYGFAPDGFTYSILFDGYLRSG  EASIV Y
Sbjct: 316 NLVTYNSLLSGLCQAKQMEETKKLLREMETYGFAPDGFTYSILFDGYLRSGDDEASIVLY 375

Query: 361 EEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRK 420
           EE +KKGV++NKYT CILLNGLCKDGKVEKAEEILTKLMM+GLVPDE++FNVLVDGYCRK
Sbjct: 376 EETVKKGVRINKYTCCILLNGLCKDGKVEKAEEILTKLMMDGLVPDEILFNVLVDGYCRK 435

Query: 421 GNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETF 480
           GNIDGAISTIQRMENQGLTP+CITFNSLINKFCE+KE+DKAEEWLRKMI +E+CPSIET+
Sbjct: 436 GNIDGAISTIQRMENQGLTPNCITFNSLINKFCEMKEMDKAEEWLRKMIAREVCPSIETY 495

Query: 481 NILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDG 540
           NILLDGYGR CLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCK GRFVEAEV+FADMDG
Sbjct: 496 NILLDGYGRACLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKGGRFVEAEVIFADMDG 555

Query: 541 KGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIE 600
           KGV PNAQIYNMLID  CT GKMQ AFKIFDEMIDR+ITPTL+TYNSLINGLCK GR+IE
Sbjct: 556 KGVLPNAQIYNMLIDCKCTLGKMQDAFKIFDEMIDRNITPTLSTYNSLINGLCKKGRVIE 615

Query: 601 AEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLIS 660
           AE+LVNQITNS  TPD+ITYNSLI G+CSSG+PQKGLE+YET+KKQGINPTLITYH+LIS
Sbjct: 616 AEELVNQITNSSLTPDVITYNSLILGHCSSGDPQKGLEIYETLKKQGINPTLITYHVLIS 675

Query: 661 GCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRD 720
           GCS+ GLDTVEKLF EML MDLAPDR +YNA+I+CYIEN DVQKAFVLYKKMIDE V+ D
Sbjct: 676 GCSEVGLDTVEKLFGEMLLMDLAPDRVVYNAMIYCYIENGDVQKAFVLYKKMIDERVELD 735

Query: 721 KMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYK 780
           KMTYNSLILGCLRDGKVTEVRKLV+DMKARGL+PK DTYNI+VKGLCELGDY EAH WYK
Sbjct: 736 KMTYNSLILGCLRDGKVTEVRKLVDDMKARGLSPKGDTYNILVKGLCELGDYGEAHVWYK 795

Query: 781 EMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM 837
           EMFKN FLLN+SVCNQLIDGLKREGRFQEA LILSEM+VKGL+V N+SN+P+ASMTM
Sbjct: 796 EMFKNHFLLNASVCNQLIDGLKREGRFQEAHLILSEMHVKGLSVWNMSNQPAASMTM 848

BLAST of Bhi01G000923 vs. NCBI nr
Match: XP_023552569.1 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita pepo subsp. pepo] >XP_023552570.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1444.9 bits (3739), Expect = 0.0e+00
Identity = 711/837 (84.95%), Postives = 768/837 (91.76%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           + MG HGYYL+RIP+ RRFRT + +FIP+S PFCSD T    QN PNQIEQV SHQEVTL
Sbjct: 16  LTMGRHGYYLSRIPVIRRFRTQYLAFIPSSNPFCSDYT----QNNPNQIEQVPSHQEVTL 75

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           STT+ QH N  EPLCHELVQK RILLQQERTGAAKRLIKS+ILSKSPFSSPCDLI +FSV
Sbjct: 76  STTRNQHPNHPEPLCHELVQKLRILLQQERTGAAKRLIKSIILSKSPFSSPCDLIALFSV 135

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSP+LKHVFSNML  A LDL M DDAIRL TSMK+NGVVPAVATL++LFKLLMSSKEF+K
Sbjct: 136 HSPNLKHVFSNMLLMAFLDLKMLDDAIRLCTSMKENGVVPAVATLSVLFKLLMSSKEFRK 195

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESG +PDSFMYGKAVEAAIKLGE N+A DL+CCMKKIGI+PTVFV NV+IS
Sbjct: 196 TLDFFSELVESGIRPDSFMYGKAVEAAIKLGETNRAFDLMCCMKKIGISPTVFVSNVLIS 255

Query: 241 GFCKEKKIIDAQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGP 300
           G CKEKK+IDAQK+FDEMIN N+SPN VTYNTIINGYCK GKLDKAFSLKERMK ENL P
Sbjct: 256 GLCKEKKLIDAQKMFDEMINTNLSPNSVTYNTIINGYCKVGKLDKAFSLKERMKQENLEP 315

Query: 301 NLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQY 360
           NLVTYNSLLSGLCQAKQMEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG  EASIV Y
Sbjct: 316 NLVTYNSLLSGLCQAKQMEEAKKLLREMETYGFAPDGFTYSILFDGYLRSGDDEASIVLY 375

Query: 361 EEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRK 420
           EE +KKGV++NKYT CILLNGLCKDGKVEKAEEIL KLMM+GLVPDE+ FNVLVDGYCRK
Sbjct: 376 EETVKKGVRINKYTCCILLNGLCKDGKVEKAEEILMKLMMDGLVPDEIHFNVLVDGYCRK 435

Query: 421 GNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETF 480
           GNI+GAISTIQRMENQGLTP+CITFNSLIN FCE+KE+DKAEEWLRKMI +E+CPSIET+
Sbjct: 436 GNINGAISTIQRMENQGLTPNCITFNSLINTFCEMKEMDKAEEWLRKMIAREVCPSIETY 495

Query: 481 NILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDG 540
           NILLDGYGR CLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCK GRFVEAEV+FADMDG
Sbjct: 496 NILLDGYGRACLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKGGRFVEAEVIFADMDG 555

Query: 541 KGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIE 600
           KGV PNAQIYNMLID  CT GKMQ AFKIFDEMIDR+ITPTL+TYNSLINGLCK GR+IE
Sbjct: 556 KGVLPNAQIYNMLIDCKCTLGKMQDAFKIFDEMIDRNITPTLSTYNSLINGLCKKGRVIE 615

Query: 601 AEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLIS 660
           AE+LVNQITNSG TPD+ITYNSLI G+CSSGNPQKGLE+YET+KKQGINPTLITYH+LIS
Sbjct: 616 AEELVNQITNSGLTPDVITYNSLILGHCSSGNPQKGLEIYETLKKQGINPTLITYHVLIS 675

Query: 661 GCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRD 720
           GCS+ GLDTVEKLF EML MDLAPDR +YNA+I+CYIEN DVQKAFVLYKKMIDE V+ D
Sbjct: 676 GCSEVGLDTVEKLFGEMLLMDLAPDRVVYNAMIYCYIENGDVQKAFVLYKKMIDERVELD 735

Query: 721 KMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYK 780
           KMTYNSLILGCLRDGKVTEVRKLV+DMKARGL+PK DTYNI+VKGLCELGDY EAH WYK
Sbjct: 736 KMTYNSLILGCLRDGKVTEVRKLVDDMKARGLSPKGDTYNILVKGLCELGDYGEAHVWYK 795

Query: 781 EMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM 837
           EMFKN FLLN+SVCNQLIDGLKREGRFQEA LILSEM+VKGL V N+SN+P+ASMTM
Sbjct: 796 EMFKNHFLLNASVCNQLIDGLKREGRFQEAHLILSEMHVKGLCVWNMSNQPAASMTM 848

BLAST of Bhi01G000923 vs. NCBI nr
Match: XP_022984191.1 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita maxima] >XP_022984192.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita maxima] >XP_022984193.1 pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1436.0 bits (3716), Expect = 0.0e+00
Identity = 705/835 (84.43%), Postives = 765/835 (91.62%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           + MG HGYYL+RIP+ RRFRT + +FIP+S PFCSDST    QN PNQIEQV SHQEVT 
Sbjct: 16  LTMGRHGYYLSRIPVIRRFRTQYLAFIPSSNPFCSDST----QNNPNQIEQVTSHQEVTS 75

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           STT+ QH N  EPLCHELVQK RILLQQERTGAAKRLIKS+ILSKSPFSSPCDLI +FSV
Sbjct: 76  STTRNQHPNHPEPLCHELVQKLRILLQQERTGAAKRLIKSIILSKSPFSSPCDLIALFSV 135

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSP+LKHVFSNML  A LDL M DDAIRL TSMK+NGVVPAVATL++LFKLLMSSKEF+K
Sbjct: 136 HSPNLKHVFSNMLLMAFLDLKMLDDAIRLCTSMKENGVVPAVATLSVLFKLLMSSKEFRK 195

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESG +PDSFMYGKAVEA+IKLGE N+A DL+CCMKKIGINPTVFV NV+IS
Sbjct: 196 TLDFFSELVESGIRPDSFMYGKAVEASIKLGETNRAFDLMCCMKKIGINPTVFVSNVLIS 255

Query: 241 GFCKEKKIIDAQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGP 300
             CKEKK+IDAQK+FDEMIN N+SPN VTYNTIINGYCK GKLDKAFSLKERMK ENL P
Sbjct: 256 SLCKEKKLIDAQKMFDEMINTNLSPNSVTYNTIINGYCKVGKLDKAFSLKERMKQENLEP 315

Query: 301 NLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQY 360
           NLVTYNSLLSGLCQAKQMEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG  EASIV Y
Sbjct: 316 NLVTYNSLLSGLCQAKQMEEAKKLLREMETYGFAPDGFTYSILFDGYLRSGDDEASIVLY 375

Query: 361 EEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRK 420
           EE +KKGV++NKYT CILLNGLCKDGKVEKAEEIL KLMM+GLVPDE++FNVLVDGYCRK
Sbjct: 376 EETVKKGVRINKYTCCILLNGLCKDGKVEKAEEILMKLMMDGLVPDEILFNVLVDGYCRK 435

Query: 421 GNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETF 480
           G IDGAISTIQRMENQGLTP+CITFNSLINKFCE+KE+DKAEEWLRKMI +E+CPSIET+
Sbjct: 436 GIIDGAISTIQRMENQGLTPNCITFNSLINKFCEMKEMDKAEEWLRKMIAREVCPSIETY 495

Query: 481 NILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDG 540
           NILLDGYGR C F+RCFQVLEEMESKGIK NVVSYGTLINCLCK GRFVEAEV+FADMDG
Sbjct: 496 NILLDGYGRGCFFNRCFQVLEEMESKGIKANVVSYGTLINCLCKGGRFVEAEVIFADMDG 555

Query: 541 KGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIE 600
           KGV PNAQIYNMLID  CT GKMQ AFKIFDEMIDR+ITPTL+T+NSLINGLCK GR+IE
Sbjct: 556 KGVLPNAQIYNMLIDCKCTLGKMQDAFKIFDEMIDRNITPTLSTHNSLINGLCKKGRVIE 615

Query: 601 AEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLIS 660
           AE+LVNQITNSG TPD+ITYNSLI G+CSSGNPQKGLE+YET+KKQGINPTLITYH+L+S
Sbjct: 616 AEELVNQITNSGLTPDVITYNSLILGHCSSGNPQKGLEIYETLKKQGINPTLITYHVLLS 675

Query: 661 GCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRD 720
           GCSK GLDTVEKLF EML MDLAPDR +YNA+I+CYIEN DVQKAFVLYKKMIDE V+ D
Sbjct: 676 GCSKVGLDTVEKLFDEMLLMDLAPDRVVYNAMIYCYIENGDVQKAFVLYKKMIDERVELD 735

Query: 721 KMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYK 780
           KMTYNSLILGCLRDGKVTEVRKLV+DMKARGL+PK DTYNI+VKGLCELGDY EAH WYK
Sbjct: 736 KMTYNSLILGCLRDGKVTEVRKLVDDMKARGLSPKGDTYNILVKGLCELGDYGEAHVWYK 795

Query: 781 EMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASM 835
           EMFKN FLLN+SVCNQLIDGLKREGRFQEA LILSEM+VKGLNV N+SN+P+AS+
Sbjct: 796 EMFKNHFLLNASVCNQLIDGLKREGRFQEAHLILSEMHVKGLNVWNMSNQPAASI 846

BLAST of Bhi01G000923 vs. NCBI nr
Match: XP_008439118.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucumis melo])

HSP 1 Score: 1419.4 bits (3673), Expect = 0.0e+00
Identity = 706/822 (85.89%), Postives = 747/822 (90.88%), Query Frame = 0

Query: 3   MGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTLST 62
           MGSHGYYL RI ITRRF T+ F FIP+S PF SDSTST NQN PNQ+EQV S QEVTLST
Sbjct: 1   MGSHGYYLPRITITRRFGTHPFIFIPSSQPFSSDSTSTPNQNNPNQVEQVSSLQEVTLST 60

Query: 63  TKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSVHS 122
           T  QHSNP EPLC ELVQK R+LLQQ RT AA+ LIKSVILSKSPFSSP DLI +FSVHS
Sbjct: 61  TNNQHSNPLEPLCLELVQKLRVLLQQGRTCAAESLIKSVILSKSPFSSPSDLIPLFSVHS 120

Query: 123 PSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTL 182
           PSL HVFS  LF A LDL MTDDAIRL TSMKKNGVVPAV TLN LFKLLMSSKEFKKTL
Sbjct: 121 PSLNHVFSKTLFMAFLDLKMTDDAIRLCTSMKKNGVVPAVGTLNFLFKLLMSSKEFKKTL 180

Query: 183 DFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGF 242
           DFFSELVESG QPD FMYGKAVEAAIKLG+MNKAC LVCCMKK GINPT FVYNVIISGF
Sbjct: 181 DFFSELVESGIQPDRFMYGKAVEAAIKLGKMNKACHLVCCMKKKGINPTFFVYNVIISGF 240

Query: 243 CKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNL 302
           CKEKK++DAQKIFDEMI  N+SPNLVTYNTIINGYCKAGKLDKAFSLKERMK ENLGPNL
Sbjct: 241 CKEKKMVDAQKIFDEMITKNMSPNLVTYNTIINGYCKAGKLDKAFSLKERMKLENLGPNL 300

Query: 303 VTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEE 362
           VTYNSLLSGLC+A++MEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG GEAS+V +EE
Sbjct: 301 VTYNSLLSGLCKAREMEEAKKLLLEMETYGFAPDGFTYSILFDGYLRSGDGEASVVLFEE 360

Query: 363 AMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGN 422
           A+KKGV++N+YT CILLNGLCKDGK EKAEEILTKLMMNGLVP+E+IFNVLVDGYCRKGN
Sbjct: 361 AVKKGVRINEYTFCILLNGLCKDGKTEKAEEILTKLMMNGLVPNEIIFNVLVDGYCRKGN 420

Query: 423 IDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNI 482
           IDGAIS IQRMENQGLTP+CITFNSLI+KFCEIKE+DKAEEWLRKMI +E+CPSIET+N 
Sbjct: 421 IDGAISIIQRMENQGLTPNCITFNSLIHKFCEIKEMDKAEEWLRKMIEREVCPSIETYNT 480

Query: 483 LLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKG 542
           LLDGYGR+ LFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRF+EAE VFADMDGKG
Sbjct: 481 LLDGYGRMHLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFIEAEAVFADMDGKG 540

Query: 543 VFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAE 602
           VFPNAQIYNMLID NCTSGKMQ AFK FDEMIDRDITPTLATYNSLINGLCK GRMIEAE
Sbjct: 541 VFPNAQIYNMLIDCNCTSGKMQDAFKTFDEMIDRDITPTLATYNSLINGLCKKGRMIEAE 600

Query: 603 KLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGC 662
           +L NQITNSG TPD+ITYNSLI+GYCSSGNPQKGLELYETMKKQGINPTL TYHLLISGC
Sbjct: 601 ELANQITNSGLTPDVITYNSLISGYCSSGNPQKGLELYETMKKQGINPTLKTYHLLISGC 660

Query: 663 SKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKM 722
           SK GLDT+EKLF+EMLH DLAPDR +Y  LIFCY+EN DVQ+AFVLY KMI EGV  DK+
Sbjct: 661 SKVGLDTMEKLFNEMLHTDLAPDRVVYKELIFCYVENGDVQRAFVLYNKMIVEGVLLDKI 720

Query: 723 TYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEM 782
           TYNSLILGC R GKVTEVRKLVEDMKARGLTPK DTYNI+VKG CE GDY EAHAWYKEM
Sbjct: 721 TYNSLILGCSRGGKVTEVRKLVEDMKARGLTPKTDTYNILVKGFCEFGDYIEAHAWYKEM 780

Query: 783 FKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNV 824
            + KFLLNSSVCNQLIDGLKREGRFQEA+LILSE  VKGLNV
Sbjct: 781 SEKKFLLNSSVCNQLIDGLKREGRFQEARLILSETDVKGLNV 822

BLAST of Bhi01G000923 vs. ExPASy TrEMBL
Match: A0A6J1E4W4 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111430768 PE=4 SV=1)

HSP 1 Score: 1445.6 bits (3741), Expect = 0.0e+00
Identity = 710/837 (84.83%), Postives = 770/837 (92.00%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           + MG HGYYL+RIP+ RRFRT + +FIP+S PFCSDST    QN PNQIEQV SHQEVTL
Sbjct: 16  LTMGRHGYYLSRIPVIRRFRTQYLAFIPSSNPFCSDST----QNNPNQIEQVPSHQEVTL 75

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           S+T+ QH N  EPLCHELVQK RILLQQERTGAAKRLIKS+ILSKSPFSSP DLI +FSV
Sbjct: 76  SSTRNQHPNHPEPLCHELVQKLRILLQQERTGAAKRLIKSIILSKSPFSSPFDLIALFSV 135

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSP+LKHVFSNML  A LDL M DDAIRL TSMK+NGVVPAVATL++LFKLLMSSKEF+K
Sbjct: 136 HSPNLKHVFSNMLLMAFLDLKMLDDAIRLCTSMKENGVVPAVATLSVLFKLLMSSKEFRK 195

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESG +PDSFMYGKAVEAAIKLGE N+A DL+CCMKKIGI+PTVFV NV+IS
Sbjct: 196 TLDFFSELVESGIRPDSFMYGKAVEAAIKLGETNRAFDLMCCMKKIGISPTVFVSNVLIS 255

Query: 241 GFCKEKKIIDAQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGP 300
           G CKEKK+IDAQK+FDEMIN N+SPN VTYNTIINGYCK GKLDKAFSLKERMK ENL P
Sbjct: 256 GLCKEKKLIDAQKMFDEMINTNLSPNSVTYNTIINGYCKVGKLDKAFSLKERMKQENLEP 315

Query: 301 NLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQY 360
           NLVTYNSLLSGLCQAKQMEE KKLL EMETYGFAPDGFTYSILFDGYLRSG  EASIV Y
Sbjct: 316 NLVTYNSLLSGLCQAKQMEETKKLLREMETYGFAPDGFTYSILFDGYLRSGDDEASIVLY 375

Query: 361 EEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRK 420
           EE +KKGV++NKYT CILLNGLCKDGKVEKAEEILTKLMM+GLVPDE++FNVLVDGYCRK
Sbjct: 376 EETVKKGVRINKYTCCILLNGLCKDGKVEKAEEILTKLMMDGLVPDEILFNVLVDGYCRK 435

Query: 421 GNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETF 480
           GNIDGAISTIQRMENQGLTP+CITFNSLINKFCE+KE+DKAEEWLRKMI +E+CPSIET+
Sbjct: 436 GNIDGAISTIQRMENQGLTPNCITFNSLINKFCEMKEMDKAEEWLRKMIAREVCPSIETY 495

Query: 481 NILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDG 540
           NILLDGYGR CLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCK GRFVEAEV+FADMDG
Sbjct: 496 NILLDGYGRACLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKGGRFVEAEVIFADMDG 555

Query: 541 KGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIE 600
           KGV PNAQIYNMLID  CT GKMQ AFKIFDEMIDR+ITPTL+TYNSLINGLCK GR+IE
Sbjct: 556 KGVLPNAQIYNMLIDCKCTLGKMQDAFKIFDEMIDRNITPTLSTYNSLINGLCKKGRVIE 615

Query: 601 AEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLIS 660
           AE+LVNQITNS  TPD+ITYNSLI G+CSSG+PQKGLE+YET+KKQGINPTLITYH+LIS
Sbjct: 616 AEELVNQITNSSLTPDVITYNSLILGHCSSGDPQKGLEIYETLKKQGINPTLITYHVLIS 675

Query: 661 GCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRD 720
           GCS+ GLDTVEKLF EML MDLAPDR +YNA+I+CYIEN DVQKAFVLYKKMIDE V+ D
Sbjct: 676 GCSEVGLDTVEKLFGEMLLMDLAPDRVVYNAMIYCYIENGDVQKAFVLYKKMIDERVELD 735

Query: 721 KMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYK 780
           KMTYNSLILGCLRDGKVTEVRKLV+DMKARGL+PK DTYNI+VKGLCELGDY EAH WYK
Sbjct: 736 KMTYNSLILGCLRDGKVTEVRKLVDDMKARGLSPKGDTYNILVKGLCELGDYGEAHVWYK 795

Query: 781 EMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASMTM 837
           EMFKN FLLN+SVCNQLIDGLKREGRFQEA LILSEM+VKGL+V N+SN+P+ASMTM
Sbjct: 796 EMFKNHFLLNASVCNQLIDGLKREGRFQEAHLILSEMHVKGLSVWNMSNQPAASMTM 848

BLAST of Bhi01G000923 vs. ExPASy TrEMBL
Match: A0A6J1J4J9 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111482583 PE=4 SV=1)

HSP 1 Score: 1436.0 bits (3716), Expect = 0.0e+00
Identity = 705/835 (84.43%), Postives = 765/835 (91.62%), Query Frame = 0

Query: 1   MKMGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTL 60
           + MG HGYYL+RIP+ RRFRT + +FIP+S PFCSDST    QN PNQIEQV SHQEVT 
Sbjct: 16  LTMGRHGYYLSRIPVIRRFRTQYLAFIPSSNPFCSDST----QNNPNQIEQVTSHQEVTS 75

Query: 61  STTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSV 120
           STT+ QH N  EPLCHELVQK RILLQQERTGAAKRLIKS+ILSKSPFSSPCDLI +FSV
Sbjct: 76  STTRNQHPNHPEPLCHELVQKLRILLQQERTGAAKRLIKSIILSKSPFSSPCDLIALFSV 135

Query: 121 HSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKK 180
           HSP+LKHVFSNML  A LDL M DDAIRL TSMK+NGVVPAVATL++LFKLLMSSKEF+K
Sbjct: 136 HSPNLKHVFSNMLLMAFLDLKMLDDAIRLCTSMKENGVVPAVATLSVLFKLLMSSKEFRK 195

Query: 181 TLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIIS 240
           TLDFFSELVESG +PDSFMYGKAVEA+IKLGE N+A DL+CCMKKIGINPTVFV NV+IS
Sbjct: 196 TLDFFSELVESGIRPDSFMYGKAVEASIKLGETNRAFDLMCCMKKIGINPTVFVSNVLIS 255

Query: 241 GFCKEKKIIDAQKIFDEMIN-NVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGP 300
             CKEKK+IDAQK+FDEMIN N+SPN VTYNTIINGYCK GKLDKAFSLKERMK ENL P
Sbjct: 256 SLCKEKKLIDAQKMFDEMINTNLSPNSVTYNTIINGYCKVGKLDKAFSLKERMKQENLEP 315

Query: 301 NLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQY 360
           NLVTYNSLLSGLCQAKQMEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG  EASIV Y
Sbjct: 316 NLVTYNSLLSGLCQAKQMEEAKKLLREMETYGFAPDGFTYSILFDGYLRSGDDEASIVLY 375

Query: 361 EEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRK 420
           EE +KKGV++NKYT CILLNGLCKDGKVEKAEEIL KLMM+GLVPDE++FNVLVDGYCRK
Sbjct: 376 EETVKKGVRINKYTCCILLNGLCKDGKVEKAEEILMKLMMDGLVPDEILFNVLVDGYCRK 435

Query: 421 GNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETF 480
           G IDGAISTIQRMENQGLTP+CITFNSLINKFCE+KE+DKAEEWLRKMI +E+CPSIET+
Sbjct: 436 GIIDGAISTIQRMENQGLTPNCITFNSLINKFCEMKEMDKAEEWLRKMIAREVCPSIETY 495

Query: 481 NILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDG 540
           NILLDGYGR C F+RCFQVLEEMESKGIK NVVSYGTLINCLCK GRFVEAEV+FADMDG
Sbjct: 496 NILLDGYGRGCFFNRCFQVLEEMESKGIKANVVSYGTLINCLCKGGRFVEAEVIFADMDG 555

Query: 541 KGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIE 600
           KGV PNAQIYNMLID  CT GKMQ AFKIFDEMIDR+ITPTL+T+NSLINGLCK GR+IE
Sbjct: 556 KGVLPNAQIYNMLIDCKCTLGKMQDAFKIFDEMIDRNITPTLSTHNSLINGLCKKGRVIE 615

Query: 601 AEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLIS 660
           AE+LVNQITNSG TPD+ITYNSLI G+CSSGNPQKGLE+YET+KKQGINPTLITYH+L+S
Sbjct: 616 AEELVNQITNSGLTPDVITYNSLILGHCSSGNPQKGLEIYETLKKQGINPTLITYHVLLS 675

Query: 661 GCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRD 720
           GCSK GLDTVEKLF EML MDLAPDR +YNA+I+CYIEN DVQKAFVLYKKMIDE V+ D
Sbjct: 676 GCSKVGLDTVEKLFDEMLLMDLAPDRVVYNAMIYCYIENGDVQKAFVLYKKMIDERVELD 735

Query: 721 KMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYK 780
           KMTYNSLILGCLRDGKVTEVRKLV+DMKARGL+PK DTYNI+VKGLCELGDY EAH WYK
Sbjct: 736 KMTYNSLILGCLRDGKVTEVRKLVDDMKARGLSPKGDTYNILVKGLCELGDYGEAHVWYK 795

Query: 781 EMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNVLNLSNEPSASM 835
           EMFKN FLLN+SVCNQLIDGLKREGRFQEA LILSEM+VKGLNV N+SN+P+AS+
Sbjct: 796 EMFKNHFLLNASVCNQLIDGLKREGRFQEAHLILSEMHVKGLNVWNMSNQPAASI 846

BLAST of Bhi01G000923 vs. ExPASy TrEMBL
Match: A0A1S3AYN1 (pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103484008 PE=4 SV=1)

HSP 1 Score: 1419.4 bits (3673), Expect = 0.0e+00
Identity = 706/822 (85.89%), Postives = 747/822 (90.88%), Query Frame = 0

Query: 3   MGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTLST 62
           MGSHGYYL RI ITRRF T+ F FIP+S PF SDSTST NQN PNQ+EQV S QEVTLST
Sbjct: 1   MGSHGYYLPRITITRRFGTHPFIFIPSSQPFSSDSTSTPNQNNPNQVEQVSSLQEVTLST 60

Query: 63  TKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSVHS 122
           T  QHSNP EPLC ELVQK R+LLQQ RT AA+ LIKSVILSKSPFSSP DLI +FSVHS
Sbjct: 61  TNNQHSNPLEPLCLELVQKLRVLLQQGRTCAAESLIKSVILSKSPFSSPSDLIPLFSVHS 120

Query: 123 PSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTL 182
           PSL HVFS  LF A LDL MTDDAIRL TSMKKNGVVPAV TLN LFKLLMSSKEFKKTL
Sbjct: 121 PSLNHVFSKTLFMAFLDLKMTDDAIRLCTSMKKNGVVPAVGTLNFLFKLLMSSKEFKKTL 180

Query: 183 DFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGF 242
           DFFSELVESG QPD FMYGKAVEAAIKLG+MNKAC LVCCMKK GINPT FVYNVIISGF
Sbjct: 181 DFFSELVESGIQPDRFMYGKAVEAAIKLGKMNKACHLVCCMKKKGINPTFFVYNVIISGF 240

Query: 243 CKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNL 302
           CKEKK++DAQKIFDEMI  N+SPNLVTYNTIINGYCKAGKLDKAFSLKERMK ENLGPNL
Sbjct: 241 CKEKKMVDAQKIFDEMITKNMSPNLVTYNTIINGYCKAGKLDKAFSLKERMKLENLGPNL 300

Query: 303 VTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEE 362
           VTYNSLLSGLC+A++MEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG GEAS+V +EE
Sbjct: 301 VTYNSLLSGLCKAREMEEAKKLLLEMETYGFAPDGFTYSILFDGYLRSGDGEASVVLFEE 360

Query: 363 AMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGN 422
           A+KKGV++N+YT CILLNGLCKDGK EKAEEILTKLMMNGLVP+E+IFNVLVDGYCRKGN
Sbjct: 361 AVKKGVRINEYTFCILLNGLCKDGKTEKAEEILTKLMMNGLVPNEIIFNVLVDGYCRKGN 420

Query: 423 IDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNI 482
           IDGAIS IQRMENQGLTP+CITFNSLI+KFCEIKE+DKAEEWLRKMI +E+CPSIET+N 
Sbjct: 421 IDGAISIIQRMENQGLTPNCITFNSLIHKFCEIKEMDKAEEWLRKMIEREVCPSIETYNT 480

Query: 483 LLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKG 542
           LLDGYGR+ LFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRF+EAE VFADMDGKG
Sbjct: 481 LLDGYGRMHLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFIEAEAVFADMDGKG 540

Query: 543 VFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAE 602
           VFPNAQIYNMLID NCTSGKMQ AFK FDEMIDRDITPTLATYNSLINGLCK GRMIEAE
Sbjct: 541 VFPNAQIYNMLIDCNCTSGKMQDAFKTFDEMIDRDITPTLATYNSLINGLCKKGRMIEAE 600

Query: 603 KLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGC 662
           +L NQITNSG TPD+ITYNSLI+GYCSSGNPQKGLELYETMKKQGINPTL TYHLLISGC
Sbjct: 601 ELANQITNSGLTPDVITYNSLISGYCSSGNPQKGLELYETMKKQGINPTLKTYHLLISGC 660

Query: 663 SKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKM 722
           SK GLDT+EKLF+EMLH DLAPDR +Y  LIFCY+EN DVQ+AFVLY KMI EGV  DK+
Sbjct: 661 SKVGLDTMEKLFNEMLHTDLAPDRVVYKELIFCYVENGDVQRAFVLYNKMIVEGVLLDKI 720

Query: 723 TYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEM 782
           TYNSLILGC R GKVTEVRKLVEDMKARGLTPK DTYNI+VKG CE GDY EAHAWYKEM
Sbjct: 721 TYNSLILGCSRGGKVTEVRKLVEDMKARGLTPKTDTYNILVKGFCEFGDYIEAHAWYKEM 780

Query: 783 FKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNV 824
            + KFLLNSSVCNQLIDGLKREGRFQEA+LILSE  VKGLNV
Sbjct: 781 SEKKFLLNSSVCNQLIDGLKREGRFQEARLILSETDVKGLNV 822

BLAST of Bhi01G000923 vs. ExPASy TrEMBL
Match: A0A5A7SUA7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1044G00050 PE=4 SV=1)

HSP 1 Score: 1419.1 bits (3672), Expect = 0.0e+00
Identity = 705/822 (85.77%), Postives = 748/822 (91.00%), Query Frame = 0

Query: 3   MGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSDSTSTQNQNKPNQIEQVLSHQEVTLST 62
           MGSHGYYL RI ITRRFRT+ F FIP+S PF SDSTST NQN PNQ+EQV S QEVTLST
Sbjct: 1   MGSHGYYLPRITITRRFRTHPFIFIPSSQPFSSDSTSTPNQNNPNQVEQVSSLQEVTLST 60

Query: 63  TKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVFSVHS 122
           T  QHSNP EP C ELVQK R+LLQQ RT AA+ LIKSVILSKSPFSSP DLI +FSVHS
Sbjct: 61  TNNQHSNPLEPWCLELVQKLRVLLQQGRTCAAESLIKSVILSKSPFSSPSDLIPLFSVHS 120

Query: 123 PSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVATLNILFKLLMSSKEFKKTL 182
           PSL HVFS  LF A LDL MTDDAIRL TSMKKNGVVPAV TLN LFKLLMSSKEFKKTL
Sbjct: 121 PSLNHVFSKTLFMAFLDLKMTDDAIRLCTSMKKNGVVPAVGTLNFLFKLLMSSKEFKKTL 180

Query: 183 DFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNVIISGF 242
           DFFSELVESG QPD FMYGKAVEAAIKLG+MNKAC LVCCMKK GINPT FVYNVIISGF
Sbjct: 181 DFFSELVESGIQPDRFMYGKAVEAAIKLGKMNKACHLVCCMKKKGINPTFFVYNVIISGF 240

Query: 243 CKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHENLGPNL 302
           CKEKK++DAQKIFDEMI  N+SPNLVTYNTIINGYCKAGKLDKAFSLKERMK ENLGPNL
Sbjct: 241 CKEKKMVDAQKIFDEMITKNMSPNLVTYNTIINGYCKAGKLDKAFSLKERMKLENLGPNL 300

Query: 303 VTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASIVQYEE 362
           VTYNSLLSGLC+A++MEEAKKLL EMETYGFAPDGFTYSILFDGYLRSG GEAS+V +EE
Sbjct: 301 VTYNSLLSGLCKAREMEEAKKLLLEMETYGFAPDGFTYSILFDGYLRSGDGEASVVLFEE 360

Query: 363 AMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGYCRKGN 422
           A+KKGV++N+YT CILLNGLCKDGK EKAEEILTKLMMNGLVP+E+IFNVLVDGYCRKGN
Sbjct: 361 AVKKGVRINEYTFCILLNGLCKDGKTEKAEEILTKLMMNGLVPNEIIFNVLVDGYCRKGN 420

Query: 423 IDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSIETFNI 482
           IDGAIS IQRMENQGLTP+CITFNSLI+KFCEIKE+DKAEEWLRKMI +E+CPSIET+N 
Sbjct: 421 IDGAISIIQRMENQGLTPNCITFNSLIHKFCEIKEMDKAEEWLRKMIEREVCPSIETYNT 480

Query: 483 LLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFADMDGKG 542
           LLDGYGR+ LFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRF+EAE VFADMDGKG
Sbjct: 481 LLDGYGRMHLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFIEAEAVFADMDGKG 540

Query: 543 VFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGRMIEAE 602
           VFPNAQIYNMLID NCTSGKMQ AFK FDEMIDRDITPTLATYNSLINGLCK GRMIEAE
Sbjct: 541 VFPNAQIYNMLIDCNCTSGKMQDAFKTFDEMIDRDITPTLATYNSLINGLCKKGRMIEAE 600

Query: 603 KLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHLLISGC 662
           +L NQITNSG TPD+ITYNSLI+GYCSSGNPQKGLELYETMKKQGINPTL TYHLLISGC
Sbjct: 601 ELANQITNSGLTPDVITYNSLISGYCSSGNPQKGLELYETMKKQGINPTLKTYHLLISGC 660

Query: 663 SKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGVQRDKM 722
           SK GLDT+EKLF+EMLH DLAP+R +Y  LIFCY+EN DVQ+AFVLY KMI EGV  DK+
Sbjct: 661 SKVGLDTMEKLFNEMLHTDLAPERVVYKELIFCYVENGDVQRAFVLYNKMIVEGVLLDKI 720

Query: 723 TYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHAWYKEM 782
           TYNSLILGC R GKVTEVRKLVEDMKARGLTPK DTYNI+VKG CE GDYSEAHAWYKEM
Sbjct: 721 TYNSLILGCSRGGKVTEVRKLVEDMKARGLTPKTDTYNILVKGFCEFGDYSEAHAWYKEM 780

Query: 783 FKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYVKGLNV 824
            + KFLLNSSVCNQLIDGLKREGRF+EA+LILSE  VKGLNV
Sbjct: 781 SEKKFLLNSSVCNQLIDGLKREGRFREARLILSETDVKGLNV 822

BLAST of Bhi01G000923 vs. ExPASy TrEMBL
Match: A0A0A0L693 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G177910 PE=4 SV=1)

HSP 1 Score: 1395.2 bits (3610), Expect = 0.0e+00
Identity = 696/822 (84.67%), Postives = 744/822 (90.51%), Query Frame = 0

Query: 3   MGSHGYYLTRIPITRRFRTYHFSFIPTSIPFCSD----STSTQNQNKPNQIEQVLSHQEV 62
           MGSHGYYL RIPITRRFRT+ F FIP+S+PFCSD    STSTQNQN  NQ E V S QEV
Sbjct: 1   MGSHGYYLPRIPITRRFRTHPFVFIPSSLPFCSDSTSTSTSTQNQNNFNQFEHVSSLQEV 60

Query: 63  TLSTTKKQHSNPSEPLCHELVQKFRILLQQERTGAAKRLIKSVILSKSPFSSPCDLIEVF 122
           TLSTT  QHSNP EPLC ELVQK RILLQQ RTGAA+ LIKSVILSKSPFSSP DLI +F
Sbjct: 61  TLSTTSSQHSNPLEPLCLELVQKLRILLQQGRTGAAESLIKSVILSKSPFSSPSDLIPLF 120

Query: 123 SVHSPSLKHVFSNMLFTALLDLNMTDDAIRLYTSMKKNGVVPAVA-TLNILFKLLMSSKE 182
           SVH+PSL HVFS  LF   LDL MTDDAIRL TSMKKNGVVPAV  TLN+LFKLLMSSKE
Sbjct: 121 SVHAPSLNHVFSKTLFMVFLDLKMTDDAIRLCTSMKKNGVVPAVVDTLNVLFKLLMSSKE 180

Query: 183 FKKTLDFFSELVESGFQPDSFMYGKAVEAAIKLGEMNKACDLVCCMKKIGINPTVFVYNV 242
           FKKTLDFFSELVESG  PD FMYGKAVEAA+KLG MNKACDLVCCMKKIGI+PT FVYNV
Sbjct: 181 FKKTLDFFSELVESGILPDKFMYGKAVEAAMKLGNMNKACDLVCCMKKIGIDPTFFVYNV 240

Query: 243 IISGFCKEKKIIDAQKIFDEMI-NNVSPNLVTYNTIINGYCKAGKLDKAFSLKERMKHEN 302
           +ISGFCKEKK++DAQKIFDEMI  N+SPNLVTYNTIINGYCKAGKLDKAFSLKERMK EN
Sbjct: 241 LISGFCKEKKMVDAQKIFDEMITKNMSPNLVTYNTIINGYCKAGKLDKAFSLKERMKLEN 300

Query: 303 LGPNLVTYNSLLSGLCQAKQMEEAKKLLHEMETYGFAPDGFTYSILFDGYLRSGGGEASI 362
           LGPNLVTYNSLLSGLC+A+QMEEAKKLL EMET+GFAPDGFTYSILFDGYLRSG GEAS+
Sbjct: 301 LGPNLVTYNSLLSGLCKARQMEEAKKLLVEMETHGFAPDGFTYSILFDGYLRSGDGEASV 360

Query: 363 VQYEEAMKKGVKMNKYTTCILLNGLCKDGKVEKAEEILTKLMMNGLVPDEMIFNVLVDGY 422
           V +EEA+KKGV++N+YT CILLNGLCKDGK EKAEE LTKLMMNGLVP+E+IFNVLVDGY
Sbjct: 361 VLFEEAVKKGVRINEYTCCILLNGLCKDGKAEKAEEFLTKLMMNGLVPNEIIFNVLVDGY 420

Query: 423 CRKGNIDGAISTIQRMENQGLTPSCITFNSLINKFCEIKELDKAEEWLRKMIGKEICPSI 482
           CRKGNIDGAISTIQRMENQGLTP+CITFNSLI+KFCEIKE+DKAEEWLRKM+ +E+CPSI
Sbjct: 421 CRKGNIDGAISTIQRMENQGLTPNCITFNSLIHKFCEIKEMDKAEEWLRKMMEREVCPSI 480

Query: 483 ETFNILLDGYGRVCLFDRCFQVLEEMESKGIKPNVVSYGTLINCLCKVGRFVEAEVVFAD 542
           ET+N LLDGYGR+ LFDRCFQVLEEMESKGIKPNVVSYG LINCLCKVGRFVEAE VFAD
Sbjct: 481 ETYNTLLDGYGRMRLFDRCFQVLEEMESKGIKPNVVSYGALINCLCKVGRFVEAEAVFAD 540

Query: 543 MDGKGVFPNAQIYNMLIDYNCTSGKMQGAFKIFDEMIDRDITPTLATYNSLINGLCKLGR 602
           MDGKGVFPNAQIYNMLID NCTSGKMQ AFK FDEMIDRDITPTLATYNSLINGLCK GR
Sbjct: 541 MDGKGVFPNAQIYNMLIDCNCTSGKMQDAFKTFDEMIDRDITPTLATYNSLINGLCKKGR 600

Query: 603 MIEAEKLVNQITNSGFTPDLITYNSLITGYCSSGNPQKGLELYETMKKQGINPTLITYHL 662
           +IEAE+L NQIT SG TPD+ITYNSLI+GYCSSGN +KGLELYETMKKQGINPTLITYHL
Sbjct: 601 VIEAEELANQITKSGLTPDVITYNSLISGYCSSGNSKKGLELYETMKKQGINPTLITYHL 660

Query: 663 LISGCSKAGLDTVEKLFSEMLHMDLAPDRGIYNALIFCYIENRDVQKAFVLYKKMIDEGV 722
           LISG SK GLDT+E+LF+E+LH DLA D+ +YN LIFCY+EN DVQKAFVLY KMI EGV
Sbjct: 661 LISGRSKVGLDTMEELFNEVLHRDLALDKVVYNGLIFCYVENGDVQKAFVLYNKMIVEGV 720

Query: 723 QRDKMTYNSLILGCLRDGKVTEVRKLVEDMKARGLTPKADTYNIVVKGLCELGDYSEAHA 782
           Q DK+TYNSLILGC R GKVTEVRKLVEDMKARGLTPKADTYNI+VKGLCE GDYSEAH 
Sbjct: 721 QLDKITYNSLILGCSRGGKVTEVRKLVEDMKARGLTPKADTYNILVKGLCEFGDYSEAHT 780

Query: 783 WYKEMFKNKFLLNSSVCNQLIDGLKREGRFQEAQLILSEMYV 819
           WYKEMF+N  LLNS V NQLIDGLKREGRFQEA+LILSE YV
Sbjct: 781 WYKEMFENNLLLNSPVRNQLIDGLKREGRFQEARLILSETYV 822

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G12100.13.4e-21947.87pentatricopeptide (PPR) repeat-containing protein [more]
AT5G55840.11.3e-9628.92Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.11.2e-9430.54Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G65560.14.5e-8629.38Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.12.9e-8531.37rna processing factor 2 [more]
Match NameE-valueIdentityDescription
Q9FMQ14.8e-21847.87Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidop... [more]
Q9LVQ51.8e-9528.92Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.7e-9330.54Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LSL96.4e-8529.38Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9SXD14.1e-8431.37Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_038880496.10.0e+00100.00pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Benincasa ... [more]
XP_022922939.10.0e+0084.83pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita ... [more]
XP_023552569.10.0e+0084.95pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita ... [more]
XP_022984191.10.0e+0084.43pentatricopeptide repeat-containing protein At5g12100, mitochondrial [Cucurbita ... [more]
XP_008439118.10.0e+0085.89PREDICTED: pentatricopeptide repeat-containing protein At5g12100, mitochondrial ... [more]
Match NameE-valueIdentityDescription
A0A6J1E4W40.0e+0084.83pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucurbit... [more]
A0A6J1J4J90.0e+0084.43pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucurbit... [more]
A0A1S3AYN10.0e+0085.89pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Cucumis ... [more]
A0A5A7SUA70.0e+0085.77Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0L6930.0e+0084.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G177910 PE=4 SV=1[more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 262..327
e-value: 1.6E-24
score: 88.5
coord: 398..466
e-value: 2.9E-17
score: 64.8
coord: 328..397
e-value: 4.8E-14
score: 54.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 561..667
e-value: 4.4E-35
score: 122.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 668..832
e-value: 5.1E-39
score: 136.4
coord: 130..261
e-value: 6.0E-27
score: 96.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 467..560
e-value: 4.0E-27
score: 97.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 442..475
e-value: 4.9E-6
score: 24.4
coord: 687..719
e-value: 3.0E-6
score: 25.0
coord: 373..406
e-value: 4.8E-7
score: 27.5
coord: 512..546
e-value: 8.1E-9
score: 33.1
coord: 548..580
e-value: 2.7E-5
score: 22.1
coord: 408..441
e-value: 3.8E-7
score: 27.9
coord: 302..335
e-value: 1.2E-10
score: 38.9
coord: 722..753
e-value: 3.0E-6
score: 25.0
coord: 164..197
e-value: 0.0012
score: 16.9
coord: 131..160
e-value: 0.0014
score: 16.7
coord: 267..300
e-value: 2.1E-10
score: 38.1
coord: 757..789
e-value: 1.9E-7
score: 28.8
coord: 234..260
e-value: 6.3E-5
score: 20.9
coord: 478..511
e-value: 6.5E-9
score: 33.4
coord: 617..650
e-value: 1.6E-9
score: 35.3
coord: 583..615
e-value: 2.1E-8
score: 31.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 227..258
e-value: 1.3E-9
score: 37.6
coord: 365..396
e-value: 1.0E-5
score: 25.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 544..593
e-value: 3.3E-15
score: 56.0
coord: 474..523
e-value: 8.5E-13
score: 48.3
coord: 719..766
e-value: 3.9E-13
score: 49.4
coord: 614..663
e-value: 6.6E-14
score: 51.9
coord: 404..452
e-value: 9.6E-11
score: 41.7
coord: 264..313
e-value: 3.9E-18
score: 65.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 792..820
e-value: 0.01
score: 16.0
coord: 687..716
e-value: 5.5E-5
score: 23.1
coord: 131..158
e-value: 1.3
score: 9.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 475..509
score: 11.73961
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 13.67976
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 335..369
score: 8.549871
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 265..299
score: 13.613992
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 580..614
score: 12.41921
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 719..753
score: 12.33152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 126..160
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 789..823
score: 9.646002
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 615..649
score: 14.063405
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 545..579
score: 11.092894
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 231..261
score: 10.05157
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 12.320559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..474
score: 10.55579
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 510..544
score: 12.178061
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 161..195
score: 8.692369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 754..788
score: 10.270796
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 370..404
score: 10.796938
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 684..718
score: 9.97484
NoneNo IPR availablePANTHERPTHR47932:SF28PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN MITOCHONDRIALcoord: 62..818
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 62..818
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 236..445
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 622..812
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 377..656

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M000923Bhi01M000923mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding