Bhi09G001310 (gene) Wax gourd (B227) v1

Overview
NameBhi09G001310
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr9: 42130934 .. 42136007 (-)
RNA-Seq ExpressionBhi09G001310
SyntenyBhi09G001310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGATATAAAAGTTTTTGGCTTTTGAGCCAGAAATTTTCCATTGTTGATCTGTTTTTGGTCGTTTTTTCAAACTCATTTGGCCATTTGCATTTAAATGTAGTTAAGGATACCGGTTAATATAATTTCTTGCAAAAACACTGTCACAATGTAAGATGTCATGCTTTCTTCCCCCTTTTTTCCCCTTTTCCATTCTCCCGACTTTCTTGTTCCAATTCAAGTCCCCGTTGGCCGTCAATTTTCCCGGGAAAATTCGGGATGGTATTTTCCCAGAAAATCTATTCAATTGTATTTGGGAAAATGGCGAATCTTAGTTCCGAAAATAAAGTATTTCTGAACTGGGTCTGAAATTTTTTATAAACGAAAGACAAAAGGGTTAAATTCTTCCATCTTTTATTCGTTTTCTGGATTTTGATCGTGCAACAGGTCTGTTTCAATGGAAAAAGAAGTTGAACCATGAGAGGATGAGGCTTCCGGGGGCTTTAGAAACAAGGTAAATATGCAACTCCACACAATTGTAAGATTAGAGAAACTCATTGATCCATCTTTTAGGTCATTTTTCTATTGTAGTTTATTTTTTGTTCTCGTGTTGCATAATACCTGCCTCTGTATTGGGTCCTCAAATGATAAGAGTTCTGTGTAATTACCTGCTTCAAATCCACCAGCTTCGTTGGTCACCATCTCTCACTTTATTCATACCTAGAAAGTTCTTTTTATATGTTCAATCCCCAGTAGTTCTGAGATGCCGAAATAAGTGTACCACCATAAATTTATCTTCCATTGACTGCTCTGGCATTGCACAATCTCTCATATCAAGGTGTTCAGTTTTGCTTGAGAAGGAAGGGTATGCCTCAGCATTACCTAATCCTTCTTTTAAGGAATTTTTATTGGAGATCTCTGATGTTGTACCGGAGTATGTGCGCAGAATTAGGCGAGTTGCAGAGTTAAAGCCTGAGGATGTGCTTAAATTGTTTCTTGGGTTTCAATCAGCGGTTGGGAATAATGGGATTCAAATTAAGAAAGTTGAGTGTTTGTGGAGAATTTTGAAATTCACTAATGAAAGTAGTAGGAACTTCAAGCATATACCAAGGTCGTGTGAGATTATGGCCTCTCTTCTCATTCGAGTTGGGAAGTTTAAGGAAGTCGAGCACTTTCTTTCTGAGATGGAGAGTCAAGGAATTTTACTGGATAATCCTGAAGTTTTCAGTTGTTTAATTCAGGGTTTTGTGTGTGAAGGTAATCTAGAAAGGGCTGTTTCGATATACGAAAAAGTGAGGCAGCGATGTATATCTCCATCATTGTCATGTTATCATGTTCTGCTTGATTCTTTGGTTCGGATGAAGAAAACACAAGTAGCACTTGGAGTATGTATGGATATGGTGGAGATGGGATTTGGTTTGGGGGATGAAGAGAAGGCTGTTTTTGATAATGTCATTGGACTACTTTGTTGGCAGGGAAAGGTTCTTGAAGCTAGGAACCTTGTGAAGAAGTTTGTGGCTTTGGACTTTAGGCCTAGCGATGAGGTTCTTTATCAAATTACAAGGGGTTACTGTGAGAAGAAGGATTTTGAAGATTTGCTGAGTTTCTTCTTTGAAATTAAAAGTCCCCCAAATGTTTCTTCTGGCAACAAAATCATTTATTCTCTTTGCAAAGATTTTGGCTCTGAAAGTGCATACTTGTATCTACGAGAACTTGAGCATACAGGCTTCAAGCCTGATGAAATAACCTTTGGGATTTTGATTTGTTGGAGCTGTCGTGAGGGAAATCTTCGAAAAGCTTTTATTTATATGTCTGAGTTATTGTTTAGTGGCCTAAAGCCAGATTTACATTCATATAATGCTCTCATCAGCGGGATGTTGAAGGAGGGCCTCTGGGAGAATGCCCAAGGCGTTCTTGCTGAAATGGTAGATCAGGGGATTGAACCTAATTTATCAACCTTCAGAATTATTTTAGCAGGCTATTGCAAAGCTAGACAATTTGAAGAAGCAAAAAAGACCGTTCTTGAAATGGAAAGATGTGGTTTTATTCAACTTTCTTCAGTAGATGATCTATTGTGCAGAATATTCTCTTTCTTGGGGTTTAATGACTCAGCAGTGAGGTTGAAAAGAGACAGCAATACTGGTGTTTCTAAAACAGAGTTCTTCGATACCCTTGGAAATGGGCTTTATCTGGACACTGATGTGGATGAATATGAGAAAAGGCTTACTGAAATTCTCAAAGAGTCTGTAGTACCTGATTTTAATTTGCTTATAATCGAGGAGTGTAAAAACAGAGACCCTAAGGCTGTAGTAGGATTGGCAGCTGAAATGGATCGATGGGGGCAAGAACTAACTTCAGTAGGTTTGATGGGTTTCTTGAAAAGACATTGTACATTGAATTCCAGAATCAAGCCTATCATTGATGTTTGGGAGAGAAGGCCATATATGATTGCTCAACTAGGAGCAGACACTTTAAATTTACTTGTGCAAGCATATAGCAAATGCAGGTTGACTTCTAGTGGAATTGGAATACTAAATGAAATGAGCCAAATGCATGTTGTAATAGAGAAAGAAACATACAGTGTTCTGATAAATAGTTTGTGCAAAACAGGAAACTTAAATGATCTTCTTGGTTGTTGGGATAGAGCCCGAAAAGATGGTTGGGTTCCAGGGTTGCATGACTGTAAATTACTTATCAGTTGCCTTTGCAAGAAAAGAAAACTCAAAGAAGTTTTCTCCCTCCTCCAAACCATGCTGGTGTCTTATCCACATTCAAGGTTGGATATACTTAATATCTTCCTTGAAAGGCTTTCAGAAGCAGGGTTCGCTGCAATTGGACAAGTATTGGCAAAAGAGCTTATGGCTCTTGGATTTTACTTGGATCAAAAGGCATATGAACTTCTTATCATAGGATTGTGTAAAGAGAACAATATTTCAATAGCGATTAATTTATTGGATGATATAATGGCTATGAGTATGGTTCCGTGCATTGATGTTTGTCTTCTGGTAATTCCTATATTATGTAAGGTTGGTAGATATGAAACCGCAATTGCATTAAAAGAGATTGGAACTACCAAGCTATCGTCTTCTTCGATTAGAGTGTTTGGTGCACTAATGAAAGGTTTCTTTATGATGGGAAAGGTTAGAGAAACCTTGCCTCTAGTTCAGGATATGTTGTCTAAAGGTATTTCTCTAGATGCTGAGATTTATAATAATCTGGTTCAAGGGCATTGCAAAGTGAAAAACTTGGATAAAGTGCGGGAGCTACTGGGCATTATTGTTAGGAAGGACTTGAGCCTTTCGATATCAAGTTACAAGAAATTAGTGTGTTTGATGTGTATGGAAGGAAGAAGTCTCCAGGCATTGCATCTAAAGGACCTCATGCTTAGAAACAGCAAATCTCATGACTGTGTTATCTATAACATTCTGATTTTTTACATTTTTCAAAGCGGGAACAGTTTGCTTGTGCCAAAAATTTTGGACGAACTATTGTACAAGAGGAAATTGTTACCTGATAACATGACCTATGATTTTCTAGTATATGGATTTTCCAAGTGCAAAAACTTTTCCAGTTCCACATTATATCTCTTTACCATGATCCAACAGGAGTTTCGTCCCAGCAATCGGAGCTTGAATACTGTAATAAGCTACCTTTGTAATACTGGACAGCTCGGAAAAGCGTTGGATCTGAGCCGGAAGATGGAATCTAGGGGATGGATACATAGTTCAGCTGTACAGAATGCAATAACAGAGTGTCTCATTGCAAATGGTAAGCTTCGAGAAGCAGAATGTTTTTTAAATAGAATGGTTGAGAAGAGTCTCATTCCTGAACATGTAGATTACAACAACATAATCAAGCAATTTTGTCAAAGTGGAAGATGGTTGAATGCAATCAATCTTATAAACTTAATGCTTGAGAAAGGAAACATCCCAAATGCTACAAGTTATGATTTTGTCATTCAATGTTGCTGTACCTACAAGAAGTTGGAAGAAGCAGTAGATTTCCATACCGAGATGTTGGACCGGCATCTAAAGCCGAGCATCAGGACGTGGGATAAACTTGTTTCTTTATTATGCAGAGAAGGGCAGACAAAAGAAGCAGAAAGGGTTTTGATAAGCATGACAGAGATGGGTGAAAAACCAAGCAAAGATGCATATTGCTCCATGCTGGACAAATACCGCTATGAGAATGATCTTGAAAAGGCATCAGAGACAATGAGAGCAATGCAAGAAAGTGGTTACGAGTTGGATTTTGAGAGACAGTGGTCTCTCATAAGCAAACTAAACGACACCAATCTCAAGGACGGCAACAACAATAATAGTAACAAAGGTTTTCTCTCGGGACTTCTTTCCAAGAGTGGATTTTCTCGGGCATTGATTCCTTAGCAAAGCCAAGATAGAGATTAGCAAAGGTTAGCTTTGTTCATTTTTGTTACATCAATAGCTTTCTCTCTTGTCGTCTAGAATAAATTGTTCAATAAAATTTTACTTTAGCAATGCTAAACCTTTGTCATCAACGATAGGACCGATTTACGATCTACCTTGAAAATCAAATGTAGTAGGATTAGGTGGTTGTCTTGCAAGAATAATAAGGTATGTGCAAGTTGACTTGATTTGACATTGACGGATGTAAAAAAAAGGTGGGGGGGGAGGAAACTTGTGGTGGACAAATAGTGTGTCAAACGTAAAAACCCCTGAAAAAGATAGTTATTAGTCAAAATAATTTAATTGAGAATGTAGATGTTTCAAAAGAAAACTTGAACCTTACTACTCATAAGAATATCTAAGGTCATGTATTTGAAGTTAAGAATAATATTTTAAAAAAAAAGATGAACCGAAATGATATAGTTGTTGACGATTTTTTTGCTCATAATATTATTTCTGTAAAAGTTAATTATCAATCAAAATTTGTCGAAGAATCTTGATACAGAAAGAATTGGCTTAAGTAAAAAGAAGCAATTAAGGTAGAATTACACTCTTTTTCCAAACATGAGATTTTTAAACCATTAGTCTAGATACTAAAAGATGTCAATTCTCTAAGATACACATGAGTATTTGGAAGGAGGAAAAAATGAAATAATGAACCATATGATATAAT

mRNA sequence

TTGATATAAAAGTTTTTGGCTTTTGAGCCAGAAATTTTCCATTGTTGATCTGTTTTTGGTCGTTTTTTCAAACTCATTTGGCCATTTGCATTTAAATGTAGTTAAGGATACCGGTTAATATAATTTCTTGCAAAAACACTGTCACAATGTAAGATGTCATGCTTTCTTCCCCCTTTTTTCCCCTTTTCCATTCTCCCGACTTTCTTGTTCCAATTCAAGTCCCCGTTGGCCGTCAATTTTCCCGGGAAAATTCGGGATGGTATTTTCCCAGAAAATCTATTCAATTGTATTTGGGAAAATGGCGAATCTTAGTTCCGAAAATAAAGTATTTCTGAACTGGGTCTGAAATTTTTTATAAACGAAAGACAAAAGGGTTAAATTCTTCCATCTTTTATTCGTTTTCTGGATTTTGATCGTGCAACAGGTCTGTTTCAATGGAAAAAGAAGTTGAACCATGAGAGGATGAGGCTTCCGGGGGCTTTAGAAACAAGGTAAATATGCAACTCCACACAATTGTAAGATTAGAGAAACTCATTGATCCATCTTTTAGGTCATTTTTCTATTGTAGTTTATTTTTTGTTCTCGTGTTGCATAATACCTGCCTCTGTATTGGGTCCTCAAATGATAAGAGTTCTGTGTAATTACCTGCTTCAAATCCACCAGCTTCGTTGGTCACCATCTCTCACTTTATTCATACCTAGAAAGTTCTTTTTATATGTTCAATCCCCAGTAGTTCTGAGATGCCGAAATAAGTGTACCACCATAAATTTATCTTCCATTGACTGCTCTGGCATTGCACAATCTCTCATATCAAGGTGTTCAGTTTTGCTTGAGAAGGAAGGGTATGCCTCAGCATTACCTAATCCTTCTTTTAAGGAATTTTTATTGGAGATCTCTGATGTTGTACCGGAGTATGTGCGCAGAATTAGGCGAGTTGCAGAGTTAAAGCCTGAGGATGTGCTTAAATTGTTTCTTGGGTTTCAATCAGCGGTTGGGAATAATGGGATTCAAATTAAGAAAGTTGAGTGTTTGTGGAGAATTTTGAAATTCACTAATGAAAGTAGTAGGAACTTCAAGCATATACCAAGGTCGTGTGAGATTATGGCCTCTCTTCTCATTCGAGTTGGGAAGTTTAAGGAAGTCGAGCACTTTCTTTCTGAGATGGAGAGTCAAGGAATTTTACTGGATAATCCTGAAGTTTTCAGTTGTTTAATTCAGGGTTTTGTGTGTGAAGGTAATCTAGAAAGGGCTGTTTCGATATACGAAAAAGTGAGGCAGCGATGTATATCTCCATCATTGTCATGTTATCATGTTCTGCTTGATTCTTTGGTTCGGATGAAGAAAACACAAGTAGCACTTGGAGTATGTATGGATATGGTGGAGATGGGATTTGGTTTGGGGGATGAAGAGAAGGCTGTTTTTGATAATGTCATTGGACTACTTTGTTGGCAGGGAAAGGTTCTTGAAGCTAGGAACCTTGTGAAGAAGTTTGTGGCTTTGGACTTTAGGCCTAGCGATGAGGTTCTTTATCAAATTACAAGGGGTTACTGTGAGAAGAAGGATTTTGAAGATTTGCTGAGTTTCTTCTTTGAAATTAAAAGTCCCCCAAATGTTTCTTCTGGCAACAAAATCATTTATTCTCTTTGCAAAGATTTTGGCTCTGAAAGTGCATACTTGTATCTACGAGAACTTGAGCATACAGGCTTCAAGCCTGATGAAATAACCTTTGGGATTTTGATTTGTTGGAGCTGTCGTGAGGGAAATCTTCGAAAAGCTTTTATTTATATGTCTGAGTTATTGTTTAGTGGCCTAAAGCCAGATTTACATTCATATAATGCTCTCATCAGCGGGATGTTGAAGGAGGGCCTCTGGGAGAATGCCCAAGGCGTTCTTGCTGAAATGGTAGATCAGGGGATTGAACCTAATTTATCAACCTTCAGAATTATTTTAGCAGGCTATTGCAAAGCTAGACAATTTGAAGAAGCAAAAAAGACCGTTCTTGAAATGGAAAGATGTGGTTTTATTCAACTTTCTTCAGTAGATGATCTATTGTGCAGAATATTCTCTTTCTTGGGGTTTAATGACTCAGCAGTGAGGTTGAAAAGAGACAGCAATACTGGTGTTTCTAAAACAGAGTTCTTCGATACCCTTGGAAATGGGCTTTATCTGGACACTGATGTGGATGAATATGAGAAAAGGCTTACTGAAATTCTCAAAGAGTCTGTAGTACCTGATTTTAATTTGCTTATAATCGAGGAGTGTAAAAACAGAGACCCTAAGGCTGTAGTAGGATTGGCAGCTGAAATGGATCGATGGGGGCAAGAACTAACTTCAGTAGGTTTGATGGGTTTCTTGAAAAGACATTGTACATTGAATTCCAGAATCAAGCCTATCATTGATGTTTGGGAGAGAAGGCCATATATGATTGCTCAACTAGGAGCAGACACTTTAAATTTACTTGTGCAAGCATATAGCAAATGCAGGTTGACTTCTAGTGGAATTGGAATACTAAATGAAATGAGCCAAATGCATGTTGTAATAGAGAAAGAAACATACAGTGTTCTGATAAATAGTTTGTGCAAAACAGGAAACTTAAATGATCTTCTTGGTTGTTGGGATAGAGCCCGAAAAGATGGTTGGGTTCCAGGGTTGCATGACTGTAAATTACTTATCAGTTGCCTTTGCAAGAAAAGAAAACTCAAAGAAGTTTTCTCCCTCCTCCAAACCATGCTGGTGTCTTATCCACATTCAAGGTTGGATATACTTAATATCTTCCTTGAAAGGCTTTCAGAAGCAGGGTTCGCTGCAATTGGACAAGTATTGGCAAAAGAGCTTATGGCTCTTGGATTTTACTTGGATCAAAAGGCATATGAACTTCTTATCATAGGATTGTGTAAAGAGAACAATATTTCAATAGCGATTAATTTATTGGATGATATAATGGCTATGAGTATGGTTCCGTGCATTGATGTTTGTCTTCTGGTAATTCCTATATTATGTAAGGTTGGTAGATATGAAACCGCAATTGCATTAAAAGAGATTGGAACTACCAAGCTATCGTCTTCTTCGATTAGAGTGTTTGGTGCACTAATGAAAGGTTTCTTTATGATGGGAAAGGTTAGAGAAACCTTGCCTCTAGTTCAGGATATGTTGTCTAAAGGTATTTCTCTAGATGCTGAGATTTATAATAATCTGGTTCAAGGGCATTGCAAAGTGAAAAACTTGGATAAAGTGCGGGAGCTACTGGGCATTATTGTTAGGAAGGACTTGAGCCTTTCGATATCAAGTTACAAGAAATTAGTGTGTTTGATGTGTATGGAAGGAAGAAGTCTCCAGGCATTGCATCTAAAGGACCTCATGCTTAGAAACAGCAAATCTCATGACTGTGTTATCTATAACATTCTGATTTTTTACATTTTTCAAAGCGGGAACAGTTTGCTTGTGCCAAAAATTTTGGACGAACTATTGTACAAGAGGAAATTGTTACCTGATAACATGACCTATGATTTTCTAGTATATGGATTTTCCAAGTGCAAAAACTTTTCCAGTTCCACATTATATCTCTTTACCATGATCCAACAGGAGTTTCGTCCCAGCAATCGGAGCTTGAATACTGTAATAAGCTACCTTTGTAATACTGGACAGCTCGGAAAAGCGTTGGATCTGAGCCGGAAGATGGAATCTAGGGGATGGATACATAGTTCAGCTGTACAGAATGCAATAACAGAGTGTCTCATTGCAAATGGTAAGCTTCGAGAAGCAGAATGTTTTTTAAATAGAATGGTTGAGAAGAGTCTCATTCCTGAACATGTAGATTACAACAACATAATCAAGCAATTTTGTCAAAGTGGAAGATGGTTGAATGCAATCAATCTTATAAACTTAATGCTTGAGAAAGGAAACATCCCAAATGCTACAAGTTATGATTTTGTCATTCAATGTTGCTGTACCTACAAGAAGTTGGAAGAAGCAGTAGATTTCCATACCGAGATGTTGGACCGGCATCTAAAGCCGAGCATCAGGACGTGGGATAAACTTGTTTCTTTATTATGCAGAGAAGGGCAGACAAAAGAAGCAGAAAGGGTTTTGATAAGCATGACAGAGATGGGTGAAAAACCAAGCAAAGATGCATATTGCTCCATGCTGGACAAATACCGCTATGAGAATGATCTTGAAAAGGCATCAGAGACAATGAGAGCAATGCAAGAAAGTGGTTACGAGTTGGATTTTGAGAGACAGTGGTCTCTCATAAGCAAACTAAACGACACCAATCTCAAGGACGGCAACAACAATAATAGTAACAAAGGTTTTCTCTCGGGACTTCTTTCCAAGAGTGGATTTTCTCGGGCATTGATTCCTTAGCAAAGCCAAGATAGAGATTAGCAAAGGTTAGCTTTGTTCATTTTTGTTACATCAATAGCTTTCTCTCTTGTCGTCTAGAATAAATTGTTCAATAAAATTTTACTTTAGCAATGCTAAACCTTTGTCATCAACGATAGGACCGATTTACGATCTACCTTGAAAATCAAATGTAGTAGGATTAGGTGGTTGTCTTGCAAGAATAATAAGGTATGTGCAAGTTGACTTGATTTGACATTGACGGATGTAAAAAAAAGGTGGGGGGGGAGGAAACTTGTGGTGGACAAATAGTGTGTCAAACGTAAAAACCCCTGAAAAAGATAGTTATTAGTCAAAATAATTTAATTGAGAATGTAGATGTTTCAAAAGAAAACTTGAACCTTACTACTCATAAGAATATCTAAGGTCATGTATTTGAAGTTAAGAATAATATTTTAAAAAAAAAGATGAACCGAAATGATATAGTTGTTGACGATTTTTTTGCTCATAATATTATTTCTGTAAAAGTTAATTATCAATCAAAATTTGTCGAAGAATCTTGATACAGAAAGAATTGGCTTAAGTAAAAAGAAGCAATTAAGGTAGAATTACACTCTTTTTCCAAACATGAGATTTTTAAACCATTAGTCTAGATACTAAAAGATGTCAATTCTCTAAGATACACATGAGTATTTGGAAGGAGGAAAAAATGAAATAATGAACCATATGATATAAT

Coding sequence (CDS)

ATGATAAGAGTTCTGTGTAATTACCTGCTTCAAATCCACCAGCTTCGTTGGTCACCATCTCTCACTTTATTCATACCTAGAAAGTTCTTTTTATATGTTCAATCCCCAGTAGTTCTGAGATGCCGAAATAAGTGTACCACCATAAATTTATCTTCCATTGACTGCTCTGGCATTGCACAATCTCTCATATCAAGGTGTTCAGTTTTGCTTGAGAAGGAAGGGTATGCCTCAGCATTACCTAATCCTTCTTTTAAGGAATTTTTATTGGAGATCTCTGATGTTGTACCGGAGTATGTGCGCAGAATTAGGCGAGTTGCAGAGTTAAAGCCTGAGGATGTGCTTAAATTGTTTCTTGGGTTTCAATCAGCGGTTGGGAATAATGGGATTCAAATTAAGAAAGTTGAGTGTTTGTGGAGAATTTTGAAATTCACTAATGAAAGTAGTAGGAACTTCAAGCATATACCAAGGTCGTGTGAGATTATGGCCTCTCTTCTCATTCGAGTTGGGAAGTTTAAGGAAGTCGAGCACTTTCTTTCTGAGATGGAGAGTCAAGGAATTTTACTGGATAATCCTGAAGTTTTCAGTTGTTTAATTCAGGGTTTTGTGTGTGAAGGTAATCTAGAAAGGGCTGTTTCGATATACGAAAAAGTGAGGCAGCGATGTATATCTCCATCATTGTCATGTTATCATGTTCTGCTTGATTCTTTGGTTCGGATGAAGAAAACACAAGTAGCACTTGGAGTATGTATGGATATGGTGGAGATGGGATTTGGTTTGGGGGATGAAGAGAAGGCTGTTTTTGATAATGTCATTGGACTACTTTGTTGGCAGGGAAAGGTTCTTGAAGCTAGGAACCTTGTGAAGAAGTTTGTGGCTTTGGACTTTAGGCCTAGCGATGAGGTTCTTTATCAAATTACAAGGGGTTACTGTGAGAAGAAGGATTTTGAAGATTTGCTGAGTTTCTTCTTTGAAATTAAAAGTCCCCCAAATGTTTCTTCTGGCAACAAAATCATTTATTCTCTTTGCAAAGATTTTGGCTCTGAAAGTGCATACTTGTATCTACGAGAACTTGAGCATACAGGCTTCAAGCCTGATGAAATAACCTTTGGGATTTTGATTTGTTGGAGCTGTCGTGAGGGAAATCTTCGAAAAGCTTTTATTTATATGTCTGAGTTATTGTTTAGTGGCCTAAAGCCAGATTTACATTCATATAATGCTCTCATCAGCGGGATGTTGAAGGAGGGCCTCTGGGAGAATGCCCAAGGCGTTCTTGCTGAAATGGTAGATCAGGGGATTGAACCTAATTTATCAACCTTCAGAATTATTTTAGCAGGCTATTGCAAAGCTAGACAATTTGAAGAAGCAAAAAAGACCGTTCTTGAAATGGAAAGATGTGGTTTTATTCAACTTTCTTCAGTAGATGATCTATTGTGCAGAATATTCTCTTTCTTGGGGTTTAATGACTCAGCAGTGAGGTTGAAAAGAGACAGCAATACTGGTGTTTCTAAAACAGAGTTCTTCGATACCCTTGGAAATGGGCTTTATCTGGACACTGATGTGGATGAATATGAGAAAAGGCTTACTGAAATTCTCAAAGAGTCTGTAGTACCTGATTTTAATTTGCTTATAATCGAGGAGTGTAAAAACAGAGACCCTAAGGCTGTAGTAGGATTGGCAGCTGAAATGGATCGATGGGGGCAAGAACTAACTTCAGTAGGTTTGATGGGTTTCTTGAAAAGACATTGTACATTGAATTCCAGAATCAAGCCTATCATTGATGTTTGGGAGAGAAGGCCATATATGATTGCTCAACTAGGAGCAGACACTTTAAATTTACTTGTGCAAGCATATAGCAAATGCAGGTTGACTTCTAGTGGAATTGGAATACTAAATGAAATGAGCCAAATGCATGTTGTAATAGAGAAAGAAACATACAGTGTTCTGATAAATAGTTTGTGCAAAACAGGAAACTTAAATGATCTTCTTGGTTGTTGGGATAGAGCCCGAAAAGATGGTTGGGTTCCAGGGTTGCATGACTGTAAATTACTTATCAGTTGCCTTTGCAAGAAAAGAAAACTCAAAGAAGTTTTCTCCCTCCTCCAAACCATGCTGGTGTCTTATCCACATTCAAGGTTGGATATACTTAATATCTTCCTTGAAAGGCTTTCAGAAGCAGGGTTCGCTGCAATTGGACAAGTATTGGCAAAAGAGCTTATGGCTCTTGGATTTTACTTGGATCAAAAGGCATATGAACTTCTTATCATAGGATTGTGTAAAGAGAACAATATTTCAATAGCGATTAATTTATTGGATGATATAATGGCTATGAGTATGGTTCCGTGCATTGATGTTTGTCTTCTGGTAATTCCTATATTATGTAAGGTTGGTAGATATGAAACCGCAATTGCATTAAAAGAGATTGGAACTACCAAGCTATCGTCTTCTTCGATTAGAGTGTTTGGTGCACTAATGAAAGGTTTCTTTATGATGGGAAAGGTTAGAGAAACCTTGCCTCTAGTTCAGGATATGTTGTCTAAAGGTATTTCTCTAGATGCTGAGATTTATAATAATCTGGTTCAAGGGCATTGCAAAGTGAAAAACTTGGATAAAGTGCGGGAGCTACTGGGCATTATTGTTAGGAAGGACTTGAGCCTTTCGATATCAAGTTACAAGAAATTAGTGTGTTTGATGTGTATGGAAGGAAGAAGTCTCCAGGCATTGCATCTAAAGGACCTCATGCTTAGAAACAGCAAATCTCATGACTGTGTTATCTATAACATTCTGATTTTTTACATTTTTCAAAGCGGGAACAGTTTGCTTGTGCCAAAAATTTTGGACGAACTATTGTACAAGAGGAAATTGTTACCTGATAACATGACCTATGATTTTCTAGTATATGGATTTTCCAAGTGCAAAAACTTTTCCAGTTCCACATTATATCTCTTTACCATGATCCAACAGGAGTTTCGTCCCAGCAATCGGAGCTTGAATACTGTAATAAGCTACCTTTGTAATACTGGACAGCTCGGAAAAGCGTTGGATCTGAGCCGGAAGATGGAATCTAGGGGATGGATACATAGTTCAGCTGTACAGAATGCAATAACAGAGTGTCTCATTGCAAATGGTAAGCTTCGAGAAGCAGAATGTTTTTTAAATAGAATGGTTGAGAAGAGTCTCATTCCTGAACATGTAGATTACAACAACATAATCAAGCAATTTTGTCAAAGTGGAAGATGGTTGAATGCAATCAATCTTATAAACTTAATGCTTGAGAAAGGAAACATCCCAAATGCTACAAGTTATGATTTTGTCATTCAATGTTGCTGTACCTACAAGAAGTTGGAAGAAGCAGTAGATTTCCATACCGAGATGTTGGACCGGCATCTAAAGCCGAGCATCAGGACGTGGGATAAACTTGTTTCTTTATTATGCAGAGAAGGGCAGACAAAAGAAGCAGAAAGGGTTTTGATAAGCATGACAGAGATGGGTGAAAAACCAAGCAAAGATGCATATTGCTCCATGCTGGACAAATACCGCTATGAGAATGATCTTGAAAAGGCATCAGAGACAATGAGAGCAATGCAAGAAAGTGGTTACGAGTTGGATTTTGAGAGACAGTGGTCTCTCATAAGCAAACTAAACGACACCAATCTCAAGGACGGCAACAACAATAATAGTAACAAAGGTTTTCTCTCGGGACTTCTTTCCAAGAGTGGATTTTCTCGGGCATTGATTCCTTAG

Protein sequence

MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQSLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGFQSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSEMESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMKKTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDEVLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHTGFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENAQGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRIFSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFNLLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPYMIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLNDLLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLERLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFSRALIP
Homology
BLAST of Bhi09G001310 vs. TAIR 10
Match: AT5G15280.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 938.3 bits (2424), Expect = 6.2e-273
Identity = 511/1188 (43.01%), Postives = 747/1188 (62.88%), Query Frame = 0

Query: 56   SGIAQSLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLK 115
            S I ++  S    LL +      L   S K+ L ++SDVVP   RR RR   LKPEDVL+
Sbjct: 48   SAIPRNYESSSFNLLSRSKEKRDLTGSSLKDLLFDLSDVVPNITRRFRRFPGLKPEDVLE 107

Query: 116  LFLGFQSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVE 175
            L LGF+S +   GI   KV+ LW I ++ +   + FKH+P++CEIMAS+LIR G  KEVE
Sbjct: 108  LSLGFESELQRGGIGNIKVQALWEIFRWASVQYQGFKHLPQACEIMASMLIREGMVKEVE 167

Query: 176  HFLSEMESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDS 235
              L EME  G  + N  +F  LI  +V + +  +AV +++ +R++ + P  SCY +L+D 
Sbjct: 168  LLLMEMERHGDTMVNEGIFCDLIGKYVDDFDSRKAVMLFDWMRRKGLVPLTSCYQILIDQ 227

Query: 236  LVRMKKTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDF 295
            LVR+ +T+ A  +C+D VE    L          VI LLC   KV EAR L +K VAL  
Sbjct: 228  LVRVHRTESAYRICLDWVETRAELNHMNIDSIGKVIELLCLDQKVQEARVLARKLVALGC 287

Query: 296  RPSDEVLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLR 355
              +  +  +IT GY EK+DFEDLLSF  E+K  P+V  GN+I++SLC+ FGSE AY+Y+ 
Sbjct: 288  ILNSSIYSKITIGYNEKQDFEDLLSFIGEVKYEPDVFVGNRILHSLCRRFGSERAYVYME 347

Query: 356  ELEHTGFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEG 415
            ELEH GFK DE+TFGILI W C EG++++A +Y+SE++  G KPD++SYNA++SG+ ++G
Sbjct: 348  ELEHLGFKQDEVTFGILIGWCCYEGDIKRAVLYLSEIMSKGYKPDVYSYNAILSGLFRKG 407

Query: 416  LWENAQGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDD 475
            LW++   +L EM + G+  +LSTF+I++ GYCKARQFEEAK+ V +M   G I+ S V+D
Sbjct: 408  LWQHTHCILDEMKENGMMLSLSTFKIMVTGYCKARQFEEAKRIVNKMFGYGLIEASKVED 467

Query: 476  LLCRIFSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESV 535
             L   FS +GF+  AVRLKRD+++  SK EFFD LGNGLYL TD+D YE+R+  +L  SV
Sbjct: 468  PLSEAFSLVGFDPLAVRLKRDNDSTFSKAEFFDDLGNGLYLHTDLDAYEQRVNMVLDRSV 527

Query: 536  VPDFNLLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVW 595
            +P+FN LI+   ++ D +  + L  EM RWGQ+L+       ++  C   + ++  I + 
Sbjct: 528  LPEFNSLIVRASEDGDLQTALRLLDEMARWGQKLSRRSFAVLMRSLCASRAHLRVSISLL 587

Query: 596  ERRPYMIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKT 655
            E+ P +  QL  +TLN LVQ Y K   +     I ++M QMH  I+  TY+ LI   CK 
Sbjct: 588  EKWPKLAYQLDGETLNFLVQEYCKKGFSRHSKLIFHKMVQMHHPIDNVTYTSLIRCFCKK 647

Query: 656  GNLNDLLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDIL 715
              LNDLL  W  A+ D W+P L+DC  L +CL +K  ++EV  L + + +SYP S+ +  
Sbjct: 648  ETLNDLLNVWGAAQNDNWLPDLNDCGDLWNCLVRKGLVEEVVQLFERVFISYPLSQSEAC 707

Query: 716  NIFLERLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMA 775
             IF+E+L+  GF+ I   + K L   G  ++Q+ Y  LI GLC E   S A  +LD+++ 
Sbjct: 708  RIFVEKLTVLGFSCIAHSVVKRLEGEGCIVEQEVYNHLIKGLCTEKKDSAAFAILDEMLD 767

Query: 776  MSMVPCIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRE 835
               +P +  CL++IP LC+  +  TA  L E    ++ SS +    AL+KG  + GK+ +
Sbjct: 768  KKHIPSLGSCLMLIPRLCRANKAGTAFNLAE----QIDSSYVHY--ALIKGLSLAGKMLD 827

Query: 836  TLPLVQDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVC 895
                ++ MLS G+S   +IYN + QG+CK  N  KV E+LG++VRK++  S+ SY++ V 
Sbjct: 828  AENQLRIMLSNGLSSYNKIYNVMFQGYCKGNNWMKVEEVLGLMVRKNIICSVKSYREYVR 887

Query: 896  LMCMEGRSLQALHLKD-LMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKL 955
             MC+E +SL A+ LK+ L+L  S     +IYN+LIFY+F++ N L V K+L E +  R +
Sbjct: 888  KMCLEPQSLSAISLKEFLLLGESNPGGVIIYNMLIFYMFRAKNHLEVNKVLLE-MQGRGV 947

Query: 956  LPDNMTYDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALD 1015
            LPD  T++FLV+G+S   ++SSS  YL  MI +  +P+NRSL  V S LC+ G + KALD
Sbjct: 948  LPDETTFNFLVHGYSSSADYSSSLRYLSAMISKGMKPNNRSLRAVTSSLCDNGDVKKALD 1007

Query: 1016 LSRKMESRGW-IHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQF 1075
            L + MES+GW + SS VQ  I E LI+ G++ +AE FL R+    ++    +Y+NIIK+ 
Sbjct: 1008 LWQVMESKGWNLGSSVVQTKIVETLISKGEIPKAEDFLTRVTRNGMMAP--NYDNIIKKL 1067

Query: 1076 CQSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSI 1135
               G    A++L+N ML+  +IP ++SYD VI     Y +L++A+DFHTEM++  L PSI
Sbjct: 1068 SDRGNLDIAVHLLNTMLKNQSIPGSSSYDSVINGLLRYNQLDKAMDFHTEMVELGLSPSI 1127

Query: 1136 RTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRA 1195
             TW  LV   C   Q  E+ER++ SM  +GE PS++ + +++D++R E +  KASE M  
Sbjct: 1128 STWSGLVHKFCEACQVLESERLIKSMVGLGESPSQEMFKTVIDRFRVEKNTVKASEMMEM 1187

Query: 1196 MQESGYELDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFS 1242
            MQ+ GYE+DFE  WSLIS ++ +  K+     + +GFLS LLS +GF+
Sbjct: 1188 MQKCGYEVDFETHWSLISNMSSS--KEKKTTTAGEGFLSRLLSGNGFT 1224

BLAST of Bhi09G001310 vs. TAIR 10
Match: AT1G12620.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 163.7 bits (413), Expect = 9.5e-40
Identity = 130/550 (23.64%), Postives = 243/550 (44.18%), Query Frame = 0

Query: 675  PGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE-----RLSEAGFAA 734
            P L D   L S + + ++   V  L + M +      L  L+I +      R     F+A
Sbjct: 70   PRLIDFSRLFSVVARTKQYDLVLDLCKQMELKGIAHNLYTLSIMINCCCRCRKLSLAFSA 129

Query: 735  IGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVI 794
            +G+++      LG+  D   +  LI GLC E  +S A+ L+D ++ M   P +     ++
Sbjct: 130  MGKII-----KLGYEPDTVTFSTLINGLCLEGRVSEALELVDRMVEMGHKPTLITLNALV 189

Query: 795  PILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGIS 854
              LC  G+   A+ L +         +   +G ++K     G+    + L++ M  + I 
Sbjct: 190  NGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKMEERKIK 249

Query: 855  LDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHL 914
            LDA  Y+ ++ G CK  +LD    L   +  K     I  Y  L+   C  GR      L
Sbjct: 250  LDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKL 309

Query: 915  KDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFS 974
               M++   + D V ++ LI    + G      ++  E++ +R + PD +TY  L+ GF 
Sbjct: 310  LRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMI-QRGISPDTVTYTSLIDGFC 369

Query: 975  KCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSA 1034
            K      +   L  M+ +   P+ R+ N +I+  C    +   L+L RKM  RG +  + 
Sbjct: 370  KENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTV 429

Query: 1035 VQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLM 1094
              N + +     GKL  A+     MV + + P+ V Y  ++   C +G    A+ +   +
Sbjct: 430  TYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKI 489

Query: 1095 LEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQT 1154
             +     +   Y+ +I   C   K+++A D    +  + +KP ++T++ ++  LC++G  
Sbjct: 490  EKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSL 549

Query: 1155 KEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSL 1214
             EA+ +   M E G  P+   Y  ++  +  E D  K+++ +  ++  G+ +D      +
Sbjct: 550  SEADLLFRKMEEDGHSPNGCTYNILIRAHLGEGDATKSAKLIEEIKRCGFSVDASTVKMV 609

Query: 1215 ISKLNDTNLK 1220
            +  L+D  LK
Sbjct: 610  VDMLSDGRLK 613

BLAST of Bhi09G001310 vs. TAIR 10
Match: AT2G17140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 161.8 bits (408), Expect = 3.6e-39
Identity = 153/694 (22.05%), Postives = 293/694 (42.22%), Query Frame = 0

Query: 539  FNLLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERR 598
            +NLL+    K R  + V  L  +M   G    +      ++  C  +S +    ++++  
Sbjct: 115  YNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCD-SSCVDAARELFDEM 174

Query: 599  PYMIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNL 658
            P    +    T  +LV+ Y K  LT  G+ +LN M    V+  K  Y+ +++S C+ G  
Sbjct: 175  PEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRN 234

Query: 659  NDLLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTM----LVSYPHSRLDI 718
            +D     ++ R++G VP +      IS LCK+ K+ +   +   M     +  P      
Sbjct: 235  DDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSIT 294

Query: 719  LNIFLERLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIM 778
             N+ L+   + G     + L + +         ++Y + + GL +      A  +L  + 
Sbjct: 295  YNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMT 354

Query: 779  AMSMVPCIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVR 838
               + P I    +++  LCK+G    A  +  +            +G L+ G+  +GKV 
Sbjct: 355  DKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVD 414

Query: 839  ETLPLVQDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLV 898
                L+Q+M+      +A   N L+    K+  + +  ELL  +  K   L   +   +V
Sbjct: 415  AAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIV 474

Query: 899  CLMCMEGRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKL 958
              +C  G   +A+ +    ++  + H       L       GNS +   ++D+ L +   
Sbjct: 475  DGLCGSGELDKAIEI----VKGMRVHGSAALGNL-------GNSYI--GLVDDSLIENNC 534

Query: 959  LPDNMTYDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALD 1018
            LPD +TY  L+ G  K   F+ +      M+ ++ +P + + N  I + C  G++  A  
Sbjct: 535  LPDLITYSTLLNGLCKAGRFAEAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFR 594

Query: 1019 LSRKMESRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFC 1078
            + + ME +G   S    N++   L    ++ E    ++ M EK + P    YN  I+  C
Sbjct: 595  VLKDMEKKGCHKSLETYNSLILGLGIKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLC 654

Query: 1079 QSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVD-FHTEMLDRHLKPSI 1138
            +  +  +A NL++ M++K   PN  S+ ++I+  C     + A + F T +     K  +
Sbjct: 655  EGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCKVPDFDMAQEVFETAVSICGQKEGL 714

Query: 1139 RTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRA 1198
              +  + + L   GQ  +A  +L ++ + G +     Y  +++    +++LE AS  +  
Sbjct: 715  --YSLMFNELLAAGQLLKATELLEAVLDRGFELGTFLYKDLVESLCKKDELEVASGILHK 774

Query: 1199 MQESGYELDFERQWSLISKLNDTNLKDGNNNNSN 1228
            M + GY  D      +I  L     K GN   +N
Sbjct: 775  MIDRGYGFDPAALMPVIDGLG----KMGNKKEAN 788

BLAST of Bhi09G001310 vs. TAIR 10
Match: AT1G12300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 161.4 bits (407), Expect = 4.7e-39
Identity = 126/538 (23.42%), Postives = 245/538 (45.54%), Query Frame = 0

Query: 683  LISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE-RLSEAGFAAIGQVLAKELMAL 742
            L+  LCK+ +LK +   L T+ +        ++N F   R     F+A+G+++      L
Sbjct: 106  LVLALCKQMELKGIAHNLYTLSI--------MINCFCRCRKLCLAFSAMGKII-----KL 165

Query: 743  GFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVIPILCKVGRYETA 802
            G+  +   +  LI GLC E  +S A+ L+D ++ M   P +     ++  LC  G+   A
Sbjct: 166  GYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEA 225

Query: 803  IALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGISLDAEIYNNLVQG 862
            + L +         +   +G ++      G+    + L++ M  + I LDA  Y+ ++ G
Sbjct: 226  MLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 285

Query: 863  HCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHLKDLMLRNSKSHD 922
             CK  +LD    L   +  K ++ +I +Y  L+   C  GR      L   M++   + +
Sbjct: 286  LCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPN 345

Query: 923  CVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFSKCKNFSSSTLYL 982
             V +++LI    + G      ++  E+++ R + PD +TY  L+ GF K  +   +   +
Sbjct: 346  VVTFSVLIDSFVKEGKLREAEELHKEMIH-RGIAPDTITYTSLIDGFCKENHLDKANQMV 405

Query: 983  FTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSAVQNAITECLIAN 1042
              M+ +   P+ R+ N +I+  C   ++   L+L RKM  RG +  +   N + +     
Sbjct: 406  DLMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCEL 465

Query: 1043 GKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLMLEKGNIPNATSY 1102
            GKL  A+     MV + + P  V Y  ++   C +G    A+ +   + +     +   Y
Sbjct: 466  GKLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIY 525

Query: 1103 DFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQTKEAERVLISMTE 1162
            + +I   C   K+++A D    +  + +KP ++T++ ++  LC++G   EAE +   M E
Sbjct: 526  NIIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKMEE 585

Query: 1163 MGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSLISKLNDTNLK 1220
             G  P    Y  ++  +  + D  K+ + +  ++  G+ +D      +I  L+D  LK
Sbjct: 586  DGHAPDGWTYNILIRAHLGDGDATKSVKLIEELKRCGFSVDASTIKMVIDMLSDGRLK 629

BLAST of Bhi09G001310 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 158.3 bits (399), Expect = 4.0e-38
Identity = 191/917 (20.83%), Postives = 357/917 (38.93%), Query Frame = 0

Query: 329  PNVSSGNKIIYSLCKDFGSESAYLYLRELEHTGFKPDEITFGILICWSCREGNLRKAFIY 388
            P+V + N I+ S+ K     S + +L+E+      PD  TF ILI   C EG+  K+   
Sbjct: 196  PSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYL 255

Query: 389  MSELLFSGLKPDLHSYNALISGMLKEGLWENAQGVLAEMVDQGIEPNLSTFRIILAGYCK 448
            M ++  SG  P + +YN ++    K+G ++ A  +L  M  +G++ ++ T+ +++   C+
Sbjct: 256  MQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCR 315

Query: 449  ARQFEEAKKTVLEMERCGFIQLSSVDDLLCRIFSFLGFNDSAVRLKRDSNTGVSKTEFFD 508
            + +  +                            +L   D   R+   +         ++
Sbjct: 316  SNRIAK---------------------------GYLLLRDMRKRMIHPNEV------TYN 375

Query: 509  TLGNGLYLDTDVDEYEKRLTEILKESVVPD---FNLLIIEECKNRDPKAVVGLAAEMDRW 568
            TL NG   +  V    + L E+L   + P+   FN LI       + K  + +   M+  
Sbjct: 376  TLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAK 435

Query: 569  GQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPYMIAQLGADTLNLLVQAYSKCRLTSS 628
            G   + V     L   C  N+        + R       +G  T   ++    K      
Sbjct: 436  GLTPSEVSYGVLLDGLCK-NAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDE 495

Query: 629  GIGILNEMSQMHVVIEKETYSVLINSLCKTGNLNDLLGCWDRARKDGWVP-GLHDCKLLI 688
             + +LNEMS+  +  +  TYS LIN  CK G          R  + G  P G+    L+ 
Sbjct: 496  AVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIY 555

Query: 689  SCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLERLSEAGFAAIGQVLAKELMALGFY 748
            +C C+   LKE   + + M++          N+ +  L +AG  A  +   + + + G  
Sbjct: 556  NC-CRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGIL 615

Query: 749  LDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVIPILCKVGRY-ETAIA 808
             +  +++ LI G         A ++ D++  +   P       ++  LCK G   E    
Sbjct: 616  PNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKF 675

Query: 809  LKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGISLDAEIYNNLVQGHC 868
            LK +     +  ++ ++  L+      G + + + L  +M+ + I  D+  Y +L+ G C
Sbjct: 676  LKSLHAVPAAVDTV-MYNTLLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLC 735

Query: 869  KVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCM------EGRSLQALHLKDLMLRNS 928
            +     K + ++ I+  K+     +     V   C        G+    ++ ++ M    
Sbjct: 736  R-----KGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLG 795

Query: 929  KSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFSKCKNFSSS 988
             + D V  N +I    + G       +L E +  +   P+  TY+ L++G+SK K+ S+S
Sbjct: 796  HTPDIVTTNAMIDGYSRMGKIEKTNDLLPE-MGNQNGGPNLTTYNILLHGYSKRKDVSTS 855

Query: 989  TLYLFTMIQQEFRP-----------------------------------SNRSLNTVISY 1048
             L   ++I     P                                      + N +IS 
Sbjct: 856  FLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYTFNMLISK 915

Query: 1049 LCNTGQLGKALDLSRKMESRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPE 1108
             C  G++  A DL + M S G        +A+   L  N + +E+   L+ M ++ + PE
Sbjct: 916  CCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMVLHEMSKQGISPE 975

Query: 1109 HVDYNNIIKQFCQSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHT 1168
               Y  +I   C+ G    A  +   M+     P   +   +++      K +EA     
Sbjct: 976  SRKYIGLINGLCRVGDIKTAFVVKEEMIAHKICPPNVAESAMVRALAKCGKADEATLLLR 1035

Query: 1169 EMLDRHLKPSIRTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYEN 1200
             ML   L P+I ++  L+ L C+ G   EA  + + M+  G K    +Y  ++     + 
Sbjct: 1036 FMLKMKLVPTIASFTTLMHLCCKNGNVIEALELRVVMSNCGLKLDLVSYNVLITGLCAKG 1070

BLAST of Bhi09G001310 vs. ExPASy Swiss-Prot
Match: Q9LXF4 (Pentatricopeptide repeat-containing protein At5g15280, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g15280 PE=1 SV=1)

HSP 1 Score: 938.3 bits (2424), Expect = 8.7e-272
Identity = 511/1188 (43.01%), Postives = 747/1188 (62.88%), Query Frame = 0

Query: 56   SGIAQSLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLK 115
            S I ++  S    LL +      L   S K+ L ++SDVVP   RR RR   LKPEDVL+
Sbjct: 48   SAIPRNYESSSFNLLSRSKEKRDLTGSSLKDLLFDLSDVVPNITRRFRRFPGLKPEDVLE 107

Query: 116  LFLGFQSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVE 175
            L LGF+S +   GI   KV+ LW I ++ +   + FKH+P++CEIMAS+LIR G  KEVE
Sbjct: 108  LSLGFESELQRGGIGNIKVQALWEIFRWASVQYQGFKHLPQACEIMASMLIREGMVKEVE 167

Query: 176  HFLSEMESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDS 235
              L EME  G  + N  +F  LI  +V + +  +AV +++ +R++ + P  SCY +L+D 
Sbjct: 168  LLLMEMERHGDTMVNEGIFCDLIGKYVDDFDSRKAVMLFDWMRRKGLVPLTSCYQILIDQ 227

Query: 236  LVRMKKTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDF 295
            LVR+ +T+ A  +C+D VE    L          VI LLC   KV EAR L +K VAL  
Sbjct: 228  LVRVHRTESAYRICLDWVETRAELNHMNIDSIGKVIELLCLDQKVQEARVLARKLVALGC 287

Query: 296  RPSDEVLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLR 355
              +  +  +IT GY EK+DFEDLLSF  E+K  P+V  GN+I++SLC+ FGSE AY+Y+ 
Sbjct: 288  ILNSSIYSKITIGYNEKQDFEDLLSFIGEVKYEPDVFVGNRILHSLCRRFGSERAYVYME 347

Query: 356  ELEHTGFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEG 415
            ELEH GFK DE+TFGILI W C EG++++A +Y+SE++  G KPD++SYNA++SG+ ++G
Sbjct: 348  ELEHLGFKQDEVTFGILIGWCCYEGDIKRAVLYLSEIMSKGYKPDVYSYNAILSGLFRKG 407

Query: 416  LWENAQGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDD 475
            LW++   +L EM + G+  +LSTF+I++ GYCKARQFEEAK+ V +M   G I+ S V+D
Sbjct: 408  LWQHTHCILDEMKENGMMLSLSTFKIMVTGYCKARQFEEAKRIVNKMFGYGLIEASKVED 467

Query: 476  LLCRIFSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESV 535
             L   FS +GF+  AVRLKRD+++  SK EFFD LGNGLYL TD+D YE+R+  +L  SV
Sbjct: 468  PLSEAFSLVGFDPLAVRLKRDNDSTFSKAEFFDDLGNGLYLHTDLDAYEQRVNMVLDRSV 527

Query: 536  VPDFNLLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVW 595
            +P+FN LI+   ++ D +  + L  EM RWGQ+L+       ++  C   + ++  I + 
Sbjct: 528  LPEFNSLIVRASEDGDLQTALRLLDEMARWGQKLSRRSFAVLMRSLCASRAHLRVSISLL 587

Query: 596  ERRPYMIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKT 655
            E+ P +  QL  +TLN LVQ Y K   +     I ++M QMH  I+  TY+ LI   CK 
Sbjct: 588  EKWPKLAYQLDGETLNFLVQEYCKKGFSRHSKLIFHKMVQMHHPIDNVTYTSLIRCFCKK 647

Query: 656  GNLNDLLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDIL 715
              LNDLL  W  A+ D W+P L+DC  L +CL +K  ++EV  L + + +SYP S+ +  
Sbjct: 648  ETLNDLLNVWGAAQNDNWLPDLNDCGDLWNCLVRKGLVEEVVQLFERVFISYPLSQSEAC 707

Query: 716  NIFLERLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMA 775
             IF+E+L+  GF+ I   + K L   G  ++Q+ Y  LI GLC E   S A  +LD+++ 
Sbjct: 708  RIFVEKLTVLGFSCIAHSVVKRLEGEGCIVEQEVYNHLIKGLCTEKKDSAAFAILDEMLD 767

Query: 776  MSMVPCIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRE 835
               +P +  CL++IP LC+  +  TA  L E    ++ SS +    AL+KG  + GK+ +
Sbjct: 768  KKHIPSLGSCLMLIPRLCRANKAGTAFNLAE----QIDSSYVHY--ALIKGLSLAGKMLD 827

Query: 836  TLPLVQDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVC 895
                ++ MLS G+S   +IYN + QG+CK  N  KV E+LG++VRK++  S+ SY++ V 
Sbjct: 828  AENQLRIMLSNGLSSYNKIYNVMFQGYCKGNNWMKVEEVLGLMVRKNIICSVKSYREYVR 887

Query: 896  LMCMEGRSLQALHLKD-LMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKL 955
             MC+E +SL A+ LK+ L+L  S     +IYN+LIFY+F++ N L V K+L E +  R +
Sbjct: 888  KMCLEPQSLSAISLKEFLLLGESNPGGVIIYNMLIFYMFRAKNHLEVNKVLLE-MQGRGV 947

Query: 956  LPDNMTYDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALD 1015
            LPD  T++FLV+G+S   ++SSS  YL  MI +  +P+NRSL  V S LC+ G + KALD
Sbjct: 948  LPDETTFNFLVHGYSSSADYSSSLRYLSAMISKGMKPNNRSLRAVTSSLCDNGDVKKALD 1007

Query: 1016 LSRKMESRGW-IHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQF 1075
            L + MES+GW + SS VQ  I E LI+ G++ +AE FL R+    ++    +Y+NIIK+ 
Sbjct: 1008 LWQVMESKGWNLGSSVVQTKIVETLISKGEIPKAEDFLTRVTRNGMMAP--NYDNIIKKL 1067

Query: 1076 CQSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSI 1135
               G    A++L+N ML+  +IP ++SYD VI     Y +L++A+DFHTEM++  L PSI
Sbjct: 1068 SDRGNLDIAVHLLNTMLKNQSIPGSSSYDSVINGLLRYNQLDKAMDFHTEMVELGLSPSI 1127

Query: 1136 RTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRA 1195
             TW  LV   C   Q  E+ER++ SM  +GE PS++ + +++D++R E +  KASE M  
Sbjct: 1128 STWSGLVHKFCEACQVLESERLIKSMVGLGESPSQEMFKTVIDRFRVEKNTVKASEMMEM 1187

Query: 1196 MQESGYELDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFS 1242
            MQ+ GYE+DFE  WSLIS ++ +  K+     + +GFLS LLS +GF+
Sbjct: 1188 MQKCGYEVDFETHWSLISNMSSS--KEKKTTTAGEGFLSRLLSGNGFT 1224

BLAST of Bhi09G001310 vs. ExPASy Swiss-Prot
Match: Q9ASZ8 (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX=3702 GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 1.3e-38
Identity = 130/550 (23.64%), Postives = 243/550 (44.18%), Query Frame = 0

Query: 675  PGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE-----RLSEAGFAA 734
            P L D   L S + + ++   V  L + M +      L  L+I +      R     F+A
Sbjct: 70   PRLIDFSRLFSVVARTKQYDLVLDLCKQMELKGIAHNLYTLSIMINCCCRCRKLSLAFSA 129

Query: 735  IGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVI 794
            +G+++      LG+  D   +  LI GLC E  +S A+ L+D ++ M   P +     ++
Sbjct: 130  MGKII-----KLGYEPDTVTFSTLINGLCLEGRVSEALELVDRMVEMGHKPTLITLNALV 189

Query: 795  PILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGIS 854
              LC  G+   A+ L +         +   +G ++K     G+    + L++ M  + I 
Sbjct: 190  NGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKMEERKIK 249

Query: 855  LDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHL 914
            LDA  Y+ ++ G CK  +LD    L   +  K     I  Y  L+   C  GR      L
Sbjct: 250  LDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKL 309

Query: 915  KDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFS 974
               M++   + D V ++ LI    + G      ++  E++ +R + PD +TY  L+ GF 
Sbjct: 310  LRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMI-QRGISPDTVTYTSLIDGFC 369

Query: 975  KCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSA 1034
            K      +   L  M+ +   P+ R+ N +I+  C    +   L+L RKM  RG +  + 
Sbjct: 370  KENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTV 429

Query: 1035 VQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLM 1094
              N + +     GKL  A+     MV + + P+ V Y  ++   C +G    A+ +   +
Sbjct: 430  TYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKI 489

Query: 1095 LEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQT 1154
             +     +   Y+ +I   C   K+++A D    +  + +KP ++T++ ++  LC++G  
Sbjct: 490  EKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSL 549

Query: 1155 KEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSL 1214
             EA+ +   M E G  P+   Y  ++  +  E D  K+++ +  ++  G+ +D      +
Sbjct: 550  SEADLLFRKMEEDGHSPNGCTYNILIRAHLGEGDATKSAKLIEEIKRCGFSVDASTVKMV 609

Query: 1215 ISKLNDTNLK 1220
            +  L+D  LK
Sbjct: 610  VDMLSDGRLK 613

BLAST of Bhi09G001310 vs. ExPASy Swiss-Prot
Match: Q0WPZ6 (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX=3702 GN=At2g17140 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 5.1e-38
Identity = 153/694 (22.05%), Postives = 293/694 (42.22%), Query Frame = 0

Query: 539  FNLLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERR 598
            +NLL+    K R  + V  L  +M   G    +      ++  C  +S +    ++++  
Sbjct: 115  YNLLLESCIKERRVEFVSWLYKDMVLCGIAPQTYTFNLLIRALCD-SSCVDAARELFDEM 174

Query: 599  PYMIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNL 658
            P    +    T  +LV+ Y K  LT  G+ +LN M    V+  K  Y+ +++S C+ G  
Sbjct: 175  PEKGCKPNEFTFGILVRGYCKAGLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRN 234

Query: 659  NDLLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTM----LVSYPHSRLDI 718
            +D     ++ R++G VP +      IS LCK+ K+ +   +   M     +  P      
Sbjct: 235  DDSEKMVEKMREEGLVPDIVTFNSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSIT 294

Query: 719  LNIFLERLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIM 778
             N+ L+   + G     + L + +         ++Y + + GL +      A  +L  + 
Sbjct: 295  YNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMT 354

Query: 779  AMSMVPCIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVR 838
               + P I    +++  LCK+G    A  +  +            +G L+ G+  +GKV 
Sbjct: 355  DKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVD 414

Query: 839  ETLPLVQDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLV 898
                L+Q+M+      +A   N L+    K+  + +  ELL  +  K   L   +   +V
Sbjct: 415  AAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIV 474

Query: 899  CLMCMEGRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKL 958
              +C  G   +A+ +    ++  + H       L       GNS +   ++D+ L +   
Sbjct: 475  DGLCGSGELDKAIEI----VKGMRVHGSAALGNL-------GNSYI--GLVDDSLIENNC 534

Query: 959  LPDNMTYDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALD 1018
            LPD +TY  L+ G  K   F+ +      M+ ++ +P + + N  I + C  G++  A  
Sbjct: 535  LPDLITYSTLLNGLCKAGRFAEAKNLFAEMMGEKLQPDSVAYNIFIHHFCKQGKISSAFR 594

Query: 1019 LSRKMESRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFC 1078
            + + ME +G   S    N++   L    ++ E    ++ M EK + P    YN  I+  C
Sbjct: 595  VLKDMEKKGCHKSLETYNSLILGLGIKNQIFEIHGLMDEMKEKGISPNICTYNTAIQYLC 654

Query: 1079 QSGRWLNAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVD-FHTEMLDRHLKPSI 1138
            +  +  +A NL++ M++K   PN  S+ ++I+  C     + A + F T +     K  +
Sbjct: 655  EGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFCKVPDFDMAQEVFETAVSICGQKEGL 714

Query: 1139 RTWDKLVSLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRA 1198
              +  + + L   GQ  +A  +L ++ + G +     Y  +++    +++LE AS  +  
Sbjct: 715  --YSLMFNELLAAGQLLKATELLEAVLDRGFELGTFLYKDLVESLCKKDELEVASGILHK 774

Query: 1199 MQESGYELDFERQWSLISKLNDTNLKDGNNNNSN 1228
            M + GY  D      +I  L     K GN   +N
Sbjct: 775  MIDRGYGFDPAALMPVIDGLG----KMGNKKEAN 788

BLAST of Bhi09G001310 vs. ExPASy Swiss-Prot
Match: Q0WKV3 (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 6.7e-38
Identity = 126/538 (23.42%), Postives = 245/538 (45.54%), Query Frame = 0

Query: 683  LISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE-RLSEAGFAAIGQVLAKELMAL 742
            L+  LCK+ +LK +   L T+ +        ++N F   R     F+A+G+++      L
Sbjct: 106  LVLALCKQMELKGIAHNLYTLSI--------MINCFCRCRKLCLAFSAMGKII-----KL 165

Query: 743  GFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVIPILCKVGRYETA 802
            G+  +   +  LI GLC E  +S A+ L+D ++ M   P +     ++  LC  G+   A
Sbjct: 166  GYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEA 225

Query: 803  IALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGISLDAEIYNNLVQG 862
            + L +         +   +G ++      G+    + L++ M  + I LDA  Y+ ++ G
Sbjct: 226  MLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 285

Query: 863  HCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHLKDLMLRNSKSHD 922
             CK  +LD    L   +  K ++ +I +Y  L+   C  GR      L   M++   + +
Sbjct: 286  LCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPN 345

Query: 923  CVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFSKCKNFSSSTLYL 982
             V +++LI    + G      ++  E+++ R + PD +TY  L+ GF K  +   +   +
Sbjct: 346  VVTFSVLIDSFVKEGKLREAEELHKEMIH-RGIAPDTITYTSLIDGFCKENHLDKANQMV 405

Query: 983  FTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSAVQNAITECLIAN 1042
              M+ +   P+ R+ N +I+  C   ++   L+L RKM  RG +  +   N + +     
Sbjct: 406  DLMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCEL 465

Query: 1043 GKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLMLEKGNIPNATSY 1102
            GKL  A+     MV + + P  V Y  ++   C +G    A+ +   + +     +   Y
Sbjct: 466  GKLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIY 525

Query: 1103 DFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQTKEAERVLISMTE 1162
            + +I   C   K+++A D    +  + +KP ++T++ ++  LC++G   EAE +   M E
Sbjct: 526  NIIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKMEE 585

Query: 1163 MGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSLISKLNDTNLK 1220
             G  P    Y  ++  +  + D  K+ + +  ++  G+ +D      +I  L+D  LK
Sbjct: 586  DGHAPDGWTYNILIRAHLGDGDATKSVKLIEELKRCGFSVDASTIKMVIDMLSDGRLK 629

BLAST of Bhi09G001310 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 1.6e-36
Identity = 125/537 (23.28%), Postives = 240/537 (44.69%), Query Frame = 0

Query: 683  LISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE-RLSEAGFAAIGQVLAKELMAL 742
            L+  LCK+ + K +   + T+ +        ++N F   R     F+ +G++     M L
Sbjct: 106  LVLALCKQMESKGIAHSIYTLSI--------MINCFCRCRKLSYAFSTMGKI-----MKL 165

Query: 743  GFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVPCIDVCLLVIPILCKVGRYETA 802
            G+  D   +  L+ GLC E  +S A+ L+D ++ M   P +     ++  LC  G+   A
Sbjct: 166  GYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDA 225

Query: 803  IALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLVQDMLSKGISLDAEIYNNLVQG 862
            + L +         +   +G ++      G+    + L++ M  + I LDA  Y+ ++ G
Sbjct: 226  VVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 285

Query: 863  HCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCMEGRSLQALHLKDLMLRNSKSHD 922
             CK  +LD    L   +  K     I +Y  L+   C  GR      L   M++   S +
Sbjct: 286  LCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPN 345

Query: 923  CVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMTYDFLVYGFSKCKNFSSSTLYL 982
             V +++LI    + G      ++L E++ +R + P+ +TY+ L+ GF K      +   +
Sbjct: 346  VVTFSVLIDSFVKEGKLREADQLLKEMM-QRGIAPNTITYNSLIDGFCKENRLEEAIQMV 405

Query: 983  FTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKMESRGWIHSSAVQNAITECLIAN 1042
              MI +   P   + N +I+  C   ++   L+L R+M  RG I ++   N + +    +
Sbjct: 406  DLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQS 465

Query: 1043 GKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWLNAINLINLMLEKGNIPNATSY 1102
            GKL  A+     MV + + P+ V Y  ++   C +G    A+ +   + +     +   Y
Sbjct: 466  GKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKIEKSKMELDIGIY 525

Query: 1103 DFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLVSLLCREGQTKEAERVLISMTE 1162
              +I   C   K+++A D    +  + +K   R ++ ++S LCR+    +A+ +   MTE
Sbjct: 526  MIIIHGMCNASKVDDAWDLFCSLPLKGVKLDARAYNIMISELCRKDSLSKADILFRKMTE 585

Query: 1163 MGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYELDFERQWSLISKLNDTNL 1219
             G  P +  Y  ++  +  ++D   A+E +  M+ SG+  D      +I+ L+   L
Sbjct: 586  EGHAPDELTYNILIRAHLGDDDATTAAELIEEMKSSGFPADVSTVKMVINMLSSGEL 628

BLAST of Bhi09G001310 vs. ExPASy TrEMBL
Match: A0A0A0K9E7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G290590 PE=4 SV=1)

HSP 1 Score: 2170.6 bits (5623), Expect = 0.0e+00
Identity = 1074/1246 (86.20%), Postives = 1154/1246 (92.62%), Query Frame = 0

Query: 1    MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQ 60
            MIR+LCNY LQIH+LR SPSLTLFIPRKFFL VQSP VLRCRNKCTTINLSSIDCSG+AQ
Sbjct: 1    MIRILCNYFLQIHRLRCSPSLTLFIPRKFFLSVQSPGVLRCRNKCTTINLSSIDCSGLAQ 60

Query: 61   SLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGF 120
            S+ISRCS+ LE EG  SALPNPS  +FLLEISDVVPEY RRIRR+ ELKPEDVLKLF+ F
Sbjct: 61   SVISRCSLFLENEGNGSALPNPSLIDFLLEISDVVPEYARRIRRIPELKPEDVLKLFIEF 120

Query: 121  QSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSE 180
            QS VG NGIQ+KKVECLWRI KF NESS NFKH+PRSCEIMASLL+RVGKFKEVEHFLSE
Sbjct: 121  QSEVGKNGIQVKKVECLWRIFKFANESSGNFKHLPRSCEIMASLLVRVGKFKEVEHFLSE 180

Query: 181  MESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMK 240
            MESQGILLDNPEVFSCLIQG VCEGNLERAV IYEKVR+RC SPSLSCYH LLDSLV+ K
Sbjct: 181  MESQGILLDNPEVFSCLIQGLVCEGNLERAVLIYEKVRRRCNSPSLSCYHALLDSLVQKK 240

Query: 241  KTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDE 300
            KTQVAL VC DMVEMGFGLGDEEKA FDNVI LLCWQG VLEARNLVKKFVALDFRPSDE
Sbjct: 241  KTQVALAVCTDMVEMGFGLGDEEKASFDNVIRLLCWQGNVLEARNLVKKFVALDFRPSDE 300

Query: 301  VLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360
            VLYQITRGYC+KKDFEDLLSFFFEIK+PPNVSSGNKIIYSLCKDFGSESAYL+LRELEHT
Sbjct: 301  VLYQITRGYCDKKDFEDLLSFFFEIKTPPNVSSGNKIIYSLCKDFGSESAYLFLRELEHT 360

Query: 361  GFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENA 420
            GFKPDEITFGILICWSC EGNLR+AFIYMSELL SGLKPDLHSYNALISGM K+GLWENA
Sbjct: 361  GFKPDEITFGILICWSCHEGNLRQAFIYMSELLSSGLKPDLHSYNALISGMFKKGLWENA 420

Query: 421  QGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRI 480
            QG+LAEMVDQGIEPNLSTFRI+LAGYCKARQFEEAKK V+EME CGFI+LSSVDD LC+I
Sbjct: 421  QGILAEMVDQGIEPNLSTFRILLAGYCKARQFEEAKKIVIEMEICGFIKLSSVDDQLCKI 480

Query: 481  FSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFN 540
            FSFLGF++S+VRLKRD+NTGVSKTEFFDTLGNGLYLDTD+DEYEKRLT++L+ES++PDFN
Sbjct: 481  FSFLGFSESSVRLKRDNNTGVSKTEFFDTLGNGLYLDTDLDEYEKRLTKVLEESILPDFN 540

Query: 541  LLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPY 600
            L IIE+CKNRD KAV+GL AEMDRWGQELTSVGLM  LKR+C LNS+IKPIIDVWERRPY
Sbjct: 541  LFIIEDCKNRDCKAVLGLVAEMDRWGQELTSVGLMSLLKRNCKLNSKIKPIIDVWERRPY 600

Query: 601  MIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLND 660
            MIAQLGADTL+LLVQAY K R TSSGIGILNEM QM   I+ ETY  LINSLCK GNLND
Sbjct: 601  MIAQLGADTLSLLVQAYGKSRSTSSGIGILNEMIQMRTEIKNETYKALINSLCKKGNLND 660

Query: 661  LLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE 720
            LL CWDRARKDGWVP LHDCK LISCLCKK KLKEVFSLL+TMLVS+ HSRLDILNIFLE
Sbjct: 661  LLHCWDRARKDGWVPELHDCKSLISCLCKKGKLKEVFSLLETMLVSHTHSRLDILNIFLE 720

Query: 721  RLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVP 780
            RLSE GFA IGQVLA+ELM+LGF +DQKAYELLIIGLCK NNISIA ++LDDIM  SMVP
Sbjct: 721  RLSEVGFATIGQVLAEELMSLGFSVDQKAYELLIIGLCKVNNISIAFSILDDIMGRSMVP 780

Query: 781  CIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLV 840
             IDVCL +IPILCKVGRYETA+ALKE+G +KLSSSS RVFGALMKGFFMMGKVRETLPL+
Sbjct: 781  SIDVCLRLIPILCKVGRYETAVALKEMGASKLSSSSHRVFGALMKGFFMMGKVRETLPLI 840

Query: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCME 900
            QDMLSKGISLDAEIYNNLVQGHCKVKN DKVRELLGIIVRKD SLS+ SYKKLVC MCME
Sbjct: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNFDKVRELLGIIVRKDFSLSMPSYKKLVCFMCME 900

Query: 901  GRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMT 960
            GRSLQALH+KDLMLRNSKSHDCVIYNILIFYI +SGN  LVPKILDELL+ RKL+PD +T
Sbjct: 901  GRSLQALHIKDLMLRNSKSHDCVIYNILIFYILRSGNGSLVPKILDELLHGRKLIPDGVT 960

Query: 961  YDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKME 1020
            YDFLVYGFSKCK+FSSS LYLFTMIQ  FRPSNRSLN VIS+LC+ GQL KAL+LS++ME
Sbjct: 961  YDFLVYGFSKCKDFSSSKLYLFTMIQLGFRPSNRSLNAVISHLCDIGQLEKALELSQEME 1020

Query: 1021 SRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWL 1080
            S+GW+HSSAVQ+AI ECLI+NGKL+EAECFLNRMVE SLIPEHVDYNNII++FCQ+GRWL
Sbjct: 1021 SKGWVHSSAVQDAIAECLISNGKLQEAECFLNRMVEMSLIPEHVDYNNIIRKFCQNGRWL 1080

Query: 1081 NAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLV 1140
             AI+LIN+ML+KGNIPNATSYDFVIQ CC YKKLEEAVDFHTEMLDR LKPSIRTWDKLV
Sbjct: 1081 KAIDLINIMLKKGNIPNATSYDFVIQSCCAYKKLEEAVDFHTEMLDRRLKPSIRTWDKLV 1140

Query: 1141 SLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYE 1200
             LLCREGQTKEAERVL+SMT MGEKPSKDAYCSMLD+YRYEN+LEKASETM+AMQESGYE
Sbjct: 1141 YLLCREGQTKEAERVLMSMTAMGEKPSKDAYCSMLDRYRYENNLEKASETMKAMQESGYE 1200

Query: 1201 LDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFSRALIP 1247
            LDFE QWSLISKLNDTNLKD NN+NSNKGFL+GLLSKSGFSRALIP
Sbjct: 1201 LDFETQWSLISKLNDTNLKDSNNSNSNKGFLAGLLSKSGFSRALIP 1246

BLAST of Bhi09G001310 vs. ExPASy TrEMBL
Match: A0A5D3CCW5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold39G00010 PE=4 SV=1)

HSP 1 Score: 2151.7 bits (5574), Expect = 0.0e+00
Identity = 1075/1246 (86.28%), Postives = 1146/1246 (91.97%), Query Frame = 0

Query: 1    MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQ 60
            MIR+LCNYLLQIH+LR S SLTLFIPRKFFL VQSPV LRCRNK TTINLSSI+CSGIAQ
Sbjct: 1    MIRILCNYLLQIHRLRCSSSLTLFIPRKFFLSVQSPVALRCRNKSTTINLSSINCSGIAQ 60

Query: 61   SLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGF 120
            SLISRCSVLLE EG  S LPN S  + LLEISDVVPEY RRIRR+ ELKPEDVLKLF+ F
Sbjct: 61   SLISRCSVLLENEGNGSTLPNASLMDLLLEISDVVPEYARRIRRIPELKPEDVLKLFIEF 120

Query: 121  QSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSE 180
            QS VGNNGIQ+KKVECLWRI KF NESS NFKH+PRSCEIMASLL RVGKFKEVEHFLSE
Sbjct: 121  QSEVGNNGIQVKKVECLWRIFKFANESSGNFKHLPRSCEIMASLLSRVGKFKEVEHFLSE 180

Query: 181  MESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMK 240
            MESQGILLDNPEVF CLIQG VCEGNLERAV IYEK RQRCISPSLSCYHVLLDSLV+MK
Sbjct: 181  MESQGILLDNPEVFGCLIQGLVCEGNLERAVLIYEKARQRCISPSLSCYHVLLDSLVQMK 240

Query: 241  KTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDE 300
            KTQVALGVC DMVEMGFGLGDEEKA FDNVI LLCWQG VLEARNLVKKFVALDFRPSDE
Sbjct: 241  KTQVALGVCTDMVEMGFGLGDEEKASFDNVIRLLCWQGNVLEARNLVKKFVALDFRPSDE 300

Query: 301  VLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360
            VLYQI+RGYC+KKDFEDLLSFFFEIK+PPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT
Sbjct: 301  VLYQISRGYCDKKDFEDLLSFFFEIKTPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360

Query: 361  GFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENA 420
            GFKPDEITFGILI WSC EGNLRKAFIY+SELL SGLKPDL SYNALISGM KEGLWENA
Sbjct: 361  GFKPDEITFGILIGWSCHEGNLRKAFIYLSELLSSGLKPDLLSYNALISGMFKEGLWENA 420

Query: 421  QGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRI 480
            QG+LAEMVDQGIEPNLSTF+I+LAGYCKARQFEEAK  VLEME CGFI+LSSVDD LC+I
Sbjct: 421  QGILAEMVDQGIEPNLSTFKILLAGYCKARQFEEAKSIVLEMETCGFIKLSSVDDQLCKI 480

Query: 481  FSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFN 540
            FSFLGF++S+VRLKRD+NTGVSKTEFFDTLGNGLYLDTD+DEYEKRLT++L+ES++PDFN
Sbjct: 481  FSFLGFSESSVRLKRDNNTGVSKTEFFDTLGNGLYLDTDIDEYEKRLTKVLEESILPDFN 540

Query: 541  LLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPY 600
            LLII+ECKNRD KAV+GL AEMDRWGQE TSVGLM  LK +C L S+IKP IDVWER+PY
Sbjct: 541  LLIIDECKNRDCKAVLGLVAEMDRWGQEFTSVGLMSLLKSNCKLISKIKPNIDVWERKPY 600

Query: 601  MIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLND 660
            MIAQLGADTL+LLVQAYSK R TSSGIGILNEM QM V I+ E Y  LINSLCK GNLND
Sbjct: 601  MIAQLGADTLSLLVQAYSKSRSTSSGIGILNEMIQMRVEIKNEAYKALINSLCKKGNLND 660

Query: 661  LLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE 720
            LL CWDRARKDGWVPGLHDCK LISCLC+K KLKEVFSLL+TMLVS+P SRLDILNIFLE
Sbjct: 661  LLFCWDRARKDGWVPGLHDCKSLISCLCEKGKLKEVFSLLETMLVSHPLSRLDILNIFLE 720

Query: 721  RLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVP 780
            RLSEAGFAAIGQ LA+EL +LGF LDQKAYELLIIGLCK NNIS+A ++LD IM  SMVP
Sbjct: 721  RLSEAGFAAIGQELAEELTSLGFSLDQKAYELLIIGLCKVNNISMAFSVLDYIMGRSMVP 780

Query: 781  CIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLV 840
             IDVCL +IPILCKVGRYETA+ALKE+G +KLSS S RVFGALMKGFFMMGKVRETLPL+
Sbjct: 781  SIDVCLRLIPILCKVGRYETAVALKEMGGSKLSSCSHRVFGALMKGFFMMGKVRETLPLL 840

Query: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCME 900
            QDMLSKGISLDAEIYNNLVQGHCKVKN DKVRELLGIIVRKD+SLS+SSYKKLVC MCME
Sbjct: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNFDKVRELLGIIVRKDVSLSMSSYKKLVCFMCME 900

Query: 901  GRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMT 960
            GRSLQALHLKDLMLRNSKS+DCVIYNILIFYIFQSGN  LVPKILDELL+ RKL+PD +T
Sbjct: 901  GRSLQALHLKDLMLRNSKSYDCVIYNILIFYIFQSGNGSLVPKILDELLHGRKLIPDRVT 960

Query: 961  YDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKME 1020
            YDFL YGFSKCK+FSSSTLYLFTMIQ EFRPSNRSLN VIS LC+ G L KAL+LS++ME
Sbjct: 961  YDFLAYGFSKCKDFSSSTLYLFTMIQLEFRPSNRSLNAVISLLCDIGHLEKALELSQEME 1020

Query: 1021 SRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWL 1080
            SRGW+HSSAVQ+AI ECLI+NGKL EAECFLNRMVE SLIPEHVDYNNII+QFC +GRWL
Sbjct: 1021 SRGWVHSSAVQDAIAECLISNGKLLEAECFLNRMVEMSLIPEHVDYNNIIRQFCHNGRWL 1080

Query: 1081 NAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLV 1140
             AI+LIN+ML+KGNIPNATSYDFVIQ CC YKKLEEAVDFHTEMLDR LKPSIRTWDKLV
Sbjct: 1081 KAIDLINIMLKKGNIPNATSYDFVIQSCCAYKKLEEAVDFHTEMLDRRLKPSIRTWDKLV 1140

Query: 1141 SLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYE 1200
             LLCREGQTKEAERVL+SMT MGEKPSKDAYCSMLD+YRYENDLEKASETMRAMQESGYE
Sbjct: 1141 YLLCREGQTKEAERVLMSMTAMGEKPSKDAYCSMLDRYRYENDLEKASETMRAMQESGYE 1200

Query: 1201 LDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFSRALIP 1247
            LDFE QWSLI+KLNDTNLKD NNNNSNKGFL+GLLSKSGFSRA IP
Sbjct: 1201 LDFETQWSLINKLNDTNLKDSNNNNSNKGFLAGLLSKSGFSRAWIP 1246

BLAST of Bhi09G001310 vs. ExPASy TrEMBL
Match: A0A1S3CES3 (pentatricopeptide repeat-containing protein At5g15280 OS=Cucumis melo OX=3656 GN=LOC103500046 PE=4 SV=1)

HSP 1 Score: 2149.0 bits (5567), Expect = 0.0e+00
Identity = 1072/1246 (86.04%), Postives = 1147/1246 (92.05%), Query Frame = 0

Query: 1    MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQ 60
            MIR+LCNYLLQIH+LR S SLTLFIPRKFFL VQSPV LRCRNK TTINLSSI+CSGIAQ
Sbjct: 1    MIRILCNYLLQIHRLRCSSSLTLFIPRKFFLSVQSPVALRCRNKSTTINLSSINCSGIAQ 60

Query: 61   SLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGF 120
            SLISRCSVLLE EG  S LPN S  + LLEISDVVPEY RRIRR+ ELKPEDVLKLF+ F
Sbjct: 61   SLISRCSVLLENEGNGSTLPNASLMDLLLEISDVVPEYARRIRRIPELKPEDVLKLFIEF 120

Query: 121  QSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSE 180
            QS VGNNGIQ+KKVECLWRI KF NESS NFKH+PRSCEIMASLL RVGKFKEVEHFLSE
Sbjct: 121  QSEVGNNGIQVKKVECLWRIFKFANESSGNFKHLPRSCEIMASLLSRVGKFKEVEHFLSE 180

Query: 181  MESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMK 240
            MESQGILLDNPEVF CLIQG VCEGNLERAV IYEK RQRCISPSLSCYHVLLDSLV+MK
Sbjct: 181  MESQGILLDNPEVFGCLIQGLVCEGNLERAVLIYEKARQRCISPSLSCYHVLLDSLVQMK 240

Query: 241  KTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDE 300
            +TQVALGVC DMVEMGFGLGDEEKA FDNVI LLCWQG VLEARNLVKKFVALDFRPSDE
Sbjct: 241  ETQVALGVCTDMVEMGFGLGDEEKASFDNVIRLLCWQGNVLEARNLVKKFVALDFRPSDE 300

Query: 301  VLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360
            VLYQI+RGYC+KKDFEDLLSFFFEIK+PPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT
Sbjct: 301  VLYQISRGYCDKKDFEDLLSFFFEIKTPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360

Query: 361  GFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENA 420
            GFKPDEITFGILI WSC EGNLRKAFIY+SELL SGLKPDL SYNALISGM KEGLWENA
Sbjct: 361  GFKPDEITFGILIGWSCHEGNLRKAFIYLSELLSSGLKPDLLSYNALISGMFKEGLWENA 420

Query: 421  QGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRI 480
            QG+LAEMVDQGIEPNLSTF+I+LAGYCKARQFEEAK  VLEME CGFI+LSSVDD LC+I
Sbjct: 421  QGILAEMVDQGIEPNLSTFKILLAGYCKARQFEEAKSIVLEMETCGFIKLSSVDDQLCKI 480

Query: 481  FSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFN 540
            FSFLGF++S+VRLKRD+NTGVSKTEFFDTLGNGLYLDTD+DEYEKRLT++L+ES++PDFN
Sbjct: 481  FSFLGFSESSVRLKRDNNTGVSKTEFFDTLGNGLYLDTDIDEYEKRLTKVLEESILPDFN 540

Query: 541  LLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPY 600
            LLII+ECKNRD KAV+GL AEMDRWGQE TSVGLM  LK +C L S+IKP IDVWER+PY
Sbjct: 541  LLIIDECKNRDCKAVLGLVAEMDRWGQEFTSVGLMSLLKSNCKLISKIKPNIDVWERKPY 600

Query: 601  MIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLND 660
            MIAQLGADTL+LLVQAYSK R TSSGIGILNEM QM V I+ E Y  LINSLCK GNLND
Sbjct: 601  MIAQLGADTLSLLVQAYSKSRSTSSGIGILNEMIQMRVEIKNEAYKALINSLCKKGNLND 660

Query: 661  LLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE 720
            LL CWDRARKDGWVPGLHDCK LISCLC+K KLKEVFSLL+TMLVS+P SRLDILNIFLE
Sbjct: 661  LLFCWDRARKDGWVPGLHDCKSLISCLCEKGKLKEVFSLLETMLVSHPLSRLDILNIFLE 720

Query: 721  RLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVP 780
            RLSEAGFAAIGQVL++EL +LGF LDQKAYELLIIGLCK NNIS+A ++LDDIM  SMVP
Sbjct: 721  RLSEAGFAAIGQVLSEELTSLGFSLDQKAYELLIIGLCKVNNISMAFSVLDDIMGRSMVP 780

Query: 781  CIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLV 840
             IDVCL +IPILCKVGRYETA+ALKE+G +KLSS S RVFGALMKGFFMMGKVRETLPL+
Sbjct: 781  SIDVCLRLIPILCKVGRYETAVALKEMGGSKLSSCSHRVFGALMKGFFMMGKVRETLPLL 840

Query: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCME 900
            QDMLSKGISLDAEIYNNLVQGHCKVKN DKV ELLGIIVRKD+SLS+SSYKKLVC MCME
Sbjct: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNFDKVWELLGIIVRKDVSLSMSSYKKLVCFMCME 900

Query: 901  GRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMT 960
            GRSLQALHLKDLMLRNSKS+DCVIYNILIFYIF+SGN  LVPKILDELL+ RKL+PD +T
Sbjct: 901  GRSLQALHLKDLMLRNSKSYDCVIYNILIFYIFRSGNGSLVPKILDELLHGRKLIPDRVT 960

Query: 961  YDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKME 1020
            YDFLVYGFSKCK+FSSSTLYLFTMIQ EFRPSNRSLN VIS LC+ G L KAL+LS++ME
Sbjct: 961  YDFLVYGFSKCKDFSSSTLYLFTMIQLEFRPSNRSLNAVISLLCDIGHLEKALELSQEME 1020

Query: 1021 SRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWL 1080
            SRGW+HSS VQ+AI ECLI+NGKL EAECFLNRMVE SLIPEHVDYNNII+QFC +GRWL
Sbjct: 1021 SRGWVHSSVVQDAIAECLISNGKLLEAECFLNRMVEMSLIPEHVDYNNIIRQFCHNGRWL 1080

Query: 1081 NAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLV 1140
             AI+LIN+ML+KGNIPNATSYDFVIQ CC YKKLEEAVDFHTEMLDR LKPSIRTWDKLV
Sbjct: 1081 KAIDLINIMLKKGNIPNATSYDFVIQSCCAYKKLEEAVDFHTEMLDRRLKPSIRTWDKLV 1140

Query: 1141 SLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYE 1200
             LLCREGQTKE+ERVL+SMT MGEKPSKDAYCSMLD+YRYENDLEKASETMRAMQESGYE
Sbjct: 1141 YLLCREGQTKESERVLMSMTAMGEKPSKDAYCSMLDRYRYENDLEKASETMRAMQESGYE 1200

Query: 1201 LDFERQWSLISKLNDTNLKDGNNNNSNKGFLSGLLSKSGFSRALIP 1247
            LDFE QWSLI+KLNDTNLKD NNNNSNKGFL+GLLSKSGFSRA IP
Sbjct: 1201 LDFETQWSLINKLNDTNLKDSNNNNSNKGFLAGLLSKSGFSRAWIP 1246

BLAST of Bhi09G001310 vs. ExPASy TrEMBL
Match: A0A6J1F202 (pentatricopeptide repeat-containing protein At5g15280, mitochondrial isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439030 PE=4 SV=1)

HSP 1 Score: 2060.0 bits (5336), Expect = 0.0e+00
Identity = 1035/1247 (83.00%), Postives = 1132/1247 (90.78%), Query Frame = 0

Query: 1    MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQ 60
            MIRVLCNYL QIHQLR S  L LFIPR F+L+VQSPV LRCRNKCTTIN SSI+C GIAQ
Sbjct: 1    MIRVLCNYLPQIHQLRSSIPLILFIPRNFYLFVQSPVTLRCRNKCTTIN-SSINCCGIAQ 60

Query: 61   SLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGF 120
            +LISRCSVLLEKE   S LPN   K+FLLEISDVVPE+VRRIRRV+ELKPEDVLKLFLGF
Sbjct: 61   TLISRCSVLLEKEENGSVLPNSCLKDFLLEISDVVPEHVRRIRRVSELKPEDVLKLFLGF 120

Query: 121  QSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSE 180
            QS VG+NGIQ+KKVECLWRILKF NES+ +FK +PR  E+MASLL++VGK+KEVE FLSE
Sbjct: 121  QSEVGDNGIQVKKVECLWRILKFVNESNGSFKQLPRLYEVMASLLVQVGKYKEVEQFLSE 180

Query: 181  MESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMK 240
            ME QGILLDNPEVFSC+IQGFVCEGNLE+A+ IYEK RQRCISPSLSCY VLLDSLVR+K
Sbjct: 181  MEIQGILLDNPEVFSCIIQGFVCEGNLEKAILIYEKARQRCISPSLSCYRVLLDSLVRIK 240

Query: 241  KTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDE 300
            KTQVALGVC DMVEMGF LGD+EKA F+NV+GLLCWQGKVLEARNLVKKFVA DFRPSDE
Sbjct: 241  KTQVALGVCTDMVEMGFDLGDDEKAAFENVVGLLCWQGKVLEARNLVKKFVASDFRPSDE 300

Query: 301  VLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360
            VLY+ITRGYCEKKDFEDLLSFFFEIKSPPNV SGNKII+SLCK+FGSESA LYLRELE T
Sbjct: 301  VLYRITRGYCEKKDFEDLLSFFFEIKSPPNVFSGNKIIHSLCKNFGSESACLYLRELECT 360

Query: 361  GFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENA 420
            GFKPDEITFGILI WSCREGNLR AFIYMSELLFSGLKPDLHSYNALIS MLKEGLWEN 
Sbjct: 361  GFKPDEITFGILISWSCREGNLRSAFIYMSELLFSGLKPDLHSYNALISAMLKEGLWENG 420

Query: 421  QGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRI 480
            QG+LAEMV++G EPNLSTFRI+LAGYCKARQFEEAKK VLEMERCGFIQLS VDDLLC+I
Sbjct: 421  QGILAEMVERGTEPNLSTFRILLAGYCKARQFEEAKKIVLEMERCGFIQLSPVDDLLCKI 480

Query: 481  FSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFN 540
            FSFLGFNDSA+RLKRD+N GVSKTEFFDTLGNGLYLDTDVDEYEK LTE+L++S++PDFN
Sbjct: 481  FSFLGFNDSAIRLKRDNNVGVSKTEFFDTLGNGLYLDTDVDEYEKTLTEVLEKSILPDFN 540

Query: 541  LLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPY 600
            L I++ECKNRD KAV+ L AEMDRWGQELTSVGLMG LK HC  NSRIKPIIDVW+RRP 
Sbjct: 541  LFIVKECKNRDLKAVLRLTAEMDRWGQELTSVGLMGLLKSHCKSNSRIKPIIDVWKRRPD 600

Query: 601  MIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLND 660
            MIAQL ADTLNLLVQAYSK RLTS GIG LNEM +M V IEKETYS LINSLCK GNL+D
Sbjct: 601  MIAQLEADTLNLLVQAYSKNRLTSCGIGTLNEMIRMDVRIEKETYSALINSLCKIGNLSD 660

Query: 661  LLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE 720
            L+GCWDRARKDGWVPGL D K LISCLCKK +LK+V  LL+TMLVSYPHSRLDILNIFLE
Sbjct: 661  LVGCWDRARKDGWVPGLLDFKSLISCLCKKGELKQVVVLLETMLVSYPHSRLDILNIFLE 720

Query: 721  RLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVP 780
            RLSEAGF AIG+VLAKEL +LGF LDQKAYELLIIGLCKEN +SIAIN+LDD+MAMSMVP
Sbjct: 721  RLSEAGFPAIGRVLAKELTSLGFSLDQKAYELLIIGLCKENTVSIAINMLDDMMAMSMVP 780

Query: 781  CIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLV 840
            CIDVCLL+IP LCK+GRYETAIALKEIGTTKLSSSS RV+GALMKGFF  GKVRE L L+
Sbjct: 781  CIDVCLLLIPTLCKIGRYETAIALKEIGTTKLSSSSRRVYGALMKGFFTTGKVREALALL 840

Query: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCME 900
            +DMLSKG+SLDAEIYN L+QGHCK KN +KVRELLG+++RKDLSLSISSY KLV LMC E
Sbjct: 841  EDMLSKGLSLDAEIYNLLIQGHCKAKNFEKVRELLGVMLRKDLSLSISSYGKLVRLMCRE 900

Query: 901  GRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMT 960
            GRSLQALHLKD+MLRNSKSHD VIYNILIFYIF+SGN  LV KILDE      LLPDN+T
Sbjct: 901  GRSLQALHLKDIMLRNSKSHDSVIYNILIFYIFRSGNCFLVGKILDE------LLPDNVT 960

Query: 961  YDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKME 1020
            Y+FLVY FS+CK+FSSST YLFTMI++EFRPSNRSLN VIS+LC+TGQL KAL++SR+ME
Sbjct: 961  YNFLVYRFSQCKDFSSSTYYLFTMIRREFRPSNRSLNAVISHLCDTGQLEKALEVSREME 1020

Query: 1021 SRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWL 1080
             RGWIH+SAVQNAI EC I+ GKL+EAECFLNRMVEKSLIP+HVDYNNIIKQFCQSGRWL
Sbjct: 1021 FRGWIHNSAVQNAIVECFISYGKLQEAECFLNRMVEKSLIPKHVDYNNIIKQFCQSGRWL 1080

Query: 1081 NAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLV 1140
             A++LIN+ML++GNIPNA+SYDFVIQCCC YKKLEEA+D HTEMLDR LKPSI T DKLV
Sbjct: 1081 KAMDLINIMLKQGNIPNASSYDFVIQCCCNYKKLEEALDLHTEMLDRCLKPSITTCDKLV 1140

Query: 1141 SLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYE 1200
            S LCREGQ KEAERVL+S+ EMGE PSKDAYCSML++YRYENDLEKASETMRAMQ+SGYE
Sbjct: 1141 SSLCREGQMKEAERVLMSILEMGEIPSKDAYCSMLNRYRYENDLEKASETMRAMQQSGYE 1200

Query: 1201 LDFERQWSLISKLNDTNLK-DGNNNNSNKGFLSGLLSKSGFSRALIP 1247
            LDFE QWSLISKL+DT+L+ + NNNNSNKGFLSGLLSKSGFSRA IP
Sbjct: 1201 LDFETQWSLISKLSDTSLENNNNNNNSNKGFLSGLLSKSGFSRASIP 1240

BLAST of Bhi09G001310 vs. ExPASy TrEMBL
Match: A0A6J1EX14 (pentatricopeptide repeat-containing protein At5g15280, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439030 PE=4 SV=1)

HSP 1 Score: 2060.0 bits (5336), Expect = 0.0e+00
Identity = 1035/1247 (83.00%), Postives = 1132/1247 (90.78%), Query Frame = 0

Query: 1    MIRVLCNYLLQIHQLRWSPSLTLFIPRKFFLYVQSPVVLRCRNKCTTINLSSIDCSGIAQ 60
            MIRVLCNYL QIHQLR S  L LFIPR F+L+VQSPV LRCRNKCTTIN SSI+C GIAQ
Sbjct: 11   MIRVLCNYLPQIHQLRSSIPLILFIPRNFYLFVQSPVTLRCRNKCTTIN-SSINCCGIAQ 70

Query: 61   SLISRCSVLLEKEGYASALPNPSFKEFLLEISDVVPEYVRRIRRVAELKPEDVLKLFLGF 120
            +LISRCSVLLEKE   S LPN   K+FLLEISDVVPE+VRRIRRV+ELKPEDVLKLFLGF
Sbjct: 71   TLISRCSVLLEKEENGSVLPNSCLKDFLLEISDVVPEHVRRIRRVSELKPEDVLKLFLGF 130

Query: 121  QSAVGNNGIQIKKVECLWRILKFTNESSRNFKHIPRSCEIMASLLIRVGKFKEVEHFLSE 180
            QS VG+NGIQ+KKVECLWRILKF NES+ +FK +PR  E+MASLL++VGK+KEVE FLSE
Sbjct: 131  QSEVGDNGIQVKKVECLWRILKFVNESNGSFKQLPRLYEVMASLLVQVGKYKEVEQFLSE 190

Query: 181  MESQGILLDNPEVFSCLIQGFVCEGNLERAVSIYEKVRQRCISPSLSCYHVLLDSLVRMK 240
            ME QGILLDNPEVFSC+IQGFVCEGNLE+A+ IYEK RQRCISPSLSCY VLLDSLVR+K
Sbjct: 191  MEIQGILLDNPEVFSCIIQGFVCEGNLEKAILIYEKARQRCISPSLSCYRVLLDSLVRIK 250

Query: 241  KTQVALGVCMDMVEMGFGLGDEEKAVFDNVIGLLCWQGKVLEARNLVKKFVALDFRPSDE 300
            KTQVALGVC DMVEMGF LGD+EKA F+NV+GLLCWQGKVLEARNLVKKFVA DFRPSDE
Sbjct: 251  KTQVALGVCTDMVEMGFDLGDDEKAAFENVVGLLCWQGKVLEARNLVKKFVASDFRPSDE 310

Query: 301  VLYQITRGYCEKKDFEDLLSFFFEIKSPPNVSSGNKIIYSLCKDFGSESAYLYLRELEHT 360
            VLY+ITRGYCEKKDFEDLLSFFFEIKSPPNV SGNKII+SLCK+FGSESA LYLRELE T
Sbjct: 311  VLYRITRGYCEKKDFEDLLSFFFEIKSPPNVFSGNKIIHSLCKNFGSESACLYLRELECT 370

Query: 361  GFKPDEITFGILICWSCREGNLRKAFIYMSELLFSGLKPDLHSYNALISGMLKEGLWENA 420
            GFKPDEITFGILI WSCREGNLR AFIYMSELLFSGLKPDLHSYNALIS MLKEGLWEN 
Sbjct: 371  GFKPDEITFGILISWSCREGNLRSAFIYMSELLFSGLKPDLHSYNALISAMLKEGLWENG 430

Query: 421  QGVLAEMVDQGIEPNLSTFRIILAGYCKARQFEEAKKTVLEMERCGFIQLSSVDDLLCRI 480
            QG+LAEMV++G EPNLSTFRI+LAGYCKARQFEEAKK VLEMERCGFIQLS VDDLLC+I
Sbjct: 431  QGILAEMVERGTEPNLSTFRILLAGYCKARQFEEAKKIVLEMERCGFIQLSPVDDLLCKI 490

Query: 481  FSFLGFNDSAVRLKRDSNTGVSKTEFFDTLGNGLYLDTDVDEYEKRLTEILKESVVPDFN 540
            FSFLGFNDSA+RLKRD+N GVSKTEFFDTLGNGLYLDTDVDEYEK LTE+L++S++PDFN
Sbjct: 491  FSFLGFNDSAIRLKRDNNVGVSKTEFFDTLGNGLYLDTDVDEYEKTLTEVLEKSILPDFN 550

Query: 541  LLIIEECKNRDPKAVVGLAAEMDRWGQELTSVGLMGFLKRHCTLNSRIKPIIDVWERRPY 600
            L I++ECKNRD KAV+ L AEMDRWGQELTSVGLMG LK HC  NSRIKPIIDVW+RRP 
Sbjct: 551  LFIVKECKNRDLKAVLRLTAEMDRWGQELTSVGLMGLLKSHCKSNSRIKPIIDVWKRRPD 610

Query: 601  MIAQLGADTLNLLVQAYSKCRLTSSGIGILNEMSQMHVVIEKETYSVLINSLCKTGNLND 660
            MIAQL ADTLNLLVQAYSK RLTS GIG LNEM +M V IEKETYS LINSLCK GNL+D
Sbjct: 611  MIAQLEADTLNLLVQAYSKNRLTSCGIGTLNEMIRMDVRIEKETYSALINSLCKIGNLSD 670

Query: 661  LLGCWDRARKDGWVPGLHDCKLLISCLCKKRKLKEVFSLLQTMLVSYPHSRLDILNIFLE 720
            L+GCWDRARKDGWVPGL D K LISCLCKK +LK+V  LL+TMLVSYPHSRLDILNIFLE
Sbjct: 671  LVGCWDRARKDGWVPGLLDFKSLISCLCKKGELKQVVVLLETMLVSYPHSRLDILNIFLE 730

Query: 721  RLSEAGFAAIGQVLAKELMALGFYLDQKAYELLIIGLCKENNISIAINLLDDIMAMSMVP 780
            RLSEAGF AIG+VLAKEL +LGF LDQKAYELLIIGLCKEN +SIAIN+LDD+MAMSMVP
Sbjct: 731  RLSEAGFPAIGRVLAKELTSLGFSLDQKAYELLIIGLCKENTVSIAINMLDDMMAMSMVP 790

Query: 781  CIDVCLLVIPILCKVGRYETAIALKEIGTTKLSSSSIRVFGALMKGFFMMGKVRETLPLV 840
            CIDVCLL+IP LCK+GRYETAIALKEIGTTKLSSSS RV+GALMKGFF  GKVRE L L+
Sbjct: 791  CIDVCLLLIPTLCKIGRYETAIALKEIGTTKLSSSSRRVYGALMKGFFTTGKVREALALL 850

Query: 841  QDMLSKGISLDAEIYNNLVQGHCKVKNLDKVRELLGIIVRKDLSLSISSYKKLVCLMCME 900
            +DMLSKG+SLDAEIYN L+QGHCK KN +KVRELLG+++RKDLSLSISSY KLV LMC E
Sbjct: 851  EDMLSKGLSLDAEIYNLLIQGHCKAKNFEKVRELLGVMLRKDLSLSISSYGKLVRLMCRE 910

Query: 901  GRSLQALHLKDLMLRNSKSHDCVIYNILIFYIFQSGNSLLVPKILDELLYKRKLLPDNMT 960
            GRSLQALHLKD+MLRNSKSHD VIYNILIFYIF+SGN  LV KILDE      LLPDN+T
Sbjct: 911  GRSLQALHLKDIMLRNSKSHDSVIYNILIFYIFRSGNCFLVGKILDE------LLPDNVT 970

Query: 961  YDFLVYGFSKCKNFSSSTLYLFTMIQQEFRPSNRSLNTVISYLCNTGQLGKALDLSRKME 1020
            Y+FLVY FS+CK+FSSST YLFTMI++EFRPSNRSLN VIS+LC+TGQL KAL++SR+ME
Sbjct: 971  YNFLVYRFSQCKDFSSSTYYLFTMIRREFRPSNRSLNAVISHLCDTGQLEKALEVSREME 1030

Query: 1021 SRGWIHSSAVQNAITECLIANGKLREAECFLNRMVEKSLIPEHVDYNNIIKQFCQSGRWL 1080
             RGWIH+SAVQNAI EC I+ GKL+EAECFLNRMVEKSLIP+HVDYNNIIKQFCQSGRWL
Sbjct: 1031 FRGWIHNSAVQNAIVECFISYGKLQEAECFLNRMVEKSLIPKHVDYNNIIKQFCQSGRWL 1090

Query: 1081 NAINLINLMLEKGNIPNATSYDFVIQCCCTYKKLEEAVDFHTEMLDRHLKPSIRTWDKLV 1140
             A++LIN+ML++GNIPNA+SYDFVIQCCC YKKLEEA+D HTEMLDR LKPSI T DKLV
Sbjct: 1091 KAMDLINIMLKQGNIPNASSYDFVIQCCCNYKKLEEALDLHTEMLDRCLKPSITTCDKLV 1150

Query: 1141 SLLCREGQTKEAERVLISMTEMGEKPSKDAYCSMLDKYRYENDLEKASETMRAMQESGYE 1200
            S LCREGQ KEAERVL+S+ EMGE PSKDAYCSML++YRYENDLEKASETMRAMQ+SGYE
Sbjct: 1151 SSLCREGQMKEAERVLMSILEMGEIPSKDAYCSMLNRYRYENDLEKASETMRAMQQSGYE 1210

Query: 1201 LDFERQWSLISKLNDTNLK-DGNNNNSNKGFLSGLLSKSGFSRALIP 1247
            LDFE QWSLISKL+DT+L+ + NNNNSNKGFLSGLLSKSGFSRA IP
Sbjct: 1211 LDFETQWSLISKLSDTSLENNNNNNNSNKGFLSGLLSKSGFSRASIP 1250

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G15280.16.2e-27343.01Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12620.19.5e-4023.64Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G17140.13.6e-3922.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12300.14.7e-3923.42Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.14.0e-3820.83Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q9LXF48.7e-27243.01Pentatricopeptide repeat-containing protein At5g15280, mitochondrial OS=Arabidop... [more]
Q9ASZ81.3e-3823.64Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX... [more]
Q0WPZ65.1e-3822.05Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX... [more]
Q0WKV36.7e-3823.42Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
Q9LPX21.6e-3623.28Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K9E70.0e+0086.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G290590 PE=4 SV=1[more]
A0A5D3CCW50.0e+0086.28Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CES30.0e+0086.04pentatricopeptide repeat-containing protein At5g15280 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1F2020.0e+0083.00pentatricopeptide repeat-containing protein At5g15280, mitochondrial isoform X2 ... [more]
A0A6J1EX140.0e+0083.00pentatricopeptide repeat-containing protein At5g15280, mitochondrial isoform X1 ... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1177..1197
NoneNo IPR availablePANTHERPTHR13547:SF14PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 56..1242
NoneNo IPR availablePANTHERPTHR13547UNCHARACTERIZEDcoord: 56..1242
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 1064..1110
e-value: 5.0E-8
score: 33.0
coord: 399..448
e-value: 1.2E-13
score: 51.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 995..1024
e-value: 1.7E-4
score: 19.5
coord: 438..467
e-value: 1.8E-5
score: 22.6
coord: 402..435
e-value: 4.6E-6
score: 24.5
coord: 1135..1167
e-value: 3.3E-4
score: 18.6
coord: 193..225
e-value: 5.8E-4
score: 17.9
coord: 1100..1132
e-value: 7.8E-4
score: 17.5
coord: 644..675
e-value: 8.5E-6
score: 23.6
coord: 1066..1098
e-value: 5.0E-4
score: 18.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 854..881
e-value: 0.079
score: 13.2
coord: 995..1023
e-value: 0.16
score: 12.3
coord: 609..636
e-value: 0.66
score: 10.4
coord: 644..672
e-value: 0.019
score: 15.1
coord: 682..704
e-value: 0.058
score: 13.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 1123..1177
e-value: 0.0071
score: 16.4
coord: 178..235
e-value: 1.1E-4
score: 22.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 746..780
score: 8.659485
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 435..469
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 851..885
score: 8.95544
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1062..1096
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 816..850
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1097..1131
score: 10.522905
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 400..434
score: 12.375365
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 992..1026
score: 8.79102
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 1132..1166
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 190..224
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 641..675
score: 10.13926
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 365..399
score: 9.152743
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 130..309
e-value: 1.9E-16
score: 62.3
coord: 310..503
e-value: 7.0E-32
score: 113.0
coord: 880..1058
e-value: 4.0E-20
score: 74.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1080..1229
e-value: 5.0E-24
score: 87.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 604..706
e-value: 4.1E-11
score: 44.7
coord: 811..878
e-value: 2.5E-8
score: 35.6
coord: 711..808
e-value: 4.0E-5
score: 25.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi09M001310Bhi09M001310mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding