Bhi02G000898 (gene) Wax gourd (B227) v1

Overview
NameBhi02G000898
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2: 24301260 .. 24306473 (+)
RNA-Seq ExpressionBhi02G000898
SyntenyBhi02G000898
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACATATTAAGTTTCATCCGAGTATTGCTCTATTTTAAAATTTTAGACACCAAATGCTAAAGCTACGGAAGGCGGAGGCGGTTAGGCTGTTACCCAATCCGTTGATATTTTGCTTTGGAAAAAGAACAGATACAGAAGCTCTTAGTCTCTTCTGTTGCTTCAGCTTCCTTTTCTTCTCCTCCATGTATGGCTGCTCTTCCTTTTCCCGCTCTCACTAACCCGCTCGCTTCTCTCTACATTCCCAGAAAACCCCACAGCTCCCCTACCCATTTTGCGAACTTGAACCAAACTGCTGGAAACGTTCAAATCTCTTATAAATCTTATCTCAACCAGATATCTTCTCTCTGCAAAGAAGCCCACCTTCGAGAAGCCGTGAATTTGGTTGCTGATATGGAATTGGAAAATATCACAATCGGACCTGATGTATACGGAGAACTCCTTCAGGGTTGCGTTTACGAGAGAGCTCTTTCGTTGGGTCAGCAAATCCACGGTCGAATTCTCAAGAATGGTGAGTATATTGCAAAGAATGAGTACATCGAGACCAAATTGGTGATCTTCTATTCAAAATGTGACGAGTCAGAAATTGCCAACCGTTTGTTTGGCAAGCTGCTGGTACAGAACGAGTTTGCTTGGGCTGCTATTATGGGATTGAAAAGTAGAATTGGGTTTAATGAAGAAGCTTTGATGGGTTTTTGTGAGATGCACGAAAATGGGCTACTTTTGGATAATTTTGTTATTCCAATTGCTTTGAAGGCTTCTGGTGCTCTGCAATGGATTGGGTTTGGGAAATCCGTACAAGGCTATGTAGTCAAGATGGGTTTAGGCGGGTGTATCTATGTTGCTAGTAGTCTTCTGGATATGTATGGTAAATGTGGGTTATGTGGAGATGCAAAGAAGGTGTTTGATAAAATTCCTGAGAAGAATATAGTAGCTTGGAATTCGATGATTGTAAATTTTACTCAGAATGGACTGAATGCTGAAGCAATTGAGACGTTTTATGAGATGAGGGTGGAAGGTGTTGTACCCACTCAAGTGACTCTATCAAGTTTTCTTTCAGCTTCAGCTAATTTGAGTGTGATCGATGAGGGTAAGCAAGGGCACGCCTTAGCAGTGTTATCTGGACTGGAACTGACCAACATATTGGGAAGTTCGCTCATAAATTTTTATTCCAAGGTTGGTTTGGTTGAGGATGCTGAGCAGGTTTTCAGTGAAATGTTGGAGAAAGATATGGTGACATGGAATTTGCTGGTCTCTGGTTATGTGCATAATGGGCTGGTTGATCGGGCACTTGACTTATGTCACGTAATGCAGTCAGAAAATTTGAGGTTTGATTCTGTGACTCTTGCTTCAATAATGGCTGCGGCTGCTGACTCTAAAAATTTGAAACTAGGGAAGGAAGGGCATTCTTTTTGTGTTAGAAACAACCTTGAATCTGATATTGCTGTTGCTAGTAGTATAGTAGATATGTATGCCAAATGTGAAAAATTGGAATGTGCAAGACAAGTTTTTGACACAACGGTAAAGAGAGACCTTATAATGTGGAACACACTGTTGGCTGCCTATGCAGAGCAGGGTCAGAGTGGTGAAACATTAAAATTGTTTTATCAGATGCAGTTAGAAGGTCTGCCACCAAATGTGATATCCTGGAACTCTGTGATTTTGGGTCTTTTGAATAAAGGTGAAGTTGATGAGGCTAAAGACATGTTCTTGGAGATGCAGTCTCTTGGTGTCTGTCCTAATTTAATTACTTGGACTACTCTCATATGTGGACTCGCTCAGAATGGTCTTGGTGATGAAGCATTCCTGACATTTCAGTCAATGGAAGAAGCTGGCATTAAACCCAACAGTTTGAGTATTAGCTCGCTACTTTCAGCTTGCACAACTATGGCATCTCTGCCTCATGGAAGAGCAATTCATTGTTACATCATAAGACATGACCTTTTGGTATCAACACCGGTCTTATGCTCCTTAGTGAATATGTATGCTAAATGTGGTAGTATAAATCAAGCAAAGACGGTGTTTGATATGATAATGAAAAAGGAATTGCCCATCTATAATGCAATGATCTCTGGCTATGCATTACACGGTCAAGCAGTGGAAGCTCTTTCACTCTTTAGACGTCTAAAAGAGGATTGTATAAAACCAGATGAAATAACCTTTACCAGTATCCTTTCAGCATGCAGTCATGCTGGACTTGTAACAGAAGGGTTAGAGCTTTTCATCGATATGGTTTCTAATCATAAAATAGTAGCACAAGCAGAGCATTATGGTTGTCTCATTAGTATTCTTTCTAGGTGTCATAACTTAGACGAAGCTTTAAGACTTATTTTAGGTATGCCTTTTGAGCCTGATGCATCTATATTTGGATCTCTACTTGCTGCGTGCAGAGAGCATCCTGACTTAGAACTCAAAGAACGTTTATTTGAACACTTGTTGAAATTGGAGCCAGATAATTCAGGAAACTATGTGGCATTATCAAATGCATATGCTGCTACTGGAATGTGGGATGAAGCATCAAAAGTGAGGGTTCTGATGAAAGAAAGGGGTCTTAGGAAGACTCCTGGGCATAGCTTGATTCAGATTGGAAATGAAACACATGTATTTTTCGCTGGAGATAAATCACACTCCAGAACAAAAGAAATTTATATGATGTTGGCACTTCTTAGAGTGGAAATGCAATCCACAAGATGTATACCTGTGATCAGTTAAGCTGCTATCTTTTATTTCCATTCTTGAGAAGAGGAAGTCATGCCTGTACTGCATGATGCAGATTCTTTTTGTCTAGGAAAAATCTAGAAATTAGTGACGGTTTAAACGATTGAAGGCCTCACGGACTTGGATAGACAGCTGCAAAATATCATTTACCCTTGCTTGCAGCAACTCATTGACGACCTGTCACATGGGCTGGAAAAATATGATCTGAACATTGAATTCTCAAGAAATATTGAGTAGTGGTTTTTGAACTGTTATTCCTTTTGGACGGCTAGAAATCCTCTACCTAGTTTAGGAGGCAGCGCTCCTCCTGTTATGAGTGATTTGAAACCTTTTGGGTTGTTTTCACCTGTGATAGAAATCTAGGTACAAGTAGATAAACCAATTCCAATCTGAGTCCGACATCAAATAAACTTGAGTTGATTAGCAACTATGGAGTTCAAAACATCATGATGGAATGCTCGAGAAACCCATTTGTCATTGTCTATATTGACAAGCAATGCAGAGAAGCAGTGGTGTGTAGTTGTGATGGATTGTTTAAACGAGCTGTACGAGTTGCTTTCATGTCAAACTCAGGTTTTTGGTCTCTTTCGTAAAGCCTGTAGATGATTTGATTCTTTTACTGTATATCATTCAGATCCTCTTACCAATTGAATAATCCACTAGAAACACAACTTTGCTTGGTCTCTACTATGAACATACCAAAGCATATGAGATGGAATGGCCATAACATACTTTCTGCGCTAGCTTCTATTTGGTGCGAGATTATTCTCGTTAAGATTTGCACTTTGAAATTTTCAATTAAATAGTTAACTCTAGCCTAAAATGCTTGTACTTTTTTCTGTTTAAATGATGGGGATTCCTGACTTTCATACACCATGCTCAATATTTGATGCTTTTCTCTTCCTATAAAGCAGCCAAGATGATGCACCCCACCGGACTTAATTGTATGGTGTTTGAGAAGTGAGACTTACAGGACAAATTTTCAGTTTCAGGATTGTCATTATACAATACACTGCAGATTAACAGGAACTGGTACTAGGAAATGATCTCAGTACAAAACATCAATAAATATTCTTACTTCTGCCAGGTTAGTACCTGTGAACATCTGTTATTTGTTTCTCATTTTCTTGTTACATGTTCAACCATTCCTAGTCCATTTATATACTCAAGTTTTGGTATCATTCAATGCAGAGCCCCTGTACAATCTCAAATGACATTTTCTTTTTCCTTCTCTCTCCCTCCTCGTTTTCTGTATATGAAGATTTAGAAATCTTTGTCATGAGCATAAACACAGAGATGCTCTTGAGCTTTTTGTGATTTCAGTGTTTTTTTACATTAAGTTTTTACATGAAACTGAAAATGTGGTCCTTGAACATGTGTTTGTTTACCTGTTTTGGATTGATATCATTAATACATGCACTCTTCCAAGTCTTGTAGTGAAGCATCCTATGCTGTAATGAGTAAAGACCAAATAAAACTTGCCCTTTTCTTTTCTTTTCTTTTCTTTTTTGTTATTATTATTATTATTTTTAATTTTATTTTTTTGGGTGATTCGTACTATCCAAGGAAAGTGAAATGATGAAGAAACGACTTCTTTTTACATTTTGTTCAGTGGGCATCGACGAGTTATGATTATTATCTACACTTTTTTTTTTTTTTTTTTGTTTCACAGCAGGGACTAGAAGACCGTGCTACTTTAGAGAAAGACATTGCTTTCTTTCGTCGGTATGTCTTTCCTTCTCTCTCTTATGTATCTAGTTAACCTTGCTTTCATAGAAATAGTAATGACTCTATGTTGTTCTTTTATCCTTGCTACATCCACATGGGCATTGATTTCATTGCATAAAACTAAAGTGGCTATCATTGATGGATGCACAATCTTTAACTTTATACTCTCTTTTAAACCTAGTAGTAGTTGAATAGCCATTTAACCATTTGCTGTTAATTGAAAATCATATGATGAAAAATAATAGAGGGTTGTTATATAATTCACATATATTGTGTTAGTTATTTATTTTATTCGGAGTCCAACAATTGCGAGGTGAGGGATTTGTTAAACGATATAATATTAAATATACCTTTATCCATTAATTTAAGCTTTTGGATCAATCGATAATTTAACATGGTATCAAAATAGGTAGTCCAAGAGGTGATGTGTTCAAACCCCTACGATGTTATTTCTTTCCTGAACTATCAATCTTTCGAATAATGTTATAAGTTAACTTAGAAATAAAAATGGATAAAAGAAAAAAATGTGTATTAGGAAAAGGCAAAAGTATTCAATAGGGCATATAGAGGAAAAGGGTATTTTAAGAGACATCATTAAGGCTTTGCATGGATTCCATAAAAAATATTATAAGTTCAATCAAATCTCCACCAACTAGTTGGTGGTTGTCCTTTACTTGATTTCAATTGTTTTCAGTAGAGCTATATATGAAAATGTACCTATCTTTCGGTGGTTCTTTTAGTTAC

mRNA sequence

TACATATTAAGTTTCATCCGAGTATTGCTCTATTTTAAAATTTTAGACACCAAATGCTAAAGCTACGGAAGGCGGAGGCGGTTAGGCTGTTACCCAATCCGTTGATATTTTGCTTTGGAAAAAGAACAGATACAGAAGCTCTTAGTCTCTTCTGTTGCTTCAGCTTCCTTTTCTTCTCCTCCATGTATGGCTGCTCTTCCTTTTCCCGCTCTCACTAACCCGCTCGCTTCTCTCTACATTCCCAGAAAACCCCACAGCTCCCCTACCCATTTTGCGAACTTGAACCAAACTGCTGGAAACGTTCAAATCTCTTATAAATCTTATCTCAACCAGATATCTTCTCTCTGCAAAGAAGCCCACCTTCGAGAAGCCGTGAATTTGGTTGCTGATATGGAATTGGAAAATATCACAATCGGACCTGATGTATACGGAGAACTCCTTCAGGGTTGCGTTTACGAGAGAGCTCTTTCGTTGGGTCAGCAAATCCACGGTCGAATTCTCAAGAATGGTGAGTATATTGCAAAGAATGAGTACATCGAGACCAAATTGGTGATCTTCTATTCAAAATGTGACGAGTCAGAAATTGCCAACCGTTTGTTTGGCAAGCTGCTGGTACAGAACGAGTTTGCTTGGGCTGCTATTATGGGATTGAAAAGTAGAATTGGGTTTAATGAAGAAGCTTTGATGGGTTTTTGTGAGATGCACGAAAATGGGCTACTTTTGGATAATTTTGTTATTCCAATTGCTTTGAAGGCTTCTGGTGCTCTGCAATGGATTGGGTTTGGGAAATCCGTACAAGGCTATGTAGTCAAGATGGGTTTAGGCGGGTGTATCTATGTTGCTAGTAGTCTTCTGGATATGTATGGTAAATGTGGGTTATGTGGAGATGCAAAGAAGGTGTTTGATAAAATTCCTGAGAAGAATATAGTAGCTTGGAATTCGATGATTGTAAATTTTACTCAGAATGGACTGAATGCTGAAGCAATTGAGACGTTTTATGAGATGAGGGTGGAAGGTGTTGTACCCACTCAAGTGACTCTATCAAGTTTTCTTTCAGCTTCAGCTAATTTGAGTGTGATCGATGAGGGTAAGCAAGGGCACGCCTTAGCAGTGTTATCTGGACTGGAACTGACCAACATATTGGGAAGTTCGCTCATAAATTTTTATTCCAAGGTTGGTTTGGTTGAGGATGCTGAGCAGGTTTTCAGTGAAATGTTGGAGAAAGATATGGTGACATGGAATTTGCTGGTCTCTGGTTATGTGCATAATGGGCTGGTTGATCGGGCACTTGACTTATGTCACGTAATGCAGTCAGAAAATTTGAGGTTTGATTCTGTGACTCTTGCTTCAATAATGGCTGCGGCTGCTGACTCTAAAAATTTGAAACTAGGGAAGGAAGGGCATTCTTTTTGTGTTAGAAACAACCTTGAATCTGATATTGCTGTTGCTAGTAGTATAGTAGATATGTATGCCAAATGTGAAAAATTGGAATGTGCAAGACAAGTTTTTGACACAACGGTAAAGAGAGACCTTATAATGTGGAACACACTGTTGGCTGCCTATGCAGAGCAGGGTCAGAGTGGTGAAACATTAAAATTGTTTTATCAGATGCAGTTAGAAGGTCTGCCACCAAATGTGATATCCTGGAACTCTGTGATTTTGGGTCTTTTGAATAAAGGTGAAGTTGATGAGGCTAAAGACATGTTCTTGGAGATGCAGTCTCTTGGTGTCTGTCCTAATTTAATTACTTGGACTACTCTCATATGTGGACTCGCTCAGAATGGTCTTGGTGATGAAGCATTCCTGACATTTCAGTCAATGGAAGAAGCTGGCATTAAACCCAACAGTTTGAGTATTAGCTCGCTACTTTCAGCTTGCACAACTATGGCATCTCTGCCTCATGGAAGAGCAATTCATTGTTACATCATAAGACATGACCTTTTGGTATCAACACCGGTCTTATGCTCCTTAGTGAATATGTATGCTAAATGTGGTAGTATAAATCAAGCAAAGACGGTGTTTGATATGATAATGAAAAAGGAATTGCCCATCTATAATGCAATGATCTCTGGCTATGCATTACACGGTCAAGCAGTGGAAGCTCTTTCACTCTTTAGACGTCTAAAAGAGGATTGTATAAAACCAGATGAAATAACCTTTACCAGTATCCTTTCAGCATGCAGTCATGCTGGACTTGTAACAGAAGGGTTAGAGCTTTTCATCGATATGGTTTCTAATCATAAAATAGTAGCACAAGCAGAGCATTATGGTTGTCTCATTAGTATTCTTTCTAGGTGTCATAACTTAGACGAAGCTTTAAGACTTATTTTAGGTATGCCTTTTGAGCCTGATGCATCTATATTTGGATCTCTACTTGCTGCGTGCAGAGAGCATCCTGACTTAGAACTCAAAGAACGTTTATTTGAACACTTGTTGAAATTGGAGCCAGATAATTCAGGAAACTATGTGGCATTATCAAATGCATATGCTGCTACTGGAATGTGGGATGAAGCATCAAAAGTGAGGGTTCTGATGAAAGAAAGGGGTCTTAGGAAGACTCCTGGGCATAGCTTGATTCAGATTGGAAATGAAACACATGTATTTTTCGCTGGAGATAAATCACACTCCAGAACAAAAGAAATTTATATGATGTTGGCACTTCTTAGAGTGGAAATGCAATCCACAAGATGTATACCTGTGATCAGTTAAGCTGCTATCTTTTATTTCCATTCTTGAGAAGAGGAAGTCATGCCTGTACTGCATGATGCAGATTCTTTTTGTCTAGGAAAAATCTAGAAATTAGTGACGGTTTAAACGATTGAAGGCCTCACGGACTTGGATAGACAGCTGCAAAATATCATTTACCCTTGCTTGCAGCAACTCATTGACGACCTGTCACATGGGCTGGAAAAATATGATCTGAACATTGAATTCTCAAGAAATATTGAGTAGTGGTTTTTGAACTGTTATTCCTTTTGGACGGCTAGAAATCCTCTACCTAGTTTAGGAGGCAGCGCTCCTCCTGTTATGAGTGATTTGAAACCTTTTGGGTTGTTTTCACCTGTGATAGAAATCTAGGTACAAGTAGATAAACCAATTCCAATCTGAGTCCGACATCAAATAAACTTGAGTTGATTAGCAACTATGGAGTTCAAAACATCATGATGGAATGCTCGAGAAACCCATTTGTCATTGTCTATATTGACAAGCAATGCAGAGAAGCAGTGGTGTGTAGTTGTGATGGATTGTTTAAACGAGCTGTACGAGTTGCTTTCATGTCAAACTCAGGTTTTTGGTCTCTTTCGTAAAGCCTGTAGATGATTTGATTCTTTTACTGTATATCATTCAGATCCTCTTACCAATTGAATAATCCACTAGAAACACAACTTTGCTTGGTCTCTACTATGAACATACCAAAGCATATGAGATGGAATGGCCATAACATACTTTCTGCGCTAGCTTCTATTTGGTGCGAGATTATTCTCGTTAAGATTTGCACTTTGAAATTTTCAATTAAATAGTTAACTCTAGCCTAAAATGCTTGTACTTTTTTCTGTTTAAATGATGGGGATTCCTGACTTTCATACACCATGCTCAATATTTGATGCTTTTCTCTTCCTATAAAGCAGCCAAGATGATGCACCCCACCGGACTTAATTGTATGGTGTTTGAGAAGTGAGACTTACAGGACAAATTTTCAGTTTCAGGATTGTCATTATACAATACACTGCAGATTAACAGGAACTGGTACTAGGAAATGATCTCAGTACAAAACATCAATAAATATTCTTACTTCTGCCAGGGACTAGAAGACCGTGCTACTTTAGAGAAAGACATTGCTTTCTTTCGTCGGTATGTCTTTCCTTCTCTCTCTTATGTATCTAGTTAACCTTGCTTTCATAGAAATAGTAATGACTCTATGTTGTTCTTTTATCCTTGCTACATCCACATGGGCATTGATTTCATTGCATAAAACTAAAGTGGCTATCATTGATGGATGCACAATCTTTAACTTTATACTCTCTTTTAAACCTAGTAGTAGTTGAATAGCCATTTAACCATTTGCTGTTAATTGAAAATCATATGATGAAAAATAATAGAGGGTTGTTATATAATTCACATATATTGTGTTAGTTATTTATTTTATTCGGAGTCCAACAATTGCGAGGTGAGGGATTTGTTAAACGATATAATATTAAATATACCTTTATCCATTAATTTAAGCTTTTGGATCAATCGATAATTTAACATGGTATCAAAATAGGTAGTCCAAGAGGTGATGTGTTCAAACCCCTACGATGTTATTTCTTTCCTGAACTATCAATCTTTCGAATAATGTTATAAGTTAACTTAGAAATAAAAATGGATAAAAGAAAAAAATGTGTATTAGGAAAAGGCAAAAGTATTCAATAGGGCATATAGAGGAAAAGGGTATTTTAAGAGACATCATTAAGGCTTTGCATGGATTCCATAAAAAATATTATAAGTTCAATCAAATCTCCACCAACTAGTTGGTGGTTGTCCTTTACTTGATTTCAATTGTTTTCAGTAGAGCTATATATGAAAATGTACCTATCTTTCGGTGGTTCTTTTAGTTAC

Coding sequence (CDS)

ATGGCTGCTCTTCCTTTTCCCGCTCTCACTAACCCGCTCGCTTCTCTCTACATTCCCAGAAAACCCCACAGCTCCCCTACCCATTTTGCGAACTTGAACCAAACTGCTGGAAACGTTCAAATCTCTTATAAATCTTATCTCAACCAGATATCTTCTCTCTGCAAAGAAGCCCACCTTCGAGAAGCCGTGAATTTGGTTGCTGATATGGAATTGGAAAATATCACAATCGGACCTGATGTATACGGAGAACTCCTTCAGGGTTGCGTTTACGAGAGAGCTCTTTCGTTGGGTCAGCAAATCCACGGTCGAATTCTCAAGAATGGTGAGTATATTGCAAAGAATGAGTACATCGAGACCAAATTGGTGATCTTCTATTCAAAATGTGACGAGTCAGAAATTGCCAACCGTTTGTTTGGCAAGCTGCTGGTACAGAACGAGTTTGCTTGGGCTGCTATTATGGGATTGAAAAGTAGAATTGGGTTTAATGAAGAAGCTTTGATGGGTTTTTGTGAGATGCACGAAAATGGGCTACTTTTGGATAATTTTGTTATTCCAATTGCTTTGAAGGCTTCTGGTGCTCTGCAATGGATTGGGTTTGGGAAATCCGTACAAGGCTATGTAGTCAAGATGGGTTTAGGCGGGTGTATCTATGTTGCTAGTAGTCTTCTGGATATGTATGGTAAATGTGGGTTATGTGGAGATGCAAAGAAGGTGTTTGATAAAATTCCTGAGAAGAATATAGTAGCTTGGAATTCGATGATTGTAAATTTTACTCAGAATGGACTGAATGCTGAAGCAATTGAGACGTTTTATGAGATGAGGGTGGAAGGTGTTGTACCCACTCAAGTGACTCTATCAAGTTTTCTTTCAGCTTCAGCTAATTTGAGTGTGATCGATGAGGGTAAGCAAGGGCACGCCTTAGCAGTGTTATCTGGACTGGAACTGACCAACATATTGGGAAGTTCGCTCATAAATTTTTATTCCAAGGTTGGTTTGGTTGAGGATGCTGAGCAGGTTTTCAGTGAAATGTTGGAGAAAGATATGGTGACATGGAATTTGCTGGTCTCTGGTTATGTGCATAATGGGCTGGTTGATCGGGCACTTGACTTATGTCACGTAATGCAGTCAGAAAATTTGAGGTTTGATTCTGTGACTCTTGCTTCAATAATGGCTGCGGCTGCTGACTCTAAAAATTTGAAACTAGGGAAGGAAGGGCATTCTTTTTGTGTTAGAAACAACCTTGAATCTGATATTGCTGTTGCTAGTAGTATAGTAGATATGTATGCCAAATGTGAAAAATTGGAATGTGCAAGACAAGTTTTTGACACAACGGTAAAGAGAGACCTTATAATGTGGAACACACTGTTGGCTGCCTATGCAGAGCAGGGTCAGAGTGGTGAAACATTAAAATTGTTTTATCAGATGCAGTTAGAAGGTCTGCCACCAAATGTGATATCCTGGAACTCTGTGATTTTGGGTCTTTTGAATAAAGGTGAAGTTGATGAGGCTAAAGACATGTTCTTGGAGATGCAGTCTCTTGGTGTCTGTCCTAATTTAATTACTTGGACTACTCTCATATGTGGACTCGCTCAGAATGGTCTTGGTGATGAAGCATTCCTGACATTTCAGTCAATGGAAGAAGCTGGCATTAAACCCAACAGTTTGAGTATTAGCTCGCTACTTTCAGCTTGCACAACTATGGCATCTCTGCCTCATGGAAGAGCAATTCATTGTTACATCATAAGACATGACCTTTTGGTATCAACACCGGTCTTATGCTCCTTAGTGAATATGTATGCTAAATGTGGTAGTATAAATCAAGCAAAGACGGTGTTTGATATGATAATGAAAAAGGAATTGCCCATCTATAATGCAATGATCTCTGGCTATGCATTACACGGTCAAGCAGTGGAAGCTCTTTCACTCTTTAGACGTCTAAAAGAGGATTGTATAAAACCAGATGAAATAACCTTTACCAGTATCCTTTCAGCATGCAGTCATGCTGGACTTGTAACAGAAGGGTTAGAGCTTTTCATCGATATGGTTTCTAATCATAAAATAGTAGCACAAGCAGAGCATTATGGTTGTCTCATTAGTATTCTTTCTAGGTGTCATAACTTAGACGAAGCTTTAAGACTTATTTTAGGTATGCCTTTTGAGCCTGATGCATCTATATTTGGATCTCTACTTGCTGCGTGCAGAGAGCATCCTGACTTAGAACTCAAAGAACGTTTATTTGAACACTTGTTGAAATTGGAGCCAGATAATTCAGGAAACTATGTGGCATTATCAAATGCATATGCTGCTACTGGAATGTGGGATGAAGCATCAAAAGTGAGGGTTCTGATGAAAGAAAGGGGTCTTAGGAAGACTCCTGGGCATAGCTTGATTCAGATTGGAAATGAAACACATGTATTTTTCGCTGGAGATAAATCACACTCCAGAACAAAAGAAATTTATATGATGTTGGCACTTCTTAGAGTGGAAATGCAATCCACAAGATGTATACCTGTGATCAGTTAA

Protein sequence

MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLREAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLDNFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFDKIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGLPPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS
Homology
BLAST of Bhi02G000898 vs. TAIR 10
Match: AT5G55740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 924.9 bits (2389), Expect = 4.8e-269
Identity = 464/835 (55.57%), Postives = 614/835 (73.53%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MA+LPF  + N      +P    S P+   +  Q       S  SY +++SSLCK   ++
Sbjct: 1   MASLPFNTIPNK-----VPFSVSSKPSSKHHDEQAHSP---SSTSYFHRVSSLCKNGEIK 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV +M+  N+ IGP++YGE+LQGCVYER LS G+QIH RILKNG++ A+NEYIETK
Sbjct: 61  EALSLVTEMDFRNLRIGPEIYGEILQGCVYERDLSTGKQIHARILKNGDFYARNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFY+KCD  EIA  LF KL V+N F+WAAI+G+K RIG  E ALMGF EM EN +  D
Sbjct: 121 LVIFYAKCDALEIAEVLFSKLRVRNVFSWAAIIGVKCRIGLCEGALMGFVEMLENEIFPD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFV+P   KA GAL+W  FG+ V GYVVK GL  C++VASSL DMYGKCG+  DA KVFD
Sbjct: 181 NFVVPNVCKACGALKWSRFGRGVHGYVVKSGLEDCVFVASSLADMYGKCGVLDDASKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           +IP++N VAWN+++V + QNG N EAI  F +MR +GV PT+VT+S+ LSASAN+  ++E
Sbjct: 241 EIPDRNAVAWNALMVGYVQNGKNEEAIRLFSDMRKQGVEPTRVTVSTCLSASANMGGVEE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQ HA+A+++G+EL NILG+SL+NFY KVGL+E AE VF  M EKD+VTWNL++SGYV 
Sbjct: 301 GKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLIISGYVQ 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
            GLV+ A+ +C +M+ E L++D VTLA++M+AAA ++NLKLGKE   +C+R++ ESDI +
Sbjct: 361 QGLVEDAIYMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFESDIVL 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           AS+++DMYAKC  +  A++VFD+TV++DLI+WNTLLAAYAE G SGE L+LFY MQLEG+
Sbjct: 421 ASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGMQLEGV 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVI+WN +IL LL  G+VDEAKDMFL+MQS G+ PNLI+WTT++ G+ QNG  +EA L
Sbjct: 481 PPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCSEEAIL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIR---HDLLVSTPVLCSLVN 600
             + M+E+G++PN+ SI+  LSAC  +ASL  GR IH YIIR   H  LVS  +  SLV+
Sbjct: 541 FLRKMQESGLRPNAFSITVALSACAHLASLHIGRTIHGYIIRNLQHSSLVS--IETSLVD 600

Query: 601 MYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEIT 660
           MYAKCG IN+A+ VF   +  ELP+ NAMIS YAL+G   EA++L+R L+   +KPD IT
Sbjct: 601 MYAKCGDINKAEKVFGSKLYSELPLSNAMISAYALYGNLKEAIALYRSLEGVGLKPDNIT 660

Query: 661 FTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGM 720
            T++LSAC+HAG + + +E+F D+VS   +    EHYG ++ +L+     ++ALRLI  M
Sbjct: 661 ITNVLSACNHAGDINQAIEIFTDIVSKRSMKPCLEHYGLMVDLLASAGETEKALRLIEEM 720

Query: 721 PFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEAS 780
           PF+PDA +  SL+A+C +    EL + L   LL+ EP+NSGNYV +SNAYA  G WDE  
Sbjct: 721 PFKPDARMIQSLVASCNKQRKTELVDYLSRKLLESEPENSGNYVTISNAYAVEGSWDEVV 780

Query: 781 KVRVLMKERGLRKTPGHSLIQIGNE--THVFFAGDKSHSRTKEIYMMLALLRVEM 831
           K+R +MK +GL+K PG S IQI  E   HVF A DK+H+R  EI MMLALL  +M
Sbjct: 781 KMREMMKAKGLKKKPGCSWIQITGEEGVHVFVANDKTHTRINEIQMMLALLLYDM 825

BLAST of Bhi02G000898 vs. TAIR 10
Match: AT3G63370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 433.7 bits (1114), Expect = 3.3e-121
Identity = 253/778 (32.52%), Postives = 421/778 (54.11%), Query Frame = 0

Query: 79  DVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLF 138
           + +  +L+ C   RA+S G+Q+H RI K      + +++  KLV  Y KC   + A ++F
Sbjct: 81  EAFAYVLELCGKRRAVSQGRQLHSRIFKTFPSF-ELDFLAGKLVFMYGKCGSLDDAEKVF 140

Query: 139 GKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLDNFVIPIALKASGALQWIG 198
            ++  +  FAW  ++G     G    AL  +  M   G+ L     P  LKA   L+ I 
Sbjct: 141 DEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIR 200

Query: 199 FGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFDKIPEK-NIVAWNSMIVNF 258
            G  +   +VK+G     ++ ++L+ MY K      A+++FD   EK + V WNS++ ++
Sbjct: 201 SGSELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSY 260

Query: 259 TQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTN 318
           + +G + E +E F EM + G  P   T+ S L+A    S    GK+ HA  + S    + 
Sbjct: 261 STSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSE 320

Query: 319 I-LGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQS 378
           + + ++LI  Y++ G +  AE++  +M   D+VTWN L+ GYV N +   AL+    M +
Sbjct: 321 LYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIA 380

Query: 379 ENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLEC 438
              + D V++ SI+AA+    NL  G E H++ +++  +S++ V ++++DMY+KC     
Sbjct: 381 AGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCY 440

Query: 439 ARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLF-----YQMQL-EGLPPNVISWNSV 498
             + F     +DLI W T++A YA+     E L+LF      +M++ E +  +++  +SV
Sbjct: 441 MGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSV 500

Query: 499 ILG----------LLNKGEVD-----EAKDMFLEMQSLGVC---------PNLITWTTLI 558
           +            +L KG +D     E  D++ + +++G            ++++WT++I
Sbjct: 501 LKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMI 560

Query: 559 CGLAQNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLL 618
              A NG   EA   F+ M E G+  +S+++  +LSA  ++++L  GR IHCY++R    
Sbjct: 561 SSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFC 620

Query: 619 VSTPVLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRL 678
           +   +  ++V+MYA CG +  AK VFD I +K L  Y +MI+ Y +HG    A+ LF ++
Sbjct: 621 LEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKM 680

Query: 679 KEDCIKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHN 738
           + + + PD I+F ++L ACSHAGL+ EG      M   +++    EHY CL+ +L R + 
Sbjct: 681 RHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANC 740

Query: 739 LDEALRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNA 798
           + EA   +  M  EP A ++ +LLAACR H + E+ E   + LL+LEP N GN V +SN 
Sbjct: 741 VVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNV 800

Query: 799 YAATGMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLA 825
           +A  G W++  KVR  MK  G+ K PG S I++  + H F A DKSH  +KEIY  L+
Sbjct: 801 FAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYEKLS 857

BLAST of Bhi02G000898 vs. TAIR 10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 428.7 bits (1101), Expect = 1.1e-119
Identity = 277/885 (31.30%), Postives = 435/885 (49.15%), Query Frame = 0

Query: 42   SYKSYLNQISSLCKEAH-LREAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQI 101
            ++ S L+  SS+ K    LR  V+L  +    N       +  +L  C  E  +  G+QI
Sbjct: 127  AWNSMLSMYSSIGKPGKVLRSFVSLFENQIFPN----KFTFSIVLSTCARETNVEFGRQI 186

Query: 102  HGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIG 161
            H  ++K G  + +N Y    LV  Y+KCD    A R+F  ++  N   W  +     + G
Sbjct: 187  HCSMIKMG--LERNSYCGGALVDMYAKCDRISDARRVFEWIVDPNTVCWTCLFSGYVKAG 246

Query: 162  FNEEALMGFCEMHENGLLLDN--FVIPI-------------------------------- 221
              EEA++ F  M + G   D+  FV  I                                
Sbjct: 247  LPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARLLFGEMSSPDVVAWNVMIS 306

Query: 222  --------------------------------ALKASGALQWIGFGKSVQGYVVKMGLGG 281
                                             L A G +  +  G  V    +K+GL  
Sbjct: 307  GHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIKLGLAS 366

Query: 282  CIYVASSLLDMYGKCGLCGDAKKVFDKIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMR 341
             IYV SSL+ MY KC     A KVF+ + EKN V WN+MI  +  NG + + +E F +M+
Sbjct: 367  NIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMK 426

Query: 342  VEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTNILGSSLINFYSKVGLVE 401
              G      T +S LS  A    ++ G Q H++ +   L     +G++L++ Y+K G +E
Sbjct: 427  SSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALE 486

Query: 402  DAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQSENLRFDSVTLASIMAAAA 461
            DA Q+F  M ++D VTWN ++  YV +     A DL   M    +  D   LAS + A  
Sbjct: 487  DARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACT 546

Query: 462  DSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNT 521
                L  GK+ H   V+  L+ D+   SS++DMY+KC  ++ AR+VF +  +  ++  N 
Sbjct: 547  HVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNA 606

Query: 522  LLAAYAEQGQSGETLKLFYQMQLEGLPPNVISWNSVI----------LGLLNKGEVDE-- 581
            L+A Y+ Q    E + LF +M   G+ P+ I++ +++          LG    G++ +  
Sbjct: 607  LIAGYS-QNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRG 666

Query: 582  --AKDMFLEMQSLGVCPN-------------------LITWTTLICGLAQNGLGDEAFLT 641
              ++  +L +  LG+  N                   ++ WT ++ G +QNG  +EA   
Sbjct: 667  FSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKF 726

Query: 642  FQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYI--IRHDLLVSTPVLCSLVNMY 701
            ++ M   G+ P+  +  ++L  C+ ++SL  GRAIH  I  + HDL   T    +L++MY
Sbjct: 727  YKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTS--NTLIDMY 786

Query: 702  AKCGSINQAKTVFD-MIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITF 761
            AKCG +  +  VFD M  +  +  +N++I+GYA +G A +AL +F  +++  I PDEITF
Sbjct: 787  AKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITF 846

Query: 762  TSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMP 821
              +L+ACSHAG V++G ++F  M+  + I A+ +H  C++ +L R   L EA   I    
Sbjct: 847  LGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQN 906

Query: 822  FEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASK 824
             +PDA ++ SLL ACR H D    E   E L++LEP NS  YV LSN YA+ G W++A+ 
Sbjct: 907  LKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANA 966

BLAST of Bhi02G000898 vs. TAIR 10
Match: AT1G19720.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 422.2 bits (1084), Expect = 1.0e-117
Identity = 256/833 (30.73%), Postives = 423/833 (50.78%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLN-----QISSLCK 60
           M  L  P+      +   P K  +SP    +      N+  + K   N     Q   LC+
Sbjct: 1   MEKLFVPSFPKTFLNYQTPAKVENSPE--LHPKSRKKNLSFTKKKEPNIIPDEQFDYLCR 60

Query: 61  EAHLREAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNE 120
              L EA   +  +  +   +    Y +LL+ C+   ++ LG+ +H R    G +   + 
Sbjct: 61  NGSLLEAEKALDSLFQQGSKVKRSTYLKLLESCIDSGSIHLGRILHARF---GLFTEPDV 120

Query: 121 YIETKLVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHEN 180
           ++ETKL+  Y+KC     A ++F  +  +N F W+A++G  SR     E    F  M ++
Sbjct: 121 FVETKLLSMYAKCGCIADARKVFDSMRERNLFTWSAMIGAYSRENRWREVAKLFRLMMKD 180

Query: 181 GLLLDNFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDA 240
           G+L D+F+ P  L+       +  GK +   V+K+G+  C+ V++S+L +Y KCG    A
Sbjct: 181 GVLPDDFLFPKILQGCANCGDVEAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFA 240

Query: 241 KKVFDKIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANL 300
            K F ++ E++++AWNS+++ + QNG + EA+E   EM  EG+ P               
Sbjct: 241 TKFFRRMRERDVIAWNSVLLAYCQNGKHEEAVELVKEMEKEGISP--------------- 300

Query: 301 SVIDEGKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEM----LEKDMVTW 360
                           GL   NI    LI  Y+++G  + A  +  +M    +  D+ TW
Sbjct: 301 ----------------GLVTWNI----LIGGYNQLGKCDAAMDLMQKMETFGITADVFTW 360

Query: 361 NLLVSGYVHNGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVR 420
             ++SG +HNG+  +ALD+   M    +  ++VT+ S ++A +  K +  G E HS  V+
Sbjct: 361 TAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVK 420

Query: 421 NNLESDIAVASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKL 480
                D+ V +S+VDMY+KC KLE AR+VFD+   +D+  WN+++  Y + G  G+  +L
Sbjct: 421 MGFIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKAYEL 480

Query: 481 FYQMQLEGLPPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLG-VCPNLITWTTLICGLA 540
           F +MQ   L PN+I+WN++I G +  G+  EA D+F  M+  G V  N  TW  +I G  
Sbjct: 481 FTRMQDANLRPNIITWNTMISGYIKNGDEGEAMDLFQRMEKDGKVQRNTATWNLIIAGYI 540

Query: 541 QNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTP 600
           QNG  DEA   F+ M+ +   PNS++I SLL AC  +      R IH  ++R +L     
Sbjct: 541 QNGKKDEALELFRKMQFSRFMPNSVTILSLLPACANLLGAKMVREIHGCVLRRNLDAIHA 600

Query: 601 VLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDC 660
           V  +L + YAK G I  ++T+F  +  K++  +N++I GY LHG    AL+LF ++K   
Sbjct: 601 VKNALTDTYAKSGDIEYSRTIFLGMETKDIITWNSLIGGYVLHGSYGPALALFNQMKTQG 660

Query: 661 IKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEA 720
           I P+  T +SI+ A    G V EG ++F  + +++ I+   EH   ++ +  R + L+EA
Sbjct: 661 ITPNRGTLSSIILAHGLMGNVDEGKKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEA 720

Query: 721 LRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAAT 780
           L+ I  M  + +  I+ S L  CR H D+++     E+L  LEP+N+     +S  YA  
Sbjct: 721 LQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTATESIVSQIYALG 780

Query: 781 GMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMML 824
                + +     ++  L+K  G S I++ N  H F  GD+S   T  +Y ++
Sbjct: 781 AKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPLV 793

BLAST of Bhi02G000898 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 421.0 bits (1081), Expect = 2.2e-117
Identity = 247/788 (31.35%), Postives = 420/788 (53.30%), Query Frame = 0

Query: 84  LLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLFGKLLV 143
           LLQ C     L  G+Q+H  ++ N   I+ + Y + +++  Y+ C       ++F +L +
Sbjct: 41  LLQACSNPNLLRQGKQVHAFLIVNS--ISGDSYTDERILGMYAMCGSFSDCGKMFYRLDL 100

Query: 144 QNEF--AWAAIMGLKSRIGFNEEALMGFCEMHENGLLLDNFVIPIALKASGALQWIGFGK 203
           +      W +I+    R G   +AL  + +M   G+  D    P  +KA  AL+      
Sbjct: 101 RRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFKGID 160

Query: 204 SVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFDKIPEKNIVAWNSMIVNFTQNG 263
            +   V  +G+    +VASSL+  Y + G      K+FD++ +K+ V WN M+  + + G
Sbjct: 161 FLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCG 220

Query: 264 LNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTNILGS 323
                I+ F  MR++ + P  VT    LS  A+  +ID G Q H L V+SG++    + +
Sbjct: 221 ALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKN 280

Query: 324 SLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQSENLRF 383
           SL++ YSK G  +DA ++F  M   D VTWN ++SGYV +GL++ +L   + M S  +  
Sbjct: 281 SLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLP 340

Query: 384 DSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLECARQVF 443
           D++T +S++ + +  +NL+  K+ H + +R+++  DI + S+++D Y KC  +  A+ +F
Sbjct: 341 DAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIF 400

Query: 444 DTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGLPPNVISWNSV--ILGLL---- 503
                 D++++  +++ Y   G   ++L++F  +    + PN I+  S+  ++G+L    
Sbjct: 401 SQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALK 460

Query: 504 ----------NKGEVDEAK------DMFLEMQSLGVC---------PNLITWTTLICGLA 563
                      KG  +         DM+ +   + +           ++++W ++I   A
Sbjct: 461 LGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCA 520

Query: 564 QNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTP 623
           Q+     A   F+ M  +GI  + +SIS+ LSAC  + S   G+AIH ++I+H L     
Sbjct: 521 QSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVY 580

Query: 624 VLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDC 683
              +L++MYAKCG++  A  VF  + +K +  +N++I+    HG+  ++L LF  + E  
Sbjct: 581 SESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKS 640

Query: 684 -IKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDE 743
            I+PD+ITF  I+S+C H G V EG+  F  M  ++ I  Q EHY C++ +  R   L E
Sbjct: 641 GIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTE 700

Query: 744 ALRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAA 803
           A   +  MPF PDA ++G+LL ACR H ++EL E     L+ L+P NSG YV +SNA+A 
Sbjct: 701 AYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHAN 760

Query: 804 TGMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVE 838
              W+  +KVR LMKER ++K PG+S I+I   TH+F +GD +H  +  IY +L  L  E
Sbjct: 761 AREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGE 820

BLAST of Bhi02G000898 vs. ExPASy Swiss-Prot
Match: Q9FM64 (Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR21 PE=2 SV=1)

HSP 1 Score: 924.9 bits (2389), Expect = 6.7e-268
Identity = 464/835 (55.57%), Postives = 614/835 (73.53%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MA+LPF  + N      +P    S P+   +  Q       S  SY +++SSLCK   ++
Sbjct: 1   MASLPFNTIPNK-----VPFSVSSKPSSKHHDEQAHSP---SSTSYFHRVSSLCKNGEIK 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV +M+  N+ IGP++YGE+LQGCVYER LS G+QIH RILKNG++ A+NEYIETK
Sbjct: 61  EALSLVTEMDFRNLRIGPEIYGEILQGCVYERDLSTGKQIHARILKNGDFYARNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFY+KCD  EIA  LF KL V+N F+WAAI+G+K RIG  E ALMGF EM EN +  D
Sbjct: 121 LVIFYAKCDALEIAEVLFSKLRVRNVFSWAAIIGVKCRIGLCEGALMGFVEMLENEIFPD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFV+P   KA GAL+W  FG+ V GYVVK GL  C++VASSL DMYGKCG+  DA KVFD
Sbjct: 181 NFVVPNVCKACGALKWSRFGRGVHGYVVKSGLEDCVFVASSLADMYGKCGVLDDASKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           +IP++N VAWN+++V + QNG N EAI  F +MR +GV PT+VT+S+ LSASAN+  ++E
Sbjct: 241 EIPDRNAVAWNALMVGYVQNGKNEEAIRLFSDMRKQGVEPTRVTVSTCLSASANMGGVEE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQ HA+A+++G+EL NILG+SL+NFY KVGL+E AE VF  M EKD+VTWNL++SGYV 
Sbjct: 301 GKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLIISGYVQ 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
            GLV+ A+ +C +M+ E L++D VTLA++M+AAA ++NLKLGKE   +C+R++ ESDI +
Sbjct: 361 QGLVEDAIYMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFESDIVL 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           AS+++DMYAKC  +  A++VFD+TV++DLI+WNTLLAAYAE G SGE L+LFY MQLEG+
Sbjct: 421 ASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGMQLEGV 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVI+WN +IL LL  G+VDEAKDMFL+MQS G+ PNLI+WTT++ G+ QNG  +EA L
Sbjct: 481 PPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCSEEAIL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIR---HDLLVSTPVLCSLVN 600
             + M+E+G++PN+ SI+  LSAC  +ASL  GR IH YIIR   H  LVS  +  SLV+
Sbjct: 541 FLRKMQESGLRPNAFSITVALSACAHLASLHIGRTIHGYIIRNLQHSSLVS--IETSLVD 600

Query: 601 MYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEIT 660
           MYAKCG IN+A+ VF   +  ELP+ NAMIS YAL+G   EA++L+R L+   +KPD IT
Sbjct: 601 MYAKCGDINKAEKVFGSKLYSELPLSNAMISAYALYGNLKEAIALYRSLEGVGLKPDNIT 660

Query: 661 FTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGM 720
            T++LSAC+HAG + + +E+F D+VS   +    EHYG ++ +L+     ++ALRLI  M
Sbjct: 661 ITNVLSACNHAGDINQAIEIFTDIVSKRSMKPCLEHYGLMVDLLASAGETEKALRLIEEM 720

Query: 721 PFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEAS 780
           PF+PDA +  SL+A+C +    EL + L   LL+ EP+NSGNYV +SNAYA  G WDE  
Sbjct: 721 PFKPDARMIQSLVASCNKQRKTELVDYLSRKLLESEPENSGNYVTISNAYAVEGSWDEVV 780

Query: 781 KVRVLMKERGLRKTPGHSLIQIGNE--THVFFAGDKSHSRTKEIYMMLALLRVEM 831
           K+R +MK +GL+K PG S IQI  E   HVF A DK+H+R  EI MMLALL  +M
Sbjct: 781 KMREMMKAKGLKKKPGCSWIQITGEEGVHVFVANDKTHTRINEIQMMLALLLYDM 825

BLAST of Bhi02G000898 vs. ExPASy Swiss-Prot
Match: Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 433.7 bits (1114), Expect = 4.7e-120
Identity = 253/778 (32.52%), Postives = 421/778 (54.11%), Query Frame = 0

Query: 79  DVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLF 138
           + +  +L+ C   RA+S G+Q+H RI K      + +++  KLV  Y KC   + A ++F
Sbjct: 81  EAFAYVLELCGKRRAVSQGRQLHSRIFKTFPSF-ELDFLAGKLVFMYGKCGSLDDAEKVF 140

Query: 139 GKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLDNFVIPIALKASGALQWIG 198
            ++  +  FAW  ++G     G    AL  +  M   G+ L     P  LKA   L+ I 
Sbjct: 141 DEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIR 200

Query: 199 FGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFDKIPEK-NIVAWNSMIVNF 258
            G  +   +VK+G     ++ ++L+ MY K      A+++FD   EK + V WNS++ ++
Sbjct: 201 SGSELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSY 260

Query: 259 TQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTN 318
           + +G + E +E F EM + G  P   T+ S L+A    S    GK+ HA  + S    + 
Sbjct: 261 STSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSE 320

Query: 319 I-LGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQS 378
           + + ++LI  Y++ G +  AE++  +M   D+VTWN L+ GYV N +   AL+    M +
Sbjct: 321 LYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIA 380

Query: 379 ENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLEC 438
              + D V++ SI+AA+    NL  G E H++ +++  +S++ V ++++DMY+KC     
Sbjct: 381 AGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCY 440

Query: 439 ARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLF-----YQMQL-EGLPPNVISWNSV 498
             + F     +DLI W T++A YA+     E L+LF      +M++ E +  +++  +SV
Sbjct: 441 MGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSV 500

Query: 499 ILG----------LLNKGEVD-----EAKDMFLEMQSLGVC---------PNLITWTTLI 558
           +            +L KG +D     E  D++ + +++G            ++++WT++I
Sbjct: 501 LKSMLIVKEIHCHILRKGLLDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMI 560

Query: 559 CGLAQNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLL 618
              A NG   EA   F+ M E G+  +S+++  +LSA  ++++L  GR IHCY++R    
Sbjct: 561 SSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGFC 620

Query: 619 VSTPVLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRL 678
           +   +  ++V+MYA CG +  AK VFD I +K L  Y +MI+ Y +HG    A+ LF ++
Sbjct: 621 LEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKM 680

Query: 679 KEDCIKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHN 738
           + + + PD I+F ++L ACSHAGL+ EG      M   +++    EHY CL+ +L R + 
Sbjct: 681 RHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANC 740

Query: 739 LDEALRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNA 798
           + EA   +  M  EP A ++ +LLAACR H + E+ E   + LL+LEP N GN V +SN 
Sbjct: 741 VVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNV 800

Query: 799 YAATGMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLA 825
           +A  G W++  KVR  MK  G+ K PG S I++  + H F A DKSH  +KEIY  L+
Sbjct: 801 FAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYEKLS 857

BLAST of Bhi02G000898 vs. ExPASy Swiss-Prot
Match: Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.5e-118
Identity = 277/885 (31.30%), Postives = 435/885 (49.15%), Query Frame = 0

Query: 42   SYKSYLNQISSLCKEAH-LREAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQI 101
            ++ S L+  SS+ K    LR  V+L  +    N       +  +L  C  E  +  G+QI
Sbjct: 127  AWNSMLSMYSSIGKPGKVLRSFVSLFENQIFPN----KFTFSIVLSTCARETNVEFGRQI 186

Query: 102  HGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIG 161
            H  ++K G  + +N Y    LV  Y+KCD    A R+F  ++  N   W  +     + G
Sbjct: 187  HCSMIKMG--LERNSYCGGALVDMYAKCDRISDARRVFEWIVDPNTVCWTCLFSGYVKAG 246

Query: 162  FNEEALMGFCEMHENGLLLDN--FVIPI-------------------------------- 221
              EEA++ F  M + G   D+  FV  I                                
Sbjct: 247  LPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARLLFGEMSSPDVVAWNVMIS 306

Query: 222  --------------------------------ALKASGALQWIGFGKSVQGYVVKMGLGG 281
                                             L A G +  +  G  V    +K+GL  
Sbjct: 307  GHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIKLGLAS 366

Query: 282  CIYVASSLLDMYGKCGLCGDAKKVFDKIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMR 341
             IYV SSL+ MY KC     A KVF+ + EKN V WN+MI  +  NG + + +E F +M+
Sbjct: 367  NIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMELFMDMK 426

Query: 342  VEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTNILGSSLINFYSKVGLVE 401
              G      T +S LS  A    ++ G Q H++ +   L     +G++L++ Y+K G +E
Sbjct: 427  SSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAKCGALE 486

Query: 402  DAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQSENLRFDSVTLASIMAAAA 461
            DA Q+F  M ++D VTWN ++  YV +     A DL   M    +  D   LAS + A  
Sbjct: 487  DARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGACLASTLKACT 546

Query: 462  DSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNT 521
                L  GK+ H   V+  L+ D+   SS++DMY+KC  ++ AR+VF +  +  ++  N 
Sbjct: 547  HVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLPEWSVVSMNA 606

Query: 522  LLAAYAEQGQSGETLKLFYQMQLEGLPPNVISWNSVI----------LGLLNKGEVDE-- 581
            L+A Y+ Q    E + LF +M   G+ P+ I++ +++          LG    G++ +  
Sbjct: 607  LIAGYS-QNNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQFHGQITKRG 666

Query: 582  --AKDMFLEMQSLGVCPN-------------------LITWTTLICGLAQNGLGDEAFLT 641
              ++  +L +  LG+  N                   ++ WT ++ G +QNG  +EA   
Sbjct: 667  FSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWTGMMSGHSQNGFYEEALKF 726

Query: 642  FQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYI--IRHDLLVSTPVLCSLVNMY 701
            ++ M   G+ P+  +  ++L  C+ ++SL  GRAIH  I  + HDL   T    +L++MY
Sbjct: 727  YKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHLAHDLDELTS--NTLIDMY 786

Query: 702  AKCGSINQAKTVFD-MIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITF 761
            AKCG +  +  VFD M  +  +  +N++I+GYA +G A +AL +F  +++  I PDEITF
Sbjct: 787  AKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKIFDSMRQSHIMPDEITF 846

Query: 762  TSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMP 821
              +L+ACSHAG V++G ++F  M+  + I A+ +H  C++ +L R   L EA   I    
Sbjct: 847  LGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLGRWGYLQEADDFIEAQN 906

Query: 822  FEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASK 824
             +PDA ++ SLL ACR H D    E   E L++LEP NS  YV LSN YA+ G W++A+ 
Sbjct: 907  LKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVLLSNIYASQGCWEKANA 966

BLAST of Bhi02G000898 vs. ExPASy Swiss-Prot
Match: Q9FXH1 (Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX=3702 GN=DYW7 PE=2 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 1.4e-116
Identity = 256/833 (30.73%), Postives = 423/833 (50.78%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLN-----QISSLCK 60
           M  L  P+      +   P K  +SP    +      N+  + K   N     Q   LC+
Sbjct: 1   MEKLFVPSFPKTFLNYQTPAKVENSPE--LHPKSRKKNLSFTKKKEPNIIPDEQFDYLCR 60

Query: 61  EAHLREAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNE 120
              L EA   +  +  +   +    Y +LL+ C+   ++ LG+ +H R    G +   + 
Sbjct: 61  NGSLLEAEKALDSLFQQGSKVKRSTYLKLLESCIDSGSIHLGRILHARF---GLFTEPDV 120

Query: 121 YIETKLVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHEN 180
           ++ETKL+  Y+KC     A ++F  +  +N F W+A++G  SR     E    F  M ++
Sbjct: 121 FVETKLLSMYAKCGCIADARKVFDSMRERNLFTWSAMIGAYSRENRWREVAKLFRLMMKD 180

Query: 181 GLLLDNFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDA 240
           G+L D+F+ P  L+       +  GK +   V+K+G+  C+ V++S+L +Y KCG    A
Sbjct: 181 GVLPDDFLFPKILQGCANCGDVEAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFA 240

Query: 241 KKVFDKIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANL 300
            K F ++ E++++AWNS+++ + QNG + EA+E   EM  EG+ P               
Sbjct: 241 TKFFRRMRERDVIAWNSVLLAYCQNGKHEEAVELVKEMEKEGISP--------------- 300

Query: 301 SVIDEGKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEM----LEKDMVTW 360
                           GL   NI    LI  Y+++G  + A  +  +M    +  D+ TW
Sbjct: 301 ----------------GLVTWNI----LIGGYNQLGKCDAAMDLMQKMETFGITADVFTW 360

Query: 361 NLLVSGYVHNGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVR 420
             ++SG +HNG+  +ALD+   M    +  ++VT+ S ++A +  K +  G E HS  V+
Sbjct: 361 TAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSIAVK 420

Query: 421 NNLESDIAVASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKL 480
                D+ V +S+VDMY+KC KLE AR+VFD+   +D+  WN+++  Y + G  G+  +L
Sbjct: 421 MGFIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKAYEL 480

Query: 481 FYQMQLEGLPPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLG-VCPNLITWTTLICGLA 540
           F +MQ   L PN+I+WN++I G +  G+  EA D+F  M+  G V  N  TW  +I G  
Sbjct: 481 FTRMQDANLRPNIITWNTMISGYIKNGDEGEAMDLFQRMEKDGKVQRNTATWNLIIAGYI 540

Query: 541 QNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTP 600
           QNG  DEA   F+ M+ +   PNS++I SLL AC  +      R IH  ++R +L     
Sbjct: 541 QNGKKDEALELFRKMQFSRFMPNSVTILSLLPACANLLGAKMVREIHGCVLRRNLDAIHA 600

Query: 601 VLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDC 660
           V  +L + YAK G I  ++T+F  +  K++  +N++I GY LHG    AL+LF ++K   
Sbjct: 601 VKNALTDTYAKSGDIEYSRTIFLGMETKDIITWNSLIGGYVLHGSYGPALALFNQMKTQG 660

Query: 661 IKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEA 720
           I P+  T +SI+ A    G V EG ++F  + +++ I+   EH   ++ +  R + L+EA
Sbjct: 661 ITPNRGTLSSIILAHGLMGNVDEGKKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEA 720

Query: 721 LRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAAT 780
           L+ I  M  + +  I+ S L  CR H D+++     E+L  LEP+N+     +S  YA  
Sbjct: 721 LQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTATESIVSQIYALG 780

Query: 781 GMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMML 824
                + +     ++  L+K  G S I++ N  H F  GD+S   T  +Y ++
Sbjct: 781 AKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPLV 793

BLAST of Bhi02G000898 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 3.1e-116
Identity = 247/788 (31.35%), Postives = 420/788 (53.30%), Query Frame = 0

Query: 84  LLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETKLVIFYSKCDESEIANRLFGKLLV 143
           LLQ C     L  G+Q+H  ++ N   I+ + Y + +++  Y+ C       ++F +L +
Sbjct: 41  LLQACSNPNLLRQGKQVHAFLIVNS--ISGDSYTDERILGMYAMCGSFSDCGKMFYRLDL 100

Query: 144 QNEF--AWAAIMGLKSRIGFNEEALMGFCEMHENGLLLDNFVIPIALKASGALQWIGFGK 203
           +      W +I+    R G   +AL  + +M   G+  D    P  +KA  AL+      
Sbjct: 101 RRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFKGID 160

Query: 204 SVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFDKIPEKNIVAWNSMIVNFTQNG 263
            +   V  +G+    +VASSL+  Y + G      K+FD++ +K+ V WN M+  + + G
Sbjct: 161 FLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCG 220

Query: 264 LNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDEGKQGHALAVLSGLELTNILGS 323
                I+ F  MR++ + P  VT    LS  A+  +ID G Q H L V+SG++    + +
Sbjct: 221 ALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKN 280

Query: 324 SLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVHNGLVDRALDLCHVMQSENLRF 383
           SL++ YSK G  +DA ++F  M   D VTWN ++SGYV +GL++ +L   + M S  +  
Sbjct: 281 SLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLP 340

Query: 384 DSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAVASSIVDMYAKCEKLECARQVF 443
           D++T +S++ + +  +NL+  K+ H + +R+++  DI + S+++D Y KC  +  A+ +F
Sbjct: 341 DAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIF 400

Query: 444 DTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGLPPNVISWNSV--ILGLL---- 503
                 D++++  +++ Y   G   ++L++F  +    + PN I+  S+  ++G+L    
Sbjct: 401 SQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALK 460

Query: 504 ----------NKGEVDEAK------DMFLEMQSLGVC---------PNLITWTTLICGLA 563
                      KG  +         DM+ +   + +           ++++W ++I   A
Sbjct: 461 LGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCA 520

Query: 564 QNGLGDEAFLTFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTP 623
           Q+     A   F+ M  +GI  + +SIS+ LSAC  + S   G+AIH ++I+H L     
Sbjct: 521 QSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVY 580

Query: 624 VLCSLVNMYAKCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDC 683
              +L++MYAKCG++  A  VF  + +K +  +N++I+    HG+  ++L LF  + E  
Sbjct: 581 SESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKS 640

Query: 684 -IKPDEITFTSILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDE 743
            I+PD+ITF  I+S+C H G V EG+  F  M  ++ I  Q EHY C++ +  R   L E
Sbjct: 641 GIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTE 700

Query: 744 ALRLILGMPFEPDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAA 803
           A   +  MPF PDA ++G+LL ACR H ++EL E     L+ L+P NSG YV +SNA+A 
Sbjct: 701 AYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHAN 760

Query: 804 TGMWDEASKVRVLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVE 838
              W+  +KVR LMKER ++K PG+S I+I   TH+F +GD +H  +  IY +L  L  E
Sbjct: 761 AREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGE 820

BLAST of Bhi02G000898 vs. NCBI nr
Match: XP_038880665.1 (pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Benincasa hispida] >XP_038880666.1 pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Benincasa hispida] >XP_038880667.1 pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Benincasa hispida] >XP_038880668.1 pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Benincasa hispida])

HSP 1 Score: 1671.0 bits (4326), Expect = 0.0e+00
Identity = 840/840 (100.00%), Postives = 840/840 (100.00%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR
Sbjct: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD
Sbjct: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE
Sbjct: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV
Sbjct: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL
Sbjct: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS
Sbjct: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR
Sbjct: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
           VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS
Sbjct: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840

BLAST of Bhi02G000898 vs. NCBI nr
Match: XP_011656577.1 (pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis sativus] >XP_011656582.1 pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis sativus] >KGN65393.1 hypothetical protein Csa_019708 [Cucumis sativus])

HSP 1 Score: 1533.1 bits (3968), Expect = 0.0e+00
Identity = 766/840 (91.19%), Postives = 797/840 (94.88%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNP+ SLY PRKPH SPTHFA+ +Q A NVQISYKSYLN ISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPIYSLYTPRKPHYSPTHFASFSQIASNVQISYKSYLNHISSLCKQGHLL 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+ELE+ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGE IAKNEYIETK
Sbjct: 61  EALDLVTDLELEDITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGESIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESEIANRLFGKL VQNEF+WAAIMGLKSR+GFN+EALMGF EMHE GLLLD
Sbjct: 121 LVIFYSKCDESEIANRLFGKLQVQNEFSWAAIMGLKSRMGFNQEALMGFREMHEYGLLLD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIA KASGAL+WIGFGKSV  YVVKMGLGGCIYVA+SLLDMYGKCGLC +AKKVFD
Sbjct: 181 NFVIPIAFKASGALRWIGFGKSVHAYVVKMGLGGCIYVATSLLDMYGKCGLCEEAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KI EKNIVAWNSMIVNFTQNGLNAEA+ETFYEMRVEGV PTQVTLSSFLSASANLSVIDE
Sbjct: 241 KILEKNIVAWNSMIVNFTQNGLNAEAVETFYEMRVEGVAPTQVTLSSFLSASANLSVIDE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCEKLECAR+VFD T KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCEKLECARRVFDATAKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKG+VD+AKD F+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGKVDQAKDTFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI RH+L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRHELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQAVEALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAVEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSAC HAGLV EGLELFIDMVSNHKIVAQAEHYGCL+SILSR HNLDEALR+ILGMPFE
Sbjct: 661 ILSACGHAGLVREGLELFIDMVSNHKIVAQAEHYGCLVSILSRSHNLDEALRIILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLLAACREHPD ELKERLFE LLKLEPDNSGNYVALSNAYAATGMWDEASKVR
Sbjct: 721 PDAFIFGSLLAACREHPDFELKERLFERLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER L K PGHSLIQIGN+THVFFAGDKSHSRTKEIYMMLALLRVEMQ TRCI VIS
Sbjct: 781 GLMKERSLSKIPGHSLIQIGNKTHVFFAGDKSHSRTKEIYMMLALLRVEMQFTRCISVIS 840

BLAST of Bhi02G000898 vs. NCBI nr
Match: TYJ98107.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1511.5 bits (3912), Expect = 0.0e+00
Identity = 760/840 (90.48%), Postives = 791/840 (94.17%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNPL SLY PRK H+S T+FA+LNQ AGNVQISYKSYLNQISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPLPSLYTPRKLHNSSTYFASLNQIAGNVQISYKSYLNQISSLCKQGHLP 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+EL +ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EALDLVTDLELADITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESE ANRLF KL VQNEF+WAAIMGLKSR+ FNEEALMGF EMHE GL+LD
Sbjct: 121 LVIFYSKCDESETANRLFDKLQVQNEFSWAAIMGLKSRMRFNEEALMGFREMHEYGLILD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGAL+WIGFGKSV GYVVKMGLG CIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALRWIGFGKSVHGYVVKMGLGVCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIV+FTQNG NAEAIETFYEMRVEGV PTQVTLSSFLSASANL VI E
Sbjct: 241 KIPEKNIVAWNSMIVSFTQNGRNAEAIETFYEMRVEGVAPTQVTLSSFLSASANLGVIVE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVE+AE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVENAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCE LECAR+VF+  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCENLECARRVFNAMIKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVD+AKDMF+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDKAKDMFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI R +L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRRELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQA EALSLFRRLKE+ IKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAAEALSLFRRLKEEYIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLV EGLELFIDMVSNHKIVAQAEHYGCL+SILSR HNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVREGLELFIDMVSNHKIVAQAEHYGCLVSILSRSHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLL ACREHPD ELKE LFE LLKLEPDNSGNYVALSNAYAATGMWDEA KVR
Sbjct: 721 PDAFIFGSLLTACREHPDFELKEHLFERLLKLEPDNSGNYVALSNAYAATGMWDEALKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER LRK PGHSLIQIGN+THVFFAGDKSHSRTKEIYM LALLR+EMQSTRCI VIS
Sbjct: 781 GLMKERSLRKIPGHSLIQIGNKTHVFFAGDKSHSRTKEIYMTLALLRMEMQSTRCISVIS 840

BLAST of Bhi02G000898 vs. NCBI nr
Match: XP_008463338.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis melo] >XP_008463339.1 PREDICTED: pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis melo] >XP_008463340.1 PREDICTED: pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis melo] >XP_008463341.1 PREDICTED: pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis melo] >KAA0043370.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1506.5 bits (3899), Expect = 0.0e+00
Identity = 758/840 (90.24%), Postives = 790/840 (94.05%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNPL SLY  RK H+S T+FA+LNQ AGNVQISYKSYLNQISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPLPSLYTSRKLHNSSTYFASLNQIAGNVQISYKSYLNQISSLCKQGHLP 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+EL +ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EALDLVTDLELADITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESE ANRLF KL VQNEF+WAAIMGLKSR+ FNEEALMGF EMHE GL+LD
Sbjct: 121 LVIFYSKCDESETANRLFDKLQVQNEFSWAAIMGLKSRMRFNEEALMGFREMHEYGLILD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGAL+WIGFGKSV GYVVKMGLG CIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALRWIGFGKSVHGYVVKMGLGVCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIV+FTQNG NAEAIETFYEMRVEGV PTQVTLSSFLSASANL VI E
Sbjct: 241 KIPEKNIVAWNSMIVSFTQNGRNAEAIETFYEMRVEGVAPTQVTLSSFLSASANLGVIVE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVE+AE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVENAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCE LECAR+VF+  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCENLECARRVFNAMIKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVD+AKDMF+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDKAKDMFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI R +L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRRELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQA EALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAAEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLV EGLELFIDMVS HKIVAQAEHYGCL+SILSR HNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVREGLELFIDMVSFHKIVAQAEHYGCLVSILSRSHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLL ACREHPD ELKE LFE LLKLEPDNSGNYVALSNAYAATGMWDEA KVR
Sbjct: 721 PDAFIFGSLLTACREHPDFELKEHLFERLLKLEPDNSGNYVALSNAYAATGMWDEALKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER LRK PGHSLIQIGN+THVFFAGDKS+SRTKEIYM LALLR+EMQSTRCI VIS
Sbjct: 781 GLMKERSLRKIPGHSLIQIGNKTHVFFAGDKSNSRTKEIYMTLALLRMEMQSTRCISVIS 840

BLAST of Bhi02G000898 vs. NCBI nr
Match: XP_023531196.1 (pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1491.1 bits (3859), Expect = 0.0e+00
Identity = 745/840 (88.69%), Postives = 791/840 (94.17%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPF   T PLASLY  RK H+SPTH A LN++AGN QISYKSYLN+ISSLCKE  LR
Sbjct: 1   MAALPFVTPTYPLASLYSTRKLHNSPTHAAKLNESAGNFQISYKSYLNRISSLCKEGDLR 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
            AV+LV+++EL+ IT+GPDVYGELLQGCVYERALSLGQQIHGRILKNGE+IAKNEYIETK
Sbjct: 61  AAVDLVSNLELQGITVGPDVYGELLQGCVYERALSLGQQIHGRILKNGEFIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESEIANRLF KL VQNEF+WAAIMGLK RIGFNEEAL+ FC+MHENGL LD
Sbjct: 121 LVIFYSKCDESEIANRLFRKLRVQNEFSWAAIMGLKCRIGFNEEALLCFCDMHENGLFLD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASG+LQWIGFGK++ GY VKMGLGGCI+VASSLLDMYGKCG+CGDA+KVFD
Sbjct: 181 NFVIPIALKASGSLQWIGFGKAIHGYAVKMGLGGCIFVASSLLDMYGKCGVCGDARKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIVNFT NGL  EAIETFY+MRVEGV PTQVTLS+FLSASANLS+I+E
Sbjct: 241 KIPEKNIVAWNSMIVNFTHNGLYEEAIETFYDMRVEGVEPTQVTLSTFLSASANLSLINE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSK+GLVEDAE VFSEMLEKD+VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKIGLVEDAELVFSEMLEKDVVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LC VMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCRVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSIVD YAKC KLECAR+VF+ T+KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIVDTYAKCGKLECARRVFELTIKRDLIMWNTLLAAYAEQGWSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPN+ISWNSVILGLLNKGEV +AKDMFLEMQSLGVCPNL+TWTTLI GL+QNGLGDEAFL
Sbjct: 481 PPNLISWNSVILGLLNKGEVSKAKDMFLEMQSLGVCPNLVTWTTLISGLSQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSM+EAGIKPNSLSIS LLSACTTMASL HGRAIH YI R +L +STPVLCSLVNMYA
Sbjct: 541 TFQSMQEAGIKPNSLSISPLLSACTTMASLRHGRAIHGYITRRELSLSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK +FDMI+KKELPIYNAMISGYALHGQAVEALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRIFDMILKKELPIYNAMISGYALHGQAVEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCL+SILSRCHNLDEALRLIL MPFE
Sbjct: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLVSILSRCHNLDEALRLILAMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLLAACREHPD+ELKERL E LLKLEPDNSGNYVALSNAYAATGMWDEASKVR
Sbjct: 721 PDAFIFGSLLAACREHPDVELKERLSERLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKERGLRKTPGHSLIQIGN+THVFFAGDKSHS+TKEIY MLALL +EMQ TRCIPVIS
Sbjct: 781 DLMKERGLRKTPGHSLIQIGNKTHVFFAGDKSHSKTKEIYKMLALLGIEMQVTRCIPVIS 840

BLAST of Bhi02G000898 vs. ExPASy TrEMBL
Match: A0A0A0LUC4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G407170 PE=4 SV=1)

HSP 1 Score: 1533.1 bits (3968), Expect = 0.0e+00
Identity = 766/840 (91.19%), Postives = 797/840 (94.88%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNP+ SLY PRKPH SPTHFA+ +Q A NVQISYKSYLN ISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPIYSLYTPRKPHYSPTHFASFSQIASNVQISYKSYLNHISSLCKQGHLL 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+ELE+ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGE IAKNEYIETK
Sbjct: 61  EALDLVTDLELEDITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGESIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESEIANRLFGKL VQNEF+WAAIMGLKSR+GFN+EALMGF EMHE GLLLD
Sbjct: 121 LVIFYSKCDESEIANRLFGKLQVQNEFSWAAIMGLKSRMGFNQEALMGFREMHEYGLLLD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIA KASGAL+WIGFGKSV  YVVKMGLGGCIYVA+SLLDMYGKCGLC +AKKVFD
Sbjct: 181 NFVIPIAFKASGALRWIGFGKSVHAYVVKMGLGGCIYVATSLLDMYGKCGLCEEAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KI EKNIVAWNSMIVNFTQNGLNAEA+ETFYEMRVEGV PTQVTLSSFLSASANLSVIDE
Sbjct: 241 KILEKNIVAWNSMIVNFTQNGLNAEAVETFYEMRVEGVAPTQVTLSSFLSASANLSVIDE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCEKLECAR+VFD T KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCEKLECARRVFDATAKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKG+VD+AKD F+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGKVDQAKDTFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI RH+L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRHELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQAVEALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAVEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSAC HAGLV EGLELFIDMVSNHKIVAQAEHYGCL+SILSR HNLDEALR+ILGMPFE
Sbjct: 661 ILSACGHAGLVREGLELFIDMVSNHKIVAQAEHYGCLVSILSRSHNLDEALRIILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLLAACREHPD ELKERLFE LLKLEPDNSGNYVALSNAYAATGMWDEASKVR
Sbjct: 721 PDAFIFGSLLAACREHPDFELKERLFERLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER L K PGHSLIQIGN+THVFFAGDKSHSRTKEIYMMLALLRVEMQ TRCI VIS
Sbjct: 781 GLMKERSLSKIPGHSLIQIGNKTHVFFAGDKSHSRTKEIYMMLALLRVEMQFTRCISVIS 840

BLAST of Bhi02G000898 vs. ExPASy TrEMBL
Match: A0A5D3BG60 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold222G00190 PE=4 SV=1)

HSP 1 Score: 1511.5 bits (3912), Expect = 0.0e+00
Identity = 760/840 (90.48%), Postives = 791/840 (94.17%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNPL SLY PRK H+S T+FA+LNQ AGNVQISYKSYLNQISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPLPSLYTPRKLHNSSTYFASLNQIAGNVQISYKSYLNQISSLCKQGHLP 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+EL +ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EALDLVTDLELADITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESE ANRLF KL VQNEF+WAAIMGLKSR+ FNEEALMGF EMHE GL+LD
Sbjct: 121 LVIFYSKCDESETANRLFDKLQVQNEFSWAAIMGLKSRMRFNEEALMGFREMHEYGLILD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGAL+WIGFGKSV GYVVKMGLG CIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALRWIGFGKSVHGYVVKMGLGVCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIV+FTQNG NAEAIETFYEMRVEGV PTQVTLSSFLSASANL VI E
Sbjct: 241 KIPEKNIVAWNSMIVSFTQNGRNAEAIETFYEMRVEGVAPTQVTLSSFLSASANLGVIVE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVE+AE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVENAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCE LECAR+VF+  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCENLECARRVFNAMIKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVD+AKDMF+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDKAKDMFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI R +L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRRELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQA EALSLFRRLKE+ IKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAAEALSLFRRLKEEYIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLV EGLELFIDMVSNHKIVAQAEHYGCL+SILSR HNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVREGLELFIDMVSNHKIVAQAEHYGCLVSILSRSHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLL ACREHPD ELKE LFE LLKLEPDNSGNYVALSNAYAATGMWDEA KVR
Sbjct: 721 PDAFIFGSLLTACREHPDFELKEHLFERLLKLEPDNSGNYVALSNAYAATGMWDEALKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER LRK PGHSLIQIGN+THVFFAGDKSHSRTKEIYM LALLR+EMQSTRCI VIS
Sbjct: 781 GLMKERSLRKIPGHSLIQIGNKTHVFFAGDKSHSRTKEIYMTLALLRMEMQSTRCISVIS 840

BLAST of Bhi02G000898 vs. ExPASy TrEMBL
Match: A0A1S3CJ17 (pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103501523 PE=4 SV=1)

HSP 1 Score: 1506.5 bits (3899), Expect = 0.0e+00
Identity = 758/840 (90.24%), Postives = 790/840 (94.05%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNPL SLY  RK H+S T+FA+LNQ AGNVQISYKSYLNQISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPLPSLYTSRKLHNSSTYFASLNQIAGNVQISYKSYLNQISSLCKQGHLP 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+EL +ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EALDLVTDLELADITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESE ANRLF KL VQNEF+WAAIMGLKSR+ FNEEALMGF EMHE GL+LD
Sbjct: 121 LVIFYSKCDESETANRLFDKLQVQNEFSWAAIMGLKSRMRFNEEALMGFREMHEYGLILD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGAL+WIGFGKSV GYVVKMGLG CIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALRWIGFGKSVHGYVVKMGLGVCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIV+FTQNG NAEAIETFYEMRVEGV PTQVTLSSFLSASANL VI E
Sbjct: 241 KIPEKNIVAWNSMIVSFTQNGRNAEAIETFYEMRVEGVAPTQVTLSSFLSASANLGVIVE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVE+AE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVENAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCE LECAR+VF+  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCENLECARRVFNAMIKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVD+AKDMF+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDKAKDMFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI R +L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRRELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQA EALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAAEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLV EGLELFIDMVS HKIVAQAEHYGCL+SILSR HNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVREGLELFIDMVSFHKIVAQAEHYGCLVSILSRSHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLL ACREHPD ELKE LFE LLKLEPDNSGNYVALSNAYAATGMWDEA KVR
Sbjct: 721 PDAFIFGSLLTACREHPDFELKEHLFERLLKLEPDNSGNYVALSNAYAATGMWDEALKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER LRK PGHSLIQIGN+THVFFAGDKS+SRTKEIYM LALLR+EMQSTRCI VIS
Sbjct: 781 GLMKERSLRKIPGHSLIQIGNKTHVFFAGDKSNSRTKEIYMTLALLRMEMQSTRCISVIS 840

BLAST of Bhi02G000898 vs. ExPASy TrEMBL
Match: A0A5A7TJA9 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold588G00310 PE=4 SV=1)

HSP 1 Score: 1506.5 bits (3899), Expect = 0.0e+00
Identity = 758/840 (90.24%), Postives = 790/840 (94.05%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MAALPFP  TNPL SLY  RK H+S T+FA+LNQ AGNVQISYKSYLNQISSLCK+ HL 
Sbjct: 1   MAALPFPLPTNPLPSLYTSRKLHNSSTYFASLNQIAGNVQISYKSYLNQISSLCKQGHLP 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
           EA++LV D+EL +ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK
Sbjct: 61  EALDLVTDLELADITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESE ANRLF KL VQNEF+WAAIMGLKSR+ FNEEALMGF EMHE GL+LD
Sbjct: 121 LVIFYSKCDESETANRLFDKLQVQNEFSWAAIMGLKSRMRFNEEALMGFREMHEYGLILD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKASGAL+WIGFGKSV GYVVKMGLG CIYVASSLLDMYGKCGLCGDAKKVFD
Sbjct: 181 NFVIPIALKASGALRWIGFGKSVHGYVVKMGLGVCIYVASSLLDMYGKCGLCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIV+FTQNG NAEAIETFYEMRVEGV PTQVTLSSFLSASANL VI E
Sbjct: 241 KIPEKNIVAWNSMIVSFTQNGRNAEAIETFYEMRVEGVAPTQVTLSSFLSASANLGVIVE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSKVGLVE+AE VFSEMLEKD VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVENAELVFSEMLEKDTVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LCHVMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCHVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSI+DMYAKCE LECAR+VF+  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIIDMYAKCENLECARRVFNAMIKRDLIMWNTLLAAYAEQGHSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPNVISWNSVILGLLNKGEVD+AKDMF+EMQSLG+CPNLITWTTLICGLAQNGLGDEAFL
Sbjct: 481 PPNVISWNSVILGLLNKGEVDKAKDMFMEMQSLGICPNLITWTTLICGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQSMEEAGIKPNSLSISSLLSAC+TMASLPHGRAIHCYI R +L VSTPVLCSLVNMYA
Sbjct: 541 TFQSMEEAGIKPNSLSISSLLSACSTMASLPHGRAIHCYITRRELSVSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK VFDMI+KKELP+YNAMISGYALHGQA EALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRVFDMILKKELPVYNAMISGYALHGQAAEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           ILSACSHAGLV EGLELFIDMVS HKIVAQAEHYGCL+SILSR HNLDEALRLILGMPFE
Sbjct: 661 ILSACSHAGLVREGLELFIDMVSFHKIVAQAEHYGCLVSILSRSHNLDEALRLILGMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLL ACREHPD ELKE LFE LLKLEPDNSGNYVALSNAYAATGMWDEA KVR
Sbjct: 721 PDAFIFGSLLTACREHPDFELKEHLFERLLKLEPDNSGNYVALSNAYAATGMWDEALKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKER LRK PGHSLIQIGN+THVFFAGDKS+SRTKEIYM LALLR+EMQSTRCI VIS
Sbjct: 781 GLMKERSLRKIPGHSLIQIGNKTHVFFAGDKSNSRTKEIYMTLALLRMEMQSTRCISVIS 840

BLAST of Bhi02G000898 vs. ExPASy TrEMBL
Match: A0A6J1EZY3 (pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111441078 PE=4 SV=1)

HSP 1 Score: 1479.2 bits (3828), Expect = 0.0e+00
Identity = 740/840 (88.10%), Postives = 784/840 (93.33%), Query Frame = 0

Query: 1   MAALPFPALTNPLASLYIPRKPHSSPTHFANLNQTAGNVQISYKSYLNQISSLCKEAHLR 60
           MA+LPF   T PLA+LY  RK  +SPTH A LN++AGN QISYKSYLN+ISSLCKE  LR
Sbjct: 1   MASLPFVTPTYPLATLYSTRKLQNSPTHAAKLNESAGNFQISYKSYLNRISSLCKEGDLR 60

Query: 61  EAVNLVADMELENITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEYIAKNEYIETK 120
            AV+LV++ EL+ ITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGE+IAKNEYIETK
Sbjct: 61  AAVDLVSNFELQGITIGPDVYGELLQGCVYERALSLGQQIHGRILKNGEFIAKNEYIETK 120

Query: 121 LVIFYSKCDESEIANRLFGKLLVQNEFAWAAIMGLKSRIGFNEEALMGFCEMHENGLLLD 180
           LVIFYSKCDESEIANRLF KL VQNEF+WAAIMGLK RIGFNEEAL+  CEMHENGL LD
Sbjct: 121 LVIFYSKCDESEIANRLFRKLRVQNEFSWAAIMGLKCRIGFNEEALLCCCEMHENGLFLD 180

Query: 181 NFVIPIALKASGALQWIGFGKSVQGYVVKMGLGGCIYVASSLLDMYGKCGLCGDAKKVFD 240
           NFVIPIALKA+G+LQWIGFGK++ GY VKM LGGCI+VASSLLDMYGKCG+CGDAKKVFD
Sbjct: 181 NFVIPIALKAAGSLQWIGFGKAIHGYAVKMDLGGCIFVASSLLDMYGKCGVCGDAKKVFD 240

Query: 241 KIPEKNIVAWNSMIVNFTQNGLNAEAIETFYEMRVEGVVPTQVTLSSFLSASANLSVIDE 300
           KIPEKNIVAWNSMIVNFT NGL  EA+ETFY+MRVEGV PTQVTLSSFLSASANLS+I+E
Sbjct: 241 KIPEKNIVAWNSMIVNFTHNGLYEEAVETFYDMRVEGVEPTQVTLSSFLSASANLSLINE 300

Query: 301 GKQGHALAVLSGLELTNILGSSLINFYSKVGLVEDAEQVFSEMLEKDMVTWNLLVSGYVH 360
           GKQGHALAVLSGLELTNILGSSLINFYSK+GLVEDAE VFSEMLEKD+VTWNLLVSGYVH
Sbjct: 301 GKQGHALAVLSGLELTNILGSSLINFYSKIGLVEDAELVFSEMLEKDVVTWNLLVSGYVH 360

Query: 361 NGLVDRALDLCHVMQSENLRFDSVTLASIMAAAADSKNLKLGKEGHSFCVRNNLESDIAV 420
           NGLVDRAL LC VMQSENLRFDSVTLASIMAAAADS+NLKLGKEGHSFCVRNNLESD+AV
Sbjct: 361 NGLVDRALGLCRVMQSENLRFDSVTLASIMAAAADSRNLKLGKEGHSFCVRNNLESDVAV 420

Query: 421 ASSIVDMYAKCEKLECARQVFDTTVKRDLIMWNTLLAAYAEQGQSGETLKLFYQMQLEGL 480
           ASSIVD YAKC KLECAR+VFD  +KRDLIMWNTLLAAYAEQG SGETLKLFYQMQLEGL
Sbjct: 421 ASSIVDTYAKCGKLECARRVFDLAIKRDLIMWNTLLAAYAEQGWSGETLKLFYQMQLEGL 480

Query: 481 PPNVISWNSVILGLLNKGEVDEAKDMFLEMQSLGVCPNLITWTTLICGLAQNGLGDEAFL 540
           PPN+ISWNSVILGLLNKGEV +AKDMFLEMQSLGVCPNL+TWTTLI GLAQNGLGDEAFL
Sbjct: 481 PPNLISWNSVILGLLNKGEVSKAKDMFLEMQSLGVCPNLVTWTTLISGLAQNGLGDEAFL 540

Query: 541 TFQSMEEAGIKPNSLSISSLLSACTTMASLPHGRAIHCYIIRHDLLVSTPVLCSLVNMYA 600
           TFQ M+EAGIKPNSLSIS LLSACT MASL HGRAIH YI R +L +STPVLCSLVNMYA
Sbjct: 541 TFQLMQEAGIKPNSLSISPLLSACTAMASLRHGRAIHGYITRRELSLSTPVLCSLVNMYA 600

Query: 601 KCGSINQAKTVFDMIMKKELPIYNAMISGYALHGQAVEALSLFRRLKEDCIKPDEITFTS 660
           KCGSINQAK +FDMI+KKELPIYNAMISGYALHGQAVEALSLFRRLKE+CIKPDEITFTS
Sbjct: 601 KCGSINQAKRIFDMILKKELPIYNAMISGYALHGQAVEALSLFRRLKEECIKPDEITFTS 660

Query: 661 ILSACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLISILSRCHNLDEALRLILGMPFE 720
           I+SACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCL+SILSRCHNLDEALRL+L MPFE
Sbjct: 661 IISACSHAGLVTEGLELFIDMVSNHKIVAQAEHYGCLVSILSRCHNLDEALRLVLAMPFE 720

Query: 721 PDASIFGSLLAACREHPDLELKERLFEHLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780
           PDA IFGSLLAACREHPD+ELKERLFE LLKLEPDNSGNYVALSNAYAATGMWDEASKVR
Sbjct: 721 PDAFIFGSLLAACREHPDIELKERLFERLLKLEPDNSGNYVALSNAYAATGMWDEASKVR 780

Query: 781 VLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSRTKEIYMMLALLRVEMQSTRCIPVIS 840
            LMKERGLRKTPGHSLIQIGNETHVFFAGDKSHS+TKEIY MLALLR+EMQ TRCI V S
Sbjct: 781 DLMKERGLRKTPGHSLIQIGNETHVFFAGDKSHSKTKEIYKMLALLRIEMQVTRCIHVTS 840

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G55740.14.8e-26955.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G63370.13.3e-12132.52Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G09040.11.1e-11931.30Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G19720.11.0e-11730.73Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT4G21300.12.2e-11731.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
Match NameE-valueIdentityDescription
Q9FM646.7e-26855.57Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidop... [more]
Q9M1V34.7e-12032.52Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Q9SS831.5e-11831.30Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
Q9FXH11.4e-11630.73Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX... [more]
Q9STE13.1e-11631.35Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038880665.10.0e+00100.00pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Benincasa ... [more]
XP_011656577.10.0e+0091.19pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucumis sa... [more]
TYJ98107.10.0e+0090.48pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008463338.10.0e+0090.24PREDICTED: pentatricopeptide repeat-containing protein At5g55740, chloroplastic ... [more]
XP_023531196.10.0e+0088.69pentatricopeptide repeat-containing protein At5g55740, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A0A0LUC40.0e+0091.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G407170 PE=4 SV=1[more]
A0A5D3BG600.0e+0090.48Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CJ170.0e+0090.24pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Cucumis ... [more]
A0A5A7TJA90.0e+0090.24Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1EZY30.0e+0088.10pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Cucurbit... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 210..299
e-value: 1.8E-16
score: 62.0
coord: 300..407
e-value: 2.6E-19
score: 71.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 588..757
e-value: 6.4E-36
score: 126.3
coord: 44..209
e-value: 1.5E-11
score: 46.2
coord: 408..585
e-value: 3.6E-40
score: 140.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 457..777
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 248..278
e-value: 2.6E-4
score: 21.0
coord: 321..347
e-value: 1.0E-4
score: 22.3
coord: 594..618
e-value: 1.1
score: 9.7
coord: 349..378
e-value: 5.5E-6
score: 26.3
coord: 450..480
e-value: 6.4E-4
score: 19.8
coord: 220..247
e-value: 0.0077
score: 16.4
coord: 693..717
e-value: 0.77
score: 10.1
coord: 760..788
e-value: 0.019
score: 15.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 622..666
e-value: 4.1E-9
score: 36.5
coord: 482..530
e-value: 5.1E-13
score: 49.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 248..281
e-value: 0.0016
score: 16.5
coord: 349..382
e-value: 1.1E-5
score: 23.3
coord: 322..348
e-value: 4.8E-5
score: 21.3
coord: 656..684
e-value: 0.0016
score: 16.5
coord: 622..655
e-value: 4.0E-7
score: 27.8
coord: 520..554
e-value: 1.6E-8
score: 32.2
coord: 450..484
e-value: 2.5E-6
score: 25.3
coord: 485..518
e-value: 1.3E-7
score: 29.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 10.818861
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 756..790
score: 9.163705
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 448..482
score: 11.443655
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 483..517
score: 11.936913
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 654..684
score: 8.988323
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 619..653
score: 10.55579
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 10.457138
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 518..552
score: 11.816339
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 357..539
coord: 33..345
NoneNo IPR availablePANTHERPTHR24015:SF1603PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 529..819
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 529..819
NoneNo IPR availablePANTHERPTHR24015:SF1603PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 357..539
coord: 33..345

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi02M000898Bhi02M000898mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding