Bhi01G001040 (gene) Wax gourd (B227) v1

Overview
NameBhi01G001040
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 28934351 .. 28937002 (-)
RNA-Seq ExpressionBhi01G001040
SyntenyBhi01G001040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

mRNA sequence

ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

Coding sequence (CDS)

ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

Protein sequence

MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS
Homology
BLAST of Bhi01G001040 vs. TAIR 10
Match: AT5G52850.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 865.1 bits (2234), Expect = 4.7e-251
Identity = 425/872 (48.74%), Postives = 598/872 (68.58%), Query Frame = 0

Query: 9   LLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFG 68
           L   NEL  L++SC +++S C S S + G+ +H P+IK GL  NL L NNLL+LY K  G
Sbjct: 14  LSRTNELGNLQKSCIRILSFCESNSSRIGLHIHCPVIKFGLLENLDLCNNLLSLYLKTDG 73

Query: 69  LKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRS 128
           +  AR LFDEM  R V +WT M +A+ +++ +  A  LF+ M+  G  PNEFT S+++RS
Sbjct: 74  IWNARKLFDEMSHRTVFAWTVMISAFTKSQEFASALSLFEEMMASGTHPNEFTFSSVVRS 133

Query: 129 CSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTW 188
           C+   ++  G  VHG VIK GFE   V+G +L +LY+KC   +EA E+F ++ + DT++W
Sbjct: 134 CAGLRDISYGGRVHGSVIKTGFEGNSVVGSSLSDLYSKCGQFKEACELFSSLQNADTISW 193

Query: 189 TVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITL 248
           T+MISSLV A+KW EALQ Y  M+ +GV PNEFTF KLL  ++FLGL++GK +H ++I  
Sbjct: 194 TMMISSLVGARKWREALQFYSEMVKAGVPPNEFTFVKLLGASSFLGLEFGKTIHSNIIVR 253

Query: 249 GVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAF 308
           G+ LNVVLKT+LVD YS++ ++EDA++V N + E+DVFLWTS++S F +NL+ KEA+  F
Sbjct: 254 GIPLNVVLKTSLVDFYSQFSKMEDAVRVLNSSGEQDVFLWTSVVSGFVRNLRAKEAVGTF 313

Query: 309 QEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKC 368
            EMR  G+ PN+FTYS+ LS C+++ SL  GKQIH Q I  G E     G+AL++MYMKC
Sbjct: 314 LEMRSLGLQPNNFTYSAILSLCSAVRSLDFGKQIHSQTIKVGFEDSTDVGNALVDMYMKC 373

Query: 369 SNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSI 428
           S    +A RVF  + SP+V+ WT+LI GL +HG  QDC+   ++M    V+PN  TLS +
Sbjct: 374 SASEVEASRVFGAMVSPNVVSWTTLILGLVDHGFVQDCFGLLMEMVKREVEPNVVTLSGV 433

Query: 429 LGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDA 488
           L A S  +   +    H Y+L+     ++VVGN+LVDAYA S KVD A  VI +M  RD 
Sbjct: 434 LRACSKLRHVRRVLEIHAYLLRRHVDGEMVVGNSLVDAYASSRKVDYAWNVIRSMKRRDN 493

Query: 489 ITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCY 548
           ITYTSL TR N++G HEMAL +I+ M  D + MD++SL   +SA   LG +ETGK LHCY
Sbjct: 494 ITYTSLVTRFNELGKHEMALSVINYMYGDGIRMDQLSLPGFISASANLGALETGKHLHCY 553

Query: 549 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 608
           S+K G     SV NSL+D+Y K G L+DA KVFEEI+ PDVVSWNG++S LA NG ISSA
Sbjct: 554 SVKSGFSGAASVLNSLVDMYSKCGSLEDAKKVFEEIATPDVVSWNGLVSGLASNGFISSA 613

Query: 609 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 668
           L+AF+ MR+   EPDS+TFL +LSACS G L D G+ YF  MK  + IEP+++HY  ++ 
Sbjct: 614 LSAFEEMRMKETEPDSVTFLILLSACSNGRLTDLGLEYFQVMKKIYNIEPQVEHYVHLVG 673

Query: 669 LLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSF 728
           +LGR G+LE A  +VE+M  + +A I+KTLL+AC + GN+ LGED+A +GL L P+D + 
Sbjct: 674 ILGRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYRGNLSLGEDMANKGLALAPSDPAL 733

Query: 729 YLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTH-PQIND 788
           Y+LLA+LYD   + +L+ KTR LM ++ + K  G+S +E+  K+H FV+ + T   + N 
Sbjct: 734 YILLADLYDESGKPELAQKTRNLMTEKRLSKKLGKSTVEVQGKVHSFVSEDVTRVDKTNG 793

Query: 789 IQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECH 848
           I  ++E ++ E K  G  Y  +EN+S HS K A+ +G +   P A V ++KN  +C++CH
Sbjct: 794 IYAEIESIKEEIKRFGSPYRGNENASFHSAKQAVVYGFIYASPEAPVHVVKNKILCKDCH 853

Query: 849 DFILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           +F+ ++T++V+++I VRDG  +H+ KNG CSC
Sbjct: 854 EFVSILTRLVDKKITVRDGNQVHIFKNGECSC 885

BLAST of Bhi01G001040 vs. TAIR 10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 501.5 bits (1290), Expect = 1.4e-141
Identity = 279/879 (31.74%), Postives = 467/879 (53.13%), Query Frame = 0

Query: 19   EESCSQLISICNSKSLKEGVC--VHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLF 78
            E + S ++  C   S+   V   +H+ I+  GL  +  + N L+ LY++   +  AR +F
Sbjct: 186  EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 79   DEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELE 138
            D +  +D  SW  M +   +N    EA  LF  M  LG  P  +  S+++ +C +   LE
Sbjct: 246  DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 139  LGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLV 198
            +G  +HG V+K GF +   +   L++LY        A  +F NM   D VT+  +I+ L 
Sbjct: 306  IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 199  QAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKY-GKLLHCHMITLGVNLNVV 258
            Q    ++A++L+  M   G+ P+  T   L+   +  G  + G+ LH +   LG   N  
Sbjct: 366  QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 259  LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 318
            ++ AL+++Y++  ++E A+    +T  ++V LW  ++  +     ++ +   F++M++  
Sbjct: 426  IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE 485

Query: 319  ILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDA 378
            I+PN +TY S L  C  +  L+LG+QIH Q+I    + +    S LI+MY K    +D A
Sbjct: 486  IVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGK-LDTA 545

Query: 379  LRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSA 438
              +        V+ WT++I+G  ++  +      F  M   G++ +   L++ + A +  
Sbjct: 546  WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 439  KSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLA 498
            ++  +    H          D+   NALV  Y+R  K++++          D I + +L 
Sbjct: 606  QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 499  TRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLD 558
            +   Q G++E AL++   M  + ++ +  +  S V A +    ++ GKQ+H    K G D
Sbjct: 666  SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 559  NTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNM 618
            +   V N+L+ +Y K G + DA K F E+S  + VSWN +I+  + +G  S AL +FD M
Sbjct: 726  SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 619  RLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQ 678
              + + P+ +T + +LSACS  GLVD G+ YF SM + + + P+ +HY C++D+L R G 
Sbjct: 786  IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 679  LENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANL 738
            L  A E ++ MP + DA +++TLL AC  H NM +GE  A   L+L P DS+ Y+LL+NL
Sbjct: 846  LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNL 905

Query: 739  YDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFL 798
            Y    + D    TR+ M+++GV+K PGQSWIE+ + IH F  G++ HP  ++I E  + L
Sbjct: 906  YAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDL 965

Query: 799  RAEFKSRGF----------MYHEDENS--SHHSEKLALAFGLVNLPPTAVVRIMKNISIC 858
                   G+          + HE ++     HSEKLA++FGL++LP T  + +MKN+ +C
Sbjct: 966  TKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVC 1025

Query: 859  RECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
             +CH +I  V+KV  REIIVRD    H  + G+CSC  Y
Sbjct: 1026 NDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDY 1063

BLAST of Bhi01G001040 vs. TAIR 10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 488.8 bits (1257), Expect = 9.2e-138
Identity = 266/827 (32.16%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 87  WTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVI 146
           W  +  + VR+    EA   +  M+ LG  P+ +    L+++ ++  ++ELG  +H +V 
Sbjct: 65  WIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVY 124

Query: 147 KGGFETKPV-LGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           K G+    V +  TL+NLY KC      Y+VF  + + + V+W  +ISSL   +KW+ AL
Sbjct: 125 KFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMAL 184

Query: 207 QLYITMMNSGVTPNEFTFTKLLATTNFL----GLKYGKLLHCHMITLGVNLNVVLKTALV 266
           + +  M++  V P+ FT   ++   + L    GL  GK +H + +  G  LN  +   LV
Sbjct: 185 EAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKG-ELNSFIINTLV 244

Query: 267 DVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSF 326
            +Y +  +L  +  +      +D+  W +++S   QN ++ EA+   +EM + G+ P+ F
Sbjct: 245 AMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEF 304

Query: 327 TYSSALSACTSIPSLKLGKQIHLQVILAG-LEADVCAGSALINMYMKCSNFIDDALRVFR 386
           T SS L AC+ +  L+ GK++H   +  G L+ +   GSAL++MY  C   +    RVF 
Sbjct: 305 TISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVL-SGRRVFD 364

Query: 387 TITSPSVICWTSLISGLAEHGCEQDCYRYFLDM-QAAGVQPNAFTLSSILGASSSAKSQN 446
            +    +  W ++I+G +++  +++    F+ M ++AG+  N+ T++ ++ A   + + +
Sbjct: 365 GMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFS 424

Query: 447 QTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLN 506
           +    HG+++K     D  V N L+D Y+R  K+D A R+   M  RD +T+ ++ T   
Sbjct: 425 RKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYV 484

Query: 507 QMGDHEMALKIIDSMR---------ADNVEM--DEISLTSLVSALTGLGIVETGKQLHCY 566
               HE AL ++  M+         A  V +  + I+L +++ +   L  +  GK++H Y
Sbjct: 485 FSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAY 544

Query: 567 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 626
           ++K  L    +V ++L+D+Y K GCL+ + KVF++I + +V++WN +I     +G+   A
Sbjct: 545 AIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEA 604

Query: 627 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 686
           +     M + G++P+ +TF+S+ +ACS  G+VD G+  F+ MK  + +EP  DHYAC++D
Sbjct: 605 IDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVD 664

Query: 687 LLGRVGQLENAMEIVESMPYEAD-AKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSS 746
           LLGR G+++ A +++  MP + + A  + +LL A   H N+ +GE  A   +QL PN +S
Sbjct: 665 LLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVAS 724

Query: 747 FYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQIND 806
            Y+LLAN+Y      D + + R+ M+++GVRK PG SWIE   ++H FV G+ +HPQ   
Sbjct: 725 HYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEK 784

Query: 807 IQEKLEFLRAEFKSRGFM---------YHEDENS---SHHSEKLALAFGLVNLPPTAVVR 866
           +   LE L    +  G++           EDE       HSEKLA+AFG++N  P  ++R
Sbjct: 785 LSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIR 844

Query: 867 IMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           + KN+ +C +CH     ++K+V+REII+RD R  H  KNG+CSC  Y
Sbjct: 845 VAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 889

BLAST of Bhi01G001040 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 485.0 bits (1247), Expect = 1.3e-136
Identity = 267/868 (30.76%), Postives = 470/868 (54.15%), Query Frame = 0

Query: 27  SICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEM-PDRDVV 86
           ++ +S +L E   +H+ +I LGL  + + S  L+  Y+       + ++F  + P ++V 
Sbjct: 13  ALSSSSNLNELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVY 72

Query: 87  SWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 146
            W ++  A+ +N  + EA E +  +      P+++T  ++I++C+   + E+G  V+  +
Sbjct: 73  LWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQI 132

Query: 147 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           +  GFE+   +G  L+++Y++      A +VF  M   D V+W  +IS       ++EAL
Sbjct: 133 LDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEAL 192

Query: 207 QLYITMMNSGVTPNEFTFTKLL-ATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVY 266
           ++Y  + NS + P+ FT + +L A  N L +K G+ LH   +  GVN  VV+   LV +Y
Sbjct: 193 EIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 252

Query: 267 SRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYS 326
            +++   DA +V ++   +D   + ++I  + +   V+E++  F E  +    P+  T S
Sbjct: 253 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 312

Query: 327 SALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITS 386
           S L AC  +  L L K I+  ++ AG   +    + LI++Y KC + I  A  VF ++  
Sbjct: 313 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMI-TARDVFNSMEC 372

Query: 387 PSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMF 446
              + W S+ISG  + G   +  + F  M     Q +  T   ++  S+           
Sbjct: 373 KDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGL 432

Query: 447 HGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDH 506
           H   +K     D+ V NAL+D YA+  +V D+ ++ S+M   D +T+ ++ +   + GD 
Sbjct: 433 HSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDF 492

Query: 507 EMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSL 566
              L++   MR   V  D  +    +     L     GK++HC  L++G ++   + N+L
Sbjct: 493 ATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNAL 552

Query: 567 MDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDS 626
           +++Y K GCL+++++VFE +S+ DVV+W GMI      G    AL  F +M  +G+ PDS
Sbjct: 553 IEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDS 612

Query: 627 ITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVE 686
           + F++I+ ACS  GLVD G+  F  MK  +KI+P ++HYAC++DLL R  ++  A E ++
Sbjct: 613 VVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQ 672

Query: 687 SMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDL 746
           +MP + DA I+ ++L+AC   G+M   E V+ R ++LNP+D  + +L +N Y    + D 
Sbjct: 673 AMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDK 732

Query: 747 SAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGF 806
            +  RK ++D+ + K+PG SWIE+   +H+F +G+ + PQ   I + LE L +     G+
Sbjct: 733 VSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGY 792

Query: 807 MYHEDENSSH-------------HSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFIL 866
           +    E S +             HSE+LA+AFGL+N  P   +++MKN+ +C +CH+   
Sbjct: 793 IPDPREVSQNLEEEEEKRRLICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTK 852

Query: 867 LVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           L++K+V REI+VRD    H+ K+G+CSC
Sbjct: 853 LISKIVGREILVRDANRFHLFKDGTCSC 878

BLAST of Bhi01G001040 vs. TAIR 10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 462.6 bits (1189), Expect = 7.0e-130
Identity = 276/940 (29.36%), Postives = 466/940 (49.57%), Query Frame = 0

Query: 31  SKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTM 90
           S  L  G C H+ I+    +   +L NNL+++Y+K   L  AR +FD+MPDRD+VSW ++
Sbjct: 52  SSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYARRVFDKMPDRDLVSWNSI 111

Query: 91  QAAYVRNRSYI-----EAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 150
            AAY ++   +     +AF LF ++       +  TLS +++ C  +G +      HGY 
Sbjct: 112 LAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLKLCLHSGYVWASESFHGYA 171

Query: 151 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 210
            K G +    +   L+N+Y K    +E   +F  M   D V W +M+ + ++    +EA+
Sbjct: 172 CKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVLWNLMLKAYLEMGFKEEAI 231

Query: 211 QLYITMMNSGVTPNEF-------------------------------------------- 270
            L     +SG+ PNE                                             
Sbjct: 232 DLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGNDASSVSEIIFRNKGLSEYL 291

Query: 271 -------------------------TFTKLLAT-TNFLGLKYGKLLHCHMITLGVNLNVV 330
                                    TF  +LAT      L  G+ +HC  + LG++L + 
Sbjct: 292 HSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLT 351

Query: 331 LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 390
           +  +L+++Y + ++   A  V +   E+D+  W S+I+   QN    EA+  F ++   G
Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG 411

Query: 391 ILPNSFTYSSALSACTSIP-SLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDD 450
           + P+ +T +S L A +S+P  L L KQ+H+  I     +D    +ALI+ Y + +  + +
Sbjct: 412 LKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR-NRCMKE 471

Query: 451 ALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSS 510
           A  +F    +  ++ W ++++G  +        + F  M   G + + FTL+++      
Sbjct: 472 AEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGF 531

Query: 511 AKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSL 570
             + NQ    H Y +K     D+ V + ++D Y +   +  A     ++   D + +T++
Sbjct: 532 LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTM 591

Query: 571 ATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGL 630
            +   + G+ E A  +   MR   V  DE ++ +L  A + L  +E G+Q+H  +LK   
Sbjct: 592 ISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNC 651

Query: 631 DNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDN 690
            N   V  SL+D+Y K G + DA  +F+ I   ++ +WN M+  LA +G     L  F  
Sbjct: 652 TNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQ 711

Query: 691 MRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVG 750
           M+  G++PD +TF+ +LSACS  GLV     +  SM   + I+PE++HY+C+ D LGR G
Sbjct: 712 MKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAG 771

Query: 751 QLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLAN 810
            ++ A  ++ESM  EA A +Y+TLL AC   G+   G+ VA + L+L P DSS Y+LL+N
Sbjct: 772 LVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSN 831

Query: 811 LYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEF 870
           +Y   ++ D     R +M+   V+K PG SWIE+ +KIH+FV  +R++ Q   I  K++ 
Sbjct: 832 MYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKD 891

Query: 871 LRAEFKSRGFM---------YHEDENSS---HHSEKLALAFGLVNLPPTAVVRIMKNISI 883
           +  + K  G++           E+E      +HSEKLA+AFGL++ PP+  +R++KN+ +
Sbjct: 892 MIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRV 951

BLAST of Bhi01G001040 vs. ExPASy Swiss-Prot
Match: Q9FLX6 (Pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H31 PE=2 SV=1)

HSP 1 Score: 865.1 bits (2234), Expect = 6.6e-250
Identity = 425/872 (48.74%), Postives = 598/872 (68.58%), Query Frame = 0

Query: 9   LLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFG 68
           L   NEL  L++SC +++S C S S + G+ +H P+IK GL  NL L NNLL+LY K  G
Sbjct: 14  LSRTNELGNLQKSCIRILSFCESNSSRIGLHIHCPVIKFGLLENLDLCNNLLSLYLKTDG 73

Query: 69  LKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRS 128
           +  AR LFDEM  R V +WT M +A+ +++ +  A  LF+ M+  G  PNEFT S+++RS
Sbjct: 74  IWNARKLFDEMSHRTVFAWTVMISAFTKSQEFASALSLFEEMMASGTHPNEFTFSSVVRS 133

Query: 129 CSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTW 188
           C+   ++  G  VHG VIK GFE   V+G +L +LY+KC   +EA E+F ++ + DT++W
Sbjct: 134 CAGLRDISYGGRVHGSVIKTGFEGNSVVGSSLSDLYSKCGQFKEACELFSSLQNADTISW 193

Query: 189 TVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITL 248
           T+MISSLV A+KW EALQ Y  M+ +GV PNEFTF KLL  ++FLGL++GK +H ++I  
Sbjct: 194 TMMISSLVGARKWREALQFYSEMVKAGVPPNEFTFVKLLGASSFLGLEFGKTIHSNIIVR 253

Query: 249 GVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAF 308
           G+ LNVVLKT+LVD YS++ ++EDA++V N + E+DVFLWTS++S F +NL+ KEA+  F
Sbjct: 254 GIPLNVVLKTSLVDFYSQFSKMEDAVRVLNSSGEQDVFLWTSVVSGFVRNLRAKEAVGTF 313

Query: 309 QEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKC 368
            EMR  G+ PN+FTYS+ LS C+++ SL  GKQIH Q I  G E     G+AL++MYMKC
Sbjct: 314 LEMRSLGLQPNNFTYSAILSLCSAVRSLDFGKQIHSQTIKVGFEDSTDVGNALVDMYMKC 373

Query: 369 SNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSI 428
           S    +A RVF  + SP+V+ WT+LI GL +HG  QDC+   ++M    V+PN  TLS +
Sbjct: 374 SASEVEASRVFGAMVSPNVVSWTTLILGLVDHGFVQDCFGLLMEMVKREVEPNVVTLSGV 433

Query: 429 LGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDA 488
           L A S  +   +    H Y+L+     ++VVGN+LVDAYA S KVD A  VI +M  RD 
Sbjct: 434 LRACSKLRHVRRVLEIHAYLLRRHVDGEMVVGNSLVDAYASSRKVDYAWNVIRSMKRRDN 493

Query: 489 ITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCY 548
           ITYTSL TR N++G HEMAL +I+ M  D + MD++SL   +SA   LG +ETGK LHCY
Sbjct: 494 ITYTSLVTRFNELGKHEMALSVINYMYGDGIRMDQLSLPGFISASANLGALETGKHLHCY 553

Query: 549 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 608
           S+K G     SV NSL+D+Y K G L+DA KVFEEI+ PDVVSWNG++S LA NG ISSA
Sbjct: 554 SVKSGFSGAASVLNSLVDMYSKCGSLEDAKKVFEEIATPDVVSWNGLVSGLASNGFISSA 613

Query: 609 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 668
           L+AF+ MR+   EPDS+TFL +LSACS G L D G+ YF  MK  + IEP+++HY  ++ 
Sbjct: 614 LSAFEEMRMKETEPDSVTFLILLSACSNGRLTDLGLEYFQVMKKIYNIEPQVEHYVHLVG 673

Query: 669 LLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSF 728
           +LGR G+LE A  +VE+M  + +A I+KTLL+AC + GN+ LGED+A +GL L P+D + 
Sbjct: 674 ILGRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYRGNLSLGEDMANKGLALAPSDPAL 733

Query: 729 YLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTH-PQIND 788
           Y+LLA+LYD   + +L+ KTR LM ++ + K  G+S +E+  K+H FV+ + T   + N 
Sbjct: 734 YILLADLYDESGKPELAQKTRNLMTEKRLSKKLGKSTVEVQGKVHSFVSEDVTRVDKTNG 793

Query: 789 IQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECH 848
           I  ++E ++ E K  G  Y  +EN+S HS K A+ +G +   P A V ++KN  +C++CH
Sbjct: 794 IYAEIESIKEEIKRFGSPYRGNENASFHSAKQAVVYGFIYASPEAPVHVVKNKILCKDCH 853

Query: 849 DFILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           +F+ ++T++V+++I VRDG  +H+ KNG CSC
Sbjct: 854 EFVSILTRLVDKKITVRDGNQVHIFKNGECSC 885

BLAST of Bhi01G001040 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 501.5 bits (1290), Expect = 1.9e-140
Identity = 279/879 (31.74%), Postives = 467/879 (53.13%), Query Frame = 0

Query: 19   EESCSQLISICNSKSLKEGVC--VHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLF 78
            E + S ++  C   S+   V   +H+ I+  GL  +  + N L+ LY++   +  AR +F
Sbjct: 186  EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 79   DEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELE 138
            D +  +D  SW  M +   +N    EA  LF  M  LG  P  +  S+++ +C +   LE
Sbjct: 246  DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 139  LGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLV 198
            +G  +HG V+K GF +   +   L++LY        A  +F NM   D VT+  +I+ L 
Sbjct: 306  IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 199  QAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKY-GKLLHCHMITLGVNLNVV 258
            Q    ++A++L+  M   G+ P+  T   L+   +  G  + G+ LH +   LG   N  
Sbjct: 366  QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 259  LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 318
            ++ AL+++Y++  ++E A+    +T  ++V LW  ++  +     ++ +   F++M++  
Sbjct: 426  IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE 485

Query: 319  ILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDA 378
            I+PN +TY S L  C  +  L+LG+QIH Q+I    + +    S LI+MY K    +D A
Sbjct: 486  IVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGK-LDTA 545

Query: 379  LRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSA 438
              +        V+ WT++I+G  ++  +      F  M   G++ +   L++ + A +  
Sbjct: 546  WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 439  KSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLA 498
            ++  +    H          D+   NALV  Y+R  K++++          D I + +L 
Sbjct: 606  QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 499  TRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLD 558
            +   Q G++E AL++   M  + ++ +  +  S V A +    ++ GKQ+H    K G D
Sbjct: 666  SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 559  NTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNM 618
            +   V N+L+ +Y K G + DA K F E+S  + VSWN +I+  + +G  S AL +FD M
Sbjct: 726  SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 619  RLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQ 678
              + + P+ +T + +LSACS  GLVD G+ YF SM + + + P+ +HY C++D+L R G 
Sbjct: 786  IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 679  LENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANL 738
            L  A E ++ MP + DA +++TLL AC  H NM +GE  A   L+L P DS+ Y+LL+NL
Sbjct: 846  LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNL 905

Query: 739  YDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFL 798
            Y    + D    TR+ M+++GV+K PGQSWIE+ + IH F  G++ HP  ++I E  + L
Sbjct: 906  YAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDL 965

Query: 799  RAEFKSRGF----------MYHEDENS--SHHSEKLALAFGLVNLPPTAVVRIMKNISIC 858
                   G+          + HE ++     HSEKLA++FGL++LP T  + +MKN+ +C
Sbjct: 966  TKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVC 1025

Query: 859  RECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
             +CH +I  V+KV  REIIVRD    H  + G+CSC  Y
Sbjct: 1026 NDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDY 1063

BLAST of Bhi01G001040 vs. ExPASy Swiss-Prot
Match: Q7Y211 (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 488.8 bits (1257), Expect = 1.3e-136
Identity = 266/827 (32.16%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 87  WTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVI 146
           W  +  + VR+    EA   +  M+ LG  P+ +    L+++ ++  ++ELG  +H +V 
Sbjct: 65  WIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVY 124

Query: 147 KGGFETKPV-LGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           K G+    V +  TL+NLY KC      Y+VF  + + + V+W  +ISSL   +KW+ AL
Sbjct: 125 KFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMAL 184

Query: 207 QLYITMMNSGVTPNEFTFTKLLATTNFL----GLKYGKLLHCHMITLGVNLNVVLKTALV 266
           + +  M++  V P+ FT   ++   + L    GL  GK +H + +  G  LN  +   LV
Sbjct: 185 EAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKG-ELNSFIINTLV 244

Query: 267 DVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSF 326
            +Y +  +L  +  +      +D+  W +++S   QN ++ EA+   +EM + G+ P+ F
Sbjct: 245 AMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEF 304

Query: 327 TYSSALSACTSIPSLKLGKQIHLQVILAG-LEADVCAGSALINMYMKCSNFIDDALRVFR 386
           T SS L AC+ +  L+ GK++H   +  G L+ +   GSAL++MY  C   +    RVF 
Sbjct: 305 TISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVL-SGRRVFD 364

Query: 387 TITSPSVICWTSLISGLAEHGCEQDCYRYFLDM-QAAGVQPNAFTLSSILGASSSAKSQN 446
            +    +  W ++I+G +++  +++    F+ M ++AG+  N+ T++ ++ A   + + +
Sbjct: 365 GMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFS 424

Query: 447 QTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLN 506
           +    HG+++K     D  V N L+D Y+R  K+D A R+   M  RD +T+ ++ T   
Sbjct: 425 RKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYV 484

Query: 507 QMGDHEMALKIIDSMR---------ADNVEM--DEISLTSLVSALTGLGIVETGKQLHCY 566
               HE AL ++  M+         A  V +  + I+L +++ +   L  +  GK++H Y
Sbjct: 485 FSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAY 544

Query: 567 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 626
           ++K  L    +V ++L+D+Y K GCL+ + KVF++I + +V++WN +I     +G+   A
Sbjct: 545 AIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEA 604

Query: 627 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 686
           +     M + G++P+ +TF+S+ +ACS  G+VD G+  F+ MK  + +EP  DHYAC++D
Sbjct: 605 IDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVD 664

Query: 687 LLGRVGQLENAMEIVESMPYEAD-AKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSS 746
           LLGR G+++ A +++  MP + + A  + +LL A   H N+ +GE  A   +QL PN +S
Sbjct: 665 LLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVAS 724

Query: 747 FYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQIND 806
            Y+LLAN+Y      D + + R+ M+++GVRK PG SWIE   ++H FV G+ +HPQ   
Sbjct: 725 HYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEK 784

Query: 807 IQEKLEFLRAEFKSRGFM---------YHEDENS---SHHSEKLALAFGLVNLPPTAVVR 866
           +   LE L    +  G++           EDE       HSEKLA+AFG++N  P  ++R
Sbjct: 785 LSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIR 844

Query: 867 IMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           + KN+ +C +CH     ++K+V+REII+RD R  H  KNG+CSC  Y
Sbjct: 845 VAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 889

BLAST of Bhi01G001040 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 485.0 bits (1247), Expect = 1.9e-135
Identity = 267/868 (30.76%), Postives = 470/868 (54.15%), Query Frame = 0

Query: 27  SICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEM-PDRDVV 86
           ++ +S +L E   +H+ +I LGL  + + S  L+  Y+       + ++F  + P ++V 
Sbjct: 13  ALSSSSNLNELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVY 72

Query: 87  SWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 146
            W ++  A+ +N  + EA E +  +      P+++T  ++I++C+   + E+G  V+  +
Sbjct: 73  LWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQI 132

Query: 147 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           +  GFE+   +G  L+++Y++      A +VF  M   D V+W  +IS       ++EAL
Sbjct: 133 LDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEAL 192

Query: 207 QLYITMMNSGVTPNEFTFTKLL-ATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVY 266
           ++Y  + NS + P+ FT + +L A  N L +K G+ LH   +  GVN  VV+   LV +Y
Sbjct: 193 EIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 252

Query: 267 SRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYS 326
            +++   DA +V ++   +D   + ++I  + +   V+E++  F E  +    P+  T S
Sbjct: 253 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 312

Query: 327 SALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITS 386
           S L AC  +  L L K I+  ++ AG   +    + LI++Y KC + I  A  VF ++  
Sbjct: 313 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMI-TARDVFNSMEC 372

Query: 387 PSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMF 446
              + W S+ISG  + G   +  + F  M     Q +  T   ++  S+           
Sbjct: 373 KDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGL 432

Query: 447 HGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDH 506
           H   +K     D+ V NAL+D YA+  +V D+ ++ S+M   D +T+ ++ +   + GD 
Sbjct: 433 HSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDF 492

Query: 507 EMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSL 566
              L++   MR   V  D  +    +     L     GK++HC  L++G ++   + N+L
Sbjct: 493 ATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNAL 552

Query: 567 MDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDS 626
           +++Y K GCL+++++VFE +S+ DVV+W GMI      G    AL  F +M  +G+ PDS
Sbjct: 553 IEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDS 612

Query: 627 ITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVE 686
           + F++I+ ACS  GLVD G+  F  MK  +KI+P ++HYAC++DLL R  ++  A E ++
Sbjct: 613 VVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQ 672

Query: 687 SMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDL 746
           +MP + DA I+ ++L+AC   G+M   E V+ R ++LNP+D  + +L +N Y    + D 
Sbjct: 673 AMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDK 732

Query: 747 SAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGF 806
            +  RK ++D+ + K+PG SWIE+   +H+F +G+ + PQ   I + LE L +     G+
Sbjct: 733 VSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGY 792

Query: 807 MYHEDENSSH-------------HSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFIL 866
           +    E S +             HSE+LA+AFGL+N  P   +++MKN+ +C +CH+   
Sbjct: 793 IPDPREVSQNLEEEEEKRRLICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTK 852

Query: 867 LVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           L++K+V REI+VRD    H+ K+G+CSC
Sbjct: 853 LISKIVGREILVRDANRFHLFKDGTCSC 878

BLAST of Bhi01G001040 vs. ExPASy Swiss-Prot
Match: Q9SMZ2 (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 9.9e-129
Identity = 276/940 (29.36%), Postives = 466/940 (49.57%), Query Frame = 0

Query: 31  SKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTM 90
           S  L  G C H+ I+    +   +L NNL+++Y+K   L  AR +FD+MPDRD+VSW ++
Sbjct: 52  SSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYARRVFDKMPDRDLVSWNSI 111

Query: 91  QAAYVRNRSYI-----EAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 150
            AAY ++   +     +AF LF ++       +  TLS +++ C  +G +      HGY 
Sbjct: 112 LAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLKLCLHSGYVWASESFHGYA 171

Query: 151 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 210
            K G +    +   L+N+Y K    +E   +F  M   D V W +M+ + ++    +EA+
Sbjct: 172 CKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVLWNLMLKAYLEMGFKEEAI 231

Query: 211 QLYITMMNSGVTPNEF-------------------------------------------- 270
            L     +SG+ PNE                                             
Sbjct: 232 DLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGNDASSVSEIIFRNKGLSEYL 291

Query: 271 -------------------------TFTKLLAT-TNFLGLKYGKLLHCHMITLGVNLNVV 330
                                    TF  +LAT      L  G+ +HC  + LG++L + 
Sbjct: 292 HSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLT 351

Query: 331 LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 390
           +  +L+++Y + ++   A  V +   E+D+  W S+I+   QN    EA+  F ++   G
Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG 411

Query: 391 ILPNSFTYSSALSACTSIP-SLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDD 450
           + P+ +T +S L A +S+P  L L KQ+H+  I     +D    +ALI+ Y + +  + +
Sbjct: 412 LKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR-NRCMKE 471

Query: 451 ALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSS 510
           A  +F    +  ++ W ++++G  +        + F  M   G + + FTL+++      
Sbjct: 472 AEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGF 531

Query: 511 AKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSL 570
             + NQ    H Y +K     D+ V + ++D Y +   +  A     ++   D + +T++
Sbjct: 532 LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTM 591

Query: 571 ATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGL 630
            +   + G+ E A  +   MR   V  DE ++ +L  A + L  +E G+Q+H  +LK   
Sbjct: 592 ISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNC 651

Query: 631 DNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDN 690
            N   V  SL+D+Y K G + DA  +F+ I   ++ +WN M+  LA +G     L  F  
Sbjct: 652 TNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQ 711

Query: 691 MRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVG 750
           M+  G++PD +TF+ +LSACS  GLV     +  SM   + I+PE++HY+C+ D LGR G
Sbjct: 712 MKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAG 771

Query: 751 QLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLAN 810
            ++ A  ++ESM  EA A +Y+TLL AC   G+   G+ VA + L+L P DSS Y+LL+N
Sbjct: 772 LVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSN 831

Query: 811 LYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEF 870
           +Y   ++ D     R +M+   V+K PG SWIE+ +KIH+FV  +R++ Q   I  K++ 
Sbjct: 832 MYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKD 891

Query: 871 LRAEFKSRGFM---------YHEDENSS---HHSEKLALAFGLVNLPPTAVVRIMKNISI 883
           +  + K  G++           E+E      +HSEKLA+AFGL++ PP+  +R++KN+ +
Sbjct: 892 MIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRV 951

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_038874958.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Benincasa hispida])

HSP 1 Score: 1769.2 bits (4581), Expect = 0.0e+00
Identity = 883/883 (100.00%), Postives = 883/883 (100.00%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL
Sbjct: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF
Sbjct: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM
Sbjct: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL
Sbjct: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK
Sbjct: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI
Sbjct: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
           STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE
Sbjct: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA
Sbjct: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
           FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL
Sbjct: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ
Sbjct: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER
Sbjct: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI
Sbjct: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS
Sbjct: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 883

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_022141235.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Momordica charantia])

HSP 1 Score: 1522.3 bits (3940), Expect = 0.0e+00
Identity = 750/883 (84.94%), Postives = 807/883 (91.39%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           M+CR VPK LNRNEL RLEE+CS LISICNSKSLKEG+CVHSPIIKLGL+GNLYLSNNLL
Sbjct: 1   MICRTVPKFLNRNELNRLEETCSHLISICNSKSLKEGICVHSPIIKLGLYGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
            LYAKRFGLKQARNLFDEMPD+DVVSWTTMQAAYVRNRSYIEAFELFDLM+ LGHCPNEF
Sbjct: 61  TLYAKRFGLKQARNLFDEMPDKDVVSWTTMQAAYVRNRSYIEAFELFDLMVILGHCPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS L+RSCSETGELELG+CVHGY IKGGFE+KPVLGCTLI++YAKCD +EEA EVFRNM
Sbjct: 121 TLSTLLRSCSETGELELGACVHGYAIKGGFESKPVLGCTLIDMYAKCDCTEEACEVFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D+ DTVTWT  ISSLVQAQKW+EALQLYITM+ SGVTPNEFTFTKLLAT NFL LKYGKL
Sbjct: 181 DNADTVTWTATISSLVQAQKWNEALQLYITMIESGVTPNEFTFTKLLATINFLDLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+IT GV+LNV+LKT LVD+YSRYQELEDAMKVANQT EKDV LWTSIIS FNQNLK
Sbjct: 241 LHNHVITFGVDLNVLLKTTLVDMYSRYQELEDAMKVANQTAEKDVHLWTSIISCFNQNLK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIA  QEMR+SGI PNSFTYSS LSACT IPSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIATLQEMRISGIPPNSFTYSSVLSACTLIPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMKCS+ I+DALRVFRTITSP+VICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMKCSDSINDALRVFRTITSPNVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA SSAKSQN+TSMFHGYILK+RAHHDI+VGNALVDAYARS  VD+A RVI
Sbjct: 421 NSFTLSSILGACSSAKSQNRTSMFHGYILKIRAHHDIIVGNALVDAYARSRMVDEAWRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
           STMNHRDAITYTSLATRLNQMGDHEMALK I SMR DNV  DE+SL SL+SA TGLG V+
Sbjct: 481 STMNHRDAITYTSLATRLNQMGDHEMALKTISSMRDDNVRKDEVSLASLISAATGLGTVK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G+QLHCYSLKYGL NT SVKNSL+DLYGKVGCLKDA K FEEI++PDVVSWNGMIS+LA
Sbjct: 541 IGEQLHCYSLKYGLYNTRSVKNSLIDLYGKVGCLKDAQKAFEEITEPDVVSWNGMISVLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGH+SSAL+AFDNMRLAGL+PDSITFL ILSACSQGGLVDFGMHYF SM+  H +EPEL
Sbjct: 601 LNGHVSSALSAFDNMRLAGLKPDSITFLLILSACSQGGLVDFGMHYFQSMREIHYVEPEL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C++DLLGR GQLE AME+VESMP+EADAKIYKTLL AC  H NMLLGEDVA RGLQ
Sbjct: 661 DHYVCLVDLLGRAGQLEKAMEVVESMPFEADAKIYKTLLSACKLHKNMLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLANLYD  NR DLS +TRKLMRDRGVRKSP QSW EL + IHLF+TG+R
Sbjct: 721 LDPYDSSFYLLLANLYDELNRPDLSKETRKLMRDRGVRKSPSQSWTELSNSIHLFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQINDIQEKLEFL+AEFK RGF+YH DENSSHHSEKLALAFGL+NLPP AV+RIMKNI
Sbjct: 781 SHPQINDIQEKLEFLKAEFKVRGFLYHGDENSSHHSEKLALAFGLINLPPKAVIRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFILLVTKV EREI+VRDG  LHV KNGSCSC HYS
Sbjct: 841 SICRECHDFILLVTKVAEREIVVRDGSRLHVFKNGSCSCRHYS 883

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_023542503.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 733/882 (83.11%), Postives = 796/882 (90.25%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFG+KQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGIKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LG CVHGY IKGGFE+KPVLGCTLI+LYAKCD +EEAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGGCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTEEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE AMKVANQTPEKDVFLWTSIIS F+QN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAMKVANQTPEKDVFLWTSIISCFSQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDAYARSKMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVS NG+ISILA
Sbjct: 541 TGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSCNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVESMP+EADAKIY+TLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVESMPFEADAKIYRTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP+IND++EKLEFLRAEFKSRGF+YH+DE+S HHSEKLALAFGLV++PP  VVRIMKNI
Sbjct: 781 SHPEINDMEEKLEFLRAEFKSRGFLYHDDEDSCHHSEKLALAFGLVSMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKVVEREI+VRDG  LHVLKNGSCSC HY
Sbjct: 841 SICRECHDFILLATKVVEREIVVRDGSRLHVLKNGSCSCKHY 876

BLAST of Bhi01G001040 vs. NCBI nr
Match: KAG6574209.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1460.7 bits (3780), Expect = 0.0e+00
Identity = 727/882 (82.43%), Postives = 794/882 (90.02%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD +++AYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKQAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE AMKVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAMKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDAYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 TGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQG LVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGRLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVE+MP+EADAK+YKTLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVEAMPFEADAKVYKTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLM+DRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMQDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV++PP  VVRIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYGDDEDSCHHSEKLALAFGLVSMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKV+EREI+VRDG  LHV  NGSCSC HY
Sbjct: 841 SICRECHDFILLATKVLEREIVVRDGSRLHVFNNGSCSCKHY 876

BLAST of Bhi01G001040 vs. NCBI nr
Match: KAG7013273.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1458.4 bits (3774), Expect = 0.0e+00
Identity = 726/882 (82.31%), Postives = 792/882 (89.80%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD +++AYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKQAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE A+KVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAIKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVD YARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDGYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 TGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQG LVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGRLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVE+MP+EADAK+YKTLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVEAMPFEADAKVYKTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLM+DRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMQDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV +PP  VVRIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYGDDEDSCHHSEKLALAFGLVRMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKVVEREI+VRDG  LHV  NGSCSC HY
Sbjct: 841 SICRECHDFILLATKVVEREIVVRDGSRLHVFNNGSCSCKHY 876

BLAST of Bhi01G001040 vs. ExPASy TrEMBL
Match: A0A6J1CHG9 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111011682 PE=3 SV=1)

HSP 1 Score: 1522.3 bits (3940), Expect = 0.0e+00
Identity = 750/883 (84.94%), Postives = 807/883 (91.39%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           M+CR VPK LNRNEL RLEE+CS LISICNSKSLKEG+CVHSPIIKLGL+GNLYLSNNLL
Sbjct: 1   MICRTVPKFLNRNELNRLEETCSHLISICNSKSLKEGICVHSPIIKLGLYGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
            LYAKRFGLKQARNLFDEMPD+DVVSWTTMQAAYVRNRSYIEAFELFDLM+ LGHCPNEF
Sbjct: 61  TLYAKRFGLKQARNLFDEMPDKDVVSWTTMQAAYVRNRSYIEAFELFDLMVILGHCPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS L+RSCSETGELELG+CVHGY IKGGFE+KPVLGCTLI++YAKCD +EEA EVFRNM
Sbjct: 121 TLSTLLRSCSETGELELGACVHGYAIKGGFESKPVLGCTLIDMYAKCDCTEEACEVFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D+ DTVTWT  ISSLVQAQKW+EALQLYITM+ SGVTPNEFTFTKLLAT NFL LKYGKL
Sbjct: 181 DNADTVTWTATISSLVQAQKWNEALQLYITMIESGVTPNEFTFTKLLATINFLDLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+IT GV+LNV+LKT LVD+YSRYQELEDAMKVANQT EKDV LWTSIIS FNQNLK
Sbjct: 241 LHNHVITFGVDLNVLLKTTLVDMYSRYQELEDAMKVANQTAEKDVHLWTSIISCFNQNLK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIA  QEMR+SGI PNSFTYSS LSACT IPSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIATLQEMRISGIPPNSFTYSSVLSACTLIPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMKCS+ I+DALRVFRTITSP+VICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMKCSDSINDALRVFRTITSPNVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA SSAKSQN+TSMFHGYILK+RAHHDI+VGNALVDAYARS  VD+A RVI
Sbjct: 421 NSFTLSSILGACSSAKSQNRTSMFHGYILKIRAHHDIIVGNALVDAYARSRMVDEAWRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
           STMNHRDAITYTSLATRLNQMGDHEMALK I SMR DNV  DE+SL SL+SA TGLG V+
Sbjct: 481 STMNHRDAITYTSLATRLNQMGDHEMALKTISSMRDDNVRKDEVSLASLISAATGLGTVK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G+QLHCYSLKYGL NT SVKNSL+DLYGKVGCLKDA K FEEI++PDVVSWNGMIS+LA
Sbjct: 541 IGEQLHCYSLKYGLYNTRSVKNSLIDLYGKVGCLKDAQKAFEEITEPDVVSWNGMISVLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGH+SSAL+AFDNMRLAGL+PDSITFL ILSACSQGGLVDFGMHYF SM+  H +EPEL
Sbjct: 601 LNGHVSSALSAFDNMRLAGLKPDSITFLLILSACSQGGLVDFGMHYFQSMREIHYVEPEL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C++DLLGR GQLE AME+VESMP+EADAKIYKTLL AC  H NMLLGEDVA RGLQ
Sbjct: 661 DHYVCLVDLLGRAGQLEKAMEVVESMPFEADAKIYKTLLSACKLHKNMLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLANLYD  NR DLS +TRKLMRDRGVRKSP QSW EL + IHLF+TG+R
Sbjct: 721 LDPYDSSFYLLLANLYDELNRPDLSKETRKLMRDRGVRKSPSQSWTELSNSIHLFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQINDIQEKLEFL+AEFK RGF+YH DENSSHHSEKLALAFGL+NLPP AV+RIMKNI
Sbjct: 781 SHPQINDIQEKLEFLKAEFKVRGFLYHGDENSSHHSEKLALAFGLINLPPKAVIRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFILLVTKV EREI+VRDG  LHV KNGSCSC HYS
Sbjct: 841 SICRECHDFILLVTKVAEREIVVRDGSRLHVFKNGSCSCRHYS 883

BLAST of Bhi01G001040 vs. ExPASy TrEMBL
Match: A0A6J1HYM3 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111467799 PE=3 SV=1)

HSP 1 Score: 1457.6 bits (3772), Expect = 0.0e+00
Identity = 727/883 (82.33%), Postives = 793/883 (89.81%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD ++EAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EA QLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEAPQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE AMKVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAMKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQ+ILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQIILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ I+DALRVFR+I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIEDALRVFRSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGY+LK  A+ DIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYVLKSMAYQDIVVGNALVDAYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TGLG +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGLGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHC+SL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 TGKQLHCFSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLNPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY  +IDL GR GQLE AMEIVESMP+EADAKIYKTLL AC  H N+LLGEDVA RGL 
Sbjct: 661 DHYVRVIDLHGRAGQLEKAMEIVESMPFEADAKIYKTLLSACKLHRNVLLGEDVARRGLH 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV++PP AV+RIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYRDDEDSCHHSEKLALAFGLVSMPPEAVIRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFI+L TKVVEREI+VRD   LHV KNGSCSC HYS
Sbjct: 841 SICRECHDFIVLATKVVEREIVVRDRSRLHVFKNGSCSCKHYS 877

BLAST of Bhi01G001040 vs. ExPASy TrEMBL
Match: A0A6J1G1W5 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111449930 PE=3 SV=1)

HSP 1 Score: 1456.0 bits (3768), Expect = 0.0e+00
Identity = 726/882 (82.31%), Postives = 790/882 (89.57%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEG+CVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGLCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD ++EAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE A KVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYATKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAF EMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFLEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDAYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            GKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 AGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVE MP+EADAK+YKTLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVEGMPFEADAKVYKTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV++PP  VVRIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYGDDEDSCHHSEKLALAFGLVSMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKVVEREI+VRDG  LHV  NGSCSC  Y
Sbjct: 841 SICRECHDFILLATKVVEREIVVRDGSRLHVFNNGSCSCKRY 876

BLAST of Bhi01G001040 vs. ExPASy TrEMBL
Match: A0A7N2RAB8 (DYW_deaminase domain-containing protein OS=Quercus lobata OX=97700 PE=3 SV=1)

HSP 1 Score: 1080.5 bits (2793), Expect = 0.0e+00
Identity = 524/882 (59.41%), Postives = 685/882 (77.66%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLC+ V K  +R ELYR ++ C +++S+CNSKSLKEGVCVHSPIIK+GL  ++YL+NNLL
Sbjct: 1   MLCKTVTKTCHRTELYRFQDICLRVVSLCNSKSLKEGVCVHSPIIKMGLQDDMYLNNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAK FG+  A + FDEMP +DVVSWT + ++YV N ++ +A  LFD ML     PNEF
Sbjct: 61  SLYAKCFGVDHAHHFFDEMPCKDVVSWTGILSSYVINENHEQALRLFDSMLNSSQYPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS+++RSCS  GE + G+ +  Y+IK GF + P+L   LI+LY+KC+ ++EAY+VF  +
Sbjct: 121 TLSSVLRSCSALGEFDYGTLIQAYMIKNGFHSNPILASALIDLYSKCNCTKEAYKVFECV 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D  DTV+WT MISSLVQAQKW +ALQLYI M+   V PNEFTF KLLA +  LG  YGKL
Sbjct: 181 DGGDTVSWTTMISSLVQAQKWSQALQLYIRMIEKKVPPNEFTFVKLLAASGSLGSSYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           +H HMI LG+ LNV+LKTALVD+YS+   +EDA+KV+NQTPE+DVFLWT+IIS F QN+K
Sbjct: 241 VHAHMILLGIELNVILKTALVDMYSKCHRMEDAVKVSNQTPERDVFLWTAIISGFIQNMK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAA  EM MSGI+PN+F+YS+ L+A +SI SL+LG+Q+H +VI AGLE D+  G+A
Sbjct: 301 VKEAIAALSEMVMSGIVPNNFSYSTILNASSSILSLELGEQVHSRVIKAGLEDDISVGNA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LI+MYMKCSN ID+ALRVFR +TSP+VI WTSLI+G A+HG E+D +R F +M+A G+ P
Sbjct: 361 LIDMYMKCSNLIDNALRVFRGVTSPNVITWTSLIAGFAKHGFEEDSFRSFEEMRALGLAP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA S+ KS +QT   HGYI+K++A  DIVVGNALVDAYA    VD+A  VI
Sbjct: 421 NSFTLSSILGACSTMKSHSQTMKLHGYIIKIKADCDIVVGNALVDAYAGLGMVDEARCVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
             M+HRDAITYTSLATR+NQMG H+ AL+II  M  D+V+MD  S++S +SA  GLG ++
Sbjct: 481 RKMDHRDAITYTSLATRINQMGYHDRALEIIKYMNKDDVKMDGFSMSSFLSAAAGLGSMK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G QLHC+S+K GL    SV N ++DLYGK GC+ DA++ F EI++PDV SWNG IS LA
Sbjct: 541 AGMQLHCFSVKSGLRCWLSVSNGVVDLYGKCGCIHDAHRAFGEITEPDVASWNGWISGLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NG+ISSAL+AF++MRL G++PD +TFL +L ACS GGLVD G+ YFHSM+ TH I P+L
Sbjct: 601 SNGYISSALSAFEDMRLVGVKPDLVTFLLVLFACSHGGLVDLGLEYFHSMRETHGIAPQL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDLLGR GQLE AM ++++MP+  DA IYKTLL A   HGN+ LGED+A +G+ 
Sbjct: 661 DHYVCLIDLLGRAGQLEEAMGVIKTMPFRPDALIYKTLLSASKLHGNVPLGEDMARQGID 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P+D +FY+LLANLYD   R DLS K R LMR+RG+ K+P QSW+E+ ++IH F   +R
Sbjct: 721 LDPSDPAFYILLANLYDRSGRSDLSEKARGLMRERGLTKNPCQSWMEIRNQIHHFTAEDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQIN I EK+E L  EFK RG++Y ++ + S+HSEKLA+AFGL++ P  A + I+K++
Sbjct: 781 SHPQINQIHEKIESLMTEFKYRGYLYRDNRDKSYHSEKLAVAFGLLSTPSKAPILIIKDM 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
            IC +CH F++LVT++V+REII+R+G  +H  K G+CSC  Y
Sbjct: 841 RICMDCHYFVMLVTELVDREIILREGNRVHSFKKGNCSCRGY 882

BLAST of Bhi01G001040 vs. ExPASy TrEMBL
Match: A0A6P5TID2 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic isoform X1 OS=Prunus avium OX=42229 GN=LOC110767578 PE=3 SV=1)

HSP 1 Score: 1070.5 bits (2767), Expect = 3.9e-309
Identity = 522/871 (59.93%), Postives = 681/871 (78.19%), Query Frame = 0

Query: 9   LLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFG 68
           ++N  ++   +E+C +++S+CNS++LKEGVCVHSPI KLGL  +LYLSNNLL+LYAK FG
Sbjct: 10  VINTTQVNCFKETCLRVLSLCNSRALKEGVCVHSPITKLGLQEDLYLSNNLLSLYAKCFG 69

Query: 69  LKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRS 128
           ++ AR+ FDEMPDRDVVSWT M +AYVRN  Y EA E FDLM   G CPNEFTLS+++RS
Sbjct: 70  VEPARHFFDEMPDRDVVSWTGMLSAYVRNGRYDEALEFFDLMSISGQCPNEFTLSSVLRS 129

Query: 129 CSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTW 188
           CS  G+ + G+ +H YVIK GFE+   LG T+I+LYAKC F++EA ++FRNMD+ DT++W
Sbjct: 130 CSLLGDFDYGTRIHAYVIKLGFESNQYLGSTMIDLYAKCGFTDEACKIFRNMDNRDTISW 189

Query: 189 TVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITL 248
           T +ISSLVQA+K+ +AL  Y+ M+ +GV PNEFTF KLLA    LGL YGKLLH H+I+L
Sbjct: 190 TTIISSLVQAEKFSQALAHYMEMICAGVHPNEFTFVKLLAAPYSLGLNYGKLLHAHLISL 249

Query: 249 GVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAF 308
           G+ LN+VLKTALV++YS+ Q++EDA+KV+NQTP+ DV LWTS+IS F Q+L+V +AIAA 
Sbjct: 250 GMRLNLVLKTALVNMYSKCQKMEDAIKVSNQTPDYDVLLWTSVISGFTQSLRVTDAIAAL 309

Query: 309 QEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKC 368
            EM +SGI+PN+FTYSS L A + I SL+LGKQIH ++I AGLE D CAG AL++MYMKC
Sbjct: 310 HEMELSGIVPNNFTYSSILKASSEILSLELGKQIHSRIIKAGLEDDTCAGGALVDMYMKC 369

Query: 369 SNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSI 428
           S+  +DAL  FR ITSPSVI WTSLI+G +EHG E+D ++ F +M+A GVQPN+FTLSSI
Sbjct: 370 SDLAEDALGAFRDITSPSVITWTSLIAGFSEHGFEKDSFQSFEEMRAVGVQPNSFTLSSI 429

Query: 429 LGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDA 488
           L A S+ KS +QT   HG I+K +A  D VVGNALVDAYA    VDDA  V+++M HRDA
Sbjct: 430 LRACSTVKSHSQTVKLHGLIVKTKAGCDTVVGNALVDAYAALGMVDDAWHVVTSMIHRDA 489

Query: 489 ITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCY 548
           ITYT LATR+NQMG +E+AL +I  M  D+VEMD  S+ S +S+  GL  +ETG+QLHC 
Sbjct: 490 ITYTCLATRMNQMGRYEVALDVIVRMHMDDVEMDGFSMASFLSSSAGLAAMETGRQLHCC 549

Query: 549 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 608
           S+K GL +  SV N+L+D YGK GC  DA + F+ IS+PD+VSWNG+IS LA  GHISSA
Sbjct: 550 SIKAGLASGISVSNALVDFYGKCGCTDDAYRAFKGISEPDIVSWNGLISGLASTGHISSA 609

Query: 609 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 668
           L+ FD+MRLAG +PD ITFL +L ACS GGLV+ G+ +F SM+  H+I P+LDHYAC++D
Sbjct: 610 LSTFDDMRLAGFKPDYITFLLVLFACSHGGLVELGLEHFQSMREKHEIAPQLDHYACLVD 669

Query: 669 LLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSF 728
           LLGR G+LE+AME++ +MP++ DA IYKTLL AC  H N+ LGE VA +G +L+P+D +F
Sbjct: 670 LLGRAGRLEDAMEVIMTMPFKPDALIYKTLLGACKSHRNIALGEYVARQGTELDPSDPAF 729

Query: 729 YLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDI 788
           Y+LLANLY+   + DL+  TR++MR+RG++K+PGQ W+E+ +K+HLF  G+R+HPQIN+I
Sbjct: 730 YVLLANLYEESGQPDLAKSTRRVMRERGLKKNPGQCWMEIRNKVHLFNAGDRSHPQINEI 789

Query: 789 QEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECHD 848
            EK+E L  E K+RG +Y + E+SS+HSEKLA+AFGL+  P  A +RI KN+ IC ECH+
Sbjct: 790 HEKVESLITELKNRGNLYQDYEDSSYHSEKLAVAFGLLRTPRNASIRISKNMLICSECHN 849

Query: 849 FILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           FI+LVT+ V+REIIVRDG  LHV K G CSC
Sbjct: 850 FIMLVTQFVDREIIVRDGNRLHVFKKGECSC 880

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G52850.14.7e-25148.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G13650.11.4e-14131.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G57430.19.2e-13832.16Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G03580.11.3e-13630.76Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G33170.17.0e-13029.36Tetratricopeptide repeat (TPR)-like superfamily protein [more]
Match NameE-valueIdentityDescription
Q9FLX66.6e-25048.74Pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Arabidop... [more]
Q9SVP71.9e-14031.74Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Q7Y2111.3e-13632.16Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Q9SS601.9e-13530.76Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q9SMZ29.9e-12929.36Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_038874958.10.0e+00100.00pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Benincasa ... [more]
XP_022141235.10.0e+0084.94pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Momordica ... [more]
XP_023542503.10.0e+0083.11pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita ... [more]
KAG6574209.10.0e+0082.43Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAG7013273.10.0e+0082.31Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A6J1CHG90.0e+0084.94pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Momordic... [more]
A0A6J1HYM30.0e+0082.33pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Cucurbit... [more]
A0A6J1G1W50.0e+0082.31pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Cucurbit... [more]
A0A7N2RAB80.0e+0059.41DYW_deaminase domain-containing protein OS=Quercus lobata OX=97700 PE=3 SV=1[more]
A0A6P5TID23.9e-30959.93pentatricopeptide repeat-containing protein At5g52850, chloroplastic isoform X1 ... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 85..119
e-value: 4.3E-5
score: 21.4
coord: 388..422
e-value: 1.0E-5
score: 23.4
coord: 590..624
e-value: 8.3E-4
score: 17.4
coord: 489..523
e-value: 0.0032
score: 15.5
coord: 186..220
e-value: 9.4E-8
score: 29.8
coord: 287..319
e-value: 5.9E-4
score: 17.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 184..227
e-value: 1.1E-10
score: 41.5
coord: 587..634
e-value: 2.1E-10
score: 40.7
coord: 82..129
e-value: 5.0E-8
score: 33.0
coord: 283..330
e-value: 3.2E-9
score: 36.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 562..584
e-value: 0.023
score: 14.9
coord: 159..180
e-value: 0.016
score: 15.4
coord: 489..518
e-value: 0.039
score: 14.2
coord: 663..686
e-value: 0.78
score: 10.1
coord: 389..418
e-value: 0.0018
score: 18.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 386..420
score: 9.63504
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 284..318
score: 9.985802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 487..521
score: 9.108898
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 83..117
score: 10.47906
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 588..622
score: 10.457138
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 184..218
score: 12.517862
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 143..232
e-value: 2.2E-18
score: 68.3
coord: 339..444
e-value: 3.8E-16
score: 60.9
coord: 236..338
e-value: 1.6E-16
score: 62.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 445..551
e-value: 1.4E-13
score: 52.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 559..815
e-value: 6.7E-34
score: 119.7
coord: 11..142
e-value: 3.5E-20
score: 74.6
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 761..872
e-value: 1.0E-20
score: 73.9
NoneNo IPR availablePANTHERPTHR47929:SF11BNACNNG15330D PROTEINcoord: 308..533
coord: 502..816
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 18..404
NoneNo IPR availablePANTHERPTHR47929FAMILY NOT NAMEDcoord: 308..533
coord: 502..816
NoneNo IPR availablePANTHERPTHR47929:SF11BNACNNG15330D PROTEINcoord: 18..404

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M001040Bhi01M001040mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding