Bhi01G001040 (gene) Wax gourd

NameBhi01G001040
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr1 : 28934351 .. 28937002 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

mRNA sequence

ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

Coding sequence (CDS)

ATGCTATGTAGAGCAGTCCCCAAACTTCTCAACAGAAATGAACTCTATCGTCTGGAAGAAAGCTGCTCTCAGCTTATTTCAATTTGCAATTCCAAATCCTTAAAAGAGGGCGTTTGTGTTCATAGCCCGATTATCAAGCTCGGTCTTCATGGTAATCTGTATCTGAGCAATAATTTACTAGCTCTTTATGCTAAACGATTTGGACTCAAACAGGCACGTAACCTGTTCGATGAAATGCCTGATAGAGATGTGGTGTCCTGGACCACGATGCAGGCTGCTTATGTCAGGAACAGAAGCTACATTGAGGCTTTTGAATTGTTTGATTTGATGCTAACATTGGGTCACTGTCCAAATGAGTTTACACTTTCGAATTTGATCCGATCGTGCTCTGAAACGGGAGAACTGGAGCTTGGAAGTTGTGTCCATGGCTATGTTATAAAGGGTGGCTTTGAGACGAAGCCAGTGCTGGGATGCACCTTGATTAATCTTTATGCAAAGTGTGATTTCTCTGAGGAAGCTTATGAAGTTTTCAGAAATATGGACGATGTCGATACTGTTACTTGGACCGTGATGATTTCTTCACTAGTGCAAGCACAGAAATGGGATGAGGCTCTTCAGTTATATATCACTATGATGAATTCTGGGGTCACTCCTAATGAGTTCACTTTTACAAAACTTTTAGCCACAACCAATTTTCTGGGTTTGAAATATGGGAAGTTACTCCATTGTCACATGATAACATTGGGAGTCAATCTGAACGTAGTTCTAAAGACGGCGCTCGTCGATGTGTATTCAAGATACCAAGAGTTAGAAGATGCAATGAAGGTTGCAAATCAAACACCTGAGAAAGACGTGTTTTTGTGGACCTCTATTATCTCCTTCTTCAATCAGAATTTGAAGGTCAAGGAGGCTATTGCTGCATTCCAAGAGATGAGAATGTCTGGAATTTTACCAAACAGTTTCACATATTCCAGTGCGTTAAGTGCCTGCACATCGATCCCGTCGCTTAAATTAGGTAAGCAAATTCACTTGCAGGTAATATTGGCTGGGTTGGAGGCTGACGTTTGTGCTGGGAGTGCACTAATTAATATGTACATGAAATGTTCTAACTTCATAGATGATGCCTTGAGAGTGTTTAGGACAATAACTTCCCCAAGTGTTATTTGTTGGACTTCTTTAATATCTGGTCTTGCCGAGCATGGTTGTGAACAAGATTGTTATAGATATTTTTTGGATATGCAAGCAGCAGGAGTGCAGCCAAATGCCTTTACTCTTTCTAGTATCCTTGGGGCCAGCAGTTCAGCGAAATCACAAAATCAAACATCGATGTTCCATGGATATATACTAAAAATGAGGGCTCACCATGATATTGTTGTTGGAAATGCTCTTGTGGATGCTTATGCTCGATCTGCAAAGGTGGATGATGCTTGCCGAGTGATTAGCACCATGAATCATCGGGATGCCATCACTTATACTAGCTTAGCCACGAGATTGAATCAGATGGGTGATCATGAAATGGCACTAAAAATCATTGATTCCATGCGTGCTGACAATGTTGAGATGGATGAAATTAGCTTGACAAGTTTGGTATCTGCATTGACAGGCCTAGGTATAGTTGAAACCGGGAAACAACTTCATTGCTATTCTTTGAAGTATGGCTTAGACAACACCTGCTCAGTAAAAAATAGTTTGATGGACTTATATGGCAAGGTTGGATGCTTGAAGGATGCCAATAAAGTTTTTGAAGAAATAAGCAAACCAGACGTCGTTTCTTGGAATGGAATGATATCTATATTAGCATTCAACGGGCATATCTCCTCTGCTCTTGCTGCCTTTGACAATATGAGATTAGCTGGCCTAGAGCCCGATTCAATCACATTCCTATCAATACTTTCAGCTTGCAGTCAAGGTGGTTTGGTTGATTTTGGAATGCACTACTTTCATTCTATGAAAGCAACCCATAAAATAGAGCCAGAATTGGATCATTATGCTTGTATAATTGATCTCCTAGGCCGCGTTGGACAACTAGAGAACGCAATGGAAATCGTAGAATCCATGCCATATGAGGCAGATGCTAAAATCTACAAGACATTGTTGAAAGCCTGCAATTTCCATGGGAACATGCTGCTTGGAGAAGATGTGGCAATAAGAGGACTTCAACTTAACCCAAACGATTCATCTTTCTATTTGCTGCTGGCCAACTTGTACGATGGATACAACCGACAAGATTTAAGTGCAAAAACTCGTAAGCTGATGCGAGATCGTGGAGTGAGGAAGAGTCCTGGCCAAAGTTGGATAGAATTACATAGCAAGATTCATCTCTTTGTCACAGGAGAGAGAACACATCCTCAAATCAATGACATCCAAGAAAAGTTAGAATTCCTCAGAGCTGAGTTCAAGAGTAGGGGGTTTATGTATCATGAAGATGAAAATTCATCCCATCATAGTGAAAAATTGGCTCTTGCATTTGGTCTTGTTAATTTGCCACCCACAGCTGTTGTACGAATAATGAAGAACATAAGCATTTGCAGAGAATGCCATGACTTCATATTGCTAGTAACAAAGGTGGTAGAGAGGGAAATAATTGTGAGAGATGGGCGCGGGCTCCATGTGCTAAAAAATGGAAGCTGCTCTTGCAGCCATTACTCATGA

Protein sequence

MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS
BLAST of Bhi01G001040 vs. TAIR10
Match: AT5G52850.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 865.1 bits (2234), Expect = 3.6e-251
Identity = 425/872 (48.74%), Postives = 598/872 (68.58%), Query Frame = 0

Query: 9   LLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFG 68
           L   NEL  L++SC +++S C S S + G+ +H P+IK GL  NL L NNLL+LY K  G
Sbjct: 14  LSRTNELGNLQKSCIRILSFCESNSSRIGLHIHCPVIKFGLLENLDLCNNLLSLYLKTDG 73

Query: 69  LKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRS 128
           +  AR LFDEM  R V +WT M +A+ +++ +  A  LF+ M+  G  PNEFT S+++RS
Sbjct: 74  IWNARKLFDEMSHRTVFAWTVMISAFTKSQEFASALSLFEEMMASGTHPNEFTFSSVVRS 133

Query: 129 CSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTW 188
           C+   ++  G  VHG VIK GFE   V+G +L +LY+KC   +EA E+F ++ + DT++W
Sbjct: 134 CAGLRDISYGGRVHGSVIKTGFEGNSVVGSSLSDLYSKCGQFKEACELFSSLQNADTISW 193

Query: 189 TVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITL 248
           T+MISSLV A+KW EALQ Y  M+ +GV PNEFTF KLL  ++FLGL++GK +H ++I  
Sbjct: 194 TMMISSLVGARKWREALQFYSEMVKAGVPPNEFTFVKLLGASSFLGLEFGKTIHSNIIVR 253

Query: 249 GVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAF 308
           G+ LNVVLKT+LVD YS++ ++EDA++V N + E+DVFLWTS++S F +NL+ KEA+  F
Sbjct: 254 GIPLNVVLKTSLVDFYSQFSKMEDAVRVLNSSGEQDVFLWTSVVSGFVRNLRAKEAVGTF 313

Query: 309 QEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKC 368
            EMR  G+ PN+FTYS+ LS C+++ SL  GKQIH Q I  G E     G+AL++MYMKC
Sbjct: 314 LEMRSLGLQPNNFTYSAILSLCSAVRSLDFGKQIHSQTIKVGFEDSTDVGNALVDMYMKC 373

Query: 369 SNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSI 428
           S    +A RVF  + SP+V+ WT+LI GL +HG  QDC+   ++M    V+PN  TLS +
Sbjct: 374 SASEVEASRVFGAMVSPNVVSWTTLILGLVDHGFVQDCFGLLMEMVKREVEPNVVTLSGV 433

Query: 429 LGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDA 488
           L A S  +   +    H Y+L+     ++VVGN+LVDAYA S KVD A  VI +M  RD 
Sbjct: 434 LRACSKLRHVRRVLEIHAYLLRRHVDGEMVVGNSLVDAYASSRKVDYAWNVIRSMKRRDN 493

Query: 489 ITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCY 548
           ITYTSL TR N++G HEMAL +I+ M  D + MD++SL   +SA   LG +ETGK LHCY
Sbjct: 494 ITYTSLVTRFNELGKHEMALSVINYMYGDGIRMDQLSLPGFISASANLGALETGKHLHCY 553

Query: 549 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 608
           S+K G     SV NSL+D+Y K G L+DA KVFEEI+ PDVVSWNG++S LA NG ISSA
Sbjct: 554 SVKSGFSGAASVLNSLVDMYSKCGSLEDAKKVFEEIATPDVVSWNGLVSGLASNGFISSA 613

Query: 609 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 668
           L+AF+ MR+   EPDS+TFL +LSACS G L D G+ YF  MK  + IEP+++HY  ++ 
Sbjct: 614 LSAFEEMRMKETEPDSVTFLILLSACSNGRLTDLGLEYFQVMKKIYNIEPQVEHYVHLVG 673

Query: 669 LLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSF 728
           +LGR G+LE A  +VE+M  + +A I+KTLL+AC + GN+ LGED+A +GL L P+D + 
Sbjct: 674 ILGRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYRGNLSLGEDMANKGLALAPSDPAL 733

Query: 729 YLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTH-PQIND 788
           Y+LLA+LYD   + +L+ KTR LM ++ + K  G+S +E+  K+H FV+ + T   + N 
Sbjct: 734 YILLADLYDESGKPELAQKTRNLMTEKRLSKKLGKSTVEVQGKVHSFVSEDVTRVDKTNG 793

Query: 789 IQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECH 848
           I  ++E ++ E K  G  Y  +EN+S HS K A+ +G +   P A V ++KN  +C++CH
Sbjct: 794 IYAEIESIKEEIKRFGSPYRGNENASFHSAKQAVVYGFIYASPEAPVHVVKNKILCKDCH 853

Query: 849 DFILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           +F+ ++T++V+++I VRDG  +H+ KNG CSC
Sbjct: 854 EFVSILTRLVDKKITVRDGNQVHIFKNGECSC 885

BLAST of Bhi01G001040 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 501.5 bits (1290), Expect = 1.0e-141
Identity = 279/879 (31.74%), Postives = 467/879 (53.13%), Query Frame = 0

Query: 19   EESCSQLISICNSKSLKEGVC--VHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLF 78
            E + S ++  C   S+   V   +H+ I+  GL  +  + N L+ LY++   +  AR +F
Sbjct: 186  EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 79   DEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELE 138
            D +  +D  SW  M +   +N    EA  LF  M  LG  P  +  S+++ +C +   LE
Sbjct: 246  DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 139  LGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLV 198
            +G  +HG V+K GF +   +   L++LY        A  +F NM   D VT+  +I+ L 
Sbjct: 306  IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 199  QAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKY-GKLLHCHMITLGVNLNVV 258
            Q    ++A++L+  M   G+ P+  T   L+   +  G  + G+ LH +   LG   N  
Sbjct: 366  QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 259  LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 318
            ++ AL+++Y++  ++E A+    +T  ++V LW  ++  +     ++ +   F++M++  
Sbjct: 426  IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE 485

Query: 319  ILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDA 378
            I+PN +TY S L  C  +  L+LG+QIH Q+I    + +    S LI+MY K    +D A
Sbjct: 486  IVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGK-LDTA 545

Query: 379  LRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSA 438
              +        V+ WT++I+G  ++  +      F  M   G++ +   L++ + A +  
Sbjct: 546  WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 439  KSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLA 498
            ++  +    H          D+   NALV  Y+R  K++++          D I + +L 
Sbjct: 606  QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 499  TRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLD 558
            +   Q G++E AL++   M  + ++ +  +  S V A +    ++ GKQ+H    K G D
Sbjct: 666  SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 559  NTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNM 618
            +   V N+L+ +Y K G + DA K F E+S  + VSWN +I+  + +G  S AL +FD M
Sbjct: 726  SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 619  RLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQ 678
              + + P+ +T + +LSACS  GLVD G+ YF SM + + + P+ +HY C++D+L R G 
Sbjct: 786  IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 679  LENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANL 738
            L  A E ++ MP + DA +++TLL AC  H NM +GE  A   L+L P DS+ Y+LL+NL
Sbjct: 846  LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNL 905

Query: 739  YDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFL 798
            Y    + D    TR+ M+++GV+K PGQSWIE+ + IH F  G++ HP  ++I E  + L
Sbjct: 906  YAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDL 965

Query: 799  RAEFKSRGF----------MYHEDENS--SHHSEKLALAFGLVNLPPTAVVRIMKNISIC 858
                   G+          + HE ++     HSEKLA++FGL++LP T  + +MKN+ +C
Sbjct: 966  TKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVC 1025

Query: 859  RECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
             +CH +I  V+KV  REIIVRD    H  + G+CSC  Y
Sbjct: 1026 NDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDY 1063

BLAST of Bhi01G001040 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 488.8 bits (1257), Expect = 7.0e-138
Identity = 266/827 (32.16%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 87  WTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVI 146
           W  +  + VR+    EA   +  M+ LG  P+ +    L+++ ++  ++ELG  +H +V 
Sbjct: 65  WIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVY 124

Query: 147 KGGFETKPV-LGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           K G+    V +  TL+NLY KC      Y+VF  + + + V+W  +ISSL   +KW+ AL
Sbjct: 125 KFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMAL 184

Query: 207 QLYITMMNSGVTPNEFTFTKLLATTNFL----GLKYGKLLHCHMITLGVNLNVVLKTALV 266
           + +  M++  V P+ FT   ++   + L    GL  GK +H + +  G  LN  +   LV
Sbjct: 185 EAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKG-ELNSFIINTLV 244

Query: 267 DVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSF 326
            +Y +  +L  +  +      +D+  W +++S   QN ++ EA+   +EM + G+ P+ F
Sbjct: 245 AMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEF 304

Query: 327 TYSSALSACTSIPSLKLGKQIHLQVILAG-LEADVCAGSALINMYMKCSNFIDDALRVFR 386
           T SS L AC+ +  L+ GK++H   +  G L+ +   GSAL++MY  C   +    RVF 
Sbjct: 305 TISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVL-SGRRVFD 364

Query: 387 TITSPSVICWTSLISGLAEHGCEQDCYRYFLDM-QAAGVQPNAFTLSSILGASSSAKSQN 446
            +    +  W ++I+G +++  +++    F+ M ++AG+  N+ T++ ++ A   + + +
Sbjct: 365 GMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFS 424

Query: 447 QTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLN 506
           +    HG+++K     D  V N L+D Y+R  K+D A R+   M  RD +T+ ++ T   
Sbjct: 425 RKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYV 484

Query: 507 QMGDHEMALKIIDSMR---------ADNVEM--DEISLTSLVSALTGLGIVETGKQLHCY 566
               HE AL ++  M+         A  V +  + I+L +++ +   L  +  GK++H Y
Sbjct: 485 FSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAY 544

Query: 567 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 626
           ++K  L    +V ++L+D+Y K GCL+ + KVF++I + +V++WN +I     +G+   A
Sbjct: 545 AIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEA 604

Query: 627 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 686
           +     M + G++P+ +TF+S+ +ACS  G+VD G+  F+ MK  + +EP  DHYAC++D
Sbjct: 605 IDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVD 664

Query: 687 LLGRVGQLENAMEIVESMPYEAD-AKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSS 746
           LLGR G+++ A +++  MP + + A  + +LL A   H N+ +GE  A   +QL PN +S
Sbjct: 665 LLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVAS 724

Query: 747 FYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQIND 806
            Y+LLAN+Y      D + + R+ M+++GVRK PG SWIE   ++H FV G+ +HPQ   
Sbjct: 725 HYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEK 784

Query: 807 IQEKLEFLRAEFKSRGFM---------YHEDENS---SHHSEKLALAFGLVNLPPTAVVR 866
           +   LE L    +  G++           EDE       HSEKLA+AFG++N  P  ++R
Sbjct: 785 LSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIR 844

Query: 867 IMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           + KN+ +C +CH     ++K+V+REII+RD R  H  KNG+CSC  Y
Sbjct: 845 VAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 889

BLAST of Bhi01G001040 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 462.6 bits (1189), Expect = 5.4e-130
Identity = 276/940 (29.36%), Postives = 466/940 (49.57%), Query Frame = 0

Query: 31  SKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTM 90
           S  L  G C H+ I+    +   +L NNL+++Y+K   L  AR +FD+MPDRD+VSW ++
Sbjct: 52  SSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYARRVFDKMPDRDLVSWNSI 111

Query: 91  QAAYVRNRSYI-----EAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 150
            AAY ++   +     +AF LF ++       +  TLS +++ C  +G +      HGY 
Sbjct: 112 LAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLKLCLHSGYVWASESFHGYA 171

Query: 151 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 210
            K G +    +   L+N+Y K    +E   +F  M   D V W +M+ + ++    +EA+
Sbjct: 172 CKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVLWNLMLKAYLEMGFKEEAI 231

Query: 211 QLYITMMNSGVTPNEF-------------------------------------------- 270
            L     +SG+ PNE                                             
Sbjct: 232 DLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGNDASSVSEIIFRNKGLSEYL 291

Query: 271 -------------------------TFTKLLAT-TNFLGLKYGKLLHCHMITLGVNLNVV 330
                                    TF  +LAT      L  G+ +HC  + LG++L + 
Sbjct: 292 HSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLT 351

Query: 331 LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 390
           +  +L+++Y + ++   A  V +   E+D+  W S+I+   QN    EA+  F ++   G
Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG 411

Query: 391 ILPNSFTYSSALSACTSIP-SLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDD 450
           + P+ +T +S L A +S+P  L L KQ+H+  I     +D    +ALI+ Y + +  + +
Sbjct: 412 LKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR-NRCMKE 471

Query: 451 ALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSS 510
           A  +F    +  ++ W ++++G  +        + F  M   G + + FTL+++      
Sbjct: 472 AEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGF 531

Query: 511 AKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSL 570
             + NQ    H Y +K     D+ V + ++D Y +   +  A     ++   D + +T++
Sbjct: 532 LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTM 591

Query: 571 ATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGL 630
            +   + G+ E A  +   MR   V  DE ++ +L  A + L  +E G+Q+H  +LK   
Sbjct: 592 ISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNC 651

Query: 631 DNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDN 690
            N   V  SL+D+Y K G + DA  +F+ I   ++ +WN M+  LA +G     L  F  
Sbjct: 652 TNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQ 711

Query: 691 MRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVG 750
           M+  G++PD +TF+ +LSACS  GLV     +  SM   + I+PE++HY+C+ D LGR G
Sbjct: 712 MKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAG 771

Query: 751 QLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLAN 810
            ++ A  ++ESM  EA A +Y+TLL AC   G+   G+ VA + L+L P DSS Y+LL+N
Sbjct: 772 LVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSN 831

Query: 811 LYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEF 870
           +Y   ++ D     R +M+   V+K PG SWIE+ +KIH+FV  +R++ Q   I  K++ 
Sbjct: 832 MYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKD 891

Query: 871 LRAEFKSRGFM---------YHEDENSS---HHSEKLALAFGLVNLPPTAVVRIMKNISI 883
           +  + K  G++           E+E      +HSEKLA+AFGL++ PP+  +R++KN+ +
Sbjct: 892 MIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRV 951

BLAST of Bhi01G001040 vs. TAIR10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 461.5 bits (1186), Expect = 1.2e-129
Identity = 260/868 (29.95%), Postives = 457/868 (52.65%), Query Frame = 0

Query: 27  SICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEM-PDRDVV 86
           ++ +S +L E   +H+ +I LGL  + + S  L+  Y+       + ++F  + P ++V 
Sbjct: 13  ALSSSSNLNELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVY 72

Query: 87  SWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 146
            W ++  A+ +N  + EA E +  +      P+++T  ++I++C+   + E+G  V+  +
Sbjct: 73  LWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQI 132

Query: 147 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           +  GFE+   +G  L+++Y++      A +VF  M   D V                   
Sbjct: 133 LDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVXXXXXXXXXXXXXXXXXXX 192

Query: 207 QLYITMMNSGVTPNEFTFTKLL-ATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVY 266
                + NS + P+ FT + +L A  N L +K G+ LH   +  GVN  VV+   LV +Y
Sbjct: 193 XXXHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 252

Query: 267 SRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYS 326
            +++   DA +V ++   +D   + ++I  + +   V+E++  F E  +    P+  T S
Sbjct: 253 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 312

Query: 327 SALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITS 386
           S L AC  +  L L K I+  ++ AG   +    + LI++Y KC + I  A  VF ++  
Sbjct: 313 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMI-TARDVFNSMEC 372

Query: 387 PSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMF 446
              + W S+ISG  + G   +  + F  M     Q +  T   ++  S+           
Sbjct: 373 KDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGL 432

Query: 447 HGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDH 506
           H   +K     D+ V NAL+D YA+  +V D+ ++ S+M   D +T+ ++ +   + GD 
Sbjct: 433 HSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDF 492

Query: 507 EMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSL 566
              L++   MR   V  D  +    +     L     GK++HC  L++G ++   + N+L
Sbjct: 493 ATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNAL 552

Query: 567 MDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDS 626
           +++Y K GCL+++++VFE +S+ DVV+W GMI      G    AL  F +M  +G+ PDS
Sbjct: 553 IEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDS 612

Query: 627 ITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVE 686
           + F++I+ ACS  GLVD G+  F  MK  +KI+P ++HYAC++DLL R  ++  A E ++
Sbjct: 613 VVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQ 672

Query: 687 SMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDL 746
           +MP + DA I+ ++L+AC   G+M   E V+ R ++LNP+D  + +L +N Y    + D 
Sbjct: 673 AMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDK 732

Query: 747 SAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGF 806
            +  RK ++D+ + K+PG SWIE+   +H+F +G+ + PQ   I + LE L +     G+
Sbjct: 733 VSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGY 792

Query: 807 MYHEDENSSH-------------HSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFIL 866
           +    E S +             HSE+LA+AFGL+N  P   +++MKN+ +C +CH+   
Sbjct: 793 IPDPREVSQNLEEEEEKRRLICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTK 852

Query: 867 LVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           L++K+V REI+VRD    H+ K+G+CSC
Sbjct: 853 LISKIVGREILVRDANRFHLFKDGTCSC 878

BLAST of Bhi01G001040 vs. Swiss-Prot
Match: sp|Q9FLX6|PP430_ARATH (Pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H31 PE=2 SV=1)

HSP 1 Score: 865.1 bits (2234), Expect = 6.5e-250
Identity = 425/872 (48.74%), Postives = 598/872 (68.58%), Query Frame = 0

Query: 9   LLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFG 68
           L   NEL  L++SC +++S C S S + G+ +H P+IK GL  NL L NNLL+LY K  G
Sbjct: 14  LSRTNELGNLQKSCIRILSFCESNSSRIGLHIHCPVIKFGLLENLDLCNNLLSLYLKTDG 73

Query: 69  LKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRS 128
           +  AR LFDEM  R V +WT M +A+ +++ +  A  LF+ M+  G  PNEFT S+++RS
Sbjct: 74  IWNARKLFDEMSHRTVFAWTVMISAFTKSQEFASALSLFEEMMASGTHPNEFTFSSVVRS 133

Query: 129 CSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTW 188
           C+   ++  G  VHG VIK GFE   V+G +L +LY+KC   +EA E+F ++ + DT++W
Sbjct: 134 CAGLRDISYGGRVHGSVIKTGFEGNSVVGSSLSDLYSKCGQFKEACELFSSLQNADTISW 193

Query: 189 TVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITL 248
           T+MISSLV A+KW EALQ Y  M+ +GV PNEFTF KLL  ++FLGL++GK +H ++I  
Sbjct: 194 TMMISSLVGARKWREALQFYSEMVKAGVPPNEFTFVKLLGASSFLGLEFGKTIHSNIIVR 253

Query: 249 GVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAF 308
           G+ LNVVLKT+LVD YS++ ++EDA++V N + E+DVFLWTS++S F +NL+ KEA+  F
Sbjct: 254 GIPLNVVLKTSLVDFYSQFSKMEDAVRVLNSSGEQDVFLWTSVVSGFVRNLRAKEAVGTF 313

Query: 309 QEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKC 368
            EMR  G+ PN+FTYS+ LS C+++ SL  GKQIH Q I  G E     G+AL++MYMKC
Sbjct: 314 LEMRSLGLQPNNFTYSAILSLCSAVRSLDFGKQIHSQTIKVGFEDSTDVGNALVDMYMKC 373

Query: 369 SNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSI 428
           S    +A RVF  + SP+V+ WT+LI GL +HG  QDC+   ++M    V+PN  TLS +
Sbjct: 374 SASEVEASRVFGAMVSPNVVSWTTLILGLVDHGFVQDCFGLLMEMVKREVEPNVVTLSGV 433

Query: 429 LGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDA 488
           L A S  +   +    H Y+L+     ++VVGN+LVDAYA S KVD A  VI +M  RD 
Sbjct: 434 LRACSKLRHVRRVLEIHAYLLRRHVDGEMVVGNSLVDAYASSRKVDYAWNVIRSMKRRDN 493

Query: 489 ITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCY 548
           ITYTSL TR N++G HEMAL +I+ M  D + MD++SL   +SA   LG +ETGK LHCY
Sbjct: 494 ITYTSLVTRFNELGKHEMALSVINYMYGDGIRMDQLSLPGFISASANLGALETGKHLHCY 553

Query: 549 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 608
           S+K G     SV NSL+D+Y K G L+DA KVFEEI+ PDVVSWNG++S LA NG ISSA
Sbjct: 554 SVKSGFSGAASVLNSLVDMYSKCGSLEDAKKVFEEIATPDVVSWNGLVSGLASNGFISSA 613

Query: 609 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 668
           L+AF+ MR+   EPDS+TFL +LSACS G L D G+ YF  MK  + IEP+++HY  ++ 
Sbjct: 614 LSAFEEMRMKETEPDSVTFLILLSACSNGRLTDLGLEYFQVMKKIYNIEPQVEHYVHLVG 673

Query: 669 LLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSF 728
           +LGR G+LE A  +VE+M  + +A I+KTLL+AC + GN+ LGED+A +GL L P+D + 
Sbjct: 674 ILGRAGRLEEATGVVETMHLKPNAMIFKTLLRACRYRGNLSLGEDMANKGLALAPSDPAL 733

Query: 729 YLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTH-PQIND 788
           Y+LLA+LYD   + +L+ KTR LM ++ + K  G+S +E+  K+H FV+ + T   + N 
Sbjct: 734 YILLADLYDESGKPELAQKTRNLMTEKRLSKKLGKSTVEVQGKVHSFVSEDVTRVDKTNG 793

Query: 789 IQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECH 848
           I  ++E ++ E K  G  Y  +EN+S HS K A+ +G +   P A V ++KN  +C++CH
Sbjct: 794 IYAEIESIKEEIKRFGSPYRGNENASFHSAKQAVVYGFIYASPEAPVHVVKNKILCKDCH 853

Query: 849 DFILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           +F+ ++T++V+++I VRDG  +H+ KNG CSC
Sbjct: 854 EFVSILTRLVDKKITVRDGNQVHIFKNGECSC 885

BLAST of Bhi01G001040 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 501.5 bits (1290), Expect = 1.9e-140
Identity = 279/879 (31.74%), Postives = 467/879 (53.13%), Query Frame = 0

Query: 19   EESCSQLISICNSKSLKEGVC--VHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLF 78
            E + S ++  C   S+   V   +H+ I+  GL  +  + N L+ LY++   +  AR +F
Sbjct: 186  EGTFSGVLEACRGGSVAFDVVEQIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVF 245

Query: 79   DEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELE 138
            D +  +D  SW  M +   +N    EA  LF  M  LG  P  +  S+++ +C +   LE
Sbjct: 246  DGLRLKDHSSWVAMISGLSKNECEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLE 305

Query: 139  LGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLV 198
            +G  +HG V+K GF +   +   L++LY        A  +F NM   D VT+  +I+ L 
Sbjct: 306  IGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLS 365

Query: 199  QAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKY-GKLLHCHMITLGVNLNVV 258
            Q    ++A++L+  M   G+ P+  T   L+   +  G  + G+ LH +   LG   N  
Sbjct: 366  QCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNK 425

Query: 259  LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 318
            ++ AL+++Y++  ++E A+    +T  ++V LW  ++  +     ++ +   F++M++  
Sbjct: 426  IEGALLNLYAKCADIETALDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE 485

Query: 319  ILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDA 378
            I+PN +TY S L  C  +  L+LG+QIH Q+I    + +    S LI+MY K    +D A
Sbjct: 486  IVPNQYTYPSILKTCIRLGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGK-LDTA 545

Query: 379  LRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSA 438
              +        V+ WT++I+G  ++  +      F  M   G++ +   L++ + A +  
Sbjct: 546  WDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGL 605

Query: 439  KSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLA 498
            ++  +    H          D+   NALV  Y+R  K++++          D I + +L 
Sbjct: 606  QALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALV 665

Query: 499  TRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLD 558
            +   Q G++E AL++   M  + ++ +  +  S V A +    ++ GKQ+H    K G D
Sbjct: 666  SGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYD 725

Query: 559  NTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNM 618
            +   V N+L+ +Y K G + DA K F E+S  + VSWN +I+  + +G  S AL +FD M
Sbjct: 726  SETEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQM 785

Query: 619  RLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQ 678
              + + P+ +T + +LSACS  GLVD G+ YF SM + + + P+ +HY C++D+L R G 
Sbjct: 786  IHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGL 845

Query: 679  LENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANL 738
            L  A E ++ MP + DA +++TLL AC  H NM +GE  A   L+L P DS+ Y+LL+NL
Sbjct: 846  LSRAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNL 905

Query: 739  YDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFL 798
            Y    + D    TR+ M+++GV+K PGQSWIE+ + IH F  G++ HP  ++I E  + L
Sbjct: 906  YAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDL 965

Query: 799  RAEFKSRGF----------MYHEDENS--SHHSEKLALAFGLVNLPPTAVVRIMKNISIC 858
                   G+          + HE ++     HSEKLA++FGL++LP T  + +MKN+ +C
Sbjct: 966  TKRASEIGYVQDCFSLLNELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVC 1025

Query: 859  RECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
             +CH +I  V+KV  REIIVRD    H  + G+CSC  Y
Sbjct: 1026 NDCHAWIKFVSKVSNREIIVRDAYRFHHFEGGACSCKDY 1063

BLAST of Bhi01G001040 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 488.8 bits (1257), Expect = 1.3e-136
Identity = 266/827 (32.16%), Postives = 460/827 (55.62%), Query Frame = 0

Query: 87  WTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYVI 146
           W  +  + VR+    EA   +  M+ LG  P+ +    L+++ ++  ++ELG  +H +V 
Sbjct: 65  WIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVY 124

Query: 147 KGGFETKPV-LGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           K G+    V +  TL+NLY KC      Y+VF  + + + V+W  +ISSL   +KW+ AL
Sbjct: 125 KFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMAL 184

Query: 207 QLYITMMNSGVTPNEFTFTKLLATTNFL----GLKYGKLLHCHMITLGVNLNVVLKTALV 266
           + +  M++  V P+ FT   ++   + L    GL  GK +H + +  G  LN  +   LV
Sbjct: 185 EAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKG-ELNSFIINTLV 244

Query: 267 DVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSF 326
            +Y +  +L  +  +      +D+  W +++S   QN ++ EA+   +EM + G+ P+ F
Sbjct: 245 AMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEF 304

Query: 327 TYSSALSACTSIPSLKLGKQIHLQVILAG-LEADVCAGSALINMYMKCSNFIDDALRVFR 386
           T SS L AC+ +  L+ GK++H   +  G L+ +   GSAL++MY  C   +    RVF 
Sbjct: 305 TISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVL-SGRRVFD 364

Query: 387 TITSPSVICWTSLISGLAEHGCEQDCYRYFLDM-QAAGVQPNAFTLSSILGASSSAKSQN 446
            +    +  W ++I+G +++  +++    F+ M ++AG+  N+ T++ ++ A   + + +
Sbjct: 365 GMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFS 424

Query: 447 QTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLN 506
           +    HG+++K     D  V N L+D Y+R  K+D A R+   M  RD +T+ ++ T   
Sbjct: 425 RKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYV 484

Query: 507 QMGDHEMALKIIDSMR---------ADNVEM--DEISLTSLVSALTGLGIVETGKQLHCY 566
               HE AL ++  M+         A  V +  + I+L +++ +   L  +  GK++H Y
Sbjct: 485 FSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAY 544

Query: 567 SLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSA 626
           ++K  L    +V ++L+D+Y K GCL+ + KVF++I + +V++WN +I     +G+   A
Sbjct: 545 AIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEA 604

Query: 627 LAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIID 686
           +     M + G++P+ +TF+S+ +ACS  G+VD G+  F+ MK  + +EP  DHYAC++D
Sbjct: 605 IDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVD 664

Query: 687 LLGRVGQLENAMEIVESMPYEAD-AKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSS 746
           LLGR G+++ A +++  MP + + A  + +LL A   H N+ +GE  A   +QL PN +S
Sbjct: 665 LLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVAS 724

Query: 747 FYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQIND 806
            Y+LLAN+Y      D + + R+ M+++GVRK PG SWIE   ++H FV G+ +HPQ   
Sbjct: 725 HYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEK 784

Query: 807 IQEKLEFLRAEFKSRGFM---------YHEDENS---SHHSEKLALAFGLVNLPPTAVVR 866
           +   LE L    +  G++           EDE       HSEKLA+AFG++N  P  ++R
Sbjct: 785 LSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIR 844

Query: 867 IMKNISICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           + KN+ +C +CH     ++K+V+REII+RD R  H  KNG+CSC  Y
Sbjct: 845 VAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 889

BLAST of Bhi01G001040 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 9.7e-129
Identity = 276/940 (29.36%), Postives = 466/940 (49.57%), Query Frame = 0

Query: 31  SKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEMPDRDVVSWTTM 90
           S  L  G C H+ I+    +   +L NNL+++Y+K   L  AR +FD+MPDRD+VSW ++
Sbjct: 52  SSDLMLGKCTHARILTFEENPERFLINNLISMYSKCGSLTYARRVFDKMPDRDLVSWNSI 111

Query: 91  QAAYVRNRSYI-----EAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 150
            AAY ++   +     +AF LF ++       +  TLS +++ C  +G +      HGY 
Sbjct: 112 LAAYAQSSECVVENIQQAFLLFRILRQDVVYTSRMTLSPMLKLCLHSGYVWASESFHGYA 171

Query: 151 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 210
            K G +    +   L+N+Y K    +E   +F  M   D V W +M+ + ++    +EA+
Sbjct: 172 CKIGLDGDEFVAGALVNIYLKFGKVKEGKVLFEEMPYRDVVLWNLMLKAYLEMGFKEEAI 231

Query: 211 QLYITMMNSGVTPNEF-------------------------------------------- 270
            L     +SG+ PNE                                             
Sbjct: 232 DLSSAFHSSGLNPNEITLRLLARISGDDSDAGQVKSFANGNDASSVSEIIFRNKGLSEYL 291

Query: 271 -------------------------TFTKLLAT-TNFLGLKYGKLLHCHMITLGVNLNVV 330
                                    TF  +LAT      L  G+ +HC  + LG++L + 
Sbjct: 292 HSGQYSALLKCFADMVESDVECDQVTFILMLATAVKVDSLALGQQVHCMALKLGLDLMLT 351

Query: 331 LKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSG 390
           +  +L+++Y + ++   A  V +   E+D+  W S+I+   QN    EA+  F ++   G
Sbjct: 352 VSNSLINMYCKLRKFGFARTVFDNMSERDLISWNSVIAGIAQNGLEVEAVCLFMQLLRCG 411

Query: 391 ILPNSFTYSSALSACTSIP-SLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDD 450
           + P+ +T +S L A +S+P  L L KQ+H+  I     +D    +ALI+ Y + +  + +
Sbjct: 412 LKPDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSR-NRCMKE 471

Query: 451 ALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSS 510
           A  +F    +  ++ W ++++G  +        + F  M   G + + FTL+++      
Sbjct: 472 AEILFER-HNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGF 531

Query: 511 AKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSL 570
             + NQ    H Y +K     D+ V + ++D Y +   +  A     ++   D + +T++
Sbjct: 532 LFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTM 591

Query: 571 ATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGL 630
            +   + G+ E A  +   MR   V  DE ++ +L  A + L  +E G+Q+H  +LK   
Sbjct: 592 ISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNC 651

Query: 631 DNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDN 690
            N   V  SL+D+Y K G + DA  +F+ I   ++ +WN M+  LA +G     L  F  
Sbjct: 652 TNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQ 711

Query: 691 MRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVG 750
           M+  G++PD +TF+ +LSACS  GLV     +  SM   + I+PE++HY+C+ D LGR G
Sbjct: 712 MKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAG 771

Query: 751 QLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLAN 810
            ++ A  ++ESM  EA A +Y+TLL AC   G+   G+ VA + L+L P DSS Y+LL+N
Sbjct: 772 LVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSN 831

Query: 811 LYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEF 870
           +Y   ++ D     R +M+   V+K PG SWIE+ +KIH+FV  +R++ Q   I  K++ 
Sbjct: 832 MYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKD 891

Query: 871 LRAEFKSRGFM---------YHEDENSS---HHSEKLALAFGLVNLPPTAVVRIMKNISI 883
           +  + K  G++           E+E      +HSEKLA+AFGL++ PP+  +R++KN+ +
Sbjct: 892 MIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRV 951

BLAST of Bhi01G001040 vs. Swiss-Prot
Match: sp|Q9SS60|PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 461.5 bits (1186), Expect = 2.2e-128
Identity = 260/868 (29.95%), Postives = 457/868 (52.65%), Query Frame = 0

Query: 27  SICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGLKQARNLFDEM-PDRDVV 86
           ++ +S +L E   +H+ +I LGL  + + S  L+  Y+       + ++F  + P ++V 
Sbjct: 13  ALSSSSNLNELRRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVSPAKNVY 72

Query: 87  SWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSCSETGELELGSCVHGYV 146
            W ++  A+ +N  + EA E +  +      P+++T  ++I++C+   + E+G  V+  +
Sbjct: 73  LWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQI 132

Query: 147 IKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWTVMISSLVQAQKWDEAL 206
           +  GFE+   +G  L+++Y++      A +VF  M   D V                   
Sbjct: 133 LDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVXXXXXXXXXXXXXXXXXXX 192

Query: 207 QLYITMMNSGVTPNEFTFTKLL-ATTNFLGLKYGKLLHCHMITLGVNLNVVLKTALVDVY 266
                + NS + P+ FT + +L A  N L +K G+ LH   +  GVN  VV+   LV +Y
Sbjct: 193 XXXHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMY 252

Query: 267 SRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQEMRMSGILPNSFTYS 326
            +++   DA +V ++   +D   + ++I  + +   V+E++  F E  +    P+  T S
Sbjct: 253 LKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVS 312

Query: 327 SALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCSNFIDDALRVFRTITS 386
           S L AC  +  L L K I+  ++ AG   +    + LI++Y KC + I  A  VF ++  
Sbjct: 313 SVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMI-TARDVFNSMEC 372

Query: 387 PSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSILGASSSAKSQNQTSMF 446
              + W S+ISG  + G   +  + F  M     Q +  T   ++  S+           
Sbjct: 373 KDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGL 432

Query: 447 HGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAITYTSLATRLNQMGDH 506
           H   +K     D+ V NAL+D YA+  +V D+ ++ S+M   D +T+ ++ +   + GD 
Sbjct: 433 HSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDF 492

Query: 507 EMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYSLKYGLDNTCSVKNSL 566
              L++   MR   V  D  +    +     L     GK++HC  L++G ++   + N+L
Sbjct: 493 ATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNAL 552

Query: 567 MDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSALAAFDNMRLAGLEPDS 626
           +++Y K GCL+++++VFE +S+ DVV+W GMI      G    AL  F +M  +G+ PDS
Sbjct: 553 IEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDS 612

Query: 627 ITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDLLGRVGQLENAMEIVE 686
           + F++I+ ACS  GLVD G+  F  MK  +KI+P ++HYAC++DLL R  ++  A E ++
Sbjct: 613 VVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQ 672

Query: 687 SMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFYLLLANLYDGYNRQDL 746
           +MP + DA I+ ++L+AC   G+M   E V+ R ++LNP+D  + +L +N Y    + D 
Sbjct: 673 AMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDK 732

Query: 747 SAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQEKLEFLRAEFKSRGF 806
            +  RK ++D+ + K+PG SWIE+   +H+F +G+ + PQ   I + LE L +     G+
Sbjct: 733 VSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGY 792

Query: 807 MYHEDENSSH-------------HSEKLALAFGLVNLPPTAVVRIMKNISICRECHDFIL 866
           +    E S +             HSE+LA+AFGL+N  P   +++MKN+ +C +CH+   
Sbjct: 793 IPDPREVSQNLEEEEEKRRLICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTK 852

Query: 867 LVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           L++K+V REI+VRD    H+ K+G+CSC
Sbjct: 853 LISKIVGREILVRDANRFHLFKDGTCSC 878

BLAST of Bhi01G001040 vs. TrEMBL
Match: tr|A0A2P5FGP2|A0A2P5FGP2_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_073370 PE=4 SV=1)

HSP 1 Score: 1045.0 bits (2701), Expect = 9.3e-302
Identity = 503/873 (57.62%), Postives = 657/873 (75.26%), Query Frame = 0

Query: 10  LNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGL 69
           LNR+E +R E+ C +++S CNS+SL++G+CVHSP+IKLGLHGNL+LSNNLL+LYAK FG 
Sbjct: 14  LNRSEAHRFEDVCLRIVSSCNSQSLEQGLCVHSPVIKLGLHGNLFLSNNLLSLYAKCFGA 73

Query: 70  KQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSC 129
           + A +LFDEMP RDVVSWT + +AY R+  + +A ELFD M+  G  PNEFT S+++RSC
Sbjct: 74  EHAHDLFDEMPYRDVVSWTGLVSAYSRSGKHGKALELFDSMIASGESPNEFTFSSVLRSC 133

Query: 130 SETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWT 189
           S  GE ++G+ +H Y+IK G ++   L  ++I+ YAKC  SEEA+ VFR MD  +TVTWT
Sbjct: 134 SAVGEFDMGTRIHNYMIKLGLDSNNFLLSSMIDFYAKCGCSEEAHNVFRGMDSGNTVTWT 193

Query: 190 VMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITLG 249
            MISSLVQAQKW  AL+ Y+ M+N+ V PNEFTF  +L  + +L L YGKLLH H+IT G
Sbjct: 194 TMISSLVQAQKWILALKHYVDMINAQVPPNEFTFAMILEASCYLDLDYGKLLHAHLITWG 253

Query: 250 VNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQ 309
           + L+++LKTALV++YS  Q ++DA+KV  QTPE DV LWTS+IS F  + KVKEA++A Q
Sbjct: 254 IRLSLILKTALVNMYSSSQRMKDALKVLRQTPEYDVVLWTSVISGFTHDSKVKEAVSALQ 313

Query: 310 EMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCS 369
           EM + G +PNSFTYS+ L AC+++  L LGKQIH + I  GLEADVC G+AL++MYMKCS
Sbjct: 314 EMEIFGFVPNSFTYSNLLKACSTVSLLDLGKQIHSRSIRTGLEADVCVGNALVDMYMKCS 373

Query: 370 NFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSIL 429
           N ++D  RVFR ITSP+VI WTSLI+G A+HG EQD + +FL M+A GVQPN+FTLS++L
Sbjct: 374 NLVEDGSRVFRGITSPNVISWTSLIAGFADHGFEQDSFHFFLQMRAVGVQPNSFTLSAVL 433

Query: 430 GASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAI 489
            A  + KS +QT   HGYI+K +   + VV NALVDAYA    V+D+ RV   M  RD I
Sbjct: 434 RACRTNKSLSQTLKIHGYIIKAKECSEAVVANALVDAYAALGMVNDSWRVTRKMKDRDII 493

Query: 490 TYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYS 549
           TYTSLATR+N++G+HEMAL +I  ++ DN+EMD  SL S +SA   L  +ETGKQLHCYS
Sbjct: 494 TYTSLATRMNKLGNHEMALNVIKHIKDDNIEMDGFSLASFLSASAALATMETGKQLHCYS 553

Query: 550 LKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSAL 609
           +K G  +  SV N L+DLY K G   D  + FEE+S PDVVSWN +IS LA NG+++SAL
Sbjct: 554 IKSGFSSCTSVSNGLVDLYWKCGYADDGYRAFEEMSDPDVVSWNQLISGLASNGYVNSAL 613

Query: 610 AAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDL 669
           +AFD+MRLAGL+PDSITFLS+L ACS+GGLVD G+ YFHSMK  + I P+LDHY C++DL
Sbjct: 614 SAFDDMRLAGLKPDSITFLSVLFACSRGGLVDLGVDYFHSMKNKYYITPQLDHYVCLVDL 673

Query: 670 LGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFY 729
           LGR GQLENAME++ +MP+  D  IYKTLL +C  HGN+ L ED+A RGL+L+P+D +FY
Sbjct: 674 LGRAGQLENAMEVIVNMPFNPDPLIYKTLLGSCKLHGNIPLAEDMARRGLELDPSDPAFY 733

Query: 730 LLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQ 789
           LLLANLYD     +L  KTR+LM D+GVRK+P QSWIE+ +K+H+F  G+R+HP  N+I 
Sbjct: 734 LLLANLYDEMGHSNLGKKTRQLMTDKGVRKNPSQSWIEIRNKVHIFNAGDRSHPLTNEIH 793

Query: 790 EKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECHDF 849
           +K+  L  E K RG+   + E SS+HSEKLA+ FGL+N P  A +RI KN+ IC ECHDF
Sbjct: 794 DKIVSLTNELKRRGYSLKDTEGSSYHSEKLAVGFGLLNTPSKAAIRISKNMRICSECHDF 853

Query: 850 ILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           I+L+T+ ++R++IVRDG  +H  K G CSC  +
Sbjct: 854 IMLLTRFIDRDVIVRDGNRIHAFKRGQCSCKGF 886

BLAST of Bhi01G001040 vs. TrEMBL
Match: tr|A0A2N9F2C3|A0A2N9F2C3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13008 PE=4 SV=1)

HSP 1 Score: 1038.1 bits (2683), Expect = 1.1e-299
Identity = 518/872 (59.40%), Postives = 665/872 (76.26%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           ML + V K   RNEL R E+ C +++S+CNSKSLKEGVCVHSPIIK+GL  +LYL+NNLL
Sbjct: 1   MLSKTVSKTCYRNELNRFEDICLRVVSLCNSKSLKEGVCVHSPIIKMGLQDDLYLNNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAK FG+  A + FDEMP +DVVSWT + +AYVRN ++ +A  LFD M+     PNEF
Sbjct: 61  SLYAKCFGVDHAHHFFDEMPYKDVVSWTGILSAYVRNENHEQALRLFDSMIHNSQYPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS+++RSCS  GE + G+ V  Y+IK GF++  VL   LI+LY+KC  +EEAY+VF +M
Sbjct: 121 TLSSVLRSCSALGEFDYGTLVQAYMIKNGFDSNRVLASALIDLYSKCGCTEEAYKVFESM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D  DTV+WT MISSLVQ QKW +ALQLYI M+ + V PNEFTF KLLA + FLGL YGKL
Sbjct: 181 DGGDTVSWTTMISSLVQGQKWSQALQLYIRMIEARVHPNEFTFVKLLAASGFLGLSYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           +H HMI LG+ LNV+LKTALVD+YS+ Q + DA+KV+NQTPE+DVFLWT+IIS F Q +K
Sbjct: 241 VHAHMILLGIELNVILKTALVDMYSKCQRMGDAVKVSNQTPERDVFLWTAIISGFIQIMK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           V+EAIAA +EM MSG +PN+F+YS+ L+AC+SI SL+LG+Q+H  VI AGLE D+  G+A
Sbjct: 301 VREAIAALREMVMSGTVPNNFSYSAILNACSSISSLELGEQVHSWVIRAGLEDDIYVGNA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           L++MYMKCSN ID+AL VFR +T P+VI WTSLISG A+HG EQD ++ F +MQA G+ P
Sbjct: 361 LLDMYMKCSNLIDNALLVFRGMTLPNVITWTSLISGFAKHGFEQDSFQSFEEMQALGLTP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSS+LGA S+ KS +QT   HGYI+K +A  DIVVGNALVDAYA    VD+A  V+
Sbjct: 421 NSFTLSSVLGACSTMKSHSQTMKLHGYIIKTKADCDIVVGNALVDAYAGIGMVDEAWCVV 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
             + HRDAITYTSLATR+NQMG H+ AL II  M+ D+V+MD  S+   +SA  GLG ++
Sbjct: 481 RKIGHRDAITYTSLATRINQMGYHDGALDIIKYMKNDDVKMDGFSMAGFLSASAGLGSMK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G QLHC+S+K GL    SV NSL+DLYGK GC++DAN+ F+EI++ DV SWNG IS LA
Sbjct: 541 AGMQLHCFSVKSGLGCWLSVSNSLVDLYGKCGCIRDANRAFKEITERDVASWNGWISGLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NG+ISSAL+AF++MRLAG++PDS+TFL +L ACS GGLVD G+ YFHSM+ TH I P+L
Sbjct: 601 SNGYISSALSAFEDMRLAGVKPDSVTFLLVLIACSHGGLVDLGLEYFHSMRETHGIAPQL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDLLGR GQLE AM ++++MP+  DA IYKTLL A   HGN+ LGED+A +GL 
Sbjct: 661 DHYVCLIDLLGRAGQLEEAMGVIKTMPFRPDALIYKTLLSASKLHGNVPLGEDMARQGLN 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P+D +FY+LLANLYD   + DL  KTR+LMR+RG+ K+  QS +E+ ++IHLF + + 
Sbjct: 721 LDPSDPAFYILLANLYDNSGQSDLGEKTRRLMRERGLMKNTCQSRVEIRNQIHLFTSEDS 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQIN I EK+E L  EFK RG++Y +  + S+HSEKLA+AFGL++ P  A + I+KN 
Sbjct: 781 SHPQINQINEKIESLITEFKCRGYLYRDSTDQSYHSEKLAVAFGLLSTPSKAPILIIKNK 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVL 873
            IC +CH FI+L   VV     V  GR  H L
Sbjct: 841 RICMDCHHFIMLRLLVV-----VITGRMFHNL 867

BLAST of Bhi01G001040 vs. TrEMBL
Match: tr|A0A2I4GTT8|A0A2I4GTT8_9ROSI (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Juglans regia OX=51240 GN=LOC109010839 PE=4 SV=1)

HSP 1 Score: 1036.9 bits (2680), Expect = 2.5e-299
Identity = 502/879 (57.11%), Postives = 667/879 (75.88%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLC+ V K+ +RNELY        ++S+CNS SLKEG+CVHSPIIKLGL  +LYL+N+LL
Sbjct: 1   MLCKTVTKICSRNELY------IGVLSLCNSMSLKEGLCVHSPIIKLGLQRDLYLNNHLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAK FG+  AR  FDEMP +DVVSWT + +  VRN ++ +A ELF  M+  G+ PNEF
Sbjct: 61  SLYAKCFGVGNARYFFDEMPYKDVVSWTGILSTCVRNGNHAQALELFGSMINFGYYPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS+ +RSCS  G+   G+ +  Y++K G+++ P+L   LI LY+KCD +EEAYEVF +M
Sbjct: 121 TLSSALRSCSALGDFHYGTRIQAYMVKNGYDSNPILASALIGLYSKCDCTEEAYEVFAHM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D  D V+W+ MISSLVQAQ+W +ALQLYI M+  GV PNEFTF KLLA  + LG+ +GKL
Sbjct: 181 DGGDVVSWSTMISSLVQAQQWSQALQLYIRMIKVGVPPNEFTFVKLLAACSSLGISFGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           +H HMI LG+ +NV+LKTALVD+YS+ + +EDA+ V+N+TPE DVFLWT+IIS F++N+K
Sbjct: 241 VHAHMILLGIEVNVILKTALVDMYSKCRRMEDAVVVSNRTPEHDVFLWTAIISGFSRNMK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           V+EA+ A  +M MS I+PN+FTYS+ L+AC+SI SL+LG+QIH +VI+AGLE DV  G+A
Sbjct: 301 VREAVGALHQMEMSEIVPNNFTYSTVLNACSSILSLELGEQIHSRVIMAGLEDDVSVGNA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           L++MYMKCSN ID+AL+VFR + SP+VI WTSLI+G AEHG E D ++ FL+M+A G+QP
Sbjct: 361 LVDMYMKCSNLIDNALKVFRGMASPNVITWTSLIAGFAEHGFEADSFQAFLEMRAVGLQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA  + KS  QT   HGYI+K +A  DIV+GNALVDAYA    VD+A  VI
Sbjct: 421 NSFTLSSILGACRTIKSHGQTMKLHGYIIKTKAEKDIVLGNALVDAYAGLGMVDEAWCVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
             M+ RDAITYT+LATR+NQMG H  AL II  M  +++EMD  S+ S +SA   LG +E
Sbjct: 481 RKMSRRDAITYTTLATRMNQMGHHGSALNIITHMNYEDIEMDGFSMASFLSASASLGSME 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            GKQLHC+S+K GL    SV NSL+DLYGK G + DA++ F EI +PDV SWNG+IS LA
Sbjct: 541 AGKQLHCWSVKSGLGCWLSVSNSLVDLYGKCGRMHDAHRAFREIDEPDVASWNGLISGLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NG+ S AL+AF++MRLAG++PDS+TFL +LSACS G L+D G+ +F SM+  H + P+L
Sbjct: 601 SNGYFSFALSAFEDMRLAGVKPDSVTFLLVLSACSHGDLLDLGLEHFRSMRNIHNMVPQL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDLLGR GQL  AM++++++P+  DA IYKTLL AC  HGN+ LGED+A +GL 
Sbjct: 661 DHYVCLIDLLGRAGQLGEAMKVIKTLPFRPDALIYKTLLSACRLHGNVPLGEDMARQGLN 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P+D +FY+LLANLYD   + D   KTR+LMR+R +R SP QSW+E  ++IH+F  G  
Sbjct: 721 LHPSDPAFYILLANLYDRSGQSDFGEKTRRLMRERXLRSSPCQSWMETRNQIHVFTAGHI 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP+I +I+ K+E L  EFK RG++YH + + S+HSEKLA AFGL+N P  A + I+KN 
Sbjct: 781 SHPEITNIKAKIESLMIEFKHRGYLYHGNGDRSYHSEKLATAFGLLNTPSKAPILIIKNT 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
            +C +CH FI+ VT++V+REI++R+G  +H  K G CSC
Sbjct: 841 RMCMDCHSFIMHVTELVDREIVLREGNRVHSFKKGRCSC 873

BLAST of Bhi01G001040 vs. TrEMBL
Match: tr|A0A2P5CB14|A0A2P5CB14_PARAD (DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_168210 PE=4 SV=1)

HSP 1 Score: 1035.8 bits (2677), Expect = 5.6e-299
Identity = 500/870 (57.47%), Postives = 654/870 (75.17%), Query Frame = 0

Query: 10  LNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLLALYAKRFGL 69
           LNR+E +R E+ C +++S CNS+SL++G+CVHSP+IKLGLHGNL+LSNNLL+LYAK FG 
Sbjct: 13  LNRSEAHRFEDVCLRIVSSCNSQSLEQGLCVHSPVIKLGLHGNLFLSNNLLSLYAKCFGA 72

Query: 70  KQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEFTLSNLIRSC 129
           + A +LFDEMP RDVVSWT + +AY R+  + +A ELFD M+  G  PNEFT S+++RSC
Sbjct: 73  EHAHDLFDEMPCRDVVSWTGLVSAYSRSGKHDKALELFDSMIASGESPNEFTFSSVLRSC 132

Query: 130 SETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNMDDVDTVTWT 189
           S   E ++G+ +H ++IK G ++   L  ++I+ YAKC  SEEA+ VFR MD  +TVTWT
Sbjct: 133 SAVEEFDMGTRIHNHMIKRGLDSNSFLLSSMIDFYAKCGCSEEAHNVFRGMDSGNTVTWT 192

Query: 190 VMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKLLHCHMITLG 249
            MISSLVQAQKW  AL+ Y+ M+N+ V PNEFTF K+L  + +L L YGKLLH H IT G
Sbjct: 193 TMISSLVQAQKWILALKHYVDMINAQVPPNEFTFAKILEASCYLDLDYGKLLHAHSITRG 252

Query: 250 VNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLKVKEAIAAFQ 309
           + L+++LKTALV++YS  Q ++DA KV  QTPE DV LWTS+IS    + KVKEA++A Q
Sbjct: 253 IRLSLILKTALVNMYSSSQRMKDAFKVLRQTPEYDVVLWTSVISGLTHDSKVKEAVSALQ 312

Query: 310 EMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSALINMYMKCS 369
           EM +SG +PNSFTYS+ L AC+++  L+LGKQIH + I  GLEADVC G+AL++MYMKCS
Sbjct: 313 EMEISGFVPNSFTYSNLLKACSTVSFLELGKQIHSRSIRTGLEADVCVGNALVDMYMKCS 372

Query: 370 NFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQPNAFTLSSIL 429
           N ++D L+VFR +TSP+VI WTSLI+G A+HG EQD +  FL M+A GVQPN+FTLS++L
Sbjct: 373 NLVEDGLKVFRGMTSPNVISWTSLIAGFADHGFEQDSFHVFLQMRAVGVQPNSFTLSAVL 432

Query: 430 GASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVISTMNHRDAI 489
            A  + KS +QT   HGYI+K +   + VV NALVDAYA    V+ + RV   M  RD I
Sbjct: 433 RACRTNKSLSQTLKIHGYIIKAKECSEAVVANALVDAYAALGMVNYSWRVTRKMKDRDII 492

Query: 490 TYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVETGKQLHCYS 549
           TYTSLATR+N++G+HEMAL +I  M+ DN+EMD  SL S +SA   L  +ETGKQLHCYS
Sbjct: 493 TYTSLATRMNKLGNHEMALNVIKHMKDDNIEMDGFSLASFLSASAALATMETGKQLHCYS 552

Query: 550 LKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILAFNGHISSAL 609
           +K G  +  SV N L+DLY K G   D  + FEEIS PDVVSWN +IS LA NG+++SAL
Sbjct: 553 IKSGFSSCTSVSNGLVDLYWKCGYADDGYRAFEEISDPDVVSWNQLISGLASNGYVNSAL 612

Query: 610 AAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPELDHYACIIDL 669
           +AFD+MRLAGL+PDSITF+S+L ACS+GGLVD G+ YF SMK  + I P+LDHY C++DL
Sbjct: 613 SAFDDMRLAGLKPDSITFVSVLFACSRGGLVDLGVDYFRSMKNKYYITPQLDHYVCLVDL 672

Query: 670 LGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQLNPNDSSFY 729
           LGR GQLENAME++ +MP+  D  IYKTLL +C  HGN+ L ED+A RGL+L+P+D +FY
Sbjct: 673 LGRAGQLENAMEVIVNMPFNPDPLIYKTLLGSCKLHGNIPLAEDMARRGLELDPSDPAFY 732

Query: 730 LLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGERTHPQINDIQ 789
           LLLANLYD   + +L  KTR+LM D+GVRK+P QSWIE+ + +H+F  G+R HP  N+I 
Sbjct: 733 LLLANLYDEMGQSNLGKKTRQLMTDKGVRKNPSQSWIEIRNTVHIFNAGDRPHPLTNEIH 792

Query: 790 EKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNISICRECHDF 849
           +K+  L  E K RG+ + + E SS+HSEKLA+AFGL+N P  A +RI KN+ IC ECHDF
Sbjct: 793 DKIVSLTNELKRRGYSFKDTEGSSYHSEKLAVAFGLLNTPSKAAIRISKNMRICSECHDF 852

Query: 850 ILLVTKVVEREIIVRDGRGLHVLKNGSCSC 880
           I+L+T+ ++R++IVRDG  +H  K G CSC
Sbjct: 853 IMLLTRFIDRDVIVRDGNRIHAFKRGQCSC 882

BLAST of Bhi01G001040 vs. TrEMBL
Match: tr|A0A1Q3C880|A0A1Q3C880_CEPFO (PPR domain-containing protein/PPR_2 domain-containing protein/DYW_deaminase domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_19797 PE=4 SV=1)

HSP 1 Score: 1031.2 bits (2665), Expect = 1.4e-297
Identity = 494/882 (56.01%), Postives = 662/882 (75.06%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           M+C+   KL NR+ELYR E  CS+++S+CNSKSLKEG+CVHSPIIKLG H +LY++NNLL
Sbjct: 1   MICKTAAKLTNRSELYRFEYVCSRVVSLCNSKSLKEGICVHSPIIKLGFHHHLYMNNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LY K FG   AR+ FD+MP +DVVSWT + +AYV++  Y +A ELFDLM+ LG  PNEF
Sbjct: 61  SLYHKCFGADHARHFFDKMPYKDVVSWTAIMSAYVQSAIYEKALELFDLMVILGQSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS+ IRSC+   + + G+ +  +VIK GF+  P+LG  LI+LY+KC F  E+YEVF+ +
Sbjct: 121 TLSSAIRSCAALRDFDQGTRLQAFVIKHGFDMNPILGSNLIDLYSKCKFIMESYEVFKLL 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
            + D V WT +ISS VQ +KW + L+LYI M+ + + PNEFTF KLL  + FLGLK G+L
Sbjct: 181 GNGDIVCWTTVISSFVQDRKWSQGLELYIRMIEARIPPNEFTFVKLLVASAFLGLKCGRL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           +H HMI  GV LNVVLKTAL D+YS+ Q +E A+KV+  TPE DVFLWT+IIS F +NLK
Sbjct: 241 VHAHMIMWGVKLNVVLKTALADMYSKCQMMEYAIKVSKLTPECDVFLWTAIISGFTKNLK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
            +EA+AA+ EM  SGILPN +TY+S L+AC+SIPSL+LGKQIH + I++GLE +V  G+A
Sbjct: 301 FREAVAAYLEMEKSGILPNYYTYNSVLNACSSIPSLELGKQIHSRAIMSGLENEVSVGNA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           L++MYMKCSN I+DALRVFR +   +VI WTSLI+G+ E+G EQD +  F++M+A GV P
Sbjct: 361 LVDMYMKCSNGIEDALRVFRGMNLLNVISWTSLIAGIIENGFEQDSFHLFMEMRAVGVPP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLS IL A  +  S +Q    HGY++K +  HD VVGNALV AYAR   +DDA  V 
Sbjct: 421 NSFTLSVILRACGTVNSSSQLMKLHGYVIKTKLDHDTVVGNALVHAYARLGMLDDAWHVF 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
             M+HRDAITYTSLA+ +NQ G HEMALKII  M  D+V+MD  SL + +SA  GL   E
Sbjct: 481 GMMSHRDAITYTSLASIMNQRGSHEMALKIISRMNNDDVKMDGFSLATFLSASAGLTSTE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHC+SLK GL +  SV N ++DLYGK G + D  + FEEI+ PDV SWNG+IS LA
Sbjct: 541 TGKQLHCHSLKSGLGSWNSVSNGVVDLYGKCGYIDDVRQAFEEITAPDVFSWNGLISALA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            +GHI  AL+ FD+MRLAG++PDS+TF  +L ACSQG +VD G+ YF SMK  H IEP L
Sbjct: 601 SSGHIYCALSTFDDMRLAGVKPDSVTFFLVLFACSQGKMVDMGLEYFQSMKEKHGIEPWL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C++D+L + G+L+ A+ ++E+MP+  DA +YKTLL AC  H N+ LGED+A RGL+
Sbjct: 661 DHYVCLVDVLSQAGRLQEALGVIETMPFAPDAMVYKTLLNACKLHRNVPLGEDMARRGLE 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P+D +FY+LLANLYD   + +   KTR+LMR+RG++++PGQSW+E+ +K+HLFVTG+ 
Sbjct: 721 LHPSDPAFYILLANLYDACGQCEFGEKTRRLMRERGLKRNPGQSWMEIRNKVHLFVTGDN 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQI+ I+EK+E + +EF   G++Y +  +SS+HSE+LA+AFG +  P  A + I+ N+
Sbjct: 781 SHPQIHAIREKIESIISEFNDCGYVYQDSGDSSYHSERLAVAFGFLTTPSKAPICIINNM 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
            ICR+CH+F+ L+T++V+++IIVR G  +H   NG CSC  Y
Sbjct: 841 PICRDCHNFLTLLTQLVDKKIIVRVGNRIHTFGNGECSCQDY 882

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_022141235.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Momordica charantia])

HSP 1 Score: 1522.3 bits (3940), Expect = 0.0e+00
Identity = 750/883 (84.94%), Postives = 807/883 (91.39%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           M+CR VPK LNRNEL RLEE+CS LISICNSKSLKEG+CVHSPIIKLGL+GNLYLSNNLL
Sbjct: 1   MICRTVPKFLNRNELNRLEETCSHLISICNSKSLKEGICVHSPIIKLGLYGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
            LYAKRFGLKQARNLFDEMPD+DVVSWTTMQAAYVRNRSYIEAFELFDLM+ LGHCPNEF
Sbjct: 61  TLYAKRFGLKQARNLFDEMPDKDVVSWTTMQAAYVRNRSYIEAFELFDLMVILGHCPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS L+RSCSETGELELG+CVHGY IKGGFE+KPVLGCTLI++YAKCD +EEA EVFRNM
Sbjct: 121 TLSTLLRSCSETGELELGACVHGYAIKGGFESKPVLGCTLIDMYAKCDCTEEACEVFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D+ DTVTWT  ISSLVQAQKW+EALQLYITM+ SGVTPNEFTFTKLLAT NFL LKYGKL
Sbjct: 181 DNADTVTWTATISSLVQAQKWNEALQLYITMIESGVTPNEFTFTKLLATINFLDLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+IT GV+LNV+LKT LVD+YSRYQELEDAMKVANQT EKDV LWTSIIS FNQNLK
Sbjct: 241 LHNHVITFGVDLNVLLKTTLVDMYSRYQELEDAMKVANQTAEKDVHLWTSIISCFNQNLK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIA  QEMR+SGI PNSFTYSS LSACT IPSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIATLQEMRISGIPPNSFTYSSVLSACTLIPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMKCS+ I+DALRVFRTITSP+VICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMKCSDSINDALRVFRTITSPNVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA SSAKSQN+TSMFHGYILK+RAHHDI+VGNALVDAYARS  VD+A RVI
Sbjct: 421 NSFTLSSILGACSSAKSQNRTSMFHGYILKIRAHHDIIVGNALVDAYARSRMVDEAWRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
           STMNHRDAITYTSLATRLNQMGDHEMALK I SMR DNV  DE+SL SL+SA TGLG V+
Sbjct: 481 STMNHRDAITYTSLATRLNQMGDHEMALKTISSMRDDNVRKDEVSLASLISAATGLGTVK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G+QLHCYSLKYGL NT SVKNSL+DLYGKVGCLKDA K FEEI++PDVVSWNGMIS+LA
Sbjct: 541 IGEQLHCYSLKYGLYNTRSVKNSLIDLYGKVGCLKDAQKAFEEITEPDVVSWNGMISVLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGH+SSAL+AFDNMRLAGL+PDSITFL ILSACSQGGLVDFGMHYF SM+  H +EPEL
Sbjct: 601 LNGHVSSALSAFDNMRLAGLKPDSITFLLILSACSQGGLVDFGMHYFQSMREIHYVEPEL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C++DLLGR GQLE AME+VESMP+EADAKIYKTLL AC  H NMLLGEDVA RGLQ
Sbjct: 661 DHYVCLVDLLGRAGQLEKAMEVVESMPFEADAKIYKTLLSACKLHKNMLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLANLYD  NR DLS +TRKLMRDRGVRKSP QSW EL + IHLF+TG+R
Sbjct: 721 LDPYDSSFYLLLANLYDELNRPDLSKETRKLMRDRGVRKSPSQSWTELSNSIHLFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQINDIQEKLEFL+AEFK RGF+YH DENSSHHSEKLALAFGL+NLPP AV+RIMKNI
Sbjct: 781 SHPQINDIQEKLEFLKAEFKVRGFLYHGDENSSHHSEKLALAFGLINLPPKAVIRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFILLVTKV EREI+VRDG  LHV KNGSCSC HYS
Sbjct: 841 SICRECHDFILLVTKVAEREIVVRDGSRLHVFKNGSCSCRHYS 883

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_023542503.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1468.4 bits (3800), Expect = 0.0e+00
Identity = 733/882 (83.11%), Postives = 796/882 (90.25%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFG+KQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGIKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LG CVHGY IKGGFE+KPVLGCTLI+LYAKCD +EEAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGGCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTEEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE AMKVANQTPEKDVFLWTSIIS F+QN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAMKVANQTPEKDVFLWTSIISCFSQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDAYARSKMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVS NG+ISILA
Sbjct: 541 TGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSCNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVESMP+EADAKIY+TLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVESMPFEADAKIYRTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP+IND++EKLEFLRAEFKSRGF+YH+DE+S HHSEKLALAFGLV++PP  VVRIMKNI
Sbjct: 781 SHPEINDMEEKLEFLRAEFKSRGFLYHDDEDSCHHSEKLALAFGLVSMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKVVEREI+VRDG  LHVLKNGSCSC HY
Sbjct: 841 SICRECHDFILLATKVVEREIVVRDGSRLHVLKNGSCSCKHY 876

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_022968638.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1457.6 bits (3772), Expect = 0.0e+00
Identity = 727/883 (82.33%), Postives = 793/883 (89.81%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEGVCVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGVCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD ++EAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EA QLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEAPQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE AMKVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYAMKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAFQEMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQ+ILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFQEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQIILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ I+DALRVFR+I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIEDALRVFRSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGY+LK  A+ DIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYVLKSMAYQDIVVGNALVDAYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TGLG +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGLGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
           TGKQLHC+SL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 TGKQLHCFSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLNPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY  +IDL GR GQLE AMEIVESMP+EADAKIYKTLL AC  H N+LLGEDVA RGL 
Sbjct: 661 DHYVRVIDLHGRAGQLEKAMEIVESMPFEADAKIYKTLLSACKLHRNVLLGEDVARRGLH 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV++PP AV+RIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYRDDEDSCHHSEKLALAFGLVSMPPEAVIRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHYS 884
           SICRECHDFI+L TKVVEREI+VRD   LHV KNGSCSC HYS
Sbjct: 841 SICRECHDFIVLATKVVEREIVVRDRSRLHVFKNGSCSCKHYS 877

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_022945787.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1456.0 bits (3768), Expect = 0.0e+00
Identity = 726/882 (82.31%), Postives = 790/882 (89.57%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLCR VPK +N NELYRLEE CSQLISICNSKSLKEG+CVHSPIIKLGL GNLYLSNNLL
Sbjct: 1   MLCRTVPKFVNINELYRLEEGCSQLISICNSKSLKEGLCVHSPIIKLGLLGNLYLSNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVR+ +Y +AFELFDLM TLG+ PNEF
Sbjct: 61  SLYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRHGNYNDAFELFDLMTTLGNSPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS LIRSCSET EL+LGSCVHGY IKGGFE+KPVLGCTLI+LYAKCD ++EAYE FRNM
Sbjct: 121 TLSTLIRSCSETRELKLGSCVHGYAIKGGFESKPVLGCTLIDLYAKCDCTKEAYETFRNM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           DD DTVTWT MISSLVQAQKW EALQLYITM+ SGV PNEFTFTKLLATT+F+GLKYGKL
Sbjct: 181 DDADTVTWTTMISSLVQAQKWAEALQLYITMLESGVAPNEFTFTKLLATTSFMGLKYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           LH H+I+LGVNLNVVLKTALVD+YS YQELE A KVANQTPEKDVFLWTSIIS FNQN K
Sbjct: 241 LHSHLISLGVNLNVVLKTALVDMYSGYQELEYATKVANQTPEKDVFLWTSIISCFNQNSK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAAF EMRMSGI P+SFTYSSALSACT +PSL+LGKQIHLQVILAGLEADVCAGSA
Sbjct: 301 VKEAIAAFLEMRMSGIPPHSFTYSSALSACTLLPSLELGKQIHLQVILAGLEADVCAGSA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LINMYMK S+ IDDALRVF +I +PSVICWTSLISGLAEHG EQDCYRYFLDMQAAGVQP
Sbjct: 361 LINMYMK-SDLIDDALRVFGSIATPSVICWTSLISGLAEHGFEQDCYRYFLDMQAAGVQP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA      +NQ SMFHGYILK  A+HDIVVGNALVDAYARS  VDDA RVI
Sbjct: 421 NSFTLSSILGA-----CKNQISMFHGYILKSMAYHDIVVGNALVDAYARSGMVDDARRVI 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
            TM HRD ITYTSLATRLNQMGDHEMALK IDSMRADNV+MDEISL SLVSA TG+G +E
Sbjct: 481 RTMKHRDPITYTSLATRLNQMGDHEMALKTIDSMRADNVKMDEISLASLVSAATGVGTIE 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            GKQLHCYSL+YGLDNT SVKNSL+D YGKVGCLKDA K FEEI++PDVVSWNG+ISILA
Sbjct: 541 AGKQLHCYSLRYGLDNTRSVKNSLVDFYGKVGCLKDACKAFEEITEPDVVSWNGLISILA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NGHIS+AL+AFDNMRLAGL+PDSIT LS+LSACSQGGLVDFGMHYF +M+ TH IEP L
Sbjct: 601 LNGHISAALSAFDNMRLAGLKPDSITLLSVLSACSQGGLVDFGMHYFQTMRETHNIEPAL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDL GR GQLE AMEIVE MP+EADAK+YKTLL AC  H N+LLGEDVA RGLQ
Sbjct: 661 DHYVCVIDLHGRAGQLEKAMEIVEGMPFEADAKVYKTLLSACKLHRNVLLGEDVARRGLQ 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P DSSFYLLLA+LYD  +R DLS KTRKLMRDRG+RKSP QSW+EL  KIH+F+TG+R
Sbjct: 721 LDPYDSSFYLLLASLYDELDRPDLSTKTRKLMRDRGMRKSPSQSWVELSGKIHVFITGDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HP++ND++EKLEFLRAEFKSRGF+Y +DE+S HHSEKLALAFGLV++PP  VVRIMKNI
Sbjct: 781 SHPEMNDMEEKLEFLRAEFKSRGFLYGDDEDSCHHSEKLALAFGLVSMPPKGVVRIMKNI 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
           SICRECHDFILL TKVVEREI+VRDG  LHV  NGSCSC  Y
Sbjct: 841 SICRECHDFILLATKVVEREIVVRDGSRLHVFNNGSCSCKRY 876

BLAST of Bhi01G001040 vs. NCBI nr
Match: XP_023875935.1 (pentatricopeptide repeat-containing protein At5g52850, chloroplastic-like [Quercus suber] >XP_023882138.1 pentatricopeptide repeat-containing protein At5g52850, chloroplastic-like [Quercus suber])

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 523/882 (59.30%), Postives = 685/882 (77.66%), Query Frame = 0

Query: 1   MLCRAVPKLLNRNELYRLEESCSQLISICNSKSLKEGVCVHSPIIKLGLHGNLYLSNNLL 60
           MLC+ V K  +RNELY  ++ C +++S+CNSKSLKEGVCVHSPIIK+GL  ++YL+NNLL
Sbjct: 1   MLCKTVTKTCHRNELYHFQDICLRVVSLCNSKSLKEGVCVHSPIIKMGLQDDMYLNNNLL 60

Query: 61  ALYAKRFGLKQARNLFDEMPDRDVVSWTTMQAAYVRNRSYIEAFELFDLMLTLGHCPNEF 120
           +LYAK FG+  A + FDEMP +DVVSWT + ++YVRN ++ +A  LFD ML     PNEF
Sbjct: 61  SLYAKCFGVDHAHHFFDEMPCKDVVSWTGILSSYVRNENHEQALRLFDSMLNSSQYPNEF 120

Query: 121 TLSNLIRSCSETGELELGSCVHGYVIKGGFETKPVLGCTLINLYAKCDFSEEAYEVFRNM 180
           TLS+++RSCS  GE + G+ +  Y+IK GF++ P+L   LI+LY+KC+ +EEAY+VF  M
Sbjct: 121 TLSSVLRSCSALGEFDYGTLIQAYMIKNGFDSNPILASVLIDLYSKCNCTEEAYKVFECM 180

Query: 181 DDVDTVTWTVMISSLVQAQKWDEALQLYITMMNSGVTPNEFTFTKLLATTNFLGLKYGKL 240
           D  DTV+WT MISSLVQAQKW +ALQ YI M+   V PNEFTF KLLA +  LG  YGKL
Sbjct: 181 DGGDTVSWTTMISSLVQAQKWSQALQFYIQMIEKKVPPNEFTFVKLLAASGSLGSSYGKL 240

Query: 241 LHCHMITLGVNLNVVLKTALVDVYSRYQELEDAMKVANQTPEKDVFLWTSIISFFNQNLK 300
           +H HMI LG+ LNV+LKT+LVD+YS+   +EDA+KV+NQTPE+DVFLWT+IIS F QN+K
Sbjct: 241 VHAHMILLGIELNVILKTSLVDMYSKCHRMEDAVKVSNQTPERDVFLWTAIISGFIQNMK 300

Query: 301 VKEAIAAFQEMRMSGILPNSFTYSSALSACTSIPSLKLGKQIHLQVILAGLEADVCAGSA 360
           VKEAIAA  EM MSGI+PN+F+YS+ L+A +SI SL+LG+Q+H +VI AGLE D+  G+A
Sbjct: 301 VKEAIAALSEMVMSGIVPNNFSYSTILNASSSILSLELGEQVHSRVIKAGLEDDISVGNA 360

Query: 361 LINMYMKCSNFIDDALRVFRTITSPSVICWTSLISGLAEHGCEQDCYRYFLDMQAAGVQP 420
           LI+MYMKCSN ID+ALRVFR ++SP+VI WTSLI+G A+HG E+D +R F +M+A G+ P
Sbjct: 361 LIDMYMKCSNLIDNALRVFRGMSSPNVITWTSLIAGFAKHGFEEDSFRSFEEMRALGLAP 420

Query: 421 NAFTLSSILGASSSAKSQNQTSMFHGYILKMRAHHDIVVGNALVDAYARSAKVDDACRVI 480
           N+FTLSSILGA S+ KS +QT   HGYI+K++A  DIVVGNALVDAYA    VD+A  V+
Sbjct: 421 NSFTLSSILGACSTMKSHSQTMKLHGYIIKIKADCDIVVGNALVDAYAGLGMVDEARCVV 480

Query: 481 STMNHRDAITYTSLATRLNQMGDHEMALKIIDSMRADNVEMDEISLTSLVSALTGLGIVE 540
             M+HRDAITYTSLATR+NQMG H+ AL+II  M  D+V+MD  S++S +SA  GLG ++
Sbjct: 481 RKMDHRDAITYTSLATRINQMGYHDRALEIIKYMNKDDVKMDGFSMSSFLSAAAGLGSMK 540

Query: 541 TGKQLHCYSLKYGLDNTCSVKNSLMDLYGKVGCLKDANKVFEEISKPDVVSWNGMISILA 600
            G QLHC+S+K GL    SV N L+DLYGK GC+ DA++ F EI++PDV SWNG IS LA
Sbjct: 541 AGMQLHCFSVKSGLRCWLSVSNGLVDLYGKCGCIHDAHRAFGEITEPDVASWNGWISGLA 600

Query: 601 FNGHISSALAAFDNMRLAGLEPDSITFLSILSACSQGGLVDFGMHYFHSMKATHKIEPEL 660
            NG+ISSAL+AF++MRL G++PD +TFL +L ACS GGLVD G+ YFHSM+ TH I P+L
Sbjct: 601 SNGYISSALSAFEDMRLVGVKPDLVTFLLVLFACSHGGLVDLGLDYFHSMRETHGIAPQL 660

Query: 661 DHYACIIDLLGRVGQLENAMEIVESMPYEADAKIYKTLLKACNFHGNMLLGEDVAIRGLQ 720
           DHY C+IDLLGR GQLE AM ++++MP+  DA IYKTLL A   HGN+ LGED+A +G+ 
Sbjct: 661 DHYVCLIDLLGRAGQLEEAMGVIKTMPFRPDALIYKTLLSASKLHGNVPLGEDMARQGID 720

Query: 721 LNPNDSSFYLLLANLYDGYNRQDLSAKTRKLMRDRGVRKSPGQSWIELHSKIHLFVTGER 780
           L+P+D +FY+LLANLYD   R DLS K R LM++RG+ K+P QSW+E+ ++IH F   +R
Sbjct: 721 LDPSDPAFYILLANLYDRSGRSDLSEKARGLMKERGLMKNPCQSWMEIRNQIHHFTAEDR 780

Query: 781 THPQINDIQEKLEFLRAEFKSRGFMYHEDENSSHHSEKLALAFGLVNLPPTAVVRIMKNI 840
           +HPQIN I EK+E L  EFK RG++Y ++ + S+HSEKLA+AFGL++ P  A + I+KN 
Sbjct: 781 SHPQINQIHEKIESLMTEFKYRGYLYRDNRDKSYHSEKLAVAFGLLSTPSKAPILIIKNT 840

Query: 841 SICRECHDFILLVTKVVEREIIVRDGRGLHVLKNGSCSCSHY 883
            IC +CH F++LVT++V+REII+R+G  +H  + G+CSC  Y
Sbjct: 841 RICMDCHYFVMLVTELVDREIILREGNRVHSFRKGNCSCRGY 882

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G52850.13.6e-25148.74Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G13650.11.0e-14131.74Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G57430.17.0e-13832.16Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33170.15.4e-13029.36Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.11.2e-12929.95Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FLX6|PP430_ARATH6.5e-25048.74Pentatricopeptide repeat-containing protein At5g52850, chloroplastic OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH1.9e-14031.74Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q7Y211|PP285_ARATH1.3e-13632.16Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|Q9SMZ2|PP347_ARATH9.7e-12929.36Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|Q9SS60|PP210_ARATH2.2e-12829.95Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A2P5FGP2|A0A2P5FGP2_9ROSA9.3e-30257.62DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_073370 ... [more]
tr|A0A2N9F2C3|A0A2N9F2C3_FAGSY1.1e-29959.40Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS13008 PE=4 SV=1[more]
tr|A0A2I4GTT8|A0A2I4GTT8_9ROSI2.5e-29957.11LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g52850, chlo... [more]
tr|A0A2P5CB14|A0A2P5CB14_PARAD5.6e-29957.47DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_168... [more]
tr|A0A1Q3C880|A0A1Q3C880_CEPFO1.4e-29756.01PPR domain-containing protein/PPR_2 domain-containing protein/DYW_deaminase doma... [more]
Match NameE-valueIdentityDescription
XP_022141235.10.0e+0084.94pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Momordica ... [more]
XP_023542503.10.0e+0083.11pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita ... [more]
XP_022968638.10.0e+0082.33pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita ... [more]
XP_022945787.10.0e+0082.31pentatricopeptide repeat-containing protein At5g52850, chloroplastic [Cucurbita ... [more]
XP_023875935.10.0e+0059.30pentatricopeptide repeat-containing protein At5g52850, chloroplastic-like [Querc... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi01M001040Bhi01M001040mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 4..144
e-value: 2.7E-20
score: 75.1
coord: 442..551
e-value: 8.9E-15
score: 57.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 231..333
e-value: 2.8E-16
score: 61.7
coord: 561..798
e-value: 1.6E-33
score: 118.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 335..441
e-value: 4.1E-15
score: 57.5
coord: 145..230
e-value: 2.7E-18
score: 67.9
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 761..872
e-value: 9.5E-21
score: 73.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 663..686
e-value: 0.74
score: 10.1
coord: 159..180
e-value: 0.014
score: 15.5
coord: 562..584
e-value: 0.021
score: 15.0
coord: 489..518
e-value: 0.036
score: 14.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 590..624
e-value: 8.3E-4
score: 17.4
coord: 186..220
e-value: 9.4E-8
score: 29.8
coord: 287..319
e-value: 5.9E-4
score: 17.8
coord: 489..523
e-value: 0.0032
score: 15.5
coord: 388..422
e-value: 1.0E-5
score: 23.4
coord: 85..119
e-value: 4.3E-5
score: 21.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 385..430
e-value: 3.1E-7
score: 30.3
coord: 184..227
e-value: 6.9E-11
score: 42.1
coord: 83..129
e-value: 5.9E-8
score: 32.7
coord: 587..635
e-value: 1.9E-10
score: 40.6
coord: 283..330
e-value: 2.5E-9
score: 37.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 725..759
score: 5.448
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 184..218
score: 12.518
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 284..318
score: 9.986
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 623..653
score: 7.432
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 153..183
score: 6.325
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 319..353
score: 6.182
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 487..521
score: 9.109
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 52..82
score: 6.665
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 83..117
score: 10.479
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 118..152
score: 7.158
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 659..689
score: 6.84
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 456..486
score: 7.87
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..283
score: 5.897
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 588..622
score: 10.457
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 354..385
score: 6.073
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 522..556
score: 5.985
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 557..587
score: 6.632
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 386..420
score: 9.635
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..204
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 199..302
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 318..799
NoneNo IPR availablePANTHERPTHR24015:SF521SUBFAMILY NOT NAMEDcoord: 199..302
NoneNo IPR availablePANTHERPTHR24015:SF521SUBFAMILY NOT NAMEDcoord: 28..204
coord: 318..799