Bhi04G000840 (gene) Wax gourd (B227) v1

Overview
NameBhi04G000840
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4: 25949943 .. 25954895 (-)
RNA-Seq ExpressionBhi04G000840
SyntenyBhi04G000840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTATTAAAAAAAAGCATTCAATTAATCCAATTCATGAATTGAAAATTTGGAGTGAAAAAAAAACCCATGTCCCATTGGACTTTTTCCCCCTTAAAATAGCTCTCTCTCTCTCTCTCTCCTCCCTTCCCCGCCGCCGATTGCACCGTCAGTCTCTCCTTCTTCAGTTCAGCCTCTGCGTTCGCCGGATTTTTCATCTTCTTCCTCTCCTTCGAACCGCAGACCAGATATCCTTCCAAAATCCTCAAAGACGGGTTGGTTCTCACTATCTCGACGGTTCGGTTTCAGGTTTTAAAGCCGAACCGCAAACACAACTGTGATTTTTTAGCTTAAATTTAGAGCATTTTGATTAAGCAGAGGGCTTCTGCCCCTAAAACGCGAATGCTTCGCCTCTGCATTTGTTCGTCCTCTCCTTGCCGCCATTGGAGTTCGAAGCTCAATTACTCCCAGTTCAACCTGCCATGCGTTTTGGTAAAATTGCCGCAATTAATCACCCACACCAACTTCGCCGGAATGTTTGCTTATTGATGAATTATTTTTCGTCCTGTGCTCTCAGCAGTTTGAATATCATCGAAGAGAGCAATACCCACAACTGGAATTACCTGGAGTTGCAGTCTCGAATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTTGAGACTTTGAATTTTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTTCAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTCTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGGCCAACCCCAAATAAAACGACTTTCAACATACTCCTTAATGGACTTATGTCGTTGGGTTATCTTAGAGATGCATATTTTTTTGCAGAAGAGATGAGCAAGAGTGGGATAAATCCATCCTTTACATCCTTATCCAAATTGCTTAAAATTTCGATGAAATCAGGTAGTTTAGTTCATTCTATTTGGATATTCAAGCTCATGTTGAGGTTAAATCATTTGCCAACTGAACCTACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCGGGATGTTGGAAGAGGCATTCAGCTTGTGTGCTGCAATTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTGTGGCAAGAGTTTTATAGCTTTGCAGTTTTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTACATCTTTATTGTTGTTTAGATCAAATGAGAAGTGATGGATGTAAGCCCAATGTCATTACTTACACGGTCATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTTATTTCTTGAAACTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATCCGTGCTCTTTGCCTTCACGATAAAGCATATGATGCTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACGTATACTGCTTTGGCTGGAGGTATAATGAAAGTAGGAAAGTCAGACATTGCTTATGAGTTATTGTGCAATGTGTTCTCGAGAAACTGTACAGTTGACGTTGTTGTGTACAATATATACTTACATTGCTTGTGTCAAAATACTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGGAGGTATTGCTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATCATAAACTTGAACATGCATTGAAGCTATTAGAATGCTTCGAGTGGCTTGAGAGCGGCCCTGATGTGATTTCATTCAATACAGTTCTCTCTGCAGCATGTAAACTTGGGAATTTAGTTCTAATTCGCAGGGTCTTGCATTGTATGGAATTTAGAGGTGTTGAGCCAGATGTAATAAGCTTGACCTGTTTGGTTCAATATTTGTCTACTATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGTAATGGTCCTGCTCCCTCAAGTGTCACTTTCAATATTCTCCTTGACAAGCTTTGCAGAAGTGGATTTATAAGCACTGCATACCAGATATTTGAGGATCTCCAAAATGCTGGATTGTTGCTTGACAGAAAAACTTACAGTATTCTTCTACGTGCCTTATTAAGGAAGTGTGATGACAATTTGATTGAACGACTGCTTCAGGATATGTACAAGCAGAGATTGTCTCCTGATCTTTCTATTTATGGTTCAAACACTAATGATATTTGTCAGGATGGTAATATATCTACTGCTCTCTTTACCAGGGGTCAAACTCTAGGGAATGGACTTAGTCCCTCTATGGAAATGTACAATAGATTGTTGAAGGCTGTGGTGCCAAAAAAATAGGCATTGGATTATGACTTATGACAGGATGGTGAAAAACTAATGGCACGATTTCAATTATTGGAGTTCTAAAGAAGAATTTTCATGGGTTATGGATCATGGTTTCTATAGCATGGGGCAATTGATTGAAATACATAAAGTGCTGGTGAATGGCTAATGTGTGTACTGATGGAGCAGTTGACCACATTCTCATCTCATGCACCAGCATCAGAAAACGCGAAGAGCAGCATTGAGGGACCTTTTCCTTGATATTTTGCCTGCTTTTCCAGGTATTAAGATTTCATTATGATATTTTGATTTTGTCATGAACCTCCATAATCCATTACTTAAGATCAGATAAATTTTACACCTTGCTTCCTCCATCTCGTATCTTCTTTCTAAGACACAATTCAGAAAATGCAGGCAGAAAATTTTGTTGGGGGGTAATTTTAGTTTGGACCTCAGCCTCTTCTTCGGGTCTTGGAGATGACATCTCACTATGTACTTTGCTGTTATGGACATTAGTGGCTACGAGGGTCGAAGATGATAACTAAGATCCAATTAAGCTATGTTTATTGAGCATTAGGAGATAAAGATATGAAATTGCTCCTTTTTTATGCGACTTCTTTGGATCTTGGAAAAGTCATCTACGTAGTCGGCTGTTATGGAAATTAGTGGCTACGGGGGTCGAAGATGATATCTAAGATCCAAGTAAGTTATGTTTAGGTGAGCATTAGTTTGAGTGGGAGATAAAGGTATGAAACTATGCAACTTTTTCTTTAAAAAGCAGACGTGCTGTAGTAACTGCTAGAAGTGATGATATTTTTTCTTCTTAGGATCTCTGTGAGATATAAACACAAACCGTCTACCACACTTCATTTTATTTTGTTTCTGATCTTTTCACTGCCCTTGATTATTTTCCCGGTTTGTTTTGGGAGGATTTTATATTTAGCTGCTACTTGACTCTAGATTTTGAGGTTCTAGCATGCAGGCCCTTTGGATTCAGAGTTTAGGATAGTTCATTTGTGGTGTTGGATATACTTCCCTTGAGAAACTATTTTAGGCTTAGAGTTCTTTGATGATACATGATATCATCTTTATTTTGTTTCAATGTTTCTATATGTTTGCTCAATTAACGACATAAAAGACTTGTTCGTTGAGAAAAATGTTTGGTGAATTCAGGTATTGAGCAAACTCAGCAAAATGCGTTTTATGGTTGTTCACTCGTGGATGGGTTGAAATTTGGTGCTCGGACCGTGTGGATGATTGATTTCTAGCTACATGTAGCAATGGAACAGCCAAGTTGATTGATATGTGAATGGAAGATATCTGCCATTCTTATTCTCAGACCAGACAGCCTTCCGTTCAAGTTTTTTATAAAGTTTGAGAGGTGTCCACGGTCTCAGGTTCCTTCCACCTTGAAAGGCCATGAGATTTCATGCTATACGCACCGGGATCAGATATGCAAATGCATCACGTTATGATATTCAAATTATCGTCGATCTTCAAGAAATTATTCTGCTTTTCAAGAATGCATCGAGTTGGAAATTCTGTTTATACGAACTGTGGCCTACAGTACAGTCCCAAAGAATAACATTATATGGCCTTGAACAGAGATAGGTAAGTTTAGCGTTTGTAATTTACCTTTCTCACAAGTTTATTAATATTGAACTTTGTGGAATGTCTGTATATGACACAAACATCAATATAGCTCTACTAGCATTTTAGCAAAAACAATTTAAATTATCAACAAATTTGATTATTGAATTGATAAAGCTCTTCAAATGTTGAGTACCATTTTGAG

mRNA sequence

ATTATTAAAAAAAAGCATTCAATTAATCCAATTCATGAATTGAAAATTTGGAGTGAAAAAAAAACCCATGTCCCATTGGACTTTTTCCCCCTTAAAATAGCTCTCTCTCTCTCTCTCTCCTCCCTTCCCCGCCGCCGATTGCACCGTCAGTCTCTCCTTCTTCAGTTCAGCCTCTGCGTTCGCCGGATTTTTCATCTTCTTCCTCTCCTTCGAACCGCAGACCAGATATCCTTCCAAAATCCTCAAAGACGGGTTGGTTCTCACTATCTCGACGGTTCGGTTTCAGGTTTTAAAGCCGAACCGCAAACACAACTGTGATTTTTTAGCTTAAATTTAGAGCATTTTGATTAAGCAGAGGGCTTCTGCCCCTAAAACGCGAATGCTTCGCCTCTGCATTTGTTCGTCCTCTCCTTGCCGCCATTGGAGTTCGAAGCTCAATTACTCCCAGTTCAACCTGCCATGCGTTTTGCAGTTTGAATATCATCGAAGAGAGCAATACCCACAACTGGAATTACCTGGAGTTGCAGTCTCGAATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTTGAGACTTTGAATTTTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTTCAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTCTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGGCCAACCCCAAATAAAACGACTTTCAACATACTCCTTAATGGACTTATGTCGTTGGGTTATCTTAGAGATGCATATTTTTTTGCAGAAGAGATGAGCAAGAGTGGGATAAATCCATCCTTTACATCCTTATCCAAATTGCTTAAAATTTCGATGAAATCAGGTAGTTTAGTTCATTCTATTTGGATATTCAAGCTCATGTTGAGGTTAAATCATTTGCCAACTGAACCTACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCGGGATGTTGGAAGAGGCATTCAGCTTGTGTGCTGCAATTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTGTGGCAAGAGTTTTATAGCTTTGCAGTTTTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTACATCTTTATTGTTGTTTAGATCAAATGAGAAGTGATGGATGTAAGCCCAATGTCATTACTTACACGGTCATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTTATTTCTTGAAACTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATCCGTGCTCTTTGCCTTCACGATAAAGCATATGATGCTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACGTATACTGCTTTGGCTGGAGGTATAATGAAAGTAGGAAAGTCAGACATTGCTTATGAGTTATTGTGCAATGTGTTCTCGAGAAACTGTACAGTTGACGTTGTTGTGTACAATATATACTTACATTGCTTGTGTCAAAATACTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGGAGGTATTGCTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATCATAAACTTGAACATGCATTGAAGCTATTAGAATGCTTCGAGTGGCTTGAGAGCGGCCCTGATGTGATTTCATTCAATACAGTTCTCTCTGCAGCATGTAAACTTGGGAATTTAGTTCTAATTCGCAGGGTCTTGCATTGTATGGAATTTAGAGGTGTTGAGCCAGATGTAATAAGCTTGACCTGTTTGGTTCAATATTTGTCTACTATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGTAATGGTCCTGCTCCCTCAAGTGTCACTTTCAATATTCTCCTTGACAAGCTTTGCAGAAGTGGATTTATAAGCACTGCATACCAGATATTTGAGGATCTCCAAAATGCTGGATTGTTGCTTGACAGAAAAACTTACAGTATTCTTCTACGTGCCTTATTAAGGAAGTGTGATGACAATTTGATTGAACGACTGCTTCAGGATATGTACAAGCAGAGATTGTCTCCTGATCTTTCTATTTATGGTTCAAACACTAATGATATTTGTCAGGATGGTAATATATCTACTGCTCTCTTTACCAGGGGTCAAACTCTAGGGAATGGACTTAGTCCCTCTATGGAAATGTACAATAGATTGTTGAAGGCTGTGGTGCCAAAAAAATAGGCATTGGATTATGACTTATGACAGGATGGTGAAAAACTAATGGCACGATTTCAATTATTGGAGTTCTAAAGAAGAATTTTCATGGGTTATGGATCATGGTTTCTATAGCATGGGGCAATTGATTGAAATACATAAAGTGCTGGTGAATGGCTAATGTGTGTACTGATGGAGCAGTTGACCACATTCTCATCTCATGCACCAGCATCAGAAAACGCGAAGAGCAGCATTGAGGGACCTTTTCCTTGATATTTTGCCTGCTTTTCCAGGTATTGAGCAAACTCAGCAAAATGCGTTTTATGGTTGTTCACTCGTGGATGGGTTGAAATTTGGTGCTCGGACCGTGTGGATGATTGATTTCTAGCTACATGTAGCAATGGAACAGCCAAGTTGATTGATATGTGAATGGAAGATATCTGCCATTCTTATTCTCAGACCAGACAGCCTTCCGTTCAAGTTTTTTATAAAGTTTGAGAGGTGTCCACGGTCTCAGGTTCCTTCCACCTTGAAAGGCCATGAGATTTCATGCTATACGCACCGGGATCAGATATGCAAATGCATCACGTTATGATATTCAAATTATCGTCGATCTTCAAGAAATTATTCTGCTTTTCAAGAATGCATCGAGTTGGAAATTCTGTTTATACGAACTGTGGCCTACAGTACAGTCCCAAAGAATAACATTATATGGCCTTGAACAGAGATAGGTAAGTTTAGCGTTTGTAATTTACCTTTCTCACAAGTTTATTAATATTGAACTTTGTGGAATGTCTGTATATGACACAAACATCAATATAGCTCTACTAGCATTTTAGCAAAAACAATTTAAATTATCAACAAATTTGATTATTGAATTGATAAAGCTCTTCAAATGTTGAGTACCATTTTGAG

Coding sequence (CDS)

ATGCAGAACTATGCGGCTTCTGGTGATCTTGCTGAAGCTCTTGAGACTTTGAATTTTATGAGAAATGTTGCTGGGAAGCCCTCTGTGTATGATTTCAACGCTTTGTTTCATAGATATTTGAGTTCTGGAAATGTTCTGTTGGAACCATTGGTTCAAGTGTATATAGGAATGAAGAGGTTTGGGCCAACCCCAAATAAAACGACTTTCAACATACTCCTTAATGGACTTATGTCGTTGGGTTATCTTAGAGATGCATATTTTTTTGCAGAAGAGATGAGCAAGAGTGGGATAAATCCATCCTTTACATCCTTATCCAAATTGCTTAAAATTTCGATGAAATCAGGTAGTTTAGTTCATTCTATTTGGATATTCAAGCTCATGTTGAGGTTAAATCATTTGCCAACTGAACCTACTTTAGCCATGTTTGTTTGTATGCTTTGTAAAGCCGGGATGTTGGAAGAGGCATTCAGCTTGTGTGCTGCAATTCTATCCAAAAGTTTTAATTTTCAAGCATATGTATTTAATCCTGTTCTTTGGGCTTTATGTAAGTGTGGCAAGAGTTTTATAGCTTTGCAGTTTTTTTATATGATGAAAAAGAAAGGCATGACTCATAATGTATGTTCATATACTGCTTTGCTTTATGGATTTGGAAGGGAACGTTTGTGGGTACATCTTTATTGTTGTTTAGATCAAATGAGAAGTGATGGATGTAAGCCCAATGTCATTACTTACACGGTCATTATTAAGTTTCTTTGTGATGATGGAAGGATTGGTGAAGCATTTTATTTCTTGAAACTCATGGAAGGGGAGGGATGTGATCCAGACTTGGTAACTTACAATATAATTATCCGTGCTCTTTGCCTTCACGATAAAGCATATGATGCTGCTGAGATTTTGCAGGTGATTCATCACAGAGGTTTCTCTCCTGATGCATATACGTATACTGCTTTGGCTGGAGGTATAATGAAAGTAGGAAAGTCAGACATTGCTTATGAGTTATTGTGCAATGTGTTCTCGAGAAACTGTACAGTTGACGTTGTTGTGTACAATATATACTTACATTGCTTGTGTCAAAATACTAGATCAAGAGAAGCACTTTCTCTGTTGAAAAGTATGAAAGAAGGAGGTATTGCTCCAACTACTGTGTCATATAACACAGTTTTAAGGGGCTTTTGTAGAGATCATAAACTTGAACATGCATTGAAGCTATTAGAATGCTTCGAGTGGCTTGAGAGCGGCCCTGATGTGATTTCATTCAATACAGTTCTCTCTGCAGCATGTAAACTTGGGAATTTAGTTCTAATTCGCAGGGTCTTGCATTGTATGGAATTTAGAGGTGTTGAGCCAGATGTAATAAGCTTGACCTGTTTGGTTCAATATTTGTCTACTATGGGAAGATACTCAGAATGCTTGAAATTATTGGAATACATGGTATGTAATGGTCCTGCTCCCTCAAGTGTCACTTTCAATATTCTCCTTGACAAGCTTTGCAGAAGTGGATTTATAAGCACTGCATACCAGATATTTGAGGATCTCCAAAATGCTGGATTGTTGCTTGACAGAAAAACTTACAGTATTCTTCTACGTGCCTTATTAAGGAAGTGTGATGACAATTTGATTGAACGACTGCTTCAGGATATGTACAAGCAGAGATTGTCTCCTGATCTTTCTATTTATGGTTCAAACACTAATGATATTTGTCAGGATGGTAATATATCTACTGCTCTCTTTACCAGGGGTCAAACTCTAGGGAATGGACTTAGTCCCTCTATGGAAATGTACAATAGATTGTTGAAGGCTGTGGTGCCAAAAAAATAG

Protein sequence

MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHSIWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLIERLLQDMYKQRLSPDLSIYGSNTNDICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKAVVPKK
Homology
BLAST of Bhi04G000840 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 212.2 bits (539), Expect = 1.1e-54
Identity = 131/459 (28.54%), Postives = 223/459 (48.58%), Query Frame = 0

Query: 73  LNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHSIWIFKLMLRLNH 132
           L  ++  G L + + F E M   G  P     + L++   + G    +  I +++     
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 133 LPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFIALQ 192
           +P   T  + +   CKAG +  A S+   +   S +     +N +L +LC  GK   A++
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 193 FFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLC 252
               M ++    +V +YT L+    R+    H    LD+MR  GC P+V+TY V++  +C
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 253 DDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAY 312
            +GR+ EA  FL  M   GC P+++T+NII+R++C   +  DA ++L  +  +GFSP   
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 313 TYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSM 372
           T+  L   + + G    A ++L  +    C  + + YN  LH  C+  +   A+  L+ M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 373 KEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNL 432
              G  P  V+YNT+L   C+D K+E A+++L         P +I++NTV+    K G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 433 VLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNIL 492
               ++L  M  + ++PD I+ + LV  LS  G+  E +K        G  P++VTFN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 493 LDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRAL 532
           +  LC+S     A      + N G   +  +Y+IL+  L
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGL 564

BLAST of Bhi04G000840 vs. TAIR 10
Match: AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 211.5 bits (537), Expect = 1.9e-54
Identity = 142/565 (25.13%), Postives = 253/565 (44.78%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNV-AGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKR 60
           M++Y  +G   +    +  MRNV + +P+   +N +    L SGN   +    V+  M  
Sbjct: 153 MRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVL-EILVSGN-CHKVAANVFYDMLS 212

Query: 61  FGPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVH 120
               P   TF +++    ++  +  A     +M+K G  P+      L+    K   +  
Sbjct: 213 RKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNE 272

Query: 121 SIWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLW 180
           ++ + + M  +  +P   T    +  LCK   + EA  +   +L + F      +  ++ 
Sbjct: 273 ALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMN 332

Query: 181 ALCKCGKSFIALQFFY--------------------------------MMKKKGMTHNVC 240
            LCK G+   A   FY                                M+   G+  +VC
Sbjct: 333 GLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVC 392

Query: 241 SYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLCDDGRIGEAFYFLKLM 300
           +Y +L+YG+ +E L       L  MR+ GCKPNV +YT+++   C  G+I EA+  L  M
Sbjct: 393 TYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEM 452

Query: 301 EGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAYTYTALAGGIMKVGKS 360
             +G  P+ V +N +I A C   +  +A EI + +  +G  PD YT+ +L  G+ +V + 
Sbjct: 453 SADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEI 512

Query: 361 DIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSMKEGGIAPTTVSYNTV 420
             A  LL ++ S     + V YN  ++   +    +EA  L+  M   G     ++YN++
Sbjct: 513 KHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYNSL 572

Query: 421 LRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNLVLIRRVLHCMEFRGV 480
           ++G CR  +++ A  L E        P  IS N +++  C+ G +         M  RG 
Sbjct: 573 IKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGS 632

Query: 481 EPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNILLDKLCRSGFISTAYQ 533
            PD+++   L+  L   GR  + L +   +   G  P +VTFN L+  LC+ GF+  A  
Sbjct: 633 TPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAEGIPPDTVTFNTLMSWLCKGGFVYDACL 692

BLAST of Bhi04G000840 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 209.9 bits (533), Expect = 5.6e-54
Identity = 145/553 (26.22%), Postives = 268/553 (48.46%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAG-KPSVYDFNALFHRYLSSGNVLLEPLVQV-YIGMK 60
           +++YA      E L  +++M +  G KP  + +N + +  L  GN L   LV++ +  M 
Sbjct: 125 IESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLN-LLVDGNSL--KLVEISHAKMS 184

Query: 61  RFGPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLV 120
            +G  P+ +TFN+L+  L     LR A    E+M   G+ P   + + +++  ++ G L 
Sbjct: 185 VWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLD 244

Query: 121 HSIWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNF-QAYVFNPV 180
            ++ I + M+      +  ++ + V   CK G +E+A +    + ++   F   Y FN +
Sbjct: 245 GALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTL 304

Query: 181 LWALCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGC 240
           +  LCK G    A++   +M ++G   +V +Y +++ G  +          LDQM +  C
Sbjct: 305 VNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDC 364

Query: 241 KPNVITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAE 300
            PN +TY  +I  LC + ++ EA    +++  +G  PD+ T+N +I+ LCL      A E
Sbjct: 365 SPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAME 424

Query: 301 ILQVIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLC 360
           + + +  +G  PD +TY  L   +   GK D A  +L  +    C   V+ YN  +   C
Sbjct: 425 LFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFC 484

Query: 361 QNTRSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVI 420
           +  ++REA  +   M+  G++  +V+YNT++ G C+  ++E A +L++        PD  
Sbjct: 485 KANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKY 544

Query: 421 SFNTVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYM 480
           ++N++L+  C+ G++     ++  M   G EPD+++   L+  L   GR     KLL  +
Sbjct: 545 TYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSI 604

Query: 481 VCNGPAPSSVTFNILLDKLCRSGFISTAYQIF-EDLQNAGLLLDRKTYSILLRAL----- 540
              G   +   +N ++  L R    + A  +F E L+      D  +Y I+ R L     
Sbjct: 605 QMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGG 664

Query: 541 -LRKCDDNLIERL 544
            +R+  D L+E L
Sbjct: 665 PIREAVDFLVELL 674

BLAST of Bhi04G000840 vs. TAIR 10
Match: AT3G07290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 201.4 bits (511), Expect = 2.0e-51
Identity = 133/458 (29.04%), Postives = 216/458 (47.16%), Query Frame = 0

Query: 146 LCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFIALQFFYMMKKK-GMTH 205
           LCK G  E A    + IL   F   +++   +L   C+      AL+ F +M K+     
Sbjct: 205 LCKNGYTEAAEMFMSKILKIGFVLDSHIGTSLLLGFCRGLNLRDALKVFDVMSKEVTCAP 264

Query: 206 NVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLCDDGRIGEAFYFL 265
           N  SY+ L++G          +   DQM   GC+P+  TYTV+IK LCD G I +AF   
Sbjct: 265 NSVSYSILIHGLCEVGRLEEAFGLKDQMGEKGCQPSTRTYTVLIKALCDRGLIDKAFNLF 324

Query: 266 KLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAYTYTALAGGIMKV 325
             M   GC P++ TY ++I  LC   K  +A  + + +      P   TY AL  G  K 
Sbjct: 325 DEMIPRGCKPNVHTYTVLIDGLCRDGKIEEANGVCRKMVKDRIFPSVITYNALINGYCKD 384

Query: 326 GKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSMKEGGIAPTTVSY 385
           G+   A+ELL  +  R C  +V  +N  +  LC+  +  +A+ LLK M + G++P  VSY
Sbjct: 385 GRVVPAFELLTVMEKRACKPNVRTFNELMEGLCRVGKPYKAVHLLKRMLDNGLSPDIVSY 444

Query: 386 NTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNLVLIRRVLHCMEF 445
           N ++ G CR+  +  A KLL      +  PD ++F  +++A CK G   +    L  M  
Sbjct: 445 NVLIDGLCREGHMNTAYKLLSSMNCFDIEPDCLTFTAIINAFCKQGKADVASAFLGLMLR 504

Query: 446 RGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNILLDKLCRSGFIST 505
           +G+  D ++ T L+  +  +G+  + L +LE +V      +  + N++LD L +   +  
Sbjct: 505 KGISLDEVTGTTLIDGVCKVGKTRDALFILETLVKMRILTTPHSLNVILDMLSKGCKVKE 564

Query: 506 AYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLIERLLQDMYKQRLSPDLSIYGSNTN 565
              +   +   GL+    TY+ L+  L+R  D     R+L+ M      P++  Y    N
Sbjct: 565 ELAMLGKINKLGLVPSVVTYTTLVDGLIRSGDITGSFRILELMKLSGCLPNVYPYTIIIN 624

Query: 566 DICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKAVV 603
            +CQ G +  A         +G+SP+   Y  ++K  V
Sbjct: 625 GLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTVMVKGYV 662

BLAST of Bhi04G000840 vs. TAIR 10
Match: AT1G63080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 201.1 bits (510), Expect = 2.6e-51
Identity = 139/563 (24.69%), Postives = 251/563 (44.58%), Query Frame = 0

Query: 10  LAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTF 69
           L EA++    M      PS+ +F+ L            + ++     M+  G + N  T+
Sbjct: 46  LDEAVDLFGEMVKSRPFPSIVEFSKLLSAIAKMKK--FDLVISFGEKMEILGVSHNLYTY 105

Query: 70  NILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHSIWIFKLMLR 129
           NI++N L     L  A     +M K G  PS  +L+ LL        +  ++ +   M+ 
Sbjct: 106 NIMINCLCRRSQLSFALAILGKMMKLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVE 165

Query: 130 LNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFI 189
           + + P   T    V  L +     EA +L   ++ K        +  V+  LCK G+  +
Sbjct: 166 MGYQPDTVTFTTLVHGLFQHNKASEAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDL 225

Query: 190 ALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIK 249
           AL     M+K  +  +V  Y+ ++    + R          +M + G +P+V TY+ +I 
Sbjct: 226 ALNLLNKMEKGKIEADVVIYSTVIDSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLIS 285

Query: 250 FLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSP 309
            LC+ GR  +A   L  M     +P++VT+N +I A     K  +A ++   +  R   P
Sbjct: 286 CLCNYGRWSDASRLLSDMLERKINPNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDP 345

Query: 310 DAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLL 369
           +  TY +L  G     + D A ++   + S++C  DVV YN  ++  C+  +  + + L 
Sbjct: 346 NIVTYNSLINGFCMHDRLDEAQQIFTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELF 405

Query: 370 KSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKL 429
           + M   G+   TV+Y T++ GF +    ++A  + +        P+++++NT+L   CK 
Sbjct: 406 RDMSRRGLVGNTVTYTTLIHGFFQASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKN 465

Query: 430 GNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTF 489
           G L     V   ++   +EPD+ +   + + +   G+  +   L   +   G  P  + +
Sbjct: 466 GKLEKAMVVFEYLQKSKMEPDIYTYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAY 525

Query: 490 NILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLIERLLQDMYK 549
           N ++   C+ G    AY +F  ++  G L D  TY+ L+RA LR  D      L+++M  
Sbjct: 526 NTMISGFCKKGLKEEAYTLFIKMKEDGPLPDSGTYNTLIRAHLRDGDKAASAELIKEMRS 585

Query: 550 QRLSPDLSIYGSNTNDICQDGNI 573
            R + D S YG  T D+  DG +
Sbjct: 586 CRFAGDASTYGLVT-DMLHDGRL 605

BLAST of Bhi04G000840 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 1.6e-53
Identity = 131/459 (28.54%), Postives = 223/459 (48.58%), Query Frame = 0

Query: 73  LNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHSIWIFKLMLRLNH 132
           L  ++  G L + + F E M   G  P     + L++   + G    +  I +++     
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 133 LPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFIALQ 192
           +P   T  + +   CKAG +  A S+   +   S +     +N +L +LC  GK   A++
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 193 FFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLC 252
               M ++    +V +YT L+    R+    H    LD+MR  GC P+V+TY V++  +C
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 253 DDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAY 312
            +GR+ EA  FL  M   GC P+++T+NII+R++C   +  DA ++L  +  +GFSP   
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 313 TYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSM 372
           T+  L   + + G    A ++L  +    C  + + YN  LH  C+  +   A+  L+ M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 373 KEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNL 432
              G  P  V+YNT+L   C+D K+E A+++L         P +I++NTV+    K G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 433 VLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNIL 492
               ++L  M  + ++PD I+ + LV  LS  G+  E +K        G  P++VTFN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 493 LDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRAL 532
           +  LC+S     A      + N G   +  +Y+IL+  L
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGL 564

BLAST of Bhi04G000840 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 2.7e-53
Identity = 142/565 (25.13%), Postives = 253/565 (44.78%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNV-AGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKR 60
           M++Y  +G   +    +  MRNV + +P+   +N +    L SGN   +    V+  M  
Sbjct: 153 MRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVL-EILVSGN-CHKVAANVFYDMLS 212

Query: 61  FGPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVH 120
               P   TF +++    ++  +  A     +M+K G  P+      L+    K   +  
Sbjct: 213 RKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNE 272

Query: 121 SIWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLW 180
           ++ + + M  +  +P   T    +  LCK   + EA  +   +L + F      +  ++ 
Sbjct: 273 ALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMN 332

Query: 181 ALCKCGKSFIALQFFY--------------------------------MMKKKGMTHNVC 240
            LCK G+   A   FY                                M+   G+  +VC
Sbjct: 333 GLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVC 392

Query: 241 SYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLCDDGRIGEAFYFLKLM 300
           +Y +L+YG+ +E L       L  MR+ GCKPNV +YT+++   C  G+I EA+  L  M
Sbjct: 393 TYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEM 452

Query: 301 EGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAYTYTALAGGIMKVGKS 360
             +G  P+ V +N +I A C   +  +A EI + +  +G  PD YT+ +L  G+ +V + 
Sbjct: 453 SADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEI 512

Query: 361 DIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSMKEGGIAPTTVSYNTV 420
             A  LL ++ S     + V YN  ++   +    +EA  L+  M   G     ++YN++
Sbjct: 513 KHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYNSL 572

Query: 421 LRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNLVLIRRVLHCMEFRGV 480
           ++G CR  +++ A  L E        P  IS N +++  C+ G +         M  RG 
Sbjct: 573 IKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGS 632

Query: 481 EPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNILLDKLCRSGFISTAYQ 533
            PD+++   L+  L   GR  + L +   +   G  P +VTFN L+  LC+ GF+  A  
Sbjct: 633 TPDIVTFNSLINGLCRAGRIEDGLTMFRKLQAEGIPPDTVTFNTLMSWLCKGGFVYDACL 692

BLAST of Bhi04G000840 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 7.9e-53
Identity = 145/553 (26.22%), Postives = 268/553 (48.46%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAG-KPSVYDFNALFHRYLSSGNVLLEPLVQV-YIGMK 60
           +++YA      E L  +++M +  G KP  + +N + +  L  GN L   LV++ +  M 
Sbjct: 125 IESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLN-LLVDGNSL--KLVEISHAKMS 184

Query: 61  RFGPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLV 120
            +G  P+ +TFN+L+  L     LR A    E+M   G+ P   + + +++  ++ G L 
Sbjct: 185 VWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLD 244

Query: 121 HSIWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNF-QAYVFNPV 180
            ++ I + M+      +  ++ + V   CK G +E+A +    + ++   F   Y FN +
Sbjct: 245 GALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTL 304

Query: 181 LWALCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGC 240
           +  LCK G    A++   +M ++G   +V +Y +++ G  +          LDQM +  C
Sbjct: 305 VNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDC 364

Query: 241 KPNVITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAE 300
            PN +TY  +I  LC + ++ EA    +++  +G  PD+ T+N +I+ LCL      A E
Sbjct: 365 SPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAME 424

Query: 301 ILQVIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLC 360
           + + +  +G  PD +TY  L   +   GK D A  +L  +    C   V+ YN  +   C
Sbjct: 425 LFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFC 484

Query: 361 QNTRSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVI 420
           +  ++REA  +   M+  G++  +V+YNT++ G C+  ++E A +L++        PD  
Sbjct: 485 KANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKY 544

Query: 421 SFNTVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYM 480
           ++N++L+  C+ G++     ++  M   G EPD+++   L+  L   GR     KLL  +
Sbjct: 545 TYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSI 604

Query: 481 VCNGPAPSSVTFNILLDKLCRSGFISTAYQIF-EDLQNAGLLLDRKTYSILLRAL----- 540
              G   +   +N ++  L R    + A  +F E L+      D  +Y I+ R L     
Sbjct: 605 QMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGG 664

Query: 541 -LRKCDDNLIERL 544
            +R+  D L+E L
Sbjct: 665 PIREAVDFLVELL 674

BLAST of Bhi04G000840 vs. ExPASy Swiss-Prot
Match: Q9SFV9 (Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g07290 PE=2 SV=1)

HSP 1 Score: 201.4 bits (511), Expect = 2.8e-50
Identity = 133/458 (29.04%), Postives = 216/458 (47.16%), Query Frame = 0

Query: 146 LCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFIALQFFYMMKKK-GMTH 205
           LCK G  E A    + IL   F   +++   +L   C+      AL+ F +M K+     
Sbjct: 205 LCKNGYTEAAEMFMSKILKIGFVLDSHIGTSLLLGFCRGLNLRDALKVFDVMSKEVTCAP 264

Query: 206 NVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIKFLCDDGRIGEAFYFL 265
           N  SY+ L++G          +   DQM   GC+P+  TYTV+IK LCD G I +AF   
Sbjct: 265 NSVSYSILIHGLCEVGRLEEAFGLKDQMGEKGCQPSTRTYTVLIKALCDRGLIDKAFNLF 324

Query: 266 KLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSPDAYTYTALAGGIMKV 325
             M   GC P++ TY ++I  LC   K  +A  + + +      P   TY AL  G  K 
Sbjct: 325 DEMIPRGCKPNVHTYTVLIDGLCRDGKIEEANGVCRKMVKDRIFPSVITYNALINGYCKD 384

Query: 326 GKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLLKSMKEGGIAPTTVSY 385
           G+   A+ELL  +  R C  +V  +N  +  LC+  +  +A+ LLK M + G++P  VSY
Sbjct: 385 GRVVPAFELLTVMEKRACKPNVRTFNELMEGLCRVGKPYKAVHLLKRMLDNGLSPDIVSY 444

Query: 386 NTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKLGNLVLIRRVLHCMEF 445
           N ++ G CR+  +  A KLL      +  PD ++F  +++A CK G   +    L  M  
Sbjct: 445 NVLIDGLCREGHMNTAYKLLSSMNCFDIEPDCLTFTAIINAFCKQGKADVASAFLGLMLR 504

Query: 446 RGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTFNILLDKLCRSGFIST 505
           +G+  D ++ T L+  +  +G+  + L +LE +V      +  + N++LD L +   +  
Sbjct: 505 KGISLDEVTGTTLIDGVCKVGKTRDALFILETLVKMRILTTPHSLNVILDMLSKGCKVKE 564

Query: 506 AYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLIERLLQDMYKQRLSPDLSIYGSNTN 565
              +   +   GL+    TY+ L+  L+R  D     R+L+ M      P++  Y    N
Sbjct: 565 ELAMLGKINKLGLVPSVVTYTTLVDGLIRSGDITGSFRILELMKLSGCLPNVYPYTIIIN 624

Query: 566 DICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKAVV 603
            +CQ G +  A         +G+SP+   Y  ++K  V
Sbjct: 625 GLCQFGRVEEAEKLLSAMQDSGVSPNHVTYTVMVKGYV 662

BLAST of Bhi04G000840 vs. ExPASy Swiss-Prot
Match: Q9CAN5 (Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g63080 PE=2 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 3.7e-50
Identity = 139/563 (24.69%), Postives = 251/563 (44.58%), Query Frame = 0

Query: 10  LAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRFGPTPNKTTF 69
           L EA++    M      PS+ +F+ L            + ++     M+  G + N  T+
Sbjct: 46  LDEAVDLFGEMVKSRPFPSIVEFSKLLSAIAKMKK--FDLVISFGEKMEILGVSHNLYTY 105

Query: 70  NILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHSIWIFKLMLR 129
           NI++N L     L  A     +M K G  PS  +L+ LL        +  ++ +   M+ 
Sbjct: 106 NIMINCLCRRSQLSFALAILGKMMKLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVE 165

Query: 130 LNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWALCKCGKSFI 189
           + + P   T    V  L +     EA +L   ++ K        +  V+  LCK G+  +
Sbjct: 166 MGYQPDTVTFTTLVHGLFQHNKASEAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDL 225

Query: 190 ALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPNVITYTVIIK 249
           AL     M+K  +  +V  Y+ ++    + R          +M + G +P+V TY+ +I 
Sbjct: 226 ALNLLNKMEKGKIEADVVIYSTVIDSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLIS 285

Query: 250 FLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQVIHHRGFSP 309
            LC+ GR  +A   L  M     +P++VT+N +I A     K  +A ++   +  R   P
Sbjct: 286 CLCNYGRWSDASRLLSDMLERKINPNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDP 345

Query: 310 DAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNTRSREALSLL 369
           +  TY +L  G     + D A ++   + S++C  DVV YN  ++  C+  +  + + L 
Sbjct: 346 NIVTYNSLINGFCMHDRLDEAQQIFTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELF 405

Query: 370 KSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFNTVLSAACKL 429
           + M   G+   TV+Y T++ GF +    ++A  + +        P+++++NT+L   CK 
Sbjct: 406 RDMSRRGLVGNTVTYTTLIHGFFQASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKN 465

Query: 430 GNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCNGPAPSSVTF 489
           G L     V   ++   +EPD+ +   + + +   G+  +   L   +   G  P  + +
Sbjct: 466 GKLEKAMVVFEYLQKSKMEPDIYTYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAY 525

Query: 490 NILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLIERLLQDMYK 549
           N ++   C+ G    AY +F  ++  G L D  TY+ L+RA LR  D      L+++M  
Sbjct: 526 NTMISGFCKKGLKEEAYTLFIKMKEDGPLPDSGTYNTLIRAHLRDGDKAASAELIKEMRS 585

Query: 550 QRLSPDLSIYGSNTNDICQDGNI 573
            R + D S YG  T D+  DG +
Sbjct: 586 CRFAGDASTYGLVT-DMLHDGRL 605

BLAST of Bhi04G000840 vs. ExPASy TrEMBL
Match: A0A6J1KPX6 (pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496160 PE=4 SV=1)

HSP 1 Score: 997.7 bits (2578), Expect = 2.2e-287
Identity = 486/604 (80.46%), Postives = 537/604 (88.91%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRF 60
           MQNYAASGDL EALETLNFM+NVAGKPS+YD+NALFHRYLSSGNV LE LVQVYIGMK F
Sbjct: 54  MQNYAASGDLPEALETLNFMKNVAGKPSIYDYNALFHRYLSSGNVSLEQLVQVYIGMKNF 113

Query: 61  GPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHS 120
           GP+PN+TTFNILLNG +SLGYLRDAYFFAEEM+KSG+NPSFTSLSKLLK SMKSG+LV S
Sbjct: 114 GPSPNRTTFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNLVDS 173

Query: 121 IWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWA 180
           IWIFK MLRL+HLPTEPT+AMF+CMLCKA MLEEA+  CA ++SK+ NFQAYVFNPVLWA
Sbjct: 174 IWIFKFMLRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWA 233

Query: 181 LCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPN 240
           LCKCGKS +ALQ FYMMKK G+ HNVCSYTALLYGFGRE LWV LY  LDQMRSDGCKPN
Sbjct: 234 LCKCGKSSLALQLFYMMKKNGIAHNVCSYTALLYGFGRECLWVDLYSFLDQMRSDGCKPN 293

Query: 241 VITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQ 300
           V+TYTVIIKFLCDDGRI EAF  LK ME EGCDPDLVTYNIIIRALCL+D+A D  E+LQ
Sbjct: 294 VVTYTVIIKFLCDDGRIVEAFEILKSMEIEGCDPDLVTYNIIIRALCLYDRACDVLELLQ 353

Query: 301 VIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNT 360
           +IH RGFSPD YTY ALAGGIMKVGK+ IAYELL  VF+RNCTVDVVVYNIY HCLC+N 
Sbjct: 354 LIHRRGFSPDPYTYAALAGGIMKVGKTKIAYELLRKVFTRNCTVDVVVYNIYFHCLCRNN 413

Query: 361 RSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFN 420
           RSREA SLLKSM +GGI PTTVSYNTVLRGFCRD++++HALKLLECFEW ESGPDV+SFN
Sbjct: 414 RSREAFSLLKSMTKGGIVPTTVSYNTVLRGFCRDNEIQHALKLLECFEWPESGPDVVSFN 473

Query: 421 TVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCN 480
           TVLSAACKLG+LVLI+RVL  ME +GVEPDV SLTCLV+YLST+GRYSEC +LLEYM+CN
Sbjct: 474 TVLSAACKLGDLVLIQRVLQYMECKGVEPDVRSLTCLVRYLSTVGRYSECWRLLEYMICN 533

Query: 481 GPAPSSVTFNILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLI 540
           GP PSSVTFNI LDKLCR+GF S AYQIFE +Q AGL LDRKTY+ILLR+ LRK D NL+
Sbjct: 534 GPVPSSVTFNIFLDKLCRNGFTSKAYQIFERIQKAGLSLDRKTYNILLRSFLRKRDINLV 593

Query: 541 ERLLQDMYKQRLSPDLSIYGSNTNDICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKA 600
           E L+QDMYKQRL PDL IYGS  + +CQ+GNISTALFTR +TLGNGL+PSMEM NR LK 
Sbjct: 594 ECLIQDMYKQRLDPDLFIYGSKISGLCQEGNISTALFTRDRTLGNGLTPSMEMCNRSLKT 653

Query: 601 VVPK 605
           V+ K
Sbjct: 654 VMHK 657

BLAST of Bhi04G000840 vs. ExPASy TrEMBL
Match: A0A6J1GI38 (putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454134 PE=4 SV=1)

HSP 1 Score: 986.9 bits (2550), Expect = 3.8e-284
Identity = 480/604 (79.47%), Postives = 534/604 (88.41%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRF 60
           MQNYAASGDL EALETLNFM+NVAGKPSVYD+NALFHRYLSSGNV LE LVQVYIGMK F
Sbjct: 54  MQNYAASGDLPEALETLNFMKNVAGKPSVYDYNALFHRYLSSGNVSLEQLVQVYIGMKNF 113

Query: 61  GPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHS 120
           GP+PN+TTFNILLNG +SLGYLRDAYFFAEEM+KSG+NPSFTSLSKLLK SMKSG++V S
Sbjct: 114 GPSPNRTTFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKSGNVVDS 173

Query: 121 IWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWA 180
           IWIFK MLRL+HLPTEPT+AMF+CMLCKA MLEEA+  CA ++SK+ NFQAYVFNPVLWA
Sbjct: 174 IWIFKFMLRLDHLPTEPTVAMFICMLCKARMLEEAYRFCAKLISKNLNFQAYVFNPVLWA 233

Query: 181 LCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPN 240
           LCKCG S +ALQ FYMMKK G+ HNVCSYTALLYGFGRE LWV LY  L QMRSDGCKPN
Sbjct: 234 LCKCGNSSLALQLFYMMKKNGIPHNVCSYTALLYGFGRECLWVDLYSFLHQMRSDGCKPN 293

Query: 241 VITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQ 300
           V+TYTVIIKFLCDDGRI EAF  LK ME EGCDPDLVTYNIIIRALCL+D+  D  E+LQ
Sbjct: 294 VVTYTVIIKFLCDDGRIVEAFEILKSMEIEGCDPDLVTYNIIIRALCLYDRTCDVVELLQ 353

Query: 301 VIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNT 360
           ++H RGFSPD YTY ALAGGIMKVGK++IAYELL  VF+RNCTVDVVVYNIY HCLC+N 
Sbjct: 354 LVHRRGFSPDPYTYAALAGGIMKVGKTEIAYELLRKVFTRNCTVDVVVYNIYFHCLCRNN 413

Query: 361 RSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFN 420
           RSREA SLLKSM +GGI PTTVSYNTVLRGFCRD++++HALKLLECFEW ESGPDV+SFN
Sbjct: 414 RSREAFSLLKSMTKGGIVPTTVSYNTVLRGFCRDNEIQHALKLLECFEWPESGPDVVSFN 473

Query: 421 TVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCN 480
           TVLSAACKLG+LVLI+RVL  ME +GVEPDV SLTCLV+YLST+GRYSEC +LLEYM+CN
Sbjct: 474 TVLSAACKLGDLVLIQRVLQYMECKGVEPDVRSLTCLVRYLSTVGRYSECWRLLEYMICN 533

Query: 481 GPAPSSVTFNILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLI 540
           G  PSSVTFNI LDKLCR+GF S AYQIFE +Q AGL LDRKTY+ILLR+ LRK D +L+
Sbjct: 534 GSVPSSVTFNIFLDKLCRNGFTSKAYQIFERIQKAGLSLDRKTYNILLRSFLRKRDIDLV 593

Query: 541 ERLLQDMYKQRLSPDLSIYGSNTNDICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKA 600
           ERL+QDMYKQRL PDL IYGS  + +CQ+GNISTALFTRG+TLGNGL+ SME  NR LK 
Sbjct: 594 ERLIQDMYKQRLDPDLFIYGSKISGLCQEGNISTALFTRGRTLGNGLTLSMETCNRSLKT 653

Query: 601 VVPK 605
           V+ K
Sbjct: 654 VIHK 657

BLAST of Bhi04G000840 vs. ExPASy TrEMBL
Match: A0A6J1KNF9 (pentatricopeptide repeat-containing protein At1g64583, mitochondrial-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111496160 PE=4 SV=1)

HSP 1 Score: 921.8 bits (2381), Expect = 1.5e-264
Identity = 458/604 (75.83%), Postives = 505/604 (83.61%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRF 60
           MQNYAASGDL EALETLNFM+NVAGKPS+YD+NALFHRYLSSGNV LE LVQVYIGMK F
Sbjct: 54  MQNYAASGDLPEALETLNFMKNVAGKPSIYDYNALFHRYLSSGNVSLEQLVQVYIGMKNF 113

Query: 61  GPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHS 120
           GP+PN+TTFNILLNG +SLGYLRDAYFFAEEM+KSG+NPSFTSLSKLLK SMKS      
Sbjct: 114 GPSPNRTTFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKS------ 173

Query: 121 IWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWA 180
                                       A MLEEA+  CA ++SK+ NFQAYVFNPVLWA
Sbjct: 174 ----------------------------ARMLEEAYRFCAKLISKNLNFQAYVFNPVLWA 233

Query: 181 LCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPN 240
           LCKCGKS +ALQ FYMMKK G+ HNVCSYTALLYGFGRE LWV LY  LDQMRSDGCKPN
Sbjct: 234 LCKCGKSSLALQLFYMMKKNGIAHNVCSYTALLYGFGRECLWVDLYSFLDQMRSDGCKPN 293

Query: 241 VITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQ 300
           V+TYTVIIKFLCDDGRI EAF  LK ME EGCDPDLVTYNIIIRALCL+D+A D  E+LQ
Sbjct: 294 VVTYTVIIKFLCDDGRIVEAFEILKSMEIEGCDPDLVTYNIIIRALCLYDRACDVLELLQ 353

Query: 301 VIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNT 360
           +IH RGFSPD YTY ALAGGIMKVGK+ IAYELL  VF+RNCTVDVVVYNIY HCLC+N 
Sbjct: 354 LIHRRGFSPDPYTYAALAGGIMKVGKTKIAYELLRKVFTRNCTVDVVVYNIYFHCLCRNN 413

Query: 361 RSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFN 420
           RSREA SLLKSM +GGI PTTVSYNTVLRGFCRD++++HALKLLECFEW ESGPDV+SFN
Sbjct: 414 RSREAFSLLKSMTKGGIVPTTVSYNTVLRGFCRDNEIQHALKLLECFEWPESGPDVVSFN 473

Query: 421 TVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCN 480
           TVLSAACKLG+LVLI+RVL  ME +GVEPDV SLTCLV+YLST+GRYSEC +LLEYM+CN
Sbjct: 474 TVLSAACKLGDLVLIQRVLQYMECKGVEPDVRSLTCLVRYLSTVGRYSECWRLLEYMICN 533

Query: 481 GPAPSSVTFNILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLI 540
           GP PSSVTFNI LDKLCR+GF S AYQIFE +Q AGL LDRKTY+ILLR+ LRK D NL+
Sbjct: 534 GPVPSSVTFNIFLDKLCRNGFTSKAYQIFERIQKAGLSLDRKTYNILLRSFLRKRDINLV 593

Query: 541 ERLLQDMYKQRLSPDLSIYGSNTNDICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKA 600
           E L+QDMYKQRL PDL IYGS  + +CQ+GNISTALFTR +TLGNGL+PSMEM NR LK 
Sbjct: 594 ECLIQDMYKQRLDPDLFIYGSKISGLCQEGNISTALFTRDRTLGNGLTPSMEMCNRSLKT 623

Query: 601 VVPK 605
           V+ K
Sbjct: 654 VMHK 623

BLAST of Bhi04G000840 vs. ExPASy TrEMBL
Match: A0A6J1GI66 (pentatricopeptide repeat-containing protein At1g09900-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454134 PE=4 SV=1)

HSP 1 Score: 912.1 bits (2356), Expect = 1.2e-261
Identity = 453/604 (75.00%), Postives = 502/604 (83.11%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRF 60
           MQNYAASGDL EALETLNFM+NVAGKPSVYD+NALFHRYLSSGNV LE LVQVYIGMK F
Sbjct: 54  MQNYAASGDLPEALETLNFMKNVAGKPSVYDYNALFHRYLSSGNVSLEQLVQVYIGMKNF 113

Query: 61  GPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHS 120
           GP+PN+TTFNILLNG +SLGYLRDAYFFAEEM+KSG+NPSFTSLSKLLK SMKS      
Sbjct: 114 GPSPNRTTFNILLNGFLSLGYLRDAYFFAEEMTKSGMNPSFTSLSKLLKSSMKS------ 173

Query: 121 IWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWA 180
                                       A MLEEA+  CA ++SK+ NFQAYVFNPVLWA
Sbjct: 174 ----------------------------ARMLEEAYRFCAKLISKNLNFQAYVFNPVLWA 233

Query: 181 LCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPN 240
           LCKCG S +ALQ FYMMKK G+ HNVCSYTALLYGFGRE LWV LY  L QMRSDGCKPN
Sbjct: 234 LCKCGNSSLALQLFYMMKKNGIPHNVCSYTALLYGFGRECLWVDLYSFLHQMRSDGCKPN 293

Query: 241 VITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQ 300
           V+TYTVIIKFLCDDGRI EAF  LK ME EGCDPDLVTYNIIIRALCL+D+  D  E+LQ
Sbjct: 294 VVTYTVIIKFLCDDGRIVEAFEILKSMEIEGCDPDLVTYNIIIRALCLYDRTCDVVELLQ 353

Query: 301 VIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNT 360
           ++H RGFSPD YTY ALAGGIMKVGK++IAYELL  VF+RNCTVDVVVYNIY HCLC+N 
Sbjct: 354 LVHRRGFSPDPYTYAALAGGIMKVGKTEIAYELLRKVFTRNCTVDVVVYNIYFHCLCRNN 413

Query: 361 RSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFN 420
           RSREA SLLKSM +GGI PTTVSYNTVLRGFCRD++++HALKLLECFEW ESGPDV+SFN
Sbjct: 414 RSREAFSLLKSMTKGGIVPTTVSYNTVLRGFCRDNEIQHALKLLECFEWPESGPDVVSFN 473

Query: 421 TVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMVCN 480
           TVLSAACKLG+LVLI+RVL  ME +GVEPDV SLTCLV+YLST+GRYSEC +LLEYM+CN
Sbjct: 474 TVLSAACKLGDLVLIQRVLQYMECKGVEPDVRSLTCLVRYLSTVGRYSECWRLLEYMICN 533

Query: 481 GPAPSSVTFNILLDKLCRSGFISTAYQIFEDLQNAGLLLDRKTYSILLRALLRKCDDNLI 540
           G  PSSVTFNI LDKLCR+GF S AYQIFE +Q AGL LDRKTY+ILLR+ LRK D +L+
Sbjct: 534 GSVPSSVTFNIFLDKLCRNGFTSKAYQIFERIQKAGLSLDRKTYNILLRSFLRKRDIDLV 593

Query: 541 ERLLQDMYKQRLSPDLSIYGSNTNDICQDGNISTALFTRGQTLGNGLSPSMEMYNRLLKA 600
           ERL+QDMYKQRL PDL IYGS  + +CQ+GNISTALFTRG+TLGNGL+ SME  NR LK 
Sbjct: 594 ERLIQDMYKQRLDPDLFIYGSKISGLCQEGNISTALFTRGRTLGNGLTLSMETCNRSLKT 623

Query: 601 VVPK 605
           V+ K
Sbjct: 654 VIHK 623

BLAST of Bhi04G000840 vs. ExPASy TrEMBL
Match: A0A6J1BT58 (pentatricopeptide repeat-containing protein At1g09900-like OS=Momordica charantia OX=3673 GN=LOC111004610 PE=4 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 3.5e-229
Identity = 389/478 (81.38%), Postives = 431/478 (90.17%), Query Frame = 0

Query: 1   MQNYAASGDLAEALETLNFMRNVAGKPSVYDFNALFHRYLSSGNVLLEPLVQVYIGMKRF 60
           MQ+ AASGDLAEALETLNFMR++ GKPSVYD+NALF RYLSS NVLLE LVQVYIGMKRF
Sbjct: 54  MQDRAASGDLAEALETLNFMRSITGKPSVYDYNALFCRYLSSENVLLEQLVQVYIGMKRF 113

Query: 61  GPTPNKTTFNILLNGLMSLGYLRDAYFFAEEMSKSGINPSFTSLSKLLKISMKSGSLVHS 120
           GP PNKTTFNILLNGL+SLG+LRDAYFF EEM+KSGINPSFT LSK LK S+KSG+LV S
Sbjct: 114 GPAPNKTTFNILLNGLLSLGFLRDAYFFVEEMTKSGINPSFTFLSKWLKKSLKSGNLVDS 173

Query: 121 IWIFKLMLRLNHLPTEPTLAMFVCMLCKAGMLEEAFSLCAAILSKSFNFQAYVFNPVLWA 180
           IWIF+ MLRL+HLPTEPTLAMF+C+LCK+ MLEEA   CAA+LSK+  FQAYVFNP++WA
Sbjct: 174 IWIFEFMLRLDHLPTEPTLAMFICLLCKSKMLEEASRFCAALLSKNLTFQAYVFNPIIWA 233

Query: 181 LCKCGKSFIALQFFYMMKKKGMTHNVCSYTALLYGFGRERLWVHLYCCLDQMRSDGCKPN 240
           LCK GKSF+ALQ FYMMKKKGMTHNVCSYTALLYGFGRE LWV LY CLDQMRSDG KPN
Sbjct: 234 LCKSGKSFLALQLFYMMKKKGMTHNVCSYTALLYGFGRECLWVDLYRCLDQMRSDGFKPN 293

Query: 241 VITYTVIIKFLCDDGRIGEAFYFLKLMEGEGCDPDLVTYNIIIRALCLHDKAYDAAEILQ 300
           VITYTVI+KFLCDDGRIGEAF  LK ME EGCDPDLVTYN+IIRALCLHD+AYD AE+LQ
Sbjct: 294 VITYTVIVKFLCDDGRIGEAFEILKFMEREGCDPDLVTYNVIIRALCLHDRAYDVAELLQ 353

Query: 301 VIHHRGFSPDAYTYTALAGGIMKVGKSDIAYELLCNVFSRNCTVDVVVYNIYLHCLCQNT 360
           VIH+RGFSPDAYTY+AL+GGIMKVGKS IAYELLC VFS+NCTVD+VVYNIY HCLCQN+
Sbjct: 354 VIHNRGFSPDAYTYSALSGGIMKVGKSQIAYELLCYVFSKNCTVDIVVYNIYFHCLCQNS 413

Query: 361 RSREALSLLKSMKEGGIAPTTVSYNTVLRGFCRDHKLEHALKLLECFEWLESGPDVISFN 420
           RSREALSLL  M  GGI PT VSYNT+LRGFC+ +K+E+ALKL E FEW +SGPDV+SFN
Sbjct: 414 RSREALSLLNRMAGGGIVPTIVSYNTILRGFCKYYKVEYALKLFEYFEWPKSGPDVVSFN 473

Query: 421 TVLSAACKLGNLVLIRRVLHCMEFRGVEPDVISLTCLVQYLSTMGRYSECLKLLEYMV 479
           TVLSAACK G+ VLI+RVLH ME+RGVEPDVISLTCLV YLSTMGR+SE L+LLE+MV
Sbjct: 474 TVLSAACKQGDFVLIQRVLHYMEYRGVEPDVISLTCLVHYLSTMGRFSESLRLLEFMV 531

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G09900.11.1e-5428.54Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G64320.11.9e-5425.13Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.15.6e-5426.22Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G07290.12.0e-5129.04Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G63080.12.6e-5124.69Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q3EDF81.6e-5328.54Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9FMF62.7e-5325.13Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Q9LFF17.9e-5326.22Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q9SFV92.8e-5029.04Pentatricopeptide repeat-containing protein At3g07290, mitochondrial OS=Arabidop... [more]
Q9CAN53.7e-5024.69Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1KPX62.2e-28780.46pentatricopeptide repeat-containing protein At3g53700, chloroplastic-like isofor... [more]
A0A6J1GI383.8e-28479.47putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial is... [more]
A0A6J1KNF91.5e-26475.83pentatricopeptide repeat-containing protein At1g64583, mitochondrial-like isofor... [more]
A0A6J1GI661.2e-26175.00pentatricopeptide repeat-containing protein At1g09900-like isoform X2 OS=Cucurbi... [more]
A0A6J1BT583.5e-22981.38pentatricopeptide repeat-containing protein At1g09900-like OS=Momordica charanti... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..168
e-value: 2.8E-25
score: 91.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 398..498
e-value: 8.3E-24
score: 85.9
coord: 174..293
e-value: 4.6E-31
score: 109.6
coord: 294..397
e-value: 1.8E-24
score: 88.1
coord: 499..605
e-value: 2.0E-11
score: 45.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 508..559
e-value: 1.8E-4
score: 21.5
coord: 26..74
e-value: 8.8E-4
score: 19.3
coord: 167..217
e-value: 4.3E-4
score: 20.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 449..498
e-value: 2.9E-11
score: 43.4
coord: 239..287
e-value: 1.0E-13
score: 51.3
coord: 345..393
e-value: 2.3E-14
score: 53.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 242..275
e-value: 3.1E-7
score: 28.1
coord: 523..555
e-value: 0.0031
score: 15.6
coord: 417..451
e-value: 6.2E-4
score: 17.8
coord: 487..520
e-value: 8.5E-5
score: 20.5
coord: 347..380
e-value: 7.5E-7
score: 26.9
coord: 68..100
e-value: 8.4E-4
score: 17.3
coord: 382..404
e-value: 4.3E-4
score: 18.3
coord: 208..241
e-value: 1.5E-6
score: 26.0
coord: 277..311
e-value: 6.4E-5
score: 20.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 144..159
e-value: 1.0
score: 9.8
coord: 417..447
e-value: 0.082
score: 13.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 450..484
score: 8.780059
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..239
score: 9.361008
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 380..414
score: 8.878711
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 65..99
score: 10.47906
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 240..274
score: 10.994242
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 345..379
score: 11.728648
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 275..309
score: 10.205028
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 8.527949
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 415..449
score: 10.040608
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 520..554
score: 8.549871
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 485..519
score: 10.358486
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 1..601
NoneNo IPR availablePANTHERPTHR47932:SF43OS04G0475800 PROTEINcoord: 1..601

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M000840Bhi04M000840mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding