Bhi02G000191 (gene) Wax gourd

NameBhi02G000191
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2 : 5310331 .. 5314085 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAACGATCTGCTTCGCTGGGCCGCCGGCTTCAGATGTTTAAGAAATAGTGAACTGCTAACTGCAGAGTCTAATCCAGTCCGTCCATGAAAATGTATTTCCGGCTTCTGCCTCTCTCATGTGGAATAATTCAAAGGTCTCGGCTCCAAGAAATTTGTACAATCTTGAACTCGGTTATTTTAGAATCAGAAATGTCGAAATTTGTGCATACCCAAGCGATGGATCTTCCGCCTCCGAGAACTAACGAGAGAAAGATTCCTGATTACAAGGATGCTCTCCACAAGGAAGGCAATGACGTGCGGAGGGACGGTTATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCTTATTTTCTCCAAGTTTCGTAGTTCTATTAATTCTCAGCTCTGTAACTCGATGATCAGGGGTTATTTGGATTTGAATAAGCATTTAAATTCACTCTACATTTTCGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTGCTGTTCTTAAAGCAACCGCACAGCTATGTGATACTGAAGTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCATGATGTCTACACAAGTACCGCTCTAGTTCACATGTATTGTGCCTGTTTGTCTATATCTGATGCTTCTCGGGTGTTCGATGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTCTTATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTATCAATGCTTTTCGAGGCATGCTGGCAGCTGGAGCTGAACCGAGTGAGCGAACCATGGTGGTAGTTCTATCAGCTTGTTCTCATTTGGGAGCTCTGAATCAGGGAAAGTGGGTCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTATTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCTGAGAAGGTCTTTGAAGAAATTAGAGAGAAGAATGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGACAAGGCGATGCTGCTTTGGCGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGCATCTTGTGCGCATGCTGTCACCAAGGTCTGGTCACGGAAGGGCGCAGGCAATTCATGAGCATGAAACAACATTTTGGACTGCAGCCAAAGATAGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCAGGATTCTTGGATGAAGCTCTAGAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAACTAGAACCAAACAATGGTGAGAATTATGTCTTGTTGTCAAATCTTTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGAAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTTGCTGCATCAAATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGAGTGAGAAATTGAAAGAAAATGGTTACGTTACAGGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTGGGTTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAATTTTTCAAGGTTGTATCGATTGTCTATCAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTCTGAAGGTTTTTGTTCGTGCCGAGACTACTGGTGAATAATTAAATGTTATAGAACTATTTCTTCGACATGATGGAAATCACGCAAATCATCTGGATCAACATGATACCATTGGCTTTTTGTGAAAAAATGAAGAATTCTGAAATTGGGCAGCCAAACTACTCAGCCATTGTTCAAGCCGCCATACTTAGTTGGCATGCCTAAGAAACAACTGCATTCCATACCAAACAAAATAGTACATTTACTAATTTGTAATATTTTATAGAACCCCAATTCAATTCTGAAGTTCTATTAAATGTTGGAGGATGGGAAATGGGTTCCATCAATGTTTGAGAGCCAAAGGGATCAGACGATAAAATGAACAAAATATTGGCATTCTGTTCCTTTTCCTTCGCAGGAGACTGCCAATATTGAGCTCTAGGATTCTGAATAGTTGGGGAGATCAACTTTGAAGTAATTTAATGAAGCTGTCAAGGAAAGCATAGCTCTCTGACCTCAATCCTGTGGATAAATCCATATCATATTTTTAGGGTAAGAAAAAGAATAAGAAGCTATGGGGGGTTTTCTGAAAAGAAGTGTCGGGAAAAGCAGTTTTTTGTTTTCAGGACTGATTATTAGTGTTTCTTGTTTTTTGTATAGTATGTTTTCACAATAGAGGTTGAACTCAAACTTATCTCTCTAAAATTCCTCATTACCAAAAGAAGAATGTACATCATTTTGGGAAAGGCTAAAAGTTAGCTTAGGGAGCTAAAATTCCTTTTCAAGTCCTCTTTGATAAAGAAAAGTTGGGTCTTATAAAAGATGCCACTTACGTTTAAATTCCTCTGTATGAGTCTTTGCGTGTCATTCATTTTCGGGAGATTCAGTCAGGAAGAAGAGATAAGATGCCAATGATTTAATTTATCTTTCCGCATCATGCTTTTACTTATTTTAGACTTAGGAATAGTGATTCTTGGAATTCTAAAAAAAAAAAAAAAAAAAAATGGTTATCGATAGAATATGTCATTTCCATCCACCAAAGTTACTAGAGAAAGCCTCTAAACATTTGGTTGAAATGAGGCATATCTAACTATTCAACTAAAACTACTTTTTGTTGGATTCAACCATATGCTGGAGCTTCTTTTGGCTATCTCTGATGCATATCCTCCTTGATACCTTGTAATTGCATTAATTTGCTTGTCTGCCGTGGAGTCCTGTTCACTTCTTGGCAATATAGTGTCACCATCTTACTGTAGAAATCCATATCCATTGACTGCAAGAACAAAAAGTCAGCAATTAAATACAGAAAGGAAGGGGGAAAAACCGTGAAATGCTTTTGCATCTTGAATTTGATCTTCAATGAATAGTATCTCTTTATATATTTTCATCGTTATTGTTGGTTGCACATTCATTTTATTCATGTCAGGTGTGTATATAACATCCATTTTTAGTCTAACCAAATGACTCTTTGGAAAACTGAATCGGCAACCGACTAAAACCAACTTTTGAATTCTGATCAATGTGTTACTGAAACCAACTAAACCGATCAATATAGTTTGATCAGTTCTCAGTTGCTTTTTTTATTTTTTGCATGTATATATGGTTCATGCTGGACTTACTTGTGCCAAAAAAGTGCAGAAAGGATCGGTTGGAGGGACAAAAGAACTGGTCGAAAATTTAGATTTTGGCTTTGTTGTTGAGATGATGGAAGGCAGTAGAATTGGAGGAAATGTAGCAGCACAAGGGAAGGAACTGGAGCTGGAAGCCATTGAAGCAGCATCGAGCGGTCCCATACGAGCTGCGAGCATTGACATTTGAAGCTGCTGCTGCTGCTGCTGTATTCCTAGAGGCATGGCCATCTGTGGCACAGCTGATCTAAT

mRNA sequence

GAAACGATCTGCTTCGCTGGGCCGCCGGCTTCAGATGTTTAAGAAATAGTGAACTGCTAACTGCAGAGTCTAATCCAGTCCGTCCATGAAAATGTATTTCCGGCTTCTGCCTCTCTCATGTGGAATAATTCAAAGGTCTCGGCTCCAAGAAATTTGTACAATCTTGAACTCGGTTATTTTAGAATCAGAAATGTCGAAATTTGTGCATACCCAAGCGATGGATCTTCCGCCTCCGAGAACTAACGAGAGAAAGATTCCTGATTACAAGGATGCTCTCCACAAGGAAGGCAATGACGTGCGGAGGGACGGTTATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCTTATTTTCTCCAAGTTTCGTAGTTCTATTAATTCTCAGCTCTGTAACTCGATGATCAGGGGTTATTTGGATTTGAATAAGCATTTAAATTCACTCTACATTTTCGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTGCTGTTCTTAAAGCAACCGCACAGCTATGTGATACTGAAGTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCATGATGTCTACACAAGTACCGCTCTAGTTCACATGTATTGTGCCTGTTTGTCTATATCTGATGCTTCTCGGGTGTTCGATGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTCTTATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTATCAATGCTTTTCGAGGCATGCTGGCAGCTGGAGCTGAACCGAGTGAGCGAACCATGGTGGTAGTTCTATCAGCTTGTTCTCATTTGGGAGCTCTGAATCAGGGAAAGTGGGTCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTATTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCTGAGAAGGTCTTTGAAGAAATTAGAGAGAAGAATGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGACAAGGCGATGCTGCTTTGGCGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGCATCTTGTGCGCATGCTGTCACCAAGGTCTGGTCACGGAAGGGCGCAGGCAATTCATGAGCATGAAACAACATTTTGGACTGCAGCCAAAGATAGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCAGGATTCTTGGATGAAGCTCTAGAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAACTAGAACCAAACAATGGTGAGAATTATGTCTTGTTGTCAAATCTTTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGAAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTTGCTGCATCAAATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGAGTGAGAAATTGAAAGAAAATGGTTACGTTACAGGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTGGGTTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAATTTTTCAAGGTTGTATCGATTGTCTATCAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTCTGAAGGTTTTTGTTCGTGCCGAGACTACTGGTGAATAATTAAATGTTATAGAACTATTTCTTCGACATGATGGAAATCACGCAAATCATCTGGATCAACATGATACCATTGGCTTTTTGTGAAAAAATGAAGAATTCTGAAATTGGGCAGCCAAACTACTCAGCCATTGTTCAAGCCGCCATACTTAGTTGGCATGCCTAAGAAACAACTGCATTCCATACCAAACAAAATAGTACATTTACTAATTTGTAATATTTTATAGAACCCCAATTCAATTCTGAAGTTCTATTAAATGTTGGAGGATGGGAAATGGGTTCCATCAATGTTTGAGAGCCAAAGGGATCAGACGATAAAATGAACAAAATATTGGCATTCTGTTCCTTTTCCTTCGCAGGAGACTGCCAATATTGAGCTCTAGGATTCTGAATAGTTGGGGAGATCAACTTTGAAGTAATTTAATGAAGCTGTCAAGGAAAGCATAGCTCTCTGACCTCAATCCTGTGGATAAATCCATATCATATTTTTAGGAAAGGATCGGTTGGAGGGACAAAAGAACTGGTCGAAAATTTAGATTTTGGCTTTGTTGTTGAGATGATGGAAGGCAGTAGAATTGGAGGAAATGTAGCAGCACAAGGGAAGGAACTGGAGCTGGAAGCCATTGAAGCAGCATCGAGCGGTCCCATACGAGCTGCGAGCATTGACATTTGAAGCTGCTGCTGCTGCTGCTGTATTCCTAGAGGCATGGCCATCTGTGGCACAGCTGATCTAAT

Coding sequence (CDS)

ATGAAAATGTATTTCCGGCTTCTGCCTCTCTCATGTGGAATAATTCAAAGGTCTCGGCTCCAAGAAATTTGTACAATCTTGAACTCGGTTATTTTAGAATCAGAAATGTCGAAATTTGTGCATACCCAAGCGATGGATCTTCCGCCTCCGAGAACTAACGAGAGAAAGATTCCTGATTACAAGGATGCTCTCCACAAGGAAGGCAATGACGTGCGGAGGGACGGTTATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCTTATTTTCTCCAAGTTTCGTAGTTCTATTAATTCTCAGCTCTGTAACTCGATGATCAGGGGTTATTTGGATTTGAATAAGCATTTAAATTCACTCTACATTTTCGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTGCTGTTCTTAAAGCAACCGCACAGCTATGTGATACTGAAGTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCATGATGTCTACACAAGTACCGCTCTAGTTCACATGTATTGTGCCTGTTTGTCTATATCTGATGCTTCTCGGGTGTTCGATGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTCTTATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTATCAATGCTTTTCGAGGCATGCTGGCAGCTGGAGCTGAACCGAGTGAGCGAACCATGGTGGTAGTTCTATCAGCTTGTTCTCATTTGGGAGCTCTGAATCAGGGAAAGTGGGTCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTATTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCTGAGAAGGTCTTTGAAGAAATTAGAGAGAAGAATGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGACAAGGCGATGCTGCTTTGGCGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGCATCTTGTGCGCATGCTGTCACCAAGGTCTGGTCACGGAAGGGCGCAGGCAATTCATGAGCATGAAACAACATTTTGGACTGCAGCCAAAGATAGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCAGGATTCTTGGATGAAGCTCTAGAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAACTAGAACCAAACAATGGTGAGAATTATGTCTTGTTGTCAAATCTTTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGAAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTTGCTGCATCAAATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGAGTGAGAAATTGAAAGAAAATGGTTACGTTACAGGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTGGGTTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAATTTTTCAAGGTTGTATCGATTGTCTATCAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTCTGAAGGTTTTTGTTCGTGCCGAGACTACTGGTGA

Protein sequence

MKMYFRLLPLSCGIIQRSRLQEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPDYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW
BLAST of Bhi02G000191 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 2.8e-127
Identity = 217/484 (44.83%), Postives = 320/484 (66.12%), Query Frame = 0

Query: 135 KFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSIS 194
           K ++ PD ST   V+ A AQ    E+G+ +H  +   GF  ++    AL+ +Y  C  + 
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 195 DASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSERTMVVVLSACS 254
            A  +F+ +P ++ ++WN LI GYTH   + EA+  F+ ML +G  P++ TM+ +L AC+
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 378

Query: 255 HLGALNQGKWVHEFIYHNRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIREKNVYT 314
           HLGA++ G+W+H +I   RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +
Sbjct: 379 HLGAIDIGRWIHVYI-DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 438

Query: 315 WNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMK 374
           WN +I G+AM+G+ DA+   FSRM     +PD++TF+G+L AC H G++  GR  F +M 
Sbjct: 439 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 498

Query: 375 QHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLG 434
           Q + + PK+EHYGCM+DLLG +G   EA E+I  M MEPD +IW +LL AC++HGN +LG
Sbjct: 499 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 558

Query: 435 EYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNV 494
           E     LI++EP N  +YVLLSN+Y+   RW EV K R +++ +G+ KVPGCSSIEI++V
Sbjct: 559 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 618

Query: 495 VYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKL 554
           V+EF   +   P    IY  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKL
Sbjct: 619 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 678

Query: 555 ALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSC 614
           A+AFGL+++  G  L IVKNLR+C +CHE  K++S +Y+R I+ RDR RFHHF +G CSC
Sbjct: 679 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 738

Query: 615 RDYW 616
            DYW
Sbjct: 739 NDYW 741

BLAST of Bhi02G000191 vs. Swiss-Prot
Match: sp|Q8LK93|PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 437.6 bits (1124), Expect = 2.3e-121
Identity = 216/506 (42.69%), Postives = 312/506 (61.66%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           NSM RGY      L    +F  + +  ILPD+ TFP++LKA A     E G+ +H   ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
           +G   +VY    L++MY  C  +  A  VFD + E   V +NA+ITGY    +  EA++ 
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 231 FRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAK 290
           FR M     +P+E T++ VLS+C+ LG+L+ GKW+H++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 291 CGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGI 350
           CG++D+A  +FE++R K+   W+                           +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXXXQPDEITFLGL 337

Query: 351 LCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEP 410
           L AC H G V EGR+ F  M   FG+ P I+HYG MVDLL RAG L++A E I  + + P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 411 DPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRG 470
            P++WR LL AC  H N  L E    R+ EL+ ++G +YV+LSNLY+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 471 MMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDM 530
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD + ++LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 531 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVY 590
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P G T+R+VKNLR+C DCH   K++S+++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 591 QRYIVVRDRNRFHHFSEGFCSCRDYW 616
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Bhi02G000191 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 4.0e-121
Identity = 230/525 (43.81%), Postives = 320/525 (60.95%), Query Frame = 0

Query: 101 FRSSINSQL--CNSMIRGYLDLNKHLNSLYIFAHMHKFSILPD---SSTFPAVLKATAQL 160
           F   +N  L  CN+MIR +           +F  + + S LP    SS+F   LK   + 
Sbjct: 69  FSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSF--ALKCCIKS 128

Query: 161 CDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALI 220
            D   G  IHG +   GF+ D    T L+ +Y  C + +DA +VFDE+P+R+ V+WN L 
Sbjct: 129 GDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLF 188

Query: 221 TGYTHNRKFMEAINAFRGM---LAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHN 280
           + Y  N++  + +  F  M   +    +P   T ++ L AC++LGAL+ GK VH+FI  N
Sbjct: 189 SCYLRNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDEN 248

Query: 281 RLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAF 340
            L   + +   L+ MY++CG++D+A +VF  +RE+NV +W  LISG AMNG G  A+ AF
Sbjct: 249 GLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAF 308

Query: 341 SRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMKQ-HFGLQPKIEHYGCMVDLLG 400
           + ML     P+E T  G+L AC H GLV EG   F  M+   F ++P + HYGC+VDLLG
Sbjct: 309 NEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLG 368

Query: 401 RAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVL 460
           RA  LD+A  LI+SM M+PD  IWR LL ACRVHG+ +LGE  I  LIEL+     +YVL
Sbjct: 369 RARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVL 428

Query: 461 LSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQ 520
           L N YS   +W +V +LR +M  + I   PGCS+IE+   V+EF   +   P  E IYK 
Sbjct: 429 LLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKM 488

Query: 521 LDNLSEKLKENGYVTGTDMALYDIE-KEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVK 580
           L  ++++LK  GYV      L+++E +EEK +++ YHSEKLA+AFG+L +P G T+R+ K
Sbjct: 489 LAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTK 548

Query: 581 NLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 616
           NLR C+DCH F K VS VY R ++VRDR+RFHHF  G CSC D+W
Sbjct: 549 NLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Bhi02G000191 vs. Swiss-Prot
Match: sp|Q9SUH6|PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 8.9e-121
Identity = 214/505 (42.38%), Postives = 311/505 (61.58%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           N+MI GY    +   SL +F  +        SST  +++  +  L    +   IHG  ++
Sbjct: 291 NAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLM---LIYAIHGYCLK 350

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
             F+     STAL  +Y     I  A ++FDE PE++  +WNA+I+GYT N    +AI+ 
Sbjct: 351 SNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISL 410

Query: 231 FRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAK 290
           FR M  +   P+  T+  +LSAC+ LGAL+ GKWVH+ +       +++V TALI MYAK
Sbjct: 411 FREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAK 470

Query: 291 CGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGI 350
           CG++ EA ++F+ + +KN  TWN +ISGY ++GQG  AL  F  ML     P  VTFL +
Sbjct: 471 CGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCV 530

Query: 351 LCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEP 410
           L AC H GLV EG   F SM   +G +P ++HY CMVD+LGRAG L  AL+ I++MS+EP
Sbjct: 531 LYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEP 590

Query: 411 DPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRG 470
              +W  LL ACR+H +T L      +L EL+P+N   +VLLSN++S ++ + +   +R 
Sbjct: 591 GSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQ 650

Query: 471 MMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDM 530
               R + K PG + IEI    + F + +   P+ + IY++L+ L  K++E GY   T++
Sbjct: 651 TAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETEL 710

Query: 531 ALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQ 590
           AL+D+E+EE+E  V  HSE+LA+AFGL+ +  G  +RI+KNLR+CLDCH   K++S + +
Sbjct: 711 ALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITE 770

Query: 591 RYIVVRDRNRFHHFSEGFCSCRDYW 616
           R IVVRD NRFHHF +G CSC DYW
Sbjct: 771 RVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of Bhi02G000191 vs. Swiss-Prot
Match: sp|Q9C6T2|PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 5.7e-120
Identity = 204/506 (40.32%), Postives = 322/506 (63.64%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           N+MIRGY+++     +L  +  M +    PD+ T+P +LKA  +L     GK IHG V +
Sbjct: 101 NTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSIREGKQIHGQVFK 160

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
           +G   DV+   +L++MY  C  +  +S VF+++  + A +W+++++       + E +  
Sbjct: 161 LGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMGMWSECLLL 220

Query: 231 FRGMLA-AGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYA 290
           FRGM +    +  E  MV  L AC++ GALN G  +H F+  N   LN+ V T+L+DMY 
Sbjct: 221 FRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQTSLVDMYV 280

Query: 291 KCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLG 350
           KCG +D+A  +F+++ ++N  T++ +ISG A++G+G++AL  FS+M+ E  +PD V ++ 
Sbjct: 281 KCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMIKEGLEPDHVVYVS 340

Query: 351 ILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSME 410
           +L AC H GLV EGRR F  M +   ++P  EHYGC+VDLLGRAG L+EALE IQS+ +E
Sbjct: 341 VLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEALETIQSIPIE 400

Query: 411 PDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLR 470
            + +IWR  L  CRV  N +LG+   + L++L  +N  +Y+L+SNLYS+ Q W +V + R
Sbjct: 401 KNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQMWDDVARTR 460

Query: 471 GMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTD 530
             ++++G+ + PG S +E+    + F + +   P+ + IYK L  +  +LK  GY     
Sbjct: 461 TEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLKFEGYSPDLT 520

Query: 531 MALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVY 590
             L ++++EEK+  +  HS+K+A+AFGLL +P G  ++I +NLR+C DCH + K +S++Y
Sbjct: 521 QILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHTYTKKISMIY 580

Query: 591 QRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +R IVVRDRNRFH F  G CSC+DYW
Sbjct: 581 EREIVVRDRNRFHLFKGGTCSCKDYW 606

BLAST of Bhi02G000191 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 457.2 bits (1175), Expect = 1.6e-128
Identity = 217/484 (44.83%), Postives = 320/484 (66.12%), Query Frame = 0

Query: 135 KFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSIS 194
           K ++ PD ST   V+ A AQ    E+G+ +H  +   GF  ++    AL+ +Y  C  + 
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 195 DASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSERTMVVVLSACS 254
            A  +F+ +P ++ ++WN LI GYTH   + EA+  F+ ML +G  P++ TM+ +L AC+
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 378

Query: 255 HLGALNQGKWVHEFIYHNRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIREKNVYT 314
           HLGA++ G+W+H +I   RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +
Sbjct: 379 HLGAIDIGRWIHVYI-DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 438

Query: 315 WNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMK 374
           WN +I G+AM+G+ DA+   FSRM     +PD++TF+G+L AC H G++  GR  F +M 
Sbjct: 439 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 498

Query: 375 QHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLG 434
           Q + + PK+EHYGCM+DLLG +G   EA E+I  M MEPD +IW +LL AC++HGN +LG
Sbjct: 499 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 558

Query: 435 EYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNV 494
           E     LI++EP N  +YVLLSN+Y+   RW EV K R +++ +G+ KVPGCSSIEI++V
Sbjct: 559 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 618

Query: 495 VYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKL 554
           V+EF   +   P    IY  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKL
Sbjct: 619 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 678

Query: 555 ALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSC 614
           A+AFGL+++  G  L IVKNLR+C +CHE  K++S +Y+R I+ RDR RFHHF +G CSC
Sbjct: 679 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 738

Query: 615 RDYW 616
            DYW
Sbjct: 739 NDYW 741

BLAST of Bhi02G000191 vs. TAIR10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 437.6 bits (1124), Expect = 1.3e-122
Identity = 216/506 (42.69%), Postives = 312/506 (61.66%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           NSM RGY      L    +F  + +  ILPD+ TFP++LKA A     E G+ +H   ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
           +G   +VY    L++MY  C  +  A  VFD + E   V +NA+ITGY    +  EA++ 
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 231 FRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAK 290
           FR M     +P+E T++ VLS+C+ LG+L+ GKW+H++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 291 CGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGI 350
           CG++D+A  +FE++R K+   W+                           +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXXXQPDEITFLGL 337

Query: 351 LCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEP 410
           L AC H G V EGR+ F  M   FG+ P I+HYG MVDLL RAG L++A E I  + + P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 411 DPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRG 470
            P++WR LL AC  H N  L E    R+ EL+ ++G +YV+LSNLY+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 471 MMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDM 530
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD + ++LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 531 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVY 590
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P G T+R+VKNLR+C DCH   K++S+++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 591 QRYIVVRDRNRFHHFSEGFCSCRDYW 616
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of Bhi02G000191 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 436.8 bits (1122), Expect = 2.2e-122
Identity = 230/525 (43.81%), Postives = 320/525 (60.95%), Query Frame = 0

Query: 101 FRSSINSQL--CNSMIRGYLDLNKHLNSLYIFAHMHKFSILPD---SSTFPAVLKATAQL 160
           F   +N  L  CN+MIR +           +F  + + S LP    SS+F   LK   + 
Sbjct: 69  FSQRLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSF--ALKCCIKS 128

Query: 161 CDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALI 220
            D   G  IHG +   GF+ D    T L+ +Y  C + +DA +VFDE+P+R+ V+WN L 
Sbjct: 129 GDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLF 188

Query: 221 TGYTHNRKFMEAINAFRGM---LAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHN 280
           + Y  N++  + +  F  M   +    +P   T ++ L AC++LGAL+ GK VH+FI  N
Sbjct: 189 SCYLRNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDEN 248

Query: 281 RLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAF 340
            L   + +   L+ MY++CG++D+A +VF  +RE+NV +W  LISG AMNG G  A+ AF
Sbjct: 249 GLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAF 308

Query: 341 SRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMKQ-HFGLQPKIEHYGCMVDLLG 400
           + ML     P+E T  G+L AC H GLV EG   F  M+   F ++P + HYGC+VDLLG
Sbjct: 309 NEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLG 368

Query: 401 RAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVL 460
           RA  LD+A  LI+SM M+PD  IWR LL ACRVHG+ +LGE  I  LIEL+     +YVL
Sbjct: 369 RARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVL 428

Query: 461 LSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQ 520
           L N YS   +W +V +LR +M  + I   PGCS+IE+   V+EF   +   P  E IYK 
Sbjct: 429 LLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKM 488

Query: 521 LDNLSEKLKENGYVTGTDMALYDIE-KEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVK 580
           L  ++++LK  GYV      L+++E +EEK +++ YHSEKLA+AFG+L +P G T+R+ K
Sbjct: 489 LAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTK 548

Query: 581 NLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 616
           NLR C+DCH F K VS VY R ++VRDR+RFHHF  G CSC D+W
Sbjct: 549 NLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Bhi02G000191 vs. TAIR10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 435.6 bits (1119), Expect = 4.9e-122
Identity = 214/505 (42.38%), Postives = 311/505 (61.58%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           N+MI GY    +   SL +F  +        SST  +++  +  L    +   IHG  ++
Sbjct: 291 NAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLM---LIYAIHGYCLK 350

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
             F+     STAL  +Y     I  A ++FDE PE++  +WNA+I+GYT N    +AI+ 
Sbjct: 351 SNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISL 410

Query: 231 FRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAK 290
           FR M  +   P+  T+  +LSAC+ LGAL+ GKWVH+ +       +++V TALI MYAK
Sbjct: 411 FREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAK 470

Query: 291 CGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGI 350
           CG++ EA ++F+ + +KN  TWN +ISGY ++GQG  AL  F  ML     P  VTFL +
Sbjct: 471 CGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCV 530

Query: 351 LCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEP 410
           L AC H GLV EG   F SM   +G +P ++HY CMVD+LGRAG L  AL+ I++MS+EP
Sbjct: 531 LYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEP 590

Query: 411 DPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRG 470
              +W  LL ACR+H +T L      +L EL+P+N   +VLLSN++S ++ + +   +R 
Sbjct: 591 GSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQ 650

Query: 471 MMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDM 530
               R + K PG + IEI    + F + +   P+ + IY++L+ L  K++E GY   T++
Sbjct: 651 TAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETEL 710

Query: 531 ALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQ 590
           AL+D+E+EE+E  V  HSE+LA+AFGL+ +  G  +RI+KNLR+CLDCH   K++S + +
Sbjct: 711 ALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITE 770

Query: 591 RYIVVRDRNRFHHFSEGFCSCRDYW 616
           R IVVRD NRFHHF +G CSC DYW
Sbjct: 771 RVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of Bhi02G000191 vs. TAIR10
Match: AT1G31920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 433.0 bits (1112), Expect = 3.2e-121
Identity = 204/506 (40.32%), Postives = 322/506 (63.64%), Query Frame = 0

Query: 111 NSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQ 170
           N+MIRGY+++     +L  +  M +    PD+ T+P +LKA  +L     GK IHG V +
Sbjct: 101 NTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSIREGKQIHGQVFK 160

Query: 171 MGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINA 230
           +G   DV+   +L++MY  C  +  +S VF+++  + A +W+++++       + E +  
Sbjct: 161 LGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMGMWSECLLL 220

Query: 231 FRGMLA-AGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYA 290
           FRGM +    +  E  MV  L AC++ GALN G  +H F+  N   LN+ V T+L+DMY 
Sbjct: 221 FRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQTSLVDMYV 280

Query: 291 KCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLG 350
           KCG +D+A  +F+++ ++N  T++ +ISG A++G+G++AL  FS+M+ E  +PD V ++ 
Sbjct: 281 KCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMIKEGLEPDHVVYVS 340

Query: 351 ILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSME 410
           +L AC H GLV EGRR F  M +   ++P  EHYGC+VDLLGRAG L+EALE IQS+ +E
Sbjct: 341 VLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEALETIQSIPIE 400

Query: 411 PDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLR 470
            + +IWR  L  CRV  N +LG+   + L++L  +N  +Y+L+SNLYS+ Q W +V + R
Sbjct: 401 KNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQMWDDVARTR 460

Query: 471 GMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTD 530
             ++++G+ + PG S +E+    + F + +   P+ + IYK L  +  +LK  GY     
Sbjct: 461 TEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLKFEGYSPDLT 520

Query: 531 MALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVY 590
             L ++++EEK+  +  HS+K+A+AFGLL +P G  ++I +NLR+C DCH + K +S++Y
Sbjct: 521 QILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHTYTKKISMIY 580

Query: 591 QRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +R IVVRDRNRFH F  G CSC+DYW
Sbjct: 581 EREIVVRDRNRFHLFKGGTCSCKDYW 606

BLAST of Bhi02G000191 vs. TrEMBL
Match: tr|A0A0A0LUK8|A0A0A0LUK8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=4 SV=1)

HSP 1 Score: 1090.9 bits (2820), Expect = 0.0e+00
Identity = 543/616 (88.15%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MKMYFRLLPLSCGIIQRSRL-QEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPD 60
           MKMY RLLP S  II+RSR+ QEICTI N   LESEM KFVHTQAMDLP   TN  KIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLD 120
           Y        NDVRR G+FLMKLIDDSVS NGFESIA IFSK+R SINSQ CNSMIR YLD
Sbjct: 61  Y--------NDVRR-GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYT 180
           LNKHLNSLYIFA MHKFSILPDSSTFPAVLKATAQLCDT VGKMIHG VIQMGFI DVYT
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGA 240
           STALVH+YC CLSISDAS++FDEMPERNAVTWNALITGYTHNRKF++AI+AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERT+VVVLSACSHLGA NQGKW+HEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGL 360
           VFEEIREKNVYTWNVLISGYAMNGQGDAAL AFSRMLMENFKPDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALL 420
           VTEGR QFMSMKQ FGLQP+IEHYGCMVDLLGRAG L+EALELIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRE+RWAEVGKLRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEE 540
           VPGCSSIEINNVVYEF ASNDRKPEFEAIYKQLDNL +KLKENGYVTGTDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRN 600
           KEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKV+S+VY+RYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of Bhi02G000191 vs. TrEMBL
Match: tr|A0A1S4DZH3|A0A1S4DZH3_CUCME (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103494017 PE=4 SV=1)

HSP 1 Score: 1048.5 bits (2710), Expect = 5.8e-303
Identity = 510/571 (89.32%), Postives = 537/571 (94.05%), Query Frame = 0

Query: 45  MDLPPPRTNERKIPDYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSS 104
           MDLP   TN+RK PDY        NDVRR G+F+MKLIDDSVSHNGFESIA IFSK+R S
Sbjct: 1   MDLPFQETNDRKTPDY--------NDVRR-GHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 105 INSQLCNSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMI 164
           INSQ CNSMIR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDTEVGKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 165 HGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKF 224
           HG VIQMGFI DVYTSTALVHMY  CLSISDAS+VFDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 225 MEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTAL 284
           MEAI+AFRGMLAAGA+PSERT+V+VLSACSHLGALNQGKW+H+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 285 IDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDE 344
           IDMYAKCGAVDE EKVFEEIREKNVYTWNVLISGYAMNGQGDAAL AFSRMLMENFKPDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 345 VTFLGILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQ 404
           VTFLG+LCACCHQGLVTEGRRQFMSMKQ FGLQP+IEHYGCMVDLLGRAG L+EALELIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 405 SMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAE 464
           SMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+Y+RE+RWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 465 VGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGY 524
           VGKLRGMM+LRGI KVPGCSSIEINNVVYEF ASNDRKPE+EAIYKQLDNL +KLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 525 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKV 584
           VTGTDMALYD+EKEEKEHS+MYHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 585 VSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VS+VY+RYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 562

BLAST of Bhi02G000191 vs. TrEMBL
Match: tr|D7SI59|D7SI59_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g06770 PE=4 SV=1)

HSP 1 Score: 790.4 bits (2040), Expect = 2.9e-225
Identity = 382/547 (69.84%), Postives = 449/547 (82.08%), Query Frame = 0

Query: 69  NDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDLNKHLNSLY 128
           N +  D   LMKLID SVS +GF + AL+F++F   I+S LCNSMIR Y D NKHL+S++
Sbjct: 69  NYIPIDHLNLMKLIDFSVSSHGFAASALLFTQFYGFIDSDLCNSMIRCYTDSNKHLHSVF 128

Query: 129 IFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYC 188
           I+  M K  I PDSSTFP VLK+ AQLC  E+GK IH  +IQMGF  +VY STALV+MY 
Sbjct: 129 IYTQMWKNGIFPDSSTFPTVLKSVAQLCRQELGKAIHCCIIQMGFESNVYVSTALVNMYG 188

Query: 189 ACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSERTMVV 248
            C S+SDA +VFDE+P+RN V+WNALITGY HNR F + I+ FR M  AGA+P E TMV 
Sbjct: 189 TCSSVSDARQVFDEIPDRNIVSWNALITGYNHNRMFRKVIDVFREMQIAGAKPVEVTMVG 248

Query: 249 VLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIREKN 308
           VL AC+HLGALNQG+W+ ++I HNRLRLNVFVGTALIDMYAKCG VDEAEK+F+ +R KN
Sbjct: 249 VLLACAHLGALNQGRWIDDYIDHNRLRLNVFVGTALIDMYAKCGVVDEAEKIFKAMRVKN 308

Query: 309 VYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFM 368
           VYTWNVLISGYAMNG+G++AL AFSRM+ME FKPDEVTFLG+LCACCHQGLV EGR  F 
Sbjct: 309 VYTWNVLISGYAMNGRGESALQAFSRMIMEKFKPDEVTFLGVLCACCHQGLVNEGRTYFT 368

Query: 369 SMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNT 428
           SMK+ FGL+P+IEHYGCMVDLLGRAGFLDEA +LIQ+MSM+PDPIIWR LL ACR+HGN 
Sbjct: 369 SMKEEFGLRPRIEHYGCMVDLLGRAGFLDEAQQLIQAMSMQPDPIIWRELLGACRIHGNI 428

Query: 429 KLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEI 488
           +LGE+ I++L+ELEPNNGENYVLL+NLY+R+QRW +VG++R MM  R + KVPGCSSIEI
Sbjct: 429 QLGEFAIKKLLELEPNNGENYVLLANLYARDQRWDKVGEVREMMDCRRVRKVPGCSSIEI 488

Query: 489 NNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 548
           +NVVYEF  SN  KP FE +YK L ++++KLK  GYV  T MA YDIE+EEKEHS+MYHS
Sbjct: 489 DNVVYEFVVSNYIKPGFEEVYKLLADMNKKLKLAGYVADTGMASYDIEEEEKEHSLMYHS 548

Query: 549 EKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGF 608
           EKLALAFGLL SP G TLRIVKNLRIC DCH FFK+VS VY+R I VRDRNRFHHF  G 
Sbjct: 549 EKLALAFGLLKSPSGLTLRIVKNLRICQDCHGFFKIVSKVYRRDISVRDRNRFHHFVGGA 608

Query: 609 CSCRDYW 616
           CSC+DYW
Sbjct: 609 CSCKDYW 615

BLAST of Bhi02G000191 vs. TrEMBL
Match: tr|A0A2R6PCT2|A0A2R6PCT2_ACTCH (Pentatricopeptide repeat-containing protein OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc29396 PE=4 SV=1)

HSP 1 Score: 754.6 bits (1947), Expect = 1.7e-214
Identity = 359/540 (66.48%), Postives = 434/540 (80.37%), Query Frame = 0

Query: 76  YFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDLNKHLNSLYIFAHMHK 135
           + LMKLI+ SVS  GF S A +F++F   I+S+LCNS+IR Y  LNKH++S++++  M K
Sbjct: 49  FSLMKLINSSVSSYGFASSAPLFAQFNHFIDSELCNSVIRSYTHLNKHVHSVFVYTQMCK 108

Query: 136 FSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTSTALVHMYCACLSISD 195
             I PDSSTFPAVLK+  +L   ++GK IH  V++MGF+ D+YT+TALVHMYC CL   +
Sbjct: 109 AGISPDSSTFPAVLKSVTKLGRGDIGKSIHCCVVKMGFVSDLYTNTALVHMYCTCLLPGE 168

Query: 196 ASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSERTMVVVLSACSH 255
             +VFD MPERNAV+WNALI+GY HNRKF EAI+AFR M AAGA+P E TMV VLSACSH
Sbjct: 169 GRQVFDVMPERNAVSWNALISGYAHNRKFREAIDAFRDMQAAGAKPGEVTMVGVLSACSH 228

Query: 256 LGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIREKNVYTWNVL 315
           LGALNQGKW+H++I  N+LRLNVFVGTALIDMYAKCG VDEA++VF  +R KNVYTWNVL
Sbjct: 229 LGALNQGKWIHDYIVRNKLRLNVFVGTALIDMYAKCGVVDEAQRVFGAVRVKNVYTWNVL 288

Query: 316 ISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVTEGRRQFMSMKQHFG 375
           ISGYAMNGQG+AAL AF  M++EN++PDEVTFLGILCACCHQGLV  GRR   +MK+ +G
Sbjct: 289 ISGYAMNGQGEAALQAFETMIVENYRPDEVTFLGILCACCHQGLVEVGRRHLRNMKEEYG 348

Query: 376 LQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTI 435
           L P+IEHYGCMVDLLGRAG   EA EL+ +M+M+PDPIIWRA L ACR+HG+T+LGE  I
Sbjct: 349 LNPRIEHYGCMVDLLGRAGLFVEAQELMHTMNMKPDPIIWRAFLGACRIHGHTQLGETAI 408

Query: 436 RRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVPGCSSIEINNVVYEF 495
           + LIELEP NGENY+LLSNLY+R+ +W+EVG++R +M+  GI KVPGCSSIEI N VYEF
Sbjct: 409 KNLIELEPENGENYILLSNLYARDHKWSEVGEVREIMNRGGIRKVPGCSSIEIENAVYEF 468

Query: 496 AASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAF 555
             SN   P +E +YK L N+  +LK  GY   TDM  YDIE+EEKE ++ YHSEKLALAF
Sbjct: 469 VVSNLMGPGYEELYKLLANVKRELKVAGYAEYTDMVSYDIEEEEKEQTLTYHSEKLALAF 528

Query: 556 GLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 615
           GLLNS    TLRI+KNLRIC DCH+FFK+VS +Y+R I VRDRNRFHHFS G CSC+DYW
Sbjct: 529 GLLNSLPDTTLRILKNLRICQDCHQFFKLVSELYRRDITVRDRNRFHHFSGGVCSCKDYW 588

BLAST of Bhi02G000191 vs. TrEMBL
Match: tr|A0A1U8A8W5|A0A1U8A8W5_NELNU (pentatricopeptide repeat-containing protein At4g21065-like OS=Nelumbo nucifera OX=4432 GN=LOC104598183 PE=4 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 5.3e-211
Identity = 361/614 (58.79%), Postives = 455/614 (74.10%), Query Frame = 0

Query: 2   KMYFRLLPLSCGIIQRSRLQEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPDYK 61
           + +F  L +S    +  +   ICT   S++ +  +      Q + L    T+   +  + 
Sbjct: 4   RSFFSSLGISTTSSRWIQFSSICTSSISLLTKCPVQMTQQQQPLFLLKSITDYNHMKQFL 63

Query: 62  DALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDLN 121
              H   N++  D + LMKLID S S +G +  A +F++F+  INS++C SMIR +   N
Sbjct: 64  G--HIITNNIAIDEFSLMKLIDLSFSSSGSDVSAHLFTQFQDFINSEICTSMIRSFTHSN 123

Query: 122 KHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTST 181
           KH  S++++  MHK+  +PDSSTFPAVLK+TAQ+C    GK +H  + Q GF  DV+T+T
Sbjct: 124 KHFLSIFVYIMMHKYGYVPDSSTFPAVLKSTAQVCRRRFGKSVHAYIFQTGFNSDVFTNT 183

Query: 182 ALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAEP 241
           ALVHMY  C SI +A R+FDEMP +N+V+WNALITGYTHNRKF EAI+ FR M  +G EP
Sbjct: 184 ALVHMYATCTSIGEARRLFDEMPVKNSVSWNALITGYTHNRKFREAISTFREMQISGFEP 243

Query: 242 SERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVF 301
            E TMV VLSAC HLGALNQGKW+H++I   RLRLNVFVGTALIDMYAKCG VDEAEKVF
Sbjct: 244 GEVTMVGVLSACGHLGALNQGKWIHDYIVQKRLRLNVFVGTALIDMYAKCGVVDEAEKVF 303

Query: 302 EEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLVT 361
             +R KNVYTWNVLISG+ MNGQG+AAL AFSRM+MENFKPD VT L +LCACC QGL+ 
Sbjct: 304 GAMRVKNVYTWNVLISGFTMNGQGEAALQAFSRMVMENFKPDGVTLLAVLCACCRQGLIK 363

Query: 362 EGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLCA 421
           EGRR F+SM++ +GL+P IEHYGCMVDLLGRAGFL+EA ELI++M  +PDP++WRALL A
Sbjct: 364 EGRRYFVSMEKEYGLRPGIEHYGCMVDLLGRAGFLNEAQELIRTMPYKPDPVVWRALLGA 423

Query: 422 CRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKVP 481
           CR+HG+T+LGE  IR L+ LEPNNGENYVLLSNLY+R  RW +VG++R MM+ +GI K+P
Sbjct: 424 CRIHGSTQLGEVAIRNLLGLEPNNGENYVLLSNLYARGHRWTKVGEVRDMMNRKGIRKIP 483

Query: 482 GCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEKE 541
           GCSSIE+++ VYEF  SN    E   +Y  L ++  ++K  GYV  T+M  YDIE+EEKE
Sbjct: 484 GCSSIEVDDAVYEFVVSNSLDVELGEVYNMLADMKNEMKLAGYVAETEMVSYDIEEEEKE 543

Query: 542 HSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNRF 601
           +S+MYHSEKLALAFGLL +    T+RIVKNLRIC DCH F K+VS +Y+R IVVRDRN F
Sbjct: 544 NSLMYHSEKLALAFGLLKTSSDSTIRIVKNLRICKDCHGFCKIVSKIYKRNIVVRDRNLF 603

Query: 602 HHFSEGFCSCRDYW 616
           HHF+ G CSC+DYW
Sbjct: 604 HHFAGGLCSCKDYW 615

BLAST of Bhi02G000191 vs. NCBI nr
Match: XP_004138309.2 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] >KGN63701.1 hypothetical protein Csa_1G011530 [Cucumis sativus])

HSP 1 Score: 1090.9 bits (2820), Expect = 0.0e+00
Identity = 543/616 (88.15%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MKMYFRLLPLSCGIIQRSRL-QEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPD 60
           MKMY RLLP S  II+RSR+ QEICTI N   LESEM KFVHTQAMDLP   TN  KIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLD 120
           Y        NDVRR G+FLMKLIDDSVS NGFESIA IFSK+R SINSQ CNSMIR YLD
Sbjct: 61  Y--------NDVRR-GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYT 180
           LNKHLNSLYIFA MHKFSILPDSSTFPAVLKATAQLCDT VGKMIHG VIQMGFI DVYT
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGA 240
           STALVH+YC CLSISDAS++FDEMPERNAVTWNALITGYTHNRKF++AI+AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERT+VVVLSACSHLGA NQGKW+HEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGL 360
           VFEEIREKNVYTWNVLISGYAMNGQGDAAL AFSRMLMENFKPDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALL 420
           VTEGR QFMSMKQ FGLQP+IEHYGCMVDLLGRAG L+EALELIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRE+RWAEVGKLRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEE 540
           VPGCSSIEINNVVYEF ASNDRKPEFEAIYKQLDNL +KLKENGYVTGTDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRN 600
           KEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKV+S+VY+RYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of Bhi02G000191 vs. NCBI nr
Match: XP_023529316.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 530/617 (85.90%), Postives = 570/617 (92.38%), Query Frame = 0

Query: 1   MKMYFRLLPLSCGIIQRSRLQEICTILNSVIL--ESEMSKFVHTQAMDLPPPRTNERKIP 60
           MKM  R LP S  +I+R+RLQ+ CTI N   L  +S++S+FVHT+ M+LP     ERKIP
Sbjct: 1   MKMDLRFLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYL 120
           D  DA  KEGND+R DGYFLMKLI+DSVS+NGFESIALIFSKFR SINSQ+CNSMIRGYL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVY 180
           DLN+HLNSL IFAHMHKFSILPDSSTFPAVLKATAQLCD ++GKMIHG V+QMGFI DVY
Sbjct: 121 DLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 TSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAG 240
           TSTALVHMYC+CLSISDAS++FDEMPERN+VTWNALITGYTHNRKF EAINAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFKEAINAFRGMLAAG 240

Query: 241 AEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERT+VVVLSACSHLGALNQGKW+H+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQG 360
           KVFEEIR+KNVYTWNVLISGY MNGQGDAAL AFSRMLMENFKPD VTFLG+LCACCHQG
Sbjct: 301 KVFEEIRDKNVYTWNVLISGYGMNGQGDAALQAFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRAL 420
           LVTEGRRQF+SMKQ FGLQPKIEHYGCMVDLLGRAG L+EALELI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRE+RW EVGKLRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKE 540
           KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN+ +KLKENGYVTGTDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNSVHEFTASNDRKLEFNAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDR 600
           EKEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRIC DCHEFFKVVS+VY+RYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of Bhi02G000191 vs. NCBI nr
Match: XP_022925029.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata])

HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 527/617 (85.41%), Postives = 569/617 (92.22%), Query Frame = 0

Query: 1   MKMYFRLLPLSCGIIQRSRLQEICTILNSVIL--ESEMSKFVHTQAMDLPPPRTNERKIP 60
           MKM  RLLP S  +I+R+RLQ+ CTI N   L  +S++S+FVHT+ M+LP     ERKIP
Sbjct: 1   MKMDLRLLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYL 120
           D  DA  KEGND+R DGYFLMKLI+DSVS+NGFESIALIFSKFR SINSQ+CNSMIRGYL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVY 180
           D N+HLNSL IFAHMHKFSILPDSSTFPAVLKATAQLCD ++GKMIHG V+QMGFI DVY
Sbjct: 121 DSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 TSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAG 240
           TSTALVHMYC+CLSISDAS++FDEMPERN+VTWNALITGYTHNRKF EAINAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAAG 240

Query: 241 AEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERT+VVVLSACSHLGALNQGKW+H+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQG 360
           KVFEEIR++NVYTWNVLISGY MNGQG+AAL  FSRMLMENFKPD VTFLG+LCACCHQG
Sbjct: 301 KVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRAL 420
           LVTEGRRQF+SMKQ FGLQPKIEHYGCMVDLLGRAG L+EALELI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRE+RW EVGKLRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKE 540
           KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN+ +KLKENGYVTGTDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDR 600
           EKEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRIC DCHEFFKVVS+VY+RYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of Bhi02G000191 vs. NCBI nr
Match: XP_023003968.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima])

HSP 1 Score: 1050.8 bits (2716), Expect = 1.8e-303
Identity = 507/571 (88.79%), Postives = 538/571 (94.22%), Query Frame = 0

Query: 45  MDLPPPRTNERKIPDYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSS 104
           M+LP     ERKIPD  DAL KEGND+R DGYFLMKLI+DSVS+NGFESIALIFSKFR S
Sbjct: 1   MNLPSQGGIERKIPDCLDALRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGS 60

Query: 105 INSQLCNSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMI 164
           INSQ+CNSMIRGYLDLN+HLNSL IFAHMHKFSILPDSSTFPAVLKATAQLCD ++GKMI
Sbjct: 61  INSQICNSMIRGYLDLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMI 120

Query: 165 HGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKF 224
           HG V+QMGFI DVYTSTALVHMYC+CLSISDAS++FDEMPERN+VTWNALITGYTHNRKF
Sbjct: 121 HGAVVQMGFIRDVYTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKF 180

Query: 225 MEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTAL 284
            EAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKW+H+FIY N+LRLNVFVGTAL
Sbjct: 181 KEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTAL 240

Query: 285 IDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDE 344
           IDMYAKCG V+EAEKVFEEIR+KNVYTWNVLISGY MNGQG+AAL AFSRMLMENFKPD 
Sbjct: 241 IDMYAKCGVVEEAEKVFEEIRDKNVYTWNVLISGYGMNGQGNAALQAFSRMLMENFKPDA 300

Query: 345 VTFLGILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQ 404
           VTFLG+LCACCHQGLVTEGRRQF+SMKQ FGLQPKIEHYGCMVDLLGRAG L+EALELI+
Sbjct: 301 VTFLGLLCACCHQGLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIE 360

Query: 405 SMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAE 464
           SMSMEPDPIIWRALLCACRVHGNTK+GEYTIRRLIELEPNNGENYVLLSNLYSRE+RW E
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKMGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIE 420

Query: 465 VGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGY 524
           VGKLRGMMSLRGI KVPGCSSIEINN VYEF ASNDRK EF AIYKQLDN+ +KLKENGY
Sbjct: 421 VGKLRGMMSLRGIEKVPGCSSIEINNAVYEFTASNDRKLEFSAIYKQLDNVMKKLKENGY 480

Query: 525 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKV 584
           VTGTDM+L+DIEKEEKEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRIC DCHEFFKV
Sbjct: 481 VTGTDMSLFDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKV 540

Query: 585 VSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VS+VY+RYIVVRDRNRFHHFSE  CSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFSERVCSCRDYW 571

BLAST of Bhi02G000191 vs. NCBI nr
Match: XP_016901378.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1048.5 bits (2710), Expect = 8.8e-303
Identity = 510/571 (89.32%), Postives = 537/571 (94.05%), Query Frame = 0

Query: 45  MDLPPPRTNERKIPDYKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSS 104
           MDLP   TN+RK PDY        NDVRR G+F+MKLIDDSVSHNGFESIA IFSK+R S
Sbjct: 1   MDLPFQETNDRKTPDY--------NDVRR-GHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 105 INSQLCNSMIRGYLDLNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMI 164
           INSQ CNSMIR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDTEVGKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 165 HGTVIQMGFIHDVYTSTALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKF 224
           HG VIQMGFI DVYTSTALVHMY  CLSISDAS+VFDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 225 MEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTAL 284
           MEAI+AFRGMLAAGA+PSERT+V+VLSACSHLGALNQGKW+H+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 285 IDMYAKCGAVDEAEKVFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDE 344
           IDMYAKCGAVDE EKVFEEIREKNVYTWNVLISGYAMNGQGDAAL AFSRMLMENFKPDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 345 VTFLGILCACCHQGLVTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQ 404
           VTFLG+LCACCHQGLVTEGRRQFMSMKQ FGLQP+IEHYGCMVDLLGRAG L+EALELIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 405 SMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAE 464
           SMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+Y+RE+RWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 465 VGKLRGMMSLRGIGKVPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGY 524
           VGKLRGMM+LRGI KVPGCSSIEINNVVYEF ASNDRKPE+EAIYKQLDNL +KLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 525 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKV 584
           VTGTDMALYD+EKEEKEHS+MYHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 585 VSIVYQRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VS+VY+RYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 562

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q9LN01|PPR21_ARATH2.8e-12744.83Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q8LK93|PP145_ARATH2.3e-12142.69Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
sp|Q9SN85|PP267_ARATH4.0e-12143.81Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9SUH6|PP341_ARATH8.9e-12142.38Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
sp|Q9C6T2|PPR68_ARATH5.7e-12040.32Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.6e-12844.83Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.11.3e-12242.69Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G47530.12.2e-12243.81Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G30700.14.9e-12242.38Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G31920.13.2e-12140.32Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LUK8|A0A0A0LUK8_CUCSA0.0e+0088.15Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=4 SV=1[more]
tr|A0A1S4DZH3|A0A1S4DZH3_CUCME5.8e-30389.32pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
tr|D7SI59|D7SI59_VITVI2.9e-22569.84Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g06770 PE=4 SV=... [more]
tr|A0A2R6PCT2|A0A2R6PCT2_ACTCH1.7e-21466.48Pentatricopeptide repeat-containing protein OS=Actinidia chinensis var. chinensi... [more]
tr|A0A1U8A8W5|A0A1U8A8W5_NELNU5.3e-21158.79pentatricopeptide repeat-containing protein At4g21065-like OS=Nelumbo nucifera O... [more]
Match NameE-valueIdentityDescription
XP_004138309.20.0e+0088.15PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis s... [more]
XP_023529316.10.0e+0085.90pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_022925029.10.0e+0085.41pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata][more]
XP_023003968.11.8e-30388.79pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima][more]
XP_016901378.18.8e-30389.32PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi02M000191Bhi02M000191mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 310..344
e-value: 7.4E-8
score: 30.1
coord: 282..309
e-value: 0.0014
score: 16.7
coord: 209..242
e-value: 6.5E-6
score: 24.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 307..356
e-value: 6.4E-11
score: 42.2
coord: 207..254
e-value: 2.0E-9
score: 37.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 382..407
e-value: 0.0026
score: 17.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 141..175
score: 5.086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 343..378
score: 7.509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 242..276
score: 5.371
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..307
score: 9.449
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 6.018
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..206
score: 8.385
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 308..342
score: 11.794
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 411..441
score: 5.251
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 379..409
score: 7.585
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 106..140
score: 8.155
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 207..241
score: 11.246
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 63..173
e-value: 2.8E-5
score: 25.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 193..308
e-value: 3.4E-25
score: 91.0
coord: 309..492
e-value: 3.7E-33
score: 117.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 288..332
coord: 394..464
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 481..604
e-value: 1.3E-36
score: 125.2
NoneNo IPR availablePANTHERPTHR24015:SF1602SUBFAMILY NOT NAMEDcoord: 184..488
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 184..488
NoneNo IPR availablePANTHERPTHR24015:SF1602SUBFAMILY NOT NAMEDcoord: 100..218
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 100..218

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Bhi02G000191Wax gourdwgowgoB121
Bhi02G000191Wax gourdwgowgoB125
Bhi02G000191Cucumber (Gy14) v1cgywgoB188
Bhi02G000191Cucumber (Gy14) v1cgywgoB585
Bhi02G000191Cucumber (Gy14) v2cgybwgoB107
Bhi02G000191Cucumber (Gy14) v2cgybwgoB092
Bhi02G000191Cucurbita maxima (Rimu)cmawgoB0631
Bhi02G000191Cucurbita maxima (Rimu)cmawgoB0916
Bhi02G000191Cucurbita moschata (Rifu)cmowgoB0081
Bhi02G000191Cucurbita moschata (Rifu)cmowgoB0631
Bhi02G000191Cucurbita pepo (Zucchini)cpewgoB0737
Bhi02G000191Cucurbita moschata (Rifu)cmowgoB0917
Bhi02G000191Cucurbita pepo (Zucchini)cpewgoB0062
Bhi02G000191Cucurbita pepo (Zucchini)cpewgoB0459
Bhi02G000191Cucurbita pepo (Zucchini)cpewgoB0581
Bhi02G000191Cucurbita pepo (Zucchini)cpewgoB0883
Bhi02G000191Wild cucumber (PI 183967)cpiwgoB116
Bhi02G000191Wild cucumber (PI 183967)cpiwgoB528
Bhi02G000191Cucumber (Chinese Long) v3cucwgoB103
Bhi02G000191Cucumber (Chinese Long) v3cucwgoB121
Bhi02G000191Cucumber (Chinese Long) v3cucwgoB537
Bhi02G000191Cucumber (Chinese Long) v2cuwgoB115
Bhi02G000191Cucumber (Chinese Long) v2cuwgoB525
Bhi02G000191Bottle gourd (USVL1VR-Ls)lsiwgoB291
Bhi02G000191Bottle gourd (USVL1VR-Ls)lsiwgoB408
Bhi02G000191Melon (DHL92) v3.6.1medwgoB090
Bhi02G000191Melon (DHL92) v3.6.1medwgoB194
Bhi02G000191Melon (DHL92) v3.5.1mewgoB089
Bhi02G000191Melon (DHL92) v3.5.1mewgoB192
Bhi02G000191Melon (DHL92) v3.5.1mewgoB291
Bhi02G000191Watermelon (Charleston Gray)wcgwgoB407
Bhi02G000191Watermelon (Charleston Gray)wcgwgoB487
Bhi02G000191Watermelon (97103) v2wgowmbB644
Bhi02G000191Watermelon (97103) v2wgowmbB647
Bhi02G000191Watermelon (97103) v1wgowmB671
Bhi02G000191Watermelon (97103) v1wgowmB666
Bhi02G000191Silver-seed gourdcarwgoB0063