Bhi04G000073 (gene) Wax gourd

NameBhi04G000073
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4 : 2314805 .. 2316586 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAAGAACACAATACCCACTTCTTCTTCGGTCATTTCGCCTGACCCGTCAATGCCCAACTGCAGAAGCGCTTGTTTCAGCTTTATTGATGGCTGTAAACTCTTGCACTTCCATCTCCAATTGCCGGAAAATTCATGCCCGAGTAATCAAATCTTTTCTTTATAGAGAGGGTTTCATTGGGGATCAGCTGGTCACTTGTTATAATAAACTGGGTTATGTTGAAGATGCACAGAAGCTGTTTGATGATATGCCTGTTAAAGATTTGATTTCTTGGAACTCACTGATTTCTGGTTTTTCTTGGTGTCTTGATATTACTCTCGAAGCATTTTATACCATGAAGTTTGAAAGGTCAGTTAAACCCAATGAGATCACAATTCTATCGATGATATCGGCTTGCAATGGAGCTTTGGGTGCAGGGAAATATATTCATGGCTTTGCAATTAAAATTGGTGTTTCTTTCGAAGTTGAGGTCGTTAATTCTCTCATTAACATGTATGGAAAGTCTGAAGATTTAACATCAGCTTGTAGATTATTTGAGGCCATTCCAGACCCAAATACAGTATCTTGGAATTCAATCATTGCTGCCCAAGTTACCAATGGCTGTGCACGAGAAGGAATTGATTGTTTCAATAAGATGAGAAGGTTTGGAATTGAGCCGGATGAAGGAACTATCCTGGCACTGCTTCAAGCTTGCCTACAATTGAGTGTAGGAAAATTGGCTGAAAGCATTCATGGTTTAATCTTCTGCTCTGGTCTAGGCGCAAAAATCACCATAGCAACTGCACTTTTAGATTTGTATGCGAAATTAGGAAGATTAAGTGCTTCATGTGACGTCTTTAGGGAGGTGGGTTGTGCTGACAGAGTTGCTTGGACTGCCATGCTTGCAGGATATGCAGCACATGGATTAGGTAGAGAAGCAATCAAGCTTTTTGAGAGCATGACCAAGAAAGGTTTGGAGCCTGATCATGTGACTTTTATTCATTTGCTTAGCGCATGTAGTCATTCAGGGCTAGTCAGAGAGGGGAAAAGTTACTTCAGTATGATGTCTAAAGTGTATGGAATTGAGCCTAGGATGGATCATTATTCATGTATGGTTGATCTACTTGGTCGCCGCGGACTTTTGAACGATGCTTATGACGTGATACAAAACATGCCTATGGAGCCCAATGCTGGTGTGTGGAGTGCTCTTCTTGGTGCTTGTAGGGTCTATGGTAACATTGAACTTGGTAAGGAAGTTGCAGAGCATTTGATTAATTTGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCTGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTTAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATGTAGCTCCATTGAATATAGAAACAAGATCCACCGCTTCTTTGTGGGCGATCGTTCTCACTCTGAGACGGGTAAGATCTATTCCAAACTCGAAGAATTGCTTGGAAAAATAAGGCAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAAATGTTGAAGAAGAAGTCAAGGAGGATATGATAAACAAGCACAGCGAGAAGTTAGCCATTGCTTTCGGGATTTTGGTGAGTAAAGAAGGTGCACCTTTAATCATCACAAAGAATATCAGAATTTGTGGAGATTGTCATACCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCATCATTTCTCTGATGGATTGTGTTCTTGTGCAGATTACTGGTAA

mRNA sequence

TCAAGAACACAATACCCACTTCTTCTTCGGTCATTTCGCCTGACCCGTCAATGCCCAACTGCAGAAGCGCTTGTTTCAGCTTTATTGATGGCTGTAAACTCTTGCACTTCCATCTCCAATTGCCGGAAAATTCATGCCCGAGTAATCAAATCTTTTCTTTATAGAGAGGGTTTCATTGGGGATCAGCTGGTCACTTGTTATAATAAACTGGGTTATGTTGAAGATGCACAGAAGCTGTTTGATGATATGCCTGTTAAAGATTTGATTTCTTGGAACTCACTGATTTCTGGTTTTTCTTGGTGTCTTGATATTACTCTCGAAGCATTTTATACCATGAAGTTTGAAAGGTCAGTTAAACCCAATGAGATCACAATTCTATCGATGATATCGGCTTGCAATGGAGCTTTGGGTGCAGGGAAATATATTCATGGCTTTGCAATTAAAATTGGTGTTTCTTTCGAAGTTGAGGTCGTTAATTCTCTCATTAACATGTATGGAAAGTCTGAAGATTTAACATCAGCTTGTAGATTATTTGAGGCCATTCCAGACCCAAATACAGTATCTTGGAATTCAATCATTGCTGCCCAAGTTACCAATGGCTGTGCACGAGAAGGAATTGATTGTTTCAATAAGATGAGAAGGTTTGGAATTGAGCCGGATGAAGGAACTATCCTGGCACTGCTTCAAGCTTGCCTACAATTGAGTGTAGGAAAATTGGCTGAAAGCATTCATGGTTTAATCTTCTGCTCTGGTCTAGGCGCAAAAATCACCATAGCAACTGCACTTTTAGATTTGTATGCGAAATTAGGAAGATTAAGTGCTTCATGTGACGTCTTTAGGGAGGTGGGTTGTGCTGACAGAGTTGCTTGGACTGCCATGCTTGCAGGATATGCAGCACATGGATTAGGTAGAGAAGCAATCAAGCTTTTTGAGAGCATGACCAAGAAAGGTTTGGAGCCTGATCATGTGACTTTTATTCATTTGCTTAGCGCATGTAGTCATTCAGGGCTAGTCAGAGAGGGGAAAAGTTACTTCAGTATGATGTCTAAAGTGTATGGAATTGAGCCTAGGATGGATCATTATTCATGTATGGTTGATCTACTTGGTCGCCGCGGACTTTTGAACGATGCTTATGACGTGATACAAAACATGCCTATGGAGCCCAATGCTGGTGTGTGGAGTGCTCTTCTTGGTGCTTGTAGGGTCTATGGTAACATTGAACTTGGTAAGGAAGTTGCAGAGCATTTGATTAATTTGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCTGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTTAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATGTAGCTCCATTGAATATAGAAACAAGATCCACCGCTTCTTTGTGGGCGATCGTTCTCACTCTGAGACGGGTAAGATCTATTCCAAACTCGAAGAATTGCTTGGAAAAATAAGGCAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAAATGTTGAAGAAGAAGTCAAGGAGGATATGATAAACAAGCACAGCGAGAAGTTAGCCATTGCTTTCGGGATTTTGGTGAGTAAAGAAGGTGCACCTTTAATCATCACAAAGAATATCAGAATTTGTGGAGATTGTCATACCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCATCATTTCTCTGATGGATTGTGTTCTTGTGCAGATTACTGGTAA

Coding sequence (CDS)

TCAAGAACACAATACCCACTTCTTCTTCGGTCATTTCGCCTGACCCGTCAATGCCCAACTGCAGAAGCGCTTGTTTCAGCTTTATTGATGGCTGTAAACTCTTGCACTTCCATCTCCAATTGCCGGAAAATTCATGCCCGAGTAATCAAATCTTTTCTTTATAGAGAGGGTTTCATTGGGGATCAGCTGGTCACTTGTTATAATAAACTGGGTTATGTTGAAGATGCACAGAAGCTGTTTGATGATATGCCTGTTAAAGATTTGATTTCTTGGAACTCACTGATTTCTGGTTTTTCTTGGTGTCTTGATATTACTCTCGAAGCATTTTATACCATGAAGTTTGAAAGGTCAGTTAAACCCAATGAGATCACAATTCTATCGATGATATCGGCTTGCAATGGAGCTTTGGGTGCAGGGAAATATATTCATGGCTTTGCAATTAAAATTGGTGTTTCTTTCGAAGTTGAGGTCGTTAATTCTCTCATTAACATGTATGGAAAGTCTGAAGATTTAACATCAGCTTGTAGATTATTTGAGGCCATTCCAGACCCAAATACAGTATCTTGGAATTCAATCATTGCTGCCCAAGTTACCAATGGCTGTGCACGAGAAGGAATTGATTGTTTCAATAAGATGAGAAGGTTTGGAATTGAGCCGGATGAAGGAACTATCCTGGCACTGCTTCAAGCTTGCCTACAATTGAGTGTAGGAAAATTGGCTGAAAGCATTCATGGTTTAATCTTCTGCTCTGGTCTAGGCGCAAAAATCACCATAGCAACTGCACTTTTAGATTTGTATGCGAAATTAGGAAGATTAAGTGCTTCATGTGACGTCTTTAGGGAGGTGGGTTGTGCTGACAGAGTTGCTTGGACTGCCATGCTTGCAGGATATGCAGCACATGGATTAGGTAGAGAAGCAATCAAGCTTTTTGAGAGCATGACCAAGAAAGGTTTGGAGCCTGATCATGTGACTTTTATTCATTTGCTTAGCGCATGTAGTCATTCAGGGCTAGTCAGAGAGGGGAAAAGTTACTTCAGTATGATGTCTAAAGTGTATGGAATTGAGCCTAGGATGGATCATTATTCATGTATGGTTGATCTACTTGGTCGCCGCGGACTTTTGAACGATGCTTATGACGTGATACAAAACATGCCTATGGAGCCCAATGCTGGTGTGTGGAGTGCTCTTCTTGGTGCTTGTAGGGTCTATGGTAACATTGAACTTGGTAAGGAAGTTGCAGAGCATTTGATTAATTTGGAACCTTTGGACCCCAGAAACTATATCATGTTATCAAATATGTATTCTGCATCTCGTTCTTGGAAGGATGCTGCCAAAGTTAGGGCCTTGCTAAAGGAGAGAGGTCTGAAAAGAACCCCAGGATGTAGCTCCATTGAATATAGAAACAAGATCCACCGCTTCTTTGTGGGCGATCGTTCTCACTCTGAGACGGGTAAGATCTATTCCAAACTCGAAGAATTGCTTGGAAAAATAAGGCAAGCTGGATATAGTTCCAAAACAGAATATGTTCTGCAAAATGTTGAAGAAGAAGTCAAGGAGGATATGATAAACAAGCACAGCGAGAAGTTAGCCATTGCTTTCGGGATTTTGGTGAGTAAAGAAGGTGCACCTTTAATCATCACAAAGAATATCAGAATTTGTGGAGATTGTCATACCACTGCAAAGCTCATATCATTGATTGAGAAGCGTACCATTATTATCCGAGATCCAAAACGCTTTCATCATTTCTCTGATGGATTGTGTTCTTGTGCAGATTACTGGTAA

Protein sequence

SRTQYPLLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVKPNEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW
BLAST of Bhi04G000073 vs. Swiss-Prot
Match: sp|Q9FND6|PP411_ARATH (Pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H15 PE=2 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 2.8e-196
Identity = 342/581 (58.86%), Postives = 422/581 (72.63%), Query Frame = 0

Query: 22  EALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFD 81
           +A VS+L+ AV SC SI  CR +H +V+KS  YR GFIGDQLV CY +LG+   A+KLFD
Sbjct: 31  DANVSSLIAAVKSCVSIELCRLLHCKVVKSVSYRHGFIGDQLVGCYLRLGHDVCAEKLFD 90

Query: 82  DMPVKDLISWNSLISGFSW------CLDITLEAFYTMKFERSVKPNEITILSMISAC--N 141
           +MP +DL+SWNSLISG+S       C ++       M  E   +PNE+T LSMISAC   
Sbjct: 91  EMPERDLVSWNSLISGYSGRGYLGKCFEVLSR---MMISEVGFRPNEVTFLSMISACVYG 150

Query: 142 GALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSII 201
           G+   G+ IHG  +K GV  EV+VVN+ IN YGK+ DLTS+C+LFE +   N VSWN++I
Sbjct: 151 GSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYGKTGDLTSSCKLFEDLSIKNLVSWNTMI 210

Query: 202 AAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLG 261
              + NG A +G+  FN  RR G EPD+ T LA+L++C  + V +LA+ IHGLI   G  
Sbjct: 211 VIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLAVLRSCEDMGVVRLAQGIHGLIMFGGFS 270

Query: 262 AKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESM 321
               I TALLDLY+KLGRL  S  VF E+   D +AWTAMLA YA HG GR+AIK FE M
Sbjct: 271 GNKCITTALLDLYSKLGRLEDSSTVFHEITSPDSMAWTAMLAAYATHGFGRDAIKHFELM 330

Query: 322 TKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGL 381
              G+ PDHVTF HLL+ACSHSGLV EGK YF  MSK Y I+PR+DHYSCMVDLLGR GL
Sbjct: 331 VHYGISPDHVTFTHLLNACSHSGLVEEGKHYFETMSKRYRIDPRLDHYSCMVDLLGRSGL 390

Query: 382 LNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNM 441
           L DAY +I+ MPMEP++GVW ALLGACRVY + +LG + AE L  LEP D RNY+MLSN+
Sbjct: 391 LQDAYGLIKEMPMEPSSGVWGALLGACRVYKDTQLGTKAAERLFELEPRDGRNYVMLSNI 450

Query: 442 YSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEEL 501
           YSAS  WKDA+++R L+K++GL R  GCS IE+ NKIH+F VGD SH E+ KI  KL+E+
Sbjct: 451 YSASGLWKDASRIRNLMKQKGLVRASGCSYIEHGNKIHKFVVGDWSHPESEKIQKKLKEI 510

Query: 502 LGKIR-QAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRI 561
             K++ + GY SKTE+VL +V E+VKE+MIN+HSEK+A+AFG+LV     P+II KN+RI
Sbjct: 511 RKKMKSEMGYKSKTEFVLHDVGEDVKEEMINQHSEKIAMAFGLLVVSPMEPIIIRKNLRI 570

Query: 562 CGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           CGDCH TAK ISLIEKR IIIRD KRFHHF DG CSC+DYW
Sbjct: 571 CGDCHETAKAISLIEKRRIIIRDSKRFHHFLDGSCSCSDYW 608

BLAST of Bhi04G000073 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 3.0e-126
Identity = 225/573 (39.27%), Postives = 355/573 (61.95%), Query Frame = 0

Query: 25  VSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMP 84
           + ++L AV++   IS  ++IH   ++S       I   LV  Y K G +E A++LFD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 85  VKDLISWNSLISGFSWCLDITLEA--FYTMKFERSVKPNEITILSMISACN--GALGAGK 144
            ++++SWNS+I  +    +   EA   +    +  VKP +++++  + AC   G L  G+
Sbjct: 299 ERNVVSWNSMIDAYVQ-NENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGR 358

Query: 145 YIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTNG 204
           +IH  ++++G+   V VVNSLI+MY K +++ +A  +F  +     VSWN++I     NG
Sbjct: 359 FIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNG 418

Query: 205 CAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITIAT 264
              + ++ F++MR   ++PD  T ++++ A  +LS+   A+ IHG++  S L   + + T
Sbjct: 419 RPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTT 478

Query: 265 ALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGLEP 324
           AL+D+YAK G +  +  +F  +       W AM+ GY  HG G+ A++LFE M K  ++P
Sbjct: 479 ALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 325 DHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAYDV 384
           + VTF+ ++SACSHSGLV  G   F MM + Y IE  MDHY  MVDLLGR G LN+A+D 
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 385 IQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASRSW 444
           I  MP++P   V+ A+LGAC+++ N+   ++ AE L  L P D   +++L+N+Y A+  W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 445 KDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIRQA 504
           +   +VR  +  +GL++TPGCS +E +N++H FF G  +H ++ KIY+ LE+L+  I++A
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEA 718

Query: 505 GYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHTTA 564
           GY   T  VL  VE +VKE +++ HSEKLAI+FG+L +  G  + + KN+R+C DCH   
Sbjct: 719 GYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNAT 778

Query: 565 KLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           K ISL+  R I++RD +RFHHF +G CSC DYW
Sbjct: 779 KYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of Bhi04G000073 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 449.1 bits (1154), Expect = 7.5e-125
Identity = 223/575 (38.78%), Postives = 352/575 (61.22%), Query Frame = 0

Query: 25  VSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMP 84
           V +LL A       +    IH+  IK  L  E F+ ++L+  Y + G + D QK+FD M 
Sbjct: 250 VVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMY 309

Query: 85  VKDLISWNSLISGFSWCLD--ITLEAFYTMKFERSVKPNEITILSMISACN--GALGAGK 144
           V+DLISWNS+I  +         +  F  M+  R ++P+ +T++S+ S  +  G + A +
Sbjct: 310 VRDLISWNSIIKAYELNEQPLRAISLFQEMRLSR-IQPDCLTLISLASILSQLGDIRACR 369

Query: 145 YIHGFAIKIGVSFE-VEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTN 204
            + GF ++ G   E + + N+++ MY K   + SA  +F  +P+ + +SWN+II+    N
Sbjct: 370 SVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQN 429

Query: 205 GCAREGIDCFNKMRRFG-IEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITI 264
           G A E I+ +N M   G I  ++GT +++L AC Q    +    +HG +  +GL   + +
Sbjct: 430 GFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFV 489

Query: 265 ATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGL 324
            T+L D+Y K GRL  +  +F ++   + V W  ++A +  HG G +A+ LF+ M  +G+
Sbjct: 490 VTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGV 549

Query: 325 EPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAY 384
           +PDH+TF+ LLSACSHSGLV EG+  F MM   YGI P + HY CMVD+ GR G L  A 
Sbjct: 550 KPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETAL 609

Query: 385 DVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASR 444
             I++M ++P+A +W ALL ACRV+GN++LGK  +EHL  +EP     +++LSNMY+++ 
Sbjct: 610 KFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAG 669

Query: 445 SWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIR 504
            W+   ++R++   +GL++TPG SS+E  NK+  F+ G+++H    ++Y +L  L  K++
Sbjct: 670 KWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLK 729

Query: 505 QAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHT 564
             GY     +VLQ+VE++ KE ++  HSE+LAIAF ++ +     + I KN+R+CGDCH+
Sbjct: 730 MIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFALIATPAKTTIRIFKNLRVCGDCHS 789

Query: 565 TAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
             K IS I +R II+RD  RFHHF +G+CSC DYW
Sbjct: 790 VTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of Bhi04G000073 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 436.0 bits (1120), Expect = 6.5e-121
Identity = 220/579 (38.00%), Postives = 346/579 (59.76%), Query Frame = 0

Query: 19  PTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQK 78
           P    + S L  A +    +S  +++H   IK     + F+   L+  Y++   +++A+ 
Sbjct: 414 PDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEI 473

Query: 79  LFDDMPVKDLISWNSLISGFSWCLD--ITLEAFYTMKFERSVKPNEITILSMISACN--G 138
           LF+     DL++WN++++G++   D   TL+ F  M  ++  + ++ T+ ++   C    
Sbjct: 474 LFERHNF-DLVAWNAMMAGYTQSHDGHKTLKLFALM-HKQGERSDDFTLATVFKTCGFLF 533

Query: 139 ALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIA 198
           A+  GK +H +AIK G   ++ V + +++MY K  D+++A   F++IP P+ V+W ++I+
Sbjct: 534 AINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMIS 593

Query: 199 AQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGA 258
             + NG        F++MR  G+ PDE TI  L +A   L+  +    IH          
Sbjct: 594 GCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTN 653

Query: 259 KITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMT 318
              + T+L+D+YAK G +  +  +F+ +   +  AW AML G A HG G+E ++LF+ M 
Sbjct: 654 DPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMK 713

Query: 319 KKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLL 378
             G++PD VTFI +LSACSHSGLV E   +   M   YGI+P ++HYSC+ D LGR GL+
Sbjct: 714 SLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLV 773

Query: 379 NDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMY 438
             A ++I++M ME +A ++  LL ACRV G+ E GK VA  L+ LEPLD   Y++LSNMY
Sbjct: 774 KQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 833

Query: 439 SASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELL 498
           +A+  W +    R ++K   +K+ PG S IE +NKIH F V DRS+ +T  IY K+++++
Sbjct: 834 AAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMI 893

Query: 499 GKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICG 558
             I+Q GY  +T++ L +VEEE KE  +  HSEKLA+AFG+L +    P+ + KN+R+CG
Sbjct: 894 RDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCG 953

Query: 559 DCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           DCH   K I+ +  R I++RD  RFH F DG+CSC DYW
Sbjct: 954 DCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGDYW 990

BLAST of Bhi04G000073 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 433.0 bits (1112), Expect = 5.5e-120
Identity = 224/592 (37.84%), Postives = 356/592 (60.14%), Query Frame = 0

Query: 13  RLTRQCP--TAEALVS--ALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLV---T 72
           RL   C   T  AL++  A L+ + S  S+ N       V      R    GD+ V   T
Sbjct: 133 RLGMDCDLYTGNALMNMYAKLLGMGSKISVGN-------VFDEMPQRTSNSGDEDVKAET 192

Query: 73  CYNKLGYVEDAQKLFDDMPVKDLISWNSLISGF--SWCLDITLEAFYTMKFERSVKPNEI 132
           C    G ++  +++F+ MP KD++S+N++I+G+  S   +  L     M     +KP+  
Sbjct: 193 CIMPFG-IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMG-TTDLKPDSF 252

Query: 133 TILSMISACNGALGA--GKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAI 192
           T+ S++   +  +    GK IHG+ I+ G+  +V + +SL++MY KS  +  + R+F  +
Sbjct: 253 TLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL 312

Query: 193 PDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAE 252
              + +SWNS++A  V NG   E +  F +M    ++P      +++ AC  L+   L +
Sbjct: 313 YCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGK 372

Query: 253 SIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHG 312
            +HG +   G G+ I IA+AL+D+Y+K G + A+  +F  +   D V+WTA++ G+A HG
Sbjct: 373 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 432

Query: 313 LGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHY 372
            G EA+ LFE M ++G++P+ V F+ +L+ACSH GLV E   YF+ M+KVYG+   ++HY
Sbjct: 433 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 492

Query: 373 SCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEP 432
           + + DLLGR G L +AY+ I  M +EP   VWS LL +C V+ N+EL ++VAE +  ++ 
Sbjct: 493 AAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDS 552

Query: 433 LDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHS 492
            +   Y+++ NMY+++  WK+ AK+R  ++++GL++ P CS IE +NK H F  GDRSH 
Sbjct: 553 ENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHP 612

Query: 493 ETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEG 552
              KI   L+ ++ ++ + GY + T  VL +V+EE K +++  HSE+LA+AFGI+ ++ G
Sbjct: 613 SMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPG 672

Query: 553 APLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
             + +TKNIRIC DCH   K IS I +R II+RD  RFHHF+ G CSC DYW
Sbjct: 673 TTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Bhi04G000073 vs. TAIR10
Match: AT5G40410.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 686.4 bits (1770), Expect = 1.5e-197
Identity = 342/581 (58.86%), Postives = 422/581 (72.63%), Query Frame = 0

Query: 22  EALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFD 81
           +A VS+L+ AV SC SI  CR +H +V+KS  YR GFIGDQLV CY +LG+   A+KLFD
Sbjct: 31  DANVSSLIAAVKSCVSIELCRLLHCKVVKSVSYRHGFIGDQLVGCYLRLGHDVCAEKLFD 90

Query: 82  DMPVKDLISWNSLISGFSW------CLDITLEAFYTMKFERSVKPNEITILSMISAC--N 141
           +MP +DL+SWNSLISG+S       C ++       M  E   +PNE+T LSMISAC   
Sbjct: 91  EMPERDLVSWNSLISGYSGRGYLGKCFEVLSR---MMISEVGFRPNEVTFLSMISACVYG 150

Query: 142 GALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSII 201
           G+   G+ IHG  +K GV  EV+VVN+ IN YGK+ DLTS+C+LFE +   N VSWN++I
Sbjct: 151 GSKEEGRCIHGLVMKFGVLEEVKVVNAFINWYGKTGDLTSSCKLFEDLSIKNLVSWNTMI 210

Query: 202 AAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLG 261
              + NG A +G+  FN  RR G EPD+ T LA+L++C  + V +LA+ IHGLI   G  
Sbjct: 211 VIHLQNGLAEKGLAYFNMSRRVGHEPDQATFLAVLRSCEDMGVVRLAQGIHGLIMFGGFS 270

Query: 262 AKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESM 321
               I TALLDLY+KLGRL  S  VF E+   D +AWTAMLA YA HG GR+AIK FE M
Sbjct: 271 GNKCITTALLDLYSKLGRLEDSSTVFHEITSPDSMAWTAMLAAYATHGFGRDAIKHFELM 330

Query: 322 TKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGL 381
              G+ PDHVTF HLL+ACSHSGLV EGK YF  MSK Y I+PR+DHYSCMVDLLGR GL
Sbjct: 331 VHYGISPDHVTFTHLLNACSHSGLVEEGKHYFETMSKRYRIDPRLDHYSCMVDLLGRSGL 390

Query: 382 LNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNM 441
           L DAY +I+ MPMEP++GVW ALLGACRVY + +LG + AE L  LEP D RNY+MLSN+
Sbjct: 391 LQDAYGLIKEMPMEPSSGVWGALLGACRVYKDTQLGTKAAERLFELEPRDGRNYVMLSNI 450

Query: 442 YSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEEL 501
           YSAS  WKDA+++R L+K++GL R  GCS IE+ NKIH+F VGD SH E+ KI  KL+E+
Sbjct: 451 YSASGLWKDASRIRNLMKQKGLVRASGCSYIEHGNKIHKFVVGDWSHPESEKIQKKLKEI 510

Query: 502 LGKIR-QAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRI 561
             K++ + GY SKTE+VL +V E+VKE+MIN+HSEK+A+AFG+LV     P+II KN+RI
Sbjct: 511 RKKMKSEMGYKSKTEFVLHDVGEDVKEEMINQHSEKIAMAFGLLVVSPMEPIIIRKNLRI 570

Query: 562 CGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           CGDCH TAK ISLIEKR IIIRD KRFHHF DG CSC+DYW
Sbjct: 571 CGDCHETAKAISLIEKRRIIIRDSKRFHHFLDGSCSCSDYW 608

BLAST of Bhi04G000073 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 453.8 bits (1166), Expect = 1.7e-127
Identity = 225/573 (39.27%), Postives = 355/573 (61.95%), Query Frame = 0

Query: 25  VSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMP 84
           + ++L AV++   IS  ++IH   ++S       I   LV  Y K G +E A++LFD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 85  VKDLISWNSLISGFSWCLDITLEA--FYTMKFERSVKPNEITILSMISACN--GALGAGK 144
            ++++SWNS+I  +    +   EA   +    +  VKP +++++  + AC   G L  G+
Sbjct: 299 ERNVVSWNSMIDAYVQ-NENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGR 358

Query: 145 YIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTNG 204
           +IH  ++++G+   V VVNSLI+MY K +++ +A  +F  +     VSWN++I     NG
Sbjct: 359 FIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNG 418

Query: 205 CAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITIAT 264
              + ++ F++MR   ++PD  T ++++ A  +LS+   A+ IHG++  S L   + + T
Sbjct: 419 RPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTT 478

Query: 265 ALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGLEP 324
           AL+D+YAK G +  +  +F  +       W AM+ GY  HG G+ A++LFE M K  ++P
Sbjct: 479 ALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 325 DHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAYDV 384
           + VTF+ ++SACSHSGLV  G   F MM + Y IE  MDHY  MVDLLGR G LN+A+D 
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 385 IQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASRSW 444
           I  MP++P   V+ A+LGAC+++ N+   ++ AE L  L P D   +++L+N+Y A+  W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 445 KDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIRQA 504
           +   +VR  +  +GL++TPGCS +E +N++H FF G  +H ++ KIY+ LE+L+  I++A
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEA 718

Query: 505 GYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHTTA 564
           GY   T  VL  VE +VKE +++ HSEKLAI+FG+L +  G  + + KN+R+C DCH   
Sbjct: 719 GYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNAT 778

Query: 565 KLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           K ISL+  R I++RD +RFHHF +G CSC DYW
Sbjct: 779 KYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of Bhi04G000073 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 449.1 bits (1154), Expect = 4.1e-126
Identity = 223/575 (38.78%), Postives = 352/575 (61.22%), Query Frame = 0

Query: 25  VSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMP 84
           V +LL A       +    IH+  IK  L  E F+ ++L+  Y + G + D QK+FD M 
Sbjct: 250 VVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMY 309

Query: 85  VKDLISWNSLISGFSWCLD--ITLEAFYTMKFERSVKPNEITILSMISACN--GALGAGK 144
           V+DLISWNS+I  +         +  F  M+  R ++P+ +T++S+ S  +  G + A +
Sbjct: 310 VRDLISWNSIIKAYELNEQPLRAISLFQEMRLSR-IQPDCLTLISLASILSQLGDIRACR 369

Query: 145 YIHGFAIKIGVSFE-VEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTN 204
            + GF ++ G   E + + N+++ MY K   + SA  +F  +P+ + +SWN+II+    N
Sbjct: 370 SVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQN 429

Query: 205 GCAREGIDCFNKMRRFG-IEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITI 264
           G A E I+ +N M   G I  ++GT +++L AC Q    +    +HG +  +GL   + +
Sbjct: 430 GFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFV 489

Query: 265 ATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGL 324
            T+L D+Y K GRL  +  +F ++   + V W  ++A +  HG G +A+ LF+ M  +G+
Sbjct: 490 VTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGV 549

Query: 325 EPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAY 384
           +PDH+TF+ LLSACSHSGLV EG+  F MM   YGI P + HY CMVD+ GR G L  A 
Sbjct: 550 KPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETAL 609

Query: 385 DVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASR 444
             I++M ++P+A +W ALL ACRV+GN++LGK  +EHL  +EP     +++LSNMY+++ 
Sbjct: 610 KFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAG 669

Query: 445 SWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIR 504
            W+   ++R++   +GL++TPG SS+E  NK+  F+ G+++H    ++Y +L  L  K++
Sbjct: 670 KWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLK 729

Query: 505 QAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHT 564
             GY     +VLQ+VE++ KE ++  HSE+LAIAF ++ +     + I KN+R+CGDCH+
Sbjct: 730 MIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFALIATPAKTTIRIFKNLRVCGDCHS 789

Query: 565 TAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
             K IS I +R II+RD  RFHHF +G+CSC DYW
Sbjct: 790 VTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of Bhi04G000073 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 436.0 bits (1120), Expect = 3.6e-122
Identity = 220/579 (38.00%), Postives = 346/579 (59.76%), Query Frame = 0

Query: 19  PTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQK 78
           P    + S L  A +    +S  +++H   IK     + F+   L+  Y++   +++A+ 
Sbjct: 414 PDQYTMTSVLKAASSLPEGLSLSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEI 473

Query: 79  LFDDMPVKDLISWNSLISGFSWCLD--ITLEAFYTMKFERSVKPNEITILSMISACN--G 138
           LF+     DL++WN++++G++   D   TL+ F  M  ++  + ++ T+ ++   C    
Sbjct: 474 LFERHNF-DLVAWNAMMAGYTQSHDGHKTLKLFALM-HKQGERSDDFTLATVFKTCGFLF 533

Query: 139 ALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIA 198
           A+  GK +H +AIK G   ++ V + +++MY K  D+++A   F++IP P+ V+W ++I+
Sbjct: 534 AINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMIS 593

Query: 199 AQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGA 258
             + NG        F++MR  G+ PDE TI  L +A   L+  +    IH          
Sbjct: 594 GCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTN 653

Query: 259 KITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMT 318
              + T+L+D+YAK G +  +  +F+ +   +  AW AML G A HG G+E ++LF+ M 
Sbjct: 654 DPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMK 713

Query: 319 KKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLL 378
             G++PD VTFI +LSACSHSGLV E   +   M   YGI+P ++HYSC+ D LGR GL+
Sbjct: 714 SLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLV 773

Query: 379 NDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMY 438
             A ++I++M ME +A ++  LL ACRV G+ E GK VA  L+ LEPLD   Y++LSNMY
Sbjct: 774 KQAENLIESMSMEASASMYRTLLAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMY 833

Query: 439 SASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELL 498
           +A+  W +    R ++K   +K+ PG S IE +NKIH F V DRS+ +T  IY K+++++
Sbjct: 834 AAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMI 893

Query: 499 GKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICG 558
             I+Q GY  +T++ L +VEEE KE  +  HSEKLA+AFG+L +    P+ + KN+R+CG
Sbjct: 894 RDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCG 953

Query: 559 DCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           DCH   K I+ +  R I++RD  RFH F DG+CSC DYW
Sbjct: 954 DCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCGDYW 990

BLAST of Bhi04G000073 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 433.0 bits (1112), Expect = 3.1e-121
Identity = 224/592 (37.84%), Postives = 356/592 (60.14%), Query Frame = 0

Query: 13  RLTRQCP--TAEALVS--ALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLV---T 72
           RL   C   T  AL++  A L+ + S  S+ N       V      R    GD+ V   T
Sbjct: 133 RLGMDCDLYTGNALMNMYAKLLGMGSKISVGN-------VFDEMPQRTSNSGDEDVKAET 192

Query: 73  CYNKLGYVEDAQKLFDDMPVKDLISWNSLISGF--SWCLDITLEAFYTMKFERSVKPNEI 132
           C    G ++  +++F+ MP KD++S+N++I+G+  S   +  L     M     +KP+  
Sbjct: 193 CIMPFG-IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMG-TTDLKPDSF 252

Query: 133 TILSMISACNGALGA--GKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAI 192
           T+ S++   +  +    GK IHG+ I+ G+  +V + +SL++MY KS  +  + R+F  +
Sbjct: 253 TLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRL 312

Query: 193 PDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAE 252
              + +SWNS++A  V NG   E +  F +M    ++P      +++ AC  L+   L +
Sbjct: 313 YCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGK 372

Query: 253 SIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHG 312
            +HG +   G G+ I IA+AL+D+Y+K G + A+  +F  +   D V+WTA++ G+A HG
Sbjct: 373 QLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHG 432

Query: 313 LGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHY 372
            G EA+ LFE M ++G++P+ V F+ +L+ACSH GLV E   YF+ M+KVYG+   ++HY
Sbjct: 433 HGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHY 492

Query: 373 SCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEP 432
           + + DLLGR G L +AY+ I  M +EP   VWS LL +C V+ N+EL ++VAE +  ++ 
Sbjct: 493 AAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDS 552

Query: 433 LDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHS 492
            +   Y+++ NMY+++  WK+ AK+R  ++++GL++ P CS IE +NK H F  GDRSH 
Sbjct: 553 ENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHP 612

Query: 493 ETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEG 552
              KI   L+ ++ ++ + GY + T  VL +V+EE K +++  HSE+LA+AFGI+ ++ G
Sbjct: 613 SMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPG 672

Query: 553 APLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
             + +TKNIRIC DCH   K IS I +R II+RD  RFHHF+ G CSC DYW
Sbjct: 673 TTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of Bhi04G000073 vs. TrEMBL
Match: tr|A0A1S3BBW7|A0A1S3BBW7_CUCME (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103488375 PE=4 SV=1)

HSP 1 Score: 1058.1 bits (2735), Expect = 7.1e-306
Identity = 521/595 (87.56%), Postives = 551/595 (92.61%), Query Frame = 0

Query: 1   SRTQYP--LLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGF 60
           SRTQYP  LLLRSF L R C  +EALVS LL+AV SCTSISNCR+IHARV KS LYR+GF
Sbjct: 35  SRTQYPLLLLLRSFHLIRPCAASEALVSDLLIAVKSCTSISNCREIHARVFKSLLYRDGF 94

Query: 61  IGDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSV 120
           IGDQLVTCYNKLGY EDAQKLFDDMP KDL+SWNSLISGFS CL +TL AFYTMKFE S+
Sbjct: 95  IGDQLVTCYNKLGYAEDAQKLFDDMPHKDLVSWNSLISGFSRCLHMTLTAFYTMKFEMSI 154

Query: 121 KPNEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLF 180
           KPNE+TILSMISACNGAL AGKYIHGFAIK+G + EV+V NSLINMYGKS DLTSACRLF
Sbjct: 155 KPNEVTILSMISACNGALDAGKYIHGFAIKVGGTLEVKVANSLINMYGKSGDLTSACRLF 214

Query: 181 EAIPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGK 240
           EAIPDPNTVSWNSIIAAQVTNGCAREGI  FNKMRRFGIE DEGTILALLQACL L VGK
Sbjct: 215 EAIPDPNTVSWNSIIAAQVTNGCAREGIXFFNKMRRFGIEQDEGTILALLQACLHLGVGK 274

Query: 241 LAESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYA 300
           LAESIH L+FC+G GAKITIATALLD YAKLGRLSASCDVFREVG ADRVAWTAMLAGYA
Sbjct: 275 LAESIHALMFCTGFGAKITIATALLDTYAKLGRLSASCDVFREVGFADRVAWTAMLAGYA 334

Query: 301 AHGLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRM 360
           AHGLGREAIKLFESM  +GLEPDHVTF HLLSACSHSGLV EGKSYF++MS+VYGIEPR+
Sbjct: 335 AHGLGREAIKLFESMVNEGLEPDHVTFTHLLSACSHSGLVNEGKSYFNVMSEVYGIEPRV 394

Query: 361 DHYSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLIN 420
           DHYSCMVDLLGR GLLNDAY+VI+NMPMEPNAGVW ALLGACRV+GN+ELGKEVAEHLIN
Sbjct: 395 DHYSCMVDLLGRCGLLNDAYEVIRNMPMEPNAGVWGALLGACRVHGNVELGKEVAEHLIN 454

Query: 421 LEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDR 480
           LEPLDPRNYIMLSN+YSASRSWKDAAK+RALLKERGLKRTPGCSSIEY NK H FFVGDR
Sbjct: 455 LEPLDPRNYIMLSNIYSASRSWKDAAKMRALLKERGLKRTPGCSSIEYGNKNHHFFVGDR 514

Query: 481 SHSETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVS 540
           SH ET KIYSKLEELLGKI++AGYSSKTEYVLQ+VEEEVKEDMINKHSEKLAIAFG+LVS
Sbjct: 515 SHPETEKIYSKLEELLGKIKKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVS 574

Query: 541 KEGAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           KEG PLIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFSDG CSCADYW
Sbjct: 575 KEGEPLIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCADYW 629

BLAST of Bhi04G000073 vs. TrEMBL
Match: tr|A0A0A0LKE6|A0A0A0LKE6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348160 PE=4 SV=1)

HSP 1 Score: 1049.3 bits (2712), Expect = 3.3e-303
Identity = 521/594 (87.71%), Postives = 547/594 (92.09%), Query Frame = 0

Query: 1   SRTQYPLLL-RSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFI 60
           SRTQYPLLL RSF L RQC T EA+VSALL+AVNSC SISNCR+IHARV KS LYR+GFI
Sbjct: 35  SRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIHARVFKSLLYRDGFI 94

Query: 61  GDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVK 120
           GDQLVTCYNKLGY EDA KLFDDMP KDL+SWNSLISGFS CL ++L AFYTMKFE SVK
Sbjct: 95  GDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMSLTAFYTMKFEMSVK 154

Query: 121 PNEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFE 180
           PNE+TILSMISAC+GAL AGKYIHGF IK+G + EV+V NSLINMYGKS DLTSACRLFE
Sbjct: 155 PNEVTILSMISACSGALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFE 214

Query: 181 AIPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKL 240
           AIPDPNTVSWNSIIAAQVTNGCAREGID FNKMRR GIE DEGTILALLQACL L VGKL
Sbjct: 215 AIPDPNTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKL 274

Query: 241 AESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAA 300
           AESIHGL+FC+G GAKITIATALLD YAKLGRLSAS  VF EVG ADRVAWTAMLAGYAA
Sbjct: 275 AESIHGLMFCTGFGAKITIATALLDTYAKLGRLSASYGVFTEVGFADRVAWTAMLAGYAA 334

Query: 301 HGLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMD 360
           HGLGREAIKLFESM  KGLEPDHVTF HLLSACSHSGLV EGKSYF++MS+VYGIEPR+D
Sbjct: 335 HGLGREAIKLFESMANKGLEPDHVTFTHLLSACSHSGLVNEGKSYFNVMSEVYGIEPRVD 394

Query: 361 HYSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINL 420
           HYSCMVDLLGR GLLNDAY+VIQNMPMEPNAGVW ALLGACRV+GNIELGKEVAEHLIN+
Sbjct: 395 HYSCMVDLLGRCGLLNDAYEVIQNMPMEPNAGVWGALLGACRVHGNIELGKEVAEHLINM 454

Query: 421 EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRS 480
           EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPG SSIEY NK H FFVGDRS
Sbjct: 455 EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRS 514

Query: 481 HSETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSK 540
           H ET KIYSKLEELLGKIR+AGYSSKTEYVLQ+VEEEVKEDMINKHSEKLAIAFG+LVSK
Sbjct: 515 HPETEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSK 574

Query: 541 EGAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           EG  LIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFSDG CSCADYW
Sbjct: 575 EGEALIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCADYW 628

BLAST of Bhi04G000073 vs. TrEMBL
Match: tr|A0A251MQR1|A0A251MQR1_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G006400 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 2.0e-228
Identity = 388/579 (67.01%), Postives = 473/579 (81.69%), Query Frame = 0

Query: 19  PTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQK 78
           P  ++L+S L+ AV+SC+SIS  R IH+ VIKSF Y +GFIGDQLV+CY +LG  +DA+ 
Sbjct: 66  PNPDSLLSYLISAVSSCSSISYSRAIHSCVIKSFNYTDGFIGDQLVSCYTRLGRADDARN 125

Query: 79  LFDDMPVKDLISWNSLISGFS--WCLDITLEAFYTMKFERSVKPNEITILSMISAC--NG 138
           LFD+MP KDLISWNSLISGFS    +D  L+AF+ MKFE  ++P+E+T++S+ SAC   G
Sbjct: 126 LFDEMPNKDLISWNSLISGFSRRGYVDKCLDAFFRMKFEMGIEPDEVTLISITSACASRG 185

Query: 139 ALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIA 198
           A+  GKYIHGFA+K+GV +EV++VNSLIN+YGKS  L + CRL E +P  N VSWN +I 
Sbjct: 186 AVDEGKYIHGFALKLGVLWEVKLVNSLINLYGKSGYLDAVCRLVETMPVGNIVSWNLMIV 245

Query: 199 AQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGA 258
           +   NG A +G+  FN MRR GI PD+GT+L+LL+AC  L + KLAE +HGLI   GL A
Sbjct: 246 SHAQNGSAADGVGYFNLMRRAGINPDDGTVLSLLEACENLGLQKLAEGVHGLITKCGLYA 305

Query: 259 KITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMT 318
             T+AT LLDLYAKLGRL+ S  VF EV   D+VAWTAMLAG A HG GREA++LFE M 
Sbjct: 306 NATVATGLLDLYAKLGRLNYSLKVFGEVNNPDKVAWTAMLAGNAVHGNGREAMELFEGMV 365

Query: 319 KKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLL 378
           K G+EPDHVTF HLLSACSHSGLV+EGK+YF +MS+VYGIEPR+DHYSCMVDLLGR GLL
Sbjct: 366 KVGVEPDHVTFTHLLSACSHSGLVKEGKNYFDIMSQVYGIEPRLDHYSCMVDLLGRSGLL 425

Query: 379 NDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMY 438
           NDAY++I+ MP++PN+ VW AL GACRVYGNIELGKEVAE L +L+P D RNYIMLSNMY
Sbjct: 426 NDAYELIKRMPLKPNSAVWGALFGACRVYGNIELGKEVAERLFSLDPSDSRNYIMLSNMY 485

Query: 439 SASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELL 498
           SA+  W+DA+KVRAL+KE+GL R PGCS IE+ NKIHRF VGDRSH E+ KIY+KLEE++
Sbjct: 486 SAAGLWRDASKVRALMKEKGLIRNPGCSFIEHGNKIHRFAVGDRSHPESEKIYTKLEEMI 545

Query: 499 GKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICG 558
           GKIR+AG+ SKTE++L +VE+ VKEDMI+KHSEKLAIAFG+LV+  G P+IITKN+RICG
Sbjct: 546 GKIREAGFVSKTEFILHDVEQAVKEDMISKHSEKLAIAFGLLVTNAGMPIIITKNLRICG 605

Query: 559 DCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           DCH+TAKLISLIEKRTIIIRD KRFHHF+ G+CSC DYW
Sbjct: 606 DCHSTAKLISLIEKRTIIIRDSKRFHHFAAGICSCGDYW 644

BLAST of Bhi04G000073 vs. TrEMBL
Match: tr|M5VWG5|M5VWG5_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa021080mg PE=4 SV=1)

HSP 1 Score: 790.4 bits (2040), Expect = 2.8e-225
Identity = 382/564 (67.73%), Postives = 462/564 (81.91%), Query Frame = 0

Query: 34  SCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNS 93
           SC+SIS  R IH+ VIKSF Y +GFIGDQLV+CY +LG  +DA+ LFD+MP KDLISWNS
Sbjct: 1   SCSSISYSRAIHSCVIKSFNYTDGFIGDQLVSCYTRLGRADDARNLFDEMPNKDLISWNS 60

Query: 94  LISGFS--WCLDITLEAFYTMKFERSVKPNEITILSMISAC--NGALGAGKYIHGFAIKI 153
           LISGFS    +D  L+AF+ MKFE  ++P+E+T++S+ SAC   GA+  GKYIHGFA+K+
Sbjct: 61  LISGFSRRGYVDKCLDAFFRMKFEMGIEPDEVTLISITSACASRGAVDEGKYIHGFALKL 120

Query: 154 GVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQVTNGCAREGIDCF 213
           GV +EV++VNSLIN+YGKS  L + CRL E +P  N VSWN +I +   NG A +G+  F
Sbjct: 121 GVLWEVKLVNSLINLYGKSGYLDAVCRLVETMPVGNIVSWNLMIVSHAQNGSAADGVGYF 180

Query: 214 NKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKITIATALLDLYAKL 273
           N MRR GI PD+GT+L+LL+AC  L + KLAE +HGLI   GL A  T+AT LLDLYAKL
Sbjct: 181 NLMRRAGINPDDGTVLSLLEACENLGLQKLAEGVHGLITKCGLYANATVATGLLDLYAKL 240

Query: 274 GRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKGLEPDHVTFIHLL 333
           GRL+ S  VF EV   D+VAWTAMLAG A HG GREA++LFE M K G+EPDHVTF HLL
Sbjct: 241 GRLNYSLKVFGEVNNPDKVAWTAMLAGNAVHGNGREAMELFEGMVKVGVEPDHVTFTHLL 300

Query: 334 SACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDAYDVIQNMPMEPN 393
           SACSHSGLV+EGK+YF +MS+VYGIEPR+DHYSCMVDLLGR GLLNDAY++I+ MP++PN
Sbjct: 301 SACSHSGLVKEGKNYFDIMSQVYGIEPRLDHYSCMVDLLGRSGLLNDAYELIKRMPLKPN 360

Query: 394 AGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSASRSWKDAAKVRAL 453
           + VW AL GACRVYGNIELGKEVAE L +L+P D RNYIMLSNMYSA+  W+DA+KVRAL
Sbjct: 361 SAVWGALFGACRVYGNIELGKEVAERLFSLDPSDSRNYIMLSNMYSAAGLWRDASKVRAL 420

Query: 454 LKERGLKRTPGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLGKIRQAGYSSKTEYV 513
           +KE+GL R PGCS IE+ NKIHRF VGDRSH E+ KIY+KLEE++GKIR+AG+ SKTE++
Sbjct: 421 MKEKGLIRNPGCSFIEHGNKIHRFAVGDRSHPESEKIYTKLEEMIGKIREAGFVSKTEFI 480

Query: 514 LQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGDCHTTAKLISLIEKR 573
           L +VE+ VKEDMI+KHSEKLAIAFG+LV+  G P+IITKN+RICGDCH+TAKLISLIEKR
Sbjct: 481 LHDVEQAVKEDMISKHSEKLAIAFGLLVTNAGMPIIITKNLRICGDCHSTAKLISLIEKR 540

Query: 574 TIIIRDPKRFHHFSDGLCSCADYW 594
           TIIIRD KRFHHF+ G+CSC DYW
Sbjct: 541 TIIIRDSKRFHHFAAGICSCGDYW 564

BLAST of Bhi04G000073 vs. TrEMBL
Match: tr|A0A2P5F861|A0A2P5F861_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_103730 PE=4 SV=1)

HSP 1 Score: 780.8 bits (2015), Expect = 2.2e-222
Identity = 382/578 (66.09%), Postives = 462/578 (79.93%), Query Frame = 0

Query: 22  EALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIGDQLVTCYNKLGYVEDAQKLFD 81
           E LVS L++A++SC+S+  C  IHA VIKS  Y +GFIGDQLV+CY KLG  + AQ+LFD
Sbjct: 63  ETLVSTLILAISSCSSVPRCHAIHAHVIKSVNYSDGFIGDQLVSCYAKLGCAKSAQQLFD 122

Query: 82  DMPVKDLISWNSLISGFS--WCLDITLEAFYTMKFERSVKPNEITILSMISAC--NGALG 141
           +MP KDL+SWN+LISGFS    LD  L AF+ MKF+  ++PNE+T++++ISAC   G + 
Sbjct: 123 EMPNKDLVSWNNLISGFSRKSLLDKCLTAFFRMKFDFDMQPNEVTLIALISACIGCGTVD 182

Query: 142 AGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEAIPDPNTVSWNSIIAAQV 201
            G Y+HG A+K+G+  EV+VVN L NMYG+     +AC+LFEA+P  N VSWN +++   
Sbjct: 183 MGNYVHGIALKLGLLLEVKVVNCLTNMYGRFGYPDAACQLFEAMPVRNVVSWNLMVSVPS 242

Query: 202 TNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLAESIHGLIFCSGLGAKIT 261
            NG    GI  FN MR  G  PD+GTI+AL QAC  L VGKLAE+IHGL+   GL A +T
Sbjct: 243 QNGFPEVGICNFNLMRMTGFRPDDGTIVALSQACGNLGVGKLAEAIHGLVISCGLSANVT 302

Query: 262 IATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAHGLGREAIKLFESMTKKG 321
           +ATALLDLYAKLGRL+ S  VF E+   DRV+WTAMLAGYA HG G+EA++LFE+M  KG
Sbjct: 303 VATALLDLYAKLGRLNDSRKVFGELINPDRVSWTAMLAGYAVHGHGKEAVELFENMIMKG 362

Query: 322 LEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDHYSCMVDLLGRRGLLNDA 381
           ++PDHVTF+HLL+ACSHSGLV++GK+YF +MS+VYGIEPR+DHYSCMVDLLGR GLLNDA
Sbjct: 363 VQPDHVTFVHLLNACSHSGLVKQGKNYFKIMSQVYGIEPRLDHYSCMVDLLGRSGLLNDA 422

Query: 382 YDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLEPLDPRNYIMLSNMYSAS 441
           Y++I  MPMEPN+GVW AL GACRVYGNIELGKEVAE L  LEP D RNYIMLSNMYSA+
Sbjct: 423 YELITRMPMEPNSGVWGALFGACRVYGNIELGKEVAERLFALEPSDSRNYIMLSNMYSAA 482

Query: 442 RSWKDAAKVRALLKERGLKRT--PGCSSIEYRNKIHRFFVGDRSHSETGKIYSKLEELLG 501
             WKDA+KVR L+K+RGL R   PGCS IE+++KIHRF   DRSH E+ KI+ KLEEL+G
Sbjct: 483 GLWKDASKVRTLMKDRGLIRNPKPGCSFIEHKSKIHRFVADDRSHPESEKIHKKLEELIG 542

Query: 502 KIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKEGAPLIITKNIRICGD 561
           KIR+AG+ SKTE VL +VEEEVKEDMI+KHSEKLAIAFGILV+    P+IITKN+RICGD
Sbjct: 543 KIREAGFVSKTECVLHDVEEEVKEDMISKHSEKLAIAFGILVTHADMPIIITKNLRICGD 602

Query: 562 CHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           CH TAKLISLIEKRTIIIRD KRFHHF++GLCSC DYW
Sbjct: 603 CHATAKLISLIEKRTIIIRDAKRFHHFANGLCSCGDYW 640

BLAST of Bhi04G000073 vs. NCBI nr
Match: XP_008445305.2 (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mitochondrial-like [Cucumis melo])

HSP 1 Score: 1058.1 bits (2735), Expect = 1.1e-305
Identity = 521/595 (87.56%), Postives = 551/595 (92.61%), Query Frame = 0

Query: 1   SRTQYP--LLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGF 60
           SRTQYP  LLLRSF L R C  +EALVS LL+AV SCTSISNCR+IHARV KS LYR+GF
Sbjct: 35  SRTQYPLLLLLRSFHLIRPCAASEALVSDLLIAVKSCTSISNCREIHARVFKSLLYRDGF 94

Query: 61  IGDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSV 120
           IGDQLVTCYNKLGY EDAQKLFDDMP KDL+SWNSLISGFS CL +TL AFYTMKFE S+
Sbjct: 95  IGDQLVTCYNKLGYAEDAQKLFDDMPHKDLVSWNSLISGFSRCLHMTLTAFYTMKFEMSI 154

Query: 121 KPNEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLF 180
           KPNE+TILSMISACNGAL AGKYIHGFAIK+G + EV+V NSLINMYGKS DLTSACRLF
Sbjct: 155 KPNEVTILSMISACNGALDAGKYIHGFAIKVGGTLEVKVANSLINMYGKSGDLTSACRLF 214

Query: 181 EAIPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGK 240
           EAIPDPNTVSWNSIIAAQVTNGCAREGI  FNKMRRFGIE DEGTILALLQACL L VGK
Sbjct: 215 EAIPDPNTVSWNSIIAAQVTNGCAREGIXFFNKMRRFGIEQDEGTILALLQACLHLGVGK 274

Query: 241 LAESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYA 300
           LAESIH L+FC+G GAKITIATALLD YAKLGRLSASCDVFREVG ADRVAWTAMLAGYA
Sbjct: 275 LAESIHALMFCTGFGAKITIATALLDTYAKLGRLSASCDVFREVGFADRVAWTAMLAGYA 334

Query: 301 AHGLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRM 360
           AHGLGREAIKLFESM  +GLEPDHVTF HLLSACSHSGLV EGKSYF++MS+VYGIEPR+
Sbjct: 335 AHGLGREAIKLFESMVNEGLEPDHVTFTHLLSACSHSGLVNEGKSYFNVMSEVYGIEPRV 394

Query: 361 DHYSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLIN 420
           DHYSCMVDLLGR GLLNDAY+VI+NMPMEPNAGVW ALLGACRV+GN+ELGKEVAEHLIN
Sbjct: 395 DHYSCMVDLLGRCGLLNDAYEVIRNMPMEPNAGVWGALLGACRVHGNVELGKEVAEHLIN 454

Query: 421 LEPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDR 480
           LEPLDPRNYIMLSN+YSASRSWKDAAK+RALLKERGLKRTPGCSSIEY NK H FFVGDR
Sbjct: 455 LEPLDPRNYIMLSNIYSASRSWKDAAKMRALLKERGLKRTPGCSSIEYGNKNHHFFVGDR 514

Query: 481 SHSETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVS 540
           SH ET KIYSKLEELLGKI++AGYSSKTEYVLQ+VEEEVKEDMINKHSEKLAIAFG+LVS
Sbjct: 515 SHPETEKIYSKLEELLGKIKKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVS 574

Query: 541 KEGAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           KEG PLIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFSDG CSCADYW
Sbjct: 575 KEGEPLIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCADYW 629

BLAST of Bhi04G000073 vs. NCBI nr
Match: XP_023546177.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo] >XP_023546178.1 pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1053.9 bits (2724), Expect = 2.0e-304
Identity = 517/593 (87.18%), Postives = 552/593 (93.09%), Query Frame = 0

Query: 1   SRTQYPLLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIG 60
           SRTQYPLLL SF   RQC  AE LVSAL++AV SCTSIS+CR IHARVIKSFLYR+GFIG
Sbjct: 35  SRTQYPLLLWSFHSIRQCVAAEGLVSALVIAVKSCTSISSCRGIHARVIKSFLYRDGFIG 94

Query: 61  DQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVKP 120
           DQLVTCYNKLGY EDAQK+FDDMP +DL+SWNSLI GFS CL +TL+AF TMKFE SVKP
Sbjct: 95  DQLVTCYNKLGYAEDAQKVFDDMPDRDLVSWNSLICGFSRCLHVTLKAFCTMKFEMSVKP 154

Query: 121 NEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEA 180
           NE+TILSMISACNGAL  G+YIHGFAIKIGVS EV+VVNSLINMYGKS DLTSACRLFEA
Sbjct: 155 NEVTILSMISACNGALDVGRYIHGFAIKIGVSLEVKVVNSLINMYGKSGDLTSACRLFEA 214

Query: 181 IPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLA 240
           IP PN VSWNSIIAA+VTNGCA EG+ CFNKMR FGIEPDEGTILALLQAC+ L VGKLA
Sbjct: 215 IPYPNIVSWNSIIAARVTNGCAGEGVHCFNKMRMFGIEPDEGTILALLQACVHLGVGKLA 274

Query: 241 ESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAH 300
           ESIHGLIFCSGLGA+ITIATALLDLYAKLGRLSAS DVF EVGCADRVAWTAMLAGYAAH
Sbjct: 275 ESIHGLIFCSGLGAQITIATALLDLYAKLGRLSASYDVFGEVGCADRVAWTAMLAGYAAH 334

Query: 301 GLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDH 360
           GLGREAIKLFESM ++GLEPDHVTF HLLSACSHSGLVREGK YF++MSKVYGIEPR+DH
Sbjct: 335 GLGREAIKLFESMAERGLEPDHVTFTHLLSACSHSGLVREGKRYFNLMSKVYGIEPRIDH 394

Query: 361 YSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLE 420
           YSCMVDLLGR GLLNDAY+VI++MPMEPNAGVW ALLGACRVYGNIELGKEVAEHLI+LE
Sbjct: 395 YSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGNIELGKEVAEHLIHLE 454

Query: 421 PLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSH 480
           PLDPRNYIMLSNMY+A+RSWKDAAKVRALLKERGLKRTPG SSIEY NKIH+FFVGDRSH
Sbjct: 455 PLDPRNYIMLSNMYAAARSWKDAAKVRALLKERGLKRTPGWSSIEYGNKIHQFFVGDRSH 514

Query: 481 SETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKE 540
            ET KIYSKLEELLGKIR+ GYSSKTEYVLQ+VEEE+KEDMINKHSEKLAIAFG+LVSKE
Sbjct: 515 PETEKIYSKLEELLGKIRKTGYSSKTEYVLQDVEEEIKEDMINKHSEKLAIAFGLLVSKE 574

Query: 541 GAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           G PLIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFSDG CSC+DYW
Sbjct: 575 GDPLIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCSDYW 627

BLAST of Bhi04G000073 vs. NCBI nr
Match: XP_004143073.2 (PREDICTED: pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucumis sativus] >KGN62278.1 hypothetical protein Csa_2G348160 [Cucumis sativus])

HSP 1 Score: 1049.3 bits (2712), Expect = 5.0e-303
Identity = 521/594 (87.71%), Postives = 547/594 (92.09%), Query Frame = 0

Query: 1   SRTQYPLLL-RSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFI 60
           SRTQYPLLL RSF L RQC T EA+VSALL+AVNSC SISNCR+IHARV KS LYR+GFI
Sbjct: 35  SRTQYPLLLHRSFHLVRQCATPEAIVSALLIAVNSCPSISNCREIHARVFKSLLYRDGFI 94

Query: 61  GDQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVK 120
           GDQLVTCYNKLGY EDA KLFDDMP KDL+SWNSLISGFS CL ++L AFYTMKFE SVK
Sbjct: 95  GDQLVTCYNKLGYAEDALKLFDDMPHKDLVSWNSLISGFSRCLHMSLTAFYTMKFEMSVK 154

Query: 121 PNEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFE 180
           PNE+TILSMISAC+GAL AGKYIHGF IK+G + EV+V NSLINMYGKS DLTSACRLFE
Sbjct: 155 PNEVTILSMISACSGALDAGKYIHGFGIKVGGTLEVKVANSLINMYGKSGDLTSACRLFE 214

Query: 181 AIPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKL 240
           AIPDPNTVSWNSIIAAQVTNGCAREGID FNKMRR GIE DEGTILALLQACL L VGKL
Sbjct: 215 AIPDPNTVSWNSIIAAQVTNGCAREGIDYFNKMRRLGIEQDEGTILALLQACLHLGVGKL 274

Query: 241 AESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAA 300
           AESIHGL+FC+G GAKITIATALLD YAKLGRLSAS  VF EVG ADRVAWTAMLAGYAA
Sbjct: 275 AESIHGLMFCTGFGAKITIATALLDTYAKLGRLSASYGVFTEVGFADRVAWTAMLAGYAA 334

Query: 301 HGLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMD 360
           HGLGREAIKLFESM  KGLEPDHVTF HLLSACSHSGLV EGKSYF++MS+VYGIEPR+D
Sbjct: 335 HGLGREAIKLFESMANKGLEPDHVTFTHLLSACSHSGLVNEGKSYFNVMSEVYGIEPRVD 394

Query: 361 HYSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINL 420
           HYSCMVDLLGR GLLNDAY+VIQNMPMEPNAGVW ALLGACRV+GNIELGKEVAEHLIN+
Sbjct: 395 HYSCMVDLLGRCGLLNDAYEVIQNMPMEPNAGVWGALLGACRVHGNIELGKEVAEHLINM 454

Query: 421 EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRS 480
           EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPG SSIEY NK H FFVGDRS
Sbjct: 455 EPLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGYSSIEYGNKNHHFFVGDRS 514

Query: 481 HSETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSK 540
           H ET KIYSKLEELLGKIR+AGYSSKTEYVLQ+VEEEVKEDMINKHSEKLAIAFG+LVSK
Sbjct: 515 HPETEKIYSKLEELLGKIRKAGYSSKTEYVLQDVEEEVKEDMINKHSEKLAIAFGLLVSK 574

Query: 541 EGAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           EG  LIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFSDG CSCADYW
Sbjct: 575 EGEALIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSDGFCSCADYW 628

BLAST of Bhi04G000073 vs. NCBI nr
Match: XP_022997515.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1042.0 bits (2693), Expect = 8.0e-301
Identity = 510/593 (86.00%), Postives = 549/593 (92.58%), Query Frame = 0

Query: 1   SRTQYPLLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIG 60
           SRTQYPLLLR F   R C  AEALVSAL++AV SCTSIS+CR IHARVIKSFLYR+GFIG
Sbjct: 35  SRTQYPLLLRPFHPIRHCVAAEALVSALVIAVKSCTSISSCRGIHARVIKSFLYRDGFIG 94

Query: 61  DQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVKP 120
           DQLVTCYNKLGY EDAQK+FDDMP +DL+SWNSLI GFS CL +TL+AF TMKFE SVKP
Sbjct: 95  DQLVTCYNKLGYAEDAQKVFDDMPDRDLVSWNSLICGFSRCLHVTLKAFCTMKFEMSVKP 154

Query: 121 NEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEA 180
           NE+TILSMISACNGAL  G+YIHGFAIKIGVS EV+VVNS INMYGKS DLTSACRLFEA
Sbjct: 155 NEVTILSMISACNGALDVGRYIHGFAIKIGVSLEVKVVNSFINMYGKSGDLTSACRLFEA 214

Query: 181 IPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLA 240
           IP PN VSWNSIIAA+VTNGCA EG+ CFNKMR FG+EPDEGTILALLQAC+ L VGKLA
Sbjct: 215 IPYPNIVSWNSIIAARVTNGCAGEGVHCFNKMRMFGMEPDEGTILALLQACVHLGVGKLA 274

Query: 241 ESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAH 300
           ESIHGLIFCSGLGA+I IATALLDLYAKLGRLSAS DVF EVGCADRVAWTAMLAGYAAH
Sbjct: 275 ESIHGLIFCSGLGAQIAIATALLDLYAKLGRLSASYDVFGEVGCADRVAWTAMLAGYAAH 334

Query: 301 GLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDH 360
           GLGREAIKLFE+M ++GLEPDHVTF HLLSACSHSGLVREGK YF++MS+VYGIEPR+DH
Sbjct: 335 GLGREAIKLFENMAERGLEPDHVTFTHLLSACSHSGLVREGKRYFNLMSEVYGIEPRIDH 394

Query: 361 YSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLE 420
           YSCMVDLLGR GLLNDAY+VI++MPMEPNAGVW ALLGACRVYGNIELGKEVAEHLI+LE
Sbjct: 395 YSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGNIELGKEVAEHLIHLE 454

Query: 421 PLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSH 480
           PLDPRNYIMLSNMY+A+ SWKDAAKVRALLKERGLKRTPG SSIEY NKIH+FFVGDRSH
Sbjct: 455 PLDPRNYIMLSNMYAAACSWKDAAKVRALLKERGLKRTPGWSSIEYGNKIHQFFVGDRSH 514

Query: 481 SETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKE 540
            ET KIYSKLEELLGKIR+ GYSSKTEYVLQ+VEEE+KEDMINKHSEKLAIAFG+LVSKE
Sbjct: 515 PETEKIYSKLEELLGKIRKTGYSSKTEYVLQDVEEEIKEDMINKHSEKLAIAFGLLVSKE 574

Query: 541 GAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADYW 594
           G PLIITKN+RICGDCH+TAKLISLIEKRTIIIRDPKRFHHFS+G CSC+DYW
Sbjct: 575 GDPLIITKNLRICGDCHSTAKLISLIEKRTIIIRDPKRFHHFSNGFCSCSDYW 627

BLAST of Bhi04G000073 vs. NCBI nr
Match: XP_022961715.1 (pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1035.8 bits (2677), Expect = 5.7e-299
Identity = 508/592 (85.81%), Postives = 548/592 (92.57%), Query Frame = 0

Query: 1   SRTQYPLLLRSFRLTRQCPTAEALVSALLMAVNSCTSISNCRKIHARVIKSFLYREGFIG 60
           SRTQYPLLLR F   RQC  AEALVSAL++AV SCTSIS+CR IHARVIKS LYR+GFIG
Sbjct: 35  SRTQYPLLLRPFHPIRQCVAAEALVSALVIAVKSCTSISSCRGIHARVIKSSLYRDGFIG 94

Query: 61  DQLVTCYNKLGYVEDAQKLFDDMPVKDLISWNSLISGFSWCLDITLEAFYTMKFERSVKP 120
           DQLV+CYNKLGY  DAQK+FDDMP +DL+SWNSLI GFS CL +TL+AF TMKFE SVKP
Sbjct: 95  DQLVSCYNKLGYAVDAQKVFDDMPDRDLVSWNSLICGFSRCLHVTLKAFCTMKFEMSVKP 154

Query: 121 NEITILSMISACNGALGAGKYIHGFAIKIGVSFEVEVVNSLINMYGKSEDLTSACRLFEA 180
           NE+TILSMISACNGAL  G+Y+HGFAIKIGVS EV+VVNSLINMYGKS DLTSACRLFEA
Sbjct: 155 NEVTILSMISACNGALDVGRYVHGFAIKIGVSLEVKVVNSLINMYGKSGDLTSACRLFEA 214

Query: 181 IPDPNTVSWNSIIAAQVTNGCAREGIDCFNKMRRFGIEPDEGTILALLQACLQLSVGKLA 240
           IP PN VSWNSIIAA VTN CA EG+ CFNKMR FG+EPDEGTILALLQAC+ L VGKLA
Sbjct: 215 IPYPNIVSWNSIIAAHVTNDCAGEGVHCFNKMRMFGMEPDEGTILALLQACVHLGVGKLA 274

Query: 241 ESIHGLIFCSGLGAKITIATALLDLYAKLGRLSASCDVFREVGCADRVAWTAMLAGYAAH 300
           ESIHGLIFCSGLGA+ITIATALLDLYAKLGRLSAS DVF EVGCADRVAWTAMLAGYAAH
Sbjct: 275 ESIHGLIFCSGLGAQITIATALLDLYAKLGRLSASYDVFGEVGCADRVAWTAMLAGYAAH 334

Query: 301 GLGREAIKLFESMTKKGLEPDHVTFIHLLSACSHSGLVREGKSYFSMMSKVYGIEPRMDH 360
           GLGREAIKLFESM ++GLEPDHVTF HLLSACSHSGLVREGK YF++MS+VYGIEPR+DH
Sbjct: 335 GLGREAIKLFESMAERGLEPDHVTFTHLLSACSHSGLVREGKRYFNLMSEVYGIEPRIDH 394

Query: 361 YSCMVDLLGRRGLLNDAYDVIQNMPMEPNAGVWSALLGACRVYGNIELGKEVAEHLINLE 420
           YSCMVDLLGR GLLNDAY+VI++MPMEPNAGVW ALLGACRVYGNIELGKEVAEHLI+LE
Sbjct: 395 YSCMVDLLGRCGLLNDAYEVIRSMPMEPNAGVWGALLGACRVYGNIELGKEVAEHLIHLE 454

Query: 421 PLDPRNYIMLSNMYSASRSWKDAAKVRALLKERGLKRTPGCSSIEYRNKIHRFFVGDRSH 480
           PLDPRNYIMLSNMY+A+RSWKDAAKVRALLKERGLKRTPG SSIEY NKIH+FFVGDRSH
Sbjct: 455 PLDPRNYIMLSNMYAAARSWKDAAKVRALLKERGLKRTPGWSSIEYGNKIHQFFVGDRSH 514

Query: 481 SETGKIYSKLEELLGKIRQAGYSSKTEYVLQNVEEEVKEDMINKHSEKLAIAFGILVSKE 540
            ET KIYSKLEELLGKIR+ GYSSKTEYVLQ+VEEE+KEDMINKHSEKLAIAFG+LVSKE
Sbjct: 515 PETEKIYSKLEELLGKIRKTGYSSKTEYVLQDVEEEIKEDMINKHSEKLAIAFGLLVSKE 574

Query: 541 GAPLIITKNIRICGDCHTTAKLISLIEKRTIIIRDPKRFHHFSDGLCSCADY 593
           G PLIITKN+RICGDCH+TAKLISLIEKRT+IIRDPKRFHHFS+G CSC+DY
Sbjct: 575 GDPLIITKNLRICGDCHSTAKLISLIEKRTLIIRDPKRFHHFSNGFCSCSDY 626

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q9FND6|PP411_ARATH2.8e-19658.86Pentatricopeptide repeat-containing protein At5g40410, mitochondrial OS=Arabidop... [more]
sp|Q3E6Q1|PPR32_ARATH3.0e-12639.27Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|O81767|PP348_ARATH7.5e-12538.78Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
sp|Q9SMZ2|PP347_ARATH6.5e-12138.00Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|Q9LW63|PP251_ARATH5.5e-12037.84Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
AT5G40410.11.5e-19758.86Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.11.7e-12739.27Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33990.14.1e-12638.78Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33170.13.6e-12238.00Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23330.13.1e-12137.84Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
tr|A0A1S3BBW7|A0A1S3BBW7_CUCME7.1e-30687.56LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g40410, mito... [more]
tr|A0A0A0LKE6|A0A0A0LKE6_CUCSA3.3e-30387.71Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G348160 PE=4 SV=1[more]
tr|A0A251MQR1|A0A251MQR1_PRUPE2.0e-22867.01Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_8G006400 PE=4 SV=1[more]
tr|M5VWG5|M5VWG5_PRUPE2.8e-22567.73Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa021080m... [more]
tr|A0A2P5F861|A0A2P5F861_9ROSA2.2e-22266.09DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_103730 ... [more]
Match NameE-valueIdentityDescription
XP_008445305.21.1e-30587.56PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g... [more]
XP_023546177.12.0e-30487.18pentatricopeptide repeat-containing protein At5g40410, mitochondrial isoform X1 ... [more]
XP_004143073.25.0e-30387.71PREDICTED: pentatricopeptide repeat-containing protein At5g40410, mitochondrial ... [more]
XP_022997515.18.0e-30186.00pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita ... [more]
XP_022961715.15.7e-29985.81pentatricopeptide repeat-containing protein At5g40410, mitochondrial [Cucurbita ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M000073Bhi04M000073mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 8..141
e-value: 8.9E-12
score: 47.0
coord: 242..529
e-value: 1.8E-36
score: 128.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 143..241
e-value: 1.4E-17
score: 65.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 286..333
e-value: 3.7E-12
score: 46.1
coord: 184..231
e-value: 4.5E-9
score: 36.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 187..221
e-value: 4.4E-6
score: 24.5
coord: 288..321
e-value: 6.9E-7
score: 27.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 360..385
e-value: 0.27
score: 11.5
coord: 158..179
e-value: 0.13
score: 12.5
coord: 63..86
e-value: 0.03
score: 14.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 12.255
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 7.278
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 357..387
score: 6.489
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..285
score: 5.305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..419
score: 5.229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 154..184
score: 6.325
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 56..90
score: 6.796
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..356
score: 7.684
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..219
score: 10.523
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 459..582
e-value: 4.5E-38
score: 129.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 20..514
NoneNo IPR availablePANTHERPTHR24015:SF669SUBFAMILY NOT NAMEDcoord: 20..514