Bhi04G002002 (gene) Wax gourd (B227) v1

Overview
NameBhi04G002002
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4: 66775839 .. 66778763 (-)
RNA-Seq ExpressionBhi04G002002
SyntenyBhi04G002002
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAACCCCACATACCAGAATTAGCTACTCGAGTGAGCAGAGCCATACTTTCAATTTCAAATCACACAAGCCCGGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAACGTTAAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTACCCATCACTCCCTCGCTCTCGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGTTTCGCCCACAATTCCGATTCATACAAGTCGATTCTCAAGTCTCTCTCTCTTTCACGCCAATTTGGGGCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATTGCTCTGTTATTGATTCCTTGATCATTGGCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTGATGTTATTGGATCCGAATCATGTAATTCGCTTTTGGCTGCTCTTGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAAGTTTTCGGTGAAATGTCTCTTAAATGCATTCCTTTTAACACTCTTGGCTTTGGTGTGTTTATATGGAGGGTTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGATGCCAGGACCAACAATTCGGAGATCAATGGTTCTGTTATTGCCACATTGATCATTCATGGGCTCTGTGGGGCGTCTAGACTTGCAGAAGCTTCAAACATTTTGGATGAGCTTAAGAATAGGGGTTGCAAGCCCGACTTTTTGACGTATTGGATTCTTGGAGAAGCGTTTCGGTCATCAGGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCAAGGCTTAATGAATACAAGGAGTATTTATTTGCTTTAATAGCTGGAAGACGGATATGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAGGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCATTGATCCTTATTCTGCTATTATTTTCTTCAAGTTCATGGTCGAGAAAGGGAGATTCCCAACTCTCTTGACTTTAAGAAATCTGAGTAGGAATTTATGTAAGCATGGAAAGATTGATGAACTGTTGGAAGTTTACCAAGTTCTGAGTAGAAATAACTACTTCAATGATTTTGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAATGGTGAAAGAGGCCTATGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTACCCCTGATGTATCTTTTTACAATTCTGTCCTAGAAACATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAAGAAGCTGTGGGATGAGATGTTTGCTAGTGGCTGTGATGGTAATTTGAAGACGTATAACATCCTCATTCAAAAGTTCTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACCATCATATGCTTGGAAAATCGGTTGAACCCGATATTACAATCTACATGTCCCTGCTTCAAGGGCTCTGTCAGCATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTAGAACAGGATGTAAATCTTGCGGGAACCTTGCTGAGCACTTTTATCTTGTGTCTATGTAAAGCAGGCACGTTATTTTCCCTGGTTACATATCTGATGTTAATTCGTGTGTGTTTCCCATTTGTATATATATGCTTATTGTCTAGAGTCATTTGTTCTCACCTGACATTCCCCTACATCCTCTGCGACTTGACCCTATCATCCTTGCTTGTCTTGAAGAGCATGCCTAACTTTGGAGTTCTTATGATTGAGCCATCAAAAAGGAAGGTATATCTTGTTGGTATAGATAGTAACTTTTAATCACGGGTCCTTTTAGCATATATTGTCCTCACTCACATGTTTTCTAAGAAAATTCTCATGAGGTCACCCAACATAAGATTCCTCCAAGCTAAGTATGTTTAACTTTAGAGTTTTCATGATTGAACCACCGAAAAGGAAGGTGTACCTCGTTGGCTTGTCCTCACTCACATGCTTTCTAGGAAAATTTCTAGGAAGTCACCTAACATAGGATTGCTCCAAACGAAGCATGCTTAACTTTGAAGTTTTCATAATTGAACTACCGAAAAGGAAGATGCACTTTGTTGGTATAGGTAGTAATTATCAATTCTTTTAAACCCTTTCTTAACCTTATTTTCATACCCTCGAGATCCCTTTCATTCAAATGTGATATCGATTCATTCATGTACCCACCTAAACTCGGGTATTATACTTAACAAGTCCCTTGGTCACCGCCATAGCTGACCCTTTATAAACCTCTCTTCTATGGGCTATTTGGGGTTATCTCTTAGATTACACCATTTTTGGCCTTGTTACCTAGACACAAATCATAAACAATCAAAGCTATTGTTGACTCTGTAAGATCCTCGCTTCTATTGGCCAACTCACTCTCTTTGGAATTAATCTCTTTGACTACACCGTTTTCAGCTTTGTACCCAGACACTCATCTTGAATTAGCAGCGTTCGCATCTAGCATCTTCTTATTCTAAGGAGGGTTCTGTAGAACCAGAAAACGGTTATGTACTCAACATTTTAATTTAATGATGGAGCTTTTATTTAACTTGAATTTTAAACAATGCAGGTCATTTCCTTGCTGCATCCAAGTTACTCTGTGGTCTATCAAGCGACATTGCTCACCCAGTCACCCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAAAGGTTCCACTGGCTAAGCAACATTTAGAATGGGTTCAAGAGACTTCTCCATCAATGTTGTCTGTTATATCCAGTGAGTTATTAGCATTTCTTCCTTCCTCCCCAAGAGCAGATCAAATTTTACAGATTCTTCAAACAATACAAGAACTATCACGTTTTAGCAATTGA

mRNA sequence

ATGAAACCCCACATACCAGAATTAGCTACTCGAGTGAGCAGAGCCATACTTTCAATTTCAAATCACACAAGCCCGGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAACGTTAAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTACCCATCACTCCCTCGCTCTCGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGTTTCGCCCACAATTCCGATTCATACAAGTCGATTCTCAAGTCTCTCTCTCTTTCACGCCAATTTGGGGCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATTGCTCTGTTATTGATTCCTTGATCATTGGCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTGATGTTATTGGATCCGAATCATGTAATTCGCTTTTGGCTGCTCTTGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAAGTTTTCGGTGAAATGTCTCTTAAATGCATTCCTTTTAACACTCTTGGCTTTGGTGTGTTTATATGGAGGGTTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGATGCCAGGACCAACAATTCGGAGATCAATGGTTCTGTTATTGCCACATTGATCATTCATGGGCTCTGTGGGGCGTCTAGACTTGCAGAAGCTTCAAACATTTTGGATGAGCTTAAGAATAGGGGTTGCAAGCCCGACTTTTTGACGTATTGGATTCTTGGAGAAGCGTTTCGGTCATCAGGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCAAGGCTTAATGAATACAAGGAGTATTTATTTGCTTTAATAGCTGGAAGACGGATATGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAGGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCATTGATCCTTATTCTGCTATTATTTTCTTCAAGTTCATGGTCGAGAAAGGGAGATTCCCAACTCTCTTGACTTTAAGAAATCTGAGTAGGAATTTATGTAAGCATGGAAAGATTGATGAACTGTTGGAAGTTTACCAAGTTCTGAGTAGAAATAACTACTTCAATGATTTTGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAATGGTGAAAGAGGCCTATGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTACCCCTGATGTATCTTTTTACAATTCTGTCCTAGAAACATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAAGAAGCTGTGGGATGAGATGTTTGCTAGTGGCTGTGATGGTAATTTGAAGACGTATAACATCCTCATTCAAAAGTTCTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACCATCATATGCTTGGAAAATCGGTTGAACCCGATATTACAATCTACATGTCCCTGCTTCAAGGGCTCTGTCAGCATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTAGAACAGGATGTAAATCTTGCGGGAACCTTGCTGAGCACTTTTATCTTGTGTCTATGTCATTTCCTTGCTGCATCCAAGTTACTCTGTGGTCTATCAAGCGACATTGCTCACCCAGTCACCCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAAAGGTTCCACTGGCTAAGCAACATTTAGAATGGGTTCAAGAGACTTCTCCATCAATGTTGTCTGTTATATCCAGTGAGTTATTAGCATTTCTTCCTTCCTCCCCAAGAGCAGATCAAATTTTACAGATTCTTCAAACAATACAAGAACTATCACGTTTTAGCAATTGA

Coding sequence (CDS)

ATGAAACCCCACATACCAGAATTAGCTACTCGAGTGAGCAGAGCCATACTTTCAATTTCAAATCACACAAGCCCGGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAACGTTAAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTACCCATCACTCCCTCGCTCTCGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGTTTCGCCCACAATTCCGATTCATACAAGTCGATTCTCAAGTCTCTCTCTCTTTCACGCCAATTTGGGGCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATTGCTCTGTTATTGATTCCTTGATCATTGGCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTGATGTTATTGGATCCGAATCATGTAATTCGCTTTTGGCTGCTCTTGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAAGTTTTCGGTGAAATGTCTCTTAAATGCATTCCTTTTAACACTCTTGGCTTTGGTGTGTTTATATGGAGGGTTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGATGCCAGGACCAACAATTCGGAGATCAATGGTTCTGTTATTGCCACATTGATCATTCATGGGCTCTGTGGGGCGTCTAGACTTGCAGAAGCTTCAAACATTTTGGATGAGCTTAAGAATAGGGGTTGCAAGCCCGACTTTTTGACGTATTGGATTCTTGGAGAAGCGTTTCGGTCATCAGGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCAAGGCTTAATGAATACAAGGAGTATTTATTTGCTTTAATAGCTGGAAGACGGATATGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAGGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCATTGATCCTTATTCTGCTATTATTTTCTTCAAGTTCATGGTCGAGAAAGGGAGATTCCCAACTCTCTTGACTTTAAGAAATCTGAGTAGGAATTTATGTAAGCATGGAAAGATTGATGAACTGTTGGAAGTTTACCAAGTTCTGAGTAGAAATAACTACTTCAATGATTTTGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAATGGTGAAAGAGGCCTATGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTACCCCTGATGTATCTTTTTACAATTCTGTCCTAGAAACATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAAGAAGCTGTGGGATGAGATGTTTGCTAGTGGCTGTGATGGTAATTTGAAGACGTATAACATCCTCATTCAAAAGTTCTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACCATCATATGCTTGGAAAATCGGTTGAACCCGATATTACAATCTACATGTCCCTGCTTCAAGGGCTCTGTCAGCATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTAGAACAGGATGTAAATCTTGCGGGAACCTTGCTGAGCACTTTTATCTTGTGTCTATGTCATTTCCTTGCTGCATCCAAGTTACTCTGTGGTCTATCAAGCGACATTGCTCACCCAGTCACCCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAAAGGTTCCACTGGCTAAGCAACATTTAGAATGGGTTCAAGAGACTTCTCCATCAATGTTGTCTGTTATATCCAGTGAGTTATTAGCATTTCTTCCTTCCTCCCCAAGAGCAGATCAAATTTTACAGATTCTTCAAACAATACAAGAACTATCACGTTTTAGCAATTGA

Protein sequence

MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCSVIDSLIIGKKTHDAFLVFNEVTDVIGSESCNSLLAALASDGFFEHAQKVFGEMSLKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCHFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQETSPSMLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN
Homology
BLAST of Bhi04G002002 vs. TAIR 10
Match: AT5G14080.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 636.0 bits (1639), Expect = 3.3e-182
Identity = 318/628 (50.64%), Postives = 442/628 (70.38%), Query Frame = 0

Query: 7   ELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLA 66
           ELA R+ R +L +S  +  A  W+P +EQ+LH LGFR +++PSLV++VIDP LL HHSLA
Sbjct: 6   ELAVRIGRELLKVSGSSRAARIWSPLIEQSLHGLGFRHSISPSLVARVIDPFLLNHHSLA 65

Query: 67  LGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCSVID 126
           LGFFNWA+QQPG++H+S SY SI KSLSLSRQF A+ +L KQVK+ KI LD SVY S+ID
Sbjct: 66  LGFFNWAAQQPGYSHDSISYHSIFKSLSLSRQFSAMDALFKQVKSNKILLDSSVYRSLID 125

Query: 127 SLIIGKKTHDAFLVFNEVTDV---IGSESCNSLLAALASDGFFEHAQKVFGEMSLKCIPF 186
           +L++G+K   AF V  E       I  + CN LLA L SDG +++AQK+F +M  K +  
Sbjct: 126 TLVLGRKAQSAFWVLEEAFSTGQEIHPDVCNRLLAGLTSDGCYDYAQKLFVKMRHKGVSL 185

Query: 187 NTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAEASNI 246
           NTLGFGV+I   CR+++  ++L ++D+ +  N  INGS+IA LI+H LC  SR  +A  I
Sbjct: 186 NTLGFGVYIGWFCRSSETNQLLRLVDEVKKANLNINGSIIALLILHSLCKCSREMDAFYI 245

Query: 247 LDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIA 306
           L+EL+N  CKPDF+ Y ++ EAF  +GN+ +R+ +LKKKRKLGVAPR ++Y+ ++  LI+
Sbjct: 246 LEELRNIDCKPDFMAYRVIAEAFVVTGNLYERQVVLKKKRKLGVAPRSSDYRAFILDLIS 305

Query: 307 GRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRFPTLLTL 366
            +R+ EAKE+ EVIV G FPMD ++ + LIGSV+++DP SA+ F  +MV  G+ P + TL
Sbjct: 306 AKRLTEAKEVAEVIVSGKFPMDNDILDALIGSVSAVDPDSAVEFLVYMVSTGKLPAIRTL 365

Query: 367 RNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKK 426
             LS+NLC+H K D L++ Y++LS   YF++   Y L ISFLCKAG V+E+Y  LQEMKK
Sbjct: 366 SKLSKNLCRHDKSDHLIKAYELLSSKGYFSELQSYSLMISFLCKAGRVRESYTALQEMKK 425

Query: 427 NGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEE 486
            G  PDVS YN+++E CC+ +++RPAKKLWDEMF  GC  NL TYN+LI+K S+  + EE
Sbjct: 426 EGLAPDVSLYNALIEACCKAEMIRPAKKLWDEMFVEGCKMNLTTYNVLIRKLSEEGEAEE 485

Query: 487 ALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQD-VNLAGTLLSTFI 546
           +L L+  ML + +EPD TIYMSL++GLC+ +++EAA EVF K +E+D   +   +LS F+
Sbjct: 486 SLRLFDKMLERGIEPDETIYMSLIEGLCKETKIEAAMEVFRKCMERDHKTVTRRVLSEFV 545

Query: 547 LCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQETSPSML 606
           L LC   H   AS+LL      + H   HV LLK  ADA +V +  +H++W++E SPS++
Sbjct: 546 LNLCSNGHSGEASQLL-REREHLEHTGAHVVLLKCVADAKEVEIGIRHMQWIKEVSPSLV 605

Query: 607 SVISSELLAFLPSSPRADQILQILQTIQ 628
             ISS+LLA   SS   D IL  ++ I+
Sbjct: 606 HTISSDLLASFCSSSDPDSILPFIRAIE 632

BLAST of Bhi04G002002 vs. TAIR 10
Match: AT5G61990.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 156.0 bits (393), Expect = 1.0e-37
Identity = 111/462 (24.03%), Postives = 207/462 (44.81%), Query Frame = 0

Query: 85  SYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCSVIDSLIIGKKTHDAFLVFNEV 144
           +Y  ++  L   ++     SLL ++ +  + LD   Y  +ID L+ G+    A  + +E+
Sbjct: 279 TYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGLVHEM 338

Query: 145 TD---VIGSESCNSLLAALASDGFFEHAQKVFGEMSLKCIPFNTLGFGVFIWRVCRNTDV 204
                 I     +  +  ++ +G  E A+ +F  M    +      +   I   CR  +V
Sbjct: 339 VSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGLIPQAQAYASLIEGYCREKNV 398

Query: 205 VKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAEASNILDELKNRGCKPDFLTYWI 264
            +   ++ + +  N  I+     T ++ G+C +  L  A NI+ E+   GC+P+ + Y  
Sbjct: 399 RQGYELLVEMKKRNIVISPYTYGT-VVKGMCSSGDLDGAYNIVKEMIASGCRPNVVIYTT 458

Query: 265 LGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKE-LGEVIVKG 324
           L + F  +    D  ++LK+ ++ G+AP +  Y   +  L   +R+ EA+  L E++  G
Sbjct: 459 LIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENG 518

Query: 325 NFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELL 384
             P        + G + + +  SA  + K M E G  P  +    L    CK GK+ E  
Sbjct: 519 LKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEAC 578

Query: 385 EVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETC 444
             Y+ +       D   Y + ++ L K   V +A  + +EM+  G  PDV  Y  ++   
Sbjct: 579 SAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLINGF 638

Query: 445 CREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDI 504
            +   ++ A  ++DEM   G   N+  YN+L+  F +S +IE+A  L   M  K + P+ 
Sbjct: 639 SKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLHPNA 698

Query: 505 TIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTLLSTFI 543
             Y +++ G C+   L  AF +F      ++ L G +  +F+
Sbjct: 699 VTYCTIIDGYCKSGDLAEAFRLF-----DEMKLKGLVPDSFV 734

BLAST of Bhi04G002002 vs. TAIR 10
Match: AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 154.8 bits (390), Expect = 2.3e-37
Identity = 141/563 (25.04%), Postives = 240/563 (42.63%), Query Frame = 0

Query: 64  SLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCS 123
           S ++  F+W   Q G+ H+ D Y+ ++  L  + +F  I  LL Q+K + I    S++ S
Sbjct: 92  STSMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFIS 151

Query: 124 VIDSLIIGKKTHDAFLVFNEVTDVIGSE----SCNSLLAALASDGFFEHAQKVFGEMSLK 183
           ++              +  E+ +V   E    S N +L  L S    + A  VF +M  +
Sbjct: 152 IMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSR 211

Query: 184 CIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAE 243
            IP     FGV +   C   ++   L+++ D  T +  +  SVI   +IH L   +R+ E
Sbjct: 212 KIPPTLFTFGVVMKAFCAVNEIDSALSLLRD-MTKHGCVPNSVIYQTLIHSLSKCNRVNE 271

Query: 244 ASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLF 303
           A  +L+E+   GC PD  T+  +         + +  K++ +    G AP    Y   + 
Sbjct: 272 ALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMN 331

Query: 304 ALIAGRRICEAKEL------GEVIV----------KGNFPMDEEVSNVLI---GSVASID 363
            L    R+  AK+L       E+++           G     + V + ++   G V  + 
Sbjct: 332 GLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVC 391

Query: 364 PYSAIIF--------------FKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVYQVL 423
            Y+++I+                 M  KG  P + +   L    CK GKIDE   V   +
Sbjct: 392 TYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEM 451

Query: 424 SRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCREDLL 483
           S +    +   ++  IS  CK   + EA  + +EM + G  PDV  +NS++   C  D +
Sbjct: 452 SADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEI 511

Query: 484 RPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIYMSL 543
           + A  L  +M + G   N  TYN LI  F +  +I+EA  L + M+ +    D   Y SL
Sbjct: 512 KHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYNSL 571

Query: 544 LQGLCQHSQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCHF--------LAASKLLCGL 582
           ++GLC+  +++ A  +F K +      +    +  I  LC               +L G 
Sbjct: 572 IKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGS 631

BLAST of Bhi04G002002 vs. TAIR 10
Match: AT1G06710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 152.1 bits (383), Expect = 1.5e-36
Identity = 115/498 (23.09%), Postives = 220/498 (44.18%), Query Frame = 0

Query: 42  FRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGA 101
           FR+ L+ SLV +V+   L+   S  + FF WA +Q G+ H +  Y +++  +        
Sbjct: 126 FREKLSESLVIEVL--RLIARPSAVISFFVWAGRQIGYKHTAPVYNALVDLIVRDDDEKV 185

Query: 102 IHSLLKQVKTQKIGLDLSVYCSVIDSLIIGKKTHDAFLVFNEVTDVI-------GSESCN 161
               L+Q++      D  V+   ++ L+     + +F +  E    +          + N
Sbjct: 186 PEEFLQQIRDD----DKEVFGEFLNVLVRKHCRNGSFSIALEELGRLKDFRFRPSRSTYN 245

Query: 162 SLLAALASDGFFEHAQKVFGEMSLKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTN 221
            L+ A       + A  +  EMSL  +  +      F + +C+     + L +++     
Sbjct: 246 CLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGKWREALTLVE----T 305

Query: 222 NSEINGSVIATLIIHGLCGASRLAEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVD 281
            + +  +V  T +I GLC AS   EA + L+ ++   C P+ +TY  L     +   +  
Sbjct: 306 ENFVPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCGCLNKKQLGR 365

Query: 282 REKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIG 341
            +++L      G  P    +   + A         A +L + +VK        V N+LIG
Sbjct: 366 CKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPGYVVYNILIG 425

Query: 342 SV-ASIDPYSAIIF------FKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVYQVLS 401
           S+    D  +  +       +  M+  G     + + + +R LC  GK ++   V + + 
Sbjct: 426 SICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLCSAGKYEKAFSVIREMI 485

Query: 402 RNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCREDLLR 461
              +  D   Y   +++LC A  ++ A+ + +EMK+ G   DV  Y  ++++ C+  L+ 
Sbjct: 486 GQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMVDSFCKAGLIE 545

Query: 462 PAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIYMSLL 521
            A+K ++EM   GC  N+ TY  LI  + K+ ++  A  L+  ML +   P+I  Y +L+
Sbjct: 546 QARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCLPNIVTYSALI 605

Query: 522 QGLCQHSQLEAAFEVFSK 526
            G C+  Q+E A ++F +
Sbjct: 606 DGHCKAGQVEKACQIFER 613

BLAST of Bhi04G002002 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 146.7 bits (369), Expect = 6.1e-35
Identity = 129/537 (24.02%), Postives = 230/537 (42.83%), Query Frame = 0

Query: 64  SLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCS 123
           S AL  FN AS++P F+     Y+ IL  L  S  F  +  +L+ +K+ +  +  S +  
Sbjct: 64  SAALRLFNLASKKPNFSPEPALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLI 123

Query: 124 VIDSLIIGKKTHDAFLVFNEVTDVIG----SESCNSLLAALASDGFFEHAQKVFGEMSLK 183
           +I+S    +   +   V + + D  G    +   N +L  L      +  +    +MS+ 
Sbjct: 124 LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVW 183

Query: 184 CIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDART------------------NNSEING- 243
            I  +   F V I  +CR   +   + M++D  +                     +++G 
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 244 ---------------SVIATLIIHGLCGASRLAEASNILDELKNR-GCKPDFLTYWILGE 303
                          +V   +I+HG C   R+ +A N + E+ N+ G  PD  T+  L  
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 304 AFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKELGEVIVKGNFPM 363
               +G+V    +I+    + G  P +  Y   +  L     + EA E+ + ++  +   
Sbjct: 304 GLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSP 363

Query: 364 DEEVSNVLIGSVASIDPY-SAIIFFKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVY 423
           +    N LI ++   +    A    + +  KG  P + T  +L + LC        +E++
Sbjct: 364 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 423

Query: 424 QVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCRE 483
           + +       D   Y++ I  LC  G + EA  +L++M+ +G    V  YN++++  C+ 
Sbjct: 424 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 483

Query: 484 DLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIY 543
           +  R A++++DEM   G   N  TYN LI    KS ++E+A  L   M+ +  +PD   Y
Sbjct: 484 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 543

Query: 544 MSLLQGLCQHSQLEAAFEVF----SKSVEQDVNLAGTLLSTFILCLC-HFLAASKLL 556
            SLL   C+   ++ A ++     S   E D+   GTL+S   LC       ASKLL
Sbjct: 544 NSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISG--LCKAGRVEVASKLL 598

BLAST of Bhi04G002002 vs. ExPASy Swiss-Prot
Match: Q9FMU2 (Pentatricopeptide repeat-containing protein At5g14080 OS=Arabidopsis thaliana OX=3702 GN=At5g14080 PE=2 SV=2)

HSP 1 Score: 636.0 bits (1639), Expect = 4.7e-181
Identity = 318/628 (50.64%), Postives = 442/628 (70.38%), Query Frame = 0

Query: 7   ELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLLTHHSLA 66
           ELA R+ R +L +S  +  A  W+P +EQ+LH LGFR +++PSLV++VIDP LL HHSLA
Sbjct: 6   ELAVRIGRELLKVSGSSRAARIWSPLIEQSLHGLGFRHSISPSLVARVIDPFLLNHHSLA 65

Query: 67  LGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCSVID 126
           LGFFNWA+QQPG++H+S SY SI KSLSLSRQF A+ +L KQVK+ KI LD SVY S+ID
Sbjct: 66  LGFFNWAAQQPGYSHDSISYHSIFKSLSLSRQFSAMDALFKQVKSNKILLDSSVYRSLID 125

Query: 127 SLIIGKKTHDAFLVFNEVTDV---IGSESCNSLLAALASDGFFEHAQKVFGEMSLKCIPF 186
           +L++G+K   AF V  E       I  + CN LLA L SDG +++AQK+F +M  K +  
Sbjct: 126 TLVLGRKAQSAFWVLEEAFSTGQEIHPDVCNRLLAGLTSDGCYDYAQKLFVKMRHKGVSL 185

Query: 187 NTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAEASNI 246
           NTLGFGV+I   CR+++  ++L ++D+ +  N  INGS+IA LI+H LC  SR  +A  I
Sbjct: 186 NTLGFGVYIGWFCRSSETNQLLRLVDEVKKANLNINGSIIALLILHSLCKCSREMDAFYI 245

Query: 247 LDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIA 306
           L+EL+N  CKPDF+ Y ++ EAF  +GN+ +R+ +LKKKRKLGVAPR ++Y+ ++  LI+
Sbjct: 246 LEELRNIDCKPDFMAYRVIAEAFVVTGNLYERQVVLKKKRKLGVAPRSSDYRAFILDLIS 305

Query: 307 GRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRFPTLLTL 366
            +R+ EAKE+ EVIV G FPMD ++ + LIGSV+++DP SA+ F  +MV  G+ P + TL
Sbjct: 306 AKRLTEAKEVAEVIVSGKFPMDNDILDALIGSVSAVDPDSAVEFLVYMVSTGKLPAIRTL 365

Query: 367 RNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKK 426
             LS+NLC+H K D L++ Y++LS   YF++   Y L ISFLCKAG V+E+Y  LQEMKK
Sbjct: 366 SKLSKNLCRHDKSDHLIKAYELLSSKGYFSELQSYSLMISFLCKAGRVRESYTALQEMKK 425

Query: 427 NGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEE 486
            G  PDVS YN+++E CC+ +++RPAKKLWDEMF  GC  NL TYN+LI+K S+  + EE
Sbjct: 426 EGLAPDVSLYNALIEACCKAEMIRPAKKLWDEMFVEGCKMNLTTYNVLIRKLSEEGEAEE 485

Query: 487 ALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQD-VNLAGTLLSTFI 546
           +L L+  ML + +EPD TIYMSL++GLC+ +++EAA EVF K +E+D   +   +LS F+
Sbjct: 486 SLRLFDKMLERGIEPDETIYMSLIEGLCKETKIEAAMEVFRKCMERDHKTVTRRVLSEFV 545

Query: 547 LCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQETSPSML 606
           L LC   H   AS+LL      + H   HV LLK  ADA +V +  +H++W++E SPS++
Sbjct: 546 LNLCSNGHSGEASQLL-REREHLEHTGAHVVLLKCVADAKEVEIGIRHMQWIKEVSPSLV 605

Query: 607 SVISSELLAFLPSSPRADQILQILQTIQ 628
             ISS+LLA   SS   D IL  ++ I+
Sbjct: 606 HTISSDLLASFCSSSDPDSILPFIRAIE 632

BLAST of Bhi04G002002 vs. ExPASy Swiss-Prot
Match: Q9FIT7 (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 1.4e-36
Identity = 111/462 (24.03%), Postives = 207/462 (44.81%), Query Frame = 0

Query: 85  SYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCSVIDSLIIGKKTHDAFLVFNEV 144
           +Y  ++  L   ++     SLL ++ +  + LD   Y  +ID L+ G+    A  + +E+
Sbjct: 279 TYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGLVHEM 338

Query: 145 TD---VIGSESCNSLLAALASDGFFEHAQKVFGEMSLKCIPFNTLGFGVFIWRVCRNTDV 204
                 I     +  +  ++ +G  E A+ +F  M    +      +   I   CR  +V
Sbjct: 339 VSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGLIPQAQAYASLIEGYCREKNV 398

Query: 205 VKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAEASNILDELKNRGCKPDFLTYWI 264
            +   ++ + +  N  I+     T ++ G+C +  L  A NI+ E+   GC+P+ + Y  
Sbjct: 399 RQGYELLVEMKKRNIVISPYTYGT-VVKGMCSSGDLDGAYNIVKEMIASGCRPNVVIYTT 458

Query: 265 LGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKE-LGEVIVKG 324
           L + F  +    D  ++LK+ ++ G+AP +  Y   +  L   +R+ EA+  L E++  G
Sbjct: 459 LIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENG 518

Query: 325 NFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELL 384
             P        + G + + +  SA  + K M E G  P  +    L    CK GK+ E  
Sbjct: 519 LKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEAC 578

Query: 385 EVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETC 444
             Y+ +       D   Y + ++ L K   V +A  + +EM+  G  PDV  Y  ++   
Sbjct: 579 SAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLINGF 638

Query: 445 CREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDI 504
            +   ++ A  ++DEM   G   N+  YN+L+  F +S +IE+A  L   M  K + P+ 
Sbjct: 639 SKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLHPNA 698

Query: 505 TIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTLLSTFI 543
             Y +++ G C+   L  AF +F      ++ L G +  +F+
Sbjct: 699 VTYCTIIDGYCKSGDLAEAFRLF-----DEMKLKGLVPDSFV 734

BLAST of Bhi04G002002 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 3.2e-36
Identity = 141/563 (25.04%), Postives = 240/563 (42.63%), Query Frame = 0

Query: 64  SLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCS 123
           S ++  F+W   Q G+ H+ D Y+ ++  L  + +F  I  LL Q+K + I    S++ S
Sbjct: 92  STSMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFIS 151

Query: 124 VIDSLIIGKKTHDAFLVFNEVTDVIGSE----SCNSLLAALASDGFFEHAQKVFGEMSLK 183
           ++              +  E+ +V   E    S N +L  L S    + A  VF +M  +
Sbjct: 152 IMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSR 211

Query: 184 CIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRLAE 243
            IP     FGV +   C   ++   L+++ D  T +  +  SVI   +IH L   +R+ E
Sbjct: 212 KIPPTLFTFGVVMKAFCAVNEIDSALSLLRD-MTKHGCVPNSVIYQTLIHSLSKCNRVNE 271

Query: 244 ASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLF 303
           A  +L+E+   GC PD  T+  +         + +  K++ +    G AP    Y   + 
Sbjct: 272 ALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMN 331

Query: 304 ALIAGRRICEAKEL------GEVIV----------KGNFPMDEEVSNVLI---GSVASID 363
            L    R+  AK+L       E+++           G     + V + ++   G V  + 
Sbjct: 332 GLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVC 391

Query: 364 PYSAIIF--------------FKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVYQVL 423
            Y+++I+                 M  KG  P + +   L    CK GKIDE   V   +
Sbjct: 392 TYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLNEM 451

Query: 424 SRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCREDLL 483
           S +    +   ++  IS  CK   + EA  + +EM + G  PDV  +NS++   C  D +
Sbjct: 452 SADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDVYTFNSLISGLCEVDEI 511

Query: 484 RPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIYMSL 543
           + A  L  +M + G   N  TYN LI  F +  +I+EA  L + M+ +    D   Y SL
Sbjct: 512 KHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLVNEMVFQGSPLDEITYNSL 571

Query: 544 LQGLCQHSQLEAAFEVFSKSVEQDVNLAGTLLSTFILCLCHF--------LAASKLLCGL 582
           ++GLC+  +++ A  +F K +      +    +  I  LC               +L G 
Sbjct: 572 IKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGS 631

BLAST of Bhi04G002002 vs. ExPASy Swiss-Prot
Match: Q9M9X9 (Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g06710 PE=3 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 2.1e-35
Identity = 115/498 (23.09%), Postives = 220/498 (44.18%), Query Frame = 0

Query: 42  FRQTLNPSLVSQVIDPHLLTHHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGA 101
           FR+ L+ SLV +V+   L+   S  + FF WA +Q G+ H +  Y +++  +        
Sbjct: 126 FREKLSESLVIEVL--RLIARPSAVISFFVWAGRQIGYKHTAPVYNALVDLIVRDDDEKV 185

Query: 102 IHSLLKQVKTQKIGLDLSVYCSVIDSLIIGKKTHDAFLVFNEVTDVI-------GSESCN 161
               L+Q++      D  V+   ++ L+     + +F +  E    +          + N
Sbjct: 186 PEEFLQQIRDD----DKEVFGEFLNVLVRKHCRNGSFSIALEELGRLKDFRFRPSRSTYN 245

Query: 162 SLLAALASDGFFEHAQKVFGEMSLKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTN 221
            L+ A       + A  +  EMSL  +  +      F + +C+     + L +++     
Sbjct: 246 CLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGKWREALTLVE----T 305

Query: 222 NSEINGSVIATLIIHGLCGASRLAEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVD 281
            + +  +V  T +I GLC AS   EA + L+ ++   C P+ +TY  L     +   +  
Sbjct: 306 ENFVPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCGCLNKKQLGR 365

Query: 282 REKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIG 341
            +++L      G  P    +   + A         A +L + +VK        V N+LIG
Sbjct: 366 CKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPGYVVYNILIG 425

Query: 342 SV-ASIDPYSAIIF------FKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVYQVLS 401
           S+    D  +  +       +  M+  G     + + + +R LC  GK ++   V + + 
Sbjct: 426 SICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLCSAGKYEKAFSVIREMI 485

Query: 402 RNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCREDLLR 461
              +  D   Y   +++LC A  ++ A+ + +EMK+ G   DV  Y  ++++ C+  L+ 
Sbjct: 486 GQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMVDSFCKAGLIE 545

Query: 462 PAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIYMSLL 521
            A+K ++EM   GC  N+ TY  LI  + K+ ++  A  L+  ML +   P+I  Y +L+
Sbjct: 546 QARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCLPNIVTYSALI 605

Query: 522 QGLCQHSQLEAAFEVFSK 526
            G C+  Q+E A ++F +
Sbjct: 606 DGHCKAGQVEKACQIFER 613

BLAST of Bhi04G002002 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 8.6e-34
Identity = 129/537 (24.02%), Postives = 230/537 (42.83%), Query Frame = 0

Query: 64  SLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSVYCS 123
           S AL  FN AS++P F+     Y+ IL  L  S  F  +  +L+ +K+ +  +  S +  
Sbjct: 64  SAALRLFNLASKKPNFSPEPALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLI 123

Query: 124 VIDSLIIGKKTHDAFLVFNEVTDVIG----SESCNSLLAALASDGFFEHAQKVFGEMSLK 183
           +I+S    +   +   V + + D  G    +   N +L  L      +  +    +MS+ 
Sbjct: 124 LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVW 183

Query: 184 CIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDART------------------NNSEING- 243
            I  +   F V I  +CR   +   + M++D  +                     +++G 
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 244 ---------------SVIATLIIHGLCGASRLAEASNILDELKNR-GCKPDFLTYWILGE 303
                          +V   +I+HG C   R+ +A N + E+ N+ G  PD  T+  L  
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 304 AFRSSGNVVDREKILKKKRKLGVAPRLNEYKEYLFALIAGRRICEAKELGEVIVKGNFPM 363
               +G+V    +I+    + G  P +  Y   +  L     + EA E+ + ++  +   
Sbjct: 304 GLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSP 363

Query: 364 DEEVSNVLIGSVASIDPY-SAIIFFKFMVEKGRFPTLLTLRNLSRNLCKHGKIDELLEVY 423
           +    N LI ++   +    A    + +  KG  P + T  +L + LC        +E++
Sbjct: 364 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 423

Query: 424 QVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGVLQEMKKNGFTPDVSFYNSVLETCCRE 483
           + +       D   Y++ I  LC  G + EA  +L++M+ +G    V  YN++++  C+ 
Sbjct: 424 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 483

Query: 484 DLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSKSNQIEEALVLYHHMLGKSVEPDITIY 543
           +  R A++++DEM   G   N  TYN LI    KS ++E+A  L   M+ +  +PD   Y
Sbjct: 484 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 543

Query: 544 MSLLQGLCQHSQLEAAFEVF----SKSVEQDVNLAGTLLSTFILCLC-HFLAASKLL 556
            SLL   C+   ++ A ++     S   E D+   GTL+S   LC       ASKLL
Sbjct: 544 NSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISG--LCKAGRVEVASKLL 598

BLAST of Bhi04G002002 vs. ExPASy TrEMBL
Match: A0A6J1FHE2 (pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445400 PE=4 SV=1)

HSP 1 Score: 1134.0 bits (2932), Expect = 0.0e+00
Identity = 571/640 (89.22%), Postives = 605/640 (94.53%), Query Frame = 0

Query: 1   MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60
           MKPH+ ELATRVSR ILSISNHT PAGSWTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL
Sbjct: 1   MKPHLQELATRVSRTILSISNHTRPAGSWTPSLEQNLHRLGFRETLNPSLVSQVIDPHLL 60

Query: 61  THHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWASQQPGFAHNS+SYKS+LKSLSLSRQFGAIHSLLKQVKTQ+IGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFAHNSESYKSVLKSLSLSRQFGAIHSLLKQVKTQRIGLDLSV 120

Query: 121 YCSVIDSLIIGKKTHDAFLVFNE---VTDVIGSESCNSLLAALASDGFFEHAQKVFGEMS 180
           Y SVIDSLIIGKKTHDAFLVF E   VT VIGSE CNSLLAALASDGFFEHAQKVF EMS
Sbjct: 121 YHSVIDSLIIGKKTHDAFLVFKELTSVTRVIGSEPCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRL 240
           LK IPFNTLGFGVFIWRVCRN DVVKVLNM+DDA TNNSEINGSV+ATLIIHGLCGASRL
Sbjct: 181 LKGIPFNTLGFGVFIWRVCRNADVVKVLNMLDDAMTNNSEINGSVVATLIIHGLCGASRL 240

Query: 241 AEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEA++S+G+VVDREK LKKKRKLGVAPRL++YKE+
Sbjct: 241 PEASNILDELKNRGCKPDFLTYWILGEAYQSAGSVVDREKTLKKKRKLGVAPRLHDYKEF 300

Query: 301 LFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRF 360
           LFALIAGRRICEAKELGEVIV+GNFPMDE+VSNVLIGSVA+IDP SAI+F K MVEK RF
Sbjct: 301 LFALIAGRRICEAKELGEVIVRGNFPMDEDVSNVLIGSVAAIDPSSAIMFLKLMVEKERF 360

Query: 361 PTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGV 420
           PTLLTLRNLSRNLCKHGK+DELLEVYQ+LS++NYF+D+DRYHLRISFLCKAGMVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKLDELLEVYQLLSKHNYFDDYDRYHLRISFLCKAGMVKEAYGV 420

Query: 421 LQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSK 480
           LQEMKKNGF PDV FYNSVLE CCREDLLRPA+KLWDEMFASGC GNLKTYNILIQKFSK
Sbjct: 421 LQEMKKNGFAPDVYFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILIQKFSK 480

Query: 481 SNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTL 540
           SNQ+EEALVLY HMLGK VEPDITIY SLLQGLCQ SQLEAAFEVFSK VEQDV+LAGTL
Sbjct: 481 SNQMEEALVLYRHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKCVEQDVDLAGTL 540

Query: 541 LSTFILCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQET 600
           LSTFILCLC   HFLAASKLL GL+SDIAHP +HVTLLKGFADAG+VPLAKQH+EWVQET
Sbjct: 541 LSTFILCLCKAGHFLAASKLLRGLTSDIAHPDSHVTLLKGFADAGEVPLAKQHVEWVQET 600

Query: 601 SPSMLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN 635
           SPSMLSVISSELLAFLPSSP+AD ILQILQTIQELSRF+N
Sbjct: 601 SPSMLSVISSELLAFLPSSPKADPILQILQTIQELSRFNN 640

BLAST of Bhi04G002002 vs. ExPASy TrEMBL
Match: A0A6J1JUL2 (pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489037 PE=4 SV=1)

HSP 1 Score: 1131.7 bits (2926), Expect = 0.0e+00
Identity = 569/640 (88.91%), Postives = 603/640 (94.22%), Query Frame = 0

Query: 1   MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60
           MKPH+ ELATRVSR +LSISNHTSPAGSWTPSLEQNLHRLGFR+TLNPSLVSQVIDPHLL
Sbjct: 1   MKPHLQELATRVSRTVLSISNHTSPAGSWTPSLEQNLHRLGFRETLNPSLVSQVIDPHLL 60

Query: 61  THHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWASQQPGFAHNS+SYKS+LKSLSLSRQFGAIH LLKQVKTQ+IGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFAHNSESYKSVLKSLSLSRQFGAIHCLLKQVKTQRIGLDLSV 120

Query: 121 YCSVIDSLIIGKKTHDAFLVFNEVTD---VIGSESCNSLLAALASDGFFEHAQKVFGEMS 180
           Y SVIDSLIIGKKTHDAFLVF EVT    VIGSE CNSLLAALASDGFFEHAQKVF EMS
Sbjct: 121 YHSVIDSLIIGKKTHDAFLVFKEVTSVTHVIGSEPCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRL 240
           LK IPFNTLGFGVFIWRVCRN DVVKVLNM+DDA TNNSEINGSV+ATLIIHGLCGASRL
Sbjct: 181 LKGIPFNTLGFGVFIWRVCRNADVVKVLNMLDDAMTNNSEINGSVVATLIIHGLCGASRL 240

Query: 241 AEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEY 300
           +EASNILDELKNRGCKPDFLTYWILGEA+R +G+VVDREK LKKKRKLGVAPRL++YKE+
Sbjct: 241 SEASNILDELKNRGCKPDFLTYWILGEAYRLAGSVVDREKTLKKKRKLGVAPRLHDYKEF 300

Query: 301 LFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRF 360
           LFALIAGRRICEAKELGEVIV+ NFPMDE+VSNVLIGSVA+IDP SAI+F  FMVEK RF
Sbjct: 301 LFALIAGRRICEAKELGEVIVRANFPMDEDVSNVLIGSVAAIDPSSAIMFLMFMVEKERF 360

Query: 361 PTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGV 420
           PTLLTLRNLSRNLCKHGK+DELLEVYQVLS++NYF+D+DRY LRISFLCKAGMVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKLDELLEVYQVLSKHNYFDDYDRYRLRISFLCKAGMVKEAYGV 420

Query: 421 LQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSK 480
           LQEMKKNGF PDV FYNSVLE CCREDLLRPA+KLWDEMFASGC GNLKTYNIL+QKFSK
Sbjct: 421 LQEMKKNGFAPDVYFYNSVLEACCREDLLRPARKLWDEMFASGCGGNLKTYNILVQKFSK 480

Query: 481 SNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTL 540
           SNQ+EEALVLY HMLGK VEPDITIY SLLQGLCQ SQLEAAFEVFSK VEQDVNLAGTL
Sbjct: 481 SNQMEEALVLYRHMLGKKVEPDITIYTSLLQGLCQESQLEAAFEVFSKCVEQDVNLAGTL 540

Query: 541 LSTFILCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQET 600
           LSTFILCLC   HFLAASKLL GL+SDIAHP +HVTLLKGFADAG+VPLAKQH+EWVQET
Sbjct: 541 LSTFILCLCKAGHFLAASKLLRGLTSDIAHPDSHVTLLKGFADAGEVPLAKQHVEWVQET 600

Query: 601 SPSMLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN 635
           SPSMLSVISSELLAFLPSSP+AD ILQILQTIQELSRF+N
Sbjct: 601 SPSMLSVISSELLAFLPSSPKADPILQILQTIQELSRFNN 640

BLAST of Bhi04G002002 vs. ExPASy TrEMBL
Match: A0A0A0LMX0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G405040 PE=4 SV=1)

HSP 1 Score: 1126.3 bits (2912), Expect = 0.0e+00
Identity = 571/640 (89.22%), Postives = 597/640 (93.28%), Query Frame = 0

Query: 1   MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60
           M+PH PELATR+SRAILSISN TSPAGSWTPSLEQNLHRLGFRQ LNPSLVSQVIDPHLL
Sbjct: 1   MRPHFPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60

Query: 61  THHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWASQQPGF HNSDSY SILKSLSLSR FG IHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YCSVIDSLIIGKKTHDAFLVFNEVTD---VIGSESCNSLLAALASDGFFEHAQKVFGEMS 180
           Y +VIDSLII KKTHDAFLVFNEVT    +IGSE CNSLLAALASDGFFEHAQKVF EMS
Sbjct: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRL 240
           LK IPFNTLGFGVFIWR+CRNTDVVKVLNMID ARTNNS+INGSVIATLIIHGLC ASRL
Sbjct: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240

Query: 241 AEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAF+S+ NVVDREKILKKKRKLGVAPRLN+YKEY
Sbjct: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRF 360
           LF LIAGRRI EAKELGEVIVKGNFPMDEEVSNVLIGSVAS+DPYSAI+FFKFMVEKGRF
Sbjct: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGV 420
           PTLLTLRNLSRNLCKHGK DELLEV+QVL  NNYFND DRYHLRISFLCKAG VKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSK 480
           LQEMKKNGF PDVSFYNSVLE CCREDLLRPA+KLWDEMFA GC GNLKTY+ILIQKFSK
Sbjct: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTL 540
           SNQIEEALVLY HMLGK+VEPDI IY SLLQGLCQ SQLEAAFEVFSKSVEQDVNLA TL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQET 600
           LSTFILCLC   HFLAASKLL GL+SD+AHP +HVTLLKGFADAG+V LAKQH+EWVQET
Sbjct: 541 LSTFILCLCKVGHFLAASKLLRGLASDVAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600

Query: 601 SPSMLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN 635
           SPSMLSVIS+ELLAFLPSSP+AD IL+ILQT+QELSRFS+
Sbjct: 601 SPSMLSVISTELLAFLPSSPKADPILEILQTVQELSRFSH 640

BLAST of Bhi04G002002 vs. ExPASy TrEMBL
Match: A0A1S3B2P5 (pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485468 PE=4 SV=1)

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 568/640 (88.75%), Postives = 596/640 (93.12%), Query Frame = 0

Query: 1   MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60
           M+PH+PELATR+SRAILSISN TSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL
Sbjct: 1   MRPHLPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60

Query: 61  THHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSV 120
           +H+SLALGFFNWASQQPGF HNSDSY SILKSLSLSR FG IHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHYSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YCSVIDSLIIGKKTHDAFLVFNEVTD---VIGSESCNSLLAALASDGFFEHAQKVFGEMS 180
           Y SVIDSLII KKTHDAFLVFNEVT    +IGSE CNSLLAAL+SDGF+E A KVF EMS
Sbjct: 121 YRSVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALSSDGFYEQATKVFDEMS 180

Query: 181 LKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRL 240
           LKCIPFNTLG GVFIW+VCRNTDVVKVLNMIDD RTNNS++NGS+IATLIIHGLCGASRL
Sbjct: 181 LKCIPFNTLGLGVFIWKVCRNTDVVKVLNMIDDVRTNNSDVNGSIIATLIIHGLCGASRL 240

Query: 241 AEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAF+S+GNVVDREKILKKKRKLGVAPRLN+YKEY
Sbjct: 241 VEASNILDELKNRGCKPDFLTYWILGEAFQSAGNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRF 360
           LFALIAG+RI EAKELGEVIVKGNFPMDEEVSNVLIGSVAS+DPYSAI+FFKFMVEKGRF
Sbjct: 301 LFALIAGKRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGV 420
           PTLLTLRNLSRNLCKHGK DELLEV+QVL   NYFND DRYHLRISFLCKAG VKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCIKNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSK 480
           LQEMKKNGF PD SFYNSVLE CCREDLLRPA+KLWDEMFASGC GNLKTY+ILIQKFSK
Sbjct: 421 LQEMKKNGFAPDASFYNSVLEACCREDLLRPARKLWDEMFASGCSGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTL 540
           SNQIEEALVLY HMLGK+VEPDI IY SLLQGLCQ SQLE AFEVFSKSVEQDVNLA TL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQGSQLETAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLC---HFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQET 600
           LSTFILCLC   HF AASKLL GL+S IAHP +HVTLLKGFADAG+VPLAKQH+EWV ET
Sbjct: 541 LSTFILCLCKVGHFHAASKLLRGLASGIAHPDSHVTLLKGFADAGEVPLAKQHVEWVHET 600

Query: 601 SPSMLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN 635
           SPSMLSVIS+ELLAFLPSSP+AD ILQILQTIQELSRFSN
Sbjct: 601 SPSMLSVISTELLAFLPSSPKADPILQILQTIQELSRFSN 640

BLAST of Bhi04G002002 vs. ExPASy TrEMBL
Match: A0A5D3BUE5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1154G00230 PE=4 SV=1)

HSP 1 Score: 1121.3 bits (2899), Expect = 0.0e+00
Identity = 566/637 (88.85%), Postives = 594/637 (93.25%), Query Frame = 0

Query: 1   MKPHIPELATRVSRAILSISNHTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60
           M+PH+PELATR+SRAILSISN TSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL
Sbjct: 1   MRPHLPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60

Query: 61  THHSLALGFFNWASQQPGFAHNSDSYKSILKSLSLSRQFGAIHSLLKQVKTQKIGLDLSV 120
           +H+SLALGFFNWASQQPGF HNSDSY SILKSLSLSR FG IHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHYSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YCSVIDSLIIGKKTHDAFLVFNEVTD---VIGSESCNSLLAALASDGFFEHAQKVFGEMS 180
           Y SVIDSLII KKTHDAFLVFNEVT    +IGSE CNSLLAAL+SDGF+E A KVF EMS
Sbjct: 121 YRSVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALSSDGFYEQATKVFDEMS 180

Query: 181 LKCIPFNTLGFGVFIWRVCRNTDVVKVLNMIDDARTNNSEINGSVIATLIIHGLCGASRL 240
           LKCIPFNTLG GVFIW+VCRNTDVVKVLNMIDD RTNNS++NGS+IATLIIHGLCGASRL
Sbjct: 181 LKCIPFNTLGLGVFIWKVCRNTDVVKVLNMIDDVRTNNSDVNGSIIATLIIHGLCGASRL 240

Query: 241 AEASNILDELKNRGCKPDFLTYWILGEAFRSSGNVVDREKILKKKRKLGVAPRLNEYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAF+S+GNVVDREKILKKKRKLGVAPRLN+YKEY
Sbjct: 241 VEASNILDELKNRGCKPDFLTYWILGEAFQSAGNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFALIAGRRICEAKELGEVIVKGNFPMDEEVSNVLIGSVASIDPYSAIIFFKFMVEKGRF 360
           LFALIAG+RI EAKELGEVIVKGNFPMDEEVSNVLIGSVAS+DPYSAI+FFKFMVEKGRF
Sbjct: 301 LFALIAGKRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKIDELLEVYQVLSRNNYFNDFDRYHLRISFLCKAGMVKEAYGV 420
           PTLLTLRNLSRNLCKHGK DELLEV+QVL   NYFND DRYHLRISFLCKAG VKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCIKNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFTPDVSFYNSVLETCCREDLLRPAKKLWDEMFASGCDGNLKTYNILIQKFSK 480
           LQEMKKNGF PD SFYNSVLE CCREDLLRPA+KLWDEMFASGC GNLKTY+ILIQKFSK
Sbjct: 421 LQEMKKNGFAPDASFYNSVLEACCREDLLRPARKLWDEMFASGCSGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYHHMLGKSVEPDITIYMSLLQGLCQHSQLEAAFEVFSKSVEQDVNLAGTL 540
           SNQIEEALVLY HMLGK+VEPDI IY SLLQGLCQ SQLE AFEVFSKSVEQDVNLA TL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQGSQLETAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCHFLAASKLLCGLSSDIAHPVTHVTLLKGFADAGKVPLAKQHLEWVQETSPS 600
           LSTFI  LCHF AASKLL GL+S IAHP +HVTLLKGFADAG+VPLAKQH+EWV ETSPS
Sbjct: 541 LSTFI--LCHFHAASKLLRGLASGIAHPDSHVTLLKGFADAGEVPLAKQHVEWVHETSPS 600

Query: 601 MLSVISSELLAFLPSSPRADQILQILQTIQELSRFSN 635
           MLSVIS+ELLAFLPSSP+AD ILQILQTIQELSRFSN
Sbjct: 601 MLSVISTELLAFLPSSPKADPILQILQTIQELSRFSN 635

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT5G14080.13.3e-18250.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G61990.11.0e-3724.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G64320.12.3e-3725.04Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G06710.11.5e-3623.09Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G53700.16.1e-3524.02Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q9FMU24.7e-18150.64Pentatricopeptide repeat-containing protein At5g14080 OS=Arabidopsis thaliana OX... [more]
Q9FIT71.4e-3624.03Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
Q9FMF63.2e-3625.04Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Q9M9X92.1e-3523.09Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidop... [more]
Q9LFF18.6e-3424.02Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1FHE20.0e+0089.22pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucurbita mo... [more]
A0A6J1JUL20.0e+0088.91pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucurbita ma... [more]
A0A0A0LMX00.0e+0089.22Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G405040 PE=4 SV=1[more]
A0A1S3B2P50.0e+0088.75pentatricopeptide repeat-containing protein At5g14080 isoform X1 OS=Cucumis melo... [more]
A0A5D3BUE50.0e+0088.85Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 428..476
e-value: 2.7E-10
score: 40.3
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 243..294
e-value: 0.003
score: 17.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 401..430
e-value: 3.5E-7
score: 28.0
coord: 467..500
e-value: 5.4E-7
score: 27.4
coord: 225..255
e-value: 1.1E-4
score: 20.1
coord: 433..462
e-value: 0.0023
score: 16.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 152..177
e-value: 0.0021
score: 18.2
coord: 402..426
e-value: 8.3E-5
score: 22.6
coord: 502..528
e-value: 0.011
score: 15.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 464..498
score: 11.421732
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 499..533
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 220..254
score: 9.558311
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 429..463
score: 9.996763
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 394..428
score: 10.851745
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 315..456
e-value: 1.6E-18
score: 69.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 32..146
e-value: 9.2E-10
score: 40.3
coord: 224..286
e-value: 9.0E-8
score: 33.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 458..557
e-value: 1.9E-16
score: 61.9
coord: 147..223
e-value: 3.1E-5
score: 25.4
NoneNo IPR availablePANTHERPTHR47938:SF25PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 4..629
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 4..629

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M002002Bhi04M002002mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding