Bhi04G000750 (gene) Wax gourd (B227) v1

Overview
NameBhi04G000750
Typegene
OrganismBenincasa hispida (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4: 23106809 .. 23109684 (+)
RNA-Seq ExpressionBhi04G000750
SyntenyBhi04G000750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATAATGTGTATGCCAGGTGACTTACCATAAATCTTATTTTTGGATAATTTTGTTATTTTTCAGAATGAAACCTTGAATTTTGCCATGCTTCAATTCTAAAAAAAAATTAAAAAAAATAAATAAATAACCCTTGAAGCCCACCGGAATTGATTGGGCCTCTAACCTATGGGGCTGGCTTTGCTTCAGTGACTCACCTTAGCAGTCCACCAGCGTCTCTTCCCCGAGGGAAAGACGACGACTGGCTCAGAAACTGGATAAGCAAAAACGCCGCTTCACCACTCTTCCACCTCCATTTTTACACACTCTGTAAGTTTCCCGCTTTCCGAATCTCTCTCCTTTTCCAAATTCCCCAATCTCTCCCCTTCCTAAACAGGAAAACCATTGCCCTCCACTTCTTTAATTTCTTCACATTCCATTAAAATGGAGTTCTAAGCATCGGAACCTTTTGTTTCCTTCCAGTTGAAAGGTGTATTATGGATATAACCTTGCCCGCGAAGCATTATACAGATGGGTTCTGTTTATTTCACCGTCGCCCCTCCAAAAATCGCGGCCGGAAAGTTAGGGCCAAGGGAAGGGTTTCTTCAGAGACCAATTCGTTAAGGCTTTATGTAGGAGAAAAAGGGAAACCTCGGTTTCTCGTTATTCCTTCTGATTGTTCTGAAGAACCCCTTGTTCGTGCGGTTCCAAGGGTCGACACATTTAGTTCCAATGGTAGATTGCAACAGGGGGAGAAAAATCTCCATACCCATTTGAATGGCTCCAGTTCTTCATCATCTTCCTCTTCAAATCATTCGCAGAGTTTTGAGGAATTCGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAGTTGGAAGAAGGATTTAAATTTCTAGAGGATATGGTTTATCGTGGTGATATTCCTGATATAATTGCTTGCACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGGAAAGCAACTAGGGTAATGGAAATTTTAGAAGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATAAGTGGTTACTGTAAATCTGGTGAAATTGGTAGTGCTTTGCAACTTTTGGATCGAATGAGTGTTTCTCCTGATGTTGTTACATACAATACAATCTTGCGTACGCTTTGTGATAGTGGGAAATTGAAGCAAGCAATGGAAGTTCTGGACAGACAACTGCAAAGGGAGTGTTATCCGGATGTAATAACTTATACTATTTTGATTGAAGCAACTTGCAAGGAGAGTGGAGTTGGGCAAGCAATGAAATTGTTGGATGAAATGAGGGACAAAGGATGTAAGCCTGATGTTGTTACTTACAATGTTCTTATAAATGGGATTTGTAAGGAAGGAAGATTAGATGAAGCTATTAAGTTCTTGAATCATATGCCTTCCTATGGTTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGCAGCATGTGTAGTACAGGGAGATGGATGGATGCCGAGAAGCTGTTGGCCGAAATGGTTCGGAAGGGATGTTCACCTAGTGTTGTCACTTTCAATATCTTGATTAATTTCTTGTGTAGAAAGGGGTTGCTTGGTCGAGCAATCGATGTTTTGGAGAAGATGCCTCAGCATGGATGTACTCCAAATTCCCTGAGTTACAACCCATTGCTCCATGGATTCTGCAAAGAGAAGAAGATGGACCGGGCGATCGAGTATTTGGATATCATGGTTTCGAGAGGCTGTTACCCTGATATTGTGACCTATAATACCCTATTGACAGCATTGTGCAAAGATGGAAAGGTAGACGTTGCAGTTGAGATACTGAATCAACTTGGTAGCAAAGGTTGCTCTCCTGTTTTGATTACATACAACACAGTCATTGATGGGCTGTCAAAGGTAGGTAAAACAGAAGATGCCATAAAACTTCTAGATGAGATGAAGGGAAAAGGACTGAAACCAGATATAATTACATACTCCTCGCTAGTCGGAGGACTGAGCAGAGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATTTAGAAGAAATGGGTGTGAGGCCAAATGCTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCGACAAACTGTTCGAGCTATTGATTTCTTGGCATACATGGTTGCCAAAGGCTGTAAACCGACCGAGGCTTCATACATGATTCTTATTGAAGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTAATGAATTGTGCTCTAGAGGAGTTGTGAAGAAGAGTTCTGCAGAGCAGGTGGTGGTTAAAAACACTTTTTGATGGGTTTTTTTTTTTTTTTTTACTTCATCTGTTGTTTGATGGCATTGGATTTCATCATTAAATTTTCATAGATGTGGGATTTATGATGATCTTGGAGCCGAATTATGTTTATTATCTTTCTCATAATGGGTAGAATCTATCAATTTTGTAAGATAAAATGCTCAACTTGAGTAGTCTTTTAGTATACAGCAGATTTGATAAAGCAATCGTCCTTTGATAATTTGCAATTTGCATTGCTTTTCAATGTAGATAATTCCTCCATTGAACTCAGCTGGATCATCATTTTTTCTACTTTCAATTAAAACTTCAATCTAATAAATTTTATAAACCCTAACAGTCCGATACTTTTAATAAATATTCAACGATAATTTTTTATAGAAATTGACTAAATAATAATAATAATTTTCAAAAAATTATATTTAATGGTTTGTATTCTCAAAATTTAGAGTGAAAGGCTAATAAATAAATAAATAATTTAAAATCCAACAATAAATTAATTTAGAAATACATAATTTTTACCCAAGATTTTTCATGAGATTTTTATACGCATTATCCAAATTTAAATTAAC

mRNA sequence

GATAATGTGTATGCCAGGTGACTTACCATAAATCTTATTTTTGGATAATTTTGTTATTTTTCAGAATGAAACCTTGAATTTTGCCATGCTTCAATTCTAAAAAAAAATTAAAAAAAATAAATAAATAACCCTTGAAGCCCACCGGAATTGATTGGGCCTCTAACCTATGGGGCTGGCTTTGCTTCAGTGACTCACCTTAGCAGTCCACCAGCGTCTCTTCCCCGAGGGAAAGACGACGACTGGCTCAGAAACTGGATAAGCAAAAACGCCGCTTCACCACTCTTCCACCTCCATTTTTACACACTCTTTGAAAGGTGTATTATGGATATAACCTTGCCCGCGAAGCATTATACAGATGGGTTCTGTTTATTTCACCGTCGCCCCTCCAAAAATCGCGGCCGGAAAGTTAGGGCCAAGGGAAGGGTTTCTTCAGAGACCAATTCGTTAAGGCTTTATGTAGGAGAAAAAGGGAAACCTCGGTTTCTCGTTATTCCTTCTGATTGTTCTGAAGAACCCCTTGTTCGTGCGGTTCCAAGGGTCGACACATTTAGTTCCAATGGTAGATTGCAACAGGGGGAGAAAAATCTCCATACCCATTTGAATGGCTCCAGTTCTTCATCATCTTCCTCTTCAAATCATTCGCAGAGTTTTGAGGAATTCGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAGTTGGAAGAAGGATTTAAATTTCTAGAGGATATGGTTTATCGTGGTGATATTCCTGATATAATTGCTTGCACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGGAAAGCAACTAGGGTAATGGAAATTTTAGAAGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATAAGTGGTTACTGTAAATCTGGTGAAATTGGTAGTGCTTTGCAACTTTTGGATCGAATGAGTGTTTCTCCTGATGTTGTTACATACAATACAATCTTGCGTACGCTTTGTGATAGTGGGAAATTGAAGCAAGCAATGGAAGTTCTGGACAGACAACTGCAAAGGGAGTGTTATCCGGATGTAATAACTTATACTATTTTGATTGAAGCAACTTGCAAGGAGAGTGGAGTTGGGCAAGCAATGAAATTGTTGGATGAAATGAGGGACAAAGGATGTAAGCCTGATGTTGTTACTTACAATGTTCTTATAAATGGGATTTGTAAGGAAGGAAGATTAGATGAAGCTATTAAGTTCTTGAATCATATGCCTTCCTATGGTTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGCAGCATGTGTAGTACAGGGAGATGGATGGATGCCGAGAAGCTGTTGGCCGAAATGGTTCGGAAGGGATGTTCACCTAGTGTTGTCACTTTCAATATCTTGATTAATTTCTTGTGTAGAAAGGGGTTGCTTGGTCGAGCAATCGATGTTTTGGAGAAGATGCCTCAGCATGGATGTACTCCAAATTCCCTGAGTTACAACCCATTGCTCCATGGATTCTGCAAAGAGAAGAAGATGGACCGGGCGATCGAGTATTTGGATATCATGGTTTCGAGAGGCTGTTACCCTGATATTGTGACCTATAATACCCTATTGACAGCATTGTGCAAAGATGGAAAGGTAGACGTTGCAGTTGAGATACTGAATCAACTTGGTAGCAAAGGTTGCTCTCCTGTTTTGATTACATACAACACAGTCATTGATGGGCTGTCAAAGGTAGGTAAAACAGAAGATGCCATAAAACTTCTAGATGAGATGAAGGGAAAAGGACTGAAACCAGATATAATTACATACTCCTCGCTAGTCGGAGGACTGAGCAGAGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATTTAGAAGAAATGGGTGTGAGGCCAAATGCTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCGACAAACTGTTCGAGCTATTGATTTCTTGGCATACATGGTTGCCAAAGGCTGTAAACCGACCGAGGCTTCATACATGATTCTTATTGAAGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTAATGAATTGTGCTCTAGAGGAGTTGTGAAGAAGAGTTCTGCAGAGCAGGTGGTGGTTAAAAACACTTTTTGATGGGTTTTTTTTTTTTTTTTTACTTCATCTGTTGTTTGATGGCATTGGATTTCATCATTAAATTTTCATAGATGTGGGATTTATGATGATCTTGGAGCCGAATTATGTTTATTATCTTTCTCATAATGGGTAGAATCTATCAATTTTGTAAGATAAAATGCTCAACTTGAGTAGTCTTTTAGTATACAGCAGATTTGATAAAGCAATCGTCCTTTGATAATTTGCAATTTGCATTGCTTTTCAATGTAGATAATTCCTCCATTGAACTCAGCTGGATCATCATTTTTTCTACTTTCAATTAAAACTTCAATCTAATAAATTTTATAAACCCTAACAGTCCGATACTTTTAATAAATATTCAACGATAATTTTTTATAGAAATTGACTAAATAATAATAATAATTTTCAAAAAATTATATTTAATGGTTTGTATTCTCAAAATTTAGAGTGAAAGGCTAATAAATAAATAAATAATTTAAAATCCAACAATAAATTAATTTAGAAATACATAATTTTTACCCAAGATTTTTCATGAGATTTTTATACGCATTATCCAAATTTAAATTAAC

Coding sequence (CDS)

ATGGATATAACCTTGCCCGCGAAGCATTATACAGATGGGTTCTGTTTATTTCACCGTCGCCCCTCCAAAAATCGCGGCCGGAAAGTTAGGGCCAAGGGAAGGGTTTCTTCAGAGACCAATTCGTTAAGGCTTTATGTAGGAGAAAAAGGGAAACCTCGGTTTCTCGTTATTCCTTCTGATTGTTCTGAAGAACCCCTTGTTCGTGCGGTTCCAAGGGTCGACACATTTAGTTCCAATGGTAGATTGCAACAGGGGGAGAAAAATCTCCATACCCATTTGAATGGCTCCAGTTCTTCATCATCTTCCTCTTCAAATCATTCGCAGAGTTTTGAGGAATTCGAGAACAATAATCATCTTCGTCGACTGGTAAGAAATGGGGAGTTGGAAGAAGGATTTAAATTTCTAGAGGATATGGTTTATCGTGGTGATATTCCTGATATAATTGCTTGCACTAGTTTGATTCGTGGTTTGTGTAAAACTGGAAAAACTAGGAAAGCAACTAGGGTAATGGAAATTTTAGAAGATTCTGGGGCTGTTCCTGATGTGATAACTTACAATGTTCTTATAAGTGGTTACTGTAAATCTGGTGAAATTGGTAGTGCTTTGCAACTTTTGGATCGAATGAGTGTTTCTCCTGATGTTGTTACATACAATACAATCTTGCGTACGCTTTGTGATAGTGGGAAATTGAAGCAAGCAATGGAAGTTCTGGACAGACAACTGCAAAGGGAGTGTTATCCGGATGTAATAACTTATACTATTTTGATTGAAGCAACTTGCAAGGAGAGTGGAGTTGGGCAAGCAATGAAATTGTTGGATGAAATGAGGGACAAAGGATGTAAGCCTGATGTTGTTACTTACAATGTTCTTATAAATGGGATTTGTAAGGAAGGAAGATTAGATGAAGCTATTAAGTTCTTGAATCATATGCCTTCCTATGGTTGCCAACCTAATGTAATCACTCATAACATCATCTTGCGCAGCATGTGTAGTACAGGGAGATGGATGGATGCCGAGAAGCTGTTGGCCGAAATGGTTCGGAAGGGATGTTCACCTAGTGTTGTCACTTTCAATATCTTGATTAATTTCTTGTGTAGAAAGGGGTTGCTTGGTCGAGCAATCGATGTTTTGGAGAAGATGCCTCAGCATGGATGTACTCCAAATTCCCTGAGTTACAACCCATTGCTCCATGGATTCTGCAAAGAGAAGAAGATGGACCGGGCGATCGAGTATTTGGATATCATGGTTTCGAGAGGCTGTTACCCTGATATTGTGACCTATAATACCCTATTGACAGCATTGTGCAAAGATGGAAAGGTAGACGTTGCAGTTGAGATACTGAATCAACTTGGTAGCAAAGGTTGCTCTCCTGTTTTGATTACATACAACACAGTCATTGATGGGCTGTCAAAGGTAGGTAAAACAGAAGATGCCATAAAACTTCTAGATGAGATGAAGGGAAAAGGACTGAAACCAGATATAATTACATACTCCTCGCTAGTCGGAGGACTGAGCAGAGAAGGAAAAGTTGATGAAGCAATTGCATTTTTCCATGATTTAGAAGAAATGGGTGTGAGGCCAAATGCTATCACATACAACTCTATCATGTTAGGACTCTGTAAGGTTCGACAAACTGTTCGAGCTATTGATTTCTTGGCATACATGGTTGCCAAAGGCTGTAAACCGACCGAGGCTTCATACATGATTCTTATTGAAGGGTTGGCCTATGAAGGTTTAGCCAAGGAGGCATTGGAGTTGCTTAATGAATTGTGCTCTAGAGGAGTTGTGAAGAAGAGTTCTGCAGAGCAGGTGGTGGTTAAAAACACTTTTTGA

Protein sequence

MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVIPSDCSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLRRLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLCKVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSAEQVVVKNTF
Homology
BLAST of Bhi04G000750 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 892.1 bits (2304), Expect = 2.5e-259
Identity = 448/610 (73.44%), Postives = 509/610 (83.44%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCL---FHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVI 60
           MD+ +      +GFCL   FHR     RG K+    R S   +S ++ +G + + R +  
Sbjct: 1   MDLMVSTSSAQEGFCLIQQFHR--EYKRGNKLDVSCRTSGSISS-KIPLGSRKRNRLV-- 60

Query: 61  PSDCSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNN 120
                   LV A  +V++   NGR Q+ E     + N + +   SS N S + E+ E+NN
Sbjct: 61  --------LVSAASKVESSGLNGRAQKFETLSSGYSNSNGNGHYSSVNSSFALEDVESNN 120

Query: 121 HLRRLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSG 180
           HLR++VR GELEEGFKFLE+MVY G++PDII CT+LIRG C+ GKTRKA +++EILE SG
Sbjct: 121 HLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSG 180

Query: 181 AVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVL 240
           AVPDVITYNV+ISGYCK+GEI +AL +LDRMSVSPDVVTYNTILR+LCDSGKLKQAMEVL
Sbjct: 181 AVPDVITYNVMISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVL 240

Query: 241 DRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKE 300
           DR LQR+CYPDVITYTILIEATC++SGVG AMKLLDEMRD+GC PDVVTYNVL+NGICKE
Sbjct: 241 DRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKE 300

Query: 301 GRLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTF 360
           GRLDEAIKFLN MPS GCQPNVITHNIILRSMCSTGRWMDAEKLLA+M+RKG SPSVVTF
Sbjct: 301 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 360

Query: 361 NILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVS 420
           NILINFLCRKGLLGRAID+LEKMPQHGC PNSLSYNPLLHGFCKEKKMDRAIEYL+ MVS
Sbjct: 361 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVS 420

Query: 421 RGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTED 480
           RGCYPDIVTYNT+LTALCKDGKV+ AVEILNQL SKGCSPVLITYNTVIDGL+K GKT  
Sbjct: 421 RGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGK 480

Query: 481 AIKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIML 540
           AIKLLDEM+ K LKPD ITYSSLVGGLSREGKVDEAI FFH+ E MG+RPNA+T+NSIML
Sbjct: 481 AIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIML 540

Query: 541 GLCKVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVK 600
           GLCK RQT RAIDFL +M+ +GCKP E SY ILIEGLAYEG+AKEALELLNELC++G++K
Sbjct: 541 GLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMK 597

Query: 601 KSSAEQVVVK 608
           KSSAEQV  K
Sbjct: 601 KSSAEQVAGK 597

BLAST of Bhi04G000750 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 451.1 bits (1159), Expect = 1.5e-126
Identity = 234/536 (43.66%), Postives = 332/536 (61.94%), Query Frame = 0

Query: 77  SSNGR--LQQGEKNLHTHLNGSSS-SSSSSSNHSQS--FEEFENNNHLRRLVRNGELEEG 136
           + NGR     G +NL T     ++  +     HSQS  F + +      R  R+G   E 
Sbjct: 49  NDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIES 108

Query: 137 FKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNVLISG 196
              LE MV +G  PD+I CT LI+G        KA RVMEILE  G  PDV  YN LI+G
Sbjct: 109 LHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALING 168

Query: 197 YCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPD 256
           +CK   I  A ++LDRM     SPD VTYN ++ +LC  GKL  A++VL++ L   C P 
Sbjct: 169 FCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPT 228

Query: 257 VITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIKFLN 316
           VITYTILIEAT  E GV +A+KL+DEM  +G KPD+ TYN +I G+CKEG +D A + + 
Sbjct: 229 VITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVR 288

Query: 317 HMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFLCRKG 376
           ++   GC+P+VI++NI+LR++ + G+W + EKL+ +M  + C P+VVT++ILI  LCR G
Sbjct: 289 NLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDG 348

Query: 377 LLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYN 436
            +  A+++L+ M + G TP++ SY+PL+  FC+E ++D AIE+L+ M+S GC PDIV YN
Sbjct: 349 KIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYN 408

Query: 437 TLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLLDEMKGK 496
           T+L  LCK+GK D A+EI  +LG  GCSP   +YNT+   L   G    A+ ++ EM   
Sbjct: 409 TVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSN 468

Query: 497 GLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLCKVRQTVRA 556
           G+ PD ITY+S++  L REG VDEA     D+      P+ +TYN ++LG CK  +   A
Sbjct: 469 GIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDA 528

Query: 557 IDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSAEQV 605
           I+ L  MV  GC+P E +Y +LIEG+ + G   EA+EL N+L     + + S +++
Sbjct: 529 INVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRL 583

BLAST of Bhi04G000750 vs. TAIR 10
Match: AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 349.7 bits (896), Expect = 4.6e-96
Identity = 188/486 (38.68%), Postives = 290/486 (59.67%), Query Frame = 0

Query: 128 LEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNV 187
           L + F  LE +V  G  P++   T L+  LCK  + +KA RV+E++  SG +PD   Y  
Sbjct: 87  LSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTY 146

Query: 188 LISGYCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRE 247
           L++  CK G +G A+QL+++M       + VTYN ++R LC  G L Q+++ ++R +Q+ 
Sbjct: 147 LVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKG 206

Query: 248 CYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAI 307
             P+  TY+ L+EA  KE G  +A+KLLDE+  KG +P++V+YNVL+ G CKEGR D+A+
Sbjct: 207 LAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAM 266

Query: 308 KFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFL 367
                +P+ G + NV+++NI+LR +C  GRW +A  LLAEM     +PSVVT+NILIN L
Sbjct: 267 ALFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSL 326

Query: 368 CRKGLLGRAIDVLEKMPQ--HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYP 427
              G   +A+ VL++M +  H     + SYNP++   CKE K+D  ++ LD M+ R C P
Sbjct: 327 AFHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKP 386

Query: 428 DIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLL 487
           +  TYN + +    + KV  A  I+  L +K        Y +VI  L + G T  A +LL
Sbjct: 387 NEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 446

Query: 488 DEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEM-GVRPNAITYNSIMLGLCK 547
            EM   G  PD  TYS+L+ GL  EG    A+     +EE    +P    +N+++LGLCK
Sbjct: 447 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 506

Query: 548 VRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSA 607
           +R+T  A++    MV K   P E +Y IL+EG+A+E   + A E+L+EL  R V+ +++ 
Sbjct: 507 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 566

BLAST of Bhi04G000750 vs. TAIR 10
Match: AT1G08610.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 339.7 bits (870), Expect = 4.8e-93
Identity = 187/506 (36.96%), Postives = 279/506 (55.14%), Query Frame = 0

Query: 88  NLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLRRLVRNGELEEGFKFLEDMVYRGDIPDI 147
           NL   +        SS       +E  NN  L  L  NG+L +  K +E M     +P  
Sbjct: 80  NLRARVKPMKQFGLSSDGPITENDEETNNEILHNLCSNGKLTDACKLVEVMARHNQVPHF 139

Query: 148 IACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDR 207
            +C++L+RGL +  +  KA  ++ ++  SG VPD ITYN++I   CK G I +AL LL+ 
Sbjct: 140 PSCSNLVRGLARIDQLDKAMCILRVMVMSGGVPDTITYNMIIGNLCKKGHIRTALVLLED 199

Query: 208 MSVS---PDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESG 267
           MS+S   PDV+TYNT++R + D G  +QA+     QLQ  C P +ITYT+L+E  C+  G
Sbjct: 200 MSLSGSPPDVITYNTVIRCMFDYGNAEQAIRFWKDQLQNGCPPFMITYTVLVELVCRYCG 259

Query: 268 VGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIKFLNHMPSYGCQPNVITHNI 327
             +A+++L++M  +GC PD+VTYN L+N  C+ G L+E    + H+ S+G + N +T+N 
Sbjct: 260 SARAIEVLEDMAVEGCYPDIVTYNSLVNYNCRRGNLEEVASVIQHILSHGLELNTVTYNT 319

Query: 328 ILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHG 387
           +L S+CS   W + E++L  M +    P+V+T+NILIN LC+  LL RAID   +     
Sbjct: 320 LLHSLCSHEYWDEVEEILNIMYQTSYCPTVITYNILINGLCKARLLSRAIDFFYQ----- 379

Query: 388 CTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAV 447
                                         M+ + C PDIVTYNT+L A+ K+G VD A+
Sbjct: 380 ------------------------------MLEQKCLPDIVTYNTVLGAMSKEGMVDDAI 439

Query: 448 EILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLVGGL 507
           E+L  L +  C P LITYN+VIDGL+K G  + A++L  +M   G+ PD IT  SL+ G 
Sbjct: 440 ELLGLLKNTCCPPGLITYNSVIDGLAKKGLMKKALELYHQMLDAGIFPDDITRRSLIYGF 499

Query: 508 SREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLCKVRQTVRAIDFLAYMVAKGCKPTE 567
            R   V+EA     +    G      TY  ++ GLCK ++   AI+ +  M+  GCKP E
Sbjct: 500 CRANLVEEAGQVLKETSNRGNGIRGSTYRLVIQGLCKKKEIEMAIEVVEIMLTGGCKPDE 550

Query: 568 ASYMILIEGLAYEGLAKEALELLNEL 591
             Y  +++G+   G+  EA++L  +L
Sbjct: 560 TIYTAIVKGVEEMGMGSEAVQLQKKL 550

BLAST of Bhi04G000750 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 332.4 bits (851), Expect = 7.6e-91
Identity = 168/488 (34.43%), Postives = 276/488 (56.56%), Query Frame = 0

Query: 123 VRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATR-VMEILEDSGAVPD 182
           +  G+L+   +  E MV  G     ++   ++ G CK G+   A   + E+    G  PD
Sbjct: 235 IEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPD 294

Query: 183 VITYNVLISGYCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLD 242
             T+N L++G CK+G +  A++++D M      PDV TYN+++  LC  G++K+A+EVLD
Sbjct: 295 QYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLD 354

Query: 243 RQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEG 302
           + + R+C P+ +TY  LI   CKE+ V +A +L   +  KG  PDV T+N LI G+C   
Sbjct: 355 QMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTR 414

Query: 303 RLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFN 362
               A++    M S GC+P+  T+N+++ S+CS G+  +A  +L +M   GC+ SV+T+N
Sbjct: 415 NHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYN 474

Query: 363 ILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSR 422
            LI+  C+      A ++ ++M  HG + NS++YN L+ G CK ++++ A + +D M+  
Sbjct: 475 TLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIME 534

Query: 423 GCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDA 482
           G  PD  TYN+LLT  C+ G +  A +I+  + S GC P ++TY T+I GL K G+ E A
Sbjct: 535 GQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594

Query: 483 IKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHD-LEEMGVRPNAITYNSIML 542
            KLL  ++ KG+      Y+ ++ GL R+ K  EAI  F + LE+    P+A++Y  +  
Sbjct: 595 SKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFR 654

Query: 543 GLCKVRQTVR-AIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVV 602
           GLC     +R A+DFL  ++ KG  P  +S  +L EGL    + +  ++L+N +  +   
Sbjct: 655 GLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARF 714

Query: 603 KKSSAEQV 605
            +     V
Sbjct: 715 SEEEVSMV 722

BLAST of Bhi04G000750 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 892.1 bits (2304), Expect = 3.5e-258
Identity = 448/610 (73.44%), Postives = 509/610 (83.44%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCL---FHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVI 60
           MD+ +      +GFCL   FHR     RG K+    R S   +S ++ +G + + R +  
Sbjct: 1   MDLMVSTSSAQEGFCLIQQFHR--EYKRGNKLDVSCRTSGSISS-KIPLGSRKRNRLV-- 60

Query: 61  PSDCSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNN 120
                   LV A  +V++   NGR Q+ E     + N + +   SS N S + E+ E+NN
Sbjct: 61  --------LVSAASKVESSGLNGRAQKFETLSSGYSNSNGNGHYSSVNSSFALEDVESNN 120

Query: 121 HLRRLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSG 180
           HLR++VR GELEEGFKFLE+MVY G++PDII CT+LIRG C+ GKTRKA +++EILE SG
Sbjct: 121 HLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSG 180

Query: 181 AVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVL 240
           AVPDVITYNV+ISGYCK+GEI +AL +LDRMSVSPDVVTYNTILR+LCDSGKLKQAMEVL
Sbjct: 181 AVPDVITYNVMISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAMEVL 240

Query: 241 DRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKE 300
           DR LQR+CYPDVITYTILIEATC++SGVG AMKLLDEMRD+GC PDVVTYNVL+NGICKE
Sbjct: 241 DRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKE 300

Query: 301 GRLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTF 360
           GRLDEAIKFLN MPS GCQPNVITHNIILRSMCSTGRWMDAEKLLA+M+RKG SPSVVTF
Sbjct: 301 GRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTF 360

Query: 361 NILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVS 420
           NILINFLCRKGLLGRAID+LEKMPQHGC PNSLSYNPLLHGFCKEKKMDRAIEYL+ MVS
Sbjct: 361 NILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVS 420

Query: 421 RGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTED 480
           RGCYPDIVTYNT+LTALCKDGKV+ AVEILNQL SKGCSPVLITYNTVIDGL+K GKT  
Sbjct: 421 RGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKTGK 480

Query: 481 AIKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIML 540
           AIKLLDEM+ K LKPD ITYSSLVGGLSREGKVDEAI FFH+ E MG+RPNA+T+NSIML
Sbjct: 481 AIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSIML 540

Query: 541 GLCKVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVK 600
           GLCK RQT RAIDFL +M+ +GCKP E SY ILIEGLAYEG+AKEALELLNELC++G++K
Sbjct: 541 GLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGLMK 597

Query: 601 KSSAEQVVVK 608
           KSSAEQV  K
Sbjct: 601 KSSAEQVAGK 597

BLAST of Bhi04G000750 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 2.1e-125
Identity = 234/536 (43.66%), Postives = 332/536 (61.94%), Query Frame = 0

Query: 77  SSNGR--LQQGEKNLHTHLNGSSS-SSSSSSNHSQS--FEEFENNNHLRRLVRNGELEEG 136
           + NGR     G +NL T     ++  +     HSQS  F + +      R  R+G   E 
Sbjct: 49  NDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIES 108

Query: 137 FKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNVLISG 196
              LE MV +G  PD+I CT LI+G        KA RVMEILE  G  PDV  YN LI+G
Sbjct: 109 LHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALING 168

Query: 197 YCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPD 256
           +CK   I  A ++LDRM     SPD VTYN ++ +LC  GKL  A++VL++ L   C P 
Sbjct: 169 FCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPT 228

Query: 257 VITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIKFLN 316
           VITYTILIEAT  E GV +A+KL+DEM  +G KPD+ TYN +I G+CKEG +D A + + 
Sbjct: 229 VITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVR 288

Query: 317 HMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFLCRKG 376
           ++   GC+P+VI++NI+LR++ + G+W + EKL+ +M  + C P+VVT++ILI  LCR G
Sbjct: 289 NLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDG 348

Query: 377 LLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYN 436
            +  A+++L+ M + G TP++ SY+PL+  FC+E ++D AIE+L+ M+S GC PDIV YN
Sbjct: 349 KIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYN 408

Query: 437 TLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLLDEMKGK 496
           T+L  LCK+GK D A+EI  +LG  GCSP   +YNT+   L   G    A+ ++ EM   
Sbjct: 409 TVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSN 468

Query: 497 GLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLCKVRQTVRA 556
           G+ PD ITY+S++  L REG VDEA     D+      P+ +TYN ++LG CK  +   A
Sbjct: 469 GIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDA 528

Query: 557 IDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSAEQV 605
           I+ L  MV  GC+P E +Y +LIEG+ + G   EA+EL N+L     + + S +++
Sbjct: 529 INVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRL 583

BLAST of Bhi04G000750 vs. ExPASy Swiss-Prot
Match: A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 349.7 bits (896), Expect = 6.5e-95
Identity = 188/486 (38.68%), Postives = 290/486 (59.67%), Query Frame = 0

Query: 128 LEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNV 187
           L + F  LE +V  G  P++   T L+  LCK  + +KA RV+E++  SG +PD   Y  
Sbjct: 87  LSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTY 146

Query: 188 LISGYCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRE 247
           L++  CK G +G A+QL+++M       + VTYN ++R LC  G L Q+++ ++R +Q+ 
Sbjct: 147 LVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKG 206

Query: 248 CYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAI 307
             P+  TY+ L+EA  KE G  +A+KLLDE+  KG +P++V+YNVL+ G CKEGR D+A+
Sbjct: 207 LAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAM 266

Query: 308 KFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFL 367
                +P+ G + NV+++NI+LR +C  GRW +A  LLAEM     +PSVVT+NILIN L
Sbjct: 267 ALFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSL 326

Query: 368 CRKGLLGRAIDVLEKMPQ--HGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYP 427
              G   +A+ VL++M +  H     + SYNP++   CKE K+D  ++ LD M+ R C P
Sbjct: 327 AFHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKP 386

Query: 428 DIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLL 487
           +  TYN + +    + KV  A  I+  L +K        Y +VI  L + G T  A +LL
Sbjct: 387 NEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 446

Query: 488 DEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEM-GVRPNAITYNSIMLGLCK 547
            EM   G  PD  TYS+L+ GL  EG    A+     +EE    +P    +N+++LGLCK
Sbjct: 447 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 506

Query: 548 VRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSA 607
           +R+T  A++    MV K   P E +Y IL+EG+A+E   + A E+L+EL  R V+ +++ 
Sbjct: 507 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 566

BLAST of Bhi04G000750 vs. ExPASy Swiss-Prot
Match: Q9FRS4 (Pentatricopeptide repeat-containing protein At1g08610 OS=Arabidopsis thaliana OX=3702 GN=At1g08610 PE=2 SV=1)

HSP 1 Score: 339.7 bits (870), Expect = 6.7e-92
Identity = 187/506 (36.96%), Postives = 279/506 (55.14%), Query Frame = 0

Query: 88  NLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLRRLVRNGELEEGFKFLEDMVYRGDIPDI 147
           NL   +        SS       +E  NN  L  L  NG+L +  K +E M     +P  
Sbjct: 80  NLRARVKPMKQFGLSSDGPITENDEETNNEILHNLCSNGKLTDACKLVEVMARHNQVPHF 139

Query: 148 IACTSLIRGLCKTGKTRKATRVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDR 207
            +C++L+RGL +  +  KA  ++ ++  SG VPD ITYN++I   CK G I +AL LL+ 
Sbjct: 140 PSCSNLVRGLARIDQLDKAMCILRVMVMSGGVPDTITYNMIIGNLCKKGHIRTALVLLED 199

Query: 208 MSVS---PDVVTYNTILRTLCDSGKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESG 267
           MS+S   PDV+TYNT++R + D G  +QA+     QLQ  C P +ITYT+L+E  C+  G
Sbjct: 200 MSLSGSPPDVITYNTVIRCMFDYGNAEQAIRFWKDQLQNGCPPFMITYTVLVELVCRYCG 259

Query: 268 VGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIKFLNHMPSYGCQPNVITHNI 327
             +A+++L++M  +GC PD+VTYN L+N  C+ G L+E    + H+ S+G + N +T+N 
Sbjct: 260 SARAIEVLEDMAVEGCYPDIVTYNSLVNYNCRRGNLEEVASVIQHILSHGLELNTVTYNT 319

Query: 328 ILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHG 387
           +L S+CS   W + E++L  M +    P+V+T+NILIN LC+  LL RAID   +     
Sbjct: 320 LLHSLCSHEYWDEVEEILNIMYQTSYCPTVITYNILINGLCKARLLSRAIDFFYQ----- 379

Query: 388 CTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAV 447
                                         M+ + C PDIVTYNT+L A+ K+G VD A+
Sbjct: 380 ------------------------------MLEQKCLPDIVTYNTVLGAMSKEGMVDDAI 439

Query: 448 EILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLVGGL 507
           E+L  L +  C P LITYN+VIDGL+K G  + A++L  +M   G+ PD IT  SL+ G 
Sbjct: 440 ELLGLLKNTCCPPGLITYNSVIDGLAKKGLMKKALELYHQMLDAGIFPDDITRRSLIYGF 499

Query: 508 SREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLCKVRQTVRAIDFLAYMVAKGCKPTE 567
            R   V+EA     +    G      TY  ++ GLCK ++   AI+ +  M+  GCKP E
Sbjct: 500 CRANLVEEAGQVLKETSNRGNGIRGSTYRLVIQGLCKKKEIEMAIEVVEIMLTGGCKPDE 550

Query: 568 ASYMILIEGLAYEGLAKEALELLNEL 591
             Y  +++G+   G+  EA++L  +L
Sbjct: 560 TIYTAIVKGVEEMGMGSEAVQLQKKL 550

BLAST of Bhi04G000750 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 1.1e-89
Identity = 168/488 (34.43%), Postives = 276/488 (56.56%), Query Frame = 0

Query: 123 VRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATR-VMEILEDSGAVPD 182
           +  G+L+   +  E MV  G     ++   ++ G CK G+   A   + E+    G  PD
Sbjct: 235 IEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPD 294

Query: 183 VITYNVLISGYCKSGEIGSALQLLDRM---SVSPDVVTYNTILRTLCDSGKLKQAMEVLD 242
             T+N L++G CK+G +  A++++D M      PDV TYN+++  LC  G++K+A+EVLD
Sbjct: 295 QYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLD 354

Query: 243 RQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEG 302
           + + R+C P+ +TY  LI   CKE+ V +A +L   +  KG  PDV T+N LI G+C   
Sbjct: 355 QMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTR 414

Query: 303 RLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFN 362
               A++    M S GC+P+  T+N+++ S+CS G+  +A  +L +M   GC+ SV+T+N
Sbjct: 415 NHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYN 474

Query: 363 ILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSR 422
            LI+  C+      A ++ ++M  HG + NS++YN L+ G CK ++++ A + +D M+  
Sbjct: 475 TLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIME 534

Query: 423 GCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDA 482
           G  PD  TYN+LLT  C+ G +  A +I+  + S GC P ++TY T+I GL K G+ E A
Sbjct: 535 GQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594

Query: 483 IKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHD-LEEMGVRPNAITYNSIML 542
            KLL  ++ KG+      Y+ ++ GL R+ K  EAI  F + LE+    P+A++Y  +  
Sbjct: 595 SKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFR 654

Query: 543 GLCKVRQTVR-AIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVV 602
           GLC     +R A+DFL  ++ KG  P  +S  +L EGL    + +  ++L+N +  +   
Sbjct: 655 GLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVNMVMQKARF 714

Query: 603 KKSSAEQV 605
            +     V
Sbjct: 715 SEEEVSMV 722

BLAST of Bhi04G000750 vs. ExPASy TrEMBL
Match: A0A5A7VFH1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G004060 PE=4 SV=1)

HSP 1 Score: 1163.7 bits (3009), Expect = 0.0e+00
Identity = 581/610 (95.25%), Postives = 594/610 (97.38%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVIPSD 60
           MD+TLPAKHYTDGFCLFHR  +KNR R+V AKGRVSSETNSLRL+VGEKGK RF VIPS 
Sbjct: 1   MDLTLPAKHYTDGFCLFHRHTTKNRDRRVTAKGRVSSETNSLRLHVGEKGKHRFFVIPSY 60

Query: 61  CSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLR 120
            S+E LVRAVPRVDTFSSNGRL  GEKNLHTHLNGSSSSSSSSSN+SQS EEFENNNHLR
Sbjct: 61  GSDEQLVRAVPRVDTFSSNGRLPHGEKNLHTHLNGSSSSSSSSSNYSQSSEEFENNNHLR 120

Query: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180
           RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP
Sbjct: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180

Query: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240
           DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ
Sbjct: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240

Query: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300
           LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL
Sbjct: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300

Query: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNIL 360
           DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEM+RKGCSPSVVTFNIL
Sbjct: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMIRKGCSPSVVTFNIL 360

Query: 361 INFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420
           INFLCRKGL+GRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC
Sbjct: 361 INFLCRKGLIGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420

Query: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIK 480
           YPDIVTYNTLLTALCKDGKVD+AVEILNQLGSKGCSPVLITYNTVIDGLSKVGKT DAIK
Sbjct: 421 YPDIVTYNTLLTALCKDGKVDIAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTGDAIK 480

Query: 481 LLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540
           LLDEMKGKGLKPDIITYS+LVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC
Sbjct: 481 LLDEMKGKGLKPDIITYSTLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540

Query: 541 KVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSS 600
           K RQTVRAIDFLAYMVA+GCKPTEASYMILIEGLAYEGL KEALELL+EL SRGVV+KSS
Sbjct: 541 KARQTVRAIDFLAYMVARGCKPTEASYMILIEGLAYEGLTKEALELLDELYSRGVVRKSS 600

Query: 601 AEQVVVKNTF 611
           AEQVVVKNTF
Sbjct: 601 AEQVVVKNTF 610

BLAST of Bhi04G000750 vs. ExPASy TrEMBL
Match: A0A1S3BB35 (pentatricopeptide repeat-containing protein At1g09900 OS=Cucumis melo OX=3656 GN=LOC103488143 PE=4 SV=1)

HSP 1 Score: 1163.7 bits (3009), Expect = 0.0e+00
Identity = 581/610 (95.25%), Postives = 594/610 (97.38%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVIPSD 60
           MD+TLPAKHYTDGFCLFHR  +KNR R+V AKGRVSSETNSLRL+VGEKGK RF VIPS 
Sbjct: 1   MDLTLPAKHYTDGFCLFHRHTTKNRDRRVTAKGRVSSETNSLRLHVGEKGKHRFFVIPSY 60

Query: 61  CSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLR 120
            S+E LVRAVPRVDTFSSNGRL  GEKNLHTHLNGSSSSSSSSSN+SQS EEFENNNHLR
Sbjct: 61  GSDEQLVRAVPRVDTFSSNGRLPHGEKNLHTHLNGSSSSSSSSSNYSQSSEEFENNNHLR 120

Query: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180
           RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP
Sbjct: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180

Query: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240
           DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ
Sbjct: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240

Query: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300
           LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL
Sbjct: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300

Query: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNIL 360
           DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEM+RKGCSPSVVTFNIL
Sbjct: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMIRKGCSPSVVTFNIL 360

Query: 361 INFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420
           INFLCRKGL+GRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC
Sbjct: 361 INFLCRKGLIGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420

Query: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIK 480
           YPDIVTYNTLLTALCKDGKVD+AVEILNQLGSKGCSPVLITYNTVIDGLSKVGKT DAIK
Sbjct: 421 YPDIVTYNTLLTALCKDGKVDIAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTGDAIK 480

Query: 481 LLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540
           LLDEMKGKGLKPDIITYS+LVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC
Sbjct: 481 LLDEMKGKGLKPDIITYSTLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540

Query: 541 KVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSS 600
           K RQTVRAIDFLAYMVA+GCKPTEASYMILIEGLAYEGL KEALELL+EL SRGVV+KSS
Sbjct: 541 KARQTVRAIDFLAYMVARGCKPTEASYMILIEGLAYEGLTKEALELLDELYSRGVVRKSS 600

Query: 601 AEQVVVKNTF 611
           AEQVVVKNTF
Sbjct: 601 AEQVVVKNTF 610

BLAST of Bhi04G000750 vs. ExPASy TrEMBL
Match: A0A6J1BSC2 (pentatricopeptide repeat-containing protein At1g09900 OS=Momordica charantia OX=3673 GN=LOC111005101 PE=4 SV=1)

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 573/623 (91.97%), Postives = 593/623 (95.18%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVI-PS 60
           MD+TLPAKHYTDGFCLF R+ ++NRGRKVRA GRV+SE NSLRLYVG KGK RFLV+ PS
Sbjct: 1   MDVTLPAKHYTDGFCLFQRQSTRNRGRKVRAGGRVTSEANSLRLYVGGKGKARFLVVPPS 60

Query: 61  DCS------------EEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHS 120
           DCS            +  +VRA+ RVDTFSSNGRLQ GEKNLH HLNG S+SSSS+SNHS
Sbjct: 61  DCSVYGEKSHDLISKQLSIVRAISRVDTFSSNGRLQHGEKNLHGHLNG-SNSSSSASNHS 120

Query: 121 QSFEEFENNNHLRRLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKAT 180
           QSFEEFENNNHLRRLVRNGELEEGFKF+E+MV  GDIPDIIACTSLIRGLCKTGKTRKAT
Sbjct: 121 QSFEEFENNNHLRRLVRNGELEEGFKFVENMVCHGDIPDIIACTSLIRGLCKTGKTRKAT 180

Query: 181 RVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDS 240
           RVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDS
Sbjct: 181 RVMEILEDSGAVPDVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDS 240

Query: 241 GKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTY 300
           GKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEM++KGCKPDVVT+
Sbjct: 241 GKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMKNKGCKPDVVTF 300

Query: 301 NVLINGICKEGRLDEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVR 360
           NVLINGICKEGRLDEAIKFLN+MPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVR
Sbjct: 301 NVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVR 360

Query: 361 KGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDR 420
           KGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDR
Sbjct: 361 KGCSPSVVTFNILINFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDR 420

Query: 421 AIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVID 480
           AIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQL +KGCSPVLITYNTVID
Sbjct: 421 AIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSAKGCSPVLITYNTVID 480

Query: 481 GLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRP 540
           GLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLV GLSREGKVDEAIAFFHDLEEMGVRP
Sbjct: 481 GLSKVGKTEDAIKLLDEMKGKGLKPDIITYSSLVRGLSREGKVDEAIAFFHDLEEMGVRP 540

Query: 541 NAITYNSIMLGLCKVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELL 600
           NAITYNSIMLGLCKVRQT RAIDFLAYMVA+GCKPTEASYMILIEGLAYEGLAKEALELL
Sbjct: 541 NAITYNSIMLGLCKVRQTSRAIDFLAYMVARGCKPTEASYMILIEGLAYEGLAKEALELL 600

Query: 601 NELCSRGVVKKSSAEQVVVKNTF 611
           NELCSRGVVKKSSAEQVVVKN+F
Sbjct: 601 NELCSRGVVKKSSAEQVVVKNSF 622

BLAST of Bhi04G000750 vs. ExPASy TrEMBL
Match: A0A6J1HFE3 (pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita moschata OX=3662 GN=LOC111462544 PE=4 SV=1)

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 539/610 (88.36%), Postives = 568/610 (93.11%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVIPSD 60
           MDITL AKHYTDGFCLF RR +KNRGR VRA+ R + ETNS+  +VG KGKPRFLV+PSD
Sbjct: 1   MDITLHAKHYTDGFCLFQRRSTKNRGRNVRAERRATPETNSVSFFVGGKGKPRFLVVPSD 60

Query: 61  CSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLR 120
           CSEE  VRAV +  +       QQGEKN+HTHLNG SSSSS SSNHSQSFEEFENNNHLR
Sbjct: 61  CSEESFVRAVLKNPS-------QQGEKNVHTHLNG-SSSSSFSSNHSQSFEEFENNNHLR 120

Query: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180
           RLVRNGELEEGFKF+E MVYRGDIPD I CTSLIRGLCKTGKTRKA RVMEILEDSGAVP
Sbjct: 121 RLVRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVP 180

Query: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240
           DVITYNVLISGYCKSGEI +AL+LLDRMS+SPDV+TYN ILRTLCDSGKLK+AMEVL RQ
Sbjct: 181 DVITYNVLISGYCKSGEIDNALKLLDRMSISPDVITYNIILRTLCDSGKLKEAMEVLHRQ 240

Query: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300
           LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMR+KGCKPDVVTYNVLINGICKEGRL
Sbjct: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRL 300

Query: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNIL 360
           DEAI+FLN M SYGCQPNVITHNIILRSMCSTGRW DAEKLLAEMVRKGCSPSVVTFNIL
Sbjct: 301 DEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNIL 360

Query: 361 INFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420
           INFLCRKGLLGRAID+LEKMPQHGCTPNS SYNPLLHGFCKEKKM+RAIEYLDIM SRGC
Sbjct: 361 INFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGC 420

Query: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIK 480
           YPDIVTYNTLLTALCKDGKVDVAVEILNQLG+KGCSPVLITYNTVIDGLSK GKTEDA+K
Sbjct: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKGCSPVLITYNTVIDGLSKAGKTEDAVK 480

Query: 481 LLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540
           LLDEMK KGLKPDIITYSSLVGGL REGKVDEAIAFFHDLEE+GVRPN ITYNSIMLGLC
Sbjct: 481 LLDEMKEKGLKPDIITYSSLVGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLC 540

Query: 541 KVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSS 600
           KV+QTVRAIDFLA MVA+GCKP EASYMILIEGLAYEGLAKEALELL+ELCSRGV+KKSS
Sbjct: 541 KVQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSS 600

Query: 601 AEQVVVKNTF 611
           AE+VV+KN+F
Sbjct: 601 AERVVIKNSF 602

BLAST of Bhi04G000750 vs. ExPASy TrEMBL
Match: A0A6J1K404 (pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita maxima OX=3661 GN=LOC111492114 PE=4 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 538/610 (88.20%), Postives = 564/610 (92.46%), Query Frame = 0

Query: 1   MDITLPAKHYTDGFCLFHRRPSKNRGRKVRAKGRVSSETNSLRLYVGEKGKPRFLVIPSD 60
           MDITL AKHYTDGFCLF RR +KN  R VRA+ R + ETNS+   VG KGKPRFLVIPSD
Sbjct: 1   MDITLHAKHYTDGFCLFQRRSTKNSSRNVRAERRATPETNSVSFSVGGKGKPRFLVIPSD 60

Query: 61  CSEEPLVRAVPRVDTFSSNGRLQQGEKNLHTHLNGSSSSSSSSSNHSQSFEEFENNNHLR 120
           CSEE  VRAV +         LQQGEKNLHTHLNG SSSSS SSNHSQSFEEFENNNHLR
Sbjct: 61  CSEESFVRAVLK-------NPLQQGEKNLHTHLNG-SSSSSFSSNHSQSFEEFENNNHLR 120

Query: 121 RLVRNGELEEGFKFLEDMVYRGDIPDIIACTSLIRGLCKTGKTRKATRVMEILEDSGAVP 180
           RLVRNGELEEGFKF+E MVYRGDIPD I CTSLIRGLCKTGKTRKA RVMEILEDSGAVP
Sbjct: 121 RLVRNGELEEGFKFIEGMVYRGDIPDAIVCTSLIRGLCKTGKTRKAARVMEILEDSGAVP 180

Query: 181 DVITYNVLISGYCKSGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKQAMEVLDRQ 240
           DVITYNVLISGYCKSGEI +AL+LLDRMS+SPDV+TYN ILRTLCDSGKLK+AMEVL RQ
Sbjct: 181 DVITYNVLISGYCKSGEIDNALKLLDRMSISPDVITYNIILRTLCDSGKLKEAMEVLHRQ 240

Query: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRL 300
           LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMR+KGCKPDVVTYNVLINGICKEGRL
Sbjct: 241 LQRECYPDVITYTILIEATCKESGVGQAMKLLDEMREKGCKPDVVTYNVLINGICKEGRL 300

Query: 301 DEAIKFLNHMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLAEMVRKGCSPSVVTFNIL 360
           DEAI+FLN M SYGCQPNVITHNIILRSMCSTGRW DAEKLLAEMVRKGCSPSVVTFNIL
Sbjct: 301 DEAIEFLNEMSSYGCQPNVITHNIILRSMCSTGRWTDAEKLLAEMVRKGCSPSVVTFNIL 360

Query: 361 INFLCRKGLLGRAIDVLEKMPQHGCTPNSLSYNPLLHGFCKEKKMDRAIEYLDIMVSRGC 420
           INFLCRKGLLGRAID+LEKMPQHGCTPNS SYNPLLHGFCKEKKM+RAIEYLDIM SRGC
Sbjct: 361 INFLCRKGLLGRAIDILEKMPQHGCTPNSSSYNPLLHGFCKEKKMERAIEYLDIMASRGC 420

Query: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTEDAIK 480
           YPDIVTYNTLLTALCKDGKVDVAVEILNQLG+K CSPVLITYNTVIDGLSK GKTEDA+K
Sbjct: 421 YPDIVTYNTLLTALCKDGKVDVAVEILNQLGTKSCSPVLITYNTVIDGLSKAGKTEDAVK 480

Query: 481 LLDEMKGKGLKPDIITYSSLVGGLSREGKVDEAIAFFHDLEEMGVRPNAITYNSIMLGLC 540
           LLDEMK KGLKPDIITYSSL+GGL REGKVDEAIAFFHDLEE+GVRPN ITYNSIMLGLC
Sbjct: 481 LLDEMKEKGLKPDIITYSSLIGGLCREGKVDEAIAFFHDLEEVGVRPNVITYNSIMLGLC 540

Query: 541 KVRQTVRAIDFLAYMVAKGCKPTEASYMILIEGLAYEGLAKEALELLNELCSRGVVKKSS 600
           KV+QTVRAIDFLA MVA+GCKP EASYMILIEGLAYEGLAKEALELL+ELCSRGV+KKSS
Sbjct: 541 KVQQTVRAIDFLASMVARGCKPNEASYMILIEGLAYEGLAKEALELLDELCSRGVMKKSS 600

Query: 601 AEQVVVKNTF 611
           AE+VV+KN+F
Sbjct: 601 AERVVIKNSF 602

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AT1G09900.12.5e-25973.44Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G04760.11.5e-12643.66Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G79080.14.6e-9638.68Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08610.14.8e-9336.96Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.17.6e-9134.43Pentatricopeptide repeat (PPR) superfamily protein [more]
Match NameE-valueIdentityDescription
Q3EDF83.5e-25873.44Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9SR002.1e-12543.66Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
A3KPF86.5e-9538.68Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
Q9FRS46.7e-9236.96Pentatricopeptide repeat-containing protein At1g08610 OS=Arabidopsis thaliana OX... [more]
Q9LFF11.1e-8934.43Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5A7VFH10.0e+0095.25Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BB350.0e+0095.25pentatricopeptide repeat-containing protein At1g09900 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1BSC20.0e+0091.97pentatricopeptide repeat-containing protein At1g09900 OS=Momordica charantia OX=... [more]
A0A6J1HFE30.0e+0088.36pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita moschata... [more]
A0A6J1K4040.0e+0088.20pentatricopeptide repeat-containing protein At1g09900-like OS=Cucurbita maxima O... [more]
InterPro
Analysis Name: InterPro Annotations of Wax gourd (B227) v1
Date Performed: 2021-10-22
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 320..354
e-value: 6.3E-9
score: 33.5
coord: 149..182
e-value: 6.2E-5
score: 20.9
coord: 215..249
e-value: 2.6E-6
score: 25.2
coord: 460..494
e-value: 6.1E-11
score: 39.8
coord: 391..424
e-value: 4.7E-9
score: 33.9
coord: 285..319
e-value: 1.6E-10
score: 38.5
coord: 355..388
e-value: 5.2E-8
score: 30.6
coord: 183..209
e-value: 1.9E-7
score: 28.8
coord: 250..284
e-value: 5.3E-7
score: 27.4
coord: 530..564
e-value: 1.1E-6
score: 26.4
coord: 495..529
e-value: 8.8E-8
score: 29.9
coord: 425..457
e-value: 9.1E-8
score: 29.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 177..209
e-value: 9.9E-13
score: 47.6
coord: 348..380
e-value: 1.6E-12
score: 46.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 527..575
e-value: 2.0E-11
score: 43.9
coord: 282..331
e-value: 4.2E-18
score: 65.3
coord: 387..436
e-value: 1.3E-15
score: 57.4
coord: 460..505
e-value: 1.6E-15
score: 57.0
coord: 212..261
e-value: 4.5E-14
score: 52.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 122..142
e-value: 0.57
score: 10.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 213..247
score: 11.750571
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 353..387
score: 12.298636
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 458..492
score: 12.83574
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 146..180
score: 10.775016
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 283..317
score: 14.293592
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 388..422
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 493..527
score: 12.92343
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 13.011121
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 563..597
score: 9.032168
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 248..282
score: 12.397287
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 528..562
score: 11.213468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 318..352
score: 12.572669
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..211
score: 11.257313
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 381..451
e-value: 1.3E-22
score: 82.3
coord: 523..593
e-value: 1.8E-16
score: 62.2
coord: 311..380
e-value: 8.8E-22
score: 79.5
coord: 241..310
e-value: 3.8E-26
score: 93.8
coord: 452..522
e-value: 2.3E-22
score: 81.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 74..240
e-value: 1.2E-33
score: 118.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 89..112
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 89..113
NoneNo IPR availablePANTHERPTHR47932:SF45OS02G0565400 PROTEINcoord: 63..607
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 63..607

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi04M000750Bhi04M000750mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding