Bhi02G000224 (gene) Wax gourd

NameBhi02G000224
Typegene
OrganismBenincasa hispida (Wax gourd)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2 : 5953449 .. 5955299 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGGAATCTTCCGTCGTCATTCTATCCTCTCTCCGAAACACCATCATCATCGTTTCGCATCCACCTCTCTGCTTTCTTCCAAATTTCGACAACAAAACTCAACACCTCACTTGGAAAGAGAGCTCCTGATTTCTCTCATAAAATCATGTACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCCGTACTTCTTCCATTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACTCGCACCGCCTCTGCCCCTTTTCGCGATTTGGGCTATTCTCGACGATTCTTCTCTCTCCTGACGAACCCATTTGTTTCTCATTATAATGCGATGTTGAGAGCCTATTCTTTGAGCCCTTCACCTCTGAAGGGATTGTACATGTACAGAGATATGGAAAGGCAAGGAGTTCGCGTCGATCCCTTGTCTTCTTCCTTTGCCGTTAAGTCTTGTATAAGAATGCTTTCATTATTTAGTGGGGTTCAGATTCACGCGAAGATTTTTAGAAATGGGCATCAATCGGATAGTCTTTTGCTCACCTCCATGATGGACCTGTATTCTCATTGTGGCAAACTTGAGGATGCGTGCAAATTGTTCGACGAAATTCCTCAAAGAGATGTTATTGCTTGGAACGTTTTGATTTCTTGTCTAACTCGAAATAAACGGACTAGGGATGCTTTAGGTTTGTTTGACATCATGCAAAGTCCAACATATCTCTGCGAACCTGATAAAGTTACTTGTTTACTTCTCCTCCAAGCTTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAATTATGTTCAACAGCACGGTTATAATACTGAGAGTAATTTGTGTAATTCGTTGATATCGATGTATTCGCTGTGTGGGCGTGTGGACAAGGCTTATGAAGTGTTTGATAAAATGCCGGAGAAAGATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGACAGGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTTGTTGATGAAGGAATGGCTTTTTTTGATCGTATGAGACAGGAGTTCAGGATAGTTCCCAACGTCTATCACTATGGATGTATGGTTGATCTCTTGGGTCGCACTGGAATGCTTGATCAAGCCTATAAACTCATAATGTCAATGGAGGTGAACCCAGATGCAACATTATGGAGGACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGCAAACCTTGGGGAGCGCATAATTGAGCATTTGATTGAACTCAAATCTCAAGAAGCTGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTATCTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGCACCACAATAGAATTGAATGGGGTGGTGCATGAGTTTGCTGTGGATGATATTTCGCATCCTATGAAGGACAAGATCTACGAGAAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCTTCTGAATTACACAGATTAAAGGCAGAAGATAAGGGGTATGCGCTTTCTAACCATAGTGAAAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAGTGGCAAATACCGTTCATATTTGCATGGATTGTCATAACTTTGCTAAGTATATCTCCAGTGTTTATAACAGAAAAGTGGTTGTTAGGGACCGAAGTCGGTCACATCATTTCCGAGAGGGTCGGTGTTCCTGCAATGATTATTGGTAG

mRNA sequence

ATGACTGGAATCTTCCGTCGTCATTCTATCCTCTCTCCGAAACACCATCATCATCGTTTCGCATCCACCTCTCTGCTTTCTTCCAAATTTCGACAACAAAACTCAACACCTCACTTGGAAAGAGAGCTCCTGATTTCTCTCATAAAATCATGTACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCCGTACTTCTTCCATTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACTCGCACCGCCTCTGCCCCTTTTCGCGATTTGGGCTATTCTCGACGATTCTTCTCTCTCCTGACGAACCCATTTGTTTCTCATTATAATGCGATGTTGAGAGCCTATTCTTTGAGCCCTTCACCTCTGAAGGGATTGTACATGTACAGAGATATGGAAAGGCAAGGAGTTCGCGTCGATCCCTTGTCTTCTTCCTTTGCCGTTAAGTCTTGTATAAGAATGCTTTCATTATTTAGTGGGGTTCAGATTCACGCGAAGATTTTTAGAAATGGGCATCAATCGGATAGTCTTTTGCTCACCTCCATGATGGACCTGTATTCTCATTGTGGCAAACTTGAGGATGCGTGCAAATTGTTCGACGAAATTCCTCAAAGAGATGTTATTGCTTGGAACGTTTTGATTTCTTGTCTAACTCGAAATAAACGGACTAGGGATGCTTTAGGTTTGTTTGACATCATGCAAAGTCCAACATATCTCTGCGAACCTGATAAAGTTACTTGTTTACTTCTCCTCCAAGCTTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAATTATGTTCAACAGCACGGTTATAATACTGAGAGTAATTTGTGTAATTCGTTGATATCGATGTATTCGCTGTGTGGGCGTGTGGACAAGGCTTATGAAGTGTTTGATAAAATGCCGGAGAAAGATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGACAGGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTTGTTGATGAAGGAATGGCTTTTTTTGATCGTATGAGACAGGAGTTCAGGATAGTTCCCAACGTCTATCACTATGGATGTATGGTTGATCTCTTGGGTCGCACTGGAATGCTTGATCAAGCCTATAAACTCATAATGTCAATGGAGGTGAACCCAGATGCAACATTATGGAGGACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGCAAACCTTGGGGAGCGCATAATTGAGCATTTGATTGAACTCAAATCTCAAGAAGCTGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTATCTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGCACCACAATAGAATTGAATGGGGTGGTGCATGAGTTTGCTGTGGATGATATTTCGCATCCTATGAAGGACAAGATCTACGAGAAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCTTCTGAATTACACAGATTAAAGGCAGAAGATAAGGGGTATGCGCTTTCTAACCATAGTGAAAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAGTGGCAAATACCGTTCATATTTGCATGGATTGTCATAACTTTGCTAAGTATATCTCCAGTGTTTATAACAGAAAAGTGGTTGTTAGGGACCGAAGTCGGTCACATCATTTCCGAGAGGGTCGGTGTTCCTGCAATGATTATTGGTAG

Coding sequence (CDS)

ATGACTGGAATCTTCCGTCGTCATTCTATCCTCTCTCCGAAACACCATCATCATCGTTTCGCATCCACCTCTCTGCTTTCTTCCAAATTTCGACAACAAAACTCAACACCTCACTTGGAAAGAGAGCTCCTGATTTCTCTCATAAAATCATGTACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCCGTACTTCTTCCATTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACTCGCACCGCCTCTGCCCCTTTTCGCGATTTGGGCTATTCTCGACGATTCTTCTCTCTCCTGACGAACCCATTTGTTTCTCATTATAATGCGATGTTGAGAGCCTATTCTTTGAGCCCTTCACCTCTGAAGGGATTGTACATGTACAGAGATATGGAAAGGCAAGGAGTTCGCGTCGATCCCTTGTCTTCTTCCTTTGCCGTTAAGTCTTGTATAAGAATGCTTTCATTATTTAGTGGGGTTCAGATTCACGCGAAGATTTTTAGAAATGGGCATCAATCGGATAGTCTTTTGCTCACCTCCATGATGGACCTGTATTCTCATTGTGGCAAACTTGAGGATGCGTGCAAATTGTTCGACGAAATTCCTCAAAGAGATGTTATTGCTTGGAACGTTTTGATTTCTTGTCTAACTCGAAATAAACGGACTAGGGATGCTTTAGGTTTGTTTGACATCATGCAAAGTCCAACATATCTCTGCGAACCTGATAAAGTTACTTGTTTACTTCTCCTCCAAGCTTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAATTATGTTCAACAGCACGGTTATAATACTGAGAGTAATTTGTGTAATTCGTTGATATCGATGTATTCGCTGTGTGGGCGTGTGGACAAGGCTTATGAAGTGTTTGATAAAATGCCGGAGAAAGATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCAATGAATGGACAGGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGAAGGGTATTGAGCCTGATGATCATACTTTCACTGCAGTTCTTTCTGCTTGTAGCCACTGTGGCCTTGTTGATGAAGGAATGGCTTTTTTTGATCGTATGAGACAGGAGTTCAGGATAGTTCCCAACGTCTATCACTATGGATGTATGGTTGATCTCTTGGGTCGCACTGGAATGCTTGATCAAGCCTATAAACTCATAATGTCAATGGAGGTGAACCCAGATGCAACATTATGGAGGACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGCAAACCTTGGGGAGCGCATAATTGAGCATTTGATTGAACTCAAATCTCAAGAAGCTGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTATCTGAATTGAGGAAATTTATGAAGGAGAAGGGTATTTATACTACACCTGGCTGCACCACAATAGAATTGAATGGGGTGGTGCATGAGTTTGCTGTGGATGATATTTCGCATCCTATGAAGGACAAGATCTACGAGAAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATATCTTCTGAATTACACAGATTAAAGGCAGAAGATAAGGGGTATGCGCTTTCTAACCATAGTGAAAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAGTGGCAAATACCGTTCATATTTGCATGGATTGTCATAACTTTGCTAAGTATATCTCCAGTGTTTATAACAGAAAAGTGGTTGTTAGGGACCGAAGTCGGTCACATCATTTCCGAGAGGGTCGGTGTTCCTGCAATGATTATTGGTAG

Protein sequence

MTGIFRRHSILSPKHHHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW
BLAST of Bhi02G000224 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 719.5 bits (1856), Expect = 3.1e-206
Identity = 354/578 (61.25%), Postives = 445/578 (76.99%), Query Frame = 0

Query: 44  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTA-SAPFRDLGYSRRFFSLLT 103
           L+SLI S T K  L QIHA ++RTS I++  V   FL+R A S   RD+ YS R FS   
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 104 NPFVSHYNAMLRAYSLSPSPLKGLYMYRDMER-QGVRVDPLSSSFAVKSCIRMLSLFSGV 163
           NP +SH N M+RA+SLS +P +G  ++R + R   +  +PLSSSFA+K CI+   L  G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 164 QIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNK 223
           QIH KIF +G  SDSLL+T++MDLYS C    DACK+FDEIP+RD ++WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 224 RTRDALGLFDIMQSPTYLC-EPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESN 283
           RTRD L LFD M++    C +PD VTCLL LQACA+L AL+FG+++H+++ ++G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 284 LCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKG 343
           L N+L+SMYS CG +DKAY+VF  M E++VVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 344 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQ-EFRIVPNVYHYGCMVDLLGRTGMLDQ 403
           I P++ T T +LSACSH GLV EGM FFDRMR  EF+I PN++HYGC+VDLLGR  +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 404 AYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSS 463
           AY LI SME+ PD+T+WRTLLGACR+HG   LGER+I HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 464 AGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQ 523
            G W+KV+ELR  MKEK I+T PGC+ IEL G VHEF VDD+SHP K++IY+ L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 524 LKIAGYEAEISSELHRLKA-EDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMD 583
           LKIAGY AEI+SELH L++ E+KGYAL  HSEKLAIAFG+L TPPG TIRV   +  C+D
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 584 CHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           CHNFAK++S VY+R V+VRDRSR HHF+ G CSCND+W
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Bhi02G000224 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 432.6 bits (1111), Expect = 7.5e-120
Identity = 205/485 (42.27%), Postives = 309/485 (63.71%), Query Frame = 0

Query: 134 RQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLE 193
           +  VR D  +    V +C +  S+  G Q+H  I  +G  S+  ++ +++DLYS CG+LE
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 194 DACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQA 253
            AC LF+ +P +DVI+WN LI   T     ++AL LF  M        P+ VT L +L A
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE--TPNDVTMLSILPA 378

Query: 254 CADLNALEFGERIHNYVQQH--GYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDVV 313
           CA L A++ G  IH Y+ +   G    S+L  SLI MY+ CG ++ A++VF+ +  K + 
Sbjct: 379 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 438

Query: 314 SWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRM 373
           SW+AMI G +M+G+   + + F  M+K GI+PDD TF  +LSACSH G++D G   F  M
Sbjct: 439 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 498

Query: 374 RQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANL 433
            Q++++ P + HYGCM+DLLG +G+  +A ++I  ME+ PD  +W +LL AC++HG+  L
Sbjct: 499 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 558

Query: 434 GERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNG 493
           GE   E+LI+++ +  G YVLL NIY+SAG W++V++ R  + +KG+   PGC++IE++ 
Sbjct: 559 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 618

Query: 494 VVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEK 553
           VVHEF + D  HP   +IY  L+E+   L+ AG+  + S  L  ++ E K  AL +HSEK
Sbjct: 619 VVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEK 678

Query: 554 LAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCS 613
           LAIAFG+++T PG  + +   + +C +CH   K IS +Y R+++ RDR+R HHFR+G CS
Sbjct: 679 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 738

Query: 614 CNDYW 617
           CNDYW
Sbjct: 739 CNDYW 741

BLAST of Bhi02G000224 vs. Swiss-Prot
Match: sp|B8YEK4|OGR1_ORYSJ (Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa subsp. japonica OX=39947 GN=OGR1 PE=2 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 3.2e-118
Identity = 238/583 (40.82%), Postives = 332/583 (56.95%), Query Frame = 0

Query: 44  LISLIKSCTHKSQLLQIHAHIIRTSSI-QDPIVSLRFLTRTASAPF-RDLGYSRRFFSLL 103
           L SL+         LQ HA ++ +  +   P +  RFL R A +P    L ++      L
Sbjct: 10  LESLLPRLASLRHYLQFHARLLTSGHLGAHPGLRARFLDRLALSPHPAALPHALLLLRSL 69

Query: 104 TNPFVSHYNAMLRAYSLSPSPLKGLYMY--RDMERQGVRVDPLSSSFAVKSCIRMLSLFS 163
             P  +  NA LR  + SP P + L +   R +     R D LS SFA+K+  R     +
Sbjct: 70  PTPATNDLNAALRGLAASPHPARSLLLLAGRLLPALLPRPDALSLSFALKASARCSDAHT 129

Query: 164 GVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTR 223
            VQ+HA + R G  +D  LLT+++D Y+ CG L  A K+FDE+  RDV  WN L++ L +
Sbjct: 130 TVQLHALVLRLGVAADVRLLTTLLDSYAKCGDLASARKVFDEMTVRDVATWNSLLAGLAQ 189

Query: 224 NKRTRDALGLF----DIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGY 283
                 AL LF    +  Q      EP++VT +  L ACA +  L+ G  +H + ++ G 
Sbjct: 190 GTEPNLALALFHRLANSFQELPSREEPNEVTIVAALSACAQIGLLKDGMYVHEFAKRFGL 249

Query: 284 NTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKD--VVSWSAMISGLSMNGQGREAIEAF 343
           +    +CNSLI MYS CG + +A +VF  +  +D  +VS++A I   SM+G G +A+  F
Sbjct: 250 DRNVRVCNSLIDMYSKCGSLSRALDVFHSIKPEDQTLVSYNAAIQAHSMHGHGGDALRLF 309

Query: 344 WEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGR 403
            EM  + IEPD  T+ AVL  C+H GLVD+G+  F+ M    R+ PN+ HYG +VDLLGR
Sbjct: 310 DEMPTR-IEPDGVTYLAVLCGCNHSGLVDDGLRVFNSM----RVAPNMKHYGTIVDLLGR 369

Query: 404 TGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLL 463
            G L +AY  ++SM    D  LW+TLLGA ++HG   L E     L EL S   GDYVLL
Sbjct: 370 AGRLTEAYDTVISMPFPADIVLWQTLLGAAKMHGVVELAELAANKLAELGSNVDGDYVLL 429

Query: 464 LNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKL 523
            N+Y+S   W  V  +R  M+   +   PG +  E++GV+H+F   D  HP   +IY  L
Sbjct: 430 SNVYASKARWMDVGRVRDTMRSNDVRKVPGFSYTEIDGVMHKFINGDKEHPRWQEIYRAL 489

Query: 524 DEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTV 583
           ++I  ++   GYE E S+ LH +  E+K YAL  HSEKLAIAFG++ATPPG T+RV   +
Sbjct: 490 EDIVSRISELGYEPETSNVLHDIGEEEKQYALCYHSEKLAIAFGLIATPPGETLRVIKNL 549

Query: 584 HICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
            IC DCH  AK IS  Y R +V+RDR+R H F +G+CSC DYW
Sbjct: 550 RICGDCHVVAKLISKAYGRVIVIRDRARFHRFEDGQCSCRDYW 587

BLAST of Bhi02G000224 vs. Swiss-Prot
Match: sp|Q9LXY5|PP284_ARATH (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 3.2e-118
Identity = 218/577 (37.78%), Postives = 343/577 (59.45%), Query Frame = 0

Query: 43  LLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLL- 102
           +++ +++ C    +L +IH+H+I       P +    L   A +    L +++  F    
Sbjct: 7   VIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFD 66

Query: 103 TNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGV-RVDPLSSSFAVKSCIRMLSLFSG 162
           ++P  S +N ++R +S S SPL  +  Y  M    V R D  + +FA+KSC R+ S+   
Sbjct: 67  SDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSIPKC 126

Query: 163 VQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRN 222
           ++IH  + R+G   D+++ TS++  YS  G +E A K+FDE+P RD+++WNV+I C +  
Sbjct: 127 LEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHV 186

Query: 223 KRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESN 282
                AL ++  M +   +C  D  T + LL +CA ++AL  G  +H         +   
Sbjct: 187 GLHNQALSMYKRMGNEG-VC-GDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVF 246

Query: 283 LCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKG 342
           + N+LI MY+ CG ++ A  VF+ M ++DV++W++MI G  ++G G EAI  F +M   G
Sbjct: 247 VSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASG 306

Query: 343 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQA 402
           + P+  TF  +L  CSH GLV EG+  F+ M  +F + PNV HYGCMVDL GR G L+ +
Sbjct: 307 VRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENS 366

Query: 403 YKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSA 462
            ++I +   + D  LWRTLLG+C+IH +  LGE  ++ L++L++  AGDYVL+ +IYS+A
Sbjct: 367 LEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAA 426

Query: 463 GNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQL 522
            +    + +RK ++   + T PG + IE+   VH+F VDD  HP    IY +L E+  + 
Sbjct: 427 NDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRA 486

Query: 523 KIAGYEAEISSE-LHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDC 582
            +AGY+ E S+     L     G A ++HSEKLAIA+G++ T  G T+R+   + +C DC
Sbjct: 487 ILAGYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDC 546

Query: 583 HNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           H+F KY+S  +NR+++VRDR R HHF +G CSCNDYW
Sbjct: 547 HSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of Bhi02G000224 vs. Swiss-Prot
Match: sp|Q9FX24|PPR71_ARATH (Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H68 PE=2 SV=2)

HSP 1 Score: 425.6 bits (1093), Expect = 9.2e-118
Identity = 226/589 (38.37%), Postives = 343/589 (58.23%), Query Frame = 0

Query: 39  LERELLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFF 98
           + R  + ++I+ C   SQ+ Q+ +H +     Q   +  R L R A +PF DL ++ + F
Sbjct: 1   MARVYMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIF 60

Query: 99  SLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQG------VRVDPLSSSFAVKSCI 158
             +  P  + +NA++R ++ S  P      YR M +Q        RVD L+ SF +K+C 
Sbjct: 61  RYIPKPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACA 120

Query: 159 RMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNV 218
           R L   +  Q+H +I R G  +DSLL T+++D YS  G L  A KLFDE+P RDV +WN 
Sbjct: 121 RALCSSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNA 180

Query: 219 LISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQ 278
           LI+ L    R  +A+ L+  M+  T      +VT +  L AC+ L  ++ GE I      
Sbjct: 181 LIAGLVSGNRASEAMELYKRME--TEGIRRSEVTVVAALGACSHLGDVKEGENIF----- 240

Query: 279 HGYNTESNL-CNSLISMYSLCGRVDKAYEVFDKMP-EKDVVSWSAMISGLSMNGQGREAI 338
           HGY+ ++ +  N+ I MYS CG VDKAY+VF++   +K VV+W+ MI+G +++G+   A+
Sbjct: 241 HGYSNDNVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRAL 300

Query: 339 EAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDL 398
           E F +++  GI+PDD ++ A L+AC H GLV+ G++ F+ M  +  +  N+ HYGC+VDL
Sbjct: 301 EIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVDL 360

Query: 399 LGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDY 458
           L R G L +A+ +I SM + PD  LW++LLGA  I+    + E     + E+     GD+
Sbjct: 361 LSRAGRLREAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDF 420

Query: 459 VLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIY 518
           VLL N+Y++ G W  V  +R  M+ K +   PG + IE  G +HEF   D SH    +IY
Sbjct: 421 VLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIY 480

Query: 519 EKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVL---ATPPGRTI 578
           EK+DEI  +++  GY A+    LH +  E+K  AL  HSEKLA+A+G++          +
Sbjct: 481 EKIDEIRFKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMMDGADEESPV 540

Query: 579 RVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           RV N + IC DCH   K+IS +Y R+++VRDR R H F++G CSC D+W
Sbjct: 541 RVINNLRICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581

BLAST of Bhi02G000224 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 719.5 bits (1856), Expect = 1.7e-207
Identity = 354/578 (61.25%), Postives = 445/578 (76.99%), Query Frame = 0

Query: 44  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTA-SAPFRDLGYSRRFFSLLT 103
           L+SLI S T K  L QIHA ++RTS I++  V   FL+R A S   RD+ YS R FS   
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 104 NPFVSHYNAMLRAYSLSPSPLKGLYMYRDMER-QGVRVDPLSSSFAVKSCIRMLSLFSGV 163
           NP +SH N M+RA+SLS +P +G  ++R + R   +  +PLSSSFA+K CI+   L  G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 164 QIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNK 223
           QIH KIF +G  SDSLL+T++MDLYS C    DACK+FDEIP+RD ++WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 224 RTRDALGLFDIMQSPTYLC-EPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESN 283
           RTRD L LFD M++    C +PD VTCLL LQACA+L AL+FG+++H+++ ++G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 284 LCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKG 343
           L N+L+SMYS CG +DKAY+VF  M E++VVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 344 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQ-EFRIVPNVYHYGCMVDLLGRTGMLDQ 403
           I P++ T T +LSACSH GLV EGM FFDRMR  EF+I PN++HYGC+VDLLGR  +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 404 AYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSS 463
           AY LI SME+ PD+T+WRTLLGACR+HG   LGER+I HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 464 AGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQ 523
            G W+KV+ELR  MKEK I+T PGC+ IEL G VHEF VDD+SHP K++IY+ L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 524 LKIAGYEAEISSELHRLKA-EDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMD 583
           LKIAGY AEI+SELH L++ E+KGYAL  HSEKLAIAFG+L TPPG TIRV   +  C+D
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 584 CHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           CHNFAK++S VY+R V+VRDRSR HHF+ G CSCND+W
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of Bhi02G000224 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 432.6 bits (1111), Expect = 4.2e-121
Identity = 205/485 (42.27%), Postives = 309/485 (63.71%), Query Frame = 0

Query: 134 RQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLE 193
           +  VR D  +    V +C +  S+  G Q+H  I  +G  S+  ++ +++DLYS CG+LE
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 194 DACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQA 253
            AC LF+ +P +DVI+WN LI   T     ++AL LF  M        P+ VT L +L A
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE--TPNDVTMLSILPA 378

Query: 254 CADLNALEFGERIHNYVQQH--GYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDVV 313
           CA L A++ G  IH Y+ +   G    S+L  SLI MY+ CG ++ A++VF+ +  K + 
Sbjct: 379 CAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLS 438

Query: 314 SWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRM 373
           SW+AMI G +M+G+   + + F  M+K GI+PDD TF  +LSACSH G++D G   F  M
Sbjct: 439 SWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTM 498

Query: 374 RQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANL 433
            Q++++ P + HYGCM+DLLG +G+  +A ++I  ME+ PD  +W +LL AC++HG+  L
Sbjct: 499 TQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVEL 558

Query: 434 GERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNG 493
           GE   E+LI+++ +  G YVLL NIY+SAG W++V++ R  + +KG+   PGC++IE++ 
Sbjct: 559 GESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDS 618

Query: 494 VVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEK 553
           VVHEF + D  HP   +IY  L+E+   L+ AG+  + S  L  ++ E K  AL +HSEK
Sbjct: 619 VVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEK 678

Query: 554 LAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCS 613
           LAIAFG+++T PG  + +   + +C +CH   K IS +Y R+++ RDR+R HHFR+G CS
Sbjct: 679 LAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCS 738

Query: 614 CNDYW 617
           CNDYW
Sbjct: 739 CNDYW 741

BLAST of Bhi02G000224 vs. TAIR10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 427.2 bits (1097), Expect = 1.8e-119
Identity = 218/577 (37.78%), Postives = 343/577 (59.45%), Query Frame = 0

Query: 43  LLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLL- 102
           +++ +++ C    +L +IH+H+I       P +    L   A +    L +++  F    
Sbjct: 7   VIVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFD 66

Query: 103 TNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGV-RVDPLSSSFAVKSCIRMLSLFSG 162
           ++P  S +N ++R +S S SPL  +  Y  M    V R D  + +FA+KSC R+ S+   
Sbjct: 67  SDPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSIPKC 126

Query: 163 VQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRN 222
           ++IH  + R+G   D+++ TS++  YS  G +E A K+FDE+P RD+++WNV+I C +  
Sbjct: 127 LEIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHV 186

Query: 223 KRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESN 282
                AL ++  M +   +C  D  T + LL +CA ++AL  G  +H         +   
Sbjct: 187 GLHNQALSMYKRMGNEG-VC-GDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVF 246

Query: 283 LCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKG 342
           + N+LI MY+ CG ++ A  VF+ M ++DV++W++MI G  ++G G EAI  F +M   G
Sbjct: 247 VSNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASG 306

Query: 343 IEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQA 402
           + P+  TF  +L  CSH GLV EG+  F+ M  +F + PNV HYGCMVDL GR G L+ +
Sbjct: 307 VRPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENS 366

Query: 403 YKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSA 462
            ++I +   + D  LWRTLLG+C+IH +  LGE  ++ L++L++  AGDYVL+ +IYS+A
Sbjct: 367 LEMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAA 426

Query: 463 GNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQL 522
            +    + +RK ++   + T PG + IE+   VH+F VDD  HP    IY +L E+  + 
Sbjct: 427 NDAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRA 486

Query: 523 KIAGYEAEISSE-LHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDC 582
            +AGY+ E S+     L     G A ++HSEKLAIA+G++ T  G T+R+   + +C DC
Sbjct: 487 ILAGYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDC 546

Query: 583 HNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           H+F KY+S  +NR+++VRDR R HHF +G CSCNDYW
Sbjct: 547 HSFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of Bhi02G000224 vs. TAIR10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 425.6 bits (1093), Expect = 5.1e-119
Identity = 226/589 (38.37%), Postives = 343/589 (58.23%), Query Frame = 0

Query: 39  LERELLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFF 98
           + R  + ++I+ C   SQ+ Q+ +H +     Q   +  R L R A +PF DL ++ + F
Sbjct: 1   MARVYMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIF 60

Query: 99  SLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQG------VRVDPLSSSFAVKSCI 158
             +  P  + +NA++R ++ S  P      YR M +Q        RVD L+ SF +K+C 
Sbjct: 61  RYIPKPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACA 120

Query: 159 RMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNV 218
           R L   +  Q+H +I R G  +DSLL T+++D YS  G L  A KLFDE+P RDV +WN 
Sbjct: 121 RALCSSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNA 180

Query: 219 LISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQ 278
           LI+ L    R  +A+ L+  M+  T      +VT +  L AC+ L  ++ GE I      
Sbjct: 181 LIAGLVSGNRASEAMELYKRME--TEGIRRSEVTVVAALGACSHLGDVKEGENIF----- 240

Query: 279 HGYNTESNL-CNSLISMYSLCGRVDKAYEVFDKMP-EKDVVSWSAMISGLSMNGQGREAI 338
           HGY+ ++ +  N+ I MYS CG VDKAY+VF++   +K VV+W+ MI+G +++G+   A+
Sbjct: 241 HGYSNDNVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRAL 300

Query: 339 EAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDL 398
           E F +++  GI+PDD ++ A L+AC H GLV+ G++ F+ M  +  +  N+ HYGC+VDL
Sbjct: 301 EIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVDL 360

Query: 399 LGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDY 458
           L R G L +A+ +I SM + PD  LW++LLGA  I+    + E     + E+     GD+
Sbjct: 361 LSRAGRLREAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDF 420

Query: 459 VLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIY 518
           VLL N+Y++ G W  V  +R  M+ K +   PG + IE  G +HEF   D SH    +IY
Sbjct: 421 VLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIY 480

Query: 519 EKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVL---ATPPGRTI 578
           EK+DEI  +++  GY A+    LH +  E+K  AL  HSEKLA+A+G++          +
Sbjct: 481 EKIDEIRFKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMMDGADEESPV 540

Query: 579 RVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           RV N + IC DCH   K+IS +Y R+++VRDR R H F++G CSC D+W
Sbjct: 541 RVINNLRICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581

BLAST of Bhi02G000224 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 424.1 bits (1089), Expect = 1.5e-118
Identity = 218/594 (36.70%), Postives = 347/594 (58.42%), Query Frame = 0

Query: 33  QNSTPHLERELLISLIKSCTHKSQL---LQIHAHIIRTSSIQDPIVSLRFLTRTASAPFR 92
           Q S+P  +   L  LI  C H+S L   L++H HI+   S QDP     FL       + 
Sbjct: 71  QESSPSQQTYEL--LILCCGHRSSLSDALRVHRHILDNGSDQDP-----FLATKLIGMYS 130

Query: 93  DLG---YSRRFFSLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSF 152
           DLG   Y+R+ F       +  +NA+ RA +L+    + L +Y  M R GV  D  + ++
Sbjct: 131 DLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTY 190

Query: 153 AVKSCI----RMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEI 212
            +K+C+     +  L  G +IHA + R G+ S   ++T+++D+Y+  G ++ A  +F  +
Sbjct: 191 VLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGM 250

Query: 213 PQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEF 272
           P R+V++W+ +I+C  +N +  +AL  F  M   T    P+ VT + +LQACA L ALE 
Sbjct: 251 PVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQ 310

Query: 273 GERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSM 332
           G+ IH Y+ + G ++   + ++L++MY  CG+++    VFD+M ++DVVSW+++IS   +
Sbjct: 311 GKLIHGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGV 370

Query: 333 NGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVY 392
           +G G++AI+ F EM   G  P   TF +VL ACSH GLV+EG   F+ M ++  I P + 
Sbjct: 371 HGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIE 430

Query: 393 HYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIEL 452
           HY CMVDLLGR   LD+A K++  M   P   +W +LLG+CRIHG+  L ER    L  L
Sbjct: 431 HYACMVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFAL 490

Query: 453 KSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDIS 512
           + + AG+YVLL +IY+ A  WD+V  ++K ++ +G+   PG   +E+   ++ F   D  
Sbjct: 491 EPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEF 550

Query: 513 HPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVLATP 572
           +P+ ++I+  L ++ + +K  GY  +    L+ L+ E+K   +  HSEKLA+AFG++ T 
Sbjct: 551 NPLMEQIHAFLVKLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTS 610

Query: 573 PGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
            G  IR+   + +C DCH F K+IS    ++++VRD +R H F+ G CSC DYW
Sbjct: 611 KGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of Bhi02G000224 vs. TrEMBL
Match: tr|A0A0A0LUH9|A0A0A0LUH9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=4 SV=1)

HSP 1 Score: 1094.7 bits (2830), Expect = 0.0e+00
Identity = 531/616 (86.20%), Postives = 564/616 (91.56%), Query Frame = 0

Query: 1   MTGIFRRHSILSPKHHHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQI 60
           M  IFR  SILS K+HHH                S  H ERE LISLIKSCTHKSQLLQI
Sbjct: 1   MCVIFRSPSILSLKYHHHSI--------------SFSHFEREPLISLIKSCTHKSQLLQI 60

Query: 61  HAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSP 120
           HAHII TSSIQDPIVSLRFLTRTASAPFRDLGYSRR F LLTNPFVSHYNAMLRAYSLS 
Sbjct: 61  HAHIITTSSIQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSR 120

Query: 121 SPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLT 180
           SPL+GLYMYRDMERQGVR DPLSSSFAVKSCI++LSL  G+QIHA+IF NGHQ+DSLLLT
Sbjct: 121 SPLEGLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLT 180

Query: 181 SMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLC 240
           SMMDLYSHCGK E+ACKLFDE+PQ+DV+AWNVLISCLTRNKRTRDALGLF+IMQSPTYLC
Sbjct: 181 SMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC 240

Query: 241 EPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYE 300
           +PDKVTCLLLLQACADLNALEFGERIH Y+QQHGYNTESNLCNSLISMYS CGR+DKAYE
Sbjct: 241 QPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYE 300

Query: 301 VFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGL 360
           VFDKM EK+VVSWSAMISGLSMNG GREAIEAFWEMQK G+EP DHTFTAVLSACSHCGL
Sbjct: 301 VFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGL 360

Query: 361 VDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLL 420
           VDEGMAFFDRMRQEF I PNV+HYGC+VDLLGR GMLDQAY+LIMSMEV PDAT+WRTLL
Sbjct: 361 VDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLL 420

Query: 421 GACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYT 480
           GACRIHGH NLGERI+EHLIELKSQEAGDYVLLLNIYSSAGNWDKV+ELRK MKEKGIYT
Sbjct: 421 GACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYT 480

Query: 481 TPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAED 540
           TP CTTIELNGVVH+FAVDDISHPMKDKIY++LDEINKQLKIAGYEAE+SSELHRL+ +D
Sbjct: 481 TPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKD 540

Query: 541 KGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRS 600
           KGYALSNHSEKLAIAFGVLATPPGRTIR+AN +  CMDCHNFAKYISSVYNRKVVVRDRS
Sbjct: 541 KGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRS 600

Query: 601 RSHHFREGRCSCNDYW 617
           R HHF+EGRCSCND+W
Sbjct: 601 RFHHFQEGRCSCNDFW 602

BLAST of Bhi02G000224 vs. TrEMBL
Match: tr|A0A1S3BV40|A0A1S3BV40_CUCME (pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN=LOC103493993 PE=4 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 515/573 (89.88%), Postives = 545/573 (95.11%), Query Frame = 0

Query: 44  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTN 103
           LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRF  LLTN
Sbjct: 13  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFLDLLTN 72

Query: 104 PFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQI 163
           P VSHYNAMLRAYS+S SPL+GLY+YRDMERQGVR DPLSSSFAVKSCI++LSL  G+QI
Sbjct: 73  PLVSHYNAMLRAYSVSRSPLEGLYVYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQI 132

Query: 164 HAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRT 223
           HA+IF  GHQ+DSLLLTSMMDLYSHCGK E+ACKLFDE+PQ+DV+AWNVLISCLTRNKRT
Sbjct: 133 HARIFIYGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRT 192

Query: 224 RDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCN 283
           RDALGLF+IMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH Y+QQH YNTESNLCN
Sbjct: 193 RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHCYNTESNLCN 252

Query: 284 SLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEP 343
           SLISMYS CGRVDKAYEVFDKMPEK+VVSWSAMISGLSMNG GREAIEAFWEMQK G+EP
Sbjct: 253 SLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEP 312

Query: 344 DDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKL 403
           DDHTFTAVLSACSHCGLVDEGMAFFDRMRQE  I PNV+HYGC+VDLLGR GMLDQAY+L
Sbjct: 313 DDHTFTAVLSACSHCGLVDEGMAFFDRMRQELMIAPNVHHYGCIVDLLGRAGMLDQAYEL 372

Query: 404 IMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNW 463
           IMSMEV PDAT+WRTLLGACRIHGHANLGERI+EHLIELKSQEAGDYVLLLNIYSSAG W
Sbjct: 373 IMSMEVRPDATMWRTLLGACRIHGHANLGERIVEHLIELKSQEAGDYVLLLNIYSSAGKW 432

Query: 464 DKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIA 523
           DKV+ELRK MKEKGIYTTP CTTIELNGVVHEFAVDDISHPMKDKIY++LDEINKQLKIA
Sbjct: 433 DKVTELRKLMKEKGIYTTPCCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINKQLKIA 492

Query: 524 GYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFA 583
           GYEAE+SSELHRLK EDKGYALSNHSEKLAIAFGVLATPPGRTIRVAN +  CMDCHNFA
Sbjct: 493 GYEAEMSSELHRLKPEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANNIRTCMDCHNFA 552

Query: 584 KYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           KYISSVYNRKVV+RDRSR HHF+EGRCSCND+W
Sbjct: 553 KYISSVYNRKVVLRDRSRFHHFQEGRCSCNDFW 585

BLAST of Bhi02G000224 vs. TrEMBL
Match: tr|A0A2I4G1Y4|A0A2I4G1Y4_9ROSI (pentatricopeptide repeat-containing protein At3g47530 OS=Juglans regia OX=51240 GN=LOC109004001 PE=4 SV=1)

HSP 1 Score: 871.3 bits (2250), Expect = 1.3e-249
Identity = 415/601 (69.05%), Postives = 492/601 (81.86%), Query Frame = 0

Query: 16  HHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIV 75
           H+   A+ + L+S+  ++      ER  L SLIKSCT K+ LLQIHAH++ T  +QDP +
Sbjct: 12  HYRSLATAASLASQTLEE------ERRQLPSLIKSCTQKTHLLQIHAHLVCTGLLQDPTI 71

Query: 76  SLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQ 135
           SL FL+R A +P RD+ YSR+FF+ +++P   HYN M+RAYS+S SPL+G YMYR+M+RQ
Sbjct: 72  SLIFLSRLALSPARDVDYSRQFFTQISDPLTFHYNTMIRAYSMSNSPLEGFYMYREMKRQ 131

Query: 136 GVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDA 195
            VRV+PLSSSFA+KSCI++ SL  GVQ+HA+I  +GHQSDSLLLT++MDLYS C + ++A
Sbjct: 132 SVRVNPLSSSFAIKSCIKLSSLLGGVQVHARILTDGHQSDSLLLTNLMDLYSCCERCDEA 191

Query: 196 CKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLLLQACA 255
           CK+FD+I  RD +AWNVLISC  RN RTRDA+GLFDIMQS +  CEPD VTCLLLLQACA
Sbjct: 192 CKVFDDIRDRDTVAWNVLISCCMRNNRTRDAMGLFDIMQSGSDGCEPDDVTCLLLLQACA 251

Query: 256 DLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSA 315
            LNALEFGE+IH Y+ QHGY T   LCNSLI+MYS CG ++KAY VF  M  K+VVSWSA
Sbjct: 252 HLNALEFGEKIHAYIGQHGYGTAGKLCNSLIAMYSRCGCLEKAYGVFKGMRNKNVVSWSA 311

Query: 316 MISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEF 375
           MISGL+MNG GREAIEAFWEMQK GI PDD TFT VLSACSHCGLVDEGM FFD M +EF
Sbjct: 312 MISGLAMNGHGREAIEAFWEMQKLGIPPDDQTFTGVLSACSHCGLVDEGMMFFDLMSKEF 371

Query: 376 RIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHANLGERI 435
            I PN  HYGC+VDLLGR G+LDQAY+LI+SM V PD+T+WRTLLGACRIHGH  LGER+
Sbjct: 372 GIAPNTRHYGCVVDLLGRAGLLDQAYQLILSMSVKPDSTMWRTLLGACRIHGHVTLGERV 431

Query: 436 IEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELNGVVHE 495
           + HLIELK+QEAGDY LLLNIYSSAG WDKV E+RKFM+EK I TTPGC+TI L GVVHE
Sbjct: 432 VGHLIELKAQEAGDYALLLNIYSSAGKWDKVMEVRKFMQEKAIQTTPGCSTIVLKGVVHE 491

Query: 496 FAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSEKLAIA 555
           F VDD+SHP K +IYE L+EIN+QLKIAGY AE+S+ELH L AE+KG ALS HSEKLAIA
Sbjct: 492 FVVDDVSHPRKGEIYEMLNEINQQLKIAGYVAEVSAELHNLGAEEKGDALSYHSEKLAIA 551

Query: 556 FGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRCSCNDY 615
           FGVL+TPPG TIRVA  +  C+DCHNFAK +S VYNR+V+VRDR+R HHF+EGRCSCNDY
Sbjct: 552 FGVLSTPPGTTIRVAKNLRTCIDCHNFAKVLSGVYNREVIVRDRTRFHHFKEGRCSCNDY 606

Query: 616 W 617
           W
Sbjct: 612 W 606

BLAST of Bhi02G000224 vs. TrEMBL
Match: tr|A0A251PAT0|A0A251PAT0_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G194000 PE=4 SV=1)

HSP 1 Score: 856.7 bits (2212), Expect = 3.3e-245
Identity = 402/615 (65.37%), Postives = 490/615 (79.67%), Query Frame = 0

Query: 2   TGIFRRHSILSPKHHHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQIH 61
           T +    S LS    HH        ++    Q  TP   ++ L+ LIKSCT +S LLQIH
Sbjct: 4   TAVSHHLSSLSSSQSHHPNVPVCFTTNISHTQ--TP---KQSLLDLIKSCTRRSHLLQIH 63

Query: 62  AHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSPS 121
           AHI+RTS + +P + L+FL+    +P + + YSRRFF  +  P    YN M+RAYS+S S
Sbjct: 64  AHIVRTSLVLEPTICLQFLSLVGLSPLKSISYSRRFFDQIAKPTAFQYNTMVRAYSISDS 123

Query: 122 PLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTS 181
           P +G  MYRD+ R+G+R D L+SSF +KSCIR+ SL  G+Q+HA+I R GH+SDS LLT+
Sbjct: 124 PEEGFSMYRDLLRRGLRADALASSFVIKSCIRVSSLLGGIQVHARILRGGHESDSRLLTT 183

Query: 182 MMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCE 241
           +MDLYS CGK ++ACKLFDE+P+RDV+AWNVLISC   N RTRDA+ LFDIM+S T+ CE
Sbjct: 184 LMDLYSICGKCDEACKLFDEMPKRDVVAWNVLISCCLHNNRTRDAVSLFDIMRSETHRCE 243

Query: 242 PDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYEV 301
           PD+VTCLL+LQAC++LNALEFGER+H Y+++HGY+  SNLCNSLI+MYS CG +DKAYEV
Sbjct: 244 PDEVTCLLMLQACSNLNALEFGERVHKYIEEHGYDGASNLCNSLIAMYSRCGCLDKAYEV 303

Query: 302 FDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLV 361
           F  M +K+VVSWSAMISGL++NG GREAIEAF EMQK G+ PDD TFT VL ACSHCGLV
Sbjct: 304 FKGMKDKNVVSWSAMISGLAVNGYGREAIEAFGEMQKMGVLPDDQTFTGVLCACSHCGLV 363

Query: 362 DEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLG 421
           DEGM FFDRM ++F +VPN++HYGCMVDLLGR G LDQAY+LI+SM++ PD+T+WRTLLG
Sbjct: 364 DEGMVFFDRMSKDFGVVPNIHHYGCMVDLLGRAGRLDQAYQLILSMDIKPDSTIWRTLLG 423

Query: 422 ACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTT 481
            CRIHGH  L E +I HLIELK+QEAGDYVLL+NIYSSAGNW+K++E+RKFMKEK I TT
Sbjct: 424 GCRIHGHDALAESVIGHLIELKAQEAGDYVLLMNIYSSAGNWEKLTEVRKFMKEKAIQTT 483

Query: 482 PGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDK 541
           PGC+TIEL GV HEF VDD+SHP KD+IY  LDEIN QLKIAGY A++SSELH L  E+K
Sbjct: 484 PGCSTIELKGVAHEFVVDDVSHPRKDEIYNMLDEINSQLKIAGYVADVSSELHNLGTEEK 543

Query: 542 GYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSR 601
           G+ALS HSEKLAIAFGVLATPPG  IRVA  + IC+DCHNFA  +S VYNR+V++RDR+R
Sbjct: 544 GHALSYHSEKLAIAFGVLATPPGTPIRVAKNLRICVDCHNFAMVLSGVYNREVIIRDRTR 603

Query: 602 SHHFREGRCSCNDYW 617
            HHFREGRCSCN YW
Sbjct: 604 FHHFREGRCSCNGYW 613

BLAST of Bhi02G000224 vs. TrEMBL
Match: tr|A0A2P5F707|A0A2P5F707_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_105670 PE=4 SV=1)

HSP 1 Score: 842.8 bits (2176), Expect = 4.9e-241
Identity = 401/606 (66.17%), Postives = 489/606 (80.69%), Query Frame = 0

Query: 12  SPKHHHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQIHAHIIRTSSIQ 71
           +P     +   T+  SS   + + +P   RELLISLI+SC+HK+ LLQIHAHI+RTS ++
Sbjct: 41  NPPESVQQATKTTTTSSFSPRTDGSP---RELLISLIRSCSHKTLLLQIHAHIVRTSLVR 100

Query: 72  DPIVSLRFLTRTA-SAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSPSPLKGLYMYR 131
           DP + L FL+R+A SAPFRDL YSRRFFS ++ P V  YN M+RAYS+S SP++G Y+YR
Sbjct: 101 DPTICLEFLSRSALSAPFRDLNYSRRFFSEISRPSVFQYNVMIRAYSMSDSPVEGFYLYR 160

Query: 132 DMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLTSMMDLYSHCG 191
           DM R+G+  DPLSSSFA+KSCIR+ S   G+Q+H +I R+G QSDS LLT++MDLYS  G
Sbjct: 161 DMRRRGLSADPLSSSFALKSCIRVSSFEGGIQVHGRILRDGLQSDSRLLTTLMDLYSCSG 220

Query: 192 KLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLCEPDKVTCLLL 251
           +  +A  +FDE+ +RD +AWNVLISC  RNKRTRDAL LFD+MQS  Y CEPD+VTCLL+
Sbjct: 221 RFCEARNVFDEMSRRDTVAWNVLISCCLRNKRTRDALALFDVMQSEAYGCEPDEVTCLLM 280

Query: 252 LQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYEVFDKMPEKDV 311
           LQAC+ LNALEFGER+H Y+++ G+   +NL NSL+SMYS CG ++KAY VF  + +K+V
Sbjct: 281 LQACSSLNALEFGERVHGYIEERGFGGHTNLRNSLLSMYSKCGCLEKAYGVFKGIRDKNV 340

Query: 312 VSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGLVDEGMAFFDR 371
           +SWSAMISG ++NG GREAI+AF EMQ+  + PD  TFT +LSACSHCG VDEGM FFDR
Sbjct: 341 ISWSAMISGFAINGYGREAIDAFEEMQRMHVPPDAQTFTGILSACSHCGFVDEGMMFFDR 400

Query: 372 MRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLLGACRIHGHAN 431
           M +EF I PN +HYGC+VDLLGR G LD+AY+LI+SM++ PD  +WRTLLGACRIHGH N
Sbjct: 401 MSKEFGISPNNHHYGCLVDLLGRAGQLDRAYQLILSMDIKPDVEIWRTLLGACRIHGHVN 460

Query: 432 LGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYTTPGCTTIELN 491
           LGER+++HLIELK+QEAGDYVLLLNIYSSAGNWDKV+E+RKF+KEK I TTPGC+T+EL 
Sbjct: 461 LGERVVDHLIELKAQEAGDYVLLLNIYSSAGNWDKVTEVRKFLKEKTIQTTPGCSTVELK 520

Query: 492 GVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAEDKGYALSNHSE 551
           GVVHEF  DD+SHP K KIYE LDEIN QLKIAGY  EISSELH L AE+K  ALS HSE
Sbjct: 521 GVVHEFVADDVSHPQKGKIYEMLDEINSQLKIAGYVVEISSELHNLGAEEKHCALSYHSE 580

Query: 552 KLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRSRSHHFREGRC 611
           KLAIAFGVL+TPPG TIRVA  +  C+DCHNFAK +S VYNR+V+VRDR+R HHF EGRC
Sbjct: 581 KLAIAFGVLSTPPGTTIRVAKNLRTCIDCHNFAKVLSGVYNREVIVRDRTRFHHFWEGRC 640

Query: 612 SCNDYW 617
           SCNDYW
Sbjct: 641 SCNDYW 643

BLAST of Bhi02G000224 vs. NCBI nr
Match: XP_023515406.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 554/628 (88.22%), Postives = 583/628 (92.83%), Query Frame = 0

Query: 1   MTGIFRRHSILSPKHHHH----RFAST--------SLLSSKFRQQNSTPHLERELLISLI 60
           MT IFRR    + +H H     RFAST        SLLSSKFR+QNST   +RE LISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPRFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSH 120
           KSCTHKSQLLQIHAH+IRTS IQDPIVSLRFLTR  SAPFR+LGYSRRFFS LTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIF 180
           YN +LRAYSLS SPL+GLYMYRDMERQGV  DPLSSSFAVKSCIRMLSLFSG+QIHA+IF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFAVKSCIRMLSLFSGIQIHARIF 180

Query: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDV+AWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISM 300
           LF+IMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH+Y+QQ+ YNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSYIQQNDYNTESNLCNSLISM 300

Query: 301 YSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTF 360
           YS CGRVDKAYEVFDKMPEK+VVSWSAMISGLSMNG GREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMRQEF IVP V+HYGCMVDLLGR GMLDQAY+L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSE 480
           VNPDAT+WRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKV+E
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIY++LDEIN+QLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINQQLKIAGYEAE 540

Query: 541 ISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISS 600
           ISSELH LKAEDKGYALS HSEKLAIAFGVLATPPGRTIRVAN +  CMDCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSYHSEKLAIAFGVLATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           VYNRKVVVRDRSR HHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Bhi02G000224 vs. NCBI nr
Match: XP_022921651.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata])

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 555/628 (88.38%), Postives = 582/628 (92.68%), Query Frame = 0

Query: 1   MTGIFRRHSILSPKHHHH----RFAST--------SLLSSKFRQQNSTPHLERELLISLI 60
           MT IFRR    + +H H      FAST        SLLSSKFR+QNST   +RE LISLI
Sbjct: 1   MTVIFRRCRCSADRHLHSLRLPHFASTASLLHSPISLLSSKFREQNSTLRFDREPLISLI 60

Query: 61  KSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSH 120
           KSCTHKSQLLQIHAH+IRTS IQDPIVSLRFLTR  SAPFR+LGYSRRFFS LTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIF 180
           YN +LRAYSLS SPL+GLYMYRDMER+GV  DPLSSSFAVKSCIRMLSLFSGVQIHA+IF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIF 180

Query: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDV+AWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISM 300
           LF+IMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH+++QQHGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTF 360
           YS CGRVDKAYEVFDKMPEK+VVSWSAMISGLSMNG GREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMRQEF IVP V+HYGCMVDLLGR GMLDQAY+L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSE 480
           VNPDAT+WRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNW KV+E
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWVKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYE+LDEINKQLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEQLDEINKQLKIAGYEAE 540

Query: 541 ISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISS 600
           ISSELH LKAEDKGYALS HSEKLAIAFGVLATPPGRTIRVAN +  CMDCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGRTIRVANNLRTCMDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           VYNRKVVVRDRSR HHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of Bhi02G000224 vs. NCBI nr
Match: XP_022987181.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima])

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 550/628 (87.58%), Postives = 584/628 (92.99%), Query Frame = 0

Query: 1   MTGIFRRHSILSPKHHHH----RFAST--------SLLSSKFRQQNSTPHLERELLISLI 60
           MT IFRR    + +H H     RFAST        SLLSSKFRQQNST H +RE LISLI
Sbjct: 1   MTVIFRRCRCSAYRHPHSLRLPRFASTASLLHSPISLLSSKFRQQNSTLHFDREPLISLI 60

Query: 61  KSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSH 120
           KSCTHKSQLLQIHAH+IRTS IQDPIVSLRFLTR  SAPFR+LGYSRRFFS LTNPFVSH
Sbjct: 61  KSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSH 120

Query: 121 YNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIF 180
           YN +LRAYSLS SPL+GLYMYRDMERQGV  DPLSSSFA+KSCIRMLSLFSG+QIHA+IF
Sbjct: 121 YNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFALKSCIRMLSLFSGIQIHARIF 180

Query: 181 RNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALG 240
           RNGHQSDSLLLTSMMDLYSHCGKL+DACKLFDEIPQRDV+AWNVLISCLTRNKRTRDALG
Sbjct: 181 RNGHQSDSLLLTSMMDLYSHCGKLKDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALG 240

Query: 241 LFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISM 300
           LF+IMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH+++QQHGYNTESNLCNSLISM
Sbjct: 241 LFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISM 300

Query: 301 YSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTF 360
           YS CGRVDKAYEVFDKMPEK+VVSWSAMISGLSMNG GREAIEAFW MQKKG+EPDDHTF
Sbjct: 301 YSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTF 360

Query: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSME 420
           TAVLSACSHCGLVDEGMAFFDRMRQEF IVP V+HYGCMVDLLGR GMLDQAY+L+MSME
Sbjct: 361 TAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSME 420

Query: 421 VNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSE 480
           VNPDAT+WRTLLGACRIHGHANLGER+IEHL+ELKSQEAGDYVLLLNIYSSAGNWDKV+E
Sbjct: 421 VNPDATMWRTLLGACRIHGHANLGERVIEHLVELKSQEAGDYVLLLNIYSSAGNWDKVTE 480

Query: 481 LRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAE 540
           LRKFMKE+GIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIY++LDEIN+QLKIAGYEAE
Sbjct: 481 LRKFMKERGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYKQLDEINQQLKIAGYEAE 540

Query: 541 ISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISS 600
           ISSELH LKAEDKGYALS HSEKLAIAFGVLATPPG TIRVAN +  C+DCHNFAKY+SS
Sbjct: 541 ISSELHNLKAEDKGYALSFHSEKLAIAFGVLATPPGGTIRVANNLRTCLDCHNFAKYVSS 600

Query: 601 VYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           VYNRKVVVRDRS+ HHFREGRCSCNDYW
Sbjct: 601 VYNRKVVVRDRSQFHHFREGRCSCNDYW 628

BLAST of Bhi02G000224 vs. NCBI nr
Match: XP_011660092.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis sativus] >KGN63671.1 hypothetical protein Csa_1G009750 [Cucumis sativus])

HSP 1 Score: 1094.7 bits (2830), Expect = 0.0e+00
Identity = 531/616 (86.20%), Postives = 564/616 (91.56%), Query Frame = 0

Query: 1   MTGIFRRHSILSPKHHHHRFASTSLLSSKFRQQNSTPHLERELLISLIKSCTHKSQLLQI 60
           M  IFR  SILS K+HHH                S  H ERE LISLIKSCTHKSQLLQI
Sbjct: 1   MCVIFRSPSILSLKYHHHSI--------------SFSHFEREPLISLIKSCTHKSQLLQI 60

Query: 61  HAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLTNPFVSHYNAMLRAYSLSP 120
           HAHII TSSIQDPIVSLRFLTRTASAPFRDLGYSRR F LLTNPFVSHYNAMLRAYSLS 
Sbjct: 61  HAHIITTSSIQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSR 120

Query: 121 SPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQIHAKIFRNGHQSDSLLLT 180
           SPL+GLYMYRDMERQGVR DPLSSSFAVKSCI++LSL  G+QIHA+IF NGHQ+DSLLLT
Sbjct: 121 SPLEGLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLT 180

Query: 181 SMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKRTRDALGLFDIMQSPTYLC 240
           SMMDLYSHCGK E+ACKLFDE+PQ+DV+AWNVLISCLTRNKRTRDALGLF+IMQSPTYLC
Sbjct: 181 SMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC 240

Query: 241 EPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLCNSLISMYSLCGRVDKAYE 300
           +PDKVTCLLLLQACADLNALEFGERIH Y+QQHGYNTESNLCNSLISMYS CGR+DKAYE
Sbjct: 241 QPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYE 300

Query: 301 VFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIEPDDHTFTAVLSACSHCGL 360
           VFDKM EK+VVSWSAMISGLSMNG GREAIEAFWEMQK G+EP DHTFTAVLSACSHCGL
Sbjct: 301 VFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGL 360

Query: 361 VDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYKLIMSMEVNPDATLWRTLL 420
           VDEGMAFFDRMRQEF I PNV+HYGC+VDLLGR GMLDQAY+LIMSMEV PDAT+WRTLL
Sbjct: 361 VDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLL 420

Query: 421 GACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVSELRKFMKEKGIYT 480
           GACRIHGH NLGERI+EHLIELKSQEAGDYVLLLNIYSSAGNWDKV+ELRK MKEKGIYT
Sbjct: 421 GACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYT 480

Query: 481 TPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKIAGYEAEISSELHRLKAED 540
           TP CTTIELNGVVH+FAVDDISHPMKDKIY++LDEINKQLKIAGYEAE+SSELHRL+ +D
Sbjct: 481 TPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKD 540

Query: 541 KGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNFAKYISSVYNRKVVVRDRS 600
           KGYALSNHSEKLAIAFGVLATPPGRTIR+AN +  CMDCHNFAKYISSVYNRKVVVRDRS
Sbjct: 541 KGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRS 600

Query: 601 RSHHFREGRCSCNDYW 617
           R HHF+EGRCSCND+W
Sbjct: 601 RFHHFQEGRCSCNDFW 602

BLAST of Bhi02G000224 vs. NCBI nr
Match: XP_022135228.1 (pentatricopeptide repeat-containing protein At3g47530 [Momordica charantia])

HSP 1 Score: 1078.9 bits (2789), Expect = 0.0e+00
Identity = 531/634 (83.75%), Postives = 573/634 (90.38%), Query Frame = 0

Query: 1   MTGIFRRHSI---LSPKHHHH--RFAST--------SLLSSKFRQQNSTPH-----LERE 60
           M  +FR+  +   L+P+HH    RFAST        SL+SSKFRQ NST       +ERE
Sbjct: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60

Query: 61  LLISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFFSLLT 120
            LISLIKSCTHK QLLQIHAHIIRTSSI+DPI++LRFLTR A+APFR+L YSRRFFS LT
Sbjct: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120

Query: 121 NPFVSHYNAMLRAYSLSPSPLKGLYMYRDMERQGVRVDPLSSSFAVKSCIRMLSLFSGVQ 180
           NP VSHYNAMLRAYSLS SP  GLY+YRDMERQG+R DPLSSSFA+KSCIR+ SL SGVQ
Sbjct: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180

Query: 181 IHAKIFRNGHQSDSLLLTSMMDLYSHCGKLEDACKLFDEIPQRDVIAWNVLISCLTRNKR 240
           IHA+IFRNGHQSDSLLLT+MMDLYSHCGKLE+ACKLFDEIPQRDV+AWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFDIMQSPTYLCEPDKVTCLLLLQACADLNALEFGERIHNYVQQHGYNTESNLC 300
           TRDALGLF+IMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIH+Y+Q+ GY+TESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300

Query: 301 NSLISMYSLCGRVDKAYEVFDKMPEKDVVSWSAMISGLSMNGQGREAIEAFWEMQKKGIE 360
           NSLISMYS CGRVDKAYEVFDKMPEK+VVSWSA+ISGLSMNG GREAIEAFWEMQK GIE
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360

Query: 361 PDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFRIVPNVYHYGCMVDLLGRTGMLDQAYK 420
           PDD TFT VLSACSHCGLVDEGMAFFDRMR EF+I PNV+HYGCMVDLLGR GMLDQAY+
Sbjct: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQ 420

Query: 421 LIMSMEVNPDATLWRTLLGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGN 480
           L MSME+NPDATLWRTLLGAC+IHGH NLGE II HLIE KSQEAGDYVLLLNIYSSAGN
Sbjct: 421 LAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGN 480

Query: 481 WDKVSELRKFMKEKGIYTTPGCTTIELNGVVHEFAVDDISHPMKDKIYEKLDEINKQLKI 540
           WDKV+ELRKFMKE GIYTTP CTTIELNGVVHEFAVDD+SHPMKD+IYE+LDEINKQLKI
Sbjct: 481 WDKVTELRKFMKENGIYTTPSCTTIELNGVVHEFAVDDVSHPMKDEIYEQLDEINKQLKI 540

Query: 541 AGYEAEISSELHRLKAEDKGYALSNHSEKLAIAFGVLATPPGRTIRVANTVHICMDCHNF 600
           AGYE+EISSELH LKAE+KGYALS HSEKLAIAFGVLATPPGRTIRVAN +  C+DCHNF
Sbjct: 541 AGYESEISSELHNLKAEEKGYALSCHSEKLAIAFGVLATPPGRTIRVANNIRTCVDCHNF 600

Query: 601 AKYISSVYNRKVVVRDRSRSHHFREGRCSCNDYW 617
           AKY+SSVYNRKVVVRDRSR HHFREGRCSCNDYW
Sbjct: 601 AKYVSSVYNRKVVVRDRSRFHHFREGRCSCNDYW 633

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
sp|Q9SN85|PP267_ARATH3.1e-20661.25Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH7.5e-12042.27Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|B8YEK4|OGR1_ORYSJ3.2e-11840.82Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa ... [more]
sp|Q9LXY5|PP284_ARATH3.2e-11837.78Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
sp|Q9FX24|PPR71_ARATH9.2e-11838.37Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT3G47530.11.7e-20761.25Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.14.2e-12142.27Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G56550.11.8e-11937.78Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G34160.15.1e-11938.37Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.11.5e-11836.70Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
tr|A0A0A0LUH9|A0A0A0LUH9_CUCSA0.0e+0086.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=4 SV=1[more]
tr|A0A1S3BV40|A0A1S3BV40_CUCME0.0e+0089.88pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2I4G1Y4|A0A2I4G1Y4_9ROSI1.3e-24969.05pentatricopeptide repeat-containing protein At3g47530 OS=Juglans regia OX=51240 ... [more]
tr|A0A251PAT0|A0A251PAT0_PRUPE3.3e-24565.37Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G194000 PE=4 SV=1[more]
tr|A0A2P5F707|A0A2P5F707_9ROSA4.9e-24166.17DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_105670 ... [more]
Match NameE-valueIdentityDescription
XP_023515406.10.0e+0088.22pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pep... [more]
XP_022921651.10.0e+0088.38pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata][more]
XP_022987181.10.0e+0087.58pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima][more]
XP_011660092.10.0e+0086.20PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis sativu... [more]
XP_022135228.10.0e+0083.75pentatricopeptide repeat-containing protein At3g47530 [Momordica charantia][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Bhi02M000224Bhi02M000224mRNA


Analysis Name: InterPro Annotations of wax gourd
Date Performed: 2019-11-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 108..137
e-value: 1.0
score: 9.7
coord: 450..478
e-value: 0.0055
score: 16.8
coord: 383..408
e-value: 0.064
score: 13.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 209..235
e-value: 8.3E-5
score: 20.5
coord: 109..140
e-value: 0.0022
score: 16.0
coord: 281..311
e-value: 4.1E-6
score: 24.6
coord: 346..381
e-value: 8.1E-6
score: 23.7
coord: 180..206
e-value: 0.0021
score: 16.1
coord: 311..344
e-value: 5.7E-7
score: 27.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 308..356
e-value: 1.6E-14
score: 53.7
coord: 206..255
e-value: 9.7E-8
score: 32.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 210..240
score: 5.788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 380..410
score: 7.805
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 105..139
score: 7.87
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 175..209
score: 9.471
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..343
score: 12.496
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 243..277
score: 6.588
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 446..480
score: 8.177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..374
score: 9.602
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..442
score: 5.546
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 278..308
score: 9.482
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 482..606
e-value: 1.4E-34
score: 118.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 107..259
e-value: 3.5E-24
score: 87.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 260..375
e-value: 6.9E-32
score: 113.2
coord: 376..554
e-value: 1.2E-16
score: 63.1
NoneNo IPR availablePANTHERPTHR24015:SF1054SUBFAMILY NOT NAMEDcoord: 71..532
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 71..532

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Bhi02G000224Wax gourdwgowgoB121
Bhi02G000224Wax gourdwgowgoB125
Bhi02G000224Cucumber (Gy14) v1cgywgoB188
Bhi02G000224Cucumber (Gy14) v1cgywgoB585
Bhi02G000224Cucumber (Gy14) v1cgywgoB693
Bhi02G000224Cucumber (Gy14) v2cgybwgoB107
Bhi02G000224Cucurbita maxima (Rimu)cmawgoB0151
Bhi02G000224Cucurbita maxima (Rimu)cmawgoB0631
Bhi02G000224Cucurbita maxima (Rimu)cmawgoB0916
Bhi02G000224Cucurbita moschata (Rifu)cmowgoB0160
Bhi02G000224Cucurbita moschata (Rifu)cmowgoB0355
Bhi02G000224Cucurbita moschata (Rifu)cmowgoB0631
Bhi02G000224Cucurbita pepo (Zucchini)cpewgoB0737
Bhi02G000224Cucurbita moschata (Rifu)cmowgoB0917
Bhi02G000224Cucurbita pepo (Zucchini)cpewgoB0062
Bhi02G000224Cucurbita pepo (Zucchini)cpewgoB0581
Bhi02G000224Cucurbita pepo (Zucchini)cpewgoB0824
Bhi02G000224Cucurbita pepo (Zucchini)cpewgoB0883
Bhi02G000224Wild cucumber (PI 183967)cpiwgoB116
Bhi02G000224Wild cucumber (PI 183967)cpiwgoB528
Bhi02G000224Cucumber (Chinese Long) v3cucwgoB121
Bhi02G000224Cucumber (Chinese Long) v3cucwgoB125
Bhi02G000224Cucumber (Chinese Long) v3cucwgoB537
Bhi02G000224Cucumber (Chinese Long) v2cuwgoB115
Bhi02G000224Cucumber (Chinese Long) v2cuwgoB525
Bhi02G000224Bottle gourd (USVL1VR-Ls)lsiwgoB291
Bhi02G000224Bottle gourd (USVL1VR-Ls)lsiwgoB408
Bhi02G000224Melon (DHL92) v3.6.1medwgoB090
Bhi02G000224Melon (DHL92) v3.6.1medwgoB194
Bhi02G000224Melon (DHL92) v3.6.1medwgoB291
Bhi02G000224Melon (DHL92) v3.6.1medwgoB446
Bhi02G000224Melon (DHL92) v3.5.1mewgoB089
Bhi02G000224Melon (DHL92) v3.5.1mewgoB192
Bhi02G000224Melon (DHL92) v3.5.1mewgoB291
Bhi02G000224Watermelon (Charleston Gray)wcgwgoB407
Bhi02G000224Watermelon (Charleston Gray)wcgwgoB487
Bhi02G000224Watermelon (97103) v2wgowmbB644
Bhi02G000224Watermelon (97103) v2wgowmbB647
Bhi02G000224Watermelon (97103) v1wgowmB671
Bhi02G000224Watermelon (97103) v1wgowmB666
Bhi02G000224Silver-seed gourdcarwgoB0063
Bhi02G000224Silver-seed gourdcarwgoB0732