BLAST of Cp4.1LG20g05810 vs. Swiss-Prot
Match:
PPR64_ARATH (Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidopsis thaliana GN=EMB2279 PE=2 SV=1)
HSP 1 Score: 728.4 bits (1879), Expect = 9.6e-209
Identity = 414/861 (48.08%), Postives = 560/861 (65.04%), Query Frame = 1
Query: 62 GHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 121
G A+K S GES + + + F+ + S EY R ++ R + D+ + +
Sbjct: 172 GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231
Query: 122 KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDSK 181
E + I D + + ++ + V + ++S +T D S + SK
Sbjct: 232 --------VEERRVQRIAKDARWSKS-RESSVAVKWSNSGESS--VTMPKDESFRRRYSK 291
Query: 182 RKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTLD 241
++ RS D +G S+ ++ + V + E +V R D R +L
Sbjct: 292 QEH-HRSSDTSRGIAR--GSKGDELELVVE---------ERRVQRIAKDVRWSKSDESLV 351
Query: 242 VKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTE-- 301
SE R G+ + + DT R + G G+ L +++ ++ E
Sbjct: 352 PVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGD-GLDLLAEERRIERLANERHEIR 411
Query: 302 -QSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCD-AEDIMDKPRVSKMEMEERIQMLSKR 361
G + G + ++ ++S +E AF D + DI+DKP S++EME+RI+ L+K
Sbjct: 412 SSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKPATSRVEMEDRIEKLAKV 471
Query: 362 LNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKS 421
LNGADI+MPEW F++ +RSAKIRY+D++++R+I LGKLGNW+RVLQVIEWLQ ++R+KS
Sbjct: 472 LNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWRRVLQVIEWLQRQDRYKS 531
Query: 422 HKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELF 481
+K+R IYTTAL+VLGK+RRPVEALNVFHAM SSYPD+VAY SIAVTLGQAG+++ELF
Sbjct: 532 NKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAYRSIAVTLGQAGHIKELF 591
Query: 482 DVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQG 541
VID+MRSPPKKKFK LEKWDPRL+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G
Sbjct: 592 YVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQRKQWEGAFWVLQQLKQRG 651
Query: 542 LQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVL 601
+PS TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV
Sbjct: 652 QKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVH 711
Query: 602 AIQTMEKRGIVGSAALYYDFARCLCSAGRCKEAL-------------------------- 661
++ ME RGIVGSAALYYD ARCLCSAGRC E L
Sbjct: 712 TVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHT 771
Query: 662 --MQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKG 721
Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK CSPNLVTCNI+LK
Sbjct: 772 IQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKA 831
Query: 722 YLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHF 781
YL G+F+EA+ELFQ MSE+G +I SD+ RVLPD YTFNTMLD +++WDDF +
Sbjct: 832 YLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYA 891
Query: 782 YNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLA 841
Y +ML +GYHFN KRHLRM++EA+R GK+E++E TW+H+ +++R P PLIKERF L
Sbjct: 892 YREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLE 951
Query: 842 RGDYSEALSCIS----KHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLL-ARN 886
+GD+ A+S ++ K ++ FS SAW +L RF +DSV+ L+ V+ L +R+
Sbjct: 952 KGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRS 1002
BLAST of Cp4.1LG20g05810 vs. Swiss-Prot
Match:
PP451_ARATH (Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidopsis thaliana GN=DG1 PE=1 SV=2)
HSP 1 Score: 410.2 bits (1053), Expect = 5.8e-113
Identity = 221/557 (39.68%), Postives = 339/557 (60.86%), Query Frame = 1
Query: 349 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 408
E +++L RL+G +I+ W F +MM + +++++ +L+++ LG+ +WK+ V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242
Query: 409 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 468
+ ++ K + RF+YT L VLG ARRP EAL +F+ M YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302
Query: 469 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 528
QAG ++EL VI+ MR P K K + WDP L+PD+V+YNA+LNACV W+ W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362
Query: 529 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 588
V EL++ GL+P+ TYGL MEVML+ GK++ VH+FFRK++ S P A+TYKVLV LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422
Query: 589 KEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 648
+EGK +EAV A++ ME++G++G+ ++YY+ A CLC+ GR +A++++ ++ ++ N +PL
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482
Query: 649 VTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNM 708
+T+TGLI A L+ ++ + IF +MK C PN+ T N++LK Y + MF EAKELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542
Query: 709 SENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHL 768
VS ++P+ YT++ ML+AS +W+ F H Y M+L GY + +H
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602
Query: 769 RMIMEAARGGKDELLETTWKHLAQADRTLPPPL-IKERFCIMLARGDYSEALSCISKHHS 828
M++EA+R GK LLE + + + D +P PL E C A+GD+ A++ I+ +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLE-DGEIPHPLFFTELLCHATAKGDFQRAITLINT-VA 662
Query: 829 SDEHHFSKSAWLNLLKEKR--FPKDSVIELIHKVSMLLARND-SPNPVLQNLLLSGKEFC 888
S+ W +L +E + +D+ +HK+S L D P + NL S K C
Sbjct: 663 LASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRC 722
Query: 889 RSRISVADPRLEEVVCT 900
S S A P L V T
Sbjct: 723 GSSSSSAQPLLAVDVTT 724
BLAST of Cp4.1LG20g05810 vs. Swiss-Prot
Match:
PP120_ARATH (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana GN=At1g74580 PE=3 SV=1)
HSP 1 Score: 110.9 bits (276), Expect = 7.3e-23
Identity = 98/417 (23.50%), Postives = 176/417 (42.21%), Query Frame = 1
Query: 369 MFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKLRFIYTTAL 428
MF M + +++ + VI+ LG G ++ + +V+ + MRE +H L +Y A+
Sbjct: 26 MFNSMRKEVGFKHTLSTYRSVIEKLGYYGKFEAMEEVL--VDMRENVGNHMLEGVYVGAM 85
Query: 429 DVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPK 488
G+ + EA+NVF M + + P + +Y++I L +GY + V MR
Sbjct: 86 KNYGRKGKVQEAVNVFERM-DFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMR---- 145
Query: 489 KKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLV 548
D + PD+ + + + K A +L + QG + + Y V
Sbjct: 146 -----------DRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTV 205
Query: 549 MEVMLQCGKYNLVHEFFRKVQKSSIPNAL-TYKVLVNTLWKEGKTDEAVLAIQTMEKRGI 608
+ + +E F K+ S + L T+ L+ L K+G E + + KRG+
Sbjct: 206 VGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGV 265
Query: 609 VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQA-CLDSKNLQSAV 668
+ + Y F + LC G A+ + + + KP V+TY LI C +SK ++ V
Sbjct: 266 LPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEV 325
Query: 669 YIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPD 728
Y+ + P+ T N L+ GY GM A+ + + NG +PD
Sbjct: 326 YLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNG------------FVPD 385
Query: 729 IYTFNTMLDASFAEKRWDDFSHFYNQ---------MLLYGYHFNPKRHLRMIMEAAR 775
+T+ +++D E + +N+ ++LY + MI+EAA+
Sbjct: 386 QFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQ 412
BLAST of Cp4.1LG20g05810 vs. Swiss-Prot
Match:
PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)
HSP 1 Score: 109.0 bits (271), Expect = 2.8e-22
Identity = 95/447 (21.25%), Postives = 189/447 (42.28%), Query Frame = 1
Query: 344 KMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVL 403
K + R ++ L+ +D +F +M++S S +++ + K+ + V+
Sbjct: 43 KTSYDYREKLSRNGLSELKLDDAVALFGEMVKSRPFP-SIIEFSKLLSAIAKMNKFDVVI 102
Query: 404 QVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSI 463
+ E +Q +H + Y+ ++ + + AL V M + P++V S+
Sbjct: 103 SLGEQMQNLGIPHNH---YTYSILINCFCRRSQLPLALAVLGKMMK-LGYEPNIVTLSSL 162
Query: 464 AVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNW 523
+ + E ++D M F TG QP+ V +N +++
Sbjct: 163 LNGYCHSKRISEAVALVDQM-------FVTG--------YQPNTVTFNTLIHGLFLHNKA 222
Query: 524 EGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSI-PNALTYKVL 583
A ++ + +G QP TYG+V+ + + G +L K+++ + P L Y +
Sbjct: 223 SEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTI 282
Query: 584 VNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN 643
++ L K D+A+ + ME +GI + Y CLC+ GR +A + + +
Sbjct: 283 IDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKI 342
Query: 644 KPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KAFCSPNLVTCNILLKGYLDHGMFDEAKE 703
P V T++ LI A + L A +++ M K P++VT + L+ G+ H DEAK+
Sbjct: 343 NPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQ 402
Query: 704 LFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFN 763
+F+ M PD+ T+NT++ KR ++ + +M G N
Sbjct: 403 MFEFMVSK------------HCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGN 457
Query: 764 PKRHLRMIMEAARGGKDELLETTWKHL 789
+ +I + G ++ + +K +
Sbjct: 463 TVTYNILIQGLFQAGDCDMAQEIFKEM 457
BLAST of Cp4.1LG20g05810 vs. Swiss-Prot
Match:
PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)
HSP 1 Score: 108.2 bits (269), Expect = 4.7e-22
Identity = 82/325 (25.23%), Postives = 134/325 (41.23%), Query Frame = 1
Query: 431 LGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKK 490
LGK R+ + L + E + PD++ Y+ + +AG + V+D M
Sbjct: 150 LGKTRKAAKILEIL----EGSGAVPDVITYNVMISGYCKAGEINNALSVLDRMS------ 209
Query: 491 FKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVME 550
+ PD+V YN +L + + A VL + ++ P TY +++E
Sbjct: 210 ------------VSPDVVTYNTILRSLCDSGKLKQAMEVLDRMLQRDCYPDVITYTILIE 269
Query: 551 VMLQCGKYNLVHEFFRKVQ-KSSIPNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVG 610
+ + +++ + P+ +TY VLVN + KEG+ DEA+ + M G
Sbjct: 270 ATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSSGCQP 329
Query: 611 SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIF 670
+ + R +CS GR +A + + + P VVT+ LI L A+ I
Sbjct: 330 NVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVVTFNILINFLCRKGLLGRAIDIL 389
Query: 671 NHMKAF-CSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIY 730
M C PN ++ N LL G+ D A E + M G PDI
Sbjct: 390 EKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERMVSRG------------CYPDIV 440
Query: 731 TFNTMLDASFAEKRWDDFSHFYNQM 754
T+NTML A + + +D NQ+
Sbjct: 450 TYNTMLTALCKDGKVEDAVEILNQL 440
BLAST of Cp4.1LG20g05810 vs. TrEMBL
Match:
A0A0A0LVN7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553530 PE=4 SV=1)
HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 756/907 (83.35%), Postives = 810/907 (89.31%), Query Frame = 1
Query: 1 MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
MVGVIMAN NLCIP CE GFP LHCT NSH SFF SSVSG+ + AK+RVLRH
Sbjct: 1 MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60
Query: 61 RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
R HKCG+IKA S GESDI L SGNLLE+DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61 RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
Query: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRKID 180
MKEN SAKSAESTSIS I VTDVQ N+DVK VD++DLF+N+ERI + D
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180
Query: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKR 240
LSGNKFD +RK VTRS D++KGK+TPF S VNDKQH EKRN NWS+YIEP+VTRSN K
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240
Query: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDTKRTKDVLKVGKYGVQLEGNYIPG 300
+HFKANTL+VK ES V G+SMK SEKIWA DDD K K VLK GKYG+QLE +Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300
Query: 301 DKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEER 360
DKVGRKKTEQSYRG S SGK+F EF E++SLEVEHAAFN+ DA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
Query: 361 IQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420
IQMLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
Query: 481 GYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540
GYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
Query: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
Query: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
Query: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENG 720
GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE
Sbjct: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
Query: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780
RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
Query: 781 EAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840
EAARGGKDELLETTWKHLAQADRT PPPL+KERFC+ LARGDYSEALS I H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
Query: 841 FSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVAD 898
FS+SAWLNLLKEKRFP+D+VIELIHKV M+L RN+SPNPV +NLLLS KEFCR+RIS+AD
Sbjct: 841 FSESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
BLAST of Cp4.1LG20g05810 vs. TrEMBL
Match:
M5WJN1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001195mg PE=4 SV=1)
HSP 1 Score: 916.0 bits (2366), Expect = 3.6e-263
Identity = 503/912 (55.15%), Postives = 635/912 (69.63%), Query Frame = 1
Query: 6 MANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRHRG--- 65
M NA L + + N +C+ L GFS F + GL + K ++RG
Sbjct: 1 MTNAQLGVSNFQRNDIFVANCSSKPGPLSGFSLFRRPIFCVGLYEKNVK----KNRGFGI 60
Query: 66 ---HKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSR--RYKRQSDD 125
++ I A SK SD R G +LE +F+FKPSFD+Y++VM +VR R R K+ S
Sbjct: 61 KIPNRRTVISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDRDKQDSSK 120
Query: 126 PNKMKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKI---DLS 185
K N ++ + +S +GN + K + + + N E+ ++ +
Sbjct: 121 EQNPKHNLRSRGVSRSLVS------EGNEEHVK----LGESEEHSNQEKASKAAKQNEAL 180
Query: 186 GNKFD----SKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGN--WSNYIEPKVTRSN 245
GN+ SKR+GV KDE + + D + K E R+G +S +EP+
Sbjct: 181 GNRNGIMGKSKRQGVKGFKDEYDSRQSNRDEKEKKKIRGEARDGRSKYSGRLEPE----- 240
Query: 246 HDKRLHFKANTLDVKSESHGVR-YGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEG-- 305
L+F+ + ++ +R Y S+ K ++ GK GV+++G
Sbjct: 241 ----LNFRGKSTMARNVKDDLRVYKSTDKSFDR----------------GKVGVKIQGGL 300
Query: 306 --NYIPGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDA-EDIMDKPRV 365
N+I + + + L+KSG+ F + ++S+EVE AAF + D DIMDKPRV
Sbjct: 301 ERNHINAENATDRGFSRRSEKLTKSGRDFPKKNYDNSMEVERAAFKNFDEFGDIMDKPRV 360
Query: 366 SKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRV 425
S+MEMEERIQ L+K LNGADIDMPEWMF++MMRSA+IR++DHSILRVIQ+LGKLGNW+RV
Sbjct: 361 SQMEMEERIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGNWRRV 420
Query: 426 LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHS 485
LQVIEWLQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM + SSYPDLVAYHS
Sbjct: 421 LQVIEWLQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLQEMSSYPDLVAYHS 480
Query: 486 IAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN 545
IAVTLGQAG+MRELFDVID+MRSPPKKKFKTGAL KWDPRL+PDIV+++AVLNACV+RK
Sbjct: 481 IAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAVLNACVQRKQ 540
Query: 546 WEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVL 605
WEGAFWVLQ+L++QGLQP+ TTYGLVMEVML CGKYNLVHEFF+KVQKSSIPNALT++V+
Sbjct: 541 WEGAFWVLQQLQQQGLQPAATTYGLVMEVMLACGKYNLVHEFFKKVQKSSIPNALTFRVI 600
Query: 606 VNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN 665
VNTLW+EGK EAVL +Q ME+RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKICKVAN
Sbjct: 601 VNTLWREGKVGEAVLVVQNMERRGIVGSAALYYDFARCLCSAGRCQEALMQIEKICKVAN 660
Query: 666 KPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKEL 725
KPLVVTYTGLIQACLD+ ++++ Y+F M+ FCSPNLVTCN +LKGYLDHGMF+EAKEL
Sbjct: 661 KPLVVTYTGLIQACLDAGSIKNGAYVFKQMENFCSPNLVTCNTMLKGYLDHGMFEEAKEL 720
Query: 726 FQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNP 785
F M +NG NIS+ SD + RV PD YTFNT+LDA EKRWDDF Y ML +GYHFN
Sbjct: 721 FLKMLDNGNNISSKSDCKARVKPDSYTFNTLLDACITEKRWDDFEFVYKMMLHHGYHFNA 780
Query: 786 KRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISK 845
KRHLRMI++A GK ELL+ TW HL +A R+ PPPLIKERFC L + DY+ AL+CI+
Sbjct: 781 KRHLRMILDACEAGKGELLDITWTHLTEAGRSPPPPLIKERFCTKLEKDDYAAALTCITD 840
Query: 846 HHSSD-EHHFSKSAWLNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGK 892
+ S+ + FSK+AWL L KE ++F KD+ + L+H+ S+L+ R D NPV QNL+ +
Sbjct: 841 PNLSELQTFFSKNAWLKLFKENAEKFQKDTFVRLVHEGSILINRTDRSNPVFQNLMAACG 873
BLAST of Cp4.1LG20g05810 vs. TrEMBL
Match:
W9RFN3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_025948 PE=4 SV=1)
HSP 1 Score: 896.0 bits (2314), Expect = 3.9e-257
Identity = 488/911 (53.57%), Postives = 618/911 (67.84%), Query Frame = 1
Query: 1 MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
M G+I N L + GNG A C Q S GFS G GLN +++
Sbjct: 1 MAGMIATNGKLGVSSFHGNGVFASKCRQTSFSSCGFSLIRRPNFGIGLN--------VKN 60
Query: 61 RGHKCGAI-KASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPN 120
R CG + +A S G SD +L G+LLE +F+FKPSFD+Y++VMESVR+ R K+Q N
Sbjct: 61 RRRNCGTVTRAGSNGGSDSKLVGGSLLEKEFEFKPSFDDYLKVMESVRTVRDKKQKSTHN 120
Query: 121 KMKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFD 180
+ S + ES + + +D K + VD+++ F + + + +K +
Sbjct: 121 LRETFLSEGNEESVRLGKS----EERLDRGKALDFVDKDESFKSRDGVKKK--------E 180
Query: 181 SKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANT 240
S+RK +T K +G + + K WS + ++ + +
Sbjct: 181 SQRKKITELKGRFEGTENNWTGRGKRKPVRSLTGRKWSKQQTREEDAEANNYNIDMRREH 240
Query: 241 LDVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQL-EGNYIPGDKVGRKKT 300
D + S R + + + IW D + + G + E N I +KV K
Sbjct: 241 EDKANSS---RVLGNKRSDDSIWNDGSMAKAGVREETGVVNNKWRERNRIQDNKVIDKDI 300
Query: 301 EQSYRGLSKSGKQFHEFTEESSLEVEHAAF-NSCDAEDIMDKPRVSKMEMEERIQMLSKR 360
+ +++ + ++ SL E AAF N D DI+ KPR+ +MEM+ERIQ L+
Sbjct: 301 VPKHGRINRRTE-----VDDKSLREERAAFRNFDDYNDILGKPRLPRMEMDERIQKLAMS 360
Query: 361 LNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKS 420
LNGAD+DMPEWMF++MMRSA+I ++DHSI RVIQ+LGK GNW+RV+QVIEWLQ+RERFKS
Sbjct: 361 LNGADVDMPEWMFSKMMRSARIIFTDHSISRVIQILGKFGNWRRVVQVIEWLQIRERFKS 420
Query: 421 HKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELF 480
HKLR+IYTTAL+VLGKARRPVEALNVF+AM +H SSYPDLVAYHSIAVTLGQAGYM+ELF
Sbjct: 421 HKLRYIYTTALNVLGKARRPVEALNVFNAMLQHMSSYPDLVAYHSIAVTLGQAGYMKELF 480
Query: 481 DVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQG 540
DVID+MRSPPKKKFKTGAL KWDPR++PDI++YNAVLNACV+RK WEGAFWVLQ+LKE+
Sbjct: 481 DVIDTMRSPPKKKFKTGALGKWDPRVEPDIIMYNAVLNACVQRKQWEGAFWVLQQLKEKA 540
Query: 541 LQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVL 600
L PS TTYGLVMEVML CGKYNLVH+FFRKVQKSSIPNALTY+VL+NTL KEGK DEAVL
Sbjct: 541 LNPSVTTYGLVMEVMLVCGKYNLVHDFFRKVQKSSIPNALTYRVLLNTLSKEGKLDEAVL 600
Query: 601 AIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACL 660
A+Q MEKRGIVGSAALYYD ARCLCSAGRC+EALMQ++KICKVA+KPLVVTYTGLIQACL
Sbjct: 601 AVQNMEKRGIVGSAALYYDLARCLCSAGRCQEALMQIDKICKVASKPLVVTYTGLIQACL 660
Query: 661 DSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVS 720
DS N++ YIFNHMK FCS NLVTCNI+LKGYL HG F EAKELF+ M ++ I + +
Sbjct: 661 DSGNIEDGAYIFNHMKDFCSRNLVTCNIMLKGYLKHGKFKEAKELFEKMLQDASLIKSKA 720
Query: 721 DYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGGK 780
D++ V PDIYTFNTM DA EK+WDDF + Y +ML +GYHFN KRHL+MI+ A+R GK
Sbjct: 721 DHKALVAPDIYTFNTMFDACITEKKWDDFEYAYKKMLHHGYHFNAKRHLQMILNASRVGK 780
Query: 781 DELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAWL 840
ELL+ TW HL +ADR P LIKE+FC+ L + DY ALSCI + S+ FSK AW
Sbjct: 781 GELLDITWNHLVEADRIPPSSLIKEKFCMKLEKEDYIAALSCICNQNLSESREFSKKAWS 840
Query: 841 NLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLEE 900
LL E +RF K +++ LI ++ ++AR+D P+ VL NLL+S KE R+ + VAD L E
Sbjct: 841 KLLDENSERFRKGTLVRLIREIDNIIARSDQPDSVLVNLLVSCKELSRTCV-VADVELTE 882
Query: 901 VVCTNEFQSAA 907
T + A+
Sbjct: 901 TFTTLQTDPAS 882
BLAST of Cp4.1LG20g05810 vs. TrEMBL
Match:
B9T6B9_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0237710 PE=4 SV=1)
HSP 1 Score: 891.0 bits (2301), Expect = 1.3e-255
Identity = 475/861 (55.17%), Postives = 612/861 (71.08%), Query Frame = 1
Query: 68 IKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKMKENASA 127
IKA S G+SD RL G +LE + +FKPSFDEY++ MESV++ K+ + + K +
Sbjct: 10 IKALSSGDSDNRLVGGGILEKELEFKPSFDEYLKAMESVKTGITKKHTRKLSGNKVKDDS 69
Query: 128 KSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSE-RITRKIDLSGNKFDSKRKGVT 187
K TS+ T+ +G + K + ++L +N + I RK + S + K +G+
Sbjct: 70 KEGSRTSVGK--TEWRGKLKFK------ENDELGENEDGEIDRKDETSSKIY--KERGIR 129
Query: 188 RSKDELKGKVTPFDSQVNDKQHVEKRNGNWSN--------------YIEPKVTRSNHDKR 247
S ++ GK + + V K R+ W N ++ K T++ ++
Sbjct: 130 ESNLKVTGKESRAYANVKRKIRGATRDREWLNNGTSSMITELEDINQVKVKRTQNVQERT 189
Query: 248 LHFKA-----NTLDVKSE-SHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGN 307
L +T K E ++G + ++ K ++ D + K G +L N
Sbjct: 190 LAIDGVRRSQSTTGKKEEFAYGQNFPEMLRRKGKTHIGEE-----DGVSGNKMGGRLVRN 249
Query: 308 YIPGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDA-EDIMDKPRVSKM 367
Y+ DK K+ + + ++ + F ++ E EVE AAF S + + +P+ SK
Sbjct: 250 YVQIDKNTDKEFMEKKGLIRRTNQAFLDYGHEDDSEVERAAFKSLEEYNNFTGRPQNSKR 309
Query: 368 EMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQV 427
E+E+R+Q L+K LNGADIDMPEWMF++MMRSA+I+Y+DHS+LR+IQ+LGKLGNW+RVLQV
Sbjct: 310 EVEDRLQKLAKCLNGADIDMPEWMFSKMMRSARIKYTDHSVLRIIQILGKLGNWRRVLQV 369
Query: 428 IEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAV 487
IEWLQMRERFKSH+LR IYTTAL+VLGKA+RPVEALNVFH MQ+ SSYPDLVAYH IAV
Sbjct: 370 IEWLQMRERFKSHRLRNIYTTALNVLGKAQRPVEALNVFHVMQQQMSSYPDLVAYHCIAV 429
Query: 488 TLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEG 547
TLGQAG+M +LFDVIDSMRSPPKKKFK A+ KWDPRL+PDIV+YNAVLNACV+RK WEG
Sbjct: 430 TLGQAGHMEQLFDVIDSMRSPPKKKFKMAAVHKWDPRLEPDIVVYNAVLNACVQRKQWEG 489
Query: 548 AFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNT 607
AFWVLQ+LK+QGLQPSTTTYGL+MEVM CGKYNLVHEFFRKVQKSSIPNAL YKVLVNT
Sbjct: 490 AFWVLQQLKQQGLQPSTTTYGLIMEVMFACGKYNLVHEFFRKVQKSSIPNALVYKVLVNT 549
Query: 608 LWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPL 667
LW+EGKTDEAVLA++ ME+RGIVG AALYYD ARCLCSAGRC+EAL+Q+EKIC+VANKPL
Sbjct: 550 LWREGKTDEAVLAVEEMERRGIVGFAALYYDLARCLCSAGRCQEALLQIEKICRVANKPL 609
Query: 668 VVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQN 727
VVTYTGLIQACLDS N+ +AVYIFN MK FCSPNLVT N++LK Y +HG+F++AKELF
Sbjct: 610 VVTYTGLIQACLDSGNIHNAVYIFNQMKHFCSPNLVTFNVMLKAYFEHGLFEDAKELFHK 669
Query: 728 MSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRH 787
M+E+ +I DY+ RV+PDIYTFNTMLDA +EK WDDF + Y +ML +G+HFN KRH
Sbjct: 670 MTEDSNHIRGNHDYKVRVIPDIYTFNTMLDACISEKSWDDFEYVYRRMLHHGFHFNGKRH 729
Query: 788 LRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHS 847
LRMI++A+R GK E LE TWKHLA+ADR PP LIKERF IML + D AL+CI+ +
Sbjct: 730 LRMILDASRAGKVEPLEMTWKHLARADRIPPPNLIKERFRIMLEKDDCKSALACITTNPM 789
Query: 848 SDEHHFSKSAWLNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCR 905
+ F K AWLNL KE ++ +D++I+L H+VSML+ + P+PVLQNLL S +F
Sbjct: 790 GESPAFHKVAWLNLFKENAEQIRRDTLIQLKHEVSMLV---NPPDPVLQNLLASCNDFLN 849
BLAST of Cp4.1LG20g05810 vs. TrEMBL
Match:
A0A061FSP7_THECC (Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_042369 PE=4 SV=1)
HSP 1 Score: 890.2 bits (2299), Expect = 2.1e-255
Identity = 473/847 (55.84%), Postives = 597/847 (70.48%), Query Frame = 1
Query: 65 CG-AIKASSKGESDIRL----ASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPN 124
CG A K SSK + L + G +LE + FKPSFDEY++ MESVR ++ +S+ N
Sbjct: 41 CGVASKNSSKKKWSFALRVVDSGGGILEKELDFKPSFDEYLKTMESVREKKQSLKSNRGN 100
Query: 125 KMKENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFD 184
++++ KS + D F E++++ ++ + K
Sbjct: 101 SIEKSNRGKSKD------------------------DSRRKFGEEEKVSKVVEHNEVKMK 160
Query: 185 SKRKGVTRSKDEL--KGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKA 244
SK TRS+ L KG+ ++ ++ ++ E G+ +P+V+R
Sbjct: 161 SKEATRTRSRKALLVKGEDDDLKAETDEYKNFE---GSNDVVDKPQVSR----------- 220
Query: 245 NTLDVKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKK 304
+K E + + K K +D+ R ++K G++ +++ + I K
Sbjct: 221 ----IKMEGRITKLANLGKYDSKSKSDEGDVR---LMKFGEFSEEVKMSKIV--KWNGVN 280
Query: 305 TEQSYRGLSKSGKQFHEFTEESSLEVEHAAF-NSCDAEDIMDKPRVSKMEMEERIQMLSK 364
T ++S K F E E+ L +E +AF N ++ D+ DKPR SKMEMEER+Q L+K
Sbjct: 281 TMNEGARRTRSRKAFLEEDEDDDLRMERSAFKNFEESNDVFDKPRASKMEMEERVQRLAK 340
Query: 365 RLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFK 424
LNGADIDMPEWMF++MMRSAKI+++D+ ILRVIQ LGKLGNW+RVLQVIEWLQMRERFK
Sbjct: 341 SLNGADIDMPEWMFSKMMRSAKIKFTDYCILRVIQALGKLGNWRRVLQVIEWLQMRERFK 400
Query: 425 SHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMREL 484
S++LR IYTTALDVLGKARRPVEALN+FH+MQ+ +SYPD+VAYHSIAVTLGQAG+MREL
Sbjct: 401 SYRLRHIYTTALDVLGKARRPVEALNIFHSMQQQMASYPDIVAYHSIAVTLGQAGHMREL 460
Query: 485 FDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQ 544
F VIDSMRSPPKKKFKT + KWDPRL+PDIV+YNAVLNAC +RK WEGAFWVLQ+LK+Q
Sbjct: 461 FHVIDSMRSPPKKKFKTRIIGKWDPRLEPDIVVYNAVLNACAQRKQWEGAFWVLQQLKQQ 520
Query: 545 GLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAV 604
LQ S TTYGLVMEVM CGKYNLVHEFFRK++KSS+PNALTY+VLVNTLWKEGK D+AV
Sbjct: 521 HLQLSATTYGLVMEVMFACGKYNLVHEFFRKIEKSSMPNALTYRVLVNTLWKEGKIDDAV 580
Query: 605 LAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQAC 664
LA+Q MEKRGIVGSAALYYD ARCLCS+GRC+EALMQ+EKICKVA+KPLVVTYTGLIQAC
Sbjct: 581 LAVQGMEKRGIVGSAALYYDLARCLCSSGRCQEALMQIEKICKVASKPLVVTYTGLIQAC 640
Query: 665 LDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAV 724
LDS N+Q+ YIFN M+ FCSPNLVTCNI+LK YLDH +FD+AK+LFQ M E+ IS+
Sbjct: 641 LDSGNIQNGAYIFNEMQNFCSPNLVTCNIMLKAYLDHRLFDQAKDLFQKMLEDANQISSK 700
Query: 725 SDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIMEAARGG 784
SDY RV+PD YTFN MLDA +KRWD+F Y +ML + +HFN KRHL MI++AAR G
Sbjct: 701 SDYLHRVIPDSYTFNIMLDACVQQKRWDEFERVYRKMLHHEFHFNAKRHLHMILDAARAG 760
Query: 785 KDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHHFSKSAW 844
K EL+ETTW+H+A+ADRT P PLIKERFC+ L + DY ALSCI+ H + FSKSAW
Sbjct: 761 KGELIETTWEHMARADRTPPLPLIKERFCMKLEKNDYISALSCITIHPLRELQAFSKSAW 820
Query: 845 LNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVADPRLE 902
N K+ RF KD ++ L+ +V +L R+DSPNP+L NLL S KEF R+ + AD L
Sbjct: 821 SNFFKDNASRFRKDIIVGLVDEVENILGRSDSPNPILHNLLTSSKEFLRTHWTSADANLT 840
BLAST of Cp4.1LG20g05810 vs. TAIR10
Match:
AT1G30610.1 (AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein)
HSP 1 Score: 728.4 bits (1879), Expect = 5.4e-210
Identity = 414/861 (48.08%), Postives = 560/861 (65.04%), Query Frame = 1
Query: 62 GHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 121
G A+K S GES + + + F+ + S EY R ++ R + D+ + +
Sbjct: 172 GESSVALKLSKSGESSVTVPE----DESFRKRYSKQEYHRSSDTSRGIERGSRGDELDLV 231
Query: 122 KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDSK 181
E + I D + + ++ + V + ++S +T D S + SK
Sbjct: 232 --------VEERRVQRIAKDARWSKS-RESSVAVKWSNSGESS--VTMPKDESFRRRYSK 291
Query: 182 RKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKRLHFKANTLD 241
++ RS D +G S+ ++ + V + E +V R D R +L
Sbjct: 292 QEH-HRSSDTSRGIAR--GSKGDELELVVE---------ERRVQRIAKDVRWSKSDESLV 351
Query: 242 VKSESHGVRYGSSMKISEKIWADDDTKRTKDVLKVGKYGVQLEGNYIPGDKVGRKKTE-- 301
SE R G+ + + DT R + G G+ L +++ ++ E
Sbjct: 352 PVSEDESFRRGNPKQEMVRYQRVSDTSRGIERGSKGD-GLDLLAEERRIERLANERHEIR 411
Query: 302 -QSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCD-AEDIMDKPRVSKMEMEERIQMLSKR 361
G + G + ++ ++S +E AF D + DI+DKP S++EME+RI+ L+K
Sbjct: 412 SSKLSGTRRIGAKRNDDDDDSLFAMETPAFRFSDESSDIVDKPATSRVEMEDRIEKLAKV 471
Query: 362 LNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKS 421
LNGADI+MPEW F++ +RSAKIRY+D++++R+I LGKLGNW+RVLQVIEWLQ ++R+KS
Sbjct: 472 LNGADINMPEWQFSKAIRSAKIRYTDYTVMRLIHFLGKLGNWRRVLQVIEWLQRQDRYKS 531
Query: 422 HKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELF 481
+K+R IYTTAL+VLGK+RRPVEALNVFHAM SSYPD+VAY SIAVTLGQAG+++ELF
Sbjct: 532 NKIRIIYTTALNVLGKSRRPVEALNVFHAMLLQISSYPDMVAYRSIAVTLGQAGHIKELF 591
Query: 482 DVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQG 541
VID+MRSPPKKKFK LEKWDPRL+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G
Sbjct: 592 YVIDTMRSPPKKKFKPTTLEKWDPRLEPDVVVYNAVLNACVQRKQWEGAFWVLQQLKQRG 651
Query: 542 LQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEGKTDEAVL 601
+PS TYGL+MEVML C KYNLVHEFFRK+QKSSIPNAL Y+VLVNTLWKEGK+DEAV
Sbjct: 652 QKPSPVTYGLIMEVMLACEKYNLVHEFFRKMQKSSIPNALAYRVLVNTLWKEGKSDEAVH 711
Query: 602 AIQTMEKRGIVGSAALYYDFARCLCSAGRCKEAL-------------------------- 661
++ ME RGIVGSAALYYD ARCLCSAGRC E L
Sbjct: 712 TVEDMESRGIVGSAALYYDLARCLCSAGRCNEGLNMVNFVNPVVLKLIENLIYKADLVHT 771
Query: 662 --MQMEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKG 721
Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK CSPNLVTCNI+LK
Sbjct: 772 IQFQLKKICRVANKPLVVTYTGLIQACVDSGNIKNAAYIFDQMKKVCSPNLVTCNIMLKA 831
Query: 722 YLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHF 781
YL G+F+EA+ELFQ MSE+G +I SD+ RVLPD YTFNTMLD +++WDDF +
Sbjct: 832 YLQGGLFEEARELFQKMSEDGNHIKNSSDFESRVLPDTYTFNTMLDTCAEQEKWDDFGYA 891
Query: 782 YNQMLLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLA 841
Y +ML +GYHFN KRHLRM++EA+R GK+E++E TW+H+ +++R P PLIKERF L
Sbjct: 892 YREMLRHGYHFNAKRHLRMVLEASRAGKEEVMEATWEHMRRSNRIPPSPLIKERFFRKLE 951
Query: 842 RGDYSEALSCIS----KHHSSDEHHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLL-ARN 886
+GD+ A+S ++ K ++ FS SAW +L RF +DSV+ L+ V+ L +R+
Sbjct: 952 KGDHISAISSLADLNGKIEETELRAFSTSAWSRVL--SRFEQDSVLRLMDDVNRRLGSRS 1002
BLAST of Cp4.1LG20g05810 vs. TAIR10
Match:
AT5G67570.1 (AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein)
HSP 1 Score: 410.2 bits (1053), Expect = 3.3e-114
Identity = 221/557 (39.68%), Postives = 339/557 (60.86%), Query Frame = 1
Query: 349 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 408
E +++L RL+G +I+ W F +MM + +++++ +L+++ LG+ +WK+ V+ W
Sbjct: 183 EAVRVLVDRLSGREINEKHWKFVRMMNQSGLQFTEDQMLKIVDRLGRKQSWKQASAVVHW 242
Query: 409 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 468
+ ++ K + RF+YT L VLG ARRP EAL +F+ M YPD+ AYH IAVTLG
Sbjct: 243 VYSDKKRKHLRSRFVYTKLLSVLGFARRPQEALQIFNQMLGDRQLYPDMAAYHCIAVTLG 302
Query: 469 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 528
QAG ++EL VI+ MR P K K + WDP L+PD+V+YNA+LNACV W+ W
Sbjct: 303 QAGLLKELLKVIERMRQKPTKLTKNLRQKNWDPVLEPDLVVYNAILNACVPTLQWKAVSW 362
Query: 529 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKS-SIPNALTYKVLVNTLW 588
V EL++ GL+P+ TYGL MEVML+ GK++ VH+FFRK++ S P A+TYKVLV LW
Sbjct: 363 VFVELRKNGLRPNGATYGLAMEVMLESGKFDRVHDFFRKMKSSGEAPKAITYKVLVRALW 422
Query: 589 KEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV 648
+EGK +EAV A++ ME++G++G+ ++YY+ A CLC+ GR +A++++ ++ ++ N +PL
Sbjct: 423 REGKIEEAVEAVRDMEQKGVIGTGSVYYELACCLCNNGRWCDAMLEVGRMKRLENCRPLE 482
Query: 649 VTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNM 708
+T+TGLI A L+ ++ + IF +MK C PN+ T N++LK Y + MF EAKELF+ +
Sbjct: 483 ITFTGLIAASLNGGHVDDCMAIFQYMKDKCDPNIGTANMMLKVYGRNDMFSEAKELFEEI 542
Query: 709 SENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHL 768
VS ++P+ YT++ ML+AS +W+ F H Y M+L GY + +H
Sbjct: 543 ---------VSRKETHLVPNEYTYSFMLEASARSLQWEYFEHVYQTMVLSGYQMDQTKHA 602
Query: 769 RMIMEAARGGKDELLETTWKHLAQADRTLPPPL-IKERFCIMLARGDYSEALSCISKHHS 828
M++EA+R GK LLE + + + D +P PL E C A+GD+ A++ I+ +
Sbjct: 603 SMLIEASRAGKWSLLEHAFDAVLE-DGEIPHPLFFTELLCHATAKGDFQRAITLINT-VA 662
Query: 829 SDEHHFSKSAWLNLLKEKR--FPKDSVIELIHKVSMLLARND-SPNPVLQNLLLSGKEFC 888
S+ W +L +E + +D+ +HK+S L D P + NL S K C
Sbjct: 663 LASFQISEEEWTDLFEEHQDWLTQDN----LHKLSDHLIECDYVSEPTVSNLSKSLKSRC 722
Query: 889 RSRISVADPRLEEVVCT 900
S S A P L V T
Sbjct: 723 GSSSSSAQPLLAVDVTT 724
BLAST of Cp4.1LG20g05810 vs. TAIR10
Match:
AT1G74580.1 (AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 110.9 bits (276), Expect = 4.1e-24
Identity = 98/417 (23.50%), Postives = 176/417 (42.21%), Query Frame = 1
Query: 369 MFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQMRERFKSHKLRFIYTTAL 428
MF M + +++ + VI+ LG G ++ + +V+ + MRE +H L +Y A+
Sbjct: 26 MFNSMRKEVGFKHTLSTYRSVIEKLGYYGKFEAMEEVL--VDMRENVGNHMLEGVYVGAM 85
Query: 429 DVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPK 488
G+ + EA+NVF M + + P + +Y++I L +GY + V MR
Sbjct: 86 KNYGRKGKVQEAVNVFERM-DFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMR---- 145
Query: 489 KKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLV 548
D + PD+ + + + K A +L + QG + + Y V
Sbjct: 146 -----------DRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTV 205
Query: 549 MEVMLQCGKYNLVHEFFRKVQKSSIPNAL-TYKVLVNTLWKEGKTDEAVLAIQTMEKRGI 608
+ + +E F K+ S + L T+ L+ L K+G E + + KRG+
Sbjct: 206 VGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGV 265
Query: 609 VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQA-CLDSKNLQSAV 668
+ + Y F + LC G A+ + + + KP V+TY LI C +SK ++ V
Sbjct: 266 LPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEV 325
Query: 669 YIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYRDRVLPD 728
Y+ + P+ T N L+ GY GM A+ + + NG +PD
Sbjct: 326 YLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNG------------FVPD 385
Query: 729 IYTFNTMLDASFAEKRWDDFSHFYNQ---------MLLYGYHFNPKRHLRMIMEAAR 775
+T+ +++D E + +N+ ++LY + MI+EAA+
Sbjct: 386 QFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQ 412
BLAST of Cp4.1LG20g05810 vs. TAIR10
Match:
AT1G62670.1 (AT1G62670.1 rna processing factor 2)
HSP 1 Score: 109.0 bits (271), Expect = 1.6e-23
Identity = 95/447 (21.25%), Postives = 189/447 (42.28%), Query Frame = 1
Query: 344 KMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVL 403
K + R ++ L+ +D +F +M++S S +++ + K+ + V+
Sbjct: 43 KTSYDYREKLSRNGLSELKLDDAVALFGEMVKSRPFP-SIIEFSKLLSAIAKMNKFDVVI 102
Query: 404 QVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSI 463
+ E +Q +H + Y+ ++ + + AL V M + P++V S+
Sbjct: 103 SLGEQMQNLGIPHNH---YTYSILINCFCRRSQLPLALAVLGKMMK-LGYEPNIVTLSSL 162
Query: 464 AVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNW 523
+ + E ++D M F TG QP+ V +N +++
Sbjct: 163 LNGYCHSKRISEAVALVDQM-------FVTG--------YQPNTVTFNTLIHGLFLHNKA 222
Query: 524 EGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSI-PNALTYKVL 583
A ++ + +G QP TYG+V+ + + G +L K+++ + P L Y +
Sbjct: 223 SEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTI 282
Query: 584 VNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN 643
++ L K D+A+ + ME +GI + Y CLC+ GR +A + + +
Sbjct: 283 IDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKI 342
Query: 644 KPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KAFCSPNLVTCNILLKGYLDHGMFDEAKE 703
P V T++ LI A + L A +++ M K P++VT + L+ G+ H DEAK+
Sbjct: 343 NPDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQ 402
Query: 704 LFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFN 763
+F+ M PD+ T+NT++ KR ++ + +M G N
Sbjct: 403 MFEFMVSK------------HCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGN 457
Query: 764 PKRHLRMIMEAARGGKDELLETTWKHL 789
+ +I + G ++ + +K +
Sbjct: 463 TVTYNILIQGLFQAGDCDMAQEIFKEM 457
BLAST of Cp4.1LG20g05810 vs. TAIR10
Match:
AT5G16640.1 (AT5G16640.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 108.2 bits (269), Expect = 2.7e-23
Identity = 75/314 (23.89%), Postives = 142/314 (45.22%), Query Frame = 1
Query: 423 IYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDS 482
IY T +D L K+++ AL++ + M++ PD+V Y+S+ L +G + ++
Sbjct: 188 IYNTIIDGLCKSKQVDNALDLLNRMEKDGIG-PDVVTYNSLISGLCSSGRWSDATRMVSC 247
Query: 483 MRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKEQGLQPST 542
M + PD+ +NA+++ACVK A +E+ + L P
Sbjct: 248 MTKR---------------EIYPDVFTFNALIDACVKEGRVSEAEEFYEEMIRRSLDPDI 307
Query: 543 TTYGLVMEVMLQCGKYNLVHEFFR-KVQKSSIPNALTYKVLVNTLWKEGKTDEAVLAIQT 602
TY L++ + + + E F V K P+ +TY +L+N K K + +
Sbjct: 308 VTYSLLIYGLCMYSRLDEAEEMFGFMVSKGCFPDVVTYSILINGYCKSKKVEHGMKLFCE 367
Query: 603 MEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKN 662
M +RG+V + Y + C AG+ A ++ P ++TY L+ D+
Sbjct: 368 MSQRGVVRNTVTYTILIQGYCRAGKLNVAEEIFRRMVFCGVHPNIITYNVLLHGLCDNGK 427
Query: 663 LQSAVYIFNHM-KAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENGRNISAVSDYR 722
++ A+ I M K ++VT NI+++G G +A +++ +++ G
Sbjct: 428 IEKALVILADMQKNGMDADIVTYNIIIRGMCKAGEVADAWDIYCSLNCQG---------- 473
Query: 723 DRVLPDIYTFNTML 735
++PDI+T+ TM+
Sbjct: 488 --LMPDIWTYTTMM 473
BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match:
gi|778662053|ref|XP_004135752.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus])
HSP 1 Score: 1482.6 bits (3837), Expect = 0.0e+00
Identity = 756/907 (83.35%), Postives = 810/907 (89.31%), Query Frame = 1
Query: 1 MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
MVGVIMAN NLCIP CE GFP LHCT NSH SFF SSVSG+ + AK+RVLRH
Sbjct: 1 MVGVIMANLNLCIPNCERYGFPTLHCTHNSHNSFWVSFFPSSVSGTDSSLSDAKNRVLRH 60
Query: 61 RGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNK 120
R HKCG+IKA S GESDI L SGNLLE+DFQFKPSFDEYV+VME+VR+RRYKRQ DDPNK
Sbjct: 61 RVHKCGSIKALSNGESDISLPSGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDDPNK 120
Query: 121 --MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRKID 180
MKEN SAKSAESTSIS I VTDVQ N+DVK VD++DLF+N+ERI + D
Sbjct: 121 LTMKENGSAKSAESTSISKIDNGKNKVTDVQHNVDVKNMFKRVDKKDLFNNTERIAPEKD 180
Query: 181 LSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHDKR 240
LSGNKFD +RK VTRS D++KGK+TPF S VNDKQH EKRN NWS+YIEP+VTRSN K
Sbjct: 181 LSGNKFD-RRKVVTRSNDKVKGKMTPFGSLVNDKQHEEKRNENWSSYIEPRVTRSNSKKP 240
Query: 241 LHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDTKRTKDVLKVGKYGVQLEGNYIPG 300
+HFKANTL+VK ES V G+SMK SEKIWA DDD K K VLK GKYG+QLE +Y PG
Sbjct: 241 IHFKANTLEVKKESSRVSDGNSMKTSEKIWAWGDDDAKPAKGVLKAGKYGIQLERSYNPG 300
Query: 301 DKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEMEER 360
DKVGRKKTEQSYRG S SGK+F EF E++SLEVEHAAFN+ DA DIMDKPRVSKMEMEER
Sbjct: 301 DKVGRKKTEQSYRGTSTSGKRFLEFNEKNSLEVEHAAFNNFDAFDIMDKPRVSKMEMEER 360
Query: 361 IQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEWLQ 420
IQMLSKRLNGADIDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQ+IEWLQ
Sbjct: 361 IQMLSKRLNGADIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQIIEWLQ 420
Query: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA
Sbjct: 421 MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLGQA 480
Query: 481 GYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL 540
GYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVL
Sbjct: 481 GYMRELFDVIDSMRSPPKKKFKTGVLEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFWVL 540
Query: 541 QELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
QELK+Q LQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG
Sbjct: 541 QELKKQSLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWKEG 600
Query: 601 KTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
KTDEAVLAI+ ME RGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT
Sbjct: 601 KTDEAVLAIENMEIRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYT 660
Query: 661 GLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSENG 720
GLIQACLDSK+LQSAVYIFNHMKAFCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE
Sbjct: 661 GLIQACLDSKDLQSAVYIFNHMKAFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEQR 720
Query: 721 RNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRMIM 780
RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRMI+
Sbjct: 721 RNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRMIL 780
Query: 781 EAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDEHH 840
EAARGGKDELLETTWKHLAQADRT PPPL+KERFC+ LARGDYSEALS I H+S D HH
Sbjct: 781 EAARGGKDELLETTWKHLAQADRTPPPPLLKERFCMKLARGDYSEALSSIWSHNSGDAHH 840
Query: 841 FSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISVAD 898
FS+SAWLNLLKEKRFP+D+VIELIHKV M+L RN+SPNPV +NLLLS KEFCR+RIS+AD
Sbjct: 841 FSESAWLNLLKEKRFPRDTVIELIHKVGMVLTRNESPNPVFKNLLLSCKEFCRTRISLAD 900
BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match:
gi|659118444|ref|XP_008459122.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo])
HSP 1 Score: 1479.2 bits (3828), Expect = 0.0e+00
Identity = 754/913 (82.58%), Postives = 811/913 (88.83%), Query Frame = 1
Query: 1 MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSG--LNSGSAKSRVL 60
MVGVIMAN NL IP CE GFP LHCT NSH SFF SSVSG G LN AK+RVL
Sbjct: 1 MVGVIMANVNLSIPNCERYGFPTLHCTHNSHTSFWVSFFPSSVSGGGTDLNFSDAKNRVL 60
Query: 61 RHRGHKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDP 120
RHR HKCG+IKA S GESDI L +GNLLE+DFQFKPSFDEYV+VME+VR+RRYKRQ D P
Sbjct: 61 RHRIHKCGSIKALSNGESDISLPNGNLLEHDFQFKPSFDEYVKVMETVRTRRYKRQLDYP 120
Query: 121 NK--MKENASAKSAESTSISNI------VTDVQGNMDVKKKVICVDQEDLFDNSERITRK 180
NK MKEN SAKSAESTSIS I VTDVQ N++VK VD++DLF+N+ERI R+
Sbjct: 121 NKLTMKENCSAKSAESTSISKIDNGKNKVTDVQHNVEVKNMFKRVDKKDLFNNTERIARE 180
Query: 181 IDLSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGNWSNYIEPKVTRSNHD 240
LSGNKFD + KGVTRS D++KGK+TPF S VNDKQH EK+NGNWS+YIEPKVTRSN +
Sbjct: 181 KHLSGNKFD-RSKGVTRSNDKVKGKMTPFGSLVNDKQHEEKKNGNWSSYIEPKVTRSNCE 240
Query: 241 KRLHFKANTLDVKSESHGVRYGSSMKISEKIWA--DDDTKRTKDVLKVGKYGVQLEGNYI 300
K +HFKAN L+ K E V YG+SMK SEKIWA +DD K KDVLK GKYG+QLE +Y
Sbjct: 241 KPIHFKANALEFKKEGSRVSYGNSMKTSEKIWAWGEDDAKPAKDVLKAGKYGIQLERSYS 300
Query: 301 PGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDAEDIMDKPRVSKMEME 360
PGDKVGRKKTEQSYRG S SGK+F EFTEE+SLEVEHAAFN+ DA DIMDKPRVSKMEME
Sbjct: 301 PGDKVGRKKTEQSYRGTSTSGKRFLEFTEENSLEVEHAAFNNFDALDIMDKPRVSKMEME 360
Query: 361 ERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGNWKRVLQVIEW 420
ERIQMLSKRLNGADIDMPEWMF+QMMR AKIRYSDHSILRVIQVLGKLGNW+RVLQVIEW
Sbjct: 361 ERIQMLSKRLNGADIDMPEWMFSQMMRGAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEW 420
Query: 421 LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480
LQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG
Sbjct: 421 LQMRERFKSHKPRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLVAYHSIAVTLG 480
Query: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFW 540
QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN EGAFW
Sbjct: 481 QAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNLEGAFW 540
Query: 541 VLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
VLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK
Sbjct: 541 VLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSIPNALTYKVLVNTLWK 600
Query: 601 EGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
EGKTDEAVLAI+ ME RG+VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT
Sbjct: 601 EGKTDEAVLAIENMEMRGVVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVT 660
Query: 661 YTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDEAKELFQNMSE 720
YTGLIQACLDSK+LQSAVY+FN MKAFCSPNLVT NILLKGYL+HGMF+EA+EL QN+SE
Sbjct: 661 YTGLIQACLDSKDLQSAVYVFNQMKAFCSPNLVTYNILLKGYLEHGMFEEARELLQNLSE 720
Query: 721 NGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGYHFNPKRHLRM 780
+NIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDFS+FYNQM LYGYHFNPKRHLRM
Sbjct: 721 QRQNISTVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFSYFYNQMFLYGYHFNPKRHLRM 780
Query: 781 IMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALSCISKHHSSDE 840
I+EAAR GKDELLETTWKHLAQADRT PPPL+KERFC+ +ARGDY+EAL CIS H+S D
Sbjct: 781 ILEAARVGKDELLETTWKHLAQADRTPPPPLLKERFCMKVARGDYTEALRCISNHNSGDA 840
Query: 841 HHFSKSAWLNLLKEKRFPKDSVIELIHKVSMLLARNDSPNPVLQNLLLSGKEFCRSRISV 900
HHFS+SAWLNLLKEKRFPKD+VIELIHKV M+ A N+SPNPV +NLLLS KEFCR+RISV
Sbjct: 841 HHFSESAWLNLLKEKRFPKDTVIELIHKVGMVFATNESPNPVFKNLLLSCKEFCRTRISV 900
Query: 901 ADPRLEEVVCTNE 902
AD RLEE V TNE
Sbjct: 901 ADHRLEETVHTNE 912
BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match:
gi|645238617|ref|XP_008225762.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Prunus mume])
HSP 1 Score: 942.2 bits (2434), Expect = 6.8e-271
Identity = 515/930 (55.38%), Postives = 647/930 (69.57%), Query Frame = 1
Query: 1 MVGVIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAKSRVLRH 60
MVG+IM NA L + + N A +C L GFS F + GL + K ++
Sbjct: 1 MVGMIMTNAQLGVSNFQRNDIFAANCISKPGPLSGFSLFRRPIFCVGLYEKNVK----KN 60
Query: 61 RG------HKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSR--RYK 120
RG ++ I A SK SD R G +LE +F+FKPSFD+Y++VM +VR R R K
Sbjct: 61 RGFGIKIPNRRTVISAVSKEGSDNRSVGGEILEKEFEFKPSFDQYLKVMGTVRLRSDRDK 120
Query: 121 RQSDDPNKMKENASAKSAESTSISN------IVTDVQGNMDVKKKVICVDQEDLFDNSER 180
+ S K N ++ + +S + + +G+ + +K Q + N
Sbjct: 121 QDSSKEQNPKHNLRSRGVSRSLVSEGNEEHVKLGESEGHSNQEKASKAAKQNEALGNRNG 180
Query: 181 ITRKIDLSGNKFDSKRKGVTRSKDELKGKVTPFDSQVNDKQHVEKRNGN--WSNYIEPKV 240
I K SKR+GV KDE + + D + K E R+G +S +EP+
Sbjct: 181 IMGK---------SKRQGVKGFKDEYDSRQSNRDEKEKKKIRGEARDGRSKYSGRLEPE- 240
Query: 241 TRSNHDKRLHFKANTLDVKSESHGVR-YGSSMKISEKIWADDDTKRTKDVLKVGKYGVQL 300
L+F+ + ++ +R Y S+ K E+ GK GV++
Sbjct: 241 --------LNFRGKSTMARNMKDDLRVYKSTDKSFER----------------GKVGVKI 300
Query: 301 EG----NYIPGDKVGRKKTEQSYRGLSKSGKQFHEFTEESSLEVEHAAFNSCDA-EDIMD 360
+G N+I +K + + L+KSG+ F + ++S++VE AAF + D DIMD
Sbjct: 301 QGGLERNHINAEKATDRGFSRRSEKLTKSGRDFPKKNYDNSMKVERAAFKNFDEFGDIMD 360
Query: 361 KPRVSKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKLGN 420
KPRVS+MEMEERIQ L+K LNGADIDMPEWMF++MMRSA+IR++DHSILRVIQ+LGKLGN
Sbjct: 361 KPRVSQMEMEERIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKLGN 420
Query: 421 WKRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPDLV 480
W+RVLQVIEWLQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM + SSYPDLV
Sbjct: 421 WRRVLQVIEWLQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLQEMSSYPDLV 480
Query: 481 AYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACV 540
AYHSIAVTLGQAG+MRELFDVID+MRSPPKKKFKTGAL KWDPRL+PDIV+++AVLNACV
Sbjct: 481 AYHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAVLNACV 540
Query: 541 KRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNALT 600
+RK WEGAFWVLQ+L++QGLQP+TTTYGLVMEVML CGKYNLVH+FF+KVQKSSIPNALT
Sbjct: 541 QRKQWEGAFWVLQQLQQQGLQPATTTYGLVMEVMLACGKYNLVHDFFKKVQKSSIPNALT 600
Query: 601 YKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEKIC 660
Y+V+VNTLW+EGK DEAVL +Q ME+RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKIC
Sbjct: 601 YRVIVNTLWREGKVDEAVLVVQNMERRGIVGSAALYYDFARCLCSAGRCQEALMQIEKIC 660
Query: 661 KVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMFDE 720
KVANKPLVVTYTGLIQACLD+ ++++ Y+F M+ FCSPNLVTCN +LKGYLDHGMF+E
Sbjct: 661 KVANKPLVVTYTGLIQACLDAGSIKNGAYVFKQMENFCSPNLVTCNTMLKGYLDHGMFEE 720
Query: 721 AKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLYGY 780
AKELF M ++G NIS+ SDY+ RV+PD YTFNT+LDA EKRWDDF Y ML +GY
Sbjct: 721 AKELFLKMLDDGNNISSKSDYKVRVIPDSYTFNTLLDACIIEKRWDDFEFVYKMMLHHGY 780
Query: 781 HFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEALS 840
HFN KRHLRMI++A GK ELL+ TW HL +A R+ PPPL+KERFC L + DY+ ALS
Sbjct: 781 HFNAKRHLRMILDAREAGKGELLDITWTHLTEAGRSPPPPLVKERFCTKLEKDDYAAALS 840
Query: 841 CISKHHSSD-EHHFSKSAWLNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNLL 900
CI+ + + FSK+AWL L KE +RF KD+ + L+H+ S+L+ R D NPV QNL+
Sbjct: 841 CITNPNLGELRTFFSKNAWLKLFKENAERFQKDTFVRLVHEGSILINRTDRSNPVFQNLM 892
Query: 901 LSGKEFCRSRISVADPRLEEVVCTNEFQSA 906
+ E R+ + AD + E VCT + A
Sbjct: 901 AACGELDRTCLVGADFKPSETVCTTHTEPA 892
BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match:
gi|657999772|ref|XP_008392321.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic-like [Malus domestica])
HSP 1 Score: 924.9 bits (2389), Expect = 1.1e-265
Identity = 509/934 (54.50%), Postives = 648/934 (69.38%), Query Frame = 1
Query: 4 VIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAK-SRVLRHRG 63
++MANA + + NG A +C S L GFS F + G GLN + K +RV +
Sbjct: 4 MVMANAQPGVSNFQRNGVFATNCCPKSLPLSGFSIFRRPIFGIGLNEKNVKRNRVFGIKF 63
Query: 64 -HKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 123
+ I A SK S+I LE +F+FKPSFD+Y++VM +VR R + D +
Sbjct: 64 VNSRTVISAVSKEGSEI-------LEKEFEFKPSFDQYLKVMGTVRLRSDR---DRQQRS 123
Query: 124 KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLFDNSERITRKIDLSGNKFDS- 183
KE S S +S + D K + + + N E+ ++ N+++S
Sbjct: 124 KEENPKHSVRSRGVSRRLLSEGSEEDAK-----LGEPEGNLNREKASK----FENRYESL 183
Query: 184 -KRKGVTRSKDELKGKVTPFDSQVNDKQHVEK-------RNGNWSNY---IEPKVTRSNH 243
R G T + ++G +DS+ N+K +K R+G WS Y +EP
Sbjct: 184 GNRNGSTHESERVEGFKDEYDSRQNNKDEKDKKMIRGETRDGRWSKYTGRVEPG------ 243
Query: 244 DKRLHFKANTLDVKSESHGVRYGSSMKISEKI---WADDDTKRTKDVLKV---------- 303
L FK + V++ G G + ++ +++ + +D L+V
Sbjct: 244 ---LDFKGKSTTVRNAKDGP--GVTGRLEQEVDFKGKSTMARNARDGLRVYKSRDKAVER 303
Query: 304 GKYGVQLEGNYIPGDKVGRKKTEQSY--RGLSKSGKQFHEFTEESSLEVEHAAFNSCDA- 363
GK+GV+ E D K T++ + R ++KSG+ F + E SLEVE AAF + D
Sbjct: 304 GKFGVRNEDGVERNDSNADKATDRGFVPRSVTKSGRDFPKRFNEKSLEVERAAFQNFDEF 363
Query: 364 EDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVL 423
DIMDKPRVS+MEME+RIQ L+K LNGADIDMPEWMF++MMRSA+IR++DHSILRVIQ+L
Sbjct: 364 GDIMDKPRVSQMEMEQRIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLL 423
Query: 424 GKLGNWKRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSS 483
GKLGNW+RVLQVIEWLQMRERFKSHKLR+IYTTALDVLGKARRPVEALNVFHAM E SS
Sbjct: 424 GKLGNWRRVLQVIEWLQMRERFKSHKLRYIYTTALDVLGKARRPVEALNVFHAMLEQMSS 483
Query: 484 YPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAV 543
YPDLVAYHSIAVTLGQAG+MRELFDVID+MRSPPKKKFKTGAL KWDPRL+PDIV+++AV
Sbjct: 484 YPDLVAYHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDIVVFHAV 543
Query: 544 LNACVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSI 603
LNACV+RK WEGAFWVLQ+LK+QGLQP+TTTYGLVMEVML CGKYNLVHEFF+KVQKSSI
Sbjct: 544 LNACVQRKQWEGAFWVLQQLKQQGLQPATTTYGLVMEVMLACGKYNLVHEFFKKVQKSSI 603
Query: 604 PNALTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQ 663
PNALTY+V+VNTLW+EGK DEAV + ME+RGIVG AALYYDFARCLCSAGRC+EALMQ
Sbjct: 604 PNALTYRVIVNTLWREGKIDEAVSVVHNMERRGIVGYAALYYDFARCLCSAGRCQEALMQ 663
Query: 664 MEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDH 723
+EKICKVANKPLVVTYTGLIQACLD+ ++++A Y+F M+ FCSPNLVTCNI+LK YLDH
Sbjct: 664 IEKICKVANKPLVVTYTGLIQACLDTGSVENAAYVFKQMENFCSPNLVTCNIMLKAYLDH 723
Query: 724 GMFDEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQM 783
MF++AK+LF M ++G NI+ SDY+ R++PD YTFNT+LDA EKRWDDF + Y +M
Sbjct: 724 RMFEKAKDLFLRMLDDGNNITNGSDYKVRIIPDSYTFNTLLDACVTEKRWDDFEYVYRRM 783
Query: 784 LLYGYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDY 843
L +G+HFN KRHLRMI++A + G+ ELL+ TW HL +ADR PPPL+KERFC L + DY
Sbjct: 784 LHHGFHFNAKRHLRMILDACKAGRAELLDMTWMHLTEADRIPPPPLVKERFCTKLEKDDY 843
Query: 844 SEALSCISKHHSSDEHHFSKSAWLNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVL 903
+ ALSCI+ + + FSK+AWL L KE +RF D+ + L+ + S+L+ R+D NPV
Sbjct: 844 AAALSCITTQNLGELQAFSKTAWLKLFKENAERFQNDTFVRLVDEGSILVNRSDRSNPVF 903
Query: 904 QNLLLSGKEFCRSRISVADPRLEEVVCTNEFQSA 906
QNL+ + E R R++ A E V T + + A
Sbjct: 904 QNLMAACGEVDRIRLAGAAGSTRETVSTTQTEPA 907
BLAST of Cp4.1LG20g05810 vs. NCBI nr
Match:
gi|694367514|ref|XP_009362169.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Pyrus x bretschneideri])
HSP 1 Score: 921.8 bits (2381), Expect = 9.5e-265
Identity = 504/931 (54.14%), Postives = 644/931 (69.17%), Query Frame = 1
Query: 4 VIMANANLCIPCCEGNGFPALHCTQNSHYLLGFSFFTSSVSGSGLNSGSAK-SRVLRHRG 63
++MANA + + NG A C S L GFS F + G GLN + K +RV +
Sbjct: 4 MVMANAQPGVSNFQRNGVFATDCCPKSLPLSGFSIFRRPIFGIGLNEKNVKRNRVFGIKF 63
Query: 64 -HKCGAIKASSKGESDIRLASGNLLENDFQFKPSFDEYVRVMESVRSRRYKRQSDDPNKM 123
+ I A SK S+I LE +F+FKPSFD+Y++VM +VR R + D +
Sbjct: 64 VNSRTVISAVSKEGSEI-------LEKEFEFKPSFDQYLKVMGTVRLRSDR---DKQQRS 123
Query: 124 KENASAKSAESTSISNIVTDVQGNMDVKKKVICVDQEDLF-DNSERITRKIDLSGNKFDS 183
KE S S +S + + K + + +L + + ++ + +L GN
Sbjct: 124 KEENPKHSVRSRGVSRRLLSEGSEEEAK---LGEPEGNLNREKASKVENRYELLGN---- 183
Query: 184 KRKGVTRSKDELKGKVTPFDSQVNDKQHVEK-------RNGNWSNY---IEPKVTRSNHD 243
R G T + +KG +DS+ N+K +K R+G WS Y +EP
Sbjct: 184 -RNGSTHERQRVKGFKDEYDSRQNNKDEKDKKMIRGETRDGRWSKYTGRVEPG------- 243
Query: 244 KRLHFKANTLDVKSESHG----------VRYGSSMKISEKIWADDDTKRTKD-VLKVGKY 303
L FK + V++ G V + ++ +++D ++ GK+
Sbjct: 244 --LDFKGKSTTVRNAKDGPGVTGRLEQEVDFKGKSSMARNARDGPRVYQSRDEAVERGKF 303
Query: 304 GVQLEGNYIPGDKVGRKKTEQSY--RGLSKSGKQFHEFTEESSLEVEHAAFNSCDA-EDI 363
GV+ E K T++ + R ++KSG+ F + E SLEVE AAF + D DI
Sbjct: 304 GVRNEDGVERNHSNADKATDRGFVPRSVTKSGRDFPKRFNEKSLEVERAAFRNFDEFGDI 363
Query: 364 MDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFAQMMRSAKIRYSDHSILRVIQVLGKL 423
MDKPRVS+MEME+RIQ L+K LNGADIDMPEWMF++MMRSA+IR++DHSILRVIQ+LGKL
Sbjct: 364 MDKPRVSQMEMEQRIQKLAKWLNGADIDMPEWMFSKMMRSAQIRFTDHSILRVIQLLGKL 423
Query: 424 GNWKRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQEHFSSYPD 483
GNW+RVLQVIEWLQMRERFKSHKLR+I+TTALDVLGKARRPVEALNVFHAM E SSYPD
Sbjct: 424 GNWRRVLQVIEWLQMRERFKSHKLRYIFTTALDVLGKARRPVEALNVFHAMLEQMSSYPD 483
Query: 484 LVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNA 543
LVAYHSIAVTLGQAG+MRELFDVID+MRSPPKKKFKTGAL KWDPRL+PD+V+++AVLNA
Sbjct: 484 LVAYHSIAVTLGQAGHMRELFDVIDTMRSPPKKKFKTGALGKWDPRLEPDVVVFHAVLNA 543
Query: 544 CVKRKNWEGAFWVLQELKEQGLQPSTTTYGLVMEVMLQCGKYNLVHEFFRKVQKSSIPNA 603
CV+RK WEGAFWVLQ+LK+QGLQP+TTTYGLVMEVML CGKYNLVHEFF+KVQKSSIPNA
Sbjct: 544 CVQRKQWEGAFWVLQQLKQQGLQPATTTYGLVMEVMLACGKYNLVHEFFKKVQKSSIPNA 603
Query: 604 LTYKVLVNTLWKEGKTDEAVLAIQTMEKRGIVGSAALYYDFARCLCSAGRCKEALMQMEK 663
LTY+V+VNTLW+EGK DEAV I ME+RGIVG AALYYDFARCLCSAGRC+EALMQ+EK
Sbjct: 604 LTYRVIVNTLWREGKIDEAVSVIHNMERRGIVGYAALYYDFARCLCSAGRCQEALMQIEK 663
Query: 664 ICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKAFCSPNLVTCNILLKGYLDHGMF 723
ICKVA+KPLVVTYTGLIQACLD+ ++++A Y+F M+ CSPNLVTCNI+LK YLDHGMF
Sbjct: 664 ICKVASKPLVVTYTGLIQACLDAGSVENAAYVFKQMENICSPNLVTCNIMLKAYLDHGMF 723
Query: 724 DEAKELFQNMSENGRNISAVSDYRDRVLPDIYTFNTMLDASFAEKRWDDFSHFYNQMLLY 783
++AK+LF M ++G NI++ SDY+ R++PD YTFNT+LDA AEKRWDDF + Y +ML +
Sbjct: 724 EKAKDLFLRMLDDGNNITSRSDYKVRIIPDSYTFNTLLDACVAEKRWDDFEYVYKRMLHH 783
Query: 784 GYHFNPKRHLRMIMEAARGGKDELLETTWKHLAQADRTLPPPLIKERFCIMLARGDYSEA 843
G+HFN KRHLRMI++A + K ELL+ TW HL +ADR PPPL+KERFC L + DY+ A
Sbjct: 784 GFHFNAKRHLRMILDACKAEKAELLDITWMHLTEADRIPPPPLVKERFCTKLEKNDYAAA 843
Query: 844 LSCISKHHSSDEHHFSKSAWLNLLKE--KRFPKDSVIELIHKVSMLLARNDSPNPVLQNL 903
LSC++ + + FSK+AWL L E +RF KD+ + L+ + S+L+ R+D NPV QNL
Sbjct: 844 LSCVTTQNLGEPQAFSKAAWLKLFMENAERFQKDTFVRLVDEGSILVNRSDRSNPVYQNL 903
Query: 904 LLSGKEFCRSRISVADPRLEEVVCTNEFQSA 906
+ + E R R++ A E V T + + A
Sbjct: 904 MAASGEVDRIRLTGAAVSTRETVSTTQTEPA 907
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PPR64_ARATH | 9.6e-209 | 48.08 | Pentatricopeptide repeat-containing protein At1g30610, chloroplastic OS=Arabidop... | [more] |
PP451_ARATH | 5.8e-113 | 39.68 | Pentatricopeptide repeat-containing protein At5g67570, chloroplastic OS=Arabidop... | [more] |
PP120_ARATH | 7.3e-23 | 23.50 | Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... | [more] |
PPR91_ARATH | 2.8e-22 | 21.25 | Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... | [more] |
PPR28_ARATH | 4.7e-22 | 25.23 | Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVN7_CUCSA | 0.0e+00 | 83.35 | Uncharacterized protein OS=Cucumis sativus GN=Csa_1G553530 PE=4 SV=1 | [more] |
M5WJN1_PRUPE | 3.6e-263 | 55.15 | Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001195mg PE=4 SV=1 | [more] |
W9RFN3_9ROSA | 3.9e-257 | 53.57 | Uncharacterized protein OS=Morus notabilis GN=L484_025948 PE=4 SV=1 | [more] |
B9T6B9_RICCO | 1.3e-255 | 55.17 | Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... | [more] |
A0A061FSP7_THECC | 2.1e-255 | 55.84 | Pentatricopeptide repeat-containing protein isoform 1 OS=Theobroma cacao GN=TCM_... | [more] |