BLAST of CSPI01G02030 vs. Swiss-Prot
Match:
PPR49_ARATH (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana GN=At1g18900 PE=2 SV=1)
HSP 1 Score: 1075.8 bits (2781), Expect = 0.0e+00
Identity = 542/879 (61.66%), Postives = 671/879 (76.34%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
M+RAK I +LS++ARSFFL+GSR + DG SC +DE CVS+RQ R E ++K +
Sbjct: 1 MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGL-NT 120
+ VG ++ E K +V K D+ + Q ++ P + V YAS +
Sbjct: 61 ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120
Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
++G+ +S I DQ+ KAGI+AVN SD N KIPS D G F K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180
Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
VK S +K +RRE+ ++++ R + E V + SSN + ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240
Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
K + N + K + ++ QR + SN F S F+NSS
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSSVEM------ 300
Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
+K P+G A + + N+ ++VE+VS +L++ +WGPAAEEA+ L IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360
Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
+D+ ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420
Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480
Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540
Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600
Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660
Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
PSLQTYTLLLSCCTD ++ DMGFC +LM TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720
Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780
Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
LHVMS+GTAVTALSRTLAWFR+Q+L SG PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840
Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
+IF PFFTE+GNSGCFVG GEPL+RWL QS+VERMHLL
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHVERMHLL 860
BLAST of CSPI01G02030 vs. Swiss-Prot
Match:
PP123_ARATH (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN=At1g74750 PE=2 SV=1)
HSP 1 Score: 1069.7 bits (2765), Expect = 1.7e-311
Identity = 547/880 (62.16%), Postives = 662/880 (75.23%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
M+RAK I +LS+SARSFFLSGSR +A DG SCTC EDE+ VS+RQ R E + + K ++
Sbjct: 1 MIRAKHISNLSSSARSFFLSGSRPSAADGNSCTCAEDESGVSKRQQIRTEVVQTGKRASN 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
+A + G ++ EA K +V KT S+ + P A+ V +AS +
Sbjct: 61 LA--AGLAGSILPVEAGKPLVVPKTVEHFTRPSLLPQHVSSPALPGKADSVNHASAIIK- 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
E I DQ+ KAGI VNL SD N+KIP SD K+CMVDP R I+ VK
Sbjct: 121 ---EDVGVPIGDQIFKAGIGNVNLLSDIANYKIPLSDGTEVVGLPKSCMVDPTRPISGVK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
S +K +RRE++++V+ R + +P++S P G++Q ++ +
Sbjct: 181 SSNVKVIRREHLAKVYPRSADRVPINSSP-------------------GTKQASNDVAGK 240
Query: 241 KLVVFQNISSDKCDKRNL-PQRTRVHSNSFTSHF--HSIAQTTGSDFTNSSKNF-KKFPD 300
+S++ KR + PQR S + S +S+ + +S + F K +
Sbjct: 241 SFEAHDLLSNNVSGKRKIMPQRPYTDSTRYASGGCDYSVHSSDDRTIISSVEGFGKPSRE 300
Query: 301 NLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKR 360
+K AP N VVE+VS IL++ KWG AAEEA+ +DAYQANQ+LK+
Sbjct: 301 MMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQ 360
Query: 361 VDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVV 420
+D++A ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF INKLLD+M++DGC+PN V
Sbjct: 361 MDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTV 420
Query: 421 TYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKM 480
TYNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLD+AM MY++M
Sbjct: 421 TYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRM 480
Query: 481 QDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNY 540
Q+AGL+PDTFTYSV+INCLGKAGHL AAHRLFC MV +GC PNLVT+NIMIAL AKARNY
Sbjct: 481 QEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNY 540
Query: 541 EIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLL 600
E ALKLYRDMQ +GF+PDKVTY IVMEVLGHCGFLEEAEG+F EMQ+KNWVPDEPVYGLL
Sbjct: 541 ETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPVYGLL 600
Query: 601 VDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGL 660
VDLWGK+GNV KAW+WY AML+AGL+PNVPTCNSLLS FLRVH++S+AY LLQSML GL
Sbjct: 601 VDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGL 660
Query: 661 KPSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSK 720
PSLQTYTLLLSCCTDA++N DMGFC +LM V+GHPAH FL+ +P AGP+GQ VRDH+S
Sbjct: 661 HPSLQTYTLLLSCCTDARSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSN 720
Query: 721 FLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLI 780
FLD MHSEDRESKRGL+DAVVDFLHKSGLKEEAG VWE A KNVYPDA++EKS YWLI
Sbjct: 721 FLDFMHSEDRESKRGLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLI 780
Query: 781 NLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDL 840
NLHVMS+GTAV ALSRTLAWFR+Q+L+SG PSRIDIVTGWGRRS+VTG+S+VRQAV++L
Sbjct: 781 NLHVMSEGTAVIALSRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEEL 840
Query: 841 LSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
L+IF+FPFFTENGNSGCFVG GEPL WL +SYVERMHLL
Sbjct: 841 LNIFNFPFFTENGNSGCFVGSGEPLKNWLLESYVERMHLL 855
BLAST of CSPI01G02030 vs. Swiss-Prot
Match:
PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)
HSP 1 Score: 192.6 bits (488), Expect = 1.8e-47
Identity = 131/530 (24.72%), Postives = 240/530 (45.28%), Query Frame = 1
Query: 374 RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDA 433
R D +Y T++ + + Q ++L QM PNVV+Y+ +I + +A +A
Sbjct: 369 RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428
Query: 434 VNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINC 493
+N+F +M+ G DRV+Y TL+ I+ K G + A+ + +M G+ D TY+ ++
Sbjct: 429 LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488
Query: 494 LGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPD 553
GK G + ++F M E +PNL+TY+ +I +K Y+ A++++R+ + +G D
Sbjct: 489 YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548
Query: 554 KVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYH 613
V Y +++ L G + A + EM K+ P+ Y ++D +G+S + ++ ++ +
Sbjct: 549 VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608
Query: 614 AMLKAGLKPNVPTCNSLLSAFL-----RVHQLSDA-----------------------YQ 673
++P +S LSA RV QL +
Sbjct: 609 G-------GSLPFSSSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILE 668
Query: 674 LLQSMLTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPN 733
+ + M +KP++ T++ +L+ C+ + D E +++ + + + L
Sbjct: 669 VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMG--Q 728
Query: 734 GQNVRDHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAV 793
+NV D ++ D + +A+ D L G K A V + V+ +
Sbjct: 729 RENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVW 788
Query: 794 KEKSSCYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGS 853
+ SC ++LH+MS G A + L R + P + I+TGWG+ SKV G
Sbjct: 789 SD--SC---LDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGD 848
Query: 854 SLVRQAVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
+R+AV+ LL PF N G F G ++ WL +S ++ +L
Sbjct: 849 GALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLIL 884
BLAST of CSPI01G02030 vs. Swiss-Prot
Match:
PP132_ARATH (Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidopsis thaliana GN=EMB2217 PE=2 SV=1)
HSP 1 Score: 186.8 bits (473), Expect = 1.0e-45
Identity = 121/441 (27.44%), Postives = 218/441 (49.43%), Query Frame = 1
Query: 378 DGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVF 437
DG TY +I L ++ + A KL QM + +P+ ++ ++ S G+A L ++ V+
Sbjct: 312 DGSTYELIIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFSSLVDSMGKAGRLDTSMKVY 371
Query: 438 KQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKA 497
+MQ G P + +LID +AK+G LD A+ ++++M+ +G P+ Y+++I K+
Sbjct: 372 MEMQGFGHRPSATMFVSLIDSYAKAGKLDTALRLWDEMKKSGFRPNFGLYTMIIESHAKS 431
Query: 498 GHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTY 557
G L A +F M G +P TY+ ++ + A + + A+K+Y M +G P +Y
Sbjct: 432 GKLEVAMTVFKDMEKAGFLPTPSTYSCLLEMHAGSGQVDSAMKIYNSMTNAGLRPGLSSY 491
Query: 558 CIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLK 617
++ +L + ++ A I +EM+ + D +L+ ++ K +V A +W M
Sbjct: 492 ISLLTLLANKRLVDVAGKILLEMKAMGYSVDVCASDVLM-IYIKDASVDLALKWLRFMGS 551
Query: 618 AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLLSCCTDAQTNDM 677
+G+K N L + ++ A LL++++ K L YT +L+ Q D
Sbjct: 552 SGIKTNNFIIRQLFESCMKNGLYDSARPLLETLVHSAGKVDLVLYTSILAHLVRCQDEDK 611
Query: 678 -GFCCELMQVTGHPAHTFLVSLPSAGP--NGQNVRDHMSKFLDLMHSEDRE-SKRGLVDA 737
++ T H AH F+ L GP Q V + +F + E E + R V+
Sbjct: 612 ERQLMSILSATKHKAHAFMCGL-FTGPEQRKQPVLTFVREFYQGIDYELEEGAARYFVNV 671
Query: 738 VVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMSDGTAVTALSRTLA 797
++++L G A CVW+ A + ++P A+ W +++ +S G A+ A+ TL
Sbjct: 672 LLNYLVLMGQINRARCVWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLH 731
Query: 798 WFRQQLLLSGVGPSRIDIVTG 815
FR+++L GV P RI +VTG
Sbjct: 732 RFRKRMLYYGVVPRRIKLVTG 750
BLAST of CSPI01G02030 vs. Swiss-Prot
Match:
PP344_ARATH (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana GN=PGR3 PE=1 SV=1)
HSP 1 Score: 179.9 bits (455), Expect = 1.2e-43
Identity = 113/395 (28.61%), Postives = 192/395 (48.61%), Query Frame = 1
Query: 274 SIAQTTGSDFTNSSKNFKKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAA 333
S+ SDF+ S PD S +T + P+ S S
Sbjct: 60 SVVSMKSSDFSGSMIRKSSKPDLSSSEE----VTRGLKSFPDTDSSFSYF---------- 119
Query: 334 EEAIGKLNCSIDAYQANQILK--RVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGR 393
+ G LN N +L+ RVD + + + L + + D +TY T+ L
Sbjct: 120 KSVAGNLNLVHTTETCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSV 179
Query: 394 AKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVT 453
L +M + G N +YN +IH ++ + +A+ V+++M G P T
Sbjct: 180 KGGLKQAPYALRKMREFGFVLNAYSYNGLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQT 239
Query: 454 YCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMV 513
Y +L+ K +D MG+ ++M+ GL P+ +T+++ I LG+AG +N A+ + RM
Sbjct: 240 YSSLMVGLGKRRDIDSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKINEAYEILKRMD 299
Query: 514 DEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLE 573
DEGC P++VTY ++I AR + A +++ M+ +PD+VTY +++ L+
Sbjct: 300 DEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLD 359
Query: 574 EAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLL 633
+ + EM+K VPD + +LVD K+GN +A++ M G+ PN+ T N+L+
Sbjct: 360 SVKQFWSEMEKDGHVPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLI 419
Query: 634 SAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLL 667
LRVH+L DA +L +M + G+KP+ TY + +
Sbjct: 420 CGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFI 440
BLAST of CSPI01G02030 vs. TrEMBL
Match:
A0A0A0LRL7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G008500 PE=4 SV=1)
HSP 1 Score: 1775.4 bits (4597), Expect = 0.0e+00
Identity = 874/874 (100.00%), Postives = 874/874 (100.00%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV
Sbjct: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
Query: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL
Sbjct: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP
Sbjct: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874
BLAST of CSPI01G02030 vs. TrEMBL
Match:
M5WQI1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001256mg PE=4 SV=1)
HSP 1 Score: 1221.5 bits (3159), Expect = 0.0e+00
Identity = 603/877 (68.76%), Postives = 719/877 (81.98%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAK I +LS+SARSFFL+G RC+A +G+SCTC EDETCVS+RQ RN +Q PST+
Sbjct: 1 MLRAKHISNLSSSARSFFLNGPRCSATEGSSCTCSEDETCVSQRQQTRNGGPLAQTPSTM 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
V+ S G +I +A KV SHK ++V+ + +I+QV T P + V Y+S + V
Sbjct: 61 VSKPSAGAGTIITGDAVKVASSHKAESVEHTTNIKQVT-TAPRSFGRSATVTYSSSTDAV 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
+SP + DQ +AG+ AVN SD VN K+P SD G + +NCMVDP R ++S+K
Sbjct: 121 H----SSPLVVDQFARAGVAAVNFLSDIVNGKLPLSDGLGLLNLPQNCMVDPTRPLSSIK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
PS +K ++RE+ VH +PS E SK +S+NHGS K + S+VKG V R +
Sbjct: 181 PSHVKQIKREHFISVHPKPSTETAAASK-HTSNNHGSKGKGEKPSFVKG-LNHVPYTRKE 240
Query: 241 KLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD-FTNSSKNFKKFPDNLK 300
VV SSD DKR++P++++ HSN+F ++ S QT+ ++ +K F + ++K
Sbjct: 241 NSVVAHTASSDTFDKRSMPRKSKGHSNNFIPNYSSNVQTSDAESMGRVTKGFNRPTRDMK 300
Query: 301 SPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDD 360
PTG+ PI F++ NVV++VS ILQQ++WGPAAE A+ LNCS+DAYQANQILK++ D
Sbjct: 301 MPTGITPINRQFVHTGNVVQNVSHILQQMRWGPAAEAALLNLNCSMDAYQANQILKQLQD 360
Query: 361 HAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYN 420
H+VAL FFYWLKR F+HDGHTYTTM+G+LGR++QF AINKLL+QM+K+GCQPNVVTYN
Sbjct: 361 HSVALSFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGCQPNVVTYN 420
Query: 421 RIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDA 480
R+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLDVA+ +Y+ MQ+A
Sbjct: 421 RLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDGMQEA 480
Query: 481 GLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIA 540
GL+PDTFTYSVMINCLGKAGHL AAHRLFC MV++GCVPNLVTYNIMIALQAKARNYE A
Sbjct: 481 GLSPDTFTYSVMINCLGKAGHLAAAHRLFCEMVNQGCVPNLVTYNIMIALQAKARNYETA 540
Query: 541 LKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDL 600
LKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE IF EM++KNWVPDEPVYGLLVDL
Sbjct: 541 LKLYRDMQGAGFEPDKVTYSIVMEVLGHCGYLEEAEAIFGEMKRKNWVPDEPVYGLLVDL 600
Query: 601 WGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPS 660
WGK+GNV KAW WY AML AGL+PNVPTCNSLLSAFLRVHQLSDAY LLQSM+ GL PS
Sbjct: 601 WGKAGNVGKAWNWYQAMLHAGLRPNVPTCNSLLSAFLRVHQLSDAYNLLQSMMGLGLNPS 660
Query: 661 LQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLD 720
LQTYTLLLSCCT+A++ DM FCCELM VTGHPAHTFL+S+PSAGP+GQNVR+HMS+FLD
Sbjct: 661 LQTYTLLLSCCTEARSPYDMDFCCELMAVTGHPAHTFLLSMPSAGPDGQNVREHMSRFLD 720
Query: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLH 780
LMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDA++EKSSCYWLINLH
Sbjct: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAIREKSSCYWLINLH 780
Query: 781 VMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSI 840
VMSDGTAVTALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSSLVRQAV++LL++
Sbjct: 781 VMSDGTAVTALSRTLAWFRQQMLISGICPSRIDIVTGWGRRSRVTGSSLVRQAVEELLNM 840
Query: 841 FSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
FSFPFFTENGNSGCFVGCGEPL++WL QSYVERMHLL
Sbjct: 841 FSFPFFTENGNSGCFVGCGEPLNKWLLQSYVERMHLL 870
BLAST of CSPI01G02030 vs. TrEMBL
Match:
W9SZB0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_010090 PE=4 SV=1)
HSP 1 Score: 1193.3 bits (3086), Expect = 0.0e+00
Identity = 596/881 (67.65%), Postives = 711/881 (80.70%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGAS-CTCPEDETCVSERQNARNETLPSQKPST 60
MLRAKQIG+LSNSARSFFLSGSRCNA DG+S CTC EDETCVS RQN R+ + +Q PST
Sbjct: 1 MLRAKQIGNLSNSARSFFLSGSRCNAADGSSSCTCSEDETCVSRRQNLRHGGILAQNPST 60
Query: 61 LVANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNT 120
L + +S RVG LI+ +A K + + K ++ S++QV P +ECV YAS +
Sbjct: 61 LASRTSARVGTLISGDAVKAVSTEKA-SMHNPTSLKQVI-ISPKSLGRSECVSYASTVEK 120
Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSS--KNCMVDPARSIT 180
+ E +SP +DQ VKAG+ AVN SD +N+K P SD G F+++ +NCMVDPAR T
Sbjct: 121 NV--EHSSPVFSDQFVKAGVAAVNFLSDVMNYKFPLSDGIGIFNNNLPQNCMVDPARLST 180
Query: 181 SVKPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
S++ S + H++R+N S VH RPSVE V Q +S + K ++SS VKG V
Sbjct: 181 SIRSSHVNHVKRKNFSGVHPRPSVEAAV----QYNSTSSTKSKDSKSSSVKGVNN-VPNT 240
Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQ--TTGSDFTNSSKNFKKFP 300
R +++ ++ D+R +P RT+ NSF + F S + T G + +K F + P
Sbjct: 241 RNGNSWATRSVPAEARDRRAIPNRTKACLNSFKADFSSDSNQSTDGGNVGFGNKGFNRPP 300
Query: 301 DNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILK 360
+ PTG API + N NVVE VS +L L+WG AAEEA+ LN ++DA+QANQ+LK
Sbjct: 301 REMNFPTGYAPIKRPYANTANVVERVSHMLHGLRWGRAAEEALENLNYAMDAFQANQVLK 360
Query: 361 RVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNV 420
++ DH VALGFFYWLKR F+HDGHTYTTM+G+LGR+++F AINKLL +M+K+GCQPNV
Sbjct: 361 QLQDHNVALGFFYWLKRQAGFKHDGHTYTTMVGILGRSREFGAINKLLHEMVKEGCQPNV 420
Query: 421 VTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEK 480
VTYNR+IHSYGRANYL++A+NVF QMQ AGCEPDRVTYCTLIDIHAK+GFLDVA+ +Y++
Sbjct: 421 VTYNRLIHSYGRANYLKEAINVFNQMQNAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDR 480
Query: 481 MQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARN 540
MQ AGL+PDTFTYSV+INCLGK GHL AAH LFC+MV EGCVPNLVTYNIMIALQAKARN
Sbjct: 481 MQQAGLSPDTFTYSVIINCLGKGGHLTAAHNLFCKMVSEGCVPNLVTYNIMIALQAKARN 540
Query: 541 YEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGL 600
YE ALKLYRDMQ +GF+PDKVTY IVMEVLGHCG+LEEAE +F+EM+ KNWVPDEPVYGL
Sbjct: 541 YETALKLYRDMQNAGFDPDKVTYSIVMEVLGHCGYLEEAEAVFVEMRHKNWVPDEPVYGL 600
Query: 601 LVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFG 660
LVDLWGKSGN++KAWEWY AML AGL+PNVPTCNSLLSAFLRVH+L++AY+LLQSM+ +G
Sbjct: 601 LVDLWGKSGNIEKAWEWYQAMLNAGLQPNVPTCNSLLSAFLRVHRLTEAYELLQSMVDWG 660
Query: 661 LKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMS 720
L PSLQTYTLLLSCCT+AQ+ DMGFCC+LM TGHPAHTFL+S+PSAGP+GQNVRDH S
Sbjct: 661 LNPSLQTYTLLLSCCTEAQSPYDMGFCCKLMATTGHPAHTFLLSMPSAGPDGQNVRDHAS 720
Query: 721 KFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWL 780
+FLDLMHSEDRE KRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDAVKEKSSC+WL
Sbjct: 721 RFLDLMHSEDREGKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVKEKSSCHWL 780
Query: 781 INLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQD 840
INLHVMSDGTAVTALSRTLAWFR+++L+SG+ PSRIDIVTGWGRRS+VTG+SLVRQAVQ+
Sbjct: 781 INLHVMSDGTAVTALSRTLAWFRREMLISGICPSRIDIVTGWGRRSRVTGASLVRQAVQE 840
Query: 841 LLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
LL +FSFPFFTENGNSGCFVGCGEPL+RWL QSYVERMHLL
Sbjct: 841 LLRMFSFPFFTENGNSGCFVGCGEPLNRWLLQSYVERMHLL 872
BLAST of CSPI01G02030 vs. TrEMBL
Match:
A0A061FXD1_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_013776 PE=4 SV=1)
HSP 1 Score: 1180.6 bits (3053), Expect = 0.0e+00
Identity = 593/879 (67.46%), Postives = 711/879 (80.89%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAKQIG+LS+SARSF SGSRC+A DG SCTCPEDE+CVS +++ RNE L
Sbjct: 1 MLRAKQIGNLSSSARSFLFSGSRCSASDGNSCTCPEDESCVSRKRSIRNEVL-------- 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
+ SS R G L A+K + SH+ + VS + P H+ G V Y ++
Sbjct: 61 -SKSSGR-GTLALGTASKAVGSHEAERAPQLVS-----SPIPLHRSGN--VNYDVNIDAA 120
Query: 121 -LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSV 180
LDG+ ++P I+DQ VKAGI AV+ SD +N+K+P SD G SS KNC+V+ +R + ++
Sbjct: 121 QLDGQASAP-ISDQFVKAGIAAVSFLSDMMNYKLPLSDGGVMLSSPKNCVVESSRQLPNI 180
Query: 181 KPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEART 240
K +K +++EN ++V+ +PS EI K + S HG+ + + ++V+G +Q VS A +
Sbjct: 181 KSPAVKPIKKENFAKVYPKPSSEIAAGPK-STVSYHGTKDRGNKPNFVRGYKQ-VSNAAS 240
Query: 241 QKLVVFQNISSDKCDK-RNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNS-SKNFKKFPDN 300
S++ CDK + +PQR + HS+ F S+F+S + + F++S ++ FKK +
Sbjct: 241 VGSSETHRTSANTCDKGKPMPQRVKAHSHRFMSNFNSNVLPSDAKFSDSGTEGFKKSFRD 300
Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
+K PTG+ P+T +V ESVS ILQQL WGPAAE+A+ LN S+DAYQANQ+LK++
Sbjct: 301 MKMPTGVVPMTRPLAGTRHVTESVSHILQQLNWGPAAEQALENLNFSMDAYQANQVLKQI 360
Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
DH VALGFFYWLK+ F+HDGHTYTTM+G+LGRA+QF AIN+LLDQM+KDGCQPNVVT
Sbjct: 361 QDHTVALGFFYWLKQRAGFKHDGHTYTTMVGILGRARQFGAINRLLDQMVKDGCQPNVVT 420
Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
YNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLDVAM +Y++MQ
Sbjct: 421 YNRLIHSYGRANYLKEAINVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVAMDLYQRMQ 480
Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
GL+PDTFTYSV+INCLGKAGHL AAHRLFC MV +GCVPNLVTYNIMIALQAKARNYE
Sbjct: 481 AVGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCVPNLVTYNIMIALQAKARNYE 540
Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
ALKLYRDMQ +GF+PDKVTY IVMEVLGH G+L+EAE IF EM+KKNWVPDEPVYGLLV
Sbjct: 541 SALKLYRDMQNAGFDPDKVTYSIVMEVLGHYGYLDEAESIFAEMKKKNWVPDEPVYGLLV 600
Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLSAFLRVH+LSDAY LLQ+M+ GL
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSAFLRVHRLSDAYNLLQNMVALGLN 660
Query: 661 PSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
PSLQTYTLLLSCCT+A++ DMGFCC+LM VTGHPAH FL+S+PSAGP+GQNVRDH+ KF
Sbjct: 661 PSLQTYTLLLSCCTEARSPYDMGFCCQLMAVTGHPAHMFLLSMPSAGPDGQNVRDHVGKF 720
Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
LD+MHSEDRESKRGLVD+VVDFLHKSGLKEEAG VWE A QKNVYPDAV+EKSSCYWLIN
Sbjct: 721 LDMMHSEDRESKRGLVDSVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAVREKSSCYWLIN 780
Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
LHVMSDGTAVTALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSSLVRQAVQDLL
Sbjct: 781 LHVMSDGTAVTALSRTLAWFRQQMLVSGISPSRIDIVTGWGRRSRVTGSSLVRQAVQDLL 840
Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
SIFSFPFFTENGNSGCFVGCGEPL+RWL QSYVERMHLL
Sbjct: 841 SIFSFPFFTENGNSGCFVGCGEPLNRWLLQSYVERMHLL 859
BLAST of CSPI01G02030 vs. TrEMBL
Match:
B9R930_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1513340 PE=4 SV=1)
HSP 1 Score: 1178.7 bits (3048), Expect = 0.0e+00
Identity = 590/883 (66.82%), Postives = 712/883 (80.63%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCN-ADGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAKQ+ +LS++ARSFFLSGSRC+ +DG+SCTC EDE+C+ RQ RN + +Q+ L
Sbjct: 1 MLRAKQLSNLSSNARSFFLSGSRCSTSDGSSCTCSEDESCLPRRQQTRNNAVLAQRGPAL 60
Query: 61 VANSSPRVGPL-IAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNT 120
V +S RV + +A K++V HK ++V+ ++ QV + P R ++CV YASG++
Sbjct: 61 VPKASARVSQTSLLGDAGKLLVPHKVESVECP-TLPQVVSA-PISIRKSDCVSYASGIDA 120
Query: 121 VL-DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITS 180
V D +SP I+DQ KAGI AV+ SD VN+K+P +D G +S KNCMVDP R ++
Sbjct: 121 VENDIPYSSPPISDQFFKAGIAAVSFLSDLVNYKLPITD-GSGINSPKNCMVDPTRPQST 180
Query: 181 VKPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNC-KPAQSSYVKGSRQEVSEA 240
V+ S +K +RREN S+V+ + S E V S S+SN+ S K +SS++KGS++ VS
Sbjct: 181 VRSSNVKPIRRENCSKVYPKASPEAAVSS---STSNYDSTRDKSEKSSFIKGSKR-VSNT 240
Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNS----SKNFKK 300
V +I+SD CD+R +PQ+++ SN T++F++ QT + T +++++K
Sbjct: 241 PAGNSVKTCSIASDTCDRRIIPQKSKGQSNRSTANFNANVQTVQTSDTKYGEYVAEDYRK 300
Query: 301 FPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQI 360
P K P P T F + ++VE+V+ IL+Q++WGPAAEEA+ LN S+D YQANQ+
Sbjct: 301 PPRETKMPVVRVPSTRRFASNGHIVENVAHILRQIRWGPAAEEALANLNYSMDPYQANQV 360
Query: 361 LKRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQP 420
LK++ DH VAL FFYWLKR P F HDGHTYTTM+G+LGRAKQF AINKLLDQM+KDGCQP
Sbjct: 361 LKQLQDHTVALNFFYWLKRQPGFNHDGHTYTTMVGILGRAKQFGAINKLLDQMVKDGCQP 420
Query: 421 NVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMY 480
NVVTYNR+IHSYGRANYL DAV+VF +MQ GCEPDRVTYCTLIDIHAK+GFLD A+ MY
Sbjct: 421 NVVTYNRLIHSYGRANYLNDAVDVFNEMQRVGCEPDRVTYCTLIDIHAKAGFLDFALEMY 480
Query: 481 EKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKA 540
++MQ AGL+PDTFTYSV+INCLGKAGHL AAH+LFC MV++GCVPNLVTYNIMIALQAKA
Sbjct: 481 QRMQAAGLSPDTFTYSVIINCLGKAGHLAAAHKLFCEMVEQGCVPNLVTYNIMIALQAKA 540
Query: 541 RNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVY 600
RNY+ ALKLYRDMQ +GF+PDKVTY IVMEVLGHCG+L+EAE +F EM++KNWVPDEPVY
Sbjct: 541 RNYQSALKLYRDMQSAGFQPDKVTYSIVMEVLGHCGYLDEAEAVFSEMKRKNWVPDEPVY 600
Query: 601 GLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLT 660
GLLVDLWGK+GNV+KAW+WY ML GL+PNVPTCNSLLSAFLRVH+L+DAY LLQSML
Sbjct: 601 GLLVDLWGKAGNVEKAWQWYQTMLNTGLRPNVPTCNSLLSAFLRVHKLADAYNLLQSMLE 660
Query: 661 FGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDH 720
GL PSLQTYTLLLSCCT+A++ DMG CELM VTGHPAH FL+SLPSAGP+GQNVRDH
Sbjct: 661 LGLNPSLQTYTLLLSCCTEARSPYDMGIYCELMAVTGHPAHMFLLSLPSAGPDGQNVRDH 720
Query: 721 MSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCY 780
SKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A Q+NVYPDAVKEK SCY
Sbjct: 721 ASKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQRNVYPDAVKEKGSCY 780
Query: 781 WLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAV 840
WLINLHVMSDGTAVTALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSS+VRQAV
Sbjct: 781 WLINLHVMSDGTAVTALSRTLAWFRQQMLVSGISPSRIDIVTGWGRRSRVTGSSMVRQAV 840
Query: 841 QDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
Q+LL IFSFPFFTENGNSGCFVGCGEPL+RWL Q YV+RMHLL
Sbjct: 841 QELLHIFSFPFFTENGNSGCFVGCGEPLNRWLLQPYVDRMHLL 876
BLAST of CSPI01G02030 vs. TAIR10
Match:
AT1G74750.1 (AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 1069.7 bits (2765), Expect = 9.6e-313
Identity = 547/880 (62.16%), Postives = 662/880 (75.23%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
M+RAK I +LS+SARSFFLSGSR +A DG SCTC EDE+ VS+RQ R E + + K ++
Sbjct: 1 MIRAKHISNLSSSARSFFLSGSRPSAADGNSCTCAEDESGVSKRQQIRTEVVQTGKRASN 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
+A + G ++ EA K +V KT S+ + P A+ V +AS +
Sbjct: 61 LA--AGLAGSILPVEAGKPLVVPKTVEHFTRPSLLPQHVSSPALPGKADSVNHASAIIK- 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
E I DQ+ KAGI VNL SD N+KIP SD K+CMVDP R I+ VK
Sbjct: 121 ---EDVGVPIGDQIFKAGIGNVNLLSDIANYKIPLSDGTEVVGLPKSCMVDPTRPISGVK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
S +K +RRE++++V+ R + +P++S P G++Q ++ +
Sbjct: 181 SSNVKVIRREHLAKVYPRSADRVPINSSP-------------------GTKQASNDVAGK 240
Query: 241 KLVVFQNISSDKCDKRNL-PQRTRVHSNSFTSHF--HSIAQTTGSDFTNSSKNF-KKFPD 300
+S++ KR + PQR S + S +S+ + +S + F K +
Sbjct: 241 SFEAHDLLSNNVSGKRKIMPQRPYTDSTRYASGGCDYSVHSSDDRTIISSVEGFGKPSRE 300
Query: 301 NLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKR 360
+K AP N VVE+VS IL++ KWG AAEEA+ +DAYQANQ+LK+
Sbjct: 301 MMKVTPRTAPTPRQHCNPGYVVENVSSILRRFKWGHAAEEALHNFGFRMDAYQANQVLKQ 360
Query: 361 VDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVV 420
+D++A ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF INKLLD+M++DGC+PN V
Sbjct: 361 MDNYANALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTV 420
Query: 421 TYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKM 480
TYNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLD+AM MY++M
Sbjct: 421 TYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRM 480
Query: 481 QDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNY 540
Q+AGL+PDTFTYSV+INCLGKAGHL AAHRLFC MV +GC PNLVT+NIMIAL AKARNY
Sbjct: 481 QEAGLSPDTFTYSVIINCLGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNY 540
Query: 541 EIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLL 600
E ALKLYRDMQ +GF+PDKVTY IVMEVLGHCGFLEEAEG+F EMQ+KNWVPDEPVYGLL
Sbjct: 541 ETALKLYRDMQNAGFQPDKVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWVPDEPVYGLL 600
Query: 601 VDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGL 660
VDLWGK+GNV KAW+WY AML+AGL+PNVPTCNSLLS FLRVH++S+AY LLQSML GL
Sbjct: 601 VDLWGKAGNVDKAWQWYQAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGL 660
Query: 661 KPSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSK 720
PSLQTYTLLLSCCTDA++N DMGFC +LM V+GHPAH FL+ +P AGP+GQ VRDH+S
Sbjct: 661 HPSLQTYTLLLSCCTDARSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSN 720
Query: 721 FLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLI 780
FLD MHSEDRESKRGL+DAVVDFLHKSGLKEEAG VWE A KNVYPDA++EKS YWLI
Sbjct: 721 FLDFMHSEDRESKRGLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLI 780
Query: 781 NLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDL 840
NLHVMS+GTAV ALSRTLAWFR+Q+L+SG PSRIDIVTGWGRRS+VTG+S+VRQAV++L
Sbjct: 781 NLHVMSEGTAVIALSRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEEL 840
Query: 841 LSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
L+IF+FPFFTENGNSGCFVG GEPL WL +SYVERMHLL
Sbjct: 841 LNIFNFPFFTENGNSGCFVGSGEPLKNWLLESYVERMHLL 855
BLAST of CSPI01G02030 vs. TAIR10
Match:
AT1G18900.3 (AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 1062.0 bits (2745), Expect = 2.0e-310
Identity = 535/873 (61.28%), Postives = 665/873 (76.17%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
M+RAK I +LS++ARSFFL+GSR + DG SC +DE CVS+RQ R E ++K +
Sbjct: 1 MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGL-NT 120
+ VG ++ E K +V K D+ + Q ++ P + V YAS +
Sbjct: 61 ILPKPSVVGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVRE 120
Query: 121 VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGG-TFSSSKNCMVDPARSITS 180
++G+ +S I DQ+ KAGI+AVN SD N KIPS D G F K+CMVDP R I+S
Sbjct: 121 EVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPISS 180
Query: 181 VKPSKIKHLRRENISRVHSRPSV-EIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEA 240
VK S +K +RRE+ ++++ R + E V + SSN + ++ +VKG RQ VS +
Sbjct: 181 VKSSNVKAIRREHFAKIYPRSAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQ-VSNS 240
Query: 241 RTQKLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDN 300
K + N + K + ++ QR + SN F S F+NSS
Sbjct: 241 VVGKSLPTTNNTYGK--RTSVLQRPHIDSNRFVP----------SGFSNSSVEM------ 300
Query: 301 LKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRV 360
+K P+G A + + N+ ++VE+VS +L++ +WGPAAEEA+ L IDAYQANQ+LK++
Sbjct: 301 MKGPSGTALTSRQYCNSGHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQM 360
Query: 361 DDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVT 420
+D+ ALGFFYWLKR P F+HDGHTYTTM+G LGRAKQF AINKLLD+M++DGCQPN VT
Sbjct: 361 NDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVT 420
Query: 421 YNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQ 480
YNR+IHSYGRANYL +A+NVF QMQEAGC+PDRVTYCTLIDIHAK+GFLD+AM MY++MQ
Sbjct: 421 YNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQ 480
Query: 481 DAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYE 540
GL+PDTFTYSV+INCLGKAGHL AAH+LFC MVD+GC PNLVTYNIM+ L AKARNY+
Sbjct: 481 AGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAKARNYQ 540
Query: 541 IALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLV 600
ALKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE +F EMQ+KNW+PDEPVYGLLV
Sbjct: 541 NALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPVYGLLV 600
Query: 601 DLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLK 660
DLWGK+GNV+KAW+WY AML AGL+PNVPTCNSLLS FLRV+++++AY+LLQ+ML GL+
Sbjct: 601 DLWGKAGNVEKAWQWYQAMLHAGLRPNVPTCNSLLSTFLRVNKIAEAYELLQNMLALGLR 660
Query: 661 PSLQTYTLLLSCCTDAQTN-DMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKF 720
PSLQTYTLLLSCCTD ++ DMGFC +LM TGHPAH FL+ +P+AGP+G+NVR+H + F
Sbjct: 661 PSLQTYTLLLSCCTDGRSKLDMGFCGQLMASTGHPAHMFLLKMPAAGPDGENVRNHANNF 720
Query: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLIN 780
LDLMHSEDRESKRGLVDAVVDFLHKSG KEEAG VWE A QKNV+PDA++EKS YWLIN
Sbjct: 721 LDLMHSEDRESKRGLVDAVVDFLHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLIN 780
Query: 781 LHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLL 840
LHVMS+GTAVTALSRTLAWFR+Q+L SG PSRIDIVTGWGRRS+VTG+S+VRQAV++LL
Sbjct: 781 LHVMSEGTAVTALSRTLAWFRKQMLASGTCPSRIDIVTGWGRRSRVTGTSMVRQAVEELL 840
Query: 841 SIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYV 869
+IF PFFTE+GNSGCFVG GEPL+RWL QS++
Sbjct: 841 NIFGSPFFTESGNSGCFVGSGEPLNRWLLQSHL 854
BLAST of CSPI01G02030 vs. TAIR10
Match:
AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)
HSP 1 Score: 192.6 bits (488), Expect = 1.0e-48
Identity = 131/530 (24.72%), Postives = 240/530 (45.28%), Query Frame = 1
Query: 374 RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDA 433
R D +Y T++ + + Q ++L QM PNVV+Y+ +I + +A +A
Sbjct: 369 RIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEA 428
Query: 434 VNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINC 493
+N+F +M+ G DRV+Y TL+ I+ K G + A+ + +M G+ D TY+ ++
Sbjct: 429 LNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGG 488
Query: 494 LGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPD 553
GK G + ++F M E +PNL+TY+ +I +K Y+ A++++R+ + +G D
Sbjct: 489 YGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRAD 548
Query: 554 KVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYH 613
V Y +++ L G + A + EM K+ P+ Y ++D +G+S + ++ ++ +
Sbjct: 549 VVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAFGRSATMDRSADYSN 608
Query: 614 AMLKAGLKPNVPTCNSLLSAFL-----RVHQLSDA-----------------------YQ 673
++P +S LSA RV QL +
Sbjct: 609 G-------GSLPFSSSALSALTETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILE 668
Query: 674 LLQSMLTFGLKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPN 733
+ + M +KP++ T++ +L+ C+ + D E +++ + + + L
Sbjct: 669 VFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMG--Q 728
Query: 734 GQNVRDHMSKFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAV 793
+NV D ++ D + +A+ D L G K A V + V+ +
Sbjct: 729 RENVWLQAQSLFDKVNEMDGSTASAFYNALTDMLWHFGQKRGAELVALEGRSRQVWENVW 788
Query: 794 KEKSSCYWLINLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGS 853
+ SC ++LH+MS G A + L R + P + I+TGWG+ SKV G
Sbjct: 789 SD--SC---LDLHLMSSGAARAMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGD 848
Query: 854 SLVRQAVQDLLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
+R+AV+ LL PF N G F G ++ WL +S ++ +L
Sbjct: 849 GALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVVATWLRESATLKLLIL 884
BLAST of CSPI01G02030 vs. TAIR10
Match:
AT1G79490.1 (AT1G79490.1 Pentatricopeptide repeat (PPR) superfamily protein)
HSP 1 Score: 186.8 bits (473), Expect = 5.7e-47
Identity = 121/441 (27.44%), Postives = 218/441 (49.43%), Query Frame = 1
Query: 378 DGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVF 437
DG TY +I L ++ + A KL QM + +P+ ++ ++ S G+A L ++ V+
Sbjct: 312 DGSTYELIIPSLAKSGRLDAAFKLFQQMKERKLRPSFSVFSSLVDSMGKAGRLDTSMKVY 371
Query: 438 KQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKA 497
+MQ G P + +LID +AK+G LD A+ ++++M+ +G P+ Y+++I K+
Sbjct: 372 MEMQGFGHRPSATMFVSLIDSYAKAGKLDTALRLWDEMKKSGFRPNFGLYTMIIESHAKS 431
Query: 498 GHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTY 557
G L A +F M G +P TY+ ++ + A + + A+K+Y M +G P +Y
Sbjct: 432 GKLEVAMTVFKDMEKAGFLPTPSTYSCLLEMHAGSGQVDSAMKIYNSMTNAGLRPGLSSY 491
Query: 558 CIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLK 617
++ +L + ++ A I +EM+ + D +L+ ++ K +V A +W M
Sbjct: 492 ISLLTLLANKRLVDVAGKILLEMKAMGYSVDVCASDVLM-IYIKDASVDLALKWLRFMGS 551
Query: 618 AGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLLSCCTDAQTNDM 677
+G+K N L + ++ A LL++++ K L YT +L+ Q D
Sbjct: 552 SGIKTNNFIIRQLFESCMKNGLYDSARPLLETLVHSAGKVDLVLYTSILAHLVRCQDEDK 611
Query: 678 -GFCCELMQVTGHPAHTFLVSLPSAGP--NGQNVRDHMSKFLDLMHSEDRE-SKRGLVDA 737
++ T H AH F+ L GP Q V + +F + E E + R V+
Sbjct: 612 ERQLMSILSATKHKAHAFMCGL-FTGPEQRKQPVLTFVREFYQGIDYELEEGAARYFVNV 671
Query: 738 VVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMSDGTAVTALSRTLA 797
++++L G A CVW+ A + ++P A+ W +++ +S G A+ A+ TL
Sbjct: 672 LLNYLVLMGQINRARCVWKVAYENKLFPKAIVFDQHIAWSLDVRNLSVGAALIAVVHTLH 731
Query: 798 WFRQQLLLSGVGPSRIDIVTG 815
FR+++L GV P RI +VTG
Sbjct: 732 RFRKRMLYYGVVPRRIKLVTG 750
BLAST of CSPI01G02030 vs. TAIR10
Match:
AT4G31850.1 (AT4G31850.1 proton gradient regulation 3)
HSP 1 Score: 179.9 bits (455), Expect = 6.9e-45
Identity = 113/395 (28.61%), Postives = 192/395 (48.61%), Query Frame = 1
Query: 274 SIAQTTGSDFTNSSKNFKKFPDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAA 333
S+ SDF+ S PD S +T + P+ S S
Sbjct: 60 SVVSMKSSDFSGSMIRKSSKPDLSSSEE----VTRGLKSFPDTDSSFSYF---------- 119
Query: 334 EEAIGKLNCSIDAYQANQILK--RVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGR 393
+ G LN N +L+ RVD + + + L + + D +TY T+ L
Sbjct: 120 KSVAGNLNLVHTTETCNYMLEALRVDGKLEEMAYVFDLMQKRIIKRDTNTYLTIFKSLSV 179
Query: 394 AKQFAAINKLLDQMIKDGCQPNVVTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVT 453
L +M + G N +YN +IH ++ + +A+ V+++M G P T
Sbjct: 180 KGGLKQAPYALRKMREFGFVLNAYSYNGLIHLLLKSRFCTEAMEVYRRMILEGFRPSLQT 239
Query: 454 YCTLIDIHAKSGFLDVAMGMYEKMQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMV 513
Y +L+ K +D MG+ ++M+ GL P+ +T+++ I LG+AG +N A+ + RM
Sbjct: 240 YSSLMVGLGKRRDIDSVMGLLKEMETLGLKPNVYTFTICIRVLGRAGKINEAYEILKRMD 299
Query: 514 DEGCVPNLVTYNIMIALQAKARNYEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLE 573
DEGC P++VTY ++I AR + A +++ M+ +PD+VTY +++ L+
Sbjct: 300 DEGCGPDVVTYTVLIDALCTARKLDCAKEVFEKMKTGRHKPDRVTYITLLDRFSDNRDLD 359
Query: 574 EAEGIFIEMQKKNWVPDEPVYGLLVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLL 633
+ + EM+K VPD + +LVD K+GN +A++ M G+ PN+ T N+L+
Sbjct: 360 SVKQFWSEMEKDGHVPDVVTFTILVDALCKAGNFGEAFDTLDVMRDQGILPNLHTYNTLI 419
Query: 634 SAFLRVHQLSDAYQLLQSMLTFGLKPSLQTYTLLL 667
LRVH+L DA +L +M + G+KP+ TY + +
Sbjct: 420 CGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFI 440
BLAST of CSPI01G02030 vs. NCBI nr
Match:
gi|449440748|ref|XP_004138146.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g18900 [Cucumis sativus])
HSP 1 Score: 1775.4 bits (4597), Expect = 0.0e+00
Identity = 874/874 (100.00%), Postives = 874/874 (100.00%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV
Sbjct: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
Query: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL
Sbjct: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP
Sbjct: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874
BLAST of CSPI01G02030 vs. NCBI nr
Match:
gi|659105772|ref|XP_008453170.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g18900 [Cucumis melo])
HSP 1 Score: 1741.1 bits (4508), Expect = 0.0e+00
Identity = 856/874 (97.94%), Postives = 865/874 (98.97%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSERQNARNETLPSQKPSTLV 60
MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVS+RQNARNETLPSQKPSTLV
Sbjct: 1 MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSQRQNARNETLPSQKPSTLV 60
Query: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTVL 120
ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQV NTGPNHQRGAECVRY+SGLNTVL
Sbjct: 61 ANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVTNTGPNHQRGAECVRYSSGLNTVL 120
Query: 121 DGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVKP 180
DGEC+SPRIADQVVKAGIMAVNLFSDFVNFKIP SDYGGTFSSSKNCMVDPARSITSVKP
Sbjct: 121 DGECSSPRIADQVVKAGIMAVNLFSDFVNFKIPLSDYGGTFSSSKNCMVDPARSITSVKP 180
Query: 181 SKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQK 240
SKIKHLRRENISRVHSRPSVE VDSKPQSSSNHGSNCKPAQSSYVKGSRQEVS+ARTQK
Sbjct: 181 SKIKHLRRENISRVHSRPSVETHVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSKARTQK 240
Query: 241 LVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDFTNSSKNFKKFPDNLKSP 300
VVFQ+ISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD T+SSKN KKFPDNLKSP
Sbjct: 241 SVVFQDISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDLTSSSKNLKKFPDNLKSP 300
Query: 301 TGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
TGMAPI SSFLN+PNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA
Sbjct: 301 TGMAPINSSFLNSPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDDHA 360
Query: 361 VALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
VALGFFYWLKRL RFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI
Sbjct: 361 VALGFFYWLKRLARFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYNRI 420
Query: 421 IHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
IHSYGRANYLQ+AVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL
Sbjct: 421 IHSYGRANYLQEAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDAGL 480
Query: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIALK 540
TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVD+GCVPNLVTYNIMIALQAKARNYEIALK
Sbjct: 481 TPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDQGCVPNLVTYNIMIALQAKARNYEIALK 540
Query: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG
Sbjct: 541 LYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDLWG 600
Query: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ
Sbjct: 601 KSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPSLQ 660
Query: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH
Sbjct: 661 TYTLLLSCCTDAQTNDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLDLMH 720
Query: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS
Sbjct: 721 SEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLHVMS 780
Query: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF
Sbjct: 781 DGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSIFSF 840
Query: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL
Sbjct: 841 PFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 874
BLAST of CSPI01G02030 vs. NCBI nr
Match:
gi|645268701|ref|XP_008239656.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g18900 [Prunus mume])
HSP 1 Score: 1230.3 bits (3182), Expect = 0.0e+00
Identity = 606/877 (69.10%), Postives = 723/877 (82.44%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAK I +LS+SARSFFL+G RC+A +G+SCTC EDETCVS+RQ RN L +Q PST+
Sbjct: 1 MLRAKHISNLSSSARSFFLNGPRCSATEGSSCTCSEDETCVSQRQQTRNGGLLAQTPSTM 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
V+ S G +I +A KV SHK ++V+ + +I+QV T P +E V Y++ + V
Sbjct: 61 VSKPSAGAGTIITGDAVKVASSHKAESVEHTTNIKQVT-TAPRSFGRSETVTYSNSTDAV 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
+SP + DQ +AG+ AVN SD VN K+P SD G + +NCMVDP R ++ +K
Sbjct: 121 H----SSPLVVDQFARAGVAAVNFLSDIVNGKLPLSDGLGLLNLPQNCMVDPTRPLSGIK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
PS +K ++RE+ VH +PS E SK +S+NHGS K + S+VKG V RT+
Sbjct: 181 PSHVKQIKREHFISVHPKPSTETVAASK-HTSNNHGSKGKGEKPSFVKG-LNHVPYTRTE 240
Query: 241 KLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD-FTNSSKNFKKFPDNLK 300
VV SSD DKR++P++++ HSN+F ++ S QT+ ++ +K F + ++K
Sbjct: 241 NSVVAHTASSDTFDKRSMPRKSKGHSNNFIPNYSSNVQTSDAESMGRVTKGFNRRTRDMK 300
Query: 301 SPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDD 360
PTG+API F++ +VV++VS ILQQ++WGPAAE A+ LNCS+DAYQANQILK++ D
Sbjct: 301 MPTGIAPINRQFVHTGSVVQNVSHILQQMRWGPAAEAALVNLNCSMDAYQANQILKQLQD 360
Query: 361 HAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYN 420
H+VAL FFYWLKR F+HDGHTYTTM+G+LGR++QF AINKLLDQM+K+GCQPNVVTYN
Sbjct: 361 HSVALSFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLDQMVKEGCQPNVVTYN 420
Query: 421 RIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDA 480
R+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLDVA+ +Y+ MQ+A
Sbjct: 421 RLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDGMQEA 480
Query: 481 GLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIA 540
GL+PDTFTYSVMINCLGKAGHL AAHRLFC MV++GCVPNLVTYNIMIALQAKARNYE A
Sbjct: 481 GLSPDTFTYSVMINCLGKAGHLAAAHRLFCEMVNQGCVPNLVTYNIMIALQAKARNYETA 540
Query: 541 LKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDL 600
LKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE IF EM++KNWVPDEPVYGLLVDL
Sbjct: 541 LKLYRDMQGAGFEPDKVTYSIVMEVLGHCGYLEEAEAIFGEMKRKNWVPDEPVYGLLVDL 600
Query: 601 WGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPS 660
WGK+GNV+KAW WY AML AGL+PNVPTCNSLLSAFLRVHQLSDAY LLQSM+ GL PS
Sbjct: 601 WGKAGNVRKAWNWYQAMLHAGLRPNVPTCNSLLSAFLRVHQLSDAYNLLQSMMGLGLNPS 660
Query: 661 LQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLD 720
LQTYTLLLSCCT+A++ DM FCCELM VTGHPAHTFL+S+PSAGP+GQNVR+HMS+FLD
Sbjct: 661 LQTYTLLLSCCTEARSPYDMDFCCELMAVTGHPAHTFLLSMPSAGPDGQNVREHMSRFLD 720
Query: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLH 780
LMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDA+KEKSSCYWLINLH
Sbjct: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAIKEKSSCYWLINLH 780
Query: 781 VMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSI 840
VMSDGTAVTALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSSLVRQAV++LL++
Sbjct: 781 VMSDGTAVTALSRTLAWFRQQMLISGICPSRIDIVTGWGRRSRVTGSSLVRQAVEELLNM 840
Query: 841 FSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
FSFPFFTENGNSGCFVGCGEPL++WL QSYVERMHLL
Sbjct: 841 FSFPFFTENGNSGCFVGCGEPLNKWLLQSYVERMHLL 870
BLAST of CSPI01G02030 vs. NCBI nr
Match:
gi|595852784|ref|XP_007210374.1| (hypothetical protein PRUPE_ppa001256mg [Prunus persica])
HSP 1 Score: 1221.5 bits (3159), Expect = 0.0e+00
Identity = 603/877 (68.76%), Postives = 719/877 (81.98%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAK I +LS+SARSFFL+G RC+A +G+SCTC EDETCVS+RQ RN +Q PST+
Sbjct: 1 MLRAKHISNLSSSARSFFLNGPRCSATEGSSCTCSEDETCVSQRQQTRNGGPLAQTPSTM 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
V+ S G +I +A KV SHK ++V+ + +I+QV T P + V Y+S + V
Sbjct: 61 VSKPSAGAGTIITGDAVKVASSHKAESVEHTTNIKQVT-TAPRSFGRSATVTYSSSTDAV 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
+SP + DQ +AG+ AVN SD VN K+P SD G + +NCMVDP R ++S+K
Sbjct: 121 H----SSPLVVDQFARAGVAAVNFLSDIVNGKLPLSDGLGLLNLPQNCMVDPTRPLSSIK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
PS +K ++RE+ VH +PS E SK +S+NHGS K + S+VKG V R +
Sbjct: 181 PSHVKQIKREHFISVHPKPSTETAAASK-HTSNNHGSKGKGEKPSFVKG-LNHVPYTRKE 240
Query: 241 KLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSD-FTNSSKNFKKFPDNLK 300
VV SSD DKR++P++++ HSN+F ++ S QT+ ++ +K F + ++K
Sbjct: 241 NSVVAHTASSDTFDKRSMPRKSKGHSNNFIPNYSSNVQTSDAESMGRVTKGFNRPTRDMK 300
Query: 301 SPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVDD 360
PTG+ PI F++ NVV++VS ILQQ++WGPAAE A+ LNCS+DAYQANQILK++ D
Sbjct: 301 MPTGITPINRQFVHTGNVVQNVSHILQQMRWGPAAEAALLNLNCSMDAYQANQILKQLQD 360
Query: 361 HAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTYN 420
H+VAL FFYWLKR F+HDGHTYTTM+G+LGR++QF AINKLL+QM+K+GCQPNVVTYN
Sbjct: 361 HSVALSFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGCQPNVVTYN 420
Query: 421 RIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEKMQDA 480
R+IHSYGRANYL++A+NVF QMQEAGCEPDRVTYCTLIDIHAK+GFLDVA+ +Y+ MQ+A
Sbjct: 421 RLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDVALRLYDGMQEA 480
Query: 481 GLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARNYEIA 540
GL+PDTFTYSVMINCLGKAGHL AAHRLFC MV++GCVPNLVTYNIMIALQAKARNYE A
Sbjct: 481 GLSPDTFTYSVMINCLGKAGHLAAAHRLFCEMVNQGCVPNLVTYNIMIALQAKARNYETA 540
Query: 541 LKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGLLVDL 600
LKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE IF EM++KNWVPDEPVYGLLVDL
Sbjct: 541 LKLYRDMQGAGFEPDKVTYSIVMEVLGHCGYLEEAEAIFGEMKRKNWVPDEPVYGLLVDL 600
Query: 601 WGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFGLKPS 660
WGK+GNV KAW WY AML AGL+PNVPTCNSLLSAFLRVHQLSDAY LLQSM+ GL PS
Sbjct: 601 WGKAGNVGKAWNWYQAMLHAGLRPNVPTCNSLLSAFLRVHQLSDAYNLLQSMMGLGLNPS 660
Query: 661 LQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMSKFLD 720
LQTYTLLLSCCT+A++ DM FCCELM VTGHPAHTFL+S+PSAGP+GQNVR+HMS+FLD
Sbjct: 661 LQTYTLLLSCCTEARSPYDMDFCCELMAVTGHPAHTFLLSMPSAGPDGQNVREHMSRFLD 720
Query: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWLINLH 780
LMHSEDRESKRGLVDAVVDFLHKSGLKEEAG VWE A QKNVYPDA++EKSSCYWLINLH
Sbjct: 721 LMHSEDRESKRGLVDAVVDFLHKSGLKEEAGSVWEVAAQKNVYPDAIREKSSCYWLINLH 780
Query: 781 VMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQDLLSI 840
VMSDGTAVTALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSSLVRQAV++LL++
Sbjct: 781 VMSDGTAVTALSRTLAWFRQQMLISGICPSRIDIVTGWGRRSRVTGSSLVRQAVEELLNM 840
Query: 841 FSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
FSFPFFTENGNSGCFVGCGEPL++WL QSYVERMHLL
Sbjct: 841 FSFPFFTENGNSGCFVGCGEPLNKWLLQSYVERMHLL 870
BLAST of CSPI01G02030 vs. NCBI nr
Match:
gi|694387210|ref|XP_009369368.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like [Pyrus x bretschneideri])
HSP 1 Score: 1201.0 bits (3106), Expect = 0.0e+00
Identity = 602/881 (68.33%), Postives = 716/881 (81.27%), Query Frame = 1
Query: 1 MLRAKQIGSLSNSARSFFLSGSRCNA-DGASCTCPEDETCVSERQNARNETLPSQKPSTL 60
MLRAKQ+ +LS+SARSFFL+G RC+A DG SC+C EDETCVS+RQ AR+ L +Q PST+
Sbjct: 1 MLRAKQLSNLSSSARSFFLTGPRCSATDGNSCSCSEDETCVSQRQLARDGGLLAQNPSTM 60
Query: 61 VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 120
V+ S R G ++ +A KV K +VD + SI QVA T P +E V YAS + V
Sbjct: 61 VSKPSARAGTILLGDAVKVAGPRKAGSVDHTASINQVA-TAPRSFGRSETVSYASSTDVV 120
Query: 121 LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 180
+SP +ADQ KAG+ AVN SD VN K+P SD G + NCMVDP R + +K
Sbjct: 121 H----SSPLVADQFAKAGVAAVNFLSDIVNGKLPLSDGLGILN---NCMVDPTRPLNGIK 180
Query: 181 PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 240
PS +K ++RE+ + +P+ E+ SKP +S+NHG K +S++VKG V RT+
Sbjct: 181 PSHVKQIKREHFTSGRPKPATEVSAASKP-TSNNHGPKGKGEKSNFVKGLNH-VPYTRTE 240
Query: 241 KLVVFQNISSDKCDKRNLPQRTRVHSNSFTSHFHSIAQTTGSDF----TNSSKNFKKFPD 300
+ SD ++R++P +++ HSN F ++ S AQT+ ++ T+++K+F +
Sbjct: 241 SSAAPHTVYSDHHEQRSMPPKSKPHSN-FIPNYSSSAQTSDAETMGHATHATKSFNRPIR 300
Query: 301 NLKSPTGMA-PITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILK 360
++K PTG+A I F + NVV +VS ILQQ +WGPAAE AI LNCS+DAYQANQILK
Sbjct: 301 DVKMPTGIARQINRQFAHTGNVVHNVSHILQQTRWGPAAEAAIENLNCSMDAYQANQILK 360
Query: 361 RVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNV 420
++ DH+VALGFFYWLKR F+HDGHTYTTM+G+LGR++QF AINKLL+QM+K+GCQPNV
Sbjct: 361 QLQDHSVALGFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGCQPNV 420
Query: 421 VTYNRIIHSYGRANYLQDAVNVFKQMQEAGCEPDRVTYCTLIDIHAKSGFLDVAMGMYEK 480
VTYNR+IHSYGRANYL++A+NVF QMQEAGCEPDRVTY TLIDIHAK+GFLDVA+G+Y+K
Sbjct: 421 VTYNRLIHSYGRANYLKEAMNVFNQMQEAGCEPDRVTYSTLIDIHAKAGFLDVALGLYDK 480
Query: 481 MQDAGLTPDTFTYSVMINCLGKAGHLNAAHRLFCRMVDEGCVPNLVTYNIMIALQAKARN 540
MQ+AGL PDTFTYSVMINCLGKAGHL+AAHRLFC MV+ GCVPNLVTYNIMIALQAKARN
Sbjct: 481 MQEAGLAPDTFTYSVMINCLGKAGHLDAAHRLFCEMVNHGCVPNLVTYNIMIALQAKARN 540
Query: 541 YEIALKLYRDMQQSGFEPDKVTYCIVMEVLGHCGFLEEAEGIFIEMQKKNWVPDEPVYGL 600
YE +LKLYRDMQ +GFEPDKVTY IVMEVLGHCG+LEEAE IF EM++KNWVPDEPVYGL
Sbjct: 541 YETSLKLYRDMQGAGFEPDKVTYSIVMEVLGHCGYLEEAEVIFHEMKRKNWVPDEPVYGL 600
Query: 601 LVDLWGKSGNVQKAWEWYHAMLKAGLKPNVPTCNSLLSAFLRVHQLSDAYQLLQSMLTFG 660
LVDLWGK+GN++KAW WY AML AGL+PNVPTCNSLLSAFLRVHQLSDAY LLQSM+ G
Sbjct: 601 LVDLWGKAGNIEKAWSWYQAMLHAGLRPNVPTCNSLLSAFLRVHQLSDAYNLLQSMMGLG 660
Query: 661 LKPSLQTYTLLLSCCTDAQT-NDMGFCCELMQVTGHPAHTFLVSLPSAGPNGQNVRDHMS 720
L PSLQTYTLLLSCCT+AQ+ DMGFC +LM VTGHPAH FL+S+PSAGP+GQNVR+HMS
Sbjct: 661 LNPSLQTYTLLLSCCTEAQSPYDMGFCGDLMAVTGHPAHKFLLSMPSAGPDGQNVREHMS 720
Query: 721 KFLDLMHSEDRESKRGLVDAVVDFLHKSGLKEEAGCVWEAAMQKNVYPDAVKEKSSCYWL 780
+FLD+MHSEDRESKRGLVDAVVDFLHK+GLKEEAG VWE A QKNVYPDA+KEKSSCYWL
Sbjct: 721 RFLDMMHSEDRESKRGLVDAVVDFLHKAGLKEEAGSVWEVAAQKNVYPDAIKEKSSCYWL 780
Query: 781 INLHVMSDGTAVTALSRTLAWFRQQLLLSGVGPSRIDIVTGWGRRSKVTGSSLVRQAVQD 840
INLHVMSDGTAV ALSRTLAWFRQQ+L+SG+ PSRIDIVTGWGRRS+VTGSSLVRQAVQ+
Sbjct: 781 INLHVMSDGTAVIALSRTLAWFRQQMLISGICPSRIDIVTGWGRRSRVTGSSLVRQAVQE 840
Query: 841 LLSIFSFPFFTENGNSGCFVGCGEPLSRWLHQSYVERMHLL 875
LL +F FPFFTENGNSGCFVGCGEPL+RWL QSYVERMHLL
Sbjct: 841 LLKVFRFPFFTENGNSGCFVGCGEPLNRWLLQSYVERMHLL 870
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
PPR49_ARATH | 0.0e+00 | 61.66 | Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana GN... | [more] |
PP123_ARATH | 1.7e-311 | 62.16 | Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN... | [more] |
PP178_ARATH | 1.8e-47 | 24.72 | Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... | [more] |
PP132_ARATH | 1.0e-45 | 27.44 | Pentatricopeptide repeat-containing protein At1g79490, mitochondrial OS=Arabidop... | [more] |
PP344_ARATH | 1.2e-43 | 28.61 | Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LRL7_CUCSA | 0.0e+00 | 100.00 | Uncharacterized protein OS=Cucumis sativus GN=Csa_1G008500 PE=4 SV=1 | [more] |
M5WQI1_PRUPE | 0.0e+00 | 68.76 | Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001256mg PE=4 SV=1 | [more] |
W9SZB0_9ROSA | 0.0e+00 | 67.65 | Uncharacterized protein OS=Morus notabilis GN=L484_010090 PE=4 SV=1 | [more] |
A0A061FXD1_THECC | 0.0e+00 | 67.46 | Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... | [more] |
B9R930_RICCO | 0.0e+00 | 66.82 | Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... | [more] |