CSPI01G01010 (gene) Wild cucumber (PI 183967)

NameCSPI01G01010
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 595182 .. 597827 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCCTTCGGAAAAACCCTAAGAAAAACCCCAATTTGGAGCCCTTCTGCCGTCGTAGATCATTCAGGCAGCCGACGGGGGTTTCTCGGAGCAACTATGACGAGGAGAAACGCAATTTTCACTTCCCTCAGATTGGCAAATTCCTTCTTTTCGACTCGGTTTCGATATCCCCAGGTGACTCGGTTTTCACCTTCTTCTTATGTTTCTCATCAGTCCCTCGTCTCCCATTTCACAATCAACCATCCTGTTCTGTTTTTCTCGTCGAACCCCCAATCGCTTCTCCAGCTTGTTTCGACCAACGATTGGTCGGAAATGTTAGAAACTGAATTAGAGACTTTAAACCCTACGCTCACACACGAAACTGTTGTCTATGTTTTGAAAAGACTTGACAAACAACCCCAAAAGGCCTCTGAGTTCTTCAACTGGGCTTCTGGGAAGAATGGGTCTACTCAAAGTTCTTCCATTTATAGCATGTTGCTTAGGATTTTTGTTCAGAACGAGTCCATGAAATTGTTCTGGATTACGTTAAGGTTAATGAAAGAACGAGGGTTTTACCTAGATGAGGAAACTTATAAGACCATTTTAGGGGTTTTGAGAAAGTCGAAGAAGGCTGCAGATGCCACGGGTCTGACTCATTTTTACAATCGAATGCTTCAACAAAATGCCATGGACAGTGTGGTGCAGAAGGTGGTTGATATTGTTTTAGGATCTGATTGGAGTAATGATGTTCCTGGAAAACTTGAGGAGCTTGGTATTGCGTTATCAGATAACTTTGTAATTAGAGTGTTGAAGGAACTTCGAAATTCTCCATTGAAAGCCTTGAGTTTTTTTCATTGGGTTGGTTGTAGGCCGGATTATGATCATAACACAGTTTCATACAATGCAATTGCTAGGGTTCTTGGACGGGATGACTCGATCGAGGCTTTTTGGGGCGTGATTGAAGAAATGAAGCATGCTAATCATGAAATTGACATCGACACTTACATAAAGATCTCCAGGCAGTTTCAGAAGAGCAAGATGATGGGTGAGGCTGTCAAGCTTTATGAGCTTATGATGGATGGGCCATACAAGCCCTCGTTGCAGGATTGCAGCGTTCTTTTGCGGACCATCGCTGCTAGTGATAATCCAGATTTGAGCTTGGTTTACAGAGTGGCCAAGAAATTCGAGGCTACAGGGTACAGTCTCTCCAAAGCTATGTACGACGGAATCCATAGGTCATTGACAAGCACAGGAAAGTTTGATGATGCGGAGAATATCGTGAAGTCCATGAGAAATGCAGGATATGAACCTGATAATGTTACATACAGTCAGCTGGTATTTGGACTTTGCAAGGCTAGGAGACTTGAGGAAGCCCGTAAAGTGCTGGATGAGATGGAAGCACAAGGATGTATTCCTGATATCAAGACTTGGACTATTTTAATTCAAGGACATTGTAATGCCAATGAACTTGACATTGCTTTAGTTTGTTTTGCAAAGATGATAGAGAAGAACTGTGATCCAGATGCTGATCTTTTGGATGTGTTGATTAGTGGTTTCCTTAACCAGAAAAAGTTAAATGGTGCATATCAGTTGCTGATTGAGTTGACAAATAAGGCTCATGTAAGACCATGGCAGGCAACATACAAACAGTTAATTAAAAATCTTTTGGAAGTTAGGAAACTTGAGGAAGCCATTGCCCTCCTTCGTTTAATGAAGAAACAAAATTACCCACCTTTTCCAGAACCCTTTGTTCAATATATCTCCAAGTTTGGTACTGTGCAGGATGCTGATGATTTTCTGAAGGTTCTTAGTTCAAAAGAATATCCTTCCGTGTCTGCTTATCTTCATATTTTTAATTCATTTTTTAATGAAGGCAGATATTCTGAGGCCAAAGATCTGCTCTTTAAATGCCCACATCACATTCGAAAGCATAACGAAGTTTGTAAGCTCTTTGGATCTGCAGAAAGCAATACCACTGCTGCTACTCAATCTTCCTCCAATCCGATTGAAACTTAATCATCACATAGAACAAATATCAACTACACATCAGGTGCAGTTTCTTGTGACTGTGTTCCTACTTGTTTCTAGTCACGAGATTTGGTTCAGCCGTCAAGGATTAAATCAACAACAGACTGTTGAAGTATAGCTTTTCAATTTGCAAAAAACCAGTTGGCCAGTCGTTCTGACAGAAAAGAGGTGCTTCATGCTGTGGTCTTTACGCTTGTGATTCTTTGGAGCAGTTTGATTCATTGGTTCATTTACTCATCATAACTTATTTGCTTGCTAATATATGTACCACAATGACAAAATGCAAGTCTCTGGGTTTAGAAATGAGAGCAATAAAAGACTTGCCCTTCTTCCATTTTCTTTTAGAGCCAGTACTTCTGTTTACCATTTGTATTGCAATATGCCATCGTCTGATCAGGCAATGATATCTTCCTTCATCTTCATATTTGATGATAAACCCTGCATCGTGGAAGCAATGTGATGTTAATGCTTGAAGGGTTTGCCATTTATTTCTGTTCCTGTATGATTGCTTCTGCTTGTAGTTGCACTAATCAAGTTCTACAATAGCTTGAGAGAGGTCGATAAATATAATAAGCTGAAATTTATGGGCTTATGCTGTCACGAGAGCTCTCTGTAACTGTTTCTCTTTCAATTGATATCAG

mRNA sequence

ATGACGAGGAGAAACGCAATTTTCACTTCCCTCAGATTGGCAAATTCCTTCTTTTCGACTCGGTTTCGATATCCCCAGGTGACTCGGTTTTCACCTTCTTCTTATGTTTCTCATCAGTCCCTCGTCTCCCATTTCACAATCAACCATCCTGTTCTGTTTTTCTCGTCGAACCCCCAATCGCTTCTCCAGCTTGTTTCGACCAACGATTGGTCGGAAATGTTAGAAACTGAATTAGAGACTTTAAACCCTACGCTCACACACGAAACTGTTGTCTATGTTTTGAAAAGACTTGACAAACAACCCCAAAAGGCCTCTGAGTTCTTCAACTGGGCTTCTGGGAAGAATGGGTCTACTCAAAGTTCTTCCATTTATAGCATGTTGCTTAGGATTTTTGTTCAGAACGAGTCCATGAAATTGTTCTGGATTACGTTAAGGTTAATGAAAGAACGAGGGTTTTACCTAGATGAGGAAACTTATAAGACCATTTTAGGGGTTTTGAGAAAGTCGAAGAAGGCTGCAGATGCCACGGGTCTGACTCATTTTTACAATCGAATGCTTCAACAAAATGCCATGGACAGTGTGGTGCAGAAGGTGGTTGATATTGTTTTAGGATCTGATTGGAGTAATGATGTTCCTGGAAAACTTGAGGAGCTTGGTATTGCGTTATCAGATAACTTTGTAATTAGAGTGTTGAAGGAACTTCGAAATTCTCCATTGAAAGCCTTGAGTTTTTTTCATTGGGTTGGTTGTAGGCCGGATTATGATCATAACACAGTTTCATACAATGCAATTGCTAGGGTTCTTGGACGGGATGACTCGATCGAGGCTTTTTGGGGCGTGATTGAAGAAATGAAGCATGCTAATCATGAAATTGACATCGACACTTACATAAAGATCTCCAGGCAGTTTCAGAAGAGCAAGATGATGGGTGAGGCTGTCAAGCTTTATGAGCTTATGATGGATGGGCCATACAAGCCCTCGTTGCAGGATTGCAGCGTTCTTTTGCGGACCATCGCTGCTAGTGATAATCCAGATTTGAGCTTGGTTTACAGAGTGGCCAAGAAATTCGAGGCTACAGGGTACAGTCTCTCCAAAGCTATGTACGACGGAATCCATAGGTCATTGACAAGCACAGGAAAGTTTGATGATGCGGAGAATATCGTGAAGTCCATGAGAAATGCAGGATATGAACCTGATAATGTTACATACAGTCAGCTGGTATTTGGACTTTGCAAGGCTAGGAGACTTGAGGAAGCCCGTAAAGTGCTGGATGAGATGGAAGCACAAGGATGTATTCCTGATATCAAGACTTGGACTATTTTAATTCAAGGACATTGTAATGCCAATGAACTTGACATTGCTTTAGTTTGTTTTGCAAAGATGATAGAGAAGAACTGTGATCCAGATGCTGATCTTTTGGATGTGTTGATTAGTGGTTTCCTTAACCAGAAAAAGTTAAATGGTGCATATCAGTTGCTGATTGAGTTGACAAATAAGGCTCATGTAAGACCATGGCAGGCAACATACAAACAGTTAATTAAAAATCTTTTGGAAGTTAGGAAACTTGAGGAAGCCATTGCCCTCCTTCGTTTAATGAAGAAACAAAATTACCCACCTTTTCCAGAACCCTTTGTTCAATATATCTCCAAGTTTGGTACTGTGCAGGATGCTGATGATTTTCTGAAGGTTCTTAGTTCAAAAGAATATCCTTCCGTGTCTGCTTATCTTCATATTTTTAATTCATTTTTTAATGAAGGCAGATATTCTGAGGCCAAAGATCTGCTCTTTAAATGCCCACATCACATTCGAAAGCATAACGAAGTTTGTAAGCTCTTTGGATCTGCAGAAAGCAATACCACTGCTGCTACTCAATCTTCCTCCAATCCGATTGAAACTTAA

Coding sequence (CDS)

ATGACGAGGAGAAACGCAATTTTCACTTCCCTCAGATTGGCAAATTCCTTCTTTTCGACTCGGTTTCGATATCCCCAGGTGACTCGGTTTTCACCTTCTTCTTATGTTTCTCATCAGTCCCTCGTCTCCCATTTCACAATCAACCATCCTGTTCTGTTTTTCTCGTCGAACCCCCAATCGCTTCTCCAGCTTGTTTCGACCAACGATTGGTCGGAAATGTTAGAAACTGAATTAGAGACTTTAAACCCTACGCTCACACACGAAACTGTTGTCTATGTTTTGAAAAGACTTGACAAACAACCCCAAAAGGCCTCTGAGTTCTTCAACTGGGCTTCTGGGAAGAATGGGTCTACTCAAAGTTCTTCCATTTATAGCATGTTGCTTAGGATTTTTGTTCAGAACGAGTCCATGAAATTGTTCTGGATTACGTTAAGGTTAATGAAAGAACGAGGGTTTTACCTAGATGAGGAAACTTATAAGACCATTTTAGGGGTTTTGAGAAAGTCGAAGAAGGCTGCAGATGCCACGGGTCTGACTCATTTTTACAATCGAATGCTTCAACAAAATGCCATGGACAGTGTGGTGCAGAAGGTGGTTGATATTGTTTTAGGATCTGATTGGAGTAATGATGTTCCTGGAAAACTTGAGGAGCTTGGTATTGCGTTATCAGATAACTTTGTAATTAGAGTGTTGAAGGAACTTCGAAATTCTCCATTGAAAGCCTTGAGTTTTTTTCATTGGGTTGGTTGTAGGCCGGATTATGATCATAACACAGTTTCATACAATGCAATTGCTAGGGTTCTTGGACGGGATGACTCGATCGAGGCTTTTTGGGGCGTGATTGAAGAAATGAAGCATGCTAATCATGAAATTGACATCGACACTTACATAAAGATCTCCAGGCAGTTTCAGAAGAGCAAGATGATGGGTGAGGCTGTCAAGCTTTATGAGCTTATGATGGATGGGCCATACAAGCCCTCGTTGCAGGATTGCAGCGTTCTTTTGCGGACCATCGCTGCTAGTGATAATCCAGATTTGAGCTTGGTTTACAGAGTGGCCAAGAAATTCGAGGCTACAGGGTACAGTCTCTCCAAAGCTATGTACGACGGAATCCATAGGTCATTGACAAGCACAGGAAAGTTTGATGATGCGGAGAATATCGTGAAGTCCATGAGAAATGCAGGATATGAACCTGATAATGTTACATACAGTCAGCTGGTATTTGGACTTTGCAAGGCTAGGAGACTTGAGGAAGCCCGTAAAGTGCTGGATGAGATGGAAGCACAAGGATGTATTCCTGATATCAAGACTTGGACTATTTTAATTCAAGGACATTGTAATGCCAATGAACTTGACATTGCTTTAGTTTGTTTTGCAAAGATGATAGAGAAGAACTGTGATCCAGATGCTGATCTTTTGGATGTGTTGATTAGTGGTTTCCTTAACCAGAAAAAGTTAAATGGTGCATATCAGTTGCTGATTGAGTTGACAAATAAGGCTCATGTAAGACCATGGCAGGCAACATACAAACAGTTAATTAAAAATCTTTTGGAAGTTAGGAAACTTGAGGAAGCCATTGCCCTCCTTCGTTTAATGAAGAAACAAAATTACCCACCTTTTCCAGAACCCTTTGTTCAATATATCTCCAAGTTTGGTACTGTGCAGGATGCTGATGATTTTCTGAAGGTTCTTAGTTCAAAAGAATATCCTTCCGTGTCTGCTTATCTTCATATTTTTAATTCATTTTTTAATGAAGGCAGATATTCTGAGGCCAAAGATCTGCTCTTTAAATGCCCACATCACATTCGAAAGCATAACGAAGTTTGTAAGCTCTTTGGATCTGCAGAAAGCAATACCACTGCTGCTACTCAATCTTCCTCCAATCCGATTGAAACTTAA
BLAST of CSPI01G01010 vs. Swiss-Prot
Match: PP269_ARATH (Pentatricopeptide repeat-containing protein At3g48250, chloroplastic OS=Arabidopsis thaliana GN=At3g48250 PE=2 SV=1)

HSP 1 Score: 708.8 bits (1828), Expect = 5.5e-203
Identity = 361/622 (58.04%), Postives = 460/622 (73.95%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLF----FSS 60
           M R  AI +SLR A S  STR  Y   ++   SS +S   L S   +    L+    FSS
Sbjct: 1   MYRSMAILSSLRHAYSQISTR-SYLSRSKVGFSSNLS-SPLDSFAIVPSRFLWKFRTFSS 60

Query: 61  NPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNG 120
            P S+LQLV  NDWS+ +E  L   + +LTHET +YVL++L+K P+KA  F +W    +G
Sbjct: 61  KPDSMLQLVLENDWSKEVEEGLRKPDMSLTHETAIYVLRKLEKYPEKAYYFLDWVLRDSG 120

Query: 121 STQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADAT 180
            + S+ +YS++LRI VQ  SMK FW+TLR MK+ GFYLDE+TYKTI G L K K  ADA 
Sbjct: 121 LSPSTPLYSIMLRILVQQRSMKRFWMTLREMKQGGFYLDEDTYKTIYGELSKEKSKADAV 180

Query: 181 GLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRN 240
            + HFY RML++NAM  V  +V  +V   DWS +V  +L+E+ + LSDNFVIRVLKELR 
Sbjct: 181 AVAHFYERMLKENAMSVVAGEVSAVVTKGDWSCEVERELQEMKLVLSDNFVIRVLKELRE 240

Query: 241 SPLKALSFFHWVG---CRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDI 300
            PLKAL+FFHWVG       Y H+TV+YNA  RVL R +S+  FW V++EMK A +++D+
Sbjct: 241 HPLKALAFFHWVGGGGSSSGYQHSTVTYNAALRVLARPNSVAEFWSVVDEMKTAGYDMDL 300

Query: 301 DTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVA 360
           DTYIK+SRQFQKS+MM E VKLYE MMDGP+KPS+QDCS+LLR ++ S NPDL LV+RV+
Sbjct: 301 DTYIKVSRQFQKSRMMAETVKLYEYMMDGPFKPSIQDCSLLLRYLSGSPNPDLDLVFRVS 360

Query: 361 KKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKA 420
           +K+E+TG SLSKA+YDGIHRSLTS G+FD+AE I K+MRNAGYEPDN+TYSQLVFGLCKA
Sbjct: 361 RKYESTGKSLSKAVYDGIHRSLTSVGRFDEAEEITKAMRNAGYEPDNITYSQLVFGLCKA 420

Query: 421 RRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLL 480
           +RLEEAR VLD+MEAQGC PDIKTWTILIQGHC  NELD AL CFA M+EK  D D++LL
Sbjct: 421 KRLEEARGVLDQMEAQGCFPDIKTWTILIQGHCKNNELDKALACFANMLEKGFDIDSNLL 480

Query: 481 DVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMK 540
           DVLI GF+   K  GA   L+E+   A+V+PWQ+TYK LI  LL+++K EEA+ LL++MK
Sbjct: 481 DVLIDGFVIHNKFEGASIFLMEMVKNANVKPWQSTYKLLIDKLLKIKKSEEALDLLQMMK 540

Query: 541 KQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKD 600
           KQNYP + E F  Y++KFGT++DA  FL VLSSK+ PS +AY H+  +F+ EGR ++AK+
Sbjct: 541 KQNYPAYAEAFDGYLAKFGTLEDAKKFLDVLSSKDSPSFAAYFHVIEAFYREGRLTDAKN 600

Query: 601 LLFKCPHHIRKHNEVCKLFGSA 616
           LLF CPHH + H ++ +LFG+A
Sbjct: 601 LLFICPHHFKTHPKISELFGAA 620

BLAST of CSPI01G01010 vs. Swiss-Prot
Match: PP208_ARATH (Pentatricopeptide repeat-containing protein At3g02490, mitochondrial OS=Arabidopsis thaliana GN=At3g02490 PE=2 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 3.0e-84
Identity = 179/562 (31.85%), Postives = 301/562 (53.56%), Query Frame = 1

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           ++ + S  +  + +  EL++ +  ++HE  + VL+ L+  P  A  FF W         S
Sbjct: 84  VIDVFSRLNGKDEITKELDSNDVVISHELALRVLRELESSPDVAGRFFKWGLEAYPQKLS 143

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           S  Y+ +LRIF  N  +  FW  +  MK++G  +       +    +K     D   L  
Sbjct: 144 SKSYNTMLRIFGVNGLVDEFWRLVDDMKKKGHGVSANVRDRVGDKFKKDGLENDLERLKE 203

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
            +      N++D V  +V  IV+   W  DV  +L +L +    + V  VL++L   P K
Sbjct: 204 LFASGSMDNSVDKVCNRVCKIVMKEVWGADVEKQLRDLKLEFKSDVVKMVLEKLDVDPRK 263

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           AL FF W+     + H+  +YNA+ARVLG++  ++ F  +IEE++ A +E++++TY+++S
Sbjct: 264 ALLFFRWIDESGSFKHDEKTYNAMARVLGKEKFLDRFQHMIEEIRSAGYEMEMETYVRVS 323

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYK--PSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEA 360
            +F ++KM+ EAV+L+E  M G     P+   CS+LL+ I  +   D+ L  R  K +  
Sbjct: 324 ARFCQTKMIKEAVELFEFAMAGSISNTPTPHCCSLLLKKIVTAKKLDMDLFTRTLKAYTG 383

Query: 361 TGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEE 420
            G  +   M   + +SL S  +F  +  ++K+M   GY P     S +  GL +  + +E
Sbjct: 384 NGNVVPDVMLQHVLKSLRSVDRFGQSNEVLKAMNEGGYVPSGDLQSVIASGLSRKGKKDE 443

Query: 421 ARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDAD-LLDVLI 480
           A ++++ MEA G   D K    L++GHC+A +L+ A  CF KMI K     A    + L+
Sbjct: 444 ANELVNFMEASGNHLDDKAMASLVEGHCDAKDLEEASECFKKMIGKEGVSYAGYAFEKLV 503

Query: 481 SGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVR-----KLEEAIALLRLM 540
             + N  +    Y+L  EL  +  ++PW +TYK +++NLL  +       EEA++LL +M
Sbjct: 504 LAYCNSFQARDVYKLFSELVKQNQLKPWHSTYKIMVRNLLMKKVARDGGFEEALSLLPMM 563

Query: 541 KKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAK 600
           +   +PPF +PF+ Y+S  GT  +A  FLK ++SK++PS S  L +F +     R+SEA+
Sbjct: 564 RNHGFPPFVDPFMDYLSNSGTSAEAFAFLKAVTSKKFPSNSMVLRVFEAMLKSARHSEAQ 623

Query: 601 DLLFKCPHHIRKHNEVCKLFGS 615
           DLL   P +IR++ EV +LF +
Sbjct: 624 DLLSMSPSYIRRNAEVLELFNT 645

BLAST of CSPI01G01010 vs. Swiss-Prot
Match: PP387_ARATH (Pentatricopeptide repeat-containing protein At5g15980, mitochondrial OS=Arabidopsis thaliana GN=At5g15980 PE=2 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 5.7e-83
Identity = 173/576 (30.03%), Postives = 306/576 (53.12%), Query Frame = 1

Query: 55  SSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGK 114
           SS   +++ + S     + +  ELE+    ++ +  + VL++L+  P  A  FF W    
Sbjct: 84  SSAEATVIDIFSRLSGEDEIRKELESSGVVISQDLALKVLRKLESNPDVAKSFFQWIKEA 143

Query: 115 NGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAAD 174
           +    SS  Y+M+LRI   N  +  FW  + +MK++G  L       +    +K    +D
Sbjct: 144 SPEELSSKNYNMMLRILGGNGLVDEFWGLVDVMKKKGHGLSANVRDKVGDKFQKDGLESD 203

Query: 175 ATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKEL 234
              L   +      N+ ++V  +V  IV+  +W +DV  ++ +L +    + V  +++ L
Sbjct: 204 LLRLRKLFTSDCLDNSAENVCDRVCKIVMKEEWGDDVEKRVRDLNVEFKSDLVKMIVERL 263

Query: 235 RNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDID 294
              P KAL FF W+     + H+  +YNA+ARVLG++  ++ F  ++ EM+ A +E++I+
Sbjct: 264 DVEPRKALLFFRWIDESDLFKHDEKTYNAMARVLGKEKFLDRFQNIVVEMRSAGYEVEIE 323

Query: 295 TYIKISRQFQKSKMMGEAVKLYELMMDG---PYKPSLQDCSVLLRTIAASDNPDLSLVYR 354
           TY+++S +F ++K++ EAV L+E+ M G      P+     +LL+ I  +   D+ L  R
Sbjct: 324 TYVRVSTRFCQTKLIKEAVDLFEIAMAGSSSSNNPTPHCFCLLLKKIVTAKILDMDLFSR 383

Query: 355 VAKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLC 414
             K +   G +L+ ++   + +SL S  + + +  ++K M+  GY P     S +   L 
Sbjct: 384 AVKVYTKNGNALTDSLLKSVLKSLRSVDRVEQSNELLKEMKRGGYVPSGDMQSMIASSLS 443

Query: 415 KARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDAD 474
           +  + +EA + +D ME+ G   D K    L++G+C++  LD ALVCF KM+       AD
Sbjct: 444 RKGKKDEADEFVDFMESSGNNLDDKAMASLVEGYCDSGNLDEALVCFEKMVGNTGVSYAD 503

Query: 475 L-LDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVR-----KLEEA 534
              + L+  + N+ ++  AY+LL     K  ++P  +TYK L+ NLL  +       EEA
Sbjct: 504 YSFEKLVLAYCNKNQVRDAYKLLSAQVTKNQLKPRHSTYKSLVTNLLTKKIARDGGFEEA 563

Query: 535 IALLRLMKKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNE 594
           ++LL +MK   +PPF +PF+ Y S  G   +A  FLK ++S  +P +S  L +F +    
Sbjct: 564 LSLLPIMKDHGFPPFIDPFMSYFSSTGKSTEALGFLKAMTSNNFPYISVVLRVFETMMKS 623

Query: 595 GRYSEAKDLLFKCPHHIRKHNEVCKLFGSAESNTTA 622
            R+SEA+DLL  CP++IR + +V +LF + + N +A
Sbjct: 624 ARHSEAQDLLSLCPNYIRNNPDVLELFNTMKPNESA 659

BLAST of CSPI01G01010 vs. Swiss-Prot
Match: PP293_ARATH (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 1.9e-30
Identity = 96/402 (23.88%), Postives = 192/402 (47.76%), Query Frame = 1

Query: 195 VQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLKALSFFHWVGCRPDY 254
           V KV+D +   D   ++   L+E+ + LS + ++ VL+  R++   A  FF W   R  +
Sbjct: 134 VCKVIDELFALD--RNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGF 193

Query: 255 DHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKISRQFQKSKMMGEAVK 314
            H++ +YN++  +L +    E    V+EEM      + ++T+    + F  +K   +AV 
Sbjct: 194 AHDSRTYNSMMSILAKTRQFETMVSVLEEMG-TKGLLTMETFTIAMKAFAAAKERKKAVG 253

Query: 315 LYELMMDGPYKPSLQDCSVLLRTIA-ASDNPDLSLVYRVAKKFEATGYSLSKAMYDGIHR 374
           ++ELM    +K  ++  + LL ++  A    +  +++   K+     ++ +   Y  +  
Sbjct: 254 IFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKE----RFTPNMMTYTVLLN 313

Query: 375 SLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEARKVLDEMEAQGCIP 434
                    +A  I   M + G +PD V ++ ++ GL ++R+  +A K+   M+++G  P
Sbjct: 314 GWCRVRNLIEAARIWNDMIDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCP 373

Query: 435 DIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGFLNQKKLNGAYQLL 494
           +++++TI+I+  C  + ++ A+  F  M++    PDA +   LI+GF  QKKL+  Y+LL
Sbjct: 374 NVRSYTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELL 433

Query: 495 IELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPFPEPFVQYISKFGT 554
            E+  K H  P   TY  LIK +   +  E A  +   M +    P    F   +  +  
Sbjct: 434 KEMQEKGH-PPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFM 493

Query: 555 VQDAD----DFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEA 592
            ++ +     + +++     P  ++Y  +      EG+  EA
Sbjct: 494 ARNYEMGRAVWEEMIKKGICPDDNSYTVLIRGLIGEGKSREA 527

BLAST of CSPI01G01010 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 2.5e-30
Identity = 93/386 (24.09%), Postives = 177/386 (45.85%), Query Frame = 1

Query: 215 LEELGIALSDNFVIRVLKELRNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSI 274
           L E G+ L    + RVL    ++      FF W   +P Y H+   Y ++ ++L +    
Sbjct: 104 LNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQF 163

Query: 275 EAFWGVIEEMKHANHE-IDIDTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSV 334
            A WG+IEEM+  N + I+ + ++ + ++F  + M+ +A+++ + M    ++P       
Sbjct: 164 GAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFGC 223

Query: 335 LLRTIAASDNPDLSLVYRVAKKFE--ATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSM 394
           LL  +    +     V   AK FE     + ++   +  +       GK  +A+ ++  M
Sbjct: 224 LLDALCKHGS-----VKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQM 283

Query: 395 RNAGYEPDNVTYSQLVFGLCKARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANEL 454
             AG+EPD V Y+ L+ G   A ++ +A  +L +M  +G  P+   +T+LIQ  C  + +
Sbjct: 284 NEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRM 343

Query: 455 DIALVCFAKMIEKNCDPDADLLDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQ 514
           + A+  F +M    C+ D      L+SGF    K++  Y +L ++  K  + P + TY  
Sbjct: 344 EEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKG-LMPSELTYMH 403

Query: 515 LIKNLLEVRKLEEAIALLRLMKKQNYPP---FPEPFVQYISKFGTVQDADDFLKVLSSKE 574
           ++    +    EE + L+  M++  Y P        ++   K G V++A      +    
Sbjct: 404 IMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENG 463

Query: 575 Y-PSVSAYLHIFNSFFNEGRYSEAKD 594
             P V  ++ + N   ++G   EA D
Sbjct: 464 LSPGVDTFVIMINGLASQGCLLEASD 483

BLAST of CSPI01G01010 vs. TrEMBL
Match: A0A0A0LNT7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G003530 PE=4 SV=1)

HSP 1 Score: 1255.7 bits (3248), Expect = 0.0e+00
Identity = 631/632 (99.84%), Postives = 631/632 (99.84%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60
           MTRRNAIFTSLRLANSFFSTR RYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS
Sbjct: 1   MTRRNAIFTSLRLANSFFSTRSRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS
Sbjct: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH
Sbjct: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
           FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK
Sbjct: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS
Sbjct: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360
           RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG
Sbjct: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360

Query: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420
           YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR
Sbjct: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420

Query: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480
           KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF
Sbjct: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480

Query: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540
           LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF
Sbjct: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540

Query: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600
           PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH
Sbjct: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600

Query: 601 HIRKHNEVCKLFGSAESNTTAATQSSSNPIET 633
           HIRKHNEVCKLFGSAESNTTAATQSSSNPIET
Sbjct: 601 HIRKHNEVCKLFGSAESNTTAATQSSSNPIET 632

BLAST of CSPI01G01010 vs. TrEMBL
Match: B9RAT8_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1508610 PE=4 SV=1)

HSP 1 Score: 830.5 bits (2144), Expect = 1.4e-237
Identity = 411/627 (65.55%), Postives = 498/627 (79.43%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFR-----YPQVTRFSPSSYVSHQSLVSHFTINHPVLFFS 60
           M R   I  SLRL+N   STR         QVT F P       S  S +   H  L+ S
Sbjct: 1   MNRARTILVSLRLSNFLLSTRISTTRPFLTQVTHFFPCFLSREHSYTSDYVNIHKKLYSS 60

Query: 61  SNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKN 120
           S P SL++L+S NDWS  LET+LE  +P LTHETV+YVLK+LDK P KA +FFNW   +N
Sbjct: 61  SKPSSLVELLSVNDWSPELETQLENSSPLLTHETVIYVLKKLDKDPHKAWDFFNWVCDRN 120

Query: 121 GSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADA 180
           G   SS +YS++LRI V+ +SMK FWITLR MKE+GFY DEETY TILGV RK +  +DA
Sbjct: 121 GFKPSSPLYSLMLRILVKKDSMKNFWITLRKMKEQGFYTDEETYLTILGVFRKERMDSDA 180

Query: 181 TGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELR 240
               HF++RM+++NAMDSVV+ VV +V  ++WSN+V  +LE +GI L+DNFVIRVLKELR
Sbjct: 181 VAFKHFFDRMVEENAMDSVVKNVVSVVSATEWSNEVEKELEGMGILLTDNFVIRVLKELR 240

Query: 241 NSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDT 300
           N PLKAL FF+W G    Y+ NT++YNAIARVLGRDDSI  FW V+EEMK+A HE+DIDT
Sbjct: 241 NYPLKALQFFNWAGKCERYECNTITYNAIARVLGRDDSIGEFWSVVEEMKNAGHEMDIDT 300

Query: 301 YIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKK 360
           YIKISRQFQK+K+MG+AVKLYE MMDGP+KPS+QDCS+LLR+I+AS+ PDL+LV+RV  K
Sbjct: 301 YIKISRQFQKNKLMGDAVKLYEFMMDGPFKPSVQDCSMLLRSISASNYPDLNLVFRVVNK 360

Query: 361 FEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARR 420
           +EATG SLSKA+YDGIHRSLTS G FD+A  ++K M+ AGYEPDN+TYSQLVFGLCKARR
Sbjct: 361 YEATGNSLSKAVYDGIHRSLTSIGNFDEAAKMMKCMQTAGYEPDNITYSQLVFGLCKARR 420

Query: 421 LEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDV 480
           LEEA +VLDEMEA GC+PDIKTWTILIQGHC ANE+  AL+C AKM+EK+CDPDADLL V
Sbjct: 421 LEEACEVLDEMEAHGCLPDIKTWTILIQGHCVANEVGKALMCLAKMMEKHCDPDADLLAV 480

Query: 481 LISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQ 540
           LI+ FL+QK+++GAY L +++ +KA +RPWQATYK LI+ LLEVRKLEEA+ LLRLMK+ 
Sbjct: 481 LINAFLSQKRIDGAYTLFMDMVDKARLRPWQATYKLLIEKLLEVRKLEEALNLLRLMKQH 540

Query: 541 NYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLL 600
           N+PPFPEPFVQYIS+FGTV DA DFLK LS KEYPS SAYL++F SFF  GR+SEAKDLL
Sbjct: 541 NHPPFPEPFVQYISRFGTVDDAADFLKALSVKEYPSTSAYLNVFQSFFRAGRHSEAKDLL 600

Query: 601 FKCPHHIRKHNEVCKLFGSAESNTTAA 623
           FKCPHHIRKH ++ +LFGSA++    A
Sbjct: 601 FKCPHHIRKHPKISELFGSAKTEDATA 627

BLAST of CSPI01G01010 vs. TrEMBL
Match: A0A067KGQ3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11400 PE=4 SV=1)

HSP 1 Score: 821.6 bits (2121), Expect = 6.5e-235
Identity = 402/624 (64.42%), Postives = 495/624 (79.33%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFR-----YPQVTRFSP-SSYVSHQSLVSHFTINHPVLFF 60
           M R   I  SLRL+NS  STR         QV RFSP S Y   +   + F  +H +++F
Sbjct: 1   MNRARTILASLRLSNSLLSTRLSTTRPLLTQVIRFSPLSPYFPSEQSHTCFLNSHQMVYF 60

Query: 61  SSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGK 120
           SS P SL++L+  NDWS  LE+ELET NP LTHETVVYVL++LDK P KA +FFNW S +
Sbjct: 61  SSKPGSLVELLLGNDWSTELESELETSNPRLTHETVVYVLRKLDKYPDKAWDFFNWVSER 120

Query: 121 NGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAAD 180
           N    SS +YS++LR+ V+ + MK FWITLR MKE+GFY+DEETY TI  + RK K  +D
Sbjct: 121 NEFKLSSPLYSLMLRVLVKKDYMKKFWITLRKMKEQGFYIDEETYLTISAIFRKEKMDSD 180

Query: 181 ATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKEL 240
                HF++RM+++NAMDS+V+ VV ++L  +W+N+V  +L  +GI L+DNFVI+VLKEL
Sbjct: 181 LVAFKHFFDRMVKENAMDSIVKNVVTVILKMEWNNEVEEELRSMGIILTDNFVIKVLKEL 240

Query: 241 RNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDID 300
           RN PLKA+ FFHW G    Y+ NTV+YNAIARV+ RDDSI  FW V+EEMK+A HE+DID
Sbjct: 241 RNYPLKAMLFFHWAGKCEGYECNTVTYNAIARVIARDDSIREFWSVVEEMKNAGHEMDID 300

Query: 301 TYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAK 360
           TYIKISRQFQK K+M +AVKLYE MMDGP+KPS+QDCS LL++I+ASD PDL+LV+RVAK
Sbjct: 301 TYIKISRQFQKMKLMEDAVKLYEFMMDGPFKPSIQDCSYLLKSISASDKPDLNLVFRVAK 360

Query: 361 KFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKAR 420
           K+E  G SLSKA+YDGIHRSLTS G FD+A NI+K M+NAG+EPDN++YSQLVFGLCKAR
Sbjct: 361 KYEVMGSSLSKAVYDGIHRSLTSAGHFDEAANIIKVMKNAGFEPDNISYSQLVFGLCKAR 420

Query: 421 RLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLD 480
           RLEEA +VLDEME  GC+PD+KTWTILIQGHC AN++D AL+CFAKM+EKNC+ DADLLD
Sbjct: 421 RLEEACEVLDEMETNGCVPDVKTWTILIQGHCAANQVDKALMCFAKMVEKNCNADADLLD 480

Query: 481 VLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKK 540
           +LI+ FL QK++ GAY LL+E+ NK H+RPWQATYK LI+ LL  RKLEEA+ LLRLMK+
Sbjct: 481 ILINAFLGQKRIEGAYTLLVEMVNKVHLRPWQATYKLLIEKLLGERKLEEAMDLLRLMKQ 540

Query: 541 QNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDL 600
            N+PPF  PFVQYISKFGTV+DA DFLK LS KEYPS SAY ++F SFF EGR+SEAKDL
Sbjct: 541 HNHPPFSGPFVQYISKFGTVEDAADFLKALSVKEYPSTSAYFNVFQSFFKEGRHSEAKDL 600

Query: 601 LFKCPHHIRKHNEVCKLFGSAESN 619
           L+KCPHHIRKH ++ +LFGSA S+
Sbjct: 601 LYKCPHHIRKHPKISELFGSARSS 624

BLAST of CSPI01G01010 vs. TrEMBL
Match: F6GTR7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09330 PE=4 SV=1)

HSP 1 Score: 805.4 bits (2079), Expect = 4.8e-230
Identity = 396/617 (64.18%), Postives = 488/617 (79.09%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60
           M R  AI  S+R  NSF ST+FR     R S   Y                      P S
Sbjct: 1   MNRAKAILFSIRFTNSFRSTQFR-----RASSLCY---------------------QPNS 60

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           +++LV  NDWS+ LE+ELE  +  LTHETV+YVLK+LDK PQ+   FFNW + KNG   S
Sbjct: 61  IVELVLENDWSDELESELEKSSSVLTHETVIYVLKKLDKDPQRTWNFFNWVTEKNGFRPS 120

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           S++YS++LR  V  ESMK FW+T+R MKE+GF +D+ETY TILGV +K K A++   LTH
Sbjct: 121 SAMYSLILRSLVHGESMKQFWVTIRKMKEQGFCIDKETYLTILGVFKKGKMASEEVALTH 180

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
           FYNRM+Q+NAMD VV+KVV++V  S WS++V  KL EL  + SDNFV+ VL+ELR  PLK
Sbjct: 181 FYNRMVQENAMDEVVKKVVELVTMSVWSSEVEKKLGELKNSFSDNFVLHVLRELRGYPLK 240

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           AL FF WVG  P Y+H++++YN IARVLGRDDSI  FW ++EEMK   HE+DIDTYIKIS
Sbjct: 241 ALRFFQWVGECPGYEHSSITYNVIARVLGRDDSIGEFWSMVEEMKSKGHEMDIDTYIKIS 300

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360
           RQFQK+KM+ +AVKLYE+MMDGPYKPS+QDC++LLR+I+ S NPDL+LV+RV +K+EA G
Sbjct: 301 RQFQKNKMLEDAVKLYEIMMDGPYKPSVQDCTMLLRSISLSSNPDLALVFRVTEKYEAVG 360

Query: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420
            SL KA+YDGIHRSLTS G+FD+A  I++SMR+AG EPDN+TYSQLV+GLCKAR+LEEA 
Sbjct: 361 NSLCKAVYDGIHRSLTSVGRFDEAGKIMESMRSAGCEPDNITYSQLVYGLCKARKLEEAC 420

Query: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480
           K+LDEMEA GC+PDIKTWTILIQGHC A E+D AL+CFAKM+EKNCD DADLL+VLI+GF
Sbjct: 421 KLLDEMEACGCVPDIKTWTILIQGHCAAKEVDKALICFAKMMEKNCDADADLLEVLINGF 480

Query: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540
           L+QK+++GAY+LL+E+ N AH+ PWQATYK +I  LL VRKLEEAI LL LMKKQNYPPF
Sbjct: 481 LSQKRIDGAYKLLVEMVNTAHLVPWQATYKLMINKLLGVRKLEEAINLLHLMKKQNYPPF 540

Query: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600
           PEPF++YISKFGTV+DA +FL  LS+K+YPS SAY+H+F SFF EGR SEAKDLL+KCPH
Sbjct: 541 PEPFIEYISKFGTVEDAGEFLNALSAKKYPSQSAYVHVFESFFQEGRESEAKDLLYKCPH 591

Query: 601 HIRKHNEVCKLFGSAES 618
           HIRKH ++CKLFGSA+S
Sbjct: 601 HIRKHPDICKLFGSAKS 591

BLAST of CSPI01G01010 vs. TrEMBL
Match: M5WD91_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002676mg PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 1.2e-228
Identity = 396/608 (65.13%), Postives = 474/608 (77.96%), Query Frame = 1

Query: 16  SFFSTRFRYPQVTRFSPSSYV-SHQSLVSHFTINHPVLFFSSNPQSLLQLVSTNDWSEML 75
           S+ +TR    QVT+F   S+  S QS   HF   H  LFFSS P S+LQLV  N WS  L
Sbjct: 39  SYVTTRPICSQVTQFPHLSHFHSDQSCGFHFLNTHQSLFFSSAPNSVLQLVLANQWSAEL 98

Query: 76  ETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQSSSIYSMLLRIFVQN 135
           E EL    P+LTH+ V+YVLK+LDK P+KA +FFNW   KNG   SS +++++L +    
Sbjct: 99  ENELSESYPSLTHDAVIYVLKKLDKDPKKAWDFFNWVCEKNGFKPSSLVFNLMLGVLGHK 158

Query: 136 ESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTHFYNRMLQQNAMDSV 195
           +SMK FWITLR MKE+GFY++ +TY  I   L+K K   D     HF+ RM++ NA D V
Sbjct: 159 DSMKQFWITLRQMKEQGFYIEVQTYSAIAERLKKGKMDNDVVAFKHFFERMMKDNATDEV 218

Query: 196 VQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLKALSFFHWVGCRPDY 255
            +KV D+V GS+WS  +  +L EL I LSDNFV+RVLKELR  P KALSFFHWVG    Y
Sbjct: 219 AKKVADVVSGSEWSAGIEKELGELKITLSDNFVVRVLKELRICPSKALSFFHWVGQSSGY 278

Query: 256 DHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKISRQFQKSKMMGEAVK 315
           +HNT++YNA+AR+L + DSI  FW VIEEMK A HE+D+DTYIKI+RQFQKSKMM +AVK
Sbjct: 279 EHNTITYNAVARILAQADSIGEFWSVIEEMKGAGHELDLDTYIKITRQFQKSKMMEDAVK 338

Query: 316 LYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATGYSLSKAMYDGIHRS 375
           LYELMMDGPYKPS QDCS+LLR+I+A+D PDL +V+RVAKKFE+ G +LSKA+YDGIHRS
Sbjct: 339 LYELMMDGPYKPSAQDCSMLLRSISANDKPDLDMVFRVAKKFESAGNTLSKAVYDGIHRS 398

Query: 376 LTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEARKVLDEMEAQGCIPD 435
           LTS G FD+AE I K MRNAGYEPDN+TYSQLVFGLCKA+RLEEA KVLDEMEA GC+PD
Sbjct: 399 LTSAGSFDEAEKITKVMRNAGYEPDNITYSQLVFGLCKAKRLEEACKVLDEMEANGCVPD 458

Query: 436 IKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGFLNQKKLNGAYQLLI 495
           I TWTILIQGHC ANE+D ALVCFAKMIEK CD DADLLDVLI+GFL Q+K+ GAY+LLI
Sbjct: 459 IMTWTILIQGHCAANEVDTALVCFAKMIEKGCDADADLLDVLINGFLKQRKIEGAYKLLI 518

Query: 496 ELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPFPEPFVQYISKFGTV 555
           E+ N   +RPWQATYK LI+NLL VRKL+EA ALL LMKKQ+YPP+P+PFVQY+SKFG+V
Sbjct: 519 EMVNMTRLRPWQATYKNLIENLLGVRKLDEAFALLHLMKKQSYPPYPDPFVQYLSKFGSV 578

Query: 556 QDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPHHIRKHNEVCKLFGS 615
           +DA +F K LS KEYPS +AY+H+F SFF EGR SEAK+LL+KCP+HIRK  E+ KLFGS
Sbjct: 579 EDAAEFFKALSVKEYPSSAAYVHVFKSFFKEGRDSEAKELLYKCPYHIRKLGEISKLFGS 638

Query: 616 AESNTTAA 623
            E   TAA
Sbjct: 639 TEGKQTAA 646

BLAST of CSPI01G01010 vs. TAIR10
Match: AT3G48250.1 (AT3G48250.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 708.8 bits (1828), Expect = 3.1e-204
Identity = 361/622 (58.04%), Postives = 460/622 (73.95%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLF----FSS 60
           M R  AI +SLR A S  STR  Y   ++   SS +S   L S   +    L+    FSS
Sbjct: 1   MYRSMAILSSLRHAYSQISTR-SYLSRSKVGFSSNLS-SPLDSFAIVPSRFLWKFRTFSS 60

Query: 61  NPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNG 120
            P S+LQLV  NDWS+ +E  L   + +LTHET +YVL++L+K P+KA  F +W    +G
Sbjct: 61  KPDSMLQLVLENDWSKEVEEGLRKPDMSLTHETAIYVLRKLEKYPEKAYYFLDWVLRDSG 120

Query: 121 STQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADAT 180
            + S+ +YS++LRI VQ  SMK FW+TLR MK+ GFYLDE+TYKTI G L K K  ADA 
Sbjct: 121 LSPSTPLYSIMLRILVQQRSMKRFWMTLREMKQGGFYLDEDTYKTIYGELSKEKSKADAV 180

Query: 181 GLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRN 240
            + HFY RML++NAM  V  +V  +V   DWS +V  +L+E+ + LSDNFVIRVLKELR 
Sbjct: 181 AVAHFYERMLKENAMSVVAGEVSAVVTKGDWSCEVERELQEMKLVLSDNFVIRVLKELRE 240

Query: 241 SPLKALSFFHWVG---CRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDI 300
            PLKAL+FFHWVG       Y H+TV+YNA  RVL R +S+  FW V++EMK A +++D+
Sbjct: 241 HPLKALAFFHWVGGGGSSSGYQHSTVTYNAALRVLARPNSVAEFWSVVDEMKTAGYDMDL 300

Query: 301 DTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVA 360
           DTYIK+SRQFQKS+MM E VKLYE MMDGP+KPS+QDCS+LLR ++ S NPDL LV+RV+
Sbjct: 301 DTYIKVSRQFQKSRMMAETVKLYEYMMDGPFKPSIQDCSLLLRYLSGSPNPDLDLVFRVS 360

Query: 361 KKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKA 420
           +K+E+TG SLSKA+YDGIHRSLTS G+FD+AE I K+MRNAGYEPDN+TYSQLVFGLCKA
Sbjct: 361 RKYESTGKSLSKAVYDGIHRSLTSVGRFDEAEEITKAMRNAGYEPDNITYSQLVFGLCKA 420

Query: 421 RRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLL 480
           +RLEEAR VLD+MEAQGC PDIKTWTILIQGHC  NELD AL CFA M+EK  D D++LL
Sbjct: 421 KRLEEARGVLDQMEAQGCFPDIKTWTILIQGHCKNNELDKALACFANMLEKGFDIDSNLL 480

Query: 481 DVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMK 540
           DVLI GF+   K  GA   L+E+   A+V+PWQ+TYK LI  LL+++K EEA+ LL++MK
Sbjct: 481 DVLIDGFVIHNKFEGASIFLMEMVKNANVKPWQSTYKLLIDKLLKIKKSEEALDLLQMMK 540

Query: 541 KQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKD 600
           KQNYP + E F  Y++KFGT++DA  FL VLSSK+ PS +AY H+  +F+ EGR ++AK+
Sbjct: 541 KQNYPAYAEAFDGYLAKFGTLEDAKKFLDVLSSKDSPSFAAYFHVIEAFYREGRLTDAKN 600

Query: 601 LLFKCPHHIRKHNEVCKLFGSA 616
           LLF CPHH + H ++ +LFG+A
Sbjct: 601 LLFICPHHFKTHPKISELFGAA 620

BLAST of CSPI01G01010 vs. TAIR10
Match: AT3G02490.1 (AT3G02490.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 314.3 bits (804), Expect = 1.7e-85
Identity = 179/562 (31.85%), Postives = 301/562 (53.56%), Query Frame = 1

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           ++ + S  +  + +  EL++ +  ++HE  + VL+ L+  P  A  FF W         S
Sbjct: 84  VIDVFSRLNGKDEITKELDSNDVVISHELALRVLRELESSPDVAGRFFKWGLEAYPQKLS 143

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           S  Y+ +LRIF  N  +  FW  +  MK++G  +       +    +K     D   L  
Sbjct: 144 SKSYNTMLRIFGVNGLVDEFWRLVDDMKKKGHGVSANVRDRVGDKFKKDGLENDLERLKE 203

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
            +      N++D V  +V  IV+   W  DV  +L +L +    + V  VL++L   P K
Sbjct: 204 LFASGSMDNSVDKVCNRVCKIVMKEVWGADVEKQLRDLKLEFKSDVVKMVLEKLDVDPRK 263

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           AL FF W+     + H+  +YNA+ARVLG++  ++ F  +IEE++ A +E++++TY+++S
Sbjct: 264 ALLFFRWIDESGSFKHDEKTYNAMARVLGKEKFLDRFQHMIEEIRSAGYEMEMETYVRVS 323

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYK--PSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEA 360
            +F ++KM+ EAV+L+E  M G     P+   CS+LL+ I  +   D+ L  R  K +  
Sbjct: 324 ARFCQTKMIKEAVELFEFAMAGSISNTPTPHCCSLLLKKIVTAKKLDMDLFTRTLKAYTG 383

Query: 361 TGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEE 420
            G  +   M   + +SL S  +F  +  ++K+M   GY P     S +  GL +  + +E
Sbjct: 384 NGNVVPDVMLQHVLKSLRSVDRFGQSNEVLKAMNEGGYVPSGDLQSVIASGLSRKGKKDE 443

Query: 421 ARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDAD-LLDVLI 480
           A ++++ MEA G   D K    L++GHC+A +L+ A  CF KMI K     A    + L+
Sbjct: 444 ANELVNFMEASGNHLDDKAMASLVEGHCDAKDLEEASECFKKMIGKEGVSYAGYAFEKLV 503

Query: 481 SGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVR-----KLEEAIALLRLM 540
             + N  +    Y+L  EL  +  ++PW +TYK +++NLL  +       EEA++LL +M
Sbjct: 504 LAYCNSFQARDVYKLFSELVKQNQLKPWHSTYKIMVRNLLMKKVARDGGFEEALSLLPMM 563

Query: 541 KKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAK 600
           +   +PPF +PF+ Y+S  GT  +A  FLK ++SK++PS S  L +F +     R+SEA+
Sbjct: 564 RNHGFPPFVDPFMDYLSNSGTSAEAFAFLKAVTSKKFPSNSMVLRVFEAMLKSARHSEAQ 623

Query: 601 DLLFKCPHHIRKHNEVCKLFGS 615
           DLL   P +IR++ EV +LF +
Sbjct: 624 DLLSMSPSYIRRNAEVLELFNT 645

BLAST of CSPI01G01010 vs. TAIR10
Match: AT5G15980.1 (AT5G15980.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 310.1 bits (793), Expect = 3.2e-84
Identity = 173/576 (30.03%), Postives = 306/576 (53.12%), Query Frame = 1

Query: 55  SSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGK 114
           SS   +++ + S     + +  ELE+    ++ +  + VL++L+  P  A  FF W    
Sbjct: 84  SSAEATVIDIFSRLSGEDEIRKELESSGVVISQDLALKVLRKLESNPDVAKSFFQWIKEA 143

Query: 115 NGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAAD 174
           +    SS  Y+M+LRI   N  +  FW  + +MK++G  L       +    +K    +D
Sbjct: 144 SPEELSSKNYNMMLRILGGNGLVDEFWGLVDVMKKKGHGLSANVRDKVGDKFQKDGLESD 203

Query: 175 ATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKEL 234
              L   +      N+ ++V  +V  IV+  +W +DV  ++ +L +    + V  +++ L
Sbjct: 204 LLRLRKLFTSDCLDNSAENVCDRVCKIVMKEEWGDDVEKRVRDLNVEFKSDLVKMIVERL 263

Query: 235 RNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDID 294
              P KAL FF W+     + H+  +YNA+ARVLG++  ++ F  ++ EM+ A +E++I+
Sbjct: 264 DVEPRKALLFFRWIDESDLFKHDEKTYNAMARVLGKEKFLDRFQNIVVEMRSAGYEVEIE 323

Query: 295 TYIKISRQFQKSKMMGEAVKLYELMMDG---PYKPSLQDCSVLLRTIAASDNPDLSLVYR 354
           TY+++S +F ++K++ EAV L+E+ M G      P+     +LL+ I  +   D+ L  R
Sbjct: 324 TYVRVSTRFCQTKLIKEAVDLFEIAMAGSSSSNNPTPHCFCLLLKKIVTAKILDMDLFSR 383

Query: 355 VAKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLC 414
             K +   G +L+ ++   + +SL S  + + +  ++K M+  GY P     S +   L 
Sbjct: 384 AVKVYTKNGNALTDSLLKSVLKSLRSVDRVEQSNELLKEMKRGGYVPSGDMQSMIASSLS 443

Query: 415 KARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDAD 474
           +  + +EA + +D ME+ G   D K    L++G+C++  LD ALVCF KM+       AD
Sbjct: 444 RKGKKDEADEFVDFMESSGNNLDDKAMASLVEGYCDSGNLDEALVCFEKMVGNTGVSYAD 503

Query: 475 L-LDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVR-----KLEEA 534
              + L+  + N+ ++  AY+LL     K  ++P  +TYK L+ NLL  +       EEA
Sbjct: 504 YSFEKLVLAYCNKNQVRDAYKLLSAQVTKNQLKPRHSTYKSLVTNLLTKKIARDGGFEEA 563

Query: 535 IALLRLMKKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNE 594
           ++LL +MK   +PPF +PF+ Y S  G   +A  FLK ++S  +P +S  L +F +    
Sbjct: 564 LSLLPIMKDHGFPPFIDPFMSYFSSTGKSTEALGFLKAMTSNNFPYISVVLRVFETMMKS 623

Query: 595 GRYSEAKDLLFKCPHHIRKHNEVCKLFGSAESNTTA 622
            R+SEA+DLL  CP++IR + +V +LF + + N +A
Sbjct: 624 ARHSEAQDLLSLCPNYIRNNPDVLELFNTMKPNESA 659

BLAST of CSPI01G01010 vs. TAIR10
Match: AT3G62470.1 (AT3G62470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 135.6 bits (340), Expect = 1.1e-31
Identity = 96/402 (23.88%), Postives = 192/402 (47.76%), Query Frame = 1

Query: 195 VQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLKALSFFHWVGCRPDY 254
           V KV+D +   D   ++   L+E+ + LS + ++ VL+  R++   A  FF W   R  +
Sbjct: 134 VCKVIDELFALD--RNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGF 193

Query: 255 DHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKISRQFQKSKMMGEAVK 314
            H++ +YN++  +L +    E    V+EEM      + ++T+    + F  +K   +AV 
Sbjct: 194 AHDSRTYNSMMSILAKTRQFETMVSVLEEMG-TKGLLTMETFTIAMKAFAAAKERKKAVG 253

Query: 315 LYELMMDGPYKPSLQDCSVLLRTIA-ASDNPDLSLVYRVAKKFEATGYSLSKAMYDGIHR 374
           ++ELM    +K  ++  + LL ++  A    +  +++   K+     ++ +   Y  +  
Sbjct: 254 IFELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLKE----RFTPNMMTYTVLLN 313

Query: 375 SLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEARKVLDEMEAQGCIP 434
                    +A  I   M + G +PD V ++ ++ GL ++R+  +A K+   M+++G  P
Sbjct: 314 GWCRVRNLIEAARIWNDMIDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCP 373

Query: 435 DIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGFLNQKKLNGAYQLL 494
           +++++TI+I+  C  + ++ A+  F  M++    PDA +   LI+GF  QKKL+  Y+LL
Sbjct: 374 NVRSYTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELL 433

Query: 495 IELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPFPEPFVQYISKFGT 554
            E+  K H  P   TY  LIK +   +  E A  +   M +    P    F   +  +  
Sbjct: 434 KEMQEKGH-PPDGKTYNALIKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFM 493

Query: 555 VQDAD----DFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEA 592
            ++ +     + +++     P  ++Y  +      EG+  EA
Sbjct: 494 ARNYEMGRAVWEEMIKKGICPDDNSYTVLIRGLIGEGKSREA 527

BLAST of CSPI01G01010 vs. TAIR10
Match: AT3G49730.1 (AT3G49730.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 135.2 bits (339), Expect = 1.4e-31
Identity = 99/385 (25.71%), Postives = 181/385 (47.01%), Query Frame = 1

Query: 215 LEELGIALSDNFVIRVLKELRNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSI 274
           L E GI L    +IRVL    ++      FF W   +P Y H+     ++  +L +    
Sbjct: 88  LNESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQF 147

Query: 275 EAFWGVIEEMKHANHE-IDIDTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSV 334
            A WG+IEEM+  N E I+ + ++ + R+F  + M+ +AV++ + M     +P       
Sbjct: 148 GAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGC 207

Query: 335 LLRTIAASDN-PDLSLVYR-VAKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSM 394
           LL  +  + +  + S V+  + +KF       +  +Y          GK  +A+ ++  M
Sbjct: 208 LLDALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCRE-----GKLMEAKEVLVQM 267

Query: 395 RNAGYEPDNVTYSQLVFGLCKARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANE- 454
           + AG EPD V ++ L+ G   A ++ +A  ++++M  +G  P++  +T+LIQ  C   + 
Sbjct: 268 KEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKR 327

Query: 455 LDIALVCFAKMIEKNCDPDADLLDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYK 514
           +D A+  F +M    C+ D      LISGF     ++  Y +L ++  K  V P Q TY 
Sbjct: 328 MDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKG-VMPSQVTYM 387

Query: 515 QLIKNLLEVRKLEEAIALLRLMKKQNYPP---FPEPFVQYISKFGTVQDADDFLKVLSSK 574
           Q++    +  + EE + L+  MK++   P        ++   K G V++A      + + 
Sbjct: 388 QIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEAN 447

Query: 575 EY-PSVSAYLHIFNSFFNEGRYSEA 592
              P V  ++ + N F ++G   EA
Sbjct: 448 GLSPGVDTFVIMINGFTSQGFLIEA 466

BLAST of CSPI01G01010 vs. NCBI nr
Match: gi|449440630|ref|XP_004138087.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic [Cucumis sativus])

HSP 1 Score: 1255.7 bits (3248), Expect = 0.0e+00
Identity = 631/632 (99.84%), Postives = 631/632 (99.84%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60
           MTRRNAIFTSLRLANSFFSTR RYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS
Sbjct: 1   MTRRNAIFTSLRLANSFFSTRSRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS
Sbjct: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH
Sbjct: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
           FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK
Sbjct: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS
Sbjct: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360
           RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG
Sbjct: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360

Query: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420
           YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR
Sbjct: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420

Query: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480
           KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF
Sbjct: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480

Query: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540
           LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF
Sbjct: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540

Query: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600
           PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH
Sbjct: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600

Query: 601 HIRKHNEVCKLFGSAESNTTAATQSSSNPIET 633
           HIRKHNEVCKLFGSAESNTTAATQSSSNPIET
Sbjct: 601 HIRKHNEVCKLFGSAESNTTAATQSSSNPIET 632

BLAST of CSPI01G01010 vs. NCBI nr
Match: gi|659129065|ref|XP_008464511.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic [Cucumis melo])

HSP 1 Score: 1161.4 bits (3003), Expect = 0.0e+00
Identity = 579/623 (92.94%), Postives = 606/623 (97.27%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRFRYPQVTRFSPSSYVSHQSLVSHFTINHPVLFFSSNPQS 60
           MTRRNAIFTSLRLANSFFSTR RYPQVTRFSPSSYVSHQSL+  F INHPVLFFSSNP S
Sbjct: 1   MTRRNAIFTSLRLANSFFSTRSRYPQVTRFSPSSYVSHQSLIPLFRINHPVLFFSSNPHS 60

Query: 61  LLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWASGKNGSTQS 120
           LL+LVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKAS+FFNWASGKNGSTQS
Sbjct: 61  LLELVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASDFFNWASGKNGSTQS 120

Query: 121 SSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKAADATGLTH 180
           SSIYS+LLRIFVQNESMKLFWITLRLMKERGFYLDEETY TILGVLRK+KKAADAT L H
Sbjct: 121 SSIYSILLRIFVQNESMKLFWITLRLMKERGFYLDEETYLTILGVLRKAKKAADATALAH 180

Query: 181 FYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLKELRNSPLK 240
           FYNRMLQ+NAMDSVVQKVV IVLGSDWSNDV GKLEELGIALSDNFVIRVLKELRNSPLK
Sbjct: 181 FYNRMLQENAMDSVVQKVVHIVLGSDWSNDVAGKLEELGIALSDNFVIRVLKELRNSPLK 240

Query: 241 ALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEIDIDTYIKIS 300
           ALSFF+WVGCRPDYDHNTVSYNAIARVL R+DSI+AFWGVIEEMK+A+ +IDIDTYIKIS
Sbjct: 241 ALSFFNWVGCRPDYDHNTVSYNAIARVLARNDSIKAFWGVIEEMKNASLDIDIDTYIKIS 300

Query: 301 RQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRVAKKFEATG 360
           RQFQKSKMMG+AVKLYELMMDGPYKPSL DCS+LLRTIAASDNPDLSLVYRVAKKFEA+G
Sbjct: 301 RQFQKSKMMGDAVKLYELMMDGPYKPSLPDCSILLRTIAASDNPDLSLVYRVAKKFEASG 360

Query: 361 YSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCKARRLEEAR 420
           YSLSKA+YDGIHRSLTS GKFDDAE+IVKSMRNAGYEPDNVT+SQLVFGLCKARRL+EAR
Sbjct: 361 YSLSKAIYDGIHRSLTSRGKFDDAEDIVKSMRNAGYEPDNVTFSQLVFGLCKARRLKEAR 420

Query: 421 KVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADLLDVLISGF 480
           +VLDEMEAQGCIPDIKTWT+LIQGHCNAN+LD+ALVCFAKMIEKNCDPDADLLDVLI+GF
Sbjct: 421 EVLDEMEAQGCIPDIKTWTVLIQGHCNANKLDVALVCFAKMIEKNCDPDADLLDVLINGF 480

Query: 481 LNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLMKKQNYPPF 540
           L+QKKL+GAYQLLIELTNKAHVRPWQATYK LIKNLLEVRKLEEA+ALLRLMKKQNYPPF
Sbjct: 481 LSQKKLDGAYQLLIELTNKAHVRPWQATYKHLIKNLLEVRKLEEAMALLRLMKKQNYPPF 540

Query: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600
           PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH
Sbjct: 541 PEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAKDLLFKCPH 600

Query: 601 HIRKHNEVCKLFGSAESNTTAAT 624
           HIRKHNEVCKLFGSAES TT AT
Sbjct: 601 HIRKHNEVCKLFGSAESKTTGAT 623

BLAST of CSPI01G01010 vs. NCBI nr
Match: gi|1009175856|ref|XP_015869115.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 849.7 bits (2194), Expect = 3.2e-243
Identity = 423/630 (67.14%), Postives = 506/630 (80.32%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRF------RYPQVTRFSPSSY--VSHQSLVSHFTINHPVL 60
           M R  AI  SLRLANSF STR          QVT FS SS   +S+QS  SH    H  L
Sbjct: 1   MNRSKAILVSLRLANSFLSTRVCSTSRPLRSQVTMFSHSSLSSLSNQSYSSHLIPTHQKL 60

Query: 61  FFSSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWAS 120
           +FSS P S+++LV  N+WS  LE EL+  +P   HETV++VLK+LDK P+KAS+FFNW  
Sbjct: 61  YFSSTPNSVVELVLANEWSIELEKELDDSHPAWNHETVIFVLKKLDKDPEKASDFFNWVC 120

Query: 121 GKNGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKA 180
            K+G   SSS+YS++LRI    E+MK FW+TLR MK+ GFYLDEETY TI G  +K K A
Sbjct: 121 EKDGFRHSSSLYSIMLRILADKETMKQFWVTLRKMKQEGFYLDEETYFTISGKFKKEKMA 180

Query: 181 ADATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLK 240
           +D   L+HFYNRM+Q+NAMD VV  VVD++LGS W+++V  +L EL +  SDN VIRVLK
Sbjct: 181 SDVAALSHFYNRMIQENAMDKVVGSVVDVILGSVWNDNVEKQLGELNVVFSDNVVIRVLK 240

Query: 241 ELRNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEID 300
           ELR+ P KA SFF WVG  P Y+HNTV+YNAIARVL + DSIE FW VIEEMK A+H++D
Sbjct: 241 ELRSYPSKAASFFRWVGKYPGYEHNTVTYNAIARVLCQHDSIEEFWSVIEEMKGADHDMD 300

Query: 301 IDTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRV 360
           IDTYIKISRQFQK+KMM +AVKLYELMMDGP+KPS+QDCS+LLR+I+A+DNP+L LV+RV
Sbjct: 301 IDTYIKISRQFQKNKMMEDAVKLYELMMDGPFKPSVQDCSMLLRSISANDNPNLDLVFRV 360

Query: 361 AKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCK 420
           AKK+E+TG++LSKA+YDGIHRSLTS G+F+DAE IV  MRNAGYEPDN+TYSQ+VFGLCK
Sbjct: 361 AKKYESTGHTLSKAIYDGIHRSLTSAGRFNDAEKIVNVMRNAGYEPDNITYSQVVFGLCK 420

Query: 421 ARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADL 480
           ARRLEEA KVLDEMEA GC+PDIKTWTILIQGHC   E+D AL+CFA M+ KN D DADL
Sbjct: 421 ARRLEEACKVLDEMEALGCVPDIKTWTILIQGHCATGEVDKALMCFASMMGKNYDADADL 480

Query: 481 LDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLM 540
           +DVLI+GF++Q+K+ GAY LLIE+ N+  +RPWQATYK LI+ LL V KLEEA+ LLRLM
Sbjct: 481 VDVLINGFISQRKIEGAYNLLIEMVNRTRLRPWQATYKNLIEKLLGVMKLEEALELLRLM 540

Query: 541 KKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAK 600
           KKQNYPP+P+PFVQYISKFGTV+DA  FLK L+ KEYPS SAY+H+  SFF+EGRYSEAK
Sbjct: 541 KKQNYPPYPDPFVQYISKFGTVEDAAGFLKALTVKEYPSSSAYVHLLKSFFHEGRYSEAK 600

Query: 601 DLLFKCPHHIRKHNEVCKLFGSAESNTTAA 623
           DLLFKCPHHIRKH EVCKLFGS +S  +AA
Sbjct: 601 DLLFKCPHHIRKHAEVCKLFGSTDSTASAA 630

BLAST of CSPI01G01010 vs. NCBI nr
Match: gi|1009130716|ref|XP_015882451.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 849.0 bits (2192), Expect = 5.4e-243
Identity = 422/630 (66.98%), Postives = 506/630 (80.32%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRF------RYPQVTRFSPSSY--VSHQSLVSHFTINHPVL 60
           M R  AI  SLRLANSF STR          QVT FS SS   +S+QS  SH    H  L
Sbjct: 1   MNRSKAILVSLRLANSFLSTRVCSTSRPLRSQVTMFSHSSLSSLSNQSYSSHLIPTHQKL 60

Query: 61  FFSSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWAS 120
           +FSS P S+++LV  N+WS  LE EL+  +P   HETV++VLK+LDK P+KAS+FFNW  
Sbjct: 61  YFSSTPNSVVELVLANEWSIELEKELDDSHPAWNHETVIFVLKKLDKDPEKASDFFNWVC 120

Query: 121 GKNGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKA 180
            K+G   SSS+YS++LRI    E+MK FW+TLR MK+ GFYLDEETY TI G  +K K A
Sbjct: 121 EKDGFRHSSSLYSIMLRILADKETMKQFWVTLRKMKQEGFYLDEETYFTISGKFKKEKMA 180

Query: 181 ADATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLK 240
           +D   L+HFYNRM+Q+NAMD VV  VVD++LGS W+++V  +L EL +  SDN VIRVLK
Sbjct: 181 SDVAALSHFYNRMIQENAMDKVVGSVVDVILGSVWNDNVEKQLGELNVVFSDNVVIRVLK 240

Query: 241 ELRNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEID 300
           ELR+ P KA SFF WVG  P Y+HNTV+YNAIARVL + DSIE FW VIEEMK A+H++D
Sbjct: 241 ELRSYPSKAASFFRWVGKYPGYEHNTVTYNAIARVLCQHDSIEEFWSVIEEMKGADHDMD 300

Query: 301 IDTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRV 360
           IDTYIKISRQFQK+KMM +AVKLYELMMDGP+KPS+QDCS+LLR+I+A+DNP+L LV+RV
Sbjct: 301 IDTYIKISRQFQKNKMMEDAVKLYELMMDGPFKPSVQDCSMLLRSISANDNPNLDLVFRV 360

Query: 361 AKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCK 420
           AKK+E+TG++LSKA+YDGIHRSLTS G+F+DAE IV  MRNAGYEPDN+TYSQ+VFGLCK
Sbjct: 361 AKKYESTGHTLSKAIYDGIHRSLTSAGRFNDAEKIVNVMRNAGYEPDNITYSQVVFGLCK 420

Query: 421 ARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADL 480
           ARRLEEA KVLDEMEA GC+PDIKTWTILIQGHC   E+D AL+CFA M+ KN D DADL
Sbjct: 421 ARRLEEACKVLDEMEALGCVPDIKTWTILIQGHCATGEVDKALMCFASMMGKNYDADADL 480

Query: 481 LDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLM 540
           +DVLI+GF++Q+K+ GAY LLIE+ N+  +RPWQATYK LI+ LL V KLEEA+ LLR+M
Sbjct: 481 VDVLINGFISQRKIEGAYNLLIEMVNRTRLRPWQATYKNLIEKLLGVMKLEEALELLRMM 540

Query: 541 KKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAK 600
           KKQNYPP+P+PFVQYISKFGTV+DA  FLK L+ KEYPS SAY+H+  SFF+EGRYSEAK
Sbjct: 541 KKQNYPPYPDPFVQYISKFGTVEDAAGFLKALTVKEYPSSSAYVHLLKSFFHEGRYSEAK 600

Query: 601 DLLFKCPHHIRKHNEVCKLFGSAESNTTAA 623
           DLLFKCPHHIRKH EVCKLFGS +S  +AA
Sbjct: 601 DLLFKCPHHIRKHAEVCKLFGSTDSTASAA 630

BLAST of CSPI01G01010 vs. NCBI nr
Match: gi|1009177171|ref|XP_015869825.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-like [Ziziphus jujuba])

HSP 1 Score: 840.1 bits (2169), Expect = 2.5e-240
Identity = 419/630 (66.51%), Postives = 506/630 (80.32%), Query Frame = 1

Query: 1   MTRRNAIFTSLRLANSFFSTRF------RYPQVTRFSPSSY--VSHQSLVSHFTINHPVL 60
           M R  AI  SLRLANSF STR          QVT FS SS   +S+QS  SH    H  L
Sbjct: 1   MNRSKAILVSLRLANSFLSTRVCSTSRPLRSQVTMFSHSSLSSLSNQSYSSHLIPTHQKL 60

Query: 61  FFSSNPQSLLQLVSTNDWSEMLETELETLNPTLTHETVVYVLKRLDKQPQKASEFFNWAS 120
           +FSS P S+++LV  N+WS  LE EL+  +P   HETV++VLK+LDK P+KAS+FFNW  
Sbjct: 61  YFSSMPNSVVELVLANEWSIELEKELDDSHPAWNHETVIFVLKKLDKDPEKASDFFNWVC 120

Query: 121 GKNGSTQSSSIYSMLLRIFVQNESMKLFWITLRLMKERGFYLDEETYKTILGVLRKSKKA 180
            K+G   SSS+YS++LRI    E+MK FW+TLR MK+ GFYLDEETY TI G  +K K A
Sbjct: 121 EKDGFRHSSSLYSIMLRILADKETMKQFWVTLRKMKQEGFYLDEETYFTISGKFKKEKMA 180

Query: 181 ADATGLTHFYNRMLQQNAMDSVVQKVVDIVLGSDWSNDVPGKLEELGIALSDNFVIRVLK 240
           +D   L+HFYNRM+Q+NAMD VV  VVD++LGS W+++V  +L EL +  SDN VIRVLK
Sbjct: 181 SDVAALSHFYNRMIQENAMDKVVGSVVDVILGSVWNDNVEKQLGELNVVFSDNVVIRVLK 240

Query: 241 ELRNSPLKALSFFHWVGCRPDYDHNTVSYNAIARVLGRDDSIEAFWGVIEEMKHANHEID 300
           ELR+ P KA SFF  VG  P Y+HNTV+YNAIARVL + DSIE FW VIEEMK A+H++D
Sbjct: 241 ELRSYPSKAASFFRCVGKYPGYEHNTVTYNAIARVLCQHDSIEEFWSVIEEMKGADHDMD 300

Query: 301 IDTYIKISRQFQKSKMMGEAVKLYELMMDGPYKPSLQDCSVLLRTIAASDNPDLSLVYRV 360
           IDTYIKISRQFQK+KMM +AVKLYELMMDGP+KPS+QDCS+LLR+I+A+DNP+L LV+RV
Sbjct: 301 IDTYIKISRQFQKNKMMEDAVKLYELMMDGPFKPSVQDCSMLLRSISANDNPNLDLVFRV 360

Query: 361 AKKFEATGYSLSKAMYDGIHRSLTSTGKFDDAENIVKSMRNAGYEPDNVTYSQLVFGLCK 420
           AKK+E+TG++LSKA+YDGIHRSLTS G+F++AE IV  M+NAGYEPDN+TYSQ+VFGLCK
Sbjct: 361 AKKYESTGHTLSKAIYDGIHRSLTSAGRFNEAEKIVNVMQNAGYEPDNITYSQVVFGLCK 420

Query: 421 ARRLEEARKVLDEMEAQGCIPDIKTWTILIQGHCNANELDIALVCFAKMIEKNCDPDADL 480
           ARRLEEA +VLDEMEA GC+PDIKTWTILIQGHC A E+D AL+CFA M+ KN D DADL
Sbjct: 421 ARRLEEACEVLDEMEALGCVPDIKTWTILIQGHCAAGEVDKALMCFASMMGKNYDADADL 480

Query: 481 LDVLISGFLNQKKLNGAYQLLIELTNKAHVRPWQATYKQLIKNLLEVRKLEEAIALLRLM 540
           +DVLI+GF++Q+K+ GAY LLIE+ N+  +RPWQATYK LI+ LL V KLEEA+ LLR+M
Sbjct: 481 VDVLINGFISQRKIEGAYNLLIEMVNRTRLRPWQATYKNLIEKLLGVMKLEEALELLRMM 540

Query: 541 KKQNYPPFPEPFVQYISKFGTVQDADDFLKVLSSKEYPSVSAYLHIFNSFFNEGRYSEAK 600
           KKQNYPP+P+PFVQYISKFGTV+DA  FLK L+ KEYPS SAY+H+  SFF+EGRYSEAK
Sbjct: 541 KKQNYPPYPDPFVQYISKFGTVEDAAGFLKALTVKEYPSSSAYVHLLKSFFHEGRYSEAK 600

Query: 601 DLLFKCPHHIRKHNEVCKLFGSAESNTTAA 623
           DLLFKCPHHIRKH EVCKLFGS +S  +AA
Sbjct: 601 DLLFKCPHHIRKHAEVCKLFGSTDSTASAA 630

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP269_ARATH5.5e-20358.04Pentatricopeptide repeat-containing protein At3g48250, chloroplastic OS=Arabidop... [more]
PP208_ARATH3.0e-8431.85Pentatricopeptide repeat-containing protein At3g02490, mitochondrial OS=Arabidop... [more]
PP387_ARATH5.7e-8330.03Pentatricopeptide repeat-containing protein At5g15980, mitochondrial OS=Arabidop... [more]
PP293_ARATH1.9e-3023.88Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
PP447_ARATH2.5e-3024.09Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0LNT7_CUCSA0.0e+0099.84Uncharacterized protein OS=Cucumis sativus GN=Csa_1G003530 PE=4 SV=1[more]
B9RAT8_RICCO1.4e-23765.55Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067KGQ3_JATCU6.5e-23564.42Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11400 PE=4 SV=1[more]
F6GTR7_VITVI4.8e-23064.18Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09330 PE=4 SV=... [more]
M5WD91_PRUPE1.2e-22865.13Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002676mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G48250.13.1e-20458.04 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G02490.11.7e-8531.85 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G15980.13.2e-8430.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G62470.11.1e-3123.88 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G49730.11.4e-3125.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449440630|ref|XP_004138087.1|0.0e+0099.84PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic ... [more]
gi|659129065|ref|XP_008464511.1|0.0e+0092.94PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic ... [more]
gi|1009175856|ref|XP_015869115.1|3.2e-24367.14PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-... [more]
gi|1009130716|ref|XP_015882451.1|5.4e-24366.98PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-... [more]
gi|1009177171|ref|XP_015869825.1|2.5e-24066.51PREDICTED: pentatricopeptide repeat-containing protein At3g48250, chloroplastic-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008380 RNA splicing
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G01010.1CSPI01G01010.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 508..536
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 395..427
score: 1.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 433..470
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 437..470
score: 9.4E-7coord: 401..435
score: 9.3E-10coord: 508..539
score: 0
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 434..468
score: 10.994coord: 505..539
score: 8.835coord: 120..154
score: 6.884coord: 469..504
score: 6.577coord: 292..326
score: 8.155coord: 155..192
score: 5.568coord: 571..601
score: 5.36coord: 364..398
score: 9.01coord: 257..291
score: 7.794coord: 399..433
score: 1
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..169
score: 1.6E-226coord: 379..608
score: 1.6E-226coord: 236..341
score: 1.6E
NoneNo IPR availablePANTHERPTHR24015:SF795SUBFAMILY NOT NAMEDcoord: 28..169
score: 1.6E-226coord: 236..341
score: 1.6E-226coord: 379..608
score: 1.6E

The following gene(s) are paralogous to this gene:

None