CSPI01G15130 (gene) Wild cucumber (PI 183967)

NameCSPI01G15130
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing family protein
LocationChr1 : 10722822 .. 10724684 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAATCCTACTACGCTTCACTCAATTTTCTTTTATACTAAATGGGTAATTGATCATAAAAATCGAAACGGTTCTTAGAACGATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTGTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATTTGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGGTTTTTTATAACCTGGGTGTGAAAAACTCTGTCTCTTGGAACACTATGCTCAATGGTTTTATGAGGAATGGAGAGATTGAGTTGGCCATTCAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAACGGTCTTTTGAAACATGGTTACTCGGAACAAGCATTGGAGTGCTTCCATCAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACAGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGATGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTCCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGAAGCTTCTTTTGTGCTGTTAAAACCCTTATTTGATTCGTCCGTCTATGGCTTAGATCAACAGCAGTTTGAGAGTTTGTGAAATAGACAAGGTAAATGGATGCTTAAGTTAAGCGGGCTGCAAGATATATAATTTAATTTGCATTTGTCGTCACATTTGAAAATGGAAGCAAAACACTTAGTTCAAGGTACAAATAAAAGGCCAC

mRNA sequence

ATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTGTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATTTGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGGTTTTTTATAACCTGGGTGTGAAAAACTCTGTCTCTTGGAACACTATGCTCAATGGTTTTATGAGGAATGGAGAGATTGAGTTGGCCATTCAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAACGGTCTTTTGAAACATGGTTACTCGGAACAAGCATTGGAGTGCTTCCATCAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACAGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGATGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTCCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGA

Coding sequence (CDS)

ATGAGCAGCATTCCTTCCCACACAGCCACTCCTTCTCAACTCCAACTACCTCCTTTTACACCTTCTTCAATCCCACTTTCAAATCCAACAAAACTCAACTTCCCCCGCTCTCCCAACTCCCCTCATCGCAATATCTCCTCCAAATTCAACCCCAATTCTGTTGACCCCATTGTTCTATGGACCTCTTCTCTTGCTCGCTACTGCCGCAACGGCCAATTATCCGAAGCCGCTGCAGAGTTTACCCGCATGCGACTCGCCGGAGTTGAGCCCAACCACATCACATTTATTACGCTTCTCTCCGCCTGTGCTGATTTTCCGTCAGAAAGCTTCTTGTTCGCCTCTTCACTTCATGGCTACGCTTGTAAATTTGGTCTGGATACTGGGCATGTAATGGTGGGGACTGCTCTCATTGATATGTATTCCAAATGTGCTCAATTGGGTCATGCTAGGAAGGTTTTTTATAACCTGGGTGTGAAAAACTCTGTCTCTTGGAACACTATGCTCAATGGTTTTATGAGGAATGGAGAGATTGAGTTGGCCATTCAACTGTTTGATGAAATGCCTACAAGAGATGCGATTTCTTGGACGGCTTTAATTAACGGTCTTTTGAAACATGGTTACTCGGAACAAGCATTGGAGTGCTTCCATCAGATGCAACGCTCGGGTGTCGCTGCTGATTATGTGTCTATAATTGCTGTTCTTGCTGCCTGTGCTGATTTGGGTGCTCTTACTTTGGGGTTGTGGGTTCATCGTTTTGTTATGCCGCAGGAGTTTAAGGATAATATTAAGATTAGTAATTCCTTGATAGATATGTATTCTAGATGTGGATGTATTGAGTTTGCCCGCCAAGTGTTTGTGAAAATGGCCAAACGAACTTTGGTATCTTGGAACTCTATCATTGTGGGGTTTGCAGTTAACGGATTTGCAGATGAATCTTTAGAGTTTTTTTATGCGATGCAGAAGGAAGGATTCAAGCCAGATGGAGTAAGCTACACAGGGGCTCTTACCGCGTGTAGCCATGCTGGCTTAGTGAATAAGGGCCTGGAATTGTTTGATAACATGAAGAGTGTACACAAAATTACTCCTAGGATTGAGCATTATGGATGTATTGTCGATCTCTACGGTCGTGCTGGAAGGTTAGAGGATGCACTGAATATGATTGAGGAAATGCCGATGAAGCCGAATGAAGTTGTGTTGGGGTCGTTGCTTGCTGCTTGCAGGACTCATGGTGATGTGAACCTGGCTGAAAGGTTAATGAAACATCTCTTTAAGTTAGATCCAGAAGGCGATGCATATTATGTGCTCCTTTCAAACATATATGCAGCAATTGGGAAGTGGGATGGTGCTAACAATGTTAGGAGAACGATGAAAGCCCGAGGTGTGCAAAAAAAACCGGGTTATAGTTCTGTTGAAATTGATGGTAAGGTTCATGAATTTGTTGCAGGTGACAATTACCATGCTGATGCAGACAATATTTACTCAATGTTAGATTTGTTGTGTCATGAACTAAAGGTGTGTGGATATGTTCCTGGTAGTGATACCATTCTGAATACCAAAGAATCTAATAAGGACGATTGA
BLAST of CSPI01G15130 vs. Swiss-Prot
Match: PPR13_ARATH (Pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Arabidopsis thaliana GN=PDE247 PE=2 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 8.4e-165
Identity = 277/463 (59.83%), Postives = 355/463 (76.67%), Query Frame = 1

Query: 48  KFNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPS 107
           + N ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNHITFI LLS C DF S
Sbjct: 27  RHNQSTSETTVSWTSRINLLTRNGRLAEAAKEFSDMTLAGVEPNHITFIALLSGCGDFTS 86

Query: 108 ESFLFASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTM 167
            S      LHGYACK GLD  HVMVGTA+I MYSK  +   AR VF  +  KNSV+WNTM
Sbjct: 87  GSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRFKKARLVFDYMEDKNSVTWNTM 146

Query: 168 LNGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADY 227
           ++G+MR+G+++ A ++FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SGV  DY
Sbjct: 147 IDGYMRSGQVDNAAKMFDKMPERDLISWTAMINGFVKKGYQEEALLWFREMQISGVKPDY 206

Query: 228 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 287
           V+IIA L AC +LGAL+ GLWVHR+V+ Q+FK+N+++SNSLID+Y RCGC+EFARQVF  
Sbjct: 207 VAIIAALNACTNLGALSFGLWVHRYVLSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFYN 266

Query: 288 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 347
           M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTACSH GLV +G
Sbjct: 267 MEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEKGFKPDAVTFTGALTACSHVGLVEEG 326

Query: 348 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 407
           L  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC 
Sbjct: 327 LRYFQIMKCDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSMPMKPNEVVIGSLLAACS 386

Query: 408 THG-DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPG 467
            HG ++ LAERLMKHL  L+ +  + YV+LSN+YAA GKW+GA+ +RR MK  G++K+PG
Sbjct: 387 NHGNNIVLAERLMKHLTDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKQPG 446

Query: 468 YSSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYV 510
           +SS+EID  +H F+AGDN H +   I  +L+L+  +L++ G V
Sbjct: 447 FSSIEIDDCMHVFMAGDNAHVETTYIREVLELISSDLRLQGCV 489

BLAST of CSPI01G15130 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 377.5 bits (968), Expect = 2.4e-103
Identity = 184/477 (38.57%), Postives = 294/477 (61.64%), Query Frame = 1

Query: 49  FNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSE 108
           F+      + L  +  + Y R G   EA   F  M  +GV P+ I+ ++ +S+C+     
Sbjct: 294 FDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL--R 353

Query: 109 SFLFASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTML 168
           + L+  S HGY  + G ++    +  ALIDMY KC +   A ++F  +  K  V+WN+++
Sbjct: 354 NILWGKSCHGYVLRNGFESWD-NICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIV 413

Query: 169 NGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQ-RSGVAADY 228
            G++ NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ + GV AD 
Sbjct: 414 AGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADG 473

Query: 229 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 288
           V+++++ +AC  LGAL L  W++ ++     + ++++  +L+DM+SRCG  E A  +F  
Sbjct: 474 VTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNS 533

Query: 289 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 348
           +  R + +W + I   A+ G A+ ++E F  M ++G KPDGV++ GALTACSH GLV +G
Sbjct: 534 LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQG 593

Query: 349 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 408
            E+F +M  +H ++P   HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR
Sbjct: 594 KEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACR 653

Query: 409 THGDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGY 468
             G+V +A    + +  L PE    YVLLSN+YA+ G+W+    VR +MK +G++K PG 
Sbjct: 654 VQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGT 713

Query: 469 SSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPG-SDTILNTKESNK 524
           SS++I GK HEF +GD  H +  NI +MLD +       G+VP  S+ +++  E  K
Sbjct: 714 SSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEK 767

BLAST of CSPI01G15130 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 367.5 bits (942), Expect = 2.5e-100
Identity = 171/468 (36.54%), Postives = 291/468 (62.18%), Query Frame = 1

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSL 116
           +V W S +  + + G   +A   F +M    V+ +H+T + +LSACA     +  F   +
Sbjct: 197 VVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKI--RNLEFGRQV 256

Query: 117 HGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGE 176
             Y  +  ++  ++ +  A++DMY+KC  +  A+++F  +  K++V+W TML+G+  + +
Sbjct: 257 CSYIEENRVNV-NLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 316

Query: 177 IELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQ-RSGVAADYVSIIAVLA 236
            E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q +  +  + +++++ L+
Sbjct: 317 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 376

Query: 237 ACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVS 296
           ACA +GAL LG W+H ++     + N  ++++LI MYS+CG +E +R+VF  + KR +  
Sbjct: 377 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 436

Query: 297 WNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK 356
           W+++I G A++G  +E+++ FY MQ+   KP+GV++T    ACSH GLV++   LF  M+
Sbjct: 437 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 496

Query: 357 SVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLA 416
           S + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++NLA
Sbjct: 497 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 556

Query: 417 ERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGK 476
           E     L +L+P  D  +VLLSNIYA +GKW+  + +R+ M+  G++K+PG SS+EIDG 
Sbjct: 557 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 616

Query: 477 VHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNK 524
           +HEF++GDN H  ++ +Y  L  +  +LK  GY P    +L   E  +
Sbjct: 617 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEE 661

BLAST of CSPI01G15130 vs. Swiss-Prot
Match: PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 3.1e-98
Identity = 178/454 (39.21%), Postives = 273/454 (60.13%), Query Frame = 1

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSL 116
           +V W S +  + +NG   EA   F  M  + VEP+ +T  +++SACA     +      +
Sbjct: 218 VVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL--SAIKVGQEV 277

Query: 117 HGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGE 176
           HG   K       +++  A +DMY+KC+++  AR +F ++ ++N ++  +M++G+     
Sbjct: 278 HGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAAS 337

Query: 177 IELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAA 236
            + A  +F +M  R+ +SW ALI G  ++G +E+AL  F  ++R  V   + S   +L A
Sbjct: 338 TKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKA 397

Query: 237 CADLGALTLGLWVHRFVMPQEFK------DNIKISNSLIDMYSRCGCIEFARQVFVKMAK 296
           CADL  L LG+  H  V+   FK      D+I + NSLIDMY +CGC+E    VF KM +
Sbjct: 398 CADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMME 457

Query: 297 RTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL 356
           R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Sbjct: 458 RDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHY 517

Query: 357 FDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHG 416
           F +M     + P  +HY C+VDL GRAG LE+A +MIEEMPM+P+ V+ GSLLAAC+ H 
Sbjct: 518 FSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHR 577

Query: 417 DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSV 476
           ++ L + + + L +++P     YVLLSN+YA +GKW+   NVR++M+  GV K+PG S +
Sbjct: 578 NITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 637

Query: 477 EIDGKVHEFVAGDNYHADADNIYSMLDLLCHELK 505
           +I G  H F+  D  H     I+S+LD+L  E++
Sbjct: 638 KIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of CSPI01G15130 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 8.9e-98
Identity = 177/476 (37.18%), Postives = 286/476 (60.08%), Query Frame = 1

Query: 53  SVDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFL 112
           ++DP + L+T+++     NG   +A   + ++  + + PN  TF +LL +C      S  
Sbjct: 90  TIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSC------STK 149

Query: 113 FASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGF 172
               +H +  KFGL      V T L+D+Y+K   +  A+KVF  +  ++ VS   M+  +
Sbjct: 150 SGKLIHTHVLKFGLGIDPY-VATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCY 209

Query: 173 MRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAA-DYVSI 232
            + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G    D +++
Sbjct: 210 AKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITV 269

Query: 233 IAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAK 292
           +A L+AC+ +GAL  G W+H FV     + N+K+   LIDMYS+CG +E A  VF    +
Sbjct: 270 VAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPR 329

Query: 293 RTLVSWNSIIVGFAVNGFADESLEFFYAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE 352
           + +V+WN++I G+A++G++ ++L  F  MQ   G +P  +++ G L AC+HAGLVN+G+ 
Sbjct: 330 KDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIR 389

Query: 353 LFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTH 412
           +F++M   + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ H
Sbjct: 390 IFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLH 449

Query: 413 GDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSS 472
           GD  L + + ++L  L+ +    YVLLSNIYA++G ++G   VR  MK +G+ K+PG S+
Sbjct: 450 GDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGIST 509

Query: 473 VEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 526
           +EI+ KVHEF AGD  H+ +  IY+ML  +   +K  GYVP ++T+L   E  + +
Sbjct: 510 IEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKE 558

BLAST of CSPI01G15130 vs. TrEMBL
Match: A0A0A0LYD6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G169950 PE=4 SV=1)

HSP 1 Score: 1070.5 bits (2767), Expect = 6.8e-310
Identity = 523/525 (99.62%), Postives = 524/525 (99.81%), Query Frame = 1

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW
Sbjct: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYA 120
           TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESF FASSLHGYA
Sbjct: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120

Query: 121 CKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA 180
           CK+GLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA
Sbjct: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA 180

Query: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL 240
           IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL
Sbjct: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI
Sbjct: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK
Sbjct: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480
           HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV
Sbjct: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480

Query: 481 AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 526
           AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD
Sbjct: 481 AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 525

BLAST of CSPI01G15130 vs. TrEMBL
Match: F6HAB7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0009g00650 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 3.8e-196
Identity = 338/509 (66.40%), Postives = 409/509 (80.35%), Query Frame = 1

Query: 3   SIPSHTAT-PSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWT 62
           S+P++TAT PS L   P   +S P S P +  FP  P+S   +++     + +DPIV WT
Sbjct: 2   SLPAYTATTPSSLVTHP---NSSPNSKPNQPTFPSRPHSTKYHLTRSHTHSPIDPIVSWT 61

Query: 63  SSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYAC 122
           SS+A +CRNGQL EAAAEF+RM++AGV PNHITF+TLLSAC DFP E   F  S+H Y  
Sbjct: 62  SSIALHCRNGQLPEAAAEFSRMQIAGVRPNHITFLTLLSACTDFPLEGLRFGGSIHAYVR 121

Query: 123 KFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAI 182
           K GLDT +VMVGTAL+DMYSKC QL  A  +F  + V+NSVSWNTM++G MRNGE+  AI
Sbjct: 122 KLGLDTENVMVGTALVDMYSKCGQLDLAWLMFDEMHVRNSVSWNTMIDGCMRNGEVGEAI 181

Query: 183 QLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLG 242
            LFD+M  RDAISWT++I G +K G  EQALE F +MQ +GV  DYV+II+VLAACA+LG
Sbjct: 182 VLFDQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPDYVTIISVLAACANLG 241

Query: 243 ALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIV 302
           AL LGLW++RFVM Q+FKDNIKISNSLIDMYSRCGCI  ARQVF +M KR+LVSWNS+IV
Sbjct: 242 ALGLGLWINRFVMKQDFKDNIKISNSLIDMYSRCGCIRLARQVFEQMPKRSLVSWNSMIV 301

Query: 303 GFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKIT 362
           GFA+NG A+E+LEFF  M+KEGF+PDGVS+TGALTACSH+GLV++GL+ FD MK   KI+
Sbjct: 302 GFALNGHAEEALEFFNLMRKEGFRPDGVSFTGALTACSHSGLVDEGLQFFDIMKRTRKIS 361

Query: 363 PRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKH 422
           PRIEHYGC+VDLY RAGRLEDALN+I  MPMKPNEVVLGSLLAACRTHGDV LAERLMK+
Sbjct: 362 PRIEHYGCLVDLYSRAGRLEDALNVIANMPMKPNEVVLGSLLAACRTHGDVGLAERLMKY 421

Query: 423 LFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVA 482
           L ++DP  D+ YVLLSNIYAA+G+WDGA+ VR+ MKA G+ KKPG+SS+E+DG +HEFVA
Sbjct: 422 LCEVDPGSDSNYVLLSNIYAAVGRWDGASKVRKKMKALGIHKKPGFSSIEMDGSIHEFVA 481

Query: 483 GDNYHADADNIYSMLDLLCHELKVCGYVP 511
           GD  H +  NIY+MLD L  EL++CGYVP
Sbjct: 482 GDKTHVETQNIYAMLDHLFLELRICGYVP 507

BLAST of CSPI01G15130 vs. TrEMBL
Match: W9SDQ7_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001219 PE=4 SV=1)

HSP 1 Score: 681.4 bits (1757), Expect = 8.7e-193
Identity = 335/510 (65.69%), Postives = 406/510 (79.61%), Query Frame = 1

Query: 3   SIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTS 62
           S+P++T TP+QL  PP  P  + L +PT+  FP      H     K     ++P+V WTS
Sbjct: 2   SLPANTVTPTQLSQPP-KPPPLSLPSPTQPFFPNQHYPSH-----KLTYKPIEPVVKWTS 61

Query: 63  SLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYACK 122
           S+AR+C+NG+ SEAAAEF+RMRL+GVEPNH+TF+TLLS CAD    +  F +S+HGYA K
Sbjct: 62  SIARHCKNGRFSEAAAEFSRMRLSGVEPNHVTFVTLLSGCAD---SNISFGASIHGYARK 121

Query: 123 FGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAIQ 182
              DT +VMVGTAL+ MY+K   +  AR VF ++  KNSVSWNTM++G+MRNG++  A++
Sbjct: 122 LCFDTSNVMVGTALVAMYAKRGLVDVARLVFDDIKEKNSVSWNTMIDGYMRNGKVRDAVE 181

Query: 183 LFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLGA 242
           +FDEMP RDA+SWTALI G +K    E+ALE F +MQ S V  DYV++IAVLAACADLG 
Sbjct: 182 VFDEMPERDAVSWTALIGGFVKRRRFEEALEWFREMQVSSVEPDYVTVIAVLAACADLGT 241

Query: 243 LTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVG 302
           + LGLW++RF+M ++FKDN+KISNSLIDMYSRCGCIEFARQVF +M  RTLVSWNSIIVG
Sbjct: 242 VGLGLWMNRFIMNRKFKDNVKISNSLIDMYSRCGCIEFARQVFERMPNRTLVSWNSIIVG 301

Query: 303 FAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITP 362
           FAVNG A+E+L+FF  MQ+EGFKPDGVS+TGALTACSHAGLV +GL LF+NMK VH I  
Sbjct: 302 FAVNGHAEEALKFFNLMQREGFKPDGVSFTGALTACSHAGLVEEGLLLFENMKRVHGIRH 361

Query: 363 RIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHL 422
           RIEHYGCIVDLY RAGRLEDALN+IE MPMKPNEVVLGSLLAACRTHGD+ LAERLMK+L
Sbjct: 362 RIEHYGCIVDLYSRAGRLEDALNVIEYMPMKPNEVVLGSLLAACRTHGDITLAERLMKYL 421

Query: 423 FKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAG 482
             LDP GD+ YVLL+N+YAA+GKWDGA  VR+TMKA G+QK PG+SS+EID  +HEFVAG
Sbjct: 422 SDLDPGGDSNYVLLANMYAAVGKWDGAGKVRKTMKALGIQKTPGFSSIEIDCNIHEFVAG 481

Query: 483 DNYHADADNIYSMLDLLCHELKVCGYVPGS 513
           D  H D + IYSML+LL  ELK  GYVPG+
Sbjct: 482 DKSHVDKNCIYSMLELLSSELKASGYVPGN 502

BLAST of CSPI01G15130 vs. TrEMBL
Match: A0A0B2RPQ9_GLYSO (Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=glysoja_001227 PE=4 SV=1)

HSP 1 Score: 672.9 bits (1735), Expect = 3.1e-190
Identity = 319/522 (61.11%), Postives = 411/522 (78.74%), Query Frame = 1

Query: 4   IPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTSS 63
           +P+  ATP+QL  PP +PSS  L N T   F  +  + ++ +S +      DPIV WT+S
Sbjct: 3   LPACNATPTQLPHPPKSPSSNSLPNQTHSTFSNTNTNTNQGLSLRHTTKYNDPIVSWTTS 62

Query: 64  LARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFL-FASSLHGYACK 123
           +A YC++G L +AA++F +MR A +EPNHITFITLLSACA +PS S + F +++H +  K
Sbjct: 63  IADYCKSGHLVKAASKFVQMREAAIEPNHITFITLLSACAHYPSRSSISFGTAIHAHVRK 122

Query: 124 FGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAIQ 183
            GLD   VMVGTALIDMY+KC ++  AR  F  +GV+N VSWNTM++G+MRNG+ E A+Q
Sbjct: 123 LGLDINDVMVGTALIDMYAKCGRVESARLAFDQMGVRNLVSWNTMIDGYMRNGKFEDALQ 182

Query: 184 LFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLGA 243
           +FD +P ++AISWTALI G +K  Y E+ALECF +MQ SGVA DYV++IAV+AACA+LG 
Sbjct: 183 VFDGLPVKNAISWTALIGGFVKKDYHEEALECFREMQLSGVAPDYVTVIAVIAACANLGT 242

Query: 244 LTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVG 303
           L LGLWVHR VM Q+F++N+K+SNSLIDMYSRCGCI+ ARQVF +M +RTLVSWNSIIVG
Sbjct: 243 LGLGLWVHRLVMTQDFRNNVKVSNSLIDMYSRCGCIDLARQVFDRMPQRTLVSWNSIIVG 302

Query: 304 FAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITP 363
           FAVNG ADE+L +F +MQ+EGFKPDGVSYTGAL ACSHAGL+ +GL +F++MK V +I P
Sbjct: 303 FAVNGLADEALSYFNSMQEEGFKPDGVSYTGALMACSHAGLIGEGLRIFEHMKRVRRILP 362

Query: 364 RIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHL 423
           RIEHYGC+VDLY RAGRLE+ALN+++ MPMKPNEV+LGSLLAACRT G++ LAE +M +L
Sbjct: 363 RIEHYGCLVDLYSRAGRLEEALNVLKNMPMKPNEVILGSLLAACRTQGNIGLAENVMNYL 422

Query: 424 FKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAG 483
            +LD  GD+ YVLLSNIYAA+GKWDGAN VRR MK RG+QKKPG+SS+EID  +H+FV+G
Sbjct: 423 IELDSGGDSNYVLLSNIYAAVGKWDGANKVRRRMKERGIQKKPGFSSIEIDSSIHKFVSG 482

Query: 484 DNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKD 525
           D  H + D+IY+ L+ L  EL++CGY+P      + KES +D
Sbjct: 483 DKSHEEKDHIYAALEFLSFELQLCGYIPD----FSGKESYED 520

BLAST of CSPI01G15130 vs. TrEMBL
Match: K7M2Y7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_13G313200 PE=4 SV=1)

HSP 1 Score: 672.9 bits (1735), Expect = 3.1e-190
Identity = 319/522 (61.11%), Postives = 411/522 (78.74%), Query Frame = 1

Query: 4   IPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTSS 63
           +P+  ATP+QL  PP +PSS  L N T   F  +  + ++ +S +      DPIV WT+S
Sbjct: 3   LPACNATPTQLPHPPKSPSSNSLPNQTHSTFSNTNTNTNQGLSLRHTTKYNDPIVSWTTS 62

Query: 64  LARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFL-FASSLHGYACK 123
           +A YC++G L +AA++F +MR A +EPNHITFITLLSACA +PS S + F +++H +  K
Sbjct: 63  IADYCKSGHLVKAASKFVQMREAAIEPNHITFITLLSACAHYPSRSSISFGTAIHAHVRK 122

Query: 124 FGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAIQ 183
            GLD   VMVGTALIDMY+KC ++  AR  F  +GV+N VSWNTM++G+MRNG+ E A+Q
Sbjct: 123 LGLDINDVMVGTALIDMYAKCGRVESARLAFDQMGVRNLVSWNTMIDGYMRNGKFEDALQ 182

Query: 184 LFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLGA 243
           +FD +P ++AISWTALI G +K  Y E+ALECF +MQ SGVA DYV++IAV+AACA+LG 
Sbjct: 183 VFDGLPVKNAISWTALIGGFVKKDYHEEALECFREMQLSGVAPDYVTVIAVIAACANLGT 242

Query: 244 LTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVG 303
           L LGLWVHR VM Q+F++N+K+SNSLIDMYSRCGCI+ ARQVF +M +RTLVSWNSIIVG
Sbjct: 243 LGLGLWVHRLVMTQDFRNNVKVSNSLIDMYSRCGCIDLARQVFDRMPQRTLVSWNSIIVG 302

Query: 304 FAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITP 363
           FAVNG ADE+L +F +MQ+EGFKPDGVSYTGAL ACSHAGL+ +GL +F++MK V +I P
Sbjct: 303 FAVNGLADEALSYFNSMQEEGFKPDGVSYTGALMACSHAGLIGEGLRIFEHMKRVRRILP 362

Query: 364 RIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHL 423
           RIEHYGC+VDLY RAGRLE+ALN+++ MPMKPNEV+LGSLLAACRT G++ LAE +M +L
Sbjct: 363 RIEHYGCLVDLYSRAGRLEEALNVLKNMPMKPNEVILGSLLAACRTQGNIGLAENVMNYL 422

Query: 424 FKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAG 483
            +LD  GD+ YVLLSNIYAA+GKWDGAN VRR MK RG+QKKPG+SS+EID  +H+FV+G
Sbjct: 423 IELDSGGDSNYVLLSNIYAAVGKWDGANKVRRRMKERGIQKKPGFSSIEIDSSIHKFVSG 482

Query: 484 DNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKD 525
           D  H + D+IY+ L+ L  EL++CGY+P      + KES +D
Sbjct: 483 DKSHEEKDHIYAALEFLSFELQLCGYIPD----FSGKESYED 520

BLAST of CSPI01G15130 vs. TAIR10
Match: AT1G05750.1 (AT1G05750.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 581.6 bits (1498), Expect = 4.7e-166
Identity = 277/463 (59.83%), Postives = 355/463 (76.67%), Query Frame = 1

Query: 48  KFNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPS 107
           + N ++ +  V WTS +    RNG+L+EAA EF+ M LAGVEPNHITFI LLS C DF S
Sbjct: 27  RHNQSTSETTVSWTSRINLLTRNGRLAEAAKEFSDMTLAGVEPNHITFIALLSGCGDFTS 86

Query: 108 ESFLFASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTM 167
            S      LHGYACK GLD  HVMVGTA+I MYSK  +   AR VF  +  KNSV+WNTM
Sbjct: 87  GSEALGDLLHGYACKLGLDRNHVMVGTAIIGMYSKRGRFKKARLVFDYMEDKNSVTWNTM 146

Query: 168 LNGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADY 227
           ++G+MR+G+++ A ++FD+MP RD ISWTA+ING +K GY E+AL  F +MQ SGV  DY
Sbjct: 147 IDGYMRSGQVDNAAKMFDKMPERDLISWTAMINGFVKKGYQEEALLWFREMQISGVKPDY 206

Query: 228 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 287
           V+IIA L AC +LGAL+ GLWVHR+V+ Q+FK+N+++SNSLID+Y RCGC+EFARQVF  
Sbjct: 207 VAIIAALNACTNLGALSFGLWVHRYVLSQDFKNNVRVSNSLIDLYCRCGCVEFARQVFYN 266

Query: 288 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 347
           M KRT+VSWNS+IVGFA NG A ESL +F  MQ++GFKPD V++TGALTACSH GLV +G
Sbjct: 267 MEKRTVVSWNSVIVGFAANGNAHESLVYFRKMQEKGFKPDAVTFTGALTACSHVGLVEEG 326

Query: 348 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 407
           L  F  MK  ++I+PRIEHYGC+VDLY RAGRLEDAL +++ MPMKPNEVV+GSLLAAC 
Sbjct: 327 LRYFQIMKCDYRISPRIEHYGCLVDLYSRAGRLEDALKLVQSMPMKPNEVVIGSLLAACS 386

Query: 408 THG-DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPG 467
            HG ++ LAERLMKHL  L+ +  + YV+LSN+YAA GKW+GA+ +RR MK  G++K+PG
Sbjct: 387 NHGNNIVLAERLMKHLTDLNVKSHSNYVILSNMYAADGKWEGASKMRRKMKGLGLKKQPG 446

Query: 468 YSSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYV 510
           +SS+EID  +H F+AGDN H +   I  +L+L+  +L++ G V
Sbjct: 447 FSSIEIDDCMHVFMAGDNAHVETTYIREVLELISSDLRLQGCV 489

BLAST of CSPI01G15130 vs. TAIR10
Match: AT3G22690.1 (AT3G22690.1 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885))

HSP 1 Score: 377.5 bits (968), Expect = 1.4e-104
Identity = 184/477 (38.57%), Postives = 294/477 (61.64%), Query Frame = 1

Query: 49  FNPNSVDPIVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSE 108
           F+      + L  +  + Y R G   EA   F  M  +GV P+ I+ ++ +S+C+     
Sbjct: 294 FDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL--R 353

Query: 109 SFLFASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTML 168
           + L+  S HGY  + G ++    +  ALIDMY KC +   A ++F  +  K  V+WN+++
Sbjct: 354 NILWGKSCHGYVLRNGFESWD-NICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIV 413

Query: 169 NGFMRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQ-RSGVAADY 228
            G++ NGE++ A + F+ MP ++ +SW  +I+GL++    E+A+E F  MQ + GV AD 
Sbjct: 414 AGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADG 473

Query: 229 VSIIAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVK 288
           V+++++ +AC  LGAL L  W++ ++     + ++++  +L+DM+SRCG  E A  +F  
Sbjct: 474 VTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNS 533

Query: 289 MAKRTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKG 348
           +  R + +W + I   A+ G A+ ++E F  M ++G KPDGV++ GALTACSH GLV +G
Sbjct: 534 LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQG 593

Query: 349 LELFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACR 408
            E+F +M  +H ++P   HYGC+VDL GRAG LE+A+ +IE+MPM+PN+V+  SLLAACR
Sbjct: 594 KEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACR 653

Query: 409 THGDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGY 468
             G+V +A    + +  L PE    YVLLSN+YA+ G+W+    VR +MK +G++K PG 
Sbjct: 654 VQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGT 713

Query: 469 SSVEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPG-SDTILNTKESNK 524
           SS++I GK HEF +GD  H +  NI +MLD +       G+VP  S+ +++  E  K
Sbjct: 714 SSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEK 767

BLAST of CSPI01G15130 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 367.5 bits (942), Expect = 1.4e-101
Identity = 171/468 (36.54%), Postives = 291/468 (62.18%), Query Frame = 1

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSL 116
           +V W S +  + + G   +A   F +M    V+ +H+T + +LSACA     +  F   +
Sbjct: 197 VVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKI--RNLEFGRQV 256

Query: 117 HGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGE 176
             Y  +  ++  ++ +  A++DMY+KC  +  A+++F  +  K++V+W TML+G+  + +
Sbjct: 257 CSYIEENRVNV-NLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISED 316

Query: 177 IELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQ-RSGVAADYVSIIAVLA 236
            E A ++ + MP +D ++W ALI+   ++G   +AL  FH++Q +  +  + +++++ L+
Sbjct: 317 YEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 376

Query: 237 ACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVS 296
           ACA +GAL LG W+H ++     + N  ++++LI MYS+CG +E +R+VF  + KR +  
Sbjct: 377 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 436

Query: 297 WNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK 356
           W+++I G A++G  +E+++ FY MQ+   KP+GV++T    ACSH GLV++   LF  M+
Sbjct: 437 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 496

Query: 357 SVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLA 416
           S + I P  +HY CIVD+ GR+G LE A+  IE MP+ P+  V G+LL AC+ H ++NLA
Sbjct: 497 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 556

Query: 417 ERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGK 476
           E     L +L+P  D  +VLLSNIYA +GKW+  + +R+ M+  G++K+PG SS+EIDG 
Sbjct: 557 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 616

Query: 477 VHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNK 524
           +HEF++GDN H  ++ +Y  L  +  +LK  GY P    +L   E  +
Sbjct: 617 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEE 661

BLAST of CSPI01G15130 vs. TAIR10
Match: AT2G13600.1 (AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 360.5 bits (924), Expect = 1.7e-99
Identity = 178/454 (39.21%), Postives = 273/454 (60.13%), Query Frame = 1

Query: 57  IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSL 116
           +V W S +  + +NG   EA   F  M  + VEP+ +T  +++SACA     +      +
Sbjct: 218 VVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASL--SAIKVGQEV 277

Query: 117 HGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGE 176
           HG   K       +++  A +DMY+KC+++  AR +F ++ ++N ++  +M++G+     
Sbjct: 278 HGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAAS 337

Query: 177 IELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAA 236
            + A  +F +M  R+ +SW ALI G  ++G +E+AL  F  ++R  V   + S   +L A
Sbjct: 338 TKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKA 397

Query: 237 CADLGALTLGLWVHRFVMPQEFK------DNIKISNSLIDMYSRCGCIEFARQVFVKMAK 296
           CADL  L LG+  H  V+   FK      D+I + NSLIDMY +CGC+E    VF KM +
Sbjct: 398 CADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMME 457

Query: 297 RTLVSWNSIIVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLEL 356
           R  VSWN++I+GFA NG+ +E+LE F  M + G KPD ++  G L+AC HAG V +G   
Sbjct: 458 RDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHY 517

Query: 357 FDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHG 416
           F +M     + P  +HY C+VDL GRAG LE+A +MIEEMPM+P+ V+ GSLLAAC+ H 
Sbjct: 518 FSSMTRDFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHR 577

Query: 417 DVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSV 476
           ++ L + + + L +++P     YVLLSN+YA +GKW+   NVR++M+  GV K+PG S +
Sbjct: 578 NITLGKYVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWI 637

Query: 477 EIDGKVHEFVAGDNYHADADNIYSMLDLLCHELK 505
           +I G  H F+  D  H     I+S+LD+L  E++
Sbjct: 638 KIQGHDHVFMVKDKSHPRKKQIHSLLDILIAEMR 669

BLAST of CSPI01G15130 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 359.0 bits (920), Expect = 5.0e-99
Identity = 177/476 (37.18%), Postives = 286/476 (60.08%), Query Frame = 1

Query: 53  SVDP-IVLWTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFL 112
           ++DP + L+T+++     NG   +A   + ++  + + PN  TF +LL +C      S  
Sbjct: 90  TIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSC------STK 149

Query: 113 FASSLHGYACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGF 172
               +H +  KFGL      V T L+D+Y+K   +  A+KVF  +  ++ VS   M+  +
Sbjct: 150 SGKLIHTHVLKFGLGIDPY-VATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCY 209

Query: 173 MRNGEIELAIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAA-DYVSI 232
            + G +E A  LFD M  RD +SW  +I+G  +HG+   AL  F ++   G    D +++
Sbjct: 210 AKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITV 269

Query: 233 IAVLAACADLGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAK 292
           +A L+AC+ +GAL  G W+H FV     + N+K+   LIDMYS+CG +E A  VF    +
Sbjct: 270 VAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPR 329

Query: 293 RTLVSWNSIIVGFAVNGFADESLEFFYAMQK-EGFKPDGVSYTGALTACSHAGLVNKGLE 352
           + +V+WN++I G+A++G++ ++L  F  MQ   G +P  +++ G L AC+HAGLVN+G+ 
Sbjct: 330 KDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIR 389

Query: 353 LFDNMKSVHKITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTH 412
           +F++M   + I P+IEHYGC+V L GRAG+L+ A   I+ M M  + V+  S+L +C+ H
Sbjct: 390 IFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLH 449

Query: 413 GDVNLAERLMKHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSS 472
           GD  L + + ++L  L+ +    YVLLSNIYA++G ++G   VR  MK +G+ K+PG S+
Sbjct: 450 GDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGIST 509

Query: 473 VEIDGKVHEFVAGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 526
           +EI+ KVHEF AGD  H+ +  IY+ML  +   +K  GYVP ++T+L   E  + +
Sbjct: 510 IEIENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKE 558

BLAST of CSPI01G15130 vs. NCBI nr
Match: gi|449443656|ref|XP_004139593.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis sativus])

HSP 1 Score: 1070.5 bits (2767), Expect = 9.8e-310
Identity = 523/525 (99.62%), Postives = 524/525 (99.81%), Query Frame = 1

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW
Sbjct: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYA 120
           TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESF FASSLHGYA
Sbjct: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFFFASSLHGYA 120

Query: 121 CKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA 180
           CK+GLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA
Sbjct: 121 CKYGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA 180

Query: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL 240
           IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL
Sbjct: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI
Sbjct: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK
Sbjct: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480
           HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV
Sbjct: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480

Query: 481 AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 526
           AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD
Sbjct: 481 AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKDD 525

BLAST of CSPI01G15130 vs. NCBI nr
Match: gi|659118080|ref|XP_008458940.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Cucumis melo])

HSP 1 Score: 950.7 bits (2456), Expect = 1.1e-273
Identity = 474/524 (90.46%), Postives = 488/524 (93.13%), Query Frame = 1

Query: 1   MSSIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLW 60
           MSSIPSH A+PSQLQ PP   SSIPLSNPTK+NFPRSP SPH NI SKF  NSV PIV W
Sbjct: 1   MSSIPSHIASPSQLQQPP--SSSIPLSNPTKVNFPRSPKSPHCNIFSKFTANSVHPIVQW 60

Query: 61  TSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYA 120
           TSS+ARYC NGQL EAAAEFTRMRLAGVEPNHITFITLLS CADFPSESF FASSLHGYA
Sbjct: 61  TSSIARYCGNGQLPEAAAEFTRMRLAGVEPNHITFITLLSGCADFPSESF-FASSLHGYA 120

Query: 121 CKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELA 180
           CKFGLDTGHVMVGTALIDMYSKC+QLG A+KVF  LGVKNSVSWNTMLNGFMRNGEIELA
Sbjct: 121 CKFGLDTGHVMVGTALIDMYSKCSQLGLAKKVFDYLGVKNSVSWNTMLNGFMRNGEIELA 180

Query: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADL 240
           IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGV ADYVSIIAVLAACADL
Sbjct: 181 IQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVVADYVSIIAVLAACADL 240

Query: 241 GALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300
           GALT GLWV+RFVM QEFKDN++ISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII
Sbjct: 241 GALTSGLWVNRFVMQQEFKDNVRISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSII 300

Query: 301 VGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKI 360
           VGFA NGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMK VHKI
Sbjct: 301 VGFAFNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKRVHKI 360

Query: 361 TPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMK 420
           TP IEHYGCIVDLYGRAGRLEDA N+IEEMPMKPNEVVLGSLLAACRTHGDV LAERLMK
Sbjct: 361 TPGIEHYGCIVDLYGRAGRLEDASNVIEEMPMKPNEVVLGSLLAACRTHGDVRLAERLMK 420

Query: 421 HLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFV 480
           H+FKLD  GD+ YVLLSNIYAAIGKW+GAN VRRTMKARGVQKK GYSSVEIDGKVHEFV
Sbjct: 421 HIFKLDSVGDSNYVLLSNIYAAIGKWEGANKVRRTMKARGVQKKRGYSSVEIDGKVHEFV 480

Query: 481 AGDNYHADADNIYSMLDLLCHELKVCGYVPGSDTILNTKESNKD 525
           AGD YHADADNIYSMLDLL HELKVCGYVP +D ILNTK+SNKD
Sbjct: 481 AGDKYHADADNIYSMLDLLFHELKVCGYVPDTDIILNTKDSNKD 521

BLAST of CSPI01G15130 vs. NCBI nr
Match: gi|1009113399|ref|XP_015873124.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 708.0 bits (1826), Expect = 1.2e-200
Identity = 340/511 (66.54%), Postives = 420/511 (82.19%), Query Frame = 1

Query: 3   SIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHR---NISSKFNPNSVDPIVL 62
           S+P++T  P+  Q  P  P ++P SNPT    P SPN+P R   ++S K     +DP V 
Sbjct: 2   SVPANTLPPTLPQ--PAKPLTLPPSNPTIR--PTSPNTPRREKRSVSLKQTHKQIDPTVS 61

Query: 63  WTSSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGY 122
           WTSS+AR+CRNG+LSEAAAEF RMRL GVEPNHIT ITLLS CADFP +   F +S+HGY
Sbjct: 62  WTSSIARHCRNGRLSEAAAEFARMRLTGVEPNHITLITLLSGCADFPLDILCFGASVHGY 121

Query: 123 ACKFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIEL 182
           A K GLD  +VMVGTA++DMY+KC ++  +R  F +LGVKN+V+WNT+++G+MRNGE+E 
Sbjct: 122 ARKSGLDRDNVMVGTAIVDMYAKCGRMDFSRLAFDDLGVKNTVTWNTLIDGYMRNGEVEC 181

Query: 183 AIQLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACAD 242
           A+++F+EMP RDAISWTALI G +K G  E++L+ F QMQ SGV  DYV++IAVL ACA+
Sbjct: 182 AVEMFEEMPDRDAISWTALIGGFIKRGRLEESLKWFRQMQISGVKPDYVTMIAVLDACAE 241

Query: 243 LGALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSI 302
           LG L LGLW ++++M +++KDNI+++NSLIDMYSRCGCI+FARQVF KM +RTLVSWNSI
Sbjct: 242 LGTLGLGLWTNKYIMNKDYKDNIRMNNSLIDMYSRCGCIQFARQVFEKMPERTLVSWNSI 301

Query: 303 IVGFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHK 362
           IVGFA+NG A+E+LEFF  MQKEGFKPDGVS+TGALTACSH+GLV++GL  F+NMK VHK
Sbjct: 302 IVGFAINGHAEEALEFFDLMQKEGFKPDGVSFTGALTACSHSGLVDEGLSFFNNMKRVHK 361

Query: 363 ITPRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLM 422
           I PRIEHYGC+VDLY RAGRLEDAL++IE+MPMKPNEVV+GSLLAACRTHGDV+LAERLM
Sbjct: 362 IKPRIEHYGCMVDLYSRAGRLEDALHVIEKMPMKPNEVVVGSLLAACRTHGDVSLAERLM 421

Query: 423 KHLFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEF 482
           K+LF+LDP GD+ YVLL+NIYAA+G+WDGA  VR+TMKA GVQK PG SS+EID  +HEF
Sbjct: 422 KYLFELDPGGDSNYVLLANIYAAVGRWDGAGKVRKTMKALGVQKTPGLSSIEIDCNIHEF 481

Query: 483 VAGDNYHADADNIYSMLDLLCHELKVCGYVP 511
           VAGD  H D + IY ML+LL  ELK CGY+P
Sbjct: 482 VAGDKSHVDTECIYEMLELLSLELKACGYIP 508

BLAST of CSPI01G15130 vs. NCBI nr
Match: gi|359479098|ref|XP_002274209.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic [Vitis vinifera])

HSP 1 Score: 692.6 bits (1786), Expect = 5.4e-196
Identity = 338/509 (66.40%), Postives = 409/509 (80.35%), Query Frame = 1

Query: 3   SIPSHTAT-PSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWT 62
           S+P++TAT PS L   P   +S P S P +  FP  P+S   +++     + +DPIV WT
Sbjct: 2   SLPAYTATTPSSLVTHP---NSSPNSKPNQPTFPSRPHSTKYHLTRSHTHSPIDPIVSWT 61

Query: 63  SSLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYAC 122
           SS+A +CRNGQL EAAAEF+RM++AGV PNHITF+TLLSAC DFP E   F  S+H Y  
Sbjct: 62  SSIALHCRNGQLPEAAAEFSRMQIAGVRPNHITFLTLLSACTDFPLEGLRFGGSIHAYVR 121

Query: 123 KFGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAI 182
           K GLDT +VMVGTAL+DMYSKC QL  A  +F  + V+NSVSWNTM++G MRNGE+  AI
Sbjct: 122 KLGLDTENVMVGTALVDMYSKCGQLDLAWLMFDEMHVRNSVSWNTMIDGCMRNGEVGEAI 181

Query: 183 QLFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLG 242
            LFD+M  RDAISWT++I G +K G  EQALE F +MQ +GV  DYV+II+VLAACA+LG
Sbjct: 182 VLFDQMSERDAISWTSMIGGFVKKGCFEQALEWFREMQLAGVEPDYVTIISVLAACANLG 241

Query: 243 ALTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIV 302
           AL LGLW++RFVM Q+FKDNIKISNSLIDMYSRCGCI  ARQVF +M KR+LVSWNS+IV
Sbjct: 242 ALGLGLWINRFVMKQDFKDNIKISNSLIDMYSRCGCIRLARQVFEQMPKRSLVSWNSMIV 301

Query: 303 GFAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKIT 362
           GFA+NG A+E+LEFF  M+KEGF+PDGVS+TGALTACSH+GLV++GL+ FD MK   KI+
Sbjct: 302 GFALNGHAEEALEFFNLMRKEGFRPDGVSFTGALTACSHSGLVDEGLQFFDIMKRTRKIS 361

Query: 363 PRIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKH 422
           PRIEHYGC+VDLY RAGRLEDALN+I  MPMKPNEVVLGSLLAACRTHGDV LAERLMK+
Sbjct: 362 PRIEHYGCLVDLYSRAGRLEDALNVIANMPMKPNEVVLGSLLAACRTHGDVGLAERLMKY 421

Query: 423 LFKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVA 482
           L ++DP  D+ YVLLSNIYAA+G+WDGA+ VR+ MKA G+ KKPG+SS+E+DG +HEFVA
Sbjct: 422 LCEVDPGSDSNYVLLSNIYAAVGRWDGASKVRKKMKALGIHKKPGFSSIEMDGSIHEFVA 481

Query: 483 GDNYHADADNIYSMLDLLCHELKVCGYVP 511
           GD  H +  NIY+MLD L  EL++CGYVP
Sbjct: 482 GDKTHVETQNIYAMLDHLFLELRICGYVP 507

BLAST of CSPI01G15130 vs. NCBI nr
Match: gi|703084743|ref|XP_010092553.1| (hypothetical protein L484_001219 [Morus notabilis])

HSP 1 Score: 681.4 bits (1757), Expect = 1.2e-192
Identity = 335/510 (65.69%), Postives = 406/510 (79.61%), Query Frame = 1

Query: 3   SIPSHTATPSQLQLPPFTPSSIPLSNPTKLNFPRSPNSPHRNISSKFNPNSVDPIVLWTS 62
           S+P++T TP+QL  PP  P  + L +PT+  FP      H     K     ++P+V WTS
Sbjct: 2   SLPANTVTPTQLSQPP-KPPPLSLPSPTQPFFPNQHYPSH-----KLTYKPIEPVVKWTS 61

Query: 63  SLARYCRNGQLSEAAAEFTRMRLAGVEPNHITFITLLSACADFPSESFLFASSLHGYACK 122
           S+AR+C+NG+ SEAAAEF+RMRL+GVEPNH+TF+TLLS CAD    +  F +S+HGYA K
Sbjct: 62  SIARHCKNGRFSEAAAEFSRMRLSGVEPNHVTFVTLLSGCAD---SNISFGASIHGYARK 121

Query: 123 FGLDTGHVMVGTALIDMYSKCAQLGHARKVFYNLGVKNSVSWNTMLNGFMRNGEIELAIQ 182
              DT +VMVGTAL+ MY+K   +  AR VF ++  KNSVSWNTM++G+MRNG++  A++
Sbjct: 122 LCFDTSNVMVGTALVAMYAKRGLVDVARLVFDDIKEKNSVSWNTMIDGYMRNGKVRDAVE 181

Query: 183 LFDEMPTRDAISWTALINGLLKHGYSEQALECFHQMQRSGVAADYVSIIAVLAACADLGA 242
           +FDEMP RDA+SWTALI G +K    E+ALE F +MQ S V  DYV++IAVLAACADLG 
Sbjct: 182 VFDEMPERDAVSWTALIGGFVKRRRFEEALEWFREMQVSSVEPDYVTVIAVLAACADLGT 241

Query: 243 LTLGLWVHRFVMPQEFKDNIKISNSLIDMYSRCGCIEFARQVFVKMAKRTLVSWNSIIVG 302
           + LGLW++RF+M ++FKDN+KISNSLIDMYSRCGCIEFARQVF +M  RTLVSWNSIIVG
Sbjct: 242 VGLGLWMNRFIMNRKFKDNVKISNSLIDMYSRCGCIEFARQVFERMPNRTLVSWNSIIVG 301

Query: 303 FAVNGFADESLEFFYAMQKEGFKPDGVSYTGALTACSHAGLVNKGLELFDNMKSVHKITP 362
           FAVNG A+E+L+FF  MQ+EGFKPDGVS+TGALTACSHAGLV +GL LF+NMK VH I  
Sbjct: 302 FAVNGHAEEALKFFNLMQREGFKPDGVSFTGALTACSHAGLVEEGLLLFENMKRVHGIRH 361

Query: 363 RIEHYGCIVDLYGRAGRLEDALNMIEEMPMKPNEVVLGSLLAACRTHGDVNLAERLMKHL 422
           RIEHYGCIVDLY RAGRLEDALN+IE MPMKPNEVVLGSLLAACRTHGD+ LAERLMK+L
Sbjct: 362 RIEHYGCIVDLYSRAGRLEDALNVIEYMPMKPNEVVLGSLLAACRTHGDITLAERLMKYL 421

Query: 423 FKLDPEGDAYYVLLSNIYAAIGKWDGANNVRRTMKARGVQKKPGYSSVEIDGKVHEFVAG 482
             LDP GD+ YVLL+N+YAA+GKWDGA  VR+TMKA G+QK PG+SS+EID  +HEFVAG
Sbjct: 422 SDLDPGGDSNYVLLANMYAAVGKWDGAGKVRKTMKALGIQKTPGFSSIEIDCNIHEFVAG 481

Query: 483 DNYHADADNIYSMLDLLCHELKVCGYVPGS 513
           D  H D + IYSML+LL  ELK  GYVPG+
Sbjct: 482 DKSHVDKNCIYSMLELLSSELKASGYVPGN 502

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR13_ARATH8.4e-16559.83Pentatricopeptide repeat-containing protein At1g05750, chloroplastic OS=Arabidop... [more]
PP249_ARATH2.4e-10338.57Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH2.5e-10036.54Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP151_ARATH3.1e-9839.21Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN... [more]
PP354_ARATH8.9e-9837.18Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0LYD6_CUCSA6.8e-31099.62Uncharacterized protein OS=Cucumis sativus GN=Csa_1G169950 PE=4 SV=1[more]
F6HAB7_VITVI3.8e-19666.40Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0009g00650 PE=4 SV=... [more]
W9SDQ7_9ROSA8.7e-19365.69Uncharacterized protein OS=Morus notabilis GN=L484_001219 PE=4 SV=1[more]
A0A0B2RPQ9_GLYSO3.1e-19061.11Pentatricopeptide repeat-containing protein, chloroplastic OS=Glycine soja GN=gl... [more]
K7M2Y7_SOYBN3.1e-19061.11Uncharacterized protein OS=Glycine max GN=GLYMA_13G313200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G05750.14.7e-16659.83 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G22690.11.4e-10438.57 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatrico... [more]
AT2G29760.11.4e-10136.54 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G13600.11.7e-9939.21 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G37380.15.0e-9937.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449443656|ref|XP_004139593.1|9.8e-31099.62PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
gi|659118080|ref|XP_008458940.1|1.1e-27390.46PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
gi|1009113399|ref|XP_015873124.1|1.2e-20066.54PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
gi|359479098|ref|XP_002274209.2|5.4e-19666.40PREDICTED: pentatricopeptide repeat-containing protein At1g05750, chloroplastic ... [more]
gi|703084743|ref|XP_010092553.1|1.2e-19265.69hypothetical protein L484_001219 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G15130.1CSPI01G15130.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 294..324
score: 1.3E-5coord: 329..356
score: 0.0061coord: 266..291
score: 0.0012coord: 366..391
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 57..103
score: 1.3E-7coord: 191..237
score: 1.4E-8coord: 159..190
score: 9.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 193..225
score: 1.1E-7coord: 329..356
score: 0.0029coord: 367..390
score: 7.8E-4coord: 294..327
score: 1.6E-5coord: 162..190
score: 9.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 363..397
score: 8.166coord: 160..190
score: 11.071coord: 56..90
score: 9.668coord: 191..225
score: 11.564coord: 429..463
score: 6.862coord: 292..326
score: 11.082coord: 327..357
score: 7.859coord: 261..291
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 67..105
score: 2.0E-9coord: 163..222
score: 2.0E-9coord: 361..449
score: 2.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 171..215
score: 8.54E-6coord: 378..450
score: 8.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..470
score: 6.6E
NoneNo IPR availablePANTHERPTHR24015:SF778SUBFAMILY NOT NAMEDcoord: 1..470
score: 6.6E