CSPI01G15330 (gene) Wild cucumber (PI 183967)

NameCSPI01G15330
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 10934919 .. 10937419 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATAGAATTCTTGCTTTTTAAAAAAACGTAAAAAAGAAAAGTTATATTTTTCGCACAAAAAGACCAATCAATTTGAAATGGGATATTTTAATGTAAAGGCCTGCAGCCGATAAGGCGGGGACGAGCCATTCAACCGGAAGCCCAGAAGTGCAAGCCGGAAAGGGTCACACAGTCGCCGGAATGGCGTTCCAGCTCTGCTATTCGCCGCCCACCTTCTTTACCGAACACCATTTCCTTTCCAATTCTCTCACTCCTCAACGTAAAACAACTCTGTCCAACTCCTCTCCTCTTTTCAAGCTCAGTCCCATTCCTCGTCACTCAAAACCGTTCCTCCAAATTACCAATGTCTCGCTACAGGAACACGCTCCTCAAGATACCCAAAATACAATTCCCTCTGCTGATGAAATCTCCAAATACCCAGATTCGAAATCCGGTTCCTCCTCGAACAGCTCCGTTTGGGTCAATCCTAGAAGCCCCAGAGCTTCCAAACTTCGGAAGCAGTCGTACGAAGCAAGATATGCTTCTCTTATTAGAGTATCGGAGTCTTTAGACTCTTCTAATCCATGTGAGGTAGATGTTGCTGATGTCTTGAAGGTGATAGGTAATAACATTTTAGAACGGGACGCTATTTTAGTGCTGAATAACATGTCAAATTCCCAAACTGCGTTGCTTGCTCTTCGCTACTTCCAGGATATGCTGAAATCAAGTAAACAGACAATTTTTTATAACGTGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGAAGAAATGATTAACAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTGTTCTTTACCAAGTAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTTACTTACTCTACGATGATTGATGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGTTTGTATGACCGTGCAAGAACGGAAAATTGGCGAATTGATCCTGCGACATTCTCGACAATGATCAAAATTCATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTTTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACTGCTTGTTGGATGCTATGGGTAGGGCTAAAAGACCTTGGCAGATCAAAACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTATGGAAGAGCCAGGTATGGTGAGGATGCTCTCATTGTTTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAACGTAATTCTCTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCTATGTTAATGAGGCTGTTGAAATTTTTCAAGATATGAAGAGTTCTGGCACTTGCTCACCTGACAGTTGGACATTTTCTTCCATGATCACTATATATTCCTGCGGTGGAAAAGTATCAGAGGCTGAAGAAATGTTGAACGATATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTAACATCACTAATCCAGTGCTATGGGAAAGCTAAACGTGTTGATGATGTAGTGAGGACATTCAATCAATTGATAGAGTTAGGATTAACTCCAGACGATCGATTCTGTGGCTGCCTTCTCAATGTGATTACCCAGACACCAAAAGGGGAACTTGGTAAGCTGATTGATTGTGTTGTGAGAGCTAATCCAAAACTCGGGTTTGTGGTTGAACTCTTGCTAGGGGAGCAAGACAAGGAAGGAAATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCTTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGATTTGGGGCTTACACTTCAGATATATAAAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGATAAAGGACTTAACAAAGGTACTTGAATCTGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAGAATTCAGATAAGGGTTTGGCAAGCGTCTTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGATGGTTTTTGACAACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAAGTTCCCCTGAATTGGTTGCAGCATAGGTTGTACCTGAAAGGTCCAATAATATCTGTTGTGATATCTTTCATTTAAACCCTTTTCTCCCCTTTCTTTTAGACGGCCATATCTTCTTGAAAGACAGCTTTATGGTAAAGGATGCAATGAAGAGTTCTGCTATCTTTGTTCTTTCATCGTATTATAATTCATTGGAATGTCATATCAATCTTTAATCACTGTTTTTTTGCTTACTG

mRNA sequence

ATGGCGTTCCAGCTCTGCTATTCGCCGCCCACCTTCTTTACCGAACACCATTTCCTTTCCAATTCTCTCACTCCTCAACGTAAAACAACTCTGTCCAACTCCTCTCCTCTTTTCAAGCTCAGTCCCATTCCTCGTCACTCAAAACCGTTCCTCCAAATTACCAATGTCTCGCTACAGGAACACGCTCCTCAAGATACCCAAAATACAATTCCCTCTGCTGATGAAATCTCCAAATACCCAGATTCGAAATCCGGTTCCTCCTCGAACAGCTCCGTTTGGGTCAATCCTAGAAGCCCCAGAGCTTCCAAACTTCGGAAGCAGTCGTACGAAGCAAGATATGCTTCTCTTATTAGAGTATCGGAGTCTTTAGACTCTTCTAATCCATGTGAGGTAGATGTTGCTGATGTCTTGAAGGTGATAGGTAATAACATTTTAGAACGGGACGCTATTTTAGTGCTGAATAACATGTCAAATTCCCAAACTGCGTTGCTTGCTCTTCGCTACTTCCAGGATATGCTGAAATCAAGTAAACAGACAATTTTTTATAACGTGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGAAGAAATGATTAACAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTGTTCTTTACCAAGTAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTTACTTACTCTACGATGATTGATGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGTTTGTATGACCGTGCAAGAACGGAAAATTGGCGAATTGATCCTGCGACATTCTCGACAATGATCAAAATTCATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTTTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACTGCTTGTTGGATGCTATGGGTAGGGCTAAAAGACCTTGGCAGATCAAAACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTATGGAAGAGCCAGGTATGGTGAGGATGCTCTCATTGTTTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAACGTAATTCTCTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCTATGTTAATGAGGCTGTTGAAATTTTTCAAGATATGAAGAGTTCTGGCACTTGCTCACCTGACAGTTGGACATTTTCTTCCATGATCACTATATATTCCTGCGGTGGAAAAGTATCAGAGGCTGAAGAAATGTTGAACGATATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTAACATCACTAATCCAGTGCTATGGGAAAGCTAAACGTGTTGATGATGTAGTGAGGACATTCAATCAATTGATAGAGTTAGGATTAACTCCAGACGATCGATTCTGTGGCTGCCTTCTCAATGTGATTACCCAGACACCAAAAGGGGAACTTGGTAAGCTGATTGATTGTGTTGTGAGAGCTAATCCAAAACTCGGGTTTGTGGTTGAACTCTTGCTAGGGGAGCAAGACAAGGAAGGAAATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCTTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGATTTGGGGCTTACACTTCAGATATATAAAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGATAAAGGACTTAACAAAGGTACTTGAATCTGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAGAATTCAGATAAGGGTTTGGCAAGCGTCTTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGATGGTTTTTGACAACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAAGTTCCCCTGAATTGGTTGCAGCATAG

Coding sequence (CDS)

ATGGCGTTCCAGCTCTGCTATTCGCCGCCCACCTTCTTTACCGAACACCATTTCCTTTCCAATTCTCTCACTCCTCAACGTAAAACAACTCTGTCCAACTCCTCTCCTCTTTTCAAGCTCAGTCCCATTCCTCGTCACTCAAAACCGTTCCTCCAAATTACCAATGTCTCGCTACAGGAACACGCTCCTCAAGATACCCAAAATACAATTCCCTCTGCTGATGAAATCTCCAAATACCCAGATTCGAAATCCGGTTCCTCCTCGAACAGCTCCGTTTGGGTCAATCCTAGAAGCCCCAGAGCTTCCAAACTTCGGAAGCAGTCGTACGAAGCAAGATATGCTTCTCTTATTAGAGTATCGGAGTCTTTAGACTCTTCTAATCCATGTGAGGTAGATGTTGCTGATGTCTTGAAGGTGATAGGTAATAACATTTTAGAACGGGACGCTATTTTAGTGCTGAATAACATGTCAAATTCCCAAACTGCGTTGCTTGCTCTTCGCTACTTCCAGGATATGCTGAAATCAAGTAAACAGACAATTTTTTATAACGTGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGAAGAAATGATTAACAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTGTTCTTTACCAAGTAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTTACTTACTCTACGATGATTGATGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGTTTGTATGACCGTGCAAGAACGGAAAATTGGCGAATTGATCCTGCGACATTCTCGACAATGATCAAAATTCATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTTTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACTGCTTGTTGGATGCTATGGGTAGGGCTAAAAGACCTTGGCAGATCAAAACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTATGGAAGAGCCAGGTATGGTGAGGATGCTCTCATTGTTTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAACGTAATTCTCTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCTATGTTAATGAGGCTGTTGAAATTTTTCAAGATATGAAGAGTTCTGGCACTTGCTCACCTGACAGTTGGACATTTTCTTCCATGATCACTATATATTCCTGCGGTGGAAAAGTATCAGAGGCTGAAGAAATGTTGAACGATATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTAACATCACTAATCCAGTGCTATGGGAAAGCTAAACGTGTTGATGATGTAGTGAGGACATTCAATCAATTGATAGAGTTAGGATTAACTCCAGACGATCGATTCTGTGGCTGCCTTCTCAATGTGATTACCCAGACACCAAAAGGGGAACTTGGTAAGCTGATTGATTGTGTTGTGAGAGCTAATCCAAAACTCGGGTTTGTGGTTGAACTCTTGCTAGGGGAGCAAGACAAGGAAGGAAATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCTTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGATTTGGGGCTTACACTTCAGATATATAAAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGATAAAGGACTTAACAAAGGTACTTGAATCTGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAGAATTCAGATAAGGGTTTGGCAAGCGTCTTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGATGGTTTTTGACAACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAAGTTCCCCTGAATTGGTTGCAGCATAG
BLAST of CSPI01G15330 vs. Swiss-Prot
Match: PP314_ARATH (Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidopsis thaliana GN=P67 PE=1 SV=3)

HSP 1 Score: 916.8 bits (2368), Expect = 1.5e-265
Identity = 452/702 (64.39%), Postives = 560/702 (79.77%), Query Frame = 1

Query: 5   LCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQEHAPQ 64
           LC SP +   +   L N L+   K+T  +    +  +    HS+  LQ T+VS+QE  PQ
Sbjct: 6   LCSSPSSLLHDPLPLCNLLSVYPKSTPRSFLSSYNPNSSHFHSRNLLQATHVSVQEAIPQ 65

Query: 65  DTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVSESLD 124
             ++ +   D     P     ++S S VWVNP+SPRAS+LR++SY++RY+SLI+++ESLD
Sbjct: 66  SEKSKLVDVDLPIPEP-----TASKSYVWVNPKSPRASQLRRKSYDSRYSSLIKLAESLD 125

Query: 125 SSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTIFYNV 184
           +  P E DV DV+   G  + E+DA++ LNNM+N +TA L L    + +K S++ I YNV
Sbjct: 126 ACKPNEADVCDVITGFGGKLFEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNV 185

Query: 185 TLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSFD 244
           T+KVFRK +D+E +EKLF+EM+ RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF 
Sbjct: 186 TMKVFRKSKDLEKSEKLFDEMLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFG 245

Query: 245 CNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL 304
           C PD+VT + MIDAYGRAGNVDMA SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Sbjct: 246 CEPDNVTMAAMIDAYGRAGNVDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCL 305

Query: 305 NVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY 364
           N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAY
Sbjct: 306 NIYEEMKALGVKPNLVIYNRLIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAY 365

Query: 365 GRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSPD 424
           GRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  YV+EA EIFQDMK+  TC PD
Sbjct: 366 GRARYGDDALAIYREMKEKGLSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPD 425

Query: 425 SWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN 484
           SWTFSS+IT+Y+C G+VSEAE  L  M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF+
Sbjct: 426 SWTFSSLITVYACSGRVSEAEAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFD 485

Query: 485 QLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD-KEGN 544
           Q++ELG+TPDDRFCGCLLNV+TQTP  E+GKLI CV +A PKLG VV++L+ EQ+ +EG 
Sbjct: 486 QVLELGITPDDRFCGCLLNVMTQTPSEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGV 545

Query: 545 FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ 604
           F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IY  LQS+S TQ
Sbjct: 546 FKKEASELIDSIGSDVKKAYLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQ 605

Query: 605 WSLYLKGLSLGAALTALHVWIKDLTK-VLESGEELPPLLGINTGHGKHKNSDKGLASVFE 664
           WSL+LK LSLGAALTALHVW+ DL++  LESGEE PPLLGINTGHGKHK SDKGLA+VFE
Sbjct: 606 WSLHLKSLSLGAALTALHVWMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFE 665

Query: 665 SHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           SHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V+A
Sbjct: 666 SHLKELNAPFHEAPDKVGWFLTTSVAAKAWLESRRSAGGVSA 702

BLAST of CSPI01G15330 vs. Swiss-Prot
Match: PP420_ARATH (Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidopsis thaliana GN=At5g46580 PE=2 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 6.3e-131
Identity = 268/709 (37.80%), Postives = 407/709 (57.40%), Query Frame = 1

Query: 2   AFQLCYSPPTFFTEHH--FLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQ 61
           A  +C++P    T+ H  FL  SL  Q ++   N S      P     +P    T    +
Sbjct: 8   AIDVCFNPQNSDTKKHSLFLKPSLFRQSRSRKLNISCSSLKQPKTLEEEPITTKTPSLSE 67

Query: 62  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQ-------SYEAR 121
           +  P           +I   P          SVWVNP  P+ S L  Q       SY  +
Sbjct: 68  QLKPLSATTLRQEQTQILSKP---------KSVWVNPTRPKRSVLSLQRQKRSAYSYNPQ 127

Query: 122 YASLIRVSESLDSSNPCEV-DVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQD 181
              L   +  L+SS   E  +   +L  I +     +A+LVLN++   Q       + + 
Sbjct: 128 IKDLRAFALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWVKS 187

Query: 182 MLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPS 241
                 +TIFYNVT+K  R  R  +  E++  EM+  GV+ DN+T+STII+CA+ C+L +
Sbjct: 188 KSLFPMETIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNLYN 247

Query: 242 KAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMI 301
           KA+EWFE+M      PD+VTYS ++D Y ++G V+   SLY+RA    W+ D   FS + 
Sbjct: 248 KAIEWFERMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSVLG 307

Query: 302 KIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFS 361
           K+ G AG+YDG   V +EMK++ +KPN+V+YN LL+AMGRA +P   ++++ EM++ G +
Sbjct: 308 KMFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAGLT 367

Query: 362 PSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEI 421
           P+  T  +L++ YG+AR+  DAL +++EMK K   ++ ILYNTLL MCAD+G   EA  +
Sbjct: 368 PNEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAERL 427

Query: 422 FQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYG 481
           F DMK S  C PD++++++M+ IY  GGK  +A E+  +M++AG   N+   T L+QC G
Sbjct: 428 FNDMKESVQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQCLG 487

Query: 482 KAKRVDDVVRTFNQLIELGLTPDDRFCGCLLNVITQTPKGE-LGKLIDCVVRANPKLGFV 541
           KAKR+DDVV  F+  I+ G+ PDDR CGCLL+V+      E   K++ C+ RAN KL   
Sbjct: 488 KAKRIDDVVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLVTF 547

Query: 542 VELLLGEQDKEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTL 601
           V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  ++A ELL LG   
Sbjct: 548 VNLIVDEKTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGTLF 607

Query: 602 QIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKH 661
            +Y  L +++  +WSL ++ LS+GAA TAL  W++ L  +++  EELP L    TG G H
Sbjct: 608 GLYPGLHNKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTGTH 667

Query: 662 KNSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP 700
           + S +GLA+ F  HL++L+APF ++ ++ G F+ TK    SWLES+  P
Sbjct: 668 RFS-QGLANSFALHLQQLSAPFRQS-DRPGIFVATKEDLVSWLESKFPP 705

BLAST of CSPI01G15330 vs. Swiss-Prot
Match: PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 203.4 bits (516), Expect = 8.4e-51
Identity = 148/567 (26.10%), Postives = 272/567 (47.97%), Query Frame = 1

Query: 167 RYFQDMLKSSKQT--IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCA 226
           ++F +M ++  Q   I +N  L V  +    E A  LF+EM NR ++ D  +++T++   
Sbjct: 325 KFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFSYNTLLDAI 384

Query: 227 RLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDP 286
                   A E   +MP     P+ V+YST+ID + +AG  D A +L+   R     +D 
Sbjct: 385 CKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLGIALDR 444

Query: 287 ATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKE 346
            +++T++ I+   G  +  L++  EM ++GIK ++V YN LL   G+  +  ++K ++ E
Sbjct: 445 VSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVFTE 504

Query: 347 MIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGY 406
           M +    P+  TY++L+  Y +    ++A+ +++E K  GL+ +V+LY+ L+      G 
Sbjct: 505 MKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVLYSALIDALCKNGL 564

Query: 407 VNEAVEIFQDMKSSGTCSPDSWTFSSMITI------------YSCGGKVSEAEEMLNDMV 466
           V  AV +  +M   G  SP+  T++S+I              YS GG +  +   L+ + 
Sbjct: 565 VGSAVSLIDEMTKEG-ISPNVVTYNSIIDAFGRSATMDRSADYSNGGSLPFSSSALSALT 624

Query: 467 EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNQLIELGLTPDDRFCGCLLN 526
           E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN
Sbjct: 625 ETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILEVFRKMHQLEIKPNVVTFSAILN 684

Query: 527 VITQTPKGELGKLIDCVVRA--NPKLGFVVELLLGEQDKEGNFRTEASELFSVVS---AD 586
             ++    E   ++   +R   N   G V  LL+G+++   N   +A  LF  V+     
Sbjct: 685 ACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRE---NVWLQAQSLFDKVNEMDGS 744

Query: 587 VRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAAL 646
              A+ N L D+  +     +  EL+ L G + Q+++++ S S     L L  +S GAA 
Sbjct: 745 TASAFYNALTDMLWHFG-QKRGAELVALEGRSRQVWENVWSDS----CLDLHLMSSGAAR 804

Query: 647 TALHVWIKDLTKVLESGEELPPLLGINTGHGKHKN--SDKGLASVFESHLKELNAPFHEA 703
             +H W+ ++  ++  G ELP +L I TG GKH     D  L    E  L+ ++APFH +
Sbjct: 805 AMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALRRAVEVLLRGMDAPFHLS 864

BLAST of CSPI01G15330 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 9.3e-42
Identity = 91/332 (27.41%), Postives = 172/332 (51.81%), Query Frame = 1

Query: 165 ALRYFQDMLKS--SKQTIFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIIS 224
           A + F++M  +  S   + YN  L V+ K    + A K+  EM+  G  P  VT++++IS
Sbjct: 298 AAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLIS 357

Query: 225 CARLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRI 284
                 +  +A+E   +M      PD  TY+T++  + RAG V+ A S+++  R    + 
Sbjct: 358 AYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKP 417

Query: 285 DPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIY 344
           +  TF+  IK++G  G +   + +++E+   G+ P++V +N LL   G+     ++  ++
Sbjct: 418 NICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVF 477

Query: 345 KEMIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADV 404
           KEM + GF P   T+ +L+ AY R    E A+ VY+ M + G+  ++  YNT+LA  A  
Sbjct: 478 KEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARG 537

Query: 405 GYVNEAVEIFQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFV 464
           G   ++ ++  +M+  G C P+  T+ S++  Y+ G ++     +  ++     +P   +
Sbjct: 538 GMWEQSEKVLAEME-DGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVL 597

Query: 465 LTSLIQCYGKAKRVDDVVRTFNQLIELGLTPD 495
           L +L+    K   + +  R F++L E G +PD
Sbjct: 598 LKTLVLVCSKCDLLPEAERAFSELKERGFSPD 628

BLAST of CSPI01G15330 vs. Swiss-Prot
Match: PP123_ARATH (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 166.8 bits (421), Expect = 8.7e-40
Identity = 113/495 (22.83%), Postives = 227/495 (45.86%), Query Frame = 1

Query: 209 GVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMA 268
           G K D  T++T++          +  +  ++M    C P+ VTY+ +I +YGRA  +  A
Sbjct: 354 GFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTVTYNRLIHSYGRANYLKEA 413

Query: 269 FSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDA 328
            +++++ +      D  T+ T+I IH  AG  D  +++Y+ M+  G+ P+   Y+ +++ 
Sbjct: 414 MNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINC 473

Query: 329 MGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLN 388
           +G+A        ++ EM+  G +P+  T+  ++  + +AR  E AL +Y++M+  G Q +
Sbjct: 474 LGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNYETALKLYRDMQNAGFQPD 533

Query: 389 VILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEML 448
            + Y+ ++ +    G++ EA  +F +M+      PD   +  ++ ++   G V +A +  
Sbjct: 534 KVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWV-PDEPVYGLLVDLWGKAGNVDKAWQWY 593

Query: 449 NDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLIELGLTPDDRFCGCLLNVITQT 508
             M++AG  PN+    SL+  + +  R+ +       ++ LGL P  +    LL+  T  
Sbjct: 594 QAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGLHPSLQTYTLLLSCCTDA 653

Query: 509 ----PKGELGKLIDCVVRANPKLGFVVELLLGEQDKEGNFRTEASELFSVVSADVR---K 568
                 G  G+L+   V  +P   F++++     D +   R   S     + ++ R   +
Sbjct: 654 RSNFDMGFCGQLM--AVSGHPAHMFLLKMPPAGPDGQ-KVRDHVSNFLDFMHSEDRESKR 713

Query: 569 AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTAL 628
              + ++D      L ++A  + ++     +Y D L+ +S + W + L  +S G A+ AL
Sbjct: 714 GLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIAL 773

Query: 629 HVWIKDLTKVLESGEELPPLLGINTGHGKHK--NSDKGLASVFESHLKELNAPFHEAPEK 688
              +    K +    + P  + I TG G+         +    E  L   N PF      
Sbjct: 774 SRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFNFPFFTENGN 833

Query: 689 VGWFLTTKVAAKSWL 694
            G F+ +    K+WL
Sbjct: 834 SGCFVGSGEPLKNWL 844

BLAST of CSPI01G15330 vs. TrEMBL
Match: A0A0A0LVP1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G173140 PE=4 SV=1)

HSP 1 Score: 1410.2 bits (3649), Expect = 0.0e+00
Identity = 703/704 (99.86%), Postives = 703/704 (99.86%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE 60
           MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE
Sbjct: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE 60

Query: 61  HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS 120
           HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS
Sbjct: 61  HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS 120

Query: 121 ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI 180
           ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI
Sbjct: 121 ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI 180

Query: 181 FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240
           FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM
Sbjct: 181 FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240

Query: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300
           PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Sbjct: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300

Query: 301 DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL 360
           DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL
Sbjct: 301 DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL 360

Query: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT 420
           LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT
Sbjct: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT 420

Query: 421 CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480
           CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV
Sbjct: 421 CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480

Query: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK 540
           RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK
Sbjct: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK 540

Query: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS 600
           EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
Sbjct: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS 600

Query: 601 PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLASV 660
           PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHK SDKGLASV
Sbjct: 601 PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV 660

Query: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA
Sbjct: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 704

BLAST of CSPI01G15330 vs. TrEMBL
Match: F6HCW3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0194g00270 PE=4 SV=1)

HSP 1 Score: 1050.8 bits (2716), Expect = 7.3e-304
Identity = 513/704 (72.87%), Postives = 601/704 (85.37%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTL-SNSSPLFKLSPIPRHSKPFLQITNVSLQ 60
           MA+ LC SP +   +HH+L NSL+  RK+ L S +S  FK + +  HS+ FLQIT+VSL+
Sbjct: 1   MAYHLCSSPSSLCHDHHYLHNSLSFSRKSRLRSFNSFSFKPNSLSLHSRTFLQITHVSLE 60

Query: 61  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRV 120
           +  PQ+TQ    S    S+ PD K+     S +WVNPRSPRASKLR+ SY+ARYASL+++
Sbjct: 61  DPIPQETQKADASNPPNSQDPDRKT----KSYIWVNPRSPRASKLRQHSYDARYASLVKI 120

Query: 121 SESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQT 180
           +ESLDS    E DV+ VL+ +G+ ILE+DA++VLNNM+N +TALLA  +F+  LK S++ 
Sbjct: 121 AESLDSCEATEEDVSQVLRCLGDKILEQDAVIVLNNMTNPETALLAFGFFRKRLKPSREV 180

Query: 181 IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEK 240
           I YNVTLKVFRKCR+++ AEKLF+EM+ RGVKPDN+TFSTIISCAR+ SLP+KAVEWFEK
Sbjct: 181 ILYNVTLKVFRKCRNLDRAEKLFDEMLERGVKPDNITFSTIISCARVSSLPNKAVEWFEK 240

Query: 241 MPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN 300
           MP F C+PDDVTYS MIDAYGRAGNVDMA  LYDRARTE WRIDP TFST+I+I+G++GN
Sbjct: 241 MPEFGCHPDDVTYSAMIDAYGRAGNVDMALKLYDRARTEKWRIDPVTFSTLIRIYGMSGN 300

Query: 301 YDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYAS 360
           +DGCLNVYEEMKA+G+KPNLVIYN LLDAMGRAKRPWQ K IYKEM  NG  PSW TYA+
Sbjct: 301 FDGCLNVYEEMKALGVKPNLVIYNTLLDAMGRAKRPWQAKNIYKEMTNNGLQPSWGTYAA 360

Query: 361 LLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSG 420
           LLRAYGRARY EDALIVYKEMKEKGL+L+V+LYNTLLAMCADVGY  EA  IF+DMKSSG
Sbjct: 361 LLRAYGRARYAEDALIVYKEMKEKGLELSVVLYNTLLAMCADVGYTEEAAAIFEDMKSSG 420

Query: 421 TCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDV 480
            C PDSWTFSS+ITIYSC GKVSEAE MLN M+EAGF+PNIFVLTSLIQCYGKA R D+V
Sbjct: 421 NCMPDSWTFSSLITIYSCSGKVSEAEAMLNAMLEAGFEPNIFVLTSLIQCYGKANRTDEV 480

Query: 481 VRTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD 540
           VRTF++L+EL +TPDDRFCGC+LNV+TQ+PK ELGKLIDC+ +ANPKLG VV+LLL EQ+
Sbjct: 481 VRTFDRLLELDITPDDRFCGCMLNVMTQSPKEELGKLIDCIDKANPKLGNVVKLLLEEQN 540

Query: 541 KEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR 600
            EG FR EASELF  +SADV+KAYCNCLIDLCVNL+LL+KACEL DLGLTL+IY D+QS+
Sbjct: 541 GEGTFRKEASELFDSISADVKKAYCNCLIDLCVNLNLLEKACELFDLGLTLEIYIDIQSK 600

Query: 601 SPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLAS 660
           SPTQWSL+LK LSLGAALTALH+W+ DL+K +E GEELP +LGINTGHGKHK SDKGLAS
Sbjct: 601 SPTQWSLHLKSLSLGAALTALHIWMNDLSKAVEVGEELPAVLGINTGHGKHKYSDKGLAS 660

Query: 661 VFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA 704
           VFESHLKELNAPFHEAP+KVGWFLTTKVAA SWLESRS+PELVA
Sbjct: 661 VFESHLKELNAPFHEAPDKVGWFLTTKVAATSWLESRSAPELVA 700

BLAST of CSPI01G15330 vs. TrEMBL
Match: A0A061GPA6_THECC (Pentatricopeptide (PPR) repeat-containing protein OS=Theobroma cacao GN=TCM_030422 PE=4 SV=1)

HSP 1 Score: 1049.3 bits (2712), Expect = 2.1e-303
Identity = 512/701 (73.04%), Postives = 612/701 (87.30%), Query Frame = 1

Query: 5   LCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQEHAPQ 64
           LC SP + F + H LS S  P+      +++P  +L      SK  +QI++VSLQ+   Q
Sbjct: 6   LCSSPSSVFHDRHTLSASPKPR---PARSTAPSLRLVSCSFQSKSSIQISHVSLQDPITQ 65

Query: 65  DTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVSESLD 124
            T+NT   ++  S+ PD K+GSSS S VWVNPRSPRAS+LR+ SY++RY+SL++V+E+LD
Sbjct: 66  -TKNTPKHSN--SQSPDGKTGSSSKSYVWVNPRSPRASRLRQLSYDSRYSSLVKVAETLD 125

Query: 125 SSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLK-SSKQTIFYN 184
           S NP E DV  VL  +GN++LE+DA++VLNNMSN  TALLAL +FQ +LK +S++ I YN
Sbjct: 126 SCNPNEHDVLSVLSRLGNDVLEQDAVVVLNNMSNPHTALLALNHFQRILKKTSREVILYN 185

Query: 185 VTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSF 244
           VT+KVFRK +D++GAEKLF+EM+ +GVKPDNVTFST+ISCAR+C+LP KAVEWFEKMP +
Sbjct: 186 VTMKVFRKSKDLDGAEKLFDEMLQKGVKPDNVTFSTLISCARVCALPDKAVEWFEKMPIY 245

Query: 245 DCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGC 304
            C+PDDVTYS MIDAYGRAGNVDMAF+LYDRARTE WRIDP TFST+IKI+G++GNYDGC
Sbjct: 246 GCDPDDVTYSAMIDAYGRAGNVDMAFNLYDRARTEKWRIDPVTFSTLIKIYGISGNYDGC 305

Query: 305 LNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRA 364
           LNVYEEMKA+G KPN+VIYN LLDAMGRAKRPWQ KTIYKEM  NGFSP+WATYA+LLRA
Sbjct: 306 LNVYEEMKALGAKPNVVIYNTLLDAMGRAKRPWQAKTIYKEMTNNGFSPNWATYAALLRA 365

Query: 365 YGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSP 424
           YGRARYGEDAL +YKEMK+KGL+L VILYNTLLAMCADVGY +EAVEIF+DMK+SGTC P
Sbjct: 366 YGRARYGEDALNIYKEMKDKGLELTVILYNTLLAMCADVGYADEAVEIFEDMKNSGTCKP 425

Query: 425 DSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF 484
           DSWT+SS+ITIYSC GKVSEAE ++++M+EAGF+PNIFVLTSLIQCYGKA+  DDVVRTF
Sbjct: 426 DSWTYSSLITIYSCSGKVSEAEGIVDEMLEAGFEPNIFVLTSLIQCYGKAQHTDDVVRTF 485

Query: 485 NQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDKEGN 544
           N+++ELG+TPDDRFCGCLLNV+TQTP+ EL KL DC+ +ANPKLG VV+LL+ EQD +GN
Sbjct: 486 NRVLELGITPDDRFCGCLLNVMTQTPREELAKLTDCIKKANPKLGHVVKLLVEEQDGQGN 545

Query: 545 FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ 604
           F+ EASELF+ + +DV+KAYCNCLIDLCVNLDLL++ACELL+LGL+L+IY D+QSRSPTQ
Sbjct: 546 FKNEASELFNCIGSDVKKAYCNCLIDLCVNLDLLERACELLELGLSLEIYADVQSRSPTQ 605

Query: 605 WSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLASVFES 664
           WSL LK LSLGAALT+LHVWI DLTKVLESGEELPPLLGINTGHGKHK SDKGLA+VFES
Sbjct: 606 WSLNLKSLSLGAALTSLHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLATVFES 665

Query: 665 HLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           HLKEL+APFHEAP+KVGWFLTT+VAAKSWLESRSSP+LVAA
Sbjct: 666 HLKELDAPFHEAPDKVGWFLTTQVAAKSWLESRSSPDLVAA 700

BLAST of CSPI01G15330 vs. TrEMBL
Match: A5B4A6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_001456 PE=4 SV=1)

HSP 1 Score: 1043.5 bits (2697), Expect = 1.2e-301
Identity = 511/704 (72.59%), Postives = 598/704 (84.94%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTL-SNSSPLFKLSPIPRHSKPFLQITNVSLQ 60
           MA+ LC SP +   +HH+L NSL+  RK+ L S +S  FK + +  HS+ FLQIT+VSL+
Sbjct: 1   MAYHLCSSPSSLCHDHHYLHNSLSFSRKSRLRSFNSFSFKPNSLSLHSRTFLQITHVSLE 60

Query: 61  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRV 120
           +  PQ+TQ    S    S+ PD K+     S +WVNPRSPRASKLR+ SY+ARYASL+++
Sbjct: 61  DPIPQETQKADASNPPNSQDPDRKT----KSYIWVNPRSPRASKLRQHSYDARYASLVKI 120

Query: 121 SESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQT 180
           +ESLDS    E DV+ VL+ +G+ ILE+DA++VLNNM+N +TALLA  +F+  LK S++ 
Sbjct: 121 AESLDSCEATEEDVSQVLRCLGDKILEQDAVIVLNNMTNPETALLAFGFFRKRLKPSREV 180

Query: 181 IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEK 240
           I YNVTLKVFRKCR+++ AEKLF+EM+ RGVKPDN+TFSTIISCAR+ SLP+KAVEWFEK
Sbjct: 181 ILYNVTLKVFRKCRNLDXAEKLFDEMLERGVKPDNITFSTIISCARVSSLPNKAVEWFEK 240

Query: 241 MPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN 300
           MP F C+PDDVTYS MIDAYGRAGNVDMA  LYDRARTE WRIDP TFST+I+I+G++GN
Sbjct: 241 MPEFGCHPDDVTYSAMIDAYGRAGNVDMALKLYDRARTEKWRIDPVTFSTLIRIYGMSGN 300

Query: 301 YDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYAS 360
           +DGCLNVYEEMKA+G+KPNLVIYN LLDAMGRAKRPWQ K IYKEM  NG   SW TYA+
Sbjct: 301 FDGCLNVYEEMKALGVKPNLVIYNTLLDAMGRAKRPWQAKNIYKEMTNNGLQLSWGTYAA 360

Query: 361 LLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSG 420
           LLRAYGRARY EDALIVYKEMKEKGL+L+V+LYNTLLAMCADVGY  EA  IF+DMKSSG
Sbjct: 361 LLRAYGRARYAEDALIVYKEMKEKGLELSVVLYNTLLAMCADVGYTEEAAAIFEDMKSSG 420

Query: 421 TCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDV 480
            C PDSWTFSS+ITIYSC GKVSEAE MLN M+EAGF+PNIFVLTSLIQCYGKA R D+V
Sbjct: 421 NCMPDSWTFSSLITIYSCSGKVSEAEAMLNAMLEAGFEPNIFVLTSLIQCYGKANRTDEV 480

Query: 481 VRTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD 540
           VRTF++L+EL +TPDDRFCGC+LNV+TQ+PK ELGKLIDC+ +ANPKLG VV+LLL EQ+
Sbjct: 481 VRTFDRLLELDITPDDRFCGCMLNVMTQSPKEELGKLIDCIDKANPKLGNVVKLLLEEQN 540

Query: 541 KEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR 600
            EG FR EASELF  +SADV KAYCNCLIDLCVNL+LL+KACEL DLGLTL+IY D+QS+
Sbjct: 541 GEGTFRKEASELFDSISADVXKAYCNCLIDLCVNLNLLEKACELFDLGLTLEIYIDIQSK 600

Query: 601 SPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLAS 660
           SPTQWSL+LK LSLGAALTALH+W+ DL+K +E GEELP +LGINTGHGKHK SDKGLAS
Sbjct: 601 SPTQWSLHLKSLSLGAALTALHIWMNDLSKAVEVGEELPAVLGINTGHGKHKYSDKGLAS 660

Query: 661 VFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA 704
           VFESHLKELNAPFHEAP+KV WFLTTKVAA SWLESRS+PELVA
Sbjct: 661 VFESHLKELNAPFHEAPDKVXWFLTTKVAATSWLESRSAPELVA 700

BLAST of CSPI01G15330 vs. TrEMBL
Match: M5WLZ8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002169mg PE=4 SV=1)

HSP 1 Score: 1023.1 bits (2644), Expect = 1.6e-295
Identity = 510/707 (72.14%), Postives = 591/707 (83.59%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSS-PLFKLSPIPRHSKPFLQITNVSLQ 60
           MA+ LC SP + F      S+SL   R   L +S     KLS    H++  LQI +VSLQ
Sbjct: 1   MAYHLCSSPSSLFPNRQTPSHSLPSPRGFRLGSSGLRTLKLSFPSLHARTSLQINHVSLQ 60

Query: 61  EHAPQDTQN--TIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLI 120
           E   Q+TQ    +P  +   +  +  SGS S S +WVNP SPRAS+LR++SY++RYASL+
Sbjct: 61  EPVAQETQTPTNVPEVESPQRQ-NRNSGSLSKSYIWVNPSSPRASQLRQKSYDSRYASLV 120

Query: 121 RVSESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSK 180
           +V+E L+S +P E DV + LK +G+ ILE+DA++VLNNM+N + ALLAL+YFQ  LK  +
Sbjct: 121 KVAEYLNSCSPSENDVFEALKGLGDRILEQDAVVVLNNMTNPENALLALKYFQQNLKPKR 180

Query: 181 QTIFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWF 240
           + I YNVTLKV RK +D++ AEKLF+E++ RGV+PDNVTFST+ISCAR+ SLP KAVEWF
Sbjct: 181 EVILYNVTLKVCRKGKDLDRAEKLFDELLKRGVQPDNVTFSTMISCARMSSLPDKAVEWF 240

Query: 241 EKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVA 300
           EKMPSF CNPDDVTYS MIDAYGR+G VDMAFSLYDRART  WRIDP TFST+IKIHG +
Sbjct: 241 EKMPSFGCNPDDVTYSAMIDAYGRSGKVDMAFSLYDRARTSKWRIDPVTFSTLIKIHGQS 300

Query: 301 GNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATY 360
           GN+DGCLNVYEEMKAIG KPNLVIYN LLDAMGRAKRPWQ K IY+EMI   FSP+W TY
Sbjct: 301 GNFDGCLNVYEEMKAIGAKPNLVIYNTLLDAMGRAKRPWQAKKIYREMINKEFSPNWVTY 360

Query: 361 ASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKS 420
           A+LLRAYGRARYG+DAL VY+EMKEKG++LNVILYNTLLAMCADVGY +EAVEIF+DMKS
Sbjct: 361 AALLRAYGRARYGDDALNVYREMKEKGMELNVILYNTLLAMCADVGYADEAVEIFKDMKS 420

Query: 421 SGTCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVD 480
           S T  PDSWTFSSMITIYSC GKV+EAE MLN+M+EAGF PNIF+LTSLIQCYGKAKR D
Sbjct: 421 SETWKPDSWTFSSMITIYSCSGKVTEAETMLNEMLEAGFQPNIFILTSLIQCYGKAKRTD 480

Query: 481 DVVRTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGE 540
           DVVR FNQL+ELG+TPD+RFCGCLLNV+TQTPK EL KL +C+ RA+ KLG+VV LL+ +
Sbjct: 481 DVVRIFNQLLELGITPDERFCGCLLNVMTQTPKEELCKLANCIERADEKLGYVVRLLVEK 540

Query: 541 QDKEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQ 600
           QD   NF+ EASELF+ + +DV+KAYCNCLIDLCVNLDLL++ACELLDLGLTLQIY D+Q
Sbjct: 541 QDNSVNFKKEASELFNSIGSDVKKAYCNCLIDLCVNLDLLERACELLDLGLTLQIYIDIQ 600

Query: 601 SRSPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGL 660
           SRS TQWSLYLKGLSLGAALTALHVWI DL++VLESGEELPPLLGINTGHGKHK SDKGL
Sbjct: 601 SRSQTQWSLYLKGLSLGAALTALHVWINDLSRVLESGEELPPLLGINTGHGKHKYSDKGL 660

Query: 661 ASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           ASVFESHLKELNAPFHEAP+K GWFLTTKVA KSWLESRSS ELVAA
Sbjct: 661 ASVFESHLKELNAPFHEAPDKAGWFLTTKVAVKSWLESRSSSELVAA 706

BLAST of CSPI01G15330 vs. TAIR10
Match: AT4G16390.1 (AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 916.8 bits (2368), Expect = 8.3e-267
Identity = 452/702 (64.39%), Postives = 560/702 (79.77%), Query Frame = 1

Query: 5   LCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQEHAPQ 64
           LC SP +   +   L N L+   K+T  +    +  +    HS+  LQ T+VS+QE  PQ
Sbjct: 6   LCSSPSSLLHDPLPLCNLLSVYPKSTPRSFLSSYNPNSSHFHSRNLLQATHVSVQEAIPQ 65

Query: 65  DTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVSESLD 124
             ++ +   D     P     ++S S VWVNP+SPRAS+LR++SY++RY+SLI+++ESLD
Sbjct: 66  SEKSKLVDVDLPIPEP-----TASKSYVWVNPKSPRASQLRRKSYDSRYSSLIKLAESLD 125

Query: 125 SSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTIFYNV 184
           +  P E DV DV+   G  + E+DA++ LNNM+N +TA L L    + +K S++ I YNV
Sbjct: 126 ACKPNEADVCDVITGFGGKLFEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNV 185

Query: 185 TLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSFD 244
           T+KVFRK +D+E +EKLF+EM+ RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF 
Sbjct: 186 TMKVFRKSKDLEKSEKLFDEMLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFG 245

Query: 245 CNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL 304
           C PD+VT + MIDAYGRAGNVDMA SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Sbjct: 246 CEPDNVTMAAMIDAYGRAGNVDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCL 305

Query: 305 NVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY 364
           N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAY
Sbjct: 306 NIYEEMKALGVKPNLVIYNRLIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAY 365

Query: 365 GRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSPD 424
           GRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  YV+EA EIFQDMK+  TC PD
Sbjct: 366 GRARYGDDALAIYREMKEKGLSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPD 425

Query: 425 SWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN 484
           SWTFSS+IT+Y+C G+VSEAE  L  M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF+
Sbjct: 426 SWTFSSLITVYACSGRVSEAEAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFD 485

Query: 485 QLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD-KEGN 544
           Q++ELG+TPDDRFCGCLLNV+TQTP  E+GKLI CV +A PKLG VV++L+ EQ+ +EG 
Sbjct: 486 QVLELGITPDDRFCGCLLNVMTQTPSEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGV 545

Query: 545 FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ 604
           F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IY  LQS+S TQ
Sbjct: 546 FKKEASELIDSIGSDVKKAYLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQ 605

Query: 605 WSLYLKGLSLGAALTALHVWIKDLTK-VLESGEELPPLLGINTGHGKHKNSDKGLASVFE 664
           WSL+LK LSLGAALTALHVW+ DL++  LESGEE PPLLGINTGHGKHK SDKGLA+VFE
Sbjct: 606 WSLHLKSLSLGAALTALHVWMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFE 665

Query: 665 SHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           SHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V+A
Sbjct: 666 SHLKELNAPFHEAPDKVGWFLTTSVAAKAWLESRRSAGGVSA 702

BLAST of CSPI01G15330 vs. TAIR10
Match: AT5G46580.1 (AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 469.5 bits (1207), Expect = 3.5e-132
Identity = 268/709 (37.80%), Postives = 407/709 (57.40%), Query Frame = 1

Query: 2   AFQLCYSPPTFFTEHH--FLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQ 61
           A  +C++P    T+ H  FL  SL  Q ++   N S      P     +P    T    +
Sbjct: 8   AIDVCFNPQNSDTKKHSLFLKPSLFRQSRSRKLNISCSSLKQPKTLEEEPITTKTPSLSE 67

Query: 62  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQ-------SYEAR 121
           +  P           +I   P          SVWVNP  P+ S L  Q       SY  +
Sbjct: 68  QLKPLSATTLRQEQTQILSKP---------KSVWVNPTRPKRSVLSLQRQKRSAYSYNPQ 127

Query: 122 YASLIRVSESLDSSNPCEV-DVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQD 181
              L   +  L+SS   E  +   +L  I +     +A+LVLN++   Q       + + 
Sbjct: 128 IKDLRAFALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWVKS 187

Query: 182 MLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPS 241
                 +TIFYNVT+K  R  R  +  E++  EM+  GV+ DN+T+STII+CA+ C+L +
Sbjct: 188 KSLFPMETIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNLYN 247

Query: 242 KAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMI 301
           KA+EWFE+M      PD+VTYS ++D Y ++G V+   SLY+RA    W+ D   FS + 
Sbjct: 248 KAIEWFERMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSVLG 307

Query: 302 KIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFS 361
           K+ G AG+YDG   V +EMK++ +KPN+V+YN LL+AMGRA +P   ++++ EM++ G +
Sbjct: 308 KMFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAGLT 367

Query: 362 PSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEI 421
           P+  T  +L++ YG+AR+  DAL +++EMK K   ++ ILYNTLL MCAD+G   EA  +
Sbjct: 368 PNEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAERL 427

Query: 422 FQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYG 481
           F DMK S  C PD++++++M+ IY  GGK  +A E+  +M++AG   N+   T L+QC G
Sbjct: 428 FNDMKESVQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQCLG 487

Query: 482 KAKRVDDVVRTFNQLIELGLTPDDRFCGCLLNVITQTPKGE-LGKLIDCVVRANPKLGFV 541
           KAKR+DDVV  F+  I+ G+ PDDR CGCLL+V+      E   K++ C+ RAN KL   
Sbjct: 488 KAKRIDDVVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLVTF 547

Query: 542 VELLLGEQDKEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTL 601
           V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  ++A ELL LG   
Sbjct: 548 VNLIVDEKTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGTLF 607

Query: 602 QIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKH 661
            +Y  L +++  +WSL ++ LS+GAA TAL  W++ L  +++  EELP L    TG G H
Sbjct: 608 GLYPGLHNKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTGTH 667

Query: 662 KNSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP 700
           + S +GLA+ F  HL++L+APF ++ ++ G F+ TK    SWLES+  P
Sbjct: 668 RFS-QGLANSFALHLQQLSAPFRQS-DRPGIFVATKEDLVSWLESKFPP 705

BLAST of CSPI01G15330 vs. TAIR10
Match: AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)

HSP 1 Score: 203.4 bits (516), Expect = 4.7e-52
Identity = 148/567 (26.10%), Postives = 272/567 (47.97%), Query Frame = 1

Query: 167 RYFQDMLKSSKQT--IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCA 226
           ++F +M ++  Q   I +N  L V  +    E A  LF+EM NR ++ D  +++T++   
Sbjct: 325 KFFDEMQRNGVQPDRITFNSLLAVCSRGGLWEAARNLFDEMTNRRIEQDVFSYNTLLDAI 384

Query: 227 RLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDP 286
                   A E   +MP     P+ V+YST+ID + +AG  D A +L+   R     +D 
Sbjct: 385 CKGGQMDLAFEILAQMPVKRIMPNVVSYSTVIDGFAKAGRFDEALNLFGEMRYLGIALDR 444

Query: 287 ATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKE 346
            +++T++ I+   G  +  L++  EM ++GIK ++V YN LL   G+  +  ++K ++ E
Sbjct: 445 VSYNTLLSIYTKVGRSEEALDILREMASVGIKKDVVTYNALLGGYGKQGKYDEVKKVFTE 504

Query: 347 MIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGY 406
           M +    P+  TY++L+  Y +    ++A+ +++E K  GL+ +V+LY+ L+      G 
Sbjct: 505 MKREHVLPNLLTYSTLIDGYSKGGLYKEAMEIFREFKSAGLRADVVLYSALIDALCKNGL 564

Query: 407 VNEAVEIFQDMKSSGTCSPDSWTFSSMITI------------YSCGGKVSEAEEMLNDMV 466
           V  AV +  +M   G  SP+  T++S+I              YS GG +  +   L+ + 
Sbjct: 565 VGSAVSLIDEMTKEG-ISPNVVTYNSIIDAFGRSATMDRSADYSNGGSLPFSSSALSALT 624

Query: 467 EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNQLIELGLTPDDRFCGCLLN 526
           E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN
Sbjct: 625 ETEGNRVIQLFGQLTTESNNRTTKDCEEGMQELSCILEVFRKMHQLEIKPNVVTFSAILN 684

Query: 527 VITQTPKGELGKLIDCVVRA--NPKLGFVVELLLGEQDKEGNFRTEASELFSVVS---AD 586
             ++    E   ++   +R   N   G V  LL+G+++   N   +A  LF  V+     
Sbjct: 685 ACSRCNSFEDASMLLEELRLFDNKVYGVVHGLLMGQRE---NVWLQAQSLFDKVNEMDGS 744

Query: 587 VRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAAL 646
              A+ N L D+  +     +  EL+ L G + Q+++++ S S     L L  +S GAA 
Sbjct: 745 TASAFYNALTDMLWHFG-QKRGAELVALEGRSRQVWENVWSDS----CLDLHLMSSGAAR 804

Query: 647 TALHVWIKDLTKVLESGEELPPLLGINTGHGKHKN--SDKGLASVFESHLKELNAPFHEA 703
             +H W+ ++  ++  G ELP +L I TG GKH     D  L    E  L+ ++APFH +
Sbjct: 805 AMVHAWLLNIRSIVYEGHELPKVLSILTGWGKHSKVVGDGALRRAVEVLLRGMDAPFHLS 864

BLAST of CSPI01G15330 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 173.3 bits (438), Expect = 5.2e-43
Identity = 91/332 (27.41%), Postives = 172/332 (51.81%), Query Frame = 1

Query: 165 ALRYFQDMLKS--SKQTIFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIIS 224
           A + F++M  +  S   + YN  L V+ K    + A K+  EM+  G  P  VT++++IS
Sbjct: 298 AAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLIS 357

Query: 225 CARLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRI 284
                 +  +A+E   +M      PD  TY+T++  + RAG V+ A S+++  R    + 
Sbjct: 358 AYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKP 417

Query: 285 DPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIY 344
           +  TF+  IK++G  G +   + +++E+   G+ P++V +N LL   G+     ++  ++
Sbjct: 418 NICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVF 477

Query: 345 KEMIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADV 404
           KEM + GF P   T+ +L+ AY R    E A+ VY+ M + G+  ++  YNT+LA  A  
Sbjct: 478 KEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARG 537

Query: 405 GYVNEAVEIFQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFV 464
           G   ++ ++  +M+  G C P+  T+ S++  Y+ G ++     +  ++     +P   +
Sbjct: 538 GMWEQSEKVLAEME-DGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVL 597

Query: 465 LTSLIQCYGKAKRVDDVVRTFNQLIELGLTPD 495
           L +L+    K   + +  R F++L E G +PD
Sbjct: 598 LKTLVLVCSKCDLLPEAERAFSELKERGFSPD 628

BLAST of CSPI01G15330 vs. TAIR10
Match: AT1G74750.1 (AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 166.8 bits (421), Expect = 4.9e-41
Identity = 113/495 (22.83%), Postives = 227/495 (45.86%), Query Frame = 1

Query: 209 GVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSFDCNPDDVTYSTMIDAYGRAGNVDMA 268
           G K D  T++T++          +  +  ++M    C P+ VTY+ +I +YGRA  +  A
Sbjct: 354 GFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTVTYNRLIHSYGRANYLKEA 413

Query: 269 FSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNCLLDA 328
            +++++ +      D  T+ T+I IH  AG  D  +++Y+ M+  G+ P+   Y+ +++ 
Sbjct: 414 MNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINC 473

Query: 329 MGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALIVYKEMKEKGLQLN 388
           +G+A        ++ EM+  G +P+  T+  ++  + +AR  E AL +Y++M+  G Q +
Sbjct: 474 LGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNYETALKLYRDMQNAGFQPD 533

Query: 389 VILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSPDSWTFSSMITIYSCGGKVSEAEEML 448
            + Y+ ++ +    G++ EA  +F +M+      PD   +  ++ ++   G V +A +  
Sbjct: 534 KVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWV-PDEPVYGLLVDLWGKAGNVDKAWQWY 593

Query: 449 NDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLIELGLTPDDRFCGCLLNVITQT 508
             M++AG  PN+    SL+  + +  R+ +       ++ LGL P  +    LL+  T  
Sbjct: 594 QAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGLHPSLQTYTLLLSCCTDA 653

Query: 509 ----PKGELGKLIDCVVRANPKLGFVVELLLGEQDKEGNFRTEASELFSVVSADVR---K 568
                 G  G+L+   V  +P   F++++     D +   R   S     + ++ R   +
Sbjct: 654 RSNFDMGFCGQLM--AVSGHPAHMFLLKMPPAGPDGQ-KVRDHVSNFLDFMHSEDRESKR 713

Query: 569 AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTAL 628
              + ++D      L ++A  + ++     +Y D L+ +S + W + L  +S G A+ AL
Sbjct: 714 GLMDAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIAL 773

Query: 629 HVWIKDLTKVLESGEELPPLLGINTGHGKHK--NSDKGLASVFESHLKELNAPFHEAPEK 688
              +    K +    + P  + I TG G+         +    E  L   N PF      
Sbjct: 774 SRTLAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTGTSMVRQAVEELLNIFNFPFFTENGN 833

Query: 689 VGWFLTTKVAAKSWL 694
            G F+ +    K+WL
Sbjct: 834 SGCFVGSGEPLKNWL 844

BLAST of CSPI01G15330 vs. NCBI nr
Match: gi|449443502|ref|XP_004139516.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus])

HSP 1 Score: 1410.2 bits (3649), Expect = 0.0e+00
Identity = 703/704 (99.86%), Postives = 703/704 (99.86%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE 60
           MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE
Sbjct: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE 60

Query: 61  HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS 120
           HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS
Sbjct: 61  HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS 120

Query: 121 ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI 180
           ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI
Sbjct: 121 ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI 180

Query: 181 FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240
           FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM
Sbjct: 181 FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240

Query: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300
           PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Sbjct: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300

Query: 301 DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL 360
           DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL
Sbjct: 301 DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL 360

Query: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT 420
           LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT
Sbjct: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT 420

Query: 421 CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480
           CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV
Sbjct: 421 CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480

Query: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK 540
           RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK
Sbjct: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK 540

Query: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS 600
           EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
Sbjct: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS 600

Query: 601 PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLASV 660
           PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHK SDKGLASV
Sbjct: 601 PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV 660

Query: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA
Sbjct: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 704

BLAST of CSPI01G15330 vs. NCBI nr
Match: gi|659128601|ref|XP_008464281.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo])

HSP 1 Score: 1357.8 bits (3513), Expect = 0.0e+00
Identity = 673/704 (95.60%), Postives = 690/704 (98.01%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQE 60
           MAFQLC+SPPTFFT HH LSNSLTPQRKTTLSNSSPLFKL+PIPRHS PFLQITN+SLQE
Sbjct: 1   MAFQLCHSPPTFFTYHHSLSNSLTPQRKTTLSNSSPLFKLNPIPRHSTPFLQITNISLQE 60

Query: 61  HAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVS 120
           H+PQ+T NTIPS DEISKY D+KSGSSS SSVWVNPRSPRASKLRKQSYEARYASL+R+S
Sbjct: 61  HSPQETHNTIPSDDEISKYSDAKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLVRIS 120

Query: 121 ESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQTI 180
           ESLDS NPCEVDVADVLKVIGNNILE+DA++VLNNMSNSQTALLALRYFQDMLKSSKQTI
Sbjct: 121 ESLDSCNPCEVDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQDMLKSSKQTI 180

Query: 181 FYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240
           FYNVTLKVFRKCRDMEGAE+LFEEM+NRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM
Sbjct: 181 FYNVTLKVFRKCRDMEGAEELFEEMLNRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKM 240

Query: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300
           PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Sbjct: 241 PSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY 300

Query: 301 DGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASL 360
           DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASL
Sbjct: 301 DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKSGFSPSWATYASL 360

Query: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGT 420
           LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMK+SGT
Sbjct: 361 LRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKNSGT 420

Query: 421 CSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480
           CSPDSWTFSSMITIYSC GKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV
Sbjct: 421 CSPDSWTFSSMITIYSCSGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVV 480

Query: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDK 540
           RTFNQLIELGLTPDDRFCGCLLNVITQTPK E+ KLIDCVVRANPKLGFVVELLLGEQDK
Sbjct: 481 RTFNQLIELGLTPDDRFCGCLLNVITQTPKEEISKLIDCVVRANPKLGFVVELLLGEQDK 540

Query: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS 600
           EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Sbjct: 541 EGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLNLGLTLQIYKDLQSRS 600

Query: 601 PTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLASV 660
           PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHK SDKGLASV
Sbjct: 601 PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV 660

Query: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA
Sbjct: 661 FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 704

BLAST of CSPI01G15330 vs. NCBI nr
Match: gi|359495626|ref|XP_002269600.2| (PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Vitis vinifera])

HSP 1 Score: 1050.8 bits (2716), Expect = 1.0e-303
Identity = 513/704 (72.87%), Postives = 601/704 (85.37%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTL-SNSSPLFKLSPIPRHSKPFLQITNVSLQ 60
           MA+ LC SP +   +HH+L NSL+  RK+ L S +S  FK + +  HS+ FLQIT+VSL+
Sbjct: 1   MAYHLCSSPSSLCHDHHYLHNSLSFSRKSRLRSFNSFSFKPNSLSLHSRTFLQITHVSLE 60

Query: 61  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRV 120
           +  PQ+TQ    S    S+ PD K+     S +WVNPRSPRASKLR+ SY+ARYASL+++
Sbjct: 61  DPIPQETQKADASNPPNSQDPDRKT----KSYIWVNPRSPRASKLRQHSYDARYASLVKI 120

Query: 121 SESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQT 180
           +ESLDS    E DV+ VL+ +G+ ILE+DA++VLNNM+N +TALLA  +F+  LK S++ 
Sbjct: 121 AESLDSCEATEEDVSQVLRCLGDKILEQDAVIVLNNMTNPETALLAFGFFRKRLKPSREV 180

Query: 181 IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEK 240
           I YNVTLKVFRKCR+++ AEKLF+EM+ RGVKPDN+TFSTIISCAR+ SLP+KAVEWFEK
Sbjct: 181 ILYNVTLKVFRKCRNLDRAEKLFDEMLERGVKPDNITFSTIISCARVSSLPNKAVEWFEK 240

Query: 241 MPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN 300
           MP F C+PDDVTYS MIDAYGRAGNVDMA  LYDRARTE WRIDP TFST+I+I+G++GN
Sbjct: 241 MPEFGCHPDDVTYSAMIDAYGRAGNVDMALKLYDRARTEKWRIDPVTFSTLIRIYGMSGN 300

Query: 301 YDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYAS 360
           +DGCLNVYEEMKA+G+KPNLVIYN LLDAMGRAKRPWQ K IYKEM  NG  PSW TYA+
Sbjct: 301 FDGCLNVYEEMKALGVKPNLVIYNTLLDAMGRAKRPWQAKNIYKEMTNNGLQPSWGTYAA 360

Query: 361 LLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSG 420
           LLRAYGRARY EDALIVYKEMKEKGL+L+V+LYNTLLAMCADVGY  EA  IF+DMKSSG
Sbjct: 361 LLRAYGRARYAEDALIVYKEMKEKGLELSVVLYNTLLAMCADVGYTEEAAAIFEDMKSSG 420

Query: 421 TCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDV 480
            C PDSWTFSS+ITIYSC GKVSEAE MLN M+EAGF+PNIFVLTSLIQCYGKA R D+V
Sbjct: 421 NCMPDSWTFSSLITIYSCSGKVSEAEAMLNAMLEAGFEPNIFVLTSLIQCYGKANRTDEV 480

Query: 481 VRTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD 540
           VRTF++L+EL +TPDDRFCGC+LNV+TQ+PK ELGKLIDC+ +ANPKLG VV+LLL EQ+
Sbjct: 481 VRTFDRLLELDITPDDRFCGCMLNVMTQSPKEELGKLIDCIDKANPKLGNVVKLLLEEQN 540

Query: 541 KEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR 600
            EG FR EASELF  +SADV+KAYCNCLIDLCVNL+LL+KACEL DLGLTL+IY D+QS+
Sbjct: 541 GEGTFRKEASELFDSISADVKKAYCNCLIDLCVNLNLLEKACELFDLGLTLEIYIDIQSK 600

Query: 601 SPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLAS 660
           SPTQWSL+LK LSLGAALTALH+W+ DL+K +E GEELP +LGINTGHGKHK SDKGLAS
Sbjct: 601 SPTQWSLHLKSLSLGAALTALHIWMNDLSKAVEVGEELPAVLGINTGHGKHKYSDKGLAS 660

Query: 661 VFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA 704
           VFESHLKELNAPFHEAP+KVGWFLTTKVAA SWLESRS+PELVA
Sbjct: 661 VFESHLKELNAPFHEAPDKVGWFLTTKVAATSWLESRSAPELVA 700

BLAST of CSPI01G15330 vs. NCBI nr
Match: gi|590627062|ref|XP_007026347.1| (Pentatricopeptide (PPR) repeat-containing protein [Theobroma cacao])

HSP 1 Score: 1049.3 bits (2712), Expect = 3.1e-303
Identity = 512/701 (73.04%), Postives = 612/701 (87.30%), Query Frame = 1

Query: 5   LCYSPPTFFTEHHFLSNSLTPQRKTTLSNSSPLFKLSPIPRHSKPFLQITNVSLQEHAPQ 64
           LC SP + F + H LS S  P+      +++P  +L      SK  +QI++VSLQ+   Q
Sbjct: 6   LCSSPSSVFHDRHTLSASPKPR---PARSTAPSLRLVSCSFQSKSSIQISHVSLQDPITQ 65

Query: 65  DTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRVSESLD 124
            T+NT   ++  S+ PD K+GSSS S VWVNPRSPRAS+LR+ SY++RY+SL++V+E+LD
Sbjct: 66  -TKNTPKHSN--SQSPDGKTGSSSKSYVWVNPRSPRASRLRQLSYDSRYSSLVKVAETLD 125

Query: 125 SSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLK-SSKQTIFYN 184
           S NP E DV  VL  +GN++LE+DA++VLNNMSN  TALLAL +FQ +LK +S++ I YN
Sbjct: 126 SCNPNEHDVLSVLSRLGNDVLEQDAVVVLNNMSNPHTALLALNHFQRILKKTSREVILYN 185

Query: 185 VTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEKMPSF 244
           VT+KVFRK +D++GAEKLF+EM+ +GVKPDNVTFST+ISCAR+C+LP KAVEWFEKMP +
Sbjct: 186 VTMKVFRKSKDLDGAEKLFDEMLQKGVKPDNVTFSTLISCARVCALPDKAVEWFEKMPIY 245

Query: 245 DCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGC 304
            C+PDDVTYS MIDAYGRAGNVDMAF+LYDRARTE WRIDP TFST+IKI+G++GNYDGC
Sbjct: 246 GCDPDDVTYSAMIDAYGRAGNVDMAFNLYDRARTEKWRIDPVTFSTLIKIYGISGNYDGC 305

Query: 305 LNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRA 364
           LNVYEEMKA+G KPN+VIYN LLDAMGRAKRPWQ KTIYKEM  NGFSP+WATYA+LLRA
Sbjct: 306 LNVYEEMKALGAKPNVVIYNTLLDAMGRAKRPWQAKTIYKEMTNNGFSPNWATYAALLRA 365

Query: 365 YGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSGTCSP 424
           YGRARYGEDAL +YKEMK+KGL+L VILYNTLLAMCADVGY +EAVEIF+DMK+SGTC P
Sbjct: 366 YGRARYGEDALNIYKEMKDKGLELTVILYNTLLAMCADVGYADEAVEIFEDMKNSGTCKP 425

Query: 425 DSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF 484
           DSWT+SS+ITIYSC GKVSEAE ++++M+EAGF+PNIFVLTSLIQCYGKA+  DDVVRTF
Sbjct: 426 DSWTYSSLITIYSCSGKVSEAEGIVDEMLEAGFEPNIFVLTSLIQCYGKAQHTDDVVRTF 485

Query: 485 NQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQDKEGN 544
           N+++ELG+TPDDRFCGCLLNV+TQTP+ EL KL DC+ +ANPKLG VV+LL+ EQD +GN
Sbjct: 486 NRVLELGITPDDRFCGCLLNVMTQTPREELAKLTDCIKKANPKLGHVVKLLVEEQDGQGN 545

Query: 545 FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ 604
           F+ EASELF+ + +DV+KAYCNCLIDLCVNLDLL++ACELL+LGL+L+IY D+QSRSPTQ
Sbjct: 546 FKNEASELFNCIGSDVKKAYCNCLIDLCVNLDLLERACELLELGLSLEIYADVQSRSPTQ 605

Query: 605 WSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLASVFES 664
           WSL LK LSLGAALT+LHVWI DLTKVLESGEELPPLLGINTGHGKHK SDKGLA+VFES
Sbjct: 606 WSLNLKSLSLGAALTSLHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLATVFES 665

Query: 665 HLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA 705
           HLKEL+APFHEAP+KVGWFLTT+VAAKSWLESRSSP+LVAA
Sbjct: 666 HLKELDAPFHEAPDKVGWFLTTQVAAKSWLESRSSPDLVAA 700

BLAST of CSPI01G15330 vs. NCBI nr
Match: gi|147841962|emb|CAN63129.1| (hypothetical protein VITISV_001456 [Vitis vinifera])

HSP 1 Score: 1043.5 bits (2697), Expect = 1.7e-301
Identity = 511/704 (72.59%), Postives = 598/704 (84.94%), Query Frame = 1

Query: 1   MAFQLCYSPPTFFTEHHFLSNSLTPQRKTTL-SNSSPLFKLSPIPRHSKPFLQITNVSLQ 60
           MA+ LC SP +   +HH+L NSL+  RK+ L S +S  FK + +  HS+ FLQIT+VSL+
Sbjct: 1   MAYHLCSSPSSLCHDHHYLHNSLSFSRKSRLRSFNSFSFKPNSLSLHSRTFLQITHVSLE 60

Query: 61  EHAPQDTQNTIPSADEISKYPDSKSGSSSNSSVWVNPRSPRASKLRKQSYEARYASLIRV 120
           +  PQ+TQ    S    S+ PD K+     S +WVNPRSPRASKLR+ SY+ARYASL+++
Sbjct: 61  DPIPQETQKADASNPPNSQDPDRKT----KSYIWVNPRSPRASKLRQHSYDARYASLVKI 120

Query: 121 SESLDSSNPCEVDVADVLKVIGNNILERDAILVLNNMSNSQTALLALRYFQDMLKSSKQT 180
           +ESLDS    E DV+ VL+ +G+ ILE+DA++VLNNM+N +TALLA  +F+  LK S++ 
Sbjct: 121 AESLDSCEATEEDVSQVLRCLGDKILEQDAVIVLNNMTNPETALLAFGFFRKRLKPSREV 180

Query: 181 IFYNVTLKVFRKCRDMEGAEKLFEEMINRGVKPDNVTFSTIISCARLCSLPSKAVEWFEK 240
           I YNVTLKVFRKCR+++ AEKLF+EM+ RGVKPDN+TFSTIISCAR+ SLP+KAVEWFEK
Sbjct: 181 ILYNVTLKVFRKCRNLDXAEKLFDEMLERGVKPDNITFSTIISCARVSSLPNKAVEWFEK 240

Query: 241 MPSFDCNPDDVTYSTMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN 300
           MP F C+PDDVTYS MIDAYGRAGNVDMA  LYDRARTE WRIDP TFST+I+I+G++GN
Sbjct: 241 MPEFGCHPDDVTYSAMIDAYGRAGNVDMALKLYDRARTEKWRIDPVTFSTLIRIYGMSGN 300

Query: 301 YDGCLNVYEEMKAIGIKPNLVIYNCLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYAS 360
           +DGCLNVYEEMKA+G+KPNLVIYN LLDAMGRAKRPWQ K IYKEM  NG   SW TYA+
Sbjct: 301 FDGCLNVYEEMKALGVKPNLVIYNTLLDAMGRAKRPWQAKNIYKEMTNNGLQLSWGTYAA 360

Query: 361 LLRAYGRARYGEDALIVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFQDMKSSG 420
           LLRAYGRARY EDALIVYKEMKEKGL+L+V+LYNTLLAMCADVGY  EA  IF+DMKSSG
Sbjct: 361 LLRAYGRARYAEDALIVYKEMKEKGLELSVVLYNTLLAMCADVGYTEEAAAIFEDMKSSG 420

Query: 421 TCSPDSWTFSSMITIYSCGGKVSEAEEMLNDMVEAGFDPNIFVLTSLIQCYGKAKRVDDV 480
            C PDSWTFSS+ITIYSC GKVSEAE MLN M+EAGF+PNIFVLTSLIQCYGKA R D+V
Sbjct: 421 NCMPDSWTFSSLITIYSCSGKVSEAEAMLNAMLEAGFEPNIFVLTSLIQCYGKANRTDEV 480

Query: 481 VRTFNQLIELGLTPDDRFCGCLLNVITQTPKGELGKLIDCVVRANPKLGFVVELLLGEQD 540
           VRTF++L+EL +TPDDRFCGC+LNV+TQ+PK ELGKLIDC+ +ANPKLG VV+LLL EQ+
Sbjct: 481 VRTFDRLLELDITPDDRFCGCMLNVMTQSPKEELGKLIDCIDKANPKLGNVVKLLLEEQN 540

Query: 541 KEGNFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR 600
            EG FR EASELF  +SADV KAYCNCLIDLCVNL+LL+KACEL DLGLTL+IY D+QS+
Sbjct: 541 GEGTFRKEASELFDSISADVXKAYCNCLIDLCVNLNLLEKACELFDLGLTLEIYIDIQSK 600

Query: 601 SPTQWSLYLKGLSLGAALTALHVWIKDLTKVLESGEELPPLLGINTGHGKHKNSDKGLAS 660
           SPTQWSL+LK LSLGAALTALH+W+ DL+K +E GEELP +LGINTGHGKHK SDKGLAS
Sbjct: 601 SPTQWSLHLKSLSLGAALTALHIWMNDLSKAVEVGEELPAVLGINTGHGKHKYSDKGLAS 660

Query: 661 VFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA 704
           VFESHLKELNAPFHEAP+KV WFLTTKVAA SWLESRS+PELVA
Sbjct: 661 VFESHLKELNAPFHEAPDKVXWFLTTKVAATSWLESRSAPELVA 700

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP314_ARATH1.5e-26564.39Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidop... [more]
PP420_ARATH6.3e-13137.80Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidop... [more]
PP178_ARATH8.4e-5126.10Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
PP362_ARATH9.3e-4227.41Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP123_ARATH8.7e-4022.83Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LVP1_CUCSA0.0e+0099.86Uncharacterized protein OS=Cucumis sativus GN=Csa_1G173140 PE=4 SV=1[more]
F6HCW3_VITVI7.3e-30472.87Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0194g00270 PE=4 SV=... [more]
A0A061GPA6_THECC2.1e-30373.04Pentatricopeptide (PPR) repeat-containing protein OS=Theobroma cacao GN=TCM_0304... [more]
A5B4A6_VITVI1.2e-30172.59Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_001456 PE=4 SV=1[more]
M5WLZ8_PRUPE1.6e-29572.14Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002169mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G16390.18.3e-26764.39 pentatricopeptide (PPR) repeat-containing protein[more]
AT5G46580.13.5e-13237.80 pentatricopeptide (PPR) repeat-containing protein[more]
AT2G31400.14.7e-5226.10 genomes uncoupled 1[more]
AT5G02860.15.2e-4327.41 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G74750.14.9e-4122.83 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449443502|ref|XP_004139516.1|0.0e+0099.86PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic ... [more]
gi|659128601|ref|XP_008464281.1|0.0e+0095.60PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic ... [more]
gi|359495626|ref|XP_002269600.2|1.0e-30372.87PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic ... [more]
gi|590627062|ref|XP_007026347.1|3.1e-30373.04Pentatricopeptide (PPR) repeat-containing protein [Theobroma cacao][more]
gi|147841962|emb|CAN63129.1|1.7e-30172.59hypothetical protein VITISV_001456 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002625Smr_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0045727 positive regulation of translation
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G15330.1CSPI01G15330.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainSMARTSM00463SMR_2coord: 603..693
score: 3.6
IPR002625Smr domainPROFILEPS50828SMRcoord: 606..690
score: 16
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 179..223
score: 2.8E-11coord: 247..292
score: 8.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 308..365
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 250..283
score: 6.9E-8coord: 356..388
score: 7.2E-6coord: 463..494
score: 5.8E-6coord: 320..354
score: 1.4E-5coord: 427..459
score: 1.0E-6coord: 182..213
score: 1.7E-5coord: 286..318
score: 4.2E-5coord: 215..248
score: 0.0014coord: 391..424
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 353..387
score: 10.49coord: 459..493
score: 10.424coord: 178..212
score: 10.786coord: 424..458
score: 11.422coord: 283..317
score: 10.961coord: 388..422
score: 10.687coord: 318..352
score: 10.764coord: 248..282
score: 11.016coord: 213..247
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 266..489
score: 1.6E-9coord: 165..265
score: 4.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 32..499
score: 5.0E-203coord: 566..585
score: 5.0E
NoneNo IPR availablePANTHERPTHR24015:SF437SUBFAMILY NOT NAMEDcoord: 566..585
score: 5.0E-203coord: 32..499
score: 5.0E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 293..492
score: 1.3

The following gene(s) are paralogous to this gene:

None