CSPI05G20220 (gene) Wild cucumber (PI 183967)

NameCSPI05G20220
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr5 : 21373026 .. 21375389 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAATCTCAGCATTAGGAACTGGTTTAGTTTCTTTAACTAATAGAGTCTTTAAATTCCATCCATCTTTTGAACGCTTCTTATCTTATTCTTGTAATATCTCAATTGGTAGAGACCCCAAAACCATTGCCACTGCTCTCTCTCTATCTGAAAATACAAAATCATTGATTTTAGGTGCTCAAGTACATGGTCATATGTGTAAGTTGGGGTTCGATTATGATACTTTCTCCATGAATAATCTGCTTAAGATGTACTGTAGATGTGGGTTTATGTGTGAAGGCTTTAAGGTGTTTGAAGAAATGCCTCAGAGAAATGTAGTGTCTTGGAGTTTGATCATTTCAAGTTTATCTGAGAATGGTGAGTTTGAATTGTGCTTGGAGAGTTTTTTGGAGATGATGAGGGATGGGTTGATGCCTACTGAGTTTGCTTTTGGTAGTGTTATGAAGGCGTGTGCGGATGTTGAAGCCTATGGATTTGGTTCGGGTGTTCATTGTCTTTCTTGGAAAATTGGGATGGAGCAGAATGTCTTTGTTGGTGGTTCAACTTTGAGCATGTATGCAAGGCTTGGGGATATTACTTCAGCTGAGTTGGTTTTTGAATGGATGGAGAAAGTAGATGTTGGCTGTTGGAATGCCATGATTGGAGGCTATACTAACTGCGGTCTTAGCTTGGAAGCCCTGAGTGCTGTATCTTTGTTAAACAGCGAGGGTATAAAGATGGACAAGTTCACCATTGTTAGTGCTATCAAAGCATGCTCGTTAATTCAGGATTTAGATTCTGGAAAAGAGCTTCATGGGTTCATCCTTCGGCGAGGATTAATATCCACTGCAGCAATGAATGCTCTCATGGATATGTACTTAATAAGTGACAGGAAGAACTCTGTTCTAAAAATCTTTAACAGTATGCAAACCAGAGACATTATATCATGGAACACAGTATTTGGAGGCTCCTCCAATGAAAAAGAAATCGTGGACTTGTTTGGCAAGTTCGTGATAGAAGGCATGAAGCCTAACCATATCACGTTCTCAGTGCTATTTCGGCAATGTGGAGTACTACTTGATTCCAGACTTGGGTTTCAGTTCTTTTCTCTTGCAGTACATTTAGGTTGTCTTGATGAAACTAGGGTGTTGAGCTCAATTATTAGTATGTTTTCTCAATTTGGGTTAATGGAGATGGTACACTCAGTATTTGACTCTCTAGTTTTCAAACCTGTATCTGCTTGGAATCAGTTTATTTTGGCATATAGTTTGAATTCTTTTGAAATGGAAGCCTTCAGAACCTTTTCCAGTTTATTGAGATATGGTGTTGTAGCAAATGAGTATACTTTTTCCATCATTATAGAGACTGCCTGCAAATTTGAGAACCCATGGATGTGCAGACAACTTCATTGTGCTTCATTGAAGGCTGGTTTTGGTTCTCACAAGTATGTGTCCTGTTCATTGATAAAATGCTATATCTTAATAGGATCTCTTGAAAGTTCCTTTGAGATCTTTAATCAACTTGAGATTGTAGACATGGCGACCTACGGAGCTGTGATATCTACCTTGGTTCACCAAAATCACATGTATGAAGCCATTATGTTTCTGAATATTCTAATGGAATCTGGCAAGAAGCCAGACGAATTTACCTTCGGCAGCATATTGAATGGCTGCTCTAGCAGGGCGGCTTATCACCAAACAAAAGCAATCCATTCACTTGTAGAAAAGATGGGATTTGGCTTCCATGTGCATGTTGCTAGTGCAATTATAGATGCATATGCAAAATGTGGCGATATAGGAAGTGCACAAGGAGCATTCGAACAGTCATGTCAGTCCAATGACGTTATTGTATATAATTCTATGATGATGGCATATGCTCATCATGGTCTTGCTTGGGAAGCGATCCAAACTTTTGAGAAAATGAGGATAGCTAAAGTACAGCCTAGTCAAGCCTCATTTGTCTCAGTTATATCAGCCTGTCGTCACATGGGTCTTGTAGAACAAGGTCGTTCTCTGTTTCAAACAATGAAGTCGGATTATAATATGACACCATCTCGTGACAATTACGGTTGCTTAGTCGATATGCTGTCAAGGAATGGATTCCTTTATGATGCTCGATATATAATTGAGTCAATGCCATTTTCACCTTGGCCTGCCATATTGAGATCTTTGCTCAGTGGATGTAGGATCTATGGAAATAGAGAATTGGGGCAATGGACTGCTGAAAAATTACTTTCACTGGCTCCACAAAATCTTGCAACCCATGTATTATTATCAAAGGTTTATTCTGAAGGGAATAGTTGGGAAGATGCTGCAAATATAAGAAAGGAGATGACGGATGGAGGGGTTCTGAAAGACCCAGGATATAGCAGGGTTGAGATATAA

mRNA sequence

ATGAAAATCTCAGCATTAGGAACTGGTTTAGTTTCTTTAACTAATAGAGTCTTTAAATTCCATCCATCTTTTGAACGCTTCTTATCTTATTCTTGTAATATCTCAATTGGTAGAGACCCCAAAACCATTGCCACTGCTCTCTCTCTATCTGAAAATACAAAATCATTGATTTTAGGTGCTCAAGTACATGGTCATATGTGTAAGTTGGGGTTCGATTATGATACTTTCTCCATGAATAATCTGCTTAAGATGTACTGTAGATGTGGGTTTATGTGTGAAGGCTTTAAGGTGTTTGAAGAAATGCCTCAGAGAAATGTAGTGTCTTGGAGTTTGATCATTTCAAGTTTATCTGAGAATGGTGAGTTTGAATTGTGCTTGGAGAGTTTTTTGGAGATGATGAGGGATGGGTTGATGCCTACTGAGTTTGCTTTTGGTAGTGTTATGAAGGCGTGTGCGGATGTTGAAGCCTATGGATTTGGTTCGGGTGTTCATTGTCTTTCTTGGAAAATTGGGATGGAGCAGAATGTCTTTGTTGGTGGTTCAACTTTGAGCATGTATGCAAGGCTTGGGGATATTACTTCAGCTGAGTTGGTTTTTGAATGGATGGAGAAAGTAGATGTTGGCTGTTGGAATGCCATGATTGGAGGCTATACTAACTGCGGTCTTAGCTTGGAAGCCCTGAGTGCTGTATCTTTGTTAAACAGCGAGGGTATAAAGATGGACAAGTTCACCATTGTTAGTGCTATCAAAGCATGCTCGTTAATTCAGGATTTAGATTCTGGAAAAGAGCTTCATGGGTTCATCCTTCGGCGAGGATTAATATCCACTGCAGCAATGAATGCTCTCATGGATATGTACTTAATAAGTGACAGGAAGAACTCTGTTCTAAAAATCTTTAACAGTATGCAAACCAGAGACATTATATCATGGAACACAGTATTTGGAGGCTCCTCCAATGAAAAAGAAATCGTGGACTTGTTTGGCAAGTTCGTGATAGAAGGCATGAAGCCTAACCATATCACGTTCTCAGTGCTATTTCGGCAATGTGGAGTACTACTTGATTCCAGACTTGGGTTTCAGTTCTTTTCTCTTGCAGTACATTTAGGTTGTCTTGATGAAACTAGGGTGTTGAGCTCAATTATTAGTATGTTTTCTCAATTTGGGTTAATGGAGATGGTACACTCAGTATTTGACTCTCTAGTTTTCAAACCTGTATCTGCTTGGAATCAGTTTATTTTGGCATATAGTTTGAATTCTTTTGAAATGGAAGCCTTCAGAACCTTTTCCAGTTTATTGAGATATGGTGTTGTAGCAAATGAGTATACTTTTTCCATCATTATAGAGACTGCCTGCAAATTTGAGAACCCATGGATGTGCAGACAACTTCATTGTGCTTCATTGAAGGCTGGTTTTGGTTCTCACAAGTATGTGTCCTGTTCATTGATAAAATGCTATATCTTAATAGGATCTCTTGAAAGTTCCTTTGAGATCTTTAATCAACTTGAGATTGTAGACATGGCGACCTACGGAGCTGTGATATCTACCTTGGTTCACCAAAATCACATGTATGAAGCCATTATGTTTCTGAATATTCTAATGGAATCTGGCAAGAAGCCAGACGAATTTACCTTCGGCAGCATATTGAATGGCTGCTCTAGCAGGGCGGCTTATCACCAAACAAAAGCAATCCATTCACTTGTAGAAAAGATGGGATTTGGCTTCCATGTGCATGTTGCTAGTGCAATTATAGATGCATATGCAAAATGTGGCGATATAGGAAGTGCACAAGGAGCATTCGAACAGTCATGTCAGTCCAATGACGTTATTGTATATAATTCTATGATGATGGCATATGCTCATCATGGTCTTGCTTGGGAAGCGATCCAAACTTTTGAGAAAATGAGGATAGCTAAAGTACAGCCTAGTCAAGCCTCATTTGTCTCAGTTATATCAGCCTGTCGTCACATGGGTCTTGTAGAACAAGGTCGTTCTCTGTTTCAAACAATGAAGTCGGATTATAATATGACACCATCTCGTGACAATTACGGTTGCTTAGTCGATATGCTGTCAAGGAATGGATTCCTTTATGATGCTCGATATATAATTGAGTCAATGCCATTTTCACCTTGGCCTGCCATATTGAGATCTTTGCTCAGTGGATGTAGGATCTATGGAAATAGAGAATTGGGGCAATGGACTGCTGAAAAATTACTTTCACTGGCTCCACAAAATCTTGCAACCCATGTATTATTATCAAAGGTTTATTCTGAAGGGAATAGTTGGGAAGATGCTGCAAATATAAGAAAGGAGATGACGGATGGAGGGGTTCTGAAAGACCCAGGATATAGCAGGGTTGAGATATAA

Coding sequence (CDS)

ATGAAAATCTCAGCATTAGGAACTGGTTTAGTTTCTTTAACTAATAGAGTCTTTAAATTCCATCCATCTTTTGAACGCTTCTTATCTTATTCTTGTAATATCTCAATTGGTAGAGACCCCAAAACCATTGCCACTGCTCTCTCTCTATCTGAAAATACAAAATCATTGATTTTAGGTGCTCAAGTACATGGTCATATGTGTAAGTTGGGGTTCGATTATGATACTTTCTCCATGAATAATCTGCTTAAGATGTACTGTAGATGTGGGTTTATGTGTGAAGGCTTTAAGGTGTTTGAAGAAATGCCTCAGAGAAATGTAGTGTCTTGGAGTTTGATCATTTCAAGTTTATCTGAGAATGGTGAGTTTGAATTGTGCTTGGAGAGTTTTTTGGAGATGATGAGGGATGGGTTGATGCCTACTGAGTTTGCTTTTGGTAGTGTTATGAAGGCGTGTGCGGATGTTGAAGCCTATGGATTTGGTTCGGGTGTTCATTGTCTTTCTTGGAAAATTGGGATGGAGCAGAATGTCTTTGTTGGTGGTTCAACTTTGAGCATGTATGCAAGGCTTGGGGATATTACTTCAGCTGAGTTGGTTTTTGAATGGATGGAGAAAGTAGATGTTGGCTGTTGGAATGCCATGATTGGAGGCTATACTAACTGCGGTCTTAGCTTGGAAGCCCTGAGTGCTGTATCTTTGTTAAACAGCGAGGGTATAAAGATGGACAAGTTCACCATTGTTAGTGCTATCAAAGCATGCTCGTTAATTCAGGATTTAGATTCTGGAAAAGAGCTTCATGGGTTCATCCTTCGGCGAGGATTAATATCCACTGCAGCAATGAATGCTCTCATGGATATGTACTTAATAAGTGACAGGAAGAACTCTGTTCTAAAAATCTTTAACAGTATGCAAACCAGAGACATTATATCATGGAACACAGTATTTGGAGGCTCCTCCAATGAAAAAGAAATCGTGGACTTGTTTGGCAAGTTCGTGATAGAAGGCATGAAGCCTAACCATATCACGTTCTCAGTGCTATTTCGGCAATGTGGAGTACTACTTGATTCCAGACTTGGGTTTCAGTTCTTTTCTCTTGCAGTACATTTAGGTTGTCTTGATGAAACTAGGGTGTTGAGCTCAATTATTAGTATGTTTTCTCAATTTGGGTTAATGGAGATGGTACACTCAGTATTTGACTCTCTAGTTTTCAAACCTGTATCTGCTTGGAATCAGTTTATTTTGGCATATAGTTTGAATTCTTTTGAAATGGAAGCCTTCAGAACCTTTTCCAGTTTATTGAGATATGGTGTTGTAGCAAATGAGTATACTTTTTCCATCATTATAGAGACTGCCTGCAAATTTGAGAACCCATGGATGTGCAGACAACTTCATTGTGCTTCATTGAAGGCTGGTTTTGGTTCTCACAAGTATGTGTCCTGTTCATTGATAAAATGCTATATCTTAATAGGATCTCTTGAAAGTTCCTTTGAGATCTTTAATCAACTTGAGATTGTAGACATGGCGACCTACGGAGCTGTGATATCTACCTTGGTTCACCAAAATCACATGTATGAAGCCATTATGTTTCTGAATATTCTAATGGAATCTGGCAAGAAGCCAGACGAATTTACCTTCGGCAGCATATTGAATGGCTGCTCTAGCAGGGCGGCTTATCACCAAACAAAAGCAATCCATTCACTTGTAGAAAAGATGGGATTTGGCTTCCATGTGCATGTTGCTAGTGCAATTATAGATGCATATGCAAAATGTGGCGATATAGGAAGTGCACAAGGAGCATTCGAACAGTCATGTCAGTCCAATGACGTTATTGTATATAATTCTATGATGATGGCATATGCTCATCATGGTCTTGCTTGGGAAGCGATCCAAACTTTTGAGAAAATGAGGATAGCTAAAGTACAGCCTAGTCAAGCCTCATTTGTCTCAGTTATATCAGCCTGTCGTCACATGGGTCTTGTAGAACAAGGTCGTTCTCTGTTTCAAACAATGAAGTCGGATTATAATATGACACCATCTCGTGACAATTACGGTTGCTTAGTCGATATGCTGTCAAGGAATGGATTCCTTTATGATGCTCGATATATAATTGAGTCAATGCCATTTTCACCTTGGCCTGCCATATTGAGATCTTTGCTCAGTGGATGTAGGATCTATGGAAATAGAGAATTGGGGCAATGGACTGCTGAAAAATTACTTTCACTGGCTCCACAAAATCTTGCAACCCATGTATTATTATCAAAGGTTTATTCTGAAGGGAATAGTTGGGAAGATGCTGCAAATATAAGAAAGGAGATGACGGATGGAGGGGTTCTGAAAGACCCAGGATATAGCAGGGTTGAGATATAA
BLAST of CSPI05G20220 vs. Swiss-Prot
Match: PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 387.1 bits (993), Expect = 4.6e-106
Identity = 219/731 (29.96%), Postives = 378/731 (51.71%), Query Frame = 1

Query: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENG 120
           Q+H  +   G    T   N L+ +Y R GF+    +VF+ +  ++  SW  +IS LS+N 
Sbjct: 208 QIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMISGLSKNE 267

Query: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180
                +  F +M   G+MPT +AF SV+ AC  +E+   G  +H L  K+G   + +V  
Sbjct: 268 CEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCN 327

Query: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240
           + +S+Y  LG++ SAE +F  M + D   +N +I G + CG   +A+     ++ +G++ 
Sbjct: 328 ALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEP 387

Query: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMN-ALMDMYLISDRKNSVLKIF 300
           D  T+ S + ACS    L  G++LH +  + G  S   +  AL+++Y       + L  F
Sbjct: 388 DSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYF 447

Query: 301 NSMQTRDIISWNTV---FGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSR 360
              +  +++ WN +   +G   + +    +F +  IE + PN  T+  + + C  L D  
Sbjct: 448 LETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLE 507

Query: 361 LGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYS 420
           LG Q  S  +         V S +I M+++ G ++    +      K V +W   I  Y+
Sbjct: 508 LGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYT 567

Query: 421 LNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKY 480
             +F+ +A  TF  +L  G+ ++E   +  +      +     +Q+H  +  +GF S   
Sbjct: 568 QYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLP 627

Query: 481 VSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESG 540
              +L+  Y   G +E S+  F Q E  D   + A++S      +  EA+     +   G
Sbjct: 628 FQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREG 687

Query: 541 KKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQ 600
              + FTFGS +   S  A   Q K +H+++ K G+     V +A+I  YAKCG I  A+
Sbjct: 688 IDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAE 747

Query: 601 GAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHM 660
             F +    N+V  +N+++ AY+ HG   EA+ +F++M  + V+P+  + V V+SAC H+
Sbjct: 748 KQFLEVSTKNEVS-WNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHI 807

Query: 661 GLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRS 720
           GLV++G + F++M S+Y ++P  ++Y C+VDML+R G L  A+  I+ MP  P   + R+
Sbjct: 808 GLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRT 867

Query: 721 LLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGV 780
           LLS C ++ N E+G++ A  LL L P++ AT+VLLS +Y+    W+     R++M + GV
Sbjct: 868 LLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGV 927

Query: 781 LKDPGYSRVEI 788
            K+PG S +E+
Sbjct: 928 KKEPGQSWIEV 937

BLAST of CSPI05G20220 vs. Swiss-Prot
Match: PP220_ARATH (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 7.8e-106
Identity = 238/757 (31.44%), Postives = 383/757 (50.59%), Query Frame = 1

Query: 39  DPKTIATALSLSENTKSLIL--GAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFK 98
           DP T+      S   K+ +      V   M   G   D  +   ++  Y R G + +   
Sbjct: 223 DPNTVCWTCLFSGYVKAGLPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARL 282

Query: 99  VFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEA 158
           +F EM   +VV+W+++IS   + G   + +E F  M +  +  T    GSV+ A   V  
Sbjct: 283 LFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVAN 342

Query: 159 YGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGG 218
              G  VH  + K+G+  N++VG S +SMY++   + +A  VFE +E+ +   WNAMI G
Sbjct: 343 LDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRG 402

Query: 219 YTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLIST 278
           Y + G S + +     + S G  +D FT  S +  C+   DL+ G + H  I+++ L   
Sbjct: 403 YAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKN 462

Query: 279 AAM-NALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGG---SSNEKEIVDLFGKFVI 338
             + NAL+DMY          +IF  M  RD ++WNT+ G      NE E  DLF +  +
Sbjct: 463 LFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNL 522

Query: 339 EGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEM 398
            G+  +    +   + C  +     G Q   L+V  G   +    SS+I M+S+ G+++ 
Sbjct: 523 CGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKD 582

Query: 399 VHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACK 458
              VF SL    V + N  I  YS N+ E EA   F  +L  GV  +E TF+ I+E   K
Sbjct: 583 ARKVFSSLPEWSVVSMNALIAGYSQNNLE-EAVVLFQEMLTRGVNPSEITFATIVEACHK 642

Query: 459 FENPWMCRQLHCASLKAGFGSH-KYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGA 518
            E+  +  Q H    K GF S  +Y+  SL+  Y+    +  +  +F++L          
Sbjct: 643 PESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWT 702

Query: 519 VISTLVHQNHMYE-AIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKM 578
            + +   QN  YE A+ F   +   G  PD+ TF ++L  CS  ++  + +AIHSL+  +
Sbjct: 703 GMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHL 762

Query: 579 GFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQT 638
                   ++ +ID YAKCGD+  +   F++  + ++V+ +NS++  YA +G A +A++ 
Sbjct: 763 AHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKI 822

Query: 639 FEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLS 698
           F+ MR + + P + +F+ V++AC H G V  GR +F+ M   Y +    D+  C+VD+L 
Sbjct: 823 FDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLG 882

Query: 699 RNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVL 758
           R G+L +A   IE+    P   +  SLL  CRI+G+   G+ +AEKL+ L PQN + +VL
Sbjct: 883 RWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVL 942

Query: 759 LSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
           LS +Y+    WE A  +RK M D GV K PGYS +++
Sbjct: 943 LSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWIDV 978

BLAST of CSPI05G20220 vs. Swiss-Prot
Match: PP357_ARATH (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 377.1 bits (967), Expect = 4.7e-103
Identity = 233/760 (30.66%), Postives = 386/760 (50.79%), Query Frame = 1

Query: 38  RDPKTIATALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKV 97
           R  +  A  L L  +   L     VHG +   G + DT+  N L+ +Y R G M    KV
Sbjct: 42  RGRREFARLLQLRASDDLLHYQNVVHGQIIVWGLELDTYLSNILINLYSRAGGMVYARKV 101

Query: 98  FEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGL-MPTEFAFGSVMKACADVEA 157
           FE+MP+RN+VSWS ++S+ + +G +E  L  FLE  R     P E+   S ++AC+ ++ 
Sbjct: 102 FEKMPERNLVSWSTMVSACNHHGIYEESLVVFLEFWRTRKDSPNEYILSSFIQACSGLDG 161

Query: 158 YGFGSGVHCLSW--KIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMI 217
            G        S+  K G +++V+VG   +  Y + G+I  A LVF+ + +     W  MI
Sbjct: 162 RGRWMVFQLQSFLVKSGFDRDVYVGTLLIDFYLKDGNIDYARLVFDALPEKSTVTWTTMI 221

Query: 218 GGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGL- 277
            G    G S  +L     L  + +  D + + + + ACS++  L+ GK++H  ILR GL 
Sbjct: 222 SGCVKMGRSYVSLQLFYQLMEDNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLE 281

Query: 278 ISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGG---SSNEKEIVDLFGKF 337
           +  + MN L+D Y+   R  +  K+FN M  ++IISW T+  G   ++  KE ++LF   
Sbjct: 282 MDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSM 341

Query: 338 VIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLM 397
              G+KP+    S +   C  L     G Q  +  +     +++ V +S+I M+++   +
Sbjct: 342 SKFGLKPDMYACSSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCL 401

Query: 398 EMVHSVFDSLVFKPVSAWNQFILAYSL--NSFEM-EAFRTFSSLLRYGVVANEYTFSIII 457
                VFD      V  +N  I  YS     +E+ EA   F  +    +  +  TF  ++
Sbjct: 402 TDARKVFDIFAAADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLL 461

Query: 458 ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 517
             +    +  + +Q+H    K G     +   +LI  Y     L+ S  +F+++++ D+ 
Sbjct: 462 RASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLV 521

Query: 518 TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 577
            + ++ +  V Q+   EA+     L  S ++PDEFTF +++    + A+    +  H  +
Sbjct: 522 IWNSMFAGYVQQSENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQL 581

Query: 578 EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA 637
            K G   + ++ +A++D YAKCG    A  AF+ S  S DV+ +NS++ +YA+HG   +A
Sbjct: 582 LKRGLECNPYITNALLDMYAKCGSPEDAHKAFD-SAASRDVVCWNSVISSYANHGEGKKA 641

Query: 638 IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD 697
           +Q  EKM    ++P+  +FV V+SAC H GLVE G   F+ M   + + P  ++Y C+V 
Sbjct: 642 LQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVS 701

Query: 698 MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLAT 757
           +L R G L  AR +IE MP  P   + RSLLSGC   GN EL +  AE  +   P++  +
Sbjct: 702 LLGRAGRLNKARELIEKMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGS 761

Query: 758 HVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
             +LS +Y+    W +A  +R+ M   GV+K+PG S + I
Sbjct: 762 FTMLSNIYASKGMWTEAKKVRERMKVEGVVKEPGRSWIGI 799

BLAST of CSPI05G20220 vs. Swiss-Prot
Match: PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 367.9 bits (943), Expect = 2.9e-100
Identity = 224/749 (29.91%), Postives = 378/749 (50.47%), Query Frame = 1

Query: 43  IATALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEM- 102
           I+ ALS S N   L    ++H  +  LG D   F    L+  Y           VF  + 
Sbjct: 10  ISRALSSSSNLNEL---RRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVS 69

Query: 103 PQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGS 162
           P +NV  W+ II + S+NG F   LE + ++    + P ++ F SV+KACA +     G 
Sbjct: 70  PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGD 129

Query: 163 GVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCG 222
            V+     +G E ++FVG + + MY+R+G +T A  VF+ M   D+  WN++I GY++ G
Sbjct: 130 LVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHG 189

Query: 223 LSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMN- 282
              EAL     L +  I  D FT+ S + A   +  +  G+ LHGF L+ G+ S   +N 
Sbjct: 190 YYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNN 249

Query: 283 ALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLFGKFV--IEGMKPN 342
            L+ MYL   R     ++F+ M  RD +S+NT+  G    + + +    F+  ++  KP+
Sbjct: 250 GLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLDQFKPD 309

Query: 343 HITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFD 402
            +T S + R CG L D  L    ++  +  G + E+ V + +I ++++ G M     VF+
Sbjct: 310 LLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFN 369

Query: 403 SLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWM 462
           S+  K   +WN  I  Y  +   MEA + F  ++     A+  T+ ++I  + +  +   
Sbjct: 370 SMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKF 429

Query: 463 CRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVH 522
            + LH   +K+G      VS +LI  Y   G +  S +IF+ +   D  T+  VIS  V 
Sbjct: 430 GKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVR 489

Query: 523 QNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHV 582
                  +     + +S   PD  TF   L  C+S AA    K IH  + + G+   + +
Sbjct: 490 FGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQI 549

Query: 583 ASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAK 642
            +A+I+ Y+KCG + ++   FE+  +  DV+ +  M+ AY  +G   +A++TF  M  + 
Sbjct: 550 GNALIEMYSKCGCLENSSRVFERMSR-RDVVTWTGMIYAYGMYGEGEKALETFADMEKSG 609

Query: 643 VQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDA 702
           + P    F+++I AC H GLV++G + F+ MK+ Y + P  ++Y C+VD+LSR+  +  A
Sbjct: 610 IVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKA 669

Query: 703 RYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEG 762
              I++MP  P  +I  S+L  CR  G+ E  +  + +++ L P +    +L S  Y+  
Sbjct: 670 EEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAAL 729

Query: 763 NSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
             W+  + IRK + D  + K+PGYS +E+
Sbjct: 730 RKWDKVSLIRKSLKDKHITKNPGYSWIEV 754

BLAST of CSPI05G20220 vs. Swiss-Prot
Match: PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 350.5 bits (898), Expect = 4.7e-95
Identity = 208/755 (27.55%), Postives = 378/755 (50.07%), Query Frame = 1

Query: 41  KTIATALSLSENTKSLILGAQVHGHMCKL--GFDYDTFSMNNLLKMYCRCGFMCEGFKVF 100
           +  A  L L    +++  G Q+H  + K    F+ D F    L+ MY +CG + +  KVF
Sbjct: 81  EAFAYVLELCGKRRAVSQGRQLHSRIFKTFPSFELD-FLAGKLVFMYGKCGSLDDAEKVF 140

Query: 101 EEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYG 160
           +EMP R   +W+ +I +   NGE    L  +  M  +G+     +F +++KACA +    
Sbjct: 141 DEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIR 200

Query: 161 FGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFE-WMEKVDVGCWNAMIGGY 220
            GS +H L  K+G     F+  + +SMYA+  D+++A  +F+ + EK D   WN+++  Y
Sbjct: 201 SGSELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSY 260

Query: 221 TNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTA 280
           +  G SLE L     ++  G   + +TIVSA+ AC        GKE+H  +L+    S+ 
Sbjct: 261 STSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSE 320

Query: 281 --AMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNE---KEIVDLFGKFVI 340
               NAL+ MY    +     +I   M   D+++WN++  G       KE ++ F   + 
Sbjct: 321 LYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIA 380

Query: 341 EGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEM 400
            G K + ++ + +    G L +   G +  +  +  G     +V +++I M+S+  L   
Sbjct: 381 AGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCY 440

Query: 401 VHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACK 460
           +   F  +  K + +W   I  Y+ N   +EA   F  + +  +  +E     I+  +  
Sbjct: 441 MGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSV 500

Query: 461 FENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAV 520
            ++  + +++HC  L+ G      +   L+  Y    ++  +  +F  ++  D+ ++ ++
Sbjct: 501 LKSMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSM 560

Query: 521 ISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGF 580
           IS+     +  EA+     ++E+G   D      IL+  +S +A ++ + IH  + + GF
Sbjct: 561 ISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGF 620

Query: 581 GFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFE 640
                +A A++D YA CGD+ SA+  F++  +   ++ Y SM+ AY  HG    A++ F+
Sbjct: 621 CLEGSIAVAVVDMYACCGDLQSAKAVFDR-IERKGLLQYTSMINAYGMHGCGKAAVELFD 680

Query: 641 KMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRN 700
           KMR   V P   SF++++ AC H GL+++GR   + M+ +Y + P  ++Y CLVDML R 
Sbjct: 681 KMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRA 740

Query: 701 GFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLS 760
             + +A   ++ M   P   +  +LL+ CR +  +E+G+  A++LL L P+N    VL+S
Sbjct: 741 NCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVS 800

Query: 761 KVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
            V++E   W D   +R +M   G+ K PG S +E+
Sbjct: 801 NVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEM 832

BLAST of CSPI05G20220 vs. TrEMBL
Match: A0A0A0KV18_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G577950 PE=4 SV=1)

HSP 1 Score: 1569.3 bits (4062), Expect = 0.0e+00
Identity = 780/787 (99.11%), Postives = 782/787 (99.36%), Query Frame = 1

Query: 1   MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA 60
           MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA
Sbjct: 1   MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA 60

Query: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENG 120
           QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLI SSLS+NG
Sbjct: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLITSSLSKNG 120

Query: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180
           EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG
Sbjct: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180

Query: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240
           STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM
Sbjct: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240

Query: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN 300
           D FTIVSA+KACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN
Sbjct: 241 DNFTIVSAVKACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN 300

Query: 301 SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360
           SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ
Sbjct: 301 SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360

Query: 361 FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF 420
           FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF
Sbjct: 361 FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF 420

Query: 421 EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS 480
           EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS
Sbjct: 421 EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS 480

Query: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD 540
           LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD
Sbjct: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD 540

Query: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE 600
           EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE
Sbjct: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE 600

Query: 601 QSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE 660
           QSCQSNDVIVYNSMMMAYAHHGLA EAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE
Sbjct: 601 QSCQSNDVIVYNSMMMAYAHHGLACEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE 660

Query: 661 QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG 720
           QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG
Sbjct: 661 QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG 720

Query: 721 CRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDP 780
           CRIYGN ELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTD GVLKDP
Sbjct: 721 CRIYGNVELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDP 780

Query: 781 GYSRVEI 788
           GYSRVEI
Sbjct: 781 GYSRVEI 787

BLAST of CSPI05G20220 vs. TrEMBL
Match: A0A0A0KRW0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174670 PE=4 SV=1)

HSP 1 Score: 1320.1 bits (3415), Expect = 0.0e+00
Identity = 662/710 (93.24%), Postives = 668/710 (94.08%), Query Frame = 1

Query: 78  MNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGL 137
           MNNLLKMY RCGFMCEGFKVFEEMPQRNVVSWSLI SSLS+NGEFE CLESFLEMMRDGL
Sbjct: 1   MNNLLKMYFRCGFMCEGFKVFEEMPQRNVVSWSLITSSLSKNGEFEFCLESFLEMMRDGL 60

Query: 138 MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL 197
           MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL
Sbjct: 61  MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL 120

Query: 198 VFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQD 257
           VFEWMEKVDVGCWNAMIGGYT+CGL LEALSAVSLLNSEGIKMD FTIVSA+KACSLIQD
Sbjct: 121 VFEWMEKVDVGCWNAMIGGYTHCGLGLEALSAVSLLNSEGIKMDNFTIVSAVKACSLIQD 180

Query: 258 LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS 317
           LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS
Sbjct: 181 LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS 240

Query: 318 SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL 377
           SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL
Sbjct: 241 SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL 300

Query: 378 SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVV 437
           SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAY                      
Sbjct: 301 SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAY---------------------- 360

Query: 438 ANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEI 497
                     +TACKFENPWMCRQLHCAS+KAGFGSHKYVSCSLIKCYILIGSLESSFEI
Sbjct: 361 ----------KTACKFENPWMCRQLHCASMKAGFGSHKYVSCSLIKCYILIGSLESSFEI 420

Query: 498 FNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAY 557
           FNQLEIVDMATYGAVISTLVHQN+MYEAIMFLN LMESGKKPDEFTFGSILNGCSSRAAY
Sbjct: 421 FNQLEIVDMATYGAVISTLVHQNYMYEAIMFLNFLMESGKKPDEFTFGSILNGCSSRAAY 480

Query: 558 HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA 617
           HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA
Sbjct: 481 HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA 540

Query: 618 YAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP 677
           YAHHGLA EAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP
Sbjct: 541 YAHHGLACEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP 600

Query: 678 SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKL 737
           SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGN ELGQWTAEKL
Sbjct: 601 SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNVELGQWTAEKL 660

Query: 738 LSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
           LSLAPQN ATHVLLSKVYSEGNSWEDAANIRKEMTD GVLKDPGYSRVEI
Sbjct: 661 LSLAPQNDATHVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDPGYSRVEI 678

BLAST of CSPI05G20220 vs. TrEMBL
Match: B9HPK8_POPTR (Uncharacterized protein (Fragment) OS=Populus trichocarpa GN=POPTR_0009s03250g PE=4 SV=2)

HSP 1 Score: 823.9 bits (2127), Expect = 1.6e-235
Identity = 414/746 (55.50%), Postives = 532/746 (71.31%), Query Frame = 1

Query: 46  ALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRN 105
           ALS  EN+KS ILG Q+HG++ KLGF  D F  NNL+K Y +   +  GF VF+ M +RN
Sbjct: 13  ALSFCENSKSFILGTQIHGYIIKLGFSSDVFVSNNLIKFYAKGAVLRYGFNVFDGMLERN 72

Query: 106 VVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACAD-VEAYGFGSGVH 165
           VVSW+L++    +  E EL LE FLEM+RDG +P EF  GSVMKAC + VE   FG  VH
Sbjct: 73  VVSWTLMVCGAIQCEEVELGLEVFLEMIRDGFVPNEFGLGSVMKACGNSVEGRVFGLCVH 132

Query: 166 CLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSL 225
           C + KIGME+N FV  S LS YA+LGDI +AE VFE +E+VDVGCWNAMIGGY  CG   
Sbjct: 133 CFALKIGMERNPFVSCSVLSFYAKLGDIGAAERVFESLEEVDVGCWNAMIGGYAQCGYGF 192

Query: 226 EALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGL-ISTAAMNALM 285
           EA+   SL+  +GI MDK+T ++ I+ CSL+ DL+ G+++HG I+R  L +S   MNALM
Sbjct: 193 EAIVTASLMRRKGIFMDKYTFINVIQGCSLLGDLNFGRQIHGLIIRSELELSAPVMNALM 252

Query: 286 DMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNE---KEIVDLFGKFVIEGMKPNHI 345
           DMY  +    S L +F  M  RD+++WNTVFG  S     K+I  LF  F++  M+PNHI
Sbjct: 253 DMYFKNGGMKSGLVVFKKMHDRDVVTWNTVFGSFSQHEDPKDIASLFHSFLLTSMRPNHI 312

Query: 346 TFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSL 405
           TFS+LFR+CG LL+  LG QF  LA+H G  DE  + S++I+MFS+ G MEM H VF S 
Sbjct: 313 TFSILFRECGKLLNLDLGLQFCCLALHFGLFDEANITSALINMFSRCGKMEMAHLVFKSK 372

Query: 406 VFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCR 465
           V + +  WN+ I  Y LN  + EA +TF  LL+ GV ANEYTFS ++ET  + EN  M R
Sbjct: 373 VSENIIIWNELISGYKLNCCDAEALKTFYDLLQLGVEANEYTFSNVLETCSRSENQLMNR 432

Query: 466 QLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQN 525
           Q+H  + K+GF SH YV  SLIK YI  G L+ S ++FN L+  DMA +G +IS  VHQ 
Sbjct: 433 QIHGVAFKSGFASHGYVCSSLIKGYIKCGLLDDSLKVFNMLDRPDMAAWGTMISAFVHQG 492

Query: 526 HMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVAS 585
              EAI  LN+L+E+G+KPDEF  GSIL+ C+S  AY QTK++HSL+ K+GF  HV VAS
Sbjct: 493 WDCEAIRSLNLLIEAGEKPDEFILGSILSSCASTVAYCQTKSVHSLIIKLGFEGHVFVAS 552

Query: 586 AIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQ 645
           A++DAYAKCGDI SA+ AF QSC+S+DV++YN+M++AYAHHG   EA+ T++KM++A +Q
Sbjct: 553 AVLDAYAKCGDIQSAKMAFNQSCKSSDVVIYNAMIIAYAHHGRVVEALDTYDKMKLANLQ 612

Query: 646 PSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARY 705
           PSQA+FVSVI+AC H+G VE+G  LF++M   Y M PS D YGCLVDM SRNG+L DA+ 
Sbjct: 613 PSQATFVSVIAACGHIGHVEKGCRLFKSMDL-YGMEPSPDIYGCLVDMFSRNGYLEDAKQ 672

Query: 706 IIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNS 765
           IIES+P+  WPAILRSLLSGCR+YGNRELG+W A+KLL L P N A H LL KVYSE  +
Sbjct: 673 IIESLPYPAWPAILRSLLSGCRMYGNRELGEWAAKKLLQLVPHNDAAHALLFKVYSELGN 732

Query: 766 WEDAANIRKEMTDGGVLKDPGYSRVE 787
           WEDAA +R+EM + G+ KDPG+S +E
Sbjct: 733 WEDAAKMRREMAERGLRKDPGHSWIE 757

BLAST of CSPI05G20220 vs. TrEMBL
Match: F6HIN1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0042g00130 PE=4 SV=1)

HSP 1 Score: 819.7 bits (2116), Expect = 3.1e-234
Identity = 411/769 (53.45%), Postives = 540/769 (70.22%), Query Frame = 1

Query: 27  FLSYSCN----ISIGRDPKTIATALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLL 86
           + S SCN    +S   DP  ++TAL+ S N+K ++LG+Q+H  + KLGF  D FS NNL+
Sbjct: 42  YTSKSCNCSSSLSFRNDPTALSTALTHSANSKCILLGSQIHAQIIKLGFCNDIFSQNNLI 101

Query: 87  KMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEF 146
           +MY +CGF+  G KVF EMP +N+VSW+L++S   +NGEFE+ L  +LEM+R GL+P EF
Sbjct: 102 RMYTKCGFLAGGLKVFGEMPMKNLVSWTLVVSGAVQNGEFEMGLGVYLEMIRTGLVPNEF 161

Query: 147 AFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWM 206
           A G V KACA +     G  VHC + K+GME+N FVG S L+MYA+LGDI  AE VFE M
Sbjct: 162 ALGCVTKACAALGGKELGLCVHCFALKVGMEKNPFVGSSILNMYAKLGDIEDAERVFECM 221

Query: 207 EKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGK 266
           + + VGCWNAMIGGY  C    E+L  VS++  +GI MD FT ++A+K C ++ +L+ G+
Sbjct: 222 DNLVVGCWNAMIGGYAQCSYGFESLKIVSVMQYKGISMDAFTFINALKGCLVVGNLNFGR 281

Query: 267 ELHGFILRRGL-ISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSS--- 326
           ++HG I++  +  STA MN+LMDMY  +      LK+F+ +Q +DIISWNTVF G S   
Sbjct: 282 QIHGLIIQSEVGFSTAVMNSLMDMYFKNGGGLYALKVFDRLQDKDIISWNTVFAGLSQGD 341

Query: 327 NEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLS 386
           + +EI   F K ++ G+KPN +TFS+LFR CG  LD   G QF  LA   G  DE  V S
Sbjct: 342 DAREIGRFFHKLMLTGLKPNCVTFSILFRFCGEALDLVSGLQFHCLAFRFGISDEASVTS 401

Query: 387 SIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVA 446
           S+I+MFS+ G M M   VFDS  FK +   N+ I  Y+LN    EA   F +L   G+ A
Sbjct: 402 SLINMFSRCGAMRMACLVFDSAPFKSIHTCNEMISGYNLNCHNAEALNLFCNLNGLGLEA 461

Query: 447 NEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIF 506
           +E TFS  +E   + EN  + RQ+H   +K+GF S  YV  SL+KCY+  G L+ SFE F
Sbjct: 462 DECTFSSALEACFRTENQKLGRQMHGTIVKSGFASQGYVCSSLLKCYVGFGLLDDSFEFF 521

Query: 507 NQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYH 566
           N +E +D+ ++GA+IS LVH+ +  EAI  LN L E+G KPDEF FGSI N C+  AAY 
Sbjct: 522 NGVERLDLVSWGAMISALVHKGYSSEAIGLLNRLKEAGGKPDEFIFGSIFNCCAGIAAYR 581

Query: 567 QTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAY 626
           QTK++HSLV KMG+  HV VASA+IDAYAKCGDI +A+  F+Q+ +  DVI++N+M+MAY
Sbjct: 582 QTKSVHSLVVKMGYEAHVFVASAVIDAYAKCGDIENARRVFDQTSRFRDVILFNTMVMAY 641

Query: 627 AHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPS 686
           AHHGL  EA++TFEKM++A ++PSQA+FVSVISAC H+GLVEQG   F++M  DY M PS
Sbjct: 642 AHHGLVREAVETFEKMKLATLEPSQATFVSVISACSHLGLVEQGDIFFKSMNLDYGMDPS 701

Query: 687 RDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLL 746
            DNYGCLVD+ SRNGFL DA++IIE+MPF PWPAI RSLL+GCRI+GN+ELG+W A+KLL
Sbjct: 702 PDNYGCLVDLFSRNGFLEDAKHIIETMPFPPWPAIWRSLLNGCRIHGNKELGEWAAKKLL 761

Query: 747 SLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
            L P+N A +VLLSKVYSE  SW DAA +RK M + G+ KDPG S +EI
Sbjct: 762 QLVPENDAAYVLLSKVYSEEGSWSDAAKVRKGMIERGLWKDPGCSWIEI 810

BLAST of CSPI05G20220 vs. TrEMBL
Match: M5XJ55_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021912mg PE=4 SV=1)

HSP 1 Score: 784.3 bits (2024), Expect = 1.4e-223
Identity = 392/711 (55.13%), Postives = 516/711 (72.57%), Query Frame = 1

Query: 84  MYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFA 143
           MY +CG + +GF+VF++MP RN+V+W+L+IS+  ++G+FE  LE +L ++R GL P EF 
Sbjct: 1   MYAKCGLVGDGFRVFDKMPDRNLVTWTLMISAAVQDGQFEWGLEIYLGLIRSGLRPNEFT 60

Query: 144 FGSVMKACADV---EAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFE 203
            GSV+K CA+    +AY FG  VHC + K+G+EQN +VGGS LSMYA+L DI SA+ VFE
Sbjct: 61  IGSVLKGCAECTSSKAYEFGMSVHCFALKVGIEQNCYVGGSILSMYAKLEDIESAKGVFE 120

Query: 204 WMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDS 263
            M  +D   WN MIGGY  CG  LEAL  VSL+   GI MD+FT V+A+K CS++ +LD 
Sbjct: 121 SMSNLDTAGWNTMIGGYAQCGYGLEALKVVSLMVWRGISMDQFTFVNALKGCSVMGNLDF 180

Query: 264 GKELHGFILRRGL-ISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSN 323
           GK+LHG I++  +  ST+ MNAL DMY  + +K++ LK+FN +Q +D+ISWNT FG  S 
Sbjct: 181 GKQLHGLIIQSEMEFSTSVMNALSDMYSRNGKKDAALKVFNRIQAKDVISWNTAFGVFSE 240

Query: 324 EK---EIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRV 383
           +K   EI  L  +F++  MKPNH+TFS+LFRQCG +LD  LG QF+SLA+  G  +E  V
Sbjct: 241 DKNTREIAKLVHEFMLANMKPNHVTFSILFRQCGEILDLNLGLQFYSLALQFGFWNEANV 300

Query: 384 LSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGV 443
            SSII+MFS+ G M+M    FDSL+ K +++WN+ I  Y+ N    EA + F  L   GV
Sbjct: 301 RSSIINMFSRCGAMDMARLFFDSLLDKNLTSWNELISGYNSNHCYTEARKIFCDLWDLGV 360

Query: 444 VANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFE 503
            A+E TFS I+E+  K E+  M RQ+H A +K+GF  H YV   LIKCY+  G L+ SFE
Sbjct: 361 EASEVTFSSILESCYKDEHQEMIRQIHGAIVKSGFSVHGYVCSFLIKCYVKFGLLDDSFE 420

Query: 504 IFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAA 563
            FN  E +D+ ++G +IS LVHQ H++EAI FL  L E+G KPDEF  GSILN C+  A 
Sbjct: 421 FFNGFETLDVESWGTMISALVHQGHLFEAIKFLKSLREAGGKPDEFILGSILNSCADNAG 480

Query: 564 YHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMM 623
           YH TK++HS+V KMGF   V V SA+IDAYAKCGDIGSA+  F QS +S DV+++N+M+M
Sbjct: 481 YHLTKSVHSVVIKMGFHSQVFVVSAVIDAYAKCGDIGSARMTFSQSFRSGDVVIHNAMIM 540

Query: 624 AYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMT 683
           A AHHGL  EA+  FEKM++A+++PSQA++VSVI+AC H+G V+ GR LF++M SD  M 
Sbjct: 541 ACAHHGLDKEAMGIFEKMKLARIKPSQATYVSVIAACAHVGQVDLGRLLFESMNSDSKME 600

Query: 684 P-SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAE 743
           P S D YGCLVDMLSR+G+L DAR +IE MP++PWPAILRSLLSGCRI+GN ELG+WTA+
Sbjct: 601 PISEDIYGCLVDMLSRSGYLEDARQMIEGMPYTPWPAILRSLLSGCRIHGNIELGEWTAK 660

Query: 744 KLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVE 787
           KL+ LAP+N   +VLLSKVYSE  SWEDA  IR+EM + GVLK+ GYS +E
Sbjct: 661 KLVQLAPENDVPYVLLSKVYSEEGSWEDATKIRREMIERGVLKNTGYSWIE 711

BLAST of CSPI05G20220 vs. TAIR10
Match: AT4G13650.1 (AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 387.1 bits (993), Expect = 2.6e-107
Identity = 219/731 (29.96%), Postives = 378/731 (51.71%), Query Frame = 1

Query: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENG 120
           Q+H  +   G    T   N L+ +Y R GF+    +VF+ +  ++  SW  +IS LS+N 
Sbjct: 208 QIHARILYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMISGLSKNE 267

Query: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180
                +  F +M   G+MPT +AF SV+ AC  +E+   G  +H L  K+G   + +V  
Sbjct: 268 CEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCN 327

Query: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240
           + +S+Y  LG++ SAE +F  M + D   +N +I G + CG   +A+     ++ +G++ 
Sbjct: 328 ALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEP 387

Query: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMN-ALMDMYLISDRKNSVLKIF 300
           D  T+ S + ACS    L  G++LH +  + G  S   +  AL+++Y       + L  F
Sbjct: 388 DSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYF 447

Query: 301 NSMQTRDIISWNTV---FGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSR 360
              +  +++ WN +   +G   + +    +F +  IE + PN  T+  + + C  L D  
Sbjct: 448 LETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLE 507

Query: 361 LGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYS 420
           LG Q  S  +         V S +I M+++ G ++    +      K V +W   I  Y+
Sbjct: 508 LGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYT 567

Query: 421 LNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKY 480
             +F+ +A  TF  +L  G+ ++E   +  +      +     +Q+H  +  +GF S   
Sbjct: 568 QYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLP 627

Query: 481 VSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESG 540
              +L+  Y   G +E S+  F Q E  D   + A++S      +  EA+     +   G
Sbjct: 628 FQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREG 687

Query: 541 KKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQ 600
              + FTFGS +   S  A   Q K +H+++ K G+     V +A+I  YAKCG I  A+
Sbjct: 688 IDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAE 747

Query: 601 GAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHM 660
             F +    N+V  +N+++ AY+ HG   EA+ +F++M  + V+P+  + V V+SAC H+
Sbjct: 748 KQFLEVSTKNEVS-WNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHI 807

Query: 661 GLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRS 720
           GLV++G + F++M S+Y ++P  ++Y C+VDML+R G L  A+  I+ MP  P   + R+
Sbjct: 808 GLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRT 867

Query: 721 LLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGV 780
           LLS C ++ N E+G++ A  LL L P++ AT+VLLS +Y+    W+     R++M + GV
Sbjct: 868 LLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGV 927

Query: 781 LKDPGYSRVEI 788
            K+PG S +E+
Sbjct: 928 KKEPGQSWIEV 937

BLAST of CSPI05G20220 vs. TAIR10
Match: AT3G09040.1 (AT3G09040.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 386.3 bits (991), Expect = 4.4e-107
Identity = 238/757 (31.44%), Postives = 383/757 (50.59%), Query Frame = 1

Query: 39  DPKTIATALSLSENTKSLIL--GAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFK 98
           DP T+      S   K+ +      V   M   G   D  +   ++  Y R G + +   
Sbjct: 223 DPNTVCWTCLFSGYVKAGLPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARL 282

Query: 99  VFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEA 158
           +F EM   +VV+W+++IS   + G   + +E F  M +  +  T    GSV+ A   V  
Sbjct: 283 LFGEMSSPDVVAWNVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVAN 342

Query: 159 YGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGG 218
              G  VH  + K+G+  N++VG S +SMY++   + +A  VFE +E+ +   WNAMI G
Sbjct: 343 LDLGLVVHAEAIKLGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRG 402

Query: 219 YTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLIST 278
           Y + G S + +     + S G  +D FT  S +  C+   DL+ G + H  I+++ L   
Sbjct: 403 YAHNGESHKVMELFMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKN 462

Query: 279 AAM-NALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGG---SSNEKEIVDLFGKFVI 338
             + NAL+DMY          +IF  M  RD ++WNT+ G      NE E  DLF +  +
Sbjct: 463 LFVGNALVDMYAKCGALEDARQIFERMCDRDNVTWNTIIGSYVQDENESEAFDLFKRMNL 522

Query: 339 EGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEM 398
            G+  +    +   + C  +     G Q   L+V  G   +    SS+I M+S+ G+++ 
Sbjct: 523 CGIVSDGACLASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKD 582

Query: 399 VHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACK 458
              VF SL    V + N  I  YS N+ E EA   F  +L  GV  +E TF+ I+E   K
Sbjct: 583 ARKVFSSLPEWSVVSMNALIAGYSQNNLE-EAVVLFQEMLTRGVNPSEITFATIVEACHK 642

Query: 459 FENPWMCRQLHCASLKAGFGSH-KYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGA 518
            E+  +  Q H    K GF S  +Y+  SL+  Y+    +  +  +F++L          
Sbjct: 643 PESLTLGTQFHGQITKRGFSSEGEYLGISLLGMYMNSRGMTEACALFSELSSPKSIVLWT 702

Query: 519 VISTLVHQNHMYE-AIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKM 578
            + +   QN  YE A+ F   +   G  PD+ TF ++L  CS  ++  + +AIHSL+  +
Sbjct: 703 GMMSGHSQNGFYEEALKFYKEMRHDGVLPDQATFVTVLRVCSVLSSLREGRAIHSLIFHL 762

Query: 579 GFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQT 638
                   ++ +ID YAKCGD+  +   F++  + ++V+ +NS++  YA +G A +A++ 
Sbjct: 763 AHDLDELTSNTLIDMYAKCGDMKGSSQVFDEMRRRSNVVSWNSLINGYAKNGYAEDALKI 822

Query: 639 FEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLS 698
           F+ MR + + P + +F+ V++AC H G V  GR +F+ M   Y +    D+  C+VD+L 
Sbjct: 823 FDSMRQSHIMPDEITFLGVLTACSHAGKVSDGRKIFEMMIGQYGIEARVDHVACMVDLLG 882

Query: 699 RNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVL 758
           R G+L +A   IE+    P   +  SLL  CRI+G+   G+ +AEKL+ L PQN + +VL
Sbjct: 883 RWGYLQEADDFIEAQNLKPDARLWSSLLGACRIHGDDIRGEISAEKLIELEPQNSSAYVL 942

Query: 759 LSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
           LS +Y+    WE A  +RK M D GV K PGYS +++
Sbjct: 943 LSNIYASQGCWEKANALRKVMRDRGVKKVPGYSWIDV 978

BLAST of CSPI05G20220 vs. TAIR10
Match: AT4G39530.1 (AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 377.1 bits (967), Expect = 2.7e-104
Identity = 233/760 (30.66%), Postives = 386/760 (50.79%), Query Frame = 1

Query: 38  RDPKTIATALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKV 97
           R  +  A  L L  +   L     VHG +   G + DT+  N L+ +Y R G M    KV
Sbjct: 42  RGRREFARLLQLRASDDLLHYQNVVHGQIIVWGLELDTYLSNILINLYSRAGGMVYARKV 101

Query: 98  FEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGL-MPTEFAFGSVMKACADVEA 157
           FE+MP+RN+VSWS ++S+ + +G +E  L  FLE  R     P E+   S ++AC+ ++ 
Sbjct: 102 FEKMPERNLVSWSTMVSACNHHGIYEESLVVFLEFWRTRKDSPNEYILSSFIQACSGLDG 161

Query: 158 YGFGSGVHCLSW--KIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMI 217
            G        S+  K G +++V+VG   +  Y + G+I  A LVF+ + +     W  MI
Sbjct: 162 RGRWMVFQLQSFLVKSGFDRDVYVGTLLIDFYLKDGNIDYARLVFDALPEKSTVTWTTMI 221

Query: 218 GGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGL- 277
            G    G S  +L     L  + +  D + + + + ACS++  L+ GK++H  ILR GL 
Sbjct: 222 SGCVKMGRSYVSLQLFYQLMEDNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLE 281

Query: 278 ISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGG---SSNEKEIVDLFGKF 337
           +  + MN L+D Y+   R  +  K+FN M  ++IISW T+  G   ++  KE ++LF   
Sbjct: 282 MDASLMNVLIDSYVKCGRVIAAHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSM 341

Query: 338 VIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLM 397
              G+KP+    S +   C  L     G Q  +  +     +++ V +S+I M+++   +
Sbjct: 342 SKFGLKPDMYACSSILTSCASLHALGFGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCL 401

Query: 398 EMVHSVFDSLVFKPVSAWNQFILAYSL--NSFEM-EAFRTFSSLLRYGVVANEYTFSIII 457
                VFD      V  +N  I  YS     +E+ EA   F  +    +  +  TF  ++
Sbjct: 402 TDARKVFDIFAAADVVLFNAMIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLL 461

Query: 458 ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 517
             +    +  + +Q+H    K G     +   +LI  Y     L+ S  +F+++++ D+ 
Sbjct: 462 RASASLTSLGLSKQIHGLMFKYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLV 521

Query: 518 TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 577
            + ++ +  V Q+   EA+     L  S ++PDEFTF +++    + A+    +  H  +
Sbjct: 522 IWNSMFAGYVQQSENEEALNLFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQL 581

Query: 578 EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA 637
            K G   + ++ +A++D YAKCG    A  AF+ S  S DV+ +NS++ +YA+HG   +A
Sbjct: 582 LKRGLECNPYITNALLDMYAKCGSPEDAHKAFD-SAASRDVVCWNSVISSYANHGEGKKA 641

Query: 638 IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD 697
           +Q  EKM    ++P+  +FV V+SAC H GLVE G   F+ M   + + P  ++Y C+V 
Sbjct: 642 LQMLEKMMSEGIEPNYITFVGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVS 701

Query: 698 MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLAT 757
           +L R G L  AR +IE MP  P   + RSLLSGC   GN EL +  AE  +   P++  +
Sbjct: 702 LLGRAGRLNKARELIEKMPTKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGS 761

Query: 758 HVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
             +LS +Y+    W +A  +R+ M   GV+K+PG S + I
Sbjct: 762 FTMLSNIYASKGMWTEAKKVRERMKVEGVVKEPGRSWIGI 799

BLAST of CSPI05G20220 vs. TAIR10
Match: AT3G03580.1 (AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 367.9 bits (943), Expect = 1.6e-101
Identity = 224/749 (29.91%), Postives = 378/749 (50.47%), Query Frame = 1

Query: 43  IATALSLSENTKSLILGAQVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEM- 102
           I+ ALS S N   L    ++H  +  LG D   F    L+  Y           VF  + 
Sbjct: 10  ISRALSSSSNLNEL---RRIHALVISLGLDSSDFFSGKLIDKYSHFREPASSLSVFRRVS 69

Query: 103 PQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGS 162
           P +NV  W+ II + S+NG F   LE + ++    + P ++ F SV+KACA +     G 
Sbjct: 70  PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGD 129

Query: 163 GVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCG 222
            V+     +G E ++FVG + + MY+R+G +T A  VF+ M   D+  WN++I GY++ G
Sbjct: 130 LVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHG 189

Query: 223 LSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMN- 282
              EAL     L +  I  D FT+ S + A   +  +  G+ LHGF L+ G+ S   +N 
Sbjct: 190 YYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNN 249

Query: 283 ALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLFGKFV--IEGMKPN 342
            L+ MYL   R     ++F+ M  RD +S+NT+  G    + + +    F+  ++  KP+
Sbjct: 250 GLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLENLDQFKPD 309

Query: 343 HITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFD 402
            +T S + R CG L D  L    ++  +  G + E+ V + +I ++++ G M     VF+
Sbjct: 310 LLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFN 369

Query: 403 SLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWM 462
           S+  K   +WN  I  Y  +   MEA + F  ++     A+  T+ ++I  + +  +   
Sbjct: 370 SMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKF 429

Query: 463 CRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVH 522
            + LH   +K+G      VS +LI  Y   G +  S +IF+ +   D  T+  VIS  V 
Sbjct: 430 GKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVR 489

Query: 523 QNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHV 582
                  +     + +S   PD  TF   L  C+S AA    K IH  + + G+   + +
Sbjct: 490 FGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQI 549

Query: 583 ASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAK 642
            +A+I+ Y+KCG + ++   FE+  +  DV+ +  M+ AY  +G   +A++TF  M  + 
Sbjct: 550 GNALIEMYSKCGCLENSSRVFERMSR-RDVVTWTGMIYAYGMYGEGEKALETFADMEKSG 609

Query: 643 VQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDA 702
           + P    F+++I AC H GLV++G + F+ MK+ Y + P  ++Y C+VD+LSR+  +  A
Sbjct: 610 IVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKA 669

Query: 703 RYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEG 762
              I++MP  P  +I  S+L  CR  G+ E  +  + +++ L P +    +L S  Y+  
Sbjct: 670 EEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAAL 729

Query: 763 NSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
             W+  + IRK + D  + K+PGYS +E+
Sbjct: 730 RKWDKVSLIRKSLKDKHITKNPGYSWIEV 754

BLAST of CSPI05G20220 vs. TAIR10
Match: AT3G63370.1 (AT3G63370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.5 bits (898), Expect = 2.7e-96
Identity = 208/755 (27.55%), Postives = 378/755 (50.07%), Query Frame = 1

Query: 41  KTIATALSLSENTKSLILGAQVHGHMCKL--GFDYDTFSMNNLLKMYCRCGFMCEGFKVF 100
           +  A  L L    +++  G Q+H  + K    F+ D F    L+ MY +CG + +  KVF
Sbjct: 81  EAFAYVLELCGKRRAVSQGRQLHSRIFKTFPSFELD-FLAGKLVFMYGKCGSLDDAEKVF 140

Query: 101 EEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYG 160
           +EMP R   +W+ +I +   NGE    L  +  M  +G+     +F +++KACA +    
Sbjct: 141 DEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIR 200

Query: 161 FGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFE-WMEKVDVGCWNAMIGGY 220
            GS +H L  K+G     F+  + +SMYA+  D+++A  +F+ + EK D   WN+++  Y
Sbjct: 201 SGSELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSY 260

Query: 221 TNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTA 280
           +  G SLE L     ++  G   + +TIVSA+ AC        GKE+H  +L+    S+ 
Sbjct: 261 STSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSE 320

Query: 281 --AMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNE---KEIVDLFGKFVI 340
               NAL+ MY    +     +I   M   D+++WN++  G       KE ++ F   + 
Sbjct: 321 LYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIA 380

Query: 341 EGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQFGLMEM 400
            G K + ++ + +    G L +   G +  +  +  G     +V +++I M+S+  L   
Sbjct: 381 AGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCY 440

Query: 401 VHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIIIETACK 460
           +   F  +  K + +W   I  Y+ N   +EA   F  + +  +  +E     I+  +  
Sbjct: 441 MGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSV 500

Query: 461 FENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMATYGAV 520
            ++  + +++HC  L+ G      +   L+  Y    ++  +  +F  ++  D+ ++ ++
Sbjct: 501 LKSMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSM 560

Query: 521 ISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLVEKMGF 580
           IS+     +  EA+     ++E+G   D      IL+  +S +A ++ + IH  + + GF
Sbjct: 561 ISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKGF 620

Query: 581 GFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFE 640
                +A A++D YA CGD+ SA+  F++  +   ++ Y SM+ AY  HG    A++ F+
Sbjct: 621 CLEGSIAVAVVDMYACCGDLQSAKAVFDR-IERKGLLQYTSMINAYGMHGCGKAAVELFD 680

Query: 641 KMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRN 700
           KMR   V P   SF++++ AC H GL+++GR   + M+ +Y + P  ++Y CLVDML R 
Sbjct: 681 KMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRA 740

Query: 701 GFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLATHVLLS 760
             + +A   ++ M   P   +  +LL+ CR +  +E+G+  A++LL L P+N    VL+S
Sbjct: 741 NCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVS 800

Query: 761 KVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
            V++E   W D   +R +M   G+ K PG S +E+
Sbjct: 801 NVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEM 832

BLAST of CSPI05G20220 vs. NCBI nr
Match: gi|778704196|ref|XP_011655491.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like isoform X1 [Cucumis sativus])

HSP 1 Score: 1569.3 bits (4062), Expect = 0.0e+00
Identity = 780/787 (99.11%), Postives = 782/787 (99.36%), Query Frame = 1

Query: 1   MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA 60
           MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA
Sbjct: 1   MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA 60

Query: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENG 120
           QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLI SSLS+NG
Sbjct: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLITSSLSKNG 120

Query: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180
           EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG
Sbjct: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180

Query: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240
           STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM
Sbjct: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240

Query: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN 300
           D FTIVSA+KACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN
Sbjct: 241 DNFTIVSAVKACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN 300

Query: 301 SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360
           SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ
Sbjct: 301 SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360

Query: 361 FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF 420
           FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF
Sbjct: 361 FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF 420

Query: 421 EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS 480
           EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS
Sbjct: 421 EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS 480

Query: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD 540
           LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD
Sbjct: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD 540

Query: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE 600
           EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE
Sbjct: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE 600

Query: 601 QSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE 660
           QSCQSNDVIVYNSMMMAYAHHGLA EAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE
Sbjct: 601 QSCQSNDVIVYNSMMMAYAHHGLACEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE 660

Query: 661 QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG 720
           QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG
Sbjct: 661 QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG 720

Query: 721 CRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDP 780
           CRIYGN ELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTD GVLKDP
Sbjct: 721 CRIYGNVELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDP 780

Query: 781 GYSRVEI 788
           GYSRVEI
Sbjct: 781 GYSRVEI 787

BLAST of CSPI05G20220 vs. NCBI nr
Match: gi|659090199|ref|XP_008445887.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like isoform X1 [Cucumis melo])

HSP 1 Score: 1502.3 bits (3888), Expect = 0.0e+00
Identity = 743/786 (94.53%), Postives = 761/786 (96.82%), Query Frame = 1

Query: 1   MKISALGTGLVSLTNRVFKFHPSFERFLSYSCNISIGRDPKTIATALSLSENTKSLILGA 60
           MKISALGTG V LTN+  KFHP FERFLSYSCNIS+GRDPKTIA+ALSLSENTKSLILGA
Sbjct: 1   MKISALGTGFVLLTNKALKFHPFFERFLSYSCNISVGRDPKTIASALSLSENTKSLILGA 60

Query: 61  QVHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENG 120
           Q+HGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSL ENG
Sbjct: 61  QIHGHMCKLGFDYDTFSMNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLPENG 120

Query: 121 EFELCLESFLEMMRDGLMPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGG 180
           EFELCLESFLEMMRDGLMP EF FGSVMKACADVEAYGFGSGVHCLSWK+G+EQNVFVGG
Sbjct: 121 EFELCLESFLEMMRDGLMPNEFTFGSVMKACADVEAYGFGSGVHCLSWKLGIEQNVFVGG 180

Query: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKM 240
           STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGL L+ALSAVSLLN +GIKM
Sbjct: 181 STLSMYARLGDITSAELVFEWMEKVDVGCWNAMIGGYTNCGLGLKALSAVSLLNCKGIKM 240

Query: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFN 300
           DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTA MNALMDMY ISDRKNS LK FN
Sbjct: 241 DKFTIVSAIKACSLIQDLDSGKELHGFILRRGLISTAVMNALMDMYFISDRKNSALKTFN 300

Query: 301 SMQTRDIISWNTVFGGSSNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360
           SMQTRDIISWNTVF GSSNE EIVDLFGKF+IEGMKPNHITFSVLFRQCGVLLDSRLGFQ
Sbjct: 301 SMQTRDIISWNTVFVGSSNENEIVDLFGKFMIEGMKPNHITFSVLFRQCGVLLDSRLGFQ 360

Query: 361 FFSLAVHLGCLDETRVLSSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSF 420
           FFSLAVHLG LDETRVLSSIISMFSQ GLMEMVHSVFDSLVFKPVSAWNQ ILAYSLNSF
Sbjct: 361 FFSLAVHLGFLDETRVLSSIISMFSQIGLMEMVHSVFDSLVFKPVSAWNQLILAYSLNSF 420

Query: 421 EMEAFRTFSSLLRYGVVANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCS 480
           EMEAFRTFSSLLRYGVVANEYT+SII+ETACK ENP +CRQLHCASLKAGFGSHKYVSCS
Sbjct: 421 EMEAFRTFSSLLRYGVVANEYTYSIIVETACKSENPRICRQLHCASLKAGFGSHKYVSCS 480

Query: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPD 540
           LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNH+YEAIMFLNILMESGKKPD
Sbjct: 481 LIKCYILIGSLESSFEIFNQLEIVDMATYGAVISTLVHQNHIYEAIMFLNILMESGKKPD 540

Query: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFE 600
           EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFG HVHVASAIIDAYAKCGDIGSAQGAFE
Sbjct: 541 EFTFGSILNGCSSRAAYHQTKAIHSLVEKMGFGVHVHVASAIIDAYAKCGDIGSAQGAFE 600

Query: 601 QSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVE 660
           QSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISAC H+GLVE
Sbjct: 601 QSCQSNDVIVYNSMMMAYAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACGHIGLVE 660

Query: 661 QGRSLFQTMKSDYNMTPSRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSG 720
           QGRSLFQTMKSDY+MTPSRDNYGCLVDML+RNGFLYDARYIIESMPFSPWPAILRSLLSG
Sbjct: 661 QGRSLFQTMKSDYSMTPSRDNYGCLVDMLARNGFLYDARYIIESMPFSPWPAILRSLLSG 720

Query: 721 CRIYGNRELGQWTAEKLLSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDP 780
           CRIYGNRELGQWTAEKLLS+APQN AT+VLLSKVYSEGNSWEDAANIRKEMTD GVLKDP
Sbjct: 721 CRIYGNRELGQWTAEKLLSMAPQNDATYVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDP 780

Query: 781 GYSRVE 787
           GYSRVE
Sbjct: 781 GYSRVE 786

BLAST of CSPI05G20220 vs. NCBI nr
Match: gi|700195264|gb|KGN50441.1| (hypothetical protein Csa_5G174670 [Cucumis sativus])

HSP 1 Score: 1320.1 bits (3415), Expect = 0.0e+00
Identity = 662/710 (93.24%), Postives = 668/710 (94.08%), Query Frame = 1

Query: 78  MNNLLKMYCRCGFMCEGFKVFEEMPQRNVVSWSLIISSLSENGEFELCLESFLEMMRDGL 137
           MNNLLKMY RCGFMCEGFKVFEEMPQRNVVSWSLI SSLS+NGEFE CLESFLEMMRDGL
Sbjct: 1   MNNLLKMYFRCGFMCEGFKVFEEMPQRNVVSWSLITSSLSKNGEFEFCLESFLEMMRDGL 60

Query: 138 MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL 197
           MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL
Sbjct: 61  MPTEFAFGSVMKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAEL 120

Query: 198 VFEWMEKVDVGCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQD 257
           VFEWMEKVDVGCWNAMIGGYT+CGL LEALSAVSLLNSEGIKMD FTIVSA+KACSLIQD
Sbjct: 121 VFEWMEKVDVGCWNAMIGGYTHCGLGLEALSAVSLLNSEGIKMDNFTIVSAVKACSLIQD 180

Query: 258 LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS 317
           LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS
Sbjct: 181 LDSGKELHGFILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGS 240

Query: 318 SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL 377
           SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL
Sbjct: 241 SNEKEIVDLFGKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVL 300

Query: 378 SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVV 437
           SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAY                      
Sbjct: 301 SSIISMFSQFGLMEMVHSVFDSLVFKPVSAWNQFILAY---------------------- 360

Query: 438 ANEYTFSIIIETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEI 497
                     +TACKFENPWMCRQLHCAS+KAGFGSHKYVSCSLIKCYILIGSLESSFEI
Sbjct: 361 ----------KTACKFENPWMCRQLHCASMKAGFGSHKYVSCSLIKCYILIGSLESSFEI 420

Query: 498 FNQLEIVDMATYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAY 557
           FNQLEIVDMATYGAVISTLVHQN+MYEAIMFLN LMESGKKPDEFTFGSILNGCSSRAAY
Sbjct: 421 FNQLEIVDMATYGAVISTLVHQNYMYEAIMFLNFLMESGKKPDEFTFGSILNGCSSRAAY 480

Query: 558 HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA 617
           HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA
Sbjct: 481 HQTKAIHSLVEKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMA 540

Query: 618 YAHHGLAWEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP 677
           YAHHGLA EAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP
Sbjct: 541 YAHHGLACEAIQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTP 600

Query: 678 SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKL 737
           SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGN ELGQWTAEKL
Sbjct: 601 SRDNYGCLVDMLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNVELGQWTAEKL 660

Query: 738 LSLAPQNLATHVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
           LSLAPQN ATHVLLSKVYSEGNSWEDAANIRKEMTD GVLKDPGYSRVEI
Sbjct: 661 LSLAPQNDATHVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDPGYSRVEI 678

BLAST of CSPI05G20220 vs. NCBI nr
Match: gi|778704206|ref|XP_011655494.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-like isoform X2 [Cucumis sativus])

HSP 1 Score: 1273.5 bits (3294), Expect = 0.0e+00
Identity = 635/640 (99.22%), Postives = 636/640 (99.38%), Query Frame = 1

Query: 148 MKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV 207
           MKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV
Sbjct: 1   MKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV 60

Query: 208 GCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGF 267
           GCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMD FTIVSA+KACSLIQDLDSGKELHGF
Sbjct: 61  GCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDNFTIVSAVKACSLIQDLDSGKELHGF 120

Query: 268 ILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLF 327
           ILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLF
Sbjct: 121 ILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLF 180

Query: 328 GKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQF 387
           GKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQF
Sbjct: 181 GKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQF 240

Query: 388 GLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIII 447
           GLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIII
Sbjct: 241 GLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIII 300

Query: 448 ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 507
           ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA
Sbjct: 301 ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 360

Query: 508 TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 567
           TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV
Sbjct: 361 TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 420

Query: 568 EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA 627
           EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLA EA
Sbjct: 421 EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLACEA 480

Query: 628 IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD 687
           IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD
Sbjct: 481 IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD 540

Query: 688 MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLAT 747
           MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGN ELGQWTAEKLLSLAPQNLAT
Sbjct: 541 MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNVELGQWTAEKLLSLAPQNLAT 600

Query: 748 HVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVEI 788
           HVLLSKVYSEGNSWEDAANIRKEMTD GVLKDPGYSRVEI
Sbjct: 601 HVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDPGYSRVEI 640

BLAST of CSPI05G20220 vs. NCBI nr
Match: gi|659090201|ref|XP_008445888.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 608/639 (95.15%), Postives = 622/639 (97.34%), Query Frame = 1

Query: 148 MKACADVEAYGFGSGVHCLSWKIGMEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV 207
           MKACADVEAYGFGSGVHCLSWK+G+EQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV
Sbjct: 1   MKACADVEAYGFGSGVHCLSWKLGIEQNVFVGGSTLSMYARLGDITSAELVFEWMEKVDV 60

Query: 208 GCWNAMIGGYTNCGLSLEALSAVSLLNSEGIKMDKFTIVSAIKACSLIQDLDSGKELHGF 267
           GCWNAMIGGYTNCGL L+ALSAVSLLN +GIKMDKFTIVSAIKACSLIQDLDSGKELHGF
Sbjct: 61  GCWNAMIGGYTNCGLGLKALSAVSLLNCKGIKMDKFTIVSAIKACSLIQDLDSGKELHGF 120

Query: 268 ILRRGLISTAAMNALMDMYLISDRKNSVLKIFNSMQTRDIISWNTVFGGSSNEKEIVDLF 327
           ILRRGLISTA MNALMDMY ISDRKNS LK FNSMQTRDIISWNTVF GSSNE EIVDLF
Sbjct: 121 ILRRGLISTAVMNALMDMYFISDRKNSALKTFNSMQTRDIISWNTVFVGSSNENEIVDLF 180

Query: 328 GKFVIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGCLDETRVLSSIISMFSQF 387
           GKF+IEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLG LDETRVLSSIISMFSQ 
Sbjct: 181 GKFMIEGMKPNHITFSVLFRQCGVLLDSRLGFQFFSLAVHLGFLDETRVLSSIISMFSQI 240

Query: 388 GLMEMVHSVFDSLVFKPVSAWNQFILAYSLNSFEMEAFRTFSSLLRYGVVANEYTFSIII 447
           GLMEMVHSVFDSLVFKPVSAWNQ ILAYSLNSFEMEAFRTFSSLLRYGVVANEYT+SII+
Sbjct: 241 GLMEMVHSVFDSLVFKPVSAWNQLILAYSLNSFEMEAFRTFSSLLRYGVVANEYTYSIIV 300

Query: 448 ETACKFENPWMCRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 507
           ETACK ENP +CRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA
Sbjct: 301 ETACKSENPRICRQLHCASLKAGFGSHKYVSCSLIKCYILIGSLESSFEIFNQLEIVDMA 360

Query: 508 TYGAVISTLVHQNHMYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 567
           TYGAVISTLVHQNH+YEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV
Sbjct: 361 TYGAVISTLVHQNHIYEAIMFLNILMESGKKPDEFTFGSILNGCSSRAAYHQTKAIHSLV 420

Query: 568 EKMGFGFHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA 627
           EKMGFG HVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA
Sbjct: 421 EKMGFGVHVHVASAIIDAYAKCGDIGSAQGAFEQSCQSNDVIVYNSMMMAYAHHGLAWEA 480

Query: 628 IQTFEKMRIAKVQPSQASFVSVISACRHMGLVEQGRSLFQTMKSDYNMTPSRDNYGCLVD 687
           IQTFEKMRIAKVQPSQASFVSVISAC H+GLVEQGRSLFQTMKSDY+MTPSRDNYGCLVD
Sbjct: 481 IQTFEKMRIAKVQPSQASFVSVISACGHIGLVEQGRSLFQTMKSDYSMTPSRDNYGCLVD 540

Query: 688 MLSRNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSLAPQNLAT 747
           ML+RNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLS+APQN AT
Sbjct: 541 MLARNGFLYDARYIIESMPFSPWPAILRSLLSGCRIYGNRELGQWTAEKLLSMAPQNDAT 600

Query: 748 HVLLSKVYSEGNSWEDAANIRKEMTDGGVLKDPGYSRVE 787
           +VLLSKVYSEGNSWEDAANIRKEMTD GVLKDPGYSRVE
Sbjct: 601 YVLLSKVYSEGNSWEDAANIRKEMTDRGVLKDPGYSRVE 639

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP307_ARATH4.6e-10629.96Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN... [more]
PP220_ARATH7.8e-10631.44Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
PP357_ARATH4.7e-10330.66Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN... [more]
PP210_ARATH2.9e-10029.91Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN... [more]
PP296_ARATH4.7e-9527.55Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KV18_CUCSA0.0e+0099.11Uncharacterized protein OS=Cucumis sativus GN=Csa_5G577950 PE=4 SV=1[more]
A0A0A0KRW0_CUCSA0.0e+0093.24Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174670 PE=4 SV=1[more]
B9HPK8_POPTR1.6e-23555.50Uncharacterized protein (Fragment) OS=Populus trichocarpa GN=POPTR_0009s03250g P... [more]
F6HIN1_VITVI3.1e-23453.45Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0042g00130 PE=4 SV=... [more]
M5XJ55_PRUPE1.4e-22355.13Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021912mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT4G13650.12.6e-10729.96 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G09040.14.4e-10731.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G39530.12.7e-10430.66 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.11.6e-10129.91 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G63370.12.7e-9627.55 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778704196|ref|XP_011655491.1|0.0e+0099.11PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like isoform X1... [more]
gi|659090199|ref|XP_008445887.1|0.0e+0094.53PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like isoform X1... [more]
gi|700195264|gb|KGN50441.1|0.0e+0093.24hypothetical protein Csa_5G174670 [Cucumis sativus][more]
gi|778704206|ref|XP_011655494.1|0.0e+0099.22PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-... [more]
gi|659090201|ref|XP_008445888.1|0.0e+0095.15PREDICTED: pentatricopeptide repeat-containing protein At3g09040, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G20220.1CSPI05G20220.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 107..137
score: 8.1E-4coord: 77..106
score: 0.0021coord: 209..237
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 607..653
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 107..140
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 439..473
score: 5.875coord: 74..104
score: 9.701coord: 373..403
score: 5.459coord: 206..240
score: 8.407coord: 678..712
score: 6.336coord: 275..309
score: 6.982coord: 575..605
score: 5.81coord: 404..438
score: 6.423coord: 642..672
score: 7.18coord: 175..205
score: 6.303coord: 607..641
score: 10.852coord: 505..539
score: 8.473coord: 105..139
score: 10.786coord: 540..574
score: 6.621coord: 744..778
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 477..501
score: 1.7E-5coord: 576..634
score: 1.7E-5coord: 540..541
score: 1.7E-5coord: 738..765
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 475..785
score: 2.3E-277coord: 30..338
score: 2.3E
NoneNo IPR availablePANTHERPTHR24015:SF715SUBFAMILY NOT NAMEDcoord: 475..785
score: 2.3E-277coord: 30..338
score: 2.3E