CSPI07G13960 (gene) Wild cucumber (PI 183967)

NameCSPI07G13960
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing family protein
LocationChr7 : 12497955 .. 12500123 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAATTGCAAAAACATCAGCAAGAATTGTGGAGATACTCCACACTCCACAGGCAAAGCCAATCCATTGAATCCTCCCTTGCTGATCATCTTTCTTTCTGTTCATTGCTCTACTCCAAACCCAAACCTAATCTTCACAACTTTCCATGAATTCCACAATTTACAACCCAAACAGGACCCTCAATTTCTCACCTCAACCCCCATTAATTCTCTCAAAACCATTTACATCATGCAAAACACCCAGAGACTTGAAGCAACTCCACGCCATTTTCATCAAAACCGGTCAAATTCAAGACCCTCTCACGGCTGCTGAAGTTATTAAATTCTGTGCATTTTCATCCCGCGATATCGACTATGCCCGCGCCGTATTCCGCCAGATGCCGGAGCCCAATTGCTTCTGTTGGAACACCATTTTGAGAATTCTTGCTGAGACCAACGACGAACATCTCCAATCGGAGGCCTTGATGCTCTTTTCCGCAATGTTGTGCGATGGGCGTGTGAAGCCCAATCGGTTCACTTTCCCTTCCGTGTTGAAGGCCTGCGCTAGAGCTTCGAGGCTTCGAGAGGGGAAACAGATACATGGGTTGATTGTGAAGTTTGGATTTCATGAGGATGAGTTTGTAATTAGTAATCTCGTTCGGATGTATGTTATGTGTGCGGTCATGGAAGATGCGTATTCTCTGTTTTGTAAGAATGTTGTTGATTTTGATGGGTCTTGTCAAATGGAGTTAGATAAGAGGAAACAAGATGGTAATGTTGTTCTGTGGAATATAATGATTGATGGGCAGGTTAGACTTGGGGATATAAAGAGTGCTAAAAACTTGTTCGACGAAATGCCTCAAAGAAGTGTAGTGTCGTGGAATGTGATGATCTCGGGGTATGCACAAAATGGGCACTTCATAGAAGCCATAAACTTGTTTCAAGAAATGCAAAGTTCCAATATTGATCCAAACTACGTGACGTTAGTGAGTGTTTTGCCCGCCATTGCACGAATTGGTGCACTAGAATTGGGGAAATGGATCCATTTATATGCGGGAAATAATAAGATTGAGATTGATGATGTGCTTGGTTCTGCTCTGGTGGACATGTATTCCAAATGTGGTAGCATTGATAAGGCACTCCAGGTCTTTGAGACGCTGCCTAAGAGAAATGCCATTACTTGGAGTGCCATAATTGGTGCATTTGCTATGCACGGTCGCGCAGAGGATGCAATCATTCACTTCCATTTGATGGGAAAAGCTGGTGTGACGCCCAATGATGTTGCTTATATTGGAATTTTGAGTGCCTGCAGTCATGCAGGGCTTGTGGAAGAGGGTAGATCATTCTTTAGCCACATGGTTAAGGTGGTTGGGCTACAACCAAGAATTGAACACTATGGATGCATGGTCGATTTACTAGGCCGTGCTGGGCATCTGGAAGAGGCTGAGGAACTTATCAGAAACATGCCAATCGAACCAGATGATGTAATATGGAAAGCTTTGCTCGGTGCTTGTAAAATGCACAAGAACCTAAAAATGGGTGAGCGTGTAGCTGAGACTTTAATGGAGTTGGCTCCTCACGACAGTGGATCCTATGTTGCTCTGTCAAATTTGTATGCTTCTTTGGGAAACTGGGAGGCGGTTGCAAGGGTGAGGTTGAAGATGAAAGGGATGGACATTAGAAAAGACCCTGGATGTAGTTGGATTGAGATCCATGGAATAATCCATGAGTTTCTTGTAGAAGACGATTCACATTCTAAAGCTAAAGAAATCCAGGCCATGTTGGGAGAAATGTCCATGAAGTTAAGGTCGAATGGCTACAGACCAAACACCTTAGAAGTGTTTCTCAATACAGATGAACAAGAGAGGGCAAGGGCCTTGCAATATCATAGTGAGAAGATTGCAGTTGCCTTTGGACTAATCAGTACAGCCCCACAACATCCACTTAAGATTGTTAAGAACCTACGTATATGTGAAGACTGTCATGCTTCACTAAAACTTATTTCCTTGATTTACAAGCGTCAAATCATTGTGCGAGACCGAAAGCGATTCCATCAGTTTGAACATGGGTCATGCTCATGCATGGATTATTGGTAACTTCTGAGTCATGGCCGTTGTCTTTGCACAGAACTGTGAAGATTTAGTAGTCATTGTGAAGATTATAAGGAAAACTCAGCTCGTTTTTGCCCTGTC

mRNA sequence

ATGAATTCCACAATTTACAACCCAAACAGGACCCTCAATTTCTCACCTCAACCCCCATTAATTCTCTCAAAACCATTTACATCATGCAAAACACCCAGAGACTTGAAGCAACTCCACGCCATTTTCATCAAAACCGGTCAAATTCAAGACCCTCTCACGGCTGCTGAAGTTATTAAATTCTGTGCATTTTCATCCCGCGATATCGACTATGCCCGCGCCGTATTCCGCCAGATGCCGGAGCCCAATTGCTTCTGTTGGAACACCATTTTGAGAATTCTTGCTGAGACCAACGACGAACATCTCCAATCGGAGGCCTTGATGCTCTTTTCCGCAATGTTGTGCGATGGGCGTGTGAAGCCCAATCGGTTCACTTTCCCTTCCGTGTTGAAGGCCTGCGCTAGAGCTTCGAGGCTTCGAGAGGGGAAACAGATACATGGGTTGATTGTGAAGTTTGGATTTCATGAGGATGAGTTTGTAATTAGTAATCTCGTTCGGATGTATGTTATGTGTGCGGTCATGGAAGATGCGTATTCTCTGTTTTGTAAGAATGTTGTTGATTTTGATGGGTCTTGTCAAATGGAGTTAGATAAGAGGAAACAAGATGGTAATGTTGTTCTGTGGAATATAATGATTGATGGGCAGGTTAGACTTGGGGATATAAAGAGTGCTAAAAACTTGTTCGACGAAATGCCTCAAAGAAGTGTAGTGTCGTGGAATGTGATGATCTCGGGGTATGCACAAAATGGGCACTTCATAGAAGCCATAAACTTGTTTCAAGAAATGCAAAGTTCCAATATTGATCCAAACTACGTGACGTTAGTGAGTGTTTTGCCCGCCATTGCACGAATTGGTGCACTAGAATTGGGGAAATGGATCCATTTATATGCGGGAAATAATAAGATTGAGATTGATGATGTGCTTGGTTCTGCTCTGGTGGACATGTATTCCAAATGTGGTAGCATTGATAAGGCACTCCAGGTCTTTGAGACGCTGCCTAAGAGAAATGCCATTACTTGGAGTGCCATAATTGGTGCATTTGCTATGCACGGTCGCGCAGAGGATGCAATCATTCACTTCCATTTGATGGGAAAAGCTGGTGTGACGCCCAATGATGTTGCTTATATTGGAATTTTGAGTGCCTGCAGTCATGCAGGGCTTGTGGAAGAGGGTAGATCATTCTTTAGCCACATGGTTAAGGTGGTTGGGCTACAACCAAGAATTGAACACTATGGATGCATGGTCGATTTACTAGGCCGTGCTGGGCATCTGGAAGAGGCTGAGGAACTTATCAGAAACATGCCAATCGAACCAGATGATGTAATATGGAAAGCTTTGCTCGGTGCTTGTAAAATGCACAAGAACCTAAAAATGGGTGAGCGTGTAGCTGAGACTTTAATGGAGTTGGCTCCTCACGACAGTGGATCCTATGTTGCTCTGTCAAATTTGTATGCTTCTTTGGGAAACTGGGAGGCGGTTGCAAGGGTGAGGTTGAAGATGAAAGGGATGGACATTAGAAAAGACCCTGGATGTAGTTGGATTGAGATCCATGGAATAATCCATGAGTTTCTTGTAGAAGACGATTCACATTCTAAAGCTAAAGAAATCCAGGCCATGTTGGGAGAAATGTCCATGAAGTTAAGGTCGAATGGCTACAGACCAAACACCTTAGAAGTGTTTCTCAATACAGATGAACAAGAGAGGGCAAGGGCCTTGCAATATCATAGTGAGAAGATTGCAGTTGCCTTTGGACTAATCAGTACAGCCCCACAACATCCACTTAAGATTGTTAAGAACCTACGTATATGTGAAGACTGTCATGCTTCACTAAAACTTATTTCCTTGATTTACAAGCGTCAAATCATTGTGCGAGACCGAAAGCGATTCCATCAGTTTGAACATGGGTCATGCTCATGCATGGATTATTGGTAA

Coding sequence (CDS)

ATGAATTCCACAATTTACAACCCAAACAGGACCCTCAATTTCTCACCTCAACCCCCATTAATTCTCTCAAAACCATTTACATCATGCAAAACACCCAGAGACTTGAAGCAACTCCACGCCATTTTCATCAAAACCGGTCAAATTCAAGACCCTCTCACGGCTGCTGAAGTTATTAAATTCTGTGCATTTTCATCCCGCGATATCGACTATGCCCGCGCCGTATTCCGCCAGATGCCGGAGCCCAATTGCTTCTGTTGGAACACCATTTTGAGAATTCTTGCTGAGACCAACGACGAACATCTCCAATCGGAGGCCTTGATGCTCTTTTCCGCAATGTTGTGCGATGGGCGTGTGAAGCCCAATCGGTTCACTTTCCCTTCCGTGTTGAAGGCCTGCGCTAGAGCTTCGAGGCTTCGAGAGGGGAAACAGATACATGGGTTGATTGTGAAGTTTGGATTTCATGAGGATGAGTTTGTAATTAGTAATCTCGTTCGGATGTATGTTATGTGTGCGGTCATGGAAGATGCGTATTCTCTGTTTTGTAAGAATGTTGTTGATTTTGATGGGTCTTGTCAAATGGAGTTAGATAAGAGGAAACAAGATGGTAATGTTGTTCTGTGGAATATAATGATTGATGGGCAGGTTAGACTTGGGGATATAAAGAGTGCTAAAAACTTGTTCGACGAAATGCCTCAAAGAAGTGTAGTGTCGTGGAATGTGATGATCTCGGGGTATGCACAAAATGGGCACTTCATAGAAGCCATAAACTTGTTTCAAGAAATGCAAAGTTCCAATATTGATCCAAACTACGTGACGTTAGTGAGTGTTTTGCCCGCCATTGCACGAATTGGTGCACTAGAATTGGGGAAATGGATCCATTTATATGCGGGAAATAATAAGATTGAGATTGATGATGTGCTTGGTTCTGCTCTGGTGGACATGTATTCCAAATGTGGTAGCATTGATAAGGCACTCCAGGTCTTTGAGACGCTGCCTAAGAGAAATGCCATTACTTGGAGTGCCATAATTGGTGCATTTGCTATGCACGGTCGCGCAGAGGATGCAATCATTCACTTCCATTTGATGGGAAAAGCTGGTGTGACGCCCAATGATGTTGCTTATATTGGAATTTTGAGTGCCTGCAGTCATGCAGGGCTTGTGGAAGAGGGTAGATCATTCTTTAGCCACATGGTTAAGGTGGTTGGGCTACAACCAAGAATTGAACACTATGGATGCATGGTCGATTTACTAGGCCGTGCTGGGCATCTGGAAGAGGCTGAGGAACTTATCAGAAACATGCCAATCGAACCAGATGATGTAATATGGAAAGCTTTGCTCGGTGCTTGTAAAATGCACAAGAACCTAAAAATGGGTGAGCGTGTAGCTGAGACTTTAATGGAGTTGGCTCCTCACGACAGTGGATCCTATGTTGCTCTGTCAAATTTGTATGCTTCTTTGGGAAACTGGGAGGCGGTTGCAAGGGTGAGGTTGAAGATGAAAGGGATGGACATTAGAAAAGACCCTGGATGTAGTTGGATTGAGATCCATGGAATAATCCATGAGTTTCTTGTAGAAGACGATTCACATTCTAAAGCTAAAGAAATCCAGGCCATGTTGGGAGAAATGTCCATGAAGTTAAGGTCGAATGGCTACAGACCAAACACCTTAGAAGTGTTTCTCAATACAGATGAACAAGAGAGGGCAAGGGCCTTGCAATATCATAGTGAGAAGATTGCAGTTGCCTTTGGACTAATCAGTACAGCCCCACAACATCCACTTAAGATTGTTAAGAACCTACGTATATGTGAAGACTGTCATGCTTCACTAAAACTTATTTCCTTGATTTACAAGCGTCAAATCATTGTGCGAGACCGAAAGCGATTCCATCAGTTTGAACATGGGTCATGCTCATGCATGGATTATTGGTAA
BLAST of CSPI07G13960 vs. Swiss-Prot
Match: PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 1.7e-244
Identity = 410/649 (63.17%), Postives = 510/649 (78.58%), Query Frame = 1

Query: 7   NPNRTLNFSP----------QPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAE 66
           NP +TL FSP            P  L     +C+T RDL Q+HA+FIK+GQ++D L AAE
Sbjct: 2   NPTQTL-FSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAE 61

Query: 67  VIKFCAFSS---RDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAML 126
           +++FCA S    RD+DYA  +F QMP+ NCF WNTI+R  +E+ DE     A+ LF  M+
Sbjct: 62  ILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSES-DEDKALIAITLFYEMM 121

Query: 127 CDGRVKPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVM 186
            D  V+PNRFTFPSVLKACA+  +++EGKQIHGL +K+GF  DEFV+SNLVRMYVMC  M
Sbjct: 122 SDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFM 181

Query: 187 EDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQR 246
           +DA  LF KN+++ D    +  D+RK+DG +VLWN+MIDG +RLGD K+A+ LFD+M QR
Sbjct: 182 KDARVLFYKNIIEKD--MVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQR 241

Query: 247 SVVSWNVMISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIH 306
           SVVSWN MISGY+ NG F +A+ +F+EM+  +I PNYVTLVSVLPAI+R+G+LELG+W+H
Sbjct: 242 SVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLH 301

Query: 307 LYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAE 366
           LYA ++ I IDDVLGSAL+DMYSKCG I+KA+ VFE LP+ N ITWSA+I  FA+HG+A 
Sbjct: 302 LYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAG 361

Query: 367 DAIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCM 426
           DAI  F  M +AGV P+DVAYI +L+ACSH GLVEEGR +FS MV V GL+PRIEHYGCM
Sbjct: 362 DAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCM 421

Query: 427 VDLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDS 486
           VDLLGR+G L+EAEE I NMPI+PDDVIWKALLGAC+M  N++MG+RVA  LM++ PHDS
Sbjct: 422 VDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDS 481

Query: 487 GSYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAK 546
           G+YVALSN+YAS GNW  V+ +RL+MK  DIRKDPGCS I+I G++HEF+VEDDSH KAK
Sbjct: 482 GAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAK 541

Query: 547 EIQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPL 606
           EI +ML E+S KLR  GYRP T +V LN +E+++   L YHSEKIA AFGLIST+P  P+
Sbjct: 542 EINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPI 601

Query: 607 KIVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           +IVKNLRICEDCH+S+KLIS +YKR+I VRDRKRFH F+ GSCSCMDYW
Sbjct: 602 RIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CSPI07G13960 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 525.8 bits (1353), Expect = 6.7e-148
Identity = 258/617 (41.82%), Postives = 374/617 (60.62%), Query Frame = 1

Query: 29  CKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFC--AFSSRDIDYARAVFRQMPEPNCFCW 88
           C    +LKQ+HA  +KTG +QD     + + FC  + SS  + YA+ VF     P+ F W
Sbjct: 24  CSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDTFLW 83

Query: 89  NTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSVLKACARASRLREGKQIHG 148
           N ++R  + +++      +L+L+  MLC      N +TFPS+LKAC+  S   E  QIH 
Sbjct: 84  NLMIRGFSCSDEPE---RSLLLYQRMLCSS-APHNAYTFPSLLKACSNLSAFEETTQIHA 143

Query: 149 LIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVL 208
            I K G+  D + +++L+  Y +    + A+ LF                 R  + + V 
Sbjct: 144 QITKLGYENDVYAVNSLINSYAVTGNFKLAHLLF----------------DRIPEPDDVS 203

Query: 209 WNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHFIEAINLFQEMQSSNI 268
           WN +I G V+ G +  A  LF +M +++ +SW  MISGY Q     EA+ LF EMQ+S++
Sbjct: 204 WNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDV 263

Query: 269 DPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQ 328
           +P+ V+L + L A A++GALE GKWIH Y    +I +D VLG  L+DMY+KCG +++AL+
Sbjct: 264 EPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALE 323

Query: 329 VFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGILSACSHAGL 388
           VF+ + K++   W+A+I  +A HG   +AI  F  M K G+ PN + +  +L+ACS+ GL
Sbjct: 324 VFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGL 383

Query: 389 VEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEAEELIRNMPIEPDDVIWKALL 448
           VEEG+  F  M +   L+P IEHYGC+VDLLGRAG L+EA+  I+ MP++P+ VIW ALL
Sbjct: 384 VEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALL 443

Query: 449 GACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYASLGNWEAVARVRLKMKGMDIRK 508
            AC++HKN+++GE + E L+ + P+  G YV  +N++A    W+  A  R  MK   + K
Sbjct: 444 KACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAK 503

Query: 509 DPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKLRSNGYRPNTLEVFLN-TDEQ 568
            PGCS I + G  HEFL  D SH + ++IQ+    M  KL  NGY P   E+ L+  D+ 
Sbjct: 504 VPGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDD 563

Query: 569 ERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICEDCHASLKLISLIYKRQIIVRDR 628
           ER   +  HSEK+A+ +GLI T P   ++I+KNLR+C+DCH   KLIS IYKR I++RDR
Sbjct: 564 EREAIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDR 620

Query: 629 KRFHQFEHGSCSCMDYW 643
            RFH F  G CSC DYW
Sbjct: 624 TRFHHFRDGKCSCGDYW 620

BLAST of CSPI07G13960 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 7.4e-139
Identity = 257/639 (40.22%), Postives = 387/639 (60.56%), Query Frame = 1

Query: 11  TLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFCAFSSRD--- 70
           TL F   P L L +   SC +  DLK +H   ++T  I D   A+ ++  C   S     
Sbjct: 8   TLRFK-HPKLALLQ---SCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKP 67

Query: 71  ---IDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFT 130
              + YA  +F Q+  PN F +N ++R  +   +    S+A   ++ ML   R+ P+  T
Sbjct: 68  TNLLGYAYGIFSQIQNPNLFVFNLLIRCFSTGAEP---SKAFGFYTQML-KSRIWPDNIT 127

Query: 131 FPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNV 190
           FP ++KA +    +  G+Q H  IV+FGF  D +V ++LV MY  C  +  A  +F    
Sbjct: 128 FPFLIKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIF---- 187

Query: 191 VDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISG 250
                  QM         +VV W  M+ G  + G +++A+ +FDEMP R++ +W++MI+G
Sbjct: 188 ------GQMGF------RDVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMING 247

Query: 251 YAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEID 310
           YA+N  F +AI+LF+ M+   +  N   +VSV+ + A +GALE G+  + Y   + + ++
Sbjct: 248 YAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVN 307

Query: 311 DVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGK 370
            +LG+ALVDM+ +CG I+KA+ VFE LP+ ++++WS+II   A+HG A  A+ +F  M  
Sbjct: 308 LILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMIS 367

Query: 371 AGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLE 430
            G  P DV +  +LSACSH GLVE+G   + +M K  G++PR+EHYGC+VD+LGRAG L 
Sbjct: 368 LGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLA 427

Query: 431 EAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYA 490
           EAE  I  M ++P+  I  ALLGACK++KN ++ ERV   L+++ P  SG YV LSN+YA
Sbjct: 428 EAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYA 487

Query: 491 SLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDD-SHSKAKEIQAMLGEMS 550
             G W+ +  +R  MK   ++K PG S IEI G I++F + DD  H +  +I+    E+ 
Sbjct: 488 CAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEIL 547

Query: 551 MKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICE 610
            K+R  GY+ NT + F + DE+E+  ++  HSEK+A+A+G++ T P   ++IVKNLR+CE
Sbjct: 548 GKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCE 607

Query: 611 DCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           DCH   KLIS +Y R++IVRDR RFH F +G CSC DYW
Sbjct: 608 DCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSCRDYW 622

BLAST of CSPI07G13960 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 8.2e-138
Identity = 258/636 (40.57%), Postives = 395/636 (62.11%), Query Frame = 1

Query: 11  TLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFC-AFSSRD-I 70
           T  F   PP  L+      ++  ++ Q+HA  ++   +  P      +K   A++S   I
Sbjct: 21  TARFRLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKI 80

Query: 71  DYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSV 130
            ++ A+F Q  +P+ F +   +   +      L+ +A +L+  +L    + PN FTF S+
Sbjct: 81  RHSLALFHQTIDPDLFLFTAAINTASING---LKDQAFLLYVQLL-SSEINPNEFTFSSL 140

Query: 131 LKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFD 190
           LK+C+  S    GK IH  ++KFG   D +V + LV +Y     +  A  +F        
Sbjct: 141 LKSCSTKS----GKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVF-------- 200

Query: 191 GSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQN 250
                    R  + ++V    MI    + G++++A+ LFD M +R +VSWNVMI GYAQ+
Sbjct: 201 --------DRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQH 260

Query: 251 GHFIEAINLFQEMQSSNID-PNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEIDDVL 310
           G   +A+ LFQ++ +     P+ +T+V+ L A ++IGALE G+WIH++  +++I ++  +
Sbjct: 261 GFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKV 320

Query: 311 GSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLM-GKAG 370
            + L+DMYSKCGS+++A+ VF   P+++ + W+A+I  +AMHG ++DA+  F+ M G  G
Sbjct: 321 CTGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITG 380

Query: 371 VTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEA 430
           + P D+ +IG L AC+HAGLV EG   F  M +  G++P+IEHYGC+V LLGRAG L+ A
Sbjct: 381 LQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRA 440

Query: 431 EELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYASL 490
            E I+NM ++ D V+W ++LG+CK+H +  +G+ +AE L+ L   +SG YV LSN+YAS+
Sbjct: 441 YETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASV 500

Query: 491 GNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKL 550
           G++E VA+VR  MK   I K+PG S IEI   +HEF   D  HSK+KEI  ML ++S ++
Sbjct: 501 GDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERI 560

Query: 551 RSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICEDCH 610
           +S+GY PNT  V  + +E E+ ++LQ HSE++A+A+GLIST P  PLKI KNLR+C DCH
Sbjct: 561 KSHGYVPNTNTVLQDLEETEKEQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCH 620

Query: 611 ASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
              KLIS I  R+I++RDR RFH F  GSCSC D+W
Sbjct: 621 TVTKLISKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CSPI07G13960 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 3.5e-136
Identity = 248/609 (40.72%), Postives = 364/609 (59.77%), Query Frame = 1

Query: 36  KQLHAIFIKTGQIQDPLTAAEVIKFCAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAE 95
           + LH + +K+    D   A  +I  C FS  D+D A  VF  + E +   WN+++    +
Sbjct: 151 QSLHGMAVKSAVGSDVFVANSLIH-CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 210

Query: 96  TNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHE 155
                   +AL LF  M  +  VK +  T   VL ACA+   L  G+Q+   I +   + 
Sbjct: 211 KGSP---DKALELFKKMESED-VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNV 270

Query: 156 DEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQV 215
           +  + + ++ MY  C  +EDA  LF               D  ++  NV  W  M+DG  
Sbjct: 271 NLTLANAMLDMYTKCGSIEDAKRLF---------------DAMEEKDNVT-WTTMLDGYA 330

Query: 216 RLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHFIEAINLFQEMQ-SSNIDPNYVTLV 275
              D ++A+ + + MPQ+ +V+WN +IS Y QNG   EA+ +F E+Q   N+  N +TLV
Sbjct: 331 ISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLV 390

Query: 276 SVLPAIARIGALELGKWIHLYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKR 335
           S L A A++GALELG+WIH Y   + I ++  + SAL+ MYSKCG ++K+ +VF ++ KR
Sbjct: 391 STLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKR 450

Query: 336 NAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFF 395
           +   WSA+IG  AMHG   +A+  F+ M +A V PN V +  +  ACSH GLV+E  S F
Sbjct: 451 DVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLF 510

Query: 396 SHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKN 455
             M    G+ P  +HY C+VD+LGR+G+LE+A + I  MPI P   +W ALLGACK+H N
Sbjct: 511 HQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHAN 570

Query: 456 LKMGERVAETLMELAPHDSGSYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIE 515
           L + E     L+EL P + G++V LSN+YA LG WE V+ +R  M+   ++K+PGCS IE
Sbjct: 571 LNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIE 630

Query: 516 IHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQE-RARALQY 575
           I G+IHEFL  D++H  ++++   L E+  KL+SNGY P   +V    +E+E + ++L  
Sbjct: 631 IDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNL 690

Query: 576 HSEKIAVAFGLISTAPQHPLKIVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEH 635
           HSEK+A+ +GLIST     ++++KNLR+C DCH+  KLIS +Y R+IIVRDR RFH F +
Sbjct: 691 HSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRN 738

Query: 636 GSCSCMDYW 643
           G CSC D+W
Sbjct: 751 GQCSCNDFW 738

BLAST of CSPI07G13960 vs. TrEMBL
Match: A0A0A0K543_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336520 PE=4 SV=1)

HSP 1 Score: 964.5 bits (2492), Expect = 6.3e-278
Identity = 498/648 (76.85%), Postives = 526/648 (81.17%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIYNPNRT NFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF
Sbjct: 1   MNSTIYNPNRTFNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60

Query: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKP 120
           CAFSSRDIDYARAVFRQMPEPNCFCWNTILR+LAETNDEHLQSEALMLFSAMLCDGRVKP
Sbjct: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRVLAETNDEHLQSEALMLFSAMLCDGRVKP 120

Query: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180
           NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF
Sbjct: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180

Query: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSV----V 240
           CKNVVDFDGSCQMELDKRKQD                     A NLF EM   ++    V
Sbjct: 181 CKNVVDFDGSCQMELDKRKQD--------------------EAINLFQEMQSSNIDPNYV 240

Query: 241 SWNVMISGYAQNG--HFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHL 300
           +   ++   A+ G     + I+L+       ID   V   +++   ++ G+++      L
Sbjct: 241 TLVSVLPAIARIGALELGKWIHLYAGKNKIEIDD--VLGSALVDMYSKCGSIDEA----L 300

Query: 301 YAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAED 360
                  + + +  SA++  ++  G  + A+  F  + K                     
Sbjct: 301 QVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKA-------------------- 360

Query: 361 AIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV 420
                      GVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV
Sbjct: 361 -----------GVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV 420

Query: 421 DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG 480
           DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG
Sbjct: 421 DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG 480

Query: 481 SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE 540
           SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE
Sbjct: 481 SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE 540

Query: 541 IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK 600
           IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK
Sbjct: 541 IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK 591

Query: 601 IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW
Sbjct: 601 IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 591

BLAST of CSPI07G13960 vs. TrEMBL
Match: V4TTU6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019256mg PE=4 SV=1)

HSP 1 Score: 911.4 bits (2354), Expect = 6.3e-262
Identity = 439/641 (68.49%), Postives = 526/641 (82.06%), Query Frame = 1

Query: 3   STIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFCA 62
           +TIY P  T N SP P     +  T CK+ R+L Q+HA FIKTGQI+DPL AAE+++FCA
Sbjct: 4   ATIYKPTTT-NSSPHPSSQFPE-ITKCKSMRELTQVHAHFIKTGQIRDPLAAAEILRFCA 63

Query: 63  FSSR-DIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKPN 122
            S   D++YA  VF Q+ EPNCF +NTI+R  +E  D+     AL++F  M+ DG V PN
Sbjct: 64  VSDLGDLEYAHKVFTQIREPNCFSYNTIIRAFSECKDDDDSLHALLVFYQMVSDGLVLPN 123

Query: 123 RFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFC 182
           +FTFPSVLKACA+ +RLREGKQ+HGLIVKFG   DEFV+SNLVRMYVMC  M++A+ LF 
Sbjct: 124 KFTFPSVLKACAKTARLREGKQVHGLIVKFGLVYDEFVVSNLVRMYVMCGDMDNAHRLFY 183

Query: 183 KNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVM 242
           K+VV+F  +  +  D R+Q+G V+LWN+MIDG VRLG+ ++++ LFDEMPQRSVVSWNVM
Sbjct: 184 KSVVEFGNNGLLLRDTRRQEGYVILWNVMIDGYVRLGNFRASRALFDEMPQRSVVSWNVM 243

Query: 243 ISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKI 302
           ISGYAQNG F EAI +F EMQ+ ++ PNYVTLVSVLPAI+R+GALELGKW+HLYA  N I
Sbjct: 244 ISGYAQNGQFREAIEMFLEMQNGDVCPNYVTLVSVLPAISRLGALELGKWVHLYAEKNAI 303

Query: 303 EIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHL 362
           EI+D+LGSAL+DMYSKCGSI+ A+QVFE +P+RNAI WSA+IG FAMHGRA+DA+  F  
Sbjct: 304 EINDILGSALIDMYSKCGSIENAIQVFERIPQRNAIAWSAMIGGFAMHGRAQDALDCFSR 363

Query: 363 MGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAG 422
           M +AGV P+DV YIG+LSACSHAGLVEEGR  F+HMV V GL+PRIEHYGCMVDLLGRAG
Sbjct: 364 MEQAGVKPSDVVYIGLLSACSHAGLVEEGRLMFNHMVNVTGLEPRIEHYGCMVDLLGRAG 423

Query: 423 HLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSN 482
            LEEAEEL+ NMPIEPDDVIWKALLGACK H N++MGERVA+ LM+LAP+DSG+YV +SN
Sbjct: 424 LLEEAEELVLNMPIEPDDVIWKALLGACKTHGNIEMGERVAKVLMKLAPNDSGAYVGISN 483

Query: 483 LYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGE 542
           +YAS GNWE VA VRLKMK MDIRKDPGCSWIE++G+IHEFLVEDDSH KAKEI +ML E
Sbjct: 484 IYASSGNWEGVAEVRLKMKQMDIRKDPGCSWIELNGMIHEFLVEDDSHPKAKEIHSMLEE 543

Query: 543 MSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRI 602
           +S +L  +GYRP T +V LN +E+ +  AL YHSEKIA+ FGLIST P   L+IVKNLRI
Sbjct: 544 ISSRLSLSGYRPKTTQVLLNMEEEGKESALHYHSEKIAIGFGLISTNPGTTLRIVKNLRI 603

Query: 603 CEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           CEDCH+SLKLIS IYKR+IIVRDRKRFH FE+GSCSCMDYW
Sbjct: 604 CEDCHSSLKLISKIYKRKIIVRDRKRFHHFENGSCSCMDYW 642

BLAST of CSPI07G13960 vs. TrEMBL
Match: M5VIK6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002774mg PE=4 SV=1)

HSP 1 Score: 909.1 bits (2348), Expect = 3.1e-261
Identity = 447/643 (69.52%), Postives = 524/643 (81.49%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTI+ P       P  P  L    T+CKT RDL Q+HA+FIKT QI DPL AAE+++F
Sbjct: 1   MNSTIFKPTTP---PPSHPSSLFPQITACKTIRDLHQVHALFIKTRQIHDPLAAAEILRF 60

Query: 61  CAFSS-RDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVK 120
            A S+ R+I++ARAVF  M  PNCF WNTI+R LAE++ +    EAL+LFS M+  G V 
Sbjct: 61  YALSAHRNIEWARAVFNHMQRPNCFSWNTIIRALAESSVDEHPLEALLLFSQMVSYGFVG 120

Query: 121 PNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSL 180
           PNRFTFPSVLKACA+   L  GK +HG++VKFG   DEFV+SNLVRMYVMC VMEDA+ L
Sbjct: 121 PNRFTFPSVLKACAKMGNLGVGKCVHGMVVKFGLDTDEFVVSNLVRMYVMCKVMEDAHLL 180

Query: 181 FCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWN 240
           F ++VV         L++RKQ+GNVVLWN+++DG VR+GD+++A+ LFD+MPQRSVVSWN
Sbjct: 181 FSRSVVVCG-----HLNERKQEGNVVLWNVIVDGYVRVGDVRAARVLFDKMPQRSVVSWN 240

Query: 241 VMISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNN 300
           VMISGYAQNG F EAI+LF++MQ  N+ PNYVTLVSVLPAI+R+GALELGKWIHLYAG N
Sbjct: 241 VMISGYAQNGFFREAIDLFRDMQIENVYPNYVTLVSVLPAISRLGALELGKWIHLYAGKN 300

Query: 301 KIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHF 360
           +IEIDDVLGSALVDMYSKCGSI+KAL VFE LPKRN ITW+AII   AMHGR EDA+ +F
Sbjct: 301 RIEIDDVLGSALVDMYSKCGSIEKALLVFEKLPKRNVITWNAIISGLAMHGRVEDALDYF 360

Query: 361 HLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGR 420
             M  AGV P+DV YIGILSACSHAGLVE+GRSFF+ MV V+ L+PRIEHYGCMVDLLGR
Sbjct: 361 KKMEPAGVVPSDVTYIGILSACSHAGLVEQGRSFFNRMVNVISLEPRIEHYGCMVDLLGR 420

Query: 421 AGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVAL 480
           AG LEEAEELI NMPI+PDDV WKALLGACK   N+ MG+RVAE LM+LAPHDSGSYVAL
Sbjct: 421 AGLLEEAEELILNMPIQPDDVTWKALLGACKKQGNIDMGKRVAEVLMDLAPHDSGSYVAL 480

Query: 481 SNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAML 540
           SN+YASLGNWEAVA+VRL+MK MDIRKDPG S IE+ G+IHEF+VED+SH +A+EI +ML
Sbjct: 481 SNMYASLGNWEAVAKVRLQMKDMDIRKDPGGSSIELDGVIHEFVVEDESHPRAREIHSML 540

Query: 541 GEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNL 600
            E+S +L   G+RP+T +V LN DE+E+   L YHSEKIA AFGLISTAPQ PL+IVKNL
Sbjct: 541 EEISNQLSLEGHRPDTTQVLLNMDEEEKQSVLHYHSEKIATAFGLISTAPQTPLRIVKNL 600

Query: 601 RICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           RICEDCH+SLKLIS IY+R IIVRDRKRFH FE G CSCMDYW
Sbjct: 601 RICEDCHSSLKLISKIYERMIIVRDRKRFHHFEQGLCSCMDYW 635

BLAST of CSPI07G13960 vs. TrEMBL
Match: A0A061FRW3_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_036393 PE=4 SV=1)

HSP 1 Score: 903.3 bits (2333), Expect = 1.7e-259
Identity = 431/644 (66.93%), Postives = 523/644 (81.21%), Query Frame = 1

Query: 3   STIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFCA 62
           +TIYN N     S  P  +  +    CKT RDL Q+HAI +KTGQI DPL AAE++KFC+
Sbjct: 4   TTIYNSNAAAAASTHPSSLFPQ-IQRCKTMRDLHQVHAIVLKTGQIHDPLAAAEILKFCS 63

Query: 63  FSS-RDIDYARAVFRQMPEPNCFCWNTILRILAET---NDEHLQSEALMLFSAMLCDGRV 122
             + RDIDYAR VFRQM EPNCF WNTI+R L E+   N+ +   EAL LF+ M+ DG V
Sbjct: 64  LGTHRDIDYARKVFRQMGEPNCFSWNTIIRALTESDESNETNEPLEALFLFTEMVADGNV 123

Query: 123 KPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYS 182
            PNRFTFPSVLKACAR  +L EG+Q+HGL+VKFGF +DEFV SNLVR+YVMC  ME+A+ 
Sbjct: 124 LPNRFTFPSVLKACARTGKLPEGEQVHGLVVKFGFEKDEFVASNLVRVYVMCGAMEEAHI 183

Query: 183 LFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSW 242
           L  K +V+F+   ++  DKR+ +GN+VLWN+MIDG VR+GD+++A+ LFD+M  RSV+SW
Sbjct: 184 LLNKMMVEFENGGKLVRDKRRIEGNIVLWNVMIDGYVRIGDLRTARELFDKMSLRSVISW 243

Query: 243 NVMISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGN 302
           NVMISGYAQNG+F EAI +F+ MQ   + PNYVTLVSVLPAI+R+GALELGKW+HLYA  
Sbjct: 244 NVMISGYAQNGYFKEAIEMFRLMQIGEVRPNYVTLVSVLPAISRLGALELGKWVHLYAEK 303

Query: 303 NKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIH 362
           N+IEIDDVLGSAL+DMYSKCGSIDKA+QVFE + K N ITWSA+IG  AMHGRAE A+ +
Sbjct: 304 NEIEIDDVLGSALIDMYSKCGSIDKAVQVFERISKPNTITWSAMIGGLAMHGRAEGALDY 363

Query: 363 FHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLG 422
           F  M   GVTP+DV YIG+LSACSHAG VEEGR FF+HMV VVG +PR+EHYGCMVDLLG
Sbjct: 364 FSRMELEGVTPSDVVYIGVLSACSHAGFVEEGRLFFNHMVNVVGFEPRLEHYGCMVDLLG 423

Query: 423 RAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVA 482
           RAG L+EAEE I NMPIEPDDVIWKALLGACKMH N++MG+ VA  LM +AP DSG+YVA
Sbjct: 424 RAGLLKEAEEFILNMPIEPDDVIWKALLGACKMHGNIEMGDHVAGILMNMAPRDSGAYVA 483

Query: 483 LSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAM 542
           LSN+YAS  +WE+VARVRLKMK MD+RKDPGCS+IE+ G++HEFLVEDDSH +AKEI +M
Sbjct: 484 LSNIYASSRDWESVARVRLKMKEMDVRKDPGCSFIELDGVVHEFLVEDDSHPRAKEIHSM 543

Query: 543 LGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKN 602
           L E++ ++R  GY+P+T  V LN DE+E+   L YHSE+IA+AFGLIST+P   L+IVKN
Sbjct: 544 LEEIAEQMRLVGYKPDTRPVLLNIDEEEKESTLYYHSERIAIAFGLISTSPGTTLRIVKN 603

Query: 603 LRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           LR+CEDCH+S+KLIS IYKR+IIVRD+KRFH FE+GSCSCMDYW
Sbjct: 604 LRVCEDCHSSIKLISKIYKRKIIVRDQKRFHHFENGSCSCMDYW 646

BLAST of CSPI07G13960 vs. TrEMBL
Match: A0A067K2C4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17578 PE=4 SV=1)

HSP 1 Score: 899.8 bits (2324), Expect = 1.9e-258
Identity = 431/632 (68.20%), Postives = 515/632 (81.49%), Query Frame = 1

Query: 16  PQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFCAFSSR-DIDYARAV 75
           P  P  L      CK+ + LKQ+HA FIKTG I DPL AAE++KF + S R D+ YAR  
Sbjct: 13  PTHPSSLFPQIARCKSIKQLKQIHAHFIKTGLIGDPLAAAEILKFLSVSDRRDLKYARKF 72

Query: 76  FRQMPEPNCFCWNTILRILAETNDEHLQS--EALMLFSAMLCDGRVKPNRFTFPSVLKAC 135
           F QM  PNCF WNTI+R  AET+D+  ++  EAL  F  M  +G V+PNRFTFPSVLKAC
Sbjct: 73  FTQMNNPNCFSWNTIIRAFAETDDDDYKNPLEALGFFGQMCSEGLVEPNRFTFPSVLKAC 132

Query: 136 ARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDG-SC 195
           A+  R++EGK+IHG +VK G   DEFV SNLVRMY MC VMEDAY LF   V  FD  S 
Sbjct: 133 AKMGRIQEGKEIHGFVVKLGLDNDEFVASNLVRMYAMCGVMEDAYLLFSNYVSHFDNNST 192

Query: 196 QMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHF 255
           ++  +KR Q+G VVLWN+MIDG VRLGDI +++ LF++MPQRSVVSWNVMISGYAQNG F
Sbjct: 193 KLVRNKRMQEGVVVLWNVMIDGFVRLGDIGASRKLFNKMPQRSVVSWNVMISGYAQNGFF 252

Query: 256 IEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEIDDVLGSAL 315
            EA+++F +MQ  ++ PNY+TLVSVLPAI+R+GALELGKW+HLYA  N+IEIDDVLGSA+
Sbjct: 253 KEAMDVFHDMQMGDVSPNYITLVSVLPAISRLGALELGKWVHLYAEKNEIEIDDVLGSAV 312

Query: 316 VDMYSKCGSIDKALQVFETLP-KRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPN 375
           +DMY+KCGS++KA+QVFE +  K+NAITWSAIIG  AMHGRA DA+ ++  M +AGVTP 
Sbjct: 313 IDMYAKCGSVEKAIQVFEKIENKKNAITWSAIIGGLAMHGRANDALDYYRKMQQAGVTPT 372

Query: 376 DVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEAEELI 435
           DV YIG+LSACSHAGL+EEGRS F+HMVKVVG++PR+EHYGCMVDLLGRAG LEEAE+L+
Sbjct: 373 DVVYIGLLSACSHAGLIEEGRSLFNHMVKVVGIEPRVEHYGCMVDLLGRAGLLEEAEQLV 432

Query: 436 RNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYASLGNWE 495
            NMPI PDDVIWKALLGACKMH N+KMGERVA TLM+L PHDSGSYVALSN++AS GNW 
Sbjct: 433 LNMPIRPDDVIWKALLGACKMHGNVKMGERVARTLMKLFPHDSGSYVALSNIFASRGNWV 492

Query: 496 AVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKLRSNG 555
            V  VRLKMK MD+RKDPGCSWIEI G+IHEFLVED+SH +AKEI++ML E+S ++RS G
Sbjct: 493 GVVEVRLKMKEMDVRKDPGCSWIEIDGVIHEFLVEDESHPRAKEIRSMLEEISNRIRSAG 552

Query: 556 YRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICEDCHASLK 615
           YRPN  +V LN DE+++  AL YHSE+IA+AFGLIST PQ PL+IVKNLR+CEDCH+S+K
Sbjct: 553 YRPNITQVLLNMDEEKKESALHYHSERIAIAFGLISTRPQTPLRIVKNLRVCEDCHSSIK 612

Query: 616 LISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           LIS IYKR+IIVRDRKRFH FE G CSCMDYW
Sbjct: 613 LISEIYKRKIIVRDRKRFHHFEKGVCSCMDYW 644

BLAST of CSPI07G13960 vs. TAIR10
Match: AT5G48910.1 (AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 846.7 bits (2186), Expect = 9.7e-246
Identity = 410/649 (63.17%), Postives = 510/649 (78.58%), Query Frame = 1

Query: 7   NPNRTLNFSP----------QPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAE 66
           NP +TL FSP            P  L     +C+T RDL Q+HA+FIK+GQ++D L AAE
Sbjct: 2   NPTQTL-FSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAE 61

Query: 67  VIKFCAFSS---RDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAML 126
           +++FCA S    RD+DYA  +F QMP+ NCF WNTI+R  +E+ DE     A+ LF  M+
Sbjct: 62  ILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSES-DEDKALIAITLFYEMM 121

Query: 127 CDGRVKPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVM 186
            D  V+PNRFTFPSVLKACA+  +++EGKQIHGL +K+GF  DEFV+SNLVRMYVMC  M
Sbjct: 122 SDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFM 181

Query: 187 EDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQR 246
           +DA  LF KN+++ D    +  D+RK+DG +VLWN+MIDG +RLGD K+A+ LFD+M QR
Sbjct: 182 KDARVLFYKNIIEKD--MVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQR 241

Query: 247 SVVSWNVMISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIH 306
           SVVSWN MISGY+ NG F +A+ +F+EM+  +I PNYVTLVSVLPAI+R+G+LELG+W+H
Sbjct: 242 SVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLH 301

Query: 307 LYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAE 366
           LYA ++ I IDDVLGSAL+DMYSKCG I+KA+ VFE LP+ N ITWSA+I  FA+HG+A 
Sbjct: 302 LYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAG 361

Query: 367 DAIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCM 426
           DAI  F  M +AGV P+DVAYI +L+ACSH GLVEEGR +FS MV V GL+PRIEHYGCM
Sbjct: 362 DAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCM 421

Query: 427 VDLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDS 486
           VDLLGR+G L+EAEE I NMPI+PDDVIWKALLGAC+M  N++MG+RVA  LM++ PHDS
Sbjct: 422 VDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDS 481

Query: 487 GSYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAK 546
           G+YVALSN+YAS GNW  V+ +RL+MK  DIRKDPGCS I+I G++HEF+VEDDSH KAK
Sbjct: 482 GAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAK 541

Query: 547 EIQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPL 606
           EI +ML E+S KLR  GYRP T +V LN +E+++   L YHSEKIA AFGLIST+P  P+
Sbjct: 542 EINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPI 601

Query: 607 KIVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           +IVKNLRICEDCH+S+KLIS +YKR+I VRDRKRFH F+ GSCSCMDYW
Sbjct: 602 RIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CSPI07G13960 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 525.8 bits (1353), Expect = 3.8e-149
Identity = 258/617 (41.82%), Postives = 374/617 (60.62%), Query Frame = 1

Query: 29  CKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFC--AFSSRDIDYARAVFRQMPEPNCFCW 88
           C    +LKQ+HA  +KTG +QD     + + FC  + SS  + YA+ VF     P+ F W
Sbjct: 24  CSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDTFLW 83

Query: 89  NTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSVLKACARASRLREGKQIHG 148
           N ++R  + +++      +L+L+  MLC      N +TFPS+LKAC+  S   E  QIH 
Sbjct: 84  NLMIRGFSCSDEPE---RSLLLYQRMLCSS-APHNAYTFPSLLKACSNLSAFEETTQIHA 143

Query: 149 LIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVL 208
            I K G+  D + +++L+  Y +    + A+ LF                 R  + + V 
Sbjct: 144 QITKLGYENDVYAVNSLINSYAVTGNFKLAHLLF----------------DRIPEPDDVS 203

Query: 209 WNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHFIEAINLFQEMQSSNI 268
           WN +I G V+ G +  A  LF +M +++ +SW  MISGY Q     EA+ LF EMQ+S++
Sbjct: 204 WNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDV 263

Query: 269 DPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQ 328
           +P+ V+L + L A A++GALE GKWIH Y    +I +D VLG  L+DMY+KCG +++AL+
Sbjct: 264 EPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALE 323

Query: 329 VFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGILSACSHAGL 388
           VF+ + K++   W+A+I  +A HG   +AI  F  M K G+ PN + +  +L+ACS+ GL
Sbjct: 324 VFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGL 383

Query: 389 VEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEAEELIRNMPIEPDDVIWKALL 448
           VEEG+  F  M +   L+P IEHYGC+VDLLGRAG L+EA+  I+ MP++P+ VIW ALL
Sbjct: 384 VEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALL 443

Query: 449 GACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYASLGNWEAVARVRLKMKGMDIRK 508
            AC++HKN+++GE + E L+ + P+  G YV  +N++A    W+  A  R  MK   + K
Sbjct: 444 KACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAK 503

Query: 509 DPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKLRSNGYRPNTLEVFLN-TDEQ 568
            PGCS I + G  HEFL  D SH + ++IQ+    M  KL  NGY P   E+ L+  D+ 
Sbjct: 504 VPGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDD 563

Query: 569 ERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICEDCHASLKLISLIYKRQIIVRDR 628
           ER   +  HSEK+A+ +GLI T P   ++I+KNLR+C+DCH   KLIS IYKR I++RDR
Sbjct: 564 EREAIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDR 620

Query: 629 KRFHQFEHGSCSCMDYW 643
            RFH F  G CSC DYW
Sbjct: 624 TRFHHFRDGKCSCGDYW 620

BLAST of CSPI07G13960 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 495.7 bits (1275), Expect = 4.2e-140
Identity = 257/639 (40.22%), Postives = 387/639 (60.56%), Query Frame = 1

Query: 11  TLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFCAFSSRD--- 70
           TL F   P L L +   SC +  DLK +H   ++T  I D   A+ ++  C   S     
Sbjct: 8   TLRFK-HPKLALLQ---SCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKP 67

Query: 71  ---IDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFT 130
              + YA  +F Q+  PN F +N ++R  +   +    S+A   ++ ML   R+ P+  T
Sbjct: 68  TNLLGYAYGIFSQIQNPNLFVFNLLIRCFSTGAEP---SKAFGFYTQML-KSRIWPDNIT 127

Query: 131 FPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNV 190
           FP ++KA +    +  G+Q H  IV+FGF  D +V ++LV MY  C  +  A  +F    
Sbjct: 128 FPFLIKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIF---- 187

Query: 191 VDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISG 250
                  QM         +VV W  M+ G  + G +++A+ +FDEMP R++ +W++MI+G
Sbjct: 188 ------GQMGF------RDVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMING 247

Query: 251 YAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEID 310
           YA+N  F +AI+LF+ M+   +  N   +VSV+ + A +GALE G+  + Y   + + ++
Sbjct: 248 YAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVN 307

Query: 311 DVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGK 370
            +LG+ALVDM+ +CG I+KA+ VFE LP+ ++++WS+II   A+HG A  A+ +F  M  
Sbjct: 308 LILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMIS 367

Query: 371 AGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLE 430
            G  P DV +  +LSACSH GLVE+G   + +M K  G++PR+EHYGC+VD+LGRAG L 
Sbjct: 368 LGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLA 427

Query: 431 EAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYA 490
           EAE  I  M ++P+  I  ALLGACK++KN ++ ERV   L+++ P  SG YV LSN+YA
Sbjct: 428 EAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYA 487

Query: 491 SLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDD-SHSKAKEIQAMLGEMS 550
             G W+ +  +R  MK   ++K PG S IEI G I++F + DD  H +  +I+    E+ 
Sbjct: 488 CAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEIL 547

Query: 551 MKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICE 610
            K+R  GY+ NT + F + DE+E+  ++  HSEK+A+A+G++ T P   ++IVKNLR+CE
Sbjct: 548 GKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCE 607

Query: 611 DCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           DCH   KLIS +Y R++IVRDR RFH F +G CSC DYW
Sbjct: 608 DCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSCRDYW 622

BLAST of CSPI07G13960 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 492.3 bits (1266), Expect = 4.6e-139
Identity = 258/636 (40.57%), Postives = 395/636 (62.11%), Query Frame = 1

Query: 11  TLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKFC-AFSSRD-I 70
           T  F   PP  L+      ++  ++ Q+HA  ++   +  P      +K   A++S   I
Sbjct: 21  TARFRLPPPEKLAVLIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKI 80

Query: 71  DYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSV 130
            ++ A+F Q  +P+ F +   +   +      L+ +A +L+  +L    + PN FTF S+
Sbjct: 81  RHSLALFHQTIDPDLFLFTAAINTASING---LKDQAFLLYVQLL-SSEINPNEFTFSSL 140

Query: 131 LKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFD 190
           LK+C+  S    GK IH  ++KFG   D +V + LV +Y     +  A  +F        
Sbjct: 141 LKSCSTKS----GKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVF-------- 200

Query: 191 GSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQN 250
                    R  + ++V    MI    + G++++A+ LFD M +R +VSWNVMI GYAQ+
Sbjct: 201 --------DRMPERSLVSSTAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQH 260

Query: 251 GHFIEAINLFQEMQSSNID-PNYVTLVSVLPAIARIGALELGKWIHLYAGNNKIEIDDVL 310
           G   +A+ LFQ++ +     P+ +T+V+ L A ++IGALE G+WIH++  +++I ++  +
Sbjct: 261 GFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKV 320

Query: 311 GSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLM-GKAG 370
            + L+DMYSKCGS+++A+ VF   P+++ + W+A+I  +AMHG ++DA+  F+ M G  G
Sbjct: 321 CTGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITG 380

Query: 371 VTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEA 430
           + P D+ +IG L AC+HAGLV EG   F  M +  G++P+IEHYGC+V LLGRAG L+ A
Sbjct: 381 LQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRA 440

Query: 431 EELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALSNLYASL 490
            E I+NM ++ D V+W ++LG+CK+H +  +G+ +AE L+ L   +SG YV LSN+YAS+
Sbjct: 441 YETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASV 500

Query: 491 GNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKL 550
           G++E VA+VR  MK   I K+PG S IEI   +HEF   D  HSK+KEI  ML ++S ++
Sbjct: 501 GDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGDREHSKSKEIYTMLRKISERI 560

Query: 551 RSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLRICEDCH 610
           +S+GY PNT  V  + +E E+ ++LQ HSE++A+A+GLIST P  PLKI KNLR+C DCH
Sbjct: 561 KSHGYVPNTNTVLQDLEETEKEQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCH 620

Query: 611 ASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
              KLIS I  R+I++RDR RFH F  GSCSC D+W
Sbjct: 621 TVTKLISKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CSPI07G13960 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 486.9 bits (1252), Expect = 1.9e-137
Identity = 248/609 (40.72%), Postives = 364/609 (59.77%), Query Frame = 1

Query: 36  KQLHAIFIKTGQIQDPLTAAEVIKFCAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAE 95
           + LH + +K+    D   A  +I  C FS  D+D A  VF  + E +   WN+++    +
Sbjct: 151 QSLHGMAVKSAVGSDVFVANSLIH-CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQ 210

Query: 96  TNDEHLQSEALMLFSAMLCDGRVKPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHE 155
                   +AL LF  M  +  VK +  T   VL ACA+   L  G+Q+   I +   + 
Sbjct: 211 KGSP---DKALELFKKMESED-VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNV 270

Query: 156 DEFVISNLVRMYVMCAVMEDAYSLFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQV 215
           +  + + ++ MY  C  +EDA  LF               D  ++  NV  W  M+DG  
Sbjct: 271 NLTLANAMLDMYTKCGSIEDAKRLF---------------DAMEEKDNVT-WTTMLDGYA 330

Query: 216 RLGDIKSAKNLFDEMPQRSVVSWNVMISGYAQNGHFIEAINLFQEMQ-SSNIDPNYVTLV 275
              D ++A+ + + MPQ+ +V+WN +IS Y QNG   EA+ +F E+Q   N+  N +TLV
Sbjct: 331 ISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLV 390

Query: 276 SVLPAIARIGALELGKWIHLYAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKR 335
           S L A A++GALELG+WIH Y   + I ++  + SAL+ MYSKCG ++K+ +VF ++ KR
Sbjct: 391 STLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKR 450

Query: 336 NAITWSAIIGAFAMHGRAEDAIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFF 395
           +   WSA+IG  AMHG   +A+  F+ M +A V PN V +  +  ACSH GLV+E  S F
Sbjct: 451 DVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLF 510

Query: 396 SHMVKVVGLQPRIEHYGCMVDLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKN 455
             M    G+ P  +HY C+VD+LGR+G+LE+A + I  MPI P   +W ALLGACK+H N
Sbjct: 511 HQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHAN 570

Query: 456 LKMGERVAETLMELAPHDSGSYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIE 515
           L + E     L+EL P + G++V LSN+YA LG WE V+ +R  M+   ++K+PGCS IE
Sbjct: 571 LNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIE 630

Query: 516 IHGIIHEFLVEDDSHSKAKEIQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQE-RARALQY 575
           I G+IHEFL  D++H  ++++   L E+  KL+SNGY P   +V    +E+E + ++L  
Sbjct: 631 IDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNL 690

Query: 576 HSEKIAVAFGLISTAPQHPLKIVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEH 635
           HSEK+A+ +GLIST     ++++KNLR+C DCH+  KLIS +Y R+IIVRDR RFH F +
Sbjct: 691 HSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRN 738

Query: 636 GSCSCMDYW 643
           G CSC D+W
Sbjct: 751 GQCSCNDFW 738

BLAST of CSPI07G13960 vs. NCBI nr
Match: gi|449443909|ref|XP_004139718.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X2 [Cucumis sativus])

HSP 1 Score: 1310.8 bits (3391), Expect = 0.0e+00
Identity = 638/642 (99.38%), Postives = 640/642 (99.69%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIYNPNRT NFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF
Sbjct: 1   MNSTIYNPNRTFNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60

Query: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKP 120
           CAFSSRDIDYARAVFRQMPEPNCFCWNTILR+LAETNDEHLQSEALMLFSAMLCDGRVKP
Sbjct: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRVLAETNDEHLQSEALMLFSAMLCDGRVKP 120

Query: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180
           NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF
Sbjct: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180

Query: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240
           CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV
Sbjct: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240

Query: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNK 300
           MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAG NK
Sbjct: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGKNK 300

Query: 301 IEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360
           IEIDDVLGSALVDMYSKCGSID+ALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH
Sbjct: 301 IEIDDVLGSALVDMYSKCGSIDEALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360

Query: 361 LMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRA 420
           LMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRA
Sbjct: 361 LMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRA 420

Query: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480
           GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS
Sbjct: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480

Query: 481 NLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLG 540
           NLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLG
Sbjct: 481 NLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLG 540

Query: 541 EMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLR 600
           EMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLR
Sbjct: 541 EMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLR 600

Query: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           ICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW
Sbjct: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 642

BLAST of CSPI07G13960 vs. NCBI nr
Match: gi|659116224|ref|XP_008457972.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X2 [Cucumis melo])

HSP 1 Score: 1264.6 bits (3271), Expect = 0.0e+00
Identity = 616/642 (95.95%), Postives = 628/642 (97.82%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIYNPNRTLN S QP LILSK  TSCKTPRDLKQLHA FIKTGQIQDPLTAAEVIKF
Sbjct: 1   MNSTIYNPNRTLNSSAQPRLILSKALTSCKTPRDLKQLHASFIKTGQIQDPLTAAEVIKF 60

Query: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKP 120
           CAFSSRDIDYARA+F QMPEPNCFCWNTILR+LAETNDEH QSEALMLFSAMLCDGRVKP
Sbjct: 61  CAFSSRDIDYARAIFCQMPEPNCFCWNTILRVLAETNDEHHQSEALMLFSAMLCDGRVKP 120

Query: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180
           NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEF+IS+LVRMYVMCAV+EDAYSLF
Sbjct: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFIISSLVRMYVMCAVIEDAYSLF 180

Query: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240
            KNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV
Sbjct: 181 SKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240

Query: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNK 300
           MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAG NK
Sbjct: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGKNK 300

Query: 301 IEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360
           IEIDDVLGSALVDMYSKCGSI+KALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH
Sbjct: 301 IEIDDVLGSALVDMYSKCGSIEKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360

Query: 361 LMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRA 420
           L+GKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQP+IEHYGCMVDLLGRA
Sbjct: 361 LLGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPKIEHYGCMVDLLGRA 420

Query: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480
           GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS
Sbjct: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480

Query: 481 NLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLG 540
           NLYASLGNWEAVARVRLKMK MDIRKDPGCSWIEIHGIIHEFL EDDSHS+AKEIQAMLG
Sbjct: 481 NLYASLGNWEAVARVRLKMKEMDIRKDPGCSWIEIHGIIHEFLAEDDSHSQAKEIQAMLG 540

Query: 541 EMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLR 600
           EMSMKLR NGYRPNTL+VFLNTDEQE+ARALQYHSEKIAVAFGLISTAP+HPLKIVKNLR
Sbjct: 541 EMSMKLRLNGYRPNTLDVFLNTDEQEKARALQYHSEKIAVAFGLISTAPKHPLKIVKNLR 600

Query: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           ICEDCHASLKLISLIYKRQIIVRDRKRFH FEHGSCSCMDYW
Sbjct: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHHFEHGSCSCMDYW 642

BLAST of CSPI07G13960 vs. NCBI nr
Match: gi|659116220|ref|XP_008457970.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X1 [Cucumis melo])

HSP 1 Score: 1253.0 bits (3241), Expect = 0.0e+00
Identity = 612/638 (95.92%), Postives = 624/638 (97.81%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIYNPNRTLN S QP LILSK  TSCKTPRDLKQLHA FIKTGQIQDPLTAAEVIKF
Sbjct: 1   MNSTIYNPNRTLNSSAQPRLILSKALTSCKTPRDLKQLHASFIKTGQIQDPLTAAEVIKF 60

Query: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKP 120
           CAFSSRDIDYARA+F QMPEPNCFCWNTILR+LAETNDEH QSEALMLFSAMLCDGRVKP
Sbjct: 61  CAFSSRDIDYARAIFCQMPEPNCFCWNTILRVLAETNDEHHQSEALMLFSAMLCDGRVKP 120

Query: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180
           NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEF+IS+LVRMYVMCAV+EDAYSLF
Sbjct: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFIISSLVRMYVMCAVIEDAYSLF 180

Query: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240
            KNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV
Sbjct: 181 SKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVSWNV 240

Query: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGNNK 300
           MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAG NK
Sbjct: 241 MISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAGKNK 300

Query: 301 IEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360
           IEIDDVLGSALVDMYSKCGSI+KALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH
Sbjct: 301 IEIDDVLGSALVDMYSKCGSIEKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFH 360

Query: 361 LMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLLGRA 420
           L+GKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQP+IEHYGCMVDLLGRA
Sbjct: 361 LLGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPKIEHYGCMVDLLGRA 420

Query: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480
           GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS
Sbjct: 421 GHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYVALS 480

Query: 481 NLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQAMLG 540
           NLYASLGNWEAVARVRLKMK MDIRKDPGCSWIEIHGIIHEFL EDDSHS+AKEIQAMLG
Sbjct: 481 NLYASLGNWEAVARVRLKMKEMDIRKDPGCSWIEIHGIIHEFLAEDDSHSQAKEIQAMLG 540

Query: 541 EMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVKNLR 600
           EMSMKLR NGYRPNTL+VFLNTDEQE+ARALQYHSEKIAVAFGLISTAP+HPLKIVKNLR
Sbjct: 541 EMSMKLRLNGYRPNTLDVFLNTDEQEKARALQYHSEKIAVAFGLISTAPKHPLKIVKNLR 600

Query: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSC 639
           ICEDCHASLKLISLIYKRQIIVRDRKRFH FEHGSCSC
Sbjct: 601 ICEDCHASLKLISLIYKRQIIVRDRKRFHHFEHGSCSC 638

BLAST of CSPI07G13960 vs. NCBI nr
Match: gi|700189340|gb|KGN44573.1| (hypothetical protein Csa_7G336520 [Cucumis sativus])

HSP 1 Score: 964.5 bits (2492), Expect = 9.0e-278
Identity = 498/648 (76.85%), Postives = 526/648 (81.17%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIYNPNRT NFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF
Sbjct: 1   MNSTIYNPNRTFNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60

Query: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRILAETNDEHLQSEALMLFSAMLCDGRVKP 120
           CAFSSRDIDYARAVFRQMPEPNCFCWNTILR+LAETNDEHLQSEALMLFSAMLCDGRVKP
Sbjct: 61  CAFSSRDIDYARAVFRQMPEPNCFCWNTILRVLAETNDEHLQSEALMLFSAMLCDGRVKP 120

Query: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180
           NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF
Sbjct: 121 NRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAYSLF 180

Query: 181 CKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSV----V 240
           CKNVVDFDGSCQMELDKRKQD                     A NLF EM   ++    V
Sbjct: 181 CKNVVDFDGSCQMELDKRKQD--------------------EAINLFQEMQSSNIDPNYV 240

Query: 241 SWNVMISGYAQNG--HFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHL 300
           +   ++   A+ G     + I+L+       ID   V   +++   ++ G+++      L
Sbjct: 241 TLVSVLPAIARIGALELGKWIHLYAGKNKIEIDD--VLGSALVDMYSKCGSIDEA----L 300

Query: 301 YAGNNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAED 360
                  + + +  SA++  ++  G  + A+  F  + K                     
Sbjct: 301 QVFETLPKRNAITWSAIIGAFAMHGRAEDAIIHFHLMGKA-------------------- 360

Query: 361 AIIHFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV 420
                      GVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV
Sbjct: 361 -----------GVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMV 420

Query: 421 DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG 480
           DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG
Sbjct: 421 DLLGRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSG 480

Query: 481 SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE 540
           SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE
Sbjct: 481 SYVALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKE 540

Query: 541 IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK 600
           IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK
Sbjct: 541 IQAMLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLK 591

Query: 601 IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW
Sbjct: 601 IVKNLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 591

BLAST of CSPI07G13960 vs. NCBI nr
Match: gi|1009149030|ref|XP_015892260.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48910 [Ziziphus jujuba])

HSP 1 Score: 932.2 bits (2408), Expect = 5.0e-268
Identity = 446/645 (69.15%), Postives = 534/645 (82.79%), Query Frame = 1

Query: 1   MNSTIYNPNRTLNFSPQPPLILSKPFTSCKTPRDLKQLHAIFIKTGQIQDPLTAAEVIKF 60
           MNSTIY P  T   S  P L+  +   +CKT RDL Q+HA+FIKTGQI DPL AAE++KF
Sbjct: 18  MNSTIYKP-ATPPLSSHPSLLFPQ-IAACKTMRDLNQVHALFIKTGQIHDPLAAAEILKF 77

Query: 61  CAFSSR-DIDYARAVFRQMPEPNCFCWNTILRILAETN--DEHLQSEALMLFSAMLCDGR 120
           CA SSR D +YAR+VF QM EPNCF WNTI+R LAE++  D+    EAL+LF  M+ +G 
Sbjct: 78  CALSSRRDTEYARSVFLQMREPNCFSWNTIIRALAESDVDDDQPMEEALLLFCQMVSNGF 137

Query: 121 VKPNRFTFPSVLKACARASRLREGKQIHGLIVKFGFHEDEFVISNLVRMYVMCAVMEDAY 180
           V+PNRFTFPSVLKACA+   +  GKQ+HG+++KFG   DEFV+SNLVRMYVMC V +DA+
Sbjct: 138 VEPNRFTFPSVLKACAKIGNVEMGKQVHGMVLKFGLERDEFVVSNLVRMYVMCGVTKDAH 197

Query: 181 SLFCKNVVDFDGSCQMELDKRKQDGNVVLWNIMIDGQVRLGDIKSAKNLFDEMPQRSVVS 240
            LF +NV+D     ++  D R+Q+GNVVL N+M+DG VRLGD K+A+ LFD MPQRSVVS
Sbjct: 198 FLFSRNVIDSGNMHKVIRDSRRQEGNVVLCNVMVDGYVRLGDFKAARELFDRMPQRSVVS 257

Query: 241 WNVMISGYAQNGHFIEAINLFQEMQSSNIDPNYVTLVSVLPAIARIGALELGKWIHLYAG 300
           WNVMISGYAQNG F EAI +F++MQ   + PNYVTLVSVLPAI+R+GALELGKW+HLYA 
Sbjct: 258 WNVMISGYAQNGLFKEAIEMFRDMQLGEVRPNYVTLVSVLPAISRLGALELGKWVHLYAE 317

Query: 301 NNKIEIDDVLGSALVDMYSKCGSIDKALQVFETLPKRNAITWSAIIGAFAMHGRAEDAII 360
            NK EI+DVLGSALVDMYSKCGSI+KALQVFETLPK N ITW+AII   AMHGRA+DA+ 
Sbjct: 318 KNKFEINDVLGSALVDMYSKCGSIEKALQVFETLPKENPITWNAIISGLAMHGRAKDALH 377

Query: 361 HFHLMGKAGVTPNDVAYIGILSACSHAGLVEEGRSFFSHMVKVVGLQPRIEHYGCMVDLL 420
           +F  M +AGV P+DVAYI +LSACSHAGLVEEGR+FFSHMV V G +PRIEHYGCMVDLL
Sbjct: 378 YFSRMEQAGVVPSDVAYIAVLSACSHAGLVEEGRTFFSHMVTVAGFEPRIEHYGCMVDLL 437

Query: 421 GRAGHLEEAEELIRNMPIEPDDVIWKALLGACKMHKNLKMGERVAETLMELAPHDSGSYV 480
           GRAGHL+EAEELI NMPI  D+VIWKALLGACKMH N++MG+RVA+ LM +APHDSGSYV
Sbjct: 438 GRAGHLKEAEELILNMPIRQDEVIWKALLGACKMHGNIEMGKRVAKVLMGMAPHDSGSYV 497

Query: 481 ALSNLYASLGNWEAVARVRLKMKGMDIRKDPGCSWIEIHGIIHEFLVEDDSHSKAKEIQA 540
           ALSNLYAS G+W+ V+ +RL M+ MDIRKDPGCSWIE+ G+IHEFLVED+SH +AKEI  
Sbjct: 498 ALSNLYASSGDWKGVSEMRLMMEDMDIRKDPGCSWIELDGVIHEFLVEDESHPRAKEIHL 557

Query: 541 MLGEMSMKLRSNGYRPNTLEVFLNTDEQERARALQYHSEKIAVAFGLISTAPQHPLKIVK 600
           ML E+S +LR+ GY+P+T +V LN DE+E+  +L YHSEKIA+AFGLI+T+PQ PL+IVK
Sbjct: 558 MLEEISNQLRTAGYKPDTTQVLLNMDEEEKETSLHYHSEKIAIAFGLIATSPQTPLRIVK 617

Query: 601 NLRICEDCHASLKLISLIYKRQIIVRDRKRFHQFEHGSCSCMDYW 643
           NLRICEDCH+S+K+IS IYKR+IIVRDRKRFH FEHG+CSCMDYW
Sbjct: 618 NLRICEDCHSSIKVISKIYKRKIIVRDRKRFHHFEHGTCSCMDYW 660

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP425_ARATH1.7e-24463.17Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN... [more]
PP449_ARATH6.7e-14841.82Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP367_ARATH7.4e-13940.22Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP354_ARATH8.2e-13840.57Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
PP175_ARATH3.5e-13640.72Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K543_CUCSA6.3e-27876.85Uncharacterized protein OS=Cucumis sativus GN=Csa_7G336520 PE=4 SV=1[more]
V4TTU6_9ROSI6.3e-26268.49Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019256mg PE=4 SV=1[more]
M5VIK6_PRUPE3.1e-26169.52Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002774mg PE=4 SV=1[more]
A0A061FRW3_THECC1.7e-25966.93Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_036... [more]
A0A067K2C4_JATCU1.9e-25868.20Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17578 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48910.19.7e-24663.17 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G66520.13.8e-14941.82 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G06540.14.2e-14040.22 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G37380.14.6e-13940.57 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.11.9e-13740.72 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449443909|ref|XP_004139718.1|0.0e+0099.38PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X2 [Cuc... [more]
gi|659116224|ref|XP_008457972.1|0.0e+0095.95PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X2 [Cuc... [more]
gi|659116220|ref|XP_008457970.1|0.0e+0095.92PREDICTED: pentatricopeptide repeat-containing protein At5g48910 isoform X1 [Cuc... [more]
gi|700189340|gb|KGN44573.1|9.0e-27876.85hypothetical protein Csa_7G336520 [Cucumis sativus][more]
gi|1009149030|ref|XP_015892260.1|5.0e-26869.15PREDICTED: pentatricopeptide repeat-containing protein At5g48910 [Ziziphus jujub... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007020 microtubule nucleation
biological_process GO:0009451 RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G13960.1CSPI07G13960.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 409..433
score: 0.0049coord: 337..367
score: 0.0063coord: 372..399
score: 0.11coord: 309..334
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 234..279
score: 1.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 205..235
score: 3.5E-5coord: 236..269
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 370..405
score: 8.188coord: 121..155
score: 8.068coord: 234..268
score: 13.088coord: 304..334
score: 8.013coord: 335..369
score: 10.26coord: 203..233
score: 9.756coord: 472..506
score: 7.366coord: 406..436
score: 7.563coord: 50..85
score: 6.533coord: 156..187
score: 5.042coord: 438..468
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 202..377
score: 1.1E-8coord: 414..490
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 7..513
score: 4.6E
NoneNo IPR availablePANTHERPTHR24015:SF291SUBFAMILY NOT NAMEDcoord: 7..513
score: 4.6E