CSPI01G09080 (gene) Wild cucumber (PI 183967)

NameCSPI01G09080
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 5713255 .. 5717083 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCGTTGAAGCCGAGGCGTTCCGCCAAGCCGTGTCCTCCGTTCGTGCTTCTATTCGGCCTTCCCTACTATTGGGCTCACTAGCTCAGCTTTGGCTACATATGGAATCGTTAGCGCGATAAGCTCAAGCCGTAAAGTTTCCGCCTTTCTAAACCTTTACTGTACCCATCACTGAGCTCCAGCTGCTGCAATTTATCCGGAAAACAATCTCCCATGGCGCTTGTACAACAACACCATCTCACATACCCATTTCTTTCCATTGCCGGGGCCAATCTGAAACAAAATACTTCCAATTCTTTTTCATTTTTTCAATCCAATACCCAGAAGCTCGCCTGCTGCTTATGTGCAGCATCCCCGAACCCCTCCACTCAATCTCCATCCCCCATTTTCCTTCATTTTTTCGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGTTCCTTCTAAGGAAGGTCATGGAGGTAACAAGACGGAAGAGGATTGGAACGACCCATTATTCAGATTTTTCAAATCCCAAACTTCAACGACGCAAGACCCATCACGTGAAAGCAAATTGCCCCTCCAAAAGAACCGCCGTTCGTCCTGGCATCTTGCCTCCGATGTTGAATTTTTCAATGAAGCTGAAGTTACACCGGAGGAAGACAAGGAACAATTGCGTTCTGCGAGTCGGAATTCTAGGGTCTTACCAGATGGTCCTGTCGGAGAAATAGTGGGAATTGCGAGGAATTTGTCGCAAAATATGACTCTGGGGGAGGCTTTGGGAGAATTTGAAGGAAGAATTAGCGAGAAGGAATGTTGGGAGGTGCTGCGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGTTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGACTTCGCTTGTTACATCTCGTGCCTATTCTCTTCTATTTCCATTGTTGGGAAGAGCTGGAATGGGAGAGAAAATTATGGTCTTGTTCAAGAACCTTCCACTCAAGAAGGAATTTCAGGATGTTCATGTCTATAACTCTGCAATTTCTGGACTTATGGTCTGTAAGAGGTACGAGTTTGGTTTTAATTTATTGCATTTTCCTCTTTATTTTTGCTGTTATAATTTTCTTGCTGCTTGATTTAGTTCTGGAACTTATATTTGAGAAATTTTTATTGAATTTTGAGCTGTGTTGAATTGGTAAGGCCAATTTGATTATTGTTTAGTTTTTGGTTTTTAAAATTTAAGCTTATAAGCAGTATACAAATTTATTTTGTTATCTACTTTTTAAGATGCTTCCAAAATCCGAGCTATATTTTGAAAAATAAAAAATAGTTTTGTAGTTTTGTTCTTGAAATTTCGTTAAGAATTCAAACGTTGGCTTCAAAGGCCACAGTAAAAAAATGGTGAGAAAACGGGCATAGTTTTATTAACTTGAGGAATGACATTTGAGACCTCACAAGTGTCTTGTTATGTTTTCTAATTGGTGTCATGGCCACTGCCCCTCACTAATCTTCTTAAGTTTTCTTAATGGTGAACGATTCAAATGATGTAAAACAAATCATCCTTTCCTCATTTTAGCTTATCCTTCTTAGTCGATAATCGGCTCTCTCTTCTATATGCTTCATTTATCCAAAAGGAGAAAAAGAAAAGAAAGAACGAAAAAGAAAACCAACCAACCAACAACTTTGGTGGCTGACAGCTCCTGGAGGCAATACGAAATTTTAATCCGTGATAGCATGTATTAATAGCTATTTCTTATTAGAATTGAGTAAAACTACAGATTGGCGGTAATGATCCTCAATGCTTGTTCTGTTTTCCTTGTTAAAAACCTATAGTTTATGGGGAGATAAAGAAAGACGCGGAACTCTATCTGTGATCCTTTTCCAGTCCAAGGAAGATAATACTTAAACAAATGTATTGGTGTATTGGAAGCAGTAAACTTCAAATTGGTCAAAGTCGTACTGTTTTGATGGGAAAATAGCATCGTTGCCTTATGTCTTCCAAAGTGGCCGTGGAAGCTTAACTTTGATTATCCTTTATTGTTTACATATTCTAAATTATTATTTCCAGGTATGATGATGCTTGCAAGGTGTACGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATTACAGTTATGAGAAAAATTGGCCGCAGTGCAAAGGATTCATGGGATTACTTTGAGAAAATGAACCAAAAAGGAGTAAAATGGAGTTCTGAAGTTTTGGGTGCTCTGATTAAATCGTTCTGCGATGAGGGGCTGAAGAGTCAAGCACTTATCCTACAATTGGAGATGGAGAAGAAAGGGGTTGCTTCGAACGTGATCATGTATAATACGATCATGGATGCTTTTAGTAAATCGAATCAAATCGAGGAAGCTGAAGGTGTCTTTGCTGAAATGAAATCTAAAGGAGTGAAACCAACGAGTGCAAGTTTTAACATCTTGATGAATGCATACAGTAGGAGGATGCAACCTGAGATTGTTGAGAAGCTTCTGGTTGAAATGAAGGATATGGGATTGGAACCTAATGTAAAGTCATACACTTGCTTGATTAGTGCTTATGGGAGGCAGAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGAATGAAAAAAAATGGTATTAGGCCAACCTCTCATTCATATACAGCTCTGATTCATGCTTATTCTGTTAGCGGTTGGCATGAGAAAGCTTACTCAGCCTTTGAGAACATGTTGCGTGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTACTCGATGCGTTTAGGCGTGCTGGTGATACAGTGTCGTTGATGAAAATATGGAAGTTAATGATTAGAGAAAAAGTACTAGGGACAAGAGTAACTTTTAACACATTGCTAGATGGGTTTGCAAAACACGGTCATTATGTTGAAGCAAGAGATGTGATCTCTGAGTTTGATAAGATCGGGTTACAACCAACTGTTATGACATACAACATGTTGATGAATGCATATGCTAGGGGAGGTCAACATTTAAAGCTGCCACAGCTGCTGCAAGAGATGGCTGCTCGGGACCTAAAACCCGACTCCGTTACTTATTCTACCATGATTTATGCCTTTGTACGTGTTCGCGATTTCAAAAGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAAGTGCCTGATGTAAAGTCATACCAGAAACTTAAATCGATCTTGGATGTAAAACTTGCTACAAAAAACAGGAAAGACAAGAGTGCCATTCTTGGTATAATAAACAGCAAAATGGGTATGGTGAAAGCTAAGAAGCAGGGCAAGAAAGATGAGTTTTGGAAGACCAAGAGAAGGCATGTAAGAACTCAAGACAGTTTCTCCCGGTGAACAAACAAAAATGAAGACAGAAAGTTTAATTGACCATCCACTTGGGCTTTTGATGAATGGATGGTCATCTCTTATCCATCATAAGATAAGCTTAAGAGGTCCAATATTTGGCAGATGGGCAGGGCGTTTGTTCAAGAAATTGAGATAAGGAAGTGTCAAAAGGGTTTATAACAAAACATTGGCCCAATGGAGGAAAAGGAGGAGAAAGTCAAAGCAGCTTTTGAAGAATCAAGATATTTGTGCATTGGCTTGGACAGATTAATAATCAATGATGAAGAATATTGTACTTTAGGTAAAAGGAGGTCAGAACTCTTCTTTTTACACGTTGTAAAGAGGTTTGTTCTCTACAATTTCCGATTAGAAAATGGAAAGTGGACTATCTAGGTTATGTGGGCTTGGTTTTTAAACTCCTTGTATTTATTTCTCAATGAAAGTTGGTCTGTTGTAAAGAAATATATATAAATTTATATTTTGTAGAG

mRNA sequence

ATGGCGCTTGTACAACAACACCATCTCACATACCCATTTCTTTCCATTGCCGGGGCCAATCTGAAACAAAATACTTCCAATTCTTTTTCATTTTTTCAATCCAATACCCAGAAGCTCGCCTGCTGCTTATGTGCAGCATCCCCGAACCCCTCCACTCAATCTCCATCCCCCATTTTCCTTCATTTTTTCGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGTTCCTTCTAAGGAAGGTCATGGAGGTAACAAGACGGAAGAGGATTGGAACGACCCATTATTCAGATTTTTCAAATCCCAAACTTCAACGACGCAAGACCCATCACGTGAAAGCAAATTGCCCCTCCAAAAGAACCGCCGTTCGTCCTGGCATCTTGCCTCCGATGTTGAATTTTTCAATGAAGCTGAAGTTACACCGGAGGAAGACAAGGAACAATTGCGTTCTGCGAGTCGGAATTCTAGGGTCTTACCAGATGGTCCTGTCGGAGAAATAGTGGGAATTGCGAGGAATTTGTCGCAAAATATGACTCTGGGGGAGGCTTTGGGAGAATTTGAAGGAAGAATTAGCGAGAAGGAATGTTGGGAGGTGCTGCGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGTTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGACTTCGCTTGTTACATCTCGTGCCTATTCTCTTCTATTTCCATTGTTGGGAAGAGCTGGAATGGGAGAGAAAATTATGGTCTTGTTCAAGAACCTTCCACTCAAGAAGGAATTTCAGGATGTTCATGTCTATAACTCTGCAATTTCTGGACTTATGGTCTGTAAGAGGTATGATGATGCTTGCAAGGTGTACGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATTACAGTTATGAGAAAAATTGGCCGCAGTGCAAAGGATTCATGGGATTACTTTGAGAAAATGAACCAAAAAGGAGTAAAATGGAGTTCTGAAGTTTTGGGTGCTCTGATTAAATCGTTCTGCGATGAGGGGCTGAAGAGTCAAGCACTTATCCTACAATTGGAGATGGAGAAGAAAGGGGTTGCTTCGAACGTGATCATGTATAATACGATCATGGATGCTTTTAGTAAATCGAATCAAATCGAGGAAGCTGAAGGTGTCTTTGCTGAAATGAAATCTAAAGGAGTGAAACCAACGAGTGCAAGTTTTAACATCTTGATGAATGCATACAGTAGGAGGATGCAACCTGAGATTGTTGAGAAGCTTCTGGTTGAAATGAAGGATATGGGATTGGAACCTAATGTAAAGTCATACACTTGCTTGATTAGTGCTTATGGGAGGCAGAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGAATGAAAAAAAATGGTATTAGGCCAACCTCTCATTCATATACAGCTCTGATTCATGCTTATTCTGTTAGCGGTTGGCATGAGAAAGCTTACTCAGCCTTTGAGAACATGTTGCGTGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTACTCGATGCGTTTAGGCGTGCTGGTGATACAGTGTCGTTGATGAAAATATGGAAGTTAATGATTAGAGAAAAAGTACTAGGGACAAGAGTAACTTTTAACACATTGCTAGATGGGTTTGCAAAACACGGTCATTATGTTGAAGCAAGAGATGTGATCTCTGAGTTTGATAAGATCGGGTTACAACCAACTGTTATGACATACAACATGTTGATGAATGCATATGCTAGGGGAGGTCAACATTTAAAGCTGCCACAGCTGCTGCAAGAGATGGCTGCTCGGGACCTAAAACCCGACTCCGTTACTTATTCTACCATGATTTATGCCTTTGTACGTGTTCGCGATTTCAAAAGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAAGTGCCTGATGTAAAGTCATACCAGAAACTTAAATCGATCTTGGATGTAAAACTTGCTACAAAAAACAGGAAAGACAAGAGTGCCATTCTTGGTATAATAAACAGCAAAATGGGTATGGTGAAAGCTAAGAAGCAGGGCAAGAAAGATGAGTTTTGGAAGACCAAGAGAAGGCATGTAAGAACTCAAGACAGTTTCTCCCGGTGA

Coding sequence (CDS)

ATGGCGCTTGTACAACAACACCATCTCACATACCCATTTCTTTCCATTGCCGGGGCCAATCTGAAACAAAATACTTCCAATTCTTTTTCATTTTTTCAATCCAATACCCAGAAGCTCGCCTGCTGCTTATGTGCAGCATCCCCGAACCCCTCCACTCAATCTCCATCCCCCATTTTCCTTCATTTTTTCGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGTTCCTTCTAAGGAAGGTCATGGAGGTAACAAGACGGAAGAGGATTGGAACGACCCATTATTCAGATTTTTCAAATCCCAAACTTCAACGACGCAAGACCCATCACGTGAAAGCAAATTGCCCCTCCAAAAGAACCGCCGTTCGTCCTGGCATCTTGCCTCCGATGTTGAATTTTTCAATGAAGCTGAAGTTACACCGGAGGAAGACAAGGAACAATTGCGTTCTGCGAGTCGGAATTCTAGGGTCTTACCAGATGGTCCTGTCGGAGAAATAGTGGGAATTGCGAGGAATTTGTCGCAAAATATGACTCTGGGGGAGGCTTTGGGAGAATTTGAAGGAAGAATTAGCGAGAAGGAATGTTGGGAGGTGCTGCGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGTTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGACTTCGCTTGTTACATCTCGTGCCTATTCTCTTCTATTTCCATTGTTGGGAAGAGCTGGAATGGGAGAGAAAATTATGGTCTTGTTCAAGAACCTTCCACTCAAGAAGGAATTTCAGGATGTTCATGTCTATAACTCTGCAATTTCTGGACTTATGGTCTGTAAGAGGTATGATGATGCTTGCAAGGTGTACGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATTACAGTTATGAGAAAAATTGGCCGCAGTGCAAAGGATTCATGGGATTACTTTGAGAAAATGAACCAAAAAGGAGTAAAATGGAGTTCTGAAGTTTTGGGTGCTCTGATTAAATCGTTCTGCGATGAGGGGCTGAAGAGTCAAGCACTTATCCTACAATTGGAGATGGAGAAGAAAGGGGTTGCTTCGAACGTGATCATGTATAATACGATCATGGATGCTTTTAGTAAATCGAATCAAATCGAGGAAGCTGAAGGTGTCTTTGCTGAAATGAAATCTAAAGGAGTGAAACCAACGAGTGCAAGTTTTAACATCTTGATGAATGCATACAGTAGGAGGATGCAACCTGAGATTGTTGAGAAGCTTCTGGTTGAAATGAAGGATATGGGATTGGAACCTAATGTAAAGTCATACACTTGCTTGATTAGTGCTTATGGGAGGCAGAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGAATGAAAAAAAATGGTATTAGGCCAACCTCTCATTCATATACAGCTCTGATTCATGCTTATTCTGTTAGCGGTTGGCATGAGAAAGCTTACTCAGCCTTTGAGAACATGTTGCGTGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTACTCGATGCGTTTAGGCGTGCTGGTGATACAGTGTCGTTGATGAAAATATGGAAGTTAATGATTAGAGAAAAAGTACTAGGGACAAGAGTAACTTTTAACACATTGCTAGATGGGTTTGCAAAACACGGTCATTATGTTGAAGCAAGAGATGTGATCTCTGAGTTTGATAAGATCGGGTTACAACCAACTGTTATGACATACAACATGTTGATGAATGCATATGCTAGGGGAGGTCAACATTTAAAGCTGCCACAGCTGCTGCAAGAGATGGCTGCTCGGGACCTAAAACCCGACTCCGTTACTTATTCTACCATGATTTATGCCTTTGTACGTGTTCGCGATTTCAAAAGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAAGTGCCTGATGTAAAGTCATACCAGAAACTTAAATCGATCTTGGATGTAAAACTTGCTACAAAAAACAGGAAAGACAAGAGTGCCATTCTTGGTATAATAAACAGCAAAATGGGTATGGTGAAAGCTAAGAAGCAGGGCAAGAAAGATGAGTTTTGGAAGACCAAGAGAAGGCATGTAAGAACTCAAGACAGTTTCTCCCGGTGA
BLAST of CSPI01G09080 vs. Swiss-Prot
Match: PP426_ARATH (Pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Arabidopsis thaliana GN=EMB1006 PE=2 SV=1)

HSP 1 Score: 839.0 bits (2166), Expect = 4.0e-242
Identity = 430/670 (64.18%), Postives = 530/670 (79.10%), Query Frame = 1

Query: 43  LCAASPNPSTQSPSPIFLHFFEEE-----EEEEEEEVPSKEGHGGNKTEE---DWNDPLF 102
           L A SP+ S+ SPS IFL  F++      ++ E   + S+E     + +E   D+ DP+ 
Sbjct: 43  LSATSPSSSSSSPS-IFLSCFDDALPDKIQQPENSTINSEESECEEEDDEEGDDFTDPIL 102

Query: 103 RFFKSQTST---TQDPSRESKLPLQKNRRSSWHLASD-VEFFNEAEVTPEEDKEQLRSAS 162
           +FFKS+T T   T DP+RESK  LQKNRR+SWHLA D  +   E E  PEE        +
Sbjct: 103 KFFKSRTLTSESTADPARESKFSLQKNRRTSWHLAPDFADPETEIESKPEESVFVTNQQT 162

Query: 163 RNSRV-LPDGPVGEIVGIARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCC 222
               +    G   EI+ +A+NL +N TLGE L  FE R+S+ EC E L ++GE   V  C
Sbjct: 163 LGVHIPFESGVAREILELAKNLKENQTLGEMLSGFERRVSDTECVEALVMMGESGFVKSC 222

Query: 223 LYFFEWMGLQETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAIS 282
           LYF+EWM LQE SL + RA S+LF LLGR  M + I++L  NLP K+EF+DV +YN+AIS
Sbjct: 223 LYFYEWMSLQEPSLASPRACSVLFTLLGRERMADYILLLLSNLPDKEEFRDVRLYNAAIS 282

Query: 283 GLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVK 342
           GL   +RYDDA +VYEAM+  NV PD+VTC+I+IT +RK GRSAK+ W+ FEKM++KGVK
Sbjct: 283 GLSASQRYDDAWEVYEAMDKINVYPDNVTCAILITTLRKAGRSAKEVWEIFEKMSEKGVK 342

Query: 343 WSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGV 402
           WS +V G L+KSFCDEGLK +AL++Q EMEKKG+ SN I+YNT+MDA++KSN IEE EG+
Sbjct: 343 WSQDVFGGLVKSFCDEGLKEEALVIQTEMEKKGIRSNTIVYNTLMDAYNKSNHIEEVEGL 402

Query: 403 FAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGR 462
           F EM+ KG+KP++A++NILM+AY+RRMQP+IVE LL EM+D+GLEPNVKSYTCLISAYGR
Sbjct: 403 FTEMRDKGLKPSAATYNILMDAYARRMQPDIVETLLREMEDLGLEPNVKSYTCLISAYGR 462

Query: 463 QKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIE 522
            KKMSDMAADAFLRMKK G++P+SHSYTALIHAYSVSGWHEKAY++FE M +EG+KPS+E
Sbjct: 463 TKKMSDMAADAFLRMKKVGLKPSSHSYTALIHAYSVSGWHEKAYASFEEMCKEGIKPSVE 522

Query: 523 TYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEF 582
           TYT++LDAFRR+GDT  LM+IWKLM+REK+ GTR+T+NTLLDGFAK G Y+EARDV+SEF
Sbjct: 523 TYTSVLDAFRRSGDTGKLMEIWKLMLREKIKGTRITYNTLLDGFAKQGLYIEARDVVSEF 582

Query: 583 DKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDF 642
            K+GLQP+VMTYNMLMNAYARGGQ  KLPQLL+EMAA +LKPDS+TYSTMIYAFVRVRDF
Sbjct: 583 SKMGLQPSVMTYNMLMNAYARGGQDAKLPQLLKEMAALNLKPDSITYSTMIYAFVRVRDF 642

Query: 643 KRAFFYHKKMVKSGQVPDVKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQ 700
           KRAFFYHK MVKSGQVPD +SY+KL++IL+ K  TKNRKDK+AILGIINSK G VKAK +
Sbjct: 643 KRAFFYHKMMVKSGQVPDPRSYEKLRAILEDKAKTKNRKDKTAILGIINSKFGRVKAKTK 702

BLAST of CSPI01G09080 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 2.2e-43
Identity = 120/442 (27.15%), Postives = 210/442 (47.51%), Query Frame = 1

Query: 189 SEKECWEVLRLLGEENLVVCCLYFFEWMGLQET--SLVTSRAYSLLFPLLGRAGMGEKIM 248
           +  E    L+ LG        L  F+W   Q+   S++ +   +++  +LG+ G      
Sbjct: 134 TSSELLAFLKGLGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAA 193

Query: 249 VLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVM 308
            +F  L       DV+ Y S IS      RY +A  V++ ME +   P  +T ++++ V 
Sbjct: 194 NMFNGLQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVF 253

Query: 309 RKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASN 368
            K+G          EKM   G+   +     LI       L  +A  +  EM+  G + +
Sbjct: 254 GKMGTPWNKITSLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYD 313

Query: 369 VIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLV 428
            + YN ++D + KS++ +EA  V  EM   G  P+  ++N L++AY+R    +   +L  
Sbjct: 314 KVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKN 373

Query: 429 EMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVS 488
           +M + G +P+V +YT L+S + R  K+ + A   F  M+  G +P   ++ A I  Y   
Sbjct: 374 QMAEKGTKPDVFTYTTLLSGFERAGKV-ESAMSIFEEMRNAGCKPNICTFNAFIKMYGNR 433

Query: 489 GWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTF 548
           G   +    F+ +   GL P I T+ TLL  F + G    +  ++K M R   +  R TF
Sbjct: 434 GKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETF 493

Query: 549 NTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAA 608
           NTL+  +++ G + +A  V       G+ P + TYN ++ A ARGG   +  ++L EM  
Sbjct: 494 NTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMED 553

Query: 609 RDLKPDSVTYSTMIYAFVRVRD 629
              KP+ +TY ++++A+   ++
Sbjct: 554 GRCKPNELTYCSLLHAYANGKE 574

BLAST of CSPI01G09080 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 5.5e-42
Identity = 114/410 (27.80%), Postives = 197/410 (48.05%), Query Frame = 1

Query: 248 LFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMR 307
           +FK +   +   +V  YN  I G       D A  +++ MET    P+ VT + +I    
Sbjct: 192 VFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYC 251

Query: 308 KIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNV 367
           K+ R   D +     M  KG++ +      +I   C EG   +   +  EM ++G + + 
Sbjct: 252 KL-RKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDE 311

Query: 368 IMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVE 427
           + YNT++  + K     +A  + AEM   G+ P+  ++  L+++  +        + L +
Sbjct: 312 VTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQ 371

Query: 428 MKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 487
           M+  GL PN ++YT L+  + ++  M++ A      M  NG  P+  +Y ALI+ + V+G
Sbjct: 372 MRVRGLCPNERTYTTLVDGFSQKGYMNE-AYRVLREMNDNGFSPSVVTYNALINGHCVTG 431

Query: 488 WHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFN 547
             E A +  E+M  +GL P + +Y+T+L  F R+ D    +++ + M+ + +    +T++
Sbjct: 432 KMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYS 491

Query: 548 TLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAAR 607
           +L+ GF +     EA D+  E  ++GL P   TY  L+NAY   G   K  QL  EM  +
Sbjct: 492 SLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEK 551

Query: 608 DLKPDSVTYSTMIYAF---VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 655
            + PD VTYS +I       R R+ KR      K+     VP   +Y  L
Sbjct: 552 GVLPDVVTYSVLINGLNKQSRTREAKRLLL---KLFYEESVPSDVTYHTL 596

BLAST of CSPI01G09080 vs. Swiss-Prot
Match: RF1_ORYSI (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 3.6e-41
Identity = 109/392 (27.81%), Postives = 188/392 (47.96%), Query Frame = 1

Query: 260 DVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDY 319
           DV  Y + I+G       D A   Y  M    + PD VT + +I  + K  ++   + + 
Sbjct: 195 DVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCK-AQAMDKAMEV 254

Query: 320 FEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMDAFSK 379
              M + GV        +++  +C  G   +A+    +M   GV  +V+ Y+ +MD   K
Sbjct: 255 LNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCK 314

Query: 380 SNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKS 439
           + +  EA  +F  M  +G+KP   ++  L+  Y+ +     +  LL  M   G+ P+   
Sbjct: 315 NGRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYV 374

Query: 440 YTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSAFENM 499
           ++ LI AY +Q K+ D A   F +M++ G+ P + +Y A+I     SG  E A   FE M
Sbjct: 375 FSILICAYAKQGKV-DQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQM 434

Query: 500 LREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAKHGHY 559
           + EGL P    Y +L+             ++   M+   +    + FN+++D   K G  
Sbjct: 435 IDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRV 494

Query: 560 VEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVTYSTM 619
           +E+  +     +IG++P V+TYN L+N Y   G+  +  +LL  M +  LKP++VTYST+
Sbjct: 495 IESEKLFELMVRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTL 554

Query: 620 IYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSY 652
           I  + ++   + A    K+M  SG  PD+ +Y
Sbjct: 555 INGYCKISRMEDALVLFKEMESSGVSPDIITY 584

BLAST of CSPI01G09080 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 170.6 bits (431), Expect = 6.1e-41
Identity = 109/427 (25.53%), Postives = 206/427 (48.24%), Query Frame = 1

Query: 230 SLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMET 289
           +++   L + G  EK+      +  K  + D+  YN+ IS        ++A ++  AM  
Sbjct: 239 NIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPG 298

Query: 290 NNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKS 349
              +P   T + +I  + K G+  +   + F +M + G+   S    +L+   C +G   
Sbjct: 299 KGFSPGVYTYNTVINGLCKHGKYERAK-EVFAEMLRSGLSPDSTTYRSLLMEACKKGDVV 358

Query: 350 QALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILM 409
           +   +  +M  + V  +++ ++++M  F++S  +++A   F  +K  G+ P +  + IL+
Sbjct: 359 ETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILI 418

Query: 410 NAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGI 469
             Y R+    +   L  EM   G   +V +Y  ++    ++K + + A   F  M +  +
Sbjct: 419 QGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGE-ADKLFNEMTERAL 478

Query: 470 RPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMK 529
            P S++ T LI  +   G  + A   F+ M  + ++  + TY TLLD F + GD  +  +
Sbjct: 479 FPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKE 538

Query: 530 IWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYA 589
           IW  M+ +++L T ++++ L++     GH  EA  V  E     ++PTVM  N ++  Y 
Sbjct: 539 IWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYC 598

Query: 590 RGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKM--VKSGQVPD 649
           R G        L++M +    PD ++Y+T+IY FVR  +  +AF   KKM   + G VPD
Sbjct: 599 RSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPD 658

Query: 650 VKSYQKL 655
           V +Y  +
Sbjct: 659 VFTYNSI 663

BLAST of CSPI01G09080 vs. TrEMBL
Match: A0A0A0LWH7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G050000 PE=4 SV=1)

HSP 1 Score: 1394.8 bits (3609), Expect = 0.0e+00
Identity = 708/711 (99.58%), Postives = 708/711 (99.58%), Query Frame = 1

Query: 1   MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL 60
           MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL
Sbjct: 1   MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL 60

Query: 61  HFFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR 120
           H FEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR
Sbjct: 61  HLFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR 120

Query: 121 SSWHLASDVEFFNEAEVTPEEDKEQLRSASRNSRVLPDGPVGEIVGIARNLSQNMTLGEA 180
           SSWHLASDVEFFNEAEVT EEDKEQLRSASRNSRVLP GPVGEIVGIARNLSQNMTLGEA
Sbjct: 121 SSWHLASDVEFFNEAEVTLEEDKEQLRSASRNSRVLPGGPVGEIVGIARNLSQNMTLGEA 180

Query: 181 LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG 240
           LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG
Sbjct: 181 LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG 240

Query: 241 MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS 300
           MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS
Sbjct: 241 MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS 300

Query: 301 IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK 360
           IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK
Sbjct: 301 IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK 360

Query: 361 KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI 420
           KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI
Sbjct: 361 KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI 420

Query: 421 VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI 480
           VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI
Sbjct: 421 VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI 480

Query: 481 HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL 540
           HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL
Sbjct: 481 HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL 540

Query: 541 GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL 600
           GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL
Sbjct: 541 GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL 600

Query: 601 LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV 660
           LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV
Sbjct: 601 LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV 660

Query: 661 KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR 712
           KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR
Sbjct: 661 KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR 711

BLAST of CSPI01G09080 vs. TrEMBL
Match: A0A061G5M6_THECC (Pentatricopeptide repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_014542 PE=4 SV=1)

HSP 1 Score: 949.5 bits (2453), Expect = 2.3e-273
Identity = 494/695 (71.08%), Postives = 572/695 (82.30%), Query Frame = 1

Query: 24  NTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFLHFFEEEEEEE-EEEVPSKEGHGG 83
           +TS SF  F          + A  P P+  S SPIFL F +E +++E E E P  +  G 
Sbjct: 33  STSKSFPSFS---------ISATPPPPTPHSSSPIFLPFLQEPQQQELETENPKSQELG- 92

Query: 84  NKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRRSSWHLASDVEFFN--EAEVTPE 143
            K E+D  DP+ RFFKS+ ST  DP R+ K  LQKNRRSSWHLA D+      E++  PE
Sbjct: 93  -KEEDDVKDPIIRFFKSRPSTP-DPPRQGKFSLQKNRRSSWHLAPDIRSLPDPESDSEPE 152

Query: 144 ED--------KEQLRSASRNSRVLPDGPVGEIVGIARNLSQNMTLGEALGEFEGRISEKE 203
            D        K+ L S   +   LP G VG+IV IA+NL +N TLGE LG ++G++S+KE
Sbjct: 153 PDGENIFSEAKQHLDSTPEDYTELPVGIVGDIVRIAKNLPENSTLGELLGGYQGKVSQKE 212

Query: 204 CWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNL 263
           C EVL L+G+E LV+ CLYFFEWMGLQE  LVT RA S+LFP+LGRAGMG+K+MVLF+NL
Sbjct: 213 CLEVLVLMGKEGLVLGCLYFFEWMGLQEPLLVTPRACSVLFPVLGRAGMGDKLMVLFRNL 272

Query: 264 PLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRS 323
           P  + F+DVHVYN+ ISGL+  KRYDDA KVYEAME NNV PDHVTCSI+IT+MRK GRS
Sbjct: 273 PQSRVFRDVHVYNATISGLLCSKRYDDAWKVYEAMEANNVQPDHVTCSIVITIMRKTGRS 332

Query: 324 AKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNT 383
           AKD+W++FE+MN+KGVKWS EVLGA+IKSFCDEGLK +ALI+Q EMEKKGV SN I+YNT
Sbjct: 333 AKDAWEFFERMNRKGVKWSPEVLGAIIKSFCDEGLKHEALIIQSEMEKKGVPSNAIVYNT 392

Query: 384 IMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMG 443
           +MDA+SKSNQIEE EG+FAEMK+KG+ PTSA+FNILM+AYSRRMQPEIVE LL+EM+DMG
Sbjct: 393 LMDAYSKSNQIEEVEGLFAEMKAKGLVPTSATFNILMDAYSRRMQPEIVENLLLEMQDMG 452

Query: 444 LEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKA 503
           L+P+ KSYTCLISAYGRQKKMSD AADAFLRMKK G++PTSHSYT+LIHAYS+SGWHEKA
Sbjct: 453 LKPDAKSYTCLISAYGRQKKMSDKAADAFLRMKKVGVKPTSHSYTSLIHAYSISGWHEKA 512

Query: 504 YSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDG 563
           Y+AFENMLREGLK SIETYTTLLDAFRRAGDT  LMKIWKLMI EKV GTRVTFN LLDG
Sbjct: 513 YTAFENMLREGLKLSIETYTTLLDAFRRAGDTQILMKIWKLMISEKVEGTRVTFNILLDG 572

Query: 564 FAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPD 623
           FAK G Y+EARDVISEF KIGLQPT+MTYNMLMNAYARGGQH KLPQLL+EMAA +LKPD
Sbjct: 573 FAKQGQYIEARDVISEFGKIGLQPTLMTYNMLMNAYARGGQHQKLPQLLKEMAALNLKPD 632

Query: 624 SVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDVKLATKNRKDKSA 683
           SVTYSTMIYAFVRVRDFKRAF+YHK+MVKSGQVPDVKSY+KLK+ILDVK A KN+KD+SA
Sbjct: 633 SVTYSTMIYAFVRVRDFKRAFYYHKQMVKSGQVPDVKSYEKLKAILDVKAAKKNKKDRSA 692

Query: 684 ILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQD 708
           ILGIINSKMGMVKAK++ KKDE WK K+RH +T D
Sbjct: 693 ILGIINSKMGMVKAKRKTKKDELWKNKKRHHKTPD 715

BLAST of CSPI01G09080 vs. TrEMBL
Match: W9S5W3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008195 PE=4 SV=1)

HSP 1 Score: 938.7 bits (2425), Expect = 4.1e-270
Identity = 483/673 (71.77%), Postives = 561/673 (83.36%), Query Frame = 1

Query: 39  LACCLCAASPNPSTQSPSPIFLHFFEEEEEEEEEEVPSKEGHGGNKTE-EDWNDPLFRFF 98
           L+  LC+AS      S S IFL F +EEEEEEE EV + E       E E+  DPL +FF
Sbjct: 43  LSTPLCSAS-----LSSSSIFLPFLQEEEEEEENEVINNEEQESKPCEKEEEEDPLVKFF 102

Query: 99  KSQTSTTQDPSRESKLPLQKNRRSSWHLASDVEFFNEAEVTPE----EDKEQLRSASRNS 158
           KS+  TTQDP RE +L LQKNRRSSWHLA D EF +E E   +    E  E+ +   +  
Sbjct: 103 KSRP-TTQDPQREGRLSLQKNRRSSWHLAPDSEFADEPETESDSNIAESLEKEQRKKQEF 162

Query: 159 RVLPDGPVGEIVGIARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFF 218
             +P+G  GEI+ IARNL QN+TLGEAL  FEGR+  +EC EVL L+GEE L + CLYFF
Sbjct: 163 EQIPEGIAGEILRIARNLPQNLTLGEALEGFEGRVGARECVEVLGLMGEEGLFMGCLYFF 222

Query: 219 EWMGLQETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMV 278
           EWMGLQE SLVT RA S+LFPLLGRAG+G+K+MVLF+NLP+KKEF+DVHVYN+AISGLM 
Sbjct: 223 EWMGLQEPSLVTPRACSVLFPLLGRAGLGDKLMVLFENLPMKKEFRDVHVYNAAISGLMC 282

Query: 279 CKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSE 338
            KRY DA KVYEAME NN+ PDHVTCSIMIT+MRKIGRSAK++W++FE+MN+KGVKWS E
Sbjct: 283 SKRYGDAWKVYEAMEANNIRPDHVTCSIMITIMRKIGRSAKEAWEFFERMNRKGVKWSPE 342

Query: 339 VLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEM 398
           VLGALIK+FCDEGLKS+AL++Q+EM KKGV  N I+YNTIMDAF KSNQ+EEAEG+FAEM
Sbjct: 343 VLGALIKAFCDEGLKSEALVIQIEMAKKGVFPNAIVYNTIMDAFCKSNQVEEAEGLFAEM 402

Query: 399 KSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKM 458
           K KG+KPTSA+FN+LM+AYSRR+QP++VEKLL EM+D+GL+PN KSYTCLISAY RQ KM
Sbjct: 403 KLKGIKPTSATFNVLMDAYSRRIQPDVVEKLLEEMQDLGLDPNAKSYTCLISAYARQ-KM 462

Query: 459 SDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTT 518
           SDMAADA LRMKK GI PTSHSYTALIHAYSV+GWHEKAY AFENM +E LKPSIETYT 
Sbjct: 463 SDMAADALLRMKKVGINPTSHSYTALIHAYSVTGWHEKAYIAFENMRKERLKPSIETYTA 522

Query: 519 LLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIG 578
           LLDAFRRAGDT  LMKIWK+M++EK+ GTRVTFNTL+DGFAK G Y EARDVIS F KIG
Sbjct: 523 LLDAFRRAGDTEMLMKIWKMMLKEKIEGTRVTFNTLVDGFAKQGRYTEARDVISVFGKIG 582

Query: 579 LQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAF 638
           LQPT+MTYNML+NAYARGGQ  KLPQLL+EM+  DLKPDSVTYSTMIYA+VR+RDFKRAF
Sbjct: 583 LQPTLMTYNMLINAYARGGQGSKLPQLLKEMSVLDLKPDSVTYSTMIYAYVRIRDFKRAF 642

Query: 639 FYHKKMVKSGQVPDVKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKD 698
           FYHK+MVKSGQVPD KSY+KL+SILDVK A KN+KDK AILGIINSKMG++KAKK+GKKD
Sbjct: 643 FYHKQMVKSGQVPDAKSYEKLRSILDVKAARKNKKDKKAILGIINSKMGLLKAKKKGKKD 702

Query: 699 EFWKTKRRHVRTQ 707
           EFWK ++ H  T+
Sbjct: 703 EFWKNRKMHDSTR 708

BLAST of CSPI01G09080 vs. TrEMBL
Match: A5BHI6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010606 PE=4 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 2.9e-268
Identity = 476/659 (72.23%), Postives = 556/659 (84.37%), Query Frame = 1

Query: 48  PNPSTQSPSPIFLHFFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDP 107
           P PS+ S SPIFL F +E++   + +   KE     + +ED NDP+ RFFKS+TST QDP
Sbjct: 38  PTPSSHSSSPIFLPFLQEQDRTLQHQRQQKE-----EEDEDPNDPILRFFKSRTST-QDP 97

Query: 108 SRESKLPLQKNRRSSWHLASDVEFFNEAEVTPEEDKEQLRSASRNSRVLPDGPVGEIVGI 167
             ESK  LQKNRR SW LAS  +  ++AE   EE+KEQ+ S S  S     G  GEI+  
Sbjct: 98  RFESKFSLQKNRRPSWRLASTTDPESDAEFDVEEEKEQVVSDSCTSL---QGISGEILHF 157

Query: 168 ARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSR 227
           ARNL +N TLGE LG + GR+ E+EC EVL L+ EE+LV+ CLYFFEWMGLQE SLVT+R
Sbjct: 158 ARNLPENSTLGEVLGPYVGRVGERECVEVLGLMCEEDLVMGCLYFFEWMGLQEPSLVTAR 217

Query: 228 AYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAM 287
           A SLLFP+LGRAGMG+ +MVL +NLP  ++F+DV +YNSAISGL  C RYDDA KVY+ M
Sbjct: 218 ACSLLFPMLGRAGMGDDLMVLLRNLPKTRQFRDVRIYNSAISGLSSCGRYDDAWKVYDEM 277

Query: 288 ETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGL 347
           ETNN+ PDHVTCSIMITVMRK G SAKD+W++F++MN+KGVKWS EVLGALIKSFCDEGL
Sbjct: 278 ETNNIRPDHVTCSIMITVMRKDGHSAKDAWEFFQRMNRKGVKWSLEVLGALIKSFCDEGL 337

Query: 348 KSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNI 407
           K++ALI+Q EMEKKG++SN I+YNT+MDA+SKSN++EEAEG+F EMK+KGV PTSA++NI
Sbjct: 338 KNEALIIQSEMEKKGISSNAIVYNTLMDAYSKSNRVEEAEGLFGEMKAKGVMPTSATYNI 397

Query: 408 LMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKN 467
           LM+AYSRRMQPEI+E LL+EM+DMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKK 
Sbjct: 398 LMDAYSRRMQPEIIENLLLEMQDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKV 457

Query: 468 GIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSL 527
           GI+PTSHSYTALIHAYSV GWHEKAY+AFENM REG+KPSIETYT LLDAFRRAGDT +L
Sbjct: 458 GIKPTSHSYTALIHAYSVGGWHEKAYTAFENMKREGIKPSIETYTALLDAFRRAGDTQTL 517

Query: 528 MKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNA 587
           MKIWKLM+ +K+ GTRVTFN LLDGFAK GHY+EARDVI EF KIG QPTVMTYNMLMNA
Sbjct: 518 MKIWKLMLSDKIEGTRVTFNILLDGFAKQGHYMEARDVIFEFGKIGFQPTVMTYNMLMNA 577

Query: 588 YARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPD 647
           YARGGQH +LPQLL+EM + +LKPDS+TYSTMIYA+VRVRDFKRAFFYHK+MVKSGQVPD
Sbjct: 578 YARGGQHSRLPQLLKEMTSLNLKPDSITYSTMIYAYVRVRDFKRAFFYHKQMVKSGQVPD 637

Query: 648 VKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQ 707
            +SYQKL+SILDVK ATKNRKD+SAILGI+NS MG++K KK GKKDEFWK K+   R Q
Sbjct: 638 PQSYQKLRSILDVKAATKNRKDRSAILGIVNSNMGLLKPKK-GKKDEFWKNKKGQRRIQ 686

BLAST of CSPI01G09080 vs. TrEMBL
Match: D7SHD5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09860 PE=4 SV=1)

HSP 1 Score: 932.6 bits (2409), Expect = 2.9e-268
Identity = 476/659 (72.23%), Postives = 556/659 (84.37%), Query Frame = 1

Query: 48  PNPSTQSPSPIFLHFFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDP 107
           P PS+ S SPIFL F +E++   + +   KE     + +ED NDP+ RFFKS+TST QDP
Sbjct: 38  PTPSSHSSSPIFLPFLQEQDRTLQHQRQQKE-----EEDEDPNDPILRFFKSRTST-QDP 97

Query: 108 SRESKLPLQKNRRSSWHLASDVEFFNEAEVTPEEDKEQLRSASRNSRVLPDGPVGEIVGI 167
             ESK  LQKNRR SW LAS  +  ++AE   EE+KEQ+ S S  S     G  GEI+  
Sbjct: 98  RFESKFSLQKNRRPSWRLASTTDPESDAEFDVEEEKEQVVSDSCTSL---QGISGEILHF 157

Query: 168 ARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSR 227
           ARNL +N TLGE LG + GR+ E+EC EVL L+ EE+LV+ CLYFFEWMGLQE SLVT+R
Sbjct: 158 ARNLPENSTLGEVLGPYVGRVGERECVEVLGLMCEEDLVMGCLYFFEWMGLQEPSLVTAR 217

Query: 228 AYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAM 287
           A SLLFP+LGRAGMG+ +MVL +NLP  ++F+DV +YNSAISGL  C RYDDA KVY+ M
Sbjct: 218 ACSLLFPMLGRAGMGDDLMVLLRNLPKTRQFRDVRIYNSAISGLSSCGRYDDAWKVYDEM 277

Query: 288 ETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGL 347
           ETNN+ PDHVTCSIMITVMRK G SAKD+W++F++MN+KGVKWS EVLGALIKSFCDEGL
Sbjct: 278 ETNNIRPDHVTCSIMITVMRKDGHSAKDAWEFFQRMNRKGVKWSLEVLGALIKSFCDEGL 337

Query: 348 KSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNI 407
           K++ALI+Q EMEKKG++SN I+YNT+MDA+SKSN++EEAEG+F EMK+KGV PTSA++NI
Sbjct: 338 KNEALIIQSEMEKKGISSNAIVYNTLMDAYSKSNRVEEAEGLFGEMKAKGVMPTSATYNI 397

Query: 408 LMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKN 467
           LM+AYSRRMQPEI+E LL+EM+DMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKK 
Sbjct: 398 LMDAYSRRMQPEIIENLLLEMQDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKV 457

Query: 468 GIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSL 527
           GI+PTSHSYTALIHAYSV GWHEKAY+AFENM REG+KPSIETYT LLDAFRRAGDT +L
Sbjct: 458 GIKPTSHSYTALIHAYSVGGWHEKAYTAFENMKREGIKPSIETYTALLDAFRRAGDTQTL 517

Query: 528 MKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNA 587
           MKIWKLM+ +K+ GTRVTFN LLDGFAK GHY+EARDVI EF KIG QPTVMTYNMLMNA
Sbjct: 518 MKIWKLMLSDKIEGTRVTFNILLDGFAKQGHYMEARDVIFEFGKIGFQPTVMTYNMLMNA 577

Query: 588 YARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPD 647
           YARGGQH +LPQLL+EM + +LKPDS+TYSTMIYA+VRVRDFKRAFFYHK+MVKSGQVPD
Sbjct: 578 YARGGQHSRLPQLLKEMTSLNLKPDSITYSTMIYAYVRVRDFKRAFFYHKQMVKSGQVPD 637

Query: 648 VKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQ 707
            +SYQKL+SILDVK ATKNRKD+SAILGI+NS MG++K KK GKKDEFWK K+   R Q
Sbjct: 638 PQSYQKLRSILDVKAATKNRKDRSAILGIVNSNMGLLKPKK-GKKDEFWKNKKGQRRIQ 686

BLAST of CSPI01G09080 vs. TAIR10
Match: AT5G50280.1 (AT5G50280.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 839.0 bits (2166), Expect = 2.2e-243
Identity = 430/670 (64.18%), Postives = 530/670 (79.10%), Query Frame = 1

Query: 43  LCAASPNPSTQSPSPIFLHFFEEE-----EEEEEEEVPSKEGHGGNKTEE---DWNDPLF 102
           L A SP+ S+ SPS IFL  F++      ++ E   + S+E     + +E   D+ DP+ 
Sbjct: 43  LSATSPSSSSSSPS-IFLSCFDDALPDKIQQPENSTINSEESECEEEDDEEGDDFTDPIL 102

Query: 103 RFFKSQTST---TQDPSRESKLPLQKNRRSSWHLASD-VEFFNEAEVTPEEDKEQLRSAS 162
           +FFKS+T T   T DP+RESK  LQKNRR+SWHLA D  +   E E  PEE        +
Sbjct: 103 KFFKSRTLTSESTADPARESKFSLQKNRRTSWHLAPDFADPETEIESKPEESVFVTNQQT 162

Query: 163 RNSRV-LPDGPVGEIVGIARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCC 222
               +    G   EI+ +A+NL +N TLGE L  FE R+S+ EC E L ++GE   V  C
Sbjct: 163 LGVHIPFESGVAREILELAKNLKENQTLGEMLSGFERRVSDTECVEALVMMGESGFVKSC 222

Query: 223 LYFFEWMGLQETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAIS 282
           LYF+EWM LQE SL + RA S+LF LLGR  M + I++L  NLP K+EF+DV +YN+AIS
Sbjct: 223 LYFYEWMSLQEPSLASPRACSVLFTLLGRERMADYILLLLSNLPDKEEFRDVRLYNAAIS 282

Query: 283 GLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVK 342
           GL   +RYDDA +VYEAM+  NV PD+VTC+I+IT +RK GRSAK+ W+ FEKM++KGVK
Sbjct: 283 GLSASQRYDDAWEVYEAMDKINVYPDNVTCAILITTLRKAGRSAKEVWEIFEKMSEKGVK 342

Query: 343 WSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGV 402
           WS +V G L+KSFCDEGLK +AL++Q EMEKKG+ SN I+YNT+MDA++KSN IEE EG+
Sbjct: 343 WSQDVFGGLVKSFCDEGLKEEALVIQTEMEKKGIRSNTIVYNTLMDAYNKSNHIEEVEGL 402

Query: 403 FAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGR 462
           F EM+ KG+KP++A++NILM+AY+RRMQP+IVE LL EM+D+GLEPNVKSYTCLISAYGR
Sbjct: 403 FTEMRDKGLKPSAATYNILMDAYARRMQPDIVETLLREMEDLGLEPNVKSYTCLISAYGR 462

Query: 463 QKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIE 522
            KKMSDMAADAFLRMKK G++P+SHSYTALIHAYSVSGWHEKAY++FE M +EG+KPS+E
Sbjct: 463 TKKMSDMAADAFLRMKKVGLKPSSHSYTALIHAYSVSGWHEKAYASFEEMCKEGIKPSVE 522

Query: 523 TYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEF 582
           TYT++LDAFRR+GDT  LM+IWKLM+REK+ GTR+T+NTLLDGFAK G Y+EARDV+SEF
Sbjct: 523 TYTSVLDAFRRSGDTGKLMEIWKLMLREKIKGTRITYNTLLDGFAKQGLYIEARDVVSEF 582

Query: 583 DKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDF 642
            K+GLQP+VMTYNMLMNAYARGGQ  KLPQLL+EMAA +LKPDS+TYSTMIYAFVRVRDF
Sbjct: 583 SKMGLQPSVMTYNMLMNAYARGGQDAKLPQLLKEMAALNLKPDSITYSTMIYAFVRVRDF 642

Query: 643 KRAFFYHKKMVKSGQVPDVKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQ 700
           KRAFFYHK MVKSGQVPD +SY+KL++IL+ K  TKNRKDK+AILGIINSK G VKAK +
Sbjct: 643 KRAFFYHKMMVKSGQVPDPRSYEKLRAILEDKAKTKNRKDKTAILGIINSKFGRVKAKTK 702

BLAST of CSPI01G09080 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 178.7 bits (452), Expect = 1.3e-44
Identity = 120/442 (27.15%), Postives = 210/442 (47.51%), Query Frame = 1

Query: 189 SEKECWEVLRLLGEENLVVCCLYFFEWMGLQET--SLVTSRAYSLLFPLLGRAGMGEKIM 248
           +  E    L+ LG        L  F+W   Q+   S++ +   +++  +LG+ G      
Sbjct: 134 TSSELLAFLKGLGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAA 193

Query: 249 VLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVM 308
            +F  L       DV+ Y S IS      RY +A  V++ ME +   P  +T ++++ V 
Sbjct: 194 NMFNGLQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVF 253

Query: 309 RKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASN 368
            K+G          EKM   G+   +     LI       L  +A  +  EM+  G + +
Sbjct: 254 GKMGTPWNKITSLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYD 313

Query: 369 VIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLV 428
            + YN ++D + KS++ +EA  V  EM   G  P+  ++N L++AY+R    +   +L  
Sbjct: 314 KVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKN 373

Query: 429 EMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVS 488
           +M + G +P+V +YT L+S + R  K+ + A   F  M+  G +P   ++ A I  Y   
Sbjct: 374 QMAEKGTKPDVFTYTTLLSGFERAGKV-ESAMSIFEEMRNAGCKPNICTFNAFIKMYGNR 433

Query: 489 GWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTF 548
           G   +    F+ +   GL P I T+ TLL  F + G    +  ++K M R   +  R TF
Sbjct: 434 GKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETF 493

Query: 549 NTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAA 608
           NTL+  +++ G + +A  V       G+ P + TYN ++ A ARGG   +  ++L EM  
Sbjct: 494 NTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMED 553

Query: 609 RDLKPDSVTYSTMIYAFVRVRD 629
              KP+ +TY ++++A+   ++
Sbjct: 554 GRCKPNELTYCSLLHAYANGKE 574

BLAST of CSPI01G09080 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 174.1 bits (440), Expect = 3.1e-43
Identity = 114/410 (27.80%), Postives = 197/410 (48.05%), Query Frame = 1

Query: 248 LFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMR 307
           +FK +   +   +V  YN  I G       D A  +++ MET    P+ VT + +I    
Sbjct: 192 VFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYC 251

Query: 308 KIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNV 367
           K+ R   D +     M  KG++ +      +I   C EG   +   +  EM ++G + + 
Sbjct: 252 KL-RKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDE 311

Query: 368 IMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVE 427
           + YNT++  + K     +A  + AEM   G+ P+  ++  L+++  +        + L +
Sbjct: 312 VTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQ 371

Query: 428 MKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 487
           M+  GL PN ++YT L+  + ++  M++ A      M  NG  P+  +Y ALI+ + V+G
Sbjct: 372 MRVRGLCPNERTYTTLVDGFSQKGYMNE-AYRVLREMNDNGFSPSVVTYNALINGHCVTG 431

Query: 488 WHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFN 547
             E A +  E+M  +GL P + +Y+T+L  F R+ D    +++ + M+ + +    +T++
Sbjct: 432 KMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYS 491

Query: 548 TLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAAR 607
           +L+ GF +     EA D+  E  ++GL P   TY  L+NAY   G   K  QL  EM  +
Sbjct: 492 SLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEK 551

Query: 608 DLKPDSVTYSTMIYAF---VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 655
            + PD VTYS +I       R R+ KR      K+     VP   +Y  L
Sbjct: 552 GVLPDVVTYSVLINGLNKQSRTREAKRLLL---KLFYEESVPSDVTYHTL 596

BLAST of CSPI01G09080 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 170.6 bits (431), Expect = 3.4e-42
Identity = 109/427 (25.53%), Postives = 206/427 (48.24%), Query Frame = 1

Query: 230 SLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMET 289
           +++   L + G  EK+      +  K  + D+  YN+ IS        ++A ++  AM  
Sbjct: 239 NIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPG 298

Query: 290 NNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKS 349
              +P   T + +I  + K G+  +   + F +M + G+   S    +L+   C +G   
Sbjct: 299 KGFSPGVYTYNTVINGLCKHGKYERAK-EVFAEMLRSGLSPDSTTYRSLLMEACKKGDVV 358

Query: 350 QALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILM 409
           +   +  +M  + V  +++ ++++M  F++S  +++A   F  +K  G+ P +  + IL+
Sbjct: 359 ETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILI 418

Query: 410 NAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGI 469
             Y R+    +   L  EM   G   +V +Y  ++    ++K + + A   F  M +  +
Sbjct: 419 QGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGE-ADKLFNEMTERAL 478

Query: 470 RPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMK 529
            P S++ T LI  +   G  + A   F+ M  + ++  + TY TLLD F + GD  +  +
Sbjct: 479 FPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKE 538

Query: 530 IWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYA 589
           IW  M+ +++L T ++++ L++     GH  EA  V  E     ++PTVM  N ++  Y 
Sbjct: 539 IWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYC 598

Query: 590 RGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKM--VKSGQVPD 649
           R G        L++M +    PD ++Y+T+IY FVR  +  +AF   KKM   + G VPD
Sbjct: 599 RSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQGGLVPD 658

Query: 650 VKSYQKL 655
           V +Y  +
Sbjct: 659 VFTYNSI 663

BLAST of CSPI01G09080 vs. TAIR10
Match: AT2G18940.1 (AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 164.5 bits (415), Expect = 2.4e-40
Identity = 114/465 (24.52%), Postives = 219/465 (47.10%), Query Frame = 1

Query: 197 LRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNLPLKK 256
           +R+LG E+         + + LQE  L+  RAY+ +     R G  EK + LF+ +    
Sbjct: 182 VRILGRESQYSVAAKLLDKIPLQEY-LLDVRAYTTILHAYSRTGKYEKAIDLFERMKEMG 241

Query: 257 EFQDVHVYNSAISGL-MVCKRYDDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKD 316
               +  YN  +     + + +     V + M +  +  D  TCS +++   + G   ++
Sbjct: 242 PSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACAREGL-LRE 301

Query: 317 SWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMD 376
           + ++F ++   G +  +    AL++ F   G+ ++AL +  EME+    ++ + YN ++ 
Sbjct: 302 AKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVA 361

Query: 377 AFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEP 436
           A+ ++   +EA GV   M  KGV P + ++  +++AY +  + +   KL   MK+ G  P
Sbjct: 362 AYVRAGFSKEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVP 421

Query: 437 NVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSA 496
           N  +Y  ++S  G++ + ++M       MK NG  P   ++  ++      G  +     
Sbjct: 422 NTCTYNAVLSLLGKKSRSNEM-IKMLCDMKSNGCSPNRATWNTMLALCGNKGMDKFVNRV 481

Query: 497 FENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAK 556
           F  M   G +P  +T+ TL+ A+ R G  V   K++  M R        T+N LL+  A+
Sbjct: 482 FREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAGFNACVTTYNALLNALAR 541

Query: 557 HGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVT 616
            G +    +VIS+    G +PT  +Y++++  YA+GG +L + ++   +    + P  + 
Sbjct: 542 KGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIENRIKEGQIFPSWML 601

Query: 617 YSTMIYAFVRVRDF---KRAFFYHKKMVKSGQVPDVKSYQKLKSI 658
             T++ A  + R     +RAF   K   K G  PD+  +  + SI
Sbjct: 602 LRTLLLANFKCRALAGSERAFTLFK---KHGYKPDMVIFNSMLSI 640

BLAST of CSPI01G09080 vs. NCBI nr
Match: gi|778657971|ref|XP_004152584.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucumis sativus])

HSP 1 Score: 1394.8 bits (3609), Expect = 0.0e+00
Identity = 708/711 (99.58%), Postives = 708/711 (99.58%), Query Frame = 1

Query: 1   MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL 60
           MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL
Sbjct: 1   MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL 60

Query: 61  HFFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR 120
           H FEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR
Sbjct: 61  HLFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRR 120

Query: 121 SSWHLASDVEFFNEAEVTPEEDKEQLRSASRNSRVLPDGPVGEIVGIARNLSQNMTLGEA 180
           SSWHLASDVEFFNEAEVT EEDKEQLRSASRNSRVLP GPVGEIVGIARNLSQNMTLGEA
Sbjct: 121 SSWHLASDVEFFNEAEVTLEEDKEQLRSASRNSRVLPGGPVGEIVGIARNLSQNMTLGEA 180

Query: 181 LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG 240
           LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG
Sbjct: 181 LGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFPLLGRAG 240

Query: 241 MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS 300
           MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS
Sbjct: 241 MGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNPDHVTCS 300

Query: 301 IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK 360
           IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK
Sbjct: 301 IMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALILQLEMEK 360

Query: 361 KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI 420
           KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI
Sbjct: 361 KGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSRRMQPEI 420

Query: 421 VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI 480
           VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI
Sbjct: 421 VEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALI 480

Query: 481 HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL 540
           HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL
Sbjct: 481 HAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLMIREKVL 540

Query: 541 GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL 600
           GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL
Sbjct: 541 GTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHLKLPQL 600

Query: 601 LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV 660
           LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV
Sbjct: 601 LQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLKSILDV 660

Query: 661 KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR 712
           KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR
Sbjct: 661 KLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRTQDSFSR 711

BLAST of CSPI01G09080 vs. NCBI nr
Match: gi|659067377|ref|XP_008439140.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucumis melo])

HSP 1 Score: 1328.9 bits (3438), Expect = 0.0e+00
Identity = 678/711 (95.36%), Postives = 691/711 (97.19%), Query Frame = 1

Query: 1   MALVQQHHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFL 60
           MALV QHHLT+PFLSI  ANLKQNTSNS SFFQSNTQKLACCLCAASPNP+TQSPSPIFL
Sbjct: 1   MALVHQHHLTFPFLSIGKANLKQNTSNSCSFFQSNTQKLACCLCAASPNPTTQSPSPIFL 60

Query: 61  HFFEEEEEEEEEE------VPSKEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLP 120
           HF +EEEEEEEEE      VPSKE HGGNKTEEDWNDPLFRFFKS+TSTTQDPSRESKL 
Sbjct: 61  HFLQEEEEEEEEEEVEEEEVPSKEVHGGNKTEEDWNDPLFRFFKSRTSTTQDPSRESKLS 120

Query: 121 LQKNRRSSWHLASDVEFFNEAEVTPEEDKEQLRSASRNSRVLPDGPVGEIVGIARNLSQN 180
           LQKNRRSSWHLASDVEFFNEAEVT EEDKEQL SASRNSRVLPDG VGEIVGIARNLSQN
Sbjct: 121 LQKNRRSSWHLASDVEFFNEAEVTLEEDKEQLGSASRNSRVLPDGLVGEIVGIARNLSQN 180

Query: 181 MTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFP 240
           MTLGEALGEFEGRISEKEC EVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFP
Sbjct: 181 MTLGEALGEFEGRISEKECLEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFP 240

Query: 241 LLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNP 300
           LLGRAGMGEKIMVLFKNLPL+KEFQDVHVYNSA+SGLMVCKRYDDACKVYEAMETNNVNP
Sbjct: 241 LLGRAGMGEKIMVLFKNLPLRKEFQDVHVYNSAMSGLMVCKRYDDACKVYEAMETNNVNP 300

Query: 301 DHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALIL 360
           DHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALI+
Sbjct: 301 DHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALII 360

Query: 361 QLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSR 420
           QLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSR
Sbjct: 361 QLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSR 420

Query: 421 RMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH 480
           RMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH
Sbjct: 421 RMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH 480

Query: 481 SYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLM 540
           SYTALIHAYSVSGWHEKAYS FENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLM
Sbjct: 481 SYTALIHAYSVSGWHEKAYSIFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLM 540

Query: 541 IREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQH 600
           IREK++GTRVTFN LLDGFAK GHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQH
Sbjct: 541 IREKIVGTRVTFNILLDGFAKQGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQH 600

Query: 601 LKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 660
           LK+PQLLQEMAAR+LKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL
Sbjct: 601 LKMPQLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 660

Query: 661 KSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTKRRHVRT 706
           KSILDVKLATKNRKDKSAILGIINSKMGMVKAK++GKKDEFWKTKRRHVRT
Sbjct: 661 KSILDVKLATKNRKDKSAILGIINSKMGMVKAKQKGKKDEFWKTKRRHVRT 711

BLAST of CSPI01G09080 vs. NCBI nr
Match: gi|470130284|ref|XP_004301033.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Fragaria vesca subsp. vesca])

HSP 1 Score: 967.6 bits (2500), Expect = 1.2e-278
Identity = 497/670 (74.18%), Postives = 573/670 (85.52%), Query Frame = 1

Query: 43  LCAASPNPSTQSPSPIFLHFFEEEEEEEEEEVPSKEGHGGNKTEEDWNDPLFRFFKSQTS 102
           L +A P P++ S S IFL F EEEEE+ EEE   +     ++ EED +DP+ RFFKS+TS
Sbjct: 43  LYSAPPIPTSNSSSSIFLPFLEEEEEDHEEEEGLESV--ADEKEEDPDDPIARFFKSRTS 102

Query: 103 TTQDPSRESKLPLQKNRRSSWHLASDV---EFFNEAEVTPEEDKEQLRSASRNSRVLPDG 162
           T QDP RE KL LQKNRRSSWHLA D+   E  +  +  PE  ++QL   S +S  L DG
Sbjct: 103 T-QDPQREGKLSLQKNRRSSWHLADDLDDSEPDSGVDPVPEVQEQQLGPVSSDSIPLADG 162

Query: 163 PVGEIVGIARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQ 222
            VG+I+  ARNL QN+TLGE LG FEGR+ EKEC EVL L+GEE L++ CLYFFEWMGLQ
Sbjct: 163 IVGQILQKARNLGQNLTLGEELGGFEGRVGEKECVEVLELMGEEGLLMGCLYFFEWMGLQ 222

Query: 223 ETSLVTSRAYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDD 282
           E  LVT RA S+LFP+LGRAGMG+K++VLFKNLP  KEF+DVHVYN+AISGLM  KRYDD
Sbjct: 223 EPCLVTPRACSVLFPILGRAGMGDKLVVLFKNLP-GKEFRDVHVYNAAISGLMCSKRYDD 282

Query: 283 ACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALI 342
           A KVYE ME NN+ PDHVTCSIMIT+MRKIGRSAKDSWD+FE+MN+KGVKWS EVLGALI
Sbjct: 283 AWKVYETMEANNILPDHVTCSIMITIMRKIGRSAKDSWDFFERMNRKGVKWSQEVLGALI 342

Query: 343 KSFCDEGLKSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVK 402
           KSFCDEGLKS+ALI+Q+EMEKKG++SN I+YNT+M AF  SN++EEAEG+F EMKS+G+K
Sbjct: 343 KSFCDEGLKSEALIIQIEMEKKGISSNAIVYNTLMTAFCDSNRVEEAEGLFTEMKSRGIK 402

Query: 403 PTSASFNILMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAAD 462
           PTS +FNILM+AYSRRMQPEIVEKLLVEM++MGL+PNVKSYTCL+SAYGRQK MSDMAAD
Sbjct: 403 PTSPTFNILMDAYSRRMQPEIVEKLLVEMQEMGLDPNVKSYTCLVSAYGRQKNMSDMAAD 462

Query: 463 AFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFR 522
           AFLRMKK GI PTSH+YTALIHAYSVSGWHEKAY AFENM REGLKPSIETYT LLDAFR
Sbjct: 463 AFLRMKKVGICPTSHTYTALIHAYSVSGWHEKAYIAFENMKREGLKPSIETYTALLDAFR 522

Query: 523 RAGDTVSLMKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVM 582
           RAGDT  LM+IWKLMI+EKV GT+VTFNTLLDGF+K GHY+EARDV+SEF  +GLQPTVM
Sbjct: 523 RAGDTEMLMRIWKLMIKEKVQGTKVTFNTLLDGFSKQGHYLEARDVVSEFGNMGLQPTVM 582

Query: 583 TYNMLMNAYARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKM 642
           TYNMLMNAYARGGQH KLPQLL+EM   +LKPDSVTYSTMIYA++RVRDF RAFFYHKKM
Sbjct: 583 TYNMLMNAYARGGQHSKLPQLLKEMEVLNLKPDSVTYSTMIYAYIRVRDFSRAFFYHKKM 642

Query: 643 VKSGQVPDVKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKT- 702
           VKSGQVPD +SY+KL++ILDVKLA KN+KDKSAILGIINSKMGM+K KK+GKKDEFWK  
Sbjct: 643 VKSGQVPDARSYEKLRAILDVKLAKKNKKDKSAILGIINSKMGMLKIKKKGKKDEFWKNK 702

Query: 703 KRRHVRTQDS 709
           K+R+VR  ++
Sbjct: 703 KKRYVRADNA 708

BLAST of CSPI01G09080 vs. NCBI nr
Match: gi|657965383|ref|XP_008374346.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Malus domestica])

HSP 1 Score: 966.1 bits (2496), Expect = 3.4e-278
Identity = 500/715 (69.93%), Postives = 582/715 (81.40%), Query Frame = 1

Query: 7   HHLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAASPNPSTQSPSPIFLHFFEEE 66
           HH   P L +   NL+      FS +            +A P PS+ S +PIFL F + +
Sbjct: 25  HHFPKPCLFVFSKNLRL-----FSLY------------SAPPTPSSLSSAPIFLPFLQNQ 84

Query: 67  EEEEEEEVPSKEGHG-----GNKTEEDWNDPLFRFFKSQTSTTQDPSRESKLPLQKNRRS 126
           EEE+E+E   +           + EED +DP+ RFFKS+TST QDP RE K  LQKNRRS
Sbjct: 85  EEEDEDETEEETEEPPALEEDEEEEEDPDDPILRFFKSRTST-QDPEREGKFSLQKNRRS 144

Query: 127 SWHLASDVEFFNEAEVTPEE-------DKEQLRSASRNSRVLPDGPVGEIVGIARNLSQN 186
           +W LA      +E+E  PE        +++Q+      S   P+G V EI+  AR L QN
Sbjct: 145 AWRLADGTHLADESE--PETGVKKLLGEQKQVGPIKFGSNASPEGIVQEILQKARTLPQN 204

Query: 187 MTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFP 246
           +TLGEALG FEGR+ EKEC ++L ++GEE L+V CLYFFEWMGLQE SLVT RA S+LFP
Sbjct: 205 LTLGEALGGFEGRVGEKECVKILEVMGEEGLLVGCLYFFEWMGLQEPSLVTPRACSVLFP 264

Query: 247 LLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAMETNNVNP 306
           +LGRAGMG+K+M+LF+NLP KKEF DVHVYN+AISGLM  KRYDDA KVYE ME NN  P
Sbjct: 265 MLGRAGMGDKLMILFRNLPAKKEFWDVHVYNAAISGLMCSKRYDDAWKVYETMEANNTLP 324

Query: 307 DHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALIL 366
           DHVTCSIMITVMRK+GRSAKDSW +FE+MN+KGV+WS EVLGALIKSFCDEGLK +ALI+
Sbjct: 325 DHVTCSIMITVMRKVGRSAKDSWQFFERMNRKGVRWSQEVLGALIKSFCDEGLKREALII 384

Query: 367 QLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSR 426
           Q+EMEKKGV+SN I+YNT+MDAF  SNQ+EEAEG+FAEMKSKG+KPT+A+FN+LM+AYSR
Sbjct: 385 QVEMEKKGVSSNAIVYNTLMDAFCNSNQVEEAEGLFAEMKSKGIKPTAATFNVLMSAYSR 444

Query: 427 RMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH 486
           +M+PEIVEKLLVEM DMGL+PNVKSYTCLISAYGRQKKMSDMAADAFLRMKK GIRPTSH
Sbjct: 445 KMEPEIVEKLLVEMXDMGLKPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKVGIRPTSH 504

Query: 487 SYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLM 546
           SYTALIHA+SVSGWHEKAY AFENM +EGLKPSIETYT LLDAFRRAGD   LMKIWKLM
Sbjct: 505 SYTALIHAFSVSGWHEKAYIAFENMQKEGLKPSIETYTALLDAFRRAGDAQMLMKIWKLM 564

Query: 547 IREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQH 606
           I+EK++GT+VT+NTLLDGFAK GHYVEARDVISEF  +GLQPTVMTYNMLMNAYARGGQH
Sbjct: 565 IKEKIVGTKVTYNTLLDGFAKQGHYVEARDVISEFGNVGLQPTVMTYNMLMNAYARGGQH 624

Query: 607 LKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 666
            KLPQLL+EMAA  LKPDSVTYSTMIYA+VRVRDF+RAFFYHK+MVK+GQVPD +SY+KL
Sbjct: 625 SKLPQLLKEMAALKLKPDSVTYSTMIYAYVRVRDFRRAFFYHKQMVKNGQVPDARSYEKL 684

Query: 667 KSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKTK-RRHVRTQDS 709
           +SILDVK A KN+KDKSAILGIINSKMG++K KK+GKKDEFWK K +R+VRT +S
Sbjct: 685 RSILDVKAARKNKKDKSAILGIINSKMGLLKVKKKGKKDEFWKNKNKRYVRTDNS 719

BLAST of CSPI01G09080 vs. NCBI nr
Match: gi|645268105|ref|XP_008239377.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Prunus mume])

HSP 1 Score: 965.3 bits (2494), Expect = 5.9e-278
Identity = 505/722 (69.94%), Postives = 591/722 (81.86%), Query Frame = 1

Query: 1   MALVQQH-HLTYPFLSIAGANLKQNTSNSFSFFQSNTQKLACCLCAAS-----PNPSTQS 60
           MAL+ QH  L  PF  +   +     S    F  S   +L     A+S     P P++ S
Sbjct: 1   MALISQHLTLPPPFPFLHNHSPNSPFSKPCLFVFSKNLRLFSLHAASSASTPPPTPTSHS 60

Query: 61  PSPIFLHFFEEEEEEEEEEVPS-KEGHGGNKTEEDWNDPLFRFFKSQTSTTQDPSRESKL 120
            +PIFL F  ++++EEEEE    +E     + EED +DP+ RFFKS++ST QDP RE KL
Sbjct: 61  SNPIFLPFLRDDDDEEEEEPEDLQELEEEEEDEEDPDDPILRFFKSRSST-QDPQREGKL 120

Query: 121 PLQKNRRSSWHLASDVEFFNEAEVTP------EEDKEQLRSASRNSRVLPDGPVGEIVGI 180
            LQKNRRSSW LA D +  +E+E         E+ KEQ R  + +SR L +  V EI+  
Sbjct: 121 SLQKNRRSSWRLADDTQLVDESETDSGIEGVLEQQKEQARQLNFDSRALSEEIVEEILQK 180

Query: 181 ARNLSQNMTLGEALGEFEGRISEKECWEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSR 240
           AR L QN+TLGE LG FEGR+ EKE  +VL L+G+E L++ CLYF+EWMGLQETSLVT R
Sbjct: 181 ARTLPQNLTLGEVLGGFEGRVGEKESVKVLELMGKEGLLMGCLYFYEWMGLQETSLVTPR 240

Query: 241 AYSLLFPLLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAISGLMVCKRYDDACKVYEAM 300
           A S+LFP+LGRAGMG+K+M+LF+NLP K EF+DVHVYN+AISGLM  KRYDDA +VYEAM
Sbjct: 241 ACSVLFPMLGRAGMGDKLMILFRNLPAKNEFRDVHVYNAAISGLMCSKRYDDAWEVYEAM 300

Query: 301 ETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGL 360
           E NN  PDHVTCSIMITVMRK+GRSAKDSW +FE+MN+KGVKWS EVLGALIKSFCDEGL
Sbjct: 301 EANNTLPDHVTCSIMITVMRKVGRSAKDSWQFFERMNRKGVKWSQEVLGALIKSFCDEGL 360

Query: 361 KSQALILQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNI 420
           KS+ALI+Q+EMEKKGV+SN I+YNT+MDAF  SNQ+EEAEG+FAEMKS+G+KPT+A+FNI
Sbjct: 361 KSEALIIQVEMEKKGVSSNAIVYNTLMDAFCNSNQVEEAEGLFAEMKSRGIKPTAATFNI 420

Query: 421 LMNAYSRRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKN 480
           LM+AYSR+MQ EIVEKLLVEM+DMGLEPNVKSYTCLISAYGRQKKMSDMAA+AFLRMKK 
Sbjct: 421 LMSAYSRKMQTEIVEKLLVEMQDMGLEPNVKSYTCLISAYGRQKKMSDMAANAFLRMKKA 480

Query: 481 GIRPTSHSYTALIHAYSVSGWHEKAYSAFENMLREGLKPSIETYTTLLDAFRRAGDTVSL 540
           GI PTSHSYTALIHA+SVSGWHEKAY AFENM +EGLKPSIETYT LLDAFRRAGD   L
Sbjct: 481 GISPTSHSYTALIHAFSVSGWHEKAYIAFENMQKEGLKPSIETYTALLDAFRRAGDAQML 540

Query: 541 MKIWKLMIREKVLGTRVTFNTLLDGFAKHGHYVEARDVISEFDKIGLQPTVMTYNMLMNA 600
           MKIWKLMI+EK+ GT+VTFNTLLDGFAK GHY EARDVISEF  IGLQPTVMTYNMLMNA
Sbjct: 541 MKIWKLMIKEKIEGTKVTFNTLLDGFAKQGHYTEARDVISEFGNIGLQPTVMTYNMLMNA 600

Query: 601 YARGGQHLKLPQLLQEMAARDLKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPD 660
           YARGGQH KLPQLL+EMAA +LKPDSVTYSTMIYA+VRVRDFKRAFFYHK+MVKSG++PD
Sbjct: 601 YARGGQHSKLPQLLKEMAALNLKPDSVTYSTMIYAYVRVRDFKRAFFYHKQMVKSGEMPD 660

Query: 661 VKSYQKLKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKQGKKDEFWKT-KRRHVRTQ 709
            +SY+KL++ILDVK A KN+KD+SAILGIINSKMG++K KK+GKKDE WK  K+R+VRT 
Sbjct: 661 ARSYEKLRAILDVKAARKNKKDRSAILGIINSKMGLLKIKKKGKKDELWKNKKKRYVRTD 720

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP426_ARATH4.0e-24264.18Pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Arabidop... [more]
PP362_ARATH2.2e-4327.15Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH5.5e-4227.80Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
RF1_ORYSI3.6e-4127.81Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1[more]
PP360_ARATH6.1e-4125.53Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LWH7_CUCSA0.0e+0099.58Uncharacterized protein OS=Cucumis sativus GN=Csa_1G050000 PE=4 SV=1[more]
A0A061G5M6_THECC2.3e-27371.08Pentatricopeptide repeat superfamily protein isoform 1 OS=Theobroma cacao GN=TCM... [more]
W9S5W3_9ROSA4.1e-27071.77Uncharacterized protein OS=Morus notabilis GN=L484_008195 PE=4 SV=1[more]
A5BHI6_VITVI2.9e-26872.23Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010606 PE=4 SV=1[more]
D7SHD5_VITVI2.9e-26872.23Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09860 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G50280.12.2e-24364.18 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G02860.11.3e-4427.15 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.13.1e-4327.80 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G01110.13.4e-4225.53 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G18940.12.4e-4024.52 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778657971|ref|XP_004152584.2|0.0e+0099.58PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic ... [more]
gi|659067377|ref|XP_008439140.1|0.0e+0095.36PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic ... [more]
gi|470130284|ref|XP_004301033.1|1.2e-27874.18PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic ... [more]
gi|657965383|ref|XP_008374346.1|3.4e-27869.93PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic ... [more]
gi|645268105|ref|XP_008239377.1|5.9e-27869.94PREDICTED: pentatricopeptide repeat-containing protein At5g50280, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G09080.1CSPI01G09080.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 366..413
score: 7.5E-15coord: 260..305
score: 5.8E-11coord: 576..624
score: 5.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 423..483
score: 8.1E-12coord: 496..551
score: 4.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 404..437
score: 9.6E-6coord: 474..507
score: 7.7E-7coord: 580..613
score: 1.2E-6coord: 510..539
score: 2.3E-4coord: 439..472
score: 1.8E-6coord: 614..648
score: 2.2E-6coord: 544..573
score: 8.9E-5coord: 368..401
score: 1.0E-9coord: 263..295
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 331..365
score: 7.936coord: 507..541
score: 8.287coord: 612..646
score: 11.049coord: 366..400
score: 13.439coord: 401..435
score: 10.665coord: 436..471
score: 11.268coord: 577..611
score: 10.457coord: 472..506
score: 11.52coord: 542..576
score: 10.83coord: 295..330
score: 8.374coord: 225..255
score: 5.163coord: 260..294
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 366..523
score: 9.0E-6coord: 559..642
score: 9.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 86..654
score: 4.8E
NoneNo IPR availablePANTHERPTHR24015:SF606SUBFAMILY NOT NAMEDcoord: 86..654
score: 4.8E

The following gene(s) are paralogous to this gene:

None