CSPI05G08820 (gene) Wild cucumber (PI 183967)

NameCSPI05G08820
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr5 : 7501116 .. 7503791 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCCTCTCAAATAGGACTATTTTCCTTAAACCCCGAATTTATATTTGTAGCCATTTTGAAAAGTAGCCCTTTTAGTTCTTCTTCTTCTTCTTCTCCTCTCTCTCTCTCTCAGGTTCTTGATGGCAAAACCCTAGTTGCCATTATTGGGTGGTCATGGAAACTCCTTCAACTTCACACCAGGGACCAATCCAGGAAGAGAATCTAGAACAGCAACTGATTCAAACCATAACAACGATCCTCAATGCCACCAAACCATCTCTCAGTGCACTTGCCCCATATGCCGCACACCTTTCTCCTTCCTTGATATCCTCCATTTTCGCCTCCAAAGCTCTAAGTTCTCATCCCTCCGTTCTTTTAAATGTCTTCAAATGGGCTCAGAAACATGTCCCCTCTTTCTCTTCCCCACCCAATAATTCTCTCTCTTCTCTTCTTACCCTCTTACCTTCTCTGTTTCGCCATTACATGTTCTCCGATGCCAAATCCCTCCTTATTTCCTTCATTTCTTCCGATCGCCAACACGAACTCCATAAATTGATTCTCCATCCCACTCGTGATCTTCCCGAGCCGTCCAAAGAGCTACTGGATACCTCCATTGGCGCCTACGTGCAAATGGACCAGCCCCATCTTGCTACTCAGATCTTCAACAAGATGAAGCGGCTTAACTACCGGCCGAATTTGCTTACCTGCAACACATTGATGAATTCCTTGGTAAGATACCCGTCTTCGAGTTCTATTCTATTGGCTAGACAAGTATTAAAGGATTCGATTAAACTCGGCGTGGTACCGAATACTAATAGCTTCAATATTTTGATATATGGGTATTGCTTAGAGAGTAAAGTTAAGGATGCACTGGATTGGGTGAATAAAATGAGTGAGTTTGGTTGTGTACCGGATACTGTGAGTTATAATACGATATTGGATGCATTGTTAAAGAGGAGACTGTTACAAGAGGCCCGGGACTTGCTGTTGGACATGAAAAGTAAAGGGCTGTCGCCAAATAAGCATACGTATAATATGTTGGTTTGTGGATATTGCAGACTGGGGTTACTGAAGGAGGCTACCAAGGTGATCGAAATAATGACGCGTAATAATTTGTTGCCTACTGTTTGGACTTATAATATGTTGGTTAATGGGTTTTGTAATGATGGTAAGATTGATGAGGCTTTTAGGATAAGAGATGAGATGGAGAAAATGAATGTCTTGCCTGACGTGGTTACCTATAACACATTGATTGATGGGTGTTCCCAGTGGCGGGATAGTTCTGAGGTATATAGTTTGATTGAAGAAATGGATAAGAAAGGAGTGAAGTGTAATGCAGTTACTTACAATATAATACTGAAATGGATGTGTAAGAAAGGGAATATGACTGAAGCAACCACTACTCTTGATAAGATGGAAGAAAATGGACTCTCCCCTGATTGTGTGACGTACAATACTCTAATAGGTGCTTATTGTAAGGCTGGAAAAATGGGAAAAGCGTTTAGAATGATGGATGAAATGACTAGTAAAGGTTTGAAAATTGATACTTGGACCTTGAATACGATTCTCCATTGTCTCAGTGTGGAGAAAAAACTTGATGAGGCATACAACTTATTATGCAGTGCTAGTAAGCGGGGCTATATTCTTGATGAGGTTAGCTATGGTATTCTGATTTTGGGTTACTTCAAAGATGAAAAGGGAGACAGAGCCTTGAATCTTTGGGATGAAATGAAGGAAAGACAGATTATGCCAAGCACCATCACCTATAATTCTGTGATTGGAGGACTATGTCAGTCTAGGAAAGTTGATCAAGCTATAGATAAGCTGAATGAGATGCTTGAGAATGGATTAGTTCCTGATGAAACTACTTACAACATAATTATCCATGGCTTTTGTTTGGAAGGGAATGTGGAAAAAGCATTCCAATTCCACAACGAAATGATTAAGAATTTATTCAAGCCAGATGTCTATACTTGTAATATTCTTCTTCGTGGGTTATGCAGAGAGGGTATGCTAGAGAAGGCTCTTAAGCTGTTCAATACTTTGGTTTCTAAAGGCAAAGACATTGATGTAGTTACGTATAATACCATAATATCTAGTCTGTGCAAAGAAGGGAAATTCGAGAATGCTTATGATCTTCTTACTGAAATGGAAGCGAAAAAGTTAGGTCCTGATCAATATACTTACAAAGTGATTATTGCTGCTCTAACAGATGCGGGAAGGATTAAGGAGGCGGAGGAATTTACATTGAAAATGGTTGAATCGGGAATAGTGCATGATCAGAATTTGAAATTGGGCAAAGGGCAGAATGTGCTAACCTCTGAAGTTTCGGAACATTTTGATTTCAAGTCTATAGCTTACTCGGATCAGATCAATGAACTATGTAATCAACATAAGTATAAGGATGCAATGCACCTATTTGTCGAAGTTACAAAGGAAGGTGTTGCTTTAAACAAATACACTTATCTAAATTTGATGGAGGGGCTGATTAAGAGGCGTAAAAGCACATCAAAGGCCAGCCGGTGATTATTAAATCTTTACAACTTCACCAATGTCAAATAGGACCAGCCTCTCTTCAGGTTCTACTCAAATCCAAAGGGGTTTTTCGCTCTAGGATGTAGGAATGTTGGAACAAGGTTACTTTACTAGACATGGGTTTACTGTGTACTTTGATGATCATGTGGTGTGATGTTAACTA

mRNA sequence

ATGGAAACTCCTTCAACTTCACACCAGGGACCAATCCAGGAAGAGAATCTAGAACAGCAACTGATTCAAACCATAACAACGATCCTCAATGCCACCAAACCATCTCTCAGTGCACTTGCCCCATATGCCGCACACCTTTCTCCTTCCTTGATATCCTCCATTTTCGCCTCCAAAGCTCTAAGTTCTCATCCCTCCGTTCTTTTAAATGTCTTCAAATGGGCTCAGAAACATGTCCCCTCTTTCTCTTCCCCACCCAATAATTCTCTCTCTTCTCTTCTTACCCTCTTACCTTCTCTGTTTCGCCATTACATGTTCTCCGATGCCAAATCCCTCCTTATTTCCTTCATTTCTTCCGATCGCCAACACGAACTCCATAAATTGATTCTCCATCCCACTCGTGATCTTCCCGAGCCGTCCAAAGAGCTACTGGATACCTCCATTGGCGCCTACGTGCAAATGGACCAGCCCCATCTTGCTACTCAGATCTTCAACAAGATGAAGCGGCTTAACTACCGGCCGAATTTGCTTACCTGCAACACATTGATGAATTCCTTGGTAAGATACCCGTCTTCGAGTTCTATTCTATTGGCTAGACAAGTATTAAAGGATTCGATTAAACTCGGCGTGGTACCGAATACTAATAGCTTCAATATTTTGATATATGGGTATTGCTTAGAGAGTAAAGTTAAGGATGCACTGGATTGGGTGAATAAAATGAGTGAGTTTGGTTGTGTACCGGATACTGTGAGTTATAATACGATATTGGATGCATTGTTAAAGAGGAGACTGTTACAAGAGGCCCGGGACTTGCTGTTGGACATGAAAAGTAAAGGGCTGTCGCCAAATAAGCATACGTATAATATGTTGGTTTGTGGATATTGCAGACTGGGGTTACTGAAGGAGGCTACCAAGGTGATCGAAATAATGACGCGTAATAATTTGTTGCCTACTGTTTGGACTTATAATATGTTGGTTAATGGGTTTTGTAATGATGGTAAGATTGATGAGGCTTTTAGGATAAGAGATGAGATGGAGAAAATGAATGTCTTGCCTGACGTGGTTACCTATAACACATTGATTGATGGGTGTTCCCAGTGGCGGGATAGTTCTGAGGTATATAGTTTGATTGAAGAAATGGATAAGAAAGGAGTGAAGTGTAATGCAGTTACTTACAATATAATACTGAAATGGATGTGTAAGAAAGGGAATATGACTGAAGCAACCACTACTCTTGATAAGATGGAAGAAAATGGACTCTCCCCTGATTGTGTGACGTACAATACTCTAATAGGTGCTTATTGTAAGGCTGGAAAAATGGGAAAAGCGTTTAGAATGATGGATGAAATGACTAGTAAAGGTTTGAAAATTGATACTTGGACCTTGAATACGATTCTCCATTGTCTCAGTGTGGAGAAAAAACTTGATGAGGCATACAACTTATTATGCAGTGCTAGTAAGCGGGGCTATATTCTTGATGAGGTTAGCTATGGTATTCTGATTTTGGGTTACTTCAAAGATGAAAAGGGAGACAGAGCCTTGAATCTTTGGGATGAAATGAAGGAAAGACAGATTATGCCAAGCACCATCACCTATAATTCTGTGATTGGAGGACTATGTCAGTCTAGGAAAGTTGATCAAGCTATAGATAAGCTGAATGAGATGCTTGAGAATGGATTAGTTCCTGATGAAACTACTTACAACATAATTATCCATGGCTTTTGTTTGGAAGGGAATGTGGAAAAAGCATTCCAATTCCACAACGAAATGATTAAGAATTTATTCAAGCCAGATGTCTATACTTGTAATATTCTTCTTCGTGGGTTATGCAGAGAGGGTATGCTAGAGAAGGCTCTTAAGCTGTTCAATACTTTGGTTTCTAAAGGCAAAGACATTGATGTAGTTACGTATAATACCATAATATCTAGTCTGTGCAAAGAAGGGAAATTCGAGAATGCTTATGATCTTCTTACTGAAATGGAAGCGAAAAAGTTAGGTCCTGATCAATATACTTACAAAGTGATTATTGCTGCTCTAACAGATGCGGGAAGGATTAAGGAGGCGGAGGAATTTACATTGAAAATGGTTGAATCGGGAATAGTGCATGATCAGAATTTGAAATTGGGCAAAGGGCAGAATGTGCTAACCTCTGAAGTTTCGGAACATTTTGATTTCAAGTCTATAGCTTACTCGGATCAGATCAATGAACTATGTAATCAACATAAGTATAAGGATGCAATGCACCTATTTGTCGAAGTTACAAAGGAAGGTGTTGCTTTAAACAAATACACTTATCTAAATTTGATGGAGGGGCTGATTAAGAGGCGTAAAAGCACATCAAAGGCCAGCCGGTGA

Coding sequence (CDS)

ATGGAAACTCCTTCAACTTCACACCAGGGACCAATCCAGGAAGAGAATCTAGAACAGCAACTGATTCAAACCATAACAACGATCCTCAATGCCACCAAACCATCTCTCAGTGCACTTGCCCCATATGCCGCACACCTTTCTCCTTCCTTGATATCCTCCATTTTCGCCTCCAAAGCTCTAAGTTCTCATCCCTCCGTTCTTTTAAATGTCTTCAAATGGGCTCAGAAACATGTCCCCTCTTTCTCTTCCCCACCCAATAATTCTCTCTCTTCTCTTCTTACCCTCTTACCTTCTCTGTTTCGCCATTACATGTTCTCCGATGCCAAATCCCTCCTTATTTCCTTCATTTCTTCCGATCGCCAACACGAACTCCATAAATTGATTCTCCATCCCACTCGTGATCTTCCCGAGCCGTCCAAAGAGCTACTGGATACCTCCATTGGCGCCTACGTGCAAATGGACCAGCCCCATCTTGCTACTCAGATCTTCAACAAGATGAAGCGGCTTAACTACCGGCCGAATTTGCTTACCTGCAACACATTGATGAATTCCTTGGTAAGATACCCGTCTTCGAGTTCTATTCTATTGGCTAGACAAGTATTAAAGGATTCGATTAAACTCGGCGTGGTACCGAATACTAATAGCTTCAATATTTTGATATATGGGTATTGCTTAGAGAGTAAAGTTAAGGATGCACTGGATTGGGTGAATAAAATGAGTGAGTTTGGTTGTGTACCGGATACTGTGAGTTATAATACGATATTGGATGCATTGTTAAAGAGGAGACTGTTACAAGAGGCCCGGGACTTGCTGTTGGACATGAAAAGTAAAGGGCTGTCGCCAAATAAGCATACGTATAATATGTTGGTTTGTGGATATTGCAGACTGGGGTTACTGAAGGAGGCTACCAAGGTGATCGAAATAATGACGCGTAATAATTTGTTGCCTACTGTTTGGACTTATAATATGTTGGTTAATGGGTTTTGTAATGATGGTAAGATTGATGAGGCTTTTAGGATAAGAGATGAGATGGAGAAAATGAATGTCTTGCCTGACGTGGTTACCTATAACACATTGATTGATGGGTGTTCCCAGTGGCGGGATAGTTCTGAGGTATATAGTTTGATTGAAGAAATGGATAAGAAAGGAGTGAAGTGTAATGCAGTTACTTACAATATAATACTGAAATGGATGTGTAAGAAAGGGAATATGACTGAAGCAACCACTACTCTTGATAAGATGGAAGAAAATGGACTCTCCCCTGATTGTGTGACGTACAATACTCTAATAGGTGCTTATTGTAAGGCTGGAAAAATGGGAAAAGCGTTTAGAATGATGGATGAAATGACTAGTAAAGGTTTGAAAATTGATACTTGGACCTTGAATACGATTCTCCATTGTCTCAGTGTGGAGAAAAAACTTGATGAGGCATACAACTTATTATGCAGTGCTAGTAAGCGGGGCTATATTCTTGATGAGGTTAGCTATGGTATTCTGATTTTGGGTTACTTCAAAGATGAAAAGGGAGACAGAGCCTTGAATCTTTGGGATGAAATGAAGGAAAGACAGATTATGCCAAGCACCATCACCTATAATTCTGTGATTGGAGGACTATGTCAGTCTAGGAAAGTTGATCAAGCTATAGATAAGCTGAATGAGATGCTTGAGAATGGATTAGTTCCTGATGAAACTACTTACAACATAATTATCCATGGCTTTTGTTTGGAAGGGAATGTGGAAAAAGCATTCCAATTCCACAACGAAATGATTAAGAATTTATTCAAGCCAGATGTCTATACTTGTAATATTCTTCTTCGTGGGTTATGCAGAGAGGGTATGCTAGAGAAGGCTCTTAAGCTGTTCAATACTTTGGTTTCTAAAGGCAAAGACATTGATGTAGTTACGTATAATACCATAATATCTAGTCTGTGCAAAGAAGGGAAATTCGAGAATGCTTATGATCTTCTTACTGAAATGGAAGCGAAAAAGTTAGGTCCTGATCAATATACTTACAAAGTGATTATTGCTGCTCTAACAGATGCGGGAAGGATTAAGGAGGCGGAGGAATTTACATTGAAAATGGTTGAATCGGGAATAGTGCATGATCAGAATTTGAAATTGGGCAAAGGGCAGAATGTGCTAACCTCTGAAGTTTCGGAACATTTTGATTTCAAGTCTATAGCTTACTCGGATCAGATCAATGAACTATGTAATCAACATAAGTATAAGGATGCAATGCACCTATTTGTCGAAGTTACAAAGGAAGGTGTTGCTTTAAACAAATACACTTATCTAAATTTGATGGAGGGGCTGATTAAGAGGCGTAAAAGCACATCAAAGGCCAGCCGGTGA
BLAST of CSPI05G08820 vs. Swiss-Prot
Match: PP156_ARATH (Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana GN=At2g16880 PE=2 SV=1)

HSP 1 Score: 701.8 bits (1810), Expect = 8.3e-201
Identity = 361/732 (49.32%), Postives = 509/732 (69.54%), Query Frame = 1

Query: 18  EQQLIQTITTILNATKPS-LSALAPYAAHLSPSLISSIFASKALSSHPSVLLNVFKWAQK 77
           E QL++T+T+IL + K   L  L PY   ++  L++S+ +S +L+  P  L++ F+WAQ 
Sbjct: 8   ESQLLKTLTSILTSEKTHFLETLNPYIPQITQPLLTSLLSSPSLAKKPETLVSFFQWAQT 67

Query: 78  HVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFI-SSDRQHELHKLILHPTRDL 137
            +P   + P++S   L++++ SL  H+ F+DAKSLL+S+I +SD    L   +LHP   L
Sbjct: 68  SIPE--AFPSDSPLPLISVVRSLLSHHKFADAKSLLVSYIRTSDASLSLCNSLLHPNLHL 127

Query: 138 -PEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSI 197
            P PSK L D ++ AY+   +PH+A QIF KM RL  +PNLLTCNTL+  LVRYPSS SI
Sbjct: 128 SPPPSKALFDIALSAYLHEGKPHVALQIFQKMIRLKLKPNLLTCNTLLIGLVRYPSSFSI 187

Query: 198 LLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKM-SEFGCVPDTVSYNT 257
             AR+V  D +K+GV  N  +FN+L+ GYCLE K++DAL  + +M SEF   PD V+YNT
Sbjct: 188 SSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLEDALGMLERMVSEFKVNPDNVTYNT 247

Query: 258 ILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNN 317
           IL A+ K+  L + ++LLLDMK  GL PN+ TYN LV GYC+LG LKEA +++E+M + N
Sbjct: 248 ILKAMSKKGRLSDLKELLLDMKKNGLVPNRVTYNNLVYGYCKLGSLKEAFQIVELMKQTN 307

Query: 318 LLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVY 377
           +LP + TYN+L+NG CN G + E   + D M+ + + PDVVTYNTLIDGC +   S E  
Sbjct: 308 VLPDLCTYNILINGLCNAGSMREGLELMDAMKSLKLQPDVVTYNTLIDGCFELGLSLEAR 367

Query: 378 SLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEE-NGLSPDCVTYNTLIGA 437
            L+E+M+  GVK N VT+NI LKW+CK+      T  + ++ + +G SPD VTY+TLI A
Sbjct: 368 KLMEQMENDGVKANQVTHNISLKWLCKEEKREAVTRKVKELVDMHGFSPDIVTYHTLIKA 427

Query: 438 YCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILD 497
           Y K G +  A  MM EM  KG+K++T TLNTIL  L  E+KLDEA+NLL SA KRG+I+D
Sbjct: 428 YLKVGDLSGALEMMREMGQKGIKMNTITLNTILDALCKERKLDEAHNLLNSAHKRGFIVD 487

Query: 498 EVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLN 557
           EV+YG LI+G+F++EK ++AL +WDEMK+ +I P+  T+NS+IGGLC   K + A++K +
Sbjct: 488 EVTYGTLIMGFFREEKVEKALEMWDEMKKVKITPTVSTFNSLIGGLCHHGKTELAMEKFD 547

Query: 558 EMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREG 617
           E+ E+GL+PD++T+N II G+C EG VEKAF+F+NE IK+ FKPD YTCNILL GLC+EG
Sbjct: 548 ELAESGLLPDDSTFNSIILGYCKEGRVEKAFEFYNESIKHSFKPDNYTCNILLNGLCKEG 607

Query: 618 MLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYK 677
           M EKAL  FNTL+ + +++D VTYNT+IS+ CK+ K + AYDLL+EME K L PD++TY 
Sbjct: 608 MTEKALNFFNTLIEE-REVDTVTYNTMISAFCKDKKLKEAYDLLSEMEEKGLEPDRFTYN 667

Query: 678 VIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQ 737
             I+ L + G++ E +E   K         ++L++   +N  TSE  E  + ++IAYSD 
Sbjct: 668 SFISLLMEDGKLSETDELLKKFSGKFGSMKRDLQVETEKNPATSESKEELNTEAIAYSDV 727

Query: 738 INELCNQHKYKD 745
           I+ELC++ + K+
Sbjct: 728 IDELCSRGRLKE 736

BLAST of CSPI05G08820 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 1.6e-82
Identity = 176/564 (31.21%), Postives = 301/564 (53.37%), Query Frame = 1

Query: 216 FNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLK-RRLLQEARDLLLDM 275
           F++++  Y   S +  AL  V+     G +P  +SYN +LDA ++ +R +  A ++  +M
Sbjct: 137 FDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEM 196

Query: 276 KSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKI 335
               +SPN  TYN+L+ G+C  G +  A  + + M     LP V TYN L++G+C   KI
Sbjct: 197 LESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKI 256

Query: 336 DEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNII 395
           D+ F++   M    + P++++YN +I+G  +     EV  ++ EM+++G   + VTYN +
Sbjct: 257 DDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTL 316

Query: 396 LKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGL 455
           +K  CK+GN  +A     +M  +GL+P  +TY +LI + CKAG M +A   +D+M  +GL
Sbjct: 317 IKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGL 376

Query: 456 KIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALN 515
             +  T  T++   S +  ++EAY +L   +  G+    V+Y  LI G+    K + A+ 
Sbjct: 377 CPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIA 436

Query: 516 LWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFC 575
           + ++MKE+ + P  ++Y++V+ G C+S  VD+A+    EM+E G+ PD  TY+ +I GFC
Sbjct: 437 VLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFC 496

Query: 576 LEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVV 635
            +   ++A   + EM++    PD +T   L+   C EG LEKAL+L N +V KG   DVV
Sbjct: 497 EQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVV 556

Query: 636 TYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKM 695
           TY+ +I+ L K+ +   A  LL ++  ++  P   TY  +         I+       K 
Sbjct: 557 TYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL---------IENCSNIEFKS 616

Query: 696 VESGIVHDQNLK--LGKGQNVLTSEVSEHFDFKSIAYSDQINELCNQHKYKDAMHLFVEV 755
           V S ++    +K  + +   V  S + ++      AY+  I+  C     + A  L+ E+
Sbjct: 617 VVS-LIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEM 676

Query: 756 TKEGVALNKYTYLNLMEGLIKRRK 777
            K G  L+  T + L++ L K  K
Sbjct: 677 VKSGFLLHTVTVIALVKALHKEGK 690

BLAST of CSPI05G08820 vs. Swiss-Prot
Match: PPR99_ARATH (Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidopsis thaliana GN=At1g63130 PE=2 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 3.9e-81
Identity = 164/531 (30.89%), Postives = 293/531 (55.18%), Query Frame = 1

Query: 159 ATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNI 218
           A  +F  M +    P+++  + L++++ +      ++   + +++   LG+  N  +++I
Sbjct: 65  AVNLFGDMVKSRPFPSIVEFSKLLSAIAKMNKFDLVISLGEQMQN---LGISHNLYTYSI 124

Query: 219 LIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKG 278
           LI  +C  S++  AL  + KM + G  PD V+ N++L+       + +A  L+  M   G
Sbjct: 125 LINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLNSLLNGFCHGNRISDAVSLVGQMVEMG 184

Query: 279 LSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAF 338
             P+  T+N L+ G  R     EA  +++ M      P + TY ++VNG C  G ID A 
Sbjct: 185 YQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGDIDLAL 244

Query: 339 RIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWM 398
            +  +ME+  + P VV YNT+ID    +++ ++  +L  EMD KG++ N VTYN +++ +
Sbjct: 245 SLLKKMEQGKIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCL 304

Query: 399 CKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDT 458
           C  G  ++A+  L  M E  ++P+ VT++ LI A+ K GK+ +A ++ DEM  + +  D 
Sbjct: 305 CNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDI 364

Query: 459 WTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDE 518
           +T +++++   +  +LDEA ++      +    + V+Y  LI G+ K ++ D  + L+ E
Sbjct: 365 FTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVDEGMELFRE 424

Query: 519 MKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGN 578
           M +R ++ +T+TY ++I G  Q+R+ D A     +M+ +G++PD  TY+I++ G C  G 
Sbjct: 425 MSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYSILLDGLCNNGK 484

Query: 579 VEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNT 638
           VE A      + ++  +PD+YT NI++ G+C+ G +E    LF +L  KG   +VVTY T
Sbjct: 485 VETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVVTYTT 544

Query: 639 IISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAA-LTDAGRIKEAE 689
           ++S  C++G  E A  L  EM+ +   PD  TY  +I A L D  +   AE
Sbjct: 545 MMSGFCRKGLKEEADALFREMKEEGPLPDSGTYNTLIRAHLRDGDKAASAE 592

BLAST of CSPI05G08820 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 1.6e-79
Identity = 204/700 (29.14%), Postives = 349/700 (49.86%), Query Frame = 1

Query: 29  LNATKPS--LSALAPYAAHLSPSLISSIFASKALSSHP--SVLLNVFKWAQKHVPSFSSP 88
           LN T PS  +S  +P++A LS + +  +    +L S P  S  L +F  A K  P+FS  
Sbjct: 27  LNLTPPSSTISFASPHSAALSSTDVKLL---DSLRSQPDDSAALRLFNLASKK-PNFSPE 86

Query: 89  PNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDLPEPSKELLD 148
           P    +    +L  L R   F D K +L    SS  +      ++               
Sbjct: 87  P----ALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLI--------------- 146

Query: 149 TSIGAYVQMD-QPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKD 208
             I +Y Q + Q  + + +   +     +P+    N ++N LV    +S  L+     K 
Sbjct: 147 -LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLV--DGNSLKLVEISHAKM 206

Query: 209 SIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRL 268
           S+  G+ P+ ++FN+LI   C   +++ A+  +  M  +G VPD  ++ T++   ++   
Sbjct: 207 SV-WGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGD 266

Query: 269 LQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMT-RNNLLPTVWTYN 328
           L  A  +   M   G S +  + N++V G+C+ G +++A   I+ M+ ++   P  +T+N
Sbjct: 267 LDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFN 326

Query: 329 MLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKK 388
            LVNG C  G +  A  I D M +    PDV TYN++I G  +  +  E   ++++M  +
Sbjct: 327 TLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITR 386

Query: 389 GVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKA 448
               N VTYN ++  +CK+  + EAT     +   G+ PD  T+N+LI   C       A
Sbjct: 387 DCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVA 446

Query: 449 FRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILG 508
             + +EM SKG + D +T N ++  L  + KLDEA N+L      G     ++Y  LI G
Sbjct: 447 MELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDG 506

Query: 509 YFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPD 568
           + K  K   A  ++DEM+   +  +++TYN++I GLC+SR+V+ A   +++M+  G  PD
Sbjct: 507 FCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPD 566

Query: 569 ETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFN 628
           + TYN ++  FC  G+++KA      M  N  +PD+ T   L+ GLC+ G +E A KL  
Sbjct: 567 KYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLR 626

Query: 629 TLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEM-EAKKLGPDQYTYKVIIAALTD- 688
           ++  KG ++    YN +I  L ++ K   A +L  EM E  +  PD  +Y+++   L + 
Sbjct: 627 SIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNG 686

Query: 689 AGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE 721
            G I+EA +F ++++E G V + +      + +LT  + E
Sbjct: 687 GGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEE 699

BLAST of CSPI05G08820 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 296.2 bits (757), Expect = 1.1e-78
Identity = 166/544 (30.51%), Postives = 292/544 (53.68%), Query Frame = 1

Query: 159 ATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNI 218
           A  +F +M +    P+++  N L++++ +      ++   + +++   L +  +  S+NI
Sbjct: 64  AVDLFGEMVQSRPLPSIVEFNKLLSAIAKMNKFDLVISLGERMQN---LRISYDLYSYNI 123

Query: 219 LIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKG 278
           LI  +C  S++  AL  + KM + G  PD V+ +++L+     + + EA  L+  M    
Sbjct: 124 LINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRISEAVALVDQMFVME 183

Query: 279 LSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAF 338
             PN  T+N L+ G        EA  +I+ M      P ++TY  +VNG C  G ID A 
Sbjct: 184 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 243

Query: 339 RIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWM 398
            +  +MEK  +  DVV Y T+ID    +++ ++  +L  EMD KG++ N VTYN +++ +
Sbjct: 244 SLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCL 303

Query: 399 CKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDT 458
           C  G  ++A+  L  M E  ++P+ VT++ LI A+ K GK+ +A ++ DEM  + +  D 
Sbjct: 304 CNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDI 363

Query: 459 WTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDE 518
           +T +++++   +  +LDEA ++      +    + V+Y  LI G+ K ++ +  + L+ E
Sbjct: 364 FTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMELFRE 423

Query: 519 MKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGN 578
           M +R ++ +T+TYN++I GL Q+   D A     +M+ +G+ PD  TY+I++ G C  G 
Sbjct: 424 MSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGK 483

Query: 579 VEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNT 638
           +EKA      + K+  +PD+YT NI++ G+C+ G +E    LF +L  KG   +V+ Y T
Sbjct: 484 LEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTT 543

Query: 639 IISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESG 698
           +IS  C++G  E A  L  EM+     P+  TY  +I A    G    + E   +M   G
Sbjct: 544 MISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIKEMRSCG 603

Query: 699 IVHD 703
            V D
Sbjct: 604 FVGD 604

BLAST of CSPI05G08820 vs. TrEMBL
Match: A5AMQ4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021300 PE=4 SV=1)

HSP 1 Score: 1015.0 bits (2623), Expect = 4.9e-293
Identity = 499/782 (63.81%), Postives = 620/782 (79.28%), Query Frame = 1

Query: 3   TPSTSHQGPIQEENLE-QQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKALS 62
           TP  S   P+    L  Q+LIQTITTIL +    L AL  Y   L+P L+ SI +SK L 
Sbjct: 4   TPPESSPPPLPPAPLPPQELIQTITTILASNNMPLQALNTYIPQLTPPLVLSILSSKTLI 63

Query: 63  SHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQ 122
           S P++L++ FKWAQ ++P+F   P+NSL SLL+LLPSLF H  FSDAKSLL+ FI++DR+
Sbjct: 64  SRPNILISFFKWAQTNLPTF---PHNSLPSLLSLLPSLFSHRKFSDAKSLLLGFIATDRR 123

Query: 123 HELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTL 182
           H+LH  IL     L  PSK LLDT+IGAYVQ  QPH A QIF KMKRL  RPNLLTCNTL
Sbjct: 124 HDLHLSILR----LTSPSKALLDTAIGAYVQSGQPHHAFQIFKKMKRLRLRPNLLTCNTL 183

Query: 183 MNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSE 242
           +NSLVRYPSS S+  +R+   D+IKLG+VPN N+FNI+IYGYCLE+K KDA++++N M +
Sbjct: 184 LNSLVRYPSSHSVSFSREAFNDAIKLGIVPNVNTFNIVIYGYCLENKFKDAVEFLNVMGK 243

Query: 243 FGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKE 302
           + C PD V+YNTILDAL K+  L +ARDLL+DMKS+GL PN++TYN+LV GYC++G LKE
Sbjct: 244 YNCSPDNVTYNTILDALCKKGRLGDARDLLMDMKSRGLLPNRNTYNILVYGYCKMGWLKE 303

Query: 303 ATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLID 362
           A  VIE+MT+NNLLP VWTYNML+NG CN+G+I+EAF++RDEME + +LPDVV+YNTLI+
Sbjct: 304 AANVIELMTQNNLLPDVWTYNMLINGLCNEGRIEEAFKLRDEMENLKLLPDVVSYNTLIN 363

Query: 363 GCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSP 422
           GC +W   SE + L+EEM +KGVK NAVT+NI++KW CK+G M +A+ T+ KMEE+G SP
Sbjct: 364 GCLEWSKISEAFKLLEEMSEKGVKPNAVTHNIMVKWYCKEGKMDDASNTITKMEESGFSP 423

Query: 423 DCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLL 482
           DCVTYNTLI  YCKAG MG+AFR MDEM  K +K+D+ TLNTIL  L  EKKL+EAY LL
Sbjct: 424 DCVTYNTLINGYCKAGNMGEAFRTMDEMGRKNMKMDSVTLNTILRTLCREKKLEEAYKLL 483

Query: 483 CSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQS 542
            SA KRGY +DEVSYG LI+GYFKD   DRAL LWDEMKE++I+PST+TYN +IGGLCQ 
Sbjct: 484 SSARKRGYFIDEVSYGTLIVGYFKDGNVDRALKLWDEMKEKEIIPSTVTYNCIIGGLCQC 543

Query: 543 RKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTC 602
            K +QAI KLNE+LE+GL+PDETTYN I+HG+C EG+VEKAFQFHN+M++N FKPDV+TC
Sbjct: 544 GKTEQAISKLNELLESGLLPDETTYNTILHGYCREGDVEKAFQFHNKMVENSFKPDVFTC 603

Query: 603 NILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEA 662
           NILLRGLC EGMLEKALKLFNT VSKGK ID VTYNT+I+SLCKEG+ ++A++LL+EME 
Sbjct: 604 NILLRGLCMEGMLEKALKLFNTWVSKGKAIDTVTYNTLITSLCKEGRLDDAFNLLSEMEE 663

Query: 663 KKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEH 722
           K+LGPD YTY  II ALTD+GRI+EAEEF  KM+E G +  Q L+L   + V+TSE SE 
Sbjct: 664 KELGPDHYTYNAIITALTDSGRIREAEEFMSKMLEKGXLPXQVLQLDXNETVVTSETSEE 723

Query: 723 FDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSKA 782
            D  S+AYS+ I ELC + KYKDAM +F E  ++G+ ++K TY+NLM+GLIKRRKS SK 
Sbjct: 724 SDSSSVAYSEWIKELCTEGKYKDAMRIFGESKQKGITVDKSTYINLMDGLIKRRKSISKE 778

Query: 783 SR 784
           +R
Sbjct: 784 AR 778

BLAST of CSPI05G08820 vs. TrEMBL
Match: A0A067JIV6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01815 PE=4 SV=1)

HSP 1 Score: 957.6 bits (2474), Expect = 9.4e-276
Identity = 478/769 (62.16%), Postives = 599/769 (77.89%), Query Frame = 1

Query: 18  EQQLIQTITTILNATKPSLSALAPYAAHLS--PSLISSIFASKALSSHPSVLLNVFKWAQ 77
           E QL++++T IL + K  L AL PY  + S  P+L+ S+ +SK+LS  P+ LL+ FKW Q
Sbjct: 16  ETQLLKSLTKILTSDKFPLQALRPYIPNFSSNPNLLISLLSSKSLSHRPTTLLSFFKWMQ 75

Query: 78  KHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDL 137
            H+P   S   +S   LL+LL  L  H+ FSDAKSLL SFI++D+ + LH  ILHP   +
Sbjct: 76  SHLPPSIS---HSPLPLLSLLSPLLSHHKFSDAKSLLTSFIANDKSNILHHHILHPPAVV 135

Query: 138 PEPS--KELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSS 197
                 K LLDTSIGAYV   +PH A +IF+KMKRL+  PNLLTCNTL+N+LV+YPS  S
Sbjct: 136 ENRRTLKSLLDTSIGAYVASGKPHHAAEIFHKMKRLHLTPNLLTCNTLLNALVKYPSKHS 195

Query: 198 ILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNT 257
           + LA+ +  D IKLGV  NT++FNILIYG CLESK+ +A+  + KM EFGC+PD VSYNT
Sbjct: 196 VCLAKDIFNDVIKLGVKVNTSTFNILIYGCCLESKLGEAIGLIGKMKEFGCLPDNVSYNT 255

Query: 258 ILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNN 317
           ILD L K+  L EARD+LLDMK+KGL+PNK T+N+LVCGYC+LG LKEAT+VIE+M++NN
Sbjct: 256 ILDVLCKKGKLNEARDMLLDMKNKGLTPNKSTFNILVCGYCKLGWLKEATRVIELMSQNN 315

Query: 318 LLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVY 377
           +LP VWTYNML+ GFC +G+IDEAF +RDEME + + PDV+TYNTLI+GC +   SS  +
Sbjct: 316 VLPDVWTYNMLIGGFCKEGRIDEAFGLRDEMENLKLFPDVITYNTLINGCFECGSSSRAF 375

Query: 378 SLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAY 437
            LI+EM+ KGVK NA+T+NI++KW  K+G M +A  T+ KMEE+G SPD VTYNTLI AY
Sbjct: 376 GLIQEMEGKGVKPNAITHNILVKWYVKEGKMDDAGKTIRKMEEDGFSPDTVTYNTLINAY 435

Query: 438 CKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDE 497
           CKAGK+G+AFRMMDEM  KGLK+ + TLNTIL+ L  EKKLDEAY LL SAS+RGY +DE
Sbjct: 436 CKAGKLGEAFRMMDEMGRKGLKMSSVTLNTILYTLCEEKKLDEAYRLLSSASRRGYFVDE 495

Query: 498 VSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNE 557
           VSYG LI+GYFKD+   +AL LW EMKE+QI+PS ITYNS+IGGLCQ  K DQAIDKLNE
Sbjct: 496 VSYGTLIMGYFKDKNSTKALKLWCEMKEKQIIPSIITYNSMIGGLCQLGKTDQAIDKLNE 555

Query: 558 MLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGM 617
           +LE+GLVPDETTYN II+G+C E  +EKAFQF+N+M++N  KPD+YTCNILL  LC+EGM
Sbjct: 556 LLESGLVPDETTYNTIINGYCREREIEKAFQFYNKMVENSLKPDIYTCNILLFELCKEGM 615

Query: 618 LEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKV 677
           LEKALK FNT +SKGK ID VTYNTI+S LC+EG+FE A+DLL EM+ KKLGPD YTY  
Sbjct: 616 LEKALKFFNTWISKGKQIDAVTYNTILSGLCREGRFEEAFDLLEEMKGKKLGPDSYTYNG 675

Query: 678 IIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQI 737
           I+ AL DAGR+KEAEEF LK+VE G +  Q L L KG+NV  SE ++ +D  S A+S+QI
Sbjct: 676 ILGALADAGRVKEAEEFLLKIVEMGELQGQTLLLDKGENV-NSEKTQKYDLNSNAFSEQI 735

Query: 738 NELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSKAS 783
           NELC Q KYKDAM +F E +++G+ALNK  Y++LMEGL+KRRKS S+A+
Sbjct: 736 NELCAQGKYKDAMQIFQESSQKGIALNKSAYISLMEGLVKRRKSISRAT 780

BLAST of CSPI05G08820 vs. TrEMBL
Match: B9SD26_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1067950 PE=4 SV=1)

HSP 1 Score: 948.7 bits (2451), Expect = 4.4e-273
Identity = 474/786 (60.31%), Postives = 603/786 (76.72%), Query Frame = 1

Query: 1   METPSTSHQGPIQEENL--EQQLIQTITTILNATKPS-LSALAPYAAHLS--PSLISSIF 60
           ME+ +T  + P Q++N+  E QL++T+T IL +   + L  L PY AHLS  P+L+ S+ 
Sbjct: 1   MESKTT--ETPNQQKNVAEESQLLKTLTRILTSEDGNTLEKLKPYTAHLSAKPNLLISVL 60

Query: 61  ASKALSSHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISF 120
           +S +LS+ P+ LL+ FKW+Q H+   S  P      L++LL  L  H+ FSDAKSLL +F
Sbjct: 61  SSNSLSNKPNTLLSFFKWSQTHLSVTSLSP----LPLISLLSPLLSHHKFSDAKSLLTAF 120

Query: 121 ISSDRQHELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNL 180
           IS+DR H LH  +LH      +  + +LDTSIGAYV  ++PH A QIFN+MKRL+ +PNL
Sbjct: 121 ISADRTHLLHHHLLHSPFKKVQSLRVILDTSIGAYVACNRPHHAAQIFNRMKRLHLKPNL 180

Query: 181 LTCNTLMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDW 240
           LTCNTL+N+LVRYPS  S+ L++ +  D IKLGV  NTN+FNILIYG C+E+K+ +A+  
Sbjct: 181 LTCNTLINALVRYPSKPSVYLSKAIFSDVIKLGVKVNTNTFNILIYGCCIENKLSEAIGL 240

Query: 241 VNKMSEFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCR 300
           + KM +F C PD VSYNTILD L K+  L EARDLLLDMK+ GL PN++T+N+LV GYC+
Sbjct: 241 IGKMKDFSCFPDNVSYNTILDVLCKKGKLNEARDLLLDMKNNGLLPNRNTFNILVSGYCK 300

Query: 301 LGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVT 360
           LG LKEA +VI++M +NN+LP VWTYNML+ G C DGKIDEAFR++DEME + +LPDVVT
Sbjct: 301 LGWLKEAAQVIDLMAQNNVLPDVWTYNMLIGGLCKDGKIDEAFRLKDEMENLKLLPDVVT 360

Query: 361 YNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKME 420
           YNTLI+GC     S + + LI++M+ KGVK NAVTYN+++KW  K+G M  A   L KME
Sbjct: 361 YNTLINGCFDCSSSLKGFELIDKMEGKGVKPNAVTYNVVVKWYVKEGKMDNAGNELRKME 420

Query: 421 ENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLD 480
           E+G SPDCVT+NTLI  YCKAG++ +AFRMMDEM+ KGLK+++ TLNTILH L  E+KLD
Sbjct: 421 ESGFSPDCVTFNTLINGYCKAGRLSEAFRMMDEMSRKGLKMNSVTLNTILHTLCGERKLD 480

Query: 481 EAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVI 540
           +AY LL SASKRGY +DEVSYG LI+GYFKD K   A+ LWDEMKE++I+PS ITYN++I
Sbjct: 481 DAYKLLSSASKRGYFVDEVSYGTLIMGYFKDGKSVEAMKLWDEMKEKEIIPSIITYNTMI 540

Query: 541 GGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFK 600
           GGLC S K DQ+IDKLNE+LE+GLVPDETTYN II G+C EG VEKAFQFHN+M+K  FK
Sbjct: 541 GGLCHSGKTDQSIDKLNELLESGLVPDETTYNTIILGYCREGQVEKAFQFHNKMVKKSFK 600

Query: 601 PDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDL 660
           PD++TCNILLRGLC EGML+KALKLFNT +SKGK ID VTYNTIIS LCKE +FE A+DL
Sbjct: 601 PDLFTCNILLRGLCTEGMLDKALKLFNTWISKGKAIDAVTYNTIISGLCKEDRFEEAFDL 660

Query: 661 LTEMEAKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLT 720
           L EME KKLGPD YTY  I++AL DAGR+KEAEEF  ++VE G + DQ + L K +   +
Sbjct: 661 LAEMEEKKLGPDCYTYNAILSALADAGRMKEAEEFMSRIVEQGKLQDQTISLNKRKIESS 720

Query: 721 SEVSEHFDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRR 780
           SE S+  D  S+ +S+QINELC Q KYKDAMH+  E T++G+ L+K TY++LMEGLIKRR
Sbjct: 721 SETSQESDPNSVTFSEQINELCTQGKYKDAMHMVQESTQKGITLHKSTYISLMEGLIKRR 780

Query: 781 KSTSKA 782
           KS S++
Sbjct: 781 KSISRS 780

BLAST of CSPI05G08820 vs. TrEMBL
Match: A0A067LFW0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06674 PE=4 SV=1)

HSP 1 Score: 946.8 bits (2446), Expect = 1.7e-272
Identity = 478/769 (62.16%), Postives = 591/769 (76.85%), Query Frame = 1

Query: 18  EQQLIQTITTILNATKPSLSALAPYAAHLSPS--LISSIFASKALSSHPSVLLNVFKWAQ 77
           EQQL+QT+T IL + K  L  L PY ++ S +  L+ S+ +SK+LS  P+ LL+ FKWAQ
Sbjct: 8   EQQLLQTLTKILTSDKFPLQTLNPYISNFSSNSNLLISLLSSKSLSHRPTALLSFFKWAQ 67

Query: 78  KHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDL 137
            H+P   S   +S   LL+LL  L  H+ FSDAKSLL SFI++D+ + LH+ ILHP   +
Sbjct: 68  SHLPPSIS---HSPLPLLSLLSPLLSHHKFSDAKSLLTSFIAADKSNILHQHILHPPAVV 127

Query: 138 PE--PSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSS 197
            +    K LLDTSIGAYV   +PH A QIFNKMKRL   PNLLTCNTL+N+LVR+PS  S
Sbjct: 128 EKRRTMKALLDTSIGAYVASGRPHHAAQIFNKMKRLRLTPNLLTCNTLLNALVRHPSKHS 187

Query: 198 ILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNT 257
           + L++ +  D IKLGV  NTN+FNILIYG C+E+K+ +A+  V KM EFGC+PD VSYNT
Sbjct: 188 VYLSKAIFNDVIKLGVKVNTNTFNILIYGCCMENKLGEAIALVGKMEEFGCLPDNVSYNT 247

Query: 258 ILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNN 317
           ILD L K+  L +ARDLLLD K+KGL PN++T+N+LVCGYC LG LKEA  V E+M +NN
Sbjct: 248 ILDLLCKKGKLNDARDLLLDRKNKGLRPNRNTFNILVCGYCTLGWLKEAAHVSELMAQNN 307

Query: 318 LLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVY 377
           +LP VWTYNML+ G C +G+IDEAFR+RDEM+ + +LPDVVTYNTLI+GC +   SS   
Sbjct: 308 VLPDVWTYNMLIAGLCKEGRIDEAFRLRDEMDNLKLLPDVVTYNTLINGCFECGSSSRAL 367

Query: 378 SLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAY 437
            LI+EM +KGVK N+VT+N ++KW  K G M +A  T+ K+ E+G SPD VTYNTLI  Y
Sbjct: 368 GLIQEMAEKGVKPNSVTHNTLVKWYVKDGKMDDAAKTIRKLGESGFSPDSVTYNTLINGY 427

Query: 438 CKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDE 497
           CKAGK+G+AFR+MDEM  KGLK+D+ TLNTILH L  EKKLDEAY LL S++KRGY +DE
Sbjct: 428 CKAGKLGEAFRIMDEMGRKGLKMDSVTLNTILHTLCGEKKLDEAYELLNSSTKRGYFIDE 487

Query: 498 VSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNE 557
           VSYG LI+GYFK+E   +AL LW EMKE++I+PS ITYNS+I GLCQS + DQAIDKLNE
Sbjct: 488 VSYGTLIMGYFKEENSVKALKLWCEMKEKEIIPSIITYNSMIKGLCQSGQTDQAIDKLNE 547

Query: 558 MLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGM 617
           +LE+GLVPD TTYN IIHG+C EG V+KAFQFHN+M++N FKPDV+TCNILLRGLCREGM
Sbjct: 548 LLESGLVPDGTTYNTIIHGYCYEGKVDKAFQFHNKMVENSFKPDVFTCNILLRGLCREGM 607

Query: 618 LEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKV 677
           LEKALKLFNT +SKGK ID VTYNTIIS+LCK+G+FE A+DLL EM+ KKLGPD YTY  
Sbjct: 608 LEKALKLFNTWISKGKQIDAVTYNTIISNLCKQGRFEEAFDLLEEMKEKKLGPDCYTYNA 667

Query: 678 IIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQI 737
           I+ AL DAG++KEAEEF  K+ E G + DQ+L L KG+NV   E  +  D  SI +S QI
Sbjct: 668 ILGALADAGKVKEAEEFMSKIAEMGQLKDQDLPLDKGKNV-NCETPQGSDPNSIVFSQQI 727

Query: 738 NELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSKAS 783
           N+LC Q KYKDAM +F E +++G+ LNK  Y+NLMEGLIKRRKS SK S
Sbjct: 728 NDLCTQGKYKDAMQIFQESSQKGITLNKSAYINLMEGLIKRRKSISKDS 772

BLAST of CSPI05G08820 vs. TrEMBL
Match: W9R773_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_009849 PE=4 SV=1)

HSP 1 Score: 946.4 bits (2445), Expect = 2.2e-272
Identity = 465/761 (61.10%), Postives = 588/761 (77.27%), Query Frame = 1

Query: 18  EQQLIQTITTILNATKPS-LSALAPYAAHLSPSLISSIFASKALSSHPSVLLNVFKWAQK 77
           + QL+QTIT IL ++KP  L AL PY   ++ SL+ SI +S+ LSS P+ L++ FKW Q 
Sbjct: 5   QPQLLQTITNILTSSKPPPLHALTPYIPQITDSLLISILSSEPLSSEPTTLISFFKWLQS 64

Query: 78  HVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDLP 137
           H P  +  PN     LL LLPSL     FSDAK LL+SFI+SDRQ+ LH+ +LHPTR+LP
Sbjct: 65  HTPPLTQSPN----PLLALLPSLLCRNNFSDAKFLLVSFIASDRQNHLHQALLHPTRNLP 124

Query: 138 EPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILL 197
            PSK LLD SI +YV   +PHLA QIF  MKR    P+L+TCNTL+N+LVR PS+S+I +
Sbjct: 125 RPSKVLLDISIASYVDSGKPHLAAQIFGMMKRHGLCPSLITCNTLLNALVRLPSTSAISM 184

Query: 198 ARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILD 257
           ++++ KD + LG+ P+T++FNILI GYC E+K +DAL  +  M EFGC PD +SYNTILD
Sbjct: 185 SKRIFKDMVGLGIRPSTSTFNILIRGYCNENKFEDALALLTSMREFGCFPDNLSYNTILD 244

Query: 258 ALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLP 317
           AL K+R L EAR LLLDMK++G+  N++TYN+LVCGYC+LG LKEA K+IE+M +N+LLP
Sbjct: 245 ALCKKRQLAEARKLLLDMKNQGVMLNRNTYNILVCGYCKLGWLKEAGKIIELMKQNSLLP 304

Query: 318 TVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLI 377
            VWTYNML+ GFC +GKI+EAF +RDEM  + +LPDV+TYNTL+DGC +WR S+E + LI
Sbjct: 305 DVWTYNMLIGGFCKEGKIEEAFGLRDEMGSLKLLPDVITYNTLVDGCFKWRSSTEAFGLI 364

Query: 378 EEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKA 437
            EM KKGVK NA+T+NI+ KW  K+G M +A  T+ KMEE+G   DCVTYNTLI  YCKA
Sbjct: 365 AEMFKKGVKPNAITHNILAKWFSKEGKMDKACDTVRKMEESGHLSDCVTYNTLINGYCKA 424

Query: 438 GKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSY 497
           GKM +AF MMD M  KGLK+D  TLN +LH L  EKKLDEAY LL SA +RGYI+DEVS+
Sbjct: 425 GKMAEAFGMMDMMGRKGLKMDACTLNIVLHTLCGEKKLDEAYKLLNSAIRRGYIVDEVSF 484

Query: 498 GILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLE 557
           G L++GY K+EK DRAL LWDEMKE+ ++PS +TYN +IGGLCQ  K DQA DKLNE+LE
Sbjct: 485 GTLMMGYIKNEKVDRALELWDEMKEKHVIPSIVTYNGIIGGLCQFGKTDQAKDKLNELLE 544

Query: 558 NGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEK 617
            GLVPDE T+N II+G+CLEG VEKAFQF+N M++ LFKPDV+TCNILL GLC+ GMLEK
Sbjct: 545 CGLVPDEITFNTIINGYCLEGEVEKAFQFYNTMVEKLFKPDVFTCNILLNGLCKGGMLEK 604

Query: 618 ALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIA 677
           ALKLFNT +SKGKDIDVVTYNT+IS LCKEG+FE A+DLL  ME KKL PDQYTY  I +
Sbjct: 605 ALKLFNTWISKGKDIDVVTYNTLISGLCKEGRFEEAFDLLANMEKKKLVPDQYTYNPIRS 664

Query: 678 ALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQINEL 737
            L DAG++++A+EF  K++ESG + +Q L++ KGQ+V+T E+S   D  S AYS++IN+L
Sbjct: 665 RLIDAGKVEQAQEFLSKVIESGKLPEQFLEMAKGQDVVTHEISVESDSVSAAYSERINQL 724

Query: 738 CNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKS 778
           C + KYKDA+H F E  ++G+ LNK  Y  LM+GLI+RRKS
Sbjct: 725 CIEGKYKDALHAFGESKQKGIVLNKSVYAKLMDGLIRRRKS 761

BLAST of CSPI05G08820 vs. TAIR10
Match: AT2G16880.1 (AT2G16880.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 701.8 bits (1810), Expect = 4.7e-202
Identity = 361/732 (49.32%), Postives = 509/732 (69.54%), Query Frame = 1

Query: 18  EQQLIQTITTILNATKPS-LSALAPYAAHLSPSLISSIFASKALSSHPSVLLNVFKWAQK 77
           E QL++T+T+IL + K   L  L PY   ++  L++S+ +S +L+  P  L++ F+WAQ 
Sbjct: 8   ESQLLKTLTSILTSEKTHFLETLNPYIPQITQPLLTSLLSSPSLAKKPETLVSFFQWAQT 67

Query: 78  HVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFI-SSDRQHELHKLILHPTRDL 137
            +P   + P++S   L++++ SL  H+ F+DAKSLL+S+I +SD    L   +LHP   L
Sbjct: 68  SIPE--AFPSDSPLPLISVVRSLLSHHKFADAKSLLVSYIRTSDASLSLCNSLLHPNLHL 127

Query: 138 -PEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSI 197
            P PSK L D ++ AY+   +PH+A QIF KM RL  +PNLLTCNTL+  LVRYPSS SI
Sbjct: 128 SPPPSKALFDIALSAYLHEGKPHVALQIFQKMIRLKLKPNLLTCNTLLIGLVRYPSSFSI 187

Query: 198 LLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKM-SEFGCVPDTVSYNT 257
             AR+V  D +K+GV  N  +FN+L+ GYCLE K++DAL  + +M SEF   PD V+YNT
Sbjct: 188 SSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKLEDALGMLERMVSEFKVNPDNVTYNT 247

Query: 258 ILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNN 317
           IL A+ K+  L + ++LLLDMK  GL PN+ TYN LV GYC+LG LKEA +++E+M + N
Sbjct: 248 ILKAMSKKGRLSDLKELLLDMKKNGLVPNRVTYNNLVYGYCKLGSLKEAFQIVELMKQTN 307

Query: 318 LLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVY 377
           +LP + TYN+L+NG CN G + E   + D M+ + + PDVVTYNTLIDGC +   S E  
Sbjct: 308 VLPDLCTYNILINGLCNAGSMREGLELMDAMKSLKLQPDVVTYNTLIDGCFELGLSLEAR 367

Query: 378 SLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEE-NGLSPDCVTYNTLIGA 437
            L+E+M+  GVK N VT+NI LKW+CK+      T  + ++ + +G SPD VTY+TLI A
Sbjct: 368 KLMEQMENDGVKANQVTHNISLKWLCKEEKREAVTRKVKELVDMHGFSPDIVTYHTLIKA 427

Query: 438 YCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILD 497
           Y K G +  A  MM EM  KG+K++T TLNTIL  L  E+KLDEA+NLL SA KRG+I+D
Sbjct: 428 YLKVGDLSGALEMMREMGQKGIKMNTITLNTILDALCKERKLDEAHNLLNSAHKRGFIVD 487

Query: 498 EVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLN 557
           EV+YG LI+G+F++EK ++AL +WDEMK+ +I P+  T+NS+IGGLC   K + A++K +
Sbjct: 488 EVTYGTLIMGFFREEKVEKALEMWDEMKKVKITPTVSTFNSLIGGLCHHGKTELAMEKFD 547

Query: 558 EMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREG 617
           E+ E+GL+PD++T+N II G+C EG VEKAF+F+NE IK+ FKPD YTCNILL GLC+EG
Sbjct: 548 ELAESGLLPDDSTFNSIILGYCKEGRVEKAFEFYNESIKHSFKPDNYTCNILLNGLCKEG 607

Query: 618 MLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYK 677
           M EKAL  FNTL+ + +++D VTYNT+IS+ CK+ K + AYDLL+EME K L PD++TY 
Sbjct: 608 MTEKALNFFNTLIEE-REVDTVTYNTMISAFCKDKKLKEAYDLLSEMEEKGLEPDRFTYN 667

Query: 678 VIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQ 737
             I+ L + G++ E +E   K         ++L++   +N  TSE  E  + ++IAYSD 
Sbjct: 668 SFISLLMEDGKLSETDELLKKFSGKFGSMKRDLQVETEKNPATSESKEELNTEAIAYSDV 727

Query: 738 INELCNQHKYKD 745
           I+ELC++ + K+
Sbjct: 728 IDELCSRGRLKE 736

BLAST of CSPI05G08820 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 308.9 bits (790), Expect = 8.9e-84
Identity = 176/564 (31.21%), Postives = 301/564 (53.37%), Query Frame = 1

Query: 216 FNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLK-RRLLQEARDLLLDM 275
           F++++  Y   S +  AL  V+     G +P  +SYN +LDA ++ +R +  A ++  +M
Sbjct: 137 FDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEM 196

Query: 276 KSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKI 335
               +SPN  TYN+L+ G+C  G +  A  + + M     LP V TYN L++G+C   KI
Sbjct: 197 LESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKI 256

Query: 336 DEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNII 395
           D+ F++   M    + P++++YN +I+G  +     EV  ++ EM+++G   + VTYN +
Sbjct: 257 DDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTL 316

Query: 396 LKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGL 455
           +K  CK+GN  +A     +M  +GL+P  +TY +LI + CKAG M +A   +D+M  +GL
Sbjct: 317 IKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGL 376

Query: 456 KIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALN 515
             +  T  T++   S +  ++EAY +L   +  G+    V+Y  LI G+    K + A+ 
Sbjct: 377 CPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIA 436

Query: 516 LWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFC 575
           + ++MKE+ + P  ++Y++V+ G C+S  VD+A+    EM+E G+ PD  TY+ +I GFC
Sbjct: 437 VLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFC 496

Query: 576 LEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVV 635
            +   ++A   + EM++    PD +T   L+   C EG LEKAL+L N +V KG   DVV
Sbjct: 497 EQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVV 556

Query: 636 TYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKM 695
           TY+ +I+ L K+ +   A  LL ++  ++  P   TY  +         I+       K 
Sbjct: 557 TYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL---------IENCSNIEFKS 616

Query: 696 VESGIVHDQNLK--LGKGQNVLTSEVSEHFDFKSIAYSDQINELCNQHKYKDAMHLFVEV 755
           V S ++    +K  + +   V  S + ++      AY+  I+  C     + A  L+ E+
Sbjct: 617 VVS-LIKGFCMKGMMTEADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEM 676

Query: 756 TKEGVALNKYTYLNLMEGLIKRRK 777
            K G  L+  T + L++ L K  K
Sbjct: 677 VKSGFLLHTVTVIALVKALHKEGK 690

BLAST of CSPI05G08820 vs. TAIR10
Match: AT1G63130.1 (AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 304.3 bits (778), Expect = 2.2e-82
Identity = 164/531 (30.89%), Postives = 293/531 (55.18%), Query Frame = 1

Query: 159 ATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNI 218
           A  +F  M +    P+++  + L++++ +      ++   + +++   LG+  N  +++I
Sbjct: 65  AVNLFGDMVKSRPFPSIVEFSKLLSAIAKMNKFDLVISLGEQMQN---LGISHNLYTYSI 124

Query: 219 LIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKG 278
           LI  +C  S++  AL  + KM + G  PD V+ N++L+       + +A  L+  M   G
Sbjct: 125 LINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLNSLLNGFCHGNRISDAVSLVGQMVEMG 184

Query: 279 LSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAF 338
             P+  T+N L+ G  R     EA  +++ M      P + TY ++VNG C  G ID A 
Sbjct: 185 YQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGDIDLAL 244

Query: 339 RIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWM 398
            +  +ME+  + P VV YNT+ID    +++ ++  +L  EMD KG++ N VTYN +++ +
Sbjct: 245 SLLKKMEQGKIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCL 304

Query: 399 CKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDT 458
           C  G  ++A+  L  M E  ++P+ VT++ LI A+ K GK+ +A ++ DEM  + +  D 
Sbjct: 305 CNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDI 364

Query: 459 WTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDE 518
           +T +++++   +  +LDEA ++      +    + V+Y  LI G+ K ++ D  + L+ E
Sbjct: 365 FTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVDEGMELFRE 424

Query: 519 MKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGN 578
           M +R ++ +T+TY ++I G  Q+R+ D A     +M+ +G++PD  TY+I++ G C  G 
Sbjct: 425 MSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYSILLDGLCNNGK 484

Query: 579 VEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNT 638
           VE A      + ++  +PD+YT NI++ G+C+ G +E    LF +L  KG   +VVTY T
Sbjct: 485 VETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVVTYTT 544

Query: 639 IISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAA-LTDAGRIKEAE 689
           ++S  C++G  E A  L  EM+ +   PD  TY  +I A L D  +   AE
Sbjct: 545 MMSGFCRKGLKEEADALFREMKEEGPLPDSGTYNTLIRAHLRDGDKAASAE 592

BLAST of CSPI05G08820 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 298.9 bits (764), Expect = 9.2e-81
Identity = 204/700 (29.14%), Postives = 349/700 (49.86%), Query Frame = 1

Query: 29  LNATKPS--LSALAPYAAHLSPSLISSIFASKALSSHP--SVLLNVFKWAQKHVPSFSSP 88
           LN T PS  +S  +P++A LS + +  +    +L S P  S  L +F  A K  P+FS  
Sbjct: 27  LNLTPPSSTISFASPHSAALSSTDVKLL---DSLRSQPDDSAALRLFNLASKK-PNFSPE 86

Query: 89  PNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDLPEPSKELLD 148
           P    +    +L  L R   F D K +L    SS  +      ++               
Sbjct: 87  P----ALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLI--------------- 146

Query: 149 TSIGAYVQMD-QPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKD 208
             I +Y Q + Q  + + +   +     +P+    N ++N LV    +S  L+     K 
Sbjct: 147 -LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLV--DGNSLKLVEISHAKM 206

Query: 209 SIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRL 268
           S+  G+ P+ ++FN+LI   C   +++ A+  +  M  +G VPD  ++ T++   ++   
Sbjct: 207 SV-WGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGD 266

Query: 269 LQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMT-RNNLLPTVWTYN 328
           L  A  +   M   G S +  + N++V G+C+ G +++A   I+ M+ ++   P  +T+N
Sbjct: 267 LDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFN 326

Query: 329 MLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKK 388
            LVNG C  G +  A  I D M +    PDV TYN++I G  +  +  E   ++++M  +
Sbjct: 327 TLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITR 386

Query: 389 GVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKA 448
               N VTYN ++  +CK+  + EAT     +   G+ PD  T+N+LI   C       A
Sbjct: 387 DCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVA 446

Query: 449 FRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILG 508
             + +EM SKG + D +T N ++  L  + KLDEA N+L      G     ++Y  LI G
Sbjct: 447 MELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDG 506

Query: 509 YFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPD 568
           + K  K   A  ++DEM+   +  +++TYN++I GLC+SR+V+ A   +++M+  G  PD
Sbjct: 507 FCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPD 566

Query: 569 ETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFN 628
           + TYN ++  FC  G+++KA      M  N  +PD+ T   L+ GLC+ G +E A KL  
Sbjct: 567 KYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLR 626

Query: 629 TLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEM-EAKKLGPDQYTYKVIIAALTD- 688
           ++  KG ++    YN +I  L ++ K   A +L  EM E  +  PD  +Y+++   L + 
Sbjct: 627 SIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNG 686

Query: 689 AGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE 721
            G I+EA +F ++++E G V + +      + +LT  + E
Sbjct: 687 GGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEE 699

BLAST of CSPI05G08820 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 296.2 bits (757), Expect = 5.9e-80
Identity = 166/544 (30.51%), Postives = 292/544 (53.68%), Query Frame = 1

Query: 159 ATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNI 218
           A  +F +M +    P+++  N L++++ +      ++   + +++   L +  +  S+NI
Sbjct: 64  AVDLFGEMVQSRPLPSIVEFNKLLSAIAKMNKFDLVISLGERMQN---LRISYDLYSYNI 123

Query: 219 LIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKG 278
           LI  +C  S++  AL  + KM + G  PD V+ +++L+     + + EA  L+  M    
Sbjct: 124 LINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRISEAVALVDQMFVME 183

Query: 279 LSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAF 338
             PN  T+N L+ G        EA  +I+ M      P ++TY  +VNG C  G ID A 
Sbjct: 184 YQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLAL 243

Query: 339 RIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWM 398
            +  +MEK  +  DVV Y T+ID    +++ ++  +L  EMD KG++ N VTYN +++ +
Sbjct: 244 SLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCL 303

Query: 399 CKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDT 458
           C  G  ++A+  L  M E  ++P+ VT++ LI A+ K GK+ +A ++ DEM  + +  D 
Sbjct: 304 CNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDI 363

Query: 459 WTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDE 518
           +T +++++   +  +LDEA ++      +    + V+Y  LI G+ K ++ +  + L+ E
Sbjct: 364 FTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGMELFRE 423

Query: 519 MKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGN 578
           M +R ++ +T+TYN++I GL Q+   D A     +M+ +G+ PD  TY+I++ G C  G 
Sbjct: 424 MSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGK 483

Query: 579 VEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNT 638
           +EKA      + K+  +PD+YT NI++ G+C+ G +E    LF +L  KG   +V+ Y T
Sbjct: 484 LEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTT 543

Query: 639 IISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESG 698
           +IS  C++G  E A  L  EM+     P+  TY  +I A    G    + E   +M   G
Sbjct: 544 MISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRARLRDGDKAASAELIKEMRSCG 603

Query: 699 IVHD 703
            V D
Sbjct: 604 FVGD 604

BLAST of CSPI05G08820 vs. NCBI nr
Match: gi|449451888|ref|XP_004143692.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Cucumis sativus])

HSP 1 Score: 1562.4 bits (4044), Expect = 0.0e+00
Identity = 781/783 (99.74%), Postives = 782/783 (99.87%), Query Frame = 1

Query: 1   METPSTSHQGPIQEENLEQQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKAL 60
           METPSTSHQGPIQEENLEQQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKAL
Sbjct: 1   METPSTSHQGPIQEENLEQQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKAL 60

Query: 61  SSHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDR 120
           SSHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDR
Sbjct: 61  SSHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDR 120

Query: 121 QHELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNT 180
           QHELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNT
Sbjct: 121 QHELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNT 180

Query: 181 LMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMS 240
           LMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMS
Sbjct: 181 LMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMS 240

Query: 241 EFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLK 300
           EFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLK
Sbjct: 241 EFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLK 300

Query: 301 EATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLI 360
           EATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLI
Sbjct: 301 EATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLI 360

Query: 361 DGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLS 420
           DGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLS
Sbjct: 361 DGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLS 420

Query: 421 PDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNL 480
           PDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCL VEKKLDEAYNL
Sbjct: 421 PDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLCVEKKLDEAYNL 480

Query: 481 LCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQ 540
           LCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQ
Sbjct: 481 LCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQ 540

Query: 541 SRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYT 600
           SRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMI+NLFKPDVYT
Sbjct: 541 SRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIENLFKPDVYT 600

Query: 601 CNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEME 660
           CNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEME
Sbjct: 601 CNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEME 660

Query: 661 AKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE 720
           AKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE
Sbjct: 661 AKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE 720

Query: 721 HFDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSK 780
           HFDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSK
Sbjct: 721 HFDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSK 780

Query: 781 ASR 784
           ASR
Sbjct: 781 ASR 783

BLAST of CSPI05G08820 vs. NCBI nr
Match: gi|659090138|ref|XP_008445857.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Cucumis melo])

HSP 1 Score: 1417.9 bits (3669), Expect = 0.0e+00
Identity = 711/783 (90.80%), Postives = 741/783 (94.64%), Query Frame = 1

Query: 1   METPSTSHQGPIQEENLEQQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKAL 60
           M+TPSTS++GP Q+EN EQQLIQTITTIL++TKPS SALAPYAAHLSPSLISSIFAS+AL
Sbjct: 1   MKTPSTSYKGPTQQENQEQQLIQTITTILSSTKPSFSALAPYAAHLSPSLISSIFASEAL 60

Query: 61  SSHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDR 120
           SS PSVL++VFKWAQKHVPSFSSPP NSLSSLLTLLPSLFRHYMF DAKSLLISFISSDR
Sbjct: 61  SSRPSVLIHVFKWAQKHVPSFSSPPINSLSSLLTLLPSLFRHYMFYDAKSLLISFISSDR 120

Query: 121 QHELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNT 180
           QHELHKLILHPTRDLPEPSK L+DTSIGAYVQM QPHLATQIFNKMKRLNYRPNLLTC T
Sbjct: 121 QHELHKLILHPTRDLPEPSKALMDTSIGAYVQMRQPHLATQIFNKMKRLNYRPNLLTCKT 180

Query: 181 LMNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMS 240
           LMNSLVRYPSSSSILLARQV KDSIKLGVVP+TNS NILIYGYCLESKVKDALD VNKMS
Sbjct: 181 LMNSLVRYPSSSSILLARQVFKDSIKLGVVPDTNSVNILIYGYCLESKVKDALDLVNKMS 240

Query: 241 EFGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLK 300
           EFGCVPDTVSYNTILDAL K+RLL EARDLLLDM +KGL PNK TYN+LV GYCRLGLLK
Sbjct: 241 EFGCVPDTVSYNTILDALFKKRLLHEARDLLLDMTNKGLLPNKRTYNILVWGYCRLGLLK 300

Query: 301 EATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLI 360
           EATKVIEIMT  NLLP +WTYN+L+NGFCNDGKIDEAFR+RDEMEKM VLPDVVTYNTLI
Sbjct: 301 EATKVIEIMTHKNLLPNIWTYNILINGFCNDGKIDEAFRLRDEMEKMKVLPDVVTYNTLI 360

Query: 361 DGCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLS 420
           DGCS+ R SSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKK NMTEATTTL KMEENGLS
Sbjct: 361 DGCSERRGSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKENMTEATTTLQKMEENGLS 420

Query: 421 PDCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNL 480
           PDCVTYNTLI  YCKAGKMG+AFRMMDEM SKGLKIDTWTLNTILH L VEKKLDEAYNL
Sbjct: 421 PDCVTYNTLIAGYCKAGKMGEAFRMMDEMISKGLKIDTWTLNTILHSLCVEKKLDEAYNL 480

Query: 481 LCSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQ 540
           LCSASKRGYILDEVSYGILI+G+FKDEKGDRALNLWDEMKERQI+PSTITYNSVI GLCQ
Sbjct: 481 LCSASKRGYILDEVSYGILIMGHFKDEKGDRALNLWDEMKERQIIPSTITYNSVIRGLCQ 540

Query: 541 SRKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYT 600
           S K DQA DKLNEMLENG+VPDETTYNIIIHG+CLEGNVEKAFQFHN+MI+NLFKPDVYT
Sbjct: 541 STKTDQATDKLNEMLENGIVPDETTYNIIIHGYCLEGNVEKAFQFHNKMIENLFKPDVYT 600

Query: 601 CNILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEME 660
            NILLRGLCREGMLEKALKLFNT VS GK +DVVTYNTIISSLCKEGKFENAYDLLTEME
Sbjct: 601 RNILLRGLCREGMLEKALKLFNTWVSDGKGVDVVTYNTIISSLCKEGKFENAYDLLTEME 660

Query: 661 AKKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSE 720
           AKKLGPDQYTYK IIAALTDAGRI+EAEEF LKMVESGIVHDQNLKLGKGQNVLTSEVSE
Sbjct: 661 AKKLGPDQYTYKTIIAALTDAGRIEEAEEFILKMVESGIVHDQNLKLGKGQNVLTSEVSE 720

Query: 721 HFDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSK 780
           HFD KSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYL+LMEGLIKRRKSTSK
Sbjct: 721 HFDSKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLSLMEGLIKRRKSTSK 780

Query: 781 ASR 784
           ASR
Sbjct: 781 ASR 783

BLAST of CSPI05G08820 vs. NCBI nr
Match: gi|731412654|ref|XP_010658442.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Vitis vinifera])

HSP 1 Score: 1019.2 bits (2634), Expect = 3.8e-294
Identity = 500/782 (63.94%), Postives = 622/782 (79.54%), Query Frame = 1

Query: 3   TPSTSHQGPIQEENLE-QQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKALS 62
           TP  S   P+    L  Q+LIQTITTIL +    L AL  Y  HL+P L+ SI +SK L 
Sbjct: 4   TPPESSPPPLPPAPLPPQELIQTITTILASNNMPLQALNTYIPHLTPPLVLSILSSKTLI 63

Query: 63  SHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQ 122
           S P++L++ FKWAQ ++P+F   P+NSL SLL+LLPSLF H  FSDAKSLL+ FI++DR+
Sbjct: 64  SRPNILISFFKWAQTNLPTF---PHNSLPSLLSLLPSLFSHRKFSDAKSLLLGFIATDRR 123

Query: 123 HELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTL 182
           H+LH  IL     L  PSK LLDT+IGAYVQ  QPH A QIF KMKRL  RPNLLTCNTL
Sbjct: 124 HDLHLSILR----LTSPSKALLDTAIGAYVQSGQPHHAFQIFKKMKRLRLRPNLLTCNTL 183

Query: 183 MNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSE 242
           +NSLVRYPSS S+  +R+   D+IKLG+VPN N+FNI+IYGYCLE+K KDA++++N M +
Sbjct: 184 LNSLVRYPSSHSVSFSREAFNDAIKLGIVPNVNTFNIVIYGYCLENKFKDAVEFLNVMGK 243

Query: 243 FGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKE 302
           + C PD V+YNTILD L K+  L +ARDLL+DMKS+GL PN++TYN+LV GYC++G LKE
Sbjct: 244 YNCSPDNVTYNTILDTLCKKGRLGDARDLLMDMKSRGLLPNRNTYNILVYGYCKMGWLKE 303

Query: 303 ATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLID 362
           A  VIE+MT+NNLLP VWTYNML+NG CN+G+I+EAF++RDEME + +LPDVV+YNTLI+
Sbjct: 304 AANVIELMTQNNLLPDVWTYNMLINGLCNEGRIEEAFKLRDEMENLKLLPDVVSYNTLIN 363

Query: 363 GCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSP 422
           GC +W   SE + L+EEM +KGVK NAVT+NI++KW CK+G M +A+ T+ KMEE+G SP
Sbjct: 364 GCLEWSKISEAFKLLEEMSEKGVKPNAVTHNIMVKWYCKEGKMDDASNTITKMEESGFSP 423

Query: 423 DCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLL 482
           DCVTYNTLI  YCKAG MG+AFR MDEM  K +K+D+ TLNTIL  L  EKKL+EAY LL
Sbjct: 424 DCVTYNTLINGYCKAGNMGEAFRTMDEMGRKNMKMDSVTLNTILRTLCREKKLEEAYKLL 483

Query: 483 CSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQS 542
            SA KRGY +DEVSYG LI+GYFKD   DRAL LWDEMKE++I+PST+TYN +IGGLCQ 
Sbjct: 484 SSARKRGYFIDEVSYGTLIVGYFKDGNVDRALKLWDEMKEKEIIPSTVTYNCIIGGLCQC 543

Query: 543 RKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTC 602
            K +QAI KLNE+LE+GL+PDETTYN I+HG+C EG+VEKAFQFHN+M++N FKPDV+TC
Sbjct: 544 GKTEQAISKLNELLESGLLPDETTYNTILHGYCREGDVEKAFQFHNKMVENSFKPDVFTC 603

Query: 603 NILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEA 662
           NILLRGLC EG+LEKALKLFNT VSKGK ID VTYNT+I+SLCKEG+ ++A++LL+EME 
Sbjct: 604 NILLRGLCMEGVLEKALKLFNTWVSKGKAIDTVTYNTLITSLCKEGRLDDAFNLLSEMEE 663

Query: 663 KKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEH 722
           K+LGPD YTY  II ALTD+GRI+EAEEF  KM+E G + DQ L+L K + V+TSE SE 
Sbjct: 664 KELGPDHYTYNAIITALTDSGRIREAEEFMSKMLEKGNLPDQVLQLDKNETVVTSETSEE 723

Query: 723 FDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSKA 782
            D  S+AYS+ I ELC + KYKDAM +F E  ++G+ ++K TY+NLM+GLIKRRKS SK 
Sbjct: 724 SDSSSVAYSEWIKELCTEGKYKDAMRIFGESKQKGITVDKSTYINLMDGLIKRRKSISKE 778

Query: 783 SR 784
           +R
Sbjct: 784 AR 778

BLAST of CSPI05G08820 vs. NCBI nr
Match: gi|147819144|emb|CAN78081.1| (hypothetical protein VITISV_021300 [Vitis vinifera])

HSP 1 Score: 1015.0 bits (2623), Expect = 7.1e-293
Identity = 499/782 (63.81%), Postives = 620/782 (79.28%), Query Frame = 1

Query: 3   TPSTSHQGPIQEENLE-QQLIQTITTILNATKPSLSALAPYAAHLSPSLISSIFASKALS 62
           TP  S   P+    L  Q+LIQTITTIL +    L AL  Y   L+P L+ SI +SK L 
Sbjct: 4   TPPESSPPPLPPAPLPPQELIQTITTILASNNMPLQALNTYIPQLTPPLVLSILSSKTLI 63

Query: 63  SHPSVLLNVFKWAQKHVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQ 122
           S P++L++ FKWAQ ++P+F   P+NSL SLL+LLPSLF H  FSDAKSLL+ FI++DR+
Sbjct: 64  SRPNILISFFKWAQTNLPTF---PHNSLPSLLSLLPSLFSHRKFSDAKSLLLGFIATDRR 123

Query: 123 HELHKLILHPTRDLPEPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTL 182
           H+LH  IL     L  PSK LLDT+IGAYVQ  QPH A QIF KMKRL  RPNLLTCNTL
Sbjct: 124 HDLHLSILR----LTSPSKALLDTAIGAYVQSGQPHHAFQIFKKMKRLRLRPNLLTCNTL 183

Query: 183 MNSLVRYPSSSSILLARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSE 242
           +NSLVRYPSS S+  +R+   D+IKLG+VPN N+FNI+IYGYCLE+K KDA++++N M +
Sbjct: 184 LNSLVRYPSSHSVSFSREAFNDAIKLGIVPNVNTFNIVIYGYCLENKFKDAVEFLNVMGK 243

Query: 243 FGCVPDTVSYNTILDALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKE 302
           + C PD V+YNTILDAL K+  L +ARDLL+DMKS+GL PN++TYN+LV GYC++G LKE
Sbjct: 244 YNCSPDNVTYNTILDALCKKGRLGDARDLLMDMKSRGLLPNRNTYNILVYGYCKMGWLKE 303

Query: 303 ATKVIEIMTRNNLLPTVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLID 362
           A  VIE+MT+NNLLP VWTYNML+NG CN+G+I+EAF++RDEME + +LPDVV+YNTLI+
Sbjct: 304 AANVIELMTQNNLLPDVWTYNMLINGLCNEGRIEEAFKLRDEMENLKLLPDVVSYNTLIN 363

Query: 363 GCSQWRDSSEVYSLIEEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSP 422
           GC +W   SE + L+EEM +KGVK NAVT+NI++KW CK+G M +A+ T+ KMEE+G SP
Sbjct: 364 GCLEWSKISEAFKLLEEMSEKGVKPNAVTHNIMVKWYCKEGKMDDASNTITKMEESGFSP 423

Query: 423 DCVTYNTLIGAYCKAGKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLL 482
           DCVTYNTLI  YCKAG MG+AFR MDEM  K +K+D+ TLNTIL  L  EKKL+EAY LL
Sbjct: 424 DCVTYNTLINGYCKAGNMGEAFRTMDEMGRKNMKMDSVTLNTILRTLCREKKLEEAYKLL 483

Query: 483 CSASKRGYILDEVSYGILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQS 542
            SA KRGY +DEVSYG LI+GYFKD   DRAL LWDEMKE++I+PST+TYN +IGGLCQ 
Sbjct: 484 SSARKRGYFIDEVSYGTLIVGYFKDGNVDRALKLWDEMKEKEIIPSTVTYNCIIGGLCQC 543

Query: 543 RKVDQAIDKLNEMLENGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTC 602
            K +QAI KLNE+LE+GL+PDETTYN I+HG+C EG+VEKAFQFHN+M++N FKPDV+TC
Sbjct: 544 GKTEQAISKLNELLESGLLPDETTYNTILHGYCREGDVEKAFQFHNKMVENSFKPDVFTC 603

Query: 603 NILLRGLCREGMLEKALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEA 662
           NILLRGLC EGMLEKALKLFNT VSKGK ID VTYNT+I+SLCKEG+ ++A++LL+EME 
Sbjct: 604 NILLRGLCMEGMLEKALKLFNTWVSKGKAIDTVTYNTLITSLCKEGRLDDAFNLLSEMEE 663

Query: 663 KKLGPDQYTYKVIIAALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEH 722
           K+LGPD YTY  II ALTD+GRI+EAEEF  KM+E G +  Q L+L   + V+TSE SE 
Sbjct: 664 KELGPDHYTYNAIITALTDSGRIREAEEFMSKMLEKGXLPXQVLQLDXNETVVTSETSEE 723

Query: 723 FDFKSIAYSDQINELCNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSKA 782
            D  S+AYS+ I ELC + KYKDAM +F E  ++G+ ++K TY+NLM+GLIKRRKS SK 
Sbjct: 724 SDSSSVAYSEWIKELCTEGKYKDAMRIFGESKQKGITVDKSTYINLMDGLIKRRKSISKE 778

Query: 783 SR 784
           +R
Sbjct: 784 AR 778

BLAST of CSPI05G08820 vs. NCBI nr
Match: gi|1009145677|ref|XP_015890461.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g16880 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 979.2 bits (2530), Expect = 4.3e-282
Identity = 475/764 (62.17%), Postives = 607/764 (79.45%), Query Frame = 1

Query: 18  EQQLIQTITTILNATK-PSLSALAPYAAHLSPSLISSIFASKALSSHPSVLLNVFKWAQK 77
           E  L+Q+I TIL ++K P L +L  +  HL+  L+ SI +SK L+S+P+ LL+ +KW+Q 
Sbjct: 13  ESHLLQSIITILTSSKNPPLHSLNTFIPHLNQPLLLSILSSKTLASNPATLLSFYKWSQS 72

Query: 78  HVPSFSSPPNNSLSSLLTLLPSLFRHYMFSDAKSLLISFISSDRQHELHKLILHPTRDLP 137
           H PS +  P      LLTLLP LF H  FSDAKSLL+SFI+SDRQ+ LH+LILHP R LP
Sbjct: 73  HTPSLTQFPQ----PLLTLLPVLFSHNKFSDAKSLLVSFIASDRQNHLHRLILHPQRVLP 132

Query: 138 EPSKELLDTSIGAYVQMDQPHLATQIFNKMKRLNYRPNLLTCNTLMNSLVRYPSSSSILL 197
            PSK LLDTSIGAYV   +PHLA +IFNKMKR  ++PNLLTCNTL+N+LVRYPSS SI L
Sbjct: 133 RPSKALLDTSIGAYVHSGKPHLAAEIFNKMKRYCFKPNLLTCNTLLNALVRYPSSHSISL 192

Query: 198 ARQVLKDSIKLGVVPNTNSFNILIYGYCLESKVKDALDWVNKMSEFGCVPDTVSYNTILD 257
           ++ V +D I+LGV PNTN+FNILI+GYCLE++ KD  + + +MSEFGC+PD VSYNTILD
Sbjct: 193 SKGVFEDVIRLGVSPNTNTFNILIHGYCLENRFKDGFELLRRMSEFGCLPDNVSYNTILD 252

Query: 258 ALLKRRLLQEARDLLLDMKSKGLSPNKHTYNMLVCGYCRLGLLKEATKVIEIMTRNNLLP 317
            L K+  L EAR+LL DMK++GL PN++TYN+LV GYC+LG LKEA +VIE+M +N + P
Sbjct: 253 GLCKKGQLVEARELLSDMKNRGLVPNRNTYNILVSGYCKLGWLKEAAQVIELMVQNKVFP 312

Query: 318 TVWTYNMLVNGFCNDGKIDEAFRIRDEMEKMNVLPDVVTYNTLIDGCSQWRDSSEVYSLI 377
            +WTYNML+NG CN+GKI+EA R+R EM+   +  DVVTYNTLI+GC +WR S+E   LI
Sbjct: 313 DIWTYNMLINGLCNEGKIEEALRLRCEMKNWKLSEDVVTYNTLINGCFEWRRSAEALRLI 372

Query: 378 EEMDKKGVKCNAVTYNIILKWMCKKGNMTEATTTLDKMEENGLSPDCVTYNTLIGAYCKA 437
           +EMD+KGVK NAVT+NI+LKW+CK+G M EA+  + KMEE+G  PDCVTYNTLI AYCK+
Sbjct: 373 DEMDEKGVKSNAVTHNIMLKWLCKEGKMDEASHNVRKMEESGFYPDCVTYNTLIDAYCKS 432

Query: 438 GKMGKAFRMMDEMTSKGLKIDTWTLNTILHCLSVEKKLDEAYNLLCSASKRGYILDEVSY 497
           GKM +AFRMMDEM  KGLK+D +TLNT+LH L  EKKLDEAY LL SA KRGY+LDEVS+
Sbjct: 433 GKMAEAFRMMDEMNRKGLKMDNFTLNTVLHTLCGEKKLDEAYELLNSAKKRGYMLDEVSF 492

Query: 498 GILILGYFKDEKGDRALNLWDEMKERQIMPSTITYNSVIGGLCQSRKVDQAIDKLNEMLE 557
           G L++GYFK+EK D AL LWDEMKE +++P+ +TYNS+IGGLCQ  K ++AI KLNE+LE
Sbjct: 493 GTLMMGYFKNEKVDEALKLWDEMKESKVIPTVVTYNSIIGGLCQYGKTEEAICKLNELLE 552

Query: 558 NGLVPDETTYNIIIHGFCLEGNVEKAFQFHNEMIKNLFKPDVYTCNILLRGLCREGMLEK 617
           +GLVP+ETTYN IIHG+C EG++ KA QFHN+M++  FKPDV+T NILL+GLC EG LEK
Sbjct: 553 SGLVPNETTYNTIIHGYCREGDIGKAIQFHNKMVEKAFKPDVFTSNILLKGLCSEGKLEK 612

Query: 618 ALKLFNTLVSKGKDIDVVTYNTIISSLCKEGKFENAYDLLTEMEAKKLGPDQYTYKVIIA 677
           ALKLFN+ +S+GKD+D VTYNT+ISSLCKE +FE+A+DLL +ME K L PDQ+TY  I +
Sbjct: 613 ALKLFNSCLSRGKDVDAVTYNTLISSLCKERRFEDAFDLLADMEDKNLQPDQFTYHAIFS 672

Query: 678 ALTDAGRIKEAEEFTLKMVESGIVHDQNLKLGKGQNVLTSEVSEHFDFKSIAYSDQINEL 737
            L DAGR++EAE F  K VE G + DQ+L++ K Q+V+T + +E +D  SIAYS+QINEL
Sbjct: 673 NLADAGRVEEAEAFATKNVELGKLSDQSLQMAKVQDVVTHKSTEEYDSSSIAYSEQINEL 732

Query: 738 CNQHKYKDAMHLFVEVTKEGVALNKYTYLNLMEGLIKRRKSTSK 781
           C + KYKDA+H+F E T +G+ L++  Y++LM GLIKRRK+ S+
Sbjct: 733 CLEGKYKDALHIFKESTLKGIVLSRTVYISLMNGLIKRRKAISR 772

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP156_ARATH8.3e-20149.32Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH1.6e-8231.21Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PPR99_ARATH3.9e-8130.89Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidop... [more]
PP281_ARATH1.6e-7929.14Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PPR96_ARATH1.1e-7830.51Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A5AMQ4_VITVI4.9e-29363.81Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_021300 PE=4 SV=1[more]
A0A067JIV6_JATCU9.4e-27662.16Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01815 PE=4 SV=1[more]
B9SD26_RICCO4.4e-27360.31Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067LFW0_JATCU1.7e-27262.16Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06674 PE=4 SV=1[more]
W9R773_9ROSA2.2e-27261.10Uncharacterized protein OS=Morus notabilis GN=L484_009849 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G16880.14.7e-20249.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.18.9e-8431.21 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G63130.12.2e-8230.89 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53700.19.2e-8129.14 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62930.15.9e-8030.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449451888|ref|XP_004143692.1|0.0e+0099.74PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Cucumis sativu... [more]
gi|659090138|ref|XP_008445857.1|0.0e+0090.80PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Cucumis melo][more]
gi|731412654|ref|XP_010658442.1|3.8e-29463.94PREDICTED: pentatricopeptide repeat-containing protein At2g16880 [Vitis vinifera... [more]
gi|147819144|emb|CAN78081.1|7.1e-29363.81hypothetical protein VITISV_021300 [Vitis vinifera][more]
gi|1009145677|ref|XP_015890461.1|4.3e-28262.17PREDICTED: pentatricopeptide repeat-containing protein At2g16880 isoform X2 [Ziz... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G08820.1CSPI05G08820.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 460..488
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 382..415
score: 5.5E-8coord: 417..450
score: 2.7E-15coord: 208..239
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 561..610
score: 9.8E-18coord: 492..540
score: 9.4E-14coord: 246..295
score: 9.6E-17coord: 316..364
score: 4.6E-17coord: 632..678
score: 9.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 148..184
score: 0.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 669..702
score: 8.7E-4coord: 424..458
score: 1.1E-10coord: 284..317
score: 3.8E-6coord: 494..527
score: 1.7E-4coord: 634..667
score: 4.3E-9coord: 565..598
score: 1.7E-8coord: 529..563
score: 3.7E-9coord: 320..353
score: 7.0E-9coord: 389..423
score: 1.3E-8coord: 249..282
score: 1.9E-7coord: 215..248
score: 8.6E-9coord: 354..388
score: 6.5E-8coord: 599..628
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 139..173
score: 7.837coord: 282..316
score: 11.904coord: 725..759
score: 7.706coord: 387..421
score: 12.057coord: 527..561
score: 12.364coord: 317..351
score: 12.726coord: 212..246
score: 11.027coord: 760..783
score: 5.294coord: 422..456
score: 13.263coord: 492..526
score: 10.972coord: 352..386
score: 11.312coord: 174..211
score: 6.851coord: 632..666
score: 13.143coord: 457..491
score: 8.396coord: 247..281
score: 12.025coord: 562..596
score: 12.419coord: 597..631
score: 11.685coord: 667..701
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 398..485
score: 1.1E-7coord: 530..695
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 59..90
score: 3.5E-253coord: 217..766
score: 3.5E-253coord: 126..178
score: 3.5E
NoneNo IPR availablePANTHERPTHR24015:SF871SUBFAMILY NOT NAMEDcoord: 217..766
score: 3.5E-253coord: 59..90
score: 3.5E-253coord: 126..178
score: 3.5E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 470..657
score: 3.

The following gene(s) are paralogous to this gene:

None