CSPI01G04250 (gene) Wild cucumber (PI 183967)

NameCSPI01G04250
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 2647399 .. 2650788 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAGCGCGCGGGAACTGCCGTTTCAGCCTGTTTATTGGCAGCTCTTCACGATCGATTCAAACTGAATCCATCACCAATAAGCTTCGAGCTTCTTCAACTTCTTCTCCGTCGAAGAAAACATGGACCCAAAAACTCGAATCCAAAAACTCTGACTCCACAATTGTGGATTCCGATATAGTCAAATGGAACAGGAAAATCAGCGCCTACATGCGCAAAGGCCAATGCGAGTCCGCTCTAAGTGTTTTCAATGGTATGCGTCGTCGGAGTACTGTCACCTACAACGCTATGATATCTGGGTATTTGAGTAACAATAAATTTGACTGTGCACGGAAGGTGTTTGAGAAAATGCCCGATAGAGATTTGATATCTTGGAATGTTATGCTTAGTGGTTATGTGAAGAATGGTAATCTTAGTGCGGCTCGGGCTTTGTTTAATCAGATCCCTGAGAAGGATGTTGTTTCTTGGAATGCTATGTTATCTGGGTTTGCTCAGAATGGGTTTGTGGAGGAGGCTAGGAAGATATTTGATCAAATGTTGGTTAAGAATGAAATTTCGTGGAATGGGTTACTGTCCGCGTATGTTCAGAATGGGAGGATTGAGGATGCTAGAAGATTGTTTGATTCGAAAATGGATTGGGAGATCGTTTCTTGGAATTGTTTGATGGGTGGGTATGTAAGGAAAAAGAGGCTAGATGATGCAAGGAGTCTTTTTGACCGTATGCCAGTTAGAGATAAAATCTCTTGGAATATAATGATTACAGGTTATGCTCAGAATGGGCTGCTTTCAGAAGCTCGGAGATTGTTTGAAGAGTTACCAATTCGAGATGTGTTTGCTTGGACGGCCATGGTTTCTGGTTTTGTGCAGAATGGGATGTTGGATGAAGCAACGAGAATTTTTGAAGAAATGCCTGAGAAGAATGAAGTTTCGTGGAATGCAATGATTGCAGGTTATGTGCAGAGCCAGCAAATAGAAAAGGCAAGGGAATTATTTGATCAAATGCCTTCCCGGAATACCAGTTCTTGGAATACAATGGTAACTGGGTATGCTCAGTGTGGCAATATTGATCAAGCCAAGATTTTGTTTGATGAGATGCCTCAACGTGATTGTATATCATGGGCGGCAATGATTTCTGGCTATGCCCAAAGCGGCCAGAGTGAAGAGGCTTTGCACCTTTTTATTAAGATGAAAAGAGATGGGGGAATTTTGAATAGATCTGCGTTAGCATGTGCTTTGAGCTCGTGTGCAGAGATTGCTGCTTTGGAGTTAGGGAAGCAACTACATGGACGACTAGTTAAGGCAGGATTCCAAACTGGGTATATTGCTGGAAATGCACTTCTAGCCATGTATGGCAAATGCGGAAGTATAGAAGAGGCATTTGATGTATTTGAAGATATAACAGAGAAGGATATTGTCTCCTGGAACACAATGATTGCAGGTTATGCAAGGCATGGGTTTGGTAAAGAGGCTTTAGCTCTTTTCGAGTCAATGAAGATGACAATCAAACCTGATGATGTCACTCTGGTATGGAAGTTCTCTATTGACTTTTTTTTTTTTTTTCTAATTTGATCTCCATGTTGTAACTTTTAACAGATAGAGTTGCAGTTACATCAAATTACTAATTTATTTCTGGATAGCAGGAGGGAAAGCTAGGATAGAAAAGAAAGAAAATAATAGACATTAATGTAGCAATTTAATTAGTAATCTTATGAAGGTTCCTGGAAGATGATGAAACTTAATTTTAAAAGCTAAGGGAGGAAACTAAGTCTTTTAGTCTGCGTGAATCAACCTGTTAAGCATGTCTTTCACAAATTTGTTTTTGCAGCTACTTGATTTCCATAGCTTGTTGATTCTCTACCATAGCTACGATCTTACATAATTCTTGACCTTGGCTGAATAAAATACTAAAACTGCATCAAATGAATTGTACTCTCAATATTTTAGCTAAAAAAATCCTAGGAAATGCATGATATTCTTTTTCTTCTCATCTTTTTACATCTAACTTTCTGGTTTTTATCTCATTTCTACCTATTTTGTCGAACATCCTAGGTCGGTGTTCTATCTGCTTGCAGCCATACTGGCTTGGTAGATAAAGGCATGGAATATTTTAACTCAATGTATCAGAACTACGGCATAACAGCAAATGCCAAACATTACACTTGCATGATCGATCTACTTGGTCGTGCGGGTCGGCTAGATGAGGCTCTGAATTTAATGAAGAGCATGCCATTTTACCCAGACGCAGCAACTTGGGGTGCTTTACTCGGAGCCAGCCGAATTCATGGTGACACAGAATTAGGTGAGAAGGCTGCTGAGAAGGTATTTGAGATGGAGCCTGATAATTCAGGGATGTACGTTCTTCTCTCAAATCTATACGCAGCTTCAGGGAGATGGAGGGAAGTTCGTGAGATGAGATCGAAAATGAGAGACAAAGGAGTGAAAAAAGTTCCTGGATATAGTTGGGTTGGAATACAAAACAAGACTCACATCTTCACTGTTGGAGATTGTTCACATCCAGAAGCAGAAAGGATATATGCTTATTTAGAAGAGTTGGATTTAGAATTGAAGAAGGATGGGTTTGTTTCTTCTACTAAATTGGTGTTACATGACGTGGAAGAGGAGGAGAAGGAACACATGCTCAAGTATCATAGTGAAAAATTAGCGGTTGCATTTGGGATTTTATCTATACCACCTGGGAGACCAATCCGAGTAATTAAAAACTTGAGGGTATGTGAAGACTGTCACAATGCCATCAAACACATATCCAAAATTACGCAAAGACAAATAATCGTGAGGGATTCTAATCGCTTTCACCACTTCAGCGAAGGTTCATGTTCTTGTGGAGATTATTGGTGATATTTCTTATCGTCATCTTCCTTTGGCGGATATGAACATTCAATTAAGATGATTAGAAATTGATGAATTGGAAAATCATTAAACCGTAAGCCGCTATCCAGAAGAGGCTTCATAGGTGAGGTGAACTTGGATCAAGATTTTTTTTTTTGAGTCCTTCGGATGGAACGTTCAATCTCTGAATGTTAGAGTCACAAAAAAGTTTTGATCCTTGGAACCTTTTCAGAAATATGAGAATTTCAAAGAACTAAATATTTGGAGGTAAACAAAAATCGAGATGAAGGTTTGGTAATAGTACTTTGACAAGGACAACTGTAGTAATGGTGTCAAATCCTGAAGCTGCTGCTCTTGGAGGATACCACTACTGCTAGATTATGCAACTGCTACATTGGTGATTCGGAAGACGCAGGCATATTGTTCATATTTGAGAATGTGATTGTGTTCTTTGTAAGTATATAACTTGGGTGCCTTACACCTAGACTTGCTGCAGCAGCACAACAATTTGACGAACTTGACGAACCAACATTTTATGGGAAG

mRNA sequence

ATGCAAGCGCGCGGGAACTGCCGTTTCAGCCTGTTTATTGGCAGCTCTTCACGATCGATTCAAACTGAATCCATCACCAATAAGCTTCGAGCTTCTTCAACTTCTTCTCCGTCGAAGAAAACATGGACCCAAAAACTCGAATCCAAAAACTCTGACTCCACAATTGTGGATTCCGATATAGTCAAATGGAACAGGAAAATCAGCGCCTACATGCGCAAAGGCCAATGCGAGTCCGCTCTAAGTGTTTTCAATGGTATGCGTCGTCGGAGTACTGTCACCTACAACGCTATGATATCTGGGTATTTGAGTAACAATAAATTTGACTGTGCACGGAAGGTGTTTGAGAAAATGCCCGATAGAGATTTGATATCTTGGAATGTTATGCTTAGTGGTTATGTGAAGAATGGTAATCTTAGTGCGGCTCGGGCTTTGTTTAATCAGATCCCTGAGAAGGATGTTGTTTCTTGGAATGCTATGTTATCTGGGTTTGCTCAGAATGGGTTTGTGGAGGAGGCTAGGAAGATATTTGATCAAATGTTGGTTAAGAATGAAATTTCGTGGAATGGGTTACTGTCCGCGTATGTTCAGAATGGGAGGATTGAGGATGCTAGAAGATTGTTTGATTCGAAAATGGATTGGGAGATCGTTTCTTGGAATTGTTTGATGGGTGGGTATGTAAGGAAAAAGAGGCTAGATGATGCAAGGAGTCTTTTTGACCGTATGCCAGTTAGAGATAAAATCTCTTGGAATATAATGATTACAGGTTATGCTCAGAATGGGCTGCTTTCAGAAGCTCGGAGATTGTTTGAAGAGTTACCAATTCGAGATGTGTTTGCTTGGACGGCCATGGTTTCTGGTTTTGTGCAGAATGGGATGTTGGATGAAGCAACGAGAATTTTTGAAGAAATGCCTGAGAAGAATGAAGTTTCGTGGAATGCAATGATTGCAGGTTATGTGCAGAGCCAGCAAATAGAAAAGGCAAGGGAATTATTTGATCAAATGCCTTCCCGGAATACCAGTTCTTGGAATACAATGGTAACTGGGTATGCTCAGTGTGGCAATATTGATCAAGCCAAGATTTTGTTTGATGAGATGCCTCAACGTGATTGTATATCATGGGCGGCAATGATTTCTGGCTATGCCCAAAGCGGCCAGAGTGAAGAGGCTTTGCACCTTTTTATTAAGATGAAAAGAGATGGGGGAATTTTGAATAGATCTGCGTTAGCATGTGCTTTGAGCTCGTGTGCAGAGATTGCTGCTTTGGAGTTAGGGAAGCAACTACATGGACGACTAGTTAAGGCAGGATTCCAAACTGGGTATATTGCTGGAAATGCACTTCTAGCCATGTATGGCAAATGCGGAAGTATAGAAGAGGCATTTGATGTATTTGAAGATATAACAGAGAAGGATATTGTCTCCTGGAACACAATGATTGCAGGTTATGCAAGGCATGGGTTTGGTAAAGAGGCTTTAGCTCTTTTCGAGTCAATGAAGATGACAATCAAACCTGATGATGTCACTCTGGTCGGTGTTCTATCTGCTTGCAGCCATACTGGCTTGGTAGATAAAGGCATGGAATATTTTAACTCAATGTATCAGAACTACGGCATAACAGCAAATGCCAAACATTACACTTGCATGATCGATCTACTTGGTCGTGCGGGTCGGCTAGATGAGGCTCTGAATTTAATGAAGAGCATGCCATTTTACCCAGACGCAGCAACTTGGGGTGCTTTACTCGGAGCCAGCCGAATTCATGGTGACACAGAATTAGGTGAGAAGGCTGCTGAGAAGGTATTTGAGATGGAGCCTGATAATTCAGGGATGTACGTTCTTCTCTCAAATCTATACGCAGCTTCAGGGAGATGGAGGGAAGTTCGTGAGATGAGATCGAAAATGAGAGACAAAGGAGTGAAAAAAGTTCCTGGATATAGTTGGGTTGGAATACAAAACAAGACTCACATCTTCACTGTTGGAGATTGTTCACATCCAGAAGCAGAAAGGATATATGCTTATTTAGAAGAGTTGGATTTAGAATTGAAGAAGGATGGGTTTGTTTCTTCTACTAAATTGGTGTTACATGACGTGGAAGAGGAGGAGAAGGAACACATGCTCAAGTATCATAGTGAAAAATTAGCGGTTGCATTTGGGATTTTATCTATACCACCTGGGAGACCAATCCGAGTAATTAAAAACTTGAGGGTATGTGAAGACTGTCACAATGCCATCAAACACATATCCAAAATTACGCAAAGACAAATAATCGTGAGGGATTCTAATCGCTTTCACCACTTCAGCGAAGGTTCATGTTCTTGTGGAGATTATTGGTGA

Coding sequence (CDS)

ATGCAAGCGCGCGGGAACTGCCGTTTCAGCCTGTTTATTGGCAGCTCTTCACGATCGATTCAAACTGAATCCATCACCAATAAGCTTCGAGCTTCTTCAACTTCTTCTCCGTCGAAGAAAACATGGACCCAAAAACTCGAATCCAAAAACTCTGACTCCACAATTGTGGATTCCGATATAGTCAAATGGAACAGGAAAATCAGCGCCTACATGCGCAAAGGCCAATGCGAGTCCGCTCTAAGTGTTTTCAATGGTATGCGTCGTCGGAGTACTGTCACCTACAACGCTATGATATCTGGGTATTTGAGTAACAATAAATTTGACTGTGCACGGAAGGTGTTTGAGAAAATGCCCGATAGAGATTTGATATCTTGGAATGTTATGCTTAGTGGTTATGTGAAGAATGGTAATCTTAGTGCGGCTCGGGCTTTGTTTAATCAGATCCCTGAGAAGGATGTTGTTTCTTGGAATGCTATGTTATCTGGGTTTGCTCAGAATGGGTTTGTGGAGGAGGCTAGGAAGATATTTGATCAAATGTTGGTTAAGAATGAAATTTCGTGGAATGGGTTACTGTCCGCGTATGTTCAGAATGGGAGGATTGAGGATGCTAGAAGATTGTTTGATTCGAAAATGGATTGGGAGATCGTTTCTTGGAATTGTTTGATGGGTGGGTATGTAAGGAAAAAGAGGCTAGATGATGCAAGGAGTCTTTTTGACCGTATGCCAGTTAGAGATAAAATCTCTTGGAATATAATGATTACAGGTTATGCTCAGAATGGGCTGCTTTCAGAAGCTCGGAGATTGTTTGAAGAGTTACCAATTCGAGATGTGTTTGCTTGGACGGCCATGGTTTCTGGTTTTGTGCAGAATGGGATGTTGGATGAAGCAACGAGAATTTTTGAAGAAATGCCTGAGAAGAATGAAGTTTCGTGGAATGCAATGATTGCAGGTTATGTGCAGAGCCAGCAAATAGAAAAGGCAAGGGAATTATTTGATCAAATGCCTTCCCGGAATACCAGTTCTTGGAATACAATGGTAACTGGGTATGCTCAGTGTGGCAATATTGATCAAGCCAAGATTTTGTTTGATGAGATGCCTCAACGTGATTGTATATCATGGGCGGCAATGATTTCTGGCTATGCCCAAAGCGGCCAGAGTGAAGAGGCTTTGCACCTTTTTATTAAGATGAAAAGAGATGGGGGAATTTTGAATAGATCTGCGTTAGCATGTGCTTTGAGCTCGTGTGCAGAGATTGCTGCTTTGGAGTTAGGGAAGCAACTACATGGACGACTAGTTAAGGCAGGATTCCAAACTGGGTATATTGCTGGAAATGCACTTCTAGCCATGTATGGCAAATGCGGAAGTATAGAAGAGGCATTTGATGTATTTGAAGATATAACAGAGAAGGATATTGTCTCCTGGAACACAATGATTGCAGGTTATGCAAGGCATGGGTTTGGTAAAGAGGCTTTAGCTCTTTTCGAGTCAATGAAGATGACAATCAAACCTGATGATGTCACTCTGGTCGGTGTTCTATCTGCTTGCAGCCATACTGGCTTGGTAGATAAAGGCATGGAATATTTTAACTCAATGTATCAGAACTACGGCATAACAGCAAATGCCAAACATTACACTTGCATGATCGATCTACTTGGTCGTGCGGGTCGGCTAGATGAGGCTCTGAATTTAATGAAGAGCATGCCATTTTACCCAGACGCAGCAACTTGGGGTGCTTTACTCGGAGCCAGCCGAATTCATGGTGACACAGAATTAGGTGAGAAGGCTGCTGAGAAGGTATTTGAGATGGAGCCTGATAATTCAGGGATGTACGTTCTTCTCTCAAATCTATACGCAGCTTCAGGGAGATGGAGGGAAGTTCGTGAGATGAGATCGAAAATGAGAGACAAAGGAGTGAAAAAAGTTCCTGGATATAGTTGGGTTGGAATACAAAACAAGACTCACATCTTCACTGTTGGAGATTGTTCACATCCAGAAGCAGAAAGGATATATGCTTATTTAGAAGAGTTGGATTTAGAATTGAAGAAGGATGGGTTTGTTTCTTCTACTAAATTGGTGTTACATGACGTGGAAGAGGAGGAGAAGGAACACATGCTCAAGTATCATAGTGAAAAATTAGCGGTTGCATTTGGGATTTTATCTATACCACCTGGGAGACCAATCCGAGTAATTAAAAACTTGAGGGTATGTGAAGACTGTCACAATGCCATCAAACACATATCCAAAATTACGCAAAGACAAATAATCGTGAGGGATTCTAATCGCTTTCACCACTTCAGCGAAGGTTCATGTTCTTGTGGAGATTATTGGTGA
BLAST of CSPI01G04250 vs. Swiss-Prot
Match: PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 1055.4 bits (2728), Expect = 2.9e-307
Identity = 481/721 (66.71%), Postives = 605/721 (83.91%), Query Frame = 1

Query: 57  DSDIVKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEK 116
           DSDI +WN  IS+YMR G+C  AL VF  M R S+V+YN MISGYL N +F+ ARK+F++
Sbjct: 61  DSDIKEWNVAISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDE 120

Query: 117 MPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIF 176
           MP+RDL+SWNVM+ GYV+N NL  AR LF  +PE+DV SWN MLSG+AQNG V++AR +F
Sbjct: 121 MPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVF 180

Query: 177 DQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARS 236
           D+M  KN++SWN LLSAYVQN ++E+A  LF S+ +W +VSWNCL+GG+V+KK++ +AR 
Sbjct: 181 DRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQ 240

Query: 237 LFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEA 296
            FD M VRD +SWN +ITGYAQ+G + EAR+LF+E P++DVF WTAMVSG++QN M++EA
Sbjct: 241 FFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEA 300

Query: 297 TRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNID 356
             +F++MPE+NEVSWNAM+AGYVQ +++E A+ELFD MP RN S+WNTM+TGYAQCG I 
Sbjct: 301 RELFDKMPERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKIS 360

Query: 357 QAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCA 416
           +AK LFD+MP+RD +SWAAMI+GY+QSG S EAL LF++M+R+GG LNRS+ + ALS+CA
Sbjct: 361 EAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCA 420

Query: 417 EIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNT 476
           ++ ALELGKQLHGRLVK G++TG   GNALL MY KCGSIEEA D+F+++  KDIVSWNT
Sbjct: 421 DVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNT 480

Query: 477 MIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNY 536
           MIAGY+RHGFG+ AL  FESMK   +KPDD T+V VLSACSHTGLVDKG +YF +M Q+Y
Sbjct: 481 MIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDY 540

Query: 537 GITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKA 596
           G+  N++HY CM+DLLGRAG L++A NLMK+MPF PDAA WG LLGASR+HG+TEL E A
Sbjct: 541 GVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETA 600

Query: 597 AEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHI 656
           A+K+F MEP+NSGMYVLLSNLYA+SGRW +V ++R +MRDKGVKKVPGYSW+ IQNKTH 
Sbjct: 601 ADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHT 660

Query: 657 FTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVA 716
           F+VGD  HPE + I+A+LEELDL +KK G+VS T +VLHDVEEEEKE M++YHSE+LAVA
Sbjct: 661 FSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVEEEEKERMVRYHSERLAVA 720

Query: 717 FGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDY 776
           +GI+ +  GRPIRVIKNLRVCEDCHNAIK++++IT R II+RD+NRFHHF +GSCSCGDY
Sbjct: 721 YGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLIILRDNNRFHHFKDGSCSCGDY 780

BLAST of CSPI01G04250 vs. Swiss-Prot
Match: PPR25_ARATH (Pentatricopeptide repeat-containing protein At1g09410 OS=Arabidopsis thaliana GN=PCMP-H18 PE=2 SV=2)

HSP 1 Score: 652.5 bits (1682), Expect = 5.7e-186
Identity = 308/687 (44.83%), Postives = 449/687 (65.36%), Query Frame = 1

Query: 93  TYNAMISGYLSNNKFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKD 152
           T N  I+      K   ARK+F+    + + SWN M++GY  N     AR LF+++P+++
Sbjct: 19  TANVRITHLSRIGKIHEARKLFDSCDSKSISSWNSMVAGYFANLMPRDARKLFDEMPDRN 78

Query: 153 VVSWNAMLSGFAQNGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMD 212
           ++SWN ++SG+ +NG ++EARK+FD M  +N +SW  L+  YV NG+++ A  LF    +
Sbjct: 79  IISWNGLVSGYMKNGEIDEARKVFDLMPERNVVSWTALVKGYVHNGKVDVAESLFWKMPE 138

Query: 213 WEIVSWNCLMGGYVRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEEL 272
              VSW  ++ G+++  R+DDA  L++ +P +D I+   MI G  + G + EAR +F+E+
Sbjct: 139 KNKVSWTVMLIGFLQDGRIDDACKLYEMIPDKDNIARTSMIHGLCKEGRVDEAREIFDEM 198

Query: 273 PIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFD 332
             R V  WT MV+G+ QN  +D+A +IF+ MPEK EVSW +M+ GYVQ+ +IE A ELF+
Sbjct: 199 SERSVITWTTMVTGYGQNNRVDDARKIFDVMPEKTEVSWTSMLMGYVQNGRIEDAEELFE 258

Query: 333 QMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHL 392
            MP +   + N M++G  Q G I +A+ +FD M +R+  SW  +I  + ++G   EAL L
Sbjct: 259 VMPVKPVIACNAMISGLGQKGEIAKARRVFDSMKERNDASWQTVIKIHERNGFELEALDL 318

Query: 393 FIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGK 452
           FI M++ G       L   LS CA +A+L  GKQ+H +LV+  F       + L+ MY K
Sbjct: 319 FILMQKQGVRPTFPTLISILSVCASLASLHHGKQVHAQLVRCQFDVDVYVASVLMTMYIK 378

Query: 453 CGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESMKM--TIKPDDVTLVG 512
           CG + ++  +F+    KDI+ WN++I+GYA HG G+EAL +F  M +  + KP++VT V 
Sbjct: 379 CGELVKSKLIFDRFPSKDIIMWNSIISGYASHGLGEEALKVFCEMPLSGSTKPNEVTFVA 438

Query: 513 VLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFY 572
            LSACS+ G+V++G++ + SM   +G+     HY CM+D+LGRAGR +EA+ ++ SM   
Sbjct: 439 TLSACSYAGMVEEGLKIYESMESVFGVKPITAHYACMVDMLGRAGRFNEAMEMIDSMTVE 498

Query: 573 PDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMR 632
           PDAA WG+LLGA R H   ++ E  A+K+ E+EP+NSG Y+LLSN+YA+ GRW +V E+R
Sbjct: 499 PDAAVWGSLLGACRTHSQLDVAEFCAKKLIEIEPENSGTYILLSNMYASQGRWADVAELR 558

Query: 633 SKMRDKGVKKVPGYSWVGIQNKTHIFTVGDC-SHPEAERIYAYLEELDLELKKDGFVSST 692
             M+ + V+K PG SW  ++NK H FT G   SHPE E I   L+ELD  L++ G+    
Sbjct: 559 KLMKTRLVRKSPGCSWTEVENKVHAFTRGGINSHPEQESILKILDELDGLLREAGYNPDC 618

Query: 693 KLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKI 752
              LHDV+EEEK + LKYHSE+LAVA+ +L +  G PIRV+KNLRVC DCH AIK ISK+
Sbjct: 619 SYALHDVDEEEKVNSLKYHSERLAVAYALLKLSEGIPIRVMKNLRVCSDCHTAIKIISKV 678

Query: 753 TQRQIIVRDSNRFHHFSEGSCSCGDYW 777
            +R+II+RD+NRFHHF  G CSC DYW
Sbjct: 679 KEREIILRDANRFHHFRNGECSCKDYW 705

BLAST of CSPI01G04250 vs. Swiss-Prot
Match: PPR84_ARATH (Pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Arabidopsis thaliana GN=PCMP-H69 PE=2 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 1.6e-180
Identity = 299/673 (44.43%), Postives = 435/673 (64.64%), Query Frame = 1

Query: 106 KFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQ 165
           K + ARK F+ +  + + SWN ++SGY  NG    AR LF+++ E++VVSWN ++SG+ +
Sbjct: 32  KINEARKFFDSLQFKAIGSWNSIVSGYFSNGLPKEARQLFDEMSERNVVSWNGLVSGYIK 91

Query: 166 NGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGY 225
           N  + EAR +F+ M  +N +SW  ++  Y+Q G + +A  LF    +   VSW  + GG 
Sbjct: 92  NRMIVEARNVFELMPERNVVSWTAMVKGYMQEGMVGEAESLFWRMPERNEVSWTVMFGGL 151

Query: 226 VRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVS 285
           +   R+D AR L+D MPV+D ++   MI G  + G + EAR +F+E+  R+V  WT M++
Sbjct: 152 IDDGRIDKARKLYDMMPVKDVVASTNMIGGLCREGRVDEARLIFDEMRERNVVTWTTMIT 211

Query: 286 GFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTM 345
           G+ QN  +D A ++FE MPEK EVSW +M+ GY  S +IE A E F+ MP +   + N M
Sbjct: 212 GYRQNNRVDVARKLFEVMPEKTEVSWTSMLLGYTLSGRIEDAEEFFEVMPMKPVIACNAM 271

Query: 346 VTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNR 405
           + G+ + G I +A+ +FD M  RD  +W  MI  Y + G   EAL LF +M++ G   + 
Sbjct: 272 IVGFGEVGEISKARRVFDLMEDRDNATWRGMIKAYERKGFELEALDLFAQMQKQGVRPSF 331

Query: 406 SALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFED 465
            +L   LS CA +A+L+ G+Q+H  LV+  F       + L+ MY KCG + +A  VF+ 
Sbjct: 332 PSLISILSVCATLASLQYGRQVHAHLVRCQFDDDVYVASVLMTMYVKCGELVKAKLVFDR 391

Query: 466 ITEKDIVSWNTMIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKG 525
            + KDI+ WN++I+GYA HG G+EAL +F  M  +   P+ VTL+ +L+ACS+ G +++G
Sbjct: 392 FSSKDIIMWNSIISGYASHGLGEEALKIFHEMPSSGTMPNKVTLIAILTACSYAGKLEEG 451

Query: 526 MEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASR 585
           +E F SM   + +T   +HY+C +D+LGRAG++D+A+ L++SM   PDA  WGALLGA +
Sbjct: 452 LEIFESMESKFCVTPTVEHYSCTVDMLGRAGQVDKAMELIESMTIKPDATVWGALLGACK 511

Query: 586 IHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGY 645
            H   +L E AA+K+FE EPDN+G YVLLS++ A+  +W +V  +R  MR   V K PG 
Sbjct: 512 THSRLDLAEVAAKKLFENEPDNAGTYVLLSSINASRSKWGDVAVVRKNMRTNNVSKFPGC 571

Query: 646 SWVGIQNKTHIFTVGDC-SHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEH 705
           SW+ +  K H+FT G   +HPE   I   LE+ D  L++ G+      VLHDV+EEEK  
Sbjct: 572 SWIEVGKKVHMFTRGGIKNHPEQAMILMMLEKTDGLLREAGYSPDCSHVLHDVDEEEKVD 631

Query: 706 MLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFH 765
            L  HSE+LAVA+G+L +P G PIRV+KNLRVC DCH AIK ISK+T+R+II+RD+NRFH
Sbjct: 632 SLSRHSERLAVAYGLLKLPEGVPIRVMKNLRVCGDCHAAIKLISKVTEREIILRDANRFH 691

Query: 766 HFSEGSCSCGDYW 777
           HF+ G CSC DYW
Sbjct: 692 HFNNGECSCRDYW 704

BLAST of CSPI01G04250 vs. Swiss-Prot
Match: PP316_ARATH (Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidopsis thaliana GN=DYW10 PE=2 SV=3)

HSP 1 Score: 569.3 bits (1466), Expect = 6.4e-161
Identity = 266/592 (44.93%), Postives = 392/592 (66.22%), Query Frame = 1

Query: 188 NGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRK-KRLDDARSLFDRMPVRDK 247
           N +++  V++G I+ A R+F        ++WN L+ G  +   R+ +A  LFD +P  D 
Sbjct: 65  NKIIARCVRSGDIDGALRVFHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIPEPDT 124

Query: 248 ISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEK 307
            S+NIM++ Y +N    +A+  F+ +P +D  +W  M++G+ + G +++A  +F  M EK
Sbjct: 125 FSYNIMLSCYVRNVNFEKAQSFFDRMPFKDAASWNTMITGYARRGEMEKARELFYSMMEK 184

Query: 308 NEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMP 367
           NEVSWNAMI+GY++   +EKA   F   P R   +W  M+TGY +   ++ A+ +F +M 
Sbjct: 185 NEVSWNAMISGYIECGDLEKASHFFKVAPVRGVVAWTAMITGYMKAKKVELAEAMFKDMT 244

Query: 368 -QRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAALELGK 427
             ++ ++W AMISGY ++ + E+ L LF  M  +G   N S L+ AL  C+E++AL+LG+
Sbjct: 245 VNKNLVTWNAMISGYVENSRPEDGLKLFRAMLEEGIRPNSSGLSSALLGCSELSALQLGR 304

Query: 428 QLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHG 487
           Q+H  + K+       A  +L++MY KCG + +A+ +FE + +KD+V+WN MI+GYA+HG
Sbjct: 305 QIHQIVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVAWNAMISGYAQHG 364

Query: 488 FGKEALALFESM-KMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANAKHY 547
              +AL LF  M    I+PD +T V VL AC+H GLV+ GM YF SM ++Y +     HY
Sbjct: 365 NADKALCLFREMIDNKIRPDWITFVAVLLACNHAGLVNIGMAYFESMVRDYKVEPQPDHY 424

Query: 548 TCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVFEMEP 607
           TCM+DLLGRAG+L+EAL L++SMPF P AA +G LLGA R+H + EL E AAEK+ ++  
Sbjct: 425 TCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHKNVELAEFAAEKLLQLNS 484

Query: 608 DNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHP 667
            N+  YV L+N+YA+  RW +V  +R +M++  V KVPGYSW+ I+NK H F   D  HP
Sbjct: 485 QNAAGYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWIEIRNKVHHFRSSDRIHP 544

Query: 668 EAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPG 727
           E + I+  L+EL+ ++K  G+    +  LH+VEEE+KE +L +HSEKLAVAFG + +P G
Sbjct: 545 ELDSIHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLWHSEKLAVAFGCIKLPQG 604

Query: 728 RPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
             I+V KNLR+C DCH AIK IS+I +R+IIVRD+ RFHHF +GSCSCGDYW
Sbjct: 605 SQIQVFKNLRICGDCHKAIKFISEIEKREIIVRDTTRFHHFKDGSCSCGDYW 656

BLAST of CSPI01G04250 vs. Swiss-Prot
Match: PPR57_ARATH (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 536.2 bits (1380), Expect = 6.0e-151
Identity = 294/747 (39.36%), Postives = 432/747 (57.83%), Query Frame = 1

Query: 55  IVDSDIVKWNRKISAYMRKGQCESALSVFNG--MRRRSTVTYNAMISGYLSNNKFDCARK 114
           I + D +     +S Y   G    A  VF    +  R TV YNAMI+G+  NN    A  
Sbjct: 75  ISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAIN 134

Query: 115 VFEKMPDR----DLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSW-----NAMLSGF 174
           +F KM       D  ++  +L+G     +       F+    K    +     NA++S +
Sbjct: 135 LFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVY 194

Query: 175 AQ----NGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMD-WEIVSW 234
           ++       +  ARK+FD++L K+E SW  +++ YV+NG  +    L +   D  ++V++
Sbjct: 195 SKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAY 254

Query: 235 NCLMGGYVRKKRLDDARSLFDRMPVR----DKISWNIMITGYAQNGLLSEARRLFEELPI 294
           N ++ GYV +    +A  +  RM       D+ ++  +I   A  GLL   +++   +  
Sbjct: 255 NAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLR 314

Query: 295 RDVFAW---TAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELF 354
           R+ F++    ++VS + + G  DEA  IFE+MP K+ VSWNA+++GYV S          
Sbjct: 315 REDFSFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSS---------- 374

Query: 355 DQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALH 414
                                G+I +AK++F EM +++ +SW  MISG A++G  EE L 
Sbjct: 375 ---------------------GHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLK 434

Query: 415 LFIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYG 474
           LF  MKR+G      A + A+ SCA + A   G+Q H +L+K GF +   AGNAL+ MY 
Sbjct: 435 LFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYA 494

Query: 475 KCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESM-KMTIKPDDVTLVG 534
           KCG +EEA  VF  +   D VSWN +IA   +HG G EA+ ++E M K  I+PD +TL+ 
Sbjct: 495 KCGVVEEARQVFRTMPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLT 554

Query: 535 VLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFY 594
           VL+ACSH GLVD+G +YF+SM   Y I   A HY  +IDLL R+G+  +A ++++S+PF 
Sbjct: 555 VLTACSHAGLVDQGRKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFK 614

Query: 595 PDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMR 654
           P A  W ALL   R+HG+ ELG  AA+K+F + P++ G Y+LLSN++AA+G+W EV  +R
Sbjct: 615 PTAEIWEALLSGCRVHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVR 674

Query: 655 SKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTK 714
             MRD+GVKK    SW+ ++ + H F V D SHPEAE +Y YL++L  E+++ G+V  T 
Sbjct: 675 KLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYVPDTS 734

Query: 715 LVLHDVEEE-EKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKI 774
            VLHDVE +  KE ML  HSEK+AVAFG++ +PPG  IR+ KNLR C DCHN  + +S +
Sbjct: 735 FVLHDVESDGHKEDMLTTHSEKIAVAFGLMKLPPGTTIRIFKNLRTCGDCHNFFRFLSWV 790

Query: 775 TQRQIIVRDSNRFHHFSEGSCSCGDYW 777
            QR II+RD  RFHHF  G CSCG++W
Sbjct: 795 VQRDIILRDRKRFHHFRNGECSCGNFW 790

BLAST of CSPI01G04250 vs. TrEMBL
Match: A0A0A0LV20_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G024980 PE=4 SV=1)

HSP 1 Score: 1575.5 bits (4078), Expect = 0.0e+00
Identity = 774/776 (99.74%), Postives = 775/776 (99.87%), Query Frame = 1

Query: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI 60
           MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI
Sbjct: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI 60

Query: 61  VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR 120
           VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR
Sbjct: 61  VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR 120

Query: 121 DLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180
           DLISWNVMLSGYVKNGNLSAARALFNQ+PEKDVVSWNAMLSGFAQNGFVEEARKIFDQML
Sbjct: 121 DLISWNVMLSGYVKNGNLSAARALFNQMPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180

Query: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR 240
           VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR
Sbjct: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR 240

Query: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300
           MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF
Sbjct: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300

Query: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI 360
           EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI
Sbjct: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI 360

Query: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA 420
           LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA
Sbjct: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA 420

Query: 421 LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480
           LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG
Sbjct: 421 LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480

Query: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540
           YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN
Sbjct: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540

Query: 541 AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF 600
           AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF
Sbjct: 541 AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF 600

Query: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGD 660
           EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWV IQNKTHIFTVGD
Sbjct: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVEIQNKTHIFTVGD 660

Query: 661 CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720
           CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS
Sbjct: 661 CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720

Query: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW
Sbjct: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 776

BLAST of CSPI01G04250 vs. TrEMBL
Match: W9SFH3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000356 PE=4 SV=1)

HSP 1 Score: 1162.1 bits (3005), Expect = 0.0e+00
Identity = 565/775 (72.90%), Postives = 659/775 (85.03%), Query Frame = 1

Query: 4   RGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLE-SKNSDSTIVDSDIVK 63
           RG+ RF  F  S   S+QT++I  KL       PSKKT  +K    KN  S I DSDIV+
Sbjct: 2   RGSHRFRQFHSSCFCSLQTQTINGKL---PNPIPSKKTLIEKHNPKKNKKSNIADSDIVQ 61

Query: 64  WNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDRDL 123
           WN  I+++MR G C++AL VFN M RRS V+YNAMISGYL+N++FD AR +FE+MP+RDL
Sbjct: 62  WNMDITSHMRNGHCKAALRVFNDMSRRSVVSYNAMISGYLANDRFDLARDMFERMPERDL 121

Query: 124 ISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQMLVK 183
           +SWNVMLSGYV+N  L AAR LF+++PE+DVVSWN+MLSG+AQ G+V+EA KIF+ M  K
Sbjct: 122 VSWNVMLSGYVRNRKLGAARMLFDRMPERDVVSWNSMLSGYAQYGYVDEAMKIFEMMPDK 181

Query: 184 NEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDRMP 243
           NEISWN LLSAYVQNGRI+DARRLF+SK DWE+VSWNCLMGGYVRKKRL DAR LFD+MP
Sbjct: 182 NEISWNSLLSAYVQNGRIDDARRLFESKADWEVVSWNCLMGGYVRKKRLVDARKLFDQMP 241

Query: 244 VRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEE 303
           +RD +SWN MIT YAQN  L+E+RRLFEE PIRDVFAWTAM+SG+VQ+GMLDEA RIF+E
Sbjct: 242 IRDAVSWNTMITCYAQNSELAESRRLFEESPIRDVFAWTAMMSGYVQHGMLDEARRIFDE 301

Query: 304 MPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILF 363
           MP KN VSWNA+IAGYV+ ++++ ARELF+ MP RN SSWNTM+T YAQ G+I QA+ +F
Sbjct: 302 MPVKNPVSWNAIIAGYVRCKRMDIARELFEVMPCRNVSSWNTMLTAYAQSGDIAQARFIF 361

Query: 364 DEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAALE 423
           D MPQRD ISWAA+I+GYAQ+G  EEAL LF++MK++G  L RS   CALS+CAEIAALE
Sbjct: 362 DRMPQRDSISWAAIIAGYAQNGYGEEALRLFMEMKKEGERLTRSCYTCALSTCAEIAALE 421

Query: 424 LGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYA 483
           LGKQLHGRLVKAGF+TG   GNALL MY KCGSIEEA++VF+DI  KDIVSWNTMIAGYA
Sbjct: 422 LGKQLHGRLVKAGFETGCYVGNALLVMYSKCGSIEEAYNVFKDIEVKDIVSWNTMIAGYA 481

Query: 484 RHGFGKEALALFESMK-MTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANA 543
           RHGFGKEAL +FESMK M I PDDVTLVGVLSACSHTGLV++G +YF SM Q+YGIT N+
Sbjct: 482 RHGFGKEALMIFESMKAMGIIPDDVTLVGVLSACSHTGLVERGKQYFYSMNQDYGITPNS 541

Query: 544 KHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVFE 603
           KHYTCMIDLLGRAG LDEA +LM++MPF PDAATWGALLGASRIHG+TELGEKAA+ +FE
Sbjct: 542 KHYTCMIDLLGRAGCLDEAQDLMRNMPFEPDAATWGALLGASRIHGNTELGEKAAKIIFE 601

Query: 604 MEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDC 663
           +EP+N+GMYVLLSNLYAASGRW +VR+MR KMRD GVKKVPGYSWV +QNK H F+VGD 
Sbjct: 602 LEPENAGMYVLLSNLYAASGRWTDVRKMRLKMRDTGVKKVPGYSWVEVQNKVHTFSVGDS 661

Query: 664 SHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSI 723
            HPE +RIYA+LEELDL++K++G+VSSTKLVLHDVEEEEKE+MLKYHSEKLAVAF ILS 
Sbjct: 662 VHPEKDRIYAFLEELDLKMKREGYVSSTKLVLHDVEEEEKENMLKYHSEKLAVAFAILST 721

Query: 724 PPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           PPGRPIRV+KNLRVCEDCH+A K ISKI  R II+RDS RFHHFS GSCSCGDYW
Sbjct: 722 PPGRPIRVMKNLRVCEDCHSAFKIISKIVGRLIILRDSYRFHHFSGGSCSCGDYW 773

BLAST of CSPI01G04250 vs. TrEMBL
Match: M5WCR9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002162mg PE=4 SV=1)

HSP 1 Score: 1138.3 bits (2943), Expect = 0.0e+00
Identity = 537/707 (75.95%), Postives = 624/707 (88.26%), Query Frame = 1

Query: 71  MRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDRDLISWNVMLS 130
           MR G+CE+AL VFN M RRS V+YNAMISGYL+N KFD A+ +FEKMP+RDL+SWNVMLS
Sbjct: 1   MRNGRCEAALRVFNVMPRRSPVSYNAMISGYLANGKFDLAKDMFEKMPERDLVSWNVMLS 60

Query: 131 GYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQMLVKNEISWNGL 190
           GYV+N +L AA ALF ++PEKDVVSWNAMLSG+AQNG+V+EARK+F++M  KNEISWNGL
Sbjct: 61  GYVRNRDLGAAHALFERMPEKDVVSWNAMLSGYAQNGYVDEARKVFERMPNKNEISWNGL 120

Query: 191 LSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDRMPVRDKISWN 250
           L+AYVQNGRIEDARRLF+SK +WE VSWNCLMGG V++KRL  AR LFDRMPVRD++SWN
Sbjct: 121 LAAYVQNGRIEDARRLFESKANWEAVSWNCLMGGLVKQKRLVHARQLFDRMPVRDEVSWN 180

Query: 251 IMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEKNEVS 310
            MITGYAQNG +SEARRLF E PIRDVFAWT+M+SG+VQNGMLDE  R+F+EMPEKN VS
Sbjct: 181 TMITGYAQNGEMSEARRLFGESPIRDVFAWTSMLSGYVQNGMLDEGRRMFDEMPEKNSVS 240

Query: 311 WNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDC 370
           WNAMIAGYVQ ++++ A +LF  MP RN SSWNT++TGYAQ G+ID A+ +FD MP+RD 
Sbjct: 241 WNAMIAGYVQCKRMDMAMKLFGAMPFRNASSWNTILTGYAQSGDIDNARKIFDSMPRRDS 300

Query: 371 ISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGR 430
           ISWAA+I+GYAQ+G SEEAL LF++MKRDG  L RS+  C LS+CAEIAALELGKQLHGR
Sbjct: 301 ISWAAIIAGYAQNGYSEEALCLFVEMKRDGERLTRSSFTCTLSTCAEIAALELGKQLHGR 360

Query: 431 LVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEA 490
           + KAG++TG   GNALL MY KCGSIEEA+DVF+ I EKD+VSWNTMI GYARHGFG +A
Sbjct: 361 VTKAGYETGCYVGNALLVMYCKCGSIEEAYDVFQGIAEKDVVSWNTMIYGYARHGFGSKA 420

Query: 491 LALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMID 550
           L +FESMK   IKPDDVT+VGVLSACSHTGLVD+G EYF SM Q+YGITAN+KHYTCMID
Sbjct: 421 LMVFESMKAAGIKPDDVTMVGVLSACSHTGLVDRGTEYFYSMNQDYGITANSKHYTCMID 480

Query: 551 LLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGM 610
           LLGRAGRL+EA NLM+ MPF PDAATWGALLGASRIHG+TELGEKAA+ +FEMEP+N+GM
Sbjct: 481 LLGRAGRLEEAQNLMRDMPFEPDAATWGALLGASRIHGNTELGEKAAQIIFEMEPENAGM 540

Query: 611 YVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHPEAERI 670
           YVLLSNLYAASGRW EV +MR KM+DKGV+KVPGYSWV +QNK H F+VGD  HP+ ++I
Sbjct: 541 YVLLSNLYAASGRWGEVGKMRLKMKDKGVRKVPGYSWVEVQNKIHTFSVGDSIHPDKDKI 600

Query: 671 YAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPGRPIRV 730
           YA+LEELDL++K++G++SSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIP GRPIRV
Sbjct: 601 YAFLEELDLKMKREGYISSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPAGRPIRV 660

Query: 731 IKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           IKNLRVC DCHNAIK+ISKI  R II+RDS+RFHHFS G+CSCGDYW
Sbjct: 661 IKNLRVCGDCHNAIKYISKIVGRTIILRDSHRFHHFSGGNCSCGDYW 707

BLAST of CSPI01G04250 vs. TrEMBL
Match: A0A0D2T8Y5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G289000 PE=4 SV=1)

HSP 1 Score: 1127.1 bits (2914), Expect = 0.0e+00
Identity = 533/750 (71.07%), Postives = 635/750 (84.67%), Query Frame = 1

Query: 28  KLRASSTSSPSKKTWTQKLESKNSDSTIVDSDIVKWNRKISAYMRKGQCESALSVFNGMR 87
           +LR +  S PSKKT TQ+  + +    + DSDI +WN  IS YMR  Q +SAL  FN M 
Sbjct: 21  ELRTNPNSYPSKKTLTQRRHN-DKPRPVGDSDIKQWNMAISYYMRNSQLDSALHFFNLMP 80

Query: 88  RRSTVTYNAMISGYLSNNKFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQ 147
           RRS+V+YNAMISGYL N +FD AR +F++MP+RDL+SWNVM+SG V+N N++AAR LF +
Sbjct: 81  RRSSVSYNAMISGYLMNGRFDLARNLFDEMPERDLVSWNVMISGCVRNNNVAAARELFEE 140

Query: 148 IPEKDVVSWNAMLSGFAQNGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLF 207
           +PE+DVVSWNAMLSG+AQNG ++EARKIFD+M  KN ISWN LL+ YVQNGR+E+A RLF
Sbjct: 141 MPERDVVSWNAMLSGYAQNGCIDEARKIFDRMPCKNSISWNALLATYVQNGRMEEACRLF 200

Query: 208 DSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARR 267
           +SK+DW++VSWNCLMGG+V+KK L DAR +FDR+P RDKISWN +ITGYAQNG + EARR
Sbjct: 201 ESKLDWDLVSWNCLMGGFVKKKMLVDARRVFDRIPFRDKISWNTIITGYAQNGEIEEARR 260

Query: 268 LFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKA 327
           LF E P+RDVF WTAMVSGFVQNG++DEA   FE+MP+KN VSWNAMIAGYVQ ++++ A
Sbjct: 261 LFNESPVRDVFTWTAMVSGFVQNGLVDEARETFEQMPQKNAVSWNAMIAGYVQCKRMDMA 320

Query: 328 RELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSE 387
           R+LFD+MP R+ ++WNTM+TGYAQ G I  A+  FD MP+ D +SWAAMI+GYAQSG SE
Sbjct: 321 RKLFDKMPFRDVTTWNTMITGYAQSGEIAHARDFFDRMPRHDPVSWAAMIAGYAQSGYSE 380

Query: 388 EALHLFIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALL 447
           EAL LF+ MKRDG  LNRS+ ACALS+CA IAALELG QLHGRLVKAG+++G   GNALL
Sbjct: 381 EALRLFVDMKRDGERLNRSSFACALSTCAHIAALELGMQLHGRLVKAGYESGSFVGNALL 440

Query: 448 AMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESMKMT-IKPDDV 507
            MY KCG IEEA   FE+I EKDIVSWNTMIAGYARHGFGKEAL +FESMK   +KP+D 
Sbjct: 441 LMYCKCGGIEEACSAFEEIMEKDIVSWNTMIAGYARHGFGKEALKIFESMKAAGVKPNDT 500

Query: 508 TLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKS 567
           T+VGVLSACSH GLVD+GMEYF SM Q+YGITAN++HYTCM+DLLGRAGRLDEA  L+++
Sbjct: 501 TMVGVLSACSHAGLVDRGMEYFYSMNQDYGITANSRHYTCMVDLLGRAGRLDEAQKLIRN 560

Query: 568 MPFYPDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREV 627
           MPF PDAATWGALLGASRIHG+T+L E AAE +FEMEP+N+GMYVLLSNLYAASGRW +V
Sbjct: 561 MPFEPDAATWGALLGASRIHGNTKLAEMAAELIFEMEPENAGMYVLLSNLYAASGRWADV 620

Query: 628 REMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHPEAERIYAYLEELDLELKKDGFV 687
            +MR KMRD GVKKVPG SW+ +QNK H F+VGD  HP+ ++IYAYLEELDL++K++G+V
Sbjct: 621 SKMRLKMRDTGVKKVPGCSWLEVQNKIHTFSVGDSCHPDRDKIYAYLEELDLKMKREGYV 680

Query: 688 SSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHI 747
           SS KL+LHDV+EEEKEHMLKYHSEKLAVA+GILSIP GRPIRV+KNLRVCEDCHNAIK+I
Sbjct: 681 SSIKLILHDVDEEEKEHMLKYHSEKLAVAYGILSIPAGRPIRVMKNLRVCEDCHNAIKYI 740

Query: 748 SKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           SKI  R II+RDSNRFHHF EGSCSCGDYW
Sbjct: 741 SKIVGRLIILRDSNRFHHFREGSCSCGDYW 769

BLAST of CSPI01G04250 vs. TrEMBL
Match: Q1SN04_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_8g106950 PE=4 SV=1)

HSP 1 Score: 1124.8 bits (2908), Expect = 0.0e+00
Identity = 528/739 (71.45%), Postives = 635/739 (85.93%), Query Frame = 1

Query: 41  TWTQKLES--KNSDSTIVDSDIVKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMI 100
           T T++ ES   N+   + D DI+KWN+ IS +MR G C+SAL VFN M RRS+V+YNAMI
Sbjct: 28  TSTRRSESVTNNNKPRVKDPDILKWNKAISTHMRNGHCDSALHVFNTMPRRSSVSYNAMI 87

Query: 101 SGYLSNNKFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNA 160
           SGYL N+KF+ AR +F++MP+RDL SWNVML+GYV+N  L  AR LF+ +PEKDVVSWN+
Sbjct: 88  SGYLRNSKFNLARNLFDQMPERDLFSWNVMLTGYVRNCRLGDARRLFDLMPEKDVVSWNS 147

Query: 161 MLSGFAQNGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSW 220
           +LSG+AQNG+V+EAR++FD M  KN ISWNGLL+AYV NGRIE+A  LF+SK DW+++SW
Sbjct: 148 LLSGYAQNGYVDEAREVFDNMPEKNSISWNGLLAAYVHNGRIEEACLLFESKSDWDLISW 207

Query: 221 NCLMGGYVRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVF 280
           NCLMGG+VRKK+L DAR LFD+MPVRD ISWN MI+GYAQ G LS+ARRLF+E P RDVF
Sbjct: 208 NCLMGGFVRKKKLGDARWLFDKMPVRDAISWNTMISGYAQGGGLSQARRLFDESPTRDVF 267

Query: 281 AWTAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRN 340
            WTAMVSG+VQNGMLDEA   F+EMPEKNEVS+NAMIAGYVQ+++++ ARELF+ MP RN
Sbjct: 268 TWTAMVSGYVQNGMLDEAKTFFDEMPEKNEVSYNAMIAGYVQTKKMDIARELFESMPCRN 327

Query: 341 TSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKR 400
            SSWNTM+TGY Q G+I QA+  FD MPQRDC+SWAA+I+GYAQSG  EEAL++F+++K+
Sbjct: 328 ISSWNTMITGYGQIGDIAQARKFFDMMPQRDCVSWAAIIAGYAQSGHYEEALNMFVEIKQ 387

Query: 401 DGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEE 460
           DG  LNR+   CALS+CA+IAALELGKQ+HG+ VK G+ TG   GNALLAMY KCGSI+E
Sbjct: 388 DGESLNRATFGCALSTCADIAALELGKQIHGQAVKMGYGTGCFVGNALLAMYFKCGSIDE 447

Query: 461 AFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSH 520
           A D FE I EKD+VSWNTM+AGYARHGFG++AL +FESMK   +KPD++T+VGVLSACSH
Sbjct: 448 ANDTFEGIEEKDVVSWNTMLAGYARHGFGRQALTVFESMKTAGVKPDEITMVGVLSACSH 507

Query: 521 TGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWG 580
           TGL+D+G EYF SM ++YG+   +KHYTCMIDLLGRAGRL+EA +L+++MPF P AA+WG
Sbjct: 508 TGLLDRGTEYFYSMTKDYGVIPTSKHYTCMIDLLGRAGRLEEAQDLIRNMPFQPGAASWG 567

Query: 581 ALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKG 640
           ALLGASRIHG+TELGEKAAE VF+MEP NSGMYVLLSNLYAASGRW +  +MRSKMRD G
Sbjct: 568 ALLGASRIHGNTELGEKAAEMVFKMEPQNSGMYVLLSNLYAASGRWVDADKMRSKMRDIG 627

Query: 641 VKKVPGYSWVGIQNKTHIFTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVE 700
           V+KVPGYSWV +QNK H F+VGDCSHPE ERIYAYLEELDL+++++G+VS TKLVLHDVE
Sbjct: 628 VQKVPGYSWVEVQNKIHTFSVGDCSHPEKERIYAYLEELDLKMREEGYVSLTKLVLHDVE 687

Query: 701 EEEKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVR 760
           EEEKEHMLKYHSEKLAVAFGIL+IP GRPIRV+KNLRVCEDCH+AIKHISKI  R II+R
Sbjct: 688 EEEKEHMLKYHSEKLAVAFGILTIPGGRPIRVMKNLRVCEDCHSAIKHISKIVGRLIILR 747

Query: 761 DSNRFHHFSEGSCSCGDYW 777
           DS+RFHHF+EG CSCGDYW
Sbjct: 748 DSHRFHHFNEGFCSCGDYW 766

BLAST of CSPI01G04250 vs. TAIR10
Match: AT4G02750.1 (AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 1055.4 bits (2728), Expect = 1.7e-308
Identity = 481/721 (66.71%), Postives = 605/721 (83.91%), Query Frame = 1

Query: 57  DSDIVKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEK 116
           DSDI +WN  IS+YMR G+C  AL VF  M R S+V+YN MISGYL N +F+ ARK+F++
Sbjct: 61  DSDIKEWNVAISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDE 120

Query: 117 MPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIF 176
           MP+RDL+SWNVM+ GYV+N NL  AR LF  +PE+DV SWN MLSG+AQNG V++AR +F
Sbjct: 121 MPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVF 180

Query: 177 DQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARS 236
           D+M  KN++SWN LLSAYVQN ++E+A  LF S+ +W +VSWNCL+GG+V+KK++ +AR 
Sbjct: 181 DRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEARQ 240

Query: 237 LFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEA 296
            FD M VRD +SWN +ITGYAQ+G + EAR+LF+E P++DVF WTAMVSG++QN M++EA
Sbjct: 241 FFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQDVFTWTAMVSGYIQNRMVEEA 300

Query: 297 TRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNID 356
             +F++MPE+NEVSWNAM+AGYVQ +++E A+ELFD MP RN S+WNTM+TGYAQCG I 
Sbjct: 301 RELFDKMPERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKIS 360

Query: 357 QAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCA 416
           +AK LFD+MP+RD +SWAAMI+GY+QSG S EAL LF++M+R+GG LNRS+ + ALS+CA
Sbjct: 361 EAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCA 420

Query: 417 EIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNT 476
           ++ ALELGKQLHGRLVK G++TG   GNALL MY KCGSIEEA D+F+++  KDIVSWNT
Sbjct: 421 DVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNT 480

Query: 477 MIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNY 536
           MIAGY+RHGFG+ AL  FESMK   +KPDD T+V VLSACSHTGLVDKG +YF +M Q+Y
Sbjct: 481 MIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDY 540

Query: 537 GITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKA 596
           G+  N++HY CM+DLLGRAG L++A NLMK+MPF PDAA WG LLGASR+HG+TEL E A
Sbjct: 541 GVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETA 600

Query: 597 AEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHI 656
           A+K+F MEP+NSGMYVLLSNLYA+SGRW +V ++R +MRDKGVKKVPGYSW+ IQNKTH 
Sbjct: 601 ADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHT 660

Query: 657 FTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVA 716
           F+VGD  HPE + I+A+LEELDL +KK G+VS T +VLHDVEEEEKE M++YHSE+LAVA
Sbjct: 661 FSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVEEEEKERMVRYHSERLAVA 720

Query: 717 FGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDY 776
           +GI+ +  GRPIRVIKNLRVCEDCHNAIK++++IT R II+RD+NRFHHF +GSCSCGDY
Sbjct: 721 YGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLIILRDNNRFHHFKDGSCSCGDY 780

BLAST of CSPI01G04250 vs. TAIR10
Match: AT1G09410.1 (AT1G09410.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 652.5 bits (1682), Expect = 3.2e-187
Identity = 308/687 (44.83%), Postives = 449/687 (65.36%), Query Frame = 1

Query: 93  TYNAMISGYLSNNKFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKD 152
           T N  I+      K   ARK+F+    + + SWN M++GY  N     AR LF+++P+++
Sbjct: 19  TANVRITHLSRIGKIHEARKLFDSCDSKSISSWNSMVAGYFANLMPRDARKLFDEMPDRN 78

Query: 153 VVSWNAMLSGFAQNGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMD 212
           ++SWN ++SG+ +NG ++EARK+FD M  +N +SW  L+  YV NG+++ A  LF    +
Sbjct: 79  IISWNGLVSGYMKNGEIDEARKVFDLMPERNVVSWTALVKGYVHNGKVDVAESLFWKMPE 138

Query: 213 WEIVSWNCLMGGYVRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEEL 272
              VSW  ++ G+++  R+DDA  L++ +P +D I+   MI G  + G + EAR +F+E+
Sbjct: 139 KNKVSWTVMLIGFLQDGRIDDACKLYEMIPDKDNIARTSMIHGLCKEGRVDEAREIFDEM 198

Query: 273 PIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFD 332
             R V  WT MV+G+ QN  +D+A +IF+ MPEK EVSW +M+ GYVQ+ +IE A ELF+
Sbjct: 199 SERSVITWTTMVTGYGQNNRVDDARKIFDVMPEKTEVSWTSMLMGYVQNGRIEDAEELFE 258

Query: 333 QMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHL 392
            MP +   + N M++G  Q G I +A+ +FD M +R+  SW  +I  + ++G   EAL L
Sbjct: 259 VMPVKPVIACNAMISGLGQKGEIAKARRVFDSMKERNDASWQTVIKIHERNGFELEALDL 318

Query: 393 FIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGK 452
           FI M++ G       L   LS CA +A+L  GKQ+H +LV+  F       + L+ MY K
Sbjct: 319 FILMQKQGVRPTFPTLISILSVCASLASLHHGKQVHAQLVRCQFDVDVYVASVLMTMYIK 378

Query: 453 CGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESMKM--TIKPDDVTLVG 512
           CG + ++  +F+    KDI+ WN++I+GYA HG G+EAL +F  M +  + KP++VT V 
Sbjct: 379 CGELVKSKLIFDRFPSKDIIMWNSIISGYASHGLGEEALKVFCEMPLSGSTKPNEVTFVA 438

Query: 513 VLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFY 572
            LSACS+ G+V++G++ + SM   +G+     HY CM+D+LGRAGR +EA+ ++ SM   
Sbjct: 439 TLSACSYAGMVEEGLKIYESMESVFGVKPITAHYACMVDMLGRAGRFNEAMEMIDSMTVE 498

Query: 573 PDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMR 632
           PDAA WG+LLGA R H   ++ E  A+K+ E+EP+NSG Y+LLSN+YA+ GRW +V E+R
Sbjct: 499 PDAAVWGSLLGACRTHSQLDVAEFCAKKLIEIEPENSGTYILLSNMYASQGRWADVAELR 558

Query: 633 SKMRDKGVKKVPGYSWVGIQNKTHIFTVGDC-SHPEAERIYAYLEELDLELKKDGFVSST 692
             M+ + V+K PG SW  ++NK H FT G   SHPE E I   L+ELD  L++ G+    
Sbjct: 559 KLMKTRLVRKSPGCSWTEVENKVHAFTRGGINSHPEQESILKILDELDGLLREAGYNPDC 618

Query: 693 KLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKI 752
              LHDV+EEEK + LKYHSE+LAVA+ +L +  G PIRV+KNLRVC DCH AIK ISK+
Sbjct: 619 SYALHDVDEEEKVNSLKYHSERLAVAYALLKLSEGIPIRVMKNLRVCSDCHTAIKIISKV 678

Query: 753 TQRQIIVRDSNRFHHFSEGSCSCGDYW 777
            +R+II+RD+NRFHHF  G CSC DYW
Sbjct: 679 KEREIILRDANRFHHFRNGECSCKDYW 705

BLAST of CSPI01G04250 vs. TAIR10
Match: AT1G56690.1 (AT1G56690.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 634.4 bits (1635), Expect = 9.1e-182
Identity = 299/673 (44.43%), Postives = 435/673 (64.64%), Query Frame = 1

Query: 106 KFDCARKVFEKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQ 165
           K + ARK F+ +  + + SWN ++SGY  NG    AR LF+++ E++VVSWN ++SG+ +
Sbjct: 32  KINEARKFFDSLQFKAIGSWNSIVSGYFSNGLPKEARQLFDEMSERNVVSWNGLVSGYIK 91

Query: 166 NGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGY 225
           N  + EAR +F+ M  +N +SW  ++  Y+Q G + +A  LF    +   VSW  + GG 
Sbjct: 92  NRMIVEARNVFELMPERNVVSWTAMVKGYMQEGMVGEAESLFWRMPERNEVSWTVMFGGL 151

Query: 226 VRKKRLDDARSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVS 285
           +   R+D AR L+D MPV+D ++   MI G  + G + EAR +F+E+  R+V  WT M++
Sbjct: 152 IDDGRIDKARKLYDMMPVKDVVASTNMIGGLCREGRVDEARLIFDEMRERNVVTWTTMIT 211

Query: 286 GFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTM 345
           G+ QN  +D A ++FE MPEK EVSW +M+ GY  S +IE A E F+ MP +   + N M
Sbjct: 212 GYRQNNRVDVARKLFEVMPEKTEVSWTSMLLGYTLSGRIEDAEEFFEVMPMKPVIACNAM 271

Query: 346 VTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNR 405
           + G+ + G I +A+ +FD M  RD  +W  MI  Y + G   EAL LF +M++ G   + 
Sbjct: 272 IVGFGEVGEISKARRVFDLMEDRDNATWRGMIKAYERKGFELEALDLFAQMQKQGVRPSF 331

Query: 406 SALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFED 465
            +L   LS CA +A+L+ G+Q+H  LV+  F       + L+ MY KCG + +A  VF+ 
Sbjct: 332 PSLISILSVCATLASLQYGRQVHAHLVRCQFDDDVYVASVLMTMYVKCGELVKAKLVFDR 391

Query: 466 ITEKDIVSWNTMIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKG 525
            + KDI+ WN++I+GYA HG G+EAL +F  M  +   P+ VTL+ +L+ACS+ G +++G
Sbjct: 392 FSSKDIIMWNSIISGYASHGLGEEALKIFHEMPSSGTMPNKVTLIAILTACSYAGKLEEG 451

Query: 526 MEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASR 585
           +E F SM   + +T   +HY+C +D+LGRAG++D+A+ L++SM   PDA  WGALLGA +
Sbjct: 452 LEIFESMESKFCVTPTVEHYSCTVDMLGRAGQVDKAMELIESMTIKPDATVWGALLGACK 511

Query: 586 IHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGY 645
            H   +L E AA+K+FE EPDN+G YVLLS++ A+  +W +V  +R  MR   V K PG 
Sbjct: 512 THSRLDLAEVAAKKLFENEPDNAGTYVLLSSINASRSKWGDVAVVRKNMRTNNVSKFPGC 571

Query: 646 SWVGIQNKTHIFTVGDC-SHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEH 705
           SW+ +  K H+FT G   +HPE   I   LE+ D  L++ G+      VLHDV+EEEK  
Sbjct: 572 SWIEVGKKVHMFTRGGIKNHPEQAMILMMLEKTDGLLREAGYSPDCSHVLHDVDEEEKVD 631

Query: 706 MLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFH 765
            L  HSE+LAVA+G+L +P G PIRV+KNLRVC DCH AIK ISK+T+R+II+RD+NRFH
Sbjct: 632 SLSRHSERLAVAYGLLKLPEGVPIRVMKNLRVCGDCHAAIKLISKVTEREIILRDANRFH 691

Query: 766 HFSEGSCSCGDYW 777
           HF+ G CSC DYW
Sbjct: 692 HFNNGECSCRDYW 704

BLAST of CSPI01G04250 vs. TAIR10
Match: AT4G16835.1 (AT4G16835.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 569.3 bits (1466), Expect = 3.6e-162
Identity = 266/592 (44.93%), Postives = 392/592 (66.22%), Query Frame = 1

Query: 188 NGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRK-KRLDDARSLFDRMPVRDK 247
           N +++  V++G I+ A R+F        ++WN L+ G  +   R+ +A  LFD +P  D 
Sbjct: 65  NKIIARCVRSGDIDGALRVFHGMRAKNTITWNSLLIGISKDPSRMMEAHQLFDEIPEPDT 124

Query: 248 ISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEEMPEK 307
            S+NIM++ Y +N    +A+  F+ +P +D  +W  M++G+ + G +++A  +F  M EK
Sbjct: 125 FSYNIMLSCYVRNVNFEKAQSFFDRMPFKDAASWNTMITGYARRGEMEKARELFYSMMEK 184

Query: 308 NEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMP 367
           NEVSWNAMI+GY++   +EKA   F   P R   +W  M+TGY +   ++ A+ +F +M 
Sbjct: 185 NEVSWNAMISGYIECGDLEKASHFFKVAPVRGVVAWTAMITGYMKAKKVELAEAMFKDMT 244

Query: 368 -QRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAALELGK 427
             ++ ++W AMISGY ++ + E+ L LF  M  +G   N S L+ AL  C+E++AL+LG+
Sbjct: 245 VNKNLVTWNAMISGYVENSRPEDGLKLFRAMLEEGIRPNSSGLSSALLGCSELSALQLGR 304

Query: 428 QLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHG 487
           Q+H  + K+       A  +L++MY KCG + +A+ +FE + +KD+V+WN MI+GYA+HG
Sbjct: 305 QIHQIVSKSTLCNDVTALTSLISMYCKCGELGDAWKLFEVMKKKDVVAWNAMISGYAQHG 364

Query: 488 FGKEALALFESM-KMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANAKHY 547
              +AL LF  M    I+PD +T V VL AC+H GLV+ GM YF SM ++Y +     HY
Sbjct: 365 NADKALCLFREMIDNKIRPDWITFVAVLLACNHAGLVNIGMAYFESMVRDYKVEPQPDHY 424

Query: 548 TCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVFEMEP 607
           TCM+DLLGRAG+L+EAL L++SMPF P AA +G LLGA R+H + EL E AAEK+ ++  
Sbjct: 425 TCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFGTLLGACRVHKNVELAEFAAEKLLQLNS 484

Query: 608 DNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHP 667
            N+  YV L+N+YA+  RW +V  +R +M++  V KVPGYSW+ I+NK H F   D  HP
Sbjct: 485 QNAAGYVQLANIYASKNRWEDVARVRKRMKESNVVKVPGYSWIEIRNKVHHFRSSDRIHP 544

Query: 668 EAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSIPPG 727
           E + I+  L+EL+ ++K  G+    +  LH+VEEE+KE +L +HSEKLAVAFG + +P G
Sbjct: 545 ELDSIHKKLKELEKKMKLAGYKPELEFALHNVEEEQKEKLLLWHSEKLAVAFGCIKLPQG 604

Query: 728 RPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
             I+V KNLR+C DCH AIK IS+I +R+IIVRD+ RFHHF +GSCSCGDYW
Sbjct: 605 SQIQVFKNLRICGDCHKAIKFISEIEKREIIVRDTTRFHHFKDGSCSCGDYW 656

BLAST of CSPI01G04250 vs. TAIR10
Match: AT1G25360.1 (AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 536.2 bits (1380), Expect = 3.4e-152
Identity = 294/747 (39.36%), Postives = 432/747 (57.83%), Query Frame = 1

Query: 55  IVDSDIVKWNRKISAYMRKGQCESALSVFNG--MRRRSTVTYNAMISGYLSNNKFDCARK 114
           I + D +     +S Y   G    A  VF    +  R TV YNAMI+G+  NN    A  
Sbjct: 75  ISEPDKIARTTMVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAIN 134

Query: 115 VFEKMPDR----DLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSW-----NAMLSGF 174
           +F KM       D  ++  +L+G     +       F+    K    +     NA++S +
Sbjct: 135 LFCKMKHEGFKPDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVY 194

Query: 175 AQ----NGFVEEARKIFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMD-WEIVSW 234
           ++       +  ARK+FD++L K+E SW  +++ YV+NG  +    L +   D  ++V++
Sbjct: 195 SKCASSPSLLHSARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAY 254

Query: 235 NCLMGGYVRKKRLDDARSLFDRMPVR----DKISWNIMITGYAQNGLLSEARRLFEELPI 294
           N ++ GYV +    +A  +  RM       D+ ++  +I   A  GLL   +++   +  
Sbjct: 255 NAMISGYVNRGFYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLR 314

Query: 295 RDVFAW---TAMVSGFVQNGMLDEATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELF 354
           R+ F++    ++VS + + G  DEA  IFE+MP K+ VSWNA+++GYV S          
Sbjct: 315 REDFSFHFDNSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSS---------- 374

Query: 355 DQMPSRNTSSWNTMVTGYAQCGNIDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALH 414
                                G+I +AK++F EM +++ +SW  MISG A++G  EE L 
Sbjct: 375 ---------------------GHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLK 434

Query: 415 LFIKMKRDGGILNRSALACALSSCAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYG 474
           LF  MKR+G      A + A+ SCA + A   G+Q H +L+K GF +   AGNAL+ MY 
Sbjct: 435 LFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITMYA 494

Query: 475 KCGSIEEAFDVFEDITEKDIVSWNTMIAGYARHGFGKEALALFESM-KMTIKPDDVTLVG 534
           KCG +EEA  VF  +   D VSWN +IA   +HG G EA+ ++E M K  I+PD +TL+ 
Sbjct: 495 KCGVVEEARQVFRTMPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRITLLT 554

Query: 535 VLSACSHTGLVDKGMEYFNSMYQNYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFY 594
           VL+ACSH GLVD+G +YF+SM   Y I   A HY  +IDLL R+G+  +A ++++S+PF 
Sbjct: 555 VLTACSHAGLVDQGRKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIESLPFK 614

Query: 595 PDAATWGALLGASRIHGDTELGEKAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMR 654
           P A  W ALL   R+HG+ ELG  AA+K+F + P++ G Y+LLSN++AA+G+W EV  +R
Sbjct: 615 PTAEIWEALLSGCRVHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEVARVR 674

Query: 655 SKMRDKGVKKVPGYSWVGIQNKTHIFTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTK 714
             MRD+GVKK    SW+ ++ + H F V D SHPEAE +Y YL++L  E+++ G+V  T 
Sbjct: 675 KLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYVPDTS 734

Query: 715 LVLHDVEEE-EKEHMLKYHSEKLAVAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKI 774
            VLHDVE +  KE ML  HSEK+AVAFG++ +PPG  IR+ KNLR C DCHN  + +S +
Sbjct: 735 FVLHDVESDGHKEDMLTTHSEKIAVAFGLMKLPPGTTIRIFKNLRTCGDCHNFFRFLSWV 790

Query: 775 TQRQIIVRDSNRFHHFSEGSCSCGDYW 777
            QR II+RD  RFHHF  G CSCG++W
Sbjct: 795 VQRDIILRDRKRFHHFRNGECSCGNFW 790

BLAST of CSPI01G04250 vs. NCBI nr
Match: gi|778656470|ref|XP_004137551.2| (PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Cucumis sativus])

HSP 1 Score: 1575.5 bits (4078), Expect = 0.0e+00
Identity = 774/776 (99.74%), Postives = 775/776 (99.87%), Query Frame = 1

Query: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI 60
           MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI
Sbjct: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI 60

Query: 61  VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR 120
           VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR
Sbjct: 61  VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR 120

Query: 121 DLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180
           DLISWNVMLSGYVKNGNLSAARALFNQ+PEKDVVSWNAMLSGFAQNGFVEEARKIFDQML
Sbjct: 121 DLISWNVMLSGYVKNGNLSAARALFNQMPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180

Query: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR 240
           VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR
Sbjct: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR 240

Query: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300
           MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF
Sbjct: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300

Query: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI 360
           EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI
Sbjct: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI 360

Query: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA 420
           LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA
Sbjct: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA 420

Query: 421 LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480
           LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG
Sbjct: 421 LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480

Query: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540
           YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN
Sbjct: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540

Query: 541 AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF 600
           AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF
Sbjct: 541 AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF 600

Query: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGD 660
           EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWV IQNKTHIFTVGD
Sbjct: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVEIQNKTHIFTVGD 660

Query: 661 CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720
           CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS
Sbjct: 661 CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720

Query: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW
Sbjct: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 776

BLAST of CSPI01G04250 vs. NCBI nr
Match: gi|659106935|ref|XP_008453471.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Cucumis melo])

HSP 1 Score: 1537.3 bits (3979), Expect = 0.0e+00
Identity = 756/776 (97.42%), Postives = 764/776 (98.45%), Query Frame = 1

Query: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLESKNSDSTIVDSDI 60
           MQARGNCRFSLFIGSSSRSIQTESITNKLRASS SSPSKKTWTQKLESKN+D TIVDSDI
Sbjct: 1   MQARGNCRFSLFIGSSSRSIQTESITNKLRASSNSSPSKKTWTQKLESKNTDPTIVDSDI 60

Query: 61  VKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDR 120
           VKWNRKISAYMRKGQCESALSVFNGM RRSTVTYNAMISGYLSN KFDCARKVFEKMP R
Sbjct: 61  VKWNRKISAYMRKGQCESALSVFNGMPRRSTVTYNAMISGYLSNYKFDCARKVFEKMPHR 120

Query: 121 DLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180
           DLISWNVMLSGYVKNGNLSAARALFNQ+PEKDVVSWNAMLSGFAQNGFVEEARKIFDQML
Sbjct: 121 DLISWNVMLSGYVKNGNLSAARALFNQMPEKDVVSWNAMLSGFAQNGFVEEARKIFDQML 180

Query: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDR 240
           VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYV+KKRLDDARSLFDR
Sbjct: 181 VKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVKKKRLDDARSLFDR 240

Query: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300
           MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF
Sbjct: 241 MPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIF 300

Query: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKI 360
           EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRN SSWNTMVTGYAQCGNIDQAKI
Sbjct: 301 EEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNISSWNTMVTGYAQCGNIDQAKI 360

Query: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAA 420
           LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFI+MKRDGGILNRSAL  ALSSCAEIAA
Sbjct: 361 LFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIEMKRDGGILNRSALVGALSSCAEIAA 420

Query: 421 LELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480
           LELGKQLHGRLVKAGF+TGY AGNALLAMY KCGSIEEAFDVFEDITEKDIVSWNTMIAG
Sbjct: 421 LELGKQLHGRLVKAGFETGYYAGNALLAMYCKCGSIEEAFDVFEDITEKDIVSWNTMIAG 480

Query: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540
           YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN
Sbjct: 481 YARHGFGKEALALFESMKMTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITAN 540

Query: 541 AKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVF 600
           +KHYTCMIDLLGRAGRLDEALNLMKSMPF+PDAATWGALLGASRIHGDTELGEKAAEKVF
Sbjct: 541 SKHYTCMIDLLGRAGRLDEALNLMKSMPFHPDAATWGALLGASRIHGDTELGEKAAEKVF 600

Query: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGD 660
           EMEPDNSGMYVLLSNLYAASGRWREVREMR KMRDKGVKKVPGYSWV IQNKTHIFTVGD
Sbjct: 601 EMEPDNSGMYVLLSNLYAASGRWREVREMRLKMRDKGVKKVPGYSWVEIQNKTHIFTVGD 660

Query: 661 CSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720
           CSHPEAERIYAYLEELDLELKK+GFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS
Sbjct: 661 CSHPEAERIYAYLEELDLELKKEGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILS 720

Query: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW
Sbjct: 721 IPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 776

BLAST of CSPI01G04250 vs. NCBI nr
Match: gi|645269813|ref|XP_008240171.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Prunus mume])

HSP 1 Score: 1165.6 bits (3014), Expect = 0.0e+00
Identity = 564/781 (72.22%), Postives = 663/781 (84.89%), Query Frame = 1

Query: 4   RGNCRFSLFIGSSS--RSIQTESITNKLRASSTSS-----PSKKTWTQKLESKNSDSTIV 63
           RG+ RF     SS   RS++T   +N    +   +     PSKKT TQK  S N  S   
Sbjct: 2   RGSYRFRQLHSSSFSLRSLETRQPSNIKHQTGNPTCPNPIPSKKTLTQKRRSMNKLSNAS 61

Query: 64  DSDIVKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEK 123
           DS+IVK N+ I+  MR G+CE+AL VFN M RRS V+YNAMISGYL+N KFD A+ +FEK
Sbjct: 62  DSEIVKLNKDITTQMRNGRCEAALRVFNVMPRRSPVSYNAMISGYLANGKFDLAKDMFEK 121

Query: 124 MPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIF 183
           MP RDL+SWNVMLSGYV+N +L AA ALF ++PEKDVVSWNAMLSG+AQNG+V+EARK+F
Sbjct: 122 MPVRDLVSWNVMLSGYVRNRDLGAAHALFERMPEKDVVSWNAMLSGYAQNGYVDEARKVF 181

Query: 184 DQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARS 243
           ++M  KNEISWNGLL+AYVQNGRIEDARRLF+SK +WE VSWNCLMGG+V++KRL  AR 
Sbjct: 182 ERMPDKNEISWNGLLAAYVQNGRIEDARRLFESKANWEAVSWNCLMGGFVKQKRLVHARQ 241

Query: 244 LFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEA 303
           +FDRMPVRD++SWN MITGYAQNG +SEARRLF E PIRDVFAWT+M+SG+VQNGMLDE 
Sbjct: 242 IFDRMPVRDEVSWNTMITGYAQNGEMSEARRLFGESPIRDVFAWTSMLSGYVQNGMLDEG 301

Query: 304 TRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNID 363
            R+F+EMPEKN VSWNAMIAGYVQ ++++ A +LF+ MP RN SSWNT++TGYAQ G+ID
Sbjct: 302 RRMFDEMPEKNSVSWNAMIAGYVQCKRMDMAMKLFEAMPFRNASSWNTILTGYAQSGDID 361

Query: 364 QAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCA 423
            A+ +FD MP+RD ISWAA+I+GYAQ+G SEEAL LF++MKRDG  L RS+  CALS+CA
Sbjct: 362 SARKIFDSMPRRDSISWAAIIAGYAQNGYSEEALCLFVEMKRDGERLTRSSFTCALSTCA 421

Query: 424 EIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNT 483
           EIAALELGKQLHGR+ KAG++TG   GNALL MY KCGSIEEA+DVF+ I EKD+VSWNT
Sbjct: 422 EIAALELGKQLHGRMTKAGYETGCYVGNALLVMYCKCGSIEEAYDVFQGIAEKDVVSWNT 481

Query: 484 MIAGYARHGFGKEALALFESMKMT-IKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNY 543
           MI GYARHGFG +AL +FESMK   IKPDDVT+VGVLSACSHTGLVD+G EYF SM Q+Y
Sbjct: 482 MIYGYARHGFGSKALMVFESMKAAGIKPDDVTMVGVLSACSHTGLVDRGTEYFYSMNQDY 541

Query: 544 GITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKA 603
           GITAN+KHYTCMIDLLGRAGRL+EA NLM+ MPF PDAATWGALLGASRIHG+TELGEKA
Sbjct: 542 GITANSKHYTCMIDLLGRAGRLEEAQNLMRDMPFEPDAATWGALLGASRIHGNTELGEKA 601

Query: 604 AEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHI 663
           A+ +FEMEP+N+GMYVLLSNLYAASGRW EV +MR KM+DKGV+KVPGYSWV +QNK H 
Sbjct: 602 AQIIFEMEPENAGMYVLLSNLYAASGRWGEVGKMRLKMKDKGVRKVPGYSWVEVQNKIHT 661

Query: 664 FTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVA 723
           F+VGD  HP+ ++IYA+LEELDL++K++G++SSTKLVLHDVEEEEKEHMLKYHSEKLAVA
Sbjct: 662 FSVGDSIHPDKDKIYAFLEELDLKMKREGYISSTKLVLHDVEEEEKEHMLKYHSEKLAVA 721

Query: 724 FGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDY 777
           FGILSIP GRPIRVIKNLRVC DCHNAIK+ISKI  R +I+RDS+RFHHFS G+CSCGDY
Sbjct: 722 FGILSIPAGRPIRVIKNLRVCGDCHNAIKYISKIVGRTVILRDSHRFHHFSGGNCSCGDY 781

BLAST of CSPI01G04250 vs. NCBI nr
Match: gi|657966297|ref|XP_008374833.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Malus domestica])

HSP 1 Score: 1162.9 bits (3007), Expect = 0.0e+00
Identity = 569/783 (72.67%), Postives = 667/783 (85.19%), Query Frame = 1

Query: 4   RGNCRF-SLFIGSS---SRSIQTESITN-KLRASSTSS----PSKKTWTQKLESKNSDST 63
           RG+ RF  L  GSS   SRS +++   N K R  + +S    PSKKT TQK  S N  S 
Sbjct: 2   RGSYRFRQLHSGSSNFCSRSFKSQQPINIKDRTGNPTSQNPIPSKKTLTQKRRSMNKLSN 61

Query: 64  IVDSDIVKWNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVF 123
             DS+IVK N  I+  MR G+CE+AL VFN M RRS V+YNAM+SGYL+N KFD A+ +F
Sbjct: 62  SSDSEIVKSNMDITTQMRNGRCEAALLVFNAMPRRSPVSYNAMVSGYLANGKFDLAKDMF 121

Query: 124 EKMPDRDLISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARK 183
           EKMP+RDL+SWNVMLSGYV+N +L AARALF ++PEKDVVSWNAMLSG+AQNG+V+EAR 
Sbjct: 122 EKMPERDLVSWNVMLSGYVRNRDLGAARALFERMPEKDVVSWNAMLSGYAQNGYVDEART 181

Query: 184 IFDQMLVKNEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDA 243
           IF +M  KNEISWNGLL+AYVQNGR+EDA RLF+SK DWE VSWNCLMGG+V++KRL +A
Sbjct: 182 IFQRMPDKNEISWNGLLAAYVQNGRVEDACRLFESKADWEAVSWNCLMGGFVKQKRLVNA 241

Query: 244 RSLFDRMPVRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLD 303
           R LFDRMPVRD++SWN MITGYAQNG +SEARRLFEE PIRDVFAWT+M+SG+VQNGMLD
Sbjct: 242 RQLFDRMPVRDEVSWNTMITGYAQNGQMSEARRLFEECPIRDVFAWTSMLSGYVQNGMLD 301

Query: 304 EATRIFEEMPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGN 363
           EA  IF+EMPEKN VSWNAMIAGYVQS++++ A +LF+ MPSRN SSWNT++TGYAQ G+
Sbjct: 302 EARSIFDEMPEKNSVSWNAMIAGYVQSKRMDMATKLFEAMPSRNASSWNTILTGYAQSGD 361

Query: 364 IDQAKILFDEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSS 423
           I  AK +FD MP+RD ISWAA+I+GYAQ+G SEEAL LF++MKRDG  L RS+  CALS+
Sbjct: 362 IVCAKEIFDSMPRRDSISWAAIIAGYAQNGYSEEALQLFVEMKRDGERLTRSSFTCALST 421

Query: 424 CAEIAALELGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSW 483
           CAEIAALELGKQLHGR+ KAG++TG   GNALL MY KCGSIEEA+DVF+ I EKD+VSW
Sbjct: 422 CAEIAALELGKQLHGRMTKAGYETGCYVGNALLVMYCKCGSIEEAYDVFQGIAEKDVVSW 481

Query: 484 NTMIAGYARHGFGKEALALFESMK-MTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQ 543
           NTMI GYARHGFG +AL +F+SMK + IKPDDVT+VGVLSACSHTGLVD+G EYF SM Q
Sbjct: 482 NTMIYGYARHGFGLKALKVFDSMKAVGIKPDDVTMVGVLSACSHTGLVDRGTEYFYSMNQ 541

Query: 544 NYGITANAKHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGE 603
           +YGIT N+KHYTCMIDLLGRAGRL+EA NLM+ M F PDAA WGALLGASRIHG+T+LGE
Sbjct: 542 DYGITENSKHYTCMIDLLGRAGRLEEAQNLMRDMSFEPDAAMWGALLGASRIHGNTKLGE 601

Query: 604 KAAEKVFEMEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKT 663
           KAA  +FEMEP+N+GMYVLLSNLYAASGRW +V +MR KMRDKGV+KVPGYSWV +QNKT
Sbjct: 602 KAARIIFEMEPENAGMYVLLSNLYAASGRWGDVDKMRLKMRDKGVRKVPGYSWVEVQNKT 661

Query: 664 HIFTVGDCSHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLA 723
           H F+VGD  HP+ ++IYA+LEELDL++K +G+VSSTKLVLHDVEEEEKEHMLKYHSEKLA
Sbjct: 662 HTFSVGDTIHPDKDKIYAFLEELDLKMKLEGYVSSTKLVLHDVEEEEKEHMLKYHSEKLA 721

Query: 724 VAFGILSIPPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCG 777
           VAFGILSIP GRP+RVIKNLRVCEDCHNAIK+IS+I  R II+RDS+RFHHF+ G+CSCG
Sbjct: 722 VAFGILSIPAGRPVRVIKNLRVCEDCHNAIKYISRIVARTIILRDSHRFHHFTGGNCSCG 781

BLAST of CSPI01G04250 vs. NCBI nr
Match: gi|703063824|ref|XP_010087050.1| (hypothetical protein L484_012294 [Morus notabilis])

HSP 1 Score: 1162.1 bits (3005), Expect = 0.0e+00
Identity = 565/775 (72.90%), Postives = 659/775 (85.03%), Query Frame = 1

Query: 4   RGNCRFSLFIGSSSRSIQTESITNKLRASSTSSPSKKTWTQKLE-SKNSDSTIVDSDIVK 63
           RG+ RF  F  S   S+QT++I  KL       PSKKT  +K    KN  S I DSDIV+
Sbjct: 2   RGSHRFRQFHSSCFCSLQTQTINGKL---PNPIPSKKTLIEKHNPKKNKKSNIADSDIVQ 61

Query: 64  WNRKISAYMRKGQCESALSVFNGMRRRSTVTYNAMISGYLSNNKFDCARKVFEKMPDRDL 123
           WN  I+++MR G C++AL VFN M RRS V+YNAMISGYL+N++FD AR +FE+MP+RDL
Sbjct: 62  WNMDITSHMRNGHCKAALRVFNDMSRRSVVSYNAMISGYLANDRFDLARDMFERMPERDL 121

Query: 124 ISWNVMLSGYVKNGNLSAARALFNQIPEKDVVSWNAMLSGFAQNGFVEEARKIFDQMLVK 183
           +SWNVMLSGYV+N  L AAR LF+++PE+DVVSWN+MLSG+AQ G+V+EA KIF+ M  K
Sbjct: 122 VSWNVMLSGYVRNRKLGAARMLFDRMPERDVVSWNSMLSGYAQYGYVDEAMKIFEMMPDK 181

Query: 184 NEISWNGLLSAYVQNGRIEDARRLFDSKMDWEIVSWNCLMGGYVRKKRLDDARSLFDRMP 243
           NEISWN LLSAYVQNGRI+DARRLF+SK DWE+VSWNCLMGGYVRKKRL DAR LFD+MP
Sbjct: 182 NEISWNSLLSAYVQNGRIDDARRLFESKADWEVVSWNCLMGGYVRKKRLVDARKLFDQMP 241

Query: 244 VRDKISWNIMITGYAQNGLLSEARRLFEELPIRDVFAWTAMVSGFVQNGMLDEATRIFEE 303
           +RD +SWN MIT YAQN  L+E+RRLFEE PIRDVFAWTAM+SG+VQ+GMLDEA RIF+E
Sbjct: 242 IRDAVSWNTMITCYAQNSELAESRRLFEESPIRDVFAWTAMMSGYVQHGMLDEARRIFDE 301

Query: 304 MPEKNEVSWNAMIAGYVQSQQIEKARELFDQMPSRNTSSWNTMVTGYAQCGNIDQAKILF 363
           MP KN VSWNA+IAGYV+ ++++ ARELF+ MP RN SSWNTM+T YAQ G+I QA+ +F
Sbjct: 302 MPVKNPVSWNAIIAGYVRCKRMDIARELFEVMPCRNVSSWNTMLTAYAQSGDIAQARFIF 361

Query: 364 DEMPQRDCISWAAMISGYAQSGQSEEALHLFIKMKRDGGILNRSALACALSSCAEIAALE 423
           D MPQRD ISWAA+I+GYAQ+G  EEAL LF++MK++G  L RS   CALS+CAEIAALE
Sbjct: 362 DRMPQRDSISWAAIIAGYAQNGYGEEALRLFMEMKKEGERLTRSCYTCALSTCAEIAALE 421

Query: 424 LGKQLHGRLVKAGFQTGYIAGNALLAMYGKCGSIEEAFDVFEDITEKDIVSWNTMIAGYA 483
           LGKQLHGRLVKAGF+TG   GNALL MY KCGSIEEA++VF+DI  KDIVSWNTMIAGYA
Sbjct: 422 LGKQLHGRLVKAGFETGCYVGNALLVMYSKCGSIEEAYNVFKDIEVKDIVSWNTMIAGYA 481

Query: 484 RHGFGKEALALFESMK-MTIKPDDVTLVGVLSACSHTGLVDKGMEYFNSMYQNYGITANA 543
           RHGFGKEAL +FESMK M I PDDVTLVGVLSACSHTGLV++G +YF SM Q+YGIT N+
Sbjct: 482 RHGFGKEALMIFESMKAMGIIPDDVTLVGVLSACSHTGLVERGKQYFYSMNQDYGITPNS 541

Query: 544 KHYTCMIDLLGRAGRLDEALNLMKSMPFYPDAATWGALLGASRIHGDTELGEKAAEKVFE 603
           KHYTCMIDLLGRAG LDEA +LM++MPF PDAATWGALLGASRIHG+TELGEKAA+ +FE
Sbjct: 542 KHYTCMIDLLGRAGCLDEAQDLMRNMPFEPDAATWGALLGASRIHGNTELGEKAAKIIFE 601

Query: 604 MEPDNSGMYVLLSNLYAASGRWREVREMRSKMRDKGVKKVPGYSWVGIQNKTHIFTVGDC 663
           +EP+N+GMYVLLSNLYAASGRW +VR+MR KMRD GVKKVPGYSWV +QNK H F+VGD 
Sbjct: 602 LEPENAGMYVLLSNLYAASGRWTDVRKMRLKMRDTGVKKVPGYSWVEVQNKVHTFSVGDS 661

Query: 664 SHPEAERIYAYLEELDLELKKDGFVSSTKLVLHDVEEEEKEHMLKYHSEKLAVAFGILSI 723
            HPE +RIYA+LEELDL++K++G+VSSTKLVLHDVEEEEKE+MLKYHSEKLAVAF ILS 
Sbjct: 662 VHPEKDRIYAFLEELDLKMKREGYVSSTKLVLHDVEEEEKENMLKYHSEKLAVAFAILST 721

Query: 724 PPGRPIRVIKNLRVCEDCHNAIKHISKITQRQIIVRDSNRFHHFSEGSCSCGDYW 777
           PPGRPIRV+KNLRVCEDCH+A K ISKI  R II+RDS RFHHFS GSCSCGDYW
Sbjct: 722 PPGRPIRVMKNLRVCEDCHSAFKIISKIVGRLIILRDSYRFHHFSGGSCSCGDYW 773

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP301_ARATH2.9e-30766.71Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN... [more]
PPR25_ARATH5.7e-18644.83Pentatricopeptide repeat-containing protein At1g09410 OS=Arabidopsis thaliana GN... [more]
PPR84_ARATH1.6e-18044.43Pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Arabidop... [more]
PP316_ARATH6.4e-16144.93Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidop... [more]
PPR57_ARATH6.0e-15139.36Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LV20_CUCSA0.0e+0099.74Uncharacterized protein OS=Cucumis sativus GN=Csa_1G024980 PE=4 SV=1[more]
W9SFH3_9ROSA0.0e+0072.90Uncharacterized protein OS=Morus notabilis GN=L484_000356 PE=4 SV=1[more]
M5WCR9_PRUPE0.0e+0075.95Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002162mg PE=4 SV=1[more]
A0A0D2T8Y5_GOSRA0.0e+0071.07Uncharacterized protein OS=Gossypium raimondii GN=B456_008G289000 PE=4 SV=1[more]
Q1SN04_MEDTR0.0e+0071.45Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_8g106950 PE... [more]
Match NameE-valueIdentityDescription
AT4G02750.11.7e-30866.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G09410.13.2e-18744.83 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G56690.19.1e-18244.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G16835.13.6e-16244.93 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G25360.13.4e-15239.36 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778656470|ref|XP_004137551.2|0.0e+0099.74PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Cucumis sativu... [more]
gi|659106935|ref|XP_008453471.1|0.0e+0097.42PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Cucumis melo][more]
gi|645269813|ref|XP_008240171.1|0.0e+0072.22PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Prunus mume][more]
gi|657966297|ref|XP_008374833.1|0.0e+0072.67PREDICTED: pentatricopeptide repeat-containing protein At4g02750 [Malus domestic... [more]
gi|703063824|ref|XP_010087050.1|0.0e+0072.90hypothetical protein L484_012294 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G04250.1CSPI01G04250.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 216..242
score: 1.8E-6coord: 371..400
score: 1.0E-7coord: 154..182
score: 2.0E-8coord: 279..307
score: 1.2E-7coord: 341..370
score: 3.1E-7coord: 247..272
score: 3.8E-7coord: 609..638
score: 0.11coord: 67..89
score: 0.0042coord: 309..338
score: 3.5E-7coord: 123..152
score: 4.7E-6coord: 185..208
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 536..567
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 469..515
score: 3.5E-9coord: 90..121
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 154..182
score: 3.9E-7coord: 472..498
score: 5.6E-6coord: 185..208
score: 5.1E-5coord: 444..472
score: 7.4E-4coord: 247..272
score: 2.5E-5coord: 92..121
score: 3.9E-7coord: 371..400
score: 1.0E-5coord: 341..370
score: 5.4E-5coord: 279..307
score: 4.0E-6coord: 544..567
score: 1.0E-4coord: 123..154
score: 3.4E-6coord: 216..245
score: 5.4E-7coord: 309..338
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 606..640
score: 9.065coord: 214..244
score: 9.821coord: 338..372
score: 11.685coord: 187..213
score: 5.996coord: 404..438
score: 5.645coord: 540..570
score: 8.616coord: 470..500
score: 10.413coord: 276..310
score: 11.663coord: 311..337
score: 7.432coord: 504..534
score: 7.026coord: 90..124
score: 11.137coord: 373..403
score: 8.046coord: 439..469
score: 7.41coord: 59..89
score: 8.747coord: 152..186
score: 11.509coord: 245..275
score: 10.095coord: 125..151
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 63..264
score: 2.4E-9coord: 561..628
score: 1.9E-11coord: 265..389
score: 1.9
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 218..365
score: 1.8E-5coord: 316..396
score: 2.52E-5coord: 512..629
score: 2.52E-5coord: 66..210
score: 6.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 242..647
score: 0.0coord: 15..213
score:
NoneNo IPR availablePANTHERPTHR24015:SF424SUBFAMILY NOT NAMEDcoord: 15..213
score: 0.0coord: 242..647
score:

The following gene(s) are paralogous to this gene:

None