CSPI01G34460 (gene) Wild cucumber (PI 183967)

NameCSPI01G34460
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 29363979 .. 29366400 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTGATGAACTCAACAAATCCTCGACAATTAGCATTGCATGGAGGGAGAGGTCCTGCAGTTTTTATACCTTTGATGCAGTTCTTGGCGTCTCTCCGAATTCTGAGGCCCCATGGATTTCTCCAGAAATTATGTTCCTTTCAACAGGGATCTTCAGCTTCTGCCTCCCTCGCATTTTTCTCCTCAACTCATTTTGATTCCATCTCTTCGCCGCACCATGATTTTTCTTCTTCTTCTTCGTTGCAGTCCCCTCTGAAAAAGATTTGTTCATTAGTTCTCGACACTTATTTACGTCAACCCCATCTGAGATTCTCTCCATCTAAGCTGAATCTTGATATGGATGCTGCCTCTTTGACTCATGAGCAAGCCATTTCTGCCGTTGCTTTGCTTGCTAGCGAGGAGGGTTCAATGGTCGCGCTGAGTTTCTTTTACTGGGCAGTTGGGTTCCCCAAATTCCGGTATTTCATGCGGCTCTACATAGTTTGTACGATGTCATTGGTTGGCAAATGTAATCTAGAGCGAGCTCATGAAGTTGTGGAGTGTATGGTAGGTGTTTTTGCAGAAATTGGGAAGTTGAAGGAGGCGGTGGATATGATCCTTGACATGAGAAATCAGGGACTTGTGTTGACCACCAGGGTAATGAATCGTATTATCCTGGTGGCTGCTGAAATGAAGCTGGTTGAATATGCAGGCAATGTGTTCGACGAAATGTCTGCAAGAGGTGTGTATCCTGATTCTTGCACTTATAAGTATATAATTGTAGGTTACTGTAGAAATGGTAATGTTTTGGAAGCAGATAGGTGGATATGTGAGATGATGGAGAGAGGTTTTGTGGTTGATAATGCCACACTGACTTTGATTATTACAGCATTTTGTGAAAAAAGTTTAGTAAACAGGGCAGTTTGGTTTTTCCATAAGGTTACTAAGATGGGTTTATCACCAAATTTGATAAACTATTCATCTATGATTAGTGGGTTGTGCAAGAGGGGTAGTGTTAAGCAAGCATTTGAGTTATTGGAAGAGATGGTTAAAAATGGATGGAAACCCAATGTGTATACCCACACATCATTAATTCACGGCCTTTGCAAGAAGGGATGGACAGAGAGAGCTTTTAGACTGTTTCTTAAACTTATTAGAAGTGATAATTACAAGCCAAATGTGCACACTTACACAGCCATGATAAGTGGGTACTGCAAAGAGGAGAAGTTGAGTAGAGCTGAAATGTTGTTTGAAAGAATGAAAGAACAGGGACTGGTTCCAAACACCAACACTTATACAACACTTATTGATGGGCACTGTAAGGCTGGGAATTTCAGTAAAGCGTATGAATTGATGGAGTTGATGTCTAATGAAGGTTTCTTCCCTAATACATGTACCTACAATTCAATTGTTGATGGTCTCTGCAAAAGAGGGAGAGCTGAAGAGGCTTTCAAGCTGTTAAATACAGGATTTCAAAATCAAATTGAAGCTGACGGTGTCACATACACCATTCTGATATCTGAGCAGTGTAAGCGAGCCGATATGAACCAAGCCCTTGTGTTTTTAAATAAGATGTTTAAAGTTGGTTTCCAGCCTGACATTCATTTATATACCACTTTGATTGCTGCATTCTGCAGGCAAAATATGATGAAGGATAGTGAAAAGTTGTTTGATGAAGTTATTAAGCTTGGGTTGGCTCCAACAAAGGAAACTTATACATCCATGATATGTGGCTATTGTAGGGAGAAAAGGGTTAGCTTAGCAGTCAAGTTTTTCCAGAAGATGAGTGACCATGGTTGTGCACCAGATAGCATTAGTTATGGTGCTTTAATCAGTGGCCTTTGTAAAGAGTCGAGGCTGGATGAGGCTCGCCAATTATATGATACCATGATAGACAAAGGGCTGTCTCCTTGCGAAGTTACACGGGTGACATTGACTTATGAGTATTGCAAAACAGAAGACTTTGCTTCGGCCATGGTTATATTGGAACGGCTGAACAAGAAGCTTTGGATACGCACAGTTCATACGCTAATAAGGAAGCTTTGTTGCGAGAAAAAAGTCGCCTTGGCAGCTCTGTTCTTTCATAAGTTACTGGATAAGGAGGTCAATGTCGACCGTGTGACTTTGGCTGCATTCAACACTGCCTGTATTGAAAGCAATAAGTATGCTCTTGTTTCGGACTTATCCGAGAGGATTTCAAAAGGTATCGGCTAACCTTCTAAATACTGGAATGATAAATGAATTTATCGTGTTGAAGAGAAGAGCAATGGAGGATAATGATCAAAGATAGATTTAGTTCTCTTGATTTGATTCCAAGCCAGCTGCATCTCAGGGATGTCTTCTAAGATGATAAAGCTTGGCCTGCACTTCAAGAACAGTGCAGGTAATTCAAGTTTTGATTCATAATCTCTCTTTTTTGAAGGAATATATAATCTCTTTTGTTA

mRNA sequence

ATGAACTCAACAAATCCTCGACAATTAGCATTGCATGGAGGGAGAGGTCCTGCAGTTTTTATACCTTTGATGCAGTTCTTGGCGTCTCTCCGAATTCTGAGGCCCCATGGATTTCTCCAGAAATTATGTTCCTTTCAACAGGGATCTTCAGCTTCTGCCTCCCTCGCATTTTTCTCCTCAACTCATTTTGATTCCATCTCTTCGCCGCACCATGATTTTTCTTCTTCTTCTTCGTTGCAGTCCCCTCTGAAAAAGATTTGTTCATTAGTTCTCGACACTTATTTACGTCAACCCCATCTGAGATTCTCTCCATCTAAGCTGAATCTTGATATGGATGCTGCCTCTTTGACTCATGAGCAAGCCATTTCTGCCGTTGCTTTGCTTGCTAGCGAGGAGGGTTCAATGGTCGCGCTGAGTTTCTTTTACTGGGCAGTTGGGTTCCCCAAATTCCGGTATTTCATGCGGCTCTACATAGTTTGTACGATGTCATTGGTTGGCAAATGTAATCTAGAGCGAGCTCATGAAGTTGTGGAGTGTATGGTAGGTGTTTTTGCAGAAATTGGGAAGTTGAAGGAGGCGGTGGATATGATCCTTGACATGAGAAATCAGGGACTTGTGTTGACCACCAGGGTAATGAATCGTATTATCCTGGTGGCTGCTGAAATGAAGCTGGTTGAATATGCAGGCAATGTGTTCGACGAAATGTCTGCAAGAGGTGTGTATCCTGATTCTTGCACTTATAAGTATATAATTGTAGGTTACTGTAGAAATGGTAATGTTTTGGAAGCAGATAGGTGGATATGTGAGATGATGGAGAGAGGTTTTGTGGTTGATAATGCCACACTGACTTTGATTATTACAGCATTTTGTGAAAAAAGTTTAGTAAACAGGGCAGTTTGGTTTTTCCATAAGGTTACTAAGATGGGTTTATCACCAAATTTGATAAACTATTCATCTATGATTAGTGGGTTGTGCAAGAGGGGTAGTGTTAAGCAAGCATTTGAGTTATTGGAAGAGATGGTTAAAAATGGATGGAAACCCAATGTGTATACCCACACATCATTAATTCACGGCCTTTGCAAGAAGGGATGGACAGAGAGAGCTTTTAGACTGTTTCTTAAACTTATTAGAAGTGATAATTACAAGCCAAATGTGCACACTTACACAGCCATGATAAGTGGGTACTGCAAAGAGGAGAAGTTGAGTAGAGCTGAAATGTTGTTTGAAAGAATGAAAGAACAGGGACTGGTTCCAAACACCAACACTTATACAACACTTATTGATGGGCACTGTAAGGCTGGGAATTTCAGTAAAGCGTATGAATTGATGGAGTTGATGTCTAATGAAGGTTTCTTCCCTAATACATGTACCTACAATTCAATTGTTGATGGTCTCTGCAAAAGAGGGAGAGCTGAAGAGGCTTTCAAGCTGTTAAATACAGGATTTCAAAATCAAATTGAAGCTGACGGTGTCACATACACCATTCTGATATCTGAGCAGTGTAAGCGAGCCGATATGAACCAAGCCCTTGTGTTTTTAAATAAGATGTTTAAAGTTGGTTTCCAGCCTGACATTCATTTATATACCACTTTGATTGCTGCATTCTGCAGGCAAAATATGATGAAGGATAGTGAAAAGTTGTTTGATGAAGTTATTAAGCTTGGGTTGGCTCCAACAAAGGAAACTTATACATCCATGATATGTGGCTATTGTAGGGAGAAAAGGGTTAGCTTAGCAGTCAAGTTTTTCCAGAAGATGAGTGACCATGGTTGTGCACCAGATAGCATTAGTTATGGTGCTTTAATCAGTGGCCTTTGTAAAGAGTCGAGGCTGGATGAGGCTCGCCAATTATATGATACCATGATAGACAAAGGGCTGTCTCCTTGCGAAGTTACACGGGTGACATTGACTTATGAGTATTGCAAAACAGAAGACTTTGCTTCGGCCATGGTTATATTGGAACGGCTGAACAAGAAGCTTTGGATACGCACAGTTCATACGCTAATAAGGAAGCTTTGTTGCGAGAAAAAAGTCGCCTTGGCAGCTCTGTTCTTTCATAAGTTACTGGATAAGGAGGTCAATGTCGACCGTGTGACTTTGGCTGCATTCAACACTGCCTGTATTGAAAGCAATAAGTATGCTCTTGTTTCGGACTTATCCGAGAGGATTTCAAAAGGTATCGGCTAA

Coding sequence (CDS)

ATGAACTCAACAAATCCTCGACAATTAGCATTGCATGGAGGGAGAGGTCCTGCAGTTTTTATACCTTTGATGCAGTTCTTGGCGTCTCTCCGAATTCTGAGGCCCCATGGATTTCTCCAGAAATTATGTTCCTTTCAACAGGGATCTTCAGCTTCTGCCTCCCTCGCATTTTTCTCCTCAACTCATTTTGATTCCATCTCTTCGCCGCACCATGATTTTTCTTCTTCTTCTTCGTTGCAGTCCCCTCTGAAAAAGATTTGTTCATTAGTTCTCGACACTTATTTACGTCAACCCCATCTGAGATTCTCTCCATCTAAGCTGAATCTTGATATGGATGCTGCCTCTTTGACTCATGAGCAAGCCATTTCTGCCGTTGCTTTGCTTGCTAGCGAGGAGGGTTCAATGGTCGCGCTGAGTTTCTTTTACTGGGCAGTTGGGTTCCCCAAATTCCGGTATTTCATGCGGCTCTACATAGTTTGTACGATGTCATTGGTTGGCAAATGTAATCTAGAGCGAGCTCATGAAGTTGTGGAGTGTATGGTAGGTGTTTTTGCAGAAATTGGGAAGTTGAAGGAGGCGGTGGATATGATCCTTGACATGAGAAATCAGGGACTTGTGTTGACCACCAGGGTAATGAATCGTATTATCCTGGTGGCTGCTGAAATGAAGCTGGTTGAATATGCAGGCAATGTGTTCGACGAAATGTCTGCAAGAGGTGTGTATCCTGATTCTTGCACTTATAAGTATATAATTGTAGGTTACTGTAGAAATGGTAATGTTTTGGAAGCAGATAGGTGGATATGTGAGATGATGGAGAGAGGTTTTGTGGTTGATAATGCCACACTGACTTTGATTATTACAGCATTTTGTGAAAAAAGTTTAGTAAACAGGGCAGTTTGGTTTTTCCATAAGGTTACTAAGATGGGTTTATCACCAAATTTGATAAACTATTCATCTATGATTAGTGGGTTGTGCAAGAGGGGTAGTGTTAAGCAAGCATTTGAGTTATTGGAAGAGATGGTTAAAAATGGATGGAAACCCAATGTGTATACCCACACATCATTAATTCACGGCCTTTGCAAGAAGGGATGGACAGAGAGAGCTTTTAGACTGTTTCTTAAACTTATTAGAAGTGATAATTACAAGCCAAATGTGCACACTTACACAGCCATGATAAGTGGGTACTGCAAAGAGGAGAAGTTGAGTAGAGCTGAAATGTTGTTTGAAAGAATGAAAGAACAGGGACTGGTTCCAAACACCAACACTTATACAACACTTATTGATGGGCACTGTAAGGCTGGGAATTTCAGTAAAGCGTATGAATTGATGGAGTTGATGTCTAATGAAGGTTTCTTCCCTAATACATGTACCTACAATTCAATTGTTGATGGTCTCTGCAAAAGAGGGAGAGCTGAAGAGGCTTTCAAGCTGTTAAATACAGGATTTCAAAATCAAATTGAAGCTGACGGTGTCACATACACCATTCTGATATCTGAGCAGTGTAAGCGAGCCGATATGAACCAAGCCCTTGTGTTTTTAAATAAGATGTTTAAAGTTGGTTTCCAGCCTGACATTCATTTATATACCACTTTGATTGCTGCATTCTGCAGGCAAAATATGATGAAGGATAGTGAAAAGTTGTTTGATGAAGTTATTAAGCTTGGGTTGGCTCCAACAAAGGAAACTTATACATCCATGATATGTGGCTATTGTAGGGAGAAAAGGGTTAGCTTAGCAGTCAAGTTTTTCCAGAAGATGAGTGACCATGGTTGTGCACCAGATAGCATTAGTTATGGTGCTTTAATCAGTGGCCTTTGTAAAGAGTCGAGGCTGGATGAGGCTCGCCAATTATATGATACCATGATAGACAAAGGGCTGTCTCCTTGCGAAGTTACACGGGTGACATTGACTTATGAGTATTGCAAAACAGAAGACTTTGCTTCGGCCATGGTTATATTGGAACGGCTGAACAAGAAGCTTTGGATACGCACAGTTCATACGCTAATAAGGAAGCTTTGTTGCGAGAAAAAAGTCGCCTTGGCAGCTCTGTTCTTTCATAAGTTACTGGATAAGGAGGTCAATGTCGACCGTGTGACTTTGGCTGCATTCAACACTGCCTGTATTGAAAGCAATAAGTATGCTCTTGTTTCGGACTTATCCGAGAGGATTTCAAAAGGTATCGGCTAA
BLAST of CSPI01G34460 vs. Swiss-Prot
Match: PP326_ARATH (Pentatricopeptide repeat-containing protein At4g19890 OS=Arabidopsis thaliana GN=At4g19890 PE=2 SV=1)

HSP 1 Score: 892.1 bits (2304), Expect = 4.0e-258
Identity = 438/678 (64.60%), Postives = 535/678 (78.91%), Query Frame = 1

Query: 54  SLAFF---SSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLD 113
           SL FF   SS H  S  S     SSS S Q  +K +CSLV  +YLRQ H+  SP ++NLD
Sbjct: 25  SLFFFRLISSDHESSDLSLPSSPSSSPS-QCLVKSVCSLVCTSYLRQNHVVSSPHRVNLD 84

Query: 114 MDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNL 173
            DA SLTHEQAI+ VA LASE GSMVAL FFYWAVGF KFR+FMRLY+V   SL+   NL
Sbjct: 85  FDANSLTHEQAITVVASLASESGSMVALCFFYWAVGFEKFRHFMRLYLVTADSLLANGNL 144

Query: 174 ERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGN 233
           ++AHEV+ CM+  F+EIG+L EAV M++DM+NQGL  ++  MN ++ +A E+ L+EYA N
Sbjct: 145 QKAHEVMRCMLRNFSEIGRLNEAVGMVMDMQNQGLTPSSITMNCVLEIAVELGLIEYAEN 204

Query: 234 VFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFC 293
           VFDEMS RGV PDS +YK +++G  R+G + EADRW+  M++RGF+ DNAT TLI+TA C
Sbjct: 205 VFDEMSVRGVVPDSSSYKLMVIGCFRDGKIQEADRWLTGMIQRGFIPDNATCTLILTALC 264

Query: 294 EKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVY 353
           E  LVNRA+W+F K+  +G  PNLIN++S+I GLCK+GS+KQAFE+LEEMV+NGWKPNVY
Sbjct: 265 ENGLVNRAIWYFRKMIDLGFKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRNGWKPNVY 324

Query: 354 THTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFER 413
           THT+LI GLCK+GWTE+AFRLFLKL+RSD YKPNVHTYT+MI GYCKE+KL+RAEMLF R
Sbjct: 325 THTALIDGLCKRGWTEKAFRLFLKLVRSDTYKPNVHTYTSMIGGYCKEDKLNRAEMLFSR 384

Query: 414 MKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGR 473
           MKEQGL PN NTYTTLI+GHCKAG+F +AYELM LM +EGF PN  TYN+ +D LCK+ R
Sbjct: 385 MKEQGLFPNVNTYTTLINGHCKAGSFGRAYELMNLMGDEGFMPNIYTYNAAIDSLCKKSR 444

Query: 474 AEEAFKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTT 533
           A EA++LLN  F   +EADGVTYTILI EQCK+ D+NQAL F  +M K GF+ D+ L   
Sbjct: 445 APEAYELLNKAFSCGLEADGVTYTILIQEQCKQNDINQALAFFCRMNKTGFEADMRLNNI 504

Query: 534 LIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHG 593
           LIAAFCRQ  MK+SE+LF  V+ LGL PTKETYTSMI  YC+E  + LA+K+F  M  HG
Sbjct: 505 LIAAFCRQKKMKESERLFQLVVSLGLIPTKETYTSMISCYCKEGDIDLALKYFHNMKRHG 564

Query: 594 CAPDSISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAM 653
           C PDS +YG+LISGLCK+S +DEA +LY+ MID+GLSP EVTRVTL YEYCK  D A+AM
Sbjct: 565 CVPDSFTYGSLISGLCKKSMVDEACKLYEAMIDRGLSPPEVTRVTLAYEYCKRNDSANAM 624

Query: 654 VILERLNKKLWIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIES 713
           ++LE L+KKLWIRTV TL+RKLC EKKV +AALFF KLL+K+ + DRVTLAAF TAC ES
Sbjct: 625 ILLEPLDKKLWIRTVRTLVRKLCSEKKVGVAALFFQKLLEKDSSADRVTLAAFTTACSES 684

Query: 714 NKYALVSDLSERISKGIG 729
            K  LV+DL+ERIS+G+G
Sbjct: 685 GKNNLVTDLTERISRGVG 701

BLAST of CSPI01G34460 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 6.8e-72
Identity = 147/480 (30.63%), Postives = 266/480 (55.42%), Query Frame = 1

Query: 176 VVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKL-VEYAGNVFDE 235
           V + +V  ++ +  + +A+ ++   +  G +      N ++      K  + +A NVF E
Sbjct: 136 VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKE 195

Query: 236 MSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSL 295
           M    V P+  TY  +I G+C  GN+  A     +M  +G + +  T   +I  +C+   
Sbjct: 196 MLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRK 255

Query: 296 VNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTS 355
           ++        +   GL PNLI+Y+ +I+GLC+ G +K+   +L EM + G+  +  T+ +
Sbjct: 256 IDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNT 315

Query: 356 LIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQ 415
           LI G CK+G   +A  +  +++R     P+V TYT++I   CK   ++RA    ++M+ +
Sbjct: 316 LIKGYCKEGNFHQALVMHAEMLRH-GLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVR 375

Query: 416 GLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEA 475
           GL PN  TYTTL+DG  + G  ++AY ++  M++ GF P+  TYN++++G C  G+ E+A
Sbjct: 376 GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDA 435

Query: 476 FKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAA 535
             +L    +  +  D V+Y+ ++S  C+  D+++AL    +M + G +PD   Y++LI  
Sbjct: 436 IAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQG 495

Query: 536 FCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPD 595
           FC Q   K++  L++E++++GL P + TYT++I  YC E  +  A++   +M + G  PD
Sbjct: 496 FCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPD 555

Query: 596 SISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILE 655
            ++Y  LI+GL K+SR  EA++L   +  +   P +VT  TL  E C   +F S + +++
Sbjct: 556 VVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL-IENCSNIEFKSVVSLIK 613

BLAST of CSPI01G34460 vs. Swiss-Prot
Match: PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 1.5e-71
Identity = 189/665 (28.42%), Postives = 318/665 (47.82%), Query Frame = 1

Query: 46  QQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPS 105
           +Q  S S  L        DS+S PH                    L + L +P+   SPS
Sbjct: 40  RQFCSVSPLLRNLPEEESDSMSVPHR-------------------LLSILSKPNWHKSPS 99

Query: 106 KLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLV 165
             ++ + A S +H  ++ ++ L         AL+F +W    P++++ +  Y      L+
Sbjct: 100 LKSM-VSAISPSHVSSLFSLDL-----DPKTALNFSHWISQNPRYKHSVYSYASLLTLLI 159

Query: 166 GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM-RNQGLVLTTRVM----NRIILVAA 225
               +    ++   M+     +G     +D+   M +++   L  +++    N ++   A
Sbjct: 160 NNGYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLA 219

Query: 226 EMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNA 285
              LV+    V+ EM    V P+  TY  ++ GYC+ GNV EA++++ +++E G   D  
Sbjct: 220 RFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFF 279

Query: 286 TLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEM 345
           T T +I  +C++  ++ A   F+++   G   N + Y+ +I GLC    + +A +L  +M
Sbjct: 280 TYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKM 339

Query: 346 VKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK 405
             +   P V T+T LI  LC       A  L +K +     KPN+HTYT +I   C + K
Sbjct: 340 KDDECFPTVRTYTVLIKSLCGSERKSEALNL-VKEMEETGIKPNIHTYTVLIDSLCSQCK 399

Query: 406 LSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNS 465
             +A  L  +M E+GL+PN  TY  LI+G+CK G    A +++ELM +    PNT TYN 
Sbjct: 400 FEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNE 459

Query: 466 IVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVG 525
           ++ G CK     +A  +LN   + ++  D VTY  LI  QC+  + + A   L+ M   G
Sbjct: 460 LIKGYCK-SNVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRG 519

Query: 526 FQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAV 585
             PD   YT++I + C+   ++++  LFD + + G+ P    YT++I GYC+  +V  A 
Sbjct: 520 LVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAH 579

Query: 586 KFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEY 645
              +KM    C P+S+++ ALI GLC + +L EA  L + M+  GL P   T   L +  
Sbjct: 580 LMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRL 639

Query: 646 CKTEDFASAMVILERL---NKKLWIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDR 703
            K  DF  A    +++     K    T  T I+  C E ++  A     K+ +  V+ D 
Sbjct: 640 LKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDL 677

BLAST of CSPI01G34460 vs. Swiss-Prot
Match: PP306_ARATH (Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana GN=At4g11690 PE=2 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.9e-69
Identity = 143/451 (31.71%), Postives = 237/451 (52.55%), Query Frame = 1

Query: 178 ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSA 237
           E ++  + +   L  ++    +M + G V  +   N ++             + F+E  +
Sbjct: 98  EVIINSYVQSQSLNLSISYFNEMVDNGFVPGSNCFNYLLTFVVGSSSFNQWWSFFNENKS 157

Query: 238 RGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNR 297
           + V  D  ++  +I G C  G + ++   + E+ E GF  +    T +I   C+K  + +
Sbjct: 158 KVVL-DVYSFGILIKGCCEAGEIEKSFDLLIELTEFGFSPNVVIYTTLIDGCCKKGEIEK 217

Query: 298 AVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIH 357
           A   F ++ K+GL  N   Y+ +I+GL K G  KQ FE+ E+M ++G  PN+YT+  +++
Sbjct: 218 AKDLFFEMGKLGLVANERTYTVLINGLFKNGVKKQGFEMYEKMQEDGVFPNLYTYNCVMN 277

Query: 358 GLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLV 417
            LCK G T+ AF++F ++ R      N+ TY  +I G C+E KL+ A  + ++MK  G+ 
Sbjct: 278 QLCKDGRTKDAFQVFDEM-RERGVSCNIVTYNTLIGGLCREMKLNEANKVVDQMKSDGIN 337

Query: 418 PNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKL 477
           PN  TY TLIDG C  G   KA  L   + + G  P+  TYN +V G C++G    A K+
Sbjct: 338 PNLITYNTLIDGFCGVGKLGKALSLCRDLKSRGLSPSLVTYNILVSGFCRKGDTSGAAKM 397

Query: 478 LNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCR 537
           +    +  I+   VTYTILI    +  +M +A+     M ++G  PD+H Y+ LI  FC 
Sbjct: 398 VKEMEERGIKPSKVTYTILIDTFARSDNMEKAIQLRLSMEELGLVPDVHTYSVLIHGFCI 457

Query: 538 QNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSIS 597
           +  M ++ +LF  +++    P +  Y +MI GYC+E     A+K  ++M +   AP+  S
Sbjct: 458 KGQMNEASRLFKSMVEKNCEPNEVIYNTMILGYCKEGSSYRALKLLKEMEEKELAPNVAS 517

Query: 598 YGALISGLCKESRLDEARQLYDTMIDKGLSP 629
           Y  +I  LCKE +  EA +L + MID G+ P
Sbjct: 518 YRYMIEVLCKERKSKEAERLVEKMIDSGIDP 546

BLAST of CSPI01G34460 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 262.7 bits (670), Expect = 1.2e-68
Identity = 148/471 (31.42%), Postives = 234/471 (49.68%), Query Frame = 1

Query: 189 KLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYK 248
           K   A+ +  +    G+       N +I    ++  ++ A ++   M  +G  PD  +Y 
Sbjct: 226 KTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 285

Query: 249 YIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKM 308
            ++ GYCR G + +  + I  M  +G   ++     II   C    +  A   F ++ + 
Sbjct: 286 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 345

Query: 309 GLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERA 368
           G+ P+ + Y+++I G CKRG ++ A +   EM      P+V T+T++I G C+ G    A
Sbjct: 346 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 405

Query: 369 FRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLID 428
            +LF ++      +P+  T+T +I+GYCK   +  A  +   M + G  PN  TYTTLID
Sbjct: 406 GKLFHEMF-CKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLID 465

Query: 429 GHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEA 488
           G CK G+   A EL+  M   G  PN  TYNSIV+GLCK G  EEA KL+       + A
Sbjct: 466 GLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNA 525

Query: 489 DGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLF 548
           D VTYT L+   CK  +M++A   L +M   G QP I  +  L+  FC   M++D EKL 
Sbjct: 526 DTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLL 585

Query: 549 DEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKE 608
           + ++  G+AP   T+ S++  YC    +  A   ++ M   G  PD  +Y  L+ G CK 
Sbjct: 586 NWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKA 645

Query: 609 SRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKK 660
             + EA  L+  M  KG S    T   L   + K + F  A  + +++ ++
Sbjct: 646 RNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRRE 695

BLAST of CSPI01G34460 vs. TrEMBL
Match: A0A0A0LYL9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G701980 PE=4 SV=1)

HSP 1 Score: 1458.0 bits (3773), Expect = 0.0e+00
Identity = 725/728 (99.59%), Postives = 728/728 (100.00%), Query Frame = 1

Query: 1   MNSTNPRQLALHGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS 60
           MNSTNPRQLAL+GGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS
Sbjct: 1   MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS 60

Query: 61  THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ 120
           THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ
Sbjct: 61  THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ 120

Query: 121 AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM 180
           AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM
Sbjct: 121 AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM 180

Query: 181 VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGV 240
           VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEM+LVEYAGNVFDEMSARGV
Sbjct: 181 VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGV 240

Query: 241 YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW 300
           YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
Sbjct: 241 YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW 300

Query: 301 FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC 360
           FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC
Sbjct: 301 FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC 360

Query: 361 KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT 420
           KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT
Sbjct: 361 KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT 420

Query: 421 NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT 480
           NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT
Sbjct: 421 NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT 480

Query: 481 GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM 540
           GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM
Sbjct: 481 GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM 540

Query: 541 MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGA 600
           MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREK+VSLAVKFFQKMSDHGCAPDSISYGA
Sbjct: 541 MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKKVSLAVKFFQKMSDHGCAPDSISYGA 600

Query: 601 LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL 660
           LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL
Sbjct: 601 LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL 660

Query: 661 WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS 720
           WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS
Sbjct: 661 WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS 720

Query: 721 ERISKGIG 729
           ERISKGIG
Sbjct: 721 ERISKGIG 728

BLAST of CSPI01G34460 vs. TrEMBL
Match: M5WK57_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015022mg PE=4 SV=1)

HSP 1 Score: 944.9 bits (2441), Expect = 5.8e-272
Identity = 465/698 (66.62%), Postives = 554/698 (79.37%), Query Frame = 1

Query: 27  LASLRILR-PHGFLQKLCSFQQGSSASA----SLAFFSSTHFD--------SISSPHHDF 86
           + SLRILR  H   QKL S      +      SL   S TH+D        + ++     
Sbjct: 1   MVSLRILRRTHELQQKLLSPASNPISIFYTLFSLRTLSYTHYDDPYSTTTITTATSTTST 60

Query: 87  SSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEG 146
           SSSS  QS ++ IC+LV  +Y  Q HLR SP KLNLD++A SLT+EQAIS VA LA E G
Sbjct: 61  SSSSQSQSLVRTICALVCQSYSPQTHLRSSPPKLNLDLNADSLTNEQAISVVASLAEEAG 120

Query: 147 SMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEA 206
           SMVALSFFYWA+GFPKFRYFMRLYI C MSL G  NLERAHEVV CMV  FAEIG+LKEA
Sbjct: 121 SMVALSFFYWAIGFPKFRYFMRLYIFCAMSLFGNGNLERAHEVVHCMVRNFAEIGRLKEA 180

Query: 207 VDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVG 266
            DM+ +M+NQGL+L+TR +N ++ +A ++ LVEYA N+F+EM  RGV PDS +YK ++VG
Sbjct: 181 ADMVFEMQNQGLMLSTRTLNCVLGIACDLGLVEYAENLFEEMCVRGVSPDSLSYKSMVVG 240

Query: 267 YCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPN 326
           YCRN  VLE DRW+ +M+ERGFV+DN T TLII+ FCEKSL+ R          MG+ PN
Sbjct: 241 YCRNRRVLEVDRWLSKMLERGFVLDNVTFTLIISLFCEKSLMIR----------MGVKPN 300

Query: 327 LINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFL 386
           LIN++S+I GLC+RGS+KQAFE+LEEMV+ GWKPNVYTHT LI GLCKKGWTERAFRLFL
Sbjct: 301 LINFTSLIHGLCQRGSIKQAFEMLEEMVRKGWKPNVYTHTGLIDGLCKKGWTERAFRLFL 360

Query: 387 KLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKA 446
           KL+RSDNYKPNVHTYTAMI GYC+E+K+SRAEML  RMKEQGL+PNTNTYTTL+ GHCKA
Sbjct: 361 KLVRSDNYKPNVHTYTAMIRGYCEEDKMSRAEMLLSRMKEQGLIPNTNTYTTLVSGHCKA 420

Query: 447 GNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTY 506
           GNF +AYELM++M  EGF PN CTYN++ D LCK+GR +EA+KL+  GF+  +EAD VTY
Sbjct: 421 GNFDRAYELMDIMGKEGFAPNICTYNAVFDSLCKKGRVQEAYKLIKKGFRRGLEADRVTY 480

Query: 507 TILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIK 566
           TI ISE CKR D+N ALVF NKM KVG QPD+H YTTLIAAFCRQ  MK+SEK F+  ++
Sbjct: 481 TIFISEHCKRGDINGALVFFNKMLKVGLQPDMHSYTTLIAAFCRQKKMKESEKFFELSVR 540

Query: 567 LGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDE 626
           LG  PTKETYTSMICGYCR++ ++LA+KFF +M DHGCAPDS +YGALISGLCKE +L+E
Sbjct: 541 LGSIPTKETYTSMICGYCRDENIALAIKFFHRMGDHGCAPDSFTYGALISGLCKEEKLEE 600

Query: 627 ARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLC 686
           AR+LYDTM+DKGLSPCEVTR+TL Y+YCK +D A+AMV+LERL KKLWIRTV+TL+RKLC
Sbjct: 601 ARRLYDTMMDKGLSPCEVTRLTLAYKYCKKDDSAAAMVLLERLEKKLWIRTVNTLVRKLC 660

Query: 687 CEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESN 712
            EKKV +A LFFHKL+DK+ NVDRVTLAAF TAC ESN
Sbjct: 661 SEKKVGIATLFFHKLVDKDQNVDRVTLAAFKTACYESN 688

BLAST of CSPI01G34460 vs. TrEMBL
Match: K7LFT8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_09G242500 PE=4 SV=1)

HSP 1 Score: 933.3 bits (2411), Expect = 1.8e-268
Identity = 441/653 (67.53%), Postives = 539/653 (82.54%), Query Frame = 1

Query: 74  SSSSSLQSPLKKICSLVLDTYLRQ-PHLRFSPSKLNLDMDAASLTHEQAISAVALLASEE 133
           +S S +QS + ++CSLV D+Y     H RFSP  L+LD+D  SLTH+QA++ VA LAS+ 
Sbjct: 31  TSPSCVQSTVTRVCSLVYDSYHHHYNHARFSPPTLHLDVDPNSLTHDQAVTIVASLASDA 90

Query: 134 GSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKE 193
           GSMVALSFF WA+   KFR+F RLYI C  SL+   N E+AHEV++CMV  FAEIG++KE
Sbjct: 91  GSMVALSFFNWAIASSKFRHFTRLYIACAASLISNKNFEKAHEVMQCMVKSFAEIGRVKE 150

Query: 194 AVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIV 253
           A++M+++M NQGL  +T+ +N ++ +  EM LVEYA N+FDEM ARGV P+  +Y+ ++V
Sbjct: 151 AIEMVIEMHNQGLAPSTKTLNWVVKIVTEMGLVEYAENLFDEMCARGVQPNCVSYRVMVV 210

Query: 254 GYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSP 313
           GYC+ GNVLE+DRW+  M+ERGFVVDNATL+LI+  FCEK  V RA+W+F +  +MGL P
Sbjct: 211 GYCKLGNVLESDRWLGGMIERGFVVDNATLSLIVREFCEKGFVTRALWYFRRFCEMGLRP 270

Query: 314 NLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLF 373
           NLIN++ MI GLCKRGSVKQAFE+LEEMV  GWKPNVYTHT+LI GLCKKGWTE+AFRLF
Sbjct: 271 NLINFTCMIEGLCKRGSVKQAFEMLEEMVGRGWKPNVYTHTALIDGLCKKGWTEKAFRLF 330

Query: 374 LKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCK 433
           LKL+RS+N+KPNV TYTAMISGYC++EK++RAEML  RMKEQGL PNTNTYTTLIDGHCK
Sbjct: 331 LKLVRSENHKPNVLTYTAMISGYCRDEKMNRAEMLLSRMKEQGLAPNTNTYTTLIDGHCK 390

Query: 434 AGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVT 493
           AGNF +AYELM +M+ EGF PN CTYN+IVDGLCK+GR +EA+K+L +GF+N ++AD VT
Sbjct: 391 AGNFERAYELMNVMNEEGFSPNVCTYNAIVDGLCKKGRVQEAYKVLKSGFRNGLDADKVT 450

Query: 494 YTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVI 553
           YTILISE CK+A++ QALV  NKM K G QPDIH YTTLIA FCR+  MK+SE  F+E +
Sbjct: 451 YTILISEHCKQAEIKQALVLFNKMVKSGIQPDIHSYTTLIAVFCREKRMKESEMFFEEAV 510

Query: 554 KLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLD 613
           + GL PT +TYTSMICGYCRE  + LA+KFF +MSDHGCA DSI+YGALISGLCK+S+LD
Sbjct: 511 RFGLVPTNKTYTSMICGYCREGNLRLALKFFHRMSDHGCASDSITYGALISGLCKQSKLD 570

Query: 614 EARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKL 673
           EAR LYD MI+KGL+PCEVTRVTL YEYCK +D  SAMV+LERL KKLW+RTV+TL+RKL
Sbjct: 571 EARCLYDAMIEKGLTPCEVTRVTLAYEYCKIDDGCSAMVVLERLEKKLWVRTVNTLVRKL 630

Query: 674 CCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISK 726
           C E+KV +AALFFHKLLDK+ NV+RVT+AAF TAC ESNKY LVSDLS RI K
Sbjct: 631 CSERKVGMAALFFHKLLDKDPNVNRVTIAAFMTACYESNKYDLVSDLSARIYK 683

BLAST of CSPI01G34460 vs. TrEMBL
Match: W9RA33_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008796 PE=4 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 2.3e-268
Identity = 463/723 (64.04%), Postives = 567/723 (78.42%), Query Frame = 1

Query: 27  LASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSP-------HHDF------ 86
           + S   LR HG        QQ  SA +SL  F  T   S+ SP       +HD+      
Sbjct: 1   MLSTLFLRSHGV-----GLQQKLSAISSLNSFLYTSLHSLFSPKTFSSNSYHDYLSAGPS 60

Query: 87  ------SSSSSL---QSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISA 146
                 SSSSSL   QS ++ +CSLV ++Y +  H R SP KL L++D  SLTHEQAI+ 
Sbjct: 61  SSSSSSSSSSSLSSSQSLIRTVCSLVFESYYQHGHGRQSPPKLILNVDTDSLTHEQAITV 120

Query: 147 VALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVF 206
           VA LA E GSMVALSFFYWA+ F KFR+FMRLYIVC MSL+G  NLERAHEV++CM+G F
Sbjct: 121 VASLADEGGSMVALSFFYWAIEFSKFRHFMRLYIVCAMSLIGNGNLERAHEVMQCMLGSF 180

Query: 207 AEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDS 266
           AEIG+LKEA DMILD++NQGL+LTT ++N ++ +A EM  +EYA  +F+EM  R V PD 
Sbjct: 181 AEIGRLKEAGDMILDLQNQGLMLTTHILNSVVRIAWEMNSIEYAEEMFEEMCQREVSPDP 240

Query: 267 CTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHK 326
            +YK ++VGYCR G VLEAD+W+ EM+++GF VDNATLTLII+ FC+K   N A+WFF+K
Sbjct: 241 SSYKSMVVGYCRIGRVLEADKWLSEMLDKGFAVDNATLTLIISTFCKKGFANHALWFFNK 300

Query: 327 VTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGW 386
           +  MGLSPNLINY+S+I+GLC+RGSVK+ FE+LEEMV  GW+PNVYTHT+LI GLCKKGW
Sbjct: 301 MIGMGLSPNLINYTSLINGLCRRGSVKKGFEMLEEMVSKGWRPNVYTHTALIDGLCKKGW 360

Query: 387 TERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYT 446
           TE+AFRLFLKL+RSDNYKPNVHTYT+MISGYC+EEK++RAEMLF +MKEQGLVPNTNTYT
Sbjct: 361 TEKAFRLFLKLVRSDNYKPNVHTYTSMISGYCREEKMNRAEMLFSKMKEQGLVPNTNTYT 420

Query: 447 TLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQN 506
           TLIDGHCKAGNF  AY+LM+ M  +GF PN  TYN ++DGL K+GR  +A KL+     +
Sbjct: 421 TLIDGHCKAGNFKTAYQLMDSMRVDGFAPNIYTYNVVMDGLLKKGRIPDAHKLMKKASWD 480

Query: 507 QIEADGVTYTILISEQCKRADMNQ--ALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMK 566
            + +D VTYTILISE CK+ +     AL+  NKM KVG QPDIHLYT+LIA FCRQ  M 
Sbjct: 481 GVRSDIVTYTILISEHCKKGETTDTGALMLFNKMVKVGIQPDIHLYTSLIAFFCRQKRMA 540

Query: 567 DSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALI 626
           +SE+ F++ I+ GL PTKETYTSMICGYCR++ V++A KFF++M+ HGC PDSI+YGALI
Sbjct: 541 ESERFFEDAIRYGLEPTKETYTSMICGYCRDENVAMASKFFRRMTGHGCIPDSIAYGALI 600

Query: 627 SGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWI 686
           SGLCK+ RLD+AR+LYDTM+DKGLSPCEVTRVTL YEYCK E+F++AM ILERL+K+LWI
Sbjct: 601 SGLCKDERLDDARRLYDTMVDKGLSPCEVTRVTLAYEYCKKENFSAAMAILERLDKRLWI 660

Query: 687 RTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSER 726
           RTV+TLIRKLC  KKV +AALFFH+L+ K+ NVDRVTLAAF TAC ESNKYALVS+L+ER
Sbjct: 661 RTVNTLIRKLCNNKKVGMAALFFHELVGKDRNVDRVTLAAFTTACYESNKYALVSELTER 718

BLAST of CSPI01G34460 vs. TrEMBL
Match: A0A072TVK4_MEDTR (PPR containing plant-like protein OS=Medicago truncatula GN=MTR_7g405940 PE=4 SV=1)

HSP 1 Score: 924.5 bits (2388), Expect = 8.2e-266
Identity = 435/643 (67.65%), Postives = 532/643 (82.74%), Query Frame = 1

Query: 83  LKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFY 142
           ++++CSLV ++Y +  H+R S  +L+  ++   LTHEQA+S VA LAS+ GSMVALSFF+
Sbjct: 2   VQRVCSLVCESYNQHAHMRVSSQRLHFGIEVDFLTHEQAVSVVASLASDAGSMVALSFFH 61

Query: 143 WAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRN 202
           WA+G+PKFR+FMRLYIVC  SL+G  N E+A EV+ CMV  F+E+G+LKEAV+M+++M N
Sbjct: 62  WAIGYPKFRHFMRLYIVCATSLIGNRNSEKACEVMRCMVENFSEVGRLKEAVEMVIEMHN 121

Query: 203 QGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLE 262
           QGLV  TR +N II V +EM LVEYA  +F+EM  RGV PDS +Y+ ++V YC+ GN+LE
Sbjct: 122 QGLVPNTRTLNLIIKVTSEMGLVEYAELLFEEMCVRGVQPDSVSYRVMVVMYCKIGNILE 181

Query: 263 ADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMIS 322
           ADRW+  M+ERGFVVDNAT TLII+ FCEK    RA+W+F ++  MGL PNLIN++ MI 
Sbjct: 182 ADRWLSAMLERGFVVDNATFTLIISRFCEKGYATRALWYFRRLVDMGLEPNLINFTCMIE 241

Query: 323 GLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYK 382
           GLCKRGS+KQAFE+LEEMV  GWKPNVYTHTSLI GLCKKGWTE+AFRLFLKL+RS+N+K
Sbjct: 242 GLCKRGSIKQAFEMLEEMVGKGWKPNVYTHTSLIDGLCKKGWTEKAFRLFLKLVRSENHK 301

Query: 383 PNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYEL 442
           PNV TYTAMISGYC+E+KL+RAEML  RMKEQGLVPNTNTYTTLIDGHCKAGNF +AY+L
Sbjct: 302 PNVLTYTAMISGYCREDKLNRAEMLLSRMKEQGLVPNTNTYTTLIDGHCKAGNFERAYDL 361

Query: 443 MELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCK 502
           M LMS+EGF PN CTYN+IV+GLCKRGR +EA+K+L  GFQN ++ D  TY IL+SE CK
Sbjct: 362 MNLMSSEGFSPNVCTYNAIVNGLCKRGRVQEAYKMLEDGFQNGLKPDRFTYNILMSEHCK 421

Query: 503 RADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKET 562
           +A++ QALV  NKM K G QPDIH YTTLIA FCR+N MK+SE  F+E +++G+ PT  T
Sbjct: 422 QANIRQALVLFNKMVKSGIQPDIHSYTTLIAVFCRENRMKESEMFFEEAVRIGIIPTNRT 481

Query: 563 YTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMI 622
           YTSMICGYCRE  ++LA+KFF ++SDHGCAPDSI+YGA+ISGLCK+S+LDEAR LYD+MI
Sbjct: 482 YTSMICGYCREGNLTLAMKFFHRLSDHGCAPDSITYGAIISGLCKQSKLDEARGLYDSMI 541

Query: 623 DKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAA 682
           +KGL PCEVTR+TL YEYCK +D  SAMVILERL KKLWIRT  TL+RKLC EKKV +AA
Sbjct: 542 EKGLVPCEVTRITLAYEYCKVDDCLSAMVILERLEKKLWIRTATTLVRKLCSEKKVGMAA 601

Query: 683 LFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISK 726
           LFF+KLLD +++V RV LAAF TAC E+N YALVSDLS RI K
Sbjct: 602 LFFNKLLDMDLHVYRVILAAFMTACYETNNYALVSDLSARIHK 644

BLAST of CSPI01G34460 vs. TAIR10
Match: AT4G19890.1 (AT4G19890.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 892.1 bits (2304), Expect = 2.3e-259
Identity = 438/678 (64.60%), Postives = 535/678 (78.91%), Query Frame = 1

Query: 54  SLAFF---SSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLD 113
           SL FF   SS H  S  S     SSS S Q  +K +CSLV  +YLRQ H+  SP ++NLD
Sbjct: 25  SLFFFRLISSDHESSDLSLPSSPSSSPS-QCLVKSVCSLVCTSYLRQNHVVSSPHRVNLD 84

Query: 114 MDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNL 173
            DA SLTHEQAI+ VA LASE GSMVAL FFYWAVGF KFR+FMRLY+V   SL+   NL
Sbjct: 85  FDANSLTHEQAITVVASLASESGSMVALCFFYWAVGFEKFRHFMRLYLVTADSLLANGNL 144

Query: 174 ERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGN 233
           ++AHEV+ CM+  F+EIG+L EAV M++DM+NQGL  ++  MN ++ +A E+ L+EYA N
Sbjct: 145 QKAHEVMRCMLRNFSEIGRLNEAVGMVMDMQNQGLTPSSITMNCVLEIAVELGLIEYAEN 204

Query: 234 VFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFC 293
           VFDEMS RGV PDS +YK +++G  R+G + EADRW+  M++RGF+ DNAT TLI+TA C
Sbjct: 205 VFDEMSVRGVVPDSSSYKLMVIGCFRDGKIQEADRWLTGMIQRGFIPDNATCTLILTALC 264

Query: 294 EKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVY 353
           E  LVNRA+W+F K+  +G  PNLIN++S+I GLCK+GS+KQAFE+LEEMV+NGWKPNVY
Sbjct: 265 ENGLVNRAIWYFRKMIDLGFKPNLINFTSLIDGLCKKGSIKQAFEMLEEMVRNGWKPNVY 324

Query: 354 THTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFER 413
           THT+LI GLCK+GWTE+AFRLFLKL+RSD YKPNVHTYT+MI GYCKE+KL+RAEMLF R
Sbjct: 325 THTALIDGLCKRGWTEKAFRLFLKLVRSDTYKPNVHTYTSMIGGYCKEDKLNRAEMLFSR 384

Query: 414 MKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGR 473
           MKEQGL PN NTYTTLI+GHCKAG+F +AYELM LM +EGF PN  TYN+ +D LCK+ R
Sbjct: 385 MKEQGLFPNVNTYTTLINGHCKAGSFGRAYELMNLMGDEGFMPNIYTYNAAIDSLCKKSR 444

Query: 474 AEEAFKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTT 533
           A EA++LLN  F   +EADGVTYTILI EQCK+ D+NQAL F  +M K GF+ D+ L   
Sbjct: 445 APEAYELLNKAFSCGLEADGVTYTILIQEQCKQNDINQALAFFCRMNKTGFEADMRLNNI 504

Query: 534 LIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHG 593
           LIAAFCRQ  MK+SE+LF  V+ LGL PTKETYTSMI  YC+E  + LA+K+F  M  HG
Sbjct: 505 LIAAFCRQKKMKESERLFQLVVSLGLIPTKETYTSMISCYCKEGDIDLALKYFHNMKRHG 564

Query: 594 CAPDSISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAM 653
           C PDS +YG+LISGLCK+S +DEA +LY+ MID+GLSP EVTRVTL YEYCK  D A+AM
Sbjct: 565 CVPDSFTYGSLISGLCKKSMVDEACKLYEAMIDRGLSPPEVTRVTLAYEYCKRNDSANAM 624

Query: 654 VILERLNKKLWIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIES 713
           ++LE L+KKLWIRTV TL+RKLC EKKV +AALFF KLL+K+ + DRVTLAAF TAC ES
Sbjct: 625 ILLEPLDKKLWIRTVRTLVRKLCSEKKVGVAALFFQKLLEKDSSADRVTLAAFTTACSES 684

Query: 714 NKYALVSDLSERISKGIG 729
            K  LV+DL+ERIS+G+G
Sbjct: 685 GKNNLVTDLTERISRGVG 701

BLAST of CSPI01G34460 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 273.5 bits (698), Expect = 3.8e-73
Identity = 147/480 (30.63%), Postives = 266/480 (55.42%), Query Frame = 1

Query: 176 VVECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKL-VEYAGNVFDE 235
           V + +V  ++ +  + +A+ ++   +  G +      N ++      K  + +A NVF E
Sbjct: 136 VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKE 195

Query: 236 MSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSL 295
           M    V P+  TY  +I G+C  GN+  A     +M  +G + +  T   +I  +C+   
Sbjct: 196 MLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRK 255

Query: 296 VNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTS 355
           ++        +   GL PNLI+Y+ +I+GLC+ G +K+   +L EM + G+  +  T+ +
Sbjct: 256 IDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNT 315

Query: 356 LIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQ 415
           LI G CK+G   +A  +  +++R     P+V TYT++I   CK   ++RA    ++M+ +
Sbjct: 316 LIKGYCKEGNFHQALVMHAEMLRH-GLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVR 375

Query: 416 GLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEA 475
           GL PN  TYTTL+DG  + G  ++AY ++  M++ GF P+  TYN++++G C  G+ E+A
Sbjct: 376 GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDA 435

Query: 476 FKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAA 535
             +L    +  +  D V+Y+ ++S  C+  D+++AL    +M + G +PD   Y++LI  
Sbjct: 436 IAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQG 495

Query: 536 FCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPD 595
           FC Q   K++  L++E++++GL P + TYT++I  YC E  +  A++   +M + G  PD
Sbjct: 496 FCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPD 555

Query: 596 SISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILE 655
            ++Y  LI+GL K+SR  EA++L   +  +   P +VT  TL  E C   +F S + +++
Sbjct: 556 VVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTL-IENCSNIEFKSVVSLIK 613

BLAST of CSPI01G34460 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 272.3 bits (695), Expect = 8.5e-73
Identity = 189/665 (28.42%), Postives = 318/665 (47.82%), Query Frame = 1

Query: 46  QQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPS 105
           +Q  S S  L        DS+S PH                    L + L +P+   SPS
Sbjct: 40  RQFCSVSPLLRNLPEEESDSMSVPHR-------------------LLSILSKPNWHKSPS 99

Query: 106 KLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLV 165
             ++ + A S +H  ++ ++ L         AL+F +W    P++++ +  Y      L+
Sbjct: 100 LKSM-VSAISPSHVSSLFSLDL-----DPKTALNFSHWISQNPRYKHSVYSYASLLTLLI 159

Query: 166 GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDM-RNQGLVLTTRVM----NRIILVAA 225
               +    ++   M+     +G     +D+   M +++   L  +++    N ++   A
Sbjct: 160 NNGYVGVVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLA 219

Query: 226 EMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNA 285
              LV+    V+ EM    V P+  TY  ++ GYC+ GNV EA++++ +++E G   D  
Sbjct: 220 RFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFF 279

Query: 286 TLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEM 345
           T T +I  +C++  ++ A   F+++   G   N + Y+ +I GLC    + +A +L  +M
Sbjct: 280 TYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKM 339

Query: 346 VKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEK 405
             +   P V T+T LI  LC       A  L +K +     KPN+HTYT +I   C + K
Sbjct: 340 KDDECFPTVRTYTVLIKSLCGSERKSEALNL-VKEMEETGIKPNIHTYTVLIDSLCSQCK 399

Query: 406 LSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNS 465
             +A  L  +M E+GL+PN  TY  LI+G+CK G    A +++ELM +    PNT TYN 
Sbjct: 400 FEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNE 459

Query: 466 IVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVG 525
           ++ G CK     +A  +LN   + ++  D VTY  LI  QC+  + + A   L+ M   G
Sbjct: 460 LIKGYCK-SNVHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRG 519

Query: 526 FQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAV 585
             PD   YT++I + C+   ++++  LFD + + G+ P    YT++I GYC+  +V  A 
Sbjct: 520 LVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAH 579

Query: 586 KFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEY 645
              +KM    C P+S+++ ALI GLC + +L EA  L + M+  GL P   T   L +  
Sbjct: 580 LMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRL 639

Query: 646 CKTEDFASAMVILERL---NKKLWIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDR 703
            K  DF  A    +++     K    T  T I+  C E ++  A     K+ +  V+ D 
Sbjct: 640 LKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDL 677

BLAST of CSPI01G34460 vs. TAIR10
Match: AT4G11690.1 (AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 265.4 bits (677), Expect = 1.0e-70
Identity = 143/451 (31.71%), Postives = 237/451 (52.55%), Query Frame = 1

Query: 178 ECMVGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSA 237
           E ++  + +   L  ++    +M + G V  +   N ++             + F+E  +
Sbjct: 98  EVIINSYVQSQSLNLSISYFNEMVDNGFVPGSNCFNYLLTFVVGSSSFNQWWSFFNENKS 157

Query: 238 RGVYPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNR 297
           + V  D  ++  +I G C  G + ++   + E+ E GF  +    T +I   C+K  + +
Sbjct: 158 KVVL-DVYSFGILIKGCCEAGEIEKSFDLLIELTEFGFSPNVVIYTTLIDGCCKKGEIEK 217

Query: 298 AVWFFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIH 357
           A   F ++ K+GL  N   Y+ +I+GL K G  KQ FE+ E+M ++G  PN+YT+  +++
Sbjct: 218 AKDLFFEMGKLGLVANERTYTVLINGLFKNGVKKQGFEMYEKMQEDGVFPNLYTYNCVMN 277

Query: 358 GLCKKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLV 417
            LCK G T+ AF++F ++ R      N+ TY  +I G C+E KL+ A  + ++MK  G+ 
Sbjct: 278 QLCKDGRTKDAFQVFDEM-RERGVSCNIVTYNTLIGGLCREMKLNEANKVVDQMKSDGIN 337

Query: 418 PNTNTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKL 477
           PN  TY TLIDG C  G   KA  L   + + G  P+  TYN +V G C++G    A K+
Sbjct: 338 PNLITYNTLIDGFCGVGKLGKALSLCRDLKSRGLSPSLVTYNILVSGFCRKGDTSGAAKM 397

Query: 478 LNTGFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCR 537
           +    +  I+   VTYTILI    +  +M +A+     M ++G  PD+H Y+ LI  FC 
Sbjct: 398 VKEMEERGIKPSKVTYTILIDTFARSDNMEKAIQLRLSMEELGLVPDVHTYSVLIHGFCI 457

Query: 538 QNMMKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSIS 597
           +  M ++ +LF  +++    P +  Y +MI GYC+E     A+K  ++M +   AP+  S
Sbjct: 458 KGQMNEASRLFKSMVEKNCEPNEVIYNTMILGYCKEGSSYRALKLLKEMEEKELAPNVAS 517

Query: 598 YGALISGLCKESRLDEARQLYDTMIDKGLSP 629
           Y  +I  LCKE +  EA +L + MID G+ P
Sbjct: 518 YRYMIEVLCKERKSKEAERLVEKMIDSGIDP 546

BLAST of CSPI01G34460 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 262.7 bits (670), Expect = 6.8e-70
Identity = 148/471 (31.42%), Postives = 234/471 (49.68%), Query Frame = 1

Query: 189 KLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYK 248
           K   A+ +  +    G+       N +I    ++  ++ A ++   M  +G  PD  +Y 
Sbjct: 226 KTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 285

Query: 249 YIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKM 308
            ++ GYCR G + +  + I  M  +G   ++     II   C    +  A   F ++ + 
Sbjct: 286 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 345

Query: 309 GLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERA 368
           G+ P+ + Y+++I G CKRG ++ A +   EM      P+V T+T++I G C+ G    A
Sbjct: 346 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 405

Query: 369 FRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLID 428
            +LF ++      +P+  T+T +I+GYCK   +  A  +   M + G  PN  TYTTLID
Sbjct: 406 GKLFHEMF-CKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLID 465

Query: 429 GHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEA 488
           G CK G+   A EL+  M   G  PN  TYNSIV+GLCK G  EEA KL+       + A
Sbjct: 466 GLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNA 525

Query: 489 DGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLF 548
           D VTYT L+   CK  +M++A   L +M   G QP I  +  L+  FC   M++D EKL 
Sbjct: 526 DTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLL 585

Query: 549 DEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKE 608
           + ++  G+AP   T+ S++  YC    +  A   ++ M   G  PD  +Y  L+ G CK 
Sbjct: 586 NWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKA 645

Query: 609 SRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKK 660
             + EA  L+  M  KG S    T   L   + K + F  A  + +++ ++
Sbjct: 646 RNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRRE 695

BLAST of CSPI01G34460 vs. NCBI nr
Match: gi|700211777|gb|KGN66873.1| (hypothetical protein Csa_1G701980 [Cucumis sativus])

HSP 1 Score: 1458.0 bits (3773), Expect = 0.0e+00
Identity = 725/728 (99.59%), Postives = 728/728 (100.00%), Query Frame = 1

Query: 1   MNSTNPRQLALHGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS 60
           MNSTNPRQLAL+GGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS
Sbjct: 1   MNSTNPRQLALNGGRGPAVFIPLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSS 60

Query: 61  THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ 120
           THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ
Sbjct: 61  THFDSISSPHHDFSSSSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQ 120

Query: 121 AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM 180
           AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM
Sbjct: 121 AISAVALLASEEGSMVALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECM 180

Query: 181 VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGV 240
           VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEM+LVEYAGNVFDEMSARGV
Sbjct: 181 VGVFAEIGKLKEAVDMILDMRNQGLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGV 240

Query: 241 YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW 300
           YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW
Sbjct: 241 YPDSCTYKYIIVGYCRNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVW 300

Query: 301 FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC 360
           FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC
Sbjct: 301 FFHKVTKMGLSPNLINYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLC 360

Query: 361 KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT 420
           KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT
Sbjct: 361 KKGWTERAFRLFLKLIRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNT 420

Query: 421 NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT 480
           NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT
Sbjct: 421 NTYTTLIDGHCKAGNFSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNT 480

Query: 481 GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM 540
           GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM
Sbjct: 481 GFQNQIEADGVTYTILISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNM 540

Query: 541 MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGA 600
           MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREK+VSLAVKFFQKMSDHGCAPDSISYGA
Sbjct: 541 MKDSEKLFDEVIKLGLAPTKETYTSMICGYCREKKVSLAVKFFQKMSDHGCAPDSISYGA 600

Query: 601 LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL 660
           LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL
Sbjct: 601 LISGLCKESRLDEARQLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKL 660

Query: 661 WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS 720
           WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS
Sbjct: 661 WIRTVHTLIRKLCCEKKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLS 720

Query: 721 ERISKGIG 729
           ERISKGIG
Sbjct: 721 ERISKGIG 728

BLAST of CSPI01G34460 vs. NCBI nr
Match: gi|778664547|ref|XP_004145475.2| (PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Cucumis sativus])

HSP 1 Score: 1412.5 bits (3655), Expect = 0.0e+00
Identity = 703/705 (99.72%), Postives = 705/705 (100.00%), Query Frame = 1

Query: 24  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPL 83
           MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPL
Sbjct: 1   MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPL 60

Query: 84  KKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYW 143
           KKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYW
Sbjct: 61  KKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYW 120

Query: 144 AVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ 203
           AVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ
Sbjct: 121 AVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ 180

Query: 204 GLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEA 263
           GLVLTTRVMNRIILVAAEM+LVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEA
Sbjct: 181 GLVLTTRVMNRIILVAAEMRLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEA 240

Query: 264 DRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG 323
           DRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Sbjct: 241 DRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG 300

Query: 324 LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKP 383
           LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKP
Sbjct: 301 LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKP 360

Query: 384 NVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM 443
           NVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM
Sbjct: 361 NVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM 420

Query: 444 ELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKR 503
           ELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKR
Sbjct: 421 ELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKR 480

Query: 504 ADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETY 563
           ADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETY
Sbjct: 481 ADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETY 540

Query: 564 TSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMID 623
           TSMICGYCREK+VSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMID
Sbjct: 541 TSMICGYCREKKVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMID 600

Query: 624 KGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL 683
           KGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL
Sbjct: 601 KGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL 660

Query: 684 FFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG 729
           FFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG
Sbjct: 661 FFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG 705

BLAST of CSPI01G34460 vs. NCBI nr
Match: gi|659118286|ref|XP_008459042.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Cucumis melo])

HSP 1 Score: 1337.4 bits (3460), Expect = 0.0e+00
Identity = 666/705 (94.47%), Postives = 681/705 (96.60%), Query Frame = 1

Query: 24  MQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDSISSPHHDFSSSSSLQSPL 83
           MQFLAS RILR HGFLQKLCS Q GSS SAS+AFFSSTHFDSISSPHHDF SSSSLQSP+
Sbjct: 1   MQFLASHRILRTHGFLQKLCSLQHGSSVSASIAFFSSTHFDSISSPHHDF-SSSSLQSPV 60

Query: 84  KKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVALSFFYW 143
           +K CSLVL+ YLRQPHLRFSPSKLNLDMDA SLTHEQAISAVA LASEEGSMVALSFFYW
Sbjct: 61  QKTCSLVLEAYLRQPHLRFSPSKLNLDMDADSLTHEQAISAVASLASEEGSMVALSFFYW 120

Query: 144 AVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ 203
           A+GFPKFRYFMRLYIVCTMSL+GKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ
Sbjct: 121 AIGFPKFRYFMRLYIVCTMSLIGKCNLERAHEVVECMVGVFAEIGKLKEAVDMILDMRNQ 180

Query: 204 GLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNGNVLEA 263
           GLVLTTRVMNRIILVAA M LVEYAGNVFDEMSARGVYPDSCTYK IIVGYCRNG+VLEA
Sbjct: 181 GLVLTTRVMNRIILVAAGMGLVEYAGNVFDEMSARGVYPDSCTYKSIIVGYCRNGDVLEA 240

Query: 264 DRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG 323
           DRWICEMMERGFVVDNATLTLII AFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG
Sbjct: 241 DRWICEMMERGFVVDNATLTLIIKAFCEKSLVNRAVWFFHKVTKMGLSPNLINYSSMISG 300

Query: 324 LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRSDNYKP 383
           LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL+RSDNYKP
Sbjct: 301 LCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLVRSDNYKP 360

Query: 384 NVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM 443
           NVHTYTAMISGYCKE+KLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM
Sbjct: 361 NVHTYTAMISGYCKEDKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSKAYELM 420

Query: 444 ELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILISEQCKR 503
           ELMSNEGFFPN CTYN+IVDGLCKRGRAEEAF+LL+ GFQNQIEADGVTYTILISEQCKR
Sbjct: 421 ELMSNEGFFPNICTYNAIVDGLCKRGRAEEAFELLSKGFQNQIEADGVTYTILISEQCKR 480

Query: 504 ADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAPTKETY 563
           ADMN ALVFLNKMFKVGFQPDIHLYTTLIAAFCRQ MMKDSEKLFDEV+KLGLAPTKETY
Sbjct: 481 ADMNHALVFLNKMFKVGFQPDIHLYTTLIAAFCRQRMMKDSEKLFDEVVKLGLAPTKETY 540

Query: 564 TSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLYDTMID 623
           TSMICGYCREK +SLAVKFFQKMSD GCAPDSISYGALISGLCKESRLDEARQLYDTMID
Sbjct: 541 TSMICGYCREKNISLAVKFFQKMSDQGCAPDSISYGALISGLCKESRLDEARQLYDTMID 600

Query: 624 KGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL 683
           KGLSPCEVTRV+L YEYCKTED ASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL
Sbjct: 601 KGLSPCEVTRVSLAYEYCKTEDCASAMVILERLNKKLWIRTVHTLIRKLCCEKKVALAAL 660

Query: 684 FFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG 729
           FFHKLLDKEVNVDRVTLAAF TAC ESNKYALVSDLSERISKGIG
Sbjct: 661 FFHKLLDKEVNVDRVTLAAFITACTESNKYALVSDLSERISKGIG 704

BLAST of CSPI01G34460 vs. NCBI nr
Match: gi|645238747|ref|XP_008225821.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Prunus mume])

HSP 1 Score: 991.9 bits (2563), Expect = 6.0e-286
Identity = 485/713 (68.02%), Postives = 573/713 (80.36%), Query Frame = 1

Query: 27  LASLRILRPHGFLQKLCSFQQGSSASASLAFFSS------THFDSISSPHH-----DFSS 86
           + SLRILR    LQ+       +  S     FS       TH+D   S          SS
Sbjct: 1   MVSLRILRRSHELQRKLLSPTSNPISLFYTLFSLRTVSSYTHYDDPYSTTTITTTTSSSS 60

Query: 87  SSSLQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSM 146
           SS  QS ++ IC+LV  +Y  Q H R SP KLNLD++  SLTHEQAIS VA LA E GSM
Sbjct: 61  SSQSQSLVRTICALVCQSYSPQTHPRSSPPKLNLDLNVDSLTHEQAISVVASLAEEAGSM 120

Query: 147 VALSFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVD 206
           VALSFFYWA+GFPKFRYFMRLYI C MSL G  NLERAHEVV CMV  FAEI +LKEA D
Sbjct: 121 VALSFFYWAIGFPKFRYFMRLYIFCAMSLFGNGNLERAHEVVHCMVRNFAEIERLKEAAD 180

Query: 207 MILDMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYC 266
           M+ +M+NQGL+L+TR +N ++ +A ++ LVEYA N+F+EM  RGV PDS +YK ++VGYC
Sbjct: 181 MVFEMQNQGLMLSTRTLNCVLGIACDLGLVEYAENLFEEMCVRGVSPDSLSYKSMVVGYC 240

Query: 267 RNGNVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLI 326
           R+  VLE DRW+ +M+ERGFV+DNAT TLI + FCEKSLV+RA W F K+ +MG+ PNLI
Sbjct: 241 RSSRVLEVDRWLSKMLERGFVLDNATFTLITSLFCEKSLVSRASWCFDKMIRMGVKPNLI 300

Query: 327 NYSSMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKL 386
           N++S+I GLC+RGS+KQAFE+LEEMV+ GWKPNVYTHT+LI GLCKKGWTERAFRLFLKL
Sbjct: 301 NFTSLIHGLCQRGSIKQAFEMLEEMVRKGWKPNVYTHTALIDGLCKKGWTERAFRLFLKL 360

Query: 387 IRSDNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGN 446
           +RSDNYKPNVHTYTAMI GYC+E+K+SRAEML  RMKEQGLVPNTNTYTTL+ GHCKAGN
Sbjct: 361 VRSDNYKPNVHTYTAMIRGYCEEDKMSRAEMLLSRMKEQGLVPNTNTYTTLVSGHCKAGN 420

Query: 447 FSKAYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTI 506
           F +AYELM++M  EGF PN CTYN++ D LCK+GR +EA+KL+  GF+  +EAD VTYTI
Sbjct: 421 FDRAYELMDIMGKEGFTPNICTYNAVFDSLCKKGRVQEAYKLIKKGFRRGLEADRVTYTI 480

Query: 507 LISEQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLG 566
            ISE CKR D+N ALVF NKM KVG QPD+H YTTLIAAFCRQ  MK+SEK F+  ++LG
Sbjct: 481 FISEHCKRGDINGALVFFNKMLKVGLQPDMHSYTTLIAAFCRQKKMKESEKFFELSLRLG 540

Query: 567 LAPTKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEAR 626
           L PTKETYTSMICGYCR++ ++LAVKFF +M DHGCAPDS +YGALISGLCKE +LDEAR
Sbjct: 541 LIPTKETYTSMICGYCRDENIALAVKFFHRMGDHGCAPDSFTYGALISGLCKEEKLDEAR 600

Query: 627 QLYDTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCE 686
           +LYDTM+DKGLSPCEVTR+TL Y+YCK +D A+AMV+LERL KKLWIRTV+TL+RKLC E
Sbjct: 601 RLYDTMMDKGLSPCEVTRLTLAYKYCKKDDSAAAMVLLERLEKKLWIRTVNTLVRKLCSE 660

Query: 687 KKVALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG 729
           KKV + ALFFHKL+DK+ NVDRVTLAAF TAC ESNKYALVSDL+ERISKGIG
Sbjct: 661 KKVGIGALFFHKLVDKDQNVDRVTLAAFKTACYESNKYALVSDLTERISKGIG 713

BLAST of CSPI01G34460 vs. NCBI nr
Match: gi|764554220|ref|XP_004293756.2| (PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Fragaria vesca subsp. vesca])

HSP 1 Score: 976.1 bits (2522), Expect = 3.4e-281
Identity = 475/710 (66.90%), Postives = 566/710 (79.72%), Query Frame = 1

Query: 22  PLMQFLASLRILRPHGFLQKLCSFQQGSSASASLAFFSSTHFDS---ISSPHHDFSSSSS 81
           P    + SLRILR HG   KL S      +        S++ DS    SS      S S 
Sbjct: 37  PPKTLMLSLRILRNHGLHHKLLSPTSTHISHPYTLRTLSSYTDSDEPSSSASTTSDSQSE 96

Query: 82  LQSPLKKICSLVLDTYLRQPHLRFSPSKLNLDMDAASLTHEQAISAVALLASEEGSMVAL 141
             S + +ICS+V  +Y  Q H + SP  LNLD++  SLTHE AIS VA LA E GSMVAL
Sbjct: 97  SHSLVTQICSMVYKSYSPQTHFKSSPPILNLDLNPDSLTHEHAISVVASLAGEAGSMVAL 156

Query: 142 SFFYWAVGFPKFRYFMRLYIVCTMSLVGKCNLERAHEVVECMVGVFAEIGKLKEAVDMIL 201
           SFFYWAVGF KFRYFMRLYI C MS+ G  NLER HEVV+CMV  FAEIG+ KEA DM+ 
Sbjct: 157 SFFYWAVGFTKFRYFMRLYIFCAMSIFGNGNLERTHEVVQCMVRSFAEIGRFKEAADMVF 216

Query: 202 DMRNQGLVLTTRVMNRIILVAAEMKLVEYAGNVFDEMSARGVYPDSCTYKYIIVGYCRNG 261
           DM+NQGLVL+TR +N ++ +A EM L+EYA NVFDEMS RGV PD  ++K ++VGYCR G
Sbjct: 217 DMQNQGLVLSTRTLNCVVGIACEMGLMEYAENVFDEMSVRGVCPDGLSFKCMVVGYCRKG 276

Query: 262 NVLEADRWICEMMERGFVVDNATLTLIITAFCEKSLVNRAVWFFHKVTKMGLSPNLINYS 321
            V+E DRW+  M+ERGFV+DNA+ TLI++ FCEK  V+RA W F K++KMG+ PNL+N++
Sbjct: 277 AVMEVDRWLSRMIERGFVLDNASFTLIVSVFCEKGFVSRASWCFDKMSKMGVKPNLVNFT 336

Query: 322 SMISGLCKRGSVKQAFELLEEMVKNGWKPNVYTHTSLIHGLCKKGWTERAFRLFLKLIRS 381
           S+I GLCKRGSVKQAFE+LEEMV+ GWKPNVYTHT+LI GLCKKGWTERAFRLFLKL+RS
Sbjct: 337 SLIHGLCKRGSVKQAFEMLEEMVRRGWKPNVYTHTALIDGLCKKGWTERAFRLFLKLVRS 396

Query: 382 DNYKPNVHTYTAMISGYCKEEKLSRAEMLFERMKEQGLVPNTNTYTTLIDGHCKAGNFSK 441
           DNYKPNVHTYTAMISGYCKEEK+SRAEML  RMKEQ LVPN  TYTTL+ GHCKAGNF K
Sbjct: 397 DNYKPNVHTYTAMISGYCKEEKMSRAEMLLSRMKEQELVPNAYTYTTLVYGHCKAGNFEK 456

Query: 442 AYELMELMSNEGFFPNTCTYNSIVDGLCKRGRAEEAFKLLNTGFQNQIEADGVTYTILIS 501
           AY+LM++MS EGF PN CTYN+++D LCK+ R +EA+KL+  GF+  ++AD VTYTI IS
Sbjct: 457 AYQLMDVMSEEGFAPNICTYNAVMDCLCKKERVQEAYKLIKKGFRRGLQADRVTYTIFIS 516

Query: 502 EQCKRADMNQALVFLNKMFKVGFQPDIHLYTTLIAAFCRQNMMKDSEKLFDEVIKLGLAP 561
           E CK+AD+  A  F NKM K G +PD+H YTTLIAAFCRQ  MK+SEKLF+  ++LGL P
Sbjct: 517 EHCKQADIKGAQAFFNKMVKAGLEPDMHSYTTLIAAFCRQKKMKESEKLFEVAVRLGLIP 576

Query: 562 TKETYTSMICGYCREKRVSLAVKFFQKMSDHGCAPDSISYGALISGLCKESRLDEARQLY 621
           TKETYTSMICGYCR+  + LAVKFF +MSDHGC+PDS +YGALISGLCKE +LDEAR+LY
Sbjct: 577 TKETYTSMICGYCRDGNIVLAVKFFHRMSDHGCSPDSFTYGALISGLCKEEKLDEARKLY 636

Query: 622 DTMIDKGLSPCEVTRVTLTYEYCKTEDFASAMVILERLNKKLWIRTVHTLIRKLCCEKKV 681
           DTM+DKGLSPCEVTR+TLT++YC+ +D+A+AMVIL+RL KK WIRTV+TL+RKLCCEKKV
Sbjct: 637 DTMMDKGLSPCEVTRLTLTHKYCQKDDYATAMVILDRLEKKYWIRTVNTLVRKLCCEKKV 696

Query: 682 ALAALFFHKLLDKEVNVDRVTLAAFNTACIESNKYALVSDLSERISKGIG 729
            +AALFFHKL+DK+ NVDRVTL AF TAC ESNKYAL+SDL+ERISKGIG
Sbjct: 697 GIAALFFHKLVDKDQNVDRVTLQAFTTACYESNKYALLSDLTERISKGIG 746

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP326_ARATH4.0e-25864.60Pentatricopeptide repeat-containing protein At4g19890 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH6.8e-7230.63Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP445_ARATH1.5e-7128.42Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... [more]
PP306_ARATH1.9e-6931.71Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH1.2e-6831.42Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LYL9_CUCSA0.0e+0099.59Uncharacterized protein OS=Cucumis sativus GN=Csa_1G701980 PE=4 SV=1[more]
M5WK57_PRUPE5.8e-27266.62Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa015022mg PE=4 S... [more]
K7LFT8_SOYBN1.8e-26867.53Uncharacterized protein OS=Glycine max GN=GLYMA_09G242500 PE=4 SV=1[more]
W9RA33_9ROSA2.3e-26864.04Uncharacterized protein OS=Morus notabilis GN=L484_008796 PE=4 SV=1[more]
A0A072TVK4_MEDTR8.2e-26667.65PPR containing plant-like protein OS=Medicago truncatula GN=MTR_7g405940 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT4G19890.12.3e-25964.60 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G39710.13.8e-7330.63 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65560.18.5e-7328.42 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G11690.11.0e-7031.71 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G05670.16.8e-7031.42 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700211777|gb|KGN66873.1|0.0e+0099.59hypothetical protein Csa_1G701980 [Cucumis sativus][more]
gi|778664547|ref|XP_004145475.2|0.0e+0099.72PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Cucumis sativu... [more]
gi|659118286|ref|XP_008459042.1|0.0e+0094.47PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Cucumis melo][more]
gi|645238747|ref|XP_008225821.1|6.0e-28668.02PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Prunus mume][more]
gi|764554220|ref|XP_004293756.2|3.4e-28166.90PREDICTED: pentatricopeptide repeat-containing protein At4g19890 [Fragaria vesca... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G34460.1CSPI01G34460.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 179..205
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 520..550
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 453..502
score: 1.6E-12coord: 312..361
score: 3.9E-18coord: 383..432
score: 6.7E-20coord: 211..256
score: 1.8E-7coord: 562..607
score: 5.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 245..278
score: 1.4E-7coord: 350..385
score: 2.1E-6coord: 456..479
score: 2.8E-5coord: 491..525
score: 8.2E-4coord: 596..628
score: 2.4E-8coord: 527..559
score: 6.8E-6coord: 386..420
score: 7.8E-11coord: 562..595
score: 9.1E-11coord: 317..349
score: 5.0E-10coord: 422..455
score: 1.3
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 243..277
score: 10.709coord: 489..523
score: 10.019coord: 173..207
score: 7.059coord: 454..488
score: 9.964coord: 348..383
score: 10.874coord: 661..695
score: 5.229coord: 419..453
score: 12.726coord: 524..558
score: 11.663coord: 313..347
score: 13.318coord: 594..628
score: 13.537coord: 559..593
score: 12.529coord: 278..312
score: 9.153coord: 629..659
score: 5.064coord: 208..242
score: 7.377coord: 384..418
score: 1
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 325..484
score: 1.7E-6coord: 522..620
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 178..670
score: 6.6E-275coord: 67..73
score: 6.6E-275coord: 23..37
score: 6.6E-275coord: 95..153
score: 6.6E
NoneNo IPR availablePANTHERPTHR24015:SF325SUBFAMILY NOT NAMEDcoord: 178..670
score: 6.6E-275coord: 67..73
score: 6.6E-275coord: 95..153
score: 6.6E-275coord: 23..37
score: 6.6E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 360..591
score: 2.62