CSPI01G33810 (gene) Wild cucumber (PI 183967)

NameCSPI01G33810
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 28765217 .. 28767696 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GACGCGGAACACGCCCTTAATCCAATCGGGCTACATCGTGGGAGTTCTACGCTTTAAGCCATCCACAACAGTACAACACCGTGACTATCGAGCCCAATACTACCAATTCGTAGGCTTCAGTTCTTCCTTTTCTGGTTTGATGACGATCATACAAAACTTCTTGTGATTTCGTTCCCAAATGCTCCATTTCAAAAATTTCGCAACAGGGCTAATACGGTTCCGGTACATTTTTCTCAACCGCCATTTCAGTAACTCAAATTCTTTGGTGAATGGCTCCACCGCGCCTTCGAAAGACGACTATTTCGCTGCAATCCACCATATCTCCCACATTGTCCGCCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCGAATCTCCAACCTCAATTCCGAGCTCGTTTTCAGAGTCCTTCGCGCTTGCTCCAACTCTGGTACCGAGTCCTTCCGTTTCTTCAACTGGGCTTGCTCTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAGAAAGTATACCACGATGTGGAAAGTTCTTCTTCAGATGAAGACTCAGAATCTCAAAATTTCACCGGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAGGGCCTTGTAGATAATGCGGTTACCATATTCAATCAATGTTCCAAATCTATCGATTGTCCACAAACAGTTGAGGTCTATAACGCGTTACTTTTTGCGCTTTGTGAGGTTAAAATGTTTCATGGAGCTTATGCGTTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCTGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAAGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTTGAAGGATTGCTTAATGCAGGGTATTTAGAATCTGCTAAGGATATGGTTAGAAAAATGACTAAAGAAGGATCTGTGCCTGATATAGGAACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTAGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGCTGTATTGAGGATGGACACGTACCGTTTCCAAGTCTTTATGGACCAATACTTAAAGGAATGTGTAAAAGGGGTCAGTTCGATGATGCATTTTGCTTTTTTGGTGATATGAAACATAAGGGGCATCCACCAAATCGACCAGTGTACACAATGTTGATAACAATGTGTGGACGTGGAGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAAATGGCTGAACTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACTGATGGATTGAAGAACTGTGGAAAACATGATTTAGCAAAGAAGATTGAACAGCTTGAAGTTTCTATTCGAGGCATTTGATCCTATAGTAAAGTAATAAGCAAGTACTTATTCTGCTTGAAGGATGTGGTTGAAATGAATATGAAAAACGAAAAACTACAGAAGTCGGAAAAAGGACAAACAAAGAAGCACTTCTCTTAGGAAATTCCTTCTAATCACTCTTACACGTGGAGAGAGTTCAGAGAAAGGAAGTCTCAAGTTGAGGTTATCATCCAGCGAACGATCAACATTTTCAGATGCACTGTATGGATTGCCTAGGATTGTGACTGAGGCTACTGCTATCTAGGTTTCATATACAGCTTCTAATGCTTAATTGCTAATATTATCTAACTGTTTAGTGCATGATTTGCTTGGTCATTTAATCCTGTCTCATGAAAACGTATATTTCCAATGTGATATGGCCATTAAAATATTTGCTTTGGGTGCTGCCAACTATTGAGCTATGAACAGTATCCATTGAGTTGATAAGACCACACGGTCGATTCTTACAGTTTTCCGCTTTGTTTTTCTAAAAGTTAGAGTTACTATTTGGCTTGAGAAAGCTGAGAGGTGATTTCTACAGGGAAGTGCTGAATTCTTTGAGTTCAAAGGTGAATCGTTTTCTTGCTTCAAGGTGGCGTTCTCTGTCAAAGGATTGCAGTAGCCTTTTCTTCAAATCTATATCAATTGGATTCTTATAATGTCTTCTGATTCCGGAAGGAAAGACGACTACTTCCAAGTCGGTTTGAATTGGGGAGAGATTGTGCTCTTATAAAAGGGGGTTAACTGTAGACTCTCCTTAGCTTTCTATATTCTCATGTATTTCAATCCTGCGATCATTCAATCTTCAATTATATGCTTGTTTTTTCTTTGGTGAGTGATGTAGATATGGTATATGTTGGATATTATATTAAACCGGATGTCCTGGCTTTAAGCTCTTGTTGTCTCTCAATTTAATAATGATTTTCACTTGTTT

mRNA sequence

ATGCTCCATTTCAAAAATTTCGCAACAGGGCTAATACGGTTCCGGTACATTTTTCTCAACCGCCATTTCAGTAACTCAAATTCTTTGGTGAATGGCTCCACCGCGCCTTCGAAAGACGACTATTTCGCTGCAATCCACCATATCTCCCACATTGTCCGCCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCGAATCTCCAACCTCAATTCCGAGCTCGTTTTCAGAGTCCTTCGCGCTTGCTCCAACTCTGGTACCGAGTCCTTCCGTTTCTTCAACTGGGCTTGCTCTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAGAAAGTATACCACGATGTGGAAAGTTCTTCTTCAGATGAAGACTCAGAATCTCAAAATTTCACCGGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAGGGCCTTGTAGATAATGCGGTTACCATATTCAATCAATGTTCCAAATCTATCGATTGTCCACAAACAGTTGAGGTCTATAACGCGTTACTTTTTGCGCTTTGTGAGGTTAAAATGTTTCATGGAGCTTATGCGTTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCTGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAAGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTTGAAGGATTGCTTAATGCAGGGTATTTAGAATCTGCTAAGGATATGGTTAGAAAAATGACTAAAGAAGGATCTGTGCCTGATATAGGAACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTAGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGCTGTATTGAGGATGGACACGTACCGTTTCCAAGTCTTTATGGACCAATACTTAAAGGAATGTGTAAAAGGGGTCAGTTCGATGATGCATTTTGCTTTTTTGGTGATATGAAACATAAGGGGCATCCACCAAATCGACCAGTGTACACAATGTTGATAACAATGTGTGGACGTGGAGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAAATGGCTGAACTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACTGATGGATTGAAGAACTGTGGAAAACATGATTTAGCAAAGAAGATTGAACAGCTTGAAGTTTCTATTCGAGGCATTTGA

Coding sequence (CDS)

ATGCTCCATTTCAAAAATTTCGCAACAGGGCTAATACGGTTCCGGTACATTTTTCTCAACCGCCATTTCAGTAACTCAAATTCTTTGGTGAATGGCTCCACCGCGCCTTCGAAAGACGACTATTTCGCTGCAATCCACCATATCTCCCACATTGTCCGCCGAGACTTCTACATGGAGCGCACTCTCAACAAGCTCCGAATCTCCAACCTCAATTCCGAGCTCGTTTTCAGAGTCCTTCGCGCTTGCTCCAACTCTGGTACCGAGTCCTTCCGTTTCTTCAACTGGGCTTGCTCTCACAACCCCTCTTACCAACCCACTACCCTTGAATTTGAAGAGCTCGTCAAAACCCTAGCTCGGACCAGAAAGTATACCACGATGTGGAAAGTTCTTCTTCAGATGAAGACTCAGAATCTCAAAATTTCACCGGAAACGATATCGTTCATAATTCAAGAGTATGGTAAGCAGGGCCTTGTAGATAATGCGGTTACCATATTCAATCAATGTTCCAAATCTATCGATTGTCCACAAACAGTTGAGGTCTATAACGCGTTACTTTTTGCGCTTTGTGAGGTTAAAATGTTTCATGGAGCTTATGCGTTGATTAGGAGGATGATTAGAAAAGGGGTAACTCCTGATAAAAAGACTTATGGAACTCTTGTAACTGGATGGTGCTCAGCGGGGAAGATGAAGGAAGCTCAGGAGTTCTTGGAGGAAATGAGCCAGAAGGGGTTCAATCCTCCTTTGCGAGGTCGTGATCTTTTGGTTGAAGGATTGCTTAATGCAGGGTATTTAGAATCTGCTAAGGATATGGTTAGAAAAATGACTAAAGAAGGATCTGTGCCTGATATAGGAACTTTTAATTCTCTGATTGATGTTATATGCAACTCTGGAGAAGTTGATTTTTGCATTAATATTTTTCATGAGGTGTGCAAGTTAGGGCTTTGTCCTGATATAAATACTTACAAGATTTTGATTCCAGCAACTTCGAAAGTAGGTAGGATTGATGAAGCATTCAGGCTTTTGCATTGCTGTATTGAGGATGGACACGTACCGTTTCCAAGTCTTTATGGACCAATACTTAAAGGAATGTGTAAAAGGGGTCAGTTCGATGATGCATTTTGCTTTTTTGGTGATATGAAACATAAGGGGCATCCACCAAATCGACCAGTGTACACAATGTTGATAACAATGTGTGGACGTGGAGGGAGATTTGTTGATGCTGCTAATTACTTGATGGAAATGGCTGAACTTGGTTTACCTCCAATTTCAAGGTGCTTTGATATGGTTACTGATGGATTGAAGAACTGTGGAAAACATGATTTAGCAAAGAAGATTGAACAGCTTGAAGTTTCTATTCGAGGCATTTGA
BLAST of CSPI01G33810 vs. Swiss-Prot
Match: PP391_ARATH (Pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Arabidopsis thaliana GN=At5g18390 PE=2 SV=2)

HSP 1 Score: 634.8 bits (1636), Expect = 7.3e-181
Identity = 302/435 (69.43%), Postives = 358/435 (82.30%), Query Frame = 1

Query: 21  RHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLR 80
           RHF++   L +  + P+K DYFAAI+H+ +IVRR+ + ER+LN LR+  + SE VFRVLR
Sbjct: 27  RHFNSLEPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLP-VTSEFVFRVLR 86

Query: 81  ACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKI 140
           A S S  +S RFFNWA S NPSY PT++E+EEL K+LA  +KY +MWK+L QMK  +L I
Sbjct: 87  ATSRSSNDSLRFFNWARS-NPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDI 146

Query: 141 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 200
           S ET+ FII++YGK G VD AV +FN   K++ C QTV+VYN+LL ALC+VKMFHGAYAL
Sbjct: 147 SGETLCFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYAL 206

Query: 201 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 260
           IRRMIRKG+ PDK+TY  LV GWCSAGKMKEAQEFL+EMS++GFNPP RGRDLL+EGLLN
Sbjct: 207 IRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLN 266

Query: 261 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 320
           AGYLESAK+MV KMTK G VPDI TFN LI+ I  SGEV+FCI +++  CKLGLC DI+T
Sbjct: 267 AGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDT 326

Query: 321 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 380
           YK LIPA SK+G+IDEAFRLL+ C+EDGH PFPSLY PI+KGMC+ G FDDAF FF DMK
Sbjct: 327 YKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMK 386

Query: 381 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 440
            K HPPNRPVYTMLITMCGRGG+FVDAANYL+EM E+GL PISRCFDMVTDGLKN GKHD
Sbjct: 387 VKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKHD 446

Query: 441 LAKKIEQLEVSIRGI 456
           LA +IEQLEV +RG+
Sbjct: 447 LAMRIEQLEVQLRGV 459

BLAST of CSPI01G33810 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 6.6e-41
Identity = 114/413 (27.60%), Postives = 199/413 (48.18%), Query Frame = 1

Query: 31  NGSTAPSKDDYFAAIHHISHIVRRDFY-----MERTLNKLRISNLNSELVFRVLRACSNS 90
           N  T  SK D FA+    S+ + R F+     +E  LN+  +  L   L+ RVL  C ++
Sbjct: 68  NDRTKNSKYDEFASDVEKSYRILRKFHSRVPKLELALNESGVE-LRPGLIERVLNRCGDA 127

Query: 91  GTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLK-ISPET 150
           G   +RFF WA    P Y  +   ++ +VK L++ R++  +W ++ +M+ +N + I PE 
Sbjct: 128 GNLGYRFFVWAAKQ-PRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPEL 187

Query: 151 ISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRM 210
              ++Q +    +V  A+ + ++  K    P    V+  LL ALC+      A  L   M
Sbjct: 188 FVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEY-VFGCLLDALCKHGSVKDAAKLFEDM 247

Query: 211 IRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYL 270
            R     + + + +L+ GWC  GKM EA+  L +M++ GF P +     L+ G  NAG +
Sbjct: 248 -RMRFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKM 307

Query: 271 ESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKIL 330
             A D++R M + G  P+   +  LI  +C    ++  + +F E+ +     D+ TY  L
Sbjct: 308 ADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTAL 367

Query: 331 IPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGH 390
           +    K G+ID+ + +L   I+ G +P    Y  I+    K+  F++       M+   +
Sbjct: 368 VSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEY 427

Query: 391 PPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCG 438
            P+  +Y ++I +  + G   +A     EM E GL P    F ++ +GL + G
Sbjct: 428 HPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQG 476

BLAST of CSPI01G33810 vs. Swiss-Prot
Match: PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 1.3e-36
Identity = 89/318 (27.99%), Postives = 150/318 (47.17%), Query Frame = 1

Query: 104 QPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVT 163
           QP    +  L+    +  +     +VL +M++++      T + +I     +G +D A+ 
Sbjct: 155 QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALK 214

Query: 164 IFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGW 223
           + NQ   S +C  TV  Y  L+ A         A  L+  M+ +G+ PD  TY T++ G 
Sbjct: 215 VLNQLL-SDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 274

Query: 224 CSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDI 283
           C  G +  A E +  +  KG  P +   ++L+  LLN G  E  + ++ KM  E   P++
Sbjct: 275 CKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 334

Query: 284 GTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHC 343
            T++ LI  +C  G+++  +N+   + + GL PD  +Y  LI A  + GR+D A   L  
Sbjct: 335 VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 394

Query: 344 CIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGR 403
            I DG +P    Y  +L  +CK G+ D A   FG +   G  PN   Y  + +     G 
Sbjct: 395 MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 454

Query: 404 FVDAANYLMEMAELGLPP 422
            + A + ++EM   G+ P
Sbjct: 455 KIRALHMILEMMSNGIDP 471

BLAST of CSPI01G33810 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 155.2 bits (391), Expect = 1.7e-36
Identity = 94/338 (27.81%), Postives = 164/338 (48.52%), Query Frame = 1

Query: 100 NPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVD 159
           N   +P  + +  L++ L    +++   ++L  M  + +  +  T S +I  + K+G + 
Sbjct: 283 NKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLV 342

Query: 160 NAVTIFNQCSK-SIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGT 219
            A  ++++  K SID    +  Y++L+   C       A  +   MI K   P+  TY T
Sbjct: 343 EAEKLYDEMIKRSID--PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNT 402

Query: 220 LVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEG 279
           L+ G+C A +++E  E   EMSQ+G        + L++GL  AG  + A+ + +KM  +G
Sbjct: 403 LIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDG 462

Query: 280 SVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAF 339
             PDI T++ L+D +C  G+++  + +F  + K  + PDI TY I+I    K G++++ +
Sbjct: 463 VPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGW 522

Query: 340 RLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMC 399
            L       G  P   +Y  ++ G C++G  ++A   F +MK  G  PN   Y  LI   
Sbjct: 523 DLFCSLSLKGVKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRAR 582

Query: 400 GRGGRFVDAANYLMEMAELGL----PPISRCFDMVTDG 433
            R G    +A  + EM   G       IS   +M+ DG
Sbjct: 583 LRDGDKAASAELIKEMRSCGFVGDASTISMVINMLHDG 618

BLAST of CSPI01G33810 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.2e-36
Identity = 94/344 (27.33%), Postives = 164/344 (47.67%), Query Frame = 1

Query: 105 PTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGL---VDNA 164
           P  + +  L+    + RK    +K+L  M  + L+ +  + + +I    ++G    V   
Sbjct: 238 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 297

Query: 165 VTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVT 224
           +T  N+   S+D       YN L+   C+   FH A  +   M+R G+TP   TY +L+ 
Sbjct: 298 LTEMNRRGYSLD----EVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIH 357

Query: 225 GWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVP 284
             C AG M  A EFL++M  +G  P  R    LV+G    GY+  A  ++R+M   G  P
Sbjct: 358 SMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSP 417

Query: 285 DIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 344
            + T+N+LI+  C +G+++  I +  ++ + GL PD+ +Y  ++    +   +DEA R+ 
Sbjct: 418 SVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVK 477

Query: 345 HCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRG 404
              +E G  P    Y  +++G C++ +  +A   + +M   G PP+   YT LI      
Sbjct: 478 REMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCME 537

Query: 405 GRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHDLAKKI 446
           G    A     EM E G+ P    + ++ +GL    +   AK++
Sbjct: 538 GDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 577

BLAST of CSPI01G33810 vs. TrEMBL
Match: A0A0A0M105_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G695420 PE=4 SV=1)

HSP 1 Score: 945.3 bits (2442), Expect = 2.8e-272
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 1

Query: 1   MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60
           MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER
Sbjct: 1   MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60

Query: 61  TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART 120
           TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART
Sbjct: 61  TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART 120

Query: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180
           RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV
Sbjct: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180

Query: 181 YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240
           YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS
Sbjct: 181 YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240

Query: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD 300
           QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD
Sbjct: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD 300

Query: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL 360
           FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL
Sbjct: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL 360

Query: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420
           KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP
Sbjct: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420

Query: 421 PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 456
           PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI
Sbjct: 421 PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 455

BLAST of CSPI01G33810 vs. TrEMBL
Match: W9RHW0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024459 PE=4 SV=1)

HSP 1 Score: 690.3 bits (1780), Expect = 1.6e-195
Identity = 323/435 (74.25%), Postives = 374/435 (85.98%), Query Frame = 1

Query: 22  HFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLRA 81
           H+ NS        + SKD+YFAAIHHIS+IV+RDFYMERTLNKLRI+ ++S+LVFRVLRA
Sbjct: 41  HYQNSTK-----PSSSKDNYFAAIHHISNIVQRDFYMERTLNKLRIAAVDSDLVFRVLRA 100

Query: 82  CSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQN-LKI 141
           C   G ES RFFNWA SH PSY+PT++E EEL K LART+KY +MWK+L QMKT N L I
Sbjct: 101 CHKFGPESLRFFNWARSHQPSYRPTSVELEELAKNLARTKKYESMWKILQQMKTNNNLII 160

Query: 142 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 201
           S ET+ FII+EYGKQGLVD A  +FN+  K  +C QTVEVYN+LLFALCEVK+FHGAYAL
Sbjct: 161 SSETLCFIIEEYGKQGLVDQAAEVFNRVPKIFNCSQTVEVYNSLLFALCEVKLFHGAYAL 220

Query: 202 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 261
           +RRMIRK V PDK+TY  LV  WCSAGKM+EAQ FL EMS+KGFNPP+RGRDLL+EGLLN
Sbjct: 221 VRRMIRKEVVPDKRTYSILVNAWCSAGKMREAQNFLSEMSKKGFNPPVRGRDLLIEGLLN 280

Query: 262 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 321
           AGY+ESAK+MVRKM KEG +PD+ TFNSL++VIC S EV+FCI+++H+VC LGLCPDINT
Sbjct: 281 AGYIESAKEMVRKMVKEGFLPDVSTFNSLVEVICKSEEVEFCIDLYHQVCGLGLCPDINT 340

Query: 322 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 381
           YK+LIPA SK G+IDEAFRLLH  IEDGH PFPSLY PI+KGMC++GQFDDA CFFG+MK
Sbjct: 341 YKVLIPAVSKAGQIDEAFRLLHSSIEDGHKPFPSLYAPIIKGMCRKGQFDDALCFFGEMK 400

Query: 382 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 441
            KGHPPNRPVYTMLITMCGRGGRFVDAANYL+EM E+GL PISRCFD+VTDGLKNCGKHD
Sbjct: 401 VKGHPPNRPVYTMLITMCGRGGRFVDAANYLVEMTEIGLTPISRCFDLVTDGLKNCGKHD 460

Query: 442 LAKKIEQLEVSIRGI 456
           LA++IEQLEVS RG+
Sbjct: 461 LARRIEQLEVSARGM 470

BLAST of CSPI01G33810 vs. TrEMBL
Match: A0A061F291_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_026106 PE=4 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 8.3e-192
Identity = 329/435 (75.63%), Postives = 377/435 (86.67%), Query Frame = 1

Query: 24  SNSNSLVNGS---TAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLR 83
           +++NSL   S   TA SKDDYFAAIHHIS+ VRR+ + ERTLN++ IS +NSELVFRVLR
Sbjct: 26  TSANSLQIASVSTTAVSKDDYFAAIHHISNTVRREVHPERTLNRMNIS-VNSELVFRVLR 85

Query: 84  ACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKI 143
           +CSNS TES RFF+WA +H   Y PT++EFEELVK L R RKY +MWK + QM+ QNL +
Sbjct: 86  SCSNSPTESLRFFSWARAH---YVPTSVEFEELVKILIRHRKYESMWKTIQQMQKQNLSL 145

Query: 144 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 203
           S +T+SFII+EYGK GLVD AV +FN+ S S+ C QTV VYN+LLFALCEVKMFHGAYAL
Sbjct: 146 SCDTLSFIIEEYGKNGLVDQAVEVFNK-STSLGCKQTVSVYNSLLFALCEVKMFHGAYAL 205

Query: 204 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 263
           IRRMIRKG  PDK+TY  LV GWCS GKM+EAQEFLEEMS+ GFNPP+RGRDLLVEGLLN
Sbjct: 206 IRRMIRKGEVPDKRTYAILVNGWCSGGKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLN 265

Query: 264 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 323
           AGYLESAK+MVR+MTKEG VPDIGTFNSL++ IC+SGEVDFCIN++H VCKLGLCPDINT
Sbjct: 266 AGYLESAKEMVRRMTKEGFVPDIGTFNSLVETICSSGEVDFCINMYHSVCKLGLCPDINT 325

Query: 324 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 383
           YKILIPA SKVGRIDEAFRLL+  +EDG+ PFPSLY PI+K MC++GQFDDAF FFG+MK
Sbjct: 326 YKILIPAASKVGRIDEAFRLLNNSVEDGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMK 385

Query: 384 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 443
            KGH PNRPVYTMLITMCGRGGRFV+AANYL+EM ELGL PISRCFDMV DGLKNCGKHD
Sbjct: 386 VKGHSPNRPVYTMLITMCGRGGRFVEAANYLVEMTELGLAPISRCFDMVIDGLKNCGKHD 445

Query: 444 LAKKIEQLEVSIRGI 456
           LAK+IEQLEVS+RG+
Sbjct: 446 LAKRIEQLEVSLRGV 455

BLAST of CSPI01G33810 vs. TrEMBL
Match: M5WME7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025361mg PE=4 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 1.1e-188
Identity = 313/437 (71.62%), Postives = 372/437 (85.13%), Query Frame = 1

Query: 19  LNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRV 78
           L RH +  N+       P+KDDYF+AI HI++IVRRD +MERTLNKLRI+ ++SELV+RV
Sbjct: 24  LLRHLATVNAAPQNRVVPTKDDYFSAIQHITNIVRRDHFMERTLNKLRIT-VDSELVYRV 83

Query: 79  LRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQN- 138
           LRACS +GTES RFFNWA +H+P+Y PTTLE EELVKTLART+KY +MWK+L  M+T + 
Sbjct: 84  LRACSAAGTESLRFFNWARTHHPTYHPTTLELEELVKTLARTKKYESMWKLLQSMQTHHG 143

Query: 139 LKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGA 198
           L +S E++ F+I+EYG  GLVD AV +FN+  K+ +C QTVEVYNALLF+LC+ K+FH A
Sbjct: 144 LTLSQESLCFVIEEYGNHGLVDQAVELFNRAPKTFNCLQTVEVYNALLFSLCQAKLFHAA 203

Query: 199 YALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEG 258
           YAL+RRMIRKG+ PDK+TY  LV  WCS GKM+EAQ FLEEMS KGFNPP+RGRDLLVEG
Sbjct: 204 YALVRRMIRKGLVPDKRTYSILVNAWCSNGKMREAQLFLEEMSSKGFNPPVRGRDLLVEG 263

Query: 259 LLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPD 318
           LLNAGY+E+AK+MVRKM KEG VPD+ TFNSL++ IC  GEV+FCI+++ E   LGLCPD
Sbjct: 264 LLNAGYIEAAKEMVRKMVKEGFVPDVSTFNSLMEAICKCGEVEFCIDLYWEANGLGLCPD 323

Query: 319 INTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFG 378
           INTYK+LIPA SKVGRID+AFRLLH  IEDGH PFPSLY PI+KGMC+RGQFDDAFCFF 
Sbjct: 324 INTYKVLIPAVSKVGRIDDAFRLLHNSIEDGHRPFPSLYAPIIKGMCRRGQFDDAFCFFS 383

Query: 379 DMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCG 438
           +MK KGHPPNRPVYTMLITM GRGGRFV+AANYL+EM E+GL PISRCFD+VTDGLKNCG
Sbjct: 384 EMKVKGHPPNRPVYTMLITMSGRGGRFVEAANYLVEMTEMGLMPISRCFDLVTDGLKNCG 443

Query: 439 KHDLAKKIEQLEVSIRG 455
           KHD+AK+IEQLEVS+RG
Sbjct: 444 KHDMAKRIEQLEVSLRG 459

BLAST of CSPI01G33810 vs. TrEMBL
Match: D7TDU0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0043g00770 PE=4 SV=1)

HSP 1 Score: 661.8 bits (1706), Expect = 6.2e-187
Identity = 308/418 (73.68%), Postives = 364/418 (87.08%), Query Frame = 1

Query: 38  KDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWAC 97
           KDDYFA +HHIS IVRRDFY+ERTLNKL IS + S+LV+RVLR+C NSGTES RFFNWA 
Sbjct: 43  KDDYFAVVHHISAIVRRDFYLERTLNKLPIS-VTSDLVYRVLRSCPNSGTESLRFFNWAR 102

Query: 98  SHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGL 157
           SH  SYQPTTLE+EEL+KTLART+++  MWK+  QM+T    +SP  +S II+E+GK GL
Sbjct: 103 SH-LSYQPTTLEYEELLKTLARTKQFQPMWKIAHQMQT----LSPTVVSSIIEEFGKHGL 162

Query: 158 VDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYG 217
           VD AV +FN+   +++CPQT+EVYN+LLFALCEVK FHGAYALIRRMIRKGVTP+K+TY 
Sbjct: 163 VDQAVEVFNKAKSALNCPQTIEVYNSLLFALCEVKYFHGAYALIRRMIRKGVTPNKQTYS 222

Query: 218 TLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKE 277
            LV GWC+AGKMKEAQ+FLEEMS+KGFNPP+RGRDLLV+GLLNAGYLE+AK+MVRKMTKE
Sbjct: 223 VLVNGWCAAGKMKEAQDFLEEMSRKGFNPPVRGRDLLVDGLLNAGYLEAAKEMVRKMTKE 282

Query: 278 GSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEA 337
           G  PD+ T NS+++ IC +GE +FCI+I+++VC+LG+ P++ TYKI+IPA  K GRIDEA
Sbjct: 283 GCAPDVETLNSMLEAICKAGEAEFCIDIYNDVCRLGVSPNVGTYKIMIPAACKEGRIDEA 342

Query: 338 FRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITM 397
           FR+LH  IEDGH PFPSLY PI+K +C+ GQFDDAFCFF DMK KGHPPNRPVYTMLITM
Sbjct: 343 FRILHRSIEDGHRPFPSLYAPIIKALCRNGQFDDAFCFFSDMKVKGHPPNRPVYTMLITM 402

Query: 398 CGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 456
           CGRGGRFVDAANYL+EM EL L PISRCFDMVTDGLKNCGKHDLA+KIEQLEVS+RG+
Sbjct: 403 CGRGGRFVDAANYLVEMTELNLTPISRCFDMVTDGLKNCGKHDLARKIEQLEVSLRGV 454

BLAST of CSPI01G33810 vs. TAIR10
Match: AT5G18390.1 (AT5G18390.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 634.8 bits (1636), Expect = 4.1e-182
Identity = 302/435 (69.43%), Postives = 358/435 (82.30%), Query Frame = 1

Query: 21  RHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLR 80
           RHF++   L +  + P+K DYFAAI+H+ +IVRR+ + ER+LN LR+  + SE VFRVLR
Sbjct: 27  RHFNSLEPLQSSDSTPTKGDYFAAINHVVNIVRREIHPERSLNSLRLP-VTSEFVFRVLR 86

Query: 81  ACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKI 140
           A S S  +S RFFNWA S NPSY PT++E+EEL K+LA  +KY +MWK+L QMK  +L I
Sbjct: 87  ATSRSSNDSLRFFNWARS-NPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDI 146

Query: 141 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 200
           S ET+ FII++YGK G VD AV +FN   K++ C QTV+VYN+LL ALC+VKMFHGAYAL
Sbjct: 147 SGETLCFIIEQYGKNGHVDQAVELFNGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYAL 206

Query: 201 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 260
           IRRMIRKG+ PDK+TY  LV GWCSAGKMKEAQEFL+EMS++GFNPP RGRDLL+EGLLN
Sbjct: 207 IRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLN 266

Query: 261 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 320
           AGYLESAK+MV KMTK G VPDI TFN LI+ I  SGEV+FCI +++  CKLGLC DI+T
Sbjct: 267 AGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDT 326

Query: 321 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 380
           YK LIPA SK+G+IDEAFRLL+ C+EDGH PFPSLY PI+KGMC+ G FDDAF FF DMK
Sbjct: 327 YKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMK 386

Query: 381 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 440
            K HPPNRPVYTMLITMCGRGG+FVDAANYL+EM E+GL PISRCFDMVTDGLKN GKHD
Sbjct: 387 VKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVPISRCFDMVTDGLKNGGKHD 446

Query: 441 LAKKIEQLEVSIRGI 456
           LA +IEQLEV +RG+
Sbjct: 447 LAMRIEQLEVQLRGV 459

BLAST of CSPI01G33810 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 169.9 bits (429), Expect = 3.7e-42
Identity = 114/413 (27.60%), Postives = 199/413 (48.18%), Query Frame = 1

Query: 31  NGSTAPSKDDYFAAIHHISHIVRRDFY-----MERTLNKLRISNLNSELVFRVLRACSNS 90
           N  T  SK D FA+    S+ + R F+     +E  LN+  +  L   L+ RVL  C ++
Sbjct: 68  NDRTKNSKYDEFASDVEKSYRILRKFHSRVPKLELALNESGVE-LRPGLIERVLNRCGDA 127

Query: 91  GTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLK-ISPET 150
           G   +RFF WA    P Y  +   ++ +VK L++ R++  +W ++ +M+ +N + I PE 
Sbjct: 128 GNLGYRFFVWAAKQ-PRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPEL 187

Query: 151 ISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRM 210
              ++Q +    +V  A+ + ++  K    P    V+  LL ALC+      A  L   M
Sbjct: 188 FVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEY-VFGCLLDALCKHGSVKDAAKLFEDM 247

Query: 211 IRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYL 270
            R     + + + +L+ GWC  GKM EA+  L +M++ GF P +     L+ G  NAG +
Sbjct: 248 -RMRFPVNLRYFTSLLYGWCRVGKMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKM 307

Query: 271 ESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKIL 330
             A D++R M + G  P+   +  LI  +C    ++  + +F E+ +     D+ TY  L
Sbjct: 308 ADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTAL 367

Query: 331 IPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGH 390
           +    K G+ID+ + +L   I+ G +P    Y  I+    K+  F++       M+   +
Sbjct: 368 VSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEY 427

Query: 391 PPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCG 438
            P+  +Y ++I +  + G   +A     EM E GL P    F ++ +GL + G
Sbjct: 428 HPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQG 476

BLAST of CSPI01G33810 vs. TAIR10
Match: AT3G04760.1 (AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 155.6 bits (392), Expect = 7.3e-38
Identity = 89/318 (27.99%), Postives = 150/318 (47.17%), Query Frame = 1

Query: 104 QPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVT 163
           QP    +  L+    +  +     +VL +M++++      T + +I     +G +D A+ 
Sbjct: 155 QPDVFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALK 214

Query: 164 IFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGW 223
           + NQ   S +C  TV  Y  L+ A         A  L+  M+ +G+ PD  TY T++ G 
Sbjct: 215 VLNQLL-SDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGM 274

Query: 224 CSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDI 283
           C  G +  A E +  +  KG  P +   ++L+  LLN G  E  + ++ KM  E   P++
Sbjct: 275 CKEGMVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNV 334

Query: 284 GTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHC 343
            T++ LI  +C  G+++  +N+   + + GL PD  +Y  LI A  + GR+D A   L  
Sbjct: 335 VTYSILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLET 394

Query: 344 CIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGR 403
            I DG +P    Y  +L  +CK G+ D A   FG +   G  PN   Y  + +     G 
Sbjct: 395 MISDGCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGD 454

Query: 404 FVDAANYLMEMAELGLPP 422
            + A + ++EM   G+ P
Sbjct: 455 KIRALHMILEMMSNGIDP 471

BLAST of CSPI01G33810 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 155.2 bits (391), Expect = 9.5e-38
Identity = 94/338 (27.81%), Postives = 164/338 (48.52%), Query Frame = 1

Query: 100 NPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVD 159
           N   +P  + +  L++ L    +++   ++L  M  + +  +  T S +I  + K+G + 
Sbjct: 283 NKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLV 342

Query: 160 NAVTIFNQCSK-SIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGT 219
            A  ++++  K SID    +  Y++L+   C       A  +   MI K   P+  TY T
Sbjct: 343 EAEKLYDEMIKRSID--PDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNT 402

Query: 220 LVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEG 279
           L+ G+C A +++E  E   EMSQ+G        + L++GL  AG  + A+ + +KM  +G
Sbjct: 403 LIKGFCKAKRVEEGMELFREMSQRGLVGNTVTYNTLIQGLFQAGDCDMAQKIFKKMVSDG 462

Query: 280 SVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAF 339
             PDI T++ L+D +C  G+++  + +F  + K  + PDI TY I+I    K G++++ +
Sbjct: 463 VPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKAGKVEDGW 522

Query: 340 RLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMC 399
            L       G  P   +Y  ++ G C++G  ++A   F +MK  G  PN   Y  LI   
Sbjct: 523 DLFCSLSLKGVKPNVIIYTTMISGFCRKGLKEEADALFREMKEDGTLPNSGTYNTLIRAR 582

Query: 400 GRGGRFVDAANYLMEMAELGL----PPISRCFDMVTDG 433
            R G    +A  + EM   G       IS   +M+ DG
Sbjct: 583 LRDGDKAASAELIKEMRSCGFVGDASTISMVINMLHDG 618

BLAST of CSPI01G33810 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 154.8 bits (390), Expect = 1.2e-37
Identity = 94/344 (27.33%), Postives = 164/344 (47.67%), Query Frame = 1

Query: 105 PTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGL---VDNA 164
           P  + +  L+    + RK    +K+L  M  + L+ +  + + +I    ++G    V   
Sbjct: 238 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 297

Query: 165 VTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVT 224
           +T  N+   S+D       YN L+   C+   FH A  +   M+R G+TP   TY +L+ 
Sbjct: 298 LTEMNRRGYSLD----EVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIH 357

Query: 225 GWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVP 284
             C AG M  A EFL++M  +G  P  R    LV+G    GY+  A  ++R+M   G  P
Sbjct: 358 SMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSP 417

Query: 285 DIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL 344
            + T+N+LI+  C +G+++  I +  ++ + GL PD+ +Y  ++    +   +DEA R+ 
Sbjct: 418 SVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVK 477

Query: 345 HCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRG 404
              +E G  P    Y  +++G C++ +  +A   + +M   G PP+   YT LI      
Sbjct: 478 REMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCME 537

Query: 405 GRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHDLAKKI 446
           G    A     EM E G+ P    + ++ +GL    +   AK++
Sbjct: 538 GDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 577

BLAST of CSPI01G33810 vs. NCBI nr
Match: gi|449449535|ref|XP_004142520.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucumis sativus])

HSP 1 Score: 945.3 bits (2442), Expect = 4.0e-272
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 1

Query: 1   MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60
           MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER
Sbjct: 1   MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60

Query: 61  TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART 120
           TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART
Sbjct: 61  TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART 120

Query: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180
           RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV
Sbjct: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180

Query: 181 YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240
           YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS
Sbjct: 181 YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240

Query: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD 300
           QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD
Sbjct: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD 300

Query: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL 360
           FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL
Sbjct: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL 360

Query: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420
           KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP
Sbjct: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420

Query: 421 PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 456
           PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI
Sbjct: 421 PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 455

BLAST of CSPI01G33810 vs. NCBI nr
Match: gi|659125518|ref|XP_008462724.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Cucumis melo])

HSP 1 Score: 911.0 bits (2353), Expect = 8.4e-262
Identity = 440/455 (96.70%), Postives = 445/455 (97.80%), Query Frame = 1

Query: 1   MLHFKNFATGLIRFRYIFLNRHFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60
           MLHFKNFATG IRF YI L R FSNSNS VNGSTAPSKDDYFAAIHHISHIVRRDFYMER
Sbjct: 1   MLHFKNFATGAIRFHYISLIRCFSNSNSSVNGSTAPSKDDYFAAIHHISHIVRRDFYMER 60

Query: 61  TLNKLRISNLNSELVFRVLRACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLART 120
           TLNKLRIS LNSELVFRVLRACSN GTESFRFFNWACSHNPSYQPTTLE EELVKTLART
Sbjct: 61  TLNKLRISYLNSELVFRVLRACSNCGTESFRFFNWACSHNPSYQPTTLELEELVKTLART 120

Query: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180
           RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV
Sbjct: 121 RKYTTMWKVLLQMKTQNLKISPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEV 180

Query: 181 YNALLFALCEVKMFHGAYALIRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240
           YNALLFALCEVKMFHGAYALIRRMI+KGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS
Sbjct: 181 YNALLFALCEVKMFHGAYALIRRMIKKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMS 240

Query: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVD 300
           QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEG VPDIGTFNSLIDVICNSGEVD
Sbjct: 241 QKGFNPPLRGRDLLVEGLLNAGYLESAKDMVRKMTKEGCVPDIGTFNSLIDVICNSGEVD 300

Query: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPIL 360
           FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLL+CCIEDGHVPFPSLYGPI+
Sbjct: 301 FCINIFHEVCKLGLCPDINTYKILIPATSKVGRIDEAFRLLNCCIEDGHVPFPSLYGPII 360

Query: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420
           KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP
Sbjct: 361 KGMCKRGQFDDAFCFFGDMKHKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLP 420

Query: 421 PISRCFDMVTDGLKNCGKHDLAKKIEQLEVSIRGI 456
           PISRCFDMVTDGLK+CGKHDLAKKIE+LEVSIRGI
Sbjct: 421 PISRCFDMVTDGLKSCGKHDLAKKIEKLEVSIRGI 455

BLAST of CSPI01G33810 vs. NCBI nr
Match: gi|1009129954|ref|XP_015882041.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial [Ziziphus jujuba])

HSP 1 Score: 691.0 bits (1782), Expect = 1.4e-195
Identity = 326/431 (75.64%), Postives = 374/431 (86.77%), Query Frame = 1

Query: 26  SNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLRACSNS 85
           + + +NG   PSKDDYFAAIHHIS+IVRRD YMERTLNK+RIS +NSELV+RVLR+CS S
Sbjct: 33  TTTTINGGK-PSKDDYFAAIHHISNIVRRDIYMERTLNKMRIS-VNSELVYRVLRSCSTS 92

Query: 86  GTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQ-NLKISPET 145
            TE+ RFFNWA +H+PSY PTT+E EELV+TLART+KY +MWK+L QM T  NL +SP+T
Sbjct: 93  ATEALRFFNWAQTHHPSYHPTTVECEELVRTLARTKKYESMWKILNQMHTHHNLSVSPDT 152

Query: 146 ISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYALIRRM 205
           + FI++EYG+ GLVD AV IFNQ  K  +C QTV +YN+LLFALCEVK+FHGAYALIRRM
Sbjct: 153 LCFIVEEYGRHGLVDQAVEIFNQAPKIFNCKQTVNLYNSLLFALCEVKLFHGAYALIRRM 212

Query: 206 IRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLNAGYL 265
           IRKGV PDK+TY  LV  WCSAGK +EAQ+FLEEMS+KGFNPP+RGRDLLVEGLLNAGY+
Sbjct: 213 IRKGVVPDKRTYSILVNAWCSAGKFREAQQFLEEMSKKGFNPPVRGRDLLVEGLLNAGYV 272

Query: 266 ESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINTYKIL 325
           ESAK+MV+KM KEG VPD+ TFNSL++ I   GEV+FCI+++ EVC LGL PDINTYK+L
Sbjct: 273 ESAKEMVKKMIKEGFVPDVSTFNSLLEAISKHGEVEFCIDLYREVCILGLVPDINTYKVL 332

Query: 326 IPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMKHKGH 385
           IP  SKVGRIDEAFRLLH  IEDGH PFPSLY PI+KGMCK GQFDDA CFF +MK KGH
Sbjct: 333 IPGVSKVGRIDEAFRLLHDSIEDGHKPFPSLYAPIIKGMCKNGQFDDALCFFSEMKVKGH 392

Query: 386 PPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHDLAKK 445
           PPNRPVYTMLITMCGRGGRFV+AANYLMEM ELGL PISRCFD+VTDGLKNCGKHDLAKK
Sbjct: 393 PPNRPVYTMLITMCGRGGRFVEAANYLMEMTELGLTPISRCFDLVTDGLKNCGKHDLAKK 452

Query: 446 IEQLEVSIRGI 456
           IEQLEVS+RG+
Sbjct: 453 IEQLEVSLRGV 461

BLAST of CSPI01G33810 vs. NCBI nr
Match: gi|703120790|ref|XP_010102179.1| (hypothetical protein L484_024459 [Morus notabilis])

HSP 1 Score: 690.3 bits (1780), Expect = 2.3e-195
Identity = 323/435 (74.25%), Postives = 374/435 (85.98%), Query Frame = 1

Query: 22  HFSNSNSLVNGSTAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLRA 81
           H+ NS        + SKD+YFAAIHHIS+IV+RDFYMERTLNKLRI+ ++S+LVFRVLRA
Sbjct: 41  HYQNSTK-----PSSSKDNYFAAIHHISNIVQRDFYMERTLNKLRIAAVDSDLVFRVLRA 100

Query: 82  CSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQN-LKI 141
           C   G ES RFFNWA SH PSY+PT++E EEL K LART+KY +MWK+L QMKT N L I
Sbjct: 101 CHKFGPESLRFFNWARSHQPSYRPTSVELEELAKNLARTKKYESMWKILQQMKTNNNLII 160

Query: 142 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 201
           S ET+ FII+EYGKQGLVD A  +FN+  K  +C QTVEVYN+LLFALCEVK+FHGAYAL
Sbjct: 161 SSETLCFIIEEYGKQGLVDQAAEVFNRVPKIFNCSQTVEVYNSLLFALCEVKLFHGAYAL 220

Query: 202 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 261
           +RRMIRK V PDK+TY  LV  WCSAGKM+EAQ FL EMS+KGFNPP+RGRDLL+EGLLN
Sbjct: 221 VRRMIRKEVVPDKRTYSILVNAWCSAGKMREAQNFLSEMSKKGFNPPVRGRDLLIEGLLN 280

Query: 262 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 321
           AGY+ESAK+MVRKM KEG +PD+ TFNSL++VIC S EV+FCI+++H+VC LGLCPDINT
Sbjct: 281 AGYIESAKEMVRKMVKEGFLPDVSTFNSLVEVICKSEEVEFCIDLYHQVCGLGLCPDINT 340

Query: 322 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 381
           YK+LIPA SK G+IDEAFRLLH  IEDGH PFPSLY PI+KGMC++GQFDDA CFFG+MK
Sbjct: 341 YKVLIPAVSKAGQIDEAFRLLHSSIEDGHKPFPSLYAPIIKGMCRKGQFDDALCFFGEMK 400

Query: 382 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 441
            KGHPPNRPVYTMLITMCGRGGRFVDAANYL+EM E+GL PISRCFD+VTDGLKNCGKHD
Sbjct: 401 VKGHPPNRPVYTMLITMCGRGGRFVDAANYLVEMTEIGLTPISRCFDLVTDGLKNCGKHD 460

Query: 442 LAKKIEQLEVSIRGI 456
           LA++IEQLEVS RG+
Sbjct: 461 LARRIEQLEVSARGM 470

BLAST of CSPI01G33810 vs. NCBI nr
Match: gi|590641647|ref|XP_007030290.1| (Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao])

HSP 1 Score: 677.9 bits (1748), Expect = 1.2e-191
Identity = 329/435 (75.63%), Postives = 377/435 (86.67%), Query Frame = 1

Query: 24  SNSNSLVNGS---TAPSKDDYFAAIHHISHIVRRDFYMERTLNKLRISNLNSELVFRVLR 83
           +++NSL   S   TA SKDDYFAAIHHIS+ VRR+ + ERTLN++ IS +NSELVFRVLR
Sbjct: 26  TSANSLQIASVSTTAVSKDDYFAAIHHISNTVRREVHPERTLNRMNIS-VNSELVFRVLR 85

Query: 84  ACSNSGTESFRFFNWACSHNPSYQPTTLEFEELVKTLARTRKYTTMWKVLLQMKTQNLKI 143
           +CSNS TES RFF+WA +H   Y PT++EFEELVK L R RKY +MWK + QM+ QNL +
Sbjct: 86  SCSNSPTESLRFFSWARAH---YVPTSVEFEELVKILIRHRKYESMWKTIQQMQKQNLSL 145

Query: 144 SPETISFIIQEYGKQGLVDNAVTIFNQCSKSIDCPQTVEVYNALLFALCEVKMFHGAYAL 203
           S +T+SFII+EYGK GLVD AV +FN+ S S+ C QTV VYN+LLFALCEVKMFHGAYAL
Sbjct: 146 SCDTLSFIIEEYGKNGLVDQAVEVFNK-STSLGCKQTVSVYNSLLFALCEVKMFHGAYAL 205

Query: 204 IRRMIRKGVTPDKKTYGTLVTGWCSAGKMKEAQEFLEEMSQKGFNPPLRGRDLLVEGLLN 263
           IRRMIRKG  PDK+TY  LV GWCS GKM+EAQEFLEEMS+ GFNPP+RGRDLLVEGLLN
Sbjct: 206 IRRMIRKGEVPDKRTYAILVNGWCSGGKMREAQEFLEEMSKMGFNPPVRGRDLLVEGLLN 265

Query: 264 AGYLESAKDMVRKMTKEGSVPDIGTFNSLIDVICNSGEVDFCINIFHEVCKLGLCPDINT 323
           AGYLESAK+MVR+MTKEG VPDIGTFNSL++ IC+SGEVDFCIN++H VCKLGLCPDINT
Sbjct: 266 AGYLESAKEMVRRMTKEGFVPDIGTFNSLVETICSSGEVDFCINMYHSVCKLGLCPDINT 325

Query: 324 YKILIPATSKVGRIDEAFRLLHCCIEDGHVPFPSLYGPILKGMCKRGQFDDAFCFFGDMK 383
           YKILIPA SKVGRIDEAFRLL+  +EDG+ PFPSLY PI+K MC++GQFDDAF FFG+MK
Sbjct: 326 YKILIPAASKVGRIDEAFRLLNNSVEDGYRPFPSLYAPIIKAMCRKGQFDDAFSFFGEMK 385

Query: 384 HKGHPPNRPVYTMLITMCGRGGRFVDAANYLMEMAELGLPPISRCFDMVTDGLKNCGKHD 443
            KGH PNRPVYTMLITMCGRGGRFV+AANYL+EM ELGL PISRCFDMV DGLKNCGKHD
Sbjct: 386 VKGHSPNRPVYTMLITMCGRGGRFVEAANYLVEMTELGLAPISRCFDMVIDGLKNCGKHD 445

Query: 444 LAKKIEQLEVSIRGI 456
           LAK+IEQLEVS+RG+
Sbjct: 446 LAKRIEQLEVSLRGV 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP391_ARATH7.3e-18169.43Pentatricopeptide repeat-containing protein At5g18390, mitochondrial OS=Arabidop... [more]
PP447_ARATH6.6e-4127.60Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
PP213_ARATH1.3e-3627.99Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
PPR96_ARATH1.7e-3627.81Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
PP407_ARATH2.2e-3627.33Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0M105_CUCSA2.8e-272100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G695420 PE=4 SV=1[more]
W9RHW0_9ROSA1.6e-19574.25Uncharacterized protein OS=Morus notabilis GN=L484_024459 PE=4 SV=1[more]
A0A061F291_THECC8.3e-19275.63Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_026... [more]
M5WME7_PRUPE1.1e-18871.62Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025361mg PE=4 SV=1[more]
D7TDU0_VITVI6.2e-18773.68Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0043g00770 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G18390.14.1e-18269.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G65820.13.7e-4227.60 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G04760.17.3e-3827.99 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G62930.19.5e-3827.81 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G39710.11.2e-3727.33 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449449535|ref|XP_004142520.1|4.0e-272100.00PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial ... [more]
gi|659125518|ref|XP_008462724.1|8.4e-26296.70PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial ... [more]
gi|1009129954|ref|XP_015882041.1|1.4e-19575.64PREDICTED: pentatricopeptide repeat-containing protein At5g18390, mitochondrial ... [more]
gi|703120790|ref|XP_010102179.1|2.3e-19574.25hypothetical protein L484_024459 [Morus notabilis][more]
gi|590641647|ref|XP_007030290.1|1.2e-19175.63Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G33810.1CSPI01G33810.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 254..278
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 207..240
score: 1.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 359..398
score: 3.1E-9coord: 281..327
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 390..421
score: 5.6E-4coord: 285..318
score: 7.1E-5coord: 254..283
score: 0.0022coord: 180..212
score: 3.6E-6coord: 215..246
score: 1.2E-4coord: 356..387
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 352..386
score: 10.424coord: 141..175
score: 6.588coord: 177..211
score: 10.841coord: 387..421
score: 10.052coord: 247..281
score: 7.366coord: 317..351
score: 9.701coord: 282..316
score: 10.161coord: 212..246
score: 12.737coord: 106..140
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 150..348
score: 3.9E-5coord: 385..416
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 9..455
score: 1.7E
NoneNo IPR availablePANTHERPTHR24015:SF579SUBFAMILY NOT NAMEDcoord: 9..455
score: 1.7E