CSPI05G08030 (gene) Wild cucumber (PI 183967)

NameCSPI05G08030
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr5 : 6820888 .. 6822267 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTCCAAATATTCTACCAAGTTGTTGACTATTTGTGTCGCCTCATTTTGTAAATCTCACCAATTTCAGAAAGCTGAAATTGCTATTAAAGATGCCATTCGTTTAGGAGTTGTTCCCGACATACTAACTTACAATATTTTGATTAATGGGTATTGCCAATTTAGCGACATGGATGCTGCTTATTCTGTCTTTCATAAAATGAGGGACGCTGGCATTACTCCAAATGTGTTTATTTATACTTCTTTGATGGCTGCTGCATCCAGAAATTCTTCATTAGAACAATGCCTTAATCTGTTTCATGAAATGCTTCAATTAGGTATCACTCCTGATGTATGGAGTTACACCACTTTGACCCACTGTTTCCTTAAAGTAGGAAAGCCAGATGAAGCTAATAGCATTTTTTTGGATTTTATACTTAAAGACCACTCCCCTAATCCTGCTACGTTTGATGTGATGATTTTCGGCTTTTGCAGTTGTGGATATACAAGTAATGCTATTACGTTGTTTAGAAATTTGCAGAGCCATGGACTTGTTCCTAAATTTGTTACATATGACATTCTTCTTACTGGACTTTGCAAGGCGGCTAGGTTGAATGTCTCTGTAAGTATGTTCAACGAGGCTATGAGTATGTTCAATGAGGCCATCGATTCAGGTTTTGAGCCCGATTCTACTACATACATAGCGTTGATGAAATGTTGCTTCAAATTTAGAGAGTATCAACATGGGTTCGAGATATTCTTTGAAATGAAAAACAAGGGCCTTGCTTTTAATGGTTTTGGTTACTACACTGCTATTGGTGCTTTACTTCGGTTAGGTAGGCTTGAAGAGGCAAAATTTTTTATGGTAGAGATGATAAAGAATGGAGTGGTATTTAATTTAGTTTTTTATAACACAGTTGTTAATTTGTACTGTAAACATGGTAAATTGGAGGCTGCACATAAGATGTTGGATAAGATAGAGTCACGGGGATTACAATGCAACGATTACACACATGCTATAATCACTGATGGGTTATGTAAGGTTGGCAATTTTGAGGGGGCTCGACGACATTTGAATTATATCTATCCATCAGGTTTTACTAATTCAAACGTGGTAGCCTCGAGTTGTCTAATTGATAGGTTATGTAAGGCTGGACAAATTGACCAAGCAATGCAACTGTTTGAATTAATGGAAACAAAGGATCCTTATGTCTACACCTCTTTGATGCACAATCTTTGCAAGGCAAGGAGATTCCTTTGTGCATCAAAGTTATTGCTTGACTGCTTAAGAAGTGGCATCAGTGTTTTTCGGTCCACACAATGTGCAGTTATCTTCGGTCTTTGTTCTTTTGGATTTACAAGTGAAGCAAGGAAGCTCAAACCTTTCATTCATTTATCA

mRNA sequence

ATGGTTTCCAAATATTCTACCAAGTTGTTGACTATTTGTGTCGCCTCATTTTGTAAATCTCACCAATTTCAGAAAGCTGAAATTGCTATTAAAGATGCCATTCGTTTAGGAGTTGTTCCCGACATACTAACTTACAATATTTTGATTAATGGGTATTGCCAATTTAGCGACATGGATGCTGCTTATTCTGTCTTTCATAAAATGAGGGACGCTGGCATTACTCCAAATGTGTTTATTTATACTTCTTTGATGGCTGCTGCATCCAGAAATTCTTCATTAGAACAATGCCTTAATCTGTTTCATGAAATGCTTCAATTAGGTATCACTCCTGATGTATGGAGTTACACCACTTTGACCCACTGTTTCCTTAAAGTAGGAAAGCCAGATGAAGCTAATAGCATTTTTTTGGATTTTATACTTAAAGACCACTCCCCTAATCCTGCTACGTTTGATGTGATGATTTTCGGCTTTTGCAGTTGTGGATATACAAGTAATGCTATTACGTTGTTTAGAAATTTGCAGAGCCATGGACTTGTTCCTAAATTTGTTACATATGACATTCTTCTTACTGGACTTTGCAAGGCGGCTAGGTTGAATGTCTCTGTAAGTATGTTCAACGAGGCTATGAGTATGTTCAATGAGGCCATCGATTCAGGTTTTGAGCCCGATTCTACTACATACATAGCGTTGATGAAATGTTGCTTCAAATTTAGAGAGTATCAACATGGGTTCGAGATATTCTTTGAAATGAAAAACAAGGGCCTTGCTTTTAATGGTTTTGGTTACTACACTGCTATTGGTGCTTTACTTCGGTTAGGTAGGCTTGAAGAGGCAAAATTTTTTATGGTAGAGATGATAAAGAATGGAGTGGTATTTAATTTAGTTTTTTATAACACAGTTGTTAATTTGTACTGTAAACATGGTAAATTGGAGGCTGCACATAAGATGTTGGATAAGATAGAGTCACGGGGATTACAATGCAACGATTACACACATGCTATAATCACTGATGGGTTATGTAAGGTTGGCAATTTTGAGGGGGCTCGACGACATTTGAATTATATCTATCCATCAGGTTTTACTAATTCAAACGTGGTAGCCTCGAGTTGTCTAATTGATAGGTTATGTAAGGCTGGACAAATTGACCAAGCAATGCAACTGTTTGAATTAATGGAAACAAAGGATCCTTATGTCTACACCTCTTTGATGCACAATCTTTGCAAGGCAAGGAGATTCCTTTGTGCATCAAAGTTATTGCTTGACTGCTTAAGAAGTGGCATCAGTGTTTTTCGGTCCACACAATGTGCAGTTATCTTCGGTCTTTGTTCTTTTGGATTTACAAGTGAAGCAAGGAAGCTCAAACCTTTCATTCATTTATCA

Coding sequence (CDS)

ATGGTTTCCAAATATTCTACCAAGTTGTTGACTATTTGTGTCGCCTCATTTTGTAAATCTCACCAATTTCAGAAAGCTGAAATTGCTATTAAAGATGCCATTCGTTTAGGAGTTGTTCCCGACATACTAACTTACAATATTTTGATTAATGGGTATTGCCAATTTAGCGACATGGATGCTGCTTATTCTGTCTTTCATAAAATGAGGGACGCTGGCATTACTCCAAATGTGTTTATTTATACTTCTTTGATGGCTGCTGCATCCAGAAATTCTTCATTAGAACAATGCCTTAATCTGTTTCATGAAATGCTTCAATTAGGTATCACTCCTGATGTATGGAGTTACACCACTTTGACCCACTGTTTCCTTAAAGTAGGAAAGCCAGATGAAGCTAATAGCATTTTTTTGGATTTTATACTTAAAGACCACTCCCCTAATCCTGCTACGTTTGATGTGATGATTTTCGGCTTTTGCAGTTGTGGATATACAAGTAATGCTATTACGTTGTTTAGAAATTTGCAGAGCCATGGACTTGTTCCTAAATTTGTTACATATGACATTCTTCTTACTGGACTTTGCAAGGCGGCTAGGTTGAATGTCTCTGTAAGTATGTTCAACGAGGCTATGAGTATGTTCAATGAGGCCATCGATTCAGGTTTTGAGCCCGATTCTACTACATACATAGCGTTGATGAAATGTTGCTTCAAATTTAGAGAGTATCAACATGGGTTCGAGATATTCTTTGAAATGAAAAACAAGGGCCTTGCTTTTAATGGTTTTGGTTACTACACTGCTATTGGTGCTTTACTTCGGTTAGGTAGGCTTGAAGAGGCAAAATTTTTTATGGTAGAGATGATAAAGAATGGAGTGGTATTTAATTTAGTTTTTTATAACACAGTTGTTAATTTGTACTGTAAACATGGTAAATTGGAGGCTGCACATAAGATGTTGGATAAGATAGAGTCACGGGGATTACAATGCAACGATTACACACATGCTATAATCACTGATGGGTTATGTAAGGTTGGCAATTTTGAGGGGGCTCGACGACATTTGAATTATATCTATCCATCAGGTTTTACTAATTCAAACGTGGTAGCCTCGAGTTGTCTAATTGATAGGTTATGTAAGGCTGGACAAATTGACCAAGCAATGCAACTGTTTGAATTAATGGAAACAAAGGATCCTTATGTCTACACCTCTTTGATGCACAATCTTTGCAAGGCAAGGAGATTCCTTTGTGCATCAAAGTTATTGCTTGACTGCTTAAGAAGTGGCATCAGTGTTTTTCGGTCCACACAATGTGCAGTTATCTTCGGTCTTTGTTCTTTTGGATTTACAAGTGAAGCAAGGAAGCTCAAACCTTTCATTCATTTATCA
BLAST of CSPI05G08030 vs. Swiss-Prot
Match: PP318_ARATH (Putative pentatricopeptide repeat-containing protein At4g17915 OS=Arabidopsis thaliana GN=At4g17915 PE=3 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 1.8e-110
Identity = 204/456 (44.74%), Postives = 289/456 (63.38%), Query Frame = 1

Query: 6   STKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVF 65
           ST+LL ICV S CK  + +KAE  I D IRLGV PD++TYN LI+GYC+F  ++ AY+V 
Sbjct: 12  STRLLNICVDSLCKFRKLEKAESLIIDGIRLGVDPDVVTYNTLISGYCRFVGIEEAYAVT 71

Query: 66  HKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKV 125
            +MRDAGI P+V  Y SL+A A+R   L+  L LF EML+ GI PD+WSY TL  C+ K+
Sbjct: 72  RRMRDAGIRPDVATYNSLIAGAARRLMLDHVLYLFDEMLEWGIYPDLWSYNTLMCCYFKL 131

Query: 126 GKPDEA-NSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVT 185
           GK +EA   ++ D  L   +P P T++V++   C CGY  NA+ LF+ +QS    P+ +T
Sbjct: 132 GKHEEAFRVLYKDLQLAGLNPGPDTYNVLLDALCKCGYIDNALELFKEMQSR-FKPELMT 191

Query: 186 YDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGF 245
           Y+IL+ GLCK+ R+         A  M  E   SG+ P++ TY  ++K  FK R  + G 
Sbjct: 192 YNILINGLCKSRRVGT-------AKWMLTELKKSGYTPNAVTYTTILKLYFKTRRIRRGL 251

Query: 246 EIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLY 305
           ++F EMK +G  ++G+ Y+  + AL++ GR +EA  +M E+++ G   ++V YNT++NLY
Sbjct: 252 QLFLEMKREGYTYDGYAYFAVVSALIKTGRTKEAYEYMQELVRKGRRHDIVSYNTLLNLY 311

Query: 306 CKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSN 365
            K G L+A   +L +IE RG++ ++YTH II +GL + G    A  H   +   G    N
Sbjct: 312 FKDGNLDAVDDLLGEIERRGMKADEYTHTIIVNGLLRTGQTRRAEEHFVSMGEMGI-GLN 371

Query: 366 VVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLLDCLR 425
           +V  +CL+D LCKAG +D+AM+ FE ME KD Y YTS++HNLCK  RF+CASKLLL C  
Sbjct: 372 LVTCNCLVDGLCKAGHVDRAMRYFESMEVKDEYTYTSVVHNLCKDMRFVCASKLLLSCYN 431

Query: 426 SGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
            GI +  S + AV+ GL   G   EARK K  + L+
Sbjct: 432 KGIKIPTSARRAVLSGLRMSGCYGEARKAKAEMKLT 458

BLAST of CSPI05G08030 vs. Swiss-Prot
Match: PP421_ARATH (Pentatricopeptide repeat-containing protein At5g46680 OS=Arabidopsis thaliana GN=At5g46680 PE=2 SV=2)

HSP 1 Score: 367.1 bits (941), Expect = 2.9e-100
Identity = 196/449 (43.65%), Postives = 282/449 (62.81%), Query Frame = 1

Query: 6   STKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVF 65
           STKLL I V S CK    ++AE  + D IRLGV+PD++TYN LI GY +F  +D AY+V 
Sbjct: 12  STKLLNISVNSLCKFRNLERAETLLIDGIRLGVLPDVITYNTLIKGYTRFIGIDEAYAVT 71

Query: 66  HKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKV 125
            +MR+AGI P+V  Y SL++ A++N  L + L LF EML  G++PD+WSY TL  C+ K+
Sbjct: 72  RRMREAGIEPDVTTYNSLISGAAKNLMLNRVLQLFDEMLHSGLSPDMWSYNTLMSCYFKL 131

Query: 126 GKPDEANSIF-LDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVT 185
           G+  EA  I   D  L    P   T+++++   C  G+T NAI LF++L+S  + P+ +T
Sbjct: 132 GRHGEAFKILHEDIHLAGLVPGIDTYNILLDALCKSGHTDNAIELFKHLKSR-VKPELMT 191

Query: 186 YDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGF 245
           Y+IL+ GLCK+ R+  SV        M  E   SG+ P++ TY  ++K  FK +  + G 
Sbjct: 192 YNILINGLCKSRRVG-SVDW------MMRELKKSGYTPNAVTYTTMLKMYFKTKRIEKGL 251

Query: 246 EIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGV-VFNLVFYNTVVNL 305
           ++F +MK +G  F+GF     + AL++ GR EEA   M E++++G    ++V YNT++NL
Sbjct: 252 QLFLKMKKEGYTFDGFANCAVVSALIKTGRAEEAYECMHELVRSGTRSQDIVSYNTLLNL 311

Query: 306 YCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNS 365
           Y K G L+A   +L++IE +GL+ +DYTH II +GL  +GN  GA +HL  I   G   S
Sbjct: 312 YFKDGNLDAVDDLLEEIEMKGLKPDDYTHTIIVNGLLNIGNTGGAEKHLACIGEMGMQPS 371

Query: 366 NVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLLDCL 425
            VV  +CLID LCKAG +D+AM+LF  ME +D + YTS++HNLCK  R +CASKLLL C 
Sbjct: 372 -VVTCNCLIDGLCKAGHVDRAMRLFASMEVRDEFTYTSVVHNLCKDGRLVCASKLLLSCY 431

Query: 426 RSGISVFRSTQCAVIFGLCSFGFTSEARK 453
             G+ +  S + AV+ G+        ARK
Sbjct: 432 NKGMKIPSSARRAVLSGIRETVSYQAARK 451

BLAST of CSPI05G08030 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 196.4 bits (498), Expect = 6.7e-49
Identity = 124/421 (29.45%), Postives = 200/421 (47.51%), Query Frame = 1

Query: 37  GVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQC 96
           G  PD+++Y+ ++NGYC+F ++D  + +   M+  G+ PN +IY S++    R   L + 
Sbjct: 276 GYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEA 335

Query: 97  LNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFG 156
              F EM++ GI PD   YTTL   F K G    A+  F +   +D +P+  T+  +I G
Sbjct: 336 EEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISG 395

Query: 157 FCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAI 216
           FC  G    A  LF  +   GL P  VT+  L+ G CKA  +        +A  + N  I
Sbjct: 396 FCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHM-------KDAFRVHNHMI 455

Query: 217 DSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLE 276
            +G  P+  TY  L+    K  +     E+  EM   GL  N F Y + +  L + G +E
Sbjct: 456 QAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIE 515

Query: 277 EAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIIT 336
           EA   + E    G+  + V Y T+++ YCK G+++ A ++L ++  +GLQ    T  ++ 
Sbjct: 516 EAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLM 575

Query: 337 DGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETK-- 396
           +G C  G  E   + LN++   G    N    + L+ + C    +  A  +++ M ++  
Sbjct: 576 NGFCLHGMLEDGEKLLNWMLAKGIA-PNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGV 635

Query: 397 --DPYVYTSLMHNLCKARRFLCASKLLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARK 454
             D   Y +L+   CKAR    A  L  +    G SV  ST   +I G        EAR+
Sbjct: 636 GPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEARE 688

BLAST of CSPI05G08030 vs. Swiss-Prot
Match: PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 194.9 bits (494), Expect = 1.9e-48
Identity = 114/386 (29.53%), Postives = 190/386 (49.22%), Query Frame = 1

Query: 25  KAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGITPNVFIYTSLM 84
           +A   I   +  G  PD++TY +++NG C+  D D A+++ +KM    + P V IY +++
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 85  AAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANSIFLDFILKDHS 144
               +   ++  LNLF EM   GI P+V +Y++L  C    G+  +A+ +  D I +  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 145 PNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLCKAARLNVSVSM 204
           P+  TF  +I  F   G    A  L+  +    + P  VTY  L+ G C   RL      
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRL------ 383

Query: 205 FNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNKGLAFNGFGYYT 264
            +EA  MF   +     PD  TY  L+K   K++  + G E+F EM  +GL  N   Y  
Sbjct: 384 -DEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNI 443

Query: 265 AIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAAHKMLDKIESRG 324
            I  L + G  + A+    EM+ +GV  N++ YNT+++  CK+GKLE A  + + ++   
Sbjct: 444 LIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSK 503

Query: 325 LQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLIDRLCKAGQIDQA 384
           ++   YT+ I+ +G+CK G  E        +   G    +VVA + +I   C+ G  ++A
Sbjct: 504 MEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKG-VKPDVVAYNTMISGFCRKGSKEEA 563

Query: 385 MQLFELMETKDPYVYTSLMHNLCKAR 411
             LF+ M+       +   + L +AR
Sbjct: 564 DALFKEMKEDGTLPNSGCYNTLIRAR 581

BLAST of CSPI05G08030 vs. Swiss-Prot
Match: PP442_ARATH (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 1.8e-46
Identity = 103/375 (27.47%), Postives = 185/375 (49.33%), Query Frame = 1

Query: 14  VASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGI 73
           +  +C+    ++    + +  +  +V    TY  ++ G C   D+D AY++  +M  +G 
Sbjct: 389 IEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGMCSSGDLDGAYNIVKEMIASGC 448

Query: 74  TPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANS 133
            PNV IYT+L+    +NS     + +  EM + GI PD++ Y +L     K  + DEA S
Sbjct: 449 RPNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARS 508

Query: 134 IFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLC 193
             ++ +     PN  T+   I G+      ++A    + ++  G++P  V    L+   C
Sbjct: 509 FLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYC 568

Query: 194 KAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNK 253
           K  ++        EA S +   +D G   D+ TY  LM   FK  +     EIF EM+ K
Sbjct: 569 KKGKVI-------EACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGK 628

Query: 254 GLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAA 313
           G+A + F Y   I    +LG +++A     EM++ G+  N++ YN ++  +C+ G++E A
Sbjct: 629 GIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKA 688

Query: 314 HKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLID 373
            ++LD++  +GL  N  T+  I DG CK G+   A R  + +   G    + V ++ L+D
Sbjct: 689 KELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTT-LVD 748

Query: 374 RLCKAGQIDQAMQLF 389
             C+   +++A+ +F
Sbjct: 749 GCCRLNDVERAITIF 755

BLAST of CSPI05G08030 vs. TrEMBL
Match: A0A0A0KL08_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171145 PE=4 SV=1)

HSP 1 Score: 825.9 bits (2132), Expect = 2.5e-236
Identity = 402/403 (99.75%), Postives = 403/403 (100.00%), Query Frame = 1

Query: 58  MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT 117
           MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT
Sbjct: 1   MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT 60

Query: 118 LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG 177
           LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG
Sbjct: 61  LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG 120

Query: 178 LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF 237
           LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF
Sbjct: 121 LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF 180

Query: 238 REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY 297
           REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY
Sbjct: 181 REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY 240

Query: 298 NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP 357
           NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP
Sbjct: 241 NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP 300

Query: 358 SGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK 417
           +GFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK
Sbjct: 301 TGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK 360

Query: 418 LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
           LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS
Sbjct: 361 LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 403

BLAST of CSPI05G08030 vs. TrEMBL
Match: A0A0A0KYL3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G481190 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 5.9e-161
Identity = 278/433 (64.20%), Postives = 339/433 (78.29%), Query Frame = 1

Query: 22  QFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGITPNVFIYT 81
           Q QKAE  I D IR+GV+PD++TYN LI+GYC+FS MDAAYSV ++MR+AGI+P+V  Y 
Sbjct: 12  QMQKAEAVIIDGIRIGVLPDVVTYNTLIDGYCRFSGMDAAYSVLYRMREAGISPDVITYN 71

Query: 82  SLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANSIFLDFILK 141
           SL+A A+RN SLEQ L+LF EMLQ GITPD+WSY TL HCF  +GKPDEA  +F D ILK
Sbjct: 72  SLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMHCFFILGKPDEAYRVFKDIILK 131

Query: 142 DHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLCKAARLNVS 201
           D SP+P TF+ MI G C  GYTSNAI LFRNLQ HG +P+ VTY+IL+ GLCK  RL  +
Sbjct: 132 DLSPHPVTFNTMINGLCKHGYTSNAIMLFRNLQRHGFIPQLVTYNILINGLCKVDRLRAA 191

Query: 202 VSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNKGLAFNGFG 261
           + M NEAM       DSG EP++ TY  LMK CF+ R+Y+ GFEIF +MKNKG AF+GF 
Sbjct: 192 IRMLNEAM-------DSGLEPNAVTYTTLMKSCFRSRQYERGFEIFSKMKNKGYAFDGFA 251

Query: 262 YYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAAHKMLDKIE 321
           Y T  GA L+LGR EEAKF M +MIKN V  ++ FYNT +NLYCK GKLEAA+K+ D+IE
Sbjct: 252 YCTVSGAFLKLGRFEEAKFCMEQMIKNDVGIDITFYNTFINLYCKEGKLEAAYKLFDEIE 311

Query: 322 SRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLIDRLCKAGQI 381
            RGL+C+ YTH+IIT+GLC+VGN EGA +HLN +Y +GF  SN+VA +CLIDRLCKAGQI
Sbjct: 312 PRGLECDVYTHSIITNGLCRVGNIEGAMQHLNCVYTTGFA-SNLVALNCLIDRLCKAGQI 371

Query: 382 DQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLLDCLRSGISVFRSTQCAVIFGL 441
           D+A++LFE MET+D + YTSL+HNLCKARRF CASKLL+ C R G+ V ++T+ AVI GL
Sbjct: 372 DRAIRLFESMETRDSFTYTSLVHNLCKARRFRCASKLLISCSRDGMKVLKATRRAVIDGL 431

Query: 442 CSFGFTSEARKLK 455
           CS GFTSEARKLK
Sbjct: 432 CSSGFTSEARKLK 436

BLAST of CSPI05G08030 vs. TrEMBL
Match: A0A061DSD0_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 4 OS=Theobroma cacao GN=TCM_001742 PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 2.5e-143
Identity = 243/460 (52.83%), Postives = 333/460 (72.39%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MV + ST+LL +C+ASFCK+H+ +KAE  I D IRLGV+PD +TYNILI+ YC+   +DA
Sbjct: 34  MVRRLSTRLLNVCIASFCKAHKLEKAESVIIDGIRLGVLPDGVTYNILIDAYCRLVGIDA 93

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
            YSV H+M +AGI+P+V  Y SL+A A+RN  + +  NLF EM+Q GI PD+WSY TL H
Sbjct: 94  GYSVLHRMGEAGISPDVISYNSLIAGATRNCQISRSFNLFDEMIQRGIAPDIWSYNTLMH 153

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
            F K+GKPDEAN +F D IL +H P  ATF++M+ G C  GYT NA  LFRNLQ HG VP
Sbjct: 154 GFFKLGKPDEANRVFRDIILVEHLPCVATFNIMMNGLCKNGYTENAFMLFRNLQRHGFVP 213

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           + +TY+IL++GLCK  RL +       AM +F E ++SG  P++ TY  +++CCF+ R++
Sbjct: 214 ELMTYNILVSGLCKIGRLGL-------AMRIFKEIVESGHVPNAITYTPVLRCCFRKRKF 273

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           + G E+  EMK+KG  F+GF Y T IGAL+++G+++EA  FMVEM++ G+  ++V YNT+
Sbjct: 274 EEGLELLSEMKSKGYTFDGFAYCTVIGALIKIGKMKEATEFMVEMMRTGIELDIVSYNTL 333

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           +N+YCK  KLE A+K+LD IE +GL+C+ YTH I+ D LCK GN EGA RHL Y+   GF
Sbjct: 334 INMYCKDNKLEEAYKLLDDIEKKGLECDKYTHTIMIDALCKAGNIEGAARHLKYMSTMGF 393

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
            +SN+ A +C ID LCK GQID AM++F+ ME +D + Y+SL+HNLC+A R+  ASKLLL
Sbjct: 394 -DSNLAAYNCFIDGLCKVGQIDNAMKVFKSMEVRDSFTYSSLVHNLCRAGRYRSASKLLL 453

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
            CLRSG+ + +S Q AV+ GL   GF  EARK++  I ++
Sbjct: 454 SCLRSGMKILKSAQRAVLSGLRYSGFPGEARKVQSKIRIA 485

BLAST of CSPI05G08030 vs. TrEMBL
Match: A0A061DJT8_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_001742 PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 2.5e-143
Identity = 243/460 (52.83%), Postives = 333/460 (72.39%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MV + ST+LL +C+ASFCK+H+ +KAE  I D IRLGV+PD +TYNILI+ YC+   +DA
Sbjct: 49  MVRRLSTRLLNVCIASFCKAHKLEKAESVIIDGIRLGVLPDGVTYNILIDAYCRLVGIDA 108

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
            YSV H+M +AGI+P+V  Y SL+A A+RN  + +  NLF EM+Q GI PD+WSY TL H
Sbjct: 109 GYSVLHRMGEAGISPDVISYNSLIAGATRNCQISRSFNLFDEMIQRGIAPDIWSYNTLMH 168

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
            F K+GKPDEAN +F D IL +H P  ATF++M+ G C  GYT NA  LFRNLQ HG VP
Sbjct: 169 GFFKLGKPDEANRVFRDIILVEHLPCVATFNIMMNGLCKNGYTENAFMLFRNLQRHGFVP 228

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           + +TY+IL++GLCK  RL +       AM +F E ++SG  P++ TY  +++CCF+ R++
Sbjct: 229 ELMTYNILVSGLCKIGRLGL-------AMRIFKEIVESGHVPNAITYTPVLRCCFRKRKF 288

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           + G E+  EMK+KG  F+GF Y T IGAL+++G+++EA  FMVEM++ G+  ++V YNT+
Sbjct: 289 EEGLELLSEMKSKGYTFDGFAYCTVIGALIKIGKMKEATEFMVEMMRTGIELDIVSYNTL 348

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           +N+YCK  KLE A+K+LD IE +GL+C+ YTH I+ D LCK GN EGA RHL Y+   GF
Sbjct: 349 INMYCKDNKLEEAYKLLDDIEKKGLECDKYTHTIMIDALCKAGNIEGAARHLKYMSTMGF 408

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
            +SN+ A +C ID LCK GQID AM++F+ ME +D + Y+SL+HNLC+A R+  ASKLLL
Sbjct: 409 -DSNLAAYNCFIDGLCKVGQIDNAMKVFKSMEVRDSFTYSSLVHNLCRAGRYRSASKLLL 468

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
            CLRSG+ + +S Q AV+ GL   GF  EARK++  I ++
Sbjct: 469 SCLRSGMKILKSAQRAVLSGLRYSGFPGEARKVQSKIRIA 500

BLAST of CSPI05G08030 vs. TrEMBL
Match: A0A061DKD7_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_001742 PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 2.5e-143
Identity = 243/460 (52.83%), Postives = 333/460 (72.39%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MV + ST+LL +C+ASFCK+H+ +KAE  I D IRLGV+PD +TYNILI+ YC+   +DA
Sbjct: 1   MVRRLSTRLLNVCIASFCKAHKLEKAESVIIDGIRLGVLPDGVTYNILIDAYCRLVGIDA 60

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
            YSV H+M +AGI+P+V  Y SL+A A+RN  + +  NLF EM+Q GI PD+WSY TL H
Sbjct: 61  GYSVLHRMGEAGISPDVISYNSLIAGATRNCQISRSFNLFDEMIQRGIAPDIWSYNTLMH 120

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
            F K+GKPDEAN +F D IL +H P  ATF++M+ G C  GYT NA  LFRNLQ HG VP
Sbjct: 121 GFFKLGKPDEANRVFRDIILVEHLPCVATFNIMMNGLCKNGYTENAFMLFRNLQRHGFVP 180

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           + +TY+IL++GLCK  RL +       AM +F E ++SG  P++ TY  +++CCF+ R++
Sbjct: 181 ELMTYNILVSGLCKIGRLGL-------AMRIFKEIVESGHVPNAITYTPVLRCCFRKRKF 240

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           + G E+  EMK+KG  F+GF Y T IGAL+++G+++EA  FMVEM++ G+  ++V YNT+
Sbjct: 241 EEGLELLSEMKSKGYTFDGFAYCTVIGALIKIGKMKEATEFMVEMMRTGIELDIVSYNTL 300

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           +N+YCK  KLE A+K+LD IE +GL+C+ YTH I+ D LCK GN EGA RHL Y+   GF
Sbjct: 301 INMYCKDNKLEEAYKLLDDIEKKGLECDKYTHTIMIDALCKAGNIEGAARHLKYMSTMGF 360

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
            +SN+ A +C ID LCK GQID AM++F+ ME +D + Y+SL+HNLC+A R+  ASKLLL
Sbjct: 361 -DSNLAAYNCFIDGLCKVGQIDNAMKVFKSMEVRDSFTYSSLVHNLCRAGRYRSASKLLL 420

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
            CLRSG+ + +S Q AV+ GL   GF  EARK++  I ++
Sbjct: 421 SCLRSGMKILKSAQRAVLSGLRYSGFPGEARKVQSKIRIA 452

BLAST of CSPI05G08030 vs. TAIR10
Match: AT5G46680.1 (AT5G46680.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 367.1 bits (941), Expect = 1.6e-101
Identity = 196/449 (43.65%), Postives = 282/449 (62.81%), Query Frame = 1

Query: 6   STKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVF 65
           STKLL I V S CK    ++AE  + D IRLGV+PD++TYN LI GY +F  +D AY+V 
Sbjct: 12  STKLLNISVNSLCKFRNLERAETLLIDGIRLGVLPDVITYNTLIKGYTRFIGIDEAYAVT 71

Query: 66  HKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKV 125
            +MR+AGI P+V  Y SL++ A++N  L + L LF EML  G++PD+WSY TL  C+ K+
Sbjct: 72  RRMREAGIEPDVTTYNSLISGAAKNLMLNRVLQLFDEMLHSGLSPDMWSYNTLMSCYFKL 131

Query: 126 GKPDEANSIF-LDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVT 185
           G+  EA  I   D  L    P   T+++++   C  G+T NAI LF++L+S  + P+ +T
Sbjct: 132 GRHGEAFKILHEDIHLAGLVPGIDTYNILLDALCKSGHTDNAIELFKHLKSR-VKPELMT 191

Query: 186 YDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGF 245
           Y+IL+ GLCK+ R+  SV        M  E   SG+ P++ TY  ++K  FK +  + G 
Sbjct: 192 YNILINGLCKSRRVG-SVDW------MMRELKKSGYTPNAVTYTTMLKMYFKTKRIEKGL 251

Query: 246 EIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGV-VFNLVFYNTVVNL 305
           ++F +MK +G  F+GF     + AL++ GR EEA   M E++++G    ++V YNT++NL
Sbjct: 252 QLFLKMKKEGYTFDGFANCAVVSALIKTGRAEEAYECMHELVRSGTRSQDIVSYNTLLNL 311

Query: 306 YCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNS 365
           Y K G L+A   +L++IE +GL+ +DYTH II +GL  +GN  GA +HL  I   G   S
Sbjct: 312 YFKDGNLDAVDDLLEEIEMKGLKPDDYTHTIIVNGLLNIGNTGGAEKHLACIGEMGMQPS 371

Query: 366 NVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLLDCL 425
            VV  +CLID LCKAG +D+AM+LF  ME +D + YTS++HNLCK  R +CASKLLL C 
Sbjct: 372 -VVTCNCLIDGLCKAGHVDRAMRLFASMEVRDEFTYTSVVHNLCKDGRLVCASKLLLSCY 431

Query: 426 RSGISVFRSTQCAVIFGLCSFGFTSEARK 453
             G+ +  S + AV+ G+        ARK
Sbjct: 432 NKGMKIPSSARRAVLSGIRETVSYQAARK 451

BLAST of CSPI05G08030 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 196.4 bits (498), Expect = 3.8e-50
Identity = 124/421 (29.45%), Postives = 200/421 (47.51%), Query Frame = 1

Query: 37  GVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQC 96
           G  PD+++Y+ ++NGYC+F ++D  + +   M+  G+ PN +IY S++    R   L + 
Sbjct: 276 GYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEA 335

Query: 97  LNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFG 156
              F EM++ GI PD   YTTL   F K G    A+  F +   +D +P+  T+  +I G
Sbjct: 336 EEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISG 395

Query: 157 FCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAI 216
           FC  G    A  LF  +   GL P  VT+  L+ G CKA  +        +A  + N  I
Sbjct: 396 FCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHM-------KDAFRVHNHMI 455

Query: 217 DSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLE 276
            +G  P+  TY  L+    K  +     E+  EM   GL  N F Y + +  L + G +E
Sbjct: 456 QAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIE 515

Query: 277 EAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIIT 336
           EA   + E    G+  + V Y T+++ YCK G+++ A ++L ++  +GLQ    T  ++ 
Sbjct: 516 EAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLM 575

Query: 337 DGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETK-- 396
           +G C  G  E   + LN++   G    N    + L+ + C    +  A  +++ M ++  
Sbjct: 576 NGFCLHGMLEDGEKLLNWMLAKGIA-PNATTFNSLVKQYCIRNNLKAATAIYKDMCSRGV 635

Query: 397 --DPYVYTSLMHNLCKARRFLCASKLLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARK 454
             D   Y +L+   CKAR    A  L  +    G SV  ST   +I G        EAR+
Sbjct: 636 GPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEARE 688

BLAST of CSPI05G08030 vs. TAIR10
Match: AT1G62670.1 (AT1G62670.1 rna processing factor 2)

HSP 1 Score: 194.9 bits (494), Expect = 1.1e-49
Identity = 114/386 (29.53%), Postives = 190/386 (49.22%), Query Frame = 1

Query: 25  KAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGITPNVFIYTSLM 84
           +A   I   +  G  PD++TY +++NG C+  D D A+++ +KM    + P V IY +++
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 85  AAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANSIFLDFILKDHS 144
               +   ++  LNLF EM   GI P+V +Y++L  C    G+  +A+ +  D I +  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 145 PNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLCKAARLNVSVSM 204
           P+  TF  +I  F   G    A  L+  +    + P  VTY  L+ G C   RL      
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRL------ 383

Query: 205 FNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNKGLAFNGFGYYT 264
            +EA  MF   +     PD  TY  L+K   K++  + G E+F EM  +GL  N   Y  
Sbjct: 384 -DEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNI 443

Query: 265 AIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAAHKMLDKIESRG 324
            I  L + G  + A+    EM+ +GV  N++ YNT+++  CK+GKLE A  + + ++   
Sbjct: 444 LIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSK 503

Query: 325 LQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLIDRLCKAGQIDQA 384
           ++   YT+ I+ +G+CK G  E        +   G    +VVA + +I   C+ G  ++A
Sbjct: 504 MEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKG-VKPDVVAYNTMISGFCRKGSKEEA 563

Query: 385 MQLFELMETKDPYVYTSLMHNLCKAR 411
             LF+ M+       +   + L +AR
Sbjct: 564 DALFKEMKEDGTLPNSGCYNTLIRAR 581

BLAST of CSPI05G08030 vs. TAIR10
Match: AT5G61990.1 (AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 188.3 bits (477), Expect = 1.0e-47
Identity = 103/375 (27.47%), Postives = 185/375 (49.33%), Query Frame = 1

Query: 14  VASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYSVFHKMRDAGI 73
           +  +C+    ++    + +  +  +V    TY  ++ G C   D+D AY++  +M  +G 
Sbjct: 389 IEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGMCSSGDLDGAYNIVKEMIASGC 448

Query: 74  TPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFLKVGKPDEANS 133
            PNV IYT+L+    +NS     + +  EM + GI PD++ Y +L     K  + DEA S
Sbjct: 449 RPNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARS 508

Query: 134 IFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFVTYDILLTGLC 193
             ++ +     PN  T+   I G+      ++A    + ++  G++P  V    L+   C
Sbjct: 509 FLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYC 568

Query: 194 KAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHGFEIFFEMKNK 253
           K  ++        EA S +   +D G   D+ TY  LM   FK  +     EIF EM+ K
Sbjct: 569 KKGKVI-------EACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGK 628

Query: 254 GLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNLYCKHGKLEAA 313
           G+A + F Y   I    +LG +++A     EM++ G+  N++ YN ++  +C+ G++E A
Sbjct: 629 GIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKA 688

Query: 314 HKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNSNVVASSCLID 373
            ++LD++  +GL  N  T+  I DG CK G+   A R  + +   G    + V ++ L+D
Sbjct: 689 KELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTT-LVD 748

Query: 374 RLCKAGQIDQAMQLF 389
             C+   +++A+ +F
Sbjct: 749 GCCRLNDVERAITIF 755

BLAST of CSPI05G08030 vs. TAIR10
Match: AT1G62910.1 (AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 187.6 bits (475), Expect = 1.7e-47
Identity = 114/407 (28.01%), Postives = 192/407 (47.17%), Query Frame = 1

Query: 4   KYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDAAYS 63
           K  T   T  +      ++  +A   +   ++ G  PD++TY  ++NG C+  D+D A S
Sbjct: 185 KPDTFTFTTLIHGLFLHNKASEAVALVDQMVQRGCQPDLVTYGTVVNGLCKRGDIDLALS 244

Query: 64  VFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTHCFL 123
           +  KM    I  +V IY +++    +   ++  LNLF EM   GI PDV++Y++L  C  
Sbjct: 245 LLKKMEKGKIEADVVIYNTIIDGLCKYKHMDDALNLFTEMDNKGIRPDVFTYSSLISCLC 304

Query: 124 KVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVPKFV 183
             G+  +A+ +  D I +  +PN  TF  +I  F   G    A  L+  +    + P   
Sbjct: 305 NYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIF 364

Query: 184 TYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREYQHG 243
           TY  L+ G C   RL       +EA  MF   I     P+  TY  L+K   K +  + G
Sbjct: 365 TYSSLINGFCMHDRL-------DEAKHMFELMISKDCFPNVVTYSTLIKGFCKAKRVEEG 424

Query: 244 FEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTVVNL 303
            E+F EM  +GL  N   Y T I    +    + A+    +M+  GV  N++ YN +++ 
Sbjct: 425 MELFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSVGVHPNILTYNILLDG 484

Query: 304 YCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGFTNS 363
            CK+GKL  A  + + ++   ++ + YT+ I+ +G+CK G  E        +   G  + 
Sbjct: 485 LCKNGKLAKAMVVFEYLQRSTMEPDIYTYNIMIEGMCKAGKVEDGWELFCNLSLKG-VSP 544

Query: 364 NVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKAR 411
           NV+A + +I   C+ G  ++A  L + M+   P   +   + L +AR
Sbjct: 545 NVIAYNTMISGFCRKGSKEEADSLLKKMKEDGPLPNSGTYNTLIRAR 583

BLAST of CSPI05G08030 vs. NCBI nr
Match: gi|449451938|ref|XP_004143717.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis sativus])

HSP 1 Score: 938.3 bits (2424), Expect = 5.0e-270
Identity = 459/460 (99.78%), Postives = 460/460 (100.00%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA
Sbjct: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
           AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH
Sbjct: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
           CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP
Sbjct: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY
Sbjct: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV
Sbjct: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP+GF
Sbjct: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPTGF 360

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
           TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL
Sbjct: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
           DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS
Sbjct: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 460

BLAST of CSPI05G08030 vs. NCBI nr
Match: gi|659073130|ref|XP_008467270.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis melo])

HSP 1 Score: 880.6 bits (2274), Expect = 1.2e-252
Identity = 429/458 (93.67%), Postives = 442/458 (96.51%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MVSKYSTKLLTICVASFCKS QFQKAEIAIKDAIRLGVVPDILTYNIL+NGYCQFSDMDA
Sbjct: 1   MVSKYSTKLLTICVASFCKSQQFQKAEIAIKDAIRLGVVPDILTYNILVNGYCQFSDMDA 60

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
           AYSV +KMR+AGITPNVFIYTSLMAAASRNSSLEQCLNLF EMLQL I PD+WSYTTL H
Sbjct: 61  AYSVLYKMREAGITPNVFIYTSLMAAASRNSSLEQCLNLFDEMLQLRINPDIWSYTTLIH 120

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
           CFLKVGKP+EANSIFLD ILKDH PNPATFDVMIFG CSCGYTSNAI LFRNLQSHG VP
Sbjct: 121 CFLKVGKPNEANSIFLDIILKDHCPNPATFDVMIFGLCSCGYTSNAIALFRNLQSHGFVP 180

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           KFVTY+ILLTGLCKAARLNVS+SMFNEAMSMF+EAID GFEPD TTYIALMKCCFKFREY
Sbjct: 181 KFVTYNILLTGLCKAARLNVSMSMFNEAMSMFDEAIDLGFEPDYTTYIALMKCCFKFREY 240

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEA FFMVEMIKNGVVF+LVFYNTV
Sbjct: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEANFFMVEMIKNGVVFDLVFYNTV 300

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           VNLYCKHGKLEAAHKMLDKIESRGLQC+DYTHAIITDGLCKVGNFEGARRHLNY+YP+GF
Sbjct: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCDDYTHAIITDGLCKVGNFEGARRHLNYMYPTGF 360

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
           TNSNVVASSCLIDRLCKAGQIDQAM+LFELMETKDPY YTSLMHNLCKARRFLCASKLLL
Sbjct: 361 TNSNVVASSCLIDRLCKAGQIDQAMKLFELMETKDPYTYTSLMHNLCKARRFLCASKLLL 420

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIH 459
           DCLRSGISVFRSTQCAVIFGLCS GFTSEARKLKPFIH
Sbjct: 421 DCLRSGISVFRSTQCAVIFGLCSLGFTSEARKLKPFIH 458

BLAST of CSPI05G08030 vs. NCBI nr
Match: gi|700195203|gb|KGN50380.1| (hypothetical protein Csa_5G171145 [Cucumis sativus])

HSP 1 Score: 825.9 bits (2132), Expect = 3.6e-236
Identity = 402/403 (99.75%), Postives = 403/403 (100.00%), Query Frame = 1

Query: 58  MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT 117
           MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT
Sbjct: 1   MDAAYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTT 60

Query: 118 LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG 177
           LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG
Sbjct: 61  LTHCFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHG 120

Query: 178 LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF 237
           LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF
Sbjct: 121 LVPKFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKF 180

Query: 238 REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY 297
           REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY
Sbjct: 181 REYQHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFY 240

Query: 298 NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP 357
           NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP
Sbjct: 241 NTVVNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYP 300

Query: 358 SGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK 417
           +GFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK
Sbjct: 301 TGFTNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASK 360

Query: 418 LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
           LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS
Sbjct: 361 LLLDCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 403

BLAST of CSPI05G08030 vs. NCBI nr
Match: gi|659083068|ref|XP_008442168.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis melo])

HSP 1 Score: 610.9 bits (1574), Expect = 1.8e-171
Identity = 296/460 (64.35%), Postives = 363/460 (78.91%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MV KYSTK L ICVASFCKS Q QKAE  I D IR+GV+P+++TYN LI+GYC+FS MDA
Sbjct: 1   MVCKYSTKFLNICVASFCKSQQMQKAEAVIIDGIRIGVLPNVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
           AYSV ++MR+AGI+P+V  Y SL+A A+RN SLEQ L+LF EMLQ GITPD+WSY TL H
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
           CF  +GKPDEA  +F D ILKD SP+P TF+ MI G C  GYTSNA+ LFRNLQ HG +P
Sbjct: 121 CFFILGKPDEAYRVFKDIILKDLSPHPVTFNTMINGLCKHGYTSNAVMLFRNLQRHGFIP 180

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           + VTY+IL+ GLCK +RL         A+ M NEA+DSG EP++ TY  LMK CF+ R+Y
Sbjct: 181 QLVTYNILINGLCKVSRLRA-------AIRMLNEAVDSGLEPNAVTYTTLMKSCFRSRQY 240

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           +HGFEIF +MK+KG AF+GF Y T IGA L+LGR EEA     +MIKN V  ++ FYNT+
Sbjct: 241 EHGFEIFSKMKSKGYAFDGFAYCTVIGAFLKLGRFEEANSCTEQMIKNDVGIDMTFYNTL 300

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           +NLYCK GKLEAA+K+LD+IESRGL+C+DYTH+IIT+GLC+VGN EGA +HLN +Y +GF
Sbjct: 301 INLYCKEGKLEAAYKLLDQIESRGLECDDYTHSIITNGLCRVGNIEGAMQHLNCVYTTGF 360

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
             SN+VA +CLIDRLCKAGQID+A++LFE MET+D + YTSL+HNLCKARRF CASKLL+
Sbjct: 361 A-SNLVALNCLIDRLCKAGQIDRAIRLFESMETRDSFTYTSLVHNLCKARRFRCASKLLI 420

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLKPFIHLS 461
            C R GI + R+T+ AVI GL S GFTSEARKLK  +HL+
Sbjct: 421 SCSRGGIKILRATRRAVIDGLYSSGFTSEARKLKFKLHLA 452

BLAST of CSPI05G08030 vs. NCBI nr
Match: gi|778694864|ref|XP_011653883.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucumis sativus])

HSP 1 Score: 607.8 bits (1566), Expect = 1.5e-170
Identity = 295/454 (64.98%), Postives = 356/454 (78.41%), Query Frame = 1

Query: 1   MVSKYSTKLLTICVASFCKSHQFQKAEIAIKDAIRLGVVPDILTYNILINGYCQFSDMDA 60
           MV KYSTK L ICVASFCKS Q QKAE  I D IR+GV+PD++TYN LI+GYC+FS MDA
Sbjct: 1   MVCKYSTKFLNICVASFCKSQQMQKAEAVIIDGIRIGVLPDVVTYNTLIDGYCRFSGMDA 60

Query: 61  AYSVFHKMRDAGITPNVFIYTSLMAAASRNSSLEQCLNLFHEMLQLGITPDVWSYTTLTH 120
           AYSV ++MR+AGI+P+V  Y SL+A A+RN SLEQ L+LF EMLQ GITPD+WSY TL H
Sbjct: 61  AYSVLYRMREAGISPDVITYNSLIAGATRNFSLEQSLDLFEEMLQSGITPDIWSYNTLMH 120

Query: 121 CFLKVGKPDEANSIFLDFILKDHSPNPATFDVMIFGFCSCGYTSNAITLFRNLQSHGLVP 180
           CF  +GKPDEA  +F D ILKD SP+P TF+ MI G C  GYTSNAI LFRNLQ HG +P
Sbjct: 121 CFFILGKPDEAYRVFKDIILKDLSPHPVTFNTMINGLCKHGYTSNAIMLFRNLQRHGFIP 180

Query: 181 KFVTYDILLTGLCKAARLNVSVSMFNEAMSMFNEAIDSGFEPDSTTYIALMKCCFKFREY 240
           + VTY+IL+ GLCK  RL  ++ M NEAM       DSG EP++ TY  LMK CF+ R+Y
Sbjct: 181 QLVTYNILINGLCKVDRLRAAIRMLNEAM-------DSGLEPNAVTYTTLMKSCFRSRQY 240

Query: 241 QHGFEIFFEMKNKGLAFNGFGYYTAIGALLRLGRLEEAKFFMVEMIKNGVVFNLVFYNTV 300
           + GFEIF +MKNKG AF+GF Y T  GA L+LGR EEAKF M +MIKN V  ++ FYNT 
Sbjct: 241 ERGFEIFSKMKNKGYAFDGFAYCTVSGAFLKLGRFEEAKFCMEQMIKNDVGIDITFYNTF 300

Query: 301 VNLYCKHGKLEAAHKMLDKIESRGLQCNDYTHAIITDGLCKVGNFEGARRHLNYIYPSGF 360
           +NLYCK GKLEAA+K+ D+IE RGL+C+ YTH+IIT+GLC+VGN EGA +HLN +Y +GF
Sbjct: 301 INLYCKEGKLEAAYKLFDEIEPRGLECDVYTHSIITNGLCRVGNIEGAMQHLNCVYTTGF 360

Query: 361 TNSNVVASSCLIDRLCKAGQIDQAMQLFELMETKDPYVYTSLMHNLCKARRFLCASKLLL 420
             SN+VA +CLIDRLCKAGQID+A++LFE MET+D + YTSL+HNLCKARRF CASKLL+
Sbjct: 361 A-SNLVALNCLIDRLCKAGQIDRAIRLFESMETRDSFTYTSLVHNLCKARRFRCASKLLI 420

Query: 421 DCLRSGISVFRSTQCAVIFGLCSFGFTSEARKLK 455
            C R G+ V ++T+ AVI GLCS GFTSEARKLK
Sbjct: 421 SCSRDGMKVLKATRRAVIDGLCSSGFTSEARKLK 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP318_ARATH1.8e-11044.74Putative pentatricopeptide repeat-containing protein At4g17915 OS=Arabidopsis th... [more]
PP421_ARATH2.9e-10043.65Pentatricopeptide repeat-containing protein At5g46680 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH6.7e-4929.45Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PPR91_ARATH1.9e-4829.53Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
PP442_ARATH1.8e-4627.47Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KL08_CUCSA2.5e-23699.75Uncharacterized protein OS=Cucumis sativus GN=Csa_5G171145 PE=4 SV=1[more]
A0A0A0KYL3_CUCSA5.9e-16164.20Uncharacterized protein OS=Cucumis sativus GN=Csa_4G481190 PE=4 SV=1[more]
A0A061DSD0_THECC2.5e-14352.83Pentatricopeptide repeat superfamily protein, putative isoform 4 OS=Theobroma ca... [more]
A0A061DJT8_THECC2.5e-14352.83Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma ca... [more]
A0A061DKD7_THECC2.5e-14352.83Pentatricopeptide repeat superfamily protein, putative isoform 2 OS=Theobroma ca... [more]
Match NameE-valueIdentityDescription
AT5G46680.11.6e-10143.65 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G05670.13.8e-5029.45 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G62670.11.1e-4929.53 rna processing factor 2[more]
AT5G61990.11.0e-4727.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62910.11.7e-4728.01 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449451938|ref|XP_004143717.1|5.0e-27099.78PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucum... [more]
gi|659073130|ref|XP_008467270.1|1.2e-25293.67PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucum... [more]
gi|700195203|gb|KGN50380.1|3.6e-23699.75hypothetical protein Csa_5G171145 [Cucumis sativus][more]
gi|659083068|ref|XP_008442168.1|1.8e-17164.35PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucum... [more]
gi|778694864|ref|XP_011653883.1|1.5e-17064.98PREDICTED: putative pentatricopeptide repeat-containing protein At4g17915 [Cucum... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G08030.1CSPI05G08030.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 264..290
score: 0.028coord: 398..427
score: 0.19coord: 114..137
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 369..392
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 145..194
score: 2.7E-8coord: 293..341
score: 7.8E-13coord: 40..84
score: 1.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 211..256
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 296..328
score: 2.5E-7coord: 369..394
score: 3.4E-5coord: 44..77
score: 1.5E-10coord: 79..112
score: 1.1E-5coord: 183..223
score: 0.0027coord: 226..255
score: 2.1E-4coord: 114..146
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 399..429
score: 5.448coord: 181..211
score: 7.125coord: 146..180
score: 10.457coord: 364..398
score: 9.427coord: 328..362
score: 7.18coord: 41..75
score: 13.493coord: 223..257
score: 9.624coord: 111..145
score: 9.175coord: 76..110
score: 10.643coord: 293..327
score: 11.181coord: 6..40
score: 6.95coord: 258..292
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 375..404
score: 1.9E-6coord: 191..236
score: 1.9E-6coord: 20..128
score: 1.9E-6coord: 272..342
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 196..319
score: 2.09E-7coord: 21..152
score: 2.0
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 9..454
score: 1.0E
NoneNo IPR availablePANTHERPTHR24015:SF489SUBFAMILY NOT NAMEDcoord: 9..454
score: 1.0E