CSPI04G22400 (gene) Wild cucumber (PI 183967)

NameCSPI04G22400
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing family protein
LocationChr4 : 20710811 .. 20712427 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGCCAGAAAGCATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGATTTCATAGCTCTATCAGGGAC

mRNA sequence

ATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGA

Coding sequence (CDS)

ATGGGGATGAAGAAAACAAGCTTCAAGCTTCCCTTGGCCTTGATCGTTTCTTTTGGATGTAACAATGGCCGCTCTCTCCCTTTCCCCCTCTCTCAACCATTCACAATCCTTTCCCCCCATTTGTACCCCAAATCTTTTTGTTCTCAGTCGGTGAACGGCAAATCGAGGAACAGTACTGGCCACCGGGAAACCAACGACACCCAAATGAAGACCCAAAAGCCAATTTCTCCTTTTCGATTGTCTTCACTTCTTCGTCTCCAAAAGGACCCAACGCTCGCTCTTCAACTCTTCCTCAACCCTAATCCCAGCTCTTCAGAACCCCCCAAACCCTTTCGTTACTCTTTACTTTCCTACGATCTCATCATCTCTAAGCTCGGACGTGCCAAAATGTTCGATGAAATGGAAGAAATCCTCCAACAGCTCAAGCAAGAGACTCGTTTTGCTCCTCACGAGGTCATTTTCTGTAATGTCATTGCCTTCTATGGCCGAGCCCACCTCCCTGATCGTGCCTTCCAAGTGTTCGAAAGAATTCCATCGTTCCGTTGCAAGCGGACGGTGAAATCTGTAAATTCTTTGTTGGCTGCATTGTTGAAGAATCGGCAACTTGAGAAAATGACGCAAGTTTTTGTGGACATTAGTAACTATGGCTCCCCTGATGCTTGTACTTTTAATATTTTGATTCATGCTGCTTGTTTATGTGGTGATTTGGATGCTGTATGGGGAGTGTTTGATGAAATGCAAAAGAGAGGTGTAAAACCGAACGTGGTTACTTTTGGGACTCTGATTTATGGGCTTTCTCTGAATTCTAAGTTGAAAGAAGCATTGAGATTGAAAGAGGATATGGTGAAAGTGTACATGATTAAACCTAATGCGTCAATATATACTACTTTGATCAAAGGATTTTGTGGGGTTGGAGAATTGAATTTCGCTTTCAAGCTAAAGGAGGAGATGGTTACTAGCAACGTAAAATTGGTCTCGGCAGTTTACTCTACTTTGATCAGTGCGCTCTTCAAGCATGGTAGGAAAGAAGAAGTTTCTGACATTTTAAGAGAAATGGGAGAGAATGGGTGCAAACCTGATACCGTTACCTACAATGCCATTATCAATGGACATTGTAAAGAAAACGATCTCGAATCTGCACATAGAGTTATGGATGAAATGGTGGAGAAAGGGTGTAAGCCAGATGTGTTCAGTTTCAACACAATCATTGGATGGTTATGTAAGGAGGGGAAATTAGATAAAGCAATGGACTTGCTTGAAGATATGCCGAGACGAGGTTGCCCTCCTGATGTGTTATCATACAGGATAATTTTTGATGGGCTGTGTGAAATGATGCAGCTAAAGGAAGCAACTTCCATACTGGACGAGATGATCTTCAAGGGTTATGTGCCTCGTAATGAAAGCATAAACAAACTCGTAGACAGGTTGTGCCAGGAATGCAATATGGAGCTGTTGTGGATGGTCTTAAACAGTCTGGGAAGAGGAAATCGCATGAATATGGACATGTGGGCTAGAGTTGTTGCTTTTGTTTATAAGGAGAACCTATCGGAATCCTCCAACTTAATTGACTCATTGATTAGTTGA
BLAST of CSPI04G22400 vs. Swiss-Prot
Match: PPR79_ARATH (Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis thaliana GN=At1g53330 PE=3 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 3.7e-144
Identity = 250/455 (54.95%), Postives = 331/455 (72.75%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   K +S FRL+SLLR + DP+ A++LF NP+P S+ P +PFRYSLL YD+II+KLG +
Sbjct: 1   MSAVKSVSSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGS 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMFDE++++L  LK +TR  P E+IFCNVI F+GR  LP RA  +F+ +P +RC+RTVKS
Sbjct: 61  KMFDELDQVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           +NSLL+ALLK  +LEKM +    I  +G PDACT+NILIH     G  D    +FDEM K
Sbjct: 121 LNSLLSALLKCGELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKLFDEMVK 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           + VKP  VTFGTLI+GL  +S++KEAL++K DM+KVY ++P   IY +LIK  C +GEL+
Sbjct: 181 KKVKPTGVTFGTLIHGLCKDSRVKEALKMKHDMLKVYGVRPTVHIYASLIKALCQIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
           FAFKLK+E     +K+ +A+YSTLIS+L K GR  EVS IL EM E GCKPDTVTYN +I
Sbjct: 241 FAFKLKDEAYEGKIKVDAAIYSTLISSLIKAGRSNEVSMILEEMSEKGCKPDTVTYNVLI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG C END ESA+RV+DEMVEKG KPDV S+N I+G   +  K ++A  L EDMPRRGC 
Sbjct: 301 NGFCVENDSESANRVLDEMVEKGLKPDVISYNMILGVFFRIKKWEEATYLFEDMPRRGCS 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PD LSYRI+FDGLCE +Q +EA  ILDEM+FKGY PR + +   + +LC+   +E+L  V
Sbjct: 361 PDTLSYRIVFDGLCEGLQFEEAAVILDEMLFKGYKPRRDRLEGFLQKLCESGKLEILSKV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLI 523
           ++SL RG   + D+W+ ++  + KE  +S+S +L+
Sbjct: 421 ISSLHRGIAGDADVWSVMIPTMCKEPVISDSIDLL 455

BLAST of CSPI04G22400 vs. Swiss-Prot
Match: PP392_ARATH (Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana GN=At5g18475 PE=2 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.6e-51
Identity = 142/469 (30.28%), Postives = 245/469 (52.24%), Query Frame = 1

Query: 66  DTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKL 125
           +T  KT K IS     SL++ ++DP   L +F   N +S +  K F ++  +Y +++  L
Sbjct: 46  ETNPKT-KFISHESAVSLMKRERDPQGVLDIF---NKASQQ--KGFNHNNATYSVLLDNL 105

Query: 126 GRAKMFDEMEEILQQLKQET-RFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF-RCK 185
            R K F  ++ IL Q+K ET RF   E +F N++  + R+ L D+  ++F  I    R K
Sbjct: 106 VRHKKFLAVDAILHQMKYETCRF--QESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARVK 165

Query: 186 RTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCGDLDAVWG 245
            ++ ++++ L  L+ + ++    ++ +    N G  P+ C FNIL+   C  GD++  + 
Sbjct: 166 PSLNAISTCLNLLIDSGEVNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFL 225

Query: 246 VFDEMQKRGVK-PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 305
           V +EM++ G+  PN +T+ TL+  L  +S+ KEA+ L EDM+    I P+   +  +I G
Sbjct: 226 VVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISKEGISPDPVTFNVMING 285

Query: 306 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 365
           FC  GE+  A K+ + M  +        YS L++   K G+ +E      E+ + G K D
Sbjct: 286 FCRAGEVERAKKILDFMKKNGCNPNVYNYSALMNGFCKVGKIQEAKQTFDEVKKTGLKLD 345

Query: 366 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 425
           TV Y  ++N  C+  + + A +++ EM    C+ D  ++N I+  L  EG+ ++A+ +L+
Sbjct: 346 TVGYTTLMNCFCRNGETDEAMKLLGEMKASRCRADTLTYNVILRGLSSEGRSEEALQMLD 405

Query: 426 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 485
                G   +  SYRII + LC   +L++A   L  M  +G  P + + N+LV RLC+  
Sbjct: 406 QWGSEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSERGIWPHHATWNELVVRLCESG 465

Query: 486 NMEL-LWMVLNSLGRGNRMNMDMWARVVAFVYKE-NLSESSNLIDSLIS 528
             E+ + +++  L  G       W  VV  + KE  L     L+DSL+S
Sbjct: 466 YTEIGVRVLIGFLRIGLIPGPKSWGAVVESICKERKLVHVFELLDSLVS 506

BLAST of CSPI04G22400 vs. Swiss-Prot
Match: PP388_ARATH (Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidopsis thaliana GN=At5g16420 PE=2 SV=1)

HSP 1 Score: 203.8 bits (517), Expect = 4.8e-51
Identity = 125/429 (29.14%), Postives = 233/429 (54.31%), Query Frame = 1

Query: 68  QMKTQKP--------ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYD 127
           Q  T+KP        + P RL S++  Q++  LALQ+FL    S       F ++  +Y 
Sbjct: 32  QYCTEKPPIKPWPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSHPG----FTHNYDTYH 91

Query: 128 LIISKLGRAKMFDEMEEILQQLKQETRFAP---HEVIFCNVIAFYGRAHLPDRAFQVFER 187
            I+ KL RA+ FD +E ++  L+    + P    E +F +++  YG A   + + ++F R
Sbjct: 92  SILFKLSRARAFDPVESLMADLRNS--YPPIKCGENLFIDLLRNYGLAGRYESSMRIFLR 151

Query: 188 IPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCG 247
           IP F  KR+V+S+N+LL  L++N++ + +  +F +   ++G +P+  T N+L+ A C   
Sbjct: 152 IPDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKN 211

Query: 248 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 307
           D+++ + V DE+   G+ PN+VT+ T++ G      ++ A R+ E+M+      P+A+ Y
Sbjct: 212 DIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWY-PDATTY 271

Query: 308 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 367
           T L+ G+C +G  + A  + ++M  + ++     Y  +I AL K  +  E  ++  EM E
Sbjct: 272 TVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSGEARNMFDEMLE 331

Query: 368 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 427
               PD+     +I+  C+++ ++ A  +  +M++  C PD    +T+I WLCKEG++ +
Sbjct: 332 RSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTE 391

Query: 428 AMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVD 484
           A  L ++   +G  P +L+Y  +  G+CE  +L EA  + D+M  +   P   + N L++
Sbjct: 392 ARKLFDEF-EKGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIE 451

BLAST of CSPI04G22400 vs. Swiss-Prot
Match: PPR20_ARATH (Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidopsis thaliana GN=At1g07740 PE=2 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 4.5e-49
Identity = 139/413 (33.66%), Postives = 209/413 (50.61%), Query Frame = 1

Query: 67  TQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLG 126
           T   T+KP       + L+  +DP  AL LF             FR+   SY  +I KL 
Sbjct: 39  THKFTRKPWEEVPFLTDLKEIEDPEEALSLFHQYQEMG------FRHDYPSYSSLIYKLA 98

Query: 127 RAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTV 186
           +++ FD +++IL+ ++        E +F  +I  YG+A   D+A  VF +I SF C RT+
Sbjct: 99  KSRNFDAVDQILRLVRYRN-VRCRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTI 158

Query: 187 KSVNSLLAALLKNRQLEKMTQVFVDISNYG-SPDACTFNILIHAACLCGDLDAVWGVFDE 246
           +S+N+L+  L+ N +LEK    F    +    P++ +FNILI       D +A   VFDE
Sbjct: 159 QSLNTLINVLVDNGELEKAKSFFDGAKDMRLRPNSVSFNILIKGFLDKCDWEAACKVFDE 218

Query: 247 MQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVG 306
           M +  V+P+VVT+ +LI  L  N  + +A  L EDM+K   I+PNA  +  L+KG C  G
Sbjct: 219 MLEMEVQPSVVTYNSLIGFLCRNDDMGKAKSLLEDMIK-KRIRPNAVTFGLLMKGLCCKG 278

Query: 307 ELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYN 366
           E N A KL  +M     K     Y  L+S L K GR +E   +L EM +   KPD V YN
Sbjct: 279 EYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKRGRIDEAKLLLGEMKKRRIKPDVVIYN 338

Query: 367 AIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDM-PR 426
            ++N  C E  +  A+RV+ EM  KGCKP+  ++  +I   C+    D  +++L  M   
Sbjct: 339 ILVNHLCTECRVPEAYRVLTEMQMKGCKPNAATYRMMIDGFCRIEDFDSGLNVLNAMLAS 398

Query: 427 RGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLC 478
           R CP    ++  +  GL +   L  A  +L+ M  K     + +   L+  LC
Sbjct: 399 RHCPTPA-TFVCMVAGLIKGGNLDHACFVLEVMGKKNLSFGSGAWQNLLSDLC 442

BLAST of CSPI04G22400 vs. Swiss-Prot
Match: PP444_ARATH (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 3.8e-48
Identity = 122/408 (29.90%), Postives = 209/408 (51.23%), Query Frame = 1

Query: 75  ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRAKMFDEM 134
           I+PF+L  LL L  + + +++LF     S +     +R+S   Y ++I KLG    F  +
Sbjct: 76  ITPFQLYKLLELPLNVSTSMELF-----SWTGSQNGYRHSFDVYQVLIGKLGANGEFKTI 135

Query: 135 EEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPS-FRCKRTVKSVNSLL 194
           + +L Q+K E      E +F +++  Y +A  P +  ++   + + + C+ T KS N +L
Sbjct: 136 DRLLIQMKDEG-IVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVL 195

Query: 195 AALLKNRQLEKMTQVFVD-ISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQKRGVK 254
             L+     +    VF D +S    P   TF +++ A C   ++D+   +  +M K G  
Sbjct: 196 EILVSGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCV 255

Query: 255 PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELNFAFK 314
           PN V + TLI+ LS  +++ EAL+L E+M  +  + P+A  +  +I G C    +N A K
Sbjct: 256 PNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCV-PDAETFNDVILGLCKFDRINEAAK 315

Query: 315 LKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAIINGHC 374
           +   M+          Y  L++ L K GR +   D+   +     KP+ V +N +I+G  
Sbjct: 316 MVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIP----KPEIVIFNTLIHGFV 375

Query: 375 KENDLESAHRVMDEMVEK-GCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCPPDV 434
               L+ A  V+ +MV   G  PDV ++N++I    KEG +  A+++L DM  +GC P+V
Sbjct: 376 THGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNV 435

Query: 435 LSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQE 480
            SY I+ DG C++ ++ EA ++L+EM   G  P     N L+   C+E
Sbjct: 436 YSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKE 472

BLAST of CSPI04G22400 vs. TrEMBL
Match: A0A0A0KZF4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G622790 PE=4 SV=1)

HSP 1 Score: 980.7 bits (2534), Expect = 7.0e-283
Identity = 493/527 (93.55%), Postives = 494/527 (93.74%), Query Frame = 1

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPH YPKSFCSQSVNGKSRNSTG
Sbjct: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHFYPKSFCSQSVNGKSRNSTG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA  
Sbjct: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA-- 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
                                         LKEALRLKEDMVKVYMIKPNASIYTTLIKG
Sbjct: 241 ------------------------------LKEALRLKEDMVKVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE
Sbjct: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWM+LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS
Sbjct: 481 NMELLWMILNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 495

BLAST of CSPI04G22400 vs. TrEMBL
Match: M5WZC0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022285mg PE=4 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 7.5e-160
Identity = 286/460 (62.17%), Postives = 350/460 (76.09%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSS-EPPKPFRYSLLSYDLIISKLGR 128
           M   K ISPFRLSSLLR QKDP LALQLF NPN  S+ +  +PFRYSLLSYDLII+KLGR
Sbjct: 1   MNASKSISPFRLSSLLRRQKDPILALQLFQNPNSDSTPQLKRPFRYSLLSYDLIITKLGR 60

Query: 129 AKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVK 188
           AKMF +M++IL QLKQ+TRFAP E+IFCNVI+FYGRAHLPDRA Q+F+ IP+FRC RTVK
Sbjct: 61  AKMFHQMDQILHQLKQDTRFAPPEIIFCNVISFYGRAHLPDRALQMFDEIPTFRCHRTVK 120

Query: 189 SVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQ 248
           S+NSLL ALLK  + EKM + FV I  Y +PDACT+NIL+ A C    LD  W V DEM 
Sbjct: 121 SLNSLLNALLKCGEFEKMWKFFVGIDKYATPDACTYNILMKACCSNEYLDDAWKVLDEMS 180

Query: 249 KRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGEL 308
           ++G+ PN VT  +LIY L  N KLKEA  LKEDM +VY + P   +YT+L+KG C +GE+
Sbjct: 181 RKGIPPNSVTMASLIYCLCSNLKLKEAFTLKEDMARVYGVPPTIFVYTSLMKGLCKIGEM 240

Query: 309 NFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAI 368
           + AF+LKEEM+   +K  +AVYSTLIS LFK GRK EV  +L EM E GCK +TVTYNA+
Sbjct: 241 SLAFRLKEEMIMRKIKPDAAVYSTLISGLFKLGRKGEVFGLLEEMSEYGCKLNTVTYNAM 300

Query: 369 INGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGC 428
           ING CKE D E+A++V+DEMVEKGC+PDV S+N I+G LCKEGK  +A DL ED+PRRGC
Sbjct: 301 INGFCKEKDFEAAYKVLDEMVEKGCEPDVISYNVILGGLCKEGKWSEANDLFEDLPRRGC 360

Query: 429 PPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWM 488
            PDV+SYRI+F GLC+  Q +EA  ILDE+IFKGY P   S +KLV+ LC+E +MELL  
Sbjct: 361 TPDVVSYRIMFTGLCDCRQFREAAFILDELIFKGYAPHCLSAHKLVEGLCREGDMELLRT 420

Query: 489 VLNSLGRGNRMNMDMWARVVAFV-YKENLSESSNLIDSLI 527
           VL SLG GN +++D WA V++ V  KE LS  S L+D+L+
Sbjct: 421 VLTSLGNGNVLHVDTWAMVISMVCKKEKLSNVSELVDTLL 460

BLAST of CSPI04G22400 vs. TrEMBL
Match: A0A061F8G6_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_031884 PE=4 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 7.2e-155
Identity = 272/459 (59.26%), Postives = 344/459 (74.95%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           MKT KPISPFRLSSLLR +KDPTLA  LF NPNP      KPFRYS LSYDLII+KLGRA
Sbjct: 1   MKTPKPISPFRLSSLLRSEKDPTLAFNLFKNPNPDPKPAGKPFRYSPLSYDLIITKLGRA 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           +MFDEME++L Q K +TR  P E+IFCN + FYGRA L +RA Q+FE +P++RC+RTVKS
Sbjct: 61  RMFDEMEQVLHQHKNDTRLVPQEIIFCNAMKFYGRACLHERALQLFEEMPAYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           VNSLL ALL + + ++M QVF  +  Y  PDACT+NILI A CL G LD    +FDEMQ+
Sbjct: 121 VNSLLNALLLSEKFDEMKQVFFGMEKYARPDACTYNILIRACCLSGCLDDASNLFDEMQR 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           +GVKPNVVTFGTLI GL +  K+ EA +LK DMV+++ + PN   Y+ +IKG C +GEL+
Sbjct: 181 KGVKPNVVTFGTLIRGLCMEMKVNEAFKLKADMVRLHGLCPNPCTYSMMIKGLCRIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
            A +LKEEMV + +K+ S++YSTLIS  F  GR++E   I  EM  N CKPDTVTYN  I
Sbjct: 241 LAIRLKEEMVGNKIKVDSSIYSTLISGHFNIGRQDEALGIFEEMALNECKPDTVTYNETI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG CK  D E+A+RV+++M +K CKPDV S+N +I  LCKEGK  +A DL EDMPR+GC 
Sbjct: 301 NGFCKVKDFEAAYRVLEDMAKKQCKPDVISYNILIDGLCKEGKWSEANDLFEDMPRQGCK 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PDV+SYR++FDGLC  +Q KEA  ILDEMIFKGYVP   SI+K V  LCQ+ + +LL MV
Sbjct: 361 PDVVSYRLLFDGLCGGLQFKEAAFILDEMIFKGYVPHCASIHKFVSGLCQKADKKLLLMV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLIDSLI 527
           LNSL +GN ++ D W  V++ VY+E+ LS SS+++D+L+
Sbjct: 421 LNSLAKGNAIDQDTWLMVISKVYQEDKLSISSDILDALM 459

BLAST of CSPI04G22400 vs. TrEMBL
Match: A0A061F7L0_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_031884 PE=4 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 6.1e-154
Identity = 271/459 (59.04%), Postives = 343/459 (74.73%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           MKT K ISPFRLSSLLR +KDPTLA  LF NPNP      KPFRYS LSYDLII+KLGRA
Sbjct: 1   MKTPKSISPFRLSSLLRSEKDPTLAFNLFKNPNPDPKPAGKPFRYSPLSYDLIITKLGRA 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           +MFDEME++L Q K +TR  P E+IFCN + FYGRA L +RA Q+FE +P++RC+RTVKS
Sbjct: 61  RMFDEMEQVLHQHKNDTRLVPQEIIFCNAMKFYGRACLHERALQLFEEMPAYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           VNSLL ALL + + ++M QVF  +  Y  PDACT+NILI A CL G LD    +FDEMQ+
Sbjct: 121 VNSLLNALLLSEKFDEMKQVFFGMEKYARPDACTYNILIRACCLSGCLDDASNLFDEMQR 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           +GVKPNVVTFGTLI GL +  K+ EA +LK DMV+++ + PN   Y+ +IKG C +GEL+
Sbjct: 181 KGVKPNVVTFGTLIRGLCMEMKVNEAFKLKADMVRLHGLCPNPCTYSMMIKGLCRIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
            A +LKEEMV + +K+ S++YSTLIS  F  GR++E   I  EM  N CKPDTVTYN  I
Sbjct: 241 LAIRLKEEMVGNKIKVDSSIYSTLISGHFNIGRQDEALGIFEEMALNECKPDTVTYNETI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG CK  D E+A+RV+++M +K CKPDV S+N +I  LCKEGK  +A DL EDMPR+GC 
Sbjct: 301 NGFCKVKDFEAAYRVLEDMAKKQCKPDVISYNILIDGLCKEGKWSEANDLFEDMPRQGCK 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PDV+SYR++FDGLC  +Q KEA  ILDEMIFKGYVP   SI+K V  LCQ+ + +LL MV
Sbjct: 361 PDVVSYRLLFDGLCGGLQFKEAAFILDEMIFKGYVPHCASIHKFVSGLCQKADKKLLLMV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLIDSLI 527
           LNSL +GN ++ D W  V++ VY+E+ LS SS+++D+L+
Sbjct: 421 LNSLAKGNAIDQDTWLMVISKVYQEDKLSISSDILDALM 459

BLAST of CSPI04G22400 vs. TrEMBL
Match: B9I1S4_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0011s10980g PE=4 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 1.8e-153
Identity = 276/459 (60.13%), Postives = 340/459 (74.07%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   +PI+PFRL+SLLRLQKDP LALQLF NPNP +  P KPFRYSLLSYDLII+KLGRA
Sbjct: 1   MNPSRPITPFRLASLLRLQKDPKLALQLFKNPNPKT--PSKPFRYSLLSYDLIITKLGRA 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMF+EM+EIL QLK+ET F P E +FC++I FYGRA LP+ A ++   +PSFR +RTVKS
Sbjct: 61  KMFNEMQEILAQLKEETLFTPKEALFCDIINFYGRARLPENALKLLVELPSFRVQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
            NSLL+  L  +  +KM ++FV I   G  DACT+N+LI   C  G LD    VFDEM  
Sbjct: 121 YNSLLSVFLMCKDFDKMRELFVGIEKLGKADACTYNLLIRGFCASGRLDDASKVFDEMTN 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           RGV PNV+TFG LIYG  L+ +LKEA +LK DMVKVY + PNA IY +LIKG C  GEL+
Sbjct: 181 RGVSPNVITFGNLIYGFCLHLRLKEAFKLKTDMVKVYRVYPNAYIYASLIKGVCKNGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
            AF+LK+EM+ + ++L  A+YSTLIS LFK GRKEE   +  +M E G KPDTVTYN II
Sbjct: 241 LAFRLKKEMIRNKIELDPAIYSTLISGLFKAGRKEEALGVWEDMKERGYKPDTVTYNVII 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           N  CK+ D E+A+R++DEMVEKGCKPDV S+N I+  L +EGK  +A DL EDMPRRGC 
Sbjct: 301 NLFCKDKDFEAAYRLLDEMVEKGCKPDVISYNVILRELFEEGKRGEANDLFEDMPRRGCA 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PDV+SYRI+FDG C  MQ KEA  ILDEMIFKG+VP + SI K V+RLC+  N +LL   
Sbjct: 361 PDVVSYRILFDGFCNGMQFKEAAFILDEMIFKGFVPCSASICKFVNRLCEGKNEDLLRSA 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLIDSLI 527
            N+L +G  +N+D+W   VA V+K++ LS S NL+DSLI
Sbjct: 421 FNTLEKGKLVNVDLWRMAVAMVFKDDKLSSSFNLVDSLI 457

BLAST of CSPI04G22400 vs. TAIR10
Match: AT1G53330.1 (AT1G53330.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 513.1 bits (1320), Expect = 2.1e-145
Identity = 250/455 (54.95%), Postives = 331/455 (72.75%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   K +S FRL+SLLR + DP+ A++LF NP+P S+ P +PFRYSLL YD+II+KLG +
Sbjct: 1   MSAVKSVSSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGS 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           KMFDE++++L  LK +TR  P E+IFCNVI F+GR  LP RA  +F+ +P +RC+RTVKS
Sbjct: 61  KMFDELDQVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           +NSLL+ALLK  +LEKM +    I  +G PDACT+NILIH     G  D    +FDEM K
Sbjct: 121 LNSLLSALLKCGELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKLFDEMVK 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           + VKP  VTFGTLI+GL  +S++KEAL++K DM+KVY ++P   IY +LIK  C +GEL+
Sbjct: 181 KKVKPTGVTFGTLIHGLCKDSRVKEALKMKHDMLKVYGVRPTVHIYASLIKALCQIGELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
           FAFKLK+E     +K+ +A+YSTLIS+L K GR  EVS IL EM E GCKPDTVTYN +I
Sbjct: 241 FAFKLKDEAYEGKIKVDAAIYSTLISSLIKAGRSNEVSMILEEMSEKGCKPDTVTYNVLI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           NG C END ESA+RV+DEMVEKG KPDV S+N I+G   +  K ++A  L EDMPRRGC 
Sbjct: 301 NGFCVENDSESANRVLDEMVEKGLKPDVISYNMILGVFFRIKKWEEATYLFEDMPRRGCS 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PD LSYRI+FDGLCE +Q +EA  ILDEM+FKGY PR + +   + +LC+   +E+L  V
Sbjct: 361 PDTLSYRIVFDGLCEGLQFEEAAVILDEMLFKGYKPRRDRLEGFLQKLCESGKLEILSKV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFVYKEN-LSESSNLI 523
           ++SL RG   + D+W+ ++  + KE  +S+S +L+
Sbjct: 421 ISSLHRGIAGDADVWSVMIPTMCKEPVISDSIDLL 455

BLAST of CSPI04G22400 vs. TAIR10
Match: AT5G18475.1 (AT5G18475.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 205.3 bits (521), Expect = 9.3e-53
Identity = 142/469 (30.28%), Postives = 245/469 (52.24%), Query Frame = 1

Query: 66  DTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKL 125
           +T  KT K IS     SL++ ++DP   L +F   N +S +  K F ++  +Y +++  L
Sbjct: 46  ETNPKT-KFISHESAVSLMKRERDPQGVLDIF---NKASQQ--KGFNHNNATYSVLLDNL 105

Query: 126 GRAKMFDEMEEILQQLKQET-RFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF-RCK 185
            R K F  ++ IL Q+K ET RF   E +F N++  + R+ L D+  ++F  I    R K
Sbjct: 106 VRHKKFLAVDAILHQMKYETCRF--QESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARVK 165

Query: 186 RTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCGDLDAVWG 245
            ++ ++++ L  L+ + ++    ++ +    N G  P+ C FNIL+   C  GD++  + 
Sbjct: 166 PSLNAISTCLNLLIDSGEVNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFL 225

Query: 246 VFDEMQKRGVK-PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 305
           V +EM++ G+  PN +T+ TL+  L  +S+ KEA+ L EDM+    I P+   +  +I G
Sbjct: 226 VVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISKEGISPDPVTFNVMING 285

Query: 306 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 365
           FC  GE+  A K+ + M  +        YS L++   K G+ +E      E+ + G K D
Sbjct: 286 FCRAGEVERAKKILDFMKKNGCNPNVYNYSALMNGFCKVGKIQEAKQTFDEVKKTGLKLD 345

Query: 366 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 425
           TV Y  ++N  C+  + + A +++ EM    C+ D  ++N I+  L  EG+ ++A+ +L+
Sbjct: 346 TVGYTTLMNCFCRNGETDEAMKLLGEMKASRCRADTLTYNVILRGLSSEGRSEEALQMLD 405

Query: 426 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 485
                G   +  SYRII + LC   +L++A   L  M  +G  P + + N+LV RLC+  
Sbjct: 406 QWGSEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSERGIWPHHATWNELVVRLCESG 465

Query: 486 NMEL-LWMVLNSLGRGNRMNMDMWARVVAFVYKE-NLSESSNLIDSLIS 528
             E+ + +++  L  G       W  VV  + KE  L     L+DSL+S
Sbjct: 466 YTEIGVRVLIGFLRIGLIPGPKSWGAVVESICKERKLVHVFELLDSLVS 506

BLAST of CSPI04G22400 vs. TAIR10
Match: AT5G16420.1 (AT5G16420.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 203.8 bits (517), Expect = 2.7e-52
Identity = 125/429 (29.14%), Postives = 233/429 (54.31%), Query Frame = 1

Query: 68  QMKTQKP--------ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYD 127
           Q  T+KP        + P RL S++  Q++  LALQ+FL    S       F ++  +Y 
Sbjct: 32  QYCTEKPPIKPWPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSHPG----FTHNYDTYH 91

Query: 128 LIISKLGRAKMFDEMEEILQQLKQETRFAP---HEVIFCNVIAFYGRAHLPDRAFQVFER 187
            I+ KL RA+ FD +E ++  L+    + P    E +F +++  YG A   + + ++F R
Sbjct: 92  SILFKLSRARAFDPVESLMADLRNS--YPPIKCGENLFIDLLRNYGLAGRYESSMRIFLR 151

Query: 188 IPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDIS-NYG-SPDACTFNILIHAACLCG 247
           IP F  KR+V+S+N+LL  L++N++ + +  +F +   ++G +P+  T N+L+ A C   
Sbjct: 152 IPDFGVKRSVRSLNTLLNVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKN 211

Query: 248 DLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIY 307
           D+++ + V DE+   G+ PN+VT+ T++ G      ++ A R+ E+M+      P+A+ Y
Sbjct: 212 DIESAYKVLDEIPSMGLVPNLVTYTTILGGYVARGDMESAKRVLEEMLDRGWY-PDATTY 271

Query: 308 TTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGE 367
           T L+ G+C +G  + A  + ++M  + ++     Y  +I AL K  +  E  ++  EM E
Sbjct: 272 TVLMDGYCKLGRFSEAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSGEARNMFDEMLE 331

Query: 368 NGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDK 427
               PD+     +I+  C+++ ++ A  +  +M++  C PD    +T+I WLCKEG++ +
Sbjct: 332 RSFMPDSSLCCKVIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTE 391

Query: 428 AMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVD 484
           A  L ++   +G  P +L+Y  +  G+CE  +L EA  + D+M  +   P   + N L++
Sbjct: 392 ARKLFDEF-EKGSIPSLLTYNTLIAGMCEKGELTEAGRLWDDMYERKCKPNAFTYNVLIE 451

BLAST of CSPI04G22400 vs. TAIR10
Match: AT1G07740.1 (AT1G07740.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 197.2 bits (500), Expect = 2.5e-50
Identity = 139/413 (33.66%), Postives = 209/413 (50.61%), Query Frame = 1

Query: 67  TQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLG 126
           T   T+KP       + L+  +DP  AL LF             FR+   SY  +I KL 
Sbjct: 39  THKFTRKPWEEVPFLTDLKEIEDPEEALSLFHQYQEMG------FRHDYPSYSSLIYKLA 98

Query: 127 RAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTV 186
           +++ FD +++IL+ ++        E +F  +I  YG+A   D+A  VF +I SF C RT+
Sbjct: 99  KSRNFDAVDQILRLVRYRN-VRCRESLFMGLIQHYGKAGSVDKAIDVFHKITSFDCVRTI 158

Query: 187 KSVNSLLAALLKNRQLEKMTQVFVDISNYG-SPDACTFNILIHAACLCGDLDAVWGVFDE 246
           +S+N+L+  L+ N +LEK    F    +    P++ +FNILI       D +A   VFDE
Sbjct: 159 QSLNTLINVLVDNGELEKAKSFFDGAKDMRLRPNSVSFNILIKGFLDKCDWEAACKVFDE 218

Query: 247 MQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVG 306
           M +  V+P+VVT+ +LI  L  N  + +A  L EDM+K   I+PNA  +  L+KG C  G
Sbjct: 219 MLEMEVQPSVVTYNSLIGFLCRNDDMGKAKSLLEDMIK-KRIRPNAVTFGLLMKGLCCKG 278

Query: 307 ELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYN 366
           E N A KL  +M     K     Y  L+S L K GR +E   +L EM +   KPD V YN
Sbjct: 279 EYNEAKKLMFDMEYRGCKPGLVNYGILMSDLGKRGRIDEAKLLLGEMKKRRIKPDVVIYN 338

Query: 367 AIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDM-PR 426
            ++N  C E  +  A+RV+ EM  KGCKP+  ++  +I   C+    D  +++L  M   
Sbjct: 339 ILVNHLCTECRVPEAYRVLTEMQMKGCKPNAATYRMMIDGFCRIEDFDSGLNVLNAMLAS 398

Query: 427 RGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLC 478
           R CP    ++  +  GL +   L  A  +L+ M  K     + +   L+  LC
Sbjct: 399 RHCPTPA-TFVCMVAGLIKGGNLDHACFVLEVMGKKNLSFGSGAWQNLLSDLC 442

BLAST of CSPI04G22400 vs. TAIR10
Match: AT5G64320.1 (AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 194.1 bits (492), Expect = 2.1e-49
Identity = 122/408 (29.90%), Postives = 209/408 (51.23%), Query Frame = 1

Query: 75  ISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRAKMFDEM 134
           I+PF+L  LL L  + + +++LF     S +     +R+S   Y ++I KLG    F  +
Sbjct: 76  ITPFQLYKLLELPLNVSTSMELF-----SWTGSQNGYRHSFDVYQVLIGKLGANGEFKTI 135

Query: 135 EEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPS-FRCKRTVKSVNSLL 194
           + +L Q+K E      E +F +++  Y +A  P +  ++   + + + C+ T KS N +L
Sbjct: 136 DRLLIQMKDEG-IVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVL 195

Query: 195 AALLKNRQLEKMTQVFVD-ISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQKRGVK 254
             L+     +    VF D +S    P   TF +++ A C   ++D+   +  +M K G  
Sbjct: 196 EILVSGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCV 255

Query: 255 PNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELNFAFK 314
           PN V + TLI+ LS  +++ EAL+L E+M  +  + P+A  +  +I G C    +N A K
Sbjct: 256 PNSVIYQTLIHSLSKCNRVNEALQLLEEMFLMGCV-PDAETFNDVILGLCKFDRINEAAK 315

Query: 315 LKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAIINGHC 374
           +   M+          Y  L++ L K GR +   D+   +     KP+ V +N +I+G  
Sbjct: 316 MVNRMLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIP----KPEIVIFNTLIHGFV 375

Query: 375 KENDLESAHRVMDEMVEK-GCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCPPDV 434
               L+ A  V+ +MV   G  PDV ++N++I    KEG +  A+++L DM  +GC P+V
Sbjct: 376 THGRLDDAKAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNV 435

Query: 435 LSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQE 480
            SY I+ DG C++ ++ EA ++L+EM   G  P     N L+   C+E
Sbjct: 436 YSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKE 472

BLAST of CSPI04G22400 vs. NCBI nr
Match: gi|449456681|ref|XP_004146077.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucumis sativus])

HSP 1 Score: 1063.9 bits (2750), Expect = 9.0e-308
Identity = 525/527 (99.62%), Postives = 526/527 (99.81%), Query Frame = 1

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPH YPKSFCSQSVNGKSRNSTG
Sbjct: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHFYPKSFCSQSVNGKSRNSTG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW
Sbjct: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
           GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG
Sbjct: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE
Sbjct: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWM+LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS
Sbjct: 481 NMELLWMILNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 527

BLAST of CSPI04G22400 vs. NCBI nr
Match: gi|700199862|gb|KGN55020.1| (hypothetical protein Csa_4G622790 [Cucumis sativus])

HSP 1 Score: 980.7 bits (2534), Expect = 1.0e-282
Identity = 493/527 (93.55%), Postives = 494/527 (93.74%), Query Frame = 1

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPH YPKSFCSQSVNGKSRNSTG
Sbjct: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHFYPKSFCSQSVNGKSRNSTG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA  
Sbjct: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDA-- 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
                                         LKEALRLKEDMVKVYMIKPNASIYTTLIKG
Sbjct: 241 ------------------------------LKEALRLKEDMVKVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE
Sbjct: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWM+LNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS
Sbjct: 481 NMELLWMILNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 495

BLAST of CSPI04G22400 vs. NCBI nr
Match: gi|659103014|ref|XP_008452430.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucumis melo])

HSP 1 Score: 971.5 bits (2510), Expect = 6.1e-280
Identity = 481/527 (91.27%), Postives = 499/527 (94.69%), Query Frame = 1

Query: 1   MGMKKTSFKLPLALIVSFGCNNGRSLPFPLSQPFTILSPHLYPKSFCSQSVNGKSRNSTG 60
           MGMKKTSF+LP  LIVSFGC NG SLPF LS PFT L PH YPKSFCS S+NGKSRNS+G
Sbjct: 1   MGMKKTSFELP--LIVSFGCKNGGSLPFLLSHPFTFLFPHFYPKSFCSHSMNGKSRNSSG 60

Query: 61  HRETNDTQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120
           HRETN TQMKTQKPISPFRLSSLLRL+KDP LALQLFLNPNPSSSEPPKPFRYSLLSYDL
Sbjct: 61  HRETNGTQMKTQKPISPFRLSSLLRLEKDPKLALQLFLNPNPSSSEPPKPFRYSLLSYDL 120

Query: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSF 180
           IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRA LPDRAFQVFERIP+F
Sbjct: 121 IISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRARLPDRAFQVFERIPTF 180

Query: 181 RCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVW 240
           RCKRTVKSVNSLL ALLK+R+  KM +V V I N+GSPDACTFNILIHAACLCGDLDAVW
Sbjct: 181 RCKRTVKSVNSLLDALLKSREFGKMMEVLVGIGNHGSPDACTFNILIHAACLCGDLDAVW 240

Query: 241 GVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKG 300
           GVFDEM+KRGV+PNVVTFGTLIYGLSLNSKLKEALRLKEDMV+VYMIKPNASIYTTLIKG
Sbjct: 241 GVFDEMRKRGVQPNVVTFGTLIYGLSLNSKLKEALRLKEDMVRVYMIKPNASIYTTLIKG 300

Query: 301 FCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPD 360
           FCGVGELNFAFKLKEEMVTS VKL S +YSTLISALFKHGRKEEVSDILREMGENGCKPD
Sbjct: 301 FCGVGELNFAFKLKEEMVTSKVKLDSKIYSTLISALFKHGRKEEVSDILREMGENGCKPD 360

Query: 361 TVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLE 420
           T TYNA+INGHCKENDLESA+RV+DEMVEKGCKPDV SFNTIIG LCKEGKLDKAMDLLE
Sbjct: 361 TATYNAMINGHCKENDLESANRVVDEMVEKGCKPDVISFNTIIGGLCKEGKLDKAMDLLE 420

Query: 421 DMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480
           DMPRRGCPPDVLSYRI+FDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC
Sbjct: 421 DMPRRGCPPDVLSYRIVFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQEC 480

Query: 481 NMELLWMVLNSLGRGNRMNMDMWARVVAFVYKENLSESSNLIDSLIS 528
           NMELLWMVLNSLGRGNRMNMD WARVVAF+YKENL ESSNLIDSLIS
Sbjct: 481 NMELLWMVLNSLGRGNRMNMDTWARVVAFLYKENLLESSNLIDSLIS 525

BLAST of CSPI04G22400 vs. NCBI nr
Match: gi|1009131812|ref|XP_015883043.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Ziziphus jujuba])

HSP 1 Score: 610.5 bits (1573), Expect = 2.7e-171
Identity = 309/476 (64.92%), Postives = 364/476 (76.47%), Query Frame = 1

Query: 57  NSTGHRETND-----TQMKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPF 116
           NS  +   ND     TQMKT KPIS FRLSSLLRLQKDP LALQLF NPNP   +  KPF
Sbjct: 22  NSVQYSIPNDNLKEKTQMKTMKPISSFRLSSLLRLQKDPILALQLFQNPNPQHPKRHKPF 81

Query: 117 RYSLLSYDLIISKLGRAKMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAF 176
           RYSLL+YDLIISKLGRAKMFD ME ILQQLKQETRFAP E+IFCNVI FYGRA LP RA 
Sbjct: 82  RYSLLAYDLIISKLGRAKMFDPMELILQQLKQETRFAPPEIIFCNVICFYGRARLPQRAL 141

Query: 177 QVFERIPSFRCKRTVKSVNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAAC 236
           QVF+ IPSFRC+RTVKS NSLL AL K  +L+K+ ++F+DI NY  PDACT+NILI A C
Sbjct: 142 QVFDEIPSFRCQRTVKSFNSLLDALCKCSELQKVREIFMDIENYVYPDACTYNILIKACC 201

Query: 237 LCGDLDAVWGVFDEMQKRGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNA 296
             G LD  W VFDEM+ +G  PNVVTFG LIYGL  + KLKEA  LKEDM +VY +KPNA
Sbjct: 202 TNGSLDDAWVVFDEMRSKGFCPNVVTFGNLIYGLCTDLKLKEAFNLKEDMTRVYGVKPNA 261

Query: 297 SIYTTLIKGFCGVGELNFAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILRE 356
            +YT+LIKG C + +L+ AFKLKEEMV + +KL SAVYSTLI ALFK GRK+EVS IL E
Sbjct: 262 FVYTSLIKGLCKIDDLSLAFKLKEEMVNNKIKLDSAVYSTLIGALFKVGRKDEVSGILEE 321

Query: 357 MGENGCKPDTVTYNAIINGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGK 416
           M  NGC+PDTVTYNA+ING CKE D E+A+ V+DEMV+K CKPD+ S+N IIG LCKEGK
Sbjct: 322 MTVNGCEPDTVTYNAMINGFCKEKDFEAAYGVLDEMVKKNCKPDIISYNVIIGGLCKEGK 381

Query: 417 LDKAMDLLEDMPRRGCPPDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINK 476
             +  DL ED+PRRGC PDV+SYR IFDG C   + KEA  IL+EM+FKGY P + SI+K
Sbjct: 382 WIEGNDLFEDLPRRGCKPDVVSYRTIFDGFCNWRRFKEAMFILEEMVFKGYSPCSASIHK 441

Query: 477 LVDRLCQECNMELLWMVLNSLGRGNRMNMDMWARVVAFV-YKENLSESSNLIDSLI 527
           L+D LCQE N+E +  VLNS+G  N +++  W  V++ V  K+  S+S NLID+L+
Sbjct: 442 LLDGLCQEGNVEFISKVLNSIGNANVVDLGTWGSVISVVCKKDKQSDSINLIDTLL 497

BLAST of CSPI04G22400 vs. NCBI nr
Match: gi|225461712|ref|XP_002283237.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Vitis vinifera])

HSP 1 Score: 578.2 bits (1489), Expect = 1.5e-161
Identity = 289/456 (63.38%), Postives = 348/456 (76.32%), Query Frame = 1

Query: 69  MKTQKPISPFRLSSLLRLQKDPTLALQLFLNPNPSSSEPPKPFRYSLLSYDLIISKLGRA 128
           M   KPISPFRLSSLLRLQ DP LALQLF NPNP     PKPFRY+ LSYDLII+KLGR+
Sbjct: 1   MNPPKPISPFRLSSLLRLQNDPKLALQLFQNPNPD----PKPFRYTHLSYDLIITKLGRS 60

Query: 129 KMFDEMEEILQQLKQETRFAPHEVIFCNVIAFYGRAHLPDRAFQVFERIPSFRCKRTVKS 188
           +MF EME+IL QL++ETRF+P E+IFCNVI+FYGRA LPDRA Q FE IP FRC+RTVKS
Sbjct: 61  RMFHEMEQILSQLRRETRFSPKEIIFCNVISFYGRARLPDRAIQTFESIPEFRCQRTVKS 120

Query: 189 VNSLLAALLKNRQLEKMTQVFVDISNYGSPDACTFNILIHAACLCGDLDAVWGVFDEMQK 248
           +NSLL ALLK ++ EK   +   I  + +PD CT+N+LI+A C  G L   W VFDEM +
Sbjct: 121 LNSLLNALLKCKEFEKFDGILSGIDKFATPDVCTYNVLINACCSSGSLGDAWNVFDEMLR 180

Query: 249 RGVKPNVVTFGTLIYGLSLNSKLKEALRLKEDMVKVYMIKPNASIYTTLIKGFCGVGELN 308
           + V PNVVTFGTLI GL  +S+L EA RLKEDMVKV+ +KPNA +Y +L+KG C V EL+
Sbjct: 181 KHVCPNVVTFGTLISGLCGDSRLDEAFRLKEDMVKVFNVKPNAFVYASLMKGLCRVNELS 240

Query: 309 FAFKLKEEMVTSNVKLVSAVYSTLISALFKHGRKEEVSDILREMGENGCKPDTVTYNAII 368
            AF+LK+EMV + ++L S +YSTLI+ALFK GRK+EV  +L EM ENGCKPDTVTYNA+I
Sbjct: 241 LAFELKKEMVANKLRLDSGIYSTLIAALFKVGRKDEVFVVLEEMRENGCKPDTVTYNAMI 300

Query: 369 NGHCKENDLESAHRVMDEMVEKGCKPDVFSFNTIIGWLCKEGKLDKAMDLLEDMPRRGCP 428
           +G C E D E+A+ V++EMV KGCKPDV S+N II  LCKEGK  +A DL EDMPRRGC 
Sbjct: 301 SGFCNEKDFEAAYGVLEEMVAKGCKPDVISYNVIISGLCKEGKWREANDLFEDMPRRGCT 360

Query: 429 PDVLSYRIIFDGLCEMMQLKEATSILDEMIFKGYVPRNESINKLVDRLCQECNMELLWMV 488
           PDV SYRI+FDGLCE MQ  EA  ILDEM+FKGY P++ S  K V+ LCQE N+ELL  V
Sbjct: 361 PDVGSYRILFDGLCEGMQFNEAAFILDEMVFKGYAPKSASKTKFVEALCQEGNLELLCKV 420

Query: 489 LNSLGRGNRMNMDMWARVVAFV-YKENLSESSNLID 524
           LNSL +GN ++ D W+  V+ V  KE LS  S L+D
Sbjct: 421 LNSLVKGNVIDGDAWSLAVSKVCKKEKLSNGSELVD 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR79_ARATH3.7e-14454.95Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis th... [more]
PP392_ARATH1.6e-5130.28Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana GN... [more]
PP388_ARATH4.8e-5129.14Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidop... [more]
PPR20_ARATH4.5e-4933.66Pentatricopeptide repeat-containing protein At1g07740, mitochondrial OS=Arabidop... [more]
PP444_ARATH3.8e-4829.90Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KZF4_CUCSA7.0e-28393.55Uncharacterized protein OS=Cucumis sativus GN=Csa_4G622790 PE=4 SV=1[more]
M5WZC0_PRUPE7.5e-16062.17Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa022285mg PE=4 SV=1[more]
A0A061F8G6_THECC7.2e-15559.26Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 2 OS=Theobr... [more]
A0A061F7L0_THECC6.1e-15459.04Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobr... [more]
B9I1S4_POPTR1.8e-15360.13Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT1G53330.12.1e-14554.95 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G18475.19.3e-5330.28 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G16420.12.7e-5229.14 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G07740.12.5e-5033.66 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G64320.12.1e-4929.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449456681|ref|XP_004146077.1|9.0e-30899.62PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucum... [more]
gi|700199862|gb|KGN55020.1|1.0e-28293.55hypothetical protein Csa_4G622790 [Cucumis sativus][more]
gi|659103014|ref|XP_008452430.1|6.1e-28091.27PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Cucum... [more]
gi|1009131812|ref|XP_015883043.1|2.7e-17164.92PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Zizip... [more]
gi|225461712|ref|XP_002283237.1|1.5e-16163.38PREDICTED: putative pentatricopeptide repeat-containing protein At1g53330 [Vitis... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009790 embryo development
biological_process GO:0010154 fruit development
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G22400.1CSPI04G22400.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 117..143
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 355..388
score: 2.5E-14coord: 286..318
score: 3.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 218..265
score: 6.8E-15coord: 394..442
score: 3.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 221..255
score: 1.7E-9coord: 293..323
score: 2.3E-4coord: 362..396
score: 2.1E-12coord: 328..361
score: 4.5E-8coord: 397..431
score: 1.2E-10coord: 433..464
score: 5.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 430..464
score: 10.435coord: 325..359
score: 11.06coord: 114..144
score: 7.574coord: 150..184
score: 7.004coord: 290..324
score: 9.931coord: 254..284
score: 7.903coord: 185..215
score: 6.029coord: 219..253
score: 12.693coord: 360..394
score: 14.37coord: 395..429
score: 13
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..483
score: 1.2E
NoneNo IPR availablePROFILEPS51257PROKAR_LIPOPROTEINcoord: 1..20
score: