CSPI01G12040 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G12040
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationChr1: 7593671 .. 7595977 (-)
RNA-Seq ExpressionCSPI01G12040
SyntenyCSPI01G12040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCTCCGTTGGGTATATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTTGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTTCGAATTATTCTTTTCTTCTGGTTTGAGTATTTAGATGTGGAGAAAGAAATGAACTGTATAAAAATGATGCATCGTGTGAGAAGAATGTGCCTTTTAGACAATTCTTACTTCGAATCTCGAGATTACGGTGAATGGGGAATGGGGTTGTTTCAAATTATCGAAAGATTTAGTTTGGAGAAATGGAAAGCACGTAGGTTTTTTATGCTAAGAAATCCGAAGCTCTTACCTTACGTTCTTTGATTAAGTGAGCTTGTTCGAGTAACTTCGTTAGTTCACAGAATATGGACTTTTCAGAAATGGAATTATGTCATTTCTCCCCTACATGATCAGGAATTTGATTTCACTCGCAAGGAATTTAACTCTTTGTAGATAATGATATATGAGCTAGAAGTATGTGGGTTTGAACATGTTTTTCGTGGAGTTTCACTGTAATTGAAGATGAGTATTACTGCAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAGGAAGAAACAACTGAACGTAAGAGCGTTGCCTCATTATTTTTGTTATTGTATCTCATTATTGATGGTGTCCATCTGCCAAATCTTGCTAGGGTTTCAACAATGCAGACGCTTGTTTATCAAATTCTTCACTTGAAATGAGTAGAAGTAGCTGATTTCATTGCCAAGTTTGCTGCTTTTGCTTCGCTTAGTTTTTAATTAAAGTTTTGTGGGCTAATAAAGTGAGGTTGCATGCAGGGATGATTGTGTATTTCATACGTAGGGGGCAAAAACGTTGAAATGGTGGTTTTCAATTAAGA

mRNA sequence

GCCTCCGTTGGGTATATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTTGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAGGAAGAAACAACTGAACGTAAGAGCGTTGCCTCATTATTTTTGTTATTGTATCTCATTATTGATGGTGTCCATCTGCCAAATCTTGCTAGGGTTTCAACAATGCAGACGCTTGTTTATCAAATTCTTCACTTGAAATGAGTAGAAGTAGCTGATTTCATTGCCAAGTTTGCTGCTTTTGCTTCGCTTAGTTTTTAATTAAAGTTTTGTGGGCTAATAAAGTGAGGTTGCATGCAGGGATGATTGTGTATTTCATACGTAGGGGGCAAAAACGTTGAAATGGTGGTTTTCAATTAAGA

Coding sequence (CDS)

ATGATTTCTCACTCTGCTTCTCTTCTTCTTCTTCGTTCTAAGAGGAGCTCTCTTCGGTTCTCATTCATGGATCAGAAGCTCTTCTCCAAAGCTTTAACACGCTATGCTCTAGCTAGCCGATCTTACCACACGACTCGGTTGAAAAAGGCGACTCTATATGCCAAAATTAGTCCACTGGGCGATCCTAGAATCAGCGTGGAGTCGGAACTTGACGGTTGGGTCCAGGAGGGGAAGAAGCTACGAGTCGCTGAGCTTCAGAGAATCATTCGCGACTTTCGCAAGCGCAATCGTTTTAGCCAAGCTCTTGAGGTGTCCGAATGGATGAAAAAAAGTGGTGCCTGCATATTTTCACCAACCGAGCATGCGGTGCAATTGGATTTGATTGGCCGAGTACGAGGTTCTCTTTCTGCTGAAAACTATTTCAATCAGTTGAAGGAGCAAGACCAGACTATTAAAACATATGGTGCTCTTCTGAATTGCTATGTTCGACAGCGGCAAGTGGACAAATCCCTCTCCCATTTTCAAAAAATGAAAGAGTTGGGTTTTGCAACTTCAGAGCTCACTTACAATGACATCATGTGTTTGTACACAAGAGTTGGCCAGCATGAGAAGGTCCCTGAGGTGCTAGCAGAGATGAAAGAGAATAATGTTTCTCCCGACAACTTTAGTTATAGAATCTGCATCAGTTCGTATGGTGCAAGAAAAGATATTGAGGGGATGGAGAATGTATTGAAAGAGATGGAATCTCAACCTCATATTGTAATGGACTGGAACACATATGCAGTAGTTGCAAACTTCTTTATAAAAGCTGCTCTTACTGATAAGGCAGTTGATGCCTTGAAAAAATCAGAAGAGAAACTGAAAAGCAGTAACGATAGAATCGGCCATAACCAGCTGATCTCGCTTTATGCAACCTTAGGTAACAAGGAAAAGGTGCTGAGATTGTGGAATCTGGATAAAACTGCTACTACGAGAATCATCAATAGGGACTACATCACGATGCTTGAATCTTTGGTGAGACTAGGTGAACTTGAAGAAGCTGAGAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACATTGTCATTATTGGATATATTGACAAGGGAATGTGTGAGAGAGGTGAAACACTTCTTGAAAACTTGAAGCAGAATGAAAAGGCTACCACACCAAACAGTTGGGGTGCTCTGGCTGTTAAGTATCTGGACCTGGGTGAGACCGAAAAAGCCTTAGAGTGTATGATGACAGCCCTTTCTGTAAACATTGGTAAAGGATGGAAGCCTAATCTTCGGGTGATCACAGGACTATTGAATTGGCTTGGTGATAAGGGCATTGTAGAAGAAGTAGAAGCTTTTGTAAGCGCATTGAGGTCTGTCACTCCAGTGAATAGAGAGATGTATCATGCCTTGTTAAAGGTTTATATAAGAGCTGATAAAGAAGTAAAGGAGGTGTTAAACAACATGAAGGCTGATAAAATAGATGAAGATGAAGAAACCAAGAAAATTCTTGGCACTTAG

Protein sequence

MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKILGT*
Homology
BLAST of CSPI01G12040 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 7.0e-159
Identity = 279/478 (58.37%), Postives = 358/478 (74.90%), Query Frame = 0

Query: 37  LASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRN 96
           +ASR Y+T R+KK TLY+KISPLGDP+ SV  EL  WVQ GKK+ VAEL RI+ D R+R 
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 97  RFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGA 156
           RF  ALEVS+WM ++G C+FSPTEHAV LDLIGRV G ++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 157 LLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENN 216
           LLNCYVRQ+ V+KSL HF+KMKE+GF TS LTYN+IMCLYT +GQHEKVP+VL EMKE N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 217 VSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKA 276
           V+PDN+SYRICI+++GA  D+E +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 277 VDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLE 336
           V+ LK SE +L+   D  G+N LI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+
Sbjct: 252 VELLKMSENRLE-KKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 337 SLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKAT 396
           SLV++  L EAE+VL EW+SSGNCYDFRVPN VI GYI K M E+ E +LE+L +  KAT
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 397 TPNSWGALAVKYLDLGETEKALECMMTALSVNIG-KGWKPNLRVITGLLNWLGDKGIVEE 456
           TP SW  +A  Y + G  E A +CM TAL V +G + W+P L ++T +L+W+GD+G ++E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 457 VEAFVSALRSVTPVNREMYHALLKVYIR-ADKEVKEVLNNMKADKIDEDEETKKILGT 513
           VE+FV++LR+   VN++MYHAL+K  IR   + +  +L  MK DKI+ DEET  IL T
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of CSPI01G12040 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 290.0 bits (741), Expect = 5.1e-77
Identity = 152/428 (35.51%), Postives = 249/428 (58.18%), Query Frame = 0

Query: 51  TLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKK 110
           TL  +++  GDP  S+   LDGW+ +G  ++ +EL  II+  RK +RFS AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 111 SGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 170
                 S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 171 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 230
              FQ+MKELGF    L YN ++ LY R G++  V ++L EM++  V PD F+    + +
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 231 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 290
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ L+KSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 291 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 350
             +  +  L+S Y   G KE+V RLW+L K       N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 351 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 410
           ++EWE+  + +D R+P+++I GY  KGM E+ E ++  L Q  +    ++W  LA+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 411 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 470
            G+ EKA+E    A+ V+   GW+P+  V+   +++L  +    ++E     LR ++   
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQ---RDMEGLRKILRLLSERG 458

Query: 471 REMYHALL 479
              Y  LL
Sbjct: 459 HISYDQLL 461

BLAST of CSPI01G12040 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 277.3 bits (708), Expect = 3.4e-73
Identity = 154/453 (34.00%), Postives = 256/453 (56.51%), Query Frame = 0

Query: 52  LYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKS 111
           +Y KIS +  P +   S L+ W + G+KL   EL R++++ RK  R +QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 112 GACI-FSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 171
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 172 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 231
            +    M++ G+A   L +N +M LY  + +++KV  ++ EMK+ ++  D +SY I +SS
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 232 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 291
            G+   +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DAL+K E ++   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 292 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 351
           N RI ++ L+SLY +LGNK+++ R+W++ K+    I N  Y  ++ SLVR+G++E AEKV
Sbjct: 309 N-RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKV 368

Query: 352 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 411
            +EW    + YD R+PN+++  Y+     E  E L +++ +     + ++W  LAV +  
Sbjct: 369 YEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTR 428

Query: 412 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 471
                +AL C+  A S      W+P + +++G      ++  V   EA +  LR    + 
Sbjct: 429 KRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 472 REMYHALLKVYIRADKEVKEVLNNMKADKIDED 504
            + Y AL+      D +    +NN + D  + D
Sbjct: 489 DKSYLALI------DVDENRTVNNSEIDAHETD 514

BLAST of CSPI01G12040 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.9e-70
Identity = 146/470 (31.06%), Postives = 254/470 (54.04%), Query Frame = 0

Query: 44  TTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALE 103
           T +  +  LY ++   G   + V  +L+ +++  K +   E+   I+  R R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 104 VSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVR 163
           +SE M++ G    + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERG-MNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 164 QRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFS 223
           +   +K+     KMKEL    S ++YN +M LYT+ G+ EKVP ++ E+K  NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 224 YRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKS 283
           Y + + +  A  DI G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 284 EEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGE 343
           E K  +  D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +
Sbjct: 257 EMK-NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLND 316

Query: 344 LEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGA 403
           L  AE + KEW+++ + YD R+ N++I  Y  +G+ ++   L E   +        +W  
Sbjct: 317 LPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEI 376

Query: 404 LAVKYLDLGETEKALECMMTALSVNIGKG--WKPNLRVITGLLNWLGDKGIVEEVEAFVS 463
               Y+  G+  +ALECM  A+S+  G G  W P+   +  L+++   K  V   E  + 
Sbjct: 377 FMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLE 436

Query: 464 ALRSVTP-VNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKIL 511
            L++ T  +  E++  L++ Y  A K    +   +K + ++ +E TKK+L
Sbjct: 437 ILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of CSPI01G12040 vs. ExPASy Swiss-Prot
Match: Q9SY07 (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.3e-68
Identity = 150/439 (34.17%), Postives = 250/439 (56.95%), Query Frame = 0

Query: 73  WVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVR 132
           W +EG  +R  EL RI+R+ RK  R+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 133 GSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDI 192
           G  SAE +F  + +Q +      +LL+ YV+ +  DK+ + F+KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 193 MCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENV-LKEMESQP 252
           + +Y   GQ EKVP ++ E+K    SPD  +Y + ++++ +  D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELK-IRTSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 253 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 312
            +  DW TY+V+ N + K    +KA  ALK+  EKL S  +R+ +  LISL+A LG+K+ 
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEM-EKLVSKKNRVAYASLISLHANLGDKDG 323

Query: 313 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 372
           V   W   K++  ++ + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN+++ 
Sbjct: 324 VNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILA 383

Query: 373 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGK 432
            Y+++     GE   E + +     + ++W  L   YL   + EK L+C   A  ++  K
Sbjct: 384 EYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVK 443

Query: 433 GWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEV 492
            W  N+R++ G    L ++G V+  E  ++ L+    VN ++Y++LL+ Y +A +    V
Sbjct: 444 KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIV 503

Query: 493 LNNMKADKIDEDEETKKIL 511
              M  D ++ DEETK+++
Sbjct: 504 EERMAKDNVELDEETKELI 516

BLAST of CSPI01G12040 vs. ExPASy TrEMBL
Match: A0A0A0LXD9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073780 PE=4 SV=1)

HSP 1 Score: 1008.1 bits (2605), Expect = 1.4e-290
Identity = 512/512 (100.00%), Postives = 512/512 (100.00%), Query Frame = 0

Query: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60
           MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG
Sbjct: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60

Query: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120
           DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE
Sbjct: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120

Query: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180
           HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL
Sbjct: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180

Query: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240
           GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM
Sbjct: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240

Query: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300
           ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI
Sbjct: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300

Query: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360
           SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC
Sbjct: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360

Query: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420
           YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC
Sbjct: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420

Query: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480
           MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV
Sbjct: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480

Query: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 513
           YIRADKEVKEVLNNMKADKIDEDEETKKILGT
Sbjct: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 512

BLAST of CSPI01G12040 vs. ExPASy TrEMBL
Match: A0A5A7UM45 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00740 PE=4 SV=1)

HSP 1 Score: 903.7 bits (2334), Expect = 3.6e-259
Identity = 455/490 (92.86%), Postives = 467/490 (95.31%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKLFSKALT YALASRSYHTTRLKKATLYAKISPLGDP ISVESELDGWVQEGKK+RV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRIIRDFRKR+RFSQAL+VSEWMKKSGACIFSPTEHAVQLDLIGRVRG LSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQ IKTYGALLNCYVRQ+QVDKSLSH QKMKELGFATSELTYNDIMCLYTRVGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVPEVLAEMK NNVSPDNFSYRICI+SYGARKD+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA LTDKAVDAL+KSEEKLK S DRIGHN LISLYATLGNKEKVLR+WNLDKTA
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLK-SKDRIGHNHLISLYATLGNKEKVLRVWNLDKTA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLENL QNEKATTPNSWGA+AVKYLD GETEKALECM  ALSVN  KGWKPN RVITG
Sbjct: 361 ETLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLGDKGIVEEVEAFVSALRSV PVNREMYHALLKVYIRADKEV EVLN MKADKI+E
Sbjct: 421 VLNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 489

BLAST of CSPI01G12040 vs. ExPASy TrEMBL
Match: A0A1S3B5N2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486296 PE=4 SV=1)

HSP 1 Score: 903.7 bits (2334), Expect = 3.6e-259
Identity = 455/490 (92.86%), Postives = 467/490 (95.31%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKLFSKALT YALASRSYHTTRLKKATLYAKISPLGDP ISVESELDGWVQEGKK+RV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRIIRDFRKR+RFSQAL+VSEWMKKSGACIFSPTEHAVQLDLIGRVRG LSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQ IKTYGALLNCYVRQ+QVDKSLSH QKMKELGFATSELTYNDIMCLYTRVGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVPEVLAEMK NNVSPDNFSYRICI+SYGARKD+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA LTDKAVDAL+KSEEKLK S DRIGHN LISLYATLGNKEKVLR+WNLDKTA
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLK-SKDRIGHNHLISLYATLGNKEKVLRVWNLDKTA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLENL QNEKATTPNSWGA+AVKYLD GETEKALECM  ALSVN  KGWKPN RVITG
Sbjct: 361 ETLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLGDKGIVEEVEAFVSALRSV PVNREMYHALLKVYIRADKEV EVLN MKADKI+E
Sbjct: 421 VLNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 489

BLAST of CSPI01G12040 vs. ExPASy TrEMBL
Match: A0A6J1K124 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490120 PE=4 SV=1)

HSP 1 Score: 829.3 bits (2141), Expect = 8.6e-237
Identity = 409/490 (83.47%), Postives = 445/490 (90.82%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQ  FSKALTRYALA R YHT RLKKATLYAKISPLGDP +SVE ELDGWV+EGKK+R+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RF+QALEVSEWMKK+G CIFSP+EHAVQLDLIGRVRG LSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQT KTYGALLNCYVRQRQV+KSLSH QKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           +KVPEVLAEMKE NVSPDNFSYRICI+SYGAR+D+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA L DKAVDALKK+EE+LK S DRIGHN LISLY TLGNKEKVLRLWNLDKT 
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLK-SKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTD 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E LLE+L +  K TTPNSWGA+AV+Y+D GETEK++ECM  AL++N+ KGWKPNLRVITG
Sbjct: 361 EALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLG+   +EEVEAFV +LRS  PVNREMYHAL+KV+IR  KEV E+LN MK+DKIDE
Sbjct: 421 ILNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 488

BLAST of CSPI01G12040 vs. ExPASy TrEMBL
Match: A0A6J1GTN5 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457015 PE=4 SV=1)

HSP 1 Score: 824.3 bits (2128), Expect = 2.8e-235
Identity = 407/490 (83.06%), Postives = 444/490 (90.61%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQ  FSKALTRYALA R YHT RLKKATLYAKISPLGDP +SVE  LDGWV+EGKK+R+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RF+QALEVSEWMKK+G CIFSP+EHAVQLDLIGRVRG LSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQT KTYGALLNCYVRQRQV+KSLSH QKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           +KVPEVLAEMKE NVSPDNFSYRICI+SYGAR+D+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA L DKAVDALKK+EE+LK S DRIGHN LISLYATLGNKEKVLRLWNLDKT 
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLK-SKDRIGHNHLISLYATLGNKEKVLRLWNLDKTD 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTR+INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E LLE+L +  K TTPN WGA+AV+Y+D  ETEK++ECM  AL++N+ KGWKPNLRVITG
Sbjct: 361 EALLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLG+   +EEVEAFV +LRSV PVNREMYHAL+K +IR  KEV E+LN MK+DKIDE
Sbjct: 421 ILNWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 488

BLAST of CSPI01G12040 vs. NCBI nr
Match: XP_011653147.2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Cucumis sativus] >KGN64671.1 hypothetical protein Csa_013825 [Cucumis sativus])

HSP 1 Score: 1008.1 bits (2605), Expect = 2.8e-290
Identity = 512/512 (100.00%), Postives = 512/512 (100.00%), Query Frame = 0

Query: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60
           MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG
Sbjct: 1   MISHSASLLLLRSKRSSLRFSFMDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLG 60

Query: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120
           DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE
Sbjct: 61  DPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTE 120

Query: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180
           HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL
Sbjct: 121 HAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKEL 180

Query: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240
           GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM
Sbjct: 181 GFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGM 240

Query: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300
           ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI
Sbjct: 241 ENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLI 300

Query: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360
           SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC
Sbjct: 301 SLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNC 360

Query: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420
           YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC
Sbjct: 361 YDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALEC 420

Query: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480
           MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV
Sbjct: 421 MMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKV 480

Query: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 513
           YIRADKEVKEVLNNMKADKIDEDEETKKILGT
Sbjct: 481 YIRADKEVKEVLNNMKADKIDEDEETKKILGT 512

BLAST of CSPI01G12040 vs. NCBI nr
Match: XP_008442422.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Cucumis melo] >KAA0056983.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26410.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 903.7 bits (2334), Expect = 7.4e-259
Identity = 455/490 (92.86%), Postives = 467/490 (95.31%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKLFSKALT YALASRSYHTTRLKKATLYAKISPLGDP ISVESELDGWVQEGKK+RV
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRIIRDFRKR+RFSQAL+VSEWMKKSGACIFSPTEHAVQLDLIGRVRG LSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQ IKTYGALLNCYVRQ+QVDKSLSH QKMKELGFATSELTYNDIMCLYTRVGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           EKVPEVLAEMK NNVSPDNFSYRICI+SYGARKD+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA LTDKAVDAL+KSEEKLK S DRIGHN LISLYATLGNKEKVLR+WNLDKTA
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLK-SKDRIGHNHLISLYATLGNKEKVLRVWNLDKTA 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLENL QNEKATTPNSWGA+AVKYLD GETEKALECM  ALSVN  KGWKPN RVITG
Sbjct: 361 ETLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLGDKGIVEEVEAFVSALRSV PVNREMYHALLKVYIRADKEV EVLN MKADKI+E
Sbjct: 421 VLNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 489

BLAST of CSPI01G12040 vs. NCBI nr
Match: XP_038893646.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa hispida] >XP_038893647.1 pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa hispida])

HSP 1 Score: 853.6 bits (2204), Expect = 8.8e-244
Identity = 424/490 (86.53%), Postives = 453/490 (92.45%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQKL SK LTRYA+ASRSYHT RLKKATLYAKISPLGDP ISVE ELD WVQEGKK+RV
Sbjct: 1   MDQKLLSKVLTRYAIASRSYHTNRLKKATLYAKISPLGDPSISVEPELDCWVQEGKKVRV 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RF+QALEVSEWMKK+G CIFSPTEHAVQLDLIGRVRG LSAE+YF+
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKNGVCIFSPTEHAVQLDLIGRVRGYLSAESYFS 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QL EQDQT KTYGALLNCYVRQRQV+KSLSH QKMKE+GFATS+LTYNDIMCLYT VGQH
Sbjct: 121 QLNEQDQTGKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSQLTYNDIMCLYTNVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           +KVPEVLAEMKE NVSPDNFSYRICI+SYGAR D+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARHDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA LTDKAV+AL+KSEE+LK S DRIGHN LISLYATLGNKEKVLRLWNLDKT 
Sbjct: 241 VANFFIKAGLTDKAVNALRKSEERLK-SKDRIGHNHLISLYATLGNKEKVLRLWNLDKTG 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           ETLLE+L + EKATTPNSWGA+AVKYLD GE +KA+ECM  ALS+N+ KGWKPNLRVIT 
Sbjct: 361 ETLLEDLMEKEKATTPNSWGAVAVKYLDQGENKKAVECMKAALSLNMDKGWKPNLRVITS 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLGDKGI+EEVEAFVSALRSV PVNREMYHAL+KVYIRA KEV E+LN MK+DKIDE
Sbjct: 421 VLNWLGDKGIIEEVEAFVSALRSVIPVNREMYHALIKVYIRAGKEVNELLNQMKSDKIDE 480

Query: 503 DEETKKILGT 513
           DEET+KILGT
Sbjct: 481 DEETQKILGT 489

BLAST of CSPI01G12040 vs. NCBI nr
Match: XP_022994385.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita maxima])

HSP 1 Score: 829.3 bits (2141), Expect = 1.8e-236
Identity = 409/490 (83.47%), Postives = 445/490 (90.82%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQ  FSKALTRYALA R YHT RLKKATLYAKISPLGDP +SVE ELDGWV+EGKK+R+
Sbjct: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RF+QALEVSEWMKK+G CIFSP+EHAVQLDLIGRVRG LSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQT KTYGALLNCYVRQRQV+KSLSH QKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           +KVPEVLAEMKE NVSPDNFSYRICI+SYGAR+D+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA L DKAVDALKK+EE+LK S DRIGHN LISLY TLGNKEKVLRLWNLDKT 
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLK-SKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTD 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E LLE+L +  K TTPNSWGA+AV+Y+D GETEK++ECM  AL++N+ KGWKPNLRVITG
Sbjct: 361 EALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLG+   +EEVEAFV +LRS  PVNREMYHAL+KV+IR  KEV E+LN MK+DKIDE
Sbjct: 421 ILNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 488

BLAST of CSPI01G12040 vs. NCBI nr
Match: KAG7012430.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 827.8 bits (2137), Expect = 5.2e-236
Identity = 408/490 (83.27%), Postives = 446/490 (91.02%), Query Frame = 0

Query: 23  MDQKLFSKALTRYALASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRV 82
           MDQ  FSKALTRYALA R YHT RLKKATLYAKISPLGDP +SVE  LDGWV+EGKK+R+
Sbjct: 1   MDQ-FFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRI 60

Query: 83  AELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFN 142
           AELQRII D RKR RF+QALEVSEWMKK+G CIFSP+EHAVQLDLIGRVRG LSAE+YFN
Sbjct: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120

Query: 143 QLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQH 202
           QLKEQDQT KTYGALLNCYVRQRQV+KSLSH QKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180

Query: 203 EKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAV 262
           +KVPEVLAEMKE NVSPDNFSYRICI+SYGAR+D+EGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 263 VANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTA 322
           VANFFIKA L DKAVDALKK+EE+LK S DRIGHN LISLYATLGNKEKVLRLWNLDKT 
Sbjct: 241 VANFFIKADLADKAVDALKKAEERLK-SKDRIGHNHLISLYATLGNKEKVLRLWNLDKTD 300

Query: 323 TTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERG 382
           TTR+INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPN VI+GYIDKGMCER 
Sbjct: 301 TTRLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERA 360

Query: 383 ETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGKGWKPNLRVITG 442
           E LLE+L +  K TTPNSWGA+AV+Y+D  ETEK++ECM  AL++N+ KGWKPNLRVITG
Sbjct: 361 EALLEDLMEKGKTTTPNSWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITG 420

Query: 443 LLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEVLNNMKADKIDE 502
           +LNWLG+ G +EEVEAFV +LRSV PVNREMYHAL+K +IR  KEV E+LN MK+DK+DE
Sbjct: 421 ILNWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDE 480

Query: 503 DEETKKILGT 513
           DEETKKILGT
Sbjct: 481 DEETKKILGT 488

BLAST of CSPI01G12040 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 562.0 bits (1447), Expect = 4.9e-160
Identity = 279/478 (58.37%), Postives = 358/478 (74.90%), Query Frame = 0

Query: 37  LASRSYHTTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRN 96
           +ASR Y+T R+KK TLY+KISPLGDP+ SV  EL  WVQ GKK+ VAEL RI+ D R+R 
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 97  RFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGA 156
           RF  ALEVS+WM ++G C+FSPTEHAV LDLIGRV G ++AE YF  LKEQ +  KTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 157 LLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENN 216
           LLNCYVRQ+ V+KSL HF+KMKE+GF TS LTYN+IMCLYT +GQHEKVP+VL EMKE N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 217 VSPDNFSYRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKA 276
           V+PDN+SYRICI+++GA  D+E +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 277 VDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLE 336
           V+ LK SE +L+   D  G+N LI+LYA LG K +VLRLW+L+K    R IN+DY+T+L+
Sbjct: 252 VELLKMSENRLE-KKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 337 SLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKAT 396
           SLV++  L EAE+VL EW+SSGNCYDFRVPN VI GYI K M E+ E +LE+L +  KAT
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 397 TPNSWGALAVKYLDLGETEKALECMMTALSVNIG-KGWKPNLRVITGLLNWLGDKGIVEE 456
           TP SW  +A  Y + G  E A +CM TAL V +G + W+P L ++T +L+W+GD+G ++E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 457 VEAFVSALRSVTPVNREMYHALLKVYIR-ADKEVKEVLNNMKADKIDEDEETKKILGT 513
           VE+FV++LR+   VN++MYHAL+K  IR   + +  +L  MK DKI+ DEET  IL T
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of CSPI01G12040 vs. TAIR 10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 290.0 bits (741), Expect = 3.6e-78
Identity = 152/428 (35.51%), Postives = 249/428 (58.18%), Query Frame = 0

Query: 51  TLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKK 110
           TL  +++  GDP  S+   LDGW+ +G  ++ +EL  II+  RK +RFS AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 111 SGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 170
                 S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 171 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 230
              FQ+MKELGF    L YN ++ LY R G++  V ++L EM++  V PD F+    + +
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 231 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 290
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA LT+KA++ L+KSE+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 291 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 350
             +  +  L+S Y   G KE+V RLW+L K       N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 351 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 410
           ++EWE+  + +D R+P+++I GY  KGM E+ E ++  L Q  +    ++W  LA+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 411 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 470
            G+ EKA+E    A+ V+   GW+P+  V+   +++L  +    ++E     LR ++   
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQ---RDMEGLRKILRLLSERG 458

Query: 471 REMYHALL 479
              Y  LL
Sbjct: 459 HISYDQLL 461

BLAST of CSPI01G12040 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 277.3 bits (708), Expect = 2.4e-74
Identity = 154/453 (34.00%), Postives = 256/453 (56.51%), Query Frame = 0

Query: 52  LYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKS 111
           +Y KIS +  P +   S L+ W + G+KL   EL R++++ RK  R +QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 112 GACI-FSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKS 171
           G     S ++ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  +K+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 172 LSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISS 231
            +    M++ G+A   L +N +M LY  + +++KV  ++ EMK+ ++  D +SY I +SS
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 232 YGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSS 291
            G+   +E ME V ++M+S   I  +W T++ +A  +IK   T+KA DAL+K E ++   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 292 NDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKV 351
           N RI ++ L+SLY +LGNK+++ R+W++ K+    I N  Y  ++ SLVR+G++E AEKV
Sbjct: 309 N-RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKV 368

Query: 352 LKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLD 411
            +EW    + YD R+PN+++  Y+     E  E L +++ +     + ++W  LAV +  
Sbjct: 369 YEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTR 428

Query: 412 LGETEKALECMMTALSVNIGKGWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVN 471
                +AL C+  A S      W+P + +++G      ++  V   EA +  LR    + 
Sbjct: 429 KRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLE 488

Query: 472 REMYHALLKVYIRADKEVKEVLNNMKADKIDED 504
            + Y AL+      D +    +NN + D  + D
Sbjct: 489 DKSYLALI------DVDENRTVNNSEIDAHETD 514

BLAST of CSPI01G12040 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 266.2 bits (679), Expect = 5.6e-71
Identity = 146/470 (31.06%), Postives = 254/470 (54.04%), Query Frame = 0

Query: 44  TTRLKKATLYAKISPLGDPRISVESELDGWVQEGKKLRVAELQRIIRDFRKRNRFSQALE 103
           T +  +  LY ++   G   + V  +L+ +++  K +   E+   I+  R R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 104 VSEWMKKSGACIFSPTEHAVQLDLIGRVRGSLSAENYFNQLKEQDQTIKTYGALLNCYVR 163
           +SE M++ G    + ++ A+ LDL+ + R   + ENYF  L E  +T  TYG+LLNCY +
Sbjct: 77  LSEVMEERG-MNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 164 QRQVDKSLSHFQKMKELGFATSELTYNDIMCLYTRVGQHEKVPEVLAEMKENNVSPDNFS 223
           +   +K+     KMKEL    S ++YN +M LYT+ G+ EKVP ++ E+K  NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 224 YRICISSYGARKDIEGMENVLKEMESQPHIVMDWNTYAVVANFFIKAALTDKAVDALKKS 283
           Y + + +  A  DI G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 284 EEKLKSSNDRIGHNQLISLYATLGNKEKVLRLWNLDKTATTRIINRDYITMLESLVRLGE 343
           E K  +  D   +  LI+LY  LG   +V R+W   + A  +  N  Y+ M++ LV+L +
Sbjct: 257 EMK-NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLND 316

Query: 344 LEEAEKVLKEWESSGNCYDFRVPNIVIIGYIDKGMCERGETLLENLKQNEKATTPNSWGA 403
           L  AE + KEW+++ + YD R+ N++I  Y  +G+ ++   L E   +        +W  
Sbjct: 317 LPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEI 376

Query: 404 LAVKYLDLGETEKALECMMTALSVNIGKG--WKPNLRVITGLLNWLGDKGIVEEVEAFVS 463
               Y+  G+  +ALECM  A+S+  G G  W P+   +  L+++   K  V   E  + 
Sbjct: 377 FMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLE 436

Query: 464 ALRSVTP-VNREMYHALLKVYIRADKEVKEVLNNMKADKIDEDEETKKIL 511
            L++ T  +  E++  L++ Y  A K    +   +K + ++ +E TKK+L
Sbjct: 437 ILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of CSPI01G12040 vs. TAIR 10
Match: AT4G02820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 260.8 bits (665), Expect = 2.4e-69
Identity = 150/439 (34.17%), Postives = 250/439 (56.95%), Query Frame = 0

Query: 73  WVQEGKKLRVAELQRIIRDFRKRNRFSQALEVSEWMKKSGACIFSPTEHAVQLDLIGRVR 132
           W +EG  +R  EL RI+R+ RK  R+  ALE+ EWM           ++AV LDLI ++R
Sbjct: 84  WKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIR 143

Query: 133 GSLSAENYFNQLKEQDQTIKTYGALLNCYVRQRQVDKSLSHFQKMKELGFATSELTYNDI 192
           G  SAE +F  + +Q +      +LL+ YV+ +  DK+ + F+KM E GF  S L YN +
Sbjct: 144 GLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHM 203

Query: 193 MCLYTRVGQHEKVPEVLAEMKENNVSPDNFSYRICISSYGARKDIEGMENV-LKEMESQP 252
           + +Y   GQ EKVP ++ E+K    SPD  +Y + ++++ +  D+EG E V LK  E + 
Sbjct: 204 LSMYISRGQFEKVPVLIKELK-IRTSPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEK- 263

Query: 253 HIVMDWNTYAVVANFFIKAALTDKAVDALKKSEEKLKSSNDRIGHNQLISLYATLGNKEK 312
            +  DW TY+V+ N + K    +KA  ALK+  EKL S  +R+ +  LISL+A LG+K+ 
Sbjct: 264 -LNPDWVTYSVLTNLYAKTDNVEKARLALKEM-EKLVSKKNRVAYASLISLHANLGDKDG 323

Query: 313 VLRLWNLDKTATTRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNIVII 372
           V   W   K++  ++ + +Y++M+ ++V+LGE E+A+ +  EWES     D R+PN+++ 
Sbjct: 324 VNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILA 383

Query: 373 GYIDKGMCERGETLLENLKQNEKATTPNSWGALAVKYLDLGETEKALECMMTALSVNIGK 432
            Y+++     GE   E + +     + ++W  L   YL   + EK L+C   A  ++  K
Sbjct: 384 EYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRKDMEKVLDCFGKA--IDSVK 443

Query: 433 GWKPNLRVITGLLNWLGDKGIVEEVEAFVSALRSVTPVNREMYHALLKVYIRADKEVKEV 492
            W  N+R++ G    L ++G V+  E  ++ L+    VN ++Y++LL+ Y +A +    V
Sbjct: 444 KWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIV 503

Query: 493 LNNMKADKIDEDEETKKIL 511
              M  D ++ DEETK+++
Sbjct: 504 EERMAKDNVELDEETKELI 516

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84JR37.0e-15958.37Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Q9SKU65.1e-7735.51Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Q8LPS63.4e-7334.00Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
O227147.9e-7031.06Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Q9SY073.3e-6834.17Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LXD91.4e-290100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073780 PE=4 SV=1[more]
A0A5A7UM453.6e-25992.86Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B5N23.6e-25992.86pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis ... [more]
A0A6J1K1248.6e-23783.47pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1GTN52.8e-23583.06pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
Match NameE-valueIdentityDescription
XP_011653147.22.8e-290100.00pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Cucumis sa... [more]
XP_008442422.17.4e-25992.86PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
XP_038893646.18.8e-24486.53pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Benincasa ... [more]
XP_022994385.11.8e-23683.47pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG7012430.15.2e-23683.27Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
AT4G21705.14.9e-16058.37Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.13.6e-7835.51Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.12.4e-7434.00Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.15.6e-7131.06Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G02820.12.4e-6934.17Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 187..230
e-value: 1.7E-8
score: 34.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 188..220
e-value: 3.6E-6
score: 24.8
coord: 222..256
e-value: 0.0014
score: 16.6
coord: 331..361
e-value: 3.9E-4
score: 18.4
coord: 153..182
e-value: 2.6E-5
score: 22.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 331..358
e-value: 0.0025
score: 18.0
coord: 153..182
e-value: 4.5E-5
score: 23.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..219
score: 10.325603
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 150..184
score: 10.106377
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 82..298
e-value: 1.2E-27
score: 99.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 317..430
e-value: 1.4E-8
score: 36.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 334..427
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 143..428
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 36..511
NoneNo IPR availablePANTHERPTHR45717:SF20OS07G0598500 PROTEINcoord: 36..511

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G12040.1CSPI01G12040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding