CSPI07G04560 (gene) Wild cucumber (PI 183967)

NameCSPI07G04560
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr7 : 3406500 .. 3409024 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTATAGCCATTTTTGTTCGGAGTTCCGACTGAGTTGCTAAACGAAAGCTCTTTTCATTCTCAATTTGATTTTACATATATATCATTCATCAATTCTCATGGTCTTTTATCTTTGAATCAAGAAGATGATAAGGAAGCTCCGAAGCTGGAACAACAACCTCATTTCCAATCTCCTTATTCAAACTTCTAAAACCCTTTCCCTCCCCTTCTCTTCCACTCCGCCCCAATTAGCCATTCTCCGCCAAAAAATCATAAACATTCGAGCCCCTAAAATTTCAGTTGTTCCGGTACTGGAAAAGTGGGTTGGCGACGGCAGAGCTATTGGAAAACCGGAACTTCAATATCTTGTTCACCTCATGAAGGACTCTCGCCGTTTCAATCACGCTTTAGAGGTTCGCTTTTCCTTCTCTCACTTCCTCTAATTTCTTTAACTGAATTCTTAGTGTGTTGTATTGCAATACAGCTTTCTATGTTGAAGGAATTGGCTCTTTTCTAGCTTTGAAATGCGTTTTGGTGATCTAGCTGAGATGGTTTTGTAGTATTTGGTTATCTTTTCTTGCTCTAATTTTGGAAATGGGGTTTGTATAGTGAAGTTCCTGATGGTTTGTTTGAGAAATTGTTAGCAAAACTGTTTATAGGATTTGGGATTGGATGAATTCTAAAGATATTGTTGTAAAACGGATACTTGATTTTGAATGTGCAATATGATTACAGCTGCTGAAAGATAAGGACACTGAAAGTGAAGTTCGTTTAATTACCATTTTGTTTTCTAAAATTAAGCTTATAAACACTTGTTACATCTCTAGAATTTTGTTTTATGCTCTACTTTTTAAGGATTGATTTCAAAATCCAAGCATGTTTTGAAACTAACGGTAAGAAAGTTTTTTATTTTACAATTTCTTAACCATGATTTTCATCTTTTAAAAGCATATTTATTTATTTCAAAAATAGTTTTGTTTTTGAATTTGACTAAGAAGCCATATGTTTACTTGGAAAAGATGAAAATCATGGTTAAGAACTTGTGACAAAGCCAACACAATTTTTAAAAAGCACAAACTAAAACAAAAAACAATATGGTTACTAAACAGGGCCTAAGTCATCTTCAAACAAAATGTAAATGGTTTTTGAGTGGATCCTTAAGACAGATCTCTTTAACAGTTTTCAAGCAGGTTCTGAATTTCTTCGTTTCTTGATTTTGTCCAGATATCTCAGTGGATGACTGATCGAAGATACTTGAGTTTATCGCCGAGCGATGCAGCAGTCAGGCTGGATTTAATCCATAGTGTTCATGGTCTGGAACACGCAGAGAATTACTTCAACAGCATATCTATTCGGTTAAAAACTTCTAATGTTTATGGTGCTCTTCTCGGTTGTTATGTGCGAGAGAAATCACTTGAGAAAGCTGAAGCCATCATGCAAGAAATGAGAAAGATGGGCATTGCTACTACGTCCTTTGCTTACAATGTGCTAATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTACTGATTGAAGAAATGAAAACGAAGAGAATACCTCAAGACATTTACTCAATTAGAAATCTTTGTGCAGCTTATGTTGCTAAGGCAGATATTTCTGGTATGGAAAAGATTCTCAAAAGGATCGAGGAGGATTCTGAACTCAAAGCTGATTGGACAATTTATTCAATTGCTGCTAATGGGTATCTTACAGCTGGGTTGGAAACAGAGGCTCTTTCCATGCTAAAGAAAACGGAGGAGAAAGTTCGGCCTAATACAAATAAATTCGCATTTAAGTTTCTTCTGTCCCTTTATGAACGAACAGGTCATAAGAACGAAGTTTACAGGGTTTGGAATACCTTCAAACCATTAACTAAAGAAACATGTGTTCCATATGCTTTAATGATCACATCTCTAGCCAAGCTTGATGATATTGAAGGGGCTGAAAGAATATTCCAGGAGTGGGAATCAAAGTGTACTGTATACGACTTTCGGGTGTTGAATCGACTTCTGGTTGCTTATTGCAGGAAAGGTCTTTTGGATAAGGCGGAATCAGTTGTTAACCAAGCAGTGGTTGAAAGAACTCCATTCCGCAGCACGTGGAGCATATTAGCCACGGGATATGCAGAATACGGACACATGAGCAAAGCCGTTGAGATGTTGAAGAAAGCTATTTTAGTCGGAAGGCAAAATTGGAAACCAAAGCAGGGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAAAACAAGGAGATGCAGAAACAATGGATGAAATAGTACGATTATGCAAAAGCTCAGGTACAGTAATGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGTAAACCAGTTATTAGCATTCTTGAACAGATGAAGATGGATGGTTTTGCAGCAGATGAAGAGGTAGACAAAATCCTGGGATCTAAGACTAACTTGTAGTTAGTAAAAAAATATTTAGTTTTTCTTAAATTTTTTGTTCTATGAAATATAGAATGGCATTTTTTACATACATCCTTTAAGCCTC

mRNA sequence

ATGATAAGGAAGCTCCGAAGCTGGAACAACAACCTCATTTCCAATCTCCTTATTCAAACTTCTAAAACCCTTTCCCTCCCCTTCTCTTCCACTCCGCCCCAATTAGCCATTCTCCGCCAAAAAATCATAAACATTCGAGCCCCTAAAATTTCAGTTGTTCCGGTACTGGAAAAGTGGGTTGGCGACGGCAGAGCTATTGGAAAACCGGAACTTCAATATCTTGTTCACCTCATGAAGGACTCTCGCCGTTTCAATCACGCTTTAGAGATATCTCAGTGGATGACTGATCGAAGATACTTGAGTTTATCGCCGAGCGATGCAGCAGTCAGGCTGGATTTAATCCATAGTGTTCATGGTCTGGAACACGCAGAGAATTACTTCAACAGCATATCTATTCGGTTAAAAACTTCTAATGTTTATGGTGCTCTTCTCGGTTGTTATGTGCGAGAGAAATCACTTGAGAAAGCTGAAGCCATCATGCAAGAAATGAGAAAGATGGGCATTGCTACTACGTCCTTTGCTTACAATGTGCTAATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTACTGATTGAAGAAATGAAAACGAAGAGAATACCTCAAGACATTTACTCAATTAGAAATCTTTGTGCAGCTTATGTTGCTAAGGCAGATATTTCTGGTATGGAAAAGATTCTCAAAAGGATCGAGGAGGATTCTGAACTCAAAGCTGATTGGACAATTTATTCAATTGCTGCTAATGGGTATCTTACAGCTGGGTTGGAAACAGAGGCTCTTTCCATGCTAAAGAAAACGGAGGAGAAAGTTCGGCCTAATACAAATAAATTCGCATTTAAGTTTCTTCTGTCCCTTTATGAACGAACAGGTCATAAGAACGAAGTTTACAGGGTTTGGAATACCTTCAAACCATTAACTAAAGAAACATGTGTTCCATATGCTTTAATGATCACATCTCTAGCCAAGCTTGATGATATTGAAGGGGCTGAAAGAATATTCCAGGAGTGGGAATCAAAGTGTACTGTATACGACTTTCGGGTGTTGAATCGACTTCTGGTTGCTTATTGCAGGAAAGGTCTTTTGGATAAGGCGGAATCAGTTGTTAACCAAGCAGTGGTTGAAAGAACTCCATTCCGCAGCACGTGGAGCATATTAGCCACGGGATATGCAGAATACGGACACATGAGCAAAGCCGTTGAGATGTTGAAGAAAGCTATTTTAGTCGGAAGGCAAAATTGGAAACCAAAGCAGGGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAAAACAAGGAGATGCAGAAACAATGGATGAAATAGTACGATTATGCAAAAGCTCAGGTACAGTAATGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGTAAACCAGTTATTAGCATTCTTGAACAGATGAAGATGGATGGTTTTGCAGCAGATGAAGAGGTAGACAAAATCCTGGGATCTAAGACTAACTTGTAG

Coding sequence (CDS)

ATGATAAGGAAGCTCCGAAGCTGGAACAACAACCTCATTTCCAATCTCCTTATTCAAACTTCTAAAACCCTTTCCCTCCCCTTCTCTTCCACTCCGCCCCAATTAGCCATTCTCCGCCAAAAAATCATAAACATTCGAGCCCCTAAAATTTCAGTTGTTCCGGTACTGGAAAAGTGGGTTGGCGACGGCAGAGCTATTGGAAAACCGGAACTTCAATATCTTGTTCACCTCATGAAGGACTCTCGCCGTTTCAATCACGCTTTAGAGATATCTCAGTGGATGACTGATCGAAGATACTTGAGTTTATCGCCGAGCGATGCAGCAGTCAGGCTGGATTTAATCCATAGTGTTCATGGTCTGGAACACGCAGAGAATTACTTCAACAGCATATCTATTCGGTTAAAAACTTCTAATGTTTATGGTGCTCTTCTCGGTTGTTATGTGCGAGAGAAATCACTTGAGAAAGCTGAAGCCATCATGCAAGAAATGAGAAAGATGGGCATTGCTACTACGTCCTTTGCTTACAATGTGCTAATTAACCTCTACGCTCAGATTGGGCAGCATGATAAGATTGATCTACTGATTGAAGAAATGAAAACGAAGAGAATACCTCAAGACATTTACTCAATTAGAAATCTTTGTGCAGCTTATGTTGCTAAGGCAGATATTTCTGGTATGGAAAAGATTCTCAAAAGGATCGAGGAGGATTCTGAACTCAAAGCTGATTGGACAATTTATTCAATTGCTGCTAATGGGTATCTTACAGCTGGGTTGGAAACAGAGGCTCTTTCCATGCTAAAGAAAACGGAGGAGAAAGTTCGGCCTAATACAAATAAATTCGCATTTAAGTTTCTTCTGTCCCTTTATGAACGAACAGGTCATAAGAACGAAGTTTACAGGGTTTGGAATACCTTCAAACCATTAACTAAAGAAACATGTGTTCCATATGCTTTAATGATCACATCTCTAGCCAAGCTTGATGATATTGAAGGGGCTGAAAGAATATTCCAGGAGTGGGAATCAAAGTGTACTGTATACGACTTTCGGGTGTTGAATCGACTTCTGGTTGCTTATTGCAGGAAAGGTCTTTTGGATAAGGCGGAATCAGTTGTTAACCAAGCAGTGGTTGAAAGAACTCCATTCCGCAGCACGTGGAGCATATTAGCCACGGGATATGCAGAATACGGACACATGAGCAAAGCCGTTGAGATGTTGAAGAAAGCTATTTTAGTCGGAAGGCAAAATTGGAAACCAAAGCAGGGTGACATTTTGGAAGCTTGTCTGGATTACTTGGAAAAACAAGGAGATGCAGAAACAATGGATGAAATAGTACGATTATGCAAAAGCTCAGGTACAGTAATGAAGGAGATGTACTACAGATTGCTGAGAACTTCCATAGCAGGGGGTAAACCAGTTATTAGCATTCTTGAACAGATGAAGATGGATGGTTTTGCAGCAGATGAAGAGGTAGACAAAATCCTGGGATCTAAGACTAACTTGTAG
BLAST of CSPI07G04560 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 1.7e-90
Identity = 166/417 (39.81%), Postives = 253/417 (60.67%), Query Frame = 1

Query: 38  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR 97
           L++++     P  S++ VL+ W+  G  +   EL  ++ +++   RF+HAL+IS WM++ 
Sbjct: 40  LQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEH 99

Query: 98  RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAE 157
           R   +S  D A+RLDLI  V GL  AE +F +I +  +  ++YGALL CY  +K L KAE
Sbjct: 100 RVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAE 159

Query: 158 AIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAY 217
            + QEM+++G       YNV++NLY + G++  ++ L+ EM+ + +  DI+++     AY
Sbjct: 160 QVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAY 219

Query: 218 VAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNT 277
              +D+ GMEK L R E D  L  DW  Y+  ANGY+ AGL  +AL ML+K+E+ V    
Sbjct: 220 SVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQK 279

Query: 278 NKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ 337
            K A++ L+S Y   G K EVYR+W+ +K L       Y  +I++L K+DDIE  E+I +
Sbjct: 280 RKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIME 339

Query: 338 EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYG 397
           EWE+  +++D R+ + L+  YC+KG+++KAE VVN  V + R    STW  LA GY   G
Sbjct: 340 EWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAG 399

Query: 398 HMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 454
            M KAVE  K+AI V +  W+P Q  +L +C+DYLE Q D E + +I+RL    G +
Sbjct: 400 KMEKAVEKWKRAIEVSKPGWRPHQ-VVLMSCVDYLEGQRDMEGLRKILRLLSERGHI 455

BLAST of CSPI07G04560 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 2.9e-74
Identity = 157/466 (33.69%), Postives = 261/466 (56.01%), Query Frame = 1

Query: 38  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR 97
           L  KI  +  PK SV P L+ WV  G+ +   EL  +VH ++  +RF HALE+S+WM + 
Sbjct: 27  LYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSKWMNET 86

Query: 98  RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAE 157
                SP++ AV LDLI  V+G   AE YF ++  + K    YGALL CYVR++++EK+ 
Sbjct: 87  GVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQNVEKSL 146

Query: 158 AIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAY 217
              ++M++MG  T+S  YN ++ LY  IGQH+K+  ++EEMK + +  D YS R    A+
Sbjct: 147 LHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYRICINAF 206

Query: 218 VAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNT 277
            A  D+  +   L+ +E   ++  DW  Y++AA  Y+  G    A+ +LK +E ++    
Sbjct: 207 GAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSENRLEKKD 266

Query: 278 NKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF 337
            +  +  L++LY R G K EV R+W+  K + K      Y  ++ SL K+D +  AE + 
Sbjct: 267 GE-GYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVEAEEVL 326

Query: 338 QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEY 397
            EW+S    YDFRV N ++  Y  K + +KAE+++   A   +     +W ++AT YAE 
Sbjct: 327 TEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVATAYAEK 386

Query: 398 GHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 457
           G +  A + +K A  + VG + W+P    ++ + L ++  +G  + ++  V   ++   V
Sbjct: 387 GTLENAFKCMKTALGVEVGSRKWRPGL-TLVTSVLSWVGDEGSLKEVESFVASLRNCIGV 446

Query: 458 MKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT 499
            K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL +++
Sbjct: 447 NKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 490

BLAST of CSPI07G04560 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 252.3 bits (643), Expect = 1.1e-65
Identity = 145/427 (33.96%), Postives = 234/427 (54.80%), Query Frame = 1

Query: 40  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-R 99
           +KI  +  P++    VL +W   GR + K EL  +V  ++  +R N ALE+  WM +R  
Sbjct: 71  KKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGE 130

Query: 100 YLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEA 159
              LS SDAA++LDLI  V G+  AE +F  +    K   VYG+LL  YVR KS EKAEA
Sbjct: 131 RFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEA 190

Query: 160 IMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYV 219
           ++  MR  G A     +NV++ LY  + ++DK+D ++ EMK K I  DIYS     ++  
Sbjct: 191 LLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCG 250

Query: 220 AKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTN 279
           +   +  ME + ++++ D  +  +WT +S  A  Y+  G   +A   L+K E ++    N
Sbjct: 251 SLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARI-TGRN 310

Query: 280 KFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ 339
           +  + +LLSLY   G+K E+YRVW+ +K +      + Y  +++SL ++ DIEGAE++++
Sbjct: 311 RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYE 370

Query: 340 EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYG 399
           EW    + YD R+ N L+ AY +   L+ AE + +  V +   P  STW ILA G+    
Sbjct: 371 EWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKR 430

Query: 400 HMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMK 459
            +S+A+  L+ A    G  NW+PK   +L       E++ D  + + ++ L + SG +  
Sbjct: 431 CISEALTCLRNAFSAEGSSNWRPKV-LMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 490

Query: 460 EMYYRLL 463
           + Y  L+
Sbjct: 491 KSYLALI 495

BLAST of CSPI07G04560 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 3.2e-65
Identity = 141/454 (31.06%), Postives = 255/454 (56.17%), Query Frame = 1

Query: 49  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAA 108
           ++ V   L +++   + + K E+   +  +++   +  AL++S+ M +R  ++ + SD A
Sbjct: 36  EVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEERG-MNKTVSDQA 95

Query: 109 VRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGI 168
           + LDL+     +   ENYF  +    KT   YG+LL CY +E   EKAE ++ +M+++ I
Sbjct: 96  IHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNI 155

Query: 169 ATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEK 228
             +S +YN L+ LY + G+ +K+  +I+E+K + +  D Y+      A  A  DISG+E+
Sbjct: 156 TPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVER 215

Query: 229 ILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLL 288
           +++ +  D  +  DWT YS  A+ Y+ AGL  +A   L++ E K   NT +   A++FL+
Sbjct: 216 VIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELEMK---NTQRDFTAYQFLI 275

Query: 289 SLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV 348
           +LY R G   EVYR+W + +  + K + V Y  MI  L KL+D+ GAE +F+EW++ C+ 
Sbjct: 276 TLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCST 335

Query: 349 YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEM 408
           YD R++N L+ AY ++GL+ KA  +  +A        + TW I    Y + G M++A+E 
Sbjct: 336 YDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALEC 395

Query: 409 LKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMDEIVRLCKS-SGTVMKEMYYR 468
           + KA+ +G+ +   W P   + + A + Y E++ D    + ++ + K+ +  +  E++  
Sbjct: 396 MSKAVSIGKGDGGKWLPSP-ETVRALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEP 455

Query: 469 LLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL 495
           L+RT  A GK   ++  ++KM+    +E   K+L
Sbjct: 456 LIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of CSPI07G04560 vs. Swiss-Prot
Match: PP400_ARATH (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 250.8 bits (639), Expect = 3.2e-65
Identity = 147/452 (32.52%), Postives = 251/452 (55.53%), Query Frame = 1

Query: 40  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRY 99
           ++I+    P+ SV  +L++ +  G A+   EL+ +   +  S R++ AL++ +WM +++ 
Sbjct: 42  KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKD 101

Query: 100 LSLSPSDAAVRLDLIHSVHGLEHAENYF-----NSISIRLKTSNVYGALLGCYVREKSLE 159
           +  S  D A+RLDLI   HGL+  E YF     +S+S+R+  S  Y  LL  YV+ K ++
Sbjct: 102 IEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKS-AYLPLLRAYVKNKMVK 161

Query: 160 KAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLC 219
           +AEA+M+++  +G   T   +N ++ LY   GQ++K+ +++  MK  +IP+++ S     
Sbjct: 162 EAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWM 221

Query: 220 AAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVR 279
            A    + ++ +E + K +  D  ++  W+     AN Y+ +G + +A  +L+   EK+ 
Sbjct: 222 NACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDA-EKML 281

Query: 280 PNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLT-KETCVPYALMITSLAKLDDIEGAE 339
             +N+  + FL++LY   G+K  V R+W   K +  + +CV Y  +++SL K  D+E AE
Sbjct: 282 NRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAE 341

Query: 340 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVER--TPFRSTWSILATG 399
           R+F EWE++C  YD RV N LL AY R G + KAES ++  V+ER  TP   TW IL  G
Sbjct: 342 RVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAES-LHGCVLERGGTPNYKTWEILMEG 401

Query: 400 YAEYGHMSKAVEMLKKA-ILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSS 459
           + +  +M KA++ + +  +L+ R +W+P   +I+ A  +Y EK+   E     VR     
Sbjct: 402 WVKCENMEKAIDAMHQVFVLMRRCHWRPSH-NIVMAIAEYFEKEEKIEEATAYVRDLHRL 461

Query: 460 GTVMKEMYYRLLRTSIAGGKPVISILEQMKMD 483
           G     +Y  LLR      +P   I E MK+D
Sbjct: 462 GLASLPLYRLLLRMHEHAKRPAYDIYEMMKLD 489

BLAST of CSPI07G04560 vs. TrEMBL
Match: A0A0A0K2B7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047430 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 2.2e-230
Identity = 406/407 (99.75%), Postives = 406/407 (99.75%), Query Frame = 1

Query: 94  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 153
           MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL
Sbjct: 1   MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 60

Query: 154 EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNL 213
           EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTK IPQDIYSIRNL
Sbjct: 61  EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKGIPQDIYSIRNL 120

Query: 214 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 273
           CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV
Sbjct: 121 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 180

Query: 274 RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE 333
           RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE
Sbjct: 181 RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE 240

Query: 334 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA 393
           RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA
Sbjct: 241 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA 300

Query: 394 EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 453
           EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV
Sbjct: 301 EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 360

Query: 454 MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL 501
           MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
Sbjct: 361 MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL 407

BLAST of CSPI07G04560 vs. TrEMBL
Match: B9RW08_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1175330 PE=4 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 2.9e-145
Identity = 270/492 (54.88%), Postives = 348/492 (70.73%), Query Frame = 1

Query: 8   WNNN---LISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGR 67
           WN N     S+L   T        SS+  + + L  +I  +R PK S++PVL +WV +G 
Sbjct: 20  WNPNPQFYFSSLFFSTRSQTQ---SSSSSESSKLYDRIQIVRDPKESIIPVLNQWVSEGH 79

Query: 68  AIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAE 127
            +GK  LQ LVHLMK  +RFNHALE+S WMTD RY SLSPSD AVRL+LI+ V+G  HAE
Sbjct: 80  TVGKALLQSLVHLMKGYKRFNHALEMSHWMTDCRYFSLSPSDVAVRLELIYRVYGSAHAE 139

Query: 128 NYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQ 187
            YF  IS +LK+ NVYGALL  YVRE S++KAEA++QEMR+ GIAT+SF YN++INLYAQ
Sbjct: 140 MYFEKISDKLKSGNVYGALLSGYVRENSVQKAEAVLQEMREKGIATSSFPYNIMINLYAQ 199

Query: 188 IGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWT 247
            G  +KID+L EEM+   IPQD Y++RNL AAYVA +DISGME+IL ++E   +L   W 
Sbjct: 200 NGAFEKIDILKEEMERNGIPQDKYTMRNLMAAYVAASDISGMERILNQLETHPQLGHGWQ 259

Query: 248 IYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNT 307
            YS+AA+GYL  GL  +AL ML+K EE +       AF +LL+LY +TG K+E+YRVWN+
Sbjct: 260 AYSVAASGYLKVGLIEKALKMLRKMEETMPIGKKTSAFNYLLTLYAKTGRKDELYRVWNS 319

Query: 308 FKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLL 367
           +KPL +     +  MI+SL K+DDIEGAERIF+EWES+C +YDFRVLN+LL+AYCRKGL 
Sbjct: 320 YKPLAEVKETQFCCMISSLEKVDDIEGAERIFEEWESQCMMYDFRVLNKLLLAYCRKGLY 379

Query: 368 DKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDIL 427
            KAE+   +A   RTP+ STW  +A  Y     MSKAVEMLKKAI V R+ WKP     L
Sbjct: 380 TKAEAAFKKAAEGRTPYASTWITMAMSYIGQNQMSKAVEMLKKAISVSRKGWKPNP-ITL 439

Query: 428 EACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGF 487
             CLDYLE+QGD E ++EIV+  KS+ ++ +++Y+RL+RT  A G PV  ++++MKMD  
Sbjct: 440 TTCLDYLEEQGDVEGIEEIVKSLKSTESLTRDIYHRLVRTYTAAGIPVTKVIDKMKMDNI 499

Query: 488 AADEEVDKILGS 497
           AADEE  KIL S
Sbjct: 500 AADEETHKILES 507

BLAST of CSPI07G04560 vs. TrEMBL
Match: E0CTC4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g03880 PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 3.0e-142
Identity = 261/443 (58.92%), Postives = 332/443 (74.94%), Query Frame = 1

Query: 34  QLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQW 93
           Q++ L  +I  +R PK S+ P+L +W+ +G+ + KP+LQ LV +MKD RRF+HALEISQW
Sbjct: 20  QISSLYDRIQAVRDPKASISPLLNQWIEEGQTVSKPQLQSLVRIMKDFRRFHHALEISQW 79

Query: 94  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 153
           MTDRRY +L+PSDAA+RLDLI  VHG   AE+YFN+I   LKTS+ YGALL  YVREKS+
Sbjct: 80  MTDRRYFTLTPSDAAIRLDLISMVHGRVQAESYFNNIPNNLKTSSAYGALLSGYVREKSV 139

Query: 154 EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNL 213
           EKAEA MQ+MR+M  AT+SF YN+LINLY+Q G H KI+ LI+EM++K IP D +++RNL
Sbjct: 140 EKAEATMQKMREMDFATSSFPYNMLINLYSQTGNHGKIEALIQEMQSKAIPCDAFTVRNL 199

Query: 214 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 273
             AYVA +DIS MEK L R+EED  +  DW IYS+AA+GYL  GL  +AL MLKK E   
Sbjct: 200 MVAYVAASDISAMEKFLNRMEEDPHISVDWNIYSVAASGYLKVGLIDKALEMLKKIESN- 259

Query: 274 RPNTNKF-AFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGA 333
           RP+  +  AFKFLLSLY RTGHK E+YRVWN +KP + E    Y+ MIT L KLDDIEGA
Sbjct: 260 RPHLERLSAFKFLLSLYARTGHKQELYRVWNLYKP-SYEYPEAYSCMITCLTKLDDIEGA 319

Query: 334 ERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGY 393
           E+IFQEWE +CT+YDFRVLNRLL AYC++ L DKAES+VN+ + ER P+ STW+ILA GY
Sbjct: 320 EKIFQEWECECTMYDFRVLNRLLSAYCKRCLFDKAESLVNKVIEERMPYASTWNILAKGY 379

Query: 394 AEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGT 453
            E   M KAVEMLKKAI VGR+ W+P    IL+AC++YLE QG+ E ++EI RLCK+ G 
Sbjct: 380 VEDKQMPKAVEMLKKAISVGRKGWRP-NSIILDACIEYLEGQGNLEEIEEIARLCKNLGI 439

Query: 454 VMKEMYYRLLRTSIAGGKPVISI 476
              ++++R   +S+   K VI +
Sbjct: 440 PDGDIHHRF--SSLNFNKSVIDL 457

BLAST of CSPI07G04560 vs. TrEMBL
Match: A5AZ95_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032243 PE=4 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 4.3e-133
Identity = 252/443 (56.88%), Postives = 318/443 (71.78%), Query Frame = 1

Query: 34  QLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQW 93
           Q++ L  +I  +R PK S+ P+L +W+ +G+ + KP+LQ LV +MKD RRF+HALEISQW
Sbjct: 20  QISSLYDRIQAVRDPKASISPLLNQWIEEGQTVSKPQLQSLVRIMKDFRRFHHALEISQW 79

Query: 94  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 153
           MTDRRY +L+PSDAA+RLDLI  V       +              YGALL  YVREKS+
Sbjct: 80  MTDRRYFTLTPSDAAIRLDLISMVPWTXAGXD-------------AYGALLSGYVREKSV 139

Query: 154 EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNL 213
           EKAEA MQ+MR+M  AT+SF YN+LINLY+Q G H KI+ LI+EM+ K IP D +++ NL
Sbjct: 140 EKAEATMQKMREMDFATSSFPYNMLINLYSQTGNHGKIEALIQEMQXKAIPCDAFTVXNL 199

Query: 214 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 273
             AYVA +DIS MEK L R+EED  +  DW IYS+AA+GYL  GL  +AL MLKK E   
Sbjct: 200 MVAYVAASDISAMEKXLNRMEEDPHISVDWNIYSVAASGYLKVGLIDKALEMLKKIESN- 259

Query: 274 RPNTNKF-AFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGA 333
           RP+  +  AFK LLSLY RT HK E+YRVWN +KP + E    Y+ MIT L KLDDIEGA
Sbjct: 260 RPHLERXSAFKXLLSLYARTXHKQELYRVWNLYKP-SYEYPEAYSCMITCLTKLDDIEGA 319

Query: 334 ERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGY 393
           E+IFQEWE +CT+YDFRVLNRLL AYC++ L DKAES+VN+ + ER P+ STW+ILA GY
Sbjct: 320 EKIFQEWECECTMYDFRVLNRLLSAYCKRCLFDKAESLVNKVIEERMPYASTWNILAKGY 379

Query: 394 AEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGT 453
            E   M KAVEMLKKAI VGR+ W+P    IL+AC++YLE QG+ E ++EI RLCK+ G 
Sbjct: 380 VEDKQMPKAVEMLKKAISVGRKGWRP-NSIILDACIEYLEGQGNLEEIEEIARLCKNLGI 439

Query: 454 VMKEMYYRLLRTSIAGGKPVISI 476
              ++++RLLRTS AG K V +I
Sbjct: 440 PDGDIHHRLLRTSAAGEKSVSAI 446

BLAST of CSPI07G04560 vs. TrEMBL
Match: A0A067JJJ5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25798 PE=4 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 1.8e-131
Identity = 241/415 (58.07%), Postives = 305/415 (73.49%), Query Frame = 1

Query: 78  MKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTS 137
           MKD RRF HALEIS WMTDRRY +LSPSDAA RL+L+H V+G  HAE YFN +S +LKT 
Sbjct: 1   MKDFRRFKHALEISHWMTDRRYFTLSPSDAASRLNLVHRVYGSAHAEKYFNELSGKLKTF 60

Query: 138 NVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEE 197
           +VYGALL  YV+E  ++KAEAIMQEMR+ G+AT++F YN+LINLY + G  +KID LI E
Sbjct: 61  HVYGALLNVYVQENFVQKAEAIMQEMREKGMATSTFPYNILINLYWKTGDFEKIDALIHE 120

Query: 198 MKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAG 257
           M+   I  D Y++ NL AAY A ++ISGME+IL ++E++  +   W IYS+AA+GYL  G
Sbjct: 121 MERNGIHGDKYTMMNLMAAYAATSNISGMERILNQVEKNPHIDHGWKIYSVAASGYLKVG 180

Query: 258 LETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYA 317
               AL+ML+K EE +       AF FLLS Y +TG K+E+YRVWN +K L     + + 
Sbjct: 181 SIETALTMLRKMEEMMPRQRKTSAFNFLLSHYGQTGKKDELYRVWNKYKSLYGLRTIVFC 240

Query: 318 LMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE 377
            MI SL+KLDDIEGAE+I +EWESKCTVYDFRVLN LLVAYC+KGL ++AE+ V +A   
Sbjct: 241 CMIESLSKLDDIEGAEQILEEWESKCTVYDFRVLNSLLVAYCKKGLFERAEAAVEKAAQG 300

Query: 378 RTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDA 437
           R P+ STW+ILATGYA+   MSKAVEMLK+AILV R+ WKP    IL ACLDYLE QGD 
Sbjct: 301 RKPYASTWTILATGYAQENQMSKAVEMLKRAILVSRKGWKPNP-TILTACLDYLEVQGDV 360

Query: 438 ETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDK 493
           E M+EIV+  KS   + +++Y+RL RT IA GK    +L+QMK D F ADE+++K
Sbjct: 361 EEMEEIVKSLKSLEPLTRDLYHRLKRTYIAAGKSTTDVLDQMKKDNFPADEDMEK 414

BLAST of CSPI07G04560 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 334.7 bits (857), Expect = 9.6e-92
Identity = 166/417 (39.81%), Postives = 253/417 (60.67%), Query Frame = 1

Query: 38  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR 97
           L++++     P  S++ VL+ W+  G  +   EL  ++ +++   RF+HAL+IS WM++ 
Sbjct: 40  LQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEH 99

Query: 98  RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAE 157
           R   +S  D A+RLDLI  V GL  AE +F +I +  +  ++YGALL CY  +K L KAE
Sbjct: 100 RVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAE 159

Query: 158 AIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAY 217
            + QEM+++G       YNV++NLY + G++  ++ L+ EM+ + +  DI+++     AY
Sbjct: 160 QVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAY 219

Query: 218 VAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNT 277
              +D+ GMEK L R E D  L  DW  Y+  ANGY+ AGL  +AL ML+K+E+ V    
Sbjct: 220 SVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQK 279

Query: 278 NKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQ 337
            K A++ L+S Y   G K EVYR+W+ +K L       Y  +I++L K+DDIE  E+I +
Sbjct: 280 RKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIME 339

Query: 338 EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVE-RTPFRSTWSILATGYAEYG 397
           EWE+  +++D R+ + L+  YC+KG+++KAE VVN  V + R    STW  LA GY   G
Sbjct: 340 EWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAG 399

Query: 398 HMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 454
            M KAVE  K+AI V +  W+P Q  +L +C+DYLE Q D E + +I+RL    G +
Sbjct: 400 KMEKAVEKWKRAIEVSKPGWRPHQ-VVLMSCVDYLEGQRDMEGLRKILRLLSERGHI 455

BLAST of CSPI07G04560 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 280.8 bits (717), Expect = 1.6e-75
Identity = 157/466 (33.69%), Postives = 261/466 (56.01%), Query Frame = 1

Query: 38  LRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR 97
           L  KI  +  PK SV P L+ WV  G+ +   EL  +VH ++  +RF HALE+S+WM + 
Sbjct: 27  LYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSKWMNET 86

Query: 98  RYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAE 157
                SP++ AV LDLI  V+G   AE YF ++  + K    YGALL CYVR++++EK+ 
Sbjct: 87  GVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQNVEKSL 146

Query: 158 AIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAY 217
              ++M++MG  T+S  YN ++ LY  IGQH+K+  ++EEMK + +  D YS R    A+
Sbjct: 147 LHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYRICINAF 206

Query: 218 VAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNT 277
            A  D+  +   L+ +E   ++  DW  Y++AA  Y+  G    A+ +LK +E ++    
Sbjct: 207 GAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSENRLEKKD 266

Query: 278 NKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETC-VPYALMITSLAKLDDIEGAERIF 337
            +  +  L++LY R G K EV R+W+  K + K      Y  ++ SL K+D +  AE + 
Sbjct: 267 GE-GYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVEAEEVL 326

Query: 338 QEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQ-AVVERTPFRSTWSILATGYAEY 397
            EW+S    YDFRV N ++  Y  K + +KAE+++   A   +     +W ++AT YAE 
Sbjct: 327 TEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVATAYAEK 386

Query: 398 GHMSKAVEMLKKA--ILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 457
           G +  A + +K A  + VG + W+P    ++ + L ++  +G  + ++  V   ++   V
Sbjct: 387 GTLENAFKCMKTALGVEVGSRKWRPGL-TLVTSVLSWVGDEGSLKEVESFVASLRNCIGV 446

Query: 458 MKEMYYRLLRTSI-AGGKPVISILEQMKMDGFAADEEVDKILGSKT 499
            K+MY+ L++  I  GG+ + ++L++MK D    DEE   IL +++
Sbjct: 447 NKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 490

BLAST of CSPI07G04560 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 252.3 bits (643), Expect = 6.3e-67
Identity = 145/427 (33.96%), Postives = 234/427 (54.80%), Query Frame = 1

Query: 40  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDR-R 99
           +KI  +  P++    VL +W   GR + K EL  +V  ++  +R N ALE+  WM +R  
Sbjct: 71  KKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGE 130

Query: 100 YLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEA 159
              LS SDAA++LDLI  V G+  AE +F  +    K   VYG+LL  YVR KS EKAEA
Sbjct: 131 RFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEA 190

Query: 160 IMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYV 219
           ++  MR  G A     +NV++ LY  + ++DK+D ++ EMK K I  DIYS     ++  
Sbjct: 191 LLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCG 250

Query: 220 AKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTN 279
           +   +  ME + ++++ D  +  +WT +S  A  Y+  G   +A   L+K E ++    N
Sbjct: 251 SLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARI-TGRN 310

Query: 280 KFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKE-TCVPYALMITSLAKLDDIEGAERIFQ 339
           +  + +LLSLY   G+K E+YRVW+ +K +      + Y  +++SL ++ DIEGAE++++
Sbjct: 311 RIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYE 370

Query: 340 EWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAV-VERTPFRSTWSILATGYAEYG 399
           EW    + YD R+ N L+ AY +   L+ AE + +  V +   P  STW ILA G+    
Sbjct: 371 EWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKR 430

Query: 400 HMSKAVEMLKKAILV-GRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMK 459
            +S+A+  L+ A    G  NW+PK   +L       E++ D  + + ++ L + SG +  
Sbjct: 431 CISEALTCLRNAFSAEGSSNWRPKV-LMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 490

Query: 460 EMYYRLL 463
           + Y  L+
Sbjct: 491 KSYLALI 495

BLAST of CSPI07G04560 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 250.8 bits (639), Expect = 1.8e-66
Identity = 141/454 (31.06%), Postives = 255/454 (56.17%), Query Frame = 1

Query: 49  KISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAA 108
           ++ V   L +++   + + K E+   +  +++   +  AL++S+ M +R  ++ + SD A
Sbjct: 36  EVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEERG-MNKTVSDQA 95

Query: 109 VRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGI 168
           + LDL+     +   ENYF  +    KT   YG+LL CY +E   EKAE ++ +M+++ I
Sbjct: 96  IHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNI 155

Query: 169 ATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEK 228
             +S +YN L+ LY + G+ +K+  +I+E+K + +  D Y+      A  A  DISG+E+
Sbjct: 156 TPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVER 215

Query: 229 ILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNK--FAFKFLL 288
           +++ +  D  +  DWT YS  A+ Y+ AGL  +A   L++ E K   NT +   A++FL+
Sbjct: 216 VIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELEMK---NTQRDFTAYQFLI 275

Query: 289 SLYERTGHKNEVYRVWNTFK-PLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTV 348
           +LY R G   EVYR+W + +  + K + V Y  MI  L KL+D+ GAE +F+EW++ C+ 
Sbjct: 276 TLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCST 335

Query: 349 YDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRS-TWSILATGYAEYGHMSKAVEM 408
           YD R++N L+ AY ++GL+ KA  +  +A        + TW I    Y + G M++A+E 
Sbjct: 336 YDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALEC 395

Query: 409 LKKAILVGRQN---WKPKQGDILEACLDYLEKQGDAETMDEIVRLCKS-SGTVMKEMYYR 468
           + KA+ +G+ +   W P   + + A + Y E++ D    + ++ + K+ +  +  E++  
Sbjct: 396 MSKAVSIGKGDGGKWLPSP-ETVRALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEP 455

Query: 469 LLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL 495
           L+RT  A GK   ++  ++KM+    +E   K+L
Sbjct: 456 LIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of CSPI07G04560 vs. TAIR10
Match: AT5G27460.1 (AT5G27460.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 250.8 bits (639), Expect = 1.8e-66
Identity = 147/452 (32.52%), Postives = 251/452 (55.53%), Query Frame = 1

Query: 40  QKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRY 99
           ++I+    P+ SV  +L++ +  G A+   EL+ +   +  S R++ AL++ +WM +++ 
Sbjct: 42  KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKD 101

Query: 100 LSLSPSDAAVRLDLIHSVHGLEHAENYF-----NSISIRLKTSNVYGALLGCYVREKSLE 159
           +  S  D A+RLDLI   HGL+  E YF     +S+S+R+  S  Y  LL  YV+ K ++
Sbjct: 102 IEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKS-AYLPLLRAYVKNKMVK 161

Query: 160 KAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLC 219
           +AEA+M+++  +G   T   +N ++ LY   GQ++K+ +++  MK  +IP+++ S     
Sbjct: 162 EAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWM 221

Query: 220 AAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVR 279
            A    + ++ +E + K +  D  ++  W+     AN Y+ +G + +A  +L+   EK+ 
Sbjct: 222 NACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDA-EKML 281

Query: 280 PNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLT-KETCVPYALMITSLAKLDDIEGAE 339
             +N+  + FL++LY   G+K  V R+W   K +  + +CV Y  +++SL K  D+E AE
Sbjct: 282 NRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAE 341

Query: 340 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVER--TPFRSTWSILATG 399
           R+F EWE++C  YD RV N LL AY R G + KAES ++  V+ER  TP   TW IL  G
Sbjct: 342 RVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAES-LHGCVLERGGTPNYKTWEILMEG 401

Query: 400 YAEYGHMSKAVEMLKKA-ILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSS 459
           + +  +M KA++ + +  +L+ R +W+P   +I+ A  +Y EK+   E     VR     
Sbjct: 402 WVKCENMEKAIDAMHQVFVLMRRCHWRPSH-NIVMAIAEYFEKEEKIEEATAYVRDLHRL 461

Query: 460 GTVMKEMYYRLLRTSIAGGKPVISILEQMKMD 483
           G     +Y  LLR      +P   I E MK+D
Sbjct: 462 GLASLPLYRLLLRMHEHAKRPAYDIYEMMKLD 489

BLAST of CSPI07G04560 vs. NCBI nr
Match: gi|778730131|ref|XP_011659707.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g74580 [Cucumis sativus])

HSP 1 Score: 986.9 bits (2550), Expect = 1.3e-284
Identity = 498/500 (99.60%), Postives = 499/500 (99.80%), Query Frame = 1

Query: 1   MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWV 60
           MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKI+NIRAPKISVVPVLEKWV
Sbjct: 1   MIRKLRSWNNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIVNIRAPKISVVPVLEKWV 60

Query: 61  GDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGL 120
           GDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGL
Sbjct: 61  GDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGL 120

Query: 121 EHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLIN 180
           EHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLIN
Sbjct: 121 EHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLIN 180

Query: 181 LYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELK 240
           LYAQIGQHDKIDLLIEEMKTK IPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELK
Sbjct: 181 LYAQIGQHDKIDLLIEEMKTKGIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELK 240

Query: 241 ADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR 300
           ADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR
Sbjct: 241 ADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYR 300

Query: 301 VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCR 360
           VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCR
Sbjct: 301 VWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCR 360

Query: 361 KGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQ 420
           KGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQ
Sbjct: 361 KGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQ 420

Query: 421 GDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMK 480
           GDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMK
Sbjct: 421 GDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMK 480

Query: 481 MDGFAADEEVDKILGSKTNL 501
           MDGFAADEEVDKILGSKTNL
Sbjct: 481 MDGFAADEEVDKILGSKTNL 500

BLAST of CSPI07G04560 vs. NCBI nr
Match: gi|659111027|ref|XP_008455539.1| (PREDICTED: uncharacterized protein LOC103495690 [Cucumis melo])

HSP 1 Score: 889.4 bits (2297), Expect = 2.9e-255
Identity = 453/511 (88.65%), Postives = 474/511 (92.76%), Query Frame = 1

Query: 1   MIRKLRSWNNNLISNLLIQT----------SKTLSLPFSSTPP-QLAILRQKIINIRAPK 60
           M+RKLRSWNNNLI NLLIQT          +KTLSLPFSSTPP Q  ILR +II+IR PK
Sbjct: 1   MVRKLRSWNNNLIPNLLIQTFKPQSNSFFCTKTLSLPFSSTPPPQSTILRNQIIDIRDPK 60

Query: 61  ISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAV 120
           ISV+PVLEKWVGDGRAI KPELQYLV+L K+ RRFNHALEISQWMTDRRYLSLS SDAA+
Sbjct: 61  ISVIPVLEKWVGDGRAIWKPELQYLVYLTKNFRRFNHALEISQWMTDRRYLSLSASDAAL 120

Query: 121 RLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIA 180
           RLDLIHSVHGLEHAENYFNSIS RLKTSNVYG+LL CYVREKS+EKAEAIMQEMRKMGIA
Sbjct: 121 RLDLIHSVHGLEHAENYFNSISTRLKTSNVYGSLLSCYVREKSVEKAEAIMQEMRKMGIA 180

Query: 181 TTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKI 240
            TSFAYNVLINLYAQIGQH+KIDLLIEEMK K IPQDIYSIRNLCAAYVAK DISGMEKI
Sbjct: 181 NTSFAYNVLINLYAQIGQHEKIDLLIEEMKMKGIPQDIYSIRNLCAAYVAKTDISGMEKI 240

Query: 241 LKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLY 300
           LKRIEEDSE KADW IYSIAANGYLTAGLETEALSML K E+K+RPNTNK AF+FLLSLY
Sbjct: 241 LKRIEEDSEFKADWRIYSIAANGYLTAGLETEALSMLNKMEKKIRPNTNKLAFEFLLSLY 300

Query: 301 ERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFR 360
           ERTGHKNEVYRVWNTFKPLT++T VPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFR
Sbjct: 301 ERTGHKNEVYRVWNTFKPLTRQTRVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFR 360

Query: 361 VLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAI 420
           VLNRLLVAYCRKGLLDKAE VVNQAVV RTPF STWS+LATGYAEYGHMSKAVEMLKKA+
Sbjct: 361 VLNRLLVAYCRKGLLDKAEWVVNQAVVGRTPFASTWSLLATGYAEYGHMSKAVEMLKKAM 420

Query: 421 LVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGG 480
           LVGRQNWKPK+ DILEACLDYLEKQGDAETM+EIVRLCKSSGTV KEMYYRLLRTSIAGG
Sbjct: 421 LVGRQNWKPKRRDILEACLDYLEKQGDAETMEEIVRLCKSSGTVAKEMYYRLLRTSIAGG 480

Query: 481 KPVISILEQMKMDGFAADEEVDKILGSKTNL 501
           KPV+SILEQMKMDGFAADEEVDKILGSKTNL
Sbjct: 481 KPVLSILEQMKMDGFAADEEVDKILGSKTNL 511

BLAST of CSPI07G04560 vs. NCBI nr
Match: gi|700188375|gb|KGN43608.1| (hypothetical protein Csa_7G047430 [Cucumis sativus])

HSP 1 Score: 806.2 bits (2081), Expect = 3.2e-230
Identity = 406/407 (99.75%), Postives = 406/407 (99.75%), Query Frame = 1

Query: 94  MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 153
           MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL
Sbjct: 1   MTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYVREKSL 60

Query: 154 EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIYSIRNL 213
           EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTK IPQDIYSIRNL
Sbjct: 61  EKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKGIPQDIYSIRNL 120

Query: 214 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 273
           CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV
Sbjct: 121 CAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKKTEEKV 180

Query: 274 RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE 333
           RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE
Sbjct: 181 RPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDDIEGAE 240

Query: 334 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA 393
           RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA
Sbjct: 241 RIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSILATGYA 300

Query: 394 EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 453
           EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV
Sbjct: 301 EYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCKSSGTV 360

Query: 454 MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL 501
           MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL
Sbjct: 361 MKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKILGSKTNL 407

BLAST of CSPI07G04560 vs. NCBI nr
Match: gi|658004682|ref|XP_008337472.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial [Malus domestica])

HSP 1 Score: 562.8 bits (1449), Expect = 6.2e-157
Identity = 284/466 (60.94%), Postives = 355/466 (76.18%), Query Frame = 1

Query: 29  SSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGKPELQYLVHLMKDSRRFNHAL 88
           SS+P     L  +I  IR PK SV+PVLE+WV +G+A+ K +LQ LV L+KD RRFNHAL
Sbjct: 36  SSSPSWSNSLHDRIKVIRDPKASVLPVLEQWVSEGQAVEKQQLQSLVRLLKDFRRFNHAL 95

Query: 89  EISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFNSISIRLKTSNVYGALLGCYV 148
           EISQWMTDRRY  LSPSDAA RL+LIH VHGLEHAENYFN++S  LK+ N YGALL  YV
Sbjct: 96  EISQWMTDRRYFDLSPSDAAARLNLIHRVHGLEHAENYFNNLSKSLKSLNAYGALLCXYV 155

Query: 149 REKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQHDKIDLLIEEMKTKRIPQDIY 208
           +E+S+EKAEA MQ+M+KMG+A TSF YN+LINLY+Q GQ++KI++L++EM+   IP D Y
Sbjct: 156 QERSVEKAEATMQKMKKMGMAKTSFPYNMLINLYSQNGQYEKINILMQEMEENGIPIDKY 215

Query: 209 SIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSIAANGYLTAGLETEALSMLKK 268
           ++RN   AY+A +D  GME IL R+EED  L  DW IYS+AANGYL  GL  +A+SMLK 
Sbjct: 216 TLRNRMMAYIAASDXPGMEAILNRMEEDPNLIVDWKIYSMAANGYLKVGLTEKAISMLKM 275

Query: 269 TEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPLTKETCVPYALMITSLAKLDD 328
            E  + P   K + +FLL+LY  TG+K E+YRVW+T+KP  +   VPY  MI+SLAKLDD
Sbjct: 276 LEG-LMPLQGKKSVEFLLTLYASTGNKEELYRVWDTYKPSNEPVDVPYGCMISSLAKLDD 335

Query: 329 IEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAESVVNQAVVERTPFRSTWSIL 388
           IEGAE IF+EWES+C +YDFRVLNRLLVAYC++GL DKAESVVN+AV  R P+ STW++L
Sbjct: 336 IEGAEGIFEEWESQCKIYDFRVLNRLLVAYCKRGLFDKAESVVNKAVEGRIPYASTWNVL 395

Query: 389 ATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACLDYLEKQGDAETMDEIVRLCK 448
           A GY E   M KAVEMLKKA+ VGR+ W P     L ACLDYLE QGD E ++EI+ L K
Sbjct: 396 AIGYTEKQQMPKAVEMLKKALSVGRRGWVP-HSPTLTACLDYLEGQGDIEGIEEIIXLLK 455

Query: 449 SSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADEEVDKIL 495
           + G + +++Y+RLLR S+A GK V  IL+QMK+DGF ADEE  K++
Sbjct: 456 NLGPLSEDLYHRLLRASVAAGKSVAIILDQMKVDGFTADEEXYKVI 499

BLAST of CSPI07G04560 vs. NCBI nr
Match: gi|694352436|ref|XP_009357845.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 546.6 bits (1407), Expect = 4.6e-152
Identity = 280/486 (57.61%), Postives = 360/486 (74.07%), Query Frame = 1

Query: 9   NNNLISNLLIQTSKTLSLPFSSTPPQLAILRQKIINIRAPKISVVPVLEKWVGDGRAIGK 68
           N  L +++   +S + S   SS+P     L  +I  IR PK SV+PVLE+WV +G+A+ K
Sbjct: 20  NYTLFNSIRPSSSSSSS---SSSPSWSNSLHDRIKVIRDPKASVLPVLEQWVSEGQAVEK 79

Query: 69  PELQYLVHLMKDSRRFNHALEISQWMTDRRYLSLSPSDAAVRLDLIHSVHGLEHAENYFN 128
            +LQ LV L+KD RRFNHALEISQWMTDRRY  LSPSDAA RL+LIH VHGL+HAENYFN
Sbjct: 80  QQLQSLVRLLKDFRRFNHALEISQWMTDRRYFDLSPSDAAARLNLIHRVHGLDHAENYFN 139

Query: 129 SISIRLKTSNVYGALLGCYVREKSLEKAEAIMQEMRKMGIATTSFAYNVLINLYAQIGQH 188
           ++   LK+ N YGALL  YV+E+S+EKAEA MQ+M+KM +A TSF YN+LINLY+Q GQ+
Sbjct: 140 NLPKSLKSLNAYGALLCIYVQERSVEKAEATMQKMKKMDMAKTSFPYNMLINLYSQNGQY 199

Query: 189 DKIDLLIEEMKTKRIPQDIYSIRNLCAAYVAKADISGMEKILKRIEEDSELKADWTIYSI 248
           +KI++L++EM+   IP D Y++RN   AY+A +D+ GME IL R+EED  L  DW I S+
Sbjct: 200 EKINILMQEMEENGIPIDKYTLRNRMMAYIAASDVPGMEAILNRMEEDPNLIVDWKICSM 259

Query: 249 AANGYLTAGLETEALSMLKKTEEKVRPNTNKFAFKFLLSLYERTGHKNEVYRVWNTFKPL 308
           AANGYL  GL  +A+SMLK   E + P   + + +FLL+LY  TG+K E+YRVW+T+KP 
Sbjct: 260 AANGYLKVGLTEKAISMLKML-EGLMPLQGRKSVEFLLTLYASTGNKEELYRVWDTYKPS 319

Query: 309 TKETCVPYALMITSLAKLDDIEGAERIFQEWESKCTVYDFRVLNRLLVAYCRKGLLDKAE 368
            +   VPY  MI++LAKLDDIEGAE IF+EWES+C +YDFRVLNRLLVAYC++GL DKAE
Sbjct: 320 NEPVDVPYGCMISALAKLDDIEGAEGIFEEWESQCKIYDFRVLNRLLVAYCKRGLFDKAE 379

Query: 369 SVVNQAVVERTPFRSTWSILATGYAEYGHMSKAVEMLKKAILVGRQNWKPKQGDILEACL 428
           SVVN+A   R P+ STW++LA GY E   M KAVEMLKKA+ VGR+ W       L ACL
Sbjct: 380 SVVNKATEGRIPYASTWNVLAIGYTEKQQMPKAVEMLKKALSVGRRGW-VLHSPTLTACL 439

Query: 429 DYLEKQGDAETMDEIVRLCKSSGTVMKEMYYRLLRTSIAGGKPVISILEQMKMDGFAADE 488
           DYLE QGD E ++EI+ L K+ G + +++Y+RLLR S+A GK V  IL+QMK+DGF ADE
Sbjct: 440 DYLEGQGDIEGIEEIISLLKNLGPLSEDLYHRLLRASVAAGKSVAIILDQMKVDGFTADE 499

Query: 489 EVDKIL 495
           E  K++
Sbjct: 500 EAYKVI 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP166_ARATH1.7e-9039.81Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PP334_ARATH2.9e-7433.69Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
PPR3_ARATH1.1e-6533.96Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH3.2e-6531.06Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PP400_ARATH3.2e-6532.52Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K2B7_CUCSA2.2e-23099.75Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047430 PE=4 SV=1[more]
B9RW08_RICCO2.9e-14554.88Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
E0CTC4_VITVI3.0e-14258.92Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g03880 PE=4 SV=... [more]
A5AZ95_VITVI4.3e-13356.88Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032243 PE=4 SV=1[more]
A0A067JJJ5_JATCU1.8e-13158.07Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25798 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G20710.19.6e-9239.81 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.11.6e-7533.69 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.16.3e-6733.96 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.11.8e-6631.06 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27460.11.8e-6632.52 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778730131|ref|XP_011659707.1|1.3e-28499.60PREDICTED: putative pentatricopeptide repeat-containing protein At1g74580 [Cucum... [more]
gi|659111027|ref|XP_008455539.1|2.9e-25588.65PREDICTED: uncharacterized protein LOC103495690 [Cucumis melo][more]
gi|700188375|gb|KGN43608.1|3.2e-23099.75hypothetical protein Csa_7G047430 [Cucumis sativus][more]
gi|658004682|ref|XP_008337472.1|6.2e-15760.94PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial ... [more]
gi|694352436|ref|XP_009357845.1|4.6e-15257.61PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G04560.1CSPI07G04560.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 383..407
score: 0.047coord: 316..342
score: 0.099coord: 352..372
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 138..182
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 139..168
score: 2.8E-6coord: 174..204
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 347..381
score: 7.574coord: 242..272
score: 5.853coord: 136..170
score: 9.854coord: 312..342
score: 6.96coord: 278..308
score: 5.053coord: 171..205
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 260..437
score: 1.0E-7coord: 98..259
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 173..411
score: 2.31E-5coord: 337..436
score: 8.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 35..495
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF704SUBFAMILY NOT NAMEDcoord: 35..495
score: 1.5E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI07G04560Cla002278Watermelon (97103) v1cpiwmB604
CSPI07G04560Csa7G047430Cucumber (Chinese Long) v2cpicuB354
CSPI07G04560MELO3C018741Melon (DHL92) v3.5.1cpimeB538
CSPI07G04560ClCG07G002970Watermelon (Charleston Gray)cpiwcgB596
CSPI07G04560Lsi02G007450Bottle gourd (USVL1VR-Ls)cpilsiB521
CSPI07G04560Lsi11G015470Bottle gourd (USVL1VR-Ls)cpilsiB505
CSPI07G04560MELO3C002597.2Melon (DHL92) v3.6.1cpimedB511
CSPI07G04560MELO3C018741.2Melon (DHL92) v3.6.1cpimedB523
CSPI07G04560CsaV3_7G004370Cucumber (Chinese Long) v3cpicucB415
CSPI07G04560Cla97C07G131300Watermelon (97103) v2cpiwmbB577
CSPI07G04560Bhi05G001611Wax gourdcpiwgoB673
CSPI07G04560Cucsa.126610Cucumber (Gy14) v1cgycpiB176
CSPI07G04560Cucsa.147710Cucumber (Gy14) v1cgycpiB220
CSPI07G04560CmaCh18G003650Cucurbita maxima (Rimu)cmacpiB429
CSPI07G04560CmaCh19G009840Cucurbita maxima (Rimu)cmacpiB533
CSPI07G04560CmaCh11G018730Cucurbita maxima (Rimu)cmacpiB144
CSPI07G04560CmoCh19G010190Cucurbita moschata (Rifu)cmocpiB524
CSPI07G04560CmoCh18G003440Cucurbita moschata (Rifu)cmocpiB419
CSPI07G04560CmoCh11G019500Cucurbita moschata (Rifu)cmocpiB131
CSPI07G04560Cp4.1LG04g03080Cucurbita pepo (Zucchini)cpecpiB692
CSPI07G04560Cp4.1LG15g07880Cucurbita pepo (Zucchini)cpecpiB257
CSPI07G04560CsGy7G004150Cucumber (Gy14) v2cgybcpiB332
CSPI07G04560CsGy1G012060Cucumber (Gy14) v2cgybcpiB044
CSPI07G04560Carg19118Silver-seed gourdcarcpiB0755
CSPI07G04560Carg19233Silver-seed gourdcarcpiB0911
CSPI07G04560Carg14631Silver-seed gourdcarcpiB0176
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CSPI07G04560CSPI01G12040Wild cucumber (PI 183967)cpicpiB051