CSPI01G12050 (gene) Wild cucumber (PI 183967)

NameCSPI01G12050
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 7597739 .. 7600255 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGCTGAAGAGCTCACGTACTTGGGCAGCGCCTAGTTTCTCCATTTTTTCTCCTCTCTTTTGCCTTTCCAAATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGAGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGGTTTGATTTCTTATTCATATCTTCGTCCAAGTATGATGGTTTTCATTTGCTCTTATGCGAAATGCTTTTGTTGATTGATGCAAATGGGTCCCAACGATTTGGCATTGTTGATTTTTTAAGATGAATAATGTGTTCATCTATGTTCCAGCATGGTTTCTTCTCCTATGAAATTGCTAAGCCGCCAGGATTTTTTATTTGCTTTGTTGAAATAGACAGTATGTGATAATTGGCGTGGAGTTGCATTGTTTCATGGATAAATGTCTTTTCGTAATTAACCAACTTCGAGAAGGGATATGGCTATTCAAATGGTGATGAATAGTCTACACATATGTCTTTCTTCCCAAATTTTGTTCGCCGGTAAAACATTCTGCCTTTTGTAAATTTTGTTGTTTGTTTATTTTCACAAAGTATCATCCATTGAATCTTGTGTACAATGTGCCGCCTCCAAAAAAAAGTAATTAATTTTTTCTTGAACACAATTACTCATGAAACTTCGACATTGGAGTGAGTAACATTGTTTCTAATGTGAACCAAGTTTTTTATTTCCTGTTTTTTTGGTAATATTATCCATATAACAGCTCTAACCAATGAATTAATGAAATTAGTTCGTCAGGAAACAACTCTAGACTCTTAAGGGAGATGGTTTTATTTCTATTTCCTCGTAGATTACATGTCATACTTGATTTGATCTCAAATTCATGTCTGATAACTATTTTTTGTCATTGATAGTTTGTCTTGCTGTTCTGCCAATGCTTTTTTTTTCTACTGTAGAGTTCACTTTCATGGAACTAGCTTGCTTTTCCTCTCAATTCATAACTGGCAAGAATAACGAATACATTGGAGAGCATTTGCTTTCCATGTTGTTCTTTCTGACATATCAAAGAAGCCACATTGTGTAGAGTTCTTGGCTTATTGATTTCATATTAAATTGAACTGTTCATTGTGTTTTACCCAGTATTATTGTTTTGACCCATCTCTTCAAACCCCCTTTCATCTTCAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCCCTTCTTAACTGCTATGCGCATGAAAAGTGTGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAACGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACTGAATCTGCCTAATTTAACTTTTATTCTAAAAGAACCCTTAAGTACTTAAAGGGATATTTAGTAAGCAGGGTG

mRNA sequence

ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGAGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCCCTTCTTAACTGCTATGCGCATGAAAAGTGTGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAACGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACTGAATCTGCCTAA

Coding sequence (CDS)

ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGAGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCCCTTCTTAACTGCTATGCGCATGAAAAGTGTGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAACGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACTGAATCTGCCTAA
BLAST of CSPI01G12050 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 1.2e-90
Identity = 169/416 (40.62%), Postives = 267/416 (64.18%), Query Frame = 1

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRIS 208
           A    Q++KE+GF    LPYN+M+NLY + G++  ++ LL+EM++  V  D FT + R+ 
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 AYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
           AY+  SD  G+EK + + E++  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CSPI01G12050 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 278.9 bits (712), Expect = 1.0e-73
Identity = 157/407 (38.57%), Postives = 235/407 (57.74%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M LY  IG+
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
            E++  +L+EMKE  V  D ++Y I I+A+ A  D   I   +  ME    I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CSPI01G12050 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 254.2 bits (648), Expect = 2.7e-66
Identity = 143/420 (34.05%), Postives = 246/420 (58.57%), Query Frame = 1

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISA 210
            A +  +++ G+A  PLP+N+MM LY  + E++++D+++ EMK++ +  D ++Y+I +S+
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 211 YAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
             +      +E + +QM+S+ SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CSPI01G12050 vs. Swiss-Prot
Match: PPR61_ARATH (Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis thaliana GN=At1g28020 PE=3 SV=2)

HSP 1 Score: 221.5 bits (563), Expect = 1.9e-56
Identity = 126/345 (36.52%), Postives = 197/345 (57.10%), Query Frame = 1

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISAYAA 213
            QK++++G    P+PYN MM+LY  +   E+++ LL EMK+  V  D  T +  +  Y+A
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 ASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
             D   +EK + + E    I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CSPI01G12050 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.3e-52
Identity = 125/437 (28.60%), Postives = 232/437 (53.09%), Query Frame = 1

Query: 27  VKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKW 86
           +++ LY R+   G   + V   L+Q++   + V + E+   IK+LR    +  AL++S+ 
Sbjct: 21  IEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEV 80

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           M ++R    + +D AI ++L+ +   +   E+YF ++P   K    + +LLNCY  E   
Sbjct: 81  M-EERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLT 140

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIR 206
           +KA   + K+KE+    S + YN +M LY + GE E++ ++++E+K   V  D +TY++ 
Sbjct: 141 EKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVW 200

Query: 207 ISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
           + A AA +D  G+E+++E+M  +  +  DW  Y   A+ Y   GL  K+   L++ E  +
Sbjct: 201 MRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELE--M 260

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKG 326
            N ++   A+   + LY R GK  E++RIW   +    K  N  +++MI  L+ L+D+ G
Sbjct: 261 KNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPG 320

Query: 327 AERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLAS 386
           AE ++KEW+    +YD+RI N+L+ AY + GL++KA  L  +      K + ++W     
Sbjct: 321 AETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMD 380

Query: 387 GYLQKDQLPQAVETLKLAASV--------CPSRLNYVKEILAAFLDGKQDVEETEKVVNL 446
            Y++   + +A+E +  A S+         PS        L ++ + K+DV   E ++ +
Sbjct: 381 YYVKSGDMARALECMSKAVSIGKGDGGKWLPS--PETVRALMSYFEQKKDVNGAENLLEI 440

Query: 447 LREKDDSHPARAHDYIV 454
           L+   D+  A   + ++
Sbjct: 441 LKNGTDNIGAEIFEPLI 452

BLAST of CSPI01G12050 vs. TrEMBL
Match: A0A0A0LV44_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073790 PE=4 SV=1)

HSP 1 Score: 915.2 bits (2364), Expect = 3.1e-263
Identity = 458/461 (99.35%), Postives = 459/461 (99.57%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFR EFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
           FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV
Sbjct: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CSPI01G12050 vs. TrEMBL
Match: A0A0A0LSC2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073800 PE=4 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 2.7e-145
Identity = 278/367 (75.75%), Postives = 289/367 (78.75%), Query Frame = 1

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           MSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF+NMPSQLKR QVHIALLNCYAHEK  
Sbjct: 1   MSDKRYFPLSTADIATRMNLILRVHGLEQVEDYFNNMPSQLKRCQVHIALLNCYAHEKYA 60

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIR 206
           DKANA +QKIKEMGFA + LPYNI MNLYHQIGEFERLDS LKE     V +D+FTY+ R
Sbjct: 61  DKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGEFERLDSPLKETD---VDHDQFTYTTR 120

Query: 207 ISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
                                      L+WNCYVIAANAYNKVGLIDKSISMLKKSEGLL
Sbjct: 121 ---------------------------LNWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 180

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAE 326
           ANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYKKEKIFNKGFISMITSL +LDDIKGAE
Sbjct: 181 ANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYKKEKIFNKGFISMITSLFVLDDIKGAE 240

Query: 327 RIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGY 386
           RIYKEWET+KLSYDLRIPNLLVDAYCRA                            ASGY
Sbjct: 241 RIYKEWETQKLSYDLRIPNLLVDAYCRA----------------------------ASGY 300

Query: 387 LQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPA 446
           LQKDQLPQAVETLK AAS+CPS LNY KEILAAFLDGKQD EETEKVVNLLREKDDSHPA
Sbjct: 301 LQKDQLPQAVETLKKAASLCPSELNYAKEILAAFLDGKQDEEETEKVVNLLREKDDSHPA 309

Query: 447 RAHDYIV 454
           RAHD +V
Sbjct: 361 RAHDILV 309

BLAST of CSPI01G12050 vs. TrEMBL
Match: W9RYV8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_014527 PE=4 SV=1)

HSP 1 Score: 453.4 bits (1165), Expect = 3.4e-124
Identity = 223/435 (51.26%), Postives = 317/435 (72.87%), Query Frame = 1

Query: 15  FRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVY 74
           F     N  +T +++ LYRRISPVG+PN+SV P+L+QWV EG+ V Q EL+ IIKELR++
Sbjct: 43  FYSSISNPTNTNIEERLYRRISPVGNPNVSVVPILEQWVQEGKPVSQIELQRIIKELRIF 102

Query: 75  KRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHI 134
           KRF HALEIS+WMSDKRY  LST D+A R++LI +VHGL++  +YF+++P+ LK ++V+ 
Sbjct: 103 KRFHHALEISQWMSDKRYMHLSTKDVAARLDLISKVHGLDEAVNYFNDIPAALKIFEVYS 162

Query: 135 ALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKER 194
            LLNCYA+EK V+KA   MQ++++M    +P+ YNIMMNLY+   ++++LDSL+ EM+E+
Sbjct: 163 TLLNCYANEKSVEKAEEIMQQMRDMWDHKTPICYNIMMNLYYHTEDYDKLDSLMSEMEEK 222

Query: 195 GVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDK 254
           G+ +D +T+SIR+SAY A SD  G+ KIME++ES P +VLDWN Y +AA+++  VGL+DK
Sbjct: 223 GIPFDTYTFSIRMSAYVAISDVEGVNKIMEKVESYPGLVLDWNFYSVAASSHLNVGLVDK 282

Query: 255 SISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMIT 314
           ++ MLKK E  L  V +K FAF+  LK YA  G +DE++RIW+LYKKEK+FNKG+ SMI 
Sbjct: 283 AVEMLKKLEDRLPTVNRKSFAFDALLKSYALIGNRDELYRIWDLYKKEKLFNKGYKSMIC 342

Query: 315 SLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKF 374
           SLL +DD++GA +IY+EWE+R L +D  IP+L++D Y R GL+EKAE L+++ +    + 
Sbjct: 343 SLLRIDDVEGAAKIYEEWESRGLPFDFLIPDLMIDTYFRKGLLEKAEALVDKAIAKGDES 402

Query: 375 SVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETE 434
           SVE W YL    L+ +Q+ +AVE LK A SV        KE L+    +LDGK D+E  +
Sbjct: 403 SVELWYYLVIRSLEHNQISKAVEALKKAISVWSPGSKPSKETLSVCLEYLDGKVDIEGAQ 462

Query: 435 KVVNLLREKDDSHPA 447
             +NLLR +  S+ A
Sbjct: 463 MSINLLRTEGISYAA 477

BLAST of CSPI01G12050 vs. TrEMBL
Match: A0A068TUA9_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029177001 PE=4 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 6.8e-117
Identity = 207/415 (49.88%), Postives = 301/415 (72.53%), Query Frame = 1

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           +NLY+RISP+GDP ISV P+LDQW  EGR V +  L  I+KEL+ YKR+KHALE+S+WM+
Sbjct: 40  NNLYKRISPLGDPKISVVPVLDQWAAEGRPVHKQYLESIVKELKAYKRYKHALEVSRWMT 99

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           +KRY PL   D++I++NLI RVHGL++ E+ F+N+ S+LK +  HIALLNCY HEK V+K
Sbjct: 100 EKRYMPLRELDVSIQINLIHRVHGLKEAENCFNNVSSKLKGFNAHIALLNCYVHEKSVEK 159

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRIS 208
           A A MQK++EMG+ANSPLPYN+MMNL++ +G +++LD L+ EM+ RG+ +D FT +IR+S
Sbjct: 160 AEALMQKMREMGYANSPLPYNLMMNLHYGLGNYKKLDDLMNEMEGRGIKFDPFTLTIRLS 219

Query: 209 AYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
           AYAAASD  G++KI + ME +P IV D++ Y + A  Y KVG +DK++ +LKK E L   
Sbjct: 220 AYAAASDAEGVDKIAKMMEIDPLIVPDFSVYAVVAQGYLKVGQLDKALPILKKMEELAVT 279

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK-KEKIFNKGFISMITSLLILDDIKGAER 328
            +K  F ++  LKLYA   ++D++ RIW +YK K+KI NKG+++M++SLL   D+ G E 
Sbjct: 280 TRKGKFPYDFLLKLYAGMQRRDDVLRIWEMYKQKQKINNKGYMTMMSSLLSFGDVGGIED 339

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I+KEWE+R LSYD R+PN+L+ AYCR G +EKAE L+++ +    +    +W Y+A GY+
Sbjct: 340 IFKEWESRGLSYDFRVPNVLIHAYCRNGELEKAEALIDKGLSEGGEPFATTWFYMALGYI 399

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQ--DVEETEKVVNLLREK 441
           + +Q+ +AVE LK A   CP       E L   L+  +  DVE++E+ + L++++
Sbjct: 400 KDNQISKAVEALKKAILKCPPDHKPNTETLNTCLEHMERGDVEKSEEFIKLIKKE 454

BLAST of CSPI01G12050 vs. TrEMBL
Match: M1CWK5_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029687 PE=4 SV=1)

HSP 1 Score: 415.6 bits (1067), Expect = 7.8e-113
Identity = 201/450 (44.67%), Postives = 313/450 (69.56%), Query Frame = 1

Query: 6   SLKPINLIAFRREFVN-FYSTVV--------KDNLYRRISPVGDPNISVTPLLDQWVLEG 65
           +L+  N I F+R  +N FY T          +D ++ RI+P+G P++S+ P+L+QWV EG
Sbjct: 8   ALRKQNPIHFQRTIINRFYGTTSTKQDKNQKRDWVFARIAPLGSPDVSMVPVLEQWVEEG 67

Query: 66  RLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQV 125
           + V + EL+ IIK L  YKR+KHALE+S WM+D+RY+PL  AD+A R+NL+ +V GLE+V
Sbjct: 68  KTVVKSELQWIIKRLNSYKRYKHALEVSHWMTDRRYYPLQPADVAARINLMNKVKGLEEV 127

Query: 126 EDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYH 185
           E YF+++P  L+R +V+ ALLNCY +EK V+KA A MQ++++MGFA   L YN MMNLY+
Sbjct: 128 EKYFNSIPQMLRRPEVYTALLNCYTNEKSVEKAEAIMQQLRDMGFAKGTLCYNHMMNLYY 187

Query: 186 QIGEFERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDW 245
           + G +E++D+++ EM+++GV +D FT +IR++AYAAA D  G++KI+  MES+  I+L W
Sbjct: 188 KTGTWEKMDNMMNEMEQKGVNFDEFTLTIRLTAYAAAGDSEGMDKILAIMESDKQIILHW 247

Query: 246 NCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIW 305
           + Y IAA  Y KVG ++K++ +L K E ++ N +K   A+N  LKLYA  GKK+E+HR+W
Sbjct: 248 DTYSIAAELYLKVGQVEKALELLSKLESMILNHEKSNGAYNCLLKLYAEAGKKEEVHRVW 307

Query: 306 NLYKKE-KIFNKGFISMITSLLIL-DDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRA 365
           +LYK+  +I NKG+I+++++L+   D  +G E+I++EWE+  LSYD R+P++L+ +YCR 
Sbjct: 308 DLYKQNMRILNKGYITVMSALMKFGDTTEGVEKIFEEWESEALSYDFRVPDVLIRSYCRN 367

Query: 366 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 425
           GL+EKA+ L+++ +       V +WC+LA+GY+ +D +P+AVE LK A S+CP      K
Sbjct: 368 GLLEKAKALMDKGISKGGVPWVTTWCHLANGYIHEDLVPEAVEALKKAISICPPNYKPSK 427

Query: 426 EILAAFLDGKQDVEETEKVVNLLREKDDSH 445
           E LA  +   +     +   + +R  +  H
Sbjct: 428 ETLATCVKYWEKQGNVDNAADFVRCLEQDH 457

BLAST of CSPI01G12050 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 335.1 bits (858), Expect = 6.8e-92
Identity = 169/416 (40.62%), Postives = 267/416 (64.18%), Query Frame = 1

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRIS 208
           A    Q++KE+GF    LPYN+M+NLY + G++  ++ LL+EM++  V  D FT + R+ 
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 AYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
           AY+  SD  G+EK + + E++  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CSPI01G12050 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 278.9 bits (712), Expect = 5.8e-75
Identity = 157/407 (38.57%), Postives = 235/407 (57.74%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M LY  IG+
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
            E++  +L+EMKE  V  D ++Y I I+A+ A  D   I   +  ME    I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CSPI01G12050 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 254.2 bits (648), Expect = 1.5e-67
Identity = 143/420 (34.05%), Postives = 246/420 (58.57%), Query Frame = 1

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISA 210
            A +  +++ G+A  PLP+N+MM LY  + E++++D+++ EMK++ +  D ++Y+I +S+
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 211 YAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
             +      +E + +QM+S+ SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CSPI01G12050 vs. TAIR10
Match: AT1G28020.1 (AT1G28020.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 221.5 bits (563), Expect = 1.1e-57
Identity = 126/345 (36.52%), Postives = 197/345 (57.10%), Query Frame = 1

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISAYAA 213
            QK++++G    P+PYN MM+LY  +   E+++ LL EMK+  V  D  T +  +  Y+A
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 ASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
             D   +EK + + E    I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CSPI01G12050 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 208.8 bits (530), Expect = 7.3e-54
Identity = 125/437 (28.60%), Postives = 232/437 (53.09%), Query Frame = 1

Query: 27  VKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKW 86
           +++ LY R+   G   + V   L+Q++   + V + E+   IK+LR    +  AL++S+ 
Sbjct: 21  IEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEV 80

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           M ++R    + +D AI ++L+ +   +   E+YF ++P   K    + +LLNCY  E   
Sbjct: 81  M-EERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLT 140

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIR 206
           +KA   + K+KE+    S + YN +M LY + GE E++ ++++E+K   V  D +TY++ 
Sbjct: 141 EKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVW 200

Query: 207 ISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
           + A AA +D  G+E+++E+M  +  +  DW  Y   A+ Y   GL  K+   L++ E  +
Sbjct: 201 MRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELE--M 260

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKG 326
            N ++   A+   + LY R GK  E++RIW   +    K  N  +++MI  L+ L+D+ G
Sbjct: 261 KNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPG 320

Query: 327 AERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLAS 386
           AE ++KEW+    +YD+RI N+L+ AY + GL++KA  L  +      K + ++W     
Sbjct: 321 AETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMD 380

Query: 387 GYLQKDQLPQAVETLKLAASV--------CPSRLNYVKEILAAFLDGKQDVEETEKVVNL 446
            Y++   + +A+E +  A S+         PS        L ++ + K+DV   E ++ +
Sbjct: 381 YYVKSGDMARALECMSKAVSIGKGDGGKWLPS--PETVRALMSYFEQKKDVNGAENLLEI 440

Query: 447 LREKDDSHPARAHDYIV 454
           L+   D+  A   + ++
Sbjct: 441 LKNGTDNIGAEIFEPLI 452

BLAST of CSPI01G12050 vs. NCBI nr
Match: gi|778658728|ref|XP_011653157.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 915.2 bits (2364), Expect = 4.5e-263
Identity = 458/461 (99.35%), Postives = 459/461 (99.57%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFR EFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
           FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV
Sbjct: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CSPI01G12050 vs. NCBI nr
Match: gi|659068040|ref|XP_008442434.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 797.3 bits (2058), Expect = 1.4e-227
Identity = 397/453 (87.64%), Postives = 422/453 (93.16%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+KP NLIA RR  VNFYST VKDNLYRRISPVGDPNISV P+LDQWVLEGR+VQ
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELRVYKRFKHALEISKWMSDKRY PLST D+A RMNLILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           +NMPSQLKRY VHIALLNCYAHEKCVDKANAF+QKIKEMG+A S LPYNIMMNLYHQIGE
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLYHQIGE 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
           FERLDSLLKEMKE+GVYYDRFTYSIR+SAYAAASD  GIEK+MEQMESN SIVLDWNCYV
Sbjct: 181 FERLDSLLKEMKEKGVYYDRFTYSIRLSAYAAASDCTGIEKMMEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKS+SMLKKSEG LA  KKKG AFNVYLKLYARNGKKDE+HRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMI SLLILDDI+GAE IYKEWET+KLSYD+RIPNLLVDAYCRAGL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+NE+V VR KFSVESWCYLASGYLQKDQLPQAVETLK AAS+CPS LNYVKEILAAF
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
            DGKQDVEE EKVVNLLREKD+ +PARAHD +V
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDNLNPARAHDILV 453

BLAST of CSPI01G12050 vs. NCBI nr
Match: gi|659068042|ref|XP_008442448.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 761.9 bits (1966), Expect = 6.4e-217
Identity = 386/467 (82.66%), Postives = 421/467 (90.15%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+K INLIAFRRE VNFYST V D+LYRR+SPVGDPNIS+ P+LDQWV EGR VQ
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             ELR IIKELRVYKR+KHALE+SKWMSDK   PLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           +NMPS+LKRYQVHIALLNCYAHEKCVDKANA +QKIKE+GFA +P PYNIMMNLYHQIGE
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
           FERLDSL+KEMKERG+YYDRFTYSIR+SAYA ASD  GIEKI EQMESN SIVLDWNCYV
Sbjct: 181 FERLDSLMKEMKERGLYYDRFTYSIRLSAYATASDCAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLA-NVKKKGFAFNVYLKLYARNGKKDEIHRIWNLY 300
           +AA+AY KVGLIDKSISMLKKSE LLA   + K  AFN+YL LYA+NGKKDE +RIWNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKIFNKGFISMITSLLILDDIKGAERIYKE-----WETRKLSYDLRIPNLLVDAYCRA 360
           KKEK+FNKGFISMITSLLILDDIKGA RI +E     WET+KLSYDLRIPNLLVDAYCRA
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 420
           GLME+AEVL+ EM+ VRRKFSV+SWCY+ASGYLQKDQLP+AVETLK+AAS+CPS+L+YVK
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           EILAAFLDGKQDVEE EKVVNLLREKD+SHPAR HDY   AIMTESA
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNSHPARGHDY---AIMTESA 464

BLAST of CSPI01G12050 vs. NCBI nr
Match: gi|778658725|ref|XP_011653151.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 640.2 bits (1650), Expect = 2.8e-180
Identity = 322/358 (89.94%), Postives = 333/358 (93.02%), Query Frame = 1

Query: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFRREFVNFYSTVVKD+LYRRISPVGDPNISVTPLLDQWVLE  LVQ
Sbjct: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDSLYRRISPVGDPNISVTPLLDQWVLESGLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGE 180
           +NMPSQLKR QVHIALLNCYAHEK  DKANA +QKIKEMGFA + LPYNI MNLYHQIGE
Sbjct: 121 NNMPSQLKRCQVHIALLNCYAHEKYADKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGE 180

Query: 181 FERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYV 240
           FERLDS    +KE  V +D+FTY+ R+SAYA A DF GIEKIMEQME N SIVLDWNCYV
Sbjct: 181 FERLDS---PLKETDVDHDQFTYTTRLSAYATAFDFTGIEKIMEQMEXNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 359
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWET+KLSYDLRIPNLLVDAYCRAGLME
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETQKLSYDLRIPNLLVDAYCRAGLME 355

BLAST of CSPI01G12050 vs. NCBI nr
Match: gi|700209577|gb|KGN64673.1| (hypothetical protein Csa_1G073800 [Cucumis sativus])

HSP 1 Score: 523.5 bits (1347), Expect = 3.8e-145
Identity = 278/367 (75.75%), Postives = 289/367 (78.75%), Query Frame = 1

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           MSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF+NMPSQLKR QVHIALLNCYAHEK  
Sbjct: 1   MSDKRYFPLSTADIATRMNLILRVHGLEQVEDYFNNMPSQLKRCQVHIALLNCYAHEKYA 60

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIR 206
           DKANA +QKIKEMGFA + LPYNI MNLYHQIGEFERLDS LKE     V +D+FTY+ R
Sbjct: 61  DKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGEFERLDSPLKETD---VDHDQFTYTTR 120

Query: 207 ISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
                                      L+WNCYVIAANAYNKVGLIDKSISMLKKSEGLL
Sbjct: 121 ---------------------------LNWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 180

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAE 326
           ANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYKKEKIFNKGFISMITSL +LDDIKGAE
Sbjct: 181 ANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYKKEKIFNKGFISMITSLFVLDDIKGAE 240

Query: 327 RIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGY 386
           RIYKEWET+KLSYDLRIPNLLVDAYCRA                            ASGY
Sbjct: 241 RIYKEWETQKLSYDLRIPNLLVDAYCRA----------------------------ASGY 300

Query: 387 LQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPA 446
           LQKDQLPQAVETLK AAS+CPS LNY KEILAAFLDGKQD EETEKVVNLLREKDDSHPA
Sbjct: 301 LQKDQLPQAVETLKKAASLCPSELNYAKEILAAFLDGKQDEEETEKVVNLLREKDDSHPA 309

Query: 447 RAHDYIV 454
           RAHD +V
Sbjct: 361 RAHDILV 309

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP166_ARATH1.2e-9040.63Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PP334_ARATH1.0e-7338.57Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
PPR3_ARATH2.7e-6634.05Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR61_ARATH1.9e-5636.52Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis th... [more]
PPR86_ARATH1.3e-5228.60Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LV44_CUCSA3.1e-26399.35Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073790 PE=4 SV=1[more]
A0A0A0LSC2_CUCSA2.7e-14575.75Uncharacterized protein OS=Cucumis sativus GN=Csa_1G073800 PE=4 SV=1[more]
W9RYV8_9ROSA3.4e-12451.26Uncharacterized protein OS=Morus notabilis GN=L484_014527 PE=4 SV=1[more]
A0A068TUA9_COFCA6.8e-11749.88Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00029177001 PE=4 SV=1[more]
M1CWK5_SOLTU7.8e-11344.67Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400029687 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G20710.16.8e-9240.63 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.15.8e-7538.57 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.11.5e-6734.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G28020.11.1e-5736.52 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.17.3e-5428.60 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778658728|ref|XP_011653157.1|4.5e-26399.35PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|659068040|ref|XP_008442434.1|1.4e-22787.64PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|659068042|ref|XP_008442448.1|6.4e-21782.66PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
gi|778658725|ref|XP_011653151.1|2.8e-18089.94PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g... [more]
gi|700209577|gb|KGN64673.1|3.8e-14575.75hypothetical protein Csa_1G073800 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009058 biosynthetic process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G12050.1CSPI01G12050.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 135..161
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 345..368
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 168..211
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 168..199
score: 1.7E-4coord: 345..368
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 199..229
score: 7.706coord: 340..374
score: 8.342coord: 129..163
score: 6.358coord: 164..198
score: 9.24coord: 272..306
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 130..271
score: 1.6E-8coord: 338..440
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 312..418
score: 5.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 28..441
score: 8.5E
NoneNo IPR availablePANTHERPTHR24015:SF644PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 28..441
score: 8.5E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI01G12050Melon (DHL92) v3.6.1cpimedB023
CSPI01G12050Melon (DHL92) v3.6.1cpimedB076
CSPI01G12050Cucumber (Gy14) v2cgybcpiB161
CSPI01G12050Silver-seed gourdcarcpiB0169
CSPI01G12050Silver-seed gourdcarcpiB0193
CSPI01G12050Silver-seed gourdcarcpiB0670
CSPI01G12050Silver-seed gourdcarcpiB0986
CSPI01G12050Cucumber (Chinese Long) v3cpicucB039
CSPI01G12050Cucumber (Chinese Long) v3cpicucB051
CSPI01G12050Watermelon (97103) v2cpiwmbB001
CSPI01G12050Watermelon (97103) v2cpiwmbB023
CSPI01G12050Watermelon (97103) v2cpiwmbB032
CSPI01G12050Wax gourdcpiwgoB038
CSPI01G12050Wax gourdcpiwgoB023
CSPI01G12050Wax gourdcpiwgoB075
CSPI01G12050Wild cucumber (PI 183967)cpicpiB026
CSPI01G12050Wild cucumber (PI 183967)cpicpiB038
CSPI01G12050Cucurbita pepo (Zucchini)cpecpiB025
CSPI01G12050Wild cucumber (PI 183967)cpicpiB051
CSPI01G12050Cucumber (Gy14) v1cgycpiB016
CSPI01G12050Cucumber (Gy14) v1cgycpiB162
CSPI01G12050Cucurbita maxima (Rimu)cmacpiB009
CSPI01G12050Cucurbita maxima (Rimu)cmacpiB219
CSPI01G12050Cucurbita maxima (Rimu)cmacpiB395
CSPI01G12050Cucurbita maxima (Rimu)cmacpiB865
CSPI01G12050Cucurbita moschata (Rifu)cmocpiB000
CSPI01G12050Cucurbita moschata (Rifu)cmocpiB205
CSPI01G12050Cucurbita moschata (Rifu)cmocpiB383
CSPI01G12050Cucurbita moschata (Rifu)cmocpiB423
CSPI01G12050Cucumber (Chinese Long) v2cpicuB034
CSPI01G12050Cucumber (Chinese Long) v2cpicuB044
CSPI01G12050Cucumber (Chinese Long) v2cpicuB051
CSPI01G12050Melon (DHL92) v3.5.1cpimeB086
CSPI01G12050Watermelon (Charleston Gray)cpiwcgB011
CSPI01G12050Watermelon (Charleston Gray)cpiwcgB016
CSPI01G12050Watermelon (97103) v1cpiwmB043
CSPI01G12050Watermelon (97103) v1cpiwmB082
CSPI01G12050Cucurbita pepo (Zucchini)cpecpiB475
CSPI01G12050Cucurbita pepo (Zucchini)cpecpiB506
CSPI01G12050Cucurbita pepo (Zucchini)cpecpiB771
CSPI01G12050Bottle gourd (USVL1VR-Ls)cpilsiB018
CSPI01G12050Bottle gourd (USVL1VR-Ls)cpilsiB021
CSPI01G12050Bottle gourd (USVL1VR-Ls)cpilsiB051