CSPI07G20080 (gene) Wild cucumber (PI 183967)

NameCSPI07G20080
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr7 : 17539216 .. 17542225 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGGTTTATAAACACAATTGAGCTTCACTTTCCCCAATATATATATTAAGGGATTGTAATTCAAATTTTACCGCGCAGCGAAACTCCCTCTCTGTATCTTCAGTCGTTTTTCATGGTAATCCATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCAATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGGGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTGAAGTTTATTTTCGCATATCTTCATTTATTCTGACACTTGATTGTTGATATTCTGATATGCCATTTGTAACTATTGTCAAAGTCGTTTTGCATTTTGACTTTATTCGATAGCTTAGTTTATATGCTTAAGTGTAGAAAGCTAAGACGTATACTACATGACGGCATGCTATATTTTAAAAGTCTAGGAATGACACGACAAGGAAACTTTTATCAAAAGATCTTTTTTTAATGCATTCTGTTAATTGGTTTCTTACTGATTTATGTAGTAAATGAGAAAATGAAGGGAGATTACATCCATAGTTATTTTTTCGATTTATATGCTGCTAACAAGGAAGTTCCTGGTTTAAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGCTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAAATATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAGGTAATTTATCTGTATACGTGTTACTCGAATAAATTGAATCATATTCTATGTTCTGCTAGTATTGTATAGGTCCTATAGGAGATGAGCTAGATATCAATCTGCTCATTCTGCTCGATACTCCTTTCACAATCTTTCAATCAGCACCCTTTAGTCCATAATTTTATGCTTGTGTAGAAAAAAATCATTAGAGGTTCATGGGTGCCTTCATGTGAGGGTTACAATTCAGGTTTTCGATTAGATCTCGTCAGGATGTGAACACAGTTCCACTTAAGTTAGAAATATACAAAATCCTTGGTATAAAAATAAGGAGAACTTCTTGTTAGGAATAGGATGAATCCTTGTCCTAACAAATTAATTATTCACTCTTCCATTGAATGCAGGATGACTTAATAATCTTGCAGCCCATATAAAAGTTAGATAGATTTCTGATTGAATGATGGGGGAGGTATTTTTTAAGATTGAGATTTAAATCTCTGATTTACTATTTGAACTATGTTTAAGTTGAGTAAGATCTTACATATTTGCTTTTTGGATGAACTGTTGACTGTTTGTCTTTTATCTGAATCCTTTGTCTGATAAGGAACAAGAACTCATTGTGGAAGCAGGTGAAAAAGCATTCCTTCCTTCGCTTTTGCACAAAGCTCGAAGAGAACACTCGCATGGGCTAGAGAAAAGATACGACATCGTTTTCCTTTTTTTCTTTTTCCTCTTTTGAAGGTATAAGCTGTTTGGCCTGGCAGTTGGATGATCGAGAAAGTTTAATTCTCAGTTGTCCTTTTTGTAGCTTCATTTGTTGGAATTGGCCACACTATAAAAGGTTATAAATTCTCACCATAAGGGCAACCTTTTCAAGCATGGACACAGTTTTAAGGATTTTTCTTTTCATTGGAAAGAT

mRNA sequence

ATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCAATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGGGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGCTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAAATATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAG

Coding sequence (CDS)

ATGGCAAGCCTTTCTTCTATTGCCAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCATCTCCTTTATCTCAACCGTTTACGTATCCCCGATTCCCCCTTTCAGGCATTTCATCAAACATTTAGTCTGCATTCTTTCTTCGCTCGTCAATTCTCAGCTCTTCCATCTTTTTCTCAAAAACTTGGCGACCCATTTCTGTTTGACACAGGAAGATTCCAAAACTATCGCCAGAGTGACGCGTGTAATGCCCGATTCATCGAATTGTTCAAACGGGTCGCTCTTTTGCAATCGGAAGTGGAGGCTGTTGCTGCATTGGACGAGTTTGATGTCAAGGCAGATTTGGATTTGGTTTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCCCTTCTAGCATTCAAATGGGGTGAGAAAGGGGGAGCCATTGATGAAGAGATTTGTAATTTGATGATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTCTTTGATCAGAGAATTGCACGGATCCTTGCTGAATTCGATGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCAAATGAGGCAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAGTTCAGATTGACACCGGATCAAGAGGCTTTTCACGTGCTTCTCAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGGAACAGAAAGCTTTAATATTATTCTCAACGGTTGGTGCAATGTAACTGTTGATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATGTCTAAATGTTGCATTTTACCAGATTCAACTTCTTATACCCACATGATTTCCTGTTTTTCGAAGAATGGGAACCTTTTCGACTCGCTTAGATTCTATGATCAGATGAAGAAAAGGGATTGGATTCCAAGCGTCGAAGTCTATAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAATGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAGTGGGCTTGCGGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGGCGGGAAAGCTGGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCAGTCCGACGATCGAAACCTACCATTCTTTTATTCAGGCTGCAGATTCTAAAATGAGCTTTGAACTTCTTAAGCGGATGAGACAAGATGGTTTGGGTCCTACAGAGGCTACCTTTCTTATCATGTTTAATAAGTCATTTGAATTAGAAGAACCGGAGTATGCATTGAATGTGTGGGTAGAAATGAAGCGGTACGAGGTATTTCCGAGTTGTGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATGTGGTCACTTAAAAAAGGCCAGGGAAATATATGACGAAATGATATTACATGGATTTATCGCACATCCAAAGATTAAAACGCTTCTGAAGGAACCAGATTTAGGTAGCATTGACGAAGCAAGGCAGCAAGTGAGACACAACAACAAAGGTAAGTTCATTCCTCATAGGAAAGGGAGAACGATGAGGTGGAAATCACATAAACAACGATCTAAAGGGGCTGCATCATTTGAATAG
BLAST of CSPI07G20080 vs. Swiss-Prot
Match: PP137_ARATH (Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidopsis thaliana GN=At1g80880 PE=2 SV=1)

HSP 1 Score: 494.2 bits (1271), Expect = 1.8e-138
Identity = 239/477 (50.10%), Postives = 337/477 (70.65%), Query Frame = 1

Query: 37  AFHQTFSLHSFFARQFSALPSF--SQKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVA 96
           AFH+   +HS   +  S LP F  S +     + +T    N           I+L ++V+
Sbjct: 47  AFHRAGHVHS---QVLSYLPHFASSNRFSTKTISETFDI-NLTALAPLEKGLIDLIRQVS 106

Query: 97  LLQSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMI 156
            L+SE +A+A+L++     + D  YS IW LRD+W+ + LAFKWGEK G  D++ C+LMI
Sbjct: 107 ELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDDQKSCDLMI 166

Query: 157 WVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTP 216
           WVLGNH+KF+ AW LIR++     ++ +AM +M+DRYA AN+ S+AI+TF +M+KF+ TP
Sbjct: 167 WVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDIMDKFKHTP 226

Query: 217 DQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 276
             EAF  LL +LC++G+IE+AEEFM  +KKLFP+  E FN+ILNGWCN+  DV EAKRIW
Sbjct: 227 YDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTDVTEAKRIW 286

Query: 277 REMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRE 336
           REM   CI P+  SY+HMISCFSK GNLFDSLR YD+MKKR   P +EVYNSL YVLTRE
Sbjct: 287 REMGNYCITPNKDSYSHMISCFSKVGNLFDSLRLYDEMKKRGLAPGIEVYNSLVYVLTRE 346

Query: 337 NCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETY 396
           +CF+EA+K+++K+ E GL+PDS TYNS+I PLCEAGKLD A++VL  M  +N+SPT++T+
Sbjct: 347 DCFDEAMKLMKKLNEEGLKPDSVTYNSMIRPLCEAGKLDVARNVLATMISENLSPTVDTF 406

Query: 397 HSFIQAADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVF 456
           H+F++A + + + E+L +M+   LGPTE TFL++  K F+ ++PE AL +W EM R+E+ 
Sbjct: 407 HAFLEAVNFEKTLEVLGQMKISDLGPTEETFLLILGKLFKGKQPENALKIWAEMDRFEIV 466

Query: 457 PSCEHYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQ 512
            +   Y   IQGL +CG L+KAREIY EM   GF+ +P ++ LL+E  +  + ++++
Sbjct: 467 ANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGVRKSKR 519

BLAST of CSPI07G20080 vs. Swiss-Prot
Match: PP383_ARATH (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 275.0 bits (702), Expect = 1.8e-72
Identity = 150/401 (37.41%), Postives = 232/401 (57.86%), Query Frame = 1

Query: 106 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEI--CNLMIWVLGNHKKF 165
           L+E DVK   +LV   +  +R+DW+++   F W  K       +   + MI +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 166 STAWSLIRELHG---SLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH 225
            TAW+LI E+     SL+NS Q +L+MI +Y   ++  KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 226 VLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKC 285
            LL++LC+Y N+ +A   +F NK  +P   +SFNI+LNGWCNV     EA+R+W EM   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 286 CILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEA 345
            +  D  SY+ MISC+SK G+L   L+ +D+MKK    P  +VYN++ + L + +  +EA
Sbjct: 298 GVKHDVVSYSSMISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEA 357

Query: 346 LKILEKI-KEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQ 405
             +++ + +E G+ P+  TYNSLI PLC+A K +EAK V   M E  + PTI TYH+F++
Sbjct: 358 RNLMKTMEEEKGIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMR 417

Query: 406 -AADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCE 465
                +  FELL +MR+ G  PT  T++++  K     + +  L +W EMK   V P   
Sbjct: 418 ILRTGEEVFELLAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLS 477

Query: 466 HYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLK 500
            Y V+I GL   G +++A   Y EM   G   +  ++ +++
Sbjct: 478 SYIVMIHGLFLNGKIEEAYGYYKEMKDKGMRPNENVEDMIQ 517

BLAST of CSPI07G20080 vs. Swiss-Prot
Match: PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.5e-44
Identity = 106/371 (28.57%), Postives = 189/371 (50.94%), Query Frame = 1

Query: 136 FKWGEK--GGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSL--LNSMQAMLVMIDRY 195
           F W  K  G     E+C  M+ +L   ++F   W LI E+  +   L   +  +V++ R+
Sbjct: 118 FLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRF 177

Query: 196 AYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTE 255
           A AN   KA++    M K+ L PD+  F  LL++LCK G+++EA +     ++ FP    
Sbjct: 178 ASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLR 237

Query: 256 SFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQ 315
            F  +L GWC     + EAK +  +M +  + PD   +T+++S ++  G + D+    + 
Sbjct: 238 YFTSLLYGWCR-EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMND 297

Query: 316 MKKRDWIPSVEVYNSLAYVLTR-ENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAG 375
           M+KR + P+V  Y  L   L R E   +EA+++  +++  G   D  TY +LIS  C+ G
Sbjct: 298 MRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWG 357

Query: 376 KLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSF----ELLKRMRQDGLGPTEATFL 435
            +D+   VL  M +  + P+  TY   + A + K  F    EL+++M++ G  P    + 
Sbjct: 358 MIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYN 417

Query: 436 IMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMILH 495
           ++   + +L E + A+ +W EM+   + P  + + ++I G  + G L +A   + EM+  
Sbjct: 418 VVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSR 477

Query: 496 GFIAHPKIKTL 498
           G  + P+  TL
Sbjct: 478 GIFSAPQYGTL 487

BLAST of CSPI07G20080 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.9e-43
Identity = 101/356 (28.37%), Postives = 183/356 (51.40%), Query Frame = 1

Query: 148 EICNLMIWVLGNHKKFSTAWSLIRELH--GSLLNSMQAMLVMIDRYAYANEASKAIKTFH 207
           E+   M+ +L   ++F   W LI E+      L   +  +V++ R+A A+   KAI+   
Sbjct: 148 EVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLD 207

Query: 208 MMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTV 267
            M KF   PD+  F  LL++LCK+G++++A +     +  FP+    F  +L GWC V  
Sbjct: 208 EMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG- 267

Query: 268 DVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYN 327
            + EAK +  +M++    PD   YT+++S ++  G + D+      M++R + P+   Y 
Sbjct: 268 KMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYT 327

Query: 328 SLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTED 387
            L   L + +   EA+K+  +++      D  TY +L+S  C+ GK+D+   VL  M + 
Sbjct: 328 VLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKK 387

Query: 388 NISPTIETYHSFIQAADSKMSF----ELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYA 447
            + P+  TY   + A + K SF    EL+++MRQ    P    + ++   + +L E + A
Sbjct: 388 GLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEA 447

Query: 448 LNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTL 498
           + +W EM+   + P  + + ++I GLA+ G L +A + + EM+  G  +  +  TL
Sbjct: 448 VRLWNEMEENGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTL 502

BLAST of CSPI07G20080 vs. Swiss-Prot
Match: PP112_ARATH (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 8.8e-40
Identity = 115/394 (29.19%), Postives = 194/394 (49.24%), Query Frame = 1

Query: 98  SEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGE--KGGAIDEEICNLMIW 157
           S+VE +  L+E  VK    L+   +  L +    +L  FKW E  KG        N +I 
Sbjct: 79  SKVETL--LNEASVKLSPALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIE 138

Query: 158 VLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPD 217
            LG  K+F   WSL+ ++    L S +   ++  RYA A +  +AI  FH ME+F    +
Sbjct: 139 SLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKME 198

Query: 218 QEAFHVLLNSLCKYGNIEEAEE-FMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 277
              F+ +L++L K  N+ +A++ F  + KK F    +S+ I+L GW    +++     + 
Sbjct: 199 SSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYTILLEGW-GQELNLLRVDEVN 258

Query: 278 REMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRE 337
           REM      PD  +Y  +I+   K     +++RF+++M++R+  PS  ++ SL   L  E
Sbjct: 259 REMKDEGFEPDVVAYGIIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSE 318

Query: 338 NCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETY 397
              N+AL+  E+ K  G   ++ TYN+L+   C + ++++A   +  M    + P   TY
Sbjct: 319 KKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTY 378

Query: 398 ----HSFIQAADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKR 457
               H  I+   SK ++E+ + M  +   PT +T+ IM       E  + A+ +W EMK 
Sbjct: 379 DIILHHLIRMQRSKEAYEVYQTMSCE---PTVSTYEIMVRMFCNKERLDMAIKIWDEMKG 438

Query: 458 YEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMI 485
             V P    +S LI  L     L +A E ++EM+
Sbjct: 439 KGVLPGMHMFSSLITALCHENKLDEACEYFNEML 466

BLAST of CSPI07G20080 vs. TrEMBL
Match: A0A0A0K678_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431960 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 542/546 (99.27%), Postives = 543/546 (99.45%), Query Frame = 1

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFDVKADLDLVYS 120
           KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALL SEVEAVAALDEFDVKADLDLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSSLLAFKWGEK GAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
           MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360
           NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420
           SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIY 480
           TE TFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARE+Y
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 GAASFE 547
           GAASFE
Sbjct: 541 GAASFE 546

BLAST of CSPI07G20080 vs. TrEMBL
Match: U5GND9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03510g PE=4 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 4.4e-163
Identity = 298/544 (54.78%), Postives = 379/544 (69.67%), Query Frame = 1

Query: 3   SLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQ----------AFHQTFSLHSFFARQF 62
           SL +IARRL   H      LLY      P SP             FH+T S+ +     F
Sbjct: 2   SLLTIARRLQISHSRLLFPLLYSITYPHPPSPSNNSNYPIFFSVEFHRTLSIPTRSPHHF 61

Query: 63  SALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFDV 122
           S   SFS Q L   F       Q +   +      ++  K  A L SE EA+A+LDE  +
Sbjct: 62  STSQSFSTQYLNVSFELIQQGIQTH---EPLQMGLLQSLKMAAHLPSEAEAMASLDESGI 121

Query: 123 KADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLIR 182
           +A+ +LVYS IW LR++W+ + LAFKWG+K G +DE+ C LM+WVLGNH+KF+TAW LIR
Sbjct: 122 RANQNLVYSVIWELREEWRLAFLAFKWGDKWGCVDEKACELMVWVLGNHRKFNTAWILIR 181

Query: 183 ELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGN 242
           +LH SL+++ +AML+MIDRYA AN   KAI  F +M+KFR+TPD+EAF+ LLN LCK GN
Sbjct: 182 DLHRSLMSTRKAMLIMIDRYAAANVPGKAIYAFRIMDKFRMTPDEEAFYFLLNVLCKNGN 241

Query: 243 IEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTH 302
           IEEAEEFM VNKK FPL  E FNIILNGWCN+ VDVFEAKRIWREMSK CI PD+T+YTH
Sbjct: 242 IEEAEEFMLVNKKFFPLEVEGFNIILNGWCNICVDVFEAKRIWREMSKYCIDPDATTYTH 301

Query: 303 MISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVG 362
           MISCFSK GNLFDSLR YD MKKR W+P +EVYNSL Y+LTRENCF EALKIL+K+KE G
Sbjct: 302 MISCFSKVGNLFDSLRLYDGMKKRGWVPGIEVYNSLVYILTRENCFKEALKILDKMKETG 361

Query: 363 LRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLK 422
           L+ DS TYNS+I PLCEA KL++A+ ++  M E+N+SPTIETYH+F+Q    + +FE+L 
Sbjct: 362 LQRDSATYNSMIRPLCEAKKLEDARSLMAAMIEENVSPTIETYHAFLQGIVFEETFEVLD 421

Query: 423 RMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCG 482
           RM+  GLGPTE TFL++  K F+LE+PE AL +WVEMK+YEV  +  HY+V+++GLA CG
Sbjct: 422 RMKIAGLGPTEDTFLLLLAKFFKLEQPENALKIWVEMKQYEVASNLTHYTVMVEGLARCG 481

Query: 483 HLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMR 536
            L KARE Y EM  +G+   PKI+ +LK P     D+ ++      + + + H+KG  +R
Sbjct: 482 LLTKAREYYAEMRSNGYSDDPKIQKMLKVPVQDKNDKRKKLGGQFKRNQHVSHKKGSMVR 541

BLAST of CSPI07G20080 vs. TrEMBL
Match: V4UN86_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025296mg PE=4 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 3.6e-157
Identity = 284/509 (55.80%), Postives = 371/509 (72.89%), Query Frame = 1

Query: 37  AFHQTFSLHS---FFARQFSALPSFSQK-LGDPFLFDTGRFQNYRQSDACNARFIELFKR 96
           AFHQTFS  S    F   FS L SF  + L DPF F      +    D      ++  KR
Sbjct: 45  AFHQTFSKQSRLSHFTSHFSTLQSFPTRILTDPFAFSP---PSTHARDPREQILLDSIKR 104

Query: 97  VALLQSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNL 156
           VA   SE +A+A+LDE  V+AD++LVYS IW LR++W+ + LAFK GE+ G++DE++  L
Sbjct: 105 VAHFDSETQAMASLDEAGVEADVNLVYSVIWALREEWRLAFLAFKLGERQGSLDEKVSEL 164

Query: 157 MIWVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRL 216
           M+WVLGN KKF+ AW LIR+++ S  ++ +AMLVMIDRYA AN+  +AI+TF +MEKFR+
Sbjct: 165 MVWVLGNCKKFNIAWCLIRDMYKSSFSTRRAMLVMIDRYAAANDPCEAIRTFSIMEKFRI 224

Query: 217 TPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKR 276
           TPD+EAF  LLN+LC++GN+EEAEEFM VNKK+FPL   SFNIILNGWCN+ VDVFE+KR
Sbjct: 225 TPDEEAFLFLLNALCQHGNVEEAEEFMLVNKKVFPLAANSFNIILNGWCNIFVDVFESKR 284

Query: 277 IWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLT 336
           +WREMS  CI P++TSY HMISCFSK GNLFDSLR YD+MKKR W+P +EVYNSL +VLT
Sbjct: 285 VWREMSNYCITPNATSYAHMISCFSKVGNLFDSLRLYDEMKKRGWVPDLEVYNSLIFVLT 344

Query: 337 RENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIE 396
           RENCF EALK+L+K+K  GL+PDS TYNS+I PLC+ GKL++A++VL  M  +N+S +  
Sbjct: 345 RENCFKEALKMLDKMKATGLQPDSATYNSMILPLCDEGKLEDARNVLATMIGENLSLSTG 404

Query: 397 TYHSFIQAADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYE 456
           TYH+F++ A  + + E+L RMR  GLGP++ TFL++ +K F+LE+PE AL VWVEMK+YE
Sbjct: 405 TYHAFLKCAGFEETLEILDRMRIAGLGPSKDTFLLILHKFFKLEQPENALRVWVEMKKYE 464

Query: 457 VFPSCEHYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQ 516
           +     HY+VL++GLA+CG L KAR+ Y EM  +G +  PK+K L+KEP  G+  E RQQ
Sbjct: 465 IDADSTHYTVLVEGLASCGLLIKARKFYVEMRSNGHLDDPKLKKLVKEPVQGNKHERRQQ 524

Query: 517 VRHNNKGKFIPHRKGRTMRW-KSHKQRSK 541
           V    + K +   KG  M   KS KQ  K
Sbjct: 525 VSKVKRNKPVGLWKGSAMEHKKSRKQLRK 550

BLAST of CSPI07G20080 vs. TrEMBL
Match: B9RGM1_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1442060 PE=4 SV=1)

HSP 1 Score: 561.6 bits (1446), Expect = 1.0e-156
Identity = 290/559 (51.88%), Postives = 377/559 (67.44%), Query Frame = 1

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQ---------------AFHQTFSLH 60
           M+SL SI  RL R HP  F+ LL     R   SPF                AFH+T S+ 
Sbjct: 1   MSSLISIGIRLRRSHPKLFYPLL-----RSTASPFSFYPLKLNPFVSLFYAAFHRTASVP 60

Query: 61  SFFARQFSALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVA 120
              + +FS   SFS Q    PF     R  N+   D      +E  KR A    E EA+A
Sbjct: 61  FLNSLRFSGSQSFSSQNQKYPFELFEHRIYNH---DPLTQGLLETLKRAAYFPGEAEAMA 120

Query: 121 ALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFS 180
            +D   VKA+++LVYS IW LR DWK + L FKWGEK G IDE+ C L++W+LGNH+KF+
Sbjct: 121 CIDGSGVKANINLVYSVIWELRKDWKLAFLGFKWGEKWGCIDEKSCELIVWILGNHRKFN 180

Query: 181 TAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLN 240
            AW +IR++H   +N  Q ML+MIDRYA A+   KAI+ F++MEKF++ PD+EAF+ L+N
Sbjct: 181 NAWIVIRDMHQLSMNIQQTMLIMIDRYAAADNPGKAIEVFNIMEKFKMAPDEEAFYSLMN 240

Query: 241 SLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILP 300
           +LC +G IEEAEEFM VNKKLFPL TE FN+ILNGWC++ V++ EAKR+WREMSKCCI P
Sbjct: 241 ALCNHGYIEEAEEFMVVNKKLFPLETEGFNVILNGWCSICVNLLEAKRVWREMSKCCITP 300

Query: 301 DSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKIL 360
           ++TSYTHMISCFSK GNLFDSLR YD+MKKR W+P +EVYNSL YVLTRENCF EAL+ L
Sbjct: 301 NATSYTHMISCFSKVGNLFDSLRLYDEMKKRGWLPGMEVYNSLIYVLTRENCFKEALRFL 360

Query: 361 EKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQA--AD 420
           +K+KEVGL+PDSTTYNS+I PLCE  KL EA+ VL  M E+NISPT+ETYH+ ++   +D
Sbjct: 361 DKMKEVGLQPDSTTYNSMIRPLCEGKKLVEARSVLATMIEENISPTMETYHALLEVENSD 420

Query: 421 SKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSV 480
            + + E+L RM   GL PT+ TFL++  K F+LE+ E AL +W+EMK+YEV P+  HY +
Sbjct: 421 FEATLEVLNRMTVAGLAPTDDTFLLVLAKFFKLEQAENALKMWIEMKQYEVTPNLTHYKI 480

Query: 481 LIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFI 540
           L++GL  CG L KARE Y +M  +GF   PK++ +LKEP  G   + +       +   +
Sbjct: 481 LVEGLVRCGLLAKARECYADMRSNGFTDDPKLQKMLKEPVRGQNSKEKLCKGQVKRDGHV 540

BLAST of CSPI07G20080 vs. TrEMBL
Match: A0A061G9G9_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma cacao GN=TCM_027468 PE=4 SV=1)

HSP 1 Score: 560.5 bits (1443), Expect = 2.3e-156
Identity = 291/555 (52.43%), Postives = 369/555 (66.49%), Query Frame = 1

Query: 1   MASLSSIARRLCRIH-----PLPFHHLLYLNRLRIPDS------PFQAFHQTFSLHS--- 60
           + +L S+ARRL R H     P   HH   ++      S      P  AFHQT  + S   
Sbjct: 2   LMALLSLARRLQRTHSQILFPFLLHHPAAISSPSPSPSTQQLLFPSLAFHQTLFISSRTL 61

Query: 61  FFARQFSALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAA 120
            F  +FS L S S Q L  PF F           D+     + L KRVA   SE EA+A+
Sbjct: 62  LFTSRFSTLQSLSTQTLNYPFEFTPPAIHG---PDSQEQALLHLLKRVAHFSSEAEAMAS 121

Query: 121 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFST 180
           LDE  +KA  DLVYS I  LR++W+ + LAFKWGEK G   E    LMIWVLGNH+KF+ 
Sbjct: 122 LDESGIKATQDLVYSVIGTLREEWRLAFLAFKWGEKCGNTGENTYELMIWVLGNHRKFNM 181

Query: 181 AWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNS 240
           AW LIR+L  S +++ +AM +MIDRYA A++  KAI+TFH MEKFR+TPD+EAF  LLN+
Sbjct: 182 AWCLIRDLFRSSMDTRRAMFIMIDRYAAASDPCKAIQTFHTMEKFRMTPDEEAFRTLLNA 241

Query: 241 LCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPD 300
           LC+YG +EEAEE M  NKKLFPL T+ FNI+LNGWCN+ VDV EAKR+WREMSK CI+P+
Sbjct: 242 LCRYGYVEEAEEIMLQNKKLFPLETDGFNIVLNGWCNILVDVVEAKRVWREMSKYCIMPN 301

Query: 301 STSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILE 360
            TSYTHMISCFSK+GNLFDSLR Y++MKKR W P +EVY SL YVLTRENC NEA  IL+
Sbjct: 302 GTSYTHMISCFSKDGNLFDSLRLYNEMKKRGWDPGIEVYKSLVYVLTRENCLNEAQNILK 361

Query: 361 KIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKM 420
           K+KE GL+PDS TYNS+I PLCEA KL+EA+++L+ M E+N+SPTIETYH+F+     + 
Sbjct: 362 KMKETGLQPDSATYNSMIRPLCEAEKLEEARNILSTMKEENLSPTIETYHAFLHGVGFEG 421

Query: 421 SFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQ 480
           + E+  RM+   LGPT  TFL++  K F++E+PE AL +W EMK +EV P   HY  L++
Sbjct: 422 TLEVFNRMKVANLGPTRDTFLLVLGKFFKMEQPEQALKIWAEMKHFEVLPDSSHYIALVE 481

Query: 481 GLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHR 540
           GL T G L KARE YDEM  +GF   PK+K  L+EP   S  + ++  R   + K +   
Sbjct: 482 GLVTSGWLDKAREYYDEMRSYGFWDDPKLKKQLEEPKQCSGSKRQRGPREGKRSKKVNLW 541

BLAST of CSPI07G20080 vs. TAIR10
Match: AT1G80880.1 (AT1G80880.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 494.2 bits (1271), Expect = 1.0e-139
Identity = 239/477 (50.10%), Postives = 337/477 (70.65%), Query Frame = 1

Query: 37  AFHQTFSLHSFFARQFSALPSF--SQKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVA 96
           AFH+   +HS   +  S LP F  S +     + +T    N           I+L ++V+
Sbjct: 47  AFHRAGHVHS---QVLSYLPHFASSNRFSTKTISETFDI-NLTALAPLEKGLIDLIRQVS 106

Query: 97  LLQSEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMI 156
            L+SE +A+A+L++     + D  YS IW LRD+W+ + LAFKWGEK G  D++ C+LMI
Sbjct: 107 ELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDDQKSCDLMI 166

Query: 157 WVLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTP 216
           WVLGNH+KF+ AW LIR++     ++ +AM +M+DRYA AN+ S+AI+TF +M+KF+ TP
Sbjct: 167 WVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDIMDKFKHTP 226

Query: 217 DQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 276
             EAF  LL +LC++G+IE+AEEFM  +KKLFP+  E FN+ILNGWCN+  DV EAKRIW
Sbjct: 227 YDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTDVTEAKRIW 286

Query: 277 REMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRE 336
           REM   CI P+  SY+HMISCFSK GNLFDSLR YD+MKKR   P +EVYNSL YVLTRE
Sbjct: 287 REMGNYCITPNKDSYSHMISCFSKVGNLFDSLRLYDEMKKRGLAPGIEVYNSLVYVLTRE 346

Query: 337 NCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETY 396
           +CF+EA+K+++K+ E GL+PDS TYNS+I PLCEAGKLD A++VL  M  +N+SPT++T+
Sbjct: 347 DCFDEAMKLMKKLNEEGLKPDSVTYNSMIRPLCEAGKLDVARNVLATMISENLSPTVDTF 406

Query: 397 HSFIQAADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVF 456
           H+F++A + + + E+L +M+   LGPTE TFL++  K F+ ++PE AL +W EM R+E+ 
Sbjct: 407 HAFLEAVNFEKTLEVLGQMKISDLGPTEETFLLILGKLFKGKQPENALKIWAEMDRFEIV 466

Query: 457 PSCEHYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQ 512
            +   Y   IQGL +CG L+KAREIY EM   GF+ +P ++ LL+E  +  + ++++
Sbjct: 467 ANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGVRKSKR 519

BLAST of CSPI07G20080 vs. TAIR10
Match: AT5G15010.1 (AT5G15010.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 275.0 bits (702), Expect = 9.9e-74
Identity = 150/401 (37.41%), Postives = 232/401 (57.86%), Query Frame = 1

Query: 106 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEI--CNLMIWVLGNHKKF 165
           L+E DVK   +LV   +  +R+DW+++   F W  K       +   + MI +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 166 STAWSLIRELHG---SLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH 225
            TAW+LI E+     SL+NS Q +L+MI +Y   ++  KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 226 VLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKC 285
            LL++LC+Y N+ +A   +F NK  +P   +SFNI+LNGWCNV     EA+R+W EM   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 286 CILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEA 345
            +  D  SY+ MISC+SK G+L   L+ +D+MKK    P  +VYN++ + L + +  +EA
Sbjct: 298 GVKHDVVSYSSMISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEA 357

Query: 346 LKILEKI-KEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQ 405
             +++ + +E G+ P+  TYNSLI PLC+A K +EAK V   M E  + PTI TYH+F++
Sbjct: 358 RNLMKTMEEEKGIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMR 417

Query: 406 -AADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCE 465
                +  FELL +MR+ G  PT  T++++  K     + +  L +W EMK   V P   
Sbjct: 418 ILRTGEEVFELLAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLS 477

Query: 466 HYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTLLK 500
            Y V+I GL   G +++A   Y EM   G   +  ++ +++
Sbjct: 478 SYIVMIHGLFLNGKIEEAYGYYKEMKDKGMRPNENVEDMIQ 517

BLAST of CSPI07G20080 vs. TAIR10
Match: AT3G49730.1 (AT3G49730.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 182.2 bits (461), Expect = 8.7e-46
Identity = 106/371 (28.57%), Postives = 189/371 (50.94%), Query Frame = 1

Query: 136 FKWGEK--GGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSL--LNSMQAMLVMIDRY 195
           F W  K  G     E+C  M+ +L   ++F   W LI E+  +   L   +  +V++ R+
Sbjct: 118 FLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRF 177

Query: 196 AYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTE 255
           A AN   KA++    M K+ L PD+  F  LL++LCK G+++EA +     ++ FP    
Sbjct: 178 ASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLR 237

Query: 256 SFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQ 315
            F  +L GWC     + EAK +  +M +  + PD   +T+++S ++  G + D+    + 
Sbjct: 238 YFTSLLYGWCR-EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMND 297

Query: 316 MKKRDWIPSVEVYNSLAYVLTR-ENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAG 375
           M+KR + P+V  Y  L   L R E   +EA+++  +++  G   D  TY +LIS  C+ G
Sbjct: 298 MRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWG 357

Query: 376 KLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSF----ELLKRMRQDGLGPTEATFL 435
            +D+   VL  M +  + P+  TY   + A + K  F    EL+++M++ G  P    + 
Sbjct: 358 MIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYN 417

Query: 436 IMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMILH 495
           ++   + +L E + A+ +W EM+   + P  + + ++I G  + G L +A   + EM+  
Sbjct: 418 VVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSR 477

Query: 496 GFIAHPKIKTL 498
           G  + P+  TL
Sbjct: 478 GIFSAPQYGTL 487

BLAST of CSPI07G20080 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 177.9 bits (450), Expect = 1.6e-44
Identity = 101/356 (28.37%), Postives = 183/356 (51.40%), Query Frame = 1

Query: 148 EICNLMIWVLGNHKKFSTAWSLIRELH--GSLLNSMQAMLVMIDRYAYANEASKAIKTFH 207
           E+   M+ +L   ++F   W LI E+      L   +  +V++ R+A A+   KAI+   
Sbjct: 148 EVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLD 207

Query: 208 MMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTV 267
            M KF   PD+  F  LL++LCK+G++++A +     +  FP+    F  +L GWC V  
Sbjct: 208 EMPKFGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG- 267

Query: 268 DVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYN 327
            + EAK +  +M++    PD   YT+++S ++  G + D+      M++R + P+   Y 
Sbjct: 268 KMMEAKYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYT 327

Query: 328 SLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTED 387
            L   L + +   EA+K+  +++      D  TY +L+S  C+ GK+D+   VL  M + 
Sbjct: 328 VLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKK 387

Query: 388 NISPTIETYHSFIQAADSKMSF----ELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYA 447
            + P+  TY   + A + K SF    EL+++MRQ    P    + ++   + +L E + A
Sbjct: 388 GLMPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEA 447

Query: 448 LNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMILHGFIAHPKIKTL 498
           + +W EM+   + P  + + ++I GLA+ G L +A + + EM+  G  +  +  TL
Sbjct: 448 VRLWNEMEENGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTL 502

BLAST of CSPI07G20080 vs. TAIR10
Match: AT1G71060.1 (AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 166.4 bits (420), Expect = 4.9e-41
Identity = 115/394 (29.19%), Postives = 194/394 (49.24%), Query Frame = 1

Query: 98  SEVEAVAALDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGE--KGGAIDEEICNLMIW 157
           S+VE +  L+E  VK    L+   +  L +    +L  FKW E  KG        N +I 
Sbjct: 79  SKVETL--LNEASVKLSPALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIE 138

Query: 158 VLGNHKKFSTAWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPD 217
            LG  K+F   WSL+ ++    L S +   ++  RYA A +  +AI  FH ME+F    +
Sbjct: 139 SLGKIKQFKLIWSLVDDMKAKKLLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKME 198

Query: 218 QEAFHVLLNSLCKYGNIEEAEE-FMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIW 277
              F+ +L++L K  N+ +A++ F  + KK F    +S+ I+L GW    +++     + 
Sbjct: 199 SSDFNRMLDTLSKSRNVGDAQKVFDKMKKKRFEPDIKSYTILLEGW-GQELNLLRVDEVN 258

Query: 278 REMSKCCILPDSTSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRE 337
           REM      PD  +Y  +I+   K     +++RF+++M++R+  PS  ++ SL   L  E
Sbjct: 259 REMKDEGFEPDVVAYGIIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSE 318

Query: 338 NCFNEALKILEKIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETY 397
              N+AL+  E+ K  G   ++ TYN+L+   C + ++++A   +  M    + P   TY
Sbjct: 319 KKLNDALEFFERSKSSGFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTY 378

Query: 398 ----HSFIQAADSKMSFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKR 457
               H  I+   SK ++E+ + M  +   PT +T+ IM       E  + A+ +W EMK 
Sbjct: 379 DIILHHLIRMQRSKEAYEVYQTMSCE---PTVSTYEIMVRMFCNKERLDMAIKIWDEMKG 438

Query: 458 YEVFPSCEHYSVLIQGLATCGHLKKAREIYDEMI 485
             V P    +S LI  L     L +A E ++EM+
Sbjct: 439 KGVLPGMHMFSSLITALCHENKLDEACEYFNEML 466

BLAST of CSPI07G20080 vs. NCBI nr
Match: gi|778728914|ref|XP_011659500.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis sativus])

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 542/546 (99.27%), Postives = 543/546 (99.45%), Query Frame = 1

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFDVKADLDLVYS 120
           KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALL SEVEAVAALDEFDVKADLDLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSSLLAFKWGEK GAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
           MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360
           NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420
           SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIY 480
           TE TFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARE+Y
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 GAASFE 547
           GAASFE
Sbjct: 541 GAASFE 546

BLAST of CSPI07G20080 vs. NCBI nr
Match: gi|659122515|ref|XP_008461183.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis melo])

HSP 1 Score: 1025.0 bits (2649), Expect = 4.8e-296
Identity = 505/546 (92.49%), Postives = 514/546 (94.14%), Query Frame = 1

Query: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60
           MA LSSIARRLCRIHPLPFHHLLYLNRL I DSPFQAF QT  L S FA QFSALPSFSQ
Sbjct: 1   MACLSSIARRLCRIHPLPFHHLLYLNRLSIRDSPFQAFRQTLCLRSLFAHQFSALPSFSQ 60

Query: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFDVKADLDLVYS 120
           K+GD F FDTGRF+NYRQSDACNARFIELFKRVALL SEVEAVAALDEFDV+AD DLVYS
Sbjct: 61  KVGDQFQFDTGRFKNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVQADSDLVYS 120

Query: 121 AIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180
           AIWVLRDDWKSS LAFKWGEK GAIDEEICNLMIWVLGNHKKFSTAW LIRELHGSLLNS
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGAIDEEICNLMIWVLGNHKKFSTAWCLIRELHGSLLNS 180

Query: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFMF
Sbjct: 181 RQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300
           VNKKLFPL TESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360
           NLFDSLRFYDQMKKRDWIPS+EVYNSLAY L RENCFNEALKILEKIKEVGLRPDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSLEVYNSLAYALMRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420
           SLISPLCEAGKLDEAKDVLTMMTEDNI PTIETYHSFIQAADSKMSFELLKRMRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNIIPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKAREIY 480
            E TFLIMFNKSFELE+PEYALN WVEMKRY+VFPS EHYSVLIQGLATCGHLKKARE+Y
Sbjct: 421 IEVTFLIMFNKSFELEQPEYALNAWVEMKRYKVFPSSEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540
           DEMILHGFIAHPKIKTLLKEPD GSIDEARQQVRHN KGKF+ HRKG TMRWKSHKQ+SK
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDSGSIDEARQQVRHNKKGKFLSHRKGSTMRWKSHKQQSK 540

Query: 541 GAASFE 547
             ASFE
Sbjct: 541 RDASFE 546

BLAST of CSPI07G20080 vs. NCBI nr
Match: gi|694326575|ref|XP_009354198.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Pyrus x bretschneideri])

HSP 1 Score: 607.8 bits (1566), Expect = 1.8e-170
Identity = 306/552 (55.43%), Postives = 392/552 (71.01%), Query Frame = 1

Query: 3   SLSSIARRLCRIHPLPFHHLLYLNRLRIP---DSPF-------------QAFHQTFSLHS 62
           +L +IARRL        H  L+L+   +P   DSP              Q FH+T  +  
Sbjct: 2   ALPAIARRL-----RSKHSQLFLSHTLLPSISDSPSPPRSSQLLLRDLCQTFHRTLLVLP 61

Query: 63  FFARQFSALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAA 122
              R F  L  FS Q L DP  F+  RF      D+   +F+EL KR A   SE E +A 
Sbjct: 62  QTPRHFCTLQPFSAQTLNDPLGFNGQRFNKNHPRDSGFTQFLELLKRAADFASEAETMAF 121

Query: 123 LDEFDVKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFST 182
           LDE  +K D + V  AIW LR DWKS+ LAF+WGEK G  +EE C+LM+W+LG+HKKFST
Sbjct: 122 LDESGIKVDREAVLLAIWELRQDWKSAFLAFQWGEKWGCCNEEACSLMVWILGSHKKFST 181

Query: 183 AWSLIRELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNS 242
           AW LIR+LH +L+++ +AML+MIDRYA  N+  KAIKTFH+M+KFRLTPDQEAFH+LLN+
Sbjct: 182 AWCLIRDLHRALMDTRRAMLIMIDRYASVNDPCKAIKTFHVMDKFRLTPDQEAFHILLNA 241

Query: 243 LCKYGNIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPD 302
           LCKYGNIEEAEEFM VNKKLFPL TESFNIILNGWCN++VDVFEAKR+WREMSKCC+ PD
Sbjct: 242 LCKYGNIEEAEEFMLVNKKLFPLETESFNIILNGWCNISVDVFEAKRVWREMSKCCVTPD 301

Query: 303 STSYTHMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILE 362
           +TSYTH+ISCFSK G LFDSLR YD+MKKR WIP + VYNSL YVLT ENCF EALKIL+
Sbjct: 302 ATSYTHLISCFSKVGKLFDSLRLYDEMKKRGWIPGISVYNSLIYVLTCENCFKEALKILD 361

Query: 363 KIKEVGLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKM 422
           K+KE GL+ D+TTYNS+I PLCE+ KL+EA+ +L+ M  DN+SPT ETY++F+Q+   + 
Sbjct: 362 KLKEEGLQADATTYNSMICPLCESEKLEEARQMLSAMIADNLSPTTETYNAFLQSTGLEG 421

Query: 423 SFELLKRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQ 482
           + E+L RM++  LGP+ +TFL++  K F LE+PE AL +W EMK+Y V P   HY+V++Q
Sbjct: 422 TLEILNRMKKANLGPSSSTFLMILGKFFRLEQPEMALKMWTEMKQYGVVPDSAHYTVMVQ 481

Query: 483 GLATCGHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHR 538
           GLA C  L KA+EI+ EM   GF+  PK+K LL+EP  GS  + +++ +   +   +  +
Sbjct: 482 GLAACRLLIKAKEIFSEMKTDGFVEDPKLKRLLEEP--GSRVKGKRRPKPVKQATKVNQK 541

BLAST of CSPI07G20080 vs. NCBI nr
Match: gi|645245269|ref|XP_008228803.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Prunus mume])

HSP 1 Score: 605.1 bits (1559), Expect = 1.2e-169
Identity = 307/556 (55.22%), Postives = 390/556 (70.14%), Query Frame = 1

Query: 3   SLSSIARRLCRIHPLPF-HHLLYLNRLRIPDSPF----------QAFHQTFSLHSFFARQ 62
           +L SIARRL   HP  F  H L  +    P  P           Q FH+T  +     R 
Sbjct: 2   ALPSIARRLRSKHPQLFLSHTLLPSISNSPSPPHSSHPRPQHLSQTFHRTLLVPFQTPRH 61

Query: 63  FSALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFD 122
           FS L  FS Q L DP  F+  RF+      +   +F+EL K  A   SE EA+A LD+  
Sbjct: 62  FSTLQPFSAQTLNDPLGFNAQRFKTNDPYKSGLTQFLELLKHAAHFASEAEAMAFLDKSG 121

Query: 123 VKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLI 182
           ++ + D V  AIW L++DWK + LAFKWGEK G  DEE C+ M+WVLG+H+KFSTAW LI
Sbjct: 122 IEVNGDTVLLAIWELKEDWKLAFLAFKWGEKLGCCDEEACSWMVWVLGSHRKFSTAWCLI 181

Query: 183 RELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYG 242
           R+LH +L+++ +A+L+MI+RYA  N+  KAIKTF MM+KFRLTPDQEAFH LLN+LCKYG
Sbjct: 182 RDLHRALMDTRRALLIMIERYASVNDPCKAIKTFQMMDKFRLTPDQEAFHTLLNALCKYG 241

Query: 243 NIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYT 302
           NIEEAEEFM VNKKLFPL T  FNIILNGWCN++VDVFEAKR+WREMSKCCI PD+TSYT
Sbjct: 242 NIEEAEEFMLVNKKLFPLETVGFNIILNGWCNISVDVFEAKRVWREMSKCCITPDATSYT 301

Query: 303 HMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEV 362
           H+ISC SK G LFDSLRFYDQMKKR ++P ++VYNSL YVLT ENCFNEALK+L   K++
Sbjct: 302 HLISCLSKVGKLFDSLRFYDQMKKRGFVPGLKVYNSLIYVLTCENCFNEALKMLGNSKQM 361

Query: 363 GLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELL 422
           GL+PDSTTYNS+I PLCE+ K +EA+ +L+ M EDN+SPTIETYH+F+Q+   + + E+L
Sbjct: 362 GLQPDSTTYNSMICPLCESKKPEEARQMLSAMIEDNVSPTIETYHAFLQSTGLEGTLEIL 421

Query: 423 KRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATC 482
            RM++  LGP   TFL++  K F LE+PE AL +W EMK Y V P   HY+V++QGLA C
Sbjct: 422 NRMKKANLGPNGNTFLMILGKFFRLEQPEMALKIWTEMKHYGVVPDSTHYTVMVQGLAAC 481

Query: 483 GHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTM 542
             L KARE++ EM  +GF+  PK++ LLKEP  GS  + +Q++R   +   +  ++GR  
Sbjct: 482 RLLMKARELFAEMRSNGFLGDPKLEKLLKEPVGGSSVKGKQRLRPVKQATRVNQKQGRMK 541

Query: 543 RWKSHKQRSKGAASFE 547
           RWKS  Q  +   S E
Sbjct: 542 RWKSPHQSREEKRSIE 557

BLAST of CSPI07G20080 vs. NCBI nr
Match: gi|658014970|ref|XP_008342813.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Malus domestica])

HSP 1 Score: 604.4 bits (1557), Expect = 2.0e-169
Identity = 308/556 (55.40%), Postives = 395/556 (71.04%), Query Frame = 1

Query: 3   SLSSIARRLCRIHPLPF-HHLLYLNRLRIPDSPF----------QAFHQTFSLHSFFARQ 62
           +L +IARRL   H   F  H L  +    P SP           Q FH+T  +     R 
Sbjct: 2   ALPAIARRLRSKHSQLFLSHTLLPSISYSPSSPCSSQPLLRDLCQTFHRTLIVLPQTPRH 61

Query: 63  FSALPSFS-QKLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLQSEVEAVAALDEFD 122
           FS L  FS Q L DP      RF      D+   +F+EL KR A   SE EAVA LDE  
Sbjct: 62  FSTLQPFSAQTLNDPL---GQRFNTDHPRDSRFTQFLELLKRAADFASEAEAVAFLDESG 121

Query: 123 VKADLDLVYSAIWVLRDDWKSSLLAFKWGEKGGAIDEEICNLMIWVLGNHKKFSTAWSLI 182
           ++ D + V  AIW LR+DWKS+ LAF+WGEK G  +EE C+LM+W+LG+HKKFSTAW LI
Sbjct: 122 IEVDRETVLLAIWELREDWKSAFLAFQWGEKWGCCNEEACSLMVWILGSHKKFSTAWCLI 181

Query: 183 RELHGSLLNSMQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYG 242
           R+LH +L+++ +AML+MIDRYA AN+  KAIKTFH+M+KFRLTPDQEAFH+LLN+LCKYG
Sbjct: 182 RDLHRALMDTRRAMLIMIDRYASANDPCKAIKTFHVMDKFRLTPDQEAFHILLNALCKYG 241

Query: 243 NIEEAEEFMFVNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYT 302
           NIEEAEEFM VNKKLFPL TESFNIILNGWCN++VDVFEAKR+WREMSKCC+ PD+TSYT
Sbjct: 242 NIEEAEEFMLVNKKLFPLETESFNIILNGWCNISVDVFEAKRVWREMSKCCVTPDATSYT 301

Query: 303 HMISCFSKNGNLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEV 362
           H+ISCFSK G LFDSLR YD+MKKR WIP + VYNSL YVLT ENCF EALKIL+K+KE 
Sbjct: 302 HLISCFSKVGKLFDSLRLYDEMKKRGWIPGISVYNSLIYVLTCENCFKEALKILDKLKEE 361

Query: 363 GLRPDSTTYNSLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELL 422
           GL+ D+TTYNS+I PLCE+ KL+EA+ +L+ M  +N+SPT ETY++F+Q+   + + E+L
Sbjct: 362 GLQADATTYNSMICPLCESEKLEEARQMLSAMIANNLSPTTETYNAFLQSTGLEGTLEIL 421

Query: 423 KRMRQDGLGPTEATFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATC 482
            RM++  LGP+ +TFL++  K F LE+PE AL +W EMK+Y V P   HY+V++QGLA C
Sbjct: 422 NRMKKASLGPSSSTFLMILGKFFRLEQPEMALKIWTEMKQYGVVPDSAHYTVMVQGLAAC 481

Query: 483 GHLKKAREIYDEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTM 542
             L KA+EI+ EM  +GF+  PK+K LL+ P  GS  + +++     +   +  ++G T 
Sbjct: 482 RLLIKAKEIFSEMKTNGFVEDPKLKRLLEVP--GSRVKGKRRRIPVKQATKVNQKQGSTK 541

Query: 543 RWKSHKQRSKGAASFE 547
           RW S ++  K  AS +
Sbjct: 542 RWNSLRKSRKKKASIK 552

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP137_ARATH1.8e-13850.10Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidop... [more]
PP383_ARATH1.8e-7237.41Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
PP275_ARATH1.5e-4428.57Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN... [more]
PP447_ARATH2.9e-4328.37Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
PP112_ARATH8.8e-4029.19Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K678_CUCSA0.0e+0099.27Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431960 PE=4 SV=1[more]
U5GND9_POPTR4.4e-16354.78Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03510g PE=4 SV=1[more]
V4UN86_9ROSI3.6e-15755.80Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025296mg PE=4 SV=1[more]
B9RGM1_RICCO1.0e-15651.88Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A061G9G9_THECC2.3e-15652.43Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma c... [more]
Match NameE-valueIdentityDescription
AT1G80880.11.0e-13950.10 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G15010.19.9e-7437.41 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G49730.18.7e-4628.57 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.11.6e-4428.37 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G71060.14.9e-4129.19 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778728914|ref|XP_011659500.1|0.0e+0099.27PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|659122515|ref|XP_008461183.1|4.8e-29692.49PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|694326575|ref|XP_009354198.1|1.8e-17055.43PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|645245269|ref|XP_008228803.1|1.2e-16955.22PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|658014970|ref|XP_008342813.1|2.0e-16955.40PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G20080.1CSPI07G20080.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 459..488
score: 8.1E-5coord: 219..238
score: 0.12coord: 288..315
score: 9.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 342..400
score: 7.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 358..390
score: 1.5E-6coord: 323..356
score: 1.9E-4coord: 460..488
score: 1.8E-4coord: 288..320
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 355..389
score: 12.682coord: 456..490
score: 9.909coord: 249..284
score: 8.309coord: 180..214
score: 7.289coord: 146..176
score: 6.347coord: 320..354
score: 9.898coord: 285..319
score: 11.323coord: 421..455
score: 6.763coord: 215..245
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 214..232
score: 7.4E-6coord: 288..391
score: 7.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 148..511
score: 6.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 126..532
score: 3.0E-225coord: 16..92
score: 3.0E
NoneNo IPR availablePANTHERPTHR24015:SF602SUBFAMILY NOT NAMEDcoord: 126..532
score: 3.0E-225coord: 16..92
score: 3.0E

The following gene(s) are paralogous to this gene:

None