Lsi02G026640 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi02G026640
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr02 : 33119017 .. 33121940 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGACCTTCACTTTCCCCAATATATAAGGTCTTGTAATTCAAACTGTAGCTCAGATTTTGCCGCGAAGCCAAGCTTCCCCTGTATCTTCACAGTCGTTTTCATGGTAATCCATGGCATGTCTTTCTTCTATTGCAAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCAACTCCTTCAGCTCATCCATCTCCAAATCCCCGATTCACCCTTTCAGGCATTTCATCAAACACTTCGTCTGCATTCTCAGTCCGCTCGTCAATTCTCAGCTCTTCGATCAGTTGGCCACCCATTTCACTTTGACACAGGAAGATTCCAAAACCATCGCCCAAACGACGCGCGTAGTGCCCAATTCGTCGAATTGCTCAAACGGGTCGCTCGGTTGCCATCGGAAGTGGAGGCTGTTGCTGCGTTGGATGAGTTTGATGTTCAGGCAGATCCAGATTTGGTGTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCATTTCTAGCATTCAAATGGGGTGAGAAATGGGGATCTATTGATGAAGAGATTTGTAATTTGATTATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTGTTTGATTAGAGAATTGCATGGATCCTTGATGGATTCAAGGCAAGCGATGCTTGTCATGATTGATAGGTAAAGTTTATTTTCGCATATCTCCATTTACTCTTGACATTTGTTTGTTCACATTCTGATATGCCATTTGTTATGACTGCCAAAGACGTTTTGCATTTTGATTTTACTCGATAACTTAGTGCATAGGCCAAGTAATGTTGAAAGCAAGTTTGTGAACTCAGATAGCAAATGGCTTGCTTCTACAAGCTTGTATGTAAACGAGTCTTTTTCTATGTGTGTGTGTGAGTTTTTTCCATAAGCAAGAAGCTTTTGGTTGAAATTGGAGTTGCATTTCATTTTTCATTTGTGTTTTTTCTTGGATTCAAGGATGGCCAACATATTTCCGTGAGATGTGGAAAGGATCTTCTTTTATGCAATCTGTTAATTGGGTTCTTACTGTTACTGGTTTATGTAGTAAATGAGAAAATGAAGGGAGATTACATTACATCCATAGTTATTTTTGAAATTTGTATGCTGCTAACAAGGTTGTTCCTGGTTTTAGGTATGCATATGCCAATGAGGTAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAATTCAGATTGACTCCGGATCAAGAGGCTTTTCACGCGCTTCTTAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGAAACAGAAAGCTTCAATATTATTCTCAACGGCTGGTGCAATGTATCTGTTAATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATCTCTAAATGTTGCATTTTGCCAGATTCAACTTCTTACACCCACATGATTTCCTGTTTTTCGAAGACTGGGAACCTTTTCGACTCGCTTAGACTCTATGATGAGATGAAGAAAAGGGATTGGGTTCCAAGCCTTGAAGTCTTTAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAGTGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAATAGGTTTGCAGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGATGGGACAGCTTGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCGGTCCAACAATCGAAACCTACCATTCTTTTATTCAGGGTGCAGATTCTGAAATGAGCTTTGAACTTCTTAAGCAGATGAGACAGGATGGTTTAGGTCCTACAGAGGCTACGTTTGTTATAATGTTTAACAAGTCATTTGAACTAGAACAACCAGATTATGCATTGAAGGCGTGGGTAGAAATGAAGCGGTACGAGGTAGTGCCTAATTCCGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATATGGTCGGTTAAAACAGGCCAGAGAATTGTATGACGAAATGACATCACATGGATTTATCGCACATCCAAAGATTAAACTGCTCCTGAAGGAACCAGATTTAGGTAGCATTGAAGAAGCAAGACAGCAAGTGAGACATAACAAGAAAGGTAAGTTCTTTTCTCATAGGAAAGGGAGCATGATGAAATGGAAATCATATAAACAACAATCTACAGAGGATGCATCATTTGAGTAGGTAATTTATCTGTATATGTGCTACTTGAATAGATTTCAATTGAATCATATTCTATATTCTGCTAGTATTGTATAGGAAATAAGGACAACATCTTGTTAGGATGAATCATTGTCCTAACAAATTAGTTATTCACCTTACTTCCTATTTACCCATATAAAAACAGGATTTCTGCTTGAATTTTAGGGGAGATTCTTTTGGTCATTCAGATGTTATGATGTTTTTTTATTTTTCATTTTAAGTTCAACTATATGTGAGGTTAAGATTTAAATCGGCGACCTTTTGTTCGTCGATATAATGTATTAACTAGTTGAACTATGTTTTTTTTTTTTTTTTTTTTTTTTAAAAGTATTCAGAGTTTGAAACTATAGACTGTTGGTCTTATATGAATCCTTGGTCTGATAAGGAATAAAAACTCATTGTGGAAGCTGCTGAAAAAGCATTCCTTTCTTCCATTTGGCACAAAGCTCGAAGAGAACACTCGCATGGTCTACTAAAGAGAAAAGATACGACACCGTTTCTATTTTATGTACTTTTTTGTATTTTCTTTTTCCTCTTTTGAAGGTATAGGCTATTTAGTGGTTAGATGACGAGAAAAGTTAATTCCCAGTTGTCCTTTTTGTAGCTTCATTTGTTGGAATTCGCCATGATAAAAGCTCATAAATTCACTAGAAGGGCAACCTTTTCAAGCATGGATAGATTTATA

mRNA sequence

CTGACCTTCACTTTCCCCAATATATAAGGTCTTGTAATTCAAACTGTAGCTCAGATTTTGCCGCGAAGCCAAGCTTCCCCTGTATCTTCACAGTCGTTTTCATGGTAATCCATGGCATGTCTTTCTTCTATTGCAAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCAACTCCTTCAGCTCATCCATCTCCAAATCCCCGATTCACCCTTTCAGGCATTTCATCAAACACTTCGTCTGCATTCTCAGTCCGCTCGTCAATTCTCAGCTCTTCGATCAGTTGGCCACCCATTTCACTTTGACACAGGAAGATTCCAAAACCATCGCCCAAACGACGCGCGTAGTGCCCAATTCGTCGAATTGCTCAAACGGGTCGCTCGGTTGCCATCGGAAGTGGAGGCTGTTGCTGCGTTGGATGAGTTTGATGTTCAGGCAGATCCAGATTTGGTGTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCATTTCTAGCATTCAAATGGGGTGAGAAATGGGGATCTATTGATGAAGAGATTTGTAATTTGATTATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTGTTTGATTAGAGAATTGCATGGATCCTTGATGGATTCAAGGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCCAATGAGGTAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAATTCAGATTGACTCCGGATCAAGAGGCTTTTCACGCGCTTCTTAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGAAACAGAAAGCTTCAATATTATTCTCAACGGCTGGTGCAATGTATCTGTTAATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATCTCTAAATGTTGCATTTTGCCAGATTCAACTTCTTACACCCACATGATTTCCTGTTTTTCGAAGACTGGGAACCTTTTCGACTCGCTTAGACTCTATGATGAGATGAAGAAAAGGGATTGGGTTCCAAGCCTTGAAGTCTTTAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAGTGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAATAGGTTTGCAGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGATGGGACAGCTTGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCGGTCCAACAATCGAAACCTACCATTCTTTTATTCAGGGTGCAGATTCTGAAATGAGCTTTGAACTTCTTAAGCAGATGAGACAGGATGGTTTAGGTCCTACAGAGGCTACGTTTGTTATAATGTTTAACAAGTCATTTGAACTAGAACAACCAGATTATGCATTGAAGGCGTGGGTAGAAATGAAGCGGTACGAGGTAGTGCCTAATTCCGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATATGGTCGGTTAAAACAGGCCAGAGAATTGTATGACGAAATGACATCACATGGATTTATCGCACATCCAAAGATTAAACTGCTCCTGAAGGAACCAGATTTAGGTAGCATTGAAGAAGCAAGACAGCAAGTGAGACATAACAAGAAAGGTAAGTTCTTTTCTCATAGGAAAGGGAGCATGATGAAATGGAAATCATATAAACAACAATCTACAGAGGATGCATCATTTGAGTAGGAATAAAAACTCATTGTGGAAGCTGCTGAAAAAGCATTCCTTTCTTCCATTTGGCACAAAGCTCGAAGAGAACACTCGCATGGTCTACTAAAGAGAAAAGATACGACACCGTTTCTATTTTATGTACTTTTTTGTATTTTCTTTTTCCTCTTTTGAAGGTATAGGCTATTTAGTGGTTAGATGACGAGAAAAGTTAATTCCCAGTTGTCCTTTTTGTAGCTTCATTTGTTGGAATTCGCCATGATAAAAGCTCATAAATTCACTAGAAGGGCAACCTTTTCAAGCATGGATAGATTTATA

Coding sequence (CDS)

ATGGCATGTCTTTCTTCTATTGCAAGAAGGCTCTGCAGAATCCATCCATTGCCATTCCATCAACTCCTTCAGCTCATCCATCTCCAAATCCCCGATTCACCCTTTCAGGCATTTCATCAAACACTTCGTCTGCATTCTCAGTCCGCTCGTCAATTCTCAGCTCTTCGATCAGTTGGCCACCCATTTCACTTTGACACAGGAAGATTCCAAAACCATCGCCCAAACGACGCGCGTAGTGCCCAATTCGTCGAATTGCTCAAACGGGTCGCTCGGTTGCCATCGGAAGTGGAGGCTGTTGCTGCGTTGGATGAGTTTGATGTTCAGGCAGATCCAGATTTGGTGTACTCGGCAATTTGGGTGTTGAGGGATGATTGGAAATCGTCATTTCTAGCATTCAAATGGGGTGAGAAATGGGGATCTATTGATGAAGAGATTTGTAATTTGATTATATGGGTGTTGGGCAATCATAAGAAATTCAGTACTGCTTGGTGTTTGATTAGAGAATTGCATGGATCCTTGATGGATTCAAGGCAAGCGATGCTTGTCATGATTGATAGGTATGCATATGCCAATGAGGTAAGTAAGGCTATTAAGACATTCCACATGATGGAGAAATTCAGATTGACTCCGGATCAAGAGGCTTTTCACGCGCTTCTTAATTCTCTCTGTAAATATGGGAACATCGAAGAAGCTGAAGAGTTTATGTTTGTAAACAAGAAGCTTTTTCCTTTGGAAACAGAAAGCTTCAATATTATTCTCAACGGCTGGTGCAATGTATCTGTTAATGTGTTTGAAGCAAAGAGAATTTGGAGAGAAATCTCTAAATGTTGCATTTTGCCAGATTCAACTTCTTACACCCACATGATTTCCTGTTTTTCGAAGACTGGGAACCTTTTCGACTCGCTTAGACTCTATGATGAGATGAAGAAAAGGGATTGGGTTCCAAGCCTTGAAGTCTTTAATTCTTTAGCTTACGTGTTGACCCGGGAGAATTGCTTCAGTGAAGCTCTCAAAATCCTTGAGAAAATAAAAGAAATAGGTTTGCAGCCAGACTCCACTACATACAACTCACTTATAAGTCCTCTGTGTGAGATGGGACAGCTTGACGAAGCAAAAGATGTACTGACCATGATGACTGAGGACAATATCGGTCCAACAATCGAAACCTACCATTCTTTTATTCAGGGTGCAGATTCTGAAATGAGCTTTGAACTTCTTAAGCAGATGAGACAGGATGGTTTAGGTCCTACAGAGGCTACGTTTGTTATAATGTTTAACAAGTCATTTGAACTAGAACAACCAGATTATGCATTGAAGGCGTGGGTAGAAATGAAGCGGTACGAGGTAGTGCCTAATTCCGAACATTACTCAGTCTTGATACAAGGCCTTGCAACATATGGTCGGTTAAAACAGGCCAGAGAATTGTATGACGAAATGACATCACATGGATTTATCGCACATCCAAAGATTAAACTGCTCCTGAAGGAACCAGATTTAGGTAGCATTGAAGAAGCAAGACAGCAAGTGAGACATAACAAGAAAGGTAAGTTCTTTTCTCATAGGAAAGGGAGCATGATGAAATGGAAATCATATAAACAACAATCTACAGAGGATGCATCATTTGAGTAG

Protein sequence

MACLSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQAFHQTLRLHSQSARQFSALRSVGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSMMKWKSYKQQSTEDASFE
BLAST of Lsi02G026640 vs. Swiss-Prot
Match: PP137_ARATH (Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidopsis thaliana GN=At1g80880 PE=2 SV=1)

HSP 1 Score: 499.2 bits (1284), Expect = 5.7e-140
Identity = 226/425 (53.18%), Postives = 321/425 (75.53%), Query Frame = 1

Query: 83  VELLKRVARLPSEVEAVAALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSID 142
           ++L+++V+ L SE +A+A+L++     + D  YS IW LRD+W+ +FLAFKWGEK G  D
Sbjct: 95  IDLIRQVSELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDD 154

Query: 143 EEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHM 202
           ++ C+L+IWVLGNH+KF+ AWCLIR++     D+R+AM +M+DRYA AN+ S+AI+TF +
Sbjct: 155 QKSCDLMIWVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDI 214

Query: 203 MEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVN 262
           M+KF+ TP  EAF  LL +LC++G+IE+AEEFM  +KKLFP++ E FN+ILNGWCN+  +
Sbjct: 215 MDKFKHTPYDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTD 274

Query: 263 VFEAKRIWREISKCCILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNS 322
           V EAKRIWRE+   CI P+  SY+HMISCFSK GNLFDSLRLYDEMKKR   P +EV+NS
Sbjct: 275 VTEAKRIWREMGNYCITPNKDSYSHMISCFSKVGNLFDSLRLYDEMKKRGLAPGIEVYNS 334

Query: 323 LAYVLTRENCFSEALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDN 382
           L YVLTRE+CF EA+K+++K+ E GL+PDS TYNS+I PLCE G+LD A++VL  M  +N
Sbjct: 335 LVYVLTREDCFDEAMKLMKKLNEEGLKPDSVTYNSMIRPLCEAGKLDVARNVLATMISEN 394

Query: 383 IGPTIETYHSFIQGADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWV 442
           + PT++T+H+F++  + E + E+L QM+   LGPTE TF+++  K F+ +QP+ ALK W 
Sbjct: 395 LSPTVDTFHAFLEAVNFEKTLEVLGQMKISDLGPTEETFLLILGKLFKGKQPENALKIWA 454

Query: 443 EMKRYEVVPNSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSI 502
           EM R+E+V N   Y   IQGL + G L++ARE+Y EM S GF+ +P ++ LL+E  +  +
Sbjct: 455 EMDRFEIVANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGV 514

Query: 503 EEARQ 508
            ++++
Sbjct: 515 RKSKR 519

BLAST of Lsi02G026640 vs. Swiss-Prot
Match: PP383_ARATH (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 272.7 bits (696), Expect = 8.6e-72
Identity = 159/441 (36.05%), Postives = 258/441 (58.50%), Query Frame = 1

Query: 102 LDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEI--CNLIIWVLGNHKKF 161
           L+E DV+   +LV   +  +R+DW+++F  F W  K       +   + +I +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 162 STAWCLIRELHG---SLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 221
            TAW LI E+     SL++S Q +L+MI +Y   ++V KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 222 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 281
           +LL++LC+Y N+ +A   +F NK  +P + +SFNI+LNGWCNV  +  EA+R+W E+   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 282 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEA 341
            +  D  SY+ MISC+SK G+L   L+L+D MKK    P  +V+N++ + L + +  SEA
Sbjct: 298 GVKHDVVSYSSMISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEA 357

Query: 342 LKILEKI-KEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQ 401
             +++ + +E G++P+  TYNSLI PLC+  + +EAK V   M E  + PTI TYH+F++
Sbjct: 358 RNLMKTMEEEKGIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMR 417

Query: 402 -GADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSE 461
                E  FELL +MR+ G  PT  T++++  K       D  L  W EMK   V P+  
Sbjct: 418 ILRTGEEVFELLAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLS 477

Query: 462 HYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKK 521
            Y V+I GL   G++++A   Y EM   G   +  ++ +++    G  + A Q++  + K
Sbjct: 478 SYIVMIHGLFLNGKIEEAYGYYKEMKDKGMRPNENVEDMIQSWFSGK-QYAEQRIT-DSK 537

Query: 522 GKFFSHRKGSMMKWKSYKQQS 536
           G+     KG+++K KS ++++
Sbjct: 538 GEV---NKGAIVK-KSEREKN 551

BLAST of Lsi02G026640 vs. Swiss-Prot
Match: PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 9.9e-44
Identity = 104/398 (26.13%), Postives = 198/398 (49.75%), Query Frame = 1

Query: 101 ALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSI--DEEICNLIIWVLGNHKK 160
           AL+E  +   P L+   +    D     +  F W  K        E+C  ++ +L   ++
Sbjct: 87  ALNESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQ 146

Query: 161 FSTAWCLIRELHGSLMD--SRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 220
           F   W LI E+  +  +    +  +V++ R+A AN V KA++    M K+ L PD+  F 
Sbjct: 147 FGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFG 206

Query: 221 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 280
            LL++LCK G+++EA +     ++ FP     F  +L GWC     + EAK +  ++ + 
Sbjct: 207 CLLDALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREG-KLMEAKEVLVQMKEA 266

Query: 281 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTR-ENCFSE 340
            + PD   +T+++S ++  G + D+  L ++M+KR + P++  +  L   L R E    E
Sbjct: 267 GLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDE 326

Query: 341 ALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQ 400
           A+++  +++  G + D  TY +LIS  C+ G +D+   VL  M +  + P+  TY   + 
Sbjct: 327 AMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMV 386

Query: 401 GADSEMSF----ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVP 460
             + +  F    EL+++M++ G  P    + ++   + +L +   A++ W EM+   + P
Sbjct: 387 AHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSP 446

Query: 461 NSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPK 490
             + + ++I G  + G L +A   + EM S G  + P+
Sbjct: 447 GVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQ 483

BLAST of Lsi02G026640 vs. Swiss-Prot
Match: PP233_ARATH (Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis thaliana GN=At3g15200 PE=3 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 2.2e-43
Identity = 125/450 (27.78%), Postives = 224/450 (49.78%), Query Frame = 1

Query: 55  LRSVGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLV 114
           L S+G P  F   RF + +  D +SA  V  + +  R  S  +    LD+  +    +LV
Sbjct: 56  LHSLGAPDKFPN-RFNDDK--DKQSALDVHNIIKHHRGSSPEKIKRILDKCGIDLTEELV 115

Query: 115 YSAIWVLRDDWKSSFLAFKWGEKWGS--IDEEICNLIIWVLGNHKKFSTAWCLIRELHGS 174
              +   R DWK +++  +   K         + N I+ VLG  ++F     +  E+  S
Sbjct: 116 LEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGKMRRFEEFHQVFDEM--S 175

Query: 175 LMD---SRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIE 234
             D   + +   V+++RYA A++V +A+  F   ++F +  D  AFH LL  LC+Y ++E
Sbjct: 176 KRDGFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVAFHGLLMWLCRYKHVE 235

Query: 235 EAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMI 294
            AE      ++ F  + ++ N+ILNGWC V  NV EAKR W++I      PD  SY  MI
Sbjct: 236 FAETLFCSRRREFGCDIKAMNMILNGWC-VLGNVHEAKRFWKDIIASKCRPDVVSYGTMI 295

Query: 295 SCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQ 354
           +  +K G L  ++ LY  M      P +++ N++   L  +    EAL++  +I E G  
Sbjct: 296 NALTKKGKLGKAMELYRAMWDTRRNPDVKICNNVIDALCFKKRIPEALEVFREISEKGPD 355

Query: 355 PDSTTYNSLISPLCEMGQLDEAKDVLTMM--TEDNIGPTIETYHSFIQGADSEMSFEL-L 414
           P+  TYNSL+  LC++ + ++  +++  M     +  P   T+   ++ +      ++ L
Sbjct: 356 PNVVTYNSLLKHLCKIRRTEKVWELVEEMELKGGSCSPNDVTFSYLLKYSQRSKDVDIVL 415

Query: 415 KQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATY 474
           ++M ++    T   + +MF    + ++ +   + W EM+R  + P+   Y++ I GL T 
Sbjct: 416 ERMAKNKCEMTSDLYNLMFRLYVQWDKEEKVREIWSEMERSGLGPDQRTYTIRIHGLHTK 475

Query: 475 GRLKQARELYDEMTSHGFIAHPKIKLLLKE 497
           G++ +A   + EM S G +  P+ ++LL +
Sbjct: 476 GKIGEALSYFQEMMSKGMVPEPRTEMLLNQ 499

BLAST of Lsi02G026640 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 4.9e-43
Identity = 107/405 (26.42%), Postives = 203/405 (50.12%), Query Frame = 1

Query: 101 ALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDE--EICNLIIWVLGNHKK 160
           AL+E  V+  P L+   +    D     +  F W  K        E+   ++ +L   ++
Sbjct: 103 ALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQ 162

Query: 161 FSTAWCLIRELH--GSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 220
           F   W LI E+      +   +  +V++ R+A A+ V KAI+    M KF   PD+  F 
Sbjct: 163 FGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFG 222

Query: 221 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 280
            LL++LCK+G++++A +     +  FP+    F  +L GWC V   + EAK +  ++++ 
Sbjct: 223 CLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG-KMMEAKYVLVQMNEA 282

Query: 281 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEA 340
              PD   YT+++S ++  G + D+  L  +M++R + P+   +  L   L + +   EA
Sbjct: 283 GFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEA 342

Query: 341 LKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQG 400
           +K+  +++    + D  TY +L+S  C+ G++D+   VL  M +  + P+  TY   +  
Sbjct: 343 MKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVA 402

Query: 401 ADSEMSF----ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPN 460
            + + SF    EL+++MRQ    P    + ++   + +L +   A++ W EM+   + P 
Sbjct: 403 HEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPG 462

Query: 461 SEHYSVLIQGLATYGRLKQARELYDEMTSHGFIA---HPKIKLLL 495
            + + ++I GLA+ G L +A + + EM + G  +   +  +KLLL
Sbjct: 463 VDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLL 506

BLAST of Lsi02G026640 vs. TrEMBL
Match: A0A0A0K678_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431960 PE=4 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 1.7e-268
Identity = 461/546 (84.43%), Postives = 495/546 (90.66%), Query Frame = 1

Query: 1   MACLSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQAFHQTLRLHSQSARQFSALRS--- 60
           MA LSSIARRLCRIHPLPFH LL L  L+IPDSPFQAFHQT  LHS  ARQFSAL S   
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  -VGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLVYS 120
            +G PF FDTGRFQN+R +DA +A+F+EL KRVA LPSEVEAVAALDEFDV+AD DLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDS 180
           AIWVLRDDWKSS LAFKWGEK G+IDEEICNL+IWVLGNHKKFSTAW LIRELHGSL++S
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 RQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYAYANE SKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMISCFSKTG 300
           VNKKLFPL TESFNIILNGWCNV+V+VFEAKRIWRE+SKCCILPDSTSYTHMISCFSK G
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQPDSTTYN 360
           NLFDSLR YD+MKKRDW+PS+EV+NSLAYVLTRENCF+EALKILEKIKE+GL+PDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELLKQMRQDGLGP 420
           SLISPLCE G+LDEAKDVLTMMTEDNI PTIETYHSFIQ ADS+MSFELLK+MRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATYGRLKQARELY 480
           TE TF+IMFNKSFELE+P+YAL  WVEMKRYEV P+ EHYSVLIQGLAT G LK+ARELY
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSMMKWKSYKQQST 540
           DEM  HGFIAHPKIK LLKEPDLGSI+EARQQVRHN KGKF  HRKG  M+WKS+KQ+S 
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 EDASFE 543
             ASFE
Sbjct: 541 GAASFE 546

BLAST of Lsi02G026640 vs. TrEMBL
Match: U5GND9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03510g PE=4 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 1.4e-166
Identity = 301/543 (55.43%), Postives = 386/543 (71.09%), Query Frame = 1

Query: 4   LSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQ----------AFHQTLRLHSQSARQFS 63
           L +IARRL   H      LL  I    P SP             FH+TL + ++S   FS
Sbjct: 3   LLTIARRLQISHSRLLFPLLYSITYPHPPSPSNNSNYPIFFSVEFHRTLSIPTRSPHHFS 62

Query: 64  ALRS-----VGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQ 123
             +S     +   F       Q H P        ++ LK  A LPSE EA+A+LDE  ++
Sbjct: 63  TSQSFSTQYLNVSFELIQQGIQTHEP---LQMGLLQSLKMAAHLPSEAEAMASLDESGIR 122

Query: 124 ADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLIRE 183
           A+ +LVYS IW LR++W+ +FLAFKWG+KWG +DE+ C L++WVLGNH+KF+TAW LIR+
Sbjct: 123 ANQNLVYSVIWELREEWRLAFLAFKWGDKWGCVDEKACELMVWVLGNHRKFNTAWILIRD 182

Query: 184 LHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNI 243
           LH SLM +R+AML+MIDRYA AN   KAI  F +M+KFR+TPD+EAF+ LLN LCK GNI
Sbjct: 183 LHRSLMSTRKAMLIMIDRYAAANVPGKAIYAFRIMDKFRMTPDEEAFYFLLNVLCKNGNI 242

Query: 244 EEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHM 303
           EEAEEFM VNKK FPLE E FNIILNGWCN+ V+VFEAKRIWRE+SK CI PD+T+YTHM
Sbjct: 243 EEAEEFMLVNKKFFPLEVEGFNIILNGWCNICVDVFEAKRIWREMSKYCIDPDATTYTHM 302

Query: 304 ISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGL 363
           ISCFSK GNLFDSLRLYD MKKR WVP +EV+NSL Y+LTRENCF EALKIL+K+KE GL
Sbjct: 303 ISCFSKVGNLFDSLRLYDGMKKRGWVPGIEVYNSLVYILTRENCFKEALKILDKMKETGL 362

Query: 364 QPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELLKQ 423
           Q DS TYNS+I PLCE  +L++A+ ++  M E+N+ PTIETYH+F+QG   E +FE+L +
Sbjct: 363 QRDSATYNSMIRPLCEAKKLEDARSLMAAMIEENVSPTIETYHAFLQGIVFEETFEVLDR 422

Query: 424 MRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATYGR 483
           M+  GLGPTE TF+++  K F+LEQP+ ALK WVEMK+YEV  N  HY+V+++GLA  G 
Sbjct: 423 MKIAGLGPTEDTFLLLLAKFFKLEQPENALKIWVEMKQYEVASNLTHYTVMVEGLARCGL 482

Query: 484 LKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSMMKW 532
           L +ARE Y EM S+G+   PKI+ +LK P     ++ ++     K+ +  SH+KGSM++ 
Sbjct: 483 LTKAREYYAEMRSNGYSDDPKIQKMLKVPVQDKNDKRKKLGGQFKRNQHVSHKKGSMVRR 542

BLAST of Lsi02G026640 vs. TrEMBL
Match: A0A067LCX9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26977 PE=4 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 9.0e-161
Identity = 288/509 (56.58%), Postives = 370/509 (72.69%), Query Frame = 1

Query: 4   LSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQ---------AFHQTLRLHSQSARQFSA 63
           L +IARRL R +P+    L Q I    P SP +         AFH+T+ +   ++ +FS 
Sbjct: 3   LITIARRLQRSYPIHVLPLFQSI--ASPSSPSRLPLKSFFLAAFHRTVSIPDSNSLRFST 62

Query: 64  LRSVGHPFHFDTGRF--------QNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFD 123
            +      +F T  F        Q    N+A     ++ LKR A  P+E EAVA +DE  
Sbjct: 63  SQ------YFPTQNFKEPFDLIQQRIHVNEALEPSLLDSLKRAAHSPTEAEAVAFVDESG 122

Query: 124 VQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLI 183
           + A+ +LVYS IW LR++W+ ++LA+KWG+KWG +DE+ C L++WVLG+HKKF+ AW LI
Sbjct: 123 ITANQNLVYSLIWNLREEWRQAYLAYKWGQKWGCVDEKCCELMVWVLGSHKKFNIAWILI 182

Query: 184 RELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYG 243
           R+L+ SLM+ RQAMLVMIDRYA AN   KAI+ F +MEKFRL PD+EAF+ LLN+LCK+G
Sbjct: 183 RDLYRSLMNPRQAMLVMIDRYAAANCPGKAIEAFDVMEKFRLAPDEEAFYTLLNALCKHG 242

Query: 244 NIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYT 303
           NIEEAEEFM +NKKLFPL TE FNIILNGWCN+ V+V EAKR+WRE+SKCCI PDSTSYT
Sbjct: 243 NIEEAEEFMLINKKLFPLGTEGFNIILNGWCNICVDVLEAKRVWREMSKCCITPDSTSYT 302

Query: 304 HMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEI 363
           HMI+CFSK GNLFDSLRLYDEMKKR +VP + V+N L YVLT ENCF  ALK++EK+KE 
Sbjct: 303 HMITCFSKVGNLFDSLRLYDEMKKRGFVPGIVVYNCLIYVLTHENCFEAALKVVEKMKEH 362

Query: 364 GLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELL 423
           GLQPDSTT+NS+I  LCE  +L EA+++L MM E+NI PT+ETYH+ +QG   E + E+L
Sbjct: 363 GLQPDSTTFNSIIHSLCERQRLAEARNILAMMVEENICPTMETYHAVLQGTGFEETLEIL 422

Query: 424 KQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATY 483
            QM   GL PT+ TF+++  K F+LEQPD ALK WVEMK+YEV PN+ HY VL++GLA Y
Sbjct: 423 DQMIISGLAPTKDTFLLILVKFFKLEQPDNALKIWVEMKQYEVTPNATHYRVLVEGLARY 482

Query: 484 GRLKQARELYDEMTSHGFIAHPKIKLLLK 496
           G L +A+E Y +M S+GF   PK++ +LK
Sbjct: 483 GLLTKAQEYYADMKSNGFSDDPKLQKILK 503

BLAST of Lsi02G026640 vs. TrEMBL
Match: W9QV67_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_018066 PE=4 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 9.0e-161
Identity = 293/514 (57.00%), Postives = 375/514 (72.96%), Query Frame = 1

Query: 6   SIARRLCRIHPLPFHQLLQ---LIHLQIPDSPF----------QAFHQTLRLHSQSAR-- 65
           ++A+RL R H    HQLL    L HL    S            QAFH+T  +  +S+   
Sbjct: 5   TLAKRLQRTH---LHQLLSPFILHHLHSTSSSTLSQQSHHHISQAFHRTHFIPFRSSPPQ 64

Query: 66  -QFSAL-----RSVGHPFHFDTGRFQNHRPNDARSA-QFVELLKRVARLPSEVEAVAALD 125
             FS L     ++   P  F+ G FQ H   D       V+ L++VA  P+E EA+++L+
Sbjct: 65  LHFSTLQHQPPKTNPDPLDFNAGTFQTHDLQDHPGLLHVVKSLEKVAHFPTEAEAMSSLE 124

Query: 126 EFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAW 185
           E  V+  P+LV SAIWVLR+DW+ +FLAFKWGEKW   DEE   L++WVLG+H+KF+TAW
Sbjct: 125 ESCVEVSPELVRSAIWVLREDWRVAFLAFKWGEKWDCCDEEAWILMVWVLGSHRKFNTAW 184

Query: 186 CLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLC 245
           CLIR+LH S MD+R+AMLVMIDRYA AN+  KAI  FH MEKF LTPDQEAF   LN+LC
Sbjct: 185 CLIRDLHRSSMDTRRAMLVMIDRYACANDPDKAIWAFHFMEKFSLTPDQEAFCIALNALC 244

Query: 246 KYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDST 305
           K+G IE+AEEFM VNKKLFPLETE FNIILNGWCN+SV++ EAKR+WRE+SKCCI P++T
Sbjct: 245 KHGYIEKAEEFMLVNKKLFPLETEGFNIILNGWCNISVDLSEAKRVWREMSKCCIEPNAT 304

Query: 306 SYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKI 365
           SYTHMI C SK GNLFDSLRLYDEMKKR W+PS++V+NSL +VLTRENCF EALKIL+K+
Sbjct: 305 SYTHMIHCLSKDGNLFDSLRLYDEMKKRGWLPSIKVYNSLIFVLTRENCFKEALKILQKL 364

Query: 366 KEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSF 425
           +E GLQPD+TTYNS+ISPLC+  +LDEA+++L  M  +NIGPT ETYH+F++  + E + 
Sbjct: 365 RESGLQPDATTYNSMISPLCKARKLDEARNMLVTMLGENIGPTTETYHAFLEIVEFEETL 424

Query: 426 ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGL 485
           E+L +M++  LGP+  TF ++  K F++EQ + ALK W EMKRY+VVP+S HY++++QGL
Sbjct: 425 EVLSRMKKAKLGPSRETFGMVLEKFFKVEQAENALKIWEEMKRYDVVPDSAHYTIMLQGL 484

Query: 486 ATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEP 498
           AT G   +ARE    M S GFI  PK+K L+KEP
Sbjct: 485 ATCGLFTKAREFLAAMRSDGFIEDPKVKKLVKEP 515

BLAST of Lsi02G026640 vs. TrEMBL
Match: B9RGM1_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1442060 PE=4 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 5.9e-160
Identity = 286/547 (52.29%), Postives = 381/547 (69.65%), Query Frame = 1

Query: 1   MACLSSIARRLCRIHPLPFHQLLQLIHLQIPDSP----------FQAFHQTLRLHSQSAR 60
           M+ L SI  RL R HP  F+ LL+         P          + AFH+T  +   ++ 
Sbjct: 1   MSSLISIGIRLRRSHPKLFYPLLRSTASPFSFYPLKLNPFVSLFYAAFHRTASVPFLNSL 60

Query: 61  QFSALRSVG-----HPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEF 120
           +FS  +S       +PF     R  NH P    +   +E LKR A  P E EA+A +D  
Sbjct: 61  RFSGSQSFSSQNQKYPFELFEHRIYNHDP---LTQGLLETLKRAAYFPGEAEAMACIDGS 120

Query: 121 DVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCL 180
            V+A+ +LVYS IW LR DWK +FL FKWGEKWG IDE+ C LI+W+LGNH+KF+ AW +
Sbjct: 121 GVKANINLVYSVIWELRKDWKLAFLGFKWGEKWGCIDEKSCELIVWILGNHRKFNNAWIV 180

Query: 181 IRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKY 240
           IR++H   M+ +Q ML+MIDRYA A+   KAI+ F++MEKF++ PD+EAF++L+N+LC +
Sbjct: 181 IRDMHQLSMNIQQTMLIMIDRYAAADNPGKAIEVFNIMEKFKMAPDEEAFYSLMNALCNH 240

Query: 241 GNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSY 300
           G IEEAEEFM VNKKLFPLETE FN+ILNGWC++ VN+ EAKR+WRE+SKCCI P++TSY
Sbjct: 241 GYIEEAEEFMVVNKKLFPLETEGFNVILNGWCSICVNLLEAKRVWREMSKCCITPNATSY 300

Query: 301 THMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKE 360
           THMISCFSK GNLFDSLRLYDEMKKR W+P +EV+NSL YVLTRENCF EAL+ L+K+KE
Sbjct: 301 THMISCFSKVGNLFDSLRLYDEMKKRGWLPGMEVYNSLIYVLTRENCFKEALRFLDKMKE 360

Query: 361 IGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSF--IQGADSEMSF 420
           +GLQPDSTTYNS+I PLCE  +L EA+ VL  M E+NI PT+ETYH+   ++ +D E + 
Sbjct: 361 VGLQPDSTTYNSMIRPLCEGKKLVEARSVLATMIEENISPTMETYHALLEVENSDFEATL 420

Query: 421 ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGL 480
           E+L +M   GL PT+ TF+++  K F+LEQ + ALK W+EMK+YEV PN  HY +L++GL
Sbjct: 421 EVLNRMTVAGLAPTDDTFLLVLAKFFKLEQAENALKMWIEMKQYEVTPNLTHYKILVEGL 480

Query: 481 ATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKG 531
              G L +ARE Y +M S+GF   PK++ +LKEP  G   + +      K+    +H++ 
Sbjct: 481 VRCGLLAKARECYADMRSNGFTDDPKLQKMLKEPVRGQNSKEKLCKGQVKRDGHVNHKRR 540

BLAST of Lsi02G026640 vs. TAIR10
Match: AT1G80880.1 (AT1G80880.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 499.2 bits (1284), Expect = 3.2e-141
Identity = 226/425 (53.18%), Postives = 321/425 (75.53%), Query Frame = 1

Query: 83  VELLKRVARLPSEVEAVAALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSID 142
           ++L+++V+ L SE +A+A+L++     + D  YS IW LRD+W+ +FLAFKWGEK G  D
Sbjct: 95  IDLIRQVSELESEADAMASLEDSSFDLNHDSFYSLIWELRDEWRLAFLAFKWGEKRGCDD 154

Query: 143 EEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHM 202
           ++ C+L+IWVLGNH+KF+ AWCLIR++     D+R+AM +M+DRYA AN+ S+AI+TF +
Sbjct: 155 QKSCDLMIWVLGNHQKFNIAWCLIRDMFNVSKDTRKAMFLMMDRYAAANDTSQAIRTFDI 214

Query: 203 MEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVN 262
           M+KF+ TP  EAF  LL +LC++G+IE+AEEFM  +KKLFP++ E FN+ILNGWCN+  +
Sbjct: 215 MDKFKHTPYDEAFQGLLCALCRHGHIEKAEEFMLASKKLFPVDVEGFNVILNGWCNIWTD 274

Query: 263 VFEAKRIWREISKCCILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNS 322
           V EAKRIWRE+   CI P+  SY+HMISCFSK GNLFDSLRLYDEMKKR   P +EV+NS
Sbjct: 275 VTEAKRIWREMGNYCITPNKDSYSHMISCFSKVGNLFDSLRLYDEMKKRGLAPGIEVYNS 334

Query: 323 LAYVLTRENCFSEALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDN 382
           L YVLTRE+CF EA+K+++K+ E GL+PDS TYNS+I PLCE G+LD A++VL  M  +N
Sbjct: 335 LVYVLTREDCFDEAMKLMKKLNEEGLKPDSVTYNSMIRPLCEAGKLDVARNVLATMISEN 394

Query: 383 IGPTIETYHSFIQGADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWV 442
           + PT++T+H+F++  + E + E+L QM+   LGPTE TF+++  K F+ +QP+ ALK W 
Sbjct: 395 LSPTVDTFHAFLEAVNFEKTLEVLGQMKISDLGPTEETFLLILGKLFKGKQPENALKIWA 454

Query: 443 EMKRYEVVPNSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSI 502
           EM R+E+V N   Y   IQGL + G L++ARE+Y EM S GF+ +P ++ LL+E  +  +
Sbjct: 455 EMDRFEIVANPALYLATIQGLLSCGWLEKAREIYSEMKSKGFVGNPMLQKLLEEQKVKGV 514

Query: 503 EEARQ 508
            ++++
Sbjct: 515 RKSKR 519

BLAST of Lsi02G026640 vs. TAIR10
Match: AT5G15010.1 (AT5G15010.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 272.7 bits (696), Expect = 4.9e-73
Identity = 159/441 (36.05%), Postives = 258/441 (58.50%), Query Frame = 1

Query: 102 LDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEI--CNLIIWVLGNHKKF 161
           L+E DV+   +LV   +  +R+DW+++F  F W  K       +   + +I +LG  +KF
Sbjct: 118 LEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKF 177

Query: 162 STAWCLIRELHG---SLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 221
            TAW LI E+     SL++S Q +L+MI +Y   ++V KAI TFH  ++F+L    + F 
Sbjct: 178 DTAWTLIDEMRKFSPSLVNS-QTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQ 237

Query: 222 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 281
           +LL++LC+Y N+ +A   +F NK  +P + +SFNI+LNGWCNV  +  EA+R+W E+   
Sbjct: 238 SLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNV 297

Query: 282 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEA 341
            +  D  SY+ MISC+SK G+L   L+L+D MKK    P  +V+N++ + L + +  SEA
Sbjct: 298 GVKHDVVSYSSMISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEA 357

Query: 342 LKILEKI-KEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQ 401
             +++ + +E G++P+  TYNSLI PLC+  + +EAK V   M E  + PTI TYH+F++
Sbjct: 358 RNLMKTMEEEKGIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMR 417

Query: 402 -GADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSE 461
                E  FELL +MR+ G  PT  T++++  K       D  L  W EMK   V P+  
Sbjct: 418 ILRTGEEVFELLAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLS 477

Query: 462 HYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKK 521
            Y V+I GL   G++++A   Y EM   G   +  ++ +++    G  + A Q++  + K
Sbjct: 478 SYIVMIHGLFLNGKIEEAYGYYKEMKDKGMRPNENVEDMIQSWFSGK-QYAEQRIT-DSK 537

Query: 522 GKFFSHRKGSMMKWKSYKQQS 536
           G+     KG+++K KS ++++
Sbjct: 538 GEV---NKGAIVK-KSEREKN 551

BLAST of Lsi02G026640 vs. TAIR10
Match: AT3G49730.1 (AT3G49730.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 179.5 bits (454), Expect = 5.6e-45
Identity = 104/398 (26.13%), Postives = 198/398 (49.75%), Query Frame = 1

Query: 101 ALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSI--DEEICNLIIWVLGNHKK 160
           AL+E  +   P L+   +    D     +  F W  K        E+C  ++ +L   ++
Sbjct: 87  ALNESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQ 146

Query: 161 FSTAWCLIRELHGSLMD--SRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 220
           F   W LI E+  +  +    +  +V++ R+A AN V KA++    M K+ L PD+  F 
Sbjct: 147 FGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFG 206

Query: 221 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 280
            LL++LCK G+++EA +     ++ FP     F  +L GWC     + EAK +  ++ + 
Sbjct: 207 CLLDALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREG-KLMEAKEVLVQMKEA 266

Query: 281 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTR-ENCFSE 340
            + PD   +T+++S ++  G + D+  L ++M+KR + P++  +  L   L R E    E
Sbjct: 267 GLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDE 326

Query: 341 ALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQ 400
           A+++  +++  G + D  TY +LIS  C+ G +D+   VL  M +  + P+  TY   + 
Sbjct: 327 AMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMV 386

Query: 401 GADSEMSF----ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVP 460
             + +  F    EL+++M++ G  P    + ++   + +L +   A++ W EM+   + P
Sbjct: 387 AHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSP 446

Query: 461 NSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPK 490
             + + ++I G  + G L +A   + EM S G  + P+
Sbjct: 447 GVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQ 483

BLAST of Lsi02G026640 vs. TAIR10
Match: AT3G15200.1 (AT3G15200.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 178.3 bits (451), Expect = 1.2e-44
Identity = 125/450 (27.78%), Postives = 224/450 (49.78%), Query Frame = 1

Query: 55  LRSVGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLV 114
           L S+G P  F   RF + +  D +SA  V  + +  R  S  +    LD+  +    +LV
Sbjct: 56  LHSLGAPDKFPN-RFNDDK--DKQSALDVHNIIKHHRGSSPEKIKRILDKCGIDLTEELV 115

Query: 115 YSAIWVLRDDWKSSFLAFKWGEKWGS--IDEEICNLIIWVLGNHKKFSTAWCLIRELHGS 174
              +   R DWK +++  +   K         + N I+ VLG  ++F     +  E+  S
Sbjct: 116 LEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGKMRRFEEFHQVFDEM--S 175

Query: 175 LMD---SRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIE 234
             D   + +   V+++RYA A++V +A+  F   ++F +  D  AFH LL  LC+Y ++E
Sbjct: 176 KRDGFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVAFHGLLMWLCRYKHVE 235

Query: 235 EAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMI 294
            AE      ++ F  + ++ N+ILNGWC V  NV EAKR W++I      PD  SY  MI
Sbjct: 236 FAETLFCSRRREFGCDIKAMNMILNGWC-VLGNVHEAKRFWKDIIASKCRPDVVSYGTMI 295

Query: 295 SCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQ 354
           +  +K G L  ++ LY  M      P +++ N++   L  +    EAL++  +I E G  
Sbjct: 296 NALTKKGKLGKAMELYRAMWDTRRNPDVKICNNVIDALCFKKRIPEALEVFREISEKGPD 355

Query: 355 PDSTTYNSLISPLCEMGQLDEAKDVLTMM--TEDNIGPTIETYHSFIQGADSEMSFEL-L 414
           P+  TYNSL+  LC++ + ++  +++  M     +  P   T+   ++ +      ++ L
Sbjct: 356 PNVVTYNSLLKHLCKIRRTEKVWELVEEMELKGGSCSPNDVTFSYLLKYSQRSKDVDIVL 415

Query: 415 KQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATY 474
           ++M ++    T   + +MF    + ++ +   + W EM+R  + P+   Y++ I GL T 
Sbjct: 416 ERMAKNKCEMTSDLYNLMFRLYVQWDKEEKVREIWSEMERSGLGPDQRTYTIRIHGLHTK 475

Query: 475 GRLKQARELYDEMTSHGFIAHPKIKLLLKE 497
           G++ +A   + EM S G +  P+ ++LL +
Sbjct: 476 GKIGEALSYFQEMMSKGMVPEPRTEMLLNQ 499

BLAST of Lsi02G026640 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 177.2 bits (448), Expect = 2.8e-44
Identity = 107/405 (26.42%), Postives = 203/405 (50.12%), Query Frame = 1

Query: 101 ALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDE--EICNLIIWVLGNHKK 160
           AL+E  V+  P L+   +    D     +  F W  K        E+   ++ +L   ++
Sbjct: 103 ALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQ 162

Query: 161 FSTAWCLIRELH--GSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFH 220
           F   W LI E+      +   +  +V++ R+A A+ V KAI+    M KF   PD+  F 
Sbjct: 163 FGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEPDEYVFG 222

Query: 221 ALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKC 280
            LL++LCK+G++++A +     +  FP+    F  +L GWC V   + EAK +  ++++ 
Sbjct: 223 CLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVG-KMMEAKYVLVQMNEA 282

Query: 281 CILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEA 340
              PD   YT+++S ++  G + D+  L  +M++R + P+   +  L   L + +   EA
Sbjct: 283 GFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEA 342

Query: 341 LKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQG 400
           +K+  +++    + D  TY +L+S  C+ G++D+   VL  M +  + P+  TY   +  
Sbjct: 343 MKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVA 402

Query: 401 ADSEMSF----ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPN 460
            + + SF    EL+++MRQ    P    + ++   + +L +   A++ W EM+   + P 
Sbjct: 403 HEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPG 462

Query: 461 SEHYSVLIQGLATYGRLKQARELYDEMTSHGFIA---HPKIKLLL 495
            + + ++I GLA+ G L +A + + EM + G  +   +  +KLLL
Sbjct: 463 VDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLL 506

BLAST of Lsi02G026640 vs. NCBI nr
Match: gi|659122515|ref|XP_008461183.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis melo])

HSP 1 Score: 949.5 bits (2453), Expect = 2.5e-273
Identity = 469/546 (85.90%), Postives = 499/546 (91.39%), Query Frame = 1

Query: 1   MACLSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQAFHQTLRLHSQSARQFSALRS--- 60
           MACLSSIARRLCRIHPLPFH LL L  L I DSPFQAF QTL L S  A QFSAL S   
Sbjct: 1   MACLSSIARRLCRIHPLPFHHLLYLNRLSIRDSPFQAFRQTLCLRSLFAHQFSALPSFSQ 60

Query: 61  -VGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLVYS 120
            VG  F FDTGRF+N+R +DA +A+F+EL KRVA LPSEVEAVAALDEFDVQAD DLVYS
Sbjct: 61  KVGDQFQFDTGRFKNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVQADSDLVYS 120

Query: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDS 180
           AIWVLRDDWKSSFLAFKWGEKWG+IDEEICNL+IWVLGNHKKFSTAWCLIRELHGSL++S
Sbjct: 121 AIWVLRDDWKSSFLAFKWGEKWGAIDEEICNLMIWVLGNHKKFSTAWCLIRELHGSLLNS 180

Query: 181 RQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240
           RQAMLVMIDRYAYANE SKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF
Sbjct: 181 RQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMISCFSKTG 300
           VNKKLFPLETESFNIILNGWCNV+V+VFEAKRIWRE+SKCCILPDSTSYTHMISCFSK G
Sbjct: 241 VNKKLFPLETESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQPDSTTYN 360
           NLFDSLR YD+MKKRDW+PSLEV+NSLAY L RENCF+EALKILEKIKE+GL+PDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSLEVYNSLAYALMRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELLKQMRQDGLGP 420
           SLISPLCE G+LDEAKDVLTMMTEDNI PTIETYHSFIQ ADS+MSFELLK+MRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNIIPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATYGRLKQARELY 480
            E TF+IMFNKSFELEQP+YAL AWVEMKRY+V P+SEHYSVLIQGLAT G LK+ARELY
Sbjct: 421 IEVTFLIMFNKSFELEQPEYALNAWVEMKRYKVFPSSEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSMMKWKSYKQQST 540
           DEM  HGFIAHPKIK LLKEPD GSI+EARQQVRHNKKGKF SHRKGS M+WKS+KQQS 
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDSGSIDEARQQVRHNKKGKFLSHRKGSTMRWKSHKQQSK 540

Query: 541 EDASFE 543
            DASFE
Sbjct: 541 RDASFE 546

BLAST of Lsi02G026640 vs. NCBI nr
Match: gi|778728914|ref|XP_011659500.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Cucumis sativus])

HSP 1 Score: 932.9 bits (2410), Expect = 2.5e-268
Identity = 461/546 (84.43%), Postives = 495/546 (90.66%), Query Frame = 1

Query: 1   MACLSSIARRLCRIHPLPFHQLLQLIHLQIPDSPFQAFHQTLRLHSQSARQFSALRS--- 60
           MA LSSIARRLCRIHPLPFH LL L  L+IPDSPFQAFHQT  LHS  ARQFSAL S   
Sbjct: 1   MASLSSIARRLCRIHPLPFHHLLYLNRLRIPDSPFQAFHQTFSLHSFFARQFSALPSFSQ 60

Query: 61  -VGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEFDVQADPDLVYS 120
            +G PF FDTGRFQN+R +DA +A+F+EL KRVA LPSEVEAVAALDEFDV+AD DLVYS
Sbjct: 61  KLGDPFLFDTGRFQNYRQSDACNARFIELFKRVALLPSEVEAVAALDEFDVKADLDLVYS 120

Query: 121 AIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCLIRELHGSLMDS 180
           AIWVLRDDWKSS LAFKWGEK G+IDEEICNL+IWVLGNHKKFSTAW LIRELHGSL++S
Sbjct: 121 AIWVLRDDWKSSLLAFKWGEKVGAIDEEICNLMIWVLGNHKKFSTAWSLIRELHGSLLNS 180

Query: 181 RQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKYGNIEEAEEFMF 240
            QAMLVMIDRYAYANE SKAIKTFHMMEKFRLTPDQEAFH LLNSLCKYGNIEEAEEFMF
Sbjct: 181 MQAMLVMIDRYAYANEASKAIKTFHMMEKFRLTPDQEAFHVLLNSLCKYGNIEEAEEFMF 240

Query: 241 VNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSYTHMISCFSKTG 300
           VNKKLFPL TESFNIILNGWCNV+V+VFEAKRIWRE+SKCCILPDSTSYTHMISCFSK G
Sbjct: 241 VNKKLFPLGTESFNIILNGWCNVTVDVFEAKRIWREMSKCCILPDSTSYTHMISCFSKNG 300

Query: 301 NLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKEIGLQPDSTTYN 360
           NLFDSLR YD+MKKRDW+PS+EV+NSLAYVLTRENCF+EALKILEKIKE+GL+PDSTTYN
Sbjct: 301 NLFDSLRFYDQMKKRDWIPSVEVYNSLAYVLTRENCFNEALKILEKIKEVGLRPDSTTYN 360

Query: 361 SLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFELLKQMRQDGLGP 420
           SLISPLCE G+LDEAKDVLTMMTEDNI PTIETYHSFIQ ADS+MSFELLK+MRQDGLGP
Sbjct: 361 SLISPLCEAGKLDEAKDVLTMMTEDNISPTIETYHSFIQAADSKMSFELLKRMRQDGLGP 420

Query: 421 TEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLATYGRLKQARELY 480
           TE TF+IMFNKSFELE+P+YAL  WVEMKRYEV P+ EHYSVLIQGLAT G LK+ARELY
Sbjct: 421 TEGTFLIMFNKSFELEEPEYALNVWVEMKRYEVFPSCEHYSVLIQGLATCGHLKKARELY 480

Query: 481 DEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSMMKWKSYKQQST 540
           DEM  HGFIAHPKIK LLKEPDLGSI+EARQQVRHN KGKF  HRKG  M+WKS+KQ+S 
Sbjct: 481 DEMILHGFIAHPKIKTLLKEPDLGSIDEARQQVRHNNKGKFIPHRKGRTMRWKSHKQRSK 540

Query: 541 EDASFE 543
             ASFE
Sbjct: 541 GAASFE 546

BLAST of Lsi02G026640 vs. NCBI nr
Match: gi|694326575|ref|XP_009354198.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Pyrus x bretschneideri])

HSP 1 Score: 623.6 bits (1607), Expect = 3.2e-175
Identity = 310/550 (56.36%), Postives = 400/550 (72.73%), Query Frame = 1

Query: 4   LSSIARRLCRIHPLPF--HQLLQLIHLQIPDSPF-------------QAFHQTLRLHSQS 63
           L +IARRL   H   F  H LL      I DSP              Q FH+TL +  Q+
Sbjct: 3   LPAIARRLRSKHSQLFLSHTLLP----SISDSPSPPRSSQLLLRDLCQTFHRTLLVLPQT 62

Query: 64  ARQFSALR-----SVGHPFHFDTGRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALD 123
            R F  L+     ++  P  F+  RF  + P D+   QF+ELLKR A   SE E +A LD
Sbjct: 63  PRHFCTLQPFSAQTLNDPLGFNGQRFNKNHPRDSGFTQFLELLKRAADFASEAETMAFLD 122

Query: 124 EFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAW 183
           E  ++ D + V  AIW LR DWKS+FLAF+WGEKWG  +EE C+L++W+LG+HKKFSTAW
Sbjct: 123 ESGIKVDREAVLLAIWELRQDWKSAFLAFQWGEKWGCCNEEACSLMVWILGSHKKFSTAW 182

Query: 184 CLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLC 243
           CLIR+LH +LMD+R+AML+MIDRYA  N+  KAIKTFH+M+KFRLTPDQEAFH LLN+LC
Sbjct: 183 CLIRDLHRALMDTRRAMLIMIDRYASVNDPCKAIKTFHVMDKFRLTPDQEAFHILLNALC 242

Query: 244 KYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDST 303
           KYGNIEEAEEFM VNKKLFPLETESFNIILNGWCN+SV+VFEAKR+WRE+SKCC+ PD+T
Sbjct: 243 KYGNIEEAEEFMLVNKKLFPLETESFNIILNGWCNISVDVFEAKRVWREMSKCCVTPDAT 302

Query: 304 SYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKI 363
           SYTH+ISCFSK G LFDSLRLYDEMKKR W+P + V+NSL YVLT ENCF EALKIL+K+
Sbjct: 303 SYTHLISCFSKVGKLFDSLRLYDEMKKRGWIPGISVYNSLIYVLTCENCFKEALKILDKL 362

Query: 364 KEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSF 423
           KE GLQ D+TTYNS+I PLCE  +L+EA+ +L+ M  DN+ PT ETY++F+Q    E + 
Sbjct: 363 KEEGLQADATTYNSMICPLCESEKLEEARQMLSAMIADNLSPTTETYNAFLQSTGLEGTL 422

Query: 424 ELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGL 483
           E+L +M++  LGP+ +TF+++  K F LEQP+ ALK W EMK+Y VVP+S HY+V++QGL
Sbjct: 423 EILNRMKKANLGPSSSTFLMILGKFFRLEQPEMALKMWTEMKQYGVVPDSAHYTVMVQGL 482

Query: 484 ATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKG 534
           A    L +A+E++ EM + GF+  PK+K LL+EP  GS  + +++ +  K+    + ++G
Sbjct: 483 AACRLLIKAKEIFSEMKTDGFVEDPKLKRLLEEP--GSRVKGKRRPKPVKQATKVNQKQG 542

BLAST of Lsi02G026640 vs. NCBI nr
Match: gi|658014970|ref|XP_008342813.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Malus domestica])

HSP 1 Score: 623.2 bits (1606), Expect = 4.2e-175
Identity = 315/557 (56.55%), Postives = 405/557 (72.71%), Query Frame = 1

Query: 4   LSSIARRLCRIHPLPF--HQLLQLIHLQIPDSPF----------QAFHQTLRLHSQSARQ 63
           L +IARRL   H   F  H LL  I    P SP           Q FH+TL +  Q+ R 
Sbjct: 3   LPAIARRLRSKHSQLFLSHTLLPSISYS-PSSPCSSQPLLRDLCQTFHRTLIVLPQTPRH 62

Query: 64  FSALRSVGHPFHFDT------GRFQNHRPNDARSAQFVELLKRVARLPSEVEAVAALDEF 123
           FS L+    PF   T       RF    P D+R  QF+ELLKR A   SE EAVA LDE 
Sbjct: 63  FSTLQ----PFSAQTLNDPLGQRFNTDHPRDSRFTQFLELLKRAADFASEAEAVAFLDES 122

Query: 124 DVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEICNLIIWVLGNHKKFSTAWCL 183
            ++ D + V  AIW LR+DWKS+FLAF+WGEKWG  +EE C+L++W+LG+HKKFSTAWCL
Sbjct: 123 GIEVDRETVLLAIWELREDWKSAFLAFQWGEKWGCCNEEACSLMVWILGSHKKFSTAWCL 182

Query: 184 IRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEKFRLTPDQEAFHALLNSLCKY 243
           IR+LH +LMD+R+AML+MIDRYA AN+  KAIKTFH+M+KFRLTPDQEAFH LLN+LCKY
Sbjct: 183 IRDLHRALMDTRRAMLIMIDRYASANDPCKAIKTFHVMDKFRLTPDQEAFHILLNALCKY 242

Query: 244 GNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFEAKRIWREISKCCILPDSTSY 303
           GNIEEAEEFM VNKKLFPLETESFNIILNGWCN+SV+VFEAKR+WRE+SKCC+ PD+TSY
Sbjct: 243 GNIEEAEEFMLVNKKLFPLETESFNIILNGWCNISVDVFEAKRVWREMSKCCVTPDATSY 302

Query: 304 THMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAYVLTRENCFSEALKILEKIKE 363
           TH+ISCFSK G LFDSLRLYDEMKKR W+P + V+NSL YVLT ENCF EALKIL+K+KE
Sbjct: 303 THLISCFSKVGKLFDSLRLYDEMKKRGWIPGISVYNSLIYVLTCENCFKEALKILDKLKE 362

Query: 364 IGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGPTIETYHSFIQGADSEMSFEL 423
            GLQ D+TTYNS+I PLCE  +L+EA+ +L+ M  +N+ PT ETY++F+Q    E + E+
Sbjct: 363 EGLQADATTYNSMICPLCESEKLEEARQMLSAMIANNLSPTTETYNAFLQSTGLEGTLEI 422

Query: 424 LKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMKRYEVVPNSEHYSVLIQGLAT 483
           L +M++  LGP+ +TF+++  K F LEQP+ ALK W EMK+Y VVP+S HY+V++QGLA 
Sbjct: 423 LNRMKKASLGPSSSTFLMILGKFFRLEQPEMALKIWTEMKQYGVVPDSAHYTVMVQGLAA 482

Query: 484 YGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEARQQVRHNKKGKFFSHRKGSM 543
              L +A+E++ EM ++GF+  PK+K LL+ P  GS  + +++    K+    + ++GS 
Sbjct: 483 CRLLIKAKEIFSEMKTNGFVEDPKLKRLLEVP--GSRVKGKRRRIPVKQATKVNQKQGST 542

BLAST of Lsi02G026640 vs. NCBI nr
Match: gi|1009112398|ref|XP_015868080.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial [Ziziphus jujuba])

HSP 1 Score: 617.8 bits (1592), Expect = 1.7e-173
Identity = 303/508 (59.65%), Postives = 390/508 (76.77%), Query Frame = 1

Query: 36  QAFHQTLRL----------HSQSARQFSALRSVGHPFHFDTGRFQNHRPNDARSAQFVEL 95
           QAF QT+ +          H  S + FSA +++ +PF F    F+ +  +D    +FVE+
Sbjct: 44  QAFRQTILVRFRSSSLQTPHVSSLQLFSA-QNLNYPFDFHGLGFRIYDLHDPGVLEFVEI 103

Query: 96  LKRVARLPSEVEAVAALDEFDVQADPDLVYSAIWVLRDDWKSSFLAFKWGEKWGSIDEEI 155
           LKR A LP+E EA+ ++ E  ++A  DLV SAIW LR++WK +FLAFKWGEK G  DEE 
Sbjct: 104 LKRAADLPTETEAMLSIQESGIEATQDLVCSAIWGLREEWKLAFLAFKWGEKCGCSDEET 163

Query: 156 CNLIIWVLGNHKKFSTAWCLIRELHGSLMDSRQAMLVMIDRYAYANEVSKAIKTFHMMEK 215
           CNL+IWVLGNH+KF+TAWCLIR+LH S MD+R+AML+MIDRYA+A++ SKAI TF +MEK
Sbjct: 164 CNLLIWVLGNHRKFNTAWCLIRDLHRSSMDTRRAMLIMIDRYAFADDPSKAIWTFDIMEK 223

Query: 216 FRLTPDQEAFHALLNSLCKYGNIEEAEEFMFVNKKLFPLETESFNIILNGWCNVSVNVFE 275
           F L  DQEA+  LLN+LC +G IEEAEEFM VNKK FPLETESFNIILNGWCN+SV+VFE
Sbjct: 224 FSLAHDQEAYRFLLNALCAHGYIEEAEEFMLVNKKPFPLETESFNIILNGWCNMSVDVFE 283

Query: 276 AKRIWREISKCCILPDSTSYTHMISCFSKTGNLFDSLRLYDEMKKRDWVPSLEVFNSLAY 335
           AKR+WRE+SKCCILPD+TSYTHM+SCFSK GNLFDSLRLYDEMKKR WVP LEV+NSL Y
Sbjct: 284 AKRVWREMSKCCILPDATSYTHMVSCFSKVGNLFDSLRLYDEMKKRGWVPGLEVYNSLIY 343

Query: 336 VLTRENCFSEALKILEKIKEIGLQPDSTTYNSLISPLCEMGQLDEAKDVLTMMTEDNIGP 395
           VLT ENC  EALK+L+K+KE+GL+PDSTTYNS+I PLCE  +L+EA+ +LT M E+NI P
Sbjct: 344 VLTHENCLGEALKMLDKVKELGLRPDSTTYNSMIRPLCEAQKLEEARKILTTMIEENICP 403

Query: 396 TIETYHSFIQGADSEMSFELLKQMRQDGLGPTEATFVIMFNKSFELEQPDYALKAWVEMK 455
           T ETYH+F+     E + E++K+M++  LGP+  TF+I+  K F+LEQP+ A+K W EM+
Sbjct: 404 TSETYHAFLGTVGFEGTLEVVKRMKKANLGPSSETFLIILRKFFKLEQPENAVKMWEEMR 463

Query: 456 RYEVVPNSEHYSVLIQGLATYGRLKQARELYDEMTSHGFIAHPKIKLLLKEPDLGSIEEA 515
           +YEVVP+S HY+V+IQGLAT+G L +A+    EM S GF+  PK+K LLKEP    I++ 
Sbjct: 464 QYEVVPDSTHYTVMIQGLATFGLLNKAKTFLSEMRSDGFVEDPKLKKLLKEPKSNRIDKR 523

Query: 516 RQQVRHNKKGKFFSHRKGSMMKWKSYKQ 534
           ++Q+R  K  K  SH+KG+ MK +++ Q
Sbjct: 524 KRQLRQLKGEKRVSHKKGNTMKGENHHQ 550

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP137_ARATH5.7e-14053.18Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidop... [more]
PP383_ARATH8.6e-7236.05Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
PP275_ARATH9.9e-4426.13Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN... [more]
PP233_ARATH2.2e-4327.78Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis th... [more]
PP447_ARATH4.9e-4326.42Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0K678_CUCSA1.7e-26884.43Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431960 PE=4 SV=1[more]
U5GND9_POPTR1.4e-16655.43Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s03510g PE=4 SV=1[more]
A0A067LCX9_JATCU9.0e-16156.58Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26977 PE=4 SV=1[more]
W9QV67_9ROSA9.0e-16157.00Uncharacterized protein OS=Morus notabilis GN=L484_018066 PE=4 SV=1[more]
B9RGM1_RICCO5.9e-16052.29Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
AT1G80880.13.2e-14153.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G15010.14.9e-7336.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G49730.15.6e-4526.13 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15200.11.2e-4427.78 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.12.8e-4426.42 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659122515|ref|XP_008461183.1|2.5e-27385.90PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|778728914|ref|XP_011659500.1|2.5e-26884.43PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|694326575|ref|XP_009354198.1|3.2e-17556.36PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|658014970|ref|XP_008342813.1|4.2e-17556.55PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
gi|1009112398|ref|XP_015868080.1|1.7e-17359.65PREDICTED: pentatricopeptide repeat-containing protein At1g80880, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi02G026640.1Lsi02G026640.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 215..234
score: 0.088coord: 182..206
score: 0.4coord: 284..311
score: 3.6E-6coord: 455..484
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 347..378
score: 5.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 284..316
score: 6.2E-8coord: 456..484
score: 1.5E-4coord: 182..211
score: 0.0029coord: 354..386
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 245..280
score: 7.837coord: 211..241
score: 7.158coord: 176..210
score: 7.563coord: 281..315
score: 11.663coord: 316..350
score: 9.251coord: 452..486
score: 10.786coord: 351..385
score: 12.156coord: 417..451
score: 6.993coord: 142..172
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 284..378
score: 2.6E-6coord: 211..228
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 122..528
score: 2.5E-224coord: 16..88
score: 2.5E
NoneNo IPR availablePANTHERPTHR24015:SF602SUBFAMILY NOT NAMEDcoord: 122..528
score: 2.5E-224coord: 16..88
score: 2.5E

The following gene(s) are paralogous to this gene:

None