Lsi05G001010 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G001010
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein
Locationchr05 : 1764796 .. 1766661 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTTATCGAAGCCGGCCTTCTACACGCACCTCAAAACCCTAACCGGGTCCCACCATTTACTCCGGCGCCAAGCTCCGGCACTTCCCATCGTCACCCTCCGATTTCTCTCTTTTGCATCGCCGGAGGAAGCTGCCGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGTATCGAACCCCCGCTCTCATCTTCCTCTGCCGCTCGCCCACAATCGCAGCCTCCTAGACCTCAAACCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCCCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTCACTCTCATTCGTGAAAATGATCTTGAAGAAGCCGCGCTTTTCACCCGCCATTCTATTTACTCCAATTGTCGTCCCACCATCTTCACCGTCAATGCTGTTCTCAATGCACAGCTTCGCCAATCGAAATATGCCGATTTGCTTTCACTTCACCGGTTTATTACACAGGCTGGTGTCGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCTGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCCTCGCCGACGACTTACAGGATCTTGTTTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTTAAAGAGGAAATGACTGTGAAGGGTTTTGTTCCTGACCCTCTTATTTATCATTATTTGATGGTGGGATGTGTGAGAAGATCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTAGGAGGGACCGTGGAAGATGGAGTTGTTTATGGGAGCTTGATGAAAGGGTATTTTATGAAAGAAATGGAAGAGGAAGCAATGAGGTGTTTTGAGGAGACTGTGGGTGAACATTCCGTGGTGAAGATGAGCGCCATTGCATACAATTCTGTGCTTGATGCATTATGCAGGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAGCATAGTCCACCCAGGCGTGTGGCAGTGAACTTGGGAAGCTTTAATGTCATGGTTGATGGGTACTGCATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGAAGCTGAGAAGCTTTATGGAACAATGGGCGATAAGGGAGTGAACCCAGACGAGTTTACTTATGGTTTGTTGATGGATTCTTGCTTTAAAGAGAACAGGGCAGATGATGCAGCTGGATATTTTAGAAAAATGGTACAGTCTGGGCTCAGACCCAATATAGCAGTTTATAATAGATTAGTGGATGAGTTGGTTAAATTAGGGAAGATTAACGATGCAAAATCTTTCTTCGACTTGATGGTGAAGAAGCTCAAAATGGATGCCTCAGGCTATCAGTTTATAATGAAGGCATTGAGTGATTCGGGGAAACTGGATGAAATATTAAATGTGGTTGATACCCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGCTGCAGGAGTTTGTAAGAGGTGAGCTAAGGAAGGAAGACAGGGAAGAAGATCTAGCTAAACTTGTGGAAGAGAAAGAAAGACTGAAAGCTGAAGCTAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAGAGAAGTGCTAAAGCTGCTGTCTCTTCTTTACTGTCGTCCAAGTTGTTCGGGAACAAGGAAGGTGAGAAAGAATCTGTAGTGAACGAAACTCAATCTGGTGAAGAAGAAAGTGGAAAAACTGAACTTGCAGAATCTAGTCCTTGA

mRNA sequence

ATGGCGTTATCGAAGCCGGCCTTCTACACGCACCTCAAAACCCTAACCGGGTCCCACCATTTACTCCGGCGCCAAGCTCCGGCACTTCCCATCGTCACCCTCCGATTTCTCTCTTTTGCATCGCCGGAGGAAGCTGCCGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGTATCGAACCCCCGCTCTCATCTTCCTCTGCCGCTCGCCCACAATCGCAGCCTCCTAGACCTCAAACCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCCCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTCACTCTCATTCGTGAAAATGATCTTGAAGAAGCCGCGCTTTTCACCCGCCATTCTATTTACTCCAATTGTCGTCCCACCATCTTCACCGTCAATGCTGTTCTCAATGCACAGCTTCGCCAATCGAAATATGCCGATTTGCTTTCACTTCACCGGTTTATTACACAGGCTGGTGTCGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCTGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCCTCGCCGACGACTTACAGGATCTTGTTTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTTAAAGAGGAAATGACTGTGAAGGGTTTTGTTCCTGACCCTCTTATTTATCATTATTTGATGGTGGGATGTGTGAGAAGATCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTAGGAGGGACCGTGGAAGATGGAGTTGTTTATGGGAGCTTGATGAAAGGGTATTTTATGAAAGAAATGGAAGAGGAAGCAATGAGGTGTTTTGAGGAGACTGTGGGTGAACATTCCGTGGTGAAGATGAGCGCCATTGCATACAATTCTGTGCTTGATGCATTATGCAGGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAGCATAGTCCACCCAGGCGTGTGGCAGTGAACTTGGGAAGCTTTAATGTCATGGTTGATGGGTACTGCATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGAAGCTGAGAAGCTTTATGGAACAATGGGCGATAAGGGAGTGAACCCAGACGAGTTTACTTATGGTTTGTTGATGGATTCTTGCTTTAAAGAGAACAGGGCAGATGATGCAGCTGGATATTTTAGAAAAATGGTACAGTCTGGGCTCAGACCCAATATAGCAGTTTATAATAGATTAGTGGATGAGTTGGTTAAATTAGGGAAGATTAACGATGCAAAATCTTTCTTCGACTTGATGGTGAAGAAGCTCAAAATGGATGCCTCAGGCTATCAGTTTATAATGAAGGCATTGAGTGATTCGGGGAAACTGGATGAAATATTAAATGTGGTTGATACCCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGCTGCAGGAGTTTGTAAGAGGTGAGCTAAGGAAGGAAGACAGGGAAGAAGATCTAGCTAAACTTGTGGAAGAGAAAGAAAGACTGAAAGCTGAAGCTAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAGAGAAGTGCTAAAGCTGCTGTCTCTTCTTTACTGTCGTCCAAGTTGTTCGGGAACAAGGAAGGTGAGAAAGAATCTGTAGTGAACGAAACTCAATCTGGTGAAGAAGAAAGTGGAAAAACTGAACTTGCAGAATCTAGTCCTTGA

Coding sequence (CDS)

ATGGCGTTATCGAAGCCGGCCTTCTACACGCACCTCAAAACCCTAACCGGGTCCCACCATTTACTCCGGCGCCAAGCTCCGGCACTTCCCATCGTCACCCTCCGATTTCTCTCTTTTGCATCGCCGGAGGAAGCTGCCGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGTATCGAACCCCCGCTCTCATCTTCCTCTGCCGCTCGCCCACAATCGCAGCCTCCTAGACCTCAAACCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCCCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTCACTCTCATTCGTGAAAATGATCTTGAAGAAGCCGCGCTTTTCACCCGCCATTCTATTTACTCCAATTGTCGTCCCACCATCTTCACCGTCAATGCTGTTCTCAATGCACAGCTTCGCCAATCGAAATATGCCGATTTGCTTTCACTTCACCGGTTTATTACACAGGCTGGTGTCGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCTGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCCTCGCCGACGACTTACAGGATCTTGTTTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTTAAAGAGGAAATGACTGTGAAGGGTTTTGTTCCTGACCCTCTTATTTATCATTATTTGATGGTGGGATGTGTGAGAAGATCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTAGGAGGGACCGTGGAAGATGGAGTTGTTTATGGGAGCTTGATGAAAGGGTATTTTATGAAAGAAATGGAAGAGGAAGCAATGAGGTGTTTTGAGGAGACTGTGGGTGAACATTCCGTGGTGAAGATGAGCGCCATTGCATACAATTCTGTGCTTGATGCATTATGCAGGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAGCATAGTCCACCCAGGCGTGTGGCAGTGAACTTGGGAAGCTTTAATGTCATGGTTGATGGGTACTGCATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGAAGCTGAGAAGCTTTATGGAACAATGGGCGATAAGGGAGTGAACCCAGACGAGTTTACTTATGGTTTGTTGATGGATTCTTGCTTTAAAGAGAACAGGGCAGATGATGCAGCTGGATATTTTAGAAAAATGGTACAGTCTGGGCTCAGACCCAATATAGCAGTTTATAATAGATTAGTGGATGAGTTGGTTAAATTAGGGAAGATTAACGATGCAAAATCTTTCTTCGACTTGATGGTGAAGAAGCTCAAAATGGATGCCTCAGGCTATCAGTTTATAATGAAGGCATTGAGTGATTCGGGGAAACTGGATGAAATATTAAATGTGGTTGATACCCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGCTGCAGGAGTTTGTAAGAGGTGAGCTAAGGAAGGAAGACAGGGAAGAAGATCTAGCTAAACTTGTGGAAGAGAAAGAAAGACTGAAAGCTGAAGCTAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAGAGAAGTGCTAAAGCTGCTGTCTCTTCTTTACTGTCGTCCAAGTTGTTCGGGAACAAGGAAGGTGAGAAAGAATCTGTAGTGAACGAAACTCAATCTGGTGAAGAAGAAAGTGGAAAAACTGAACTTGCAGAATCTAGTCCTTGA

Protein sequence

MALSKPAFYTHLKTLTGSHHLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESVVNETQSGEEESGKTELAESSP
BLAST of Lsi05G001010 vs. Swiss-Prot
Match: PP273_ARATH (Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN=EMB1796 PE=2 SV=1)

HSP 1 Score: 769.2 bits (1985), Expect = 3.4e-221
Identity = 395/619 (63.81%), Postives = 494/619 (79.81%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLRIEP 60
           M++SK AF  HL+TL+ S+   R +    P + +R++SFA+ EEAAAERRRRKRRLR+EP
Sbjct: 1   MSISKAAFLNHLQTLSRSY---RHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEP 60

Query: 61  PLSS-SSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           P++S + + + QSQ PRP   QNPN PKLPE +SAL G RL+LHN IL LIRENDLEEAA
Sbjct: 61  PVNSFNRSQQQQSQIPRPI--QNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           L+TRHS+YSNCRPTIFTVN VL AQLRQ+KY  LL LH FI QAG+APNIIT+NLIFQ Y
Sbjct: 121 LYTRHSVYSNCRPTIFTVNTVLAAQLRQAKYGALLQLHGFINQAGIAPNIITYNLIFQAY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LD RKP+ A+EHYK  I++AP NPS  T+RIL KGLV N+ LE+AME+KE+M VKGFV D
Sbjct: 181 LDVRKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           P++Y YLM+GCV+ SD DGV KL++ELKEKLGG V+DGVVYG LMKGYFMKEME+EAM C
Sbjct: 241 PVVYSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMEC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EE VGE+S V+MSA+AYN VL+AL  NGKF EAL LFD + KEH+PPR +AVNLG+FNV
Sbjct: 301 YEEAVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MV+GYC  G+F++A+EVF +MGD++CSPDTLSFNNL+ QLC+N +LAEAEKLYG M +K 
Sbjct: 361 MVNGYCAGGKFEEAMEVFRQMGDFKCSPDTLSFNNLMNQLCDNELLAEAEKLYGEMEEKN 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           V PDE+TYGLLMD+CFKE + D+ A Y++ MV+S LRPN+AVYNRL D+L+K GK++DAK
Sbjct: 421 VKPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFD+MV KLKMD   Y+FIM+ALS++G+LDE+L +VD +LDDD +  SEELQEFV+ EL
Sbjct: 481 SFFDMMVSKLKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RK  RE DL KL+EEKERLKAEAKAKE   AE +K++    +++L+  K    K+   + 
Sbjct: 541 RKGGREGDLEKLMEEKERLKAEAKAKELADAEEKKKAQSINIAALIPPKAVEEKKETAKL 600

Query: 601 VVNETQSGEEESGKTELAE 619
           +      G EE+   E+A+
Sbjct: 601 LWENEAGGVEEADVVEMAK 614

BLAST of Lsi05G001010 vs. Swiss-Prot
Match: PPR29_ARATH (Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN=GRP23 PE=1 SV=1)

HSP 1 Score: 301.2 bits (770), Expect = 2.6e-80
Identity = 187/534 (35.02%), Postives = 293/534 (54.87%), Query Frame = 1

Query: 30  PIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPPRPQTPQNPNAPKLP 89
           P +  R ++F+S EEAAAERRRRKRRLRIEPPL +      +  P  P   ++PNAP+LP
Sbjct: 81  PPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL-----RRDPSAPPPKRDPNAPRLP 140

Query: 90  EHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSK 149
           +  SAL G RLNLHNR+ +LIR +DL+ A+   R S++SN RPT+FT NA++ A  R  +
Sbjct: 141 DSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKR 200

Query: 150 YADLLSLHR-FITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTY 209
           Y++ +SL + F  Q+ + PN++++N I   + D    D A+E Y+ ++ +APF PS  TY
Sbjct: 201 YSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTY 260

Query: 210 RILFKGLVDNNKLERAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKE 269
           R L KGLV   ++  A  L  EM  KG   D  +Y+ L+ G +   D D   + F+ELK 
Sbjct: 261 RHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKS 320

Query: 270 KLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNG 329
           K   TV DG+V  + M+ +F K  ++EAM  +   + +    +M     N +L+   + G
Sbjct: 321 KC--TVYDGIVNATFMEYWFEKGNDKEAMESYRSLLDKK--FRMHPPTGNVLLEVFLKFG 380

Query: 330 KFGEALMLFDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSP- 389
           K  EA  LF+ M   H+PP  ++VN  +  +MV+     G F +AI  F+K+G    S  
Sbjct: 381 KKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKP 440

Query: 390 ---DTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAA 449
              D L + N++ + C  GML EAE+ +     + +  D  ++  ++D+  K  R DDA 
Sbjct: 441 FVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAV 500

Query: 450 GYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKAL 509
               +MV   LR       R+  EL+K GK+ ++      M  ++ K D S Y  +++ L
Sbjct: 501 KMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGL 560

Query: 510 SDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGELRKEDREEDLAKLVEEKER 558
            D   LD+  ++V  ++  + +  +  L+EF+     K  R E++ K++    R
Sbjct: 561 CDGDALDQAKDIVGEMIRHN-VGVTTVLREFIIEVFEKAGRREEIEKILNSVAR 604

BLAST of Lsi05G001010 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 7.4e-43
Identity = 107/413 (25.91%), Postives = 196/413 (47.46%), Query Frame = 1

Query: 103 HNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQ 162
           +N +  ++R  +LEE   F  + +Y    P I     ++    R  K      +   +  
Sbjct: 106 NNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEG 165

Query: 163 AGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLE 222
           +G  P++IT+N++   Y    + + A+     +++    +P   TY  + + L D+ KL+
Sbjct: 166 SGAVPDVITYNVMISGYCKAGEINNALS----VLDRMSVSPDVVTYNTILRSLCDSGKLK 225

Query: 223 RAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGS 282
           +AME+ + M  +   PD + Y  L+    R S      KL +E++++  G   D V Y  
Sbjct: 226 QAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDR--GCTPDVVTYNV 285

Query: 283 LMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEA-LMLFDRMT 342
           L+ G   +   +EA++   +     S  + + I +N +L ++C  G++ +A  +L D + 
Sbjct: 286 LVNGICKEGRLDEAIKFLNDMPS--SGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLR 345

Query: 343 KEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCN 402
           K  SP      ++ +FN++++  C +G    AI++ EKM  + C P++LS+N L+   C 
Sbjct: 346 KGFSP------SVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCK 405

Query: 403 NGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAV 462
              +  A +    M  +G  PD  TY  ++ +  K+ + +DA     ++   G  P +  
Sbjct: 406 EKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLIT 465

Query: 463 YNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKALSDSGKLDEIL 514
           YN ++D L K GK   A    D M  K LK D   Y  ++  LS  GK+DE +
Sbjct: 466 YNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAI 504

BLAST of Lsi05G001010 vs. Swiss-Prot
Match: PPR36_ARATH (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 165.6 bits (418), Expect = 1.7e-39
Identity = 107/419 (25.54%), Postives = 196/419 (46.78%), Query Frame = 1

Query: 93  SALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYAD 152
           SA S   L+   R+ + + +   ++A    R  I+S   PT+   + + +A  +  +Y  
Sbjct: 47  SAFSDRNLSYRERLRSGLVDIKADDAIDLFRDMIHSRPLPTVIDFSRLFSAIAKTKQYDL 106

Query: 153 LLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILF 212
           +L+L + +   G+A N+ T +++   +  CRK   A     ++I    + P+  T+  L 
Sbjct: 107 VLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIK-LGYEPNTITFSTLI 166

Query: 213 KGLVDNNKLERAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGG 272
            GL    ++  A+EL + M   G  PD +  + L+ G            L +++ E   G
Sbjct: 167 NGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEY--G 226

Query: 273 TVEDGVVYGSLMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGE 332
              + V YG ++           AM    +   E   +K+ A+ Y+ ++D LC++G    
Sbjct: 227 CQPNAVTYGPVLNVMCKSGQTALAMELLRKM--EERNIKLDAVKYSIIIDGLCKHGSLDN 286

Query: 333 ALMLFDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSF 392
           A  LF+ M       + +  N+ ++N+++ G+C  GR+ D  ++   M   + +P+ ++F
Sbjct: 287 AFNLFNEMEM-----KGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTF 346

Query: 393 NNLIEQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQ 452
           + LI+     G L EAE+L+  M  +G+ PD  TY  L+D   KEN  D A      MV 
Sbjct: 347 SVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVS 406

Query: 453 SGLRPNIAVYNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKALSDSGKLD 511
            G  PNI  +N L++   K  +I+D    F  M ++ +  D   Y  +++   + GKL+
Sbjct: 407 KGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLN 455

BLAST of Lsi05G001010 vs. Swiss-Prot
Match: PPR37_ARATH (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 2.1e-37
Identity = 115/451 (25.50%), Postives = 202/451 (44.79%), Query Frame = 1

Query: 109 LIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPN 168
           L  E  + EA       +    +PT+ T+NA++N      K +D + L   + + G  PN
Sbjct: 152 LCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPN 211

Query: 169 IITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELK 228
            +T+  + +      +   AME  +++  +         Y I+  GL  +  L+ A  L 
Sbjct: 212 EVTYGPVLKVMCKSGQTALAMELLRKM-EERKIKLDAVKYSIIIDGLCKDGSLDNAFNLF 271

Query: 229 EEMTVKGFVPDPLIYHYLMVG-CVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGY 288
            EM +KGF  D +IY  L+ G C      DG   L + +K K+     D V + +L+  +
Sbjct: 272 NEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKI---TPDVVAFSALIDCF 331

Query: 289 FMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPP 348
             +    EA    +E +     +    + Y S++D  C+  +  +A  + D M  +   P
Sbjct: 332 VKEGKLREAEELHKEMIQRG--ISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGP 391

Query: 349 RRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAE 408
                N+ +FN++++GYC      D +E+F KM       DT+++N LI+  C  G L  
Sbjct: 392 -----NIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEV 451

Query: 409 AEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVD 468
           A++L+  M  + V PD  +Y +L+D        + A   F K+ +S +  +I +YN ++ 
Sbjct: 452 AKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIH 511

Query: 469 ELVKLGKINDAKSFF-DLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIE 528
            +    K++DA   F  L +K +K D   Y  ++  L   G L E   +   + +D    
Sbjct: 512 GMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSLSEADLLFRKMEEDGHSP 571

Query: 529 FSEELQEFVRGELRKEDREEDLAKLVEEKER 558
                   +R  L + D  +  AKL+EE +R
Sbjct: 572 NGCTYNILIRAHLGEGDATKS-AKLIEEIKR 590

BLAST of Lsi05G001010 vs. TrEMBL
Match: A0A0A0L7B2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G239860 PE=4 SV=1)

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 575/624 (92.15%), Postives = 593/624 (95.03%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPA-LPIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKPAF+THLKTLTGSHHLL+RQA A  PIVTLRFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQALAPFPIVTLRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARP +QPPR QTPQNPNAPK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPLTQPPRSQTPQNPNAPKIPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRIL KGLVDNNKLERAMELK+EM  KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKDEMIEKGFAPD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIYHYLM GCVR  DPDGVFKLFEELKEKLG TVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMGGCVRSLDPDGVFKLFEELKEKLGATVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EETVG++SVVKMSAIAYNSVLDALCRNGKFGEAL LFDRMTKEH PPR +AVNLGSFNV
Sbjct: 301 YEETVGDNSVVKMSAIAYNSVLDALCRNGKFGEALTLFDRMTKEHRPPRHLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MVDGYCIEGRFK+AIEVFEKMGDYRC PDTLSFNNLIEQLCNNGMLAEAE LYGTM DKG
Sbjct: 361 MVDGYCIEGRFKEAIEVFEKMGDYRCCPDTLSFNNLIEQLCNNGMLAEAEMLYGTMDDKG 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           VNPDEFTYGLLMDSCFK+NRADDAA YFRKMV SGLRPNIAVYN LVDELVKLGKI+DAK
Sbjct: 421 VNPDEFTYGLLMDSCFKKNRADDAAAYFRKMVDSGLRPNIAVYNILVDELVKLGKIDDAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFDLMVKKLKMDAS YQFIMKALS+SGK+DEILNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASSYQFIMKALSESGKMDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKE+REEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKENREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 VVNETQSGEEE--SGKTELAESSP 622
           VVNE QS E+E  SGKTELAESSP
Sbjct: 601 VVNEMQSVEQEDDSGKTELAESSP 624

BLAST of Lsi05G001010 vs. TrEMBL
Match: E5GB98_CUCME (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1092.4 bits (2824), Expect = 0.0e+00
Identity = 565/624 (90.54%), Postives = 592/624 (94.87%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPA-LPIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKPAF+THLKTLTGSHHLL+RQAPA LPIVT RFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQAPAPLPIVTFRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARPQSQP R QTPQNPN PK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPQSQPSRSQTPQNPNTPKVPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRIL KGLVDN KLERAMELKEEM VKGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNKKLERAMELKEEMIVKGFAPD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIYHYLM GCVR SDPDGVFKLFEELKEKLGGTVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMAGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EETVG++ VVKMSAIAYNSVLDALC++GKF EAL LFDRMTKEH PPR +AVNLG+FNV
Sbjct: 301 YEETVGDNPVVKMSAIAYNSVLDALCKHGKFSEALTLFDRMTKEHRPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MVDGYCI+GRFK+AI VFE+MGDYRCSPDTLSFNNLIEQLCNNGMLAEAE LYGTMG+KG
Sbjct: 361 MVDGYCIKGRFKEAIGVFEEMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEMLYGTMGEKG 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           VNPDEFTYGLLM SCF++NRADDAA YFRKMV SGLRPNIAVYN LV ELVKLGK+++AK
Sbjct: 421 VNPDEFTYGLLMHSCFQKNRADDAAAYFRKMVDSGLRPNIAVYNILVGELVKLGKVDEAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFDLMVKKLKMDAS YQFIMKALS+SGK+DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASNYQFIMKALSESGKMDEVLNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 VVNETQSG--EEESGKTELAESSP 622
           VVNE QSG  E++ GKTELAES+P
Sbjct: 601 VVNEMQSGQQEDDGGKTELAESNP 624

BLAST of Lsi05G001010 vs. TrEMBL
Match: M5X3R7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002582mg PE=4 SV=1)

HSP 1 Score: 931.0 bits (2405), Expect = 7.4e-268
Identity = 483/628 (76.91%), Postives = 544/628 (86.62%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLT---GSHHLLRRQAPALP--IVTLRFLSFASPEEAAAERRRRKRR 60
           MALSKP F THL+TL      HH      P  P   ++LRFLSFA+PEEAAAERRRRKRR
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHH------PTTPPSFISLRFLSFATPEEAAAERRRRKRR 60

Query: 61  LRIEPPLSS---SSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRE 120
           LRIEPPLSS   +   + Q Q P+PQ  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+
Sbjct: 61  LRIEPPLSSLHRNQQQQQQQQSPKPQ--QNPNAPKLPEPVSALSGNRLNLHNRILTLVRQ 120

Query: 121 NDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITH 180
           NDLEEAAL+TRHSIYSNCRPTIFTVN+VL AQLRQSKY+DLLSLHRFITQAGVAPNIITH
Sbjct: 121 NDLEEAALYTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITH 180

Query: 181 NLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMT 240
           NLIFQTYLDCRKPDTAME+YKQLINDAPFNPSPTTYRIL KGLVDNNKL+RAMELKEE+ 
Sbjct: 181 NLIFQTYLDCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEID 240

Query: 241 VKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEM 300
            KGF PDP++YHYLMVGCV+ SD DGVF+L+EELKEKLGG VEDG+VYG+LMKGYFM+ M
Sbjct: 241 AKGFAPDPVVYHYLMVGCVKNSDSDGVFRLYEELKEKLGGVVEDGIVYGNLMKGYFMRGM 300

Query: 301 EEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAV 360
           E+EAM C+EE+ GE S VK SA+AYNSVLDAL +NGKF EAL LFDRM  EH+PPRR+AV
Sbjct: 301 EKEAMECYEESFGESSKVKTSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAV 360

Query: 361 NLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLY 420
           NLGSFNVM DGYC++GRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML+EAE+LY
Sbjct: 361 NLGSFNVMADGYCVQGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELY 420

Query: 421 GTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKL 480
           G M DKGV PDEFTY LLMD+CF+ENRADDAA YFRKMV + LRPN+AVYNRLVD L+K+
Sbjct: 421 GEMSDKGVYPDEFTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKV 480

Query: 481 GKINDAKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQ 540
           GK+++AKSFFDLMVKKLKMD   YQFIMK LS++GKLDE+LNVVDT+LDDDG+EF+EELQ
Sbjct: 481 GKVDEAKSFFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVDTMLDDDGVEFNEELQ 540

Query: 541 EFVRGELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGN 600
           EFV+GELRKE RE+++ KL+EEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGN
Sbjct: 541 EFVKGELRKEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGN 600

Query: 601 KEGEKESVVNETQSGEEESGKTELAESS 621
           KE E  S      +GE  S  T+ AE++
Sbjct: 601 KESETGSTQATENAGEAAS--TQPAEAA 618

BLAST of Lsi05G001010 vs. TrEMBL
Match: W9RP26_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006813 PE=4 SV=1)

HSP 1 Score: 909.1 bits (2348), Expect = 3.0e-261
Identity = 481/645 (74.57%), Postives = 537/645 (83.26%), Query Frame = 1

Query: 1   MALSKP-AFYTHLKTLTGSHH--LLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLR 60
           MALSKP AF THLKTL    H   L    P    V+LRFLSFA+PE+AAAERRRRKRRLR
Sbjct: 1   MALSKPNAFLTHLKTLAKPPHRRFLSPPPPPPSFVSLRFLSFATPEDAAAERRRRKRRLR 60

Query: 61  IEPPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEE 120
           IEPPLSS    + Q Q   P   +NPNAPKLP+H+SAL+GNRLNLHN+ILTLIRENDLEE
Sbjct: 61  IEPPLSSLHRNQQQQQQSPPPPQRNPNAPKLPDHVSALTGNRLNLHNKILTLIRENDLEE 120

Query: 121 AALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQ 180
           AAL+TRHSIYSNCRPTIFTVN+VLNA LRQSKY+DLLSLHRFITQAGVAPNIITHNL+FQ
Sbjct: 121 AALYTRHSIYSNCRPTIFTVNSVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLVFQ 180

Query: 181 TYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFV 240
           TYLDCRKPDTAMEHYKQLINDAPF+PSPTTYRIL KGLVDNN+LERA+ELKEEM+ KG  
Sbjct: 181 TYLDCRKPDTAMEHYKQLINDAPFSPSPTTYRILVKGLVDNNRLERALELKEEMSEKGLA 240

Query: 241 PDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAM 300
           PDP +YHYLM GCVR SD D VF L+EELK KLGG VEDGVVYGSLMK YF+K ME+EAM
Sbjct: 241 PDPTVYHYLMAGCVRNSDVDKVFDLYEELKGKLGGFVEDGVVYGSLMKAYFLKGMEKEAM 300

Query: 301 RCFEETVG---------------------EHSVVKMSAIAYNSVLDALCRNGKFGEALML 360
             FEE VG                     E+S VKMSA+AYNSVLDAL +NGKF EAL L
Sbjct: 301 EIFEEAVGAGYFLKGIKKESMETFEEALAENSSVKMSAVAYNSVLDALSKNGKFDEALKL 360

Query: 361 FDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLI 420
           FDRM KEH+PPRR+AVNLG+FNV+ +GYC +GRF+DAIEVF  MGDYRCSPDTLSFN LI
Sbjct: 361 FDRMKKEHNPPRRLAVNLGTFNVIAEGYCAQGRFRDAIEVFRTMGDYRCSPDTLSFNVLI 420

Query: 421 EQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLR 480
           EQLCNNGML EAE LYG MG+KGVNPDEFT+GLLMD+CFKENR DDAAGYFRKMV S LR
Sbjct: 421 EQLCNNGMLGEAEALYGEMGEKGVNPDEFTFGLLMDTCFKENRPDDAAGYFRKMVDSKLR 480

Query: 481 PNIAVYNRLVDELVKLGKINDAKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVV 540
           PN+AVYNRLVD LVK+GK+++AKSFFDLMVKKLKMD   Y+FIMKALS+SGKLDE+LNVV
Sbjct: 481 PNLAVYNRLVDGLVKVGKVDEAKSFFDLMVKKLKMDVPSYKFIMKALSESGKLDEVLNVV 540

Query: 541 DTLLDDDGIEFSEELQEFVRGELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRS 600
           DT+LDDDG+EF+EE+QEFV+GELRKE RE++LAKL+EEKER KAEAKAKEAEAAEA KRS
Sbjct: 541 DTMLDDDGVEFNEEVQEFVKGELRKEGREDELAKLIEEKERQKAEAKAKEAEAAEAAKRS 600

Query: 601 AKAAVSSLLSSKLFGNKEGEKESVVNETQSGE-EESGKTELAESS 621
           A+AAVSSLL SKLFG+KE         T+SG  E +G   + E+S
Sbjct: 601 ARAAVSSLLPSKLFGSKE--------STESGSAEANGSPTVGEAS 637

BLAST of Lsi05G001010 vs. TrEMBL
Match: A0A061G122_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_015413 PE=4 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 7.7e-249
Identity = 439/617 (71.15%), Postives = 515/617 (83.47%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLRIEP 60
           MALSKP F THL+ L   HH   R  P+   +T R LSF +PEEAAAERRRRKRRLR+EP
Sbjct: 1   MALSKPTFLTHLQNLAKRHH---RSPPSF--ITFRHLSFNTPEEAAAERRRRKRRLRVEP 60

Query: 61  PLSSSSAARPQSQPPRPQTP-QNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PLSS+  ++ Q+Q   P  P QNPNAPK+PE ++ L+GNRLNLHN+IL LIRENDLEEAA
Sbjct: 61  PLSSAHRSKQQAQQVAPSKPIQNPNAPKIPEPVTVLTGNRLNLHNKILKLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           L+TRHS+YSNCRPT++TVNAVLNAQLRQSKYADLLSLHRFIT AG+APN+ITHNLIFQTY
Sbjct: 121 LYTRHSVYSNCRPTVYTVNAVLNAQLRQSKYADLLSLHRFITLAGIAPNVITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LDC+KPDTA+EHYKQ  N++P NPSPTTYRIL KGLVDN KLE+A+E+KEEM  KG  PD
Sbjct: 181 LDCKKPDTALEHYKQFSNESPVNPSPTTYRILVKGLVDNGKLEKALEMKEEMVEKGLAPD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           P++Y YL++GC +  D DG+FKLFEELKEK  G +EDGV+YG LMKGYFM+ ME+EAM C
Sbjct: 241 PVVYSYLILGCAKSGDSDGIFKLFEELKEKKDGVLEDGVIYGGLMKGYFMRGMEKEAMEC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EE  GE+S VKMSA+AYN VLDAL +NGKF EAL LFDRM  EHSPPRR+AVNLGSFNV
Sbjct: 301 YEEACGENSKVKMSAVAYNYVLDALSKNGKFDEALRLFDRMKNEHSPPRRLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           + DGYC EG+FK+A+E F  MGDYRCSPDTLSFNNLI+QLC NG+L EAE LYG MGDKG
Sbjct: 361 IADGYCAEGKFKEAMEAFRLMGDYRCSPDTLSFNNLIDQLCQNGLLGEAEDLYGEMGDKG 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           VNPDE+TY LLMD+CFK +R DD A YFRKMV+SGLRPN+AVYNRLVDELVK+GK+++AK
Sbjct: 421 VNPDEYTYVLLMDACFKVDRIDDGASYFRKMVESGLRPNLAVYNRLVDELVKVGKVDEAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SF+D MVKKLKMD + Y+F++KALSD GKLD +L +VD +LDD+ ++F+EELQEFV+ EL
Sbjct: 481 SFYDTMVKKLKMDDASYKFMIKALSDVGKLDVVLKMVDEMLDDESVDFNEELQEFVKEEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           R E REEDL KL+EEKERLKAEAKA+E EAAEA KRSAKAAVSSLL SKLFG KE E +S
Sbjct: 541 RNEGREEDLTKLMEEKERLKAEAKAREIEAAEAAKRSAKAAVSSLLPSKLFGKKEDESQS 600

Query: 601 -VVNETQSGEEESGKTE 616
              NE+       G+ +
Sbjct: 601 TAANESTIEAASEGEVQ 612

BLAST of Lsi05G001010 vs. TAIR10
Match: AT3G49240.1 (AT3G49240.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 769.2 bits (1985), Expect = 1.9e-222
Identity = 395/619 (63.81%), Postives = 494/619 (79.81%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLRIEP 60
           M++SK AF  HL+TL+ S+   R +    P + +R++SFA+ EEAAAERRRRKRRLR+EP
Sbjct: 1   MSISKAAFLNHLQTLSRSY---RHRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEP 60

Query: 61  PLSS-SSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           P++S + + + QSQ PRP   QNPN PKLPE +SAL G RL+LHN IL LIRENDLEEAA
Sbjct: 61  PVNSFNRSQQQQSQIPRPI--QNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           L+TRHS+YSNCRPTIFTVN VL AQLRQ+KY  LL LH FI QAG+APNIIT+NLIFQ Y
Sbjct: 121 LYTRHSVYSNCRPTIFTVNTVLAAQLRQAKYGALLQLHGFINQAGIAPNIITYNLIFQAY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LD RKP+ A+EHYK  I++AP NPS  T+RIL KGLV N+ LE+AME+KE+M VKGFV D
Sbjct: 181 LDVRKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           P++Y YLM+GCV+ SD DGV KL++ELKEKLGG V+DGVVYG LMKGYFMKEME+EAM C
Sbjct: 241 PVVYSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMEC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EE VGE+S V+MSA+AYN VL+AL  NGKF EAL LFD + KEH+PPR +AVNLG+FNV
Sbjct: 301 YEEAVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MV+GYC  G+F++A+EVF +MGD++CSPDTLSFNNL+ QLC+N +LAEAEKLYG M +K 
Sbjct: 361 MVNGYCAGGKFEEAMEVFRQMGDFKCSPDTLSFNNLMNQLCDNELLAEAEKLYGEMEEKN 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           V PDE+TYGLLMD+CFKE + D+ A Y++ MV+S LRPN+AVYNRL D+L+K GK++DAK
Sbjct: 421 VKPDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFD+MV KLKMD   Y+FIM+ALS++G+LDE+L +VD +LDDD +  SEELQEFV+ EL
Sbjct: 481 SFFDMMVSKLKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RK  RE DL KL+EEKERLKAEAKAKE   AE +K++    +++L+  K    K+   + 
Sbjct: 541 RKGGREGDLEKLMEEKERLKAEAKAKELADAEEKKKAQSINIAALIPPKAVEEKKETAKL 600

Query: 601 VVNETQSGEEESGKTELAE 619
           +      G EE+   E+A+
Sbjct: 601 LWENEAGGVEEADVVEMAK 614

BLAST of Lsi05G001010 vs. TAIR10
Match: AT1G10270.1 (AT1G10270.1 glutamine-rich protein 23)

HSP 1 Score: 301.2 bits (770), Expect = 1.5e-81
Identity = 187/534 (35.02%), Postives = 293/534 (54.87%), Query Frame = 1

Query: 30  PIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPPRPQTPQNPNAPKLP 89
           P +  R ++F+S EEAAAERRRRKRRLRIEPPL +      +  P  P   ++PNAP+LP
Sbjct: 81  PPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL-----RRDPSAPPPKRDPNAPRLP 140

Query: 90  EHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSK 149
           +  SAL G RLNLHNR+ +LIR +DL+ A+   R S++SN RPT+FT NA++ A  R  +
Sbjct: 141 DSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKR 200

Query: 150 YADLLSLHR-FITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTY 209
           Y++ +SL + F  Q+ + PN++++N I   + D    D A+E Y+ ++ +APF PS  TY
Sbjct: 201 YSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTY 260

Query: 210 RILFKGLVDNNKLERAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKE 269
           R L KGLV   ++  A  L  EM  KG   D  +Y+ L+ G +   D D   + F+ELK 
Sbjct: 261 RHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKS 320

Query: 270 KLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNG 329
           K   TV DG+V  + M+ +F K  ++EAM  +   + +    +M     N +L+   + G
Sbjct: 321 KC--TVYDGIVNATFMEYWFEKGNDKEAMESYRSLLDKK--FRMHPPTGNVLLEVFLKFG 380

Query: 330 KFGEALMLFDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSP- 389
           K  EA  LF+ M   H+PP  ++VN  +  +MV+     G F +AI  F+K+G    S  
Sbjct: 381 KKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKP 440

Query: 390 ---DTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAA 449
              D L + N++ + C  GML EAE+ +     + +  D  ++  ++D+  K  R DDA 
Sbjct: 441 FVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAV 500

Query: 450 GYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKAL 509
               +MV   LR       R+  EL+K GK+ ++      M  ++ K D S Y  +++ L
Sbjct: 501 KMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGL 560

Query: 510 SDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGELRKEDREEDLAKLVEEKER 558
            D   LD+  ++V  ++  + +  +  L+EF+     K  R E++ K++    R
Sbjct: 561 CDGDALDQAKDIVGEMIRHN-VGVTTVLREFIIEVFEKAGRREEIEKILNSVAR 604

BLAST of Lsi05G001010 vs. TAIR10
Match: AT1G09900.1 (AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 176.8 bits (447), Expect = 4.2e-44
Identity = 107/413 (25.91%), Postives = 196/413 (47.46%), Query Frame = 1

Query: 103 HNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQ 162
           +N +  ++R  +LEE   F  + +Y    P I     ++    R  K      +   +  
Sbjct: 106 NNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEG 165

Query: 163 AGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLE 222
           +G  P++IT+N++   Y    + + A+     +++    +P   TY  + + L D+ KL+
Sbjct: 166 SGAVPDVITYNVMISGYCKAGEINNALS----VLDRMSVSPDVVTYNTILRSLCDSGKLK 225

Query: 223 RAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGS 282
           +AME+ + M  +   PD + Y  L+    R S      KL +E++++  G   D V Y  
Sbjct: 226 QAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDR--GCTPDVVTYNV 285

Query: 283 LMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEA-LMLFDRMT 342
           L+ G   +   +EA++   +     S  + + I +N +L ++C  G++ +A  +L D + 
Sbjct: 286 LVNGICKEGRLDEAIKFLNDMPS--SGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLR 345

Query: 343 KEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCN 402
           K  SP      ++ +FN++++  C +G    AI++ EKM  + C P++LS+N L+   C 
Sbjct: 346 KGFSP------SVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCK 405

Query: 403 NGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAV 462
              +  A +    M  +G  PD  TY  ++ +  K+ + +DA     ++   G  P +  
Sbjct: 406 EKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLIT 465

Query: 463 YNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKALSDSGKLDEIL 514
           YN ++D L K GK   A    D M  K LK D   Y  ++  LS  GK+DE +
Sbjct: 466 YNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAI 504

BLAST of Lsi05G001010 vs. TAIR10
Match: AT1G12300.1 (AT1G12300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 165.6 bits (418), Expect = 9.6e-41
Identity = 107/419 (25.54%), Postives = 196/419 (46.78%), Query Frame = 1

Query: 93  SALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYAD 152
           SA S   L+   R+ + + +   ++A    R  I+S   PT+   + + +A  +  +Y  
Sbjct: 47  SAFSDRNLSYRERLRSGLVDIKADDAIDLFRDMIHSRPLPTVIDFSRLFSAIAKTKQYDL 106

Query: 153 LLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILF 212
           +L+L + +   G+A N+ T +++   +  CRK   A     ++I    + P+  T+  L 
Sbjct: 107 VLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIK-LGYEPNTITFSTLI 166

Query: 213 KGLVDNNKLERAMELKEEMTVKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGG 272
            GL    ++  A+EL + M   G  PD +  + L+ G            L +++ E   G
Sbjct: 167 NGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEY--G 226

Query: 273 TVEDGVVYGSLMKGYFMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGE 332
              + V YG ++           AM    +   E   +K+ A+ Y+ ++D LC++G    
Sbjct: 227 CQPNAVTYGPVLNVMCKSGQTALAMELLRKM--EERNIKLDAVKYSIIIDGLCKHGSLDN 286

Query: 333 ALMLFDRMTKEHSPPRRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSF 392
           A  LF+ M       + +  N+ ++N+++ G+C  GR+ D  ++   M   + +P+ ++F
Sbjct: 287 AFNLFNEMEM-----KGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTF 346

Query: 393 NNLIEQLCNNGMLAEAEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQ 452
           + LI+     G L EAE+L+  M  +G+ PD  TY  L+D   KEN  D A      MV 
Sbjct: 347 SVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVS 406

Query: 453 SGLRPNIAVYNRLVDELVKLGKINDAKSFFDLM-VKKLKMDASGYQFIMKALSDSGKLD 511
            G  PNI  +N L++   K  +I+D    F  M ++ +  D   Y  +++   + GKL+
Sbjct: 407 KGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLN 455

BLAST of Lsi05G001010 vs. TAIR10
Match: AT1G12620.1 (AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 158.7 bits (400), Expect = 1.2e-38
Identity = 115/451 (25.50%), Postives = 202/451 (44.79%), Query Frame = 1

Query: 109 LIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPN 168
           L  E  + EA       +    +PT+ T+NA++N      K +D + L   + + G  PN
Sbjct: 152 LCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPN 211

Query: 169 IITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELK 228
            +T+  + +      +   AME  +++  +         Y I+  GL  +  L+ A  L 
Sbjct: 212 EVTYGPVLKVMCKSGQTALAMELLRKM-EERKIKLDAVKYSIIIDGLCKDGSLDNAFNLF 271

Query: 229 EEMTVKGFVPDPLIYHYLMVG-CVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGY 288
            EM +KGF  D +IY  L+ G C      DG   L + +K K+     D V + +L+  +
Sbjct: 272 NEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKI---TPDVVAFSALIDCF 331

Query: 289 FMKEMEEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPP 348
             +    EA    +E +     +    + Y S++D  C+  +  +A  + D M  +   P
Sbjct: 332 VKEGKLREAEELHKEMIQRG--ISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGP 391

Query: 349 RRVAVNLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAE 408
                N+ +FN++++GYC      D +E+F KM       DT+++N LI+  C  G L  
Sbjct: 392 -----NIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEV 451

Query: 409 AEKLYGTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVD 468
           A++L+  M  + V PD  +Y +L+D        + A   F K+ +S +  +I +YN ++ 
Sbjct: 452 AKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIH 511

Query: 469 ELVKLGKINDAKSFF-DLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIE 528
            +    K++DA   F  L +K +K D   Y  ++  L   G L E   +   + +D    
Sbjct: 512 GMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSLSEADLLFRKMEEDGHSP 571

Query: 529 FSEELQEFVRGELRKEDREEDLAKLVEEKER 558
                   +R  L + D  +  AKL+EE +R
Sbjct: 572 NGCTYNILIRAHLGEGDATKS-AKLIEEIKR 590

BLAST of Lsi05G001010 vs. NCBI nr
Match: gi|449456969|ref|XP_004146221.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis sativus])

HSP 1 Score: 1101.7 bits (2848), Expect = 0.0e+00
Identity = 575/624 (92.15%), Postives = 593/624 (95.03%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPA-LPIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKPAF+THLKTLTGSHHLL+RQA A  PIVTLRFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQALAPFPIVTLRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARP +QPPR QTPQNPNAPK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPLTQPPRSQTPQNPNAPKIPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRIL KGLVDNNKLERAMELK+EM  KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKDEMIEKGFAPD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIYHYLM GCVR  DPDGVFKLFEELKEKLG TVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMGGCVRSLDPDGVFKLFEELKEKLGATVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EETVG++SVVKMSAIAYNSVLDALCRNGKFGEAL LFDRMTKEH PPR +AVNLGSFNV
Sbjct: 301 YEETVGDNSVVKMSAIAYNSVLDALCRNGKFGEALTLFDRMTKEHRPPRHLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MVDGYCIEGRFK+AIEVFEKMGDYRC PDTLSFNNLIEQLCNNGMLAEAE LYGTM DKG
Sbjct: 361 MVDGYCIEGRFKEAIEVFEKMGDYRCCPDTLSFNNLIEQLCNNGMLAEAEMLYGTMDDKG 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           VNPDEFTYGLLMDSCFK+NRADDAA YFRKMV SGLRPNIAVYN LVDELVKLGKI+DAK
Sbjct: 421 VNPDEFTYGLLMDSCFKKNRADDAAAYFRKMVDSGLRPNIAVYNILVDELVKLGKIDDAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFDLMVKKLKMDAS YQFIMKALS+SGK+DEILNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASSYQFIMKALSESGKMDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKE+REEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKENREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 VVNETQSGEEE--SGKTELAESSP 622
           VVNE QS E+E  SGKTELAESSP
Sbjct: 601 VVNEMQSVEQEDDSGKTELAESSP 624

BLAST of Lsi05G001010 vs. NCBI nr
Match: gi|659133624|ref|XP_008466825.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis melo])

HSP 1 Score: 1092.4 bits (2824), Expect = 0.0e+00
Identity = 565/624 (90.54%), Postives = 592/624 (94.87%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSHHLLRRQAPA-LPIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKPAF+THLKTLTGSHHLL+RQAPA LPIVT RFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQAPAPLPIVTFRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARPQSQP R QTPQNPN PK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPQSQPSRSQTPQNPNTPKVPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRIL KGLVDN KLERAMELKEEM VKGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNKKLERAMELKEEMIVKGFAPD 240

Query: 241 PLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIYHYLM GCVR SDPDGVFKLFEELKEKLGGTVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMAGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 FEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSFNV 360
           +EETVG++ VVKMSAIAYNSVLDALC++GKF EAL LFDRMTKEH PPR +AVNLG+FNV
Sbjct: 301 YEETVGDNPVVKMSAIAYNSVLDALCKHGKFSEALTLFDRMTKEHRPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGDKG 420
           MVDGYCI+GRFK+AI VFE+MGDYRCSPDTLSFNNLIEQLCNNGMLAEAE LYGTMG+KG
Sbjct: 361 MVDGYCIKGRFKEAIGVFEEMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEMLYGTMGEKG 420

Query: 421 VNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKINDAK 480
           VNPDEFTYGLLM SCF++NRADDAA YFRKMV SGLRPNIAVYN LV ELVKLGK+++AK
Sbjct: 421 VNPDEFTYGLLMHSCFQKNRADDAAAYFRKMVDSGLRPNIAVYNILVGELVKLGKVDEAK 480

Query: 481 SFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540
           SFFDLMVKKLKMDAS YQFIMKALS+SGK+DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASNYQFIMKALSESGKMDEVLNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 VVNETQSG--EEESGKTELAESSP 622
           VVNE QSG  E++ GKTELAES+P
Sbjct: 601 VVNEMQSGQQEDDGGKTELAESNP 624

BLAST of Lsi05G001010 vs. NCBI nr
Match: gi|645258146|ref|XP_008234749.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Prunus mume])

HSP 1 Score: 931.0 bits (2405), Expect = 1.1e-267
Identity = 482/627 (76.87%), Postives = 544/627 (86.76%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLT---GSHHLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLR 60
           MALSKP F THL+TL      HH      P    ++LRFLSFA+PEEAAAERRRRKRRLR
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHH----PTPPPSFISLRFLSFATPEEAAAERRRRKRRLR 60

Query: 61  IEPPLSS----SSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIREN 120
           IEPPLSS        + Q Q P+PQ  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+N
Sbjct: 61  IEPPLSSLHRNQQQQQQQQQSPKPQ--QNPNAPKLPEPVSALSGNRLNLHNRILTLVRQN 120

Query: 121 DLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHN 180
           DLEEAAL+TRHSIYSNCRPTIFTVN+VL AQLRQSKY+DLLSLHRFITQAGVAPNIITHN
Sbjct: 121 DLEEAALYTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITHN 180

Query: 181 LIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTV 240
           LIFQTYLDCRKPDTAME+YKQLINDAPFNPSPTTYRIL KGLVDNNKL+RAMELKEE+ V
Sbjct: 181 LIFQTYLDCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEIDV 240

Query: 241 KGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEME 300
           KGF PDP++YHYLMVGCV+ SD DGVFKL+EELKEKLGG VEDG+VYG+LMKGYFM+ ME
Sbjct: 241 KGFAPDPVVYHYLMVGCVKNSDSDGVFKLYEELKEKLGGVVEDGIVYGNLMKGYFMRGME 300

Query: 301 EEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVN 360
           +EAM C+EE++ E S VKMSA+AYNSVLDAL +NGKF EAL LFDRM  EH+PPRR+AVN
Sbjct: 301 KEAMECYEESLRESSKVKMSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAVN 360

Query: 361 LGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYG 420
           LGSFNVM DGYC EGRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML+EAE+LYG
Sbjct: 361 LGSFNVMADGYCAEGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELYG 420

Query: 421 TMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLG 480
            M DKGVN DE+TY LLMD+CF+ENRADDAA YFRKMV + LRPN+AVYNRLVD L+K+G
Sbjct: 421 EMSDKGVNADEYTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKVG 480

Query: 481 KINDAKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQE 540
           K+++AKSFFDLMVKKLKMD   YQFIMK LS++GKLDE+LNVV+T+LDDDG+EF+EELQE
Sbjct: 481 KVDEAKSFFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVNTMLDDDGVEFNEELQE 540

Query: 541 FVRGELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNK 600
           FV+GE+RKE RE+++ KL+EEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNK
Sbjct: 541 FVKGEMRKEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNK 600

Query: 601 EGEKESVVNETQSGEEESGKTELAESS 621
           E E  S  +   +GE  S  T+ AE++
Sbjct: 601 ESETGSTQSTENAGEAAS--TQPAEAA 619

BLAST of Lsi05G001010 vs. NCBI nr
Match: gi|596020758|ref|XP_007218947.1| (hypothetical protein PRUPE_ppa002582mg [Prunus persica])

HSP 1 Score: 931.0 bits (2405), Expect = 1.1e-267
Identity = 483/628 (76.91%), Postives = 544/628 (86.62%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLT---GSHHLLRRQAPALP--IVTLRFLSFASPEEAAAERRRRKRR 60
           MALSKP F THL+TL      HH      P  P   ++LRFLSFA+PEEAAAERRRRKRR
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHH------PTTPPSFISLRFLSFATPEEAAAERRRRKRR 60

Query: 61  LRIEPPLSS---SSAARPQSQPPRPQTPQNPNAPKLPEHISALSGNRLNLHNRILTLIRE 120
           LRIEPPLSS   +   + Q Q P+PQ  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+
Sbjct: 61  LRIEPPLSSLHRNQQQQQQQQSPKPQ--QNPNAPKLPEPVSALSGNRLNLHNRILTLVRQ 120

Query: 121 NDLEEAALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITH 180
           NDLEEAAL+TRHSIYSNCRPTIFTVN+VL AQLRQSKY+DLLSLHRFITQAGVAPNIITH
Sbjct: 121 NDLEEAALYTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITH 180

Query: 181 NLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMT 240
           NLIFQTYLDCRKPDTAME+YKQLINDAPFNPSPTTYRIL KGLVDNNKL+RAMELKEE+ 
Sbjct: 181 NLIFQTYLDCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEID 240

Query: 241 VKGFVPDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEM 300
            KGF PDP++YHYLMVGCV+ SD DGVF+L+EELKEKLGG VEDG+VYG+LMKGYFM+ M
Sbjct: 241 AKGFAPDPVVYHYLMVGCVKNSDSDGVFRLYEELKEKLGGVVEDGIVYGNLMKGYFMRGM 300

Query: 301 EEEAMRCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAV 360
           E+EAM C+EE+ GE S VK SA+AYNSVLDAL +NGKF EAL LFDRM  EH+PPRR+AV
Sbjct: 301 EKEAMECYEESFGESSKVKTSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAV 360

Query: 361 NLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLY 420
           NLGSFNVM DGYC++GRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML+EAE+LY
Sbjct: 361 NLGSFNVMADGYCVQGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELY 420

Query: 421 GTMGDKGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKL 480
           G M DKGV PDEFTY LLMD+CF+ENRADDAA YFRKMV + LRPN+AVYNRLVD L+K+
Sbjct: 421 GEMSDKGVYPDEFTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKV 480

Query: 481 GKINDAKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQ 540
           GK+++AKSFFDLMVKKLKMD   YQFIMK LS++GKLDE+LNVVDT+LDDDG+EF+EELQ
Sbjct: 481 GKVDEAKSFFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVDTMLDDDGVEFNEELQ 540

Query: 541 EFVRGELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGN 600
           EFV+GELRKE RE+++ KL+EEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGN
Sbjct: 541 EFVKGELRKEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGN 600

Query: 601 KEGEKESVVNETQSGEEESGKTELAESS 621
           KE E  S      +GE  S  T+ AE++
Sbjct: 601 KESETGSTQATENAGEAAS--TQPAEAA 618

BLAST of Lsi05G001010 vs. NCBI nr
Match: gi|1009123239|ref|XP_015878436.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Ziziphus jujuba])

HSP 1 Score: 929.1 bits (2400), Expect = 4.1e-267
Identity = 480/621 (77.29%), Postives = 539/621 (86.80%), Query Frame = 1

Query: 1   MALSKPAFYTHLKTLTGSH-HLLRRQAPALPIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKP F  HLK+L   H HL R   P    ++LRFLSFA+PEEAAAERRRRKRRLRIE
Sbjct: 1   MALSKPTFLIHLKSLNAPHRHLRRLPPPPSSFISLRFLSFATPEEAAAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPPRPQTP--QNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEE 120
           PPLSS    + Q Q  + Q+P  QNPNAPKLPE ++ALSGNRLNLHNRIL LIR+NDLEE
Sbjct: 61  PPLSSLHRTQQQQQQAQTQSPKPQNPNAPKLPEPVTALSGNRLNLHNRILELIRKNDLEE 120

Query: 121 AALFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVAPNIITHNLIFQ 180
           AAL+TRHSIYSNCRPTIFTVNAVLNA LRQSKY+DLLSLHRFITQAGVAPNIITHNLIFQ
Sbjct: 121 AALYTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQ 180

Query: 181 TYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILFKGLVDNNKLERAMELKEEMTVKGFV 240
           TYLDCRKPD AMEHYKQLINDAPFNPSPTTY+IL  GLVDNNKLERA+ELKEEM VKG  
Sbjct: 181 TYLDCRKPDIAMEHYKQLINDAPFNPSPTTYQILIAGLVDNNKLERALELKEEMDVKGIP 240

Query: 241 PDPLIYHYLMVGCVRRSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAM 300
            +P++YH+LM+GCV+ SD DGVF+L+EELKEKLGG+VEDGVVYGSLMKGYF++ ME+EAM
Sbjct: 241 ANPVVYHHLMLGCVKNSDADGVFRLYEELKEKLGGSVEDGVVYGSLMKGYFLRGMEKEAM 300

Query: 301 RCFEETVGEHSVVKMSAIAYNSVLDALCRNGKFGEALMLFDRMTKEHSPPRRVAVNLGSF 360
            C+EE VGE+S VKMSA+AYNSVLDAL +NGKF EAL LFDRMTKEH+PP+R+AVNLGSF
Sbjct: 301 ECYEEAVGENSKVKMSAVAYNSVLDALSKNGKFDEALGLFDRMTKEHNPPKRLAVNLGSF 360

Query: 361 NVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEKLYGTMGD 420
           NVM DGYC +G FKDAIEVF KMGDYRCSPD LSFNNLIEQLCNNG+L EAE+LYG M  
Sbjct: 361 NVMADGYCAQGSFKDAIEVFRKMGDYRCSPDALSFNNLIEQLCNNGLLTEAEELYGEMDG 420

Query: 421 KGVNPDEFTYGLLMDSCFKENRADDAAGYFRKMVQSGLRPNIAVYNRLVDELVKLGKIND 480
           KGVNPDE+T+ LLMD+CFKENR DDAA YFRKM+ S LRPN+AVYN+LVD LVK+GKI++
Sbjct: 421 KGVNPDEYTFVLLMDACFKENRPDDAAEYFRKMIDSKLRPNLAVYNKLVDGLVKVGKIDE 480

Query: 481 AKSFFDLMVKKLKMDASGYQFIMKALSDSGKLDEILNVVDTLLDDDGIEFSEELQEFVRG 540
           AKSFFDLMVKKLKMD   Y+FIMKALS+SGK DE+LNVVDT+LDDDG+EF+EE+QEFV+G
Sbjct: 481 AKSFFDLMVKKLKMDVPSYEFIMKALSESGKFDEVLNVVDTMLDDDGVEFNEEVQEFVKG 540

Query: 541 ELRKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEK 600
           ELRKE RE+DL KL+EEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNKE + 
Sbjct: 541 ELRKEGREDDLVKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNKESDT 600

Query: 601 ESVVNETQSGEEESGKTELAE 619
            S   E      E+GKT +AE
Sbjct: 601 GSA--EANGNAIEAGKTGIAE 619

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP273_ARATH3.4e-22163.81Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN... [more]
PPR29_ARATH2.6e-8035.02Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN... [more]
PPR28_ARATH7.4e-4325.91Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
PPR36_ARATH1.7e-3925.54Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
PPR37_ARATH2.1e-3725.50Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L7B2_CUCSA0.0e+0092.15Uncharacterized protein OS=Cucumis sativus GN=Csa_3G239860 PE=4 SV=1[more]
E5GB98_CUCME0.0e+0090.54Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=... [more]
M5X3R7_PRUPE7.4e-26876.91Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002582mg PE=4 SV=1[more]
W9RP26_9ROSA3.0e-26174.57Uncharacterized protein OS=Morus notabilis GN=L484_006813 PE=4 SV=1[more]
A0A061G122_THECC7.7e-24971.15Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT3G49240.11.9e-22263.81 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G10270.11.5e-8135.02 glutamine-rich protein 23[more]
AT1G09900.14.2e-4425.91 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G12300.19.6e-4125.54 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G12620.11.2e-3825.50 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449456969|ref|XP_004146221.1|0.0e+0092.15PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis sativu... [more]
gi|659133624|ref|XP_008466825.1|0.0e+0090.54PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis melo][more]
gi|645258146|ref|XP_008234749.1|1.1e-26776.87PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Prunus mume][more]
gi|596020758|ref|XP_007218947.1|1.1e-26776.91hypothetical protein PRUPE_ppa002582mg [Prunus persica][more]
gi|1009123239|ref|XP_015878436.1|4.1e-26777.29PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Ziziphus jujub... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0010033 response to organic substance
biological_process GO:0048366 leaf development
biological_process GO:0048609 multicellular organismal reproductive process
biological_process GO:0071704 organic substance metabolic process
biological_process GO:0044238 primary metabolic process
biological_process GO:0050794 regulation of cellular process
biological_process GO:0009628 response to abiotic stimulus
biological_process GO:1901700 response to oxygen-containing compound
biological_process GO:0010182 sugar mediated signaling pathway
biological_process GO:0044763 single-organism cellular process
biological_process GO:0009960 endosperm development
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006349 regulation of gene expression by genetic imprinting
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
biological_process GO:0044237 cellular metabolic process
biological_process GO:0010228 vegetative to reproductive phase transition of meristem
biological_process GO:0009845 seed germination
biological_process GO:0009640 photomorphogenesis
biological_process GO:0051301 cell division
biological_process GO:0048825 cotyledon development
biological_process GO:0009560 embryo sac egg cell differentiation
biological_process GO:0010162 seed dormancy process
biological_process GO:0009933 meristem structural organization
biological_process GO:0019915 lipid storage
biological_process GO:0016567 protein ubiquitination
biological_process GO:0009220 pyrimidine ribonucleotide biosynthetic process
biological_process GO:0010564 regulation of cell cycle process
biological_process GO:0009909 regulation of flower development
biological_process GO:0009737 response to abscisic acid
biological_process GO:0050826 response to freezing
cellular_component GO:0044444 cytoplasmic part
cellular_component GO:0043231 intracellular membrane-bounded organelle
cellular_component GO:0009507 chloroplast
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G001010.1Lsi05G001010.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 461..488
score: 0.017coord: 316..343
score: 1.1E-7coord: 243..269
score: 0.16coord: 207..236
score: 0.0091coord: 278..302
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 353..380
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 387..435
score: 3.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 129..179
score: 0.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 425..459
score: 1.2E-6coord: 461..488
score: 0.002coord: 316..344
score: 5.6E-8coord: 356..389
score: 1.0E-8coord: 391..424
score: 2.7E-5coord: 207..239
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 492..527
score: 6.829coord: 423..457
score: 11.74coord: 276..310
score: 7.3coord: 388..422
score: 11.542coord: 133..167
score: 7.278coord: 458..488
score: 8.813coord: 204..238
score: 10.534coord: 168..198
score: 6.61coord: 313..347
score: 12.299coord: 353..387
score: 12.079coord: 239..269
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 275..515
score: 2.0
NoneNo IPR availableunknownCoilCoilcoord: 541..580
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..42
score: 1.2E-267coord: 60..533
score: 1.2E
NoneNo IPR availablePANTHERPTHR24015:SF237SUBFAMILY NOT NAMEDcoord: 27..42
score: 1.2E-267coord: 60..533
score: 1.2E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 179..269
score: 8.63E-8coord: 312..463
score: 8.6

The following gene(s) are paralogous to this gene:

None