Lsi04G010270 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G010270
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr04 : 11721528 .. 11723588 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGTGCACTTTCAAAAATAAGTTTCTGCTCCAAGCTCAACTTTAGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCGAATTCTGCGTTGTCTTCTTTGAACTATGTGGACGATGGTTGCTTTACTTTTGAATGTCCCGTTGCTACAAACTATAATGATGACGCTGTTGAACAAAATTATTTTGGCAATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGCAATGAAAACAATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGATAATGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCAGACTTTGTCTTCACTACTTCAAATGGTCAGGATGTTTATCCAGATCTAATCAGTCGCTGGAGTCAATCTGTAAGATGATGCATATTTTGGTAACTGGGAATATGAATCATAGGGCAGTTGATTTAATGTCACACCTTGCTAAAACTTATGGTAGTGAAGAGGGATTTTCAACCATATTGCTGAAACTTTTGTATGAAACACATAAAGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTTTCTGTTATATCAAGGAAAGAATGGTAACGCCTGCCCTTATGTTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTGCATGGGTATACAAGTCGGTGATACAAGCTTTATTACAAACCAATCAGTCGGAGTTAGCCTGGGATCTTCTAGAAGAAATGTATCGGCAAGGTTTAAGTTTAAATTATTCAATTAATTTATTTGTTCATCACTATTGTGCAAAAGGTAATCTAGGCAGGGGGTGGAAAGTGCTTTTGGAGCTGAGGAATTTTGGATCTAAGCCTGATGCAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCTACCACCGTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTTACGTTGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAATGTCGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTGTGGAAGGAAACATGGAAAGGGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATTGTAAAGTGGAAAACATAAACAGAGCATTCTCTTACATATGCAAGATGTTAAAAAGTGGAATACAACCATCTCTTATCACGTATACTTTGTTCATTGATAAGTTTTGCAAGTGTGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATCGAGGGTTTAAAACCTGATGTTGTCACGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCATAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTTATTAATGGTCTTGTTATGCGAGGGTTTCTCAAAGAGGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAACAGCGTATAGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACCTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTCGGATTTCAGAAAAAGAGAGTTATCAATCCAATAGCGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAAATTGATGTCAATGGACATATCTAA

mRNA sequence

ATGAAGAGTGCACTTTCAAAAATAAGTTTCTGCTCCAAGCTCAACTTTAGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCGAATTCTGCGTTGTCTTCTTTGAACTATGTGGACGATGGTTGCTTTACTTTTGAATGTCCCGTTGCTACAAACTATAATGATGACGCTGTTGAACAAAATTATTTTGGCAATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGCAATGAAAACAATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGATAATGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCAGACTTTGTCTTCACTACTTCAAATGGTCAGGATGTTTATCCAGATCTAATCAGTCGCTGGAGTCAATCTGTAAGATGATGCATATTTTGGTAACTGGGAATATGAATCATAGGGCAGTTGATTTAATGTCACACCTTGCTAAAACTTATGGTAGTGAAGAGGGATTTTCAACCATATTGCTGAAACTTTTGTATGAAACACATAAAGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTTTCTGTTATATCAAGGAAAGAATGGTAACGCCTGCCCTTATGTTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTGCATGGGTATACAAGTCGGTGATACAAGCTTTATTACAAACCAATCAGTCGGAGTTAGCCTGGGATCTTCTAGAAGAAATGTATCGGCAAGGTTTAAGTTTAAATTATTCAATTAATTTATTTGTTCATCACTATTGTGCAAAAGGTAATCTAGGCAGGGGGTGGAAAGTGCTTTTGGAGCTGAGGAATTTTGGATCTAAGCCTGATGCAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCTACCACCGTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTTACGTTGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAATGTCGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTGTGGAAGGAAACATGGAAAGGGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATTGTAAAGTGGAAAACATAAACAGAGCATTCTCTTACATATGCAAGATGTTAAAAAGTGGAATACAACCATCTCTTATCACGTATACTTTGTTCATTGATAAGTTTTGCAAGTGTGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATCGAGGGTTTAAAACCTGATGTTGTCACGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCATAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTTATTAATGGTCTTGTTATGCGAGGGTTTCTCAAAGAGGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAACAGCGTATAGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACCTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTCGGATTTCAGAAAAAGAGAGTTATCAATCCAATAGCGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAAATTGATGTCAATGGACATATCTAA

Coding sequence (CDS)

ATGAAGAGTGCACTTTCAAAAATAAGTTTCTGCTCCAAGCTCAACTTTAGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCGAATTCTGCGTTGTCTTCTTTGAACTATGTGGACGATGGTTGCTTTACTTTTGAATGTCCCGTTGCTACAAACTATAATGATGACGCTGTTGAACAAAATTATTTTGGCAATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGCAATGAAAACAATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGATAATGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCAGACTTTGTCTTCACTACTTCAAATGGTCAGGATGTTTATCCAGATCTAATCAGTCGCTGGAGTCAATCTGTAAGATGATGCATATTTTGGTAACTGGGAATATGAATCATAGGGCAGTTGATTTAATGTCACACCTTGCTAAAACTTATGGTAGTGAAGAGGGATTTTCAACCATATTGCTGAAACTTTTGTATGAAACACATAAAGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTTTCTGTTATATCAAGGAAAGAATGGTAACGCCTGCCCTTATGTTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTGCATGGGTATACAAGTCGGTGATACAAGCTTTATTACAAACCAATCAGTCGGAGTTAGCCTGGGATCTTCTAGAAGAAATGTATCGGCAAGGTTTAAGTTTAAATTATTCAATTAATTTATTTGTTCATCACTATTGTGCAAAAGGTAATCTAGGCAGGGGGTGGAAAGTGCTTTTGGAGCTGAGGAATTTTGGATCTAAGCCTGATGCAGTTGATTACACAACTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCTACCACCGTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTTACGTTGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAATGTCGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTGTGGAAGGAAACATGGAAAGGGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATTGTAAAGTGGAAAACATAAACAGAGCATTCTCTTACATATGCAAGATGTTAAAAAGTGGAATACAACCATCTCTTATCACGTATACTTTGTTCATTGATAAGTTTTGCAAGTGTGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATCGAGGGTTTAAAACCTGATGTTGTCACGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCATAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTTATTAATGGTCTTGTTATGCGAGGGTTTCTCAAAGAGGCAAAGGATATTCTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCTCTTCTTAGTGGGTATTGCCGAGAACAGCGTATAGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACCTGATATTGTACAATACTCTAATCCATGGATTTTGCAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCATCGTGCACTTGTCCTCGGATTTCAGAAAAAGAGAGTTATCAATCCAATAGCGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAAATTGATGTCAATGGACATATCTAA

Protein sequence

MKSALSKISFCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQNYFGNEVQVSKGQKADEDAMKTIKLILGNHGFNLGSHPKQFDNVRILDILFEDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQEILLAYNLQIDVNGHI
BLAST of Lsi04G010270 vs. Swiss-Prot
Match: PP164_ARATH (Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana GN=At2g19280 PE=2 SV=2)

HSP 1 Score: 577.4 bits (1487), Expect = 2.1e-163
Identity = 295/673 (43.83%), Postives = 439/673 (65.23%), Query Frame = 1

Query: 10  FCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQNYFGNEVQV 69
           FC++    R   CR F+ A+ + ++  +  D       P + +    +  +++  + V +
Sbjct: 16  FCTRTKAFRYFWCRTFSLASLSENNSRFQTDSS---RLPYSGSRYYHSSSKHFGEDFVSI 75

Query: 70  SKGQKADEDAMKTIKLILGNHGF------NLGSHPKQFDNVRILDILFEDSSDARLCLHY 129
            K      D ++TI+ +L  H +         +   Q+  +RILD LFE++ DA + L++
Sbjct: 76  LKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVLYF 135

Query: 130 FKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYE 189
           F+WS        S  SI +M+HILV+GNMN+RAVD++  L K    EE    +++K L+E
Sbjct: 136 FRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDLFE 195

Query: 190 THKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQS 249
           T  +R+ LET  S+L+ C I+ER V  AL L  ++    IFPS  V  S+++ +L+ +  
Sbjct: 196 TRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVHGL 255

Query: 250 ELAWDLLEEMYRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTV 309
           ELA + +E M  +G  LN ++ +LF+  YC+ G   +GW++L+ ++++G +PD V +T  
Sbjct: 256 ELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFTVF 315

Query: 310 INSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLN 369
           I+ LCK   LKEAT+VLFK+  FG+S DSV++SSVIDG+CKVG  + A K++  FRL  N
Sbjct: 316 IDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGKPEEAIKLIHSFRLRPN 375

Query: 370 IFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYIC 429
           IF Y+SF++ +C  G+M RAS +F E+ E+GL+PDCV YTTMI GYC +   ++AF Y  
Sbjct: 376 IFVYSSFLSNICSTGDMLRASTIFQEIFELGLLPDCVCYTTMIDGYCNLGRTDKAFQYFG 435

Query: 430 KMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKG 489
            +LKSG  PSL T T+ I    + G +  AE +F+ M  EGLK DVVTYN LM GYGK  
Sbjct: 436 ALLKSGNPPSLTTSTILIGACSRFGSISDAESVFRNMKTEGLKLDVVTYNNLMHGYGKTH 495

Query: 490 YLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYT 549
            L+K FEL+D MRS  ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T
Sbjct: 496 QLNKVFELIDEMRSAGISPDVATYNILIHSMVVRGYIDEANEIISELIRRGFVPSTLAFT 555

Query: 550 NIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDI 609
           ++I G+SKRG+F+EAF+LW++M D  +KPDVVTCSALL GYC+ QR+++A  LF K+LD 
Sbjct: 556 DVIGGFSKRGDFQEAFILWFYMADLRMKPDVVTCSALLHGYCKAQRMEKAIVLFNKLLDA 615

Query: 610 GLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPI 669
           GL PD++LYNTLIHG+CSVG++++ C  +  M++  ++PN  TH ALVLG + KR +N  
Sbjct: 616 GLKPDVVLYNTLIHGYCSVGDIEKACELIGLMVQRGMLPNESTHHALVLGLEGKRFVNSE 675

Query: 670 ASATSKLQEILLA 676
             A+  L+EI++A
Sbjct: 676 THASMLLEEIIVA 685

BLAST of Lsi04G010270 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 2.5e-68
Identity = 134/387 (34.63%), Postives = 217/387 (56.07%), Query Frame = 1

Query: 273 AKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSV 332
           +K N+     V  E+      P+   Y  +I   C    +  A T+  KM   G  P+ V
Sbjct: 182 SKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVV 241

Query: 333 TLSSVIDGYCKVGMSDIACKILKYFRLP---LNIFTYNSFITRLCVEGNMERASEVFLEM 392
           T +++IDGYCK+   D   K+L+   L     N+ +YN  I  LC EG M+  S V  EM
Sbjct: 242 TYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEM 301

Query: 393 SEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDV 452
           +  G   D V+Y T+I GYCK  N ++A     +ML+ G+ PS+ITYT  I   CK G++
Sbjct: 302 NRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNM 361

Query: 453 EMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTL 512
             A     +M + GL P+  TY  L+DG+ +KGY+++A+ +L  M     +P VVTYN L
Sbjct: 362 NRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNAL 421

Query: 513 INGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCV 572
           ING  + G +++A  +L+++  +G S DVV+Y+ ++ G+ +  + +EA  +   M +  +
Sbjct: 422 INGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGI 481

Query: 573 KPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCN 632
           KPD +T S+L+ G+C ++R  EA  L+ +ML +GL PD   Y  LI+ +C  G++++   
Sbjct: 482 KPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQ 541

Query: 633 FVKKMIESSIIPNNVTHRALVLGFQKK 657
              +M+E  ++P+ VT+  L+ G  K+
Sbjct: 542 LHNEMVEKGVLPDVVTYSVLINGLNKQ 568

BLAST of Lsi04G010270 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 5.2e-66
Identity = 171/558 (30.65%), Postives = 269/558 (48.21%), Query Frame = 1

Query: 100 QFDNVRILDILFEDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDL 159
           +F    ++ +L +   D RL L +F W+   SR + +LES+C ++H+ V       A  L
Sbjct: 84  KFKTDHLIWVLMKIKCDYRLVLDFFDWAR--SRRDSNLESLCIVIHLAVASKDLKVAQSL 143

Query: 160 MSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMK 219
           +S                      +  ER  L  T S + F  +          L+   K
Sbjct: 144 IS----------------------SFWERPKLNVTDSFVQFFDL----------LVYTYK 203

Query: 220 HLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLNY-SINLFVHHY---CAKG 279
                P   V+    Q L+       A  + E+M   GL L+  S N+++      C K 
Sbjct: 204 DWGSDPR--VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYK- 263

Query: 280 NLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLS 339
                  V  E    G   +   Y  VI+ +C++  +KEA  +L  M   G +PD ++ S
Sbjct: 264 -TATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 323

Query: 340 SVIDGYCKVGMSDIACKILKYFR---LPLNIFTYNSFITRLCVEGNMERASEVFLEMSEV 399
           +V++GYC+ G  D   K+++  +   L  N + Y S I  LC    +  A E F EM   
Sbjct: 324 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 383

Query: 400 GLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMA 459
           G++PD V YTT+I G+CK  +I  A  +  +M    I P ++TYT  I  FC+ GD+  A
Sbjct: 384 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 443

Query: 460 EVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLING 519
             +F +M  +GL+PD VT+  L++GY K G++  AF + + M     +P+VVTY TLI+G
Sbjct: 444 GKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDG 503

Query: 520 LVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPD 579
           L   G L  A ++L E+ + G   ++ TY +I+ G  K GN EEA  L        +  D
Sbjct: 504 LCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNAD 563

Query: 580 VVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVK 639
            VT + L+  YC+   +D+A  +  +ML  GL P ++ +N L++GFC  G +++G   + 
Sbjct: 564 TVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLN 603

Query: 640 KMIESSIIPNNVTHRALV 651
            M+   I PN  T  +LV
Sbjct: 624 WMLAKGIAPNATTFNSLV 603

BLAST of Lsi04G010270 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 1.7e-61
Identity = 145/584 (24.83%), Postives = 291/584 (49.83%), Query Frame = 1

Query: 93  NLGSHPKQFDNVRILDILFEDSSDARLCLHYFKWSGC-LSRSNQSLESICKMMHILVTGN 152
           N+ +H  + + + ++++L+   +D  L   +    G        +  S+  M+HILV   
Sbjct: 68  NVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPNFKHTSLSLSAMIHILVRSG 127

Query: 153 MNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPA 212
              R  D  S L +           ++  L  T     + ++   +L+  Y++ R +  A
Sbjct: 128 ---RLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVFDLLIRTYVQARKLREA 187

Query: 213 LMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLN-YSINLFVHH 272
                 ++      S     ++I +L++    ELAW + +E+ R G+ +N Y++N+ V+ 
Sbjct: 188 HEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNA 247

Query: 273 YCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPD 332
            C  G + +    L +++  G  PD V Y T+I++     L++EA  ++  M   G SP 
Sbjct: 248 LCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPGKGFSPG 307

Query: 333 SVTLSSVIDGYCKVGMSDIACKILKYF---RLPLNIFTYNSFITRLCVEGNMERASEVFL 392
             T ++VI+G CK G  + A ++        L  +  TY S +   C +G++    +VF 
Sbjct: 308 VYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEACKKGDVVETEKVFS 367

Query: 393 EMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCG 452
           +M    +VPD V +++M+  + +  N+++A  Y   + ++G+ P  + YT+ I  +C+ G
Sbjct: 368 DMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKG 427

Query: 453 DVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYN 512
            + +A  +  +M+ +G   DVVTYN ++ G  K+  L +A +L + M    + PD  T  
Sbjct: 428 MISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLT 487

Query: 513 TLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDN 572
            LI+G    G L+ A ++  ++  +   +DVVTY  ++ G+ K G+ + A  +W  M   
Sbjct: 488 ILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSK 547

Query: 573 CVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEG 632
            + P  ++ S L++  C +  + EA  ++ +M+   + P +++ N++I G+C  GN  +G
Sbjct: 548 EILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDG 607

Query: 633 CNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQE 672
            +F++KMI    +P+ +++  L+ GF ++  ++       K++E
Sbjct: 608 ESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEE 648

BLAST of Lsi04G010270 vs. Swiss-Prot
Match: PP376_ARATH (Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidopsis thaliana GN=At5g12100 PE=2 SV=1)

HSP 1 Score: 238.4 bits (607), Expect = 2.3e-61
Identity = 169/553 (30.56%), Postives = 265/553 (47.92%), Query Frame = 1

Query: 146 ILVTGNMNHRAVDLMSHLAKT--YGSEEGFSTILLKLLYETHKERKTLETTCSMLV---- 205
           +L    M   A DL   L     Y S +   T+LL  L +T + R T+    ++L     
Sbjct: 118 LLNESKMISEAADLFFALRNEGIYPSSDSL-TLLLDHLVKTKQFRVTINVFLNILESDFR 177

Query: 206 ---FCY-------IKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDL 265
              F Y       +K   V   L L  +MKH  I+PS ++Y  +I  L +  +   A  L
Sbjct: 178 PSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLIDGLCKGKRMNDAEQL 237

Query: 266 LEEMY-RQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCK 325
            +EM  R+ L    + N  +  YC  GN  + +KV   ++    +P  + + T++  L K
Sbjct: 238 FDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIEPSLITFNTLLKGLFK 297

Query: 326 ISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYF---RLPLNIFT 385
             ++++A  VL +M   G  PD+ T S + DGY     ++ A  + +      + +N +T
Sbjct: 298 AGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGVYETAVDSGVKMNAYT 357

Query: 386 YNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKML 445
            +  +  LC EG +E+A E+       GLVP+ V Y TMI GYC+  ++  A   I  M 
Sbjct: 358 CSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCRKGDLVGARMKIEAME 417

Query: 446 KSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLH 505
           K G++P  + Y   I +FC+ G++E AE    KM ++G+ P V TYNIL+ GYG+K    
Sbjct: 418 KQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKYEFD 477

Query: 506 KAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNII 565
           K F++L  M      P+VV+Y TLIN L     L EA+ +  ++  RG S  V  Y  +I
Sbjct: 478 KCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDMEDRGVSPKVRIYNMLI 537

Query: 566 YGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLN 625
            G   +G  E+AF     M    ++ ++VT + L+ G     ++ EA  L  ++   GL 
Sbjct: 538 DGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLLLEISRKGLK 597

Query: 626 PDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRV-INPIAS 678
           PD+  YN+LI G+   GNV       ++M  S I P   T+  L+    K+ + +     
Sbjct: 598 PDVFTYNSLISGYGFAGNVQRCIALYEEMKRSGIKPTLKTYHLLISLCTKEGIELTERLF 657

BLAST of Lsi04G010270 vs. TrEMBL
Match: A0A0A0KV38_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G581700 PE=4 SV=1)

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 558/686 (81.34%), Postives = 617/686 (89.94%), Query Frame = 1

Query: 1   MKSALSKISFCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQ 60
           M+SA S ISFCSKLNFRRK  CRY ATANS LSS N++D+ C        TNY+ ++ E+
Sbjct: 1   MRSAFSIISFCSKLNFRRKTPCRYSATANSELSSFNHMDEDC--------TNYDVNSDER 60

Query: 61  NYFGNEVQVSKGQKADEDAMKTIKLILGNHGFNLGSHPKQFDNVRILDILFEDSSDARLC 120
           +Y GNEV+VSKGQK DED M+TIKLILGN GFNLGS PKQ + +RILD+LFEDSSDA LC
Sbjct: 61  SYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIRILDVLFEDSSDAGLC 120

Query: 121 LHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKL 180
           L+YFKWSGCLS SNQSLESIC+M HILV GNMNHRAVDL+SHL K YG  EG S+ILLK+
Sbjct: 121 LYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKV 180

Query: 181 LYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQT 240
             ETH  RKTLETTCSM+V CYIKERMVT AL+L+ QMKHLNIFPS WVYKSVI+ALLQT
Sbjct: 181 FCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQT 240

Query: 241 NQSELAWDLLEEMYRQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYT 300
           NQS +AWDLLEEM+RQG+SLNYSINLF+HHYC++GNLG+GWKVLLELRNFGSKPD VDYT
Sbjct: 241 NQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYT 300

Query: 301 TVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLP 360
           TVINSLCK+SLLKEAT +LFKM  FGVSPD VT+SS+IDG+CKVG SDIACKILKYFRLP
Sbjct: 301 TVINSLCKVSLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFRLP 360

Query: 361 LNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSY 420
           LNIF YNSFIT+L  EG+M +AS+VFLEM+EVGLVPDC+SYTTMIGGYCKV NIN AFSY
Sbjct: 361 LNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKVGNINIAFSY 420

Query: 421 ICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGK 480
           + KMLKSGIQPS+ITYTLF+D FC+C DVEMAEVMF+KMI+EGLKPDVV YNILMD YGK
Sbjct: 421 LSKMLKSGIQPSVITYTLFLDYFCECRDVEMAEVMFEKMIVEGLKPDVVVYNILMDAYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGY+HKAF+LLDMMRSTNVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGFS+DVVT
Sbjct: 481 KGYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILDELIRRGFSVDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKML 600
           YTNII+GYS RGNFEEAFLLWYHM +NCV PDVVTCSALLSGYCRE+R+DEANALFCKML
Sbjct: 541 YTNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREKRMDEANALFCKML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVIN 660
           DIGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTHRALVLGFQKKRV +
Sbjct: 601 DIGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVTD 660

Query: 661 PIASATSKLQEILLAYNLQIDVNGHI 687
           PI SATSKLQEIL+AY+LQID  G+I
Sbjct: 661 PIQSATSKLQEILIAYDLQIDAIGYI 678

BLAST of Lsi04G010270 vs. TrEMBL
Match: M5XNQ0_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021440mg PE=4 SV=1)

HSP 1 Score: 718.8 bits (1854), Expect = 6.4e-204
Identity = 367/671 (54.69%), Postives = 482/671 (71.83%), Query Frame = 1

Query: 13  KLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATN---------YNDD--AVEQN 72
           KL FRR+   RY+++ NSALSS+   +D   T E  VA +         Y  D   + + 
Sbjct: 16  KLIFRRRSTLRYYSSVNSALSSIILSEDETSTLEDTVAADNGIFLSAKSYPTDFRGINEL 75

Query: 73  YFGNE---------VQVSKGQKADEDAMKTIKLILGNHGFNLGS------HPKQFDNVRI 132
           Y G +            S  ++ DED MK + LIL   G+NLG       +  Q + + +
Sbjct: 76  YCGEDGVCEPVDTGFLFSINERPDEDEMKRLMLILAKRGWNLGCQNGYNIYLNQLNTIEL 135

Query: 133 LDILFEDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKT 192
           L+ LFE+S DA+L L++FKWS C S S  +L++IC+M+HILV+GN+NHRAVDL+  L + 
Sbjct: 136 LNDLFEESFDAKLVLYFFKWSECCSGSKHTLQTICRMIHILVSGNLNHRAVDLILRLVRN 195

Query: 193 YGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPS 252
           +G EE  ++ LL++L ETH E + LETTCSMLV  YI+E MV  AL +  QMKHLNIFPS
Sbjct: 196 HGDEESCNS-LLEVLDETHSEIRVLETTCSMLVNGYIQEGMVNMALKIACQMKHLNIFPS 255

Query: 253 AWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLL 312
                S          SELAWD LE M  +G+ LN ++ +LF++ YC++G+L  GWK+LL
Sbjct: 256 NGDQSS----------SELAWDFLEVMRTRGMGLNAAMMSLFINKYCSEGDLESGWKLLL 315

Query: 313 ELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVG 372
           E++N+G +PD V +T VINSLCK+S L EAT +LFKMT  G+SPD V LSS+IDG+CK+G
Sbjct: 316 EMKNYGIQPDVVSFTIVINSLCKMSYLNEATALLFKMTQLGISPDPVLLSSIIDGHCKLG 375

Query: 373 MSDIACKILKYFRLPLNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMI 432
            +++A  ILK F  PLNIF YNSFI++LC +GNM  AS +F EMS +GL+PDC  Y+T+I
Sbjct: 376 QTEVALSILKIFNTPLNIFIYNSFISKLCTDGNMAEASSLFHEMSMLGLLPDCFCYSTII 435

Query: 433 GGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLK 492
            GYCKV +I+RAF Y  KMLK+GI P + TYT  ID + K G++EMAE  F KMI EGL 
Sbjct: 436 DGYCKVRDIDRAFQYFGKMLKNGITPCVTTYTSLIDAYLKSGNMEMAEYSFHKMISEGLA 495

Query: 493 PDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDI 552
           PD+VT+N LMDG+G+KG+L K F LLDMM S+NV+PD+VTYNTLI+ LV RGF+ EAK+I
Sbjct: 496 PDIVTFNTLMDGFGRKGHLQKVFGLLDMMNSSNVSPDIVTYNTLIHSLVTRGFVIEAKEI 555

Query: 553 LDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCR 612
           L ELI+RGFS+DVVT+TN+I G+SK+GNFEEAF +W++M+++ VKPDVVTCSALL+GYCR
Sbjct: 556 LFELIKRGFSLDVVTFTNLIDGFSKKGNFEEAFFVWFYMSEHDVKPDVVTCSALLNGYCR 615

Query: 613 EQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVT 657
           E+RI+EAN LF KML+IGL PDLILYNTLIHG CS G++D+ C  +  MIE  I PNN+T
Sbjct: 616 ERRIEEANVLFHKMLNIGLRPDLILYNTLIHGHCSFGSMDDACTLISMMIEHGIFPNNIT 675

BLAST of Lsi04G010270 vs. TrEMBL
Match: W9R9T4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012212 PE=4 SV=1)

HSP 1 Score: 713.4 bits (1840), Expect = 2.7e-202
Identity = 359/692 (51.88%), Postives = 488/692 (70.52%), Query Frame = 1

Query: 19  KRGCRYFATANSALSSLNYVDDGCFTFE---------CPVATNYNDDAVEQNYF------ 78
           +R  RY+++ N AL+S + ++D C   E          P A   + +  E++        
Sbjct: 20  RRAFRYYSSRNFALTSTSQLEDSCLVSEDSDSAKDTKSPKANCNSCERRERDELSFDKKD 79

Query: 79  GNEVQVSK-----GQKADEDAMKTIKLILGNHGFNLGSHP------KQFDNVRILDILFE 138
           G+EV          QKA    +  I  +L N G++L S         + + +RI+D LFE
Sbjct: 80  GDEVAERDFLFLTNQKAKVREVGRITRVLKNRGWDLTSPNGYRVKLSEVNIIRIMDDLFE 139

Query: 139 DSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEG 198
           +SSDA L L++F WS     S  ++ S+C+M+HIL +GNM HRA+DL+ HL + Y  EE 
Sbjct: 140 ESSDAELALYFFTWSESRIGSKHTVRSVCRMIHILASGNMKHRAMDLILHLVRRYKEEES 199

Query: 199 FSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKS 258
           +S  LL++LYETH ER   E  CSMLV CYIKE+ +  AL L  Q+K  NIFPS  V  +
Sbjct: 200 YS-FLLEVLYETHTERMIFEIVCSMLVNCYIKEKCLNAALKLTCQLKQHNIFPSDRVSNA 259

Query: 259 VIQALLQTNQSELAWDLLEEMYRQGLSLNYS-INLFVHHYCAKGNLGRGWKVLLELRNFG 318
           +++ L+ + Q ELAWD LE +  +G+ LN S I+LF+H+YC +GN   GWK+L  +R++G
Sbjct: 260 MLRELIGSKQLELAWDWLEIIQSRGMGLNASTISLFIHYYCKEGNFESGWKLLCRMRDYG 319

Query: 319 SKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIAC 378
            KPD + YT +I++LCK+S   EAT+++FKMT  G+SPD+V +SS++DGY KVG  D A 
Sbjct: 320 VKPDVISYTIIIDALCKMSCPIEATSLVFKMTQLGISPDAVCVSSIVDGYSKVGRIDQAL 379

Query: 379 KILKYFRLPLNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKV 438
           KILK F  P NI+TYNSFI++LC++ NM +AS +F +M E+GL+PDC SYTT+IGGYCKV
Sbjct: 380 KILKIFNFPQNIYTYNSFISKLCLDCNMVKASSLFHQMIELGLLPDCFSYTTIIGGYCKV 439

Query: 439 ENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTY 498
            +  RAF Y+ +MLK G++PS+ TYT+ ID  CKCG++EMAE +F K+I + L PDVV Y
Sbjct: 440 GDSQRAFQYLGRMLKVGVKPSVATYTVLIDTCCKCGNMEMAECLFWKLIADDLMPDVVVY 499

Query: 499 NILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIR 558
           N LMDGYG+KG+L K FEL DMM+S+NV PDVVTYNTLI+ LVMRGF+ EA+D+ DEL  
Sbjct: 500 NSLMDGYGEKGHLQKVFELFDMMKSSNVCPDVVTYNTLIHSLVMRGFVNEAEDVFDELTE 559

Query: 559 RGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDE 618
           RGF  DVVT+T +I G+SK+GNFEEAFL+W++M+++ V+PDVVTCSA+L+GYCR  R++E
Sbjct: 560 RGFCPDVVTFTTLIDGFSKKGNFEEAFLVWFYMSEHRVEPDVVTCSAILNGYCRRHRMEE 619

Query: 619 ANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVL 678
           A ALF KML+IGL PDL LYN LI+GFCSVGN+D+ C+ + +M+E   +PN +THRALVL
Sbjct: 620 AKALFQKMLNIGLKPDLRLYNNLIYGFCSVGNMDDACDLISRMVEHDNLPNKLTHRALVL 679

Query: 679 GFQKKRVINPIASATSKLQEILLAYNLQIDVN 684
           GF+KKR  NP+ SA  KLQEILL Y + + VN
Sbjct: 680 GFEKKRAKNPVESADFKLQEILLRYGVHVVVN 710

BLAST of Lsi04G010270 vs. TrEMBL
Match: A0A061FRZ6_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_044567 PE=4 SV=1)

HSP 1 Score: 702.2 bits (1811), Expect = 6.2e-199
Identity = 343/611 (56.14%), Postives = 453/611 (74.14%), Query Frame = 1

Query: 80  MKTIKLILGNHGFNLGSH---PKQFDNVRILDIL---FEDSSDARLCLHYFKWSGCLSRS 139
           +  IK IL   G+N+      P  F+   ++ IL   FE+S DA L L++FK S     S
Sbjct: 51  LSLIKSILWKRGWNINPDNLCPIDFNESSVIGILTHLFEESLDAELALYFFKLSERCVGS 110

Query: 140 NQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLET 199
             S++S+CKM+HILV+GNMNHRAVD +  L +   S++    +LLKL YETH +R  LET
Sbjct: 111 LHSVKSVCKMIHILVSGNMNHRAVDFILRLVRISCSKDVSEDLLLKLFYETHSDRMVLET 170

Query: 200 TCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEM 259
            CSMLV CYIKE  V  AL L  +MK  N+ PS  V  S+++ALL+ N+ +LAWD L++M
Sbjct: 171 VCSMLVDCYIKENEVGLALELACKMKSFNMIPSIGVCNSLLKALLELNELDLAWDFLDQM 230

Query: 260 YRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLL 319
            RQG  LN +I +LF+  YC KG L   W  L+E++N+G KPD V YT +I+SLCK+S L
Sbjct: 231 LRQGSGLNVAIVSLFIDKYCRKGQLLSAWTFLMEMKNYGIKPDVVAYTIIIDSLCKVSCL 290

Query: 320 KEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLNIFTYNSFITR 379
            EAT++LFK+T  G+SPDSV +SSV++G+CK G    A  ++ +F L  NIF YNSFI++
Sbjct: 291 GEATSLLFKITRLGISPDSVLVSSVVEGHCKAGKPKEAINVINFFNLKPNIFVYNSFISK 350

Query: 380 LCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPS 439
           LC +G+M  AS +F +M E+GL+PDCVSYTT+IGGYCK +++NRAF Y  KMLK GI+PS
Sbjct: 351 LCADGDMVEASLIFQDMFELGLLPDCVSYTTIIGGYCKDQDMNRAFQYFGKMLKCGIKPS 410

Query: 440 LITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLD 499
           + TYT+ ID  CK  D+EMAE +FQKMI+ GL PD+VT+N ++DGYGKKG+LHKAF LLD
Sbjct: 411 VTTYTVLIDACCKSEDLEMAECLFQKMIMAGLVPDIVTFNTVIDGYGKKGHLHKAFMLLD 470

Query: 500 MMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRG 559
           MMRS  ++PDV TYN +I+ L+ RGF  EAK ILDEL++RG S D+VT+TNII G SK+G
Sbjct: 471 MMRSAGISPDVTTYNIIIHSLIERGFTNEAKVILDELVQRGISPDMVTFTNIIDGLSKKG 530

Query: 560 NFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYN 619
           +FEEAFL+W++M++  VKPDVVTCSALL+GYCR +R++EAN LF +MLD+GLNPDL+LYN
Sbjct: 531 DFEEAFLIWFYMSERHVKPDVVTCSALLNGYCRARRMEEANTLFLRMLDVGLNPDLVLYN 590

Query: 620 TLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQEI 679
           TLIHGFC  GN+DE CN V  M+ + I+PNNVTH+A VLGF+KK V NP  SA  KLQ++
Sbjct: 591 TLIHGFCRTGNMDEACNLVTMMVRNGILPNNVTHQAFVLGFEKKWVKNPEESAALKLQQL 650

Query: 680 LLAYNLQIDVN 684
           LL +++ +DV+
Sbjct: 651 LLRHDIHVDVD 661

BLAST of Lsi04G010270 vs. TrEMBL
Match: A0A0B0MK88_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_18322 PE=4 SV=1)

HSP 1 Score: 699.9 bits (1805), Expect = 3.1e-198
Identity = 343/611 (56.14%), Postives = 451/611 (73.81%), Query Frame = 1

Query: 80  MKTIKLILGNHGFNLGSHP------KQFDNVRILDILFEDSSDARLCLHYFKWSGCLSRS 139
           M  IK IL   GFN+           + + +RIL+ LF++SS++ L LH+FK S     S
Sbjct: 53  MSMIKSILSKRGFNINPENLHAVDLNESNLIRILNDLFDESSNSELALHFFKLSEYCIGS 112

Query: 140 NQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLET 199
             S +S+CKM+HILV+GNMNH AVD + +L +    ++     LLKL YETH ++  L T
Sbjct: 113 LHSNKSVCKMIHILVSGNMNHIAVDFILYLVRVSVKKDVPEDELLKLFYETHTDKTVLRT 172

Query: 200 TCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEM 259
             SMLV CYI+E+    A  L  QMKH ++FPS  V  S+++ALL+ NQ +LAWD L++M
Sbjct: 173 VYSMLVDCYIREKKADLAFELTCQMKHFDMFPSVGVCNSLLKALLRLNQLDLAWDFLDQM 232

Query: 260 YRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLL 319
            RQG+ LN SI  LF++ YC KG+L   WK+L++++N+G KPD V YT +IN+LCK+S L
Sbjct: 233 MRQGIRLNVSIFTLFINMYCNKGHLLSAWKLLMDMKNYGIKPDVVAYTIIINTLCKMSCL 292

Query: 320 KEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLNIFTYNSFITR 379
            EAT++LFK+T FGV PDSV +SSV++GYCKVG    A  ++K+F L  NIF YNSFIT+
Sbjct: 293 GEATSMLFKITRFGVFPDSVLVSSVVEGYCKVGRPMEAMNVIKFFNLKPNIFVYNSFITK 352

Query: 380 LCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPS 439
            C EGNM +AS +F EM E+GL+PDCVSYTT+IGGYC+  ++ RAF Y  KMLK GI P+
Sbjct: 353 FCAEGNMVKASLIFQEMFELGLLPDCVSYTTIIGGYCRDRDMGRAFQYFGKMLKCGINPT 412

Query: 440 LITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLD 499
           + T+TL ID  CK  D+EMA+ +F KMI+EGL PDVVT+N ++DGYGK G LHKAF L+D
Sbjct: 413 VTTFTLLIDACCKSKDLEMADYLFHKMIMEGLVPDVVTFNTVIDGYGKMGLLHKAFMLVD 472

Query: 500 MMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRG 559
           MMRS  ++PDV TYN +I+ L+ RGF  EAKDILDEL+RRG S D VT+TNII G SK+G
Sbjct: 473 MMRSAGISPDVTTYNIIIDSLIKRGFTNEAKDILDELVRRGVSPDTVTFTNIIDGLSKKG 532

Query: 560 NFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYN 619
           +FEEAFL+W++M++  VKPDV+TCSALL+GYCRE+R+ EANALF +MLDIGL+PDL+LYN
Sbjct: 533 DFEEAFLVWFYMSECNVKPDVLTCSALLNGYCRERRMTEANALFVRMLDIGLSPDLVLYN 592

Query: 620 TLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQEI 679
           TLIHGFC +G++D+ CN V+ MI   I+PN  THRA +LGF KK V NP  +A  KLQ++
Sbjct: 593 TLIHGFCGIGDMDKACNLVEMMIRDGILPNKGTHRAFILGFGKKWVKNPEETAALKLQQL 652

Query: 680 LLAYNLQIDVN 684
           LL Y++ +DVN
Sbjct: 653 LLQYDIHVDVN 663

BLAST of Lsi04G010270 vs. TAIR10
Match: AT2G19280.1 (AT2G19280.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 577.4 bits (1487), Expect = 1.2e-164
Identity = 295/673 (43.83%), Postives = 439/673 (65.23%), Query Frame = 1

Query: 10  FCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQNYFGNEVQV 69
           FC++    R   CR F+ A+ + ++  +  D       P + +    +  +++  + V +
Sbjct: 16  FCTRTKAFRYFWCRTFSLASLSENNSRFQTDSS---RLPYSGSRYYHSSSKHFGEDFVSI 75

Query: 70  SKGQKADEDAMKTIKLILGNHGF------NLGSHPKQFDNVRILDILFEDSSDARLCLHY 129
            K      D ++TI+ +L  H +         +   Q+  +RILD LFE++ DA + L++
Sbjct: 76  LKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVLYF 135

Query: 130 FKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYE 189
           F+WS        S  SI +M+HILV+GNMN+RAVD++  L K    EE    +++K L+E
Sbjct: 136 FRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDLFE 195

Query: 190 THKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQS 249
           T  +R+ LET  S+L+ C I+ER V  AL L  ++    IFPS  V  S+++ +L+ +  
Sbjct: 196 TRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVHGL 255

Query: 250 ELAWDLLEEMYRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTV 309
           ELA + +E M  +G  LN ++ +LF+  YC+ G   +GW++L+ ++++G +PD V +T  
Sbjct: 256 ELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFTVF 315

Query: 310 INSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLN 369
           I+ LCK   LKEAT+VLFK+  FG+S DSV++SSVIDG+CKVG  + A K++  FRL  N
Sbjct: 316 IDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGKPEEAIKLIHSFRLRPN 375

Query: 370 IFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYIC 429
           IF Y+SF++ +C  G+M RAS +F E+ E+GL+PDCV YTTMI GYC +   ++AF Y  
Sbjct: 376 IFVYSSFLSNICSTGDMLRASTIFQEIFELGLLPDCVCYTTMIDGYCNLGRTDKAFQYFG 435

Query: 430 KMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKG 489
            +LKSG  PSL T T+ I    + G +  AE +F+ M  EGLK DVVTYN LM GYGK  
Sbjct: 436 ALLKSGNPPSLTTSTILIGACSRFGSISDAESVFRNMKTEGLKLDVVTYNNLMHGYGKTH 495

Query: 490 YLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYT 549
            L+K FEL+D MRS  ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T
Sbjct: 496 QLNKVFELIDEMRSAGISPDVATYNILIHSMVVRGYIDEANEIISELIRRGFVPSTLAFT 555

Query: 550 NIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDI 609
           ++I G+SKRG+F+EAF+LW++M D  +KPDVVTCSALL GYC+ QR+++A  LF K+LD 
Sbjct: 556 DVIGGFSKRGDFQEAFILWFYMADLRMKPDVVTCSALLHGYCKAQRMEKAIVLFNKLLDA 615

Query: 610 GLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPI 669
           GL PD++LYNTLIHG+CSVG++++ C  +  M++  ++PN  TH ALVLG + KR +N  
Sbjct: 616 GLKPDVVLYNTLIHGYCSVGDIEKACELIGLMVQRGMLPNESTHHALVLGLEGKRFVNSE 675

Query: 670 ASATSKLQEILLA 676
             A+  L+EI++A
Sbjct: 676 THASMLLEEIIVA 685

BLAST of Lsi04G010270 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 261.5 bits (667), Expect = 1.4e-69
Identity = 134/387 (34.63%), Postives = 217/387 (56.07%), Query Frame = 1

Query: 273 AKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSV 332
           +K N+     V  E+      P+   Y  +I   C    +  A T+  KM   G  P+ V
Sbjct: 182 SKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVV 241

Query: 333 TLSSVIDGYCKVGMSDIACKILKYFRLP---LNIFTYNSFITRLCVEGNMERASEVFLEM 392
           T +++IDGYCK+   D   K+L+   L     N+ +YN  I  LC EG M+  S V  EM
Sbjct: 242 TYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEM 301

Query: 393 SEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDV 452
           +  G   D V+Y T+I GYCK  N ++A     +ML+ G+ PS+ITYT  I   CK G++
Sbjct: 302 NRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNM 361

Query: 453 EMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTL 512
             A     +M + GL P+  TY  L+DG+ +KGY+++A+ +L  M     +P VVTYN L
Sbjct: 362 NRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNAL 421

Query: 513 INGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCV 572
           ING  + G +++A  +L+++  +G S DVV+Y+ ++ G+ +  + +EA  +   M +  +
Sbjct: 422 INGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGI 481

Query: 573 KPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCN 632
           KPD +T S+L+ G+C ++R  EA  L+ +ML +GL PD   Y  LI+ +C  G++++   
Sbjct: 482 KPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQ 541

Query: 633 FVKKMIESSIIPNNVTHRALVLGFQKK 657
              +M+E  ++P+ VT+  L+ G  K+
Sbjct: 542 LHNEMVEKGVLPDVVTYSVLINGLNKQ 568

BLAST of Lsi04G010270 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 253.8 bits (647), Expect = 3.0e-67
Identity = 171/558 (30.65%), Postives = 269/558 (48.21%), Query Frame = 1

Query: 100 QFDNVRILDILFEDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDL 159
           +F    ++ +L +   D RL L +F W+   SR + +LES+C ++H+ V       A  L
Sbjct: 84  KFKTDHLIWVLMKIKCDYRLVLDFFDWAR--SRRDSNLESLCIVIHLAVASKDLKVAQSL 143

Query: 160 MSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMK 219
           +S                      +  ER  L  T S + F  +          L+   K
Sbjct: 144 IS----------------------SFWERPKLNVTDSFVQFFDL----------LVYTYK 203

Query: 220 HLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLNY-SINLFVHHY---CAKG 279
                P   V+    Q L+       A  + E+M   GL L+  S N+++      C K 
Sbjct: 204 DWGSDPR--VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYK- 263

Query: 280 NLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLS 339
                  V  E    G   +   Y  VI+ +C++  +KEA  +L  M   G +PD ++ S
Sbjct: 264 -TATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 323

Query: 340 SVIDGYCKVGMSDIACKILKYFR---LPLNIFTYNSFITRLCVEGNMERASEVFLEMSEV 399
           +V++GYC+ G  D   K+++  +   L  N + Y S I  LC    +  A E F EM   
Sbjct: 324 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 383

Query: 400 GLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMA 459
           G++PD V YTT+I G+CK  +I  A  +  +M    I P ++TYT  I  FC+ GD+  A
Sbjct: 384 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 443

Query: 460 EVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLING 519
             +F +M  +GL+PD VT+  L++GY K G++  AF + + M     +P+VVTY TLI+G
Sbjct: 444 GKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDG 503

Query: 520 LVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPD 579
           L   G L  A ++L E+ + G   ++ TY +I+ G  K GN EEA  L        +  D
Sbjct: 504 LCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNAD 563

Query: 580 VVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVK 639
            VT + L+  YC+   +D+A  +  +ML  GL P ++ +N L++GFC  G +++G   + 
Sbjct: 564 TVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLN 603

Query: 640 KMIESSIIPNNVTHRALV 651
            M+   I PN  T  +LV
Sbjct: 624 WMLAKGIAPNATTFNSLV 603

BLAST of Lsi04G010270 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 238.8 bits (608), Expect = 9.8e-63
Identity = 145/584 (24.83%), Postives = 291/584 (49.83%), Query Frame = 1

Query: 93  NLGSHPKQFDNVRILDILFEDSSDARLCLHYFKWSGC-LSRSNQSLESICKMMHILVTGN 152
           N+ +H  + + + ++++L+   +D  L   +    G        +  S+  M+HILV   
Sbjct: 68  NVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFPNFKHTSLSLSAMIHILVRSG 127

Query: 153 MNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPA 212
              R  D  S L +           ++  L  T     + ++   +L+  Y++ R +  A
Sbjct: 128 ---RLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVFDLLIRTYVQARKLREA 187

Query: 213 LMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLN-YSINLFVHH 272
                 ++      S     ++I +L++    ELAW + +E+ R G+ +N Y++N+ V+ 
Sbjct: 188 HEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSGVGINVYTLNIMVNA 247

Query: 273 YCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPD 332
            C  G + +    L +++  G  PD V Y T+I++     L++EA  ++  M   G SP 
Sbjct: 248 LCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAFELMNAMPGKGFSPG 307

Query: 333 SVTLSSVIDGYCKVGMSDIACKILKYF---RLPLNIFTYNSFITRLCVEGNMERASEVFL 392
             T ++VI+G CK G  + A ++        L  +  TY S +   C +G++    +VF 
Sbjct: 308 VYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEACKKGDVVETEKVFS 367

Query: 393 EMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCG 452
           +M    +VPD V +++M+  + +  N+++A  Y   + ++G+ P  + YT+ I  +C+ G
Sbjct: 368 DMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDNVIYTILIQGYCRKG 427

Query: 453 DVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYN 512
            + +A  +  +M+ +G   DVVTYN ++ G  K+  L +A +L + M    + PD  T  
Sbjct: 428 MISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNEMTERALFPDSYTLT 487

Query: 513 TLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDN 572
            LI+G    G L+ A ++  ++  +   +DVVTY  ++ G+ K G+ + A  +W  M   
Sbjct: 488 ILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSK 547

Query: 573 CVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEG 632
            + P  ++ S L++  C +  + EA  ++ +M+   + P +++ N++I G+C  GN  +G
Sbjct: 548 EILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDG 607

Query: 633 CNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQE 672
            +F++KMI    +P+ +++  L+ GF ++  ++       K++E
Sbjct: 608 ESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEE 648

BLAST of Lsi04G010270 vs. TAIR10
Match: AT5G12100.1 (AT5G12100.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 238.4 bits (607), Expect = 1.3e-62
Identity = 169/553 (30.56%), Postives = 265/553 (47.92%), Query Frame = 1

Query: 146 ILVTGNMNHRAVDLMSHLAKT--YGSEEGFSTILLKLLYETHKERKTLETTCSMLV---- 205
           +L    M   A DL   L     Y S +   T+LL  L +T + R T+    ++L     
Sbjct: 118 LLNESKMISEAADLFFALRNEGIYPSSDSL-TLLLDHLVKTKQFRVTINVFLNILESDFR 177

Query: 206 ---FCY-------IKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDL 265
              F Y       +K   V   L L  +MKH  I+PS ++Y  +I  L +  +   A  L
Sbjct: 178 PSKFMYGKAIQAAVKLSDVGKGLELFNRMKHDRIYPSVFIYNVLIDGLCKGKRMNDAEQL 237

Query: 266 LEEMY-RQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCK 325
            +EM  R+ L    + N  +  YC  GN  + +KV   ++    +P  + + T++  L K
Sbjct: 238 FDEMLARRLLPSLITYNTLIDGYCKAGNPEKSFKVRERMKADHIEPSLITFNTLLKGLFK 297

Query: 326 ISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYF---RLPLNIFT 385
             ++++A  VL +M   G  PD+ T S + DGY     ++ A  + +      + +N +T
Sbjct: 298 AGMVEDAENVLKEMKDLGFVPDAFTFSILFDGYSSNEKAEAALGVYETAVDSGVKMNAYT 357

Query: 386 YNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKML 445
            +  +  LC EG +E+A E+       GLVP+ V Y TMI GYC+  ++  A   I  M 
Sbjct: 358 CSILLNALCKEGKIEKAEEILGREMAKGLVPNEVIYNTMIDGYCRKGDLVGARMKIEAME 417

Query: 446 KSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLH 505
           K G++P  + Y   I +FC+ G++E AE    KM ++G+ P V TYNIL+ GYG+K    
Sbjct: 418 KQGMKPDHLAYNCLIRRFCELGEMENAEKEVNKMKLKGVSPSVETYNILIGGYGRKYEFD 477

Query: 506 KAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNII 565
           K F++L  M      P+VV+Y TLIN L     L EA+ +  ++  RG S  V  Y  +I
Sbjct: 478 KCFDILKEMEDNGTMPNVVSYGTLINCLCKGSKLLEAQIVKRDMEDRGVSPKVRIYNMLI 537

Query: 566 YGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLN 625
            G   +G  E+AF     M    ++ ++VT + L+ G     ++ EA  L  ++   GL 
Sbjct: 538 DGCCSKGKIEDAFRFSKEMLKKGIELNLVTYNTLIDGLSMTGKLSEAEDLLLEISRKGLK 597

Query: 626 PDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRV-INPIAS 678
           PD+  YN+LI G+   GNV       ++M  S I P   T+  L+    K+ + +     
Sbjct: 598 PDVFTYNSLISGYGFAGNVQRCIALYEEMKRSGIKPTLKTYHLLISLCTKEGIELTERLF 657

BLAST of Lsi04G010270 vs. NCBI nr
Match: gi|778704309|ref|XP_011655513.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus])

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 558/686 (81.34%), Postives = 617/686 (89.94%), Query Frame = 1

Query: 1   MKSALSKISFCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQ 60
           M+SA S ISFCSKLNFRRK  CRY ATANS LSS N++D+ C        TNY+ ++ E+
Sbjct: 1   MRSAFSIISFCSKLNFRRKTPCRYSATANSELSSFNHMDEDC--------TNYDVNSDER 60

Query: 61  NYFGNEVQVSKGQKADEDAMKTIKLILGNHGFNLGSHPKQFDNVRILDILFEDSSDARLC 120
           +Y GNEV+VSKGQK DED M+TIKLILGN GFNLGS PKQ + +RILD+LFEDSSDA LC
Sbjct: 61  SYVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIRILDVLFEDSSDAGLC 120

Query: 121 LHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKL 180
           L+YFKWSGCLS SNQSLESIC+M HILV GNMNHRAVDL+SHL K YG  EG S+ILLK+
Sbjct: 121 LYYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKV 180

Query: 181 LYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQT 240
             ETH  RKTLETTCSM+V CYIKERMVT AL+L+ QMKHLNIFPS WVYKSVI+ALLQT
Sbjct: 181 FCETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQT 240

Query: 241 NQSELAWDLLEEMYRQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYT 300
           NQS +AWDLLEEM+RQG+SLNYSINLF+HHYC++GNLG+GWKVLLELRNFGSKPD VDYT
Sbjct: 241 NQSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYT 300

Query: 301 TVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLP 360
           TVINSLCK+SLLKEAT +LFKM  FGVSPD VT+SS+IDG+CKVG SDIACKILKYFRLP
Sbjct: 301 TVINSLCKVSLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFRLP 360

Query: 361 LNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSY 420
           LNIF YNSFIT+L  EG+M +AS+VFLEM+EVGLVPDC+SYTTMIGGYCKV NIN AFSY
Sbjct: 361 LNIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKVGNINIAFSY 420

Query: 421 ICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGK 480
           + KMLKSGIQPS+ITYTLF+D FC+C DVEMAEVMF+KMI+EGLKPDVV YNILMD YGK
Sbjct: 421 LSKMLKSGIQPSVITYTLFLDYFCECRDVEMAEVMFEKMIVEGLKPDVVVYNILMDAYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGY+HKAF+LLDMMRSTNVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGFS+DVVT
Sbjct: 481 KGYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILDELIRRGFSVDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKML 600
           YTNII+GYS RGNFEEAFLLWYHM +NCV PDVVTCSALLSGYCRE+R+DEANALFCKML
Sbjct: 541 YTNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREKRMDEANALFCKML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVIN 660
           DIGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTHRALVLGFQKKRV +
Sbjct: 601 DIGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVTD 660

Query: 661 PIASATSKLQEILLAYNLQIDVNGHI 687
           PI SATSKLQEIL+AY+LQID  G+I
Sbjct: 661 PIQSATSKLQEILIAYDLQIDAIGYI 678

BLAST of Lsi04G010270 vs. NCBI nr
Match: gi|659090263|ref|XP_008445921.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo])

HSP 1 Score: 1120.5 bits (2897), Expect = 0.0e+00
Identity = 544/686 (79.30%), Postives = 611/686 (89.07%), Query Frame = 1

Query: 1   MKSALSKISFCSKLNFRRKRGCRYFATANSALSSLNYVDDGCFTFECPVATNYNDDAVEQ 60
           MKSA S ISFCSKLNFRRK  CRYFATAN  LSS N++D+ C        TNY+ D+ E+
Sbjct: 1   MKSAFSIISFCSKLNFRRKTPCRYFATANYELSSFNHMDEDC--------TNYDVDSDER 60

Query: 61  NYFGNEVQVSKGQKADEDAMKTIKLILGNHGFNLGSHPKQFDNVRILDILFEDSSDARLC 120
           +YFGNEV+VSKG+K DED M+ IKLILGN GF LGS PKQ + VRILDILFEDSSD  LC
Sbjct: 61  SYFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVRILDILFEDSSDPELC 120

Query: 121 LHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKL 180
           L+YFKWSGCLS SNQSLESIC+M HILV GN NH AVDL+SHL K YG +EG S+ILL++
Sbjct: 121 LYYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVKNYGCKEGSSSILLEV 180

Query: 181 LYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQT 240
            Y+TH +RKTLETTC M++ CYIKE MVT A++L+ QM+ LN+FPS WVYKSVI+ALLQT
Sbjct: 181 FYDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFPSIWVYKSVIKALLQT 240

Query: 241 NQSELAWDLLEEMYRQGLSLNYSINLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYT 300
           N+ ++AWDLLEEM RQG+SL+YSINLF+HHYC++GNLG+GWKVLLELRNFGSKPD VDYT
Sbjct: 241 NRFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYT 300

Query: 301 TVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLP 360
           TVINSLCKISLLKEAT +LFKM  FGVSPD VT+SS+IDG+CKVG SDIACKILKYF++P
Sbjct: 301 TVINSLCKISLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFKIP 360

Query: 361 LNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSY 420
           LNIF YNSFIT L +EG+  +AS+VFLEMSEVGLVPDCVSYTTMIGGYCKV NIN AFSY
Sbjct: 361 LNIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMIGGYCKVGNINIAFSY 420

Query: 421 ICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGK 480
           + KMLKSGIQPS+ITYTLF+D FC+CGDVEMAEVMF+KMI+E LKPDVV YNILMD YGK
Sbjct: 421 LSKMLKSGIQPSVITYTLFVDYFCECGDVEMAEVMFEKMIVEDLKPDVVMYNILMDAYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGY+HKAF+LLDMMRSTNVTPDVVTYN+LI+GLVMRGFL+EAKDILDELIRRGFSIDVVT
Sbjct: 481 KGYMHKAFQLLDMMRSTNVTPDVVTYNSLIHGLVMRGFLQEAKDILDELIRRGFSIDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKML 600
           YTNI++GYSKRGNFEEAFLLWYHM DNCV PDVVTCSALLSGYCR + +DEANALFC+ML
Sbjct: 541 YTNIMHGYSKRGNFEEAFLLWYHMADNCVTPDVVTCSALLSGYCRAKHMDEANALFCRML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVIN 660
           DIGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTHRALVLGFQKKRV++
Sbjct: 601 DIGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVMD 660

Query: 661 PIASATSKLQEILLAYNLQIDVNGHI 687
           PI SATSKLQEIL+AY+LQID  GHI
Sbjct: 661 PIQSATSKLQEILIAYDLQIDAIGHI 678

BLAST of Lsi04G010270 vs. NCBI nr
Match: gi|645251570|ref|XP_008231739.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Prunus mume])

HSP 1 Score: 740.0 bits (1909), Expect = 3.9e-210
Identity = 359/609 (58.95%), Postives = 473/609 (77.67%), Query Frame = 1

Query: 85  LILGNHGFNLGS------HPKQFDNVRILDILFEDSSDARLCLHYFKWSGCLSRSNQSLE 144
           LIL   G+NLG       +  Q + + +L+ LFE+S DA+L L++FKWS C S S  +++
Sbjct: 2   LILAKRGWNLGCQNGYNIYLNQLNIIELLNDLFEESLDAKLVLYFFKWSECCSGSKHTIQ 61

Query: 145 SICKMMHILVTGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSML 204
           +IC+M+HILV+GN+NHRAVDL+ HL + +G EE  ++ LL++LYETH E + LETTCSML
Sbjct: 62  TICRMIHILVSGNLNHRAVDLILHLVRNHGDEESCNS-LLEVLYETHSEIRVLETTCSML 121

Query: 205 VFCYIKERMVTPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGL 264
           V  YI+E MV  AL +  QMKHLNIFPS  V  S++QALL + Q ELAWD LE M  +G+
Sbjct: 122 VNGYIQEGMVNMALKIACQMKHLNIFPSNGVCNSLLQALLGSKQLELAWDFLEVMRTRGM 181

Query: 265 SLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATT 324
            LN ++ +LF++ YC++G+L  GWK+LLE++N+G +PD V +T VINSLCK+S L EAT 
Sbjct: 182 GLNAAMMSLFINKYCSEGDLESGWKLLLEMKNYGIQPDVVSFTIVINSLCKMSYLNEATA 241

Query: 325 VLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLNIFTYNSFITRLCVEG 384
           +LFKMT  G+SPD V LSS+IDG+CK+G +++A  ILK F   LNIF YNSFI++LC +G
Sbjct: 242 LLFKMTQLGISPDPVLLSSIIDGHCKLGQTEVAISILKIFNTSLNIFIYNSFISKLCTDG 301

Query: 385 NMERASEVFLEMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYT 444
           NM  AS +F EMS +GL+PDC  Y+T+I GYCKV +I+RAF Y  KMLK+GI P + TYT
Sbjct: 302 NMVEASRLFHEMSLLGLLPDCFCYSTIIDGYCKVRDIDRAFQYFGKMLKNGITPCVTTYT 361

Query: 445 LFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRST 504
             ID +CK G++E AE  F KMI  GL PDVVT+N LMDG+G+KG+L K F LLDMM S+
Sbjct: 362 SLIDAYCKSGNMETAEYSFHKMISAGLAPDVVTFNTLMDGFGRKGHLQKVFGLLDMMNSS 421

Query: 505 NVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEA 564
           NV+PD+VTYNTLI+ LV RGF+ EAK+IL ELI+RGFS+DVVT+TN+I G+SK+GNFEEA
Sbjct: 422 NVSPDIVTYNTLIHSLVTRGFVIEAKEILFELIKRGFSLDVVTFTNLIDGFSKKGNFEEA 481

Query: 565 FLLWYHMTDNCVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHG 624
           F +W++M+++ VKPDVVTCSALL+GY RE+RI+EAN LF KML+IGL PDLILYNTLIHG
Sbjct: 482 FFVWFYMSEHDVKPDVVTCSALLNGYYRERRIEEANVLFHKMLNIGLRPDLILYNTLIHG 541

Query: 625 FCSVGNVDEGCNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQEILLAYN 684
            CS G++D+ C  +  MIE  I+PNN+TH+ALVLGF+KKRV+NP+ +A  KLQ+ILL Y 
Sbjct: 542 HCSFGSMDDACTLISMMIEHGILPNNITHQALVLGFRKKRVMNPVETANLKLQQILLKYG 601

Query: 685 LQIDVNGHI 687
           + +DV+ ++
Sbjct: 602 IHVDVDEYL 609

BLAST of Lsi04G010270 vs. NCBI nr
Match: gi|731415259|ref|XP_002272339.2| (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Vitis vinifera])

HSP 1 Score: 734.2 bits (1894), Expect = 2.1e-208
Identity = 377/691 (54.56%), Postives = 502/691 (72.65%), Query Frame = 1

Query: 18  RKRGCRYFATANSALSSLNYVDDGCFT--------------FEC-PVA-TNYNDDAVEQN 77
           R R  + F++AN ALSS   + D  F                +C PV    +N  +  +N
Sbjct: 21  RNRIPKKFSSANLALSSPTLMVDEVFNHNNSCCVDDDLLPNIKCIPVEYMEWNGLSSGEN 80

Query: 78  ----YFGNEVQVSKGQKADEDAMKTIKLILGNHGFNLGSHP------KQFDNVRILDILF 137
               Y   +  +S+ +KA +D M+ IK+IL N G+NLGS         QF+ ++IL+ LF
Sbjct: 81  DIFAYVDKDSLISENEKAVDDEMEIIKVILTNRGWNLGSQNGYRIDLSQFNVMKILNDLF 140

Query: 138 EDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILVTGNMNHRAVDLMSHLAKTYGSEE 197
           E+S+DA L L++F+WS     S  ++ES+C M+HILV+GNMNH+A+DL+ HL      EE
Sbjct: 141 EESTDAALALYFFRWSEYCMGSKHTVESVCTMIHILVSGNMNHKAMDLLLHLISYNSGEE 200

Query: 198 GFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMVTPALMLMGQMKHLNIFPSAWVYK 257
           G+  I LK+ +ETH +R+ LET   MLV CY+KE M   AL L+ +M+HLNIFP   V  
Sbjct: 201 GWHNIFLKI-HETHTKRRVLETVYGMLVNCYVKENMTQVALKLICKMRHLNIFPLIGVCN 260

Query: 258 SVIQALLQTNQSELAWDLLEEMYRQGLSLNYSI-NLFVHHYCAKGNLGRGWKVLLELRNF 317
           S+++ALL++ Q  LAWD L+EM  QGL LN SI +LF+  YC++GN+  GWK+L+E++  
Sbjct: 261 SLLKALLESEQLNLAWDFLKEMKSQGLGLNASIISLFISGYCSQGNIDTGWKLLMEMKYL 320

Query: 318 GSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGVSPDSVTLSSVIDGYCKVGMSDIA 377
           G KPD V YT VI+SLCK+SLLKEAT++LFKMT  GV  DSV++SSV+DGYCKVG S+ A
Sbjct: 321 GIKPDVVAYTIVIDSLCKMSLLKEATSILFKMTQMGVFLDSVSVSSVVDGYCKVGKSEEA 380

Query: 378 CKILKYFRLPLNIFTYNSFITRLCVEGNMERASEVFLEMSEVGLVPDCVSYTTMIGGYCK 437
             +L+ F L  NIF +NSFI++LC +GNM +A++VF +M E+GL+PDC SYTTM+ GYCK
Sbjct: 381 MDVLEVFNLSPNIFVFNSFISKLCTDGNMLKAAKVFQDMCEMGLIPDCFSYTTMMAGYCK 440

Query: 438 VENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCGDVEMAEVMFQKMIIEGLKPDVVT 497
           V++I+ A  Y+ KMLK GI+PS+ TYTL ID  CK G++EMAE +FQ+MI EGL PDVV+
Sbjct: 441 VKDISNALKYLGKMLKRGIRPSVATYTLLIDSCCKPGNMEMAEYLFQRMITEGLVPDVVS 500

Query: 498 YNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELI 557
           YN LM+GYGKKG+L KAFELL MMRS  V+PD+VTYN LI+GL+ RG + EAKDILDEL 
Sbjct: 501 YNTLMNGYGKKGHLQKAFELLSMMRSAGVSPDLVTYNILIHGLIKRGLVNEAKDILDELT 560

Query: 558 RRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRID 617
           RRGFS DVVT+TNII G+S +GNFEEAFLL+++M+++ ++PDVVTCSALL+GYCR + + 
Sbjct: 561 RRGFSPDVVTFTNIIGGFSNKGNFEEAFLLFFYMSEHHLEPDVVTCSALLNGYCRTRCMA 620

Query: 618 EANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHRALV 677
           EAN LF KMLD GL  D+ILYN+LIHGFCS+GN+D+ C+ V  MIE  I+PNN+TH ALV
Sbjct: 621 EANVLFHKMLDAGLKADVILYNSLIHGFCSLGNIDDACHLVSMMIEHGIMPNNITHHALV 680

Query: 678 LGFQKKRVINPIASATSKLQEILLAYNLQID 682
           LG++KK V NP+  A  KLQ++LL Y +Q +
Sbjct: 681 LGYEKKCVENPVERAAFKLQQLLLKYGIQAE 710

BLAST of Lsi04G010270 vs. NCBI nr
Match: gi|1009118967|ref|XP_015876129.1| (PREDICTED: uncharacterized protein LOC107412823 [Ziziphus jujuba])

HSP 1 Score: 726.5 bits (1874), Expect = 4.4e-206
Identity = 371/719 (51.60%), Postives = 497/719 (69.12%), Query Frame = 1

Query: 1    MKSALSKISFCS-KLNFRRKRGCRYFATANSALSSLNYVD------------DGCFTFEC 60
            MK+    ++ CS +L    +R  RY++   SALSS++  D            DG      
Sbjct: 852  MKNLPLIVNICSTRLKLALRRTFRYYSFGKSALSSISQKDVCLISEDLVSDDDGMMLSAQ 911

Query: 61   PVATNYND------------DAVEQN--YFGNEVQVSKGQKADEDAMKTIKLILGNHGFN 120
                 YN+            + V++N  +F N        KA++  MK I  IL + G+N
Sbjct: 912  SSLARYNEYDEFSFSRKDVAEVVDKNSLFFNN-------LKAEDHDMKRIMTILTHRGWN 971

Query: 121  LGSHPKQFDN-----VRILDILFEDSSDARLCLHYFKWSGCLSRSNQSLESICKMMHILV 180
            + S   + D      +RI++ L+E+S DA L L++F W  C S S  ++ ++C+M+HILV
Sbjct: 972  ITSFMCRIDLNEIKIIRIINDLYEESLDATLALYFFNWLECCSGSKHAIRTVCRMIHILV 1031

Query: 181  TGNMNHRAVDLMSHLAKTYGSEEGFSTILLKLLYETHKERKTLETTCSMLVFCYIKERMV 240
            +GN+NHRA+D + HL + YG  E    +LLK+L ETH ER+ LET CSMLV CY+ E MV
Sbjct: 1032 SGNINHRAMDKILHLVRNYGEAESCD-LLLKILCETHTERRVLETACSMLVNCYVMENMV 1091

Query: 241  TPALMLMGQMKHLNIFPSAWVYKSVIQALLQTNQSELAWDLLEEMYRQGLSLN-YSINLF 300
              AL L   M++LNIFPS  V   ++  L+++ Q ELAW  LE M  +G+ LN +  +LF
Sbjct: 1092 NIALKLTCAMENLNIFPSVQVCNKLLNELVRSKQLELAWKFLEIMQSRGMGLNAFIFSLF 1151

Query: 301  VHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYTTVINSLCKISLLKEATTVLFKMTAFGV 360
            +H  C++ N+  GWK+LLE++N+G +PD V YT +I+SLCK+S L EAT++LFKM   G+
Sbjct: 1152 IHKCCSEYNIESGWKLLLEMKNYGIQPDVVSYTIIIDSLCKLSCLLEATSLLFKMMQLGI 1211

Query: 361  SPDSVTLSSVIDGYCKVGMSDIACKILKYFRLPLNIFTYNSFITRLCVEGNMERASEVFL 420
            SPD V +SSVIDG+CKVG  + A  ILK F LPLNIF YNSFI++ C +GNME+AS +F 
Sbjct: 1212 SPDPVLISSVIDGHCKVGEMEKAIDILKVFNLPLNIFVYNSFISKSCSDGNMEKASRLFH 1271

Query: 421  EMSEVGLVPDCVSYTTMIGGYCKVENINRAFSYICKMLKSGIQPSLITYTLFIDKFCKCG 480
            EMS +G +PDC SYTT++GGYCK  ++ +AF Y  KM+K G +PS+ TYTL ID  CK G
Sbjct: 1272 EMSVLGFLPDCFSYTTIVGGYCKAGDMKKAFQYFGKMIKGGTKPSVTTYTLLIDTCCKSG 1331

Query: 481  DVEMAEVMFQKMIIEGLKPDVVTYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYN 540
            ++EMAE +  KM+ EGL+PDVV YN LMDGYGKKG+L K FELL  M+S+NV  DVVTYN
Sbjct: 1332 NMEMAESLLHKMMTEGLQPDVVAYNTLMDGYGKKGHLQKVFELLGKMKSSNVCLDVVTYN 1391

Query: 541  TLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDN 600
            TLI+ L+ RGF+KEA +ILDELI RGFS DVVT+TN+I G+SK+GNFEEAFL+WY M+++
Sbjct: 1392 TLIHSLIKRGFIKEAGEILDELIERGFSPDVVTFTNVIDGFSKQGNFEEAFLVWYSMSEH 1451

Query: 601  CVKPDVVTCSALLSGYCREQRIDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEG 660
             VKPDVVTCSA+L+GYCRE+R++EA  LF  M+DIGLNPDL LYN LIHGFCSVGN+DE 
Sbjct: 1452 RVKPDVVTCSAILNGYCRERRMEEARRLFQDMIDIGLNPDLRLYNILIHGFCSVGNMDEA 1511

Query: 661  CNFVKKMIESSIIPNNVTHRALVLGFQKKRVINPIASATSKLQEILLAYNLQIDVNGHI 687
            CN V  M+E  I+PNN++H+AL+LGF+KK V NP+ +A  KLQEILL Y +  D + ++
Sbjct: 1512 CNLVSTMVEHGILPNNISHKALILGFEKKWVKNPVENAAFKLQEILLRYGIDFDTDEYL 1562

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP164_ARATH2.1e-16343.83Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH2.5e-6834.63Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH5.2e-6630.65Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PP360_ARATH1.7e-6124.83Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
PP376_ARATH2.3e-6130.56Pentatricopeptide repeat-containing protein At5g12100, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KV38_CUCSA0.0e+0081.34Uncharacterized protein OS=Cucumis sativus GN=Csa_5G581700 PE=4 SV=1[more]
M5XNQ0_PRUPE6.4e-20454.69Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa021440mg PE=4 S... [more]
W9R9T4_9ROSA2.7e-20251.88Uncharacterized protein OS=Morus notabilis GN=L484_012212 PE=4 SV=1[more]
A0A061FRZ6_THECC6.2e-19956.14Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
A0A0B0MK88_GOSAR3.1e-19856.14Uncharacterized protein OS=Gossypium arboreum GN=F383_18322 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G19280.11.2e-16443.83 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.11.4e-6934.63 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G05670.13.0e-6730.65 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G01110.19.8e-6324.83 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G12100.11.3e-6230.56 pentatricopeptide (PPR) repeat-containing protein[more]
Match NameE-valueIdentityDescription
gi|778704309|ref|XP_011655513.1|0.0e+0081.34PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativu... [more]
gi|659090263|ref|XP_008445921.1|0.0e+0079.30PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo][more]
gi|645251570|ref|XP_008231739.1|3.9e-21058.95PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Prunus mume][more]
gi|731415259|ref|XP_002272339.2|2.1e-20854.56PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Vitis vinifera... [more]
gi|1009118967|ref|XP_015876129.1|4.4e-20651.60PREDICTED: uncharacterized protein LOC107412823 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G010270.1Lsi04G010270.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 229..257
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 359..389
score: 1.6E-6coord: 603..634
score: 3.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 537..585
score: 3.0E-13coord: 396..445
score: 8.3E-14coord: 466..513
score: 1.2E-17coord: 294..343
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 229..261
score: 6.1E-4coord: 299..331
score: 8.1E-5coord: 434..468
score: 1.3E-7coord: 469..503
score: 1.0E-9coord: 399..432
score: 5.4E-8coord: 539..573
score: 1.6E-7coord: 364..398
score: 1.0E-7coord: 610..642
score: 6.4E-8coord: 504..538
score: 5.2E-8coord: 574..607
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 502..536
score: 11.74coord: 262..294
score: 6.248coord: 295..329
score: 10.041coord: 572..606
score: 12.54coord: 467..501
score: 13.0coord: 607..641
score: 11.849coord: 226..260
score: 9.109coord: 537..571
score: 11.356coord: 191..225
score: 7.136coord: 432..466
score: 11.597coord: 397..431
score: 11.619coord: 362..396
score: 11.915coord: 330..360
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 470..612
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 105..137
score: 4.2E-179coord: 196..649
score: 4.2E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 366..561
score: 2.7

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Lsi04G010270Cla007287Watermelon (97103) v1lsiwmB306
Lsi04G010270ClCG07G005030Watermelon (Charleston Gray)lsiwcgB297
Lsi04G010270Cla97C07G133380Watermelon (97103) v2lsiwmbB298
Lsi04G010270Bhi07G000424Wax gourdlsiwgoB381
The following gene(s) are paralogous to this gene:

None