Lsi06G001870 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi06G001870
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat superfamily protein
Locationchr06 : 2034801 .. 2036852 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGCCTTTCCTTCATTTGCCTTGTTTCCCTCTCAAAAGGTTCATTTCACACACCTCAAAATCCTCCATACAAAATGCCCTGTTCAACCCACAACCCAATCTTCTCTCATTTCTCCAAGAATTTCCCCCTAACATTCTGTCTGTCAAATCCATACACGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAATTTTTGGTTGCAAAGCTTGTTGCGGCATACTCAAGTTTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTTTACAATGCCATGGTTAATGGGTATCTCCAAAATGAGCATTATAATGACAGTATCCAGCTGCTTAAGATGATGAGTCGATGTGATTTGGAATTTGATAGTTATACTTGTAATTTTGCTCTTAAGGCATGCATGTTCTTATTGGATTATGAAATGGGGATGGAAGTGATTAGATTAGCTGTGTGTAAGGGGTTGGCTAGAGGTCGGTTTTTGGGAAGTTCTATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGAGTGCACAAATTTTTTTTCATCAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTGATGCAGGAAGGCTTGTTTAGTGAAGGATATAATGTGTTTCTTGATATGCTTTATAATAAAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGAGATGAGGAATTTAAAGTTGGGAAAATGTATACATAGCTATGTTCTTGGATTTGGAATGAGTAGTGATACAAGAGTGCTTACCTCATTGATTGATATGTATTGTAAATCGGGTGACGTCAAAAGTGCTCGATGGATTTTCGATACCATGCCATCTAGGAATTTGGTCTCTTGGAATGTTATGATTTCTGGATATGTTCAAAATGGTTTGCTTGTTGAAACTTTACATCTCTTTCAGATGTTGGTTATGAATGATGGAGGTTTCGATTCAGGTACCGTTGTTAGCCTCATCCAGCTTTGTTCTCGCACGACTGATTTGGATGGCGGCAAGATTCTCCATGGTTGCATCTATCGAAGGGGACTTGATTTGAATTTGGTTTTGTCTACTGCAATTGTTGATCTATATGCTAAATGTAGATCCCTGGCCTATGCATCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTTGTGGGATTGGCACAGAACGGGCATGCTAGAGATGCTTTAAAGTTATTTTATCAGATGCAAAATGAGAGGGTTACTTTCAATGCTCTCACCTTAGTTAGTTTACTCCATTGTTGTGCGCTCCTAGGCTTGTTACTTGAAGGGAGAAGTGTACATGCTATCTTAACTCGATTTCATTTTGCTTCCGAAGTTGTTGGTATGACTGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAAACTCAGCTGAGAAGGTGTTCAAGTACGGTTTTATGCCCAAGGATGTGATACTATATAACGCAATGATTTCGGGCTATGGAATGCATGGTCTCGGGCATAAAGCACTGTGCGTCTACCATCAAATGAATCAAGAAGGACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGTCATTCAGGCCTGGTGGAAGAGGGGATCTCTTTGTTTCGAAATATGGAGAAAGATCATAACGTAACACCTACCGATAAACTTTACGCTTGTTTTGTAGATCTTCTATGTCGAGCAGGTCGCCTCCAGCAAGCTGAGGAATTAATCAATCAAATGCCTTTCATACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGATGTCTTTTGCACAATGACATTGAATTGGGTGTAAAATTTGCTGACAGATTACTCTCCTTGGAGTCTAGAAATCCGAGCATCTACGTTACCTTGTCGAATATATATGCCGAAGCAAGTCGATGGGATTCGGTAAAGTATGTTCGAGGTCTCATGAACGAGCAAGAGCTTAAAAAGATTCCAGGATATAGCTCAATTGAAGTAAATATTTAG

mRNA sequence

ATGCCGCCTTTCCTTCATTTGCCTTGTTTCCCTCTCAAAAGGTTCATTTCACACACCTCAAAATCCTCCATACAAAATGCCCTGTTCAACCCACAACCCAATCTTCTCTCATTTCTCCAAGAATTTCCCCCTAACATTCTGTCTGTCAAATCCATACACGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAATTTTTGGTTGCAAAGCTTGTTGCGGCATACTCAAGTTTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTTTACAATGCCATGGTTAATGGGTATCTCCAAAATGAGCATTATAATGACAGTATCCAGCTGCTTAAGATGATGAGTCGATGTGATTTGGAATTTGATAGTTATACTTGTAATTTTGCTCTTAAGGCATGCATGTTCTTATTGGATTATGAAATGGGGATGGAAGTGATTAGATTAGCTGTGTGTAAGGGGTTGGCTAGAGGTCGGTTTTTGGGAAGTTCTATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGAGTGCACAAATTTTTTTTCATCAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTGATGCAGGAAGGCTTGTTTAGTGAAGGATATAATGTGTTTCTTGATATGCTTTATAATAAAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGAGATGAGGAATTTAAAGTTGGGAAAATGTATACATAGCTATGTTCTTGGATTTGGAATGAGTAGTGATACAAGAGTGCTTACCTCATTGATTGATATGTATTGTAAATCGGGTGACGTCAAAAGTGCTCGATGGATTTTCGATACCATGCCATCTAGGAATTTGGTCTCTTGGAATGTTATGATTTCTGGATATGTTCAAAATGGTTTGCTTGTTGAAACTTTACATCTCTTTCAGATGTTGGTTATGAATGATGGAGGTTTCGATTCAGGTACCGTTGTTAGCCTCATCCAGCTTTGTTCTCGCACGACTGATTTGGATGGCGGCAAGATTCTCCATGGTTGCATCTATCGAAGGGGACTTGATTTGAATTTGGTTTTGTCTACTGCAATTGTTGATCTATATGCTAAATGTAGATCCCTGGCCTATGCATCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTTGTGGGATTGGCACAGAACGGGCATGCTAGAGATGCTTTAAAGTTATTTTATCAGATGCAAAATGAGAGGGTTACTTTCAATGCTCTCACCTTAGTTAGTTTACTCCATTGTTGTGCGCTCCTAGGCTTGTTACTTGAAGGGAGAAGTGTACATGCTATCTTAACTCGATTTCATTTTGCTTCCGAAGTTGTTGGTATGACTGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAAACTCAGCTGAGAAGGTGTTCAAGTACGGTTTTATGCCCAAGGATGTGATACTATATAACGCAATGATTTCGGGCTATGGAATGCATGGTCTCGGGCATAAAGCACTGTGCGTCTACCATCAAATGAATCAAGAAGGACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGTCATTCAGGCCTGGTGGAAGAGGGGATCTCTTTGTTTCGAAATATGGAGAAAGATCATAACGTAACACCTACCGATAAACTTTACGCTTGTTTTGTAGATCTTCTATGTCGAGCAGGTCGCCTCCAGCAAGCTGAGGAATTAATCAATCAAATGCCTTTCATACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGATGTCTTTTGCACAATGACATTGAATTGGGTGTAAAATTTGCTGACAGATTACTCTCCTTGGAGTCTAGAAATCCGAGCATCTACGTTACCTTGTCGAATATATATGCCGAAGCAAGTCGATGGGATTCGGTAAAGTATGTTCGAGGTCTCATGAACGAGCAAGAGCTTAAAAAGATTCCAGGATATAGCTCAATTGAAGTAAATATTTAG

Coding sequence (CDS)

ATGCCGCCTTTCCTTCATTTGCCTTGTTTCCCTCTCAAAAGGTTCATTTCACACACCTCAAAATCCTCCATACAAAATGCCCTGTTCAACCCACAACCCAATCTTCTCTCATTTCTCCAAGAATTTCCCCCTAACATTCTGTCTGTCAAATCCATACACGCCCAGATTATTATCACAAACGCCATATCTGGGGACCAATTTTTGGTTGCAAAGCTTGTTGCGGCATACTCAAGTTTGGGTTGTTTGGAAAATGCACGGAAAGTGTTTGATAAAATTCCTCAACCAAAAACTGTTCTTTACAATGCCATGGTTAATGGGTATCTCCAAAATGAGCATTATAATGACAGTATCCAGCTGCTTAAGATGATGAGTCGATGTGATTTGGAATTTGATAGTTATACTTGTAATTTTGCTCTTAAGGCATGCATGTTCTTATTGGATTATGAAATGGGGATGGAAGTGATTAGATTAGCTGTGTGTAAGGGGTTGGCTAGAGGTCGGTTTTTGGGAAGTTCTATTTTGAATTTTTTGGTGAAAACTGGTGATATTATGAGTGCACAAATTTTTTTTCATCAAATGGTTGAGAAAGATGTTGTTTGTTGGAATGTGATGATTGGTGGGTTGATGCAGGAAGGCTTGTTTAGTGAAGGATATAATGTGTTTCTTGATATGCTTTATAATAAAATTGAGCCTAGTGCTGTGACCATGACAAGCTTGATTCAATCCTGTGGGGAGATGAGGAATTTAAAGTTGGGAAAATGTATACATAGCTATGTTCTTGGATTTGGAATGAGTAGTGATACAAGAGTGCTTACCTCATTGATTGATATGTATTGTAAATCGGGTGACGTCAAAAGTGCTCGATGGATTTTCGATACCATGCCATCTAGGAATTTGGTCTCTTGGAATGTTATGATTTCTGGATATGTTCAAAATGGTTTGCTTGTTGAAACTTTACATCTCTTTCAGATGTTGGTTATGAATGATGGAGGTTTCGATTCAGGTACCGTTGTTAGCCTCATCCAGCTTTGTTCTCGCACGACTGATTTGGATGGCGGCAAGATTCTCCATGGTTGCATCTATCGAAGGGGACTTGATTTGAATTTGGTTTTGTCTACTGCAATTGTTGATCTATATGCTAAATGTAGATCCCTGGCCTATGCATCTTCTGTTTTTGAAAGAATGAAAAATAAGAATGTGATTTCATGGACTGCCATGCTTGTGGGATTGGCACAGAACGGGCATGCTAGAGATGCTTTAAAGTTATTTTATCAGATGCAAAATGAGAGGGTTACTTTCAATGCTCTCACCTTAGTTAGTTTACTCCATTGTTGTGCGCTCCTAGGCTTGTTACTTGAAGGGAGAAGTGTACATGCTATCTTAACTCGATTTCATTTTGCTTCCGAAGTTGTTGGTATGACTGCCCTCATTGATATGTATGCAAAATGCAGCAAAATAAACTCAGCTGAGAAGGTGTTCAAGTACGGTTTTATGCCCAAGGATGTGATACTATATAACGCAATGATTTCGGGCTATGGAATGCATGGTCTCGGGCATAAAGCACTGTGCGTCTACCATCAAATGAATCAAGAAGGACTTCAGCCAAATGAGAGCACCTTTGTTTCTCTGCTATCTGCTTGTAGTCATTCAGGCCTGGTGGAAGAGGGGATCTCTTTGTTTCGAAATATGGAGAAAGATCATAACGTAACACCTACCGATAAACTTTACGCTTGTTTTGTAGATCTTCTATGTCGAGCAGGTCGCCTCCAGCAAGCTGAGGAATTAATCAATCAAATGCCTTTCATACCAACCAGTGGCATACTTGAAACTCTGCTGAATGGATGTCTTTTGCACAATGACATTGAATTGGGTGTAAAATTTGCTGACAGATTACTCTCCTTGGAGTCTAGAAATCCGAGCATCTACGTTACCTTGTCGAATATATATGCCGAAGCAAGTCGATGGGATTCGGTAAAGTATGTTCGAGGTCTCATGAACGAGCAAGAGCTTAAAAAGATTCCAGGATATAGCTCAATTGAAGTAAATATTTAG

Protein sequence

MPPFLHLPCFPLKRFISHTSKSSIQNALFNPQPNLLSFLQEFPPNILSVKSIHAQIIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIEVNI
BLAST of Lsi06G001870 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 2.0e-113
Identity = 224/630 (35.56%), Postives = 360/630 (57.14%), Query Frame = 1

Query: 58  ITNAISGDQFLV-----AKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEH 117
           + N I G+ F++     +KL   Y++ G L+ A +VFD++   K + +N ++N   ++  
Sbjct: 116 VDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGD 175

Query: 118 YNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSS 177
           ++ SI L K M    +E DSYT +   K+   L     G ++    +  G      +G+S
Sbjct: 176 FSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNS 235

Query: 178 ILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPS 237
           ++ F +K   + SA+  F +M E+DV+ WN +I G +  GL  +G +VF+ ML + IE  
Sbjct: 236 LVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEID 295

Query: 238 AVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFD 297
             T+ S+   C + R + LG+ +HS  +    S + R   +L+DMY K GD+ SA+ +F 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 298 TMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDG 357
            M  R++VS+  MI+GY + GL  E + LF+ +       D  TV +++  C+R   LD 
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 358 GKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQ 417
           GK +H  I    L  ++ +S A++D+YAKC S+  A  VF  M+ K++ISW  ++ G ++
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSK 475

Query: 418 NGHARDALKLF-YQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVV 477
           N +A +AL LF   ++ +R + +  T+  +L  CA L    +GR +H  + R  + S+  
Sbjct: 476 NCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH 535

Query: 478 GMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQE 537
              +L+DMYAKC  +  A  +F      KD++ +  MI+GYGMHG G +A+ +++QM Q 
Sbjct: 536 VANSLVDMYAKCGALLLAHMLFD-DIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQA 595

Query: 538 GLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQ 597
           G++ +E +FVSLL ACSHSGLV+EG   F  M  +  + PT + YAC VD+L R G L +
Sbjct: 596 GIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIK 655

Query: 598 AEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAE 657
           A   I  MP  P + I   LL GC +H+D++L  K A+++  LE  N   YV ++NIYAE
Sbjct: 656 AYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAE 715

Query: 658 ASRWDSVKYVRGLMNEQELKKIPGYSSIEV 682
           A +W+ VK +R  + ++ L+K PG S IE+
Sbjct: 716 AEKWEQVKRLRKRIGQRGLRKNPGCSWIEI 744

BLAST of Lsi06G001870 vs. Swiss-Prot
Match: PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.1e-111
Identity = 212/619 (34.25%), Postives = 354/619 (57.19%), Query Frame = 1

Query: 65  DQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQLLKMMS 124
           ++F+ + L+ AY   G ++   K+FD++ Q   V++N M+NGY +    +  I+   +M 
Sbjct: 172 NEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGFSVMR 231

Query: 125 RCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVKTGDIM 184
              +  ++ T +  L  C   L  ++G+++  L V  G+     + +S+L+   K G   
Sbjct: 232 MDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFD 291

Query: 185 SAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSLIQSCG 244
            A   F  M   D V WN MI G +Q GL  E    F +M+ + + P A+T +SL+ S  
Sbjct: 292 DASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVS 351

Query: 245 EMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNLVSWNV 304
           +  NL+  K IH Y++   +S D  + ++LID Y K   V  A+ IF    S ++V +  
Sbjct: 352 KFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTA 411

Query: 305 MISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGCIYRRG 364
           MISGY+ NGL +++L +F+ LV      +  T+VS++ +      L  G+ LHG I ++G
Sbjct: 412 MISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKG 471

Query: 365 LDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFY 424
            D    +  A++D+YAKC  +  A  +FER+  ++++SW +M+   AQ+ +   A+ +F 
Sbjct: 472 FDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFR 531

Query: 425 QMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDMYAKCS 484
           QM    + ++ +++ + L  CA L     G+++H  + +   AS+V   + LIDMYAKC 
Sbjct: 532 QMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMYAKCG 591

Query: 485 KINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQM-NQEGLQPNESTFVSL 544
            + +A  VFK     K+++ +N++I+  G HG    +LC++H+M  + G++P++ TF+ +
Sbjct: 592 NLKAAMNVFK-TMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITFLEI 651

Query: 545 LSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQMPFIP 604
           +S+C H G V+EG+  FR+M +D+ + P  + YAC VDL  RAGRL +A E +  MPF P
Sbjct: 652 ISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMPFPP 711

Query: 605 TSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVKYVRG 664
            +G+  TLL  C LH ++EL    + +L+ L+  N   YV +SN +A A  W+SV  VR 
Sbjct: 712 DAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESVTKVRS 771

Query: 665 LMNEQELKKIPGYSSIEVN 683
           LM E+E++KIPGYS IE+N
Sbjct: 772 LMKEREVQKIPGYSWIEIN 789

BLAST of Lsi06G001870 vs. Swiss-Prot
Match: PP405_ARATH (Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana GN=PCMP-E16 PE=2 SV=1)

HSP 1 Score: 405.2 bits (1040), Expect = 1.4e-111
Identity = 220/642 (34.27%), Postives = 360/642 (56.07%), Query Frame = 1

Query: 45  NILSVKSIHAQIIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMV 104
           +I   K++H  +I    +SG   +++ L   Y+  G +  ARK+F+++PQ   + YN ++
Sbjct: 30  SISKTKALHCHVITGGRVSGH--ILSTLSVTYALCGHITYARKLFEEMPQSSLLSYNIVI 89

Query: 105 NGYLQNEHYNDSIQL-LKMMSR-CDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKG 164
             Y++   Y+D+I + ++M+S       D YT  F  KA   L   ++G+ V    +   
Sbjct: 90  RMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAAGELKSMKLGLVVHGRILRSW 149

Query: 165 LARGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFL 224
             R +++ +++L   +  G +  A+  F  M  +DV+ WN MI G  + G  ++   +F 
Sbjct: 150 FGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWNTMISGYYRNGYMNDALMMFD 209

Query: 225 DMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSG 284
            M+   ++    T+ S++  CG +++L++G+ +H  V    +     V  +L++MY K G
Sbjct: 210 WMVNESVDLDHATIVSMLPVCGHLKDLEMGRNVHKLVEEKRLGDKIEVKNALVNMYLKCG 269

Query: 285 DVKSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQ 344
            +  AR++FD M  R++++W  MI+GY ++G +   L L +++       ++ T+ SL+ 
Sbjct: 270 RMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELCRLMQFEGVRPNAVTIASLVS 329

Query: 345 LCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVIS 404
           +C     ++ GK LHG   R+ +  ++++ T+++ +YAKC+ +     VF      +   
Sbjct: 330 VCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKCKRVDLCFRVFSGASKYHTGP 389

Query: 405 WTAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILT 464
           W+A++ G  QN    DAL LF +M+ E V  N  TL SLL   A L  L +  ++H  LT
Sbjct: 390 WSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLLPAYAALADLRQAMNIHCYLT 449

Query: 465 RFHFASEVVGMTALIDMYAKCSKINSAEKVF---KYGFMPKDVILYNAMISGYGMHGLGH 524
           +  F S +   T L+ +Y+KC  + SA K+F   +     KDV+L+ A+ISGYGMHG GH
Sbjct: 450 KTGFMSSLDAATGLVHVYSKCGTLESAHKIFNGIQEKHKSKDVVLWGALISGYGMHGDGH 509

Query: 525 KALCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACF 584
            AL V+ +M + G+ PNE TF S L+ACSHSGLVEEG++LFR M + +        Y C 
Sbjct: 510 NALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLTLFRFMLEHYKTLARSNHYTCI 569

Query: 585 VDLLCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNP 644
           VDLL RAGRL +A  LI  +PF PTS +   LL  C+ H +++LG   A++L  LE  N 
Sbjct: 570 VDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTHENVQLGEMAANKLFELEPENT 629

Query: 645 SIYVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIEV 682
             YV L+NIYA   RW  ++ VR +M    L+K PG+S+IE+
Sbjct: 630 GNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHSTIEI 669

BLAST of Lsi06G001870 vs. Swiss-Prot
Match: PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.9e-109
Identity = 230/639 (35.99%), Postives = 358/639 (56.03%), Query Frame = 1

Query: 45  NILSVKSIHAQIIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMV 104
           NI S++  H  ++  N + GD  +  KLV+ Y   G  ++AR VFD+IP+P   L+  M+
Sbjct: 56  NIDSLRQSHG-VLTGNGLMGDISIATKLVSLYGFFGYTKDARLVFDQIPEPDFYLWKVML 115

Query: 105 NGYLQNEHYNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLA 164
             Y  N+   + ++L  ++ +    +D    + ALKAC  L D + G + I   + K  +
Sbjct: 116 RCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKK-IHCQLVKVPS 175

Query: 165 RGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDM 224
               + + +L+   K G+I SA   F+ +  ++VVCW  MI G ++  L  EG  +F  M
Sbjct: 176 FDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEGLVLFNRM 235

Query: 225 LYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDV 284
             N +  +  T  +LI +C ++  L  GK  H  ++  G+   + ++TSL+DMY K GD+
Sbjct: 236 RENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSSCLVTSLLDMYVKCGDI 295

Query: 285 KSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLC 344
            +AR +F+     +LV W  MI GY  NG + E L LFQ +   +   +  T+ S++  C
Sbjct: 296 SNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPNCVTIASVLSGC 355

Query: 345 SRTTDLDGGKILHGCIYRRGL-DLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISW 404
               +L+ G+ +HG   + G+ D N+  + A+V +YAKC     A  VFE    K++++W
Sbjct: 356 GLIENLELGRSVHGLSIKVGIWDTNV--ANALVHMYAKCYQNRDAKYVFEMESEKDIVAW 415

Query: 405 TAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTR 464
            +++ G +QNG   +AL LF++M +E VT N +T+ SL   CA LG L  G S+HA   +
Sbjct: 416 NSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAVGSSLHAYSVK 475

Query: 465 FHF--ASEVVGMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKA 524
             F  +S V   TAL+D YAKC    SA  +F      K+ I ++AMI GYG  G    +
Sbjct: 476 LGFLASSSVHVGTALLDFYAKCGDPQSARLIFDT-IEEKNTITWSAMIGGYGKQGDTIGS 535

Query: 525 LCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVD 584
           L ++ +M ++  +PNESTF S+LSAC H+G+V EG   F +M KD+N TP+ K Y C VD
Sbjct: 536 LELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYNFTPSTKHYTCMVD 595

Query: 585 LLCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSI 644
           +L RAG L+QA ++I +MP  P        L+GC +H+  +LG     ++L L   + S 
Sbjct: 596 MLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDLHPDDASY 655

Query: 645 YVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIE 681
           YV +SN+YA   RW+  K VR LM ++ L KI G+S++E
Sbjct: 656 YVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTME 689

BLAST of Lsi06G001870 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 4.7e-107
Identity = 209/626 (33.39%), Postives = 338/626 (53.99%), Query Frame = 1

Query: 56  IIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYND 115
           ++  N +  + F   KLV+ +   G ++ A +VF+ I     VLY+ M+ G+ +    + 
Sbjct: 59  LVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDK 118

Query: 116 SIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILN 175
           ++Q    M   D+E   Y   + LK C    +  +G E+  L V  G +   F  + + N
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 176 FLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVT 235
              K   +  A+  F +M E+D+V WN ++ G  Q G+      +   M    ++PS +T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 236 MTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMP 295
           + S++ +   +R + +GK IH Y +  G  S   + T+L+DMY K G +++AR +FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 296 SRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKI 355
            RN+VSWN MI  YVQN    E + +FQ ++         +V+  +  C+   DL+ G+ 
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRF 358

Query: 356 LHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGH 415
           +H      GLD N+ +  +++ +Y KC+ +  A+S+F +++++ ++SW AM++G AQNG 
Sbjct: 359 IHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR 418

Query: 416 ARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTA 475
             DAL  F QM++  V  +  T VS++   A L +    + +H ++ R      V   TA
Sbjct: 419 PIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTA 478

Query: 476 LIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQP 535
           L+DMYAKC  I  A  +F      + V  +NAMI GYG HG G  AL ++ +M +  ++P
Sbjct: 479 LVDMYAKCGAIMIARLIFDM-MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 536 NESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEEL 595
           N  TF+S++SACSHSGLVE G+  F  M++++++  +   Y   VDLL RAGRL +A + 
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 596 INQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRW 655
           I QMP  P   +   +L  C +H ++    K A+RL  L   +   +V L+NIY  AS W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 656 DSVKYVRGLMNEQELKKIPGYSSIEV 682
           + V  VR  M  Q L+K PG S +E+
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEI 683

BLAST of Lsi06G001870 vs. TrEMBL
Match: A0A0A0LYM5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573630 PE=4 SV=1)

HSP 1 Score: 1046.2 bits (2704), Expect = 1.7e-302
Identity = 520/578 (89.97%), Postives = 540/578 (93.43%), Query Frame = 1

Query: 103 MVNGYLQNEHYNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKG 162
           MVNGYLQNE YND I+LLKMMSRC LEFDSYTCNFALKACMFLLDYEMGMEVI LAVCKG
Sbjct: 1   MVNGYLQNERYNDCIELLKMMSRCHLEFDSYTCNFALKACMFLLDYEMGMEVIGLAVCKG 60

Query: 163 LARGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFL 222
           LA GRFLGSSILNFLVKTGDIM AQ FFHQMVEKDVVCWNVMIGG MQEGLF EGYN+FL
Sbjct: 61  LAGGRFLGSSILNFLVKTGDIMCAQFFFHQMVEKDVVCWNVMIGGFMQEGLFREGYNLFL 120

Query: 223 DMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSG 282
           DMLYNKIEPSAVTM SLIQSCGEMRNL  GKC+H +VLGFGMS DTRVLT+LIDMYCKSG
Sbjct: 121 DMLYNKIEPSAVTMISLIQSCGEMRNLTFGKCMHGFVLGFGMSRDTRVLTTLIDMYCKSG 180

Query: 283 DVKSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQ 342
           DV+SARWIF+ MPSRNLVSWNVMISGYVQNGLLVETL LFQ L+M+D GFDSGTVVSLIQ
Sbjct: 181 DVESARWIFENMPSRNLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQ 240

Query: 343 LCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVIS 402
           LCSRT DLDGGKILHG IYRRGLDLNLVL TAIVDLYAKC SLAYASSVFERMKNKNVIS
Sbjct: 241 LCSRTADLDGGKILHGFIYRRGLDLNLVLPTAIVDLYAKCGSLAYASSVFERMKNKNVIS 300

Query: 403 WTAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILT 462
           WTAMLVGLAQNGHARDALKLF QMQNERVTFNALTLVSL++CC LLGLL EGRSVHA LT
Sbjct: 301 WTAMLVGLAQNGHARDALKLFDQMQNERVTFNALTLVSLVYCCTLLGLLREGRSVHATLT 360

Query: 463 RFHFASEVVGMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKAL 522
           RFHFASEVV MTALIDMYAKCSKINSAE VFKYG  PKDVILYN+MISGYGMHGLGHKAL
Sbjct: 361 RFHFASEVVVMTALIDMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKAL 420

Query: 523 CVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDL 582
           CVYH+MN+EGLQPNESTFVSLLSACSHSGLVEEGI+LF+NM KDHN TPTDKLYAC VDL
Sbjct: 421 CVYHRMNREGLQPNESTFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDL 480

Query: 583 LCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIY 642
           L RAGRL+QAEELINQMPF PTSGILETLLNGCLLH DIELGVK ADRLLSLESRNPSIY
Sbjct: 481 LSRAGRLRQAEELINQMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIY 540

Query: 643 VTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIE 681
           +TLSNIYA+ASRWDSVKYVRGLM EQE+KKIPGYSSIE
Sbjct: 541 ITLSNIYAKASRWDSVKYVRGLMMEQEIKKIPGYSSIE 578

BLAST of Lsi06G001870 vs. TrEMBL
Match: A0A0D2PJ08_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G344400 PE=4 SV=1)

HSP 1 Score: 787.3 bits (2032), Expect = 1.5e-224
Identity = 396/682 (58.06%), Postives = 516/682 (75.66%), Query Frame = 1

Query: 4   FLHLPCFPLKRFISHTSKSSIQNALFNPQP----NLLSFLQEFPPNILSVKSIHAQIIIT 63
           FLH   F   +F+S  S SSI+NA FN  P      LS L EF   +LSVKSIHAQII T
Sbjct: 11  FLHKVPF---KFLSLHSFSSIKNANFNHLPLDIQRFLSLLHEFSNTLLSVKSIHAQII-T 70

Query: 64  NAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQL 123
           N++S  QFL + LV +YS LGCL  ARKVFDKIPQPK +L N+M+NGYL+N+ Y ++I+L
Sbjct: 71  NSVSKHQFLSSNLVRSYSELGCLGLARKVFDKIPQPKPILCNSMLNGYLRNQCYKETIEL 130

Query: 124 LKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVK 183
            ++M   D  FDSY+CNF LKACM L DYE G E+I+ AV   +   +FLGSS++NF +K
Sbjct: 131 FELMGASDWGFDSYSCNFVLKACMELEDYERGTEIIKRAVDHRVDGDKFLGSSMINFFMK 190

Query: 184 TGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSL 243
            GD  SA+  F+QM+ +DVVCWN MIGG ++   +   +++FL+M+   + PSA+TM SL
Sbjct: 191 FGDFNSARRVFNQMISRDVVCWNSMIGGYVKGCYYVAAFDLFLEMILCGVRPSAITMVSL 250

Query: 244 IQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNL 303
           +Q+CG MR+L+LGK +H  VL FG+ +D  V T+LIDMY K G+ + AR +FD MP++ L
Sbjct: 251 VQACGGMRDLELGKRVHGLVLVFGLGTDVLVHTALIDMYSKLGEHERARSVFDIMPAKTL 310

Query: 304 VSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGC 363
           VSWNV+ISGYVQN L+ E  +LFQ LV+  GGFDSGT++SL+Q C++  DL+ GK+LHG 
Sbjct: 311 VSWNVIISGYVQNCLVYEAFYLFQKLVLTGGGFDSGTIISLLQSCAQVADLESGKVLHGY 370

Query: 364 IYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDA 423
           I+R+GLD+N++L TA+VDLY+KC +L  A+ +F+RMKN+NVI+WTAMLVGLAQNGHA DA
Sbjct: 371 IFRKGLDINVILCTALVDLYSKCGALKEATFMFDRMKNRNVITWTAMLVGLAQNGHAEDA 430

Query: 424 LKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDM 483
           ++LF +MQ E VT N+ TLVSL+HCCA LG L +GRSVHA L R+ +A +VV  TALIDM
Sbjct: 431 IRLFGKMQEEGVTANSTTLVSLVHCCAHLGSLKKGRSVHARLLRYGYAFDVVNRTALIDM 490

Query: 484 YAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNEST 543
           YAKC  IN AE+VF+     KDVI +N+MI+GYGMHG GHKAL +Y +M +EGL+PN++T
Sbjct: 491 YAKCGNINYAERVFEDVSFFKDVISWNSMITGYGMHGQGHKALDLYRRMLEEGLKPNKTT 550

Query: 544 FVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQM 603
           FVSLLSACSHSGLV++G SLF +ME DHN+   +K YAC+VDLL RAG +++AE LI QM
Sbjct: 551 FVSLLSACSHSGLVDQGRSLFLSMESDHNIRANEKHYACYVDLLSRAGHIKEAEVLIKQM 610

Query: 604 PFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVK 663
           PF  +  + E LLNGC +H +I++G+K AD LLSL++ NP IY+ LSNIYAEA RWD+V 
Sbjct: 611 PFQSSREVFEALLNGCRMHKNIDIGIKAADYLLSLDATNPGIYIMLSNIYAEARRWDAVD 670

Query: 664 YVRGLMNEQELKKIPGYSSIEV 682
           ++RGLM  + LKK PGYS IEV
Sbjct: 671 HIRGLMRGRGLKKTPGYSLIEV 688

BLAST of Lsi06G001870 vs. TrEMBL
Match: A0A061F2D9_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_026441 PE=4 SV=1)

HSP 1 Score: 784.3 bits (2024), Expect = 1.2e-223
Identity = 391/686 (57.00%), Postives = 517/686 (75.36%), Query Frame = 1

Query: 3   PFLHLPCFPLK---RFISHTSKSSIQNALFNPQ----PNLLSFLQEFPPNILSVKSIHAQ 62
           P+LH   F  K   +F+S  S S+I+NA FN         L  LQEFP  +  +KSIHAQ
Sbjct: 4   PYLHTHHFFPKISFKFLSLHSFSTIKNANFNQTFPCFNKFLLLLQEFPNTLFCIKSIHAQ 63

Query: 63  IIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYND 122
           II TN+ S  QFL + LV  YS LGCL  ARKVFD+I QPK +L N+M+NGYL+N+ Y +
Sbjct: 64  II-TNSESRHQFLASNLVKGYSGLGCLAIARKVFDQISQPKPILCNSMLNGYLRNQCYKE 123

Query: 123 SIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILN 182
           +++L + M    LEFDSY+CN+ LKACM L D+E G EV++ AV + +   RFLGSS+++
Sbjct: 124 TVELFEFMGFLHLEFDSYSCNYVLKACMELEDFEKGKEVVQRAVDRRVDGDRFLGSSMIS 183

Query: 183 FLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVT 242
           F +K GD   A+  F++MV++DVVCWN MI G ++   + E   +F++M+   + PS +T
Sbjct: 184 FFMKFGDFDGARWVFNRMVDRDVVCWNSMISGYVKGCYYFEALGLFIEMILRGVRPSPIT 243

Query: 243 MTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMP 302
           M SL+Q+CG +R+L+LGKC+H +VLG GM SD  VLT+L+DMY K G+++SA  +FD++P
Sbjct: 244 MVSLVQACGGLRSLELGKCVHGFVLGLGMGSDILVLTALVDMYSKMGEIESAHLLFDSIP 303

Query: 303 SRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKI 362
           ++NLVSWNVMISGYVQN L+ ++  LF+ LV+  G FDSGT++SL+Q C++  DL+ GK+
Sbjct: 304 AKNLVSWNVMISGYVQNCLVSKSFDLFRELVITGGDFDSGTIISLLQCCAQIADLESGKV 363

Query: 363 LHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGH 422
           LHGCI+RRGLD+NL+LSTAIVDLY+KC ++  A+ VF+RMK++NVI+WTAMLVGLAQNG 
Sbjct: 364 LHGCIFRRGLDMNLILSTAIVDLYSKCGAVKEATFVFDRMKDRNVITWTAMLVGLAQNGK 423

Query: 423 ARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTA 482
           A DALKLF QMQ E V  N++TLV L+H CA LG L +GRSVHA L R  +  +VV  TA
Sbjct: 424 AEDALKLFNQMQEEGVAANSITLVGLVHSCAHLGSLKKGRSVHAQLFRHGYDFDVVNRTA 483

Query: 483 LIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQP 542
           LIDMYAKC KIN AE+V + G   KDVIL+N+MI+GYGMHG GHKAL ++ +M +EG++P
Sbjct: 484 LIDMYAKCGKINYAERVLRDGSFFKDVILWNSMITGYGMHGQGHKALDIFCRMLEEGVKP 543

Query: 543 NESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEEL 602
           +++TF+SLLSACSHSGLV +G SLF +ME DHN+ PT+K YAC+VDLL RAGRLQ+AE L
Sbjct: 544 SQTTFISLLSACSHSGLVNQGRSLFVSMESDHNIRPTEKHYACYVDLLSRAGRLQEAEAL 603

Query: 603 INQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRW 662
           I QMPF  +  + E LL+GC  H +I++G+K AD LLSL++ NP IYV LSNIYAEA RW
Sbjct: 604 IKQMPFQSSGAVFEALLSGCRTHKNIDIGIKAADHLLSLDATNPGIYVMLSNIYAEARRW 663

Query: 663 DSVKYVRGLMNEQELKKIPGYSSIEV 682
           D+V ++RGLM ++ LKK PGYS IEV
Sbjct: 664 DAVDHIRGLMKKRGLKKTPGYSLIEV 688

BLAST of Lsi06G001870 vs. TrEMBL
Match: I1ND66_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_20G013300 PE=4 SV=2)

HSP 1 Score: 736.9 bits (1901), Expect = 2.3e-209
Identity = 377/675 (55.85%), Postives = 493/675 (73.04%), Query Frame = 1

Query: 9   CFPLKRFISHTSKSSIQNA-LFNPQPNLLS-FLQEFPPNILSVKSIHAQIIITNAISGDQ 68
           C P+      ++  S+ +A  FN  P++ S  L +F   ++ VKSIHAQII  N +S + 
Sbjct: 21  CRPIYNAAPSSTFVSVHHAPFFNQAPSVFSSLLHQFSNTLIHVKSIHAQII-KNWVSTES 80

Query: 69  FLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQLLKMMSRC 128
           FL AKL+  YS LG L +AR VFD+   P+T + NAM+ G+L+N+ + +  +L +MM  C
Sbjct: 81  FLAAKLIRVYSDLGFLGHARNVFDQCSLPETAVCNAMIAGFLRNQQHMEVPRLFRMMGSC 140

Query: 129 DLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVKTGDIMSA 188
           D+E +SYTC FALKAC  LLD E+GME+IR AV +G     ++GSS++NFLVK G +  A
Sbjct: 141 DIEINSYTCMFALKACTDLLDDEVGMEIIRAAVRRGFHLHLYVGSSMVNFLVKRGYLADA 200

Query: 189 QIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSLIQSCGEM 248
           Q  F  M EKDVVCWN +IGG +Q+GLF E   +FL+M+   + PS VTM +L+++CG+ 
Sbjct: 201 QKVFDGMPEKDVVCWNSIIGGYVQKGLFWESIQMFLEMIGGGLRPSPVTMANLLKACGQS 260

Query: 249 RNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNLVSWNVMI 308
              K+G C HSYVL  GM +D  VLTSL+DMY   GD  SA  +FD+M SR+L+SWN MI
Sbjct: 261 GLKKVGMCAHSYVLALGMGNDVFVLTSLVDMYSNLGDTGSAALVFDSMCSRSLISWNAMI 320

Query: 309 SGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGCIYRRGLD 368
           SGYVQNG++ E+  LF+ LV +  GFDSGT+VSLI+ CS+T+DL+ G+ILH CI R+ L+
Sbjct: 321 SGYVQNGMIPESYALFRRLVQSGSGFDSGTLVSLIRGCSQTSDLENGRILHSCIIRKELE 380

Query: 369 LNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFYQM 428
            +LVLSTAIVD+Y+KC ++  A+ VF RM  KNVI+WTAMLVGL+QNG+A DALKLF QM
Sbjct: 381 SHLVLSTAIVDMYSKCGAIKQATIVFGRMGKKNVITWTAMLVGLSQNGYAEDALKLFCQM 440

Query: 429 QNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDMYAKCSKI 488
           Q E+V  N++TLVSL+HCCA LG L +GR+VHA   R  +A + V  +ALIDMYAKC KI
Sbjct: 441 QEEKVAANSVTLVSLVHCCAHLGSLTKGRTVHAHFIRHGYAFDAVITSALIDMYAKCGKI 500

Query: 489 NSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNESTFVSLLSA 548
           +SAEK+F   F  KDVIL N+MI GYGMHG G  AL VY +M +E L+PN++TFVSLL+A
Sbjct: 501 HSAEKLFNNEFHLKDVILCNSMIMGYGMHGHGRYALGVYSRMIEERLKPNQTTFVSLLTA 560

Query: 549 CSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQMPFIPTSG 608
           CSHSGLVEEG +LF +ME+DH+V P  K YAC VDL  RAGRL++A+EL+ QMPF P++ 
Sbjct: 561 CSHSGLVEEGKALFHSMERDHDVRPQHKHYACLVDLHSRAGRLEEADELVKQMPFQPSTD 620

Query: 609 ILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVKYVRGLMN 668
           +LE LL+GC  H +  +G++ ADRL+SL+  N  IYV LSNIYAEA +W+SV Y+RGLM 
Sbjct: 621 VLEALLSGCRTHKNTNMGIQIADRLISLDYLNSGIYVMLSNIYAEARKWESVNYIRGLMR 680

Query: 669 EQELKKIPGYSSIEV 682
            Q +KKIPGYS IEV
Sbjct: 681 MQGMKKIPGYSLIEV 694

BLAST of Lsi06G001870 vs. TrEMBL
Match: A0A072UNF3_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_4g085110 PE=4 SV=1)

HSP 1 Score: 730.3 bits (1884), Expect = 2.1e-207
Identity = 370/666 (55.56%), Postives = 483/666 (72.52%), Query Frame = 1

Query: 18  HTSKSSIQNALFNPQPNLL--SFLQEFPPNILSVKSIHAQIIITNAISGDQFLVAKLVAA 77
           H   ++I+NA    QP+ +  S L+EF   ++ VKSIHAQII  N  S   FL  KL+  
Sbjct: 23  HAPFATIENASLFNQPSSIFSSLLREFSNTLIDVKSIHAQII-RNYASNQHFLATKLIKI 82

Query: 78  YSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQLLKMMSRCDLEFDSYTC 137
           YS+LG L  A KVFD+ P  +T+L NAM+ G+L+N  Y +  +L KMM   D+E +SYTC
Sbjct: 83  YSNLGFLNYAYKVFDQCPHRETILCNAMMGGFLKNMEYKEVPKLFKMMGLRDIELNSYTC 142

Query: 138 NFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVKTGDIMSAQIFFHQMVE 197
            F LKAC  LLD E+GME++R+AV KG      +GSS++NFLVK G++  A++ F  M E
Sbjct: 143 VFGLKACTVLLDDEVGMELVRMAVRKGFHLHPHVGSSMINFLVKCGNLNDARMVFDGMPE 202

Query: 198 KDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCI 257
           +DVVCWN +IGG +QEGL  E   +F++M+   I PS+VTM S++++CGE  + KLG C+
Sbjct: 203 RDVVCWNSIIGGYVQEGLLKEVIQLFVEMISCGIRPSSVTMASILKACGESGHKKLGTCV 262

Query: 258 HSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNLVSWNVMISGYVQNGLL 317
           H +VL  GM  D  VLTSL+DMYC  GD +SA  +F+ M SR+L+SWN MISG VQNG++
Sbjct: 263 HVFVLALGMGDDVFVLTSLVDMYCNVGDTESAFLVFNRMCSRSLISWNAMISGCVQNGMV 322

Query: 318 VETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAI 377
            E+  LF  LV +  GFDSGT+VSLI+ CS+T+DL+ GK+LH CI R+GL+ NLVLSTAI
Sbjct: 323 PESFSLFHKLVQSGDGFDSGTLVSLIRGCSQTSDLENGKVLHACIIRKGLESNLVLSTAI 382

Query: 378 VDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFYQMQNERVTFNA 437
           VD+Y+KC ++  AS VF  M+ +NVI+WTAMLVGL+QNG+A  ALKLF +MQ E V  N+
Sbjct: 383 VDMYSKCGAIKQASDVFRTMEKRNVITWTAMLVGLSQNGYAEGALKLFCRMQEENVAANS 442

Query: 438 LTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDMYAKCSKINSAEKVFKY 497
           +TLVSL+HCCA LG L +GRSVH  L R  +    V M+ALIDMYAKC KI+SAEK+F  
Sbjct: 443 VTLVSLVHCCAHLGSLKKGRSVHGHLIRHGYEFNAVNMSALIDMYAKCGKIHSAEKLFYN 502

Query: 498 GFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEE 557
           GF  KDVIL N+MI GYGMHG GH+AL VY +M  E L+PN++TFVS+L+ACSHSGLVEE
Sbjct: 503 GFHLKDVILCNSMIMGYGMHGQGHQALRVYDRMIDERLKPNQTTFVSMLTACSHSGLVEE 562

Query: 558 GISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQMPFIPTSGILETLLNGC 617
           G +LF  ME+ HN+ P+DK YACFVDLL RAG L++A  L+ Q+P  P+  +LE LL GC
Sbjct: 563 GRTLFHCMERVHNIKPSDKHYACFVDLLSRAGYLEEAYALVKQIPVEPSIDVLEALLGGC 622

Query: 618 LLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPG 677
            +H +I +G++ ADRL+SL+  N  IYV LSNIY+EA RW+SV Y+RGLM ++ LKK P 
Sbjct: 623 RIHKNINMGIQIADRLISLDYLNTGIYVMLSNIYSEARRWESVNYIRGLMRKRGLKKTPA 682

Query: 678 YSSIEV 682
           +S  EV
Sbjct: 683 FSLTEV 687

BLAST of Lsi06G001870 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 411.4 bits (1056), Expect = 1.1e-114
Identity = 224/630 (35.56%), Postives = 360/630 (57.14%), Query Frame = 1

Query: 58  ITNAISGDQFLV-----AKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEH 117
           + N I G+ F++     +KL   Y++ G L+ A +VFD++   K + +N ++N   ++  
Sbjct: 116 VDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGD 175

Query: 118 YNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSS 177
           ++ SI L K M    +E DSYT +   K+   L     G ++    +  G      +G+S
Sbjct: 176 FSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNS 235

Query: 178 ILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPS 237
           ++ F +K   + SA+  F +M E+DV+ WN +I G +  GL  +G +VF+ ML + IE  
Sbjct: 236 LVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEID 295

Query: 238 AVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFD 297
             T+ S+   C + R + LG+ +HS  +    S + R   +L+DMY K GD+ SA+ +F 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 298 TMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDG 357
            M  R++VS+  MI+GY + GL  E + LF+ +       D  TV +++  C+R   LD 
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 358 GKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQ 417
           GK +H  I    L  ++ +S A++D+YAKC S+  A  VF  M+ K++ISW  ++ G ++
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSK 475

Query: 418 NGHARDALKLF-YQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVV 477
           N +A +AL LF   ++ +R + +  T+  +L  CA L    +GR +H  + R  + S+  
Sbjct: 476 NCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRH 535

Query: 478 GMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQE 537
              +L+DMYAKC  +  A  +F      KD++ +  MI+GYGMHG G +A+ +++QM Q 
Sbjct: 536 VANSLVDMYAKCGALLLAHMLFD-DIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQA 595

Query: 538 GLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQ 597
           G++ +E +FVSLL ACSHSGLV+EG   F  M  +  + PT + YAC VD+L R G L +
Sbjct: 596 GIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIK 655

Query: 598 AEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAE 657
           A   I  MP  P + I   LL GC +H+D++L  K A+++  LE  N   YV ++NIYAE
Sbjct: 656 AYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAE 715

Query: 658 ASRWDSVKYVRGLMNEQELKKIPGYSSIEV 682
           A +W+ VK +R  + ++ L+K PG S IE+
Sbjct: 716 AEKWEQVKRLRKRIGQRGLRKNPGCSWIEI 744

BLAST of Lsi06G001870 vs. TAIR10
Match: AT4G21300.1 (AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 405.6 bits (1041), Expect = 6.1e-113
Identity = 212/619 (34.25%), Postives = 354/619 (57.19%), Query Frame = 1

Query: 65  DQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQLLKMMS 124
           ++F+ + L+ AY   G ++   K+FD++ Q   V++N M+NGY +    +  I+   +M 
Sbjct: 172 NEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGFSVMR 231

Query: 125 RCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVKTGDIM 184
              +  ++ T +  L  C   L  ++G+++  L V  G+     + +S+L+   K G   
Sbjct: 232 MDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFD 291

Query: 185 SAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSLIQSCG 244
            A   F  M   D V WN MI G +Q GL  E    F +M+ + + P A+T +SL+ S  
Sbjct: 292 DASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVS 351

Query: 245 EMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNLVSWNV 304
           +  NL+  K IH Y++   +S D  + ++LID Y K   V  A+ IF    S ++V +  
Sbjct: 352 KFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTA 411

Query: 305 MISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGCIYRRG 364
           MISGY+ NGL +++L +F+ LV      +  T+VS++ +      L  G+ LHG I ++G
Sbjct: 412 MISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKG 471

Query: 365 LDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDALKLFY 424
            D    +  A++D+YAKC  +  A  +FER+  ++++SW +M+   AQ+ +   A+ +F 
Sbjct: 472 FDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFR 531

Query: 425 QMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDMYAKCS 484
           QM    + ++ +++ + L  CA L     G+++H  + +   AS+V   + LIDMYAKC 
Sbjct: 532 QMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMYAKCG 591

Query: 485 KINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQM-NQEGLQPNESTFVSL 544
            + +A  VFK     K+++ +N++I+  G HG    +LC++H+M  + G++P++ TF+ +
Sbjct: 592 NLKAAMNVFK-TMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITFLEI 651

Query: 545 LSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQMPFIP 604
           +S+C H G V+EG+  FR+M +D+ + P  + YAC VDL  RAGRL +A E +  MPF P
Sbjct: 652 ISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMPFPP 711

Query: 605 TSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVKYVRG 664
            +G+  TLL  C LH ++EL    + +L+ L+  N   YV +SN +A A  W+SV  VR 
Sbjct: 712 DAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESVTKVRS 771

Query: 665 LMNEQELKKIPGYSSIEVN 683
           LM E+E++KIPGYS IE+N
Sbjct: 772 LMKEREVQKIPGYSWIEIN 789

BLAST of Lsi06G001870 vs. TAIR10
Match: AT5G39350.1 (AT5G39350.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 405.2 bits (1040), Expect = 7.9e-113
Identity = 220/642 (34.27%), Postives = 360/642 (56.07%), Query Frame = 1

Query: 45  NILSVKSIHAQIIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMV 104
           +I   K++H  +I    +SG   +++ L   Y+  G +  ARK+F+++PQ   + YN ++
Sbjct: 30  SISKTKALHCHVITGGRVSGH--ILSTLSVTYALCGHITYARKLFEEMPQSSLLSYNIVI 89

Query: 105 NGYLQNEHYNDSIQL-LKMMSR-CDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKG 164
             Y++   Y+D+I + ++M+S       D YT  F  KA   L   ++G+ V    +   
Sbjct: 90  RMYVREGLYHDAISVFIRMVSEGVKCVPDGYTYPFVAKAAGELKSMKLGLVVHGRILRSW 149

Query: 165 LARGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFL 224
             R +++ +++L   +  G +  A+  F  M  +DV+ WN MI G  + G  ++   +F 
Sbjct: 150 FGRDKYVQNALLAMYMNFGKVEMARDVFDVMKNRDVISWNTMISGYYRNGYMNDALMMFD 209

Query: 225 DMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSG 284
            M+   ++    T+ S++  CG +++L++G+ +H  V    +     V  +L++MY K G
Sbjct: 210 WMVNESVDLDHATIVSMLPVCGHLKDLEMGRNVHKLVEEKRLGDKIEVKNALVNMYLKCG 269

Query: 285 DVKSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQ 344
            +  AR++FD M  R++++W  MI+GY ++G +   L L +++       ++ T+ SL+ 
Sbjct: 270 RMDEARFVFDRMERRDVITWTCMINGYTEDGDVENALELCRLMQFEGVRPNAVTIASLVS 329

Query: 345 LCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVIS 404
           +C     ++ GK LHG   R+ +  ++++ T+++ +YAKC+ +     VF      +   
Sbjct: 330 VCGDALKVNDGKCLHGWAVRQQVYSDIIIETSLISMYAKCKRVDLCFRVFSGASKYHTGP 389

Query: 405 WTAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILT 464
           W+A++ G  QN    DAL LF +M+ E V  N  TL SLL   A L  L +  ++H  LT
Sbjct: 390 WSAIIAGCVQNELVSDALGLFKRMRREDVEPNIATLNSLLPAYAALADLRQAMNIHCYLT 449

Query: 465 RFHFASEVVGMTALIDMYAKCSKINSAEKVF---KYGFMPKDVILYNAMISGYGMHGLGH 524
           +  F S +   T L+ +Y+KC  + SA K+F   +     KDV+L+ A+ISGYGMHG GH
Sbjct: 450 KTGFMSSLDAATGLVHVYSKCGTLESAHKIFNGIQEKHKSKDVVLWGALISGYGMHGDGH 509

Query: 525 KALCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACF 584
            AL V+ +M + G+ PNE TF S L+ACSHSGLVEEG++LFR M + +        Y C 
Sbjct: 510 NALQVFMEMVRSGVTPNEITFTSALNACSHSGLVEEGLTLFRFMLEHYKTLARSNHYTCI 569

Query: 585 VDLLCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNP 644
           VDLL RAGRL +A  LI  +PF PTS +   LL  C+ H +++LG   A++L  LE  N 
Sbjct: 570 VDLLGRAGRLDEAYNLITTIPFEPTSTVWGALLAACVTHENVQLGEMAANKLFELEPENT 629

Query: 645 SIYVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIEV 682
             YV L+NIYA   RW  ++ VR +M    L+K PG+S+IE+
Sbjct: 630 GNYVLLANIYAALGRWKDMEKVRSMMENVGLRKKPGHSTIEI 669

BLAST of Lsi06G001870 vs. TAIR10
Match: AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 397.5 bits (1020), Expect = 1.6e-110
Identity = 230/639 (35.99%), Postives = 358/639 (56.03%), Query Frame = 1

Query: 45  NILSVKSIHAQIIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMV 104
           NI S++  H  ++  N + GD  +  KLV+ Y   G  ++AR VFD+IP+P   L+  M+
Sbjct: 56  NIDSLRQSHG-VLTGNGLMGDISIATKLVSLYGFFGYTKDARLVFDQIPEPDFYLWKVML 115

Query: 105 NGYLQNEHYNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLA 164
             Y  N+   + ++L  ++ +    +D    + ALKAC  L D + G + I   + K  +
Sbjct: 116 RCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQDLDNGKK-IHCQLVKVPS 175

Query: 165 RGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDM 224
               + + +L+   K G+I SA   F+ +  ++VVCW  MI G ++  L  EG  +F  M
Sbjct: 176 FDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGYVKNDLCEEGLVLFNRM 235

Query: 225 LYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDV 284
             N +  +  T  +LI +C ++  L  GK  H  ++  G+   + ++TSL+DMY K GD+
Sbjct: 236 RENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSSCLVTSLLDMYVKCGDI 295

Query: 285 KSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLC 344
            +AR +F+     +LV W  MI GY  NG + E L LFQ +   +   +  T+ S++  C
Sbjct: 296 SNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGVEIKPNCVTIASVLSGC 355

Query: 345 SRTTDLDGGKILHGCIYRRGL-DLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISW 404
               +L+ G+ +HG   + G+ D N+  + A+V +YAKC     A  VFE    K++++W
Sbjct: 356 GLIENLELGRSVHGLSIKVGIWDTNV--ANALVHMYAKCYQNRDAKYVFEMESEKDIVAW 415

Query: 405 TAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTR 464
            +++ G +QNG   +AL LF++M +E VT N +T+ SL   CA LG L  G S+HA   +
Sbjct: 416 NSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLGSLAVGSSLHAYSVK 475

Query: 465 FHF--ASEVVGMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKA 524
             F  +S V   TAL+D YAKC    SA  +F      K+ I ++AMI GYG  G    +
Sbjct: 476 LGFLASSSVHVGTALLDFYAKCGDPQSARLIFDT-IEEKNTITWSAMIGGYGKQGDTIGS 535

Query: 525 LCVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVD 584
           L ++ +M ++  +PNESTF S+LSAC H+G+V EG   F +M KD+N TP+ K Y C VD
Sbjct: 536 LELFEEMLKKQQKPNESTFTSILSACGHTGMVNEGKKYFSSMYKDYNFTPSTKHYTCMVD 595

Query: 585 LLCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSI 644
           +L RAG L+QA ++I +MP  P        L+GC +H+  +LG     ++L L   + S 
Sbjct: 596 MLARAGELEQALDIIEKMPIQPDVRCFGAFLHGCGMHSRFDLGEIVIKKMLDLHPDDASY 655

Query: 645 YVTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIE 681
           YV +SN+YA   RW+  K VR LM ++ L KI G+S++E
Sbjct: 656 YVLVSNLYASDGRWNQAKEVRNLMKQRGLSKIAGHSTME 689

BLAST of Lsi06G001870 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 390.2 bits (1001), Expect = 2.6e-108
Identity = 209/626 (33.39%), Postives = 338/626 (53.99%), Query Frame = 1

Query: 56  IIITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYND 115
           ++  N +  + F   KLV+ +   G ++ A +VF+ I     VLY+ M+ G+ +    + 
Sbjct: 59  LVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDK 118

Query: 116 SIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILN 175
           ++Q    M   D+E   Y   + LK C    +  +G E+  L V  G +   F  + + N
Sbjct: 119 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 178

Query: 176 FLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVT 235
              K   +  A+  F +M E+D+V WN ++ G  Q G+      +   M    ++PS +T
Sbjct: 179 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 238

Query: 236 MTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMP 295
           + S++ +   +R + +GK IH Y +  G  S   + T+L+DMY K G +++AR +FD M 
Sbjct: 239 IVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGML 298

Query: 296 SRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKI 355
            RN+VSWN MI  YVQN    E + +FQ ++         +V+  +  C+   DL+ G+ 
Sbjct: 299 ERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRF 358

Query: 356 LHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGH 415
           +H      GLD N+ +  +++ +Y KC+ +  A+S+F +++++ ++SW AM++G AQNG 
Sbjct: 359 IHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGR 418

Query: 416 ARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTA 475
             DAL  F QM++  V  +  T VS++   A L +    + +H ++ R      V   TA
Sbjct: 419 PIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTA 478

Query: 476 LIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQP 535
           L+DMYAKC  I  A  +F      + V  +NAMI GYG HG G  AL ++ +M +  ++P
Sbjct: 479 LVDMYAKCGAIMIARLIFDM-MSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKP 538

Query: 536 NESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEEL 595
           N  TF+S++SACSHSGLVE G+  F  M++++++  +   Y   VDLL RAGRL +A + 
Sbjct: 539 NGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDF 598

Query: 596 INQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRW 655
           I QMP  P   +   +L  C +H ++    K A+RL  L   +   +V L+NIY  AS W
Sbjct: 599 IMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMW 658

Query: 656 DSVKYVRGLMNEQELKKIPGYSSIEV 682
           + V  VR  M  Q L+K PG S +E+
Sbjct: 659 EKVGQVRVSMLRQGLRKTPGCSMVEI 683

BLAST of Lsi06G001870 vs. NCBI nr
Match: gi|659099713|ref|XP_008450740.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g18750, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1226.1 bits (3171), Expect = 0.0e+00
Identity = 614/686 (89.50%), Postives = 638/686 (93.00%), Query Frame = 1

Query: 1   MPPFLHLPCFPLKRFISHTSKSSIQNALFNPQPNL---LSFLQEFPPNILSVKSIHAQII 60
           MPPFLH PCFPLKRFISHTSKSS+QNALFNPQPNL   LSFLQE P NILSVKSIHAQII
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSLQNALFNPQPNLQPFLSFLQECPHNILSVKSIHAQII 60

Query: 61  ITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSI 120
           ITN I GDQFLVAKLVAAYS LGCLE ARKVFD+IPQPKTVL NAMVNGYLQNEH+ND I
Sbjct: 61  ITNGIYGDQFLVAKLVAAYSGLGCLETARKVFDEIPQPKTVLCNAMVNGYLQNEHFNDCI 120

Query: 121 QLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL 180
           +LL+MMSRC LEFDSYTCNFALKAC FLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL
Sbjct: 121 ELLEMMSRCHLEFDSYTCNFALKACTFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFL 180

Query: 181 VKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMT 240
           VKTGDIM AQ FFHQM EKDVVCWNVMIGG MQEGLF EGYN+F DMLYNKIEPSAVTM 
Sbjct: 181 VKTGDIMCAQYFFHQMDEKDVVCWNVMIGGFMQEGLFREGYNLFFDMLYNKIEPSAVTMI 240

Query: 241 SLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSR 300
           SLIQSCGE RNLK GKC+HS+VLGFGMSSDTRVLT+LIDMYCKSGDV+SARWIFD MPSR
Sbjct: 241 SLIQSCGETRNLKFGKCMHSFVLGFGMSSDTRVLTTLIDMYCKSGDVESARWIFDNMPSR 300

Query: 301 NLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILH 360
           NLVSWNVMISGYVQNGLLVETL LFQ L+M+D GFDSGTVVSLIQLCSRT DLDGGKILH
Sbjct: 301 NLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKILH 360

Query: 361 GCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHAR 420
           GCIYRRGLDLNLVLSTAIVDLYAKC SLAYASSVFER+KNKNVISWTAMLVGLAQNGHAR
Sbjct: 361 GCIYRRGLDLNLVLSTAIVDLYAKCGSLAYASSVFERIKNKNVISWTAMLVGLAQNGHAR 420

Query: 421 DALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALI 480
           DALKLF QMQNERVTFN LTLVSL++CC LL LL EGRSVHA LTRFHFASEVV MTALI
Sbjct: 421 DALKLFDQMQNERVTFNVLTLVSLVYCCTLLRLLREGRSVHATLTRFHFASEVVVMTALI 480

Query: 481 DMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNE 540
           DMYAKCSKINSAE VFKYG  PKDVILYN+MISGYGMHGLGHKALCVYH+MN+EGLQPNE
Sbjct: 481 DMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPNE 540

Query: 541 STFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELIN 600
           STFVSLLSACSHSGLVEEGI+LF+NM KDHN TPTDKLYAC VDLL RAGRLQQAEELIN
Sbjct: 541 STFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLQQAEELIN 600

Query: 601 QMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDS 660
           QMPF PTSGILETLLNGCLLH DIELGVK ADRLLSLESRNPSIY+TLSNIYA+ASRWDS
Sbjct: 601 QMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWDS 660

Query: 661 VKYVRGLMNEQELKKIPGYSSIEVNI 684
           VK+VRGLM EQE+KKIPG SSIEVNI
Sbjct: 661 VKHVRGLMMEQEIKKIPGCSSIEVNI 686

BLAST of Lsi06G001870 vs. NCBI nr
Match: gi|778662656|ref|XP_011659934.1| (PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 1225.7 bits (3170), Expect = 0.0e+00
Identity = 615/687 (89.52%), Postives = 638/687 (92.87%), Query Frame = 1

Query: 1   MPPFLHLPCFPLKRFISHTSKSSIQNALFNPQPNLL----SFLQEFPPNILSVKSIHAQI 60
           MPPFLH PCFPLKRFISHTSKSS+QNALFN QPNLL    SFLQEF  N+LSVKSIHAQI
Sbjct: 1   MPPFLHFPCFPLKRFISHTSKSSLQNALFNAQPNLLQPFLSFLQEFSHNLLSVKSIHAQI 60

Query: 61  IITNAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDS 120
           IITN I GDQFLVAKLVAAYSSLGCLENARKVFD+IPQPKTVL NAMVNGYLQNE YND 
Sbjct: 61  IITNPIYGDQFLVAKLVAAYSSLGCLENARKVFDEIPQPKTVLCNAMVNGYLQNERYNDC 120

Query: 121 IQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNF 180
           I+LLKMMSRC LEFDSYTCNFALKACMFLLDYEMGMEVI LAVCKGLA GRFLGSSILNF
Sbjct: 121 IELLKMMSRCHLEFDSYTCNFALKACMFLLDYEMGMEVIGLAVCKGLAGGRFLGSSILNF 180

Query: 181 LVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTM 240
           LVKTGDIM AQ FFHQMVEKDVVCWNVMIGG MQEGLF EGYN+FLDMLYNKIEPSAVTM
Sbjct: 181 LVKTGDIMCAQFFFHQMVEKDVVCWNVMIGGFMQEGLFREGYNLFLDMLYNKIEPSAVTM 240

Query: 241 TSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPS 300
            SLIQSCGEMRNL  GKC+H +VLGFGMS DTRVLT+LIDMYCKSGDV+SARWIF+ MPS
Sbjct: 241 ISLIQSCGEMRNLTFGKCMHGFVLGFGMSRDTRVLTTLIDMYCKSGDVESARWIFENMPS 300

Query: 301 RNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKIL 360
           RNLVSWNVMISGYVQNGLLVETL LFQ L+M+D GFDSGTVVSLIQLCSRT DLDGGKIL
Sbjct: 301 RNLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQLCSRTADLDGGKIL 360

Query: 361 HGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHA 420
           HG IYRRGLDLNLVL TAIVDLYAKC SLAYASSVFERMKNKNVISWTAMLVGLAQNGHA
Sbjct: 361 HGFIYRRGLDLNLVLPTAIVDLYAKCGSLAYASSVFERMKNKNVISWTAMLVGLAQNGHA 420

Query: 421 RDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTAL 480
           RDALKLF QMQNERVTFNALTLVSL++CC LLGLL EGRSVHA LTRFHFASEVV MTAL
Sbjct: 421 RDALKLFDQMQNERVTFNALTLVSLVYCCTLLGLLREGRSVHATLTRFHFASEVVVMTAL 480

Query: 481 IDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPN 540
           IDMYAKCSKINSAE VFKYG  PKDVILYN+MISGYGMHGLGHKALCVYH+MN+EGLQPN
Sbjct: 481 IDMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKALCVYHRMNREGLQPN 540

Query: 541 ESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELI 600
           ESTFVSLLSACSHSGLVEEGI+LF+NM KDHN TPTDKLYAC VDLL RAGRL+QAEELI
Sbjct: 541 ESTFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDLLSRAGRLRQAEELI 600

Query: 601 NQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWD 660
           NQMPF PTSGILETLLNGCLLH DIELGVK ADRLLSLESRNPSIY+TLSNIYA+ASRWD
Sbjct: 601 NQMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIYITLSNIYAKASRWD 660

Query: 661 SVKYVRGLMNEQELKKIPGYSSIEVNI 684
           SVKYVRGLM EQE+KKIPGYSSIEVNI
Sbjct: 661 SVKYVRGLMMEQEIKKIPGYSSIEVNI 687

BLAST of Lsi06G001870 vs. NCBI nr
Match: gi|700211048|gb|KGN66144.1| (hypothetical protein Csa_1G573630 [Cucumis sativus])

HSP 1 Score: 1046.2 bits (2704), Expect = 2.5e-302
Identity = 520/578 (89.97%), Postives = 540/578 (93.43%), Query Frame = 1

Query: 103 MVNGYLQNEHYNDSIQLLKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKG 162
           MVNGYLQNE YND I+LLKMMSRC LEFDSYTCNFALKACMFLLDYEMGMEVI LAVCKG
Sbjct: 1   MVNGYLQNERYNDCIELLKMMSRCHLEFDSYTCNFALKACMFLLDYEMGMEVIGLAVCKG 60

Query: 163 LARGRFLGSSILNFLVKTGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFL 222
           LA GRFLGSSILNFLVKTGDIM AQ FFHQMVEKDVVCWNVMIGG MQEGLF EGYN+FL
Sbjct: 61  LAGGRFLGSSILNFLVKTGDIMCAQFFFHQMVEKDVVCWNVMIGGFMQEGLFREGYNLFL 120

Query: 223 DMLYNKIEPSAVTMTSLIQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSG 282
           DMLYNKIEPSAVTM SLIQSCGEMRNL  GKC+H +VLGFGMS DTRVLT+LIDMYCKSG
Sbjct: 121 DMLYNKIEPSAVTMISLIQSCGEMRNLTFGKCMHGFVLGFGMSRDTRVLTTLIDMYCKSG 180

Query: 283 DVKSARWIFDTMPSRNLVSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQ 342
           DV+SARWIF+ MPSRNLVSWNVMISGYVQNGLLVETL LFQ L+M+D GFDSGTVVSLIQ
Sbjct: 181 DVESARWIFENMPSRNLVSWNVMISGYVQNGLLVETLRLFQKLIMDDVGFDSGTVVSLIQ 240

Query: 343 LCSRTTDLDGGKILHGCIYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVIS 402
           LCSRT DLDGGKILHG IYRRGLDLNLVL TAIVDLYAKC SLAYASSVFERMKNKNVIS
Sbjct: 241 LCSRTADLDGGKILHGFIYRRGLDLNLVLPTAIVDLYAKCGSLAYASSVFERMKNKNVIS 300

Query: 403 WTAMLVGLAQNGHARDALKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILT 462
           WTAMLVGLAQNGHARDALKLF QMQNERVTFNALTLVSL++CC LLGLL EGRSVHA LT
Sbjct: 301 WTAMLVGLAQNGHARDALKLFDQMQNERVTFNALTLVSLVYCCTLLGLLREGRSVHATLT 360

Query: 463 RFHFASEVVGMTALIDMYAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKAL 522
           RFHFASEVV MTALIDMYAKCSKINSAE VFKYG  PKDVILYN+MISGYGMHGLGHKAL
Sbjct: 361 RFHFASEVVVMTALIDMYAKCSKINSAEMVFKYGLTPKDVILYNSMISGYGMHGLGHKAL 420

Query: 523 CVYHQMNQEGLQPNESTFVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDL 582
           CVYH+MN+EGLQPNESTFVSLLSACSHSGLVEEGI+LF+NM KDHN TPTDKLYAC VDL
Sbjct: 421 CVYHRMNREGLQPNESTFVSLLSACSHSGLVEEGIALFQNMVKDHNTTPTDKLYACIVDL 480

Query: 583 LCRAGRLQQAEELINQMPFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIY 642
           L RAGRL+QAEELINQMPF PTSGILETLLNGCLLH DIELGVK ADRLLSLESRNPSIY
Sbjct: 481 LSRAGRLRQAEELINQMPFTPTSGILETLLNGCLLHKDIELGVKLADRLLSLESRNPSIY 540

Query: 643 VTLSNIYAEASRWDSVKYVRGLMNEQELKKIPGYSSIE 681
           +TLSNIYA+ASRWDSVKYVRGLM EQE+KKIPGYSSIE
Sbjct: 541 ITLSNIYAKASRWDSVKYVRGLMMEQEIKKIPGYSSIE 578

BLAST of Lsi06G001870 vs. NCBI nr
Match: gi|763778904|gb|KJB46027.1| (hypothetical protein B456_007G344400 [Gossypium raimondii])

HSP 1 Score: 787.3 bits (2032), Expect = 2.1e-224
Identity = 396/682 (58.06%), Postives = 516/682 (75.66%), Query Frame = 1

Query: 4   FLHLPCFPLKRFISHTSKSSIQNALFNPQP----NLLSFLQEFPPNILSVKSIHAQIIIT 63
           FLH   F   +F+S  S SSI+NA FN  P      LS L EF   +LSVKSIHAQII T
Sbjct: 11  FLHKVPF---KFLSLHSFSSIKNANFNHLPLDIQRFLSLLHEFSNTLLSVKSIHAQII-T 70

Query: 64  NAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQL 123
           N++S  QFL + LV +YS LGCL  ARKVFDKIPQPK +L N+M+NGYL+N+ Y ++I+L
Sbjct: 71  NSVSKHQFLSSNLVRSYSELGCLGLARKVFDKIPQPKPILCNSMLNGYLRNQCYKETIEL 130

Query: 124 LKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVK 183
            ++M   D  FDSY+CNF LKACM L DYE G E+I+ AV   +   +FLGSS++NF +K
Sbjct: 131 FELMGASDWGFDSYSCNFVLKACMELEDYERGTEIIKRAVDHRVDGDKFLGSSMINFFMK 190

Query: 184 TGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSL 243
            GD  SA+  F+QM+ +DVVCWN MIGG ++   +   +++FL+M+   + PSA+TM SL
Sbjct: 191 FGDFNSARRVFNQMISRDVVCWNSMIGGYVKGCYYVAAFDLFLEMILCGVRPSAITMVSL 250

Query: 244 IQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNL 303
           +Q+CG MR+L+LGK +H  VL FG+ +D  V T+LIDMY K G+ + AR +FD MP++ L
Sbjct: 251 VQACGGMRDLELGKRVHGLVLVFGLGTDVLVHTALIDMYSKLGEHERARSVFDIMPAKTL 310

Query: 304 VSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGC 363
           VSWNV+ISGYVQN L+ E  +LFQ LV+  GGFDSGT++SL+Q C++  DL+ GK+LHG 
Sbjct: 311 VSWNVIISGYVQNCLVYEAFYLFQKLVLTGGGFDSGTIISLLQSCAQVADLESGKVLHGY 370

Query: 364 IYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDA 423
           I+R+GLD+N++L TA+VDLY+KC +L  A+ +F+RMKN+NVI+WTAMLVGLAQNGHA DA
Sbjct: 371 IFRKGLDINVILCTALVDLYSKCGALKEATFMFDRMKNRNVITWTAMLVGLAQNGHAEDA 430

Query: 424 LKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDM 483
           ++LF +MQ E VT N+ TLVSL+HCCA LG L +GRSVHA L R+ +A +VV  TALIDM
Sbjct: 431 IRLFGKMQEEGVTANSTTLVSLVHCCAHLGSLKKGRSVHARLLRYGYAFDVVNRTALIDM 490

Query: 484 YAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNEST 543
           YAKC  IN AE+VF+     KDVI +N+MI+GYGMHG GHKAL +Y +M +EGL+PN++T
Sbjct: 491 YAKCGNINYAERVFEDVSFFKDVISWNSMITGYGMHGQGHKALDLYRRMLEEGLKPNKTT 550

Query: 544 FVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQM 603
           FVSLLSACSHSGLV++G SLF +ME DHN+   +K YAC+VDLL RAG +++AE LI QM
Sbjct: 551 FVSLLSACSHSGLVDQGRSLFLSMESDHNIRANEKHYACYVDLLSRAGHIKEAEVLIKQM 610

Query: 604 PFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVK 663
           PF  +  + E LLNGC +H +I++G+K AD LLSL++ NP IY+ LSNIYAEA RWD+V 
Sbjct: 611 PFQSSREVFEALLNGCRMHKNIDIGIKAADYLLSLDATNPGIYIMLSNIYAEARRWDAVD 670

Query: 664 YVRGLMNEQELKKIPGYSSIEV 682
           ++RGLM  + LKK PGYS IEV
Sbjct: 671 HIRGLMRGRGLKKTPGYSLIEV 688

BLAST of Lsi06G001870 vs. NCBI nr
Match: gi|823198858|ref|XP_012434729.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g12770-like [Gossypium raimondii])

HSP 1 Score: 787.3 bits (2032), Expect = 2.1e-224
Identity = 396/682 (58.06%), Postives = 516/682 (75.66%), Query Frame = 1

Query: 4   FLHLPCFPLKRFISHTSKSSIQNALFNPQP----NLLSFLQEFPPNILSVKSIHAQIIIT 63
           FLH   F   +F+S  S SSI+NA FN  P      LS L EF   +LSVKSIHAQII T
Sbjct: 15  FLHKVPF---KFLSLHSFSSIKNANFNHLPLDIQRFLSLLHEFSNTLLSVKSIHAQII-T 74

Query: 64  NAISGDQFLVAKLVAAYSSLGCLENARKVFDKIPQPKTVLYNAMVNGYLQNEHYNDSIQL 123
           N++S  QFL + LV +YS LGCL  ARKVFDKIPQPK +L N+M+NGYL+N+ Y ++I+L
Sbjct: 75  NSVSKHQFLSSNLVRSYSELGCLGLARKVFDKIPQPKPILCNSMLNGYLRNQCYKETIEL 134

Query: 124 LKMMSRCDLEFDSYTCNFALKACMFLLDYEMGMEVIRLAVCKGLARGRFLGSSILNFLVK 183
            ++M   D  FDSY+CNF LKACM L DYE G E+I+ AV   +   +FLGSS++NF +K
Sbjct: 135 FELMGASDWGFDSYSCNFVLKACMELEDYERGTEIIKRAVDHRVDGDKFLGSSMINFFMK 194

Query: 184 TGDIMSAQIFFHQMVEKDVVCWNVMIGGLMQEGLFSEGYNVFLDMLYNKIEPSAVTMTSL 243
            GD  SA+  F+QM+ +DVVCWN MIGG ++   +   +++FL+M+   + PSA+TM SL
Sbjct: 195 FGDFNSARRVFNQMISRDVVCWNSMIGGYVKGCYYVAAFDLFLEMILCGVRPSAITMVSL 254

Query: 244 IQSCGEMRNLKLGKCIHSYVLGFGMSSDTRVLTSLIDMYCKSGDVKSARWIFDTMPSRNL 303
           +Q+CG MR+L+LGK +H  VL FG+ +D  V T+LIDMY K G+ + AR +FD MP++ L
Sbjct: 255 VQACGGMRDLELGKRVHGLVLVFGLGTDVLVHTALIDMYSKLGEHERARSVFDIMPAKTL 314

Query: 304 VSWNVMISGYVQNGLLVETLHLFQMLVMNDGGFDSGTVVSLIQLCSRTTDLDGGKILHGC 363
           VSWNV+ISGYVQN L+ E  +LFQ LV+  GGFDSGT++SL+Q C++  DL+ GK+LHG 
Sbjct: 315 VSWNVIISGYVQNCLVYEAFYLFQKLVLTGGGFDSGTIISLLQSCAQVADLESGKVLHGY 374

Query: 364 IYRRGLDLNLVLSTAIVDLYAKCRSLAYASSVFERMKNKNVISWTAMLVGLAQNGHARDA 423
           I+R+GLD+N++L TA+VDLY+KC +L  A+ +F+RMKN+NVI+WTAMLVGLAQNGHA DA
Sbjct: 375 IFRKGLDINVILCTALVDLYSKCGALKEATFMFDRMKNRNVITWTAMLVGLAQNGHAEDA 434

Query: 424 LKLFYQMQNERVTFNALTLVSLLHCCALLGLLLEGRSVHAILTRFHFASEVVGMTALIDM 483
           ++LF +MQ E VT N+ TLVSL+HCCA LG L +GRSVHA L R+ +A +VV  TALIDM
Sbjct: 435 IRLFGKMQEEGVTANSTTLVSLVHCCAHLGSLKKGRSVHARLLRYGYAFDVVNRTALIDM 494

Query: 484 YAKCSKINSAEKVFKYGFMPKDVILYNAMISGYGMHGLGHKALCVYHQMNQEGLQPNEST 543
           YAKC  IN AE+VF+     KDVI +N+MI+GYGMHG GHKAL +Y +M +EGL+PN++T
Sbjct: 495 YAKCGNINYAERVFEDVSFFKDVISWNSMITGYGMHGQGHKALDLYRRMLEEGLKPNKTT 554

Query: 544 FVSLLSACSHSGLVEEGISLFRNMEKDHNVTPTDKLYACFVDLLCRAGRLQQAEELINQM 603
           FVSLLSACSHSGLV++G SLF +ME DHN+   +K YAC+VDLL RAG +++AE LI QM
Sbjct: 555 FVSLLSACSHSGLVDQGRSLFLSMESDHNIRANEKHYACYVDLLSRAGHIKEAEVLIKQM 614

Query: 604 PFIPTSGILETLLNGCLLHNDIELGVKFADRLLSLESRNPSIYVTLSNIYAEASRWDSVK 663
           PF  +  + E LLNGC +H +I++G+K AD LLSL++ NP IY+ LSNIYAEA RWD+V 
Sbjct: 615 PFQSSREVFEALLNGCRMHKNIDIGIKAADYLLSLDATNPGIYIMLSNIYAEARRWDAVD 674

Query: 664 YVRGLMNEQELKKIPGYSSIEV 682
           ++RGLM  + LKK PGYS IEV
Sbjct: 675 HIRGLMRGRGLKKTPGYSLIEV 692

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP320_ARATH2.0e-11335.56Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP333_ARATH1.1e-11134.25Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN... [more]
PP405_ARATH1.4e-11134.27Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana GN... [more]
PP146_ARATH2.9e-10935.99Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
PPR32_ARATH4.7e-10733.39Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LYM5_CUCSA1.7e-30289.97Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573630 PE=4 SV=1[more]
A0A0D2PJ08_GOSRA1.5e-22458.06Uncharacterized protein OS=Gossypium raimondii GN=B456_007G344400 PE=4 SV=1[more]
A0A061F2D9_THECC1.2e-22357.00Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_026441 PE... [more]
I1ND66_SOYBN2.3e-20955.85Uncharacterized protein OS=Glycine max GN=GLYMA_20G013300 PE=4 SV=2[more]
A0A072UNF3_MEDTR2.1e-20755.56Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_4g085110 PE... [more]
Match NameE-valueIdentityDescription
AT4G18750.11.1e-11435.56 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21300.16.1e-11334.25 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G39350.17.9e-11334.27 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G03380.11.6e-11035.99 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.12.6e-10833.39 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659099713|ref|XP_008450740.1|0.0e+0089.50PREDICTED: pentatricopeptide repeat-containing protein At4g18750, chloroplastic-... [more]
gi|778662656|ref|XP_011659934.1|0.0e+0089.52PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like ... [more]
gi|700211048|gb|KGN66144.1|2.5e-30289.97hypothetical protein Csa_1G573630 [Cucumis sativus][more]
gi|763778904|gb|KJB46027.1|2.1e-22458.06hypothetical protein B456_007G344400 [Gossypium raimondii][more]
gi|823198858|ref|XP_012434729.1|2.1e-22458.06PREDICTED: pentatricopeptide repeat-containing protein At3g12770-like [Gossypium... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi06G001870.1Lsi06G001870.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 272..298
score: 4.1E-5coord: 300..326
score: 4.8E-5coord: 578..599
score: 0.19coord: 474..494
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 398..445
score: 4.9E-8coord: 500..548
score: 1.9E-12coord: 95..142
score: 4.0E-8coord: 196..243
score: 1.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 539..572
score: 5.0E-4coord: 401..432
score: 7.2E-5coord: 300..326
score: 6.1E-4coord: 199..232
score: 2.8E-4coord: 504..537
score: 8.6E-8coord: 98..131
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 434..468
score: 5.492coord: 536..566
score: 9.251coord: 572..602
score: 7.213coord: 399..433
score: 9.953coord: 96..130
score: 8.802coord: 232..266
score: 6.138coord: 131..165
score: 5.656coord: 638..672
score: 6.5coord: 368..398
score: 7.037coord: 267..301
score: 10.457coord: 302..328
score: 5.327coord: 501..535
score: 11.619coord: 197..231
score: 10.501coord: 166..196
score: 5.459coord: 65..95
score: 6.095coord: 333..367
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 410..411
score: 2.3E-5coord: 265..294
score: 2.3E-5coord: 504..656
score: 2.3E-5coord: 73..127
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 2..45
score: 1.9E-279coord: 67..327
score: 1.9E-279coord: 363..679
score: 1.9E