Lsi04G018170 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G018170
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein
Locationchr04 : 25370242 .. 25371966 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTGGCAGGTATCAATATTCCTTACTTTACTCGCCGTCGTCCTCATCGGCTTTATCATACTTGTTTTCAACCCTAGACGAACCATCAAATCTATTTGATGATGGTGTTTTGGGTGATGGAACTCGGAATCAATGTAGCATAGACGAGCGCTTCGTTATCGGCGAACTTTCTAATCTCTTACTTGTTAATCCCTATGGTTCGGTTTATAACACTCTCAAAGAGATTCCTACCGAGAAACAGATGCCAATTAGGGCAGTTGATGGATTTTTACCACCAGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATCGAACATGCTTTAGCTAATACTGATGTGAATTTGAGCCAAGATGTTGTCAACAAAGTACTGGACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCCAGTTACAACATAATACTTAAAGCTTTAGGCAGAAGAAGGTTTTTTGACTCCATGATGGATGTTTTACACAACATGACACGGGAGGGAGTGAATGTGAACATGGAAACAGTATCCATTGTGGTAGACAGTTTGGTCAAGGCTCGCCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAATATTCTTCTACAATGCATGTGTCGACGATCCCATGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGACATATAACATTATAATTGGTGGATGGTCAAGATACGGCAGGCATAGTGAAGTTGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTCTTGAGTGTCTTGGCAGAGCTAATCGCATTGATGATGCTGTCAAGGTCTTTGATAAAATGGATGAAAAAGGCTGTACGCCAGATGTTGATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATCATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTACAAATTGTTGCTAATGCGGCTCTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAAACTTATGAGCATGCCATCGACTGTCTTTGTAAAACAGGGCAGCTCGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAGTTTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGA

mRNA sequence

ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTGGCAGGTATCAATATTCCTTACTTTACTCGCCGTCGTCCTCATCGGCTTTATCATACTTGTTTTCAACCCTAGACGAACCATCAAATCTATTTGATGATGGTGTTTTGGGTGATGGAACTCGGAATCAATGTAGCATAGACGAGCGCTTCGTTATCGGCGAACTTTCTAATCTCTTACTTGTTAATCCCTATGGTTCGGTTTATAACACTCTCAAAGAGATTCCTACCGAGAAACAGATGCCAATTAGGGCAGTTGATGGATTTTTACCACCAGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATCGAACATGCTTTAGCTAATACTGATGTGAATTTGAGCCAAGATGTTGTCAACAAAGTACTGGACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCCAGTTACAACATAATACTTAAAGCTTTAGGCAGAAGAAGGTTTTTTGACTCCATGATGGATGTTTTACACAACATGACACGGGAGGGAGTGAATGTGAACATGGAAACAGTATCCATTGTGGTAGACAGTTTGGTCAAGGCTCGCCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAATATTCTTCTACAATGCATGTGTCGACGATCCCATGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGACATATAACATTATAATTGGTGGATGGTCAAGATACGGCAGGCATAGTGAAGTTGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTCTTGAGTGTCTTGGCAGAGCTAATCGCATTGATGATGCTGTCAAGGTCTTTGATAAAATGGATGAAAAAGGCTGTACGCCAGATGTTGATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATCATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTACAAATTGTTGCTAATGCGGCTCTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAAACTTATGAGCATGCCATCGACTGTCTTTGTAAAACAGGGCAGCTCGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAGTTTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGA

Coding sequence (CDS)

ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTGGCAGGTATCAATATTCCTTACTTTACTCGCCGTCGTCCTCATCGGCTTTATCATACTTGTTTTCAACCCTAGACGAACCATCAAATCTATTTGATGATGGTGTTTTGGGTGATGGAACTCGGAATCAATGTAGCATAGACGAGCGCTTCGTTATCGGCGAACTTTCTAATCTCTTACTTGTTAATCCCTATGGTTCGGTTTATAACACTCTCAAAGAGATTCCTACCGAGAAACAGATGCCAATTAGGGCAGTTGATGGATTTTTACCACCAGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATCGAACATGCTTTAGCTAATACTGATGTGAATTTGAGCCAAGATGTTGTCAACAAAGTACTGGACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCCAGTTACAACATAATACTTAAAGCTTTAGGCAGAAGAAGGTTTTTTGACTCCATGATGGATGTTTTACACAACATGACACGGGAGGGAGTGAATGTGAACATGGAAACAGTATCCATTGTGGTAGACAGTTTGGTCAAGGCTCGCCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAATATTCTTCTACAATGCATGTGTCGACGATCCCATGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGACATATAACATTATAATTGGTGGATGGTCAAGATACGGCAGGCATAGTGAAGTTGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTCTTGAGTGTCTTGGCAGAGCTAATCGCATTGATGATGCTGTCAAGGTCTTTGATAAAATGGATGAAAAAGGCTGTACGCCAGATGTTGATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATCATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTACAAATTGTTGCTAATGCGGCTCTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAAACTTATGAGCATGCCATCGACTGTCTTTGTAAAACAGGGCAGCTCGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAGTTTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGA

Protein sequence

MAFGASRRFLSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQESLQRCWRAKGWHY
BLAST of Lsi04G018170 vs. Swiss-Prot
Match: PP416_ARATH (Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis thaliana GN=At5g43820 PE=3 SV=1)

HSP 1 Score: 580.5 bits (1495), Expect = 2.0e-164
Identity = 277/510 (54.31%), Postives = 378/510 (74.12%), Query Frame = 1

Query: 66  NQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQ 125
           N   +DE +V+ ELS+LL ++   +  +  KE  + K     A+D FL  E+KLRGVFLQ
Sbjct: 41  NHGVVDESYVLAELSSLLPIS--SNKTSVSKEDSSSKNQV--AIDSFLSAEDKLRGVFLQ 100

Query: 126 KLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNI 185
           KL GK+AI+ +L++  + LS D+V  VL+ G+L  EAMVTFF WA+++P + KD  SY++
Sbjct: 101 KLKGKSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSV 160

Query: 186 ILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIG 245
           IL+ALGRR+ F  MMDVL  M  EGVN ++E ++I +DS V+   V +A++LF   +  G
Sbjct: 161 ILRALGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFG 220

Query: 246 LKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVER 305
           +KC TE+ N LL+C+C RSHV AA S FN  KGN+PF++ +YNI+I GWS+ G   E+E+
Sbjct: 221 VKCSTESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEK 280

Query: 306 ILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFI 365
           +LK M   GF PDCL+Y++L+E LGR  RI+D+V++FD +  KG  PD + YNAMI NFI
Sbjct: 281 VLKEMVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFI 340

Query: 366 CIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTG 425
              DFDE + YY+RML   CEP++ TYS L+ G +K +KV+DALE+F+EM++R ++PTTG
Sbjct: 341 SARDFDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTG 400

Query: 426 AITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEM 485
            +TSF+K  CSYGPPHAAM+IY+K+RK GCRIS++AYKLLL RLS FGK GMLLN+W+EM
Sbjct: 401 LVTSFLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEM 460

Query: 486 QESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRT 545
           QESGY  DVE YE+ +D LC  G LENAVLVMEE +R+GF P+R + S+L++KL+A N+T
Sbjct: 461 QESGYPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKT 520

Query: 546 EMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           E+AYKL+LKIK AR  E+ +  WR+ GWH+
Sbjct: 521 ELAYKLFLKIKKARATENARSFWRSNGWHF 546

BLAST of Lsi04G018170 vs. Swiss-Prot
Match: PP293_ARATH (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 6.8e-43
Identity = 112/392 (28.57%), Postives = 186/392 (47.45%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++     D+ +YN ++  L +
Sbjct: 148 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAK 207

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 208 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 267

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 268 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 327

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 328 QGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 387

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 388 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 447

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P  A  IY K  +     S + + +++    +   + M   +W EM + G  P
Sbjct: 448 LMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFMARNYEMGRAVWEEMIKKGICP 507

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 508 DDNSYTVLIRGLIGEGKSREACRYLEEMLDKG 538

BLAST of Lsi04G018170 vs. Swiss-Prot
Match: PP382_ARATH (Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidopsis thaliana GN=At5g14820 PE=2 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 2.6e-42
Identity = 111/392 (28.32%), Postives = 186/392 (47.45%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++     D+ +YN ++  L +
Sbjct: 147 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAK 206

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 207 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 266

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 267 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 326

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 327 HGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 386

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 387 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 446

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P     IY K  +     S + + +++    +   + M   +W+EM + G  P
Sbjct: 447 LMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICP 506

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 507 DDNSYTVLIRGLISEGKSREACRYLEEMLDKG 537

BLAST of Lsi04G018170 vs. Swiss-Prot
Match: PP294_ARATH (Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidopsis thaliana GN=At3g62540 PE=2 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 2.2e-41
Identity = 110/392 (28.06%), Postives = 185/392 (47.19%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++      + +YN ++  L +
Sbjct: 148 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHASRTYNSMMSILAK 207

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 208 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 267

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 268 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 327

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 328 HGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 387

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 388 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 447

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P     IY K  +     S + + +++    +   + M   +W+EM + G  P
Sbjct: 448 LMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICP 507

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 508 DDNSYTVLIRGLISEGKSREACRYLEEMLDKG 538

BLAST of Lsi04G018170 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 7.8e-39
Identity = 109/427 (25.53%), Postives = 188/427 (44.03%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVL----DTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILK 192
           +E AL  + V L   ++ +VL    D G+LG      FF WA KQP        Y  ++K
Sbjct: 100 LELALNESGVELRPGLIERVLNRCGDAGNLGYR----FFVWAAKQPRYCHSIEVYKSMVK 159

Query: 193 ALGRRRFFDSMMDVLHNMTREGVN-VNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLK 252
            L + R F ++  ++  M +E    +  E   ++V     A  V KA+++   + + G +
Sbjct: 160 ILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFE 219

Query: 253 CDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERIL 312
            D      LL  +C+   V  A   F  ++   P N   +  ++ GW R G+  E + +L
Sbjct: 220 PDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEAKYVL 279

Query: 313 KAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICI 372
             M   GF PD + YT LL     A ++ DA  +   M  +G  P+ + Y  +I     +
Sbjct: 280 VQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKV 339

Query: 373 GDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAI 432
              +E +  +  M    CE D+ TY+ L+ GF K  K+     + D+M+ + ++P+    
Sbjct: 340 DRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTY 399

Query: 433 TSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQE 492
              M            + + +K R++        Y +++      G+    + +WNEM+E
Sbjct: 400 MHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEE 459

Query: 493 SGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRS--KLNNKLLACNRT 552
           +G  P V+T+   I+ L   G L  A    +E + +G F   Q  +   L N +L   + 
Sbjct: 460 NGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLKDKKL 519

BLAST of Lsi04G018170 vs. TrEMBL
Match: A0A0A0L3E0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G663730 PE=4 SV=1)

HSP 1 Score: 1043.9 bits (2698), Expect = 7.3e-302
Identity = 510/574 (88.85%), Postives = 539/574 (93.90%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVL 60
           MAFGASRR + +Q R CFLG +ASGRY Y L++SPS   ALSYLFSTLDEPSNLFDDG+ 
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGLS 60

Query: 61  GDGTRNQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLR 120
           G+G RNQ  IDERFVI ELS+LLLVNPYGSVYNTLKE   EKQMP+RAVDGFL PEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDV LSQDVV+KVL+TGSLGSEAMVTFFYWAIKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMMDVL+NMTREGV   +E VSIVVDSLVK  QVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRN 240

Query: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRH 300
           LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNL KGN+PFN MTYNI+IGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAM 360
            EVE++LKAME+DGFSPDCLT+TYL+ECLGRAN+IDDAVK+FDKMDE GCTPDVDAYNAM
Sbjct: 301 GEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARII 420
           ISNFICIGDFD+CLTYY+RMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMVARII
Sbjct: 361 ISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480

Query: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA 540
           WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLLA
Sbjct: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           CNRTEMAYKLWLKIKVARHQE+LQRCWRAKGWHY
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Lsi04G018170 vs. TrEMBL
Match: A5C8V0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018999 PE=4 SV=1)

HSP 1 Score: 673.7 bits (1737), Expect = 2.0e-190
Identity = 339/566 (59.89%), Postives = 429/566 (75.80%), Query Frame = 1

Query: 10  LSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCS 69
           ++ Q +G FL R +  R +Y   Y PSS S   + FSTL   SN   D    +  +   +
Sbjct: 1   MASQLQG-FLSRFS--RTRYHTRYLPSSVSL--FQFSTLQVTSNPLMDEPTDNQIKRPSN 60

Query: 70  IDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQKLNG 129
            +ER V+ +LS LL +    S+     E   ++Q+  RAVDGFL P EKLRGVF+Q+L G
Sbjct: 61  FNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRG 120

Query: 130 KTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKA 189
           K AIE AL N  ++L+ D+V++V + G+LG EAMV FF WA+KQP+IPKD  +YN+I+KA
Sbjct: 121 KAAIELALTNVGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKA 180

Query: 190 LGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCD 249
           LGRR+F +  + VL +M  +G++ N ET+SIV+DS +KARQVSKA+++FRNL+E G KCD
Sbjct: 181 LGRRKFIEFXVXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 240

Query: 250 TETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKA 309
           TE+LN+LLQC+C+RSHVGAAN FFN +KG +PFN MTYNIIIGGWS+YG+  E+ER LKA
Sbjct: 241 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 300

Query: 310 MEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGD 369
           M  DGFSP+CLT+++L+E LGRA RIDDAV+VF  M+E GC P+   YNA+ISNFI   D
Sbjct: 301 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 360

Query: 370 FDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITS 429
           FDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVADALEM DEMV R +IPTTGAITS
Sbjct: 361 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 420

Query: 430 FMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESG 489
           F++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQESG
Sbjct: 421 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 480

Query: 490 YDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAY 549
           Y  D E YE+ I+ LC  GQL+ AVLVMEE L +GF PSR IRSKLNNKLLA N+ EMAY
Sbjct: 481 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAY 540

Query: 550 KLWLKIKVARHQESLQRCWRAKGWHY 575
           KL+LKIK AR  ++ +R WR  GWH+
Sbjct: 541 KLFLKIKXARQNDNARRFWRGNGWHF 561

BLAST of Lsi04G018170 vs. TrEMBL
Match: M5WDR4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023340mg PE=4 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 1.3e-189
Identity = 344/552 (62.32%), Postives = 427/552 (77.36%), Query Frame = 1

Query: 26  RYQYS-LLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLLL 85
           RY  S L++SP SSS    LFSTL   SN   D       ++Q ++DE FV+  LSNLL 
Sbjct: 18  RYPLSYLVHSPISSS----LFSTLYAQSNSLHDE---HRIKSQSTLDESFVLDRLSNLLP 77

Query: 86  VNPYGSVYNTLKEIP-TEKQMPIRAVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVN 145
           ++   S   TL E   ++KQ+ IR VDGFL P+EKLRGVFLQKL G  AIEHAL N  V+
Sbjct: 78  ISRSNSSTATLFEPSNSDKQIEIRTVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVD 137

Query: 146 LSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVL 205
           LS DVV +V++ G LG+EAM+ FF WAI++P+I K   +Y+IILKALGRR+FF  MM +L
Sbjct: 138 LSVDVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKYIETYHIILKALGRRKFFTHMMQIL 197

Query: 206 HNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRR 265
           H+M  +G++ N+ET+SIV+DS V+A+ VSKA+Q+FRNL+EIGL+CDTE+LN+LLQC+C+R
Sbjct: 198 HHMRAQGISPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQR 257

Query: 266 SHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYT 325
           SHVGAANSF N +KG + FN  TYNIIIGGWSR+GR SE+ERIL+AM  DGFS D  T++
Sbjct: 258 SHVGAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFS 317

Query: 326 YLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSN 385
           ++LE LGRA RIDDAV++FD M  KGC PD   YNAMISNFI + +FDEC+ YYK M SN
Sbjct: 318 FILEGLGRAGRIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSN 377

Query: 386 RCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITSFMKLSCSYGPPHAA 445
            C+P+I+TY+ LI  FLKA+KVA ALEMFDEM+ R ++PTTG ITSF++  CSYGPP+AA
Sbjct: 378 SCDPNIDTYTKLIAAFLKARKVAGALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAA 437

Query: 446 MLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDC 505
           M+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLNIW +MQE GY  D E Y++ I+ 
Sbjct: 438 MMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVING 497

Query: 506 LCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQES 565
           LC  G LENAVLVMEE L++GF PSR + SKLNNKLLA N+ E AYKL+LKIK AR  ++
Sbjct: 498 LCNIGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDN 557

Query: 566 LQRCWRAKGWHY 575
            QR WR+KGWH+
Sbjct: 558 AQRFWRSKGWHF 562

BLAST of Lsi04G018170 vs. TrEMBL
Match: D7UDB7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g01120 PE=4 SV=1)

HSP 1 Score: 665.2 bits (1715), Expect = 7.0e-188
Identity = 323/528 (61.17%), Postives = 413/528 (78.22%), Query Frame = 1

Query: 48  LDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIR 107
           +DEP++        +  +   + +ER V+ +LS LL +    S+     E   ++Q+  R
Sbjct: 1   MDEPTD--------NQIKRPSNFNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTR 60

Query: 108 AVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFF 167
           AVDGFL P EKLRGVF+Q+L GK AIE AL N  ++L+ D+V++V++ G+LG EAMV FF
Sbjct: 61  AVDGFLSPGEKLRGVFIQRLRGKAAIELALTNVGIDLTIDIVSEVINRGNLGGEAMVIFF 120

Query: 168 YWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVK 227
            WA+KQP+IPKD  +YN+I+KALGRR+F + ++ VL +M  +G++ N ET+SIV+DS +K
Sbjct: 121 NWAVKQPTIPKDVDTYNVIIKALGRRKFIEFVVKVLKDMHIQGISPNYETLSIVMDSFIK 180

Query: 228 ARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTY 287
           ARQVSKA+++FRNL+E G KCDTE+LN+LLQC+C+RSHVGAAN FFN +KG +PFN MTY
Sbjct: 181 ARQVSKAIEMFRNLEEFGGKCDTESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTY 240

Query: 288 NIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDE 347
           NIIIGGWS+YG+  E+ER LKAM  DGFSP+CLT+++L+E LGRA RIDDAV+VF  M+E
Sbjct: 241 NIIIGGWSKYGKIGEMERCLKAMVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEE 300

Query: 348 KGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVAD 407
            GC P+   YNA+ISNFI   DFDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVAD
Sbjct: 301 TGCVPNACVYNALISNFISTRDFDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVAD 360

Query: 408 ALEMFDEMVAR-IIPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLM 467
           ALEM DEMV R +IPTTGAITSF++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLM
Sbjct: 361 ALEMLDEMVGRGMIPTTGAITSFIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLM 420

Query: 468 RLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFP 527
           RLS FGK GMLLN+W+EMQESGY  D E YE+ I+ LC  GQL+ AVLVMEE L +GF P
Sbjct: 421 RLSRFGKCGMLLNLWDEMQESGYSSDTEVYEYVINGLCNIGQLDTAVLVMEESLHKGFCP 480

Query: 528 SRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           SR IRSKLNNKLLA N+ EMAYKL+LKIK+AR  ++ +R WR  GWH+
Sbjct: 481 SRLIRSKLNNKLLASNKVEMAYKLFLKIKIARQNDNARRFWRGNGWHF 520

BLAST of Lsi04G018170 vs. TrEMBL
Match: A0A061FE27_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_034412 PE=4 SV=1)

HSP 1 Score: 646.4 bits (1666), Expect = 3.4e-182
Identity = 326/541 (60.26%), Postives = 415/541 (76.71%), Query Frame = 1

Query: 36  SSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLL-LVNPYGSVYNT 95
           S SSA S  FSTL + S++ +     +   NQ ++DER V+GELS+L    +   +V   
Sbjct: 27  SFSSAFS--FSTLSD-SSIKEPSF--NQISNQSTVDERRVLGELSDLFQFSHSNATVPYP 86

Query: 96  LKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLD 155
            +E    KQ+   AVD +L PEEKLRGVFLQKL GKTAIEHAL+N  V LS D++ KV++
Sbjct: 87  YRESYPPKQIESGAVDEYLLPEEKLRGVFLQKLRGKTAIEHALSNVPVELSIDIIAKVVN 146

Query: 156 TGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVLHNMTREGVNVN 215
            G+LG EAMV FF WA+KQP I +D  SY II+KALGRR+FF  M++ LH+M +EG+  +
Sbjct: 147 IGNLGGEAMVLFFNWAMKQPGIARDIHSYYIIIKALGRRKFFKFMIETLHDMVKEGIKPD 206

Query: 216 METVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFN 275
           +ET+SIV+DS ++A++V KA++ F NL+E+GLK DT++LN+LLQC+CRR+HVGAANS FN
Sbjct: 207 VETLSIVMDSFIRAQRVQKAIETFENLEELGLKRDTKSLNVLLQCLCRRAHVGAANSLFN 266

Query: 276 LIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLLECLGRANR 335
            + G V FN  TYNI+I GWS+ GR S++ERILKAM  D F+PDC T++YL+E LGRA R
Sbjct: 267 AVNGKVKFNCDTYNIMISGWSKLGRVSKIERILKAMIADEFTPDCSTFSYLIEGLGRAGR 326

Query: 336 IDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSN 395
           IDDAV++FD M EKGC PD   YNAMISNFI +G+FDEC+ YYK +L++  +PD++TY+ 
Sbjct: 327 IDDAVEIFDHMKEKGCIPDTRVYNAMISNFISVGNFDECMKYYKGLLNSNSDPDVDTYTK 386

Query: 396 LIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVG 455
           LI  FLKA+ VADALE+FDEM+ + I+PTTG +TSF++  CSYGPP+AAM+ YKKARK G
Sbjct: 387 LISAFLKAQNVADALEIFDEMLVQGIVPTTGTLTSFVEPLCSYGPPYAAMMFYKKARKFG 446

Query: 456 CRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDCLCKTGQLENAV 515
           C+IS +AYKLLLMRLS FGK GMLLNIW+EMQESG+  D+E YEH I+ LC  G LENAV
Sbjct: 447 CKISLSAYKLLLMRLSRFGKCGMLLNIWDEMQESGHTSDMEVYEHVINGLCNIGHLENAV 506

Query: 516 LVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQESLQRCWRAKGWH 575
           LVMEE LR+GF PSR + SKLNNKLLA N  E AYKL+LKIK AR  E+ +R WRA GWH
Sbjct: 507 LVMEEALRKGFCPSRVLYSKLNNKLLASNEVEKAYKLFLKIKNARRDENARRYWRANGWH 562

BLAST of Lsi04G018170 vs. TAIR10
Match: AT5G43820.1 (AT5G43820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 580.5 bits (1495), Expect = 1.2e-165
Identity = 277/510 (54.31%), Postives = 378/510 (74.12%), Query Frame = 1

Query: 66  NQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQ 125
           N   +DE +V+ ELS+LL ++   +  +  KE  + K     A+D FL  E+KLRGVFLQ
Sbjct: 41  NHGVVDESYVLAELSSLLPIS--SNKTSVSKEDSSSKNQV--AIDSFLSAEDKLRGVFLQ 100

Query: 126 KLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNI 185
           KL GK+AI+ +L++  + LS D+V  VL+ G+L  EAMVTFF WA+++P + KD  SY++
Sbjct: 101 KLKGKSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSV 160

Query: 186 ILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIG 245
           IL+ALGRR+ F  MMDVL  M  EGVN ++E ++I +DS V+   V +A++LF   +  G
Sbjct: 161 ILRALGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFG 220

Query: 246 LKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVER 305
           +KC TE+ N LL+C+C RSHV AA S FN  KGN+PF++ +YNI+I GWS+ G   E+E+
Sbjct: 221 VKCSTESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEK 280

Query: 306 ILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFI 365
           +LK M   GF PDCL+Y++L+E LGR  RI+D+V++FD +  KG  PD + YNAMI NFI
Sbjct: 281 VLKEMVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFI 340

Query: 366 CIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTG 425
              DFDE + YY+RML   CEP++ TYS L+ G +K +KV+DALE+F+EM++R ++PTTG
Sbjct: 341 SARDFDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTG 400

Query: 426 AITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEM 485
            +TSF+K  CSYGPPHAAM+IY+K+RK GCRIS++AYKLLL RLS FGK GMLLN+W+EM
Sbjct: 401 LVTSFLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEM 460

Query: 486 QESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRT 545
           QESGY  DVE YE+ +D LC  G LENAVLVMEE +R+GF P+R + S+L++KL+A N+T
Sbjct: 461 QESGYPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKT 520

Query: 546 EMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           E+AYKL+LKIK AR  E+ +  WR+ GWH+
Sbjct: 521 ELAYKLFLKIKKARATENARSFWRSNGWHF 546

BLAST of Lsi04G018170 vs. TAIR10
Match: AT3G62470.1 (AT3G62470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 176.8 bits (447), Expect = 3.8e-44
Identity = 112/392 (28.57%), Postives = 186/392 (47.45%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++     D+ +YN ++  L +
Sbjct: 148 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAK 207

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 208 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 267

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 268 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 327

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 328 QGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 387

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 388 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 447

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P  A  IY K  +     S + + +++    +   + M   +W EM + G  P
Sbjct: 448 LMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFMARNYEMGRAVWEEMIKKGICP 507

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 508 DDNSYTVLIRGLIGEGKSREACRYLEEMLDKG 538

BLAST of Lsi04G018170 vs. TAIR10
Match: AT5G14820.1 (AT5G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 174.9 bits (442), Expect = 1.5e-43
Identity = 111/392 (28.32%), Postives = 186/392 (47.45%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++     D+ +YN ++  L +
Sbjct: 147 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAK 206

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 207 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 266

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 267 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 326

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 327 HGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 386

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 387 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 446

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P     IY K  +     S + + +++    +   + M   +W+EM + G  P
Sbjct: 447 LMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICP 506

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 507 DDNSYTVLIRGLISEGKSREACRYLEEMLDKG 537

BLAST of Lsi04G018170 vs. TAIR10
Match: AT3G62540.1 (AT3G62540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 171.8 bits (434), Expect = 1.2e-42
Identity = 110/392 (28.06%), Postives = 185/392 (47.19%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGR 192
           +E  L    ++LS D++ +VL+      +    FF WA ++      + +YN ++  L +
Sbjct: 148 MEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHASRTYNSMMSILAK 207

Query: 193 RRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTET 252
            R F++M+ VL  M  +G+ + MET +I + +   A++  KA+ +F  +K+   K   ET
Sbjct: 208 TRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVET 267

Query: 253 LNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEV 312
           +N LL  + R      A   F+ +K     N MTY +++ GW R     E  RI   M  
Sbjct: 268 INCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMID 327

Query: 313 DGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDE 372
            G  PD + +  +LE L R+ +  DA+K+F  M  KG  P+V +Y  MI +F      + 
Sbjct: 328 HGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMET 387

Query: 373 CLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARIIPTTG-AITSFMK 432
            + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + +K
Sbjct: 388 AIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIK 447

Query: 433 LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDP 492
           L  +   P     IY K  +     S + + +++    +   + M   +W+EM + G  P
Sbjct: 448 LMANQKMPEHGTRIYNKMIQNEIEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICP 507

Query: 493 DVETYEHAIDCLCKTGQLENAVLVMEECLRQG 524
           D  +Y   I  L   G+   A   +EE L +G
Sbjct: 508 DDNSYTVLIRGLISEGKSREACRYLEEMLDKG 538

BLAST of Lsi04G018170 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 163.3 bits (412), Expect = 4.4e-40
Identity = 109/427 (25.53%), Postives = 188/427 (44.03%), Query Frame = 1

Query: 133 IEHALANTDVNLSQDVVNKVL----DTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILK 192
           +E AL  + V L   ++ +VL    D G+LG      FF WA KQP        Y  ++K
Sbjct: 100 LELALNESGVELRPGLIERVLNRCGDAGNLGYR----FFVWAAKQPRYCHSIEVYKSMVK 159

Query: 193 ALGRRRFFDSMMDVLHNMTREGVN-VNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLK 252
            L + R F ++  ++  M +E    +  E   ++V     A  V KA+++   + + G +
Sbjct: 160 ILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFE 219

Query: 253 CDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERIL 312
            D      LL  +C+   V  A   F  ++   P N   +  ++ GW R G+  E + +L
Sbjct: 220 PDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEAKYVL 279

Query: 313 KAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICI 372
             M   GF PD + YT LL     A ++ DA  +   M  +G  P+ + Y  +I     +
Sbjct: 280 VQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQALCKV 339

Query: 373 GDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAI 432
              +E +  +  M    CE D+ TY+ L+ GF K  K+     + D+M+ + ++P+    
Sbjct: 340 DRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPSELTY 399

Query: 433 TSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQE 492
              M            + + +K R++        Y +++      G+    + +WNEM+E
Sbjct: 400 MHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEE 459

Query: 493 SGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRS--KLNNKLLACNRT 552
           +G  P V+T+   I+ L   G L  A    +E + +G F   Q  +   L N +L   + 
Sbjct: 460 NGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLKDKKL 519

BLAST of Lsi04G018170 vs. NCBI nr
Match: gi|778697198|ref|XP_011654277.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis sativus])

HSP 1 Score: 1043.9 bits (2698), Expect = 1.0e-301
Identity = 510/574 (88.85%), Postives = 539/574 (93.90%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVL 60
           MAFGASRR + +Q R CFLG +ASGRY Y L++SPS   ALSYLFSTLDEPSNLFDDG+ 
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGLS 60

Query: 61  GDGTRNQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLR 120
           G+G RNQ  IDERFVI ELS+LLLVNPYGSVYNTLKE   EKQMP+RAVDGFL PEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDV LSQDVV+KVL+TGSLGSEAMVTFFYWAIKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMMDVL+NMTREGV   +E VSIVVDSLVK  QVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRN 240

Query: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRH 300
           LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNL KGN+PFN MTYNI+IGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAM 360
            EVE++LKAME+DGFSPDCLT+TYL+ECLGRAN+IDDAVK+FDKMDE GCTPDVDAYNAM
Sbjct: 301 GEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARII 420
           ISNFICIGDFD+CLTYY+RMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMVARII
Sbjct: 361 ISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480

Query: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA 540
           WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLLA
Sbjct: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           CNRTEMAYKLWLKIKVARHQE+LQRCWRAKGWHY
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Lsi04G018170 vs. NCBI nr
Match: gi|659104798|ref|XP_008452985.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo])

HSP 1 Score: 1037.3 bits (2681), Expect = 9.8e-300
Identity = 512/574 (89.20%), Postives = 541/574 (94.25%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVL 60
           MAFGASRR L +Q + CFLG +ASGRY Y L++SPS   ALSYLFSTLDEPSNLFDDGV 
Sbjct: 1   MAFGASRRLLPYQVKACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGVS 60

Query: 61  GDGTRNQCSIDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLR 120
           G+G RNQ  IDERFVI ELS+LLLVNP+GSV NT+KE  TEKQ+PIRAVDGFL PEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPHGSVSNTVKENLTEKQVPIRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDVNLSQDVV+KVL+TGSLGSEAMVTFFYW+IKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVSKVLNTGSLGSEAMVTFFYWSIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMMDVL++MTREGV+  +ETVSIVVDSLVKA QVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYSMTREGVDATLETVSIVVDSLVKAHQVSKALQFFRN 240

Query: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRH 300
           LKEIGLKCDTETLNILLQCMCRRSHVGAANSF NL KG++PFN MTYNIIIGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFLNLTKGSIPFNVMTYNIIIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAM 360
           SEVE+ LKAMEVDGFSPD LT+TYL+ECLGRANRIDDAVK+FDKMDEKGCTPDV AYNAM
Sbjct: 301 SEVEQTLKAMEVDGFSPDYLTHTYLIECLGRANRIDDAVKIFDKMDEKGCTPDVAAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVARII 420
           ISNFICIGDFD+CLTYYKRMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMVARII
Sbjct: 361 ISNFICIGDFDQCLTYYKRMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLL+I
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLSI 480

Query: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA 540
           WNEMQESGYDPDVETYEHAI CLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA
Sbjct: 481 WNEMQESGYDPDVETYEHAIGCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQESLQRCWRAKGWHY 575
           CNRTEMAYKLWLKIKVARHQE+LQRCWRAKGWHY
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Lsi04G018170 vs. NCBI nr
Match: gi|147865347|emb|CAN84084.1| (hypothetical protein VITISV_018999 [Vitis vinifera])

HSP 1 Score: 673.7 bits (1737), Expect = 2.8e-190
Identity = 339/566 (59.89%), Postives = 429/566 (75.80%), Query Frame = 1

Query: 10  LSHQFRGCFLGRLASGRYQYSLLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCS 69
           ++ Q +G FL R +  R +Y   Y PSS S   + FSTL   SN   D    +  +   +
Sbjct: 1   MASQLQG-FLSRFS--RTRYHTRYLPSSVSL--FQFSTLQVTSNPLMDEPTDNQIKRPSN 60

Query: 70  IDERFVIGELSNLLLVNPYGSVYNTLKEIPTEKQMPIRAVDGFLPPEEKLRGVFLQKLNG 129
            +ER V+ +LS LL +    S+     E   ++Q+  RAVDGFL P EKLRGVF+Q+L G
Sbjct: 61  FNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRG 120

Query: 130 KTAIEHALANTDVNLSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKA 189
           K AIE AL N  ++L+ D+V++V + G+LG EAMV FF WA+KQP+IPKD  +YN+I+KA
Sbjct: 121 KAAIELALTNVGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKA 180

Query: 190 LGRRRFFDSMMDVLHNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCD 249
           LGRR+F +  + VL +M  +G++ N ET+SIV+DS +KARQVSKA+++FRNL+E G KCD
Sbjct: 181 LGRRKFIEFXVXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 240

Query: 250 TETLNILLQCMCRRSHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKA 309
           TE+LN+LLQC+C+RSHVGAAN FFN +KG +PFN MTYNIIIGGWS+YG+  E+ER LKA
Sbjct: 241 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 300

Query: 310 MEVDGFSPDCLTYTYLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGD 369
           M  DGFSP+CLT+++L+E LGRA RIDDAV+VF  M+E GC P+   YNA+ISNFI   D
Sbjct: 301 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 360

Query: 370 FDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITS 429
           FDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVADALEM DEMV R +IPTTGAITS
Sbjct: 361 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 420

Query: 430 FMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESG 489
           F++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQESG
Sbjct: 421 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 480

Query: 490 YDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAY 549
           Y  D E YE+ I+ LC  GQL+ AVLVMEE L +GF PSR IRSKLNNKLLA N+ EMAY
Sbjct: 481 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAY 540

Query: 550 KLWLKIKVARHQESLQRCWRAKGWHY 575
           KL+LKIK AR  ++ +R WR  GWH+
Sbjct: 541 KLFLKIKXARQNDNARRFWRGNGWHF 561

BLAST of Lsi04G018170 vs. NCBI nr
Match: gi|595858117|ref|XP_007210680.1| (hypothetical protein PRUPE_ppa023340mg [Prunus persica])

HSP 1 Score: 671.0 bits (1730), Expect = 1.8e-189
Identity = 344/552 (62.32%), Postives = 427/552 (77.36%), Query Frame = 1

Query: 26  RYQYS-LLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLLL 85
           RY  S L++SP SSS    LFSTL   SN   D       ++Q ++DE FV+  LSNLL 
Sbjct: 18  RYPLSYLVHSPISSS----LFSTLYAQSNSLHDE---HRIKSQSTLDESFVLDRLSNLLP 77

Query: 86  VNPYGSVYNTLKEIP-TEKQMPIRAVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVN 145
           ++   S   TL E   ++KQ+ IR VDGFL P+EKLRGVFLQKL G  AIEHAL N  V+
Sbjct: 78  ISRSNSSTATLFEPSNSDKQIEIRTVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVD 137

Query: 146 LSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVL 205
           LS DVV +V++ G LG+EAM+ FF WAI++P+I K   +Y+IILKALGRR+FF  MM +L
Sbjct: 138 LSVDVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKYIETYHIILKALGRRKFFTHMMQIL 197

Query: 206 HNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRR 265
           H+M  +G++ N+ET+SIV+DS V+A+ VSKA+Q+FRNL+EIGL+CDTE+LN+LLQC+C+R
Sbjct: 198 HHMRAQGISPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQR 257

Query: 266 SHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYT 325
           SHVGAANSF N +KG + FN  TYNIIIGGWSR+GR SE+ERIL+AM  DGFS D  T++
Sbjct: 258 SHVGAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFS 317

Query: 326 YLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSN 385
           ++LE LGRA RIDDAV++FD M  KGC PD   YNAMISNFI + +FDEC+ YYK M SN
Sbjct: 318 FILEGLGRAGRIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSN 377

Query: 386 RCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITSFMKLSCSYGPPHAA 445
            C+P+I+TY+ LI  FLKA+KVA ALEMFDEM+ R ++PTTG ITSF++  CSYGPP+AA
Sbjct: 378 SCDPNIDTYTKLIAAFLKARKVAGALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAA 437

Query: 446 MLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDC 505
           M+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLNIW +MQE GY  D E Y++ I+ 
Sbjct: 438 MMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVING 497

Query: 506 LCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQES 565
           LC  G LENAVLVMEE L++GF PSR + SKLNNKLLA N+ E AYKL+LKIK AR  ++
Sbjct: 498 LCNIGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDN 557

Query: 566 LQRCWRAKGWHY 575
            QR WR+KGWH+
Sbjct: 558 AQRFWRSKGWHF 562

BLAST of Lsi04G018170 vs. NCBI nr
Match: gi|645265054|ref|XP_008237970.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Prunus mume])

HSP 1 Score: 669.8 bits (1727), Expect = 4.1e-189
Identity = 344/552 (62.32%), Postives = 425/552 (76.99%), Query Frame = 1

Query: 26  RYQYS-LLYSPSSSSALSYLFSTLDEPSNLFDDGVLGDGTRNQCSIDERFVIGELSNLLL 85
           RY  S L+ SP  SS    LFSTL   SN   D       +NQ ++DE FV+ +LSNLL 
Sbjct: 18  RYPLSYLVRSPIPSS----LFSTLYAQSNSLHDE---HRIKNQSTLDESFVLDQLSNLLP 77

Query: 86  VNPYGSVYNTLKEIP-TEKQMPIRAVDGFLPPEEKLRGVFLQKLNGKTAIEHALANTDVN 145
           +    S   TL E   ++KQ+ IRAVDGFL P+EKLRGVFLQKL G  AIEHAL N  V+
Sbjct: 78  ICRSNSSTATLFEPSNSDKQIEIRAVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVD 137

Query: 146 LSQDVVNKVLDTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMDVL 205
           LS DVV +V++ G LG+EAM+ FF WAI++P+I K+  +++IILKALGRR+FF  MM +L
Sbjct: 138 LSVDVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKNIETFHIILKALGRRKFFTHMMQIL 197

Query: 206 HNMTREGVNVNMETVSIVVDSLVKARQVSKALQLFRNLKEIGLKCDTETLNILLQCMCRR 265
           H+M  +G+  N+ET+SIV+DS V+A+ VSKA+Q+FRNL+EIGL+CDTE+LN+LLQC+C+R
Sbjct: 198 HHMRAQGIRPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQR 257

Query: 266 SHVGAANSFFNLIKGNVPFNAMTYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYT 325
           SHVGAANSF N +KG + FN  TYNIIIGGWSR+GR SE+ERIL+AM  DGFS D  T++
Sbjct: 258 SHVGAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFS 317

Query: 326 YLLECLGRANRIDDAVKVFDKMDEKGCTPDVDAYNAMISNFICIGDFDECLTYYKRMLSN 385
           ++LE LGRA  IDDAV++FD M  KGC PD   YNAMISNFI + +FDEC+ YYK M SN
Sbjct: 318 FILEGLGRAGHIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSN 377

Query: 386 RCEPDINTYSNLIIGFLKAKKVADALEMFDEMVAR-IIPTTGAITSFMKLSCSYGPPHAA 445
            C P+I+TY+ LI  FLKA+KVADALEMFDEM+ R ++PTTG ITSF++  CSYGPP+AA
Sbjct: 378 SCNPNIDTYTKLIAAFLKARKVADALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAA 437

Query: 446 MLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDC 505
           M+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLNIW +MQE GY  D E Y++ I+ 
Sbjct: 438 MMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVING 497

Query: 506 LCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACNRTEMAYKLWLKIKVARHQES 565
           LC  G LENAVLVMEE L++GF PSR + SKLNNKLLA N+ E AYKL+LKIK AR  ++
Sbjct: 498 LCNIGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDN 557

Query: 566 LQRCWRAKGWHY 575
            QR WR+KGWH+
Sbjct: 558 AQRFWRSKGWHF 562

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP416_ARATH2.0e-16454.31Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis th... [more]
PP293_ARATH6.8e-4328.57Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
PP382_ARATH2.6e-4228.32Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidop... [more]
PP294_ARATH2.2e-4128.06Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidop... [more]
PP447_ARATH7.8e-3925.53Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0L3E0_CUCSA7.3e-30288.85Uncharacterized protein OS=Cucumis sativus GN=Csa_4G663730 PE=4 SV=1[more]
A5C8V0_VITVI2.0e-19059.89Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018999 PE=4 SV=1[more]
M5WDR4_PRUPE1.3e-18962.32Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023340mg PE=4 SV=1[more]
D7UDB7_VITVI7.0e-18861.17Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g01120 PE=4 SV=... [more]
A0A061FE27_THECC3.4e-18260.26Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobr... [more]
Match NameE-valueIdentityDescription
AT5G43820.11.2e-16554.31 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G62470.13.8e-4428.57 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G14820.11.5e-4328.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G62540.11.2e-4228.06 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G65820.14.4e-4025.53 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778697198|ref|XP_011654277.1|1.0e-30188.85PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
gi|659104798|ref|XP_008452985.1|9.8e-30089.20PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
gi|147865347|emb|CAN84084.1|2.8e-19059.89hypothetical protein VITISV_018999 [Vitis vinifera][more]
gi|595858117|ref|XP_007210680.1|1.8e-18962.32hypothetical protein PRUPE_ppa023340mg [Prunus persica][more]
gi|645265054|ref|XP_008237970.1|4.1e-18962.32PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Prunu... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G018170.1Lsi04G018170.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 495..524
score: 0.054coord: 217..246
score: 0.033coord: 182..211
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 384..416
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 305..363
score: 3.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 391..418
score: 2.5E-5coord: 182..214
score: 2.8E-5coord: 286..319
score: 2.0E-6coord: 217..249
score: 0.0014coord: 356..389
score: 5.3E-8coord: 321..354
score: 3.6
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 179..213
score: 9.909coord: 283..317
score: 11.137coord: 457..491
score: 7.892coord: 353..387
score: 10.567coord: 388..422
score: 9.712coord: 214..248
score: 8.955coord: 492..526
score: 10.468coord: 249..279
score: 6.796coord: 318..352
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 490..521
score: 1.6E-9coord: 230..433
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 318..416
score: 5.0
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 93..568
score: 7.3E
NoneNo IPR availablePANTHERPTHR24015:SF378SUBFAMILY NOT NAMEDcoord: 93..568
score: 7.3E