Lsi05G020360 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G020360
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein, putative
Locationchr05 : 27330235 .. 27332064 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTATTAACATAATTGTAACATTCCGAGAAGCTGCAGAAATTGTTCGAATGCAATGCGAATCTCCTCTCTTATTCTCACTGTCTGAGATTCTGCTGGTTTTCCTTTCAGGAAGAATCCTCAATCTATTATCGACAGAATAAATATTTCGTATTTATCCTTCTCGTCGCTTGATATATGTTGAGCTTTTCGAGATGTGCACGCAACCTGTTTGTCACAAGTCCCAAGAGAAAAAATTATACGTTAGGTCCCATGATCGATGCTACTTCAACCAAAAAGCGAGCGTACTACGAAGACCCAGTTGAATTTTATGCTCAATGAGAGGATGTAATATCTTGGACGTCTAAAATTACCAATTTGGTAAGAACAGGTCAGCCAGAATCTGCTTTTGCCTTCTTCAAGATGATGTTCTCAAATGGGCAAAGGCCAAATCATGTGACAATGCTGAGCGTAATAAGAGCTATTGACTCATTGAGCTGGGATTCAATGATCGAGGTGATCCATGGAGGAGTGATCAAAATGGGTTTTGAGTCAGACGTGGCAGTTTCAACAGCCCTTCTTGGATTTTATTCAATGCGTGATATTGGGATCGTCTGGAAGTTGTTTTATCAGATACCTTACAAGGATGTTGTTTTGTGGAGTGCTATGATCTCAGCATGCGTGAAAAATGGTCAGTATAATGAAGCATTTGATCTTTTCAGAGAGATGCAAAGTCGGGGAGTTCAACCAAACCAAGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGTTCTATCTTTAGGTAAAGAGTTGCATGCATTTTCAATGAGAAGGGATTTTTATTCTATGGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATTTTGAGGCATCAGTTAGAGTTTTGAAGTTGACGAGGAAAAAGGACATGGTATCATGGAGAATTATAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGTGTTTAAATTTTTCTCAAGGATGCGATCTTTTGGATTTGAGTTAGGCGAAACGATGATGCTGGATATTATAGCTGCAGTATTACTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGCTATGCATTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGGTTGGCCAAGCTTGTATTTGACGAGCTTGTTGACAAAGATATCATTGCCTGGAGCGCAATGATCTCAGCTTACTCTCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAACGATGCAATCAACTAATGAAAAGCCTAATGAGATAACTTTTGTAAGTTTAATGAATGCTTGTTCTTCATTGGGTGCTCAGGAGTTGGGAGAAAGTATTCAAGCTCATATAACAAAATGGGGGTACTCATCTAATACACATTTGATGTCAGCTTTGGTTGATTTTTACTGCACACTTGGAAGGATAAAGCTAGGAAAACATGTTTTTGACGAGATTTCGACAAAGGACGTAATTTGTTGGAGTGCGATGATTAAAGGGTACGGAATGAATGCCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCTAATGGGGTGGTCTTCATCTCTCTTTTATCTGCTTGTGCTAATTGTGGATTGGAAAAGGAAGGCTGGATATGGTTTCATTCGATGATTGACAAGTATGGCATTACTCCGACAGTGGCACATTATGCTTGTATCGTGGACTTGCTCGTTCGGCAAGGAAAAATTAGAGAAGCTGTTGAATTTGTGAAGAAATGCCAG

mRNA sequence

TTATTAACATAATTGTAACATTCCGAGAAGCTGCAGAAATTGTTCGAATGCAATGCGAATCTCCTCTCTTATTCTCACTGTCTGAGATTCTGCTGATGTGCACGCAACCTGTTTGTCACAAGTCCCAAGAGAAAAAATTATACGTTAGGTCCCATGATCGATGCTACTTCAACCAAAAAGCGAGCGTACTACGAAGACCCAGTCAGCCAGAATCTGCTTTTGCCTTCTTCAAGATGATGTTCTCAAATGGGCAAAGGCCAAATCATGTGACAATGCTGAGCGTAATAAGAGCTATTGACTCATTGAGCTGGGATTCAATGATCGAGGTGATCCATGGAGGAGTGATCAAAATGGGTTTTGAGTCAGACGTGGCAGTTTCAACAGCCCTTCTTGGATTTTATTCAATGCGTGATATTGGGATCGTCTGGAAGTTGTTTTATCAGATACCTTACAAGGATGTTGTTTTGTGGAGTGCTATGATCTCAGCATGCGTGAAAAATGGTCAGTATAATGAAGCATTTGATCTTTTCAGAGAGATGCAAAGTCGGGGAGTTCAACCAAACCAAGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGTTCTATCTTTAGGTAAAGAGTTGCATGCATTTTCAATGAGAAGGGATTTTTATTCTATGGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATTTTGAGGCATCAGTTAGAGTTTTGAAGTTGACGAGGAAAAAGGACATGGTATCATGGAGAATTATAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGTGTTTAAATTTTTCTCAAGGATGCGATCTTTTGGATTTGAGTTAGGCGAAACGATGATGCTGGATATTATAGCTGCAGTATTACTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGCTATGCATTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGGTTGGCCAAGCTTGTATTTGACGAGCTTGTTGACAAAGATATCATTGCCTGGAGCGCAATGATCTCAGCTTACTCTCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAACGATGCAATCAACTAATGAAAAGCCTAATGAGATAACTTTTGTAAGTTTAATGAATGCTTGTTCTTCATTGGGTGCTCAGGAGTTGGGAGAAAGTATTCAAGCTCATATAACAAAATGGGGGTACTCATCTAATACACATTTGATGTCAGCTTTGGTTGATTTTTACTGCACACTTGGAAGGATAAAGCTAGGAAAACATGTTTTTGACGAGATTTCGACAAAGGACGTAATTTGTTGGAGTGCGATGATTAAAGGGTACGGAATGAATGCCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCTAATGGGGTGGTCTTCATCTCTCTTTTATCTGCTTGTGCTAATTGTGGATTGGAAAAGGAAGGCTGGATATGGTTTCATTCGATGATTGACAAGTATGGCATTACTCCGACAGTGGCACATTATGCTTGTATCGTGGACTTGCTCGTTCGGCAAGGAAAAATTAGAGAAGCTGTTGAATTTGTGAAGAAATGCCAG

Coding sequence (CDS)

ATGCAATGCGAATCTCCTCTCTTATTCTCACTGTCTGAGATTCTGCTGATGTGCACGCAACCTGTTTGTCACAAGTCCCAAGAGAAAAAATTATACGTTAGGTCCCATGATCGATGCTACTTCAACCAAAAAGCGAGCGTACTACGAAGACCCAGTCAGCCAGAATCTGCTTTTGCCTTCTTCAAGATGATGTTCTCAAATGGGCAAAGGCCAAATCATGTGACAATGCTGAGCGTAATAAGAGCTATTGACTCATTGAGCTGGGATTCAATGATCGAGGTGATCCATGGAGGAGTGATCAAAATGGGTTTTGAGTCAGACGTGGCAGTTTCAACAGCCCTTCTTGGATTTTATTCAATGCGTGATATTGGGATCGTCTGGAAGTTGTTTTATCAGATACCTTACAAGGATGTTGTTTTGTGGAGTGCTATGATCTCAGCATGCGTGAAAAATGGTCAGTATAATGAAGCATTTGATCTTTTCAGAGAGATGCAAAGTCGGGGAGTTCAACCAAACCAAGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGTTCTATCTTTAGGTAAAGAGTTGCATGCATTTTCAATGAGAAGGGATTTTTATTCTATGGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATTTTGAGGCATCAGTTAGAGTTTTGAAGTTGACGAGGAAAAAGGACATGGTATCATGGAGAATTATAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGTGTTTAAATTTTTCTCAAGGATGCGATCTTTTGGATTTGAGTTAGGCGAAACGATGATGCTGGATATTATAGCTGCAGTATTACTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGCTATGCATTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTTCTCCAAATGTATGCTAAATTTGGTGATTTGGGGTTGGCCAAGCTTGTATTTGACGAGCTTGTTGACAAAGATATCATTGCCTGGAGCGCAATGATCTCAGCTTACTCTCATGGTGAAGATCCACTTAATGCCATCCAGACATTTAAAACGATGCAATCAACTAATGAAAAGCCTAATGAGATAACTTTTGTAAGTTTAATGAATGCTTGTTCTTCATTGGGTGCTCAGGAGTTGGGAGAAAGTATTCAAGCTCATATAACAAAATGGGGGTACTCATCTAATACACATTTGATGTCAGCTTTGGTTGATTTTTACTGCACACTTGGAAGGATAAAGCTAGGAAAACATGTTTTTGACGAGATTTCGACAAAGGACGTAATTTGTTGGAGTGCGATGATTAAAGGGTACGGAATGAATGCCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCTAATGGGGTGGTCTTCATCTCTCTTTTATCTGCTTGTGCTAATTGTGGATTGGAAAAGGAAGGCTGGATATGGTTTCATTCGATGATTGACAAGTATGGCATTACTCCGACAGTGGCACATTATGCTTGTATCGTGGACTTGCTCGTTCGGCAAGGAAAAATTAGAGAAGCTGTTGAATTTGTGAAGAAATGCCAG

Protein sequence

MQCESPLLFSLSEILLMCTQPVCHKSQEKKLYVRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLVRQGKIREAVEFVKKCQ
BLAST of Lsi05G020360 vs. Swiss-Prot
Match: PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN=DYW9 PE=2 SV=1)

HSP 1 Score: 276.6 bits (706), Expect = 5.9e-73
Identity = 155/493 (31.44%), Postives = 261/493 (52.94%), Query Frame = 1

Query: 46  SVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFE 105
           SV   P    S FA  +   S   +PN  T    I A      D    VIHG  +  G +
Sbjct: 94  SVNESPHSSLSVFAHLRK--STDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCD 153

Query: 106 SDVAVSTALLGFY-SMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREM 165
           S++ + + ++  Y     +    K+F ++P KD +LW+ MIS   KN  Y E+  +FR++
Sbjct: 154 SELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVFRDL 213

Query: 166 QSRG-VQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRN 225
            +    + +  +++ ILPA A+   L LG ++H+ + +   YS   +    + +YSKC  
Sbjct: 214 INESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGK 273

Query: 226 FEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAA 285
            +    + +  RK D+V++  + H    N         F  +   G  L  + ++ ++  
Sbjct: 274 IKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVP- 333

Query: 286 VLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAW 345
             +   L+L  A+H Y LK  FL   SV T L  +Y+K  ++  A+ +FDE  +K + +W
Sbjct: 334 --VSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSW 393

Query: 346 SAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAHITK 405
           +AMIS Y+      +AI  F+ MQ +   PN +T   +++AC+ LGA  LG+ +   +  
Sbjct: 394 NAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRS 453

Query: 406 WGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEALNT 465
             + S+ ++ +AL+  Y   G I   + +FD ++ K+ + W+ MI GYG++  G EALN 
Sbjct: 454 TDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNI 513

Query: 466 FSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLV 525
           F +ML+ G+ P  V F+ +L AC++ GL KEG   F+SMI +YG  P+V HYAC+VD+L 
Sbjct: 514 FYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILG 573

Query: 526 RQGKIREAVEFVK 537
           R G ++ A++F++
Sbjct: 574 RAGHLQRALQFIE 581

BLAST of Lsi05G020360 vs. Swiss-Prot
Match: PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 276.2 bits (705), Expect = 7.8e-73
Identity = 157/491 (31.98%), Postives = 259/491 (52.75%), Query Frame = 1

Query: 52  SQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFESDVAVS 111
           S   +    F+ M +    PN  T+  + +A  SL   ++    H  V+KM    D+ V 
Sbjct: 97  SSSYTVMQLFREMRAQDILPNAYTLAGIFKAESSLQSSTVGRQAHALVVKMSSFGDIYVD 156

Query: 112 TALLGFYSMRDIGIV---WKLFYQIPYKDVVLWSAMISACVKNGQYNEA---FDLFREMQ 171
           T+L+G Y     G+V    K+F  +P ++   WS M+S     G+  EA   F+LF   +
Sbjct: 157 TSLVGMYCKA--GLVEDGLKVFAYMPERNTYTWSTMVSGYATRGRVEEAIKVFNLFLREK 216

Query: 172 SRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFE 231
             G   + V   ++L + A    + LG+++H  +++      V + N+L+ MYSKC +  
Sbjct: 217 EEGSDSDYV-FTAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLN 276

Query: 232 ASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAAVL 291
            + ++   +  ++ ++W  +     QN    +  K FSRM S G +  E  ++ ++ A  
Sbjct: 277 EACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACS 336

Query: 292 LVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAWSA 351
            +  L  G  +H + LK GF   +   T L+ MYAK G L  A+  FD L ++D+  W++
Sbjct: 337 DICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTS 396

Query: 352 MISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAHITKWG 411
           +IS Y    D   A+  ++ M++    PN+ T  S++ ACSSL   ELG+ +  H  K G
Sbjct: 397 LISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHG 456

Query: 412 YSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEALNTFS 471
           +     + SAL   Y   G ++ G  VF     KDV+ W+AMI G   N  G+EAL  F 
Sbjct: 457 FGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFE 516

Query: 472 DMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLVRQ 531
           +ML+ G++P+ V F++++SAC++ G  + GW +F+ M D+ G+ P V HYAC+VDLL R 
Sbjct: 517 EMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQIGLDPKVDHYACMVDLLSRA 576

Query: 532 GKIREAVEFVK 537
           G+++EA EF++
Sbjct: 577 GQLKEAKEFIE 584

BLAST of Lsi05G020360 vs. Swiss-Prot
Match: PP398_ARATH (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 274.2 bits (700), Expect = 2.9e-72
Identity = 153/469 (32.62%), Postives = 256/469 (54.58%), Query Frame = 1

Query: 71  PNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDI-GIVWKL 130
           P+  T  +VI+A  +L  + +  +IH  V+K G+  DV V+++L+G Y+  ++     ++
Sbjct: 105 PDSFTFPNVIKAYGALGREFLGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQV 164

Query: 131 FYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVL 190
           F ++P +DV  W+ +IS   ++G+  +A +LF  M+S G +PN VS+   + AC+    L
Sbjct: 165 FDEMPERDVASWNTVISCFYQSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWL 224

Query: 191 SLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHAC 250
             GKE+H   +++ F     + ++L+DMY KC   E +  V +   +K +V+W  +    
Sbjct: 225 ERGKEIHRKCVKKGFELDEYVNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGY 284

Query: 251 IQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFI 310
           +         +  +RM   G    +T +  I+ A      LL G  +H Y ++      I
Sbjct: 285 VAKGDSKSCVEILNRMIIEGTRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADI 344

Query: 311 SVGTELLQMYAKFGDLGLAKLVFDELVDKDII-AWSAMISAYSHGEDPLNAIQTFKTMQS 370
            V   L+ +Y K G+  LA+ VF +   KD+  +W+ MIS+Y    +   A++ +  M S
Sbjct: 345 YVNCSLIDLYFKCGEANLAETVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVS 404

Query: 371 TNEKPNEITFVSLMNACSSLGAQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKL 430
              KP+ +TF S++ ACS L A E G+ I   I++    ++  L+SAL+D Y   G  K 
Sbjct: 405 VGVKPDVVTFTSVLPACSQLAALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKE 464

Query: 431 GKHVFDEISTKDVICWSAMIKGYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACAN 490
              +F+ I  KDV+ W+ MI  YG +    EAL  F +M  +GLKP+GV  +++LSAC +
Sbjct: 465 AFRIFNSIPKKDVVSWTVMISAYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGH 524

Query: 491 CGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLVRQGKIREAVEFVKK 538
            GL  EG  +F  M  KYGI P + HY+C++D+L R G++ EA E +++
Sbjct: 525 AGLIDEGLKFFSQMRSKYGIEPIIEHYSCMIDILGRAGRLLEAYEIIQQ 572

BLAST of Lsi05G020360 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 271.2 bits (692), Expect = 2.5e-71
Identity = 160/509 (31.43%), Postives = 266/509 (52.26%), Query Frame = 1

Query: 33  VRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMI 92
           V+     ++N   + L +      +   FK M S+G   +  T   V ++  SL      
Sbjct: 155 VKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 214

Query: 93  EVIHGGVIKMGFESDVAVSTALLGFYSMRD-IGIVWKLFYQIPYKDVVLWSAMISACVKN 152
           E +HG ++K GF    +V  +L+ FY     +    K+F ++  +DV+ W+++I+  V N
Sbjct: 215 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 274

Query: 153 GQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQ 212
           G   +   +F +M   G++ +  +IVS+   CAD  ++SLG+ +H+  ++  F       
Sbjct: 275 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 334

Query: 213 NSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFE 272
           N+L+DMYSKC + +++  V +    + +VS+  +     +     +  K F  M   G  
Sbjct: 335 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 394

Query: 273 ---LGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLA 332
                 T +L+  A   L+DE   G  VH +  +      I V   L+ MYAK G +  A
Sbjct: 395 PDVYTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEA 454

Query: 333 KLVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTM-QSTNEKPNEITFVSLMNACSS 392
           +LVF E+  KDII+W+ +I  YS       A+  F  + +     P+E T   ++ AC+S
Sbjct: 455 ELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACAS 514

Query: 393 LGAQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAM 452
           L A + G  I  +I + GY S+ H+ ++LVD Y   G + L   +FD+I++KD++ W+ M
Sbjct: 515 LSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVM 574

Query: 453 IKGYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYG 512
           I GYGM+  G EA+  F+ M   G++ + + F+SLL AC++ GL  EGW +F+ M  +  
Sbjct: 575 IAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECK 634

Query: 513 ITPTVAHYACIVDLLVRQGKIREAVEFVK 537
           I PTV HYACIVD+L R G + +A  F++
Sbjct: 635 IEPTVEHYACIVDMLARTGDLIKAYRFIE 660

BLAST of Lsi05G020360 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 1.2e-70
Identity = 140/442 (31.67%), Postives = 249/442 (56.33%), Query Frame = 1

Query: 95  IHGGVIKMGFESDVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQ 154
           IHG ++K GF  D+   T L   Y+  R +    K+F ++P +D+V W+ +++   +NG 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 155 YNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNS 214
              A ++ + M    ++P+ ++IVS+LPA +   ++S+GKE+H ++MR  F S+VNI  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 215 LMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELG 274
           L+DMY+KC + E + ++     ++++VSW  +  A +QN  P +    F +M   G +  
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 336

Query: 275 ETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFD 334
           +  ++  + A   + +L  G  +H  +++ G    +SV   L+ MY K  ++  A  +F 
Sbjct: 337 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 396

Query: 335 ELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQEL 394
           +L  + +++W+AMI  ++    P++A+  F  M+S   KP+  T+VS++ A + L     
Sbjct: 397 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

Query: 395 GESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGM 454
            + I   + +     N  + +ALVD Y   G I + + +FD +S + V  W+AMI GYG 
Sbjct: 457 AKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGT 516

Query: 455 NACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVA 514
           +  G  AL  F +M    +KPNGV F+S++SAC++ GL + G   F+ M + Y I  ++ 
Sbjct: 517 HGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMD 576

Query: 515 HYACIVDLLVRQGKIREAVEFV 536
           HY  +VDLL R G++ EA +F+
Sbjct: 577 HYGAMVDLLGRAGRLNEAWDFI 598

BLAST of Lsi05G020360 vs. TrEMBL
Match: A0A0A0L489_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122420 PE=4 SV=1)

HSP 1 Score: 866.7 bits (2238), Expect = 1.5e-248
Identity = 431/509 (84.68%), Postives = 462/509 (90.77%), Query Frame = 1

Query: 30  KLYVRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWD 89
           + Y +  D   +  K + L R  QPESAF FFKMMFSNG RPN+VTMLSVIRAID+LSWD
Sbjct: 43  EFYAQREDVISWTSKITNLVRTGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWD 102

Query: 90  SMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACV 149
           SMIEV+HG VIKMGFES+VAVSTALLGFYS+RDI  VWKLF QIP KDVVLWSA+IS CV
Sbjct: 103 SMIEVMHGVVIKMGFESEVAVSTALLGFYSIRDIETVWKLFNQIPSKDVVLWSAIISVCV 162

Query: 150 KNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVN 209
           KNGQYNEAFDL REMQ +GVQPNQV+IVSILPACADFGVLSLGKELHAFSMRRDFYSMV+
Sbjct: 163 KNGQYNEAFDLLREMQDQGVQPNQVTIVSILPACADFGVLSLGKELHAFSMRRDFYSMVD 222

Query: 210 IQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG 269
           +QNSLMDMYSKCR FEAS+RVLKL RKKD VSW+IITHACIQNNCPSKVFK FSRMRSFG
Sbjct: 223 LQNSLMDMYSKCRKFEASIRVLKLMRKKDAVSWKIITHACIQNNCPSKVFKIFSRMRSFG 282

Query: 270 FELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAK 329
           FEL ETMMLD+I+AVLL+DELLLGLAVHCYALKGGFLCFI VGTELLQMYAKFGDL LAK
Sbjct: 283 FELSETMMLDMISAVLLLDELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAK 342

Query: 330 LVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLG 389
           LVFD LVDKDIIAWSAMISAYSHGEDPLNAIQTFK MQSTNEKPNE TFVSLM+ACSSLG
Sbjct: 343 LVFDGLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLG 402

Query: 390 AQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIK 449
           A+ELGE+IQAH  K GY+SNTHLMSALV FYC LGRIKLG+HVFDEIS KDVICW+A+IK
Sbjct: 403 AKELGETIQAHTIKCGYTSNTHLMSALVGFYCKLGRIKLGEHVFDEISRKDVICWNALIK 462

Query: 450 GYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGIT 509
           GYG+N CGN+ALNTFSDMLSYGLKPNGVVF SLLSACA CGLEKE  +WF SM D+YGIT
Sbjct: 463 GYGLNGCGNKALNTFSDMLSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMNDEYGIT 522

Query: 510 PTVAHYACIVDLLVRQGKIREAVEFVKKC 539
           PT+AHYACIVDLLVRQGKIREAVEFVKKC
Sbjct: 523 PTMAHYACIVDLLVRQGKIREAVEFVKKC 551

BLAST of Lsi05G020360 vs. TrEMBL
Match: A5C4V9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_040403 PE=4 SV=1)

HSP 1 Score: 609.8 bits (1571), Expect = 3.3e-171
Identity = 295/497 (59.36%), Postives = 375/497 (75.45%), Query Frame = 1

Query: 41  FNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVI 100
           +  K S L + +Q E A   FKMM    QRPNHVT+LSVIRAI  L  + M+ VI G VI
Sbjct: 57  WTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRAISGLGLEDMMRVICGSVI 116

Query: 101 KMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDL 160
           K+GFES+V+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWSAM+SACVK+GQY EAF++
Sbjct: 117 KLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWSAMVSACVKSGQYGEAFEI 176

Query: 161 FREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSK 220
           FR MQ  GV+PN VSIVSILPACA+ G L  GKE+H FS+++ F+ + N+ NSL+DMY+K
Sbjct: 177 FRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKKMFHPLTNVHNSLVDMYAK 236

Query: 221 CRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDI 280
           CRNF+AS+ V     +KD++SW  I   CI+N+CP + FK FSRM+   F   ET++ D+
Sbjct: 237 CRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAFSRMQFSCFGADETIVQDL 296

Query: 281 IAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDI 340
           I A++  DE   G+A H + LK G L F+S+GT LLQMYAKFG+L  A +VFD+L  KD 
Sbjct: 297 IVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKFGELESAIIVFDQLNKKDY 356

Query: 341 IAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAH 400
           I+WSAMIS ++H   P NA++TFK MQST+E+PNEITFVSL+ ACS +GAQELGESIQAH
Sbjct: 357 ISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLLQACSLIGAQELGESIQAH 416

Query: 401 ITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEA 460
            TK GY SN  L SAL+D YC  GRI  G+ +F+EI TKD++CWS+MI GYG+N CG+EA
Sbjct: 417 ATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLVCWSSMINGYGLNGCGDEA 476

Query: 461 LNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVD 520
           L TFS+ML+ G+KPN VVFIS+LSAC++CGLE EGW  F SM  KYGI P + HYAC+VD
Sbjct: 477 LETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSMEQKYGIIPKLPHYACMVD 536

Query: 521 LLVRQGKIREAVEFVKK 538
           L+ R+G I  A++FV K
Sbjct: 537 LISRRGNIEGALQFVNK 553

BLAST of Lsi05G020360 vs. TrEMBL
Match: A0A061DV61_THECC (Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM_005333 PE=4 SV=1)

HSP 1 Score: 543.1 bits (1398), Expect = 3.8e-151
Identity = 268/497 (53.92%), Postives = 354/497 (71.23%), Query Frame = 1

Query: 46   SVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFE 105
            S L R  QPE A   FK M  + QRPN+VT+LS+++A D+L W+++  ++HG VIKMGFE
Sbjct: 710  SKLVRQGQPEEAIGLFKTMLMSNQRPNYVTILSLVKAFDTLDWEALRMMVHGLVIKMGFE 769

Query: 106  SDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREMQ 165
            S+ +V TAL+G YS+  +G+ W LF QIP KDVVL SAM+SACVKNG Y EA +LFR MQ
Sbjct: 770  SEPSVLTALIGSYSVYGMGVCWSLFNQIPNKDVVLRSAMVSACVKNGDYVEALELFRRMQ 829

Query: 166  SRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFE 225
              G++ N VSIVSILPACA+ G L LG+E+H F +RR    +  +QNSL+DMY+KCR+ +
Sbjct: 830  VLGLKANHVSIVSILPACANLGALQLGREIHGFIIRRMICYVNTVQNSLVDMYAKCRSLQ 889

Query: 226  ASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG-FELGETMMLDIIAAV 285
             ++ V     KKD+VSWR +    ++N C  K    FS+M+    F L E ++ D+I AV
Sbjct: 890  TAICVFNGMLKKDLVSWRTLIRGYVENECGIKALDAFSKMQRLSFFALDEFVVRDMIMAV 949

Query: 286  LLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAWS 345
            L   E  +G A HCY LK GFL F+S+ T LLQMYAKF  +  A+ VFD + +KD+IAW+
Sbjct: 950  LQSGESKIGSAFHCYILKTGFLAFVSIATALLQMYAKFSMVASARNVFDHISNKDVIAWN 1009

Query: 346  AMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQE----LGESIQAH 405
            AMISAY+    P NAI TF+ M   NEKP+E + VSL+  CS + +QE    +GE+I A 
Sbjct: 1010 AMISAYAQTGLPFNAINTFRQMLLMNEKPSEFSLVSLLQICSLMASQEVSDKVGETIHAF 1069

Query: 406  ITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEA 465
            + K GYS N +L SAL+DFYC  GR+K GK +FDE+ TKD+ICWS+MI GY +N  G EA
Sbjct: 1070 VAKVGYSRNVYLSSALIDFYCRFGRVKQGKALFDEVPTKDLICWSSMINGYVLNGYGIEA 1129

Query: 466  LNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVD 525
            L TF++ML  G+KPN ++F+S+LSAC++CGL+ EGW WF+SM +KYGITP +AHYAC+VD
Sbjct: 1130 LETFANMLDCGIKPNDIIFLSVLSACSHCGLKNEGWNWFYSMKEKYGITPKLAHYACMVD 1189

Query: 526  LLVRQGKIREAVEFVKK 538
            LL RQG I +A+ FVKK
Sbjct: 1190 LLSRQGHIEQALHFVKK 1206

BLAST of Lsi05G020360 vs. TrEMBL
Match: A0A0D2RE34_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G121600 PE=4 SV=1)

HSP 1 Score: 536.6 bits (1381), Expect = 3.5e-149
Identity = 266/504 (52.78%), Postives = 356/504 (70.63%), Query Frame = 1

Query: 41  FNQKASVLRRPSQPESAFAFFKMMF--SNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGG 100
           +  K S L +  QPE A   FK M    + QRPN+VT+LS+I+A+D+L WD ++ ++HG 
Sbjct: 54  WTSKLSKLVKQGQPEEAICLFKRMLLLKSNQRPNYVTILSLIKALDALHWDVLVMMVHGL 113

Query: 101 VIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAF 160
           V+KMGF S+ +V TAL+G YS+  +G  W LF+QI  KDVVLWSA++ ACVKN  Y EA 
Sbjct: 114 VVKMGFISEPSVLTALIGSYSVYGMGTCWSLFHQIRDKDVVLWSAVVYACVKNKDYLEAL 173

Query: 161 DLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMY 220
           +LFR MQ  G++ N VSIVSILPACA+ G L LG+E+H F ++R F  ++++QNSL+DMY
Sbjct: 174 ELFRRMQFIGLKVNHVSIVSILPACANLGALRLGREIHGFIIKRMFSHVISVQNSLVDMY 233

Query: 221 SKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG-FELGETMM 280
           +KCRN E+ +RV     +KD+VSWR +    I+N    +    FS+M+    F   E ++
Sbjct: 234 AKCRNLESGIRVFDGMLEKDLVSWRTVIRGYIENEFGIEAINIFSKMQLLSFFAPDEFVV 293

Query: 281 LDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVD 340
            D+I AVL   E  LG A HCY +K GFL F+SV T LLQMYAKFG +  A+ VFD + +
Sbjct: 294 RDMIMAVLQSGENKLGSAFHCYIMKNGFLAFVSVATALLQMYAKFGMVCSARSVFDHIGN 353

Query: 341 KDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQ----EL 400
           KD+IAW+AMISAY+  + P NA+ TF  M   N KPNE + +SL+  CS + +Q    EL
Sbjct: 354 KDVIAWNAMISAYTQSKLPFNAVDTFTQMLHMNAKPNEFSLISLLQMCSLMASQEVSHEL 413

Query: 401 GESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGM 460
           G+SI A I K GYS N +L SAL+DFYC  GR+K GK +FDE+  KD+ICWS++I GYG+
Sbjct: 414 GDSIHAFIEKVGYSRNVYLSSALIDFYCRSGRVKQGKALFDEVPVKDLICWSSLINGYGL 473

Query: 461 NACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVA 520
           N  G EAL TFS+ML  G+KPN ++F+S+LSAC++CGLE EGW WF+SM +KY +TP +A
Sbjct: 474 NGYGIEALETFSNMLDCGIKPNEIIFLSVLSACSHCGLEYEGWNWFYSMKEKYNVTPKLA 533

Query: 521 HYACIVDLLVRQGKIREAVEFVKK 538
           HYAC+VDLL RQG I +A++FVKK
Sbjct: 534 HYACMVDLLSRQGNIEQALDFVKK 557

BLAST of Lsi05G020360 vs. TrEMBL
Match: M0ZG59_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400000039 PE=4 SV=1)

HSP 1 Score: 532.3 bits (1370), Expect = 6.7e-148
Identity = 261/506 (51.58%), Postives = 355/506 (70.16%), Query Frame = 1

Query: 30  KLYVRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWD 89
           K   + + R + +Q +S++R  +Q   A   FK M  +  +PNHVT+LSVIRA +   W 
Sbjct: 36  KFAEKQNVRAWTSQISSLVRE-NQSIEAINLFKTMLKDEHKPNHVTVLSVIRAAEK--WQ 95

Query: 90  SMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACV 149
            M+  IHG  IKMGFE ++ V TAL+G Y++ D+   W+LF     KDV+LWSAM SACV
Sbjct: 96  PMVRGIHGFTIKMGFEIELPVVTALVGVYAIWDMDTAWQLFNHTKEKDVILWSAMASACV 155

Query: 150 KNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVN 209
           K+G+Y EA +LFREMQ  GV+PN VSIV I+PACA+ G LS+GKE+HA+S++    S VN
Sbjct: 156 KSGEYVEAIELFREMQLCGVEPNYVSIVGIVPACANLGALSIGKEIHAYSIKVSSISHVN 215

Query: 210 IQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG 269
           IQNSL+DMY+KC + +AS+ V +   KKD+VSWR + H C++N C ++    FS MR   
Sbjct: 216 IQNSLVDMYAKCGSLKASITVFRGIEKKDLVSWRSMIHGCVENGCFNEALSLFSEMRYCC 275

Query: 270 FELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAK 329
           FE  E ++ ++I A+  +DE+ +G   H +ALK GFL  +SV T LL +Y  FGD+  A+
Sbjct: 276 FEPDEGVIREVIGALSQLDEIKIGQCFHSFALKQGFLGCVSVVTALLHIYGGFGDIESAR 335

Query: 330 LVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLG 389
            +FD L  KD+IAWS MI+AY+  E P NA+  ++ MQS NEKPNEI +VSL+ ACSS+ 
Sbjct: 336 SLFDPLKSKDLIAWSTMIAAYAQSECPSNALDIYRQMQSANEKPNEIIYVSLIQACSSIA 395

Query: 390 AQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIK 449
           A+ +GE + A + K G +SN  L+S+L+D YC  GRI  G+ +F E    D+ICWS+MI 
Sbjct: 396 AEVIGEGVHAQVIKLGNTSNAFLISSLIDMYCRFGRISQGQAIFSECPNVDLICWSSMIN 455

Query: 450 GYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGIT 509
           GYG+N  GNEAL  FSDML+ G+KPN VVF+S+LSAC++CGLE EGW WFH+M +++G+T
Sbjct: 456 GYGINGHGNEALQCFSDMLNSGIKPNDVVFVSVLSACSHCGLEYEGWNWFHAMEEQFGVT 515

Query: 510 PTVAHYACIVDLLVRQGKIREAVEFV 536
           P +AHYAC+VD+L RQG I EA EFV
Sbjct: 516 PKLAHYACMVDMLSRQGNIEEAFEFV 538

BLAST of Lsi05G020360 vs. TAIR10
Match: AT4G30700.1 (AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 276.6 bits (706), Expect = 3.3e-74
Identity = 155/493 (31.44%), Postives = 261/493 (52.94%), Query Frame = 1

Query: 46  SVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFE 105
           SV   P    S FA  +   S   +PN  T    I A      D    VIHG  +  G +
Sbjct: 94  SVNESPHSSLSVFAHLRK--STDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCD 153

Query: 106 SDVAVSTALLGFY-SMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREM 165
           S++ + + ++  Y     +    K+F ++P KD +LW+ MIS   KN  Y E+  +FR++
Sbjct: 154 SELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVFRDL 213

Query: 166 QSRG-VQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRN 225
            +    + +  +++ ILPA A+   L LG ++H+ + +   YS   +    + +YSKC  
Sbjct: 214 INESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGK 273

Query: 226 FEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAA 285
            +    + +  RK D+V++  + H    N         F  +   G  L  + ++ ++  
Sbjct: 274 IKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVP- 333

Query: 286 VLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAW 345
             +   L+L  A+H Y LK  FL   SV T L  +Y+K  ++  A+ +FDE  +K + +W
Sbjct: 334 --VSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSW 393

Query: 346 SAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAHITK 405
           +AMIS Y+      +AI  F+ MQ +   PN +T   +++AC+ LGA  LG+ +   +  
Sbjct: 394 NAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRS 453

Query: 406 WGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEALNT 465
             + S+ ++ +AL+  Y   G I   + +FD ++ K+ + W+ MI GYG++  G EALN 
Sbjct: 454 TDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNI 513

Query: 466 FSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLV 525
           F +ML+ G+ P  V F+ +L AC++ GL KEG   F+SMI +YG  P+V HYAC+VD+L 
Sbjct: 514 FYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILG 573

Query: 526 RQGKIREAVEFVK 537
           R G ++ A++F++
Sbjct: 574 RAGHLQRALQFIE 581

BLAST of Lsi05G020360 vs. TAIR10
Match: AT2G33680.1 (AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 276.2 bits (705), Expect = 4.4e-74
Identity = 157/491 (31.98%), Postives = 259/491 (52.75%), Query Frame = 1

Query: 52  SQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFESDVAVS 111
           S   +    F+ M +    PN  T+  + +A  SL   ++    H  V+KM    D+ V 
Sbjct: 97  SSSYTVMQLFREMRAQDILPNAYTLAGIFKAESSLQSSTVGRQAHALVVKMSSFGDIYVD 156

Query: 112 TALLGFYSMRDIGIV---WKLFYQIPYKDVVLWSAMISACVKNGQYNEA---FDLFREMQ 171
           T+L+G Y     G+V    K+F  +P ++   WS M+S     G+  EA   F+LF   +
Sbjct: 157 TSLVGMYCKA--GLVEDGLKVFAYMPERNTYTWSTMVSGYATRGRVEEAIKVFNLFLREK 216

Query: 172 SRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFE 231
             G   + V   ++L + A    + LG+++H  +++      V + N+L+ MYSKC +  
Sbjct: 217 EEGSDSDYV-FTAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLN 276

Query: 232 ASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAAVL 291
            + ++   +  ++ ++W  +     QN    +  K FSRM S G +  E  ++ ++ A  
Sbjct: 277 EACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACS 336

Query: 292 LVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAWSA 351
            +  L  G  +H + LK GF   +   T L+ MYAK G L  A+  FD L ++D+  W++
Sbjct: 337 DICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTS 396

Query: 352 MISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAHITKWG 411
           +IS Y    D   A+  ++ M++    PN+ T  S++ ACSSL   ELG+ +  H  K G
Sbjct: 397 LISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHG 456

Query: 412 YSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEALNTFS 471
           +     + SAL   Y   G ++ G  VF     KDV+ W+AMI G   N  G+EAL  F 
Sbjct: 457 FGLEVPIGSALSTMYSKCGSLEDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFE 516

Query: 472 DMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLVRQ 531
           +ML+ G++P+ V F++++SAC++ G  + GW +F+ M D+ G+ P V HYAC+VDLL R 
Sbjct: 517 EMLAEGMEPDDVTFVNIISACSHKGFVERGWFYFNMMSDQIGLDPKVDHYACMVDLLSRA 576

Query: 532 GKIREAVEFVK 537
           G+++EA EF++
Sbjct: 577 GQLKEAKEFIE 584

BLAST of Lsi05G020360 vs. TAIR10
Match: AT5G27110.1 (AT5G27110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 274.2 bits (700), Expect = 1.7e-73
Identity = 153/469 (32.62%), Postives = 256/469 (54.58%), Query Frame = 1

Query: 71  PNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDI-GIVWKL 130
           P+  T  +VI+A  +L  + +  +IH  V+K G+  DV V+++L+G Y+  ++     ++
Sbjct: 105 PDSFTFPNVIKAYGALGREFLGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQV 164

Query: 131 FYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVL 190
           F ++P +DV  W+ +IS   ++G+  +A +LF  M+S G +PN VS+   + AC+    L
Sbjct: 165 FDEMPERDVASWNTVISCFYQSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWL 224

Query: 191 SLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHAC 250
             GKE+H   +++ F     + ++L+DMY KC   E +  V +   +K +V+W  +    
Sbjct: 225 ERGKEIHRKCVKKGFELDEYVNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGY 284

Query: 251 IQNNCPSKVFKFFSRMRSFGFELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFI 310
           +         +  +RM   G    +T +  I+ A      LL G  +H Y ++      I
Sbjct: 285 VAKGDSKSCVEILNRMIIEGTRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADI 344

Query: 311 SVGTELLQMYAKFGDLGLAKLVFDELVDKDII-AWSAMISAYSHGEDPLNAIQTFKTMQS 370
            V   L+ +Y K G+  LA+ VF +   KD+  +W+ MIS+Y    +   A++ +  M S
Sbjct: 345 YVNCSLIDLYFKCGEANLAETVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVS 404

Query: 371 TNEKPNEITFVSLMNACSSLGAQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKL 430
              KP+ +TF S++ ACS L A E G+ I   I++    ++  L+SAL+D Y   G  K 
Sbjct: 405 VGVKPDVVTFTSVLPACSQLAALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKE 464

Query: 431 GKHVFDEISTKDVICWSAMIKGYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACAN 490
              +F+ I  KDV+ W+ MI  YG +    EAL  F +M  +GLKP+GV  +++LSAC +
Sbjct: 465 AFRIFNSIPKKDVVSWTVMISAYGSHGQPREALYQFDEMQKFGLKPDGVTLLAVLSACGH 524

Query: 491 CGLEKEGWIWFHSMIDKYGITPTVAHYACIVDLLVRQGKIREAVEFVKK 538
            GL  EG  +F  M  KYGI P + HY+C++D+L R G++ EA E +++
Sbjct: 525 AGLIDEGLKFFSQMRSKYGIEPIIEHYSCMIDILGRAGRLLEAYEIIQQ 572

BLAST of Lsi05G020360 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 271.2 bits (692), Expect = 1.4e-72
Identity = 160/509 (31.43%), Postives = 266/509 (52.26%), Query Frame = 1

Query: 33  VRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMI 92
           V+     ++N   + L +      +   FK M S+G   +  T   V ++  SL      
Sbjct: 155 VKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 214

Query: 93  EVIHGGVIKMGFESDVAVSTALLGFYSMRD-IGIVWKLFYQIPYKDVVLWSAMISACVKN 152
           E +HG ++K GF    +V  +L+ FY     +    K+F ++  +DV+ W+++I+  V N
Sbjct: 215 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 274

Query: 153 GQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQ 212
           G   +   +F +M   G++ +  +IVS+   CAD  ++SLG+ +H+  ++  F       
Sbjct: 275 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 334

Query: 213 NSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFE 272
           N+L+DMYSKC + +++  V +    + +VS+  +     +     +  K F  M   G  
Sbjct: 335 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 394

Query: 273 ---LGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLA 332
                 T +L+  A   L+DE   G  VH +  +      I V   L+ MYAK G +  A
Sbjct: 395 PDVYTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEA 454

Query: 333 KLVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTM-QSTNEKPNEITFVSLMNACSS 392
           +LVF E+  KDII+W+ +I  YS       A+  F  + +     P+E T   ++ AC+S
Sbjct: 455 ELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACAS 514

Query: 393 LGAQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAM 452
           L A + G  I  +I + GY S+ H+ ++LVD Y   G + L   +FD+I++KD++ W+ M
Sbjct: 515 LSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVM 574

Query: 453 IKGYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYG 512
           I GYGM+  G EA+  F+ M   G++ + + F+SLL AC++ GL  EGW +F+ M  +  
Sbjct: 575 IAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECK 634

Query: 513 ITPTVAHYACIVDLLVRQGKIREAVEFVK 537
           I PTV HYACIVD+L R G + +A  F++
Sbjct: 635 IEPTVEHYACIVDMLARTGDLIKAYRFIE 660

BLAST of Lsi05G020360 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 268.9 bits (686), Expect = 7.0e-72
Identity = 140/442 (31.67%), Postives = 249/442 (56.33%), Query Frame = 1

Query: 95  IHGGVIKMGFESDVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQ 154
           IHG ++K GF  D+   T L   Y+  R +    K+F ++P +D+V W+ +++   +NG 
Sbjct: 157 IHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGM 216

Query: 155 YNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNS 214
              A ++ + M    ++P+ ++IVS+LPA +   ++S+GKE+H ++MR  F S+VNI  +
Sbjct: 217 ARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTA 276

Query: 215 LMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELG 274
           L+DMY+KC + E + ++     ++++VSW  +  A +QN  P +    F +M   G +  
Sbjct: 277 LVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPT 336

Query: 275 ETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFD 334
           +  ++  + A   + +L  G  +H  +++ G    +SV   L+ MY K  ++  A  +F 
Sbjct: 337 DVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFG 396

Query: 335 ELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQEL 394
           +L  + +++W+AMI  ++    P++A+  F  M+S   KP+  T+VS++ A + L     
Sbjct: 397 KLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH 456

Query: 395 GESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGM 454
            + I   + +     N  + +ALVD Y   G I + + +FD +S + V  W+AMI GYG 
Sbjct: 457 AKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGT 516

Query: 455 NACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVA 514
           +  G  AL  F +M    +KPNGV F+S++SAC++ GL + G   F+ M + Y I  ++ 
Sbjct: 517 HGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMD 576

Query: 515 HYACIVDLLVRQGKIREAVEFV 536
           HY  +VDLL R G++ EA +F+
Sbjct: 577 HYGAMVDLLGRAGRLNEAWDFI 598

BLAST of Lsi05G020360 vs. NCBI nr
Match: gi|700201385|gb|KGN56518.1| (hypothetical protein Csa_3G122420 [Cucumis sativus])

HSP 1 Score: 866.7 bits (2238), Expect = 2.1e-248
Identity = 431/509 (84.68%), Postives = 462/509 (90.77%), Query Frame = 1

Query: 30  KLYVRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWD 89
           + Y +  D   +  K + L R  QPESAF FFKMMFSNG RPN+VTMLSVIRAID+LSWD
Sbjct: 43  EFYAQREDVISWTSKITNLVRTGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWD 102

Query: 90  SMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACV 149
           SMIEV+HG VIKMGFES+VAVSTALLGFYS+RDI  VWKLF QIP KDVVLWSA+IS CV
Sbjct: 103 SMIEVMHGVVIKMGFESEVAVSTALLGFYSIRDIETVWKLFNQIPSKDVVLWSAIISVCV 162

Query: 150 KNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVN 209
           KNGQYNEAFDL REMQ +GVQPNQV+IVSILPACADFGVLSLGKELHAFSMRRDFYSMV+
Sbjct: 163 KNGQYNEAFDLLREMQDQGVQPNQVTIVSILPACADFGVLSLGKELHAFSMRRDFYSMVD 222

Query: 210 IQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG 269
           +QNSLMDMYSKCR FEAS+RVLKL RKKD VSW+IITHACIQNNCPSKVFK FSRMRSFG
Sbjct: 223 LQNSLMDMYSKCRKFEASIRVLKLMRKKDAVSWKIITHACIQNNCPSKVFKIFSRMRSFG 282

Query: 270 FELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAK 329
           FEL ETMMLD+I+AVLL+DELLLGLAVHCYALKGGFLCFI VGTELLQMYAKFGDL LAK
Sbjct: 283 FELSETMMLDMISAVLLLDELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAK 342

Query: 330 LVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLG 389
           LVFD LVDKDIIAWSAMISAYSHGEDPLNAIQTFK MQSTNEKPNE TFVSLM+ACSSLG
Sbjct: 343 LVFDGLVDKDIIAWSAMISAYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLG 402

Query: 390 AQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIK 449
           A+ELGE+IQAH  K GY+SNTHLMSALV FYC LGRIKLG+HVFDEIS KDVICW+A+IK
Sbjct: 403 AKELGETIQAHTIKCGYTSNTHLMSALVGFYCKLGRIKLGEHVFDEISRKDVICWNALIK 462

Query: 450 GYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGIT 509
           GYG+N CGN+ALNTFSDMLSYGLKPNGVVF SLLSACA CGLEKE  +WF SM D+YGIT
Sbjct: 463 GYGLNGCGNKALNTFSDMLSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMNDEYGIT 522

Query: 510 PTVAHYACIVDLLVRQGKIREAVEFVKKC 539
           PT+AHYACIVDLLVRQGKIREAVEFVKKC
Sbjct: 523 PTMAHYACIVDLLVRQGKIREAVEFVKKC 551

BLAST of Lsi05G020360 vs. NCBI nr
Match: gi|659077727|ref|XP_008439351.1| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g01580 [Cucumis melo])

HSP 1 Score: 854.0 bits (2205), Expect = 1.4e-244
Identity = 427/507 (84.22%), Postives = 458/507 (90.34%), Query Frame = 1

Query: 30  KLYVRSHDRCYFNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWD 89
           + Y +  D   +  K + L R  QPESAF FFKMMFSNG RPN+VTMLSVIRAID+LSWD
Sbjct: 43  EFYAQREDVISWTSKITNLVRAGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWD 102

Query: 90  SMIEVIHGGVIKMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACV 149
           SMIEV+HG  IKMGFES+VAVSTALLGFYS+RDI  VWKLF QIP KDVV WSA+ISACV
Sbjct: 103 SMIEVMHGVTIKMGFESEVAVSTALLGFYSIRDIETVWKLFNQIPCKDVVFWSAIISACV 162

Query: 150 KNGQYNEAFDLFREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVN 209
           KNGQY+EAFDL REMQ +GVQPNQVSIVSILPACADFGVLSLGKELHAFSMR+DFYSMV+
Sbjct: 163 KNGQYSEAFDLLREMQDQGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRKDFYSMVD 222

Query: 210 IQNSLMDMYSKCRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG 269
           IQNSLMDMYSKCR FEAS++VLKL RKKD VSW+IITHACIQNN PS+VFK FSRMRS G
Sbjct: 223 IQNSLMDMYSKCRMFEASIKVLKLMRKKDAVSWKIITHACIQNNYPSEVFKIFSRMRSLG 282

Query: 270 FELGETMMLDIIAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAK 329
           FEL ETM+LD+I+AVLLVDELLLGLAVHCYALKGGFLCFI VGTELLQMYAKFGDL LAK
Sbjct: 283 FELSETMVLDMISAVLLVDELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAK 342

Query: 330 LVFDELVDKDIIAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLG 389
           LVFDELVDKDIIAWSAMIS YSHGEDPLNAIQTFK MQSTNEKPNE TFVSLM+ACSSLG
Sbjct: 343 LVFDELVDKDIIAWSAMISVYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLG 402

Query: 390 AQELGESIQAHITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIK 449
           A+ELGESIQAH  K GY+SNTHLMSALV FYCTLGRIKLG+HVFDEISTKD+ICW+AMIK
Sbjct: 403 AKELGESIQAHTIKCGYTSNTHLMSALVGFYCTLGRIKLGEHVFDEISTKDLICWNAMIK 462

Query: 450 GYGMNACGNEALNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGIT 509
           GYG+N CGN+ALNTFSDMLSYGLKPNGVVF SLLSACA CGLEKE  +WF SMIDKYGIT
Sbjct: 463 GYGLNGCGNKALNTFSDMLSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMIDKYGIT 522

Query: 510 PTVAHYACIVDLLVRQGKIREAVEFVK 537
           PT AHYACIVDLLVR+GKI EAVEFVK
Sbjct: 523 PTEAHYACIVDLLVRKGKIGEAVEFVK 549

BLAST of Lsi05G020360 vs. NCBI nr
Match: gi|147834193|emb|CAN75306.1| (hypothetical protein VITISV_040403 [Vitis vinifera])

HSP 1 Score: 609.8 bits (1571), Expect = 4.7e-171
Identity = 295/497 (59.36%), Postives = 375/497 (75.45%), Query Frame = 1

Query: 41  FNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVI 100
           +  K S L + +Q E A   FKMM    QRPNHVT+LSVIRAI  L  + M+ VI G VI
Sbjct: 57  WTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRAISGLGLEDMMRVICGSVI 116

Query: 101 KMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDL 160
           K+GFES+V+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWSAM+SACVK+GQY EAF++
Sbjct: 117 KLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWSAMVSACVKSGQYGEAFEI 176

Query: 161 FREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSK 220
           FR MQ  GV+PN VSIVSILPACA+ G L  GKE+H FS+++ F+ + N+ NSL+DMY+K
Sbjct: 177 FRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKKMFHPLTNVHNSLVDMYAK 236

Query: 221 CRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDI 280
           CRNF+AS+ V     +KD++SW  I   CI+N+CP + FK FSRM+   F   ET++ D+
Sbjct: 237 CRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAFSRMQFSCFGADETIVQDL 296

Query: 281 IAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDI 340
           I A++  DE   G+A H + LK G L F+S+GT LLQMYAKFG+L  A +VFD+L  KD 
Sbjct: 297 IVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKFGELESAIIVFDQLNKKDY 356

Query: 341 IAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAH 400
           I+WSAMIS ++H   P NA++TFK MQST+E+PNEITFVSL+ ACS +GAQELGESIQAH
Sbjct: 357 ISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLLQACSLIGAQELGESIQAH 416

Query: 401 ITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEA 460
            TK GY SN  L SAL+D YC  GRI  G+ +F+EI TKD++CWS+MI GYG+N CG+EA
Sbjct: 417 ATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLVCWSSMINGYGLNGCGDEA 476

Query: 461 LNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVD 520
           L TFS+ML+ G+KPN VVFIS+LSAC++CGLE EGW  F SM  KYGI P + HYAC+VD
Sbjct: 477 LETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSMEQKYGIIPKLPHYACMVD 536

Query: 521 LLVRQGKIREAVEFVKK 538
           L+ R+G I  A++FV K
Sbjct: 537 LISRRGNIEGALQFVNK 553

BLAST of Lsi05G020360 vs. NCBI nr
Match: gi|731421553|ref|XP_010661790.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Vitis vinifera])

HSP 1 Score: 609.8 bits (1571), Expect = 4.7e-171
Identity = 295/497 (59.36%), Postives = 375/497 (75.45%), Query Frame = 1

Query: 41  FNQKASVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVI 100
           +  K S L + +Q E A   FKMM    QRPNHVT+LSVIRAI  L  + M+ VI G VI
Sbjct: 57  WTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRAISGLGLEDMMRVICGSVI 116

Query: 101 KMGFESDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDL 160
           K+GFES+V+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWSAM+SACVK+GQY EAF++
Sbjct: 117 KLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWSAMVSACVKSGQYGEAFEI 176

Query: 161 FREMQSRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSK 220
           FR MQ  GV+PN VSIVSILPACA+ G L  GKE+H FS+++ F+ + N+ NSL+DMY+K
Sbjct: 177 FRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKKMFHPLTNVHNSLVDMYAK 236

Query: 221 CRNFEASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFGFELGETMMLDI 280
           CRNF+AS+ V     +KD++SW  I   CI+N+CP + FK FSRM+   F   ET++ D+
Sbjct: 237 CRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAFSRMQFSCFGADETIVQDL 296

Query: 281 IAAVLLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDI 340
           I A++  DE   G+A H + LK G L F+S+GT LLQMYAKFG+L  A +VFD+L  KD 
Sbjct: 297 IVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKFGELESAIIVFDQLNKKDY 356

Query: 341 IAWSAMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQELGESIQAH 400
           I+WSAMIS ++H   P NA++TFK MQST+E+PNEITFVSL+ ACS +GAQELGESIQAH
Sbjct: 357 ISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLLQACSLIGAQELGESIQAH 416

Query: 401 ITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEA 460
            TK GY SN  L SAL+D YC  GRI  G+ +F+EI TKD++CWS+MI GYG+N CG+EA
Sbjct: 417 ATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLVCWSSMINGYGLNGCGDEA 476

Query: 461 LNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVD 520
           L TFS+ML+ G+KPN VVFIS+LSAC++CGLE EGW  F SM  KYGI P + HYAC+VD
Sbjct: 477 LETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSMEQKYGIIPKLPHYACMVD 536

Query: 521 LLVRQGKIREAVEFVKK 538
           L+ R+G I  A++FV K
Sbjct: 537 LISRRGNIEGALQFVNK 553

BLAST of Lsi05G020360 vs. NCBI nr
Match: gi|590722123|ref|XP_007051809.1| (Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao])

HSP 1 Score: 543.1 bits (1398), Expect = 5.4e-151
Identity = 268/497 (53.92%), Postives = 354/497 (71.23%), Query Frame = 1

Query: 46   SVLRRPSQPESAFAFFKMMFSNGQRPNHVTMLSVIRAIDSLSWDSMIEVIHGGVIKMGFE 105
            S L R  QPE A   FK M  + QRPN+VT+LS+++A D+L W+++  ++HG VIKMGFE
Sbjct: 710  SKLVRQGQPEEAIGLFKTMLMSNQRPNYVTILSLVKAFDTLDWEALRMMVHGLVIKMGFE 769

Query: 106  SDVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAMISACVKNGQYNEAFDLFREMQ 165
            S+ +V TAL+G YS+  +G+ W LF QIP KDVVL SAM+SACVKNG Y EA +LFR MQ
Sbjct: 770  SEPSVLTALIGSYSVYGMGVCWSLFNQIPNKDVVLRSAMVSACVKNGDYVEALELFRRMQ 829

Query: 166  SRGVQPNQVSIVSILPACADFGVLSLGKELHAFSMRRDFYSMVNIQNSLMDMYSKCRNFE 225
              G++ N VSIVSILPACA+ G L LG+E+H F +RR    +  +QNSL+DMY+KCR+ +
Sbjct: 830  VLGLKANHVSIVSILPACANLGALQLGREIHGFIIRRMICYVNTVQNSLVDMYAKCRSLQ 889

Query: 226  ASVRVLKLTRKKDMVSWRIITHACIQNNCPSKVFKFFSRMRSFG-FELGETMMLDIIAAV 285
             ++ V     KKD+VSWR +    ++N C  K    FS+M+    F L E ++ D+I AV
Sbjct: 890  TAICVFNGMLKKDLVSWRTLIRGYVENECGIKALDAFSKMQRLSFFALDEFVVRDMIMAV 949

Query: 286  LLVDELLLGLAVHCYALKGGFLCFISVGTELLQMYAKFGDLGLAKLVFDELVDKDIIAWS 345
            L   E  +G A HCY LK GFL F+S+ T LLQMYAKF  +  A+ VFD + +KD+IAW+
Sbjct: 950  LQSGESKIGSAFHCYILKTGFLAFVSIATALLQMYAKFSMVASARNVFDHISNKDVIAWN 1009

Query: 346  AMISAYSHGEDPLNAIQTFKTMQSTNEKPNEITFVSLMNACSSLGAQE----LGESIQAH 405
            AMISAY+    P NAI TF+ M   NEKP+E + VSL+  CS + +QE    +GE+I A 
Sbjct: 1010 AMISAYAQTGLPFNAINTFRQMLLMNEKPSEFSLVSLLQICSLMASQEVSDKVGETIHAF 1069

Query: 406  ITKWGYSSNTHLMSALVDFYCTLGRIKLGKHVFDEISTKDVICWSAMIKGYGMNACGNEA 465
            + K GYS N +L SAL+DFYC  GR+K GK +FDE+ TKD+ICWS+MI GY +N  G EA
Sbjct: 1070 VAKVGYSRNVYLSSALIDFYCRFGRVKQGKALFDEVPTKDLICWSSMINGYVLNGYGIEA 1129

Query: 466  LNTFSDMLSYGLKPNGVVFISLLSACANCGLEKEGWIWFHSMIDKYGITPTVAHYACIVD 525
            L TF++ML  G+KPN ++F+S+LSAC++CGL+ EGW WF+SM +KYGITP +AHYAC+VD
Sbjct: 1130 LETFANMLDCGIKPNDIIFLSVLSACSHCGLKNEGWNWFYSMKEKYGITPKLAHYACMVD 1189

Query: 526  LLVRQGKIREAVEFVKK 538
            LL RQG I +A+ FVKK
Sbjct: 1190 LLSRQGHIEQALHFVKK 1206

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP341_ARATH5.9e-7331.44Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN... [more]
PP181_ARATH7.8e-7331.98Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN... [more]
PP398_ARATH2.9e-7232.62Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana GN... [more]
PP320_ARATH2.5e-7131.43Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PPR32_ARATH1.2e-7031.67Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L489_CUCSA1.5e-24884.68Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122420 PE=4 SV=1[more]
A5C4V9_VITVI3.3e-17159.36Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_040403 PE=4 SV=1[more]
A0A061DV61_THECC3.8e-15153.92Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM... [more]
A0A0D2RE34_GOSRA3.5e-14952.78Uncharacterized protein OS=Gossypium raimondii GN=B456_008G121600 PE=4 SV=1[more]
M0ZG59_SOLTU6.7e-14851.58Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400000039 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G30700.13.3e-7431.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G33680.14.4e-7431.98 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27110.11.7e-7332.62 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.11.4e-7231.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.17.0e-7231.67 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700201385|gb|KGN56518.1|2.1e-24884.68hypothetical protein Csa_3G122420 [Cucumis sativus][more]
gi|659077727|ref|XP_008439351.1|1.4e-24484.22PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
gi|147834193|emb|CAN75306.1|4.7e-17159.36hypothetical protein VITISV_040403 [Vitis vinifera][more]
gi|731421553|ref|XP_010661790.1|4.7e-17159.36PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Vitis... [more]
gi|590722123|ref|XP_007051809.1|5.4e-15153.92Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G020360.1Lsi05G020360.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 240..270
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 136..184
score: 3.0E-14coord: 338..386
score: 2.2E-8coord: 439..487
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 342..375
score: 0.0022coord: 240..272
score: 0.0027coord: 139..172
score: 2.1E-11coord: 442..475
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 511..539
score: 6.939coord: 238..272
score: 8.966coord: 409..439
score: 6.818coord: 440..474
score: 10.282coord: 37..71
score: 6.741coord: 339..373
score: 9.405coord: 207..237
score: 5.744coord: 475..510
score: 7.509coord: 308..338
score: 5.196coord: 374..408
score: 6.763coord: 137..171
score: 14
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 309..537
score: 3.2E-188coord: 53..273
score: 3.2E

The following gene(s) are paralogous to this gene:

None