Lsi04G004220 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G004220
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionPentatricopeptide repeat-containing protein
Locationchr04 : 4049941 .. 4052310 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACCATGCCCACCATCTGTTCGATCAAATTCCTGACAAGGATATTGTCCTCTTTAACATAATGACACGTGGGTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGCTCAAGTCCTTTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCATGTGCGAGTTCTAAGGCCTTGAAAGAAGGTATGGAGTTGCATTGTTTTGCTATTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCGCGAGGAGTGTTTGATGAAATGGAGCAGCCATGCATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCGAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGAGCACTAGACCTAGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCAAAATGTGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATGGTTGCATTTGCAACTCATGGGAATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGCAGGAGTTAGACCTGATGAGATCACTTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAGCAAGGTAGAGGGTATTTCTATAGTATGTCTAAAAACTATGGAATAACTCCAGGGATCAAGCATTATGGGTGTATGGTGGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAACTTCATAGATGAACTGGAAATTAAGCCCACGCCTATACTCTGGCGCACCCTGTTGTCTGCTTGCAGTACCCATGGTAATGTCGACATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTTATATTATCAAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCATTTCGGTAGAGTTGCGGCGAGCACTTGACGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCTTGAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCTGGTACAACGATAAGGGTAGTTAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCTTTAATATTTGGGAGGCAAATTGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCCTGTGGTGATTTCTGGTAATGGACTATGTAGAAATCACAGGCTATTCTTACAGTTTGATATTTCTTTCCTGCAATGGCCATGATTTCCTATTCATTTAGTGTATCTTCATGGGGAACCATTATGTTTGTATAGCAACCAAGGAATTCAGCCATATATTTATCCGCGTAGCATTGATCATTGCATGTATTGGATCAAAATAAAGTATATAACCATCGGCTGAACATATTTCTACACCATCATTACACCGTATGTCAACACGAGCTTAGCTGAACGGTAATTGGTATGTACTCCTTTCTTTGAAGTCGGATAATAAAATCTTCATACCCCGCATTAACGGTAGGATGAACATATTTCTAGATCATCAACTGACTTACTTTTAAATAACATACCATCAACTGACTTGTTTTCTATACTTCCTTTGACAGTCAACCATGGATTTTGGTTGCTATTTATGGCCAAAGTCATTATGCTCCTTCTAATTACAATCAAAGGACAGAAGTTGAATGATGAACTTTACCATCTATTACCCTTCCTTATTTACTGCAGTAGCCAATTCTAGATACAGTACAGAAGATCAAACATTGGAAACTCATAAAGTTTGTTCCAAATTTCCGATTATGACTGTTTTTAATTTAATGCTTTACCTATATTTAGCGTGGAAAGATTTCTTGAGCATTATCACTCTTTCTTTCTATGTACTAATTTTTTTAATTGTTTCTTTAATAAATTATTTATCATAAATTTTGTATCCTCAGTTACGTCCTCGTGAAGTCGTGGAAAATTTATGATTTATTTGCTGCACTTAATTAA

mRNA sequence

ATGGACCATGCCCACCATCTGTTCGATCAAATTCCTGACAAGGATATTGTCCTCTTTAACATAATGACACGTGGGTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGCTCAAGTCCTTTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCATGTGCGAGTTCTAAGGCCTTGAAAGAAGGTATGGAGTTGCATTGTTTTGCTATTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCGCGAGGAGTGTTTGATGAAATGGAGCAGCCATGCATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCGAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGAGCACTAGACCTAGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCAAAATGTGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATGGTTGCATTTGCAACTCATGGGAATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGCAGGAGTTAGACCTGATGAGATCACTTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAGCAAGGTAGAGGGTATTTCTATAGTATGTCTAAAAACTATGGAATAACTCCAGGGATCAAGCATTATGGGTGTATGGTGGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAACTTCATAGATGAACTGGAAATTAAGCCCACGCCTATACTCTGGCGCACCCTGTTGTCTGCTTGCAGTACCCATGGTAATGTCGACATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTTATATTATCAAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCATTTCGGTAGAGTTGCGGCGAGCACTTGACGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCTTGAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCTGGTACAACGATAAGGGTAGTTAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCTTTAATATTTGGGAGGCAAATTGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCCTGTGGTGATTTCTGTTACGTCCTCGTGAAGTCGTGGAAAATTTATGATTTATTTGCTGCACTTAATTAA

Coding sequence (CDS)

ATGGACCATGCCCACCATCTGTTCGATCAAATTCCTGACAAGGATATTGTCCTCTTTAACATAATGACACGTGGGTATGCTCGCTCTAATTCTCCCTATCTTGCCTTTTCTCTTTTTGCTCAAGTCCTTTGCTCTGGCCTTCTTCCTGATGACTACACATTCTCGTCCCTTCTCAAGGCATGTGCGAGTTCTAAGGCCTTGAAAGAAGGTATGGAGTTGCATTGTTTTGCTATTAAACTTGGACTGAATCATAATATTTACATATGCCCAACTCTCATAAATATGTATGCGGAGTGCAATGACATGAATGCAGCGCGAGGAGTGTTTGATGAAATGGAGCAGCCATGCATTGTTAGCTATAATGCAATTATCACGGGTTATGCTCGAAGTAGTCAGCCCAATGAGGCTTTGTCGTTGTTTAGAGAATTGCAAGCGAGCAATCTTGAGCCTACTGATGTTACTATGCTTAGTGTAATTATGTCATGTGCTCTGTTGGGAGCACTAGACCTAGGAAGGTGGATTCATGAATATGTTAAGAAGAAAGGGTTTGATAAATATGTGAAGGTGAACACTGCACTTATAGATATGTTTGCAAAATGTGGAAGTCTAGCTGATGCTATTTCTATCTTTGAGGGCATGCGTGTGAGAGATACACAAGCTTGGTCTGCAATGATGGTTGCATTTGCAACTCATGGGAATGGCTTGAAAGCTATCTCAATGTTTGAAGAGATGAAGAGGGCAGGAGTTAGACCTGATGAGATCACTTTTTTGGGGCTTTTGTATGCTTGTAGTCATGCTGGGCTAGTAGAGCAAGGTAGAGGGTATTTCTATAGTATGTCTAAAAACTATGGAATAACTCCAGGGATCAAGCATTATGGGTGTATGGTGGATTTGCTTGGTCGAACAGGTCATTTAGATGAGGCTTATAACTTCATAGATGAACTGGAAATTAAGCCCACGCCTATACTCTGGCGCACCCTGTTGTCTGCTTGCAGTACCCATGGTAATGTCGACATGGCAAAGCGGGTTATTGAAAGAATTTTTGAATTAGATGACTCCCATGGAGGGGACTATGTTATATTATCAAACTTGTGTGCTAGAGTAGGAAGATGGGAAGATGTGAATCATTTGAGGAAATTGATGAAAGATAGAGGGGTGGTGAAGGTTCCTGGGTGTAGTTCTGTGGAGGTAAACAATGTAGTACATGAGTTCTTCTCTGGAGATGGAGTTCACTGCATTTCGGTAGAGTTGCGGCGAGCACTTGACGAGTTAATCAAAGAAATAAAGTTGGTGGGATATGTTCCTGATACTTCTTTAGTATATCATGCTGATATGGAAGAGGAAGGGAAAGAACTTGTCTTGAGATATCACAGTGAAAAATTGGCCATGGCTTTTGGGCTCCTAAATACACCTCCTGGTACAACGATAAGGGTAGTTAAGAACCTCCGTATTTGTGGAGATTGTCATAATGCTGCAAAACTTATATCTTTAATATTTGGGAGGCAAATTGTCATTAGGGACGTTCAACGATTCCATCGATTTGAAGATGGGAAATGCTCCTGTGGTGATTTCTGTTACGTCCTCGTGAAGTCGTGGAAAATTTATGATTTATTTGCTGCACTTAATTAA

Protein sequence

MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKACASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLGRTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDFCYVLVKSWKIYDLFAALN
BLAST of Lsi04G004220 vs. Swiss-Prot
Match: PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 777.7 bits (2007), Expect = 8.3e-224
Identity = 359/524 (68.51%), Postives = 436/524 (83.21%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           M +A HLF+ + + DIV+FN M RGY+R  +P   FSLF ++L  G+LPD+YTF SLLKA
Sbjct: 79  MSYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKA 138

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA +KAL+EG +LHC ++KLGL+ N+Y+CPTLINMY EC D+++AR VFD + +PC+V Y
Sbjct: 139 CAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCY 198

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+ITGYAR ++PNEALSLFRE+Q   L+P ++T+LSV+ SCALLG+LDLG+WIH+Y KK
Sbjct: 199 NAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKK 258

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
             F KYVKVNTALIDMFAKCGSL DA+SIFE MR +DTQAWSAM+VA+A HG   K++ M
Sbjct: 259 HSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLM 318

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FE M+   V+PDEITFLGLL ACSH G VE+GR YF  M   +GI P IKHYG MVDLL 
Sbjct: 319 FERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLS 378

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G+L++AY FID+L I PTP+LWR LL+ACS+H N+D+A++V ERIFELDDSHGGDYVI
Sbjct: 379 RAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVI 438

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNL AR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV   + +L RA
Sbjct: 439 LSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRA 498

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDE++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVK
Sbjct: 499 LDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVK 558

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+C DCHNAAKLISLIFGR++V+RDVQRFH FEDGKCSCGDF
Sbjct: 559 NLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDF 602

BLAST of Lsi04G004220 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 493.0 bits (1268), Expect = 4.1e-138
Identity = 227/526 (43.16%), Postives = 348/526 (66.16%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           +++A  LFD+IP KD+V +N M  GYA + +   A  LF  ++ + + PD+ T  +++ A
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA S +++ G ++H +    G   N+ I   LI++Y++C ++  A G+F+ +    ++S+
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           N +I GY   +   EAL LF+E+  S   P DVTMLS++ +CA LGA+D+GRWIH Y+ K
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 181 --KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAI 240
             KG      + T+LIDM+AKCG +  A  +F  +  +   +W+AM+  FA HG    + 
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 455

Query: 241 SMFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDL 300
            +F  M++ G++PD+ITF+GLL ACSH+G+++ GR  F +M+++Y +TP ++HYGCM+DL
Sbjct: 456 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDL 515

Query: 301 LGRTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDY 360
           LG +G   EA   I+ +E++P  ++W +LL AC  HGNV++ +   E + +++  + G Y
Sbjct: 516 LGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSY 575

Query: 361 VILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELR 420
           V+LSN+ A  GRW +V   R L+ D+G+ KVPGCSS+E+++VVHEF  GD  H  + E+ 
Sbjct: 576 VLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIY 635

Query: 421 RALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV 480
             L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEKLA+AFGL++T PGT + +
Sbjct: 636 GMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 695

Query: 481 VKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           VKNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CSC D+
Sbjct: 696 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of Lsi04G004220 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 5.7e-132
Identity = 226/524 (43.13%), Postives = 340/524 (64.89%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           ++ A  LFD + ++++V +N M   Y ++ +P  A  +F ++L  G+ P D +    L A
Sbjct: 287 LETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHA 346

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA    L+ G  +H  +++LGL+ N+ +  +LI+MY +C +++ A  +F +++   +VS+
Sbjct: 347 CADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSW 406

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I G+A++ +P +AL+ F ++++  ++P   T +SVI + A L      +WIH  V +
Sbjct: 407 NAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMR 466

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
              DK V V TAL+DM+AKCG++  A  IF+ M  R    W+AM+  + THG G  A+ +
Sbjct: 467 SCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALEL 526

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEM++  ++P+ +TFL ++ ACSH+GLVE G   FY M +NY I   + HYG MVDLLG
Sbjct: 527 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 586

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EA++FI ++ +KP   ++  +L AC  H NV+ A++  ER+FEL+   GG +V+
Sbjct: 587 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 646

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           L+N+      WE V  +R  M  +G+ K PGCS VE+ N VH FFSG   H  S ++   
Sbjct: 647 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 706

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           L++LI  IK  GYVPDT+LV    +E + KE +L  HSEKLA++FGLLNT  GTTI V K
Sbjct: 707 LEKLICHIKEAGYVPDTNLV--LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRK 766

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+C DCHNA K ISL+ GR+IV+RD+QRFH F++G CSCGD+
Sbjct: 767 NLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDY 808

BLAST of Lsi04G004220 vs. Swiss-Prot
Match: PP252_ARATH (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 1.4e-130
Identity = 221/525 (42.10%), Postives = 335/525 (63.81%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           ++ A  +F+++P +D V +  +  GY++ + P  A   F Q+L  G  P+++T SS++KA
Sbjct: 111 LEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKA 170

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
            A+ +    G +LH F +K G + N+++   L+++Y     M+ A+ VFD +E    VS+
Sbjct: 171 AAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSW 230

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I G+AR S   +AL LF+ +      P+  +  S+  +C+  G L+ G+W+H Y+ K
Sbjct: 231 NALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIK 290

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            G          L+DM+AK GS+ DA  IF+ +  RD  +W++++ A+A HG G +A+  
Sbjct: 291 SGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWW 350

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEM+R G+RP+EI+FL +L ACSH+GL+++G  ++Y + K  GI P   HY  +VDLLG
Sbjct: 351 FEEMRRVGIRPNEISFLSVLTACSHSGLLDEG-WHYYELMKKDGIVPEAWHYVTVVDLLG 410

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+ A  FI+E+ I+PT  +W+ LL+AC  H N ++     E +FELD    G +VI
Sbjct: 411 RAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVI 470

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           L N+ A  GRW D   +RK MK+ GV K P CS VE+ N +H F + D  H    E+ R 
Sbjct: 471 LYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARK 530

Query: 421 LDELIKEIKLVGYVPDTS-LVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVV 480
            +E++ +IK +GYVPDTS ++ H D +E  +E+ L+YHSEK+A+AF LLNTPPG+TI + 
Sbjct: 531 WEEVLAKIKELGYVPDTSHVIVHVDQQE--REVNLQYHSEKIALAFALLNTPPGSTIHIK 590

Query: 481 KNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           KN+R+CGDCH A KL S + GR+I++RD  RFH F+DG CSC D+
Sbjct: 591 KNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDGNCSCKDY 632

BLAST of Lsi04G004220 vs. Swiss-Prot
Match: PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 1.2e-129
Identity = 220/524 (41.98%), Postives = 345/524 (65.84%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           +D    +F+ +P KD+V +N +  GYA+S     A  +  ++  + L PD +T SS+L  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
            +    + +G E+H + I+ G++ ++YI  +L++MYA+   +  +  VF  +     +S+
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           N+++ GY ++ + NEAL LFR++  + ++P  V   SVI +CA L  L LG+ +H YV +
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLR 371

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            GF   + + +AL+DM++KCG++  A  IF+ M V D  +W+A+++  A HG+G +A+S+
Sbjct: 372 GGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSL 431

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKR GV+P+++ F+ +L ACSH GLV++  GYF SM+K YG+   ++HY  + DLLG
Sbjct: 432 FEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLG 491

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EAYNFI ++ ++PT  +W TLLS+CS H N+++A++V E+IF +D  + G YV+
Sbjct: 492 RAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVL 551

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           + N+ A  GRW+++  LR  M+ +G+ K P CS +E+ N  H F SGD  H    ++   
Sbjct: 552 MCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEF 611

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           L  ++++++  GYV DTS V H D++EE K  +L  HSE+LA+AFG++NT PGTTIRV K
Sbjct: 612 LKAVMEQMEKEGYVADTSGVLH-DVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTK 671

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           N+RIC DCH A K IS I  R+I++RD  RFH F  G CSCGD+
Sbjct: 672 NIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDY 714

BLAST of Lsi04G004220 vs. TrEMBL
Match: A0A0A0KU15_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G605160 PE=4 SV=1)

HSP 1 Score: 1037.3 bits (2681), Expect = 6.4e-300
Identity = 496/524 (94.66%), Postives = 512/524 (97.71%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MDHAHHLFDQI DKDI+LFNIM RGYARSNSPYLAFSLF ++LCSGLLPDDYTFSSLLKA
Sbjct: 80  MDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKA 139

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL+EGM LHCFA+KLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY
Sbjct: 140 CASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 199

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NAIITGYARSSQPNEALSLFRELQASN+EPTDVTMLSVIMSCALLGALDLG+WIHEYVKK
Sbjct: 200 NAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKK 259

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
           KGFDKYVKVNTALIDMFAKCGSL DAISIFEGMRVRDTQAWSAM+VAFATHG+GLKAISM
Sbjct: 260 KGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISM 319

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKR GVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK YGITPGIKHYGCMVDLLG
Sbjct: 320 FEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLG 379

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R GHLDEAYNF+D+LEIK TPILWRTLLSACSTHGNV+MAKRVIERIFELDD+HGGDYVI
Sbjct: 380 RAGHLDEAYNFVDKLEIKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHGGDYVI 439

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNL ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHC+SVELRRA
Sbjct: 440 LSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRA 499

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV K
Sbjct: 500 LDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAK 559

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLRICGDCHNAAKLIS IFGR+IVIRDVQRFHRFEDGKCSCGDF
Sbjct: 560 NLRICGDCHNAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGDF 603

BLAST of Lsi04G004220 vs. TrEMBL
Match: D7TN78_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02510 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 8.0e-250
Identity = 398/524 (75.95%), Postives = 466/524 (88.93%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           M HAHHLFDQIP  DIVLFN M RGYAR+++P  AF+LF Q+L SGL PDDYTF SLLKA
Sbjct: 71  MQHAHHLFDQIPQPDIVLFNTMARGYARTDTPLRAFTLFTQILFSGLFPDDYTFPSLLKA 130

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CAS KAL+EG +LHC AIKLGL+ N+Y+CPTLINMY  CN+M+ AR VFD++ +PC+V+Y
Sbjct: 131 CASCKALEEGRQLHCLAIKLGLSENVYVCPTLINMYTACNEMDCARRVFDKIWEPCVVTY 190

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+ITGYAR S+PNEALSLFRELQA NL+PTDVTMLSV+ SCALLGALDLG+W+HEYVKK
Sbjct: 191 NAMITGYARGSRPNEALSLFRELQARNLKPTDVTMLSVLSSCALLGALDLGKWMHEYVKK 250

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            GF+++VKV+TALIDM+AKCGSL DA+ +FE M VRDTQAWSAM++A+A HG+GLKA+S+
Sbjct: 251 NGFNRFVKVDTALIDMYAKCGSLDDAVCVFENMAVRDTQAWSAMIMAYAIHGHGLKAVSL 310

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           F+EM++AG  PDEITFLGLLYACSH GLVE+G  YFY M   YG+ PGIKHYGCMVDLLG
Sbjct: 311 FKEMRKAGTEPDEITFLGLLYACSHTGLVEEGFEYFYGMRDKYGVIPGIKHYGCMVDLLG 370

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EAY FI  L I+PTPILWRTLLSAC +HGNV++ KRVIE+IFELDDSHGGDY+I
Sbjct: 371 RAGRLEEAYEFIVGLPIRPTPILWRTLLSACGSHGNVELGKRVIEQIFELDDSHGGDYII 430

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNLCAR GRWEDVN++RKLM +RGVVK+PGCSSVEVNNVVHEFFSGDGVH +S +L +A
Sbjct: 431 LSNLCARAGRWEDVNYVRKLMNERGVVKIPGCSSVEVNNVVHEFFSGDGVHSVSTKLHQA 490

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KE+KLVGYVP+TSLV+HADME+E KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVK
Sbjct: 491 LDELVKELKLVGYVPNTSLVFHADMEDEEKEVTLRYHSEKLAITFGLLNTPPGTTIRVVK 550

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+CGDCH+AAKLISLIF RQI++RDVQRFH F+DGKCSC D+
Sbjct: 551 NLRVCGDCHSAAKLISLIFDRQIILRDVQRFHHFKDGKCSCEDY 594

BLAST of Lsi04G004220 vs. TrEMBL
Match: M5XQC4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016366mg PE=4 SV=1)

HSP 1 Score: 865.1 bits (2234), Expect = 4.4e-248
Identity = 397/524 (75.76%), Postives = 466/524 (88.93%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MD+AHHLFDQIP  DIV+FN M RGYARS++P+ A SLFA +L S L PDDYTF+SLLKA
Sbjct: 69  MDYAHHLFDQIPHPDIVVFNTMARGYARSHAPFRAISLFAHILSSDLFPDDYTFASLLKA 128

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL+EG +LHCFAIK GL+ NIY+CPTLINMY ECND++AAR VFD++  PC+V +
Sbjct: 129 CASSKALEEGRQLHCFAIKCGLHLNIYVCPTLINMYTECNDVDAARRVFDKIPDPCVVVH 188

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I GYARSS+PNEAL+LFRELQASNL+PTDVTMLS + SCALLGALDLG+WIHEYVKK
Sbjct: 189 NAMIKGYARSSRPNEALALFRELQASNLKPTDVTMLSALSSCALLGALDLGKWIHEYVKK 248

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
             FD+YVKVNTALIDM+AKCGSL DA+S+FE M V+DTQAWSAM+VA+ATHGNG KA+SM
Sbjct: 249 NRFDRYVKVNTALIDMYAKCGSLEDAVSVFEDMSVKDTQAWSAMIVAYATHGNGSKALSM 308

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMK+A +RPDEITFLGLLYACSHAG VE+G  YFYSMS+ YGI PGIKHYGCMVDLLG
Sbjct: 309 FEEMKKARIRPDEITFLGLLYACSHAGFVEEGCKYFYSMSERYGIVPGIKHYGCMVDLLG 368

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R+G L EAY FIDEL I PTPI WRTLLSAC +HG+VDM  RV+E+IF LDDSHGGDYVI
Sbjct: 369 RSGRLGEAYKFIDELPITPTPIFWRTLLSACGSHGDVDMGMRVLEQIFALDDSHGGDYVI 428

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           +SNLCAR GRWEDV+ LRKLM+DRG+VK+PGCSS+EVNNVVHEFFSGDG   +S  L +A
Sbjct: 429 ISNLCARAGRWEDVDRLRKLMRDRGIVKIPGCSSIEVNNVVHEFFSGDGERSVSTVLHQA 488

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           +D+L++E+KL GYVPDTSLV+H++ME++ +E+ LRYHSEKLA+A+GLLNTPPG TIRVVK
Sbjct: 489 VDKLVEELKLAGYVPDTSLVFHSNMEDKDREVSLRYHSEKLAIAYGLLNTPPGATIRVVK 548

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+CGDCH+AAK ISLIF RQI++RDVQRFH F++GKCSCGD+
Sbjct: 549 NLRVCGDCHSAAKYISLIFNRQIILRDVQRFHHFKEGKCSCGDY 592

BLAST of Lsi04G004220 vs. TrEMBL
Match: V4TQ07_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031011mg PE=4 SV=1)

HSP 1 Score: 852.8 bits (2202), Expect = 2.2e-244
Identity = 398/526 (75.67%), Postives = 459/526 (87.26%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           M+HAH LFD+IP+ DIVLFN M RGY+RS +P  A  LF ++L SGLLPDDY+F SLLKA
Sbjct: 72  MEHAHLLFDRIPEPDIVLFNTMARGYSRSKTPIRAIFLFVELLNSGLLPDDYSFPSLLKA 131

Query: 61  CA--SSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIV 120
           CA   ++AL+EG +LHCFAIKLGLN N+Y+C TLIN+YAEC+D+ AAR +F+ + +PC+V
Sbjct: 132 CACVGAEALEEGKQLHCFAIKLGLNSNLYVCTTLINLYAECSDVEAARRIFENISEPCVV 191

Query: 121 SYNAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYV 180
           SYNAIIT YARSS+PNEALSLFRELQ  NL+PTDVTMLS + SCALLG+LDLG+WIHEY+
Sbjct: 192 SYNAIITAYARSSRPNEALSLFRELQERNLKPTDVTMLSALSSCALLGSLDLGKWIHEYI 251

Query: 181 KKKGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAI 240
           KK G DKYVKVNTALIDM AKCG L DA+S+F+ M  +DTQAWSAM+VA+ATHG G K+I
Sbjct: 252 KKYGLDKYVKVNTALIDMHAKCGRLDDAVSVFDNMSGKDTQAWSAMIVAYATHGQGHKSI 311

Query: 241 SMFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDL 300
            MFEEM +A V PDEITFLGLLYACSH GLV++G  YFYSM   YGI PGIKHYGCMVDL
Sbjct: 312 LMFEEMMKAQVSPDEITFLGLLYACSHTGLVDEGWNYFYSMRDKYGIVPGIKHYGCMVDL 371

Query: 301 LGRTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDY 360
           LGR G LDEAY FIDEL IK TPILWRTLLS+CS+H N+ +AK+VIERIFELDDSHGGDY
Sbjct: 372 LGRAGRLDEAYRFIDELPIKSTPILWRTLLSSCSSHNNLGLAKQVIERIFELDDSHGGDY 431

Query: 361 VILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELR 420
           VILSNLCAR GRWEDV++LRKLMKDRGV+KVPGCSS+EVNNVV EFFSGDGVH  S +L+
Sbjct: 432 VILSNLCARAGRWEDVDYLRKLMKDRGVLKVPGCSSIEVNNVVREFFSGDGVHSYSTDLQ 491

Query: 421 RALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV 480
           +ALDEL+KE+K+VGYVPDTSLV+H DME+E KE+ LRYHSEKLA+ FGLLNTPPGTTIRV
Sbjct: 492 KALDELVKELKMVGYVPDTSLVHHGDMEDEEKEIALRYHSEKLAITFGLLNTPPGTTIRV 551

Query: 481 VKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           VKNLR+CGDCH+AAK+ISLIF RQIV+RDVQRFH F DGKCSCGDF
Sbjct: 552 VKNLRVCGDCHSAAKIISLIFNRQIVLRDVQRFHHFRDGKCSCGDF 597

BLAST of Lsi04G004220 vs. TrEMBL
Match: A0A0B2P166_GLYSO (Pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_008131 PE=4 SV=1)

HSP 1 Score: 852.8 bits (2202), Expect = 2.2e-244
Identity = 396/524 (75.57%), Postives = 458/524 (87.40%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MDHAH +FD+IP  DIVLFN M RGYAR + P  A  L +QVLCSGLLPDDYTFSSLLKA
Sbjct: 1   MDHAHRMFDKIPQPDIVLFNTMARGYARFDDPLRAILLCSQVLCSGLLPDDYTFSSLLKA 60

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA  KAL+EG +LHC A+KLG+  N+Y+CPTLINMY  CND++AAR VFD++ +PC+V+Y
Sbjct: 61  CARLKALEEGKQLHCLAVKLGVGDNMYVCPTLINMYTACNDVDAARRVFDKIGEPCVVAY 120

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NAIIT  AR+S+PNEAL+LFRELQ S L+PTDVTML  + SCALLGALDLGRWIHEYVKK
Sbjct: 121 NAIITSCARNSRPNEALALFRELQESGLKPTDVTMLVALSSCALLGALDLGRWIHEYVKK 180

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            GFD+YVKVNTALIDM+AKCGSL DA+S+F+ M  RDTQAWSAM+VA+ATHG+G +AISM
Sbjct: 181 NGFDQYVKVNTALIDMYAKCGSLDDAVSVFKDMPRRDTQAWSAMIVAYATHGHGSQAISM 240

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
             EMK+A V+PDEITFLG+LYACSH GLVE+G  YF+SM+  YGI P IKHYGCM+DLLG
Sbjct: 241 LREMKKAKVQPDEITFLGILYACSHTGLVEEGYEYFHSMTHEYGIVPSIKHYGCMIDLLG 300

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EA  FIDEL IKPTPILWRTLLS+CS+HGNV+MAK VI+RIFELDDSHGGDYVI
Sbjct: 301 RAGRLEEACKFIDELPIKPTPILWRTLLSSCSSHGNVEMAKLVIQRIFELDDSHGGDYVI 360

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNLCAR GRW+DVNHLRK+M D+G +KVPGCSS+EVNNVVHEFFSGDGVH  S  L  A
Sbjct: 361 LSNLCARNGRWDDVNHLRKMMVDKGALKVPGCSSIEVNNVVHEFFSGDGVHSTSTILHHA 420

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KE+KL GYVPDTSLV++AD+E+E KE+VLRYHSEKLA+ +GLLNTPPGTTIRVVK
Sbjct: 421 LDELVKELKLAGYVPDTSLVFYADIEDEEKEIVLRYHSEKLAITYGLLNTPPGTTIRVVK 480

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+C DCHNAAK ISLIFGRQI++RDVQRFH F+DGKCSCGD+
Sbjct: 481 NLRVCVDCHNAAKFISLIFGRQIILRDVQRFHHFKDGKCSCGDY 524

BLAST of Lsi04G004220 vs. TAIR10
Match: AT2G02980.1 (AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 777.7 bits (2007), Expect = 4.6e-225
Identity = 359/524 (68.51%), Postives = 436/524 (83.21%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           M +A HLF+ + + DIV+FN M RGY+R  +P   FSLF ++L  G+LPD+YTF SLLKA
Sbjct: 79  MSYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKA 138

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA +KAL+EG +LHC ++KLGL+ N+Y+CPTLINMY EC D+++AR VFD + +PC+V Y
Sbjct: 139 CAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCY 198

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+ITGYAR ++PNEALSLFRE+Q   L+P ++T+LSV+ SCALLG+LDLG+WIH+Y KK
Sbjct: 199 NAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKK 258

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
             F KYVKVNTALIDMFAKCGSL DA+SIFE MR +DTQAWSAM+VA+A HG   K++ M
Sbjct: 259 HSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLM 318

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FE M+   V+PDEITFLGLL ACSH G VE+GR YF  M   +GI P IKHYG MVDLL 
Sbjct: 319 FERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLS 378

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G+L++AY FID+L I PTP+LWR LL+ACS+H N+D+A++V ERIFELDDSHGGDYVI
Sbjct: 379 RAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVI 438

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNL AR  +WE V+ LRK+MKDR  VKVPGCSS+EVNNVVHEFFSGDGV   + +L RA
Sbjct: 439 LSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRA 498

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDE++KE+KL GYVPDTS+V HA+M ++ KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVK
Sbjct: 499 LDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVK 558

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+C DCHNAAKLISLIFGR++V+RDVQRFH FEDGKCSCGDF
Sbjct: 559 NLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDF 602

BLAST of Lsi04G004220 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 493.0 bits (1268), Expect = 2.3e-139
Identity = 227/526 (43.16%), Postives = 348/526 (66.16%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           +++A  LFD+IP KD+V +N M  GYA + +   A  LF  ++ + + PD+ T  +++ A
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA S +++ G ++H +    G   N+ I   LI++Y++C ++  A G+F+ +    ++S+
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           N +I GY   +   EAL LF+E+  S   P DVTMLS++ +CA LGA+D+GRWIH Y+ K
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 181 --KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAI 240
             KG      + T+LIDM+AKCG +  A  +F  +  +   +W+AM+  FA HG    + 
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASF 455

Query: 241 SMFEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDL 300
            +F  M++ G++PD+ITF+GLL ACSH+G+++ GR  F +M+++Y +TP ++HYGCM+DL
Sbjct: 456 DLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDL 515

Query: 301 LGRTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDY 360
           LG +G   EA   I+ +E++P  ++W +LL AC  HGNV++ +   E + +++  + G Y
Sbjct: 516 LGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSY 575

Query: 361 VILSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELR 420
           V+LSN+ A  GRW +V   R L+ D+G+ KVPGCSS+E+++VVHEF  GD  H  + E+ 
Sbjct: 576 VLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIY 635

Query: 421 RALDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV 480
             L+E+   ++  G+VPDTS V   +MEEE KE  LR+HSEKLA+AFGL++T PGT + +
Sbjct: 636 GMLEEMEVLLEKAGFVPDTSEVLQ-EMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 695

Query: 481 VKNLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           VKNLR+C +CH A KLIS I+ R+I+ RD  RFH F DG CSC D+
Sbjct: 696 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of Lsi04G004220 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 472.6 bits (1215), Expect = 3.2e-133
Identity = 226/524 (43.13%), Postives = 340/524 (64.89%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           ++ A  LFD + ++++V +N M   Y ++ +P  A  +F ++L  G+ P D +    L A
Sbjct: 287 LETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHA 346

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CA    L+ G  +H  +++LGL+ N+ +  +LI+MY +C +++ A  +F +++   +VS+
Sbjct: 347 CADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSW 406

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I G+A++ +P +AL+ F ++++  ++P   T +SVI + A L      +WIH  V +
Sbjct: 407 NAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMR 466

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
              DK V V TAL+DM+AKCG++  A  IF+ M  R    W+AM+  + THG G  A+ +
Sbjct: 467 SCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALEL 526

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEM++  ++P+ +TFL ++ ACSH+GLVE G   FY M +NY I   + HYG MVDLLG
Sbjct: 527 FEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLG 586

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EA++FI ++ +KP   ++  +L AC  H NV+ A++  ER+FEL+   GG +V+
Sbjct: 587 RAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVL 646

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           L+N+      WE V  +R  M  +G+ K PGCS VE+ N VH FFSG   H  S ++   
Sbjct: 647 LANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAF 706

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           L++LI  IK  GYVPDT+LV    +E + KE +L  HSEKLA++FGLLNT  GTTI V K
Sbjct: 707 LEKLICHIKEAGYVPDTNLV--LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRK 766

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+C DCHNA K ISL+ GR+IV+RD+QRFH F++G CSCGD+
Sbjct: 767 NLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDY 808

BLAST of Lsi04G004220 vs. TAIR10
Match: AT3G23330.1 (AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 464.9 bits (1195), Expect = 6.7e-131
Identity = 220/524 (41.98%), Postives = 345/524 (65.84%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           +D    +F+ +P KD+V +N +  GYA+S     A  +  ++  + L PD +T SS+L  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
            +    + +G E+H + I+ G++ ++YI  +L++MYA+   +  +  VF  +     +S+
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           N+++ GY ++ + NEAL LFR++  + ++P  V   SVI +CA L  L LG+ +H YV +
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLR 371

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            GF   + + +AL+DM++KCG++  A  IF+ M V D  +W+A+++  A HG+G +A+S+
Sbjct: 372 GGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSL 431

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKR GV+P+++ F+ +L ACSH GLV++  GYF SM+K YG+   ++HY  + DLLG
Sbjct: 432 FEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLG 491

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EAYNFI ++ ++PT  +W TLLS+CS H N+++A++V E+IF +D  + G YV+
Sbjct: 492 RAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVL 551

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           + N+ A  GRW+++  LR  M+ +G+ K P CS +E+ N  H F SGD  H    ++   
Sbjct: 552 MCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEF 611

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           L  ++++++  GYV DTS V H D++EE K  +L  HSE+LA+AFG++NT PGTTIRV K
Sbjct: 612 LKAVMEQMEKEGYVADTSGVLH-DVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTK 671

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           N+RIC DCH A K IS I  R+I++RD  RFH F  G CSCGD+
Sbjct: 672 NIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDY 714

BLAST of Lsi04G004220 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 464.5 bits (1194), Expect = 8.7e-131
Identity = 218/555 (39.28%), Postives = 341/555 (61.44%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           + +A  +FD     D  L+N+M RG++ S+ P  +  L+ ++LCS    + YTF SLLKA
Sbjct: 65  LPYAQIVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKA 124

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           C++  A +E  ++H    KLG  +++Y   +LIN YA   +   A  +FD + +P  VS+
Sbjct: 125 CSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSW 184

Query: 121 NAIITGYARSSQPN-------------------------------EALSLFRELQASNLE 180
           N++I GY ++ + +                               EAL LF E+Q S++E
Sbjct: 185 NSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVE 244

Query: 181 PTDVTMLSVIMSCALLGALDLGRWIHEYVKKKGFDKYVKVNTALIDMFAKCGSLADAISI 240
           P +V++ + + +CA LGAL+ G+WIH Y+ K        +   LIDM+AKCG + +A+ +
Sbjct: 245 PDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEV 304

Query: 241 FEGMRVRDTQAWSAMMVAFATHGNGLKAISMFEEMKRAGVRPDEITFLGLLYACSHAGLV 300
           F+ ++ +  QAW+A++  +A HG+G +AIS F EM++ G++P+ ITF  +L ACS+ GLV
Sbjct: 305 FKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLV 364

Query: 301 EQGRGYFYSMSKNYGITPGIKHYGCMVDLLGRTGHLDEAYNFIDELEIKPTPILWRTLLS 360
           E+G+  FYSM ++Y + P I+HYGC+VDLLGR G LDEA  FI E+ +KP  ++W  LL 
Sbjct: 365 EEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLK 424

Query: 361 ACSTHGNVDMAKRVIERIFELDDSHGGDYVILSNLCARVGRWEDVNHLRKLMKDRGVVKV 420
           AC  H N+++ + + E +  +D  HGG YV  +N+ A   +W+     R+LMK++GV KV
Sbjct: 425 ACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKV 484

Query: 421 PGCSSVEVNNVVHEFFSGDGVHCISVELRRALDELIKEIKLVGYVPDTSLVYHADMEEEG 480
           PGCS++ +    HEF +GD  H    +++     + ++++  GYVP+   +    ++++ 
Sbjct: 485 PGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDE 544

Query: 481 KELVLRYHSEKLAMAFGLLNTPPGTTIRVVKNLRICGDCHNAAKLISLIFGRQIVIRDVQ 525
           +E ++  HSEKLA+ +GL+ T PGT IR++KNLR+C DCH   KLIS I+ R IV+RD  
Sbjct: 545 REAIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRT 604

BLAST of Lsi04G004220 vs. NCBI nr
Match: gi|700196745|gb|KGN51922.1| (hypothetical protein Csa_5G605160 [Cucumis sativus])

HSP 1 Score: 1037.3 bits (2681), Expect = 9.2e-300
Identity = 496/524 (94.66%), Postives = 512/524 (97.71%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MDHAHHLFDQI DKDI+LFNIM RGYARSNSPYLAFSLF ++LCSGLLPDDYTFSSLLKA
Sbjct: 80  MDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFGELLCSGLLPDDYTFSSLLKA 139

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL+EGM LHCFA+KLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY
Sbjct: 140 CASSKALREGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 199

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NAIITGYARSSQPNEALSLFRELQASN+EPTDVTMLSVIMSCALLGALDLG+WIHEYVKK
Sbjct: 200 NAIITGYARSSQPNEALSLFRELQASNIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKK 259

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
           KGFDKYVKVNTALIDMFAKCGSL DAISIFEGMRVRDTQAWSAM+VAFATHG+GLKAISM
Sbjct: 260 KGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKAISM 319

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKR GVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSK YGITPGIKHYGCMVDLLG
Sbjct: 320 FEEMKREGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKTYGITPGIKHYGCMVDLLG 379

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R GHLDEAYNF+D+LEIK TPILWRTLLSACSTHGNV+MAKRVIERIFELDD+HGGDYVI
Sbjct: 380 RAGHLDEAYNFVDKLEIKATPILWRTLLSACSTHGNVEMAKRVIERIFELDDAHGGDYVI 439

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNL ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHC+SVELRRA
Sbjct: 440 LSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRA 499

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV K
Sbjct: 500 LDELMKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAK 559

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLRICGDCHNAAKLIS IFGR+IVIRDVQRFHRFEDGKCSCGDF
Sbjct: 560 NLRICGDCHNAAKLISFIFGRKIVIRDVQRFHRFEDGKCSCGDF 603

BLAST of Lsi04G004220 vs. NCBI nr
Match: gi|659091079|ref|XP_008446357.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Cucumis melo])

HSP 1 Score: 1035.4 bits (2676), Expect = 3.5e-299
Identity = 494/524 (94.27%), Postives = 515/524 (98.28%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MDHAHHLFDQI DKDI+LFNIM RGYARSNSPYLAFSLFAQ+LCSGLLPDDYTFSSLLKA
Sbjct: 80  MDHAHHLFDQILDKDIILFNIMARGYARSNSPYLAFSLFAQLLCSGLLPDDYTFSSLLKA 139

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL++GM LHCFA+KLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY
Sbjct: 140 CASSKALRQGMGLHCFAVKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 199

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NAIITGYARSSQPNEALSLFRELQAS++EPTDVTMLSVIMSCALLGALDLG+WIHEYVKK
Sbjct: 200 NAIITGYARSSQPNEALSLFRELQASDIEPTDVTMLSVIMSCALLGALDLGKWIHEYVKK 259

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
           KGFDKYVKVNTALIDMFAKCGSL DAISIFEGMRVRDTQAWSAM+VAFATHG+GLK+IS+
Sbjct: 260 KGFDKYVKVNTALIDMFAKCGSLTDAISIFEGMRVRDTQAWSAMIVAFATHGDGLKSISI 319

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMS+ YGITPGIKHYGCMVDLLG
Sbjct: 320 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSRTYGITPGIKHYGCMVDLLG 379

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           RTG LDEAYNF+DELEIKPTPILWRTLLSACSTHGNV+MAKRVIERIFELDDSHGGDYVI
Sbjct: 380 RTGCLDEAYNFVDELEIKPTPILWRTLLSACSTHGNVEMAKRVIERIFELDDSHGGDYVI 439

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNL ARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHC+SVELRRA
Sbjct: 440 LSNLYARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCVSVELRRA 499

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KEIKLVGY+PDTSLVYHADM+EEGKELVLRYHSEKLAMAFGLLNTPPGTTIRV K
Sbjct: 500 LDELMKEIKLVGYIPDTSLVYHADMDEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVAK 559

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLRICGDCHNAAKLIS IFGR+IVIRDVQRFH+FEDGKCSCGDF
Sbjct: 560 NLRICGDCHNAAKLISFIFGRKIVIRDVQRFHQFEDGKCSCGDF 603

BLAST of Lsi04G004220 vs. NCBI nr
Match: gi|225425668|ref|XP_002269694.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Vitis vinifera])

HSP 1 Score: 870.9 bits (2249), Expect = 1.1e-249
Identity = 398/524 (75.95%), Postives = 466/524 (88.93%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           M HAHHLFDQIP  DIVLFN M RGYAR+++P  AF+LF Q+L SGL PDDYTF SLLKA
Sbjct: 71  MQHAHHLFDQIPQPDIVLFNTMARGYARTDTPLRAFTLFTQILFSGLFPDDYTFPSLLKA 130

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CAS KAL+EG +LHC AIKLGL+ N+Y+CPTLINMY  CN+M+ AR VFD++ +PC+V+Y
Sbjct: 131 CASCKALEEGRQLHCLAIKLGLSENVYVCPTLINMYTACNEMDCARRVFDKIWEPCVVTY 190

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+ITGYAR S+PNEALSLFRELQA NL+PTDVTMLSV+ SCALLGALDLG+W+HEYVKK
Sbjct: 191 NAMITGYARGSRPNEALSLFRELQARNLKPTDVTMLSVLSSCALLGALDLGKWMHEYVKK 250

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
            GF+++VKV+TALIDM+AKCGSL DA+ +FE M VRDTQAWSAM++A+A HG+GLKA+S+
Sbjct: 251 NGFNRFVKVDTALIDMYAKCGSLDDAVCVFENMAVRDTQAWSAMIMAYAIHGHGLKAVSL 310

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           F+EM++AG  PDEITFLGLLYACSH GLVE+G  YFY M   YG+ PGIKHYGCMVDLLG
Sbjct: 311 FKEMRKAGTEPDEITFLGLLYACSHTGLVEEGFEYFYGMRDKYGVIPGIKHYGCMVDLLG 370

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R G L+EAY FI  L I+PTPILWRTLLSAC +HGNV++ KRVIE+IFELDDSHGGDY+I
Sbjct: 371 RAGRLEEAYEFIVGLPIRPTPILWRTLLSACGSHGNVELGKRVIEQIFELDDSHGGDYII 430

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           LSNLCAR GRWEDVN++RKLM +RGVVK+PGCSSVEVNNVVHEFFSGDGVH +S +L +A
Sbjct: 431 LSNLCARAGRWEDVNYVRKLMNERGVVKIPGCSSVEVNNVVHEFFSGDGVHSVSTKLHQA 490

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           LDEL+KE+KLVGYVP+TSLV+HADME+E KE+ LRYHSEKLA+ FGLLNTPPGTTIRVVK
Sbjct: 491 LDELVKELKLVGYVPNTSLVFHADMEDEEKEVTLRYHSEKLAITFGLLNTPPGTTIRVVK 550

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+CGDCH+AAKLISLIF RQI++RDVQRFH F+DGKCSC D+
Sbjct: 551 NLRVCGDCHSAAKLISLIFDRQIILRDVQRFHHFKDGKCSCEDY 594

BLAST of Lsi04G004220 vs. NCBI nr
Match: gi|596294817|ref|XP_007226900.1| (hypothetical protein PRUPE_ppa016366mg [Prunus persica])

HSP 1 Score: 865.1 bits (2234), Expect = 6.3e-248
Identity = 397/524 (75.76%), Postives = 466/524 (88.93%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MD+AHHLFDQIP  DIV+FN M RGYARS++P+ A SLFA +L S L PDDYTF+SLLKA
Sbjct: 69  MDYAHHLFDQIPHPDIVVFNTMARGYARSHAPFRAISLFAHILSSDLFPDDYTFASLLKA 128

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL+EG +LHCFAIK GL+ NIY+CPTLINMY ECND++AAR VFD++  PC+V +
Sbjct: 129 CASSKALEEGRQLHCFAIKCGLHLNIYVCPTLINMYTECNDVDAARRVFDKIPDPCVVVH 188

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I GYARSS+PNEAL+LFRELQASNL+PTDVTMLS + SCALLGALDLG+WIHEYVKK
Sbjct: 189 NAMIKGYARSSRPNEALALFRELQASNLKPTDVTMLSALSSCALLGALDLGKWIHEYVKK 248

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
             FD+YVKVNTALIDM+AKCGSL DA+S+FE M V+DTQAWSAM+VA+ATHGNG KA+SM
Sbjct: 249 NRFDRYVKVNTALIDMYAKCGSLEDAVSVFEDMSVKDTQAWSAMIVAYATHGNGSKALSM 308

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMK+A +RPDEITFLGLLYACSHAG VE+G  YFYSMS+ YGI PGIKHYGCMVDLLG
Sbjct: 309 FEEMKKARIRPDEITFLGLLYACSHAGFVEEGCKYFYSMSERYGIVPGIKHYGCMVDLLG 368

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R+G L EAY FIDEL I PTPI WRTLLSAC +HG+VDM  RV+E+IF LDDSHGGDYVI
Sbjct: 369 RSGRLGEAYKFIDELPITPTPIFWRTLLSACGSHGDVDMGMRVLEQIFALDDSHGGDYVI 428

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           +SNLCAR GRWEDV+ LRKLM+DRG+VK+PGCSS+EVNNVVHEFFSGDG   +S  L +A
Sbjct: 429 ISNLCARAGRWEDVDRLRKLMRDRGIVKIPGCSSIEVNNVVHEFFSGDGERSVSTVLHQA 488

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           +D+L++E+KL GYVPDTSLV+H++ME++ +E+ LRYHSEKLA+A+GLLNTPPG TIRVVK
Sbjct: 489 VDKLVEELKLAGYVPDTSLVFHSNMEDKDREVSLRYHSEKLAIAYGLLNTPPGATIRVVK 548

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+CGDCH+AAK ISLIF RQI++RDVQRFH F++GKCSCGD+
Sbjct: 549 NLRVCGDCHSAAKYISLIFNRQIILRDVQRFHHFKEGKCSCGDY 592

BLAST of Lsi04G004220 vs. NCBI nr
Match: gi|645229766|ref|XP_008221613.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Prunus mume])

HSP 1 Score: 861.3 bits (2224), Expect = 9.1e-247
Identity = 396/524 (75.57%), Postives = 464/524 (88.55%), Query Frame = 1

Query: 1   MDHAHHLFDQIPDKDIVLFNIMTRGYARSNSPYLAFSLFAQVLCSGLLPDDYTFSSLLKA 60
           MD+AHHLFDQIP  DIV+FN M RGYARS++P+ A SLFA +L S L PDDYTF SLLKA
Sbjct: 69  MDYAHHLFDQIPHPDIVVFNTMARGYARSHTPFRAISLFAHILSSDLFPDDYTFPSLLKA 128

Query: 61  CASSKALKEGMELHCFAIKLGLNHNIYICPTLINMYAECNDMNAARGVFDEMEQPCIVSY 120
           CASSKAL+EG +LHCFAIK GL+ NIY+CPTLINMY ECND++AAR VFD++  PC+V +
Sbjct: 129 CASSKALEEGRQLHCFAIKCGLHLNIYVCPTLINMYTECNDVDAARRVFDKIPDPCVVVH 188

Query: 121 NAIITGYARSSQPNEALSLFRELQASNLEPTDVTMLSVIMSCALLGALDLGRWIHEYVKK 180
           NA+I GY+RSS+PNEAL+LFRELQASNL+PTDVTMLS + SCALLGALDLG+W+HEYVKK
Sbjct: 189 NAMIKGYSRSSRPNEALALFRELQASNLKPTDVTMLSALSSCALLGALDLGKWMHEYVKK 248

Query: 181 KGFDKYVKVNTALIDMFAKCGSLADAISIFEGMRVRDTQAWSAMMVAFATHGNGLKAISM 240
             FD+YVKVNTALIDM+AKCGSL +A+S+FE M V+DTQAWSAM+VA+ATHGNG KA+SM
Sbjct: 249 NRFDRYVKVNTALIDMYAKCGSLEEAVSVFEDMSVKDTQAWSAMIVAYATHGNGSKALSM 308

Query: 241 FEEMKRAGVRPDEITFLGLLYACSHAGLVEQGRGYFYSMSKNYGITPGIKHYGCMVDLLG 300
           FEEMKRA +RPDEITFLGLLYACSHAG VE+G  YFYSMS+ Y I PGIKHYGCMVDLLG
Sbjct: 309 FEEMKRARIRPDEITFLGLLYACSHAGFVEEGCKYFYSMSETYRIVPGIKHYGCMVDLLG 368

Query: 301 RTGHLDEAYNFIDELEIKPTPILWRTLLSACSTHGNVDMAKRVIERIFELDDSHGGDYVI 360
           R+G L EAY FIDEL IKPTPI WRTLLSAC +HG+VDM  RV+ERIFELDDSHGGDYVI
Sbjct: 369 RSGRLGEAYKFIDELPIKPTPIFWRTLLSACGSHGDVDMGMRVLERIFELDDSHGGDYVI 428

Query: 361 LSNLCARVGRWEDVNHLRKLMKDRGVVKVPGCSSVEVNNVVHEFFSGDGVHCISVELRRA 420
           +SNLCAR GRWEDV+ LRKLM+DRG+VK+PGCSS+EVNNVVHEFFSGDG   +S  L +A
Sbjct: 429 ISNLCARAGRWEDVDRLRKLMRDRGIVKIPGCSSIEVNNVVHEFFSGDGERSVSTVLHQA 488

Query: 421 LDELIKEIKLVGYVPDTSLVYHADMEEEGKELVLRYHSEKLAMAFGLLNTPPGTTIRVVK 480
           +DEL++E+KL GYVPDTSLV+H++ME++ +E+ LRYHSEKLA+A+GLLNTPP  TIRVVK
Sbjct: 489 VDELVEELKLAGYVPDTSLVFHSNMEDKDREVSLRYHSEKLAIAYGLLNTPPRATIRVVK 548

Query: 481 NLRICGDCHNAAKLISLIFGRQIVIRDVQRFHRFEDGKCSCGDF 525
           NLR+CGDCH+AAK ISLIF RQI++RDVQRFH F++G CSCGD+
Sbjct: 549 NLRVCGDCHSAAKYISLIFNRQIILRDVQRFHHFKEGNCSCGDY 592

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP145_ARATH8.3e-22468.51Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
PPR21_ARATH4.1e-13843.16Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PPR32_ARATH5.7e-13243.13Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP252_ARATH1.4e-13042.10Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
PP251_ARATH1.2e-12941.98Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0KU15_CUCSA6.4e-30094.66Uncharacterized protein OS=Cucumis sativus GN=Csa_5G605160 PE=4 SV=1[more]
D7TN78_VITVI8.0e-25075.95Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0026g02510 PE=4 SV=... [more]
M5XQC4_PRUPE4.4e-24875.76Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa016366mg PE=4 SV=1[more]
V4TQ07_9ROSI2.2e-24475.67Uncharacterized protein OS=Citrus clementina GN=CICLE_v10031011mg PE=4 SV=1[more]
A0A0B2P166_GLYSO2.2e-24475.57Pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_008131 PE... [more]
Match NameE-valueIdentityDescription
AT2G02980.14.6e-22568.51 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.3e-13943.16 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.13.2e-13343.13 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G23330.16.7e-13141.98 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G66520.18.7e-13139.28 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700196745|gb|KGN51922.1|9.2e-30094.66hypothetical protein Csa_5G605160 [Cucumis sativus][more]
gi|659091079|ref|XP_008446357.1|3.5e-29994.27PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Cucumis melo][more]
gi|225425668|ref|XP_002269694.1|1.1e-24975.95PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Vitis vinifera... [more]
gi|596294817|ref|XP_007226900.1|6.3e-24875.76hypothetical protein PRUPE_ppa016366mg [Prunus persica][more]
gi|645229766|ref|XP_008221613.1|9.1e-24775.57PREDICTED: pentatricopeptide repeat-containing protein At2g02980 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0016556 mRNA modification
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G004220.1Lsi04G004220.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 91..113
score: 0.039coord: 358..386
score: 0.71coord: 292..316
score: 0.23coord: 191..216
score: 0.022coord: 323..347
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 217..263
score: 3.6E-9coord: 115..162
score: 2.2E-10coord: 14..62
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 220..253
score: 2.1E-6coord: 118..151
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 151..185
score: 6.127coord: 252..287
score: 7.465coord: 50..84
score: 7.772coord: 186..216
score: 6.665coord: 15..49
score: 9.361coord: 85..115
score: 7.651coord: 320..350
score: 7.366coord: 354..388
score: 7.256coord: 116..150
score: 11.619coord: 217..251
score: 11.378coord: 288..318
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 119..260
score: 5.6E-6coord: 328..373
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 71..216
score: 7.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..395
score: 1.0E
NoneNo IPR availablePANTHERPTHR24015:SF44SUBFAMILY NOT NAMEDcoord: 1..395
score: 1.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Lsi04G004220Cucumber (Gy14) v2cgyblsiB048
Lsi04G004220Cucumber (Gy14) v2cgyblsiB341
Lsi04G004220Melon (DHL92) v3.6.1lsimedB292
Lsi04G004220Melon (DHL92) v3.6.1lsimedB297
Lsi04G004220Melon (DHL92) v3.6.1lsimedB302
Lsi04G004220Melon (DHL92) v3.6.1lsimedB308
Lsi04G004220Silver-seed gourdcarlsiB048
Lsi04G004220Silver-seed gourdcarlsiB356
Lsi04G004220Cucumber (Chinese Long) v3cuclsiB061
Lsi04G004220Cucumber (Chinese Long) v3cuclsiB059
Lsi04G004220Cucumber (Chinese Long) v3cuclsiB393
Lsi04G004220Cucumber (Chinese Long) v3cuclsiB397
Lsi04G004220Watermelon (97103) v2lsiwmbB281
Lsi04G004220Watermelon (97103) v2lsiwmbB311
Lsi04G004220Wax gourdlsiwgoB354
Lsi04G004220Wax gourdlsiwgoB375
Lsi04G004220Wax gourdlsiwgoB408
Lsi04G004220Bottle gourd (USVL1VR-Ls)lsilsiB099
Lsi04G004220Bottle gourd (USVL1VR-Ls)lsilsiB101
Lsi04G004220Bottle gourd (USVL1VR-Ls)lsilsiB131
Lsi04G004220Bottle gourd (USVL1VR-Ls)lsilsiB139
Lsi04G004220Cucumber (Gy14) v1cgylsiB154
Lsi04G004220Cucumber (Gy14) v1cgylsiB256
Lsi04G004220Cucumber (Gy14) v1cgylsiB339
Lsi04G004220Cucurbita maxima (Rimu)cmalsiB062
Lsi04G004220Cucurbita maxima (Rimu)cmalsiB215
Lsi04G004220Cucurbita maxima (Rimu)cmalsiB280
Lsi04G004220Cucurbita maxima (Rimu)cmalsiB435
Lsi04G004220Cucurbita maxima (Rimu)cmalsiB678
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB049
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB204
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB269
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB427
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB429
Lsi04G004220Cucurbita moschata (Rifu)cmolsiB671
Lsi04G004220Cucurbita pepo (Zucchini)cpelsiB032
Lsi04G004220Cucurbita pepo (Zucchini)cpelsiB360
Lsi04G004220Cucurbita pepo (Zucchini)cpelsiB461
Lsi04G004220Cucurbita pepo (Zucchini)cpelsiB469
Lsi04G004220Wild cucumber (PI 183967)cpilsiB053
Lsi04G004220Wild cucumber (PI 183967)cpilsiB055
Lsi04G004220Wild cucumber (PI 183967)cpilsiB375
Lsi04G004220Cucumber (Chinese Long) v2culsiB053
Lsi04G004220Cucumber (Chinese Long) v2culsiB056
Lsi04G004220Cucumber (Chinese Long) v2culsiB371
Lsi04G004220Melon (DHL92) v3.5.1lsimeB281
Lsi04G004220Melon (DHL92) v3.5.1lsimeB287
Lsi04G004220Melon (DHL92) v3.5.1lsimeB294
Lsi04G004220Watermelon (Charleston Gray)lsiwcgB253
Lsi04G004220Watermelon (Charleston Gray)lsiwcgB281
Lsi04G004220Watermelon (Charleston Gray)lsiwcgB293
Lsi04G004220Watermelon (97103) v1lsiwmB284
Lsi04G004220Watermelon (97103) v1lsiwmB328
Lsi04G004220Watermelon (97103) v1lsiwmB341