Cla97C05G086320 (gene) Watermelon (97103) v2

NameCla97C05G086320
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr05 : 4737103 .. 4739320 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGCTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCACGCCCAAATTCTCAAAACCCTAAGAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCTAAACTCGATCATCTTGACTCGGCCGAACTCATCCTCAAACTCGCCCCTTGCCGTTCTGTCGTCACTTGGACCGCCCTCATCGCTGGTTCCGTCCAAAACGGCTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGATTTCACTTTCCCTTGCGCGTTCAAGGCCACCACTGGCCTTCGCATGGCCGTGACAGGCACACAGCTACACGGACTTGCGGTTAAGGAGGGATTAATAAACGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGAGCTTTCTTAATGACGCTTACAAGATATTTGATGAAATGCCTCATCGAAACCTCGAGACGTGGAATGCGTATATATCCAATTCCGTGCACCATGGGCGACCTGAAGACTCTGCCAGTGCATTTATTGAGCTACTTCGGGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCAGACAAACTAGGCTTGGAGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGAAGTGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAAGTTGAATGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAGGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGTAGGTCAGTTCAAGCACTAGCCGTCAAGGCTTGTGTAGAGAAAAACATTTTCGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCAAGCCTTCAACGAGATGCCAGAGAGAAACTTGGTGTCTTGGAATGCATTGTTGGGCGGATACGCGCACCAAGGACATGCAGACAAGGCCATGGCATTGCTCGAGGAGATGCTGGCGGCAGGCATGGCACCAAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGGTTTGAAGATGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTGGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGCCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTAGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGACGGTAACCCAATTTCAAATTTCACAACACTCTTATTCACATCTACGTTCTGAAGTTAGCTCATTACTCGATGATATCCATTATAATTATTAGTTAGAAAAGAATGCCTCTTACAATCTAAATTGTTAGATCATTGTTTATTTGATCTTTTTGATCGTATTCCACTACTATTCACTCAGTGAGTTAAGAATTCAATTTAAAATTCATTAGGTAGCTACTGTATTAATAATTTTCTAAGATCATTAGAAGTATAAAGACAAAAACTTCTTACTTTAATACTAAATTAAATCACCACTAAGTCTAAAAATTTGAACTAATGAATTGTGATATACTTAATCTTTAACATTCTTCAAGACTTTGATTTCGCACATTCTATGGTAAATTGATTGGGGGGCGAATTCAGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGGATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTGAACAGTAGAATCCATATATTCCAAGCGAAAGACCAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGCTGGGGAAGTTGAGGAAGGAGATGCAGGAGGCTACTGGTTGCATTGCAGACACCAAGTATGCTCTTTTTGAAGTATCGAATTAA

mRNA sequence

ATGCCGCTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCACGCCCAAATTCTCAAAACCCTAAGAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCTAAACTCGATCATCTTGACTCGGCCGAACTCATCCTCAAACTCGCCCCTTGCCGTTCTGTCGTCACTTGGACCGCCCTCATCGCTGGTTCCGTCCAAAACGGCTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGATTTCACTTTCCCTTGCGCGTTCAAGGCCACCACTGGCCTTCGCATGGCCGTGACAGGCACACAGCTACACGGACTTGCGGTTAAGGAGGGATTAATAAACGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGAGCTTTCTTAATGACGCTTACAAGATATTTGATGAAATGCCTCATCGAAACCTCGAGACGTGGAATGCGTATATATCCAATTCCGTGCACCATGGGCGACCTGAAGACTCTGCCAGTGCATTTATTGAGCTACTTCGGGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCAGACAAACTAGGCTTGGAGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGAAGTGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAAGTTGAATGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAGGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGTAGGTCAGTTCAAGCACTAGCCGTCAAGGCTTGTGTAGAGAAAAACATTTTCGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCAAGCCTTCAACGAGATGCCAGAGAGAAACTTGGTGTCTTGGAATGCATTGTTGGGCGGATACGCGCACCAAGGACATGCAGACAAGGCCATGGCATTGCTCGAGGAGATGCTGGCGGCAGGCATGGCACCAAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGGTTTGAAGATGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTGGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGCCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTAGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGACGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGGATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTGAACAGTAGAATCCATATATTCCAAGCGAAAGACCAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGCTGGGGAAGTTGAGGAAGGAGATGCAGGAGGCTACTGGTTGCATTGCAGACACCAAGTATGCTCTTTTTGAAGTATCGAATTAA

Coding sequence (CDS)

ATGCCGCTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCACGCCCAAATTCTCAAAACCCTAAGAACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCTAAACTCGATCATCTTGACTCGGCCGAACTCATCCTCAAACTCGCCCCTTGCCGTTCTGTCGTCACTTGGACCGCCCTCATCGCTGGTTCCGTCCAAAACGGCTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGATTTCACTTTCCCTTGCGCGTTCAAGGCCACCACTGGCCTTCGCATGGCCGTGACAGGCACACAGCTACACGGACTTGCGGTTAAGGAGGGATTAATAAACGATGTCTTCGTCGGGTGCAGTGTCTTCGACATGTACAGCAAATTGAGCTTTCTTAATGACGCTTACAAGATATTTGATGAAATGCCTCATCGAAACCTCGAGACGTGGAATGCGTATATATCCAATTCCGTGCACCATGGGCGACCTGAAGACTCTGCCAGTGCATTTATTGAGCTACTTCGGGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCAGACAAACTAGGCTTGGAGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGAAGTGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAAGTTGAATGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAGGAGAAGGCTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGTAGGTCAGTTCAAGCACTAGCCGTCAAGGCTTGTGTAGAGAAAAACATTTTCGTTGGAAGTGCACTGGTTGACATGTATGGAAAATGTGGAAGTATTGATGAAGCAGAGCAAGCCTTCAACGAGATGCCAGAGAGAAACTTGGTGTCTTGGAATGCATTGTTGGGCGGATACGCGCACCAAGGACATGCAGACAAGGCCATGGCATTGCTCGAGGAGATGCTGGCGGCAGGCATGGCACCAAGCTATGTAAGTTTGGTCTGTGCATTATCAGCTTGCAGTAGAGCAGGAGGTTTGAAGATGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGGTATAGAACCAGGGCCAGAGCATTATGCTTGCTTGGTGGATTTGCTTGGACGTGCTGGAATGGTGGAATGTGCGTATGATTTTATAAAGAGCATGCCATTTCCTCCTACAATCTCAATCTGGGGTGCTCTGTTAGGGGCTTGCCGAATGCATGGGAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTGTTTGAACTTGATCCAAAAGACTCTGGAAATCACGTTGTGCTGTCCAATATGTTTGCTGCAACTGGACGGTGGGAAGAAGTGACTGTCGTACGAAATGAGATGAAAGAAGTAGGGATCAAAAAGGGAGCTGGGTTCAGTTGGATAACAGTGAACAGTAGAATCCATATATTCCAAGCGAAAGACCAAAGCCATGAGAAGGACTCTGAAATTCAGGACATGCTGGGGAAGTTGAGGAAGGAGATGCAGGAGGCTACTGGTTGCATTGCAGACACCAAGTATGCTCTTTTTGAAGTATCGAATTAA

Protein sequence

MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGSIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCALSACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYALFEVSN
BLAST of Cla97C05G086320 vs. NCBI nr
Match: XP_004134445.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus] >XP_011650980.1 PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus] >KGN56980.1 hypothetical protein Csa_3G146650 [Cucumis sativus])

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 556/606 (91.75%), Postives = 576/606 (95.05%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+S
Sbjct: 1   MPFLSQNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPC  KA+TG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRM  TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMPHRNLETWNAY
Sbjct: 121 LRMDTTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKVFDEMPHRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV HGRPEDS  AFIELLRVGGKPDSITFCAF NACSDKLGL PGCQLHGFIIRSGY
Sbjct: 181 ISNSVLHGRPEDSVIAFIELLRVGGKPDSITFCAFLNACSDKLGLGPGCQLHGFIIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVE+NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML-AAGMAPSYVSLVCALS 420
           ID AEQAFN MPERNLVSWNALLGGYAHQGHA+KA+ALLEEM  AAG+ PSYVSL+CALS
Sbjct: 361 IDNAEQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK MPFPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITV+SRIH+FQAKD+SHEKD EIQD+LGKLRKEMQ+A GCIAD  YA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHMFQAKDKSHEKDPEIQDILGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 606
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of Cla97C05G086320 vs. NCBI nr
Match: XP_008438671.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo])

HSP 1 Score: 1119.0 bits (2893), Expect = 0.0e+00
Identity = 552/606 (91.09%), Postives = 574/606 (94.72%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+S
Sbjct: 1   MPFLSKNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPC  KA+TG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRM +TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMP RNLETWNAY
Sbjct: 121 LRMDMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKLFDEMPQRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           I+NSV HGRPEDSA AFIELLRVG KPDSITFCAF NACSDKLGL PGCQLHGF+IRSGY
Sbjct: 181 ITNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFLNACSDKLGLGPGCQLHGFVIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVE+NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML-AAGMAPSYVSLVCALS 420
           ID A QAFN MPERNLVSWNALLGGYAHQGHA+KA+ALLEEM  AAG+ PSYVSL+CALS
Sbjct: 361 IDNAVQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK MPFPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEE TVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEATVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITV+SRIHIFQAKD+SHEKD EIQ+MLGKLRKEMQ+A GCIAD  YA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHIFQAKDKSHEKDPEIQNMLGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 606
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of Cla97C05G086320 vs. NCBI nr
Match: XP_022956070.1 (pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata])

HSP 1 Score: 1095.9 bits (2833), Expect = 0.0e+00
Identity = 540/603 (89.55%), Postives = 568/603 (94.20%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LSPNSLASLVE A+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+S
Sbjct: 1   MPFLSPNSLASLVEFALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           AELIL+LAPCRSVVTWT+LIAGSVQNG FASALLHFSDMLSDCVRPNDFTFPC FKA+TG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFASALLHFSDMLSDCVRPNDFTFPCVFKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRMA+TG Q+H LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAY
Sbjct: 121 LRMAMTGKQVHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV HGRPEDSA AFIELLR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVVCSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDIKPTDFMVSSVLCA AGLSEIE GRSVQALAVKACV++NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVDENIFVGSALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAA-GMAPSYVSLVCALS 420
           ID+AEQAFNEMPERNLVSWNALLGGYAHQG+ADKA+ALL++M +  G+APSYVSLVCALS
Sbjct: 361 IDKAEQAFNEMPERNLVSWNALLGGYAHQGYADKAVALLKDMASVEGIAPSYVSLVCALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ MPFPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITVNSRIHIFQAKD+S+EKDSE+QDMLGKLRKEMQEA G IAD  YA
Sbjct: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDKSYEKDSELQDMLGKLRKEMQEAAGSIADANYA 600

Query: 601 LFE 603
           LFE
Sbjct: 601 LFE 603

BLAST of Cla97C05G086320 vs. NCBI nr
Match: XP_022979420.1 (pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022979421.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022979423.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022979424.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022979425.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022979426.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima])

HSP 1 Score: 1088.9 bits (2815), Expect = 0.0e+00
Identity = 539/603 (89.39%), Postives = 565/603 (93.70%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LSPNSLASLVELA+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+S
Sbjct: 1   MPFLSPNSLASLVELALSIRSSLLGRVAHAQILKTLKTPFPAFLYNHLVNMYAKLDQLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           AELIL+LAPCRSVVTWT+LIAGSVQNG F+SALLHFSDMLSDCVRPNDFTFPC  KA+TG
Sbjct: 61  AELILELAPCRSVVTWTSLIAGSVQNGRFSSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRMA+TG QLH LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAY
Sbjct: 121 LRMAMTGKQLHALAVKEGLINDVFVGCSAFDMYSKLGLLDDAYKLFVEMPHRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV HGRPEDSA AFIELLR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG 
Sbjct: 181 ISNSVLHGRPEDSAIAFIELLRAGGKPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGC 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Sbjct: 241 GQNVSISNGLIDFYGKCGEVICSEVIFDRMGERNSVSWSSLIAAYVQNNEEEKACCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKE IKPTDFMVSSVLCA AGLSEIE GRSVQALAVKACVE+NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEGIKPTDFMVSSVLCASAGLSEIELGRSVQALAVKACVEENIFVGSALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAA-GMAPSYVSLVCALS 420
           IDEAE+AFNEMPERNLVSWN+LLGGYAHQG ADKA+ALLEEM +A G+APSYVSLVCALS
Sbjct: 361 IDEAERAFNEMPERNLVSWNSLLGGYAHQGCADKAVALLEEMASADGIAPSYVSLVCALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GMQIFESMKARYGIEPGPEHYACLVDL GRAGMVECAYDFI+ MPFPPTI
Sbjct: 421 ACSRAGDLKTGMQIFESMKARYGIEPGPEHYACLVDLFGRAGMVECAYDFIRRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM AAT RWEEVTV+RNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMLAATSRWEEVTVIRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITVN RIHIFQAKD+S+EKDSE+QDMLG LRKEMQEA G IA+  YA
Sbjct: 541 KEVGIKKGAGFSWITVNRRIHIFQAKDKSYEKDSELQDMLGNLRKEMQEAAGSIAEANYA 600

Query: 601 LFE 603
           LFE
Sbjct: 601 LFE 603

BLAST of Cla97C05G086320 vs. NCBI nr
Match: XP_022137756.1 (pentatricopeptide repeat-containing protein At4g14850 [Momordica charantia])

HSP 1 Score: 1081.2 bits (2795), Expect = 0.0e+00
Identity = 535/604 (88.58%), Postives = 563/604 (93.21%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LSPNSLASLVELAVS RSSLLGRAAHAQILKTL+TPLP+FLYNHLVNMYAKLDH DS
Sbjct: 1   MPFLSPNSLASLVELAVSARSSLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPDS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           AEL+L LAPCRSVVTWTALIAGSVQNG F+SALL+FS MLSDCVRPNDFTFPCA KA+T 
Sbjct: 61  AELVLGLAPCRSVVTWTALIAGSVQNGHFSSALLYFSHMLSDCVRPNDFTFPCALKASTS 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRMA++G Q+H LAVKEGLINDVFVGCS FDMYSKL  L DA K+F EMPHRNLETWNAY
Sbjct: 121 LRMAMSGKQIHALAVKEGLINDVFVGCSTFDMYSKLGLLEDASKVFVEMPHRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSVHHGRPEDS  AF+ELLR GG PDSITFCAF NACSDKLGLEPGCQLHGFIIRSG+
Sbjct: 181 ISNSVHHGRPEDSVIAFLELLRAGGSPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
            QNVSVSNGLIDFYGKCGEVECS MVFDRMGERN+VSWSSLIAAY+QNNEEEKA CLFL+
Sbjct: 241 EQNVSVSNGLIDFYGKCGEVECSGMVFDRMGERNAVSWSSLIAAYIQNNEEEKACCLFLQ 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDIKP DFMVSSVLCACAGLS IE GRSVQALAVKACVE+NIFVGSALVDMYGKCGS
Sbjct: 301 ARKEDIKPIDFMVSSVLCACAGLSGIELGRSVQALAVKACVEENIFVGSALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML-AAGMAPSYVSLVCALS 420
           IDEAE+AF EMP++NLVSWN LLGGYAHQGHADKA+ALLEEM  AAGMAPSYVSLVCALS
Sbjct: 361 IDEAERAFKEMPDKNLVSWNTLLGGYAHQGHADKAVALLEEMTSAAGMAPSYVSLVCALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GMQIFESMKARY +EPGPEHYA LVDLLGRAGMVECAYDFIK+MPF PTI
Sbjct: 421 ACSRAGDLKRGMQIFESMKARYNVEPGPEHYASLVDLLGRAGMVECAYDFIKNMPFSPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVT VRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTGVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           +EVGIKKGAGFSWITV+SRIHIFQAKD+SHEKDSEIQD+LGKLRKEMQEA G IA T Y+
Sbjct: 541 REVGIKKGAGFSWITVDSRIHIFQAKDRSHEKDSEIQDLLGKLRKEMQEAAGYIAXTYYS 600

Query: 601 LFEV 604
           +FEV
Sbjct: 601 IFEV 604

BLAST of Cla97C05G086320 vs. TrEMBL
Match: tr|A0A0A0L4T8|A0A0A0L4T8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146650 PE=4 SV=1)

HSP 1 Score: 1128.6 bits (2918), Expect = 0.0e+00
Identity = 556/606 (91.75%), Postives = 576/606 (95.05%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+S
Sbjct: 1   MPFLSQNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPC  KA+TG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRM  TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMPHRNLETWNAY
Sbjct: 121 LRMDTTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKVFDEMPHRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV HGRPEDS  AFIELLRVGGKPDSITFCAF NACSDKLGL PGCQLHGFIIRSGY
Sbjct: 181 ISNSVLHGRPEDSVIAFIELLRVGGKPDSITFCAFLNACSDKLGLGPGCQLHGFIIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVE+NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML-AAGMAPSYVSLVCALS 420
           ID AEQAFN MPERNLVSWNALLGGYAHQGHA+KA+ALLEEM  AAG+ PSYVSL+CALS
Sbjct: 361 IDNAEQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK MPFPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITV+SRIH+FQAKD+SHEKD EIQD+LGKLRKEMQ+A GCIAD  YA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHMFQAKDKSHEKDPEIQDILGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 606
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of Cla97C05G086320 vs. TrEMBL
Match: tr|A0A1S3AXN0|A0A1S3AXN0_CUCME (pentatricopeptide repeat-containing protein At4g14850 OS=Cucumis melo OX=3656 GN=LOC103483708 PE=4 SV=1)

HSP 1 Score: 1119.0 bits (2893), Expect = 0.0e+00
Identity = 552/606 (91.09%), Postives = 574/606 (94.72%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+S
Sbjct: 1   MPFLSKNSLASVVELAVSVRSSLLGRAAHAQILKTLKTPFPAFLYNHLVNMYAKLDHLNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDMLSDCVRPNDFTFPC  KA+TG
Sbjct: 61  AKLILELAPCRSVVTWTALIAGSVQNGCFVSALLHFSDMLSDCVRPNDFTFPCVLKASTG 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRM +TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMP RNLETWNAY
Sbjct: 121 LRMDMTGKQLHALAVKEGLINDVFVGCSVFDMYSKLGFLNDAYKLFDEMPQRNLETWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           I+NSV HGRPEDSA AFIELLRVG KPDSITFCAF NACSDKLGL PGCQLHGF+IRSGY
Sbjct: 181 ITNSVLHGRPEDSAIAFIELLRVGEKPDSITFCAFLNACSDKLGLGPGCQLHGFVIRSGY 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Sbjct: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           ARKEDI+PTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVE+NIFV SALVDMYGKCGS
Sbjct: 301 ARKEDIEPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEQNIFVASALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML-AAGMAPSYVSLVCALS 420
           ID A QAFN MPERNLVSWNALLGGYAHQGHA+KA+ALLEEM  AAG+ PSYVSL+CALS
Sbjct: 361 IDNAVQAFNAMPERNLVSWNALLGGYAHQGHANKAVALLEEMTSAAGIVPSYVSLICALS 420

Query: 421 ACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTI 480
           ACSRAG LK GM+IFESMK RYG+EPGPEHYACLVDLLGRAGMVECAYDFIK MPFPPTI
Sbjct: 421 ACSRAGDLKTGMKIFESMKERYGVEPGPEHYACLVDLLGRAGMVECAYDFIKRMPFPPTI 480

Query: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEM 540
           SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEE TVVRNEM
Sbjct: 481 SIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEATVVRNEM 540

Query: 541 KEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYA 600
           KEVGIKKGAGFSWITV+SRIHIFQAKD+SHEKD EIQ+MLGKLRKEMQ+A GCIAD  YA
Sbjct: 541 KEVGIKKGAGFSWITVDSRIHIFQAKDKSHEKDPEIQNMLGKLRKEMQDAAGCIADPNYA 600

Query: 601 LFEVSN 606
           LFEVSN
Sbjct: 601 LFEVSN 606

BLAST of Cla97C05G086320 vs. TrEMBL
Match: tr|A0A2I4GEW5|A0A2I4GEW5_9ROSI (pentatricopeptide repeat-containing protein At4g14850 OS=Juglans regia OX=51240 GN=LOC109007283 PE=4 SV=1)

HSP 1 Score: 857.4 bits (2214), Expect = 1.9e-245
Identity = 419/605 (69.26%), Postives = 502/605 (82.98%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           M  L+PNSLA+LVE A+S RSS LGRA HAQI+KTL  PLP+FL NHLVNMY+KLD   S
Sbjct: 1   MTSLTPNSLAALVESALSTRSSFLGRAVHAQIIKTLNNPLPSFLSNHLVNMYSKLDLPIS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+L+L L P RSVVTWTALIAGSVQNG FASALL FS+ML + ++PNDFTFPCAFKA+  
Sbjct: 61  AQLVLSLTPSRSVVTWTALIAGSVQNGRFASALLQFSNMLRERIQPNDFTFPCAFKASAS 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LRM   G Q+H LAVK+G I DVFVGCS FDMY K    ++A  +FDEMP +N+ TWNAY
Sbjct: 121 LRMPAIGKQVHALAVKDGQIRDVFVGCSAFDMYCKTGLRDEARILFDEMPEKNIVTWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           +SN+V  G+P ++  AFIE LRV GKPDSITFCAF NACSD L LEPG QLHGFIIRSG+
Sbjct: 181 MSNAVLDGQPRNAVKAFIEFLRVDGKPDSITFCAFLNACSDALYLEPGRQLHGFIIRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
             +VS SNGLIDFYGKC +V  SEMVFDRM +RN VSW S++ A++QN EEEKA  +FL+
Sbjct: 241 LADVSASNGLIDFYGKCRDVGSSEMVFDRMCQRNDVSWCSMVTAHLQNYEEEKACMVFLQ 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           AR+E ++PTD+M+SSVL ACAGLS +E GRSV ALAVKACVE N+FVGSA+VDMYGKCGS
Sbjct: 301 AREEGVEPTDYMISSVLSACAGLSGLELGRSVHALAVKACVEGNVFVGSAIVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAG--MAPSYVSLVCAL 420
           I +AEQAF EMPERNL++WNA++GGYAHQGHAD A+A L++M +    + P+YV+LVC L
Sbjct: 361 IHDAEQAFYEMPERNLITWNAMIGGYAHQGHADMALAFLQDMTSGSDDVVPNYVTLVCVL 420

Query: 421 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 480
           SACSRAG ++MG++IFESM+AR+GIEPG EHYAC+VDLLGR+GMVE AY+F+  M  PPT
Sbjct: 421 SACSRAGAVEMGLEIFESMRARFGIEPGVEHYACIVDLLGRSGMVERAYEFLTKMRIPPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           I+IWGALLGACRM+GKPELGK+AA+ LFELDPKDSGNHVVLSN+FAA G WEE T+VR E
Sbjct: 481 IAIWGALLGACRMYGKPELGKIAADNLFELDPKDSGNHVVLSNLFAAAGMWEEATLVRKE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 600
           MK VGIKKG G SWI+V + +H+FQAKD SHE++SEIQ ML KLRKEM+EA G + DT +
Sbjct: 541 MKNVGIKKGVGCSWISVKNGVHVFQAKDTSHERNSEIQAMLYKLRKEMKEA-GYVPDTDF 600

Query: 601 ALFEV 604
           AL+++
Sbjct: 601 ALYDL 604

BLAST of Cla97C05G086320 vs. TrEMBL
Match: tr|A0A2N9H1X5|A0A2N9H1X5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33757 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 6.7e-243
Identity = 421/606 (69.47%), Postives = 502/606 (82.84%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP L+ NSLA+LV+ AVS RSSLLGRAAHAQ+LKTL TP P+FL NHLVNMY+KLD  +S
Sbjct: 1   MPFLATNSLAALVQSAVSTRSSLLGRAAHAQMLKTLETPFPSFLSNHLVNMYSKLDLPNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+L+L L P R VVTWTALIAGSVQNG FASALLHFS+ML + +RPNDFTFPCAFKA+  
Sbjct: 61  AQLVLSLTPSRCVVTWTALIAGSVQNGHFASALLHFSNMLRERIRPNDFTFPCAFKASAS 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           L M V G Q+H +AVK+G I DVFVGCS FDMY K    ++A K+FDEMP RN+ TWNAY
Sbjct: 121 LCMPVVGKQVHAIAVKDGQIRDVFVGCSAFDMYCKTGLRDEARKLFDEMPERNIVTWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRV-GGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSG 240
           ISN+V  G+  ++  AFIELLRV GG+PDSITFCAF NACSD   LE G QLHGF+IR G
Sbjct: 181 ISNAVLDGQSRNAVDAFIELLRVGGGQPDSITFCAFLNACSDASYLELGRQLHGFVIRIG 240

Query: 241 YGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFL 300
           +  +VSVSNGLIDFYGKC EV  S+MVF+ M  RN VSW SL+AA+VQN E+EKA  +FL
Sbjct: 241 FEADVSVSNGLIDFYGKCWEVGSSKMVFEGMSRRNDVSWCSLVAAHVQNYEDEKACVVFL 300

Query: 301 RARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCG 360
           +AR+E I+ TDFM+SSVL A AGLS +E GRSV ALAVKACV  +IFVGSA+VDMYGKCG
Sbjct: 301 QAREEGIEMTDFMLSSVLSASAGLSGLELGRSVHALAVKACVVGSIFVGSAIVDMYGKCG 360

Query: 361 SIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAG--MAPSYVSLVCA 420
           +I++AE AF+EMPERNL++WNA++GGYAHQGHAD A+ALLEEM  +   + P+YV+LVC 
Sbjct: 361 NINDAELAFHEMPERNLITWNAMIGGYAHQGHADMALALLEEMTTSSNEVVPNYVTLVCV 420

Query: 421 LSACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPP 480
           LSACSRAG +KMGM +F+SM+ RYGIEPG EHYAC+VDLLGRAGMVE AY+FIK MP  P
Sbjct: 421 LSACSRAGAVKMGMGVFDSMRGRYGIEPGVEHYACVVDLLGRAGMVERAYEFIKEMPIRP 480

Query: 481 TISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRN 540
           T S+WGALLGACR++GKPELGK+AA+ LFELDPKDSGNHVVLSN+FAATGRWEE T+VR 
Sbjct: 481 TTSVWGALLGACRVYGKPELGKIAADNLFELDPKDSGNHVVLSNLFAATGRWEEATLVRK 540

Query: 541 EMKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTK 600
           EMK+VGIKKG G SW++V + +H+FQAKD SHE++ EIQ ML KLR+EM EA G + DT 
Sbjct: 541 EMKDVGIKKGVGCSWVSVKNTVHVFQAKDTSHERNLEIQAMLVKLRREMNEA-GYVPDTN 600

Query: 601 YALFEV 604
           +AL+++
Sbjct: 601 FALYDL 605

BLAST of Cla97C05G086320 vs. TrEMBL
Match: tr|A0A2P6RUQ0|A0A2P6RUQ0_ROSCH (Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0129921 PE=4 SV=1)

HSP 1 Score: 840.9 bits (2171), Expect = 1.8e-240
Identity = 419/605 (69.26%), Postives = 494/605 (81.65%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           MP L+PNSLASL++ AVS RSSLLGRAAHA+I++TL+ P P+FL NHLVNMYAKLD  +S
Sbjct: 1   MPSLTPNSLASLLQSAVSTRSSLLGRAAHARIIRTLQPPHPSFLSNHLVNMYAKLDLPNS 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A+L+L L P  SVVTWTALIAG V N  FASALLHF++M  D VRPNDFTFPCAFKA+  
Sbjct: 61  AQLVLHLTPSPSVVTWTALIAGLVHNRHFASALLHFANMRRDSVRPNDFTFPCAFKASGL 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LR+ V G Q+H LAVK G I DVFVGCS FDMY K    +DA K+FDEMP RNL TWNAY
Sbjct: 121 LRLPVVGKQVHALAVKAGQICDVFVGCSAFDMYCKTGLRDDAGKVFDEMPERNLATWNAY 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           +SN+V   RP  + + FIE +R GG+P+ ITFCAF NACSD   LE G QLHGF++R G+
Sbjct: 181 MSNAVLDRRPVSAVNKFIEFVRAGGEPNPITFCAFLNACSDTSALELGRQLHGFVMRCGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
           G++VSV NGL+DFYGKC +V  S+MVFDR+GE N VSW S++AAYVQN EEEKA  LFLR
Sbjct: 241 GKDVSVLNGLVDFYGKCRDVGSSKMVFDRIGEANHVSWCSMVAAYVQNYEEEKACELFLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           AR+E ++PTDFMVSSVL AC+GL+ +E GRSV ALAVKACV+ N+FVGSALVDMYGKCGS
Sbjct: 301 ARREGVEPTDFMVSSVLSACSGLAWLEQGRSVHALAVKACVDGNVFVGSALVDMYGKCGS 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAG--MAPSYVSLVCAL 420
           I++AE AF+ M  RNL+SWNA++GGY HQGHAD A+AL EEM A    + P+YV+LVC L
Sbjct: 361 IEDAECAFDAMRSRNLISWNAMVGGYTHQGHADMALALFEEMSAGSHEVKPNYVTLVCVL 420

Query: 421 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 480
           SACSRAG +  GMQIF+SMK RYG+EPG EHYAC+VDLLGRAGMVE AY+FI  MP  PT
Sbjct: 421 SACSRAGAVPKGMQIFDSMKQRYGVEPGAEHYACVVDLLGRAGMVERAYEFITKMPINPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           ISIWGALLGAC+M+ KPELGK+AA KLFELDPKDSGNHVVLSN+ AATGRWEE T+VR E
Sbjct: 481 ISIWGALLGACKMYRKPELGKIAAHKLFELDPKDSGNHVVLSNLLAATGRWEEATLVRKE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 600
           MK+VGIKKGAG+SWI V + +HIFQAKD SHE +SEIQ ML  LR++MQEA G + DT +
Sbjct: 541 MKDVGIKKGAGYSWIAVKNAVHIFQAKDTSHEMNSEIQAMLTYLRRKMQEA-GYVPDTNF 600

Query: 601 ALFEV 604
           ALF++
Sbjct: 601 ALFDL 604

BLAST of Cla97C05G086320 vs. Swiss-Prot
Match: sp|Q0WSH6|PP312_ARATH (Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX=3702 GN=LOI1 PE=1 SV=1)

HSP 1 Score: 774.6 bits (1999), Expect = 7.9e-223
Identity = 378/605 (62.48%), Postives = 473/605 (78.18%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           M LLS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH +S
Sbjct: 1   MSLLSADALGLLLKNAISASSMRLGRVVHARIVKTLDSPPPPFLANYLINMYSKLDHPES 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A L+L+L P R+VV+WT+LI+G  QNG F++AL+ F +M  + V PNDFTFPCAFKA   
Sbjct: 61  ARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRREGVVPNDFTFPCAFKAVAS 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LR+ VTG Q+H LAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLETWNA+
Sbjct: 121 LRLPVTGKQIHALAVKCGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAF 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV  GRP ++  AFIE  R+ G P+SITFCAF NACSD L L  G QLHG ++RSG+
Sbjct: 181 ISNSVTDGRPREAIEAFIEFRRIDGHPNSITFCAFLNACSDWLHLNLGMQLHGLVLRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
             +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Sbjct: 241 DTDVSVCNGLIDFYGKCKQIRSSEIIFTEMGTKNAVSWCSLVAAYVQNHEDEKASVLYLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           +RK+ ++ +DFM+SSVL ACAG++ +E GRS+ A AVKACVE+ IFVGSALVDMYGKCG 
Sbjct: 301 SRKDIVETSDFMISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGC 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML--AAGMAPSYVSLVCAL 420
           I+++EQAF+EMPE+NLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  L
Sbjct: 361 IEDSEQAFDEMPEKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLL 420

Query: 421 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 480
           SACSRAG ++ GM+IF+SM++ YGIEPG EHY+C+VD+LGRAGMVE AY+FIK MP  PT
Sbjct: 421 SACSRAGAVENGMKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           IS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA GRW E   VR E
Sbjct: 481 ISVWGALQNACRMHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 600
           +K VGIKKGAG+SWITV +++H FQAKD+SH  + EIQ  L KLR EM EA G   D K 
Sbjct: 541 LKGVGIKKGAGYSWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEM-EAAGYKPDLKL 600

Query: 601 ALFEV 604
           +L+++
Sbjct: 601 SLYDL 604

BLAST of Cla97C05G086320 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 3.5e-114
Identity = 230/610 (37.70%), Postives = 350/610 (57.38%), Query Frame = 0

Query: 4   LSPNS----LASLVELAVSVRSSL-LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHL 63
           +SP S    L+S  E +++    L  GR  H  ++ T        + N LVNMYAK   +
Sbjct: 306 VSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSI 365

Query: 64  DSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKAT 123
             A  +      +  V+W ++I G  QNGCF  A+  +  M    + P  FT   +  + 
Sbjct: 366 ADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSC 425

Query: 124 TGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWN 183
             L+ A  G Q+HG ++K G+  +V V  ++  +Y++  +LN+  KIF  MP  +  +WN
Sbjct: 426 ASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWN 485

Query: 184 AYISNSVHHGRP-EDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIR 243
           + I       R   ++   F+   R G K + ITF +  +A S     E G Q+HG  ++
Sbjct: 486 SIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALK 545

Query: 244 SGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKASC 303
           +      +  N LI  YGKCGE++  E +F RM E R++V+W+S+I+ Y+ N    KA  
Sbjct: 546 NNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALD 605

Query: 304 LFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYG 363
           L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGSALVDMY 
Sbjct: 606 LVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYS 665

Query: 364 KCGSIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMA-PSYVSLV 423
           KCG +D A + FN MP RN  SWN+++ GYA  G  ++A+ L E M   G   P +V+ V
Sbjct: 666 KCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFV 725

Query: 424 CALSACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPF 483
             LSACS AG L+ G + FESM   YG+ P  EH++C+ D+LGRAG ++   DFI+ MP 
Sbjct: 726 GVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPM 785

Query: 484 PPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVT 543
            P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA GRWE++ 
Sbjct: 786 KPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLV 845

Query: 544 VVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCI 603
             R +MK+  +KK AG+SW+T+   +H+F A D+SH     I   L +L ++M++A G +
Sbjct: 846 KARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDA-GYV 905

BLAST of Cla97C05G086320 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 1.1e-104
Identity = 203/577 (35.18%), Postives = 319/577 (55.29%), Query Frame = 0

Query: 24  LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGS 83
           +G+  H  ++K+    L  F    L NMYAK   ++ A  +    P R +V+W  ++AG 
Sbjct: 153 VGKEIHGLLVKS-GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGY 212

Query: 84  VQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDV 143
            QNG    AL     M  + ++P+  T      A + LR+   G ++HG A++ G  + V
Sbjct: 213 SQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLV 272

Query: 144 FVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRV 203
            +  ++ DMY+K   L  A ++FD M  RN+ +WN+ I   V +  P+++   F ++L  
Sbjct: 273 NISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE 332

Query: 204 GGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECS 263
           G KP  ++     +AC+D   LE G  +H   +  G  +NVSV N LI  Y KC EV+ +
Sbjct: 333 GVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTA 392

Query: 264 EMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGL 323
             +F ++  R  VSW+++I  + QN     A   F + R   +KP  F   SV+ A A L
Sbjct: 393 ASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAEL 452

Query: 324 SEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGSIDEAEQAFNEMPERNLVSWNALL 383
           S     + +  + +++C++KN+FV +ALVDMY KCG+I  A   F+ M ER++ +WNA++
Sbjct: 453 SITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMI 512

Query: 384 GGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCALSACSRAGGLKMGMQIFESMKARYGI 443
            GY   G    A+ L EEM    + P+ V+ +  +SACS +G ++ G++ F  MK  Y I
Sbjct: 513 DGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSI 572

Query: 444 EPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTISIWGALLGACRMHGKPELGKLAAE 503
           E   +HY  +VDLLGRAG +  A+DFI  MP  P ++++GA+LGAC++H      + AAE
Sbjct: 573 ELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAE 632

Query: 504 KLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIFQ 563
           +LFEL+P D G HV+L+N++ A   WE+V  VR  M   G++K  G S + + + +H F 
Sbjct: 633 RLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFF 692

Query: 564 AKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYAL 601
           +   +H    +I   L KL   ++EA G + DT   L
Sbjct: 693 SGSTAHPDSKKIYAFLEKLICHIKEA-GYVPDTNLVL 727

BLAST of Cla97C05G086320 vs. Swiss-Prot
Match: sp|P0C898|PP232_ARATH (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 7.4e-104
Identity = 215/607 (35.42%), Postives = 330/607 (54.37%), Query Frame = 0

Query: 6   PNSLASLVE-LAVSVRSSL--LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAE 65
           PN   +LV  L V  R  L   G   H  +LK+  + L     N+L++MY K      A 
Sbjct: 3   PNQRQNLVSILRVCTRKGLSDQGGQVHCYLLKS-GSGLNLITSNYLIDMYCKCREPLMAY 62

Query: 66  LILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLR 125
            +    P R+VV+W+AL++G V NG    +L  FS+M    + PN+FTF    KA   L 
Sbjct: 63  KVFDSMPERNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQGIYPNEFTFSTNLKACGLLN 122

Query: 126 MAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYIS 185
               G Q+HG  +K G    V VG S+ DMYSK   +N+A K+F  +  R+L +WNA I+
Sbjct: 123 ALEKGLQIHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIA 182

Query: 186 NSVHHGRPEDSASAF--IELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 245
             VH G    +   F  ++   +  +PD  T  +   ACS    +  G Q+HGF++RSG+
Sbjct: 183 GFVHAGYGSKALDTFGMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGF 242

Query: 246 --GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLF 305
               + +++  L+D Y KCG +  +   FD++ E+  +SWSSLI  Y Q  E  +A  LF
Sbjct: 243 HCPSSATITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLF 302

Query: 306 LRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKC 365
            R ++ + +   F +SS++   A  + +  G+ +QALAVK        V +++VDMY KC
Sbjct: 303 KRLQELNSQIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKC 362

Query: 366 GSIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCAL 425
           G +DEAE+ F EM  ++++SW  ++ GY   G   K++ +  EML   + P  V  +  L
Sbjct: 363 GLVDEAEKCFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVL 422

Query: 426 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 485
           SACS +G +K G ++F  +   +GI+P  EHYAC+VDLLGRAG ++ A   I +MP  P 
Sbjct: 423 SACSHSGMIKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPN 482

Query: 486 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 545
           + IW  LL  CR+HG  ELGK   + L  +D K+  N+V++SN++   G W E    R  
Sbjct: 483 VGIWQTLLSLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNAREL 542

Query: 546 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 605
               G+KK AG SW+ +   +H F++ + SH     IQ+ L +  + ++E  G +   K+
Sbjct: 543 GNIKGLKKEAGMSWVEIEREVHFFRSGEDSHPLTPVIQETLKEAERRLREELGYVYGLKH 602

BLAST of Cla97C05G086320 vs. Swiss-Prot
Match: sp|Q9LFI1|PP280_ARATH (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 3.4e-101
Identity = 189/581 (32.53%), Postives = 329/581 (56.63%), Query Frame = 0

Query: 8   SLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKL 67
           +  S+++   S     LG+  HAQ++K L +       N L+ MY + + +  A  +   
Sbjct: 170 AFGSIIKACASSSDVGLGKQLHAQVIK-LESSSHLIAQNALIAMYVRFNQMSDASRVFYG 229

Query: 68  APCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCV-RPNDFTFPCAFKATTGLRMAVT 127
            P + +++W+++IAG  Q G    AL H  +MLS  V  PN++ F  + KA + L     
Sbjct: 230 IPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDY 289

Query: 128 GTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVH 187
           G+Q+HGL +K  L  +   GCS+ DMY++  FLN A ++FD++   +  +WN  I+   +
Sbjct: 290 GSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLAN 349

Query: 188 HGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSV 247
           +G  +++ S F ++   G  PD+I+  +   A +  + L  G Q+H +II+ G+  +++V
Sbjct: 350 NGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGFLADLTV 409

Query: 248 SNGLIDFYGKCGEVECSEMVF-DRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKED 307
            N L+  Y  C ++ C   +F D     +SVSW++++ A +Q+ +  +   LF      +
Sbjct: 410 CNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSE 469

Query: 308 IKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGSIDEAE 367
            +P    + ++L  C  +S ++ G  V   ++K  +    F+ + L+DMY KCGS+ +A 
Sbjct: 470 CEPDHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQAR 529

Query: 368 QAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCALSACSRAG 427
           + F+ M  R++VSW+ L+ GYA  G  ++A+ L +EM +AG+ P++V+ V  L+ACS  G
Sbjct: 530 RIFDSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVG 589

Query: 428 GLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTISIWGAL 487
            ++ G++++ +M+  +GI P  EH +C+VDLL RAG +  A  FI  M   P + +W  L
Sbjct: 590 LVEEGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTL 649

Query: 488 LGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIK 547
           L AC+  G   L + AAE + ++DP +S  HV+L +M A++G WE   ++R+ MK+  +K
Sbjct: 650 LSACKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVK 709

Query: 548 KGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEM 587
           K  G SWI +  +IHIF A+D  H +  +I  +L  +  +M
Sbjct: 710 KIPGQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNIWSQM 749

BLAST of Cla97C05G086320 vs. TAIR10
Match: AT4G14850.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 774.6 bits (1999), Expect = 4.4e-224
Identity = 378/605 (62.48%), Postives = 473/605 (78.18%), Query Frame = 0

Query: 1   MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDS 60
           M LLS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH +S
Sbjct: 1   MSLLSADALGLLLKNAISASSMRLGRVVHARIVKTLDSPPPPFLANYLINMYSKLDHPES 60

Query: 61  AELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTG 120
           A L+L+L P R+VV+WT+LI+G  QNG F++AL+ F +M  + V PNDFTFPCAFKA   
Sbjct: 61  ARLVLRLTPARNVVSWTSLISGLAQNGHFSTALVEFFEMRREGVVPNDFTFPCAFKAVAS 120

Query: 121 LRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAY 180
           LR+ VTG Q+H LAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLETWNA+
Sbjct: 121 LRLPVTGKQIHALAVKCGRILDVFVGCSAFDMYCKTRLRDDARKLFDEIPERNLETWNAF 180

Query: 181 ISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 240
           ISNSV  GRP ++  AFIE  R+ G P+SITFCAF NACSD L L  G QLHG ++RSG+
Sbjct: 181 ISNSVTDGRPREAIEAFIEFRRIDGHPNSITFCAFLNACSDWLHLNLGMQLHGLVLRSGF 240

Query: 241 GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR 300
             +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Sbjct: 241 DTDVSVCNGLIDFYGKCKQIRSSEIIFTEMGTKNAVSWCSLVAAYVQNHEDEKASVLYLR 300

Query: 301 ARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGS 360
           +RK+ ++ +DFM+SSVL ACAG++ +E GRS+ A AVKACVE+ IFVGSALVDMYGKCG 
Sbjct: 301 SRKDIVETSDFMISSVLSACAGMAGLELGRSIHAHAVKACVERTIFVGSALVDMYGKCGC 360

Query: 361 IDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEML--AAGMAPSYVSLVCAL 420
           I+++EQAF+EMPE+NLV+ N+L+GGYAHQG  D A+AL EEM     G  P+Y++ V  L
Sbjct: 361 IEDSEQAFDEMPEKNLVTRNSLIGGYAHQGQVDMALALFEEMAPRGCGPTPNYMTFVSLL 420

Query: 421 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 480
           SACSRAG ++ GM+IF+SM++ YGIEPG EHY+C+VD+LGRAGMVE AY+FIK MP  PT
Sbjct: 421 SACSRAGAVENGMKIFDSMRSTYGIEPGAEHYSCIVDMLGRAGMVERAYEFIKKMPIQPT 480

Query: 481 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 540
           IS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FAA GRW E   VR E
Sbjct: 481 ISVWGALQNACRMHGKPQLGLLAAENLFKLDPKDSGNHVLLSNTFAAAGRWAEANTVREE 540

Query: 541 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 600
           +K VGIKKGAG+SWITV +++H FQAKD+SH  + EIQ  L KLR EM EA G   D K 
Sbjct: 541 LKGVGIKKGAGYSWITVKNQVHAFQAKDRSHILNKEIQTTLAKLRNEM-EAAGYKPDLKL 600

Query: 601 ALFEV 604
           +L+++
Sbjct: 601 SLYDL 604

BLAST of Cla97C05G086320 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 413.7 bits (1062), Expect = 2.0e-115
Identity = 230/610 (37.70%), Postives = 350/610 (57.38%), Query Frame = 0

Query: 4   LSPNS----LASLVELAVSVRSSL-LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHL 63
           +SP S    L+S  E +++    L  GR  H  ++ T        + N LVNMYAK   +
Sbjct: 306 VSPESYVILLSSFPEYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSI 365

Query: 64  DSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKAT 123
             A  +      +  V+W ++I G  QNGCF  A+  +  M    + P  FT   +  + 
Sbjct: 366 ADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSFTLISSLSSC 425

Query: 124 TGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWN 183
             L+ A  G Q+HG ++K G+  +V V  ++  +Y++  +LN+  KIF  MP  +  +WN
Sbjct: 426 ASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWN 485

Query: 184 AYISNSVHHGRP-EDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIR 243
           + I       R   ++   F+   R G K + ITF +  +A S     E G Q+HG  ++
Sbjct: 486 SIIGALARSERSLPEAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALK 545

Query: 244 SGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKASC 303
           +      +  N LI  YGKCGE++  E +F RM E R++V+W+S+I+ Y+ N    KA  
Sbjct: 546 NNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALD 605

Query: 304 LFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYG 363
           L     +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGSALVDMY 
Sbjct: 606 LVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYS 665

Query: 364 KCGSIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMA-PSYVSLV 423
           KCG +D A + FN MP RN  SWN+++ GYA  G  ++A+ L E M   G   P +V+ V
Sbjct: 666 KCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFV 725

Query: 424 CALSACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPF 483
             LSACS AG L+ G + FESM   YG+ P  EH++C+ D+LGRAG ++   DFI+ MP 
Sbjct: 726 GVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPM 785

Query: 484 PPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVT 543
            P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+AA GRWE++ 
Sbjct: 786 KPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLV 845

Query: 544 VVRNEMKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCI 603
             R +MK+  +KK AG+SW+T+   +H+F A D+SH     I   L +L ++M++A G +
Sbjct: 846 KARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDA-GYV 905

BLAST of Cla97C05G086320 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 382.1 bits (980), Expect = 6.3e-106
Identity = 203/577 (35.18%), Postives = 319/577 (55.29%), Query Frame = 0

Query: 24  LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGS 83
           +G+  H  ++K+    L  F    L NMYAK   ++ A  +    P R +V+W  ++AG 
Sbjct: 153 VGKEIHGLLVKS-GFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGY 212

Query: 84  VQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDV 143
            QNG    AL     M  + ++P+  T      A + LR+   G ++HG A++ G  + V
Sbjct: 213 SQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLV 272

Query: 144 FVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRV 203
            +  ++ DMY+K   L  A ++FD M  RN+ +WN+ I   V +  P+++   F ++L  
Sbjct: 273 NISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDE 332

Query: 204 GGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECS 263
           G KP  ++     +AC+D   LE G  +H   +  G  +NVSV N LI  Y KC EV+ +
Sbjct: 333 GVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTA 392

Query: 264 EMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGL 323
             +F ++  R  VSW+++I  + QN     A   F + R   +KP  F   SV+ A A L
Sbjct: 393 ASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAEL 452

Query: 324 SEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGSIDEAEQAFNEMPERNLVSWNALL 383
           S     + +  + +++C++KN+FV +ALVDMY KCG+I  A   F+ M ER++ +WNA++
Sbjct: 453 SITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMI 512

Query: 384 GGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCALSACSRAGGLKMGMQIFESMKARYGI 443
            GY   G    A+ L EEM    + P+ V+ +  +SACS +G ++ G++ F  MK  Y I
Sbjct: 513 DGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSI 572

Query: 444 EPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTISIWGALLGACRMHGKPELGKLAAE 503
           E   +HY  +VDLLGRAG +  A+DFI  MP  P ++++GA+LGAC++H      + AAE
Sbjct: 573 ELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAE 632

Query: 504 KLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSRIHIFQ 563
           +LFEL+P D G HV+L+N++ A   WE+V  VR  M   G++K  G S + + + +H F 
Sbjct: 633 RLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFF 692

Query: 564 AKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKYAL 601
           +   +H    +I   L KL   ++EA G + DT   L
Sbjct: 693 SGSTAHPDSKKIYAFLEKLICHIKEA-GYVPDTNLVL 727

BLAST of Cla97C05G086320 vs. TAIR10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 379.4 bits (973), Expect = 4.1e-105
Identity = 215/607 (35.42%), Postives = 330/607 (54.37%), Query Frame = 0

Query: 6   PNSLASLVE-LAVSVRSSL--LGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAE 65
           PN   +LV  L V  R  L   G   H  +LK+  + L     N+L++MY K      A 
Sbjct: 3   PNQRQNLVSILRVCTRKGLSDQGGQVHCYLLKS-GSGLNLITSNYLIDMYCKCREPLMAY 62

Query: 66  LILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLR 125
            +    P R+VV+W+AL++G V NG    +L  FS+M    + PN+FTF    KA   L 
Sbjct: 63  KVFDSMPERNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQGIYPNEFTFSTNLKACGLLN 122

Query: 126 MAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYIS 185
               G Q+HG  +K G    V VG S+ DMYSK   +N+A K+F  +  R+L +WNA I+
Sbjct: 123 ALEKGLQIHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIA 182

Query: 186 NSVHHGRPEDSASAF--IELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGY 245
             VH G    +   F  ++   +  +PD  T  +   ACS    +  G Q+HGF++RSG+
Sbjct: 183 GFVHAGYGSKALDTFGMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGF 242

Query: 246 --GQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLF 305
               + +++  L+D Y KCG +  +   FD++ E+  +SWSSLI  Y Q  E  +A  LF
Sbjct: 243 HCPSSATITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLF 302

Query: 306 LRARKEDIKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKC 365
            R ++ + +   F +SS++   A  + +  G+ +QALAVK        V +++VDMY KC
Sbjct: 303 KRLQELNSQIDSFALSSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKC 362

Query: 366 GSIDEAEQAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCAL 425
           G +DEAE+ F EM  ++++SW  ++ GY   G   K++ +  EML   + P  V  +  L
Sbjct: 363 GLVDEAEKCFAEMQLKDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVL 422

Query: 426 SACSRAGGLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPT 485
           SACS +G +K G ++F  +   +GI+P  EHYAC+VDLLGRAG ++ A   I +MP  P 
Sbjct: 423 SACSHSGMIKEGEELFSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPN 482

Query: 486 ISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNE 545
           + IW  LL  CR+HG  ELGK   + L  +D K+  N+V++SN++   G W E    R  
Sbjct: 483 VGIWQTLLSLCRVHGDIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNAREL 542

Query: 546 MKEVGIKKGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEMQEATGCIADTKY 605
               G+KK AG SW+ +   +H F++ + SH     IQ+ L +  + ++E  G +   K+
Sbjct: 543 GNIKGLKKEAGMSWVEIEREVHFFRSGEDSHPLTPVIQETLKEAERRLREELGYVYGLKH 602

BLAST of Cla97C05G086320 vs. TAIR10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 370.5 bits (950), Expect = 1.9e-102
Identity = 189/581 (32.53%), Postives = 329/581 (56.63%), Query Frame = 0

Query: 8   SLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKL 67
           +  S+++   S     LG+  HAQ++K L +       N L+ MY + + +  A  +   
Sbjct: 170 AFGSIIKACASSSDVGLGKQLHAQVIK-LESSSHLIAQNALIAMYVRFNQMSDASRVFYG 229

Query: 68  APCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCV-RPNDFTFPCAFKATTGLRMAVT 127
            P + +++W+++IAG  Q G    AL H  +MLS  V  PN++ F  + KA + L     
Sbjct: 230 IPMKDLISWSSIIAGFSQLGFEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDY 289

Query: 128 GTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVH 187
           G+Q+HGL +K  L  +   GCS+ DMY++  FLN A ++FD++   +  +WN  I+   +
Sbjct: 290 GSQIHGLCIKSELAGNAIAGCSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLAN 349

Query: 188 HGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSV 247
           +G  +++ S F ++   G  PD+I+  +   A +  + L  G Q+H +II+ G+  +++V
Sbjct: 350 NGYADEAVSVFSQMRSSGFIPDAISLRSLLCAQTKPMALSQGMQIHSYIIKWGFLADLTV 409

Query: 248 SNGLIDFYGKCGEVECSEMVF-DRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKED 307
            N L+  Y  C ++ C   +F D     +SVSW++++ A +Q+ +  +   LF      +
Sbjct: 410 CNSLLTMYTFCSDLYCCFNLFEDFRNNADSVSWNTILTACLQHEQPVEMLRLFKLMLVSE 469

Query: 308 IKPTDFMVSSVLCACAGLSEIEFGRSVQALAVKACVEKNIFVGSALVDMYGKCGSIDEAE 367
            +P    + ++L  C  +S ++ G  V   ++K  +    F+ + L+DMY KCGS+ +A 
Sbjct: 470 CEPDHITMGNLLRGCVEISSLKLGSQVHCYSLKTGLAPEQFIKNGLIDMYAKCGSLGQAR 529

Query: 368 QAFNEMPERNLVSWNALLGGYAHQGHADKAMALLEEMLAAGMAPSYVSLVCALSACSRAG 427
           + F+ M  R++VSW+ L+ GYA  G  ++A+ L +EM +AG+ P++V+ V  L+ACS  G
Sbjct: 530 RIFDSMDNRDVVSWSTLIVGYAQSGFGEEALILFKEMKSAGIEPNHVTFVGVLTACSHVG 589

Query: 428 GLKMGMQIFESMKARYGIEPGPEHYACLVDLLGRAGMVECAYDFIKSMPFPPTISIWGAL 487
            ++ G++++ +M+  +GI P  EH +C+VDLL RAG +  A  FI  M   P + +W  L
Sbjct: 590 LVEEGLKLYATMQTEHGISPTKEHCSCVVDLLARAGRLNEAERFIDEMKLEPDVVVWKTL 649

Query: 488 LGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFAATGRWEEVTVVRNEMKEVGIK 547
           L AC+  G   L + AAE + ++DP +S  HV+L +M A++G WE   ++R+ MK+  +K
Sbjct: 650 LSACKTQGNVHLAQKAAENILKIDPFNSTAHVLLCSMHASSGNWENAALLRSSMKKHDVK 709

Query: 548 KGAGFSWITVNSRIHIFQAKDQSHEKDSEIQDMLGKLRKEM 587
           K  G SWI +  +IHIF A+D  H +  +I  +L  +  +M
Sbjct: 710 KIPGQSWIEIEDKIHIFFAEDIFHPERDDIYTVLHNIWSQM 749

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004134445.10.0e+0091.75PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativu... [more]
XP_008438671.10.0e+0091.09PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo][more]
XP_022956070.10.0e+0089.55pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata][more]
XP_022979420.10.0e+0089.39pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima] >XP_022... [more]
XP_022137756.10.0e+0088.58pentatricopeptide repeat-containing protein At4g14850 [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A0A0L4T8|A0A0A0L4T8_CUCSA0.0e+0091.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G146650 PE=4 SV=1[more]
tr|A0A1S3AXN0|A0A1S3AXN0_CUCME0.0e+0091.09pentatricopeptide repeat-containing protein At4g14850 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2I4GEW5|A0A2I4GEW5_9ROSI1.9e-24569.26pentatricopeptide repeat-containing protein At4g14850 OS=Juglans regia OX=51240 ... [more]
tr|A0A2N9H1X5|A0A2N9H1X5_FAGSY6.7e-24369.47Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33757 PE=4 SV=1[more]
tr|A0A2P6RUQ0|A0A2P6RUQ0_ROSCH1.8e-24069.26Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS... [more]
Match NameE-valueIdentityDescription
sp|Q0WSH6|PP312_ARATH7.9e-22362.48Pentatricopeptide repeat-containing protein At4g14850 OS=Arabidopsis thaliana OX... [more]
sp|Q9FIB2|PP373_ARATH3.5e-11437.70Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|Q3E6Q1|PPR32_ARATH1.1e-10435.18Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|P0C898|PP232_ARATH7.4e-10435.42Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
sp|Q9LFI1|PP280_ARATH3.4e-10132.53Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT4G14850.14.4e-22462.48Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09950.12.0e-11537.70Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.16.3e-10635.18Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G15130.14.1e-10535.42Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53360.11.9e-10232.53Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0019288 isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway
biological_process GO:0019287 isopentenyl diphosphate biosynthetic process, mevalonate pathway
biological_process GO:0050790 regulation of catalytic activity
biological_process GO:0048364 root development
biological_process GO:0016125 sterol metabolic process
biological_process GO:0006629 lipid metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0034046 poly(G) binding
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G086320.1Cla97C05G086320.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 276..305
e-value: 0.0054
score: 16.8
coord: 148..173
e-value: 0.16
score: 12.2
coord: 248..274
e-value: 0.025
score: 14.7
coord: 74..102
e-value: 0.017
score: 15.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 377..410
e-value: 4.3E-7
score: 27.7
coord: 350..377
e-value: 1.4E-4
score: 19.8
coord: 276..309
e-value: 1.9E-4
score: 19.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 375..420
e-value: 1.6E-7
score: 31.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..308
score: 9.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 208..242
score: 5.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 142..172
score: 6.774
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 12.496
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 72..106
score: 8.758
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 512..546
score: 7.52
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..374
score: 8.846
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 446..476
score: 5.744
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 243..273
score: 7.476
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..440
score: 6.643
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 41..71
score: 5.711
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..207
score: 8.429
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 428..600
e-value: 7.4E-15
score: 57.3
coord: 225..331
e-value: 3.2E-15
score: 58.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 332..427
e-value: 1.2E-22
score: 82.1
coord: 4..124
e-value: 3.9E-13
score: 51.1
coord: 127..224
e-value: 2.0E-13
score: 52.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 323..513
NoneNo IPR availablePANTHERPTHR24015:SF116SUBFAMILY NOT NAMEDcoord: 6..579
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 6..579

The following gene(s) are paralogous to this gene:

None