Cla97C03G057900 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G057900
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr03: 7064867 .. 7067014 (+)
RNA-Seq ExpressionCla97C03G057900
SyntenyCla97C03G057900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCACCTTCAACGATCAAAACCCATTTTCCCCAATTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCTTCCTCTTCAATCGATGCAGCTCCCCTCAACACCTCCACCAAATTCACGCCAGGTTCCTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCCAACTTATTGATTCCTATGCCAATCTTGGACTCCTTAATCTCTCTCTCCAAGTTTTCTACTCTATAGCCCACCCCAATTCGACCCTTTACAACGCCATATTGAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCCTGTTGGTGTACCAACAAATGGTCGCAAACTCTATGCACCCGGATGAAGAAACTTACCCTTCTGTTTTGCGATCATTTTGTTCTTTTTCTAATGTCCGATTTGGGAGGAAGATTCATGGGTATTTGGTTAAGCTGGGTTTCGATTCGTTTGATATAGTAGCTACTGCTCTGGCTGAGATGTACGAGGAGTGCATTGATTTTGAGAATGCTCATCAACTGTCTGATAAAAGGTCTGTGAAGAATATGGAATGCTGGAGTTCCTTCACTACGGAGACTCCTCAAAATGGGAATGGAGAGGGAATTTTTCGGCTCTTTAGGAGGATGAGAGCAGAGCAATTAGTACGAGACTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGATTTGAATTCAATTCGACTTGCCAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGTGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTACGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGTCAGATAAAGACCGTGTTGTATGGAATATAATGATAGCAGCTTACGCCCGAGATGGGAAAGCGACGGAATGTCTCGAGCTCTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCACTCTCTGTAATCTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCCCATATATTGAGGAATGGTGCCGACAGTCAAATTTCAGTTCATAACTCTCTCATCGACATGTACTGCGAATGCAACCTTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGACTGTAATTTCATGGAGCGCTATGATAAAGGGATATGCCAAACATGGTCAGTCCCTCATTGCGTTGTCTCTCTTCTCCAGGATGAAATCTGATGGGATTCAAGCTGATTTTATTACAGCGATCAATATCTTGCCTGCACTTGTTCACATAGGAGTACTTGAAAATGTCAAATATTTACATGGGTACGCAATGAAACTAGGCCTGACTTCCCTTTCATCACTTAACACAGCCCTTCTAATCACCTATGCAAAATGTGGGTGTTTAGAGATGGCACAGAGGATATTTGAAGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCTCATGCCAACCATGGAGACTGGTCCCAATGTTTTAAGCTATACAATCAAATGAAGTGCTCGAATTCAAAGCCGGACCAAGTAACATTTCTGGGACTACTAACAGCTTGTGTCAATTCCGGTCTCGTAGCAGAGGGAAAAGAGTTTTTCAAGGAGATGACTGAAAGATATGGCTGTCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCCGGGCTCATCAATGAAGCTGGAGAACTTGTAAGAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGCCCATTGTTGAGTGCTTGTAAGCTGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCGGAGAAGCTCATCGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTTGCAAAAATGAGAAGTTTCTTAAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACGGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTACGAAACCTTGAACTTGAAATCAAAGAGGCTAGAGAAAAGAATATAGATAAATTAATTCCTCTATAA

mRNA sequence

ATGCTTCACCTTCAACGATCAAAACCCATTTTCCCCAATTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCTTCCTCTTCAATCGATGCAGCTCCCCTCAACACCTCCACCAAATTCACGCCAGGTTCCTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCCAACTTATTGATTCCTATGCCAATCTTGGACTCCTTAATCTCTCTCTCCAAGTTTTCTACTCTATAGCCCACCCCAATTCGACCCTTTACAACGCCATATTGAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCCTGTTGGTGTACCAACAAATGGTCGCAAACTCTATGCACCCGGATGAAGAAACTTACCCTTCTGTTTTGCGATCATTTTGTTCTTTTTCTAATGTCCGATTTGGGAGGAAGATTCATGGGTATTTGGTTAAGCTGGGTTTCGATTCGTTTGATATAGTAGCTACTGCTCTGGCTGAGATGTACGAGGAGTGCATTGATTTTGAGAATGCTCATCAACTGTCTGATAAAAGGTCTGTGAAGAATATGGAATGCTGGAGTTCCTTCACTACGGAGACTCCTCAAAATGGGAATGGAGAGGGAATTTTTCGGCTCTTTAGGAGGATGAGAGCAGAGCAATTAGTACGAGACTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGATTTGAATTCAATTCGACTTGCCAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGTGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTACGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGTCAGATAAAGACCGTGTTGTATGGAATATAATGATAGCAGCTTACGCCCGAGATGGGAAAGCGACGGAATGTCTCGAGCTCTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCACTCTCTGTAATCTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCCCATATATTGAGGAATGGTGCCGACAGTCAAATTTCAGTTCATAACTCTCTCATCGACATGTACTGCGAATGCAACCTTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGACTGTAATTTCATGGAGCGCTATGATAAAGGGATATGCCAAACATGGTCAGTCCCTCATTGCGTTGTCTCTCTTCTCCAGGATGAAATCTGATGGGATTCAAGCTGATTTTATTACAGCGATCAATATCTTGCCTGCACTTGTTCACATAGGAGTACTTGAAAATGTCAAATATTTACATGGGTACGCAATGAAACTAGGCCTGACTTCCCTTTCATCACTTAACACAGCCCTTCTAATCACCTATGCAAAATGTGGGTGTTTAGAGATGGCACAGAGGATATTTGAAGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCTCATGCCAACCATGGAGACTGGTCCCAATGTTTTAAGCTATACAATCAAATGAAGTGCTCGAATTCAAAGCCGGACCAAGTAACATTTCTGGGACTACTAACAGCTTGTGTCAATTCCGGTCTCGTAGCAGAGGGAAAAGAGTTTTTCAAGGAGATGACTGAAAGATATGGCTGTCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCCGGGCTCATCAATGAAGCTGGAGAACTTGTAAGAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGCCCATTGTTGAGTGCTTGTAAGCTGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCGGAGAAGCTCATCGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTTGCAAAAATGAGAAGTTTCTTAAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACGGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTACGAAACCTTGAACTTGAAATCAAAGAGGCTAGAGAAAAGAATATAGATAAATTAATTCCTCTATAA

Coding sequence (CDS)

ATGCTTCACCTTCAACGATCAAAACCCATTTTCCCCAATTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCTTCCTCTTCAATCGATGCAGCTCCCCTCAACACCTCCACCAAATTCACGCCAGGTTCCTCCTCCATGGCTTCCACCAAAACCCAACTCTCTCTTCCCAACTTATTGATTCCTATGCCAATCTTGGACTCCTTAATCTCTCTCTCCAAGTTTTCTACTCTATAGCCCACCCCAATTCGACCCTTTACAACGCCATATTGAGAAATTTGACAAGATATGGAGAATGTGAGCGGACCCTGTTGGTGTACCAACAAATGGTCGCAAACTCTATGCACCCGGATGAAGAAACTTACCCTTCTGTTTTGCGATCATTTTGTTCTTTTTCTAATGTCCGATTTGGGAGGAAGATTCATGGGTATTTGGTTAAGCTGGGTTTCGATTCGTTTGATATAGTAGCTACTGCTCTGGCTGAGATGTACGAGGAGTGCATTGATTTTGAGAATGCTCATCAACTGTCTGATAAAAGGTCTGTGAAGAATATGGAATGCTGGAGTTCCTTCACTACGGAGACTCCTCAAAATGGGAATGGAGAGGGAATTTTTCGGCTCTTTAGGAGGATGAGAGCAGAGCAATTAGTACGAGACTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGATTTGAATTCAATTCGACTTGCCAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGTGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTACGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGTCAGATAAAGACCGTGTTGTATGGAATATAATGATAGCAGCTTACGCCCGAGATGGGAAAGCGACGGAATGTCTCGAGCTCTTCAAGTCCATGGCAAGATCAGGGATTAGATCTGATATGTTTACTGCACTCTCTGTAATCTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCCCATATATTGAGGAATGGTGCCGACAGTCAAATTTCAGTTCATAACTCTCTCATCGACATGTACTGCGAATGCAACCTTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGACTGTAATTTCATGGAGCGCTATGATAAAGGGATATGCCAAACATGGTCAGTCCCTCATTGCGTTGTCTCTCTTCTCCAGGATGAAATCTGATGGGATTCAAGCTGATTTTATTACAGCGATCAATATCTTGCCTGCACTTGTTCACATAGGAGTACTTGAAAATGTCAAATATTTACATGGGTACGCAATGAAACTAGGCCTGACTTCCCTTTCATCACTTAACACAGCCCTTCTAATCACCTATGCAAAATGTGGGTGTTTAGAGATGGCACAGAGGATATTTGAAGAAGAGAGAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCTCATGCCAACCATGGAGACTGGTCCCAATGTTTTAAGCTATACAATCAAATGAAGTGCTCGAATTCAAAGCCGGACCAAGTAACATTTCTGGGACTACTAACAGCTTGTGTCAATTCCGGTCTCGTAGCAGAGGGAAAAGAGTTTTTCAAGGAGATGACTGAAAGATATGGCTGTCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAACCTCTTAGGGAGAGCCGGGCTCATCAATGAAGCTGGAGAACTTGTAAGAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGCCCATTGTTGAGTGCTTGTAAGCTGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCGGAGAAGCTCATCGATATGGAGCCTAAAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTTGCAAAAATGAGAAGTTTCTTAAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACGGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTACGAAACCTTGAACTTGAAATCAAAGAGGCTAGAGAAAAGAATATAGATAAATTAATTCCTCTATAA

Protein sequence

MLHLQRSKPIFPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSMHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKLIPL
Homology
BLAST of Cla97C03G057900 vs. NCBI nr
Match: XP_038894029.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1300.8 bits (3365), Expect = 0.0e+00
Identity = 646/717 (90.10%), Postives = 673/717 (93.86%), Query Frame = 0

Query: 1   MLHLQRSKP-----IFPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPT 60
           MLHLQRSKP     IFPNFPATQSRLLNTLSFLF+RCSS QHL QIHARF+LHGFHQNPT
Sbjct: 34  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPT 93

Query: 61  LSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANS 120
           LSS+LID YANLGLLNLSLQVFYSI  PNST+YNAILRNLTRYGECERTLLVY+QMVA S
Sbjct: 94  LSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKS 153

Query: 121 MHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAH 180
           MHPDEETYPSVLRS CSFSNV  GRKIHGYLVKLGFDSFD+VATAL EMYEECIDFE+AH
Sbjct: 154 MHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAH 213

Query: 181 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 240
           QL DKRSVK++ECWSSFTTE PQNGNGEGIF +F RMR EQLV DSLTFINLLR IA  N
Sbjct: 214 QLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFN 273

Query: 241 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 300
           SI+LAKIVHCIAIVSKLCGDLLVNTAVLSLYSKL SLVDARKLFDKM + DRVVWNIMIA
Sbjct: 274 SIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA 333

Query: 301 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 360
           AYAR+GK TECL LFKSMARSGIRSDMFTAL VISSISQLK  DWGKQTHA+ILRNG+DS
Sbjct: 334 AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDS 393

Query: 361 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 420
           Q+SV+NSLIDMYCECN+LDSACKIFNWM DKTVISWSAMIKGY KHG SLIALSLFS MK
Sbjct: 394 QVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMK 453

Query: 421 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 480
           SDGIQ+DFIT INILPA VHIGVLENVKYLHGY+MKLGLTSL SLNTALLITYAKCGC+E
Sbjct: 454 SDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE 513

Query: 481 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 540
           MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSN+KPDQVTFLGLLTA
Sbjct: 514 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTA 573

Query: 541 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600
           CVNSGLV +GKEF KEMTE YGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR
Sbjct: 574 CVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 633

Query: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACKLHPGSKLAEFAAEKL+DMEPKNAGNYILLSNIYAAAGKWD VAKMRSFLR
Sbjct: 634 VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLR 693

Query: 661 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL NLELEIKEAREK+++KL
Sbjct: 694 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKL 750

BLAST of Cla97C03G057900 vs. NCBI nr
Match: XP_008444579.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo] >KAA0054005.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK20690.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1224.5 bits (3167), Expect = 0.0e+00
Identity = 616/718 (85.79%), Postives = 654/718 (91.09%), Query Frame = 0

Query: 1   MLHLQRSKPIFP-----NFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPT 60
           MLHLQRSKPI       NFPATQSRLLNTLS LFNRC+S QHL QIHARF+LHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANS 120
           LSS+LID YANLGLL  SLQVF SI  PN TL+NAILRNLTRYGE ER LLVYQQMVA S
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAH 180
           MHPDEETYP + RS  SFSNV FGR IHGYLVKLGFDSFD+VATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 240
           QL DKRSVK++   SS TTE  QNGNGEGIFR+F RMRAEQLV DSLTF+NLLR IA LN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 300
           SI+LAKIVHCIAIVSKL GDLLV TAVLSLYSKLRSLVDAR+LFDKM +KDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 360
           AYAR+GK  ECLELFKSMARSGIRSD+FTAL VISSI+QLKCVDWGKQTHAHILRNG+DS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 420
           Q+SVHNSLIDMYCEC +LDSAC IFNWMTDK+VISWSAMIKGY K+GQSL A SLFS+MK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 480
           SDGIQADF+T INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCG +E
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 540
           MAQR+FEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600
           CVNSGL+ +GKEFFKEMTE YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKLI 714
           +KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTIL NLELEIKE REK++D L+
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLV 718

BLAST of Cla97C03G057900 vs. NCBI nr
Match: XP_022139869.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia])

HSP 1 Score: 1209.5 bits (3128), Expect = 0.0e+00
Identity = 600/716 (83.80%), Postives = 642/716 (89.66%), Query Frame = 0

Query: 1   MLHLQRSKPI----FPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTL 60
           MLHLQRSKPI    F NFPATQSR LNTLSFLF+RCSS Q L QIHARF+LHG HQNP L
Sbjct: 1   MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPAL 60

Query: 61  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 120
           S +LIDSYANLGLL LS QVF SI  P STLY+AILRNL+ +GE ERTLLVY++M A SM
Sbjct: 61  SCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSM 120

Query: 121 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQ 180
           HPDEETYPSVLRS C  SNV +GRKIHG+LVKLG D +D  ATALAEMY +CI FEN H 
Sbjct: 121 HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHD 180

Query: 181 LSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNS 240
           L DK  +K+ ECW+S  +E  QNGNG+ IF+LF RMR EQLV DSLTFINLLRSI  LNS
Sbjct: 181 LFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNS 240

Query: 241 IRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAA 300
           I+LAKIVHC+AI S LCGDLLVNTAVLSLYSKL  LV+ARKLFDKM +KDRVVWNIMIAA
Sbjct: 241 IQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA 300

Query: 301 YARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQ 360
           Y R+G   ECLELFKSMARSGIR+D+FTAL VISSISQLKCVDWGKQTHAH LRNG+D+Q
Sbjct: 301 YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQ 360

Query: 361 ISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKS 420
           +SVHNSLIDMYCE N+LDSACKIF+WMT+KTVISWSAMIKG  KHGQSL ALSLFSRMKS
Sbjct: 361 VSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKS 420

Query: 421 DGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEM 480
           DGIQADFIT INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCGC+EM
Sbjct: 421 DGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEM 480

Query: 481 AQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTAC 540
           AQR+FEEER+DDKDLIMWNSMISAHANHGDWSQCFK+YNQMKCSNS+PDQVTFLGLLTAC
Sbjct: 481 AQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTAC 540

Query: 541 VNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600
           VNSGLV +GKE FKEM E YGCQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Sbjct: 541 VNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV 600

Query: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660
           WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
Sbjct: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660

Query: 661 KGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           KGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTIL NLELEIKEAREK+ +KL
Sbjct: 661 KGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEKL 716

BLAST of Cla97C03G057900 vs. NCBI nr
Match: KAG6573373.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1191.0 bits (3080), Expect = 0.0e+00
Identity = 591/716 (82.54%), Postives = 640/716 (89.39%), Query Frame = 0

Query: 1   MLHLQRSKPI----FPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTL 60
           M HLQRSKPI    FPNFPATQSRLLNTLS LF+RC S Q L QIHARF+LHGFHQNPTL
Sbjct: 1   MFHLQRSKPIFRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFHQNPTL 60

Query: 61  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 120
           S +LID YAN GLLNLS  VF SI  PNS LYNAILRNLTR+GE ERTLLVY++MVA SM
Sbjct: 61  SCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREMVAKSM 120

Query: 121 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQ 180
           HPDE+TYP VLRS C  SNV+FG+ IHG L+KLG DS+D V T L EMYE+CIDFENAHQ
Sbjct: 121 HPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDFENAHQ 180

Query: 181 LSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNS 240
           L DK SVK+++CWSS  TE PQNGNG+ I RLF RM++E LV DSLTFINLLRS++ L+S
Sbjct: 181 LFDKMSVKDLDCWSSLITEAPQNGNGDDISRLFGRMKSEPLVTDSLTFINLLRSVSGLSS 240

Query: 241 IRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAA 300
           I+LAKIVHCIAIVS LCGDLLV+TAVLSLYSKL SLVDARKLF+K+ +KDRVVWNIMIAA
Sbjct: 241 IQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWNIMIAA 300

Query: 301 YARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQ 360
           YAR+G+  ECLELF+SMARSGIR+D+FTAL VISSISQLK  DWGKQTHA+ILRNG+DSQ
Sbjct: 301 YAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRNGSDSQ 360

Query: 361 ISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKS 420
           +SVHNSLIDMYCECN LDSACKIFN +T+KTVISWSAMIKG  KHG  LIALSLF RMKS
Sbjct: 361 VSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLFFRMKS 420

Query: 421 DGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEM 480
           DGIQADFIT INI+PA V IG LENVKYLHGY++KL LTSL SLNTALLITYAKCGC++M
Sbjct: 421 DGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKCGCIDM 480

Query: 481 AQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTAC 540
           AQR+FEEER+DDKDLIMWNSMISAHANHGDWSQCF LYNQMKCSNS PDQVTFLGLLTAC
Sbjct: 481 AQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFNLYNQMKCSNSNPDQVTFLGLLTAC 540

Query: 541 VNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600
           VNSGLV +GKEFFKEM E Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV
Sbjct: 541 VNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600

Query: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660
           WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
Sbjct: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660

Query: 661 KGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           KGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL NLEL+IKEA+E + +KL
Sbjct: 661 KGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEKL 716

BLAST of Cla97C03G057900 vs. NCBI nr
Match: KAG7012542.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 589/716 (82.26%), Postives = 640/716 (89.39%), Query Frame = 0

Query: 1   MLHLQRSKPI----FPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTL 60
           M HLQRSKPI    FPNFPATQSRLLNTLS LF+RC S Q L QIHARF+LHGFHQNPTL
Sbjct: 1   MFHLQRSKPIFRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFHQNPTL 60

Query: 61  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 120
           S +LID YAN GLLNLS  VF SI  PNS LYNAILRNLTR+GE ERTLLVY++MVA SM
Sbjct: 61  SCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREMVAKSM 120

Query: 121 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQ 180
           HPDE+TYP VLRS C  SNV+FG+ IHG L+KLG DS+D V T L EMYE+CIDFENAHQ
Sbjct: 121 HPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDFENAHQ 180

Query: 181 LSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNS 240
           L DK SVK+++CWSS  ++ PQNGNG+ I  LF RM++E LV DSLTFINLLRSI+ L+S
Sbjct: 181 LFDKMSVKDLDCWSSLMSDAPQNGNGDDISLLFGRMKSEPLVTDSLTFINLLRSISGLSS 240

Query: 241 IRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAA 300
           I+LAK+VHCIAIVS LCGDLLV+TAVLSLYSKL SLVDARKLF+K+ +KDRVVWNIMIAA
Sbjct: 241 IQLAKMVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWNIMIAA 300

Query: 301 YARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQ 360
           YAR+G+  ECLELF+SMARSGIR+D+FTAL VISSISQLK  DWGKQTHA+ILRNG+DSQ
Sbjct: 301 YAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRNGSDSQ 360

Query: 361 ISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKS 420
           +SVHNSLIDMYCECN LDSACKIFN +T+KTVISWSAMIKG  KHG  LIALSLF RMKS
Sbjct: 361 VSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLFFRMKS 420

Query: 421 DGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEM 480
           DGIQADFIT INI+PA V IG LENVKYLHGY++KL LTSL SLNTALLITYAKCGC++M
Sbjct: 421 DGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKCGCIDM 480

Query: 481 AQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTAC 540
           AQR+FEEER+DDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNS PDQVTFLGLLTAC
Sbjct: 481 AQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLGLLTAC 540

Query: 541 VNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600
           VNSGLV +GKEFFKEM ERY CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV
Sbjct: 541 VNSGLVEKGKEFFKEMIERYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600

Query: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660
           WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
Sbjct: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660

Query: 661 KGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           KGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL NLEL+IKE +E + +KL
Sbjct: 661 KGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEVKEMSPEKL 716

BLAST of Cla97C03G057900 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 415.6 bits (1067), Expect = 1.1e-114
Identity = 226/709 (31.88%), Postives = 389/709 (54.87%), Query Frame = 0

Query: 9   PIFPNFPATQSRLLNTLSF------------LFNRCSSPQHLHQIHARFLLHGFHQNPTL 68
           P  PN P+     L+  ++            L  RCSS + L QI      +G +Q    
Sbjct: 12  PQIPNPPSRHRHFLSERNYIPANVYEHPAALLLERCSSLKELRQILPLVFKNGLYQEHFF 71

Query: 69  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 128
            ++L+  +   G ++ + +VF  I    + LY+ +L+   +  + ++ L  + +M  + +
Sbjct: 72  QTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDV 131

Query: 129 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVA-TALAEMYEECIDFENAH 188
            P    +  +L+     + +R G++IHG LVK GF S D+ A T L  MY +C     A 
Sbjct: 132 EPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGF-SLDLFAMTGLENMYAKCRQVNEAR 191

Query: 189 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 248
           ++ D+   +++  W++      QNG       + + M  E L    +T +++L +++ L 
Sbjct: 192 KVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALR 251

Query: 249 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 308
            I + K +H  A+ S     + ++TA++ +Y+K  SL  AR+LFD M +++ V WN MI 
Sbjct: 252 LISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMID 311

Query: 309 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 368
           AY ++    E + +F+ M   G++    + +  + + + L  ++ G+  H   +  G D 
Sbjct: 312 AYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDR 371

Query: 369 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 428
            +SV NSLI MYC+C  +D+A  +F  +  +T++SW+AMI G+A++G+ + AL+ FS+M+
Sbjct: 372 NVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMR 431

Query: 429 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 488
           S  ++ D  T ++++ A+  + +  + K++HG  M+  L     + TAL+  YAKCG + 
Sbjct: 432 SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIM 491

Query: 489 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 548
           +A+ IF  + + ++ +  WN+MI  +  HG      +L+ +M+    KP+ VTFL +++A
Sbjct: 492 IARLIF--DMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISA 551

Query: 549 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 608
           C +SGLV  G + F  M E Y  + S +HY  MV+LLGRAG +NEA + +  MP+KP   
Sbjct: 552 CSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVN 611

Query: 609 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 668
           V+G +L AC++H     AE AAE+L ++ P + G ++LL+NIY AA  W+ V ++R  + 
Sbjct: 612 VYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSML 671

Query: 669 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEA 705
            +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Sbjct: 672 RQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEA 717

BLAST of Cla97C03G057900 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 409.8 bits (1052), Expect = 6.2e-113
Identity = 226/660 (34.24%), Postives = 369/660 (55.91%), Query Frame = 0

Query: 48  HGFHQNPTLSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLV 107
           +GF  +  L S+L   Y N G L  + +VF  +    +  +N ++  L + G+   ++ +
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 108 YQQMVANSMHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEE 167
           +++M+++ +  D  T+  V +SF S  +V  G ++HG+++K GF   + V  +L   Y +
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 168 CIDFENAHQLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINL 227
               ++A ++ D+ + +++  W+S       NG  E    +F +M    +  D  T +++
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 228 LRSIADLNSIRLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLRSLVDARKLFDKMSD 287
               AD   I L + VH I +  K C    D   NT +L +YSK   L  A+ +F +MSD
Sbjct: 303 FAGCADSRLISLGRAVHSIGV--KACFSREDRFCNT-LLDMYSKCGDLDSAKAVFREMSD 362

Query: 288 KDRVVWNIMIAAYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQT 347
           +  V +  MIA YAR+G A E ++LF+ M   GI  D++T  +V++  ++ + +D GK+ 
Sbjct: 363 RSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRV 422

Query: 348 HAHILRNGADSQISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQS 407
           H  I  N     I V N+L+DMY +C  +  A  +F+ M  K +ISW+ +I GY+K+  +
Sbjct: 423 HEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYA 482

Query: 408 LIALSLFS-RMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTA 467
             ALSLF+  ++      D  T   +LPA   +   +  + +HGY M+ G  S   +  +
Sbjct: 483 NEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANS 542

Query: 468 LLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSK 527
           L+  YAKCG L +A  +F++  I  KDL+ W  MI+ +  HG   +   L+NQM+ +  +
Sbjct: 543 LVDMYAKCGALLLAHMLFDD--IASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIE 602

Query: 528 PDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGE 587
            D+++F+ LL AC +SGLV EG  FF  M      +P+ EHYAC+V++L R G + +A  
Sbjct: 603 ADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYR 662

Query: 588 LVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGK 647
            + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A K
Sbjct: 663 FIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEK 722

Query: 648 WDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKE 704
           W+ V ++R  +  +GL+K PGCSW+EI G V  F   D ++P  E+I   LR +   + E
Sbjct: 723 WEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIE 777

BLAST of Cla97C03G057900 vs. ExPASy Swiss-Prot
Match: O81767 (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 399.4 bits (1025), Expect = 8.3e-110
Identity = 233/688 (33.87%), Postives = 385/688 (55.96%), Query Frame = 0

Query: 18  QSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGLLNLSLQVF 77
           +S+ ++ +  LF  C++ Q    +HAR ++    QN  +S++L++ Y  LG + L+   F
Sbjct: 50  ESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTF 109

Query: 78  YSIAHPNSTLYNAILRNLTRYGECERTLLVYQQ-MVANSMHPDEETYPSVLRSFCSFSNV 137
             I + +   +N ++    R G     +  +   M+++ + PD  T+PSVL+   +   V
Sbjct: 110 DHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLK---ACRTV 169

Query: 138 RFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSFTTET 197
             G KIH   +K GF     VA +L  +Y       NA  L D+  V++M  W++  +  
Sbjct: 170 IDGNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGY 229

Query: 198 PQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSKLCGDL 257
            Q+GN +    L   +RA     DS+T ++LL +  +         +H  +I   L  +L
Sbjct: 230 CQSGNAKEALTLSNGLRA----MDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESEL 289

Query: 258 LVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFKSMARS 317
            V+  ++ LY++   L D +K+FD+M  +D + WN +I AY  + +    + LF+ M  S
Sbjct: 290 FVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLS 349

Query: 318 GIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNG-ADSQISVHNSLIDMYCECNLLDS 377
            I+ D  T +S+ S +SQL  +   +      LR G     I++ N+++ MY +  L+DS
Sbjct: 350 RIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDS 409

Query: 378 ACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKSDG-IQADFITAINILPALV 437
           A  +FNW+ +  VISW+ +I GYA++G +  A+ +++ M+ +G I A+  T +++LPA  
Sbjct: 410 ARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACS 469

Query: 438 HIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMW 497
             G L     LHG  +K GL     + T+L   Y KCG LE A  +F +  I   + + W
Sbjct: 470 QAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQ--IPRVNSVPW 529

Query: 498 NSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTE 557
           N++I+ H  HG   +   L+ +M     KPD +TF+ LL+AC +SGLV EG+  F+ M  
Sbjct: 530 NTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQT 589

Query: 558 RYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE 617
            YG  PS +HY CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Sbjct: 590 DYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGK 649

Query: 618 FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHV 677
            A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V
Sbjct: 650 IASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKV 709

Query: 678 TEFRVADQTHPRAEDIYTILRNLELEIK 703
             F   +QTHP  E++Y  L  L+ ++K
Sbjct: 710 EVFYTGNQTHPMYEEMYRELTALQAKLK 728

BLAST of Cla97C03G057900 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 2.4e-109
Identity = 229/715 (32.03%), Postives = 392/715 (54.83%), Query Frame = 0

Query: 18  QSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGL---LNLSL 77
           QS+           C +   L   H      G   + +  ++L+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 78  QVF-YSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSMHPDEETYPSVLRSFCSF 137
           +VF  S ++    +YN+++R     G C   +L++ +M+ + + PD+ T+P  L S C+ 
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGL-SACAK 147

Query: 138 SNVR-FGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSF 197
           S  +  G +IHG +VK+G+     V  +L   Y EC + ++A ++ D+ S +N+  W+S 
Sbjct: 148 SRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSM 207

Query: 198 TTETPQNGNGEGIFRLFRRM-RAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSK 257
                +    +    LF RM R E++  +S+T + ++ + A L  +   + V+     S 
Sbjct: 208 ICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSG 267

Query: 258 LCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFK 317
           +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F 
Sbjct: 268 IEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFN 327

Query: 318 SMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQISVHNSLIDMYCECN 377
            M  SG+R D  + LS ISS SQL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C+
Sbjct: 328 LMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCH 387

Query: 378 LLDSACKIFNWMTDKTVISWSAMIKGYAKHGQ------------------------SLIA 437
             D+A +IF+ M++KTV++W++++ GY ++G+                         L+ 
Sbjct: 388 RQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQ 447

Query: 438 LSLF--------SRMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSS 497
            SLF        S    +G+ AD +T ++I  A  H+G L+  K+++ Y  K G+     
Sbjct: 448 GSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVR 507

Query: 498 LNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKC 557
           L T L+  +++CG  E A  IF    + ++D+  W + I A A  G+  +  +L++ M  
Sbjct: 508 LGTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIE 567

Query: 558 SNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLIN 617
              KPD V F+G LTAC + GLV +GKE F  M + +G  P   HY CMV+LLGRAGL+ 
Sbjct: 568 QGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLE 627

Query: 618 EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYA 677
           EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA
Sbjct: 628 EAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYA 687

Query: 678 AAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL 695
           +AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   D++HP   +I  +L
Sbjct: 688 SAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAML 739

BLAST of Cla97C03G057900 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 1.7e-107
Identity = 209/655 (31.91%), Postives = 367/655 (56.03%), Query Frame = 0

Query: 49  GFHQNPTLSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVY 108
           G   N  ++S LI +Y   G +++  ++F  +   +  ++N +L    + G  +  +  +
Sbjct: 168 GMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGF 227

Query: 109 QQMVANSMHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEEC 168
             M  + + P+  T+  VL    S   +  G ++HG +V  G D    +  +L  MY +C
Sbjct: 228 SVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKC 287

Query: 169 IDFENAHQLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLL 228
             F++A +L    S  +   W+   +   Q+G  E     F  M +  ++ D++TF +LL
Sbjct: 288 GRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLL 347

Query: 229 RSIADLNSIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRV 288
            S++   ++   K +HC  +   +  D+ + +A++  Y K R +  A+ +F + +  D V
Sbjct: 348 PSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVV 407

Query: 289 VWNIMIAAYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHI 348
           V+  MI+ Y  +G   + LE+F+ + +  I  +  T +S++  I  L  +  G++ H  I
Sbjct: 408 VFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFI 467

Query: 349 LRNGADSQISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIAL 408
           ++ G D++ ++  ++IDMY +C  ++ A +IF  ++ + ++SW++MI   A+      A+
Sbjct: 468 IKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAI 527

Query: 409 SLFSRMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITY 468
            +F +M   GI  D ++    L A  ++      K +HG+ +K  L S     + L+  Y
Sbjct: 528 DIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMY 587

Query: 469 AKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQM-KCSNSKPDQV 528
           AKCG L+ A  +F  + + +K+++ WNS+I+A  NHG       L+++M + S  +PDQ+
Sbjct: 588 AKCGNLKAAMNVF--KTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQI 647

Query: 529 TFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRN 588
           TFL ++++C + G V EG  FF+ MTE YG QP QEHYAC+V+L GRAG + EA E V++
Sbjct: 648 TFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKS 707

Query: 589 MPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGV 648
           MP  PDA VWG LL AC+LH   +LAE A+ KL+D++P N+G Y+L+SN +A A +W+ V
Sbjct: 708 MPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESV 767

Query: 649 AKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIK 703
            K+RS ++++ ++K PG SW+EIN     F   D  HP +  IY++L +L  E++
Sbjct: 768 TKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGELR 820

BLAST of Cla97C03G057900 vs. ExPASy TrEMBL
Match: A0A0A0M0Z6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1)

HSP 1 Score: 1242.3 bits (3213), Expect = 0.0e+00
Identity = 622/718 (86.63%), Postives = 659/718 (91.78%), Query Frame = 0

Query: 1   MLHLQRSK-----PIFPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPT 60
           MLHL RSK     PIF NFPATQSRLLNTLS LF+RC+S QHL QIHARF+LHGFHQNPT
Sbjct: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANS 120
           LSS+LID YANLGLLN SLQVF S+  PN TL+NAILRNLTRYGE ERTLLVYQQMVA S
Sbjct: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120

Query: 121 MHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAH 180
           MHPDEETYP VLRS  SFSNV FGR IHGYLVKLGFD FD+VATALAEMYEECI+FENAH
Sbjct: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180

Query: 181 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 240
           QL DKRSVK++   SS TTE PQN NGEGIFR+F RM AEQLV DS TF NLLR IA LN
Sbjct: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240

Query: 241 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 300
           SI+LAKIVHCIAIVSKL GDLLVNTAVLSLYSKLRSLVDARKLFDKM +KDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300

Query: 301 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 360
           AYAR+GK TECLELFKSMARSGIRSD+FTAL VISSI+QLKCVDWGKQTHAHILRNG+DS
Sbjct: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 420
           Q+SVHNSLIDMYCEC +LDSACKIFNWMTDK+VISWSAMIKGY K+GQSL ALSLFS+MK
Sbjct: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420

Query: 421 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 480
           SDGIQADF+  INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCG +E
Sbjct: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480

Query: 481 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 540
           MAQR+FEEE+IDDKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600
           CVNSGLV +GKEFFKEMTE YGCQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Sbjct: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600

Query: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660

Query: 661 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKLI 714
           +KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTIL NLELEIKE REK+ D L+
Sbjct: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLV 718

BLAST of Cla97C03G057900 vs. ExPASy TrEMBL
Match: A0A5D3DB69 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00810 PE=4 SV=1)

HSP 1 Score: 1224.5 bits (3167), Expect = 0.0e+00
Identity = 616/718 (85.79%), Postives = 654/718 (91.09%), Query Frame = 0

Query: 1   MLHLQRSKPIFP-----NFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPT 60
           MLHLQRSKPI       NFPATQSRLLNTLS LFNRC+S QHL QIHARF+LHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANS 120
           LSS+LID YANLGLL  SLQVF SI  PN TL+NAILRNLTRYGE ER LLVYQQMVA S
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAH 180
           MHPDEETYP + RS  SFSNV FGR IHGYLVKLGFDSFD+VATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 240
           QL DKRSVK++   SS TTE  QNGNGEGIFR+F RMRAEQLV DSLTF+NLLR IA LN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 300
           SI+LAKIVHCIAIVSKL GDLLV TAVLSLYSKLRSLVDAR+LFDKM +KDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 360
           AYAR+GK  ECLELFKSMARSGIRSD+FTAL VISSI+QLKCVDWGKQTHAHILRNG+DS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 420
           Q+SVHNSLIDMYCEC +LDSAC IFNWMTDK+VISWSAMIKGY K+GQSL A SLFS+MK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 480
           SDGIQADF+T INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCG +E
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 540
           MAQR+FEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600
           CVNSGL+ +GKEFFKEMTE YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKLI 714
           +KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTIL NLELEIKE REK++D L+
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLV 718

BLAST of Cla97C03G057900 vs. ExPASy TrEMBL
Match: A0A1S3BBG7 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103487849 PE=4 SV=1)

HSP 1 Score: 1224.5 bits (3167), Expect = 0.0e+00
Identity = 616/718 (85.79%), Postives = 654/718 (91.09%), Query Frame = 0

Query: 1   MLHLQRSKPIFP-----NFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPT 60
           MLHLQRSKPI       NFPATQSRLLNTLS LFNRC+S QHL QIHARF+LHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANS 120
           LSS+LID YANLGLL  SLQVF SI  PN TL+NAILRNLTRYGE ER LLVYQQMVA S
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAH 180
           MHPDEETYP + RS  SFSNV FGR IHGYLVKLGFDSFD+VATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 240
           QL DKRSVK++   SS TTE  QNGNGEGIFR+F RMRAEQLV DSLTF+NLLR IA LN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 300
           SI+LAKIVHCIAIVSKL GDLLV TAVLSLYSKLRSLVDAR+LFDKM +KDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 360
           AYAR+GK  ECLELFKSMARSGIRSD+FTAL VISSI+QLKCVDWGKQTHAHILRNG+DS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 420
           Q+SVHNSLIDMYCEC +LDSAC IFNWMTDK+VISWSAMIKGY K+GQSL A SLFS+MK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 480
           SDGIQADF+T INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCG +E
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 540
           MAQR+FEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600
           CVNSGL+ +GKEFFKEMTE YGC PSQEH+ACMVNLLGRAGLI+EAGELVRNMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKLI 714
           +KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTIL NLELEIKE REK++D L+
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLV 718

BLAST of Cla97C03G057900 vs. ExPASy TrEMBL
Match: A0A6J1CE61 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111010677 PE=4 SV=1)

HSP 1 Score: 1209.5 bits (3128), Expect = 0.0e+00
Identity = 600/716 (83.80%), Postives = 642/716 (89.66%), Query Frame = 0

Query: 1   MLHLQRSKPI----FPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTL 60
           MLHLQRSKPI    F NFPATQSR LNTLSFLF+RCSS Q L QIHARF+LHG HQNP L
Sbjct: 1   MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPAL 60

Query: 61  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 120
           S +LIDSYANLGLL LS QVF SI  P STLY+AILRNL+ +GE ERTLLVY++M A SM
Sbjct: 61  SCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSM 120

Query: 121 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQ 180
           HPDEETYPSVLRS C  SNV +GRKIHG+LVKLG D +D  ATALAEMY +CI FEN H 
Sbjct: 121 HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHD 180

Query: 181 LSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNS 240
           L DK  +K+ ECW+S  +E  QNGNG+ IF+LF RMR EQLV DSLTFINLLRSI  LNS
Sbjct: 181 LFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNS 240

Query: 241 IRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAA 300
           I+LAKIVHC+AI S LCGDLLVNTAVLSLYSKL  LV+ARKLFDKM +KDRVVWNIMIAA
Sbjct: 241 IQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA 300

Query: 301 YARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQ 360
           Y R+G   ECLELFKSMARSGIR+D+FTAL VISSISQLKCVDWGKQTHAH LRNG+D+Q
Sbjct: 301 YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQ 360

Query: 361 ISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKS 420
           +SVHNSLIDMYCE N+LDSACKIF+WMT+KTVISWSAMIKG  KHGQSL ALSLFSRMKS
Sbjct: 361 VSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKS 420

Query: 421 DGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEM 480
           DGIQADFIT INILPA VHIG LENVKYLHGY+MKLGLTSL SLNTALLITYAKCGC+EM
Sbjct: 421 DGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEM 480

Query: 481 AQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTAC 540
           AQR+FEEER+DDKDLIMWNSMISAHANHGDWSQCFK+YNQMKCSNS+PDQVTFLGLLTAC
Sbjct: 481 AQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTAC 540

Query: 541 VNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600
           VNSGLV +GKE FKEM E YGCQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Sbjct: 541 VNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV 600

Query: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660
           WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
Sbjct: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660

Query: 661 KGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           KGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTIL NLELEIKEAREK+ +KL
Sbjct: 661 KGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEKL 716

BLAST of Cla97C03G057900 vs. ExPASy TrEMBL
Match: A0A6J1GR57 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111456773 PE=4 SV=1)

HSP 1 Score: 1182.2 bits (3057), Expect = 0.0e+00
Identity = 587/716 (81.98%), Postives = 637/716 (88.97%), Query Frame = 0

Query: 1   MLHLQRSKPI----FPNFPATQSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTL 60
           M HLQRSKPI    FPNFPAT SRLLNTLS LF+RC S Q L QIHARF+LHGFHQNPTL
Sbjct: 1   MFHLQRSKPIFRFKFPNFPATHSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFHQNPTL 60

Query: 61  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 120
           S +LID YAN GLLNLS  VF SI  PNS LYNAILRNLTR+GE ERTLLVY++MVA SM
Sbjct: 61  SCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREMVAKSM 120

Query: 121 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQ 180
           HPDE+TYP VLRS C  SNV+FG+ IHG L+KLG DS+D V T L EMYE+CIDFENAHQ
Sbjct: 121 HPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDFENAHQ 180

Query: 181 LSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNS 240
           L DK SVK+++CWSS  TE PQNGNG+ I RLF RM++E LV DSLTFINLLRS++ L+S
Sbjct: 181 LFDKMSVKDLDCWSSLITEAPQNGNGDDISRLFGRMKSEPLVTDSLTFINLLRSVSGLSS 240

Query: 241 IRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAA 300
           I+LAKIVHCIAIVS LCGDLLV+TAVLSLYSKL SLVDARKLF+K+ +KDRVVWNIMIAA
Sbjct: 241 IQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWNIMIAA 300

Query: 301 YARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQ 360
           YAR+G+  ECLELF+SMARSGIR+D+FT L VISSISQLK  DWGKQTHA+ILRNG+DSQ
Sbjct: 301 YAREGRPMECLELFESMARSGIRADLFTVLPVISSISQLKRADWGKQTHANILRNGSDSQ 360

Query: 361 ISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKS 420
           +SVHNSLIDMYCECN LDSA KIFN +T+KTVISWSAMIKG  KHG  LIALSLF RMKS
Sbjct: 361 VSVHNSLIDMYCECNSLDSASKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLFFRMKS 420

Query: 421 DGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEM 480
           DGIQADFIT INI+PA V IG LENVKYLHGY++KL LTSL SLNTALLITYAKCGC++M
Sbjct: 421 DGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKCGCIDM 480

Query: 481 AQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTAC 540
           AQR+FEEER++DKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNS PDQVTFLGLLTAC
Sbjct: 481 AQRLFEEERVNDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLGLLTAC 540

Query: 541 VNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600
           VNSGLV +GKEFFKEM E Y CQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV
Sbjct: 541 VNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARV 600

Query: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660
           WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
Sbjct: 601 WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD 660

Query: 661 KGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEAREKNIDKL 713
           KGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL NLEL+IKE +E + +KL
Sbjct: 661 KGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEVKEMSPEKL 716

BLAST of Cla97C03G057900 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 415.6 bits (1067), Expect = 8.0e-116
Identity = 226/709 (31.88%), Postives = 389/709 (54.87%), Query Frame = 0

Query: 9   PIFPNFPATQSRLLNTLSF------------LFNRCSSPQHLHQIHARFLLHGFHQNPTL 68
           P  PN P+     L+  ++            L  RCSS + L QI      +G +Q    
Sbjct: 12  PQIPNPPSRHRHFLSERNYIPANVYEHPAALLLERCSSLKELRQILPLVFKNGLYQEHFF 71

Query: 69  SSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSM 128
            ++L+  +   G ++ + +VF  I    + LY+ +L+   +  + ++ L  + +M  + +
Sbjct: 72  QTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDV 131

Query: 129 HPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVA-TALAEMYEECIDFENAH 188
            P    +  +L+     + +R G++IHG LVK GF S D+ A T L  MY +C     A 
Sbjct: 132 EPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGF-SLDLFAMTGLENMYAKCRQVNEAR 191

Query: 189 QLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLN 248
           ++ D+   +++  W++      QNG       + + M  E L    +T +++L +++ L 
Sbjct: 192 KVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALR 251

Query: 249 SIRLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIA 308
            I + K +H  A+ S     + ++TA++ +Y+K  SL  AR+LFD M +++ V WN MI 
Sbjct: 252 LISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMID 311

Query: 309 AYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADS 368
           AY ++    E + +F+ M   G++    + +  + + + L  ++ G+  H   +  G D 
Sbjct: 312 AYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDR 371

Query: 369 QISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMK 428
            +SV NSLI MYC+C  +D+A  +F  +  +T++SW+AMI G+A++G+ + AL+ FS+M+
Sbjct: 372 NVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMR 431

Query: 429 SDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLE 488
           S  ++ D  T ++++ A+  + +  + K++HG  M+  L     + TAL+  YAKCG + 
Sbjct: 432 SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIM 491

Query: 489 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTA 548
           +A+ IF  + + ++ +  WN+MI  +  HG      +L+ +M+    KP+ VTFL +++A
Sbjct: 492 IARLIF--DMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISA 551

Query: 549 CVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 608
           C +SGLV  G + F  M E Y  + S +HY  MV+LLGRAG +NEA + +  MP+KP   
Sbjct: 552 CSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVN 611

Query: 609 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 668
           V+G +L AC++H     AE AAE+L ++ P + G ++LL+NIY AA  W+ V ++R  + 
Sbjct: 612 VYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSML 671

Query: 669 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKEA 705
            +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Sbjct: 672 RQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEA 717

BLAST of Cla97C03G057900 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 409.8 bits (1052), Expect = 4.4e-114
Identity = 226/660 (34.24%), Postives = 369/660 (55.91%), Query Frame = 0

Query: 48  HGFHQNPTLSSQLIDSYANLGLLNLSLQVFYSIAHPNSTLYNAILRNLTRYGECERTLLV 107
           +GF  +  L S+L   Y N G L  + +VF  +    +  +N ++  L + G+   ++ +
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 108 YQQMVANSMHPDEETYPSVLRSFCSFSNVRFGRKIHGYLVKLGFDSFDIVATALAEMYEE 167
           +++M+++ +  D  T+  V +SF S  +V  G ++HG+++K GF   + V  +L   Y +
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 168 CIDFENAHQLSDKRSVKNMECWSSFTTETPQNGNGEGIFRLFRRMRAEQLVRDSLTFINL 227
               ++A ++ D+ + +++  W+S       NG  E    +F +M    +  D  T +++
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 228 LRSIADLNSIRLAKIVHCIAIVSKLC---GDLLVNTAVLSLYSKLRSLVDARKLFDKMSD 287
               AD   I L + VH I +  K C    D   NT +L +YSK   L  A+ +F +MSD
Sbjct: 303 FAGCADSRLISLGRAVHSIGV--KACFSREDRFCNT-LLDMYSKCGDLDSAKAVFREMSD 362

Query: 288 KDRVVWNIMIAAYARDGKATECLELFKSMARSGIRSDMFTALSVISSISQLKCVDWGKQT 347
           +  V +  MIA YAR+G A E ++LF+ M   GI  D++T  +V++  ++ + +D GK+ 
Sbjct: 363 RSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRV 422

Query: 348 HAHILRNGADSQISVHNSLIDMYCECNLLDSACKIFNWMTDKTVISWSAMIKGYAKHGQS 407
           H  I  N     I V N+L+DMY +C  +  A  +F+ M  K +ISW+ +I GY+K+  +
Sbjct: 423 HEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYA 482

Query: 408 LIALSLFS-RMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSSLNTA 467
             ALSLF+  ++      D  T   +LPA   +   +  + +HGY M+ G  S   +  +
Sbjct: 483 NEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANS 542

Query: 468 LLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSK 527
           L+  YAKCG L +A  +F++  I  KDL+ W  MI+ +  HG   +   L+NQM+ +  +
Sbjct: 543 LVDMYAKCGALLLAHMLFDD--IASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIE 602

Query: 528 PDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLINEAGE 587
            D+++F+ LL AC +SGLV EG  FF  M      +P+ EHYAC+V++L R G + +A  
Sbjct: 603 ADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYR 662

Query: 588 LVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGK 647
            + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A K
Sbjct: 663 FIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEK 722

Query: 648 WDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILRNLELEIKE 704
           W+ V ++R  +  +GL+K PGCSW+EI G V  F   D ++P  E+I   LR +   + E
Sbjct: 723 WEQVKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIE 777

BLAST of Cla97C03G057900 vs. TAIR 10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 399.4 bits (1025), Expect = 5.9e-111
Identity = 233/688 (33.87%), Postives = 385/688 (55.96%), Query Frame = 0

Query: 18  QSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGLLNLSLQVF 77
           +S+ ++ +  LF  C++ Q    +HAR ++    QN  +S++L++ Y  LG + L+   F
Sbjct: 50  ESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTF 109

Query: 78  YSIAHPNSTLYNAILRNLTRYGECERTLLVYQQ-MVANSMHPDEETYPSVLRSFCSFSNV 137
             I + +   +N ++    R G     +  +   M+++ + PD  T+PSVL+   +   V
Sbjct: 110 DHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLK---ACRTV 169

Query: 138 RFGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSFTTET 197
             G KIH   +K GF     VA +L  +Y       NA  L D+  V++M  W++  +  
Sbjct: 170 IDGNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGY 229

Query: 198 PQNGNGEGIFRLFRRMRAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSKLCGDL 257
            Q+GN +    L   +RA     DS+T ++LL +  +         +H  +I   L  +L
Sbjct: 230 CQSGNAKEALTLSNGLRA----MDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESEL 289

Query: 258 LVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFKSMARS 317
            V+  ++ LY++   L D +K+FD+M  +D + WN +I AY  + +    + LF+ M  S
Sbjct: 290 FVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLS 349

Query: 318 GIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNG-ADSQISVHNSLIDMYCECNLLDS 377
            I+ D  T +S+ S +SQL  +   +      LR G     I++ N+++ MY +  L+DS
Sbjct: 350 RIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDS 409

Query: 378 ACKIFNWMTDKTVISWSAMIKGYAKHGQSLIALSLFSRMKSDG-IQADFITAINILPALV 437
           A  +FNW+ +  VISW+ +I GYA++G +  A+ +++ M+ +G I A+  T +++LPA  
Sbjct: 410 ARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACS 469

Query: 438 HIGVLENVKYLHGYAMKLGLTSLSSLNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMW 497
             G L     LHG  +K GL     + T+L   Y KCG LE A  +F +  I   + + W
Sbjct: 470 QAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQ--IPRVNSVPW 529

Query: 498 NSMISAHANHGDWSQCFKLYNQMKCSNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTE 557
           N++I+ H  HG   +   L+ +M     KPD +TF+ LL+AC +SGLV EG+  F+ M  
Sbjct: 530 NTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQT 589

Query: 558 RYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAE 617
            YG  PS +HY CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L +
Sbjct: 590 DYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGK 649

Query: 618 FAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHV 677
            A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V
Sbjct: 650 IASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKV 709

Query: 678 TEFRVADQTHPRAEDIYTILRNLELEIK 703
             F   +QTHP  E++Y  L  L+ ++K
Sbjct: 710 EVFYTGNQTHPMYEEMYRELTALQAKLK 728

BLAST of Cla97C03G057900 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 397.9 bits (1021), Expect = 1.7e-110
Identity = 229/715 (32.03%), Postives = 392/715 (54.83%), Query Frame = 0

Query: 18  QSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGL---LNLSL 77
           QS+           C +   L   H      G   + +  ++L+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 78  QVF-YSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSMHPDEETYPSVLRSFCSF 137
           +VF  S ++    +YN+++R     G C   +L++ +M+ + + PD+ T+P  L S C+ 
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGL-SACAK 147

Query: 138 SNVR-FGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSF 197
           S  +  G +IHG +VK+G+     V  +L   Y EC + ++A ++ D+ S +N+  W+S 
Sbjct: 148 SRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSM 207

Query: 198 TTETPQNGNGEGIFRLFRRM-RAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSK 257
                +    +    LF RM R E++  +S+T + ++ + A L  +   + V+     S 
Sbjct: 208 ICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSG 267

Query: 258 LCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFK 317
           +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F 
Sbjct: 268 IEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFN 327

Query: 318 SMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQISVHNSLIDMYCECN 377
            M  SG+R D  + LS ISS SQL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C+
Sbjct: 328 LMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCH 387

Query: 378 LLDSACKIFNWMTDKTVISWSAMIKGYAKHGQ------------------------SLIA 437
             D+A +IF+ M++KTV++W++++ GY ++G+                         L+ 
Sbjct: 388 RQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQ 447

Query: 438 LSLF--------SRMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSS 497
            SLF        S    +G+ AD +T ++I  A  H+G L+  K+++ Y  K G+     
Sbjct: 448 GSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVR 507

Query: 498 LNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKC 557
           L T L+  +++CG  E A  IF    + ++D+  W + I A A  G+  +  +L++ M  
Sbjct: 508 LGTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIE 567

Query: 558 SNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLIN 617
              KPD V F+G LTAC + GLV +GKE F  M + +G  P   HY CMV+LLGRAGL+ 
Sbjct: 568 QGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLE 627

Query: 618 EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYA 677
           EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA
Sbjct: 628 EAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYA 687

Query: 678 AAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL 695
           +AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   D++HP   +I  +L
Sbjct: 688 SAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAML 739

BLAST of Cla97C03G057900 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 397.9 bits (1021), Expect = 1.7e-110
Identity = 229/715 (32.03%), Postives = 392/715 (54.83%), Query Frame = 0

Query: 18  QSRLLNTLSFLFNRCSSPQHLHQIHARFLLHGFHQNPTLSSQLIDSYANLGL---LNLSL 77
           QS+           C +   L   H      G   + +  ++L+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 78  QVF-YSIAHPNSTLYNAILRNLTRYGECERTLLVYQQMVANSMHPDEETYPSVLRSFCSF 137
           +VF  S ++    +YN+++R     G C   +L++ +M+ + + PD+ T+P  L S C+ 
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGL-SACAK 147

Query: 138 SNVR-FGRKIHGYLVKLGFDSFDIVATALAEMYEECIDFENAHQLSDKRSVKNMECWSSF 197
           S  +  G +IHG +VK+G+     V  +L   Y EC + ++A ++ D+ S +N+  W+S 
Sbjct: 148 SRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSM 207

Query: 198 TTETPQNGNGEGIFRLFRRM-RAEQLVRDSLTFINLLRSIADLNSIRLAKIVHCIAIVSK 257
                +    +    LF RM R E++  +S+T + ++ + A L  +   + V+     S 
Sbjct: 208 ICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSG 267

Query: 258 LCGDLLVNTAVLSLYSKLRSLVDARKLFDKMSDKDRVVWNIMIAAYARDGKATECLELFK 317
           +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R G   E L +F 
Sbjct: 268 IEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFN 327

Query: 318 SMARSGIRSDMFTALSVISSISQLKCVDWGKQTHAHILRNGADSQISVHNSLIDMYCECN 377
            M  SG+R D  + LS ISS SQL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C+
Sbjct: 328 LMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCH 387

Query: 378 LLDSACKIFNWMTDKTVISWSAMIKGYAKHGQ------------------------SLIA 437
             D+A +IF+ M++KTV++W++++ GY ++G+                         L+ 
Sbjct: 388 RQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQ 447

Query: 438 LSLF--------SRMKSDGIQADFITAINILPALVHIGVLENVKYLHGYAMKLGLTSLSS 497
            SLF        S    +G+ AD +T ++I  A  H+G L+  K+++ Y  K G+     
Sbjct: 448 GSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVR 507

Query: 498 LNTALLITYAKCGCLEMAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKC 557
           L T L+  +++CG  E A  IF    + ++D+  W + I A A  G+  +  +L++ M  
Sbjct: 508 LGTTLVDMFSRCGDPESAMSIFNS--LTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIE 567

Query: 558 SNSKPDQVTFLGLLTACVNSGLVAEGKEFFKEMTERYGCQPSQEHYACMVNLLGRAGLIN 617
              KPD V F+G LTAC + GLV +GKE F  M + +G  P   HY CMV+LLGRAGL+ 
Sbjct: 568 QGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLE 627

Query: 618 EAGELVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYA 677
           EA +L+ +MP++P+  +W  LL+AC++    ++A +AAEK+  + P+  G+Y+LLSN+YA
Sbjct: 628 EAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYA 687

Query: 678 AAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL 695
           +AG+W+ +AK+R  +++KGL+K PG S ++I G   EF   D++HP   +I  +L
Sbjct: 688 SAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAML 739

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894029.10.0e+0090.10pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benin... [more]
XP_008444579.10.0e+0085.79PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-... [more]
XP_022139869.10.0e+0083.80pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momor... [more]
KAG6573373.10.0e+0082.54Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
KAG7012542.10.0e+0082.26Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q3E6Q11.1e-11431.88Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9SN396.2e-11334.24Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
O817678.3e-11033.87Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Q9LUJ22.4e-10932.03Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9STE11.7e-10731.91Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0M0Z60.0e+0086.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1[more]
A0A5D3DB690.0e+0085.79Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BBG70.0e+0085.79pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
A0A6J1CE610.0e+0083.80pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Mom... [more]
A0A6J1GR570.0e+0081.98pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G11290.18.0e-11631.88Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.14.4e-11434.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G33990.15.9e-11133.87Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.11.7e-11032.03CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.21.7e-11032.03INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 687..707
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..712
NoneNo IPR availablePANTHERPTHR47928:SF25PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..712
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 460..689
e-value: 5.0E-35
score: 123.4
coord: 16..196
e-value: 1.2E-13
score: 53.2
coord: 197..335
e-value: 1.7E-20
score: 75.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 336..459
e-value: 2.4E-18
score: 68.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 288..318
e-value: 5.4E-8
score: 32.6
coord: 360..381
e-value: 0.034
score: 14.4
coord: 389..419
e-value: 8.3E-8
score: 32.0
coord: 260..285
e-value: 0.16
score: 12.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 288..321
e-value: 7.1E-7
score: 27.0
coord: 389..422
e-value: 2.9E-6
score: 25.1
coord: 87..120
e-value: 5.4E-4
score: 17.9
coord: 492..525
e-value: 9.1E-8
score: 29.8
coord: 527..561
e-value: 3.3E-4
score: 18.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 490..537
e-value: 6.7E-12
score: 45.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 490..524
score: 11.268274
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 525..560
score: 8.769097
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 387..421
score: 11.070971
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 11.925952
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..118
score: 9.108898
IPR011528Nuclease-related domain, NERDPROSITEPS50965NERDcoord: 671..715
score: 8.677929

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G057900.1Cla97C03G057900.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding