Cla97C09G165200 (gene) Watermelon (97103) v2.5

Overview
NameCla97C09G165200
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionLOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930
LocationCla97Chr09: 2471066 .. 2473300 (+)
RNA-Seq ExpressionCla97C09G165200
SyntenyCla97C09G165200
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

mRNA sequence

ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

Coding sequence (CDS)

ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

Protein sequence

MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW
Homology
BLAST of Cla97C09G165200 vs. NCBI nr
Match: XP_031744195.1 (putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus] >XP_031744196.1 putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus] >XP_031744197.1 putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus])

HSP 1 Score: 1398.6 bits (3619), Expect = 0.0e+00
Identity = 673/744 (90.46%), Postives = 703/744 (94.49%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLHHRHV   KQM TIA  SSA  SFSPPTHPLISLLETC+SMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHHRHVKKPKQMTTIAATSSALKSFSPPTHPLISLLETCESMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIK   NANPVLQN+VM+FCCTHE GD +YA  LFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKKGLNANPVLQNRVMTFCCTHEYGDFQYARRLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQYNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR V DVC KADVI WNM+ISAYNKVG+FEESRR+FL ME+KQVLPTT
Sbjct: 181 VQMYLLCGQLDTARGVFDVCPKADVITWNMIISAYNKVGKFEESRRLFLVMEDKQVLPTT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENA+IDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENAMIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYIDRNKI ND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYIDRNKIKNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAE IFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAESIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA+EVIENMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYEVIENMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYRE+DMAEMVVKQIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQM
Sbjct: 541 SIVWGALLAGCRVYRESDMAEMVVKQILELEPDNGAVYVLLCNIYAACKRWNDLRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 720

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 721 EVIVRDRTRFHHFKHGLCSCKDYW 744

BLAST of Cla97C09G165200 vs. NCBI nr
Match: KAA0058740.1 (putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK10534.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1393.3 bits (3605), Expect = 0.0e+00
Identity = 672/744 (90.32%), Postives = 704/744 (94.62%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLH  HV  SKQM TIA  SSAS SFSPPT PLI LLETCKSMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHRFHVKESKQMPTIAATSSASKSFSPPTRPLIYLLETCKSMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIKT  NANPVLQN+VMSFCCT + GD +YA HLFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKTGLNANPVLQNRVMSFCCTDDYGDFQYARHLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ+NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQNNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR VLDVCSKADVI WNM+ISAYNKVG+FEESRR+FL ME KQVL TT
Sbjct: 181 VQMYLLCGQLDTARGVLDVCSKADVITWNMIISAYNKVGKFEESRRLFLVMENKQVLATT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENALIDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENALIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYI+RNKINND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYINRNKINNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAERIFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAERIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA++VI+NMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYDVIKNMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYREADMAEMVVK IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQM
Sbjct: 541 SIVWGALLAGCRVYREADMAEMVVKHILELEPDNGAVYVLLCNIYAACKRWNELRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLD+AEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 661 FLDVAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 720

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 721 EVIVRDRTRFHHFKHGLCSCKDYW 744

BLAST of Cla97C09G165200 vs. NCBI nr
Match: XP_008461137.2 (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis melo])

HSP 1 Score: 1390.9 bits (3599), Expect = 0.0e+00
Identity = 671/744 (90.19%), Postives = 703/744 (94.49%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLH  HV  SKQM TIA  SSAS SFSPPT PLI LLETCKSMDQLQQ+HC
Sbjct: 13  MPVPVRFRTLLHRFHVKESKQMPTIAATSSASKSFSPPTRPLIYLLETCKSMDQLQQVHC 72

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIKT  NANPVLQN+VMSFCCT + GD +YA HLFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 73  QAIKTGLNANPVLQNRVMSFCCTDDYGDFQYARHLFDEIPEPNLFIWNTMIRGYSRLDFP 132

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ+NVFVHTAL
Sbjct: 133 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQNNVFVHTAL 192

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR VLDVCSKADVI WNM+ISAYNKVG+FEESRR+FL ME KQVL TT
Sbjct: 193 VQMYLLCGQLDTARGVLDVCSKADVITWNMIISAYNKVGKFEESRRLFLVMENKQVLATT 252

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENALIDMYA CGEMDSALGIFRS
Sbjct: 253 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENALIDMYADCGEMDSALGIFRS 312

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 313 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 372

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYI+RNKINND FVRNALIDMYFKC
Sbjct: 373 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYINRNKINNDLFVRNALIDMYFKC 432

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAERIFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 433 GDVDKAERIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 492

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA++VI+NMP+K N
Sbjct: 493 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYDVIKNMPIKAN 552

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYREADMAEMVVK IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQM
Sbjct: 553 SIVWGALLAGCRVYREADMAEMVVKHILELEPDNGAVYVLLCNIYAACKRWNELRELRQM 612

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKK PGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 613 MMDKGIKKXPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 672

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLD+AEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 673 FLDVAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 732

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 733 EVIVRDRTRFHHFKHGLCSCKDYW 756

BLAST of Cla97C09G165200 vs. NCBI nr
Match: XP_038896377.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 [Benincasa hispida])

HSP 1 Score: 1365.5 bits (3533), Expect = 0.0e+00
Identity = 658/719 (91.52%), Postives = 688/719 (95.69%), Query Frame = 0

Query: 26  ATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 85
           AT  S S+  SP THP+ISL++TCKSMDQLQQIHCQAIKT  NANPVLQN++MSFCCTHE
Sbjct: 62  ATTFSTSNPISPSTHPVISLVQTCKSMDQLQQIHCQAIKTGLNANPVLQNRLMSFCCTHE 121

Query: 86  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFL 145
           CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP+LGVSLY+EMLRRGF+PDRYTFPFL
Sbjct: 122 CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPQLGVSLYVEMLRRGFEPDRYTFPFL 181

Query: 146 FKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADV 205
           FKGFTRDIALEYGRELHGHVLK GLQSNVFVHTALVQMYLLCGQLDTAR V DV SKADV
Sbjct: 182 FKGFTRDIALEYGRELHGHVLKHGLQSNVFVHTALVQMYLLCGQLDTARGVFDVFSKADV 241

Query: 206 IAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSY 265
           IAWNMMISAY K GEFEES R+FLGMEEK+VLPTTVTLVLILSACSKLKDLKTGKQV SY
Sbjct: 242 IAWNMMISAYKKAGEFEESIRLFLGMEEKKVLPTTVTLVLILSACSKLKDLKTGKQVDSY 301

Query: 266 VNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVA 325
           V NCKVESNLVLENALIDMYA CGEMD+AL IFRSMN++DIISWTTIVSGFTN+GEIDVA
Sbjct: 302 VKNCKVESNLVLENALIDMYAACGEMDAALEIFRSMNDRDIISWTTIVSGFTNMGEIDVA 361

Query: 326 RNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHL 385
           RNYFDKMPEKDYVSWTAMIDGYIR NRFKEALELFRNMQATNVKPDEFTMVSILTACAHL
Sbjct: 362 RNYFDKMPEKDYVSWTAMIDGYIRLNRFKEALELFRNMQATNVKPDEFTMVSILTACAHL 421

Query: 386 GALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMI 445
           GALELGEWIRTYIDRNKINND FVRNALIDMYFKCGNV+KAE IFREMCQRDKFTWTAMI
Sbjct: 422 GALELGEWIRTYIDRNKINNDTFVRNALIDMYFKCGNVEKAESIFREMCQRDKFTWTAMI 481

Query: 446 VGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGI 505
           VGLAVNG GEKALDMFS+MLKASI+PDEITYIGVLSACTHTGMV+KGREYFLSMTTQHGI
Sbjct: 482 VGLAVNGRGEKALDMFSEMLKASIMPDEITYIGVLSACTHTGMVDKGREYFLSMTTQHGI 541

Query: 506 EPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVK 565
           EPNIAHYGCLVDLLARAG LKEAHEVIENMPMKPNSIV G LL GCRVYREA+MAEMVVK
Sbjct: 542 EPNIAHYGCLVDLLARAGHLKEAHEVIENMPMKPNSIVLGGLLGGCRVYREANMAEMVVK 601

Query: 566 QILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFV 625
           QIL+LEPENGAVYVLLCNIYAACKRWN+LRELRQMMMDKGIKKTPGCSLIEMNGTVHEFV
Sbjct: 602 QILELEPENGAVYVLLCNIYAACKRWNELRELRQMMMDKGIKKTPGCSLIEMNGTVHEFV 661

Query: 626 AGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFG 685
           AGDRSHPQT++ID KL+KMTQ+LK AGYSPD+SEVFLDIAEEDKENSVFRHSEKLAIAFG
Sbjct: 662 AGDRSHPQTKQIDAKLDKMTQELKSAGYSPDISEVFLDIAEEDKENSVFRHSEKLAIAFG 721

Query: 686 LINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           LINSP GVTIR+VKNLRMC DCHNMAKLVSKV+NREVIVRDRTRFHHFKHGLCSCK+YW
Sbjct: 722 LINSPSGVTIRVVKNLRMCTDCHNMAKLVSKVHNREVIVRDRTRFHHFKHGLCSCKEYW 780

BLAST of Cla97C09G165200 vs. NCBI nr
Match: XP_022991386.1 (putative pentatricopeptide repeat-containing protein At3g15930 [Cucurbita maxima])

HSP 1 Score: 1336.6 bits (3458), Expect = 0.0e+00
Identity = 644/728 (88.46%), Postives = 689/728 (94.64%), Query Frame = 0

Query: 17  TNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNK 76
           T  KQMATIA   +AS   S  THPLISLLE C+SMDQLQQIHC+AIKT   ANPVLQN+
Sbjct: 3   TKLKQMATIA--CTASKPLSSTTHPLISLLEICESMDQLQQIHCRAIKTGLAANPVLQNR 62

Query: 77  VMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFK 136
           VM+FCCTHECGDLKYA HLFDE+PEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRG K
Sbjct: 63  VMAFCCTHECGDLKYARHLFDEMPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGVK 122

Query: 137 PDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRV 196
           PD Y+FPFLFKGFTRDIAL+ GRELHGHVLK GL SNVFVHTALVQMYLLCG LDTAR V
Sbjct: 123 PDNYSFPFLFKGFTRDIALQCGRELHGHVLKHGLLSNVFVHTALVQMYLLCGLLDTARGV 182

Query: 197 LDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDL 256
           LD  SKADVIAWNMMI+AYNKVG+FEESRR+FLGMEEKQVLPTTVTLVLILSACSKLKD 
Sbjct: 183 LDAGSKADVIAWNMMIAAYNKVGKFEESRRLFLGMEEKQVLPTTVTLVLILSACSKLKDF 242

Query: 257 KTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGF 316
           KTGK VHS VNNCKVESNLVLENALIDMYA CGEMDSALGIFR+MNNKDIISWTTIVSGF
Sbjct: 243 KTGKHVHSCVNNCKVESNLVLENALIDMYAACGEMDSALGIFRNMNNKDIISWTTIVSGF 302

Query: 317 TNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMV 376
           TNLGEIDVARNYFD+MPEKD VSWTAMIDGY+ +NRFKEA +LFR+MQAT+VKPDEFTMV
Sbjct: 303 TNLGEIDVARNYFDQMPEKDCVSWTAMIDGYLHTNRFKEAFDLFRHMQATSVKPDEFTMV 362

Query: 377 SILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQR 436
           SILTACA LGALELGEWI+TYID+NKINNDAFVRNALIDMYFKCGNVDKAER+FREM QR
Sbjct: 363 SILTACAQLGALELGEWIKTYIDKNKINNDAFVRNALIDMYFKCGNVDKAERVFREMHQR 422

Query: 437 DKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYF 496
           DKFTWT +IVGLAVNGHGEKALD+FS+ML+ASILPD++TYIGVLSACTHTGMV+KGRE+F
Sbjct: 423 DKFTWTTIIVGLAVNGHGEKALDIFSKMLEASILPDDVTYIGVLSACTHTGMVDKGREFF 482

Query: 497 LSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYRE 556
           LSMTTQHGIEPNI HYGCLVDLLARAG LKEAHEVI+NMP++PNSIVWGALLAGCRV+RE
Sbjct: 483 LSMTTQHGIEPNITHYGCLVDLLARAGRLKEAHEVIKNMPIEPNSIVWGALLAGCRVHRE 542

Query: 557 ADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIE 616
           A+MAEMV KQIL+LEPENGAVYVLLCNIYAACKRWNDLR+LRQMMMDKGIKK PGCSLIE
Sbjct: 543 ANMAEMVAKQILELEPENGAVYVLLCNIYAACKRWNDLRDLRQMMMDKGIKKIPGCSLIE 602

Query: 617 MNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRH 676
           MNGTVHEFVAGDRSHPQT++IDVKL KMTQDLKFAGYSPD+S+VFLDIAEEDKENSVFRH
Sbjct: 603 MNGTVHEFVAGDRSHPQTKEIDVKLEKMTQDLKFAGYSPDISKVFLDIAEEDKENSVFRH 662

Query: 677 SEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHG 736
           SEKLAIAFGLINSPPGVTIRIVKNLRMC+DCH++AKL+SKVY+REVIVRDRTRFHHFKHG
Sbjct: 663 SEKLAIAFGLINSPPGVTIRIVKNLRMCLDCHSVAKLISKVYDREVIVRDRTRFHHFKHG 722

Query: 737 LCSCKDYW 745
           LCSCKDYW
Sbjct: 723 LCSCKDYW 728

BLAST of Cla97C09G165200 vs. ExPASy Swiss-Prot
Match: Q9LSB8 (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 764.2 bits (1972), Expect = 1.3e-219
Identity = 357/640 (55.78%), Postives = 471/640 (73.59%), Query Frame = 0

Query: 28  ISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECG 87
           +S+ + S S      IS+L  CK+ DQ +Q+H Q+I      NP  Q K+  F C+   G
Sbjct: 23  MSTITESISNDYSRFISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGG 82

Query: 88  DLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFK 147
            + YA+ LF +IPEP++ +WN MI+G+S++D    GV LYL ML+ G  PD +TFPFL  
Sbjct: 83  HVSYAYKLFVKIPEPDVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLN 142

Query: 148 GFTRD-IALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVI 207
           G  RD  AL  G++LH HV+K GL SN++V  ALV+MY LCG +D AR V D   K DV 
Sbjct: 143 GLKRDGGALACGKKLHCHVVKFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVF 202

Query: 208 AWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYV 267
           +WN+MIS YN++ E+EES  + + ME   V PT+VTL+L+LSACSK+KD    K+VH YV
Sbjct: 203 SWNLMISGYNRMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYV 262

Query: 268 NNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVAR 327
           + CK E +L LENAL++ YA CGEMD A+ IFRSM  +D+ISWT+IV G+   G + +AR
Sbjct: 263 SECKTEPSLRLENALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLAR 322

Query: 328 NYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLG 387
            YFD+MP +D +SWT MIDGY+R+  F E+LE+FR MQ+  + PDEFTMVS+LTACAHLG
Sbjct: 323 TYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLG 382

Query: 388 ALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIV 447
           +LE+GEWI+TYID+NKI ND  V NALIDMYFKCG  +KA+++F +M QRDKFTWTAM+V
Sbjct: 383 SLEIGEWIKTYIDKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVV 442

Query: 448 GLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIE 507
           GLA NG G++A+ +F QM   SI PD+ITY+GVLSAC H+GMV++ R++F  M + H IE
Sbjct: 443 GLANNGQGQEAIKVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIE 502

Query: 508 PNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQ 567
           P++ HYGC+VD+L RAG +KEA+E++  MPM PNSIVWGALL   R++ +  MAE+  K+
Sbjct: 503 PSLVHYGCMVDMLGRAGLVKEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKK 562

Query: 568 ILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVA 627
           IL+LEP+NGAVY LLCNIYA CKRW DLRE+R+ ++D  IKKTPG SLIE+NG  HEFVA
Sbjct: 563 ILELEPDNGAVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVA 622

Query: 628 GDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAE 667
           GD+SH Q+E+I +KL ++ Q+  FA Y PD SE+  +  +
Sbjct: 623 GDKSHLQSEEIYMKLEELAQESTFAAYLPDTSELLFEAGD 662

BLAST of Cla97C09G165200 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 650.6 bits (1677), Expect = 2.2e-185
Identity = 308/722 (42.66%), Postives = 474/722 (65.65%), Query Frame = 0

Query: 34  SFSPPTHPL--------ISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 93
           +FS P  P         ISL+E C S+ QL+Q H   I+T   ++P   +K+ +      
Sbjct: 17  NFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSS 76

Query: 94  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRG-FKPDRYTFPF 153
              L+YA  +FDEIP+PN F WNT+IR Y+    P L +  +L+M+      P++YTFPF
Sbjct: 77  FASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPF 136

Query: 154 LFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKAD 213
           L K      +L  G+ LHG  +K  + S+VFV  +L+  Y  CG LD+A +V     + D
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKD 196

Query: 214 VIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHS 273
           V++WN MI+ + + G  +++  +F  ME + V  + VT+V +LSAC+K+++L+ G+QV S
Sbjct: 197 VVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCS 256

Query: 274 YVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDV 333
           Y+   +V  NL L NA++DMY  CG ++ A  +F +M  KD ++WTT++ G+    + + 
Sbjct: 257 YIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEA 316

Query: 334 ARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQ-ATNVKPDEFTMVSILTACA 393
           AR   + MP+KD V+W A+I  Y ++ +  EAL +F  +Q   N+K ++ T+VS L+ACA
Sbjct: 317 AREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACA 376

Query: 394 HLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTA 453
            +GALELG WI +YI ++ I  +  V +ALI MY KCG+++K+  +F  + +RD F W+A
Sbjct: 377 QVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSA 436

Query: 454 MIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQH 513
           MI GLA++G G +A+DMF +M +A++ P+ +T+  V  AC+HTG+V++    F  M + +
Sbjct: 437 MIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNY 496

Query: 514 GIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMV 573
           GI P   HY C+VD+L R+G L++A + IE MP+ P++ VWGALL  C+++   ++AEM 
Sbjct: 497 GIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMA 556

Query: 574 VKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHE 633
             ++L+LEP N   +VLL NIYA   +W ++ ELR+ M   G+KK PGCS IE++G +HE
Sbjct: 557 CTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHE 616

Query: 634 FVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEED-KENSVFRHSEKLAI 693
           F++GD +HP +EK+  KL+++ + LK  GY P++S+V   I EE+ KE S+  HSEKLAI
Sbjct: 617 FLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAI 676

Query: 694 AFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKD 745
            +GLI++     IR++KNLR+C DCH++AKL+S++Y+RE+IVRDR RFHHF++G CSC D
Sbjct: 677 CYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCND 736

BLAST of Cla97C09G165200 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 631.7 bits (1628), Expect = 1.0e-179
Identity = 303/751 (40.35%), Postives = 456/751 (60.72%), Query Frame = 0

Query: 33  SSFSPP-----THPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFC-CTHEC 92
           SS  PP      HP +SLL  CK++  L+ IH Q IK   +      +K++ FC  +   
Sbjct: 22  SSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHF 81

Query: 93  GDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLF 152
             L YA  +F  I EPNL IWNTM RG++    P   + LY+ M+  G  P+ YTFPF+ 
Sbjct: 82  EGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVL 141

Query: 153 KGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSK---- 212
           K   +  A + G+++HGHVLKLG   +++VHT+L+ MY+  G+L+ A +V D        
Sbjct: 142 KSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVV 201

Query: 213 ---------------------------ADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 272
                                       DV++WN MIS Y + G ++E+  +F  M +  
Sbjct: 202 SYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTN 261

Query: 273 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 332
           V P   T+V ++SAC++   ++ G+QVH ++++    SNL + NALID+Y+ CGE+++A 
Sbjct: 262 VRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETAC 321

Query: 333 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 392
           G+F  +  KD+ISW T++ G+T++                               N +KE
Sbjct: 322 GLFERLPYKDVISWNTLIGGYTHM-------------------------------NLYKE 381

Query: 393 ALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDR--NKINNDAFVRNAL 452
           AL LF+ M  +   P++ TM+SIL ACAHLGA+++G WI  YID+    + N + +R +L
Sbjct: 382 ALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSL 441

Query: 453 IDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDE 512
           IDMY KCG+++ A ++F  +  +   +W AMI G A++G  + + D+FS+M K  I PD+
Sbjct: 442 IDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDD 501

Query: 513 ITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIE 572
           IT++G+LSAC+H+GM++ GR  F +MT  + + P + HYGC++DLL  +G  KEA E+I 
Sbjct: 502 ITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMIN 561

Query: 573 NMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWND 632
            M M+P+ ++W +LL  C+++   ++ E   + ++ +EPEN   YVLL NIYA+  RWN+
Sbjct: 562 MMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNE 621

Query: 633 LRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGY 692
           + + R ++ DKG+KK PGCS IE++  VHEF+ GD+ HP+  +I   L +M   L+ AG+
Sbjct: 622 VAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGF 681

Query: 693 SPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKL 745
            PD SEV  ++ EE KE ++  HSEKLAIAFGLI++ PG  + IVKNLR+C +CH   KL
Sbjct: 682 VPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKL 741

BLAST of Cla97C09G165200 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 594.7 bits (1532), Expect = 1.4e-168
Identity = 285/690 (41.30%), Postives = 432/690 (62.61%), Query Frame = 0

Query: 57  QIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSR 116
           QIH   +K  +  +  +QN ++ F    ECG+L  A  +FDE+ E N+  W +MI GY+R
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYA--ECGELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 117 LDSPELGVSLYLEMLR-RGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVF 176
            D  +  V L+  M+R     P+  T   +     +   LE G +++  +   G++ N  
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 177 VHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 236
           + +ALV MY+ C  +D A+R+ D    +++   N M S Y + G   E+  +F  M +  
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 237 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 296
           V P  ++++  +S+CS+L+++  GK  H YV     ES   + NALIDMY  C   D+A 
Sbjct: 335 VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAF 394

Query: 297 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 356
            IF  M+NK +++W +IV+G+   GE+D A   F+ MPEK+ VSW  +I G ++ + F+E
Sbjct: 395 RIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEE 454

Query: 357 ALELFRNMQA-TNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALI 416
           A+E+F +MQ+   V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+
Sbjct: 455 AIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLV 514

Query: 417 DMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEI 476
           DM+ +CG+ + A  IF  +  RD   WTA I  +A+ G+ E+A+++F  M++  + PD +
Sbjct: 515 DMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGV 574

Query: 477 TYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIEN 536
            ++G L+AC+H G+V++G+E F SM   HG+ P   HYGC+VDLL RAG L+EA ++IE+
Sbjct: 575 AFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIED 634

Query: 537 MPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDL 596
           MPM+PN ++W +LLA CRV    +MA    ++I  L PE    YVLL N+YA+  RWND+
Sbjct: 635 MPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDM 694

Query: 597 RELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYS 656
            ++R  M +KG++K PG S I++ G  HEF +GD SHP+   I+  L++++Q     G+ 
Sbjct: 695 AKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 754

Query: 657 PDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLV 716
           PD+S V +D+ E++K   + RHSEKLA+A+GLI+S  G TIRIVKNLR+C DCH+ AK  
Sbjct: 755 PDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFA 814

Query: 717 SKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           SKVYNRE+I+RD  RFH+ + G CSC D+W
Sbjct: 815 SKVYNREIILRDNNRFHYIRQGKCSCGDFW 842

BLAST of Cla97C09G165200 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 4.2e-165
Identity = 280/717 (39.05%), Postives = 444/717 (61.92%), Query Frame = 0

Query: 36  SPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHL 95
           S   + ++  L  CKS++ ++Q+H   ++T  N    L + + +   +    +L YA ++
Sbjct: 9   STAANTILEKLSFCKSLNHIKQLHAHILRTVINHK--LNSFLFNLSVSSSSINLSYALNV 68

Query: 96  FDEIPE-PNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIA 155
           F  IP  P   ++N  +R  SR   P   +  Y  +   G + D+++F  + K  ++  A
Sbjct: 69  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 128

Query: 156 LEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISA 215
           L  G ELHG   K+    + FV T  + MY  CG+++ AR V D  S  DV+ WN MI  
Sbjct: 129 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 188

Query: 216 YNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESN 275
           Y + G  +E+ ++F  M++  V+P  + L  I+SAC +  +++  + ++ ++    V  +
Sbjct: 189 YCRFGLVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMD 248

Query: 276 LVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPE 335
             L  AL+ MYA  G MD A   FR M+ +++   T +VSG++  G +D A+  FD+  +
Sbjct: 249 THLLTALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEK 308

Query: 336 KDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWI 395
           KD V WT MI  Y+ S+  +EAL +F  M  + +KPD  +M S+++ACA+LG L+  +W+
Sbjct: 309 KDLVCWTTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWV 368

Query: 396 RTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHG 455
            + I  N + ++  + NALI+MY KCG +D    +F +M +R+  +W++MI  L+++G  
Sbjct: 369 HSCIHVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEA 428

Query: 456 EKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGC 515
             AL +F++M + ++ P+E+T++GVL  C+H+G+VE+G++ F SMT ++ I P + HYGC
Sbjct: 429 SDALSLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGC 488

Query: 516 LVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPEN 575
           +VDL  RA  L+EA EVIE+MP+  N ++WG+L++ CR++ E ++ +   K+IL+LEP++
Sbjct: 489 MVDLFGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDH 548

Query: 576 GAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQT 635
               VL+ NIYA  +RW D+R +R++M +K + K  G S I+ NG  HEF+ GD+ H Q+
Sbjct: 549 DGALVLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQS 608

Query: 636 EKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPP--- 695
            +I  KL+++   LK AGY PD   V +D+ EE+K++ V  HSEKLA+ FGL+N      
Sbjct: 609 NEIYAKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEE 668

Query: 696 ----GVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
               GV IRIVKNLR+C DCH   KLVSKVY RE+IVRDRTRFH +K+GLCSC+DYW
Sbjct: 669 KDSCGV-IRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Cla97C09G165200 vs. ExPASy TrEMBL
Match: A0A5A7UUL4 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G001870 PE=3 SV=1)

HSP 1 Score: 1393.3 bits (3605), Expect = 0.0e+00
Identity = 672/744 (90.32%), Postives = 704/744 (94.62%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLH  HV  SKQM TIA  SSAS SFSPPT PLI LLETCKSMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHRFHVKESKQMPTIAATSSASKSFSPPTRPLIYLLETCKSMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIKT  NANPVLQN+VMSFCCT + GD +YA HLFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKTGLNANPVLQNRVMSFCCTDDYGDFQYARHLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ+NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQNNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR VLDVCSKADVI WNM+ISAYNKVG+FEESRR+FL ME KQVL TT
Sbjct: 181 VQMYLLCGQLDTARGVLDVCSKADVITWNMIISAYNKVGKFEESRRLFLVMENKQVLATT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENALIDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENALIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYI+RNKINND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYINRNKINNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAERIFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAERIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA++VI+NMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYDVIKNMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYREADMAEMVVK IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQM
Sbjct: 541 SIVWGALLAGCRVYREADMAEMVVKHILELEPDNGAVYVLLCNIYAACKRWNELRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLD+AEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 661 FLDVAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 720

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 721 EVIVRDRTRFHHFKHGLCSCKDYW 744

BLAST of Cla97C09G165200 vs. ExPASy TrEMBL
Match: A0A1S3CDK0 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 OS=Cucumis melo OX=3656 GN=LOC103499814 PE=3 SV=1)

HSP 1 Score: 1390.9 bits (3599), Expect = 0.0e+00
Identity = 671/744 (90.19%), Postives = 703/744 (94.49%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLH  HV  SKQM TIA  SSAS SFSPPT PLI LLETCKSMDQLQQ+HC
Sbjct: 13  MPVPVRFRTLLHRFHVKESKQMPTIAATSSASKSFSPPTRPLIYLLETCKSMDQLQQVHC 72

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIKT  NANPVLQN+VMSFCCT + GD +YA HLFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 73  QAIKTGLNANPVLQNRVMSFCCTDDYGDFQYARHLFDEIPEPNLFIWNTMIRGYSRLDFP 132

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ+NVFVHTAL
Sbjct: 133 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQNNVFVHTAL 192

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR VLDVCSKADVI WNM+ISAYNKVG+FEESRR+FL ME KQVL TT
Sbjct: 193 VQMYLLCGQLDTARGVLDVCSKADVITWNMIISAYNKVGKFEESRRLFLVMENKQVLATT 252

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENALIDMYA CGEMDSALGIFRS
Sbjct: 253 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENALIDMYADCGEMDSALGIFRS 312

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 313 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 372

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYI+RNKINND FVRNALIDMYFKC
Sbjct: 373 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYINRNKINNDLFVRNALIDMYFKC 432

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAERIFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 433 GDVDKAERIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 492

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA++VI+NMP+K N
Sbjct: 493 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYDVIKNMPIKAN 552

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYREADMAEMVVK IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQM
Sbjct: 553 SIVWGALLAGCRVYREADMAEMVVKHILELEPDNGAVYVLLCNIYAACKRWNELRELRQM 612

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKK PGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 613 MMDKGIKKXPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 672

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLD+AEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 673 FLDVAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 732

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 733 EVIVRDRTRFHHFKHGLCSCKDYW 756

BLAST of Cla97C09G165200 vs. ExPASy TrEMBL
Match: A0A6J1JQK8 (putative pentatricopeptide repeat-containing protein At3g15930 OS=Cucurbita maxima OX=3661 GN=LOC111488039 PE=3 SV=1)

HSP 1 Score: 1336.6 bits (3458), Expect = 0.0e+00
Identity = 644/728 (88.46%), Postives = 689/728 (94.64%), Query Frame = 0

Query: 17  TNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNK 76
           T  KQMATIA   +AS   S  THPLISLLE C+SMDQLQQIHC+AIKT   ANPVLQN+
Sbjct: 3   TKLKQMATIA--CTASKPLSSTTHPLISLLEICESMDQLQQIHCRAIKTGLAANPVLQNR 62

Query: 77  VMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFK 136
           VM+FCCTHECGDLKYA HLFDE+PEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRG K
Sbjct: 63  VMAFCCTHECGDLKYARHLFDEMPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGVK 122

Query: 137 PDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRV 196
           PD Y+FPFLFKGFTRDIAL+ GRELHGHVLK GL SNVFVHTALVQMYLLCG LDTAR V
Sbjct: 123 PDNYSFPFLFKGFTRDIALQCGRELHGHVLKHGLLSNVFVHTALVQMYLLCGLLDTARGV 182

Query: 197 LDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDL 256
           LD  SKADVIAWNMMI+AYNKVG+FEESRR+FLGMEEKQVLPTTVTLVLILSACSKLKD 
Sbjct: 183 LDAGSKADVIAWNMMIAAYNKVGKFEESRRLFLGMEEKQVLPTTVTLVLILSACSKLKDF 242

Query: 257 KTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGF 316
           KTGK VHS VNNCKVESNLVLENALIDMYA CGEMDSALGIFR+MNNKDIISWTTIVSGF
Sbjct: 243 KTGKHVHSCVNNCKVESNLVLENALIDMYAACGEMDSALGIFRNMNNKDIISWTTIVSGF 302

Query: 317 TNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMV 376
           TNLGEIDVARNYFD+MPEKD VSWTAMIDGY+ +NRFKEA +LFR+MQAT+VKPDEFTMV
Sbjct: 303 TNLGEIDVARNYFDQMPEKDCVSWTAMIDGYLHTNRFKEAFDLFRHMQATSVKPDEFTMV 362

Query: 377 SILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQR 436
           SILTACA LGALELGEWI+TYID+NKINNDAFVRNALIDMYFKCGNVDKAER+FREM QR
Sbjct: 363 SILTACAQLGALELGEWIKTYIDKNKINNDAFVRNALIDMYFKCGNVDKAERVFREMHQR 422

Query: 437 DKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYF 496
           DKFTWT +IVGLAVNGHGEKALD+FS+ML+ASILPD++TYIGVLSACTHTGMV+KGRE+F
Sbjct: 423 DKFTWTTIIVGLAVNGHGEKALDIFSKMLEASILPDDVTYIGVLSACTHTGMVDKGREFF 482

Query: 497 LSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYRE 556
           LSMTTQHGIEPNI HYGCLVDLLARAG LKEAHEVI+NMP++PNSIVWGALLAGCRV+RE
Sbjct: 483 LSMTTQHGIEPNITHYGCLVDLLARAGRLKEAHEVIKNMPIEPNSIVWGALLAGCRVHRE 542

Query: 557 ADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIE 616
           A+MAEMV KQIL+LEPENGAVYVLLCNIYAACKRWNDLR+LRQMMMDKGIKK PGCSLIE
Sbjct: 543 ANMAEMVAKQILELEPENGAVYVLLCNIYAACKRWNDLRDLRQMMMDKGIKKIPGCSLIE 602

Query: 617 MNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRH 676
           MNGTVHEFVAGDRSHPQT++IDVKL KMTQDLKFAGYSPD+S+VFLDIAEEDKENSVFRH
Sbjct: 603 MNGTVHEFVAGDRSHPQTKEIDVKLEKMTQDLKFAGYSPDISKVFLDIAEEDKENSVFRH 662

Query: 677 SEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHG 736
           SEKLAIAFGLINSPPGVTIRIVKNLRMC+DCH++AKL+SKVY+REVIVRDRTRFHHFKHG
Sbjct: 663 SEKLAIAFGLINSPPGVTIRIVKNLRMCLDCHSVAKLISKVYDREVIVRDRTRFHHFKHG 722

Query: 737 LCSCKDYW 745
           LCSCKDYW
Sbjct: 723 LCSCKDYW 728

BLAST of Cla97C09G165200 vs. ExPASy TrEMBL
Match: A0A0A0K6A7 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G432370 PE=3 SV=1)

HSP 1 Score: 1333.9 bits (3451), Expect = 0.0e+00
Identity = 645/716 (90.08%), Postives = 675/716 (94.27%), Query Frame = 0

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLHHRHV   KQM TIA  SSA  SFSPPTHPLISLLETC+SMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHHRHVKKPKQMTTIAATSSALKSFSPPTHPLISLLETCESMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIK   NANPVLQN+VM+FCCTHE GD +YA  LFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKKGLNANPVLQNRVMTFCCTHEYGDFQYARRLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQYNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR V DVC KADVI WNM+ISAYNKVG+FEESRR+FL ME+KQVLPTT
Sbjct: 181 VQMYLLCGQLDTARGVFDVCPKADVITWNMIISAYNKVGKFEESRRLFLVMEDKQVLPTT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENA+IDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENAMIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYIDRNKI ND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYIDRNKIKNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAE IFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAESIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA+EVIENMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYEVIENMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYRE+DMAEMVVKQIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQM
Sbjct: 541 SIVWGALLAGCRVYRESDMAEMVVKQILELEPDNGAVYVLLCNIYAACKRWNDLRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSK 717
           FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSK
Sbjct: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSK 716

BLAST of Cla97C09G165200 vs. ExPASy TrEMBL
Match: A0A6J1GRB6 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 OS=Cucurbita moschata OX=3662 GN=LOC111456329 PE=3 SV=1)

HSP 1 Score: 1308.9 bits (3386), Expect = 0.0e+00
Identity = 633/723 (87.55%), Postives = 674/723 (93.22%), Query Frame = 0

Query: 22  MATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFC 81
           MATIA   +AS   S  THPLISLLE C+SMDQLQQI C+AIKT    NPVLQN+VM+ C
Sbjct: 1   MATIA--CTASKPLSSTTHPLISLLEICESMDQLQQIDCRAIKTGLTPNPVLQNRVMAIC 60

Query: 82  CTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYT 141
           CT+ECGDLKYA HLFDE+PEPNLFIWNTMIRGY RLDSPELGVSLYLEMLRRG KPD YT
Sbjct: 61  CTYECGDLKYAPHLFDEMPEPNLFIWNTMIRGYXRLDSPELGVSLYLEMLRRGVKPDNYT 120

Query: 142 FPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCS 201
           FPFLFKGFTRDI+L+ G ELHGHVLK GL SNVFVHTALVQMYLLCG LD AR VLD  S
Sbjct: 121 FPFLFKGFTRDISLQCGSELHGHVLKHGLLSNVFVHTALVQMYLLCGLLDMARGVLDAGS 180

Query: 202 KADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQ 261
           KADVI+WNMMI+AYNKVG+ EESRR+FLGMEE+QVLPTTVTLVLILSACSKLKD KTGK 
Sbjct: 181 KADVISWNMMIAAYNKVGKLEESRRLFLGMEERQVLPTTVTLVLILSACSKLKDFKTGKH 240

Query: 262 VHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGE 321
           V S VNNCKVESNLVLENALIDMYA CGEMDSAL IFR+MNNKDIISWTTIVSGFTNLGE
Sbjct: 241 VRSCVNNCKVESNLVLENALIDMYAACGEMDSALEIFRNMNNKDIISWTTIVSGFTNLGE 300

Query: 322 IDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTA 381
           IDVARNYFD+MPEKD VSWTAMIDGY+R NRFKEA ELFR+MQA +VKPDEFTMVSILTA
Sbjct: 301 IDVARNYFDQMPEKDCVSWTAMIDGYLRMNRFKEAFELFRHMQAISVKPDEFTMVSILTA 360

Query: 382 CAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTW 441
           CA LGALELGEWI+TYIDRNKINNDAF RNALIDMYFKCGNVDKAER+FREM QRDKFTW
Sbjct: 361 CAQLGALELGEWIKTYIDRNKINNDAFFRNALIDMYFKCGNVDKAERVFREMHQRDKFTW 420

Query: 442 TAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTT 501
           TAMIVGLAVNGHGEKALDMFS+ML+ASILPD++TYIGVL+ACTHTGMV+KGRE+FLSMTT
Sbjct: 421 TAMIVGLAVNGHGEKALDMFSKMLEASILPDDVTYIGVLAACTHTGMVDKGREFFLSMTT 480

Query: 502 QHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAE 561
           QHGIEPNI HYGCLVDLLARAG LKEAHEVI+NMP++PNSIVWGALLAGCR +READMAE
Sbjct: 481 QHGIEPNITHYGCLVDLLARAGHLKEAHEVIKNMPIEPNSIVWGALLAGCRAHREADMAE 540

Query: 562 MVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTV 621
           MV KQIL+LEPENGAVYVLLCNIYAACKRWNDLR+LRQMMMDKGIKK PGCSLIEMNGTV
Sbjct: 541 MVAKQILELEPENGAVYVLLCNIYAACKRWNDLRDLRQMMMDKGIKKIPGCSLIEMNGTV 600

Query: 622 HEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLA 681
           HEFVAGDRSHPQT++IDVKL KMTQDLKFAGYSPD SEVFLDIAEEDKENSVFRHSEKLA
Sbjct: 601 HEFVAGDRSHPQTKEIDVKLEKMTQDLKFAGYSPDTSEVFLDIAEEDKENSVFRHSEKLA 660

Query: 682 IAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCK 741
           IAFGLINSPPGVTIRIVKNLRMC+DCH++AKL+SKVY+REVIVRDRTRFHHFKHGLCSCK
Sbjct: 661 IAFGLINSPPGVTIRIVKNLRMCLDCHSLAKLISKVYHREVIVRDRTRFHHFKHGLCSCK 720

Query: 742 DYW 745
           DYW
Sbjct: 721 DYW 721

BLAST of Cla97C09G165200 vs. TAIR 10
Match: AT3G15930.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 764.2 bits (1972), Expect = 9.5e-221
Identity = 357/640 (55.78%), Postives = 471/640 (73.59%), Query Frame = 0

Query: 28  ISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECG 87
           +S+ + S S      IS+L  CK+ DQ +Q+H Q+I      NP  Q K+  F C+   G
Sbjct: 23  MSTITESISNDYSRFISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGG 82

Query: 88  DLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFK 147
            + YA+ LF +IPEP++ +WN MI+G+S++D    GV LYL ML+ G  PD +TFPFL  
Sbjct: 83  HVSYAYKLFVKIPEPDVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLN 142

Query: 148 GFTRD-IALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVI 207
           G  RD  AL  G++LH HV+K GL SN++V  ALV+MY LCG +D AR V D   K DV 
Sbjct: 143 GLKRDGGALACGKKLHCHVVKFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVF 202

Query: 208 AWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYV 267
           +WN+MIS YN++ E+EES  + + ME   V PT+VTL+L+LSACSK+KD    K+VH YV
Sbjct: 203 SWNLMISGYNRMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYV 262

Query: 268 NNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVAR 327
           + CK E +L LENAL++ YA CGEMD A+ IFRSM  +D+ISWT+IV G+   G + +AR
Sbjct: 263 SECKTEPSLRLENALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLAR 322

Query: 328 NYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLG 387
            YFD+MP +D +SWT MIDGY+R+  F E+LE+FR MQ+  + PDEFTMVS+LTACAHLG
Sbjct: 323 TYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLG 382

Query: 388 ALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIV 447
           +LE+GEWI+TYID+NKI ND  V NALIDMYFKCG  +KA+++F +M QRDKFTWTAM+V
Sbjct: 383 SLEIGEWIKTYIDKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVV 442

Query: 448 GLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIE 507
           GLA NG G++A+ +F QM   SI PD+ITY+GVLSAC H+GMV++ R++F  M + H IE
Sbjct: 443 GLANNGQGQEAIKVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIE 502

Query: 508 PNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQ 567
           P++ HYGC+VD+L RAG +KEA+E++  MPM PNSIVWGALL   R++ +  MAE+  K+
Sbjct: 503 PSLVHYGCMVDMLGRAGLVKEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKK 562

Query: 568 ILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVA 627
           IL+LEP+NGAVY LLCNIYA CKRW DLRE+R+ ++D  IKKTPG SLIE+NG  HEFVA
Sbjct: 563 ILELEPDNGAVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVA 622

Query: 628 GDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAE 667
           GD+SH Q+E+I +KL ++ Q+  FA Y PD SE+  +  +
Sbjct: 623 GDKSHLQSEEIYMKLEELAQESTFAAYLPDTSELLFEAGD 662

BLAST of Cla97C09G165200 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 650.6 bits (1677), Expect = 1.5e-186
Identity = 308/722 (42.66%), Postives = 474/722 (65.65%), Query Frame = 0

Query: 34  SFSPPTHPL--------ISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 93
           +FS P  P         ISL+E C S+ QL+Q H   I+T   ++P   +K+ +      
Sbjct: 17  NFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSS 76

Query: 94  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRG-FKPDRYTFPF 153
              L+YA  +FDEIP+PN F WNT+IR Y+    P L +  +L+M+      P++YTFPF
Sbjct: 77  FASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPF 136

Query: 154 LFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKAD 213
           L K      +L  G+ LHG  +K  + S+VFV  +L+  Y  CG LD+A +V     + D
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKD 196

Query: 214 VIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHS 273
           V++WN MI+ + + G  +++  +F  ME + V  + VT+V +LSAC+K+++L+ G+QV S
Sbjct: 197 VVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCS 256

Query: 274 YVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDV 333
           Y+   +V  NL L NA++DMY  CG ++ A  +F +M  KD ++WTT++ G+    + + 
Sbjct: 257 YIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEA 316

Query: 334 ARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQ-ATNVKPDEFTMVSILTACA 393
           AR   + MP+KD V+W A+I  Y ++ +  EAL +F  +Q   N+K ++ T+VS L+ACA
Sbjct: 317 AREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACA 376

Query: 394 HLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTA 453
            +GALELG WI +YI ++ I  +  V +ALI MY KCG+++K+  +F  + +RD F W+A
Sbjct: 377 QVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSA 436

Query: 454 MIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQH 513
           MI GLA++G G +A+DMF +M +A++ P+ +T+  V  AC+HTG+V++    F  M + +
Sbjct: 437 MIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNY 496

Query: 514 GIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMV 573
           GI P   HY C+VD+L R+G L++A + IE MP+ P++ VWGALL  C+++   ++AEM 
Sbjct: 497 GIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMA 556

Query: 574 VKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHE 633
             ++L+LEP N   +VLL NIYA   +W ++ ELR+ M   G+KK PGCS IE++G +HE
Sbjct: 557 CTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHE 616

Query: 634 FVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEED-KENSVFRHSEKLAI 693
           F++GD +HP +EK+  KL+++ + LK  GY P++S+V   I EE+ KE S+  HSEKLAI
Sbjct: 617 FLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAI 676

Query: 694 AFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKD 745
            +GLI++     IR++KNLR+C DCH++AKL+S++Y+RE+IVRDR RFHHF++G CSC D
Sbjct: 677 CYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCND 736

BLAST of Cla97C09G165200 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 631.7 bits (1628), Expect = 7.4e-181
Identity = 303/751 (40.35%), Postives = 456/751 (60.72%), Query Frame = 0

Query: 33  SSFSPP-----THPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFC-CTHEC 92
           SS  PP      HP +SLL  CK++  L+ IH Q IK   +      +K++ FC  +   
Sbjct: 22  SSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHF 81

Query: 93  GDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLF 152
             L YA  +F  I EPNL IWNTM RG++    P   + LY+ M+  G  P+ YTFPF+ 
Sbjct: 82  EGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVL 141

Query: 153 KGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSK---- 212
           K   +  A + G+++HGHVLKLG   +++VHT+L+ MY+  G+L+ A +V D        
Sbjct: 142 KSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVV 201

Query: 213 ---------------------------ADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 272
                                       DV++WN MIS Y + G ++E+  +F  M +  
Sbjct: 202 SYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTN 261

Query: 273 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 332
           V P   T+V ++SAC++   ++ G+QVH ++++    SNL + NALID+Y+ CGE+++A 
Sbjct: 262 VRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETAC 321

Query: 333 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 392
           G+F  +  KD+ISW T++ G+T++                               N +KE
Sbjct: 322 GLFERLPYKDVISWNTLIGGYTHM-------------------------------NLYKE 381

Query: 393 ALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDR--NKINNDAFVRNAL 452
           AL LF+ M  +   P++ TM+SIL ACAHLGA+++G WI  YID+    + N + +R +L
Sbjct: 382 ALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSL 441

Query: 453 IDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDE 512
           IDMY KCG+++ A ++F  +  +   +W AMI G A++G  + + D+FS+M K  I PD+
Sbjct: 442 IDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDD 501

Query: 513 ITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIE 572
           IT++G+LSAC+H+GM++ GR  F +MT  + + P + HYGC++DLL  +G  KEA E+I 
Sbjct: 502 ITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMIN 561

Query: 573 NMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWND 632
            M M+P+ ++W +LL  C+++   ++ E   + ++ +EPEN   YVLL NIYA+  RWN+
Sbjct: 562 MMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNE 621

Query: 633 LRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGY 692
           + + R ++ DKG+KK PGCS IE++  VHEF+ GD+ HP+  +I   L +M   L+ AG+
Sbjct: 622 VAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGF 681

Query: 693 SPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKL 745
            PD SEV  ++ EE KE ++  HSEKLAIAFGLI++ PG  + IVKNLR+C +CH   KL
Sbjct: 682 VPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKL 741

BLAST of Cla97C09G165200 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 594.7 bits (1532), Expect = 1.0e-169
Identity = 285/690 (41.30%), Postives = 432/690 (62.61%), Query Frame = 0

Query: 57  QIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSR 116
           QIH   +K  +  +  +QN ++ F    ECG+L  A  +FDE+ E N+  W +MI GY+R
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYA--ECGELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 117 LDSPELGVSLYLEMLR-RGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVF 176
            D  +  V L+  M+R     P+  T   +     +   LE G +++  +   G++ N  
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 177 VHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 236
           + +ALV MY+ C  +D A+R+ D    +++   N M S Y + G   E+  +F  M +  
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 237 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 296
           V P  ++++  +S+CS+L+++  GK  H YV     ES   + NALIDMY  C   D+A 
Sbjct: 335 VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAF 394

Query: 297 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 356
            IF  M+NK +++W +IV+G+   GE+D A   F+ MPEK+ VSW  +I G ++ + F+E
Sbjct: 395 RIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEE 454

Query: 357 ALELFRNMQA-TNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALI 416
           A+E+F +MQ+   V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+
Sbjct: 455 AIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLV 514

Query: 417 DMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEI 476
           DM+ +CG+ + A  IF  +  RD   WTA I  +A+ G+ E+A+++F  M++  + PD +
Sbjct: 515 DMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGV 574

Query: 477 TYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIEN 536
            ++G L+AC+H G+V++G+E F SM   HG+ P   HYGC+VDLL RAG L+EA ++IE+
Sbjct: 575 AFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIED 634

Query: 537 MPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDL 596
           MPM+PN ++W +LLA CRV    +MA    ++I  L PE    YVLL N+YA+  RWND+
Sbjct: 635 MPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDM 694

Query: 597 RELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYS 656
            ++R  M +KG++K PG S I++ G  HEF +GD SHP+   I+  L++++Q     G+ 
Sbjct: 695 AKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 754

Query: 657 PDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLV 716
           PD+S V +D+ E++K   + RHSEKLA+A+GLI+S  G TIRIVKNLR+C DCH+ AK  
Sbjct: 755 PDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFA 814

Query: 717 SKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           SKVYNRE+I+RD  RFH+ + G CSC D+W
Sbjct: 815 SKVYNREIILRDNNRFHYIRQGKCSCGDFW 842

BLAST of Cla97C09G165200 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 590.9 bits (1522), Expect = 1.4e-168
Identity = 284/689 (41.22%), Postives = 431/689 (62.55%), Query Frame = 0

Query: 57  QIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSR 116
           QIH   +K  +  +  +QN ++ F    ECG+L  A  +FDE+ E N+  W +MI GY+R
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYA--ECGELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 117 LDSPELGVSLYLEMLR-RGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVF 176
            D  +  V L+  M+R     P+  T   +     +   LE G +++  +   G++ N  
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 177 VHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 236
           + +ALV MY+ C  +D A+R+ D    +++   N M S Y + G   E+  +F  M +  
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 237 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 296
           V P  ++++  +S+CS+L+++  GK  H YV     ES   + NALIDMY  C   D+A 
Sbjct: 335 VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAF 394

Query: 297 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 356
            IF  M+NK +++W +IV+G+   GE+D A   F+ MPEK+ VSW  +I G ++ + F+E
Sbjct: 395 RIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEE 454

Query: 357 ALELFRNMQA-TNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALI 416
           A+E+F +MQ+   V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+
Sbjct: 455 AIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLV 514

Query: 417 DMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEI 476
           DM+ +CG+ + A  IF  +  RD   WTA I  +A+ G+ E+A+++F  M++  + PD +
Sbjct: 515 DMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGV 574

Query: 477 TYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIEN 536
            ++G L+AC+H G+V++G+E F SM   HG+ P   HYGC+VDLL RAG L+EA ++IE+
Sbjct: 575 AFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIED 634

Query: 537 MPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDL 596
           MPM+PN ++W +LLA CRV    +MA    ++I  L PE    YVLL N+YA+  RWND+
Sbjct: 635 MPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDM 694

Query: 597 RELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYS 656
            ++R  M +KG++K PG S I++ G  HEF +GD SHP+   I+  L++++Q     G+ 
Sbjct: 695 AKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 754

Query: 657 PDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLV 716
           PD+S V +D+ E++K   + RHSEKLA+A+GLI+S  G TIRIVKNLR+C DCH+ AK  
Sbjct: 755 PDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFA 814

Query: 717 SKVYNREVIVRDRTRFHHFKHGLCSCKDY 744
           SKVYNRE+I+RD  RFH+ + G CSC D+
Sbjct: 815 SKVYNREIILRDNNRFHYIRQGKCSCGDF 841

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_031744195.10.0e+0090.46putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus]... [more]
KAA0058740.10.0e+0090.32putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] ... [more]
XP_008461137.20.0e+0090.19PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
XP_038896377.10.0e+0091.52LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15... [more]
XP_022991386.10.0e+0088.46putative pentatricopeptide repeat-containing protein At3g15930 [Cucurbita maxima... [more]
Match NameE-valueIdentityDescription
Q9LSB81.3e-21955.78Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
O823802.2e-18542.66Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN011.0e-17940.35Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LUJ21.4e-16841.30Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
O233374.2e-16539.05Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5A7UUL40.0e+0090.32Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A1S3CDK00.0e+0090.19LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15... [more]
A0A6J1JQK80.0e+0088.46putative pentatricopeptide repeat-containing protein At3g15930 OS=Cucurbita maxi... [more]
A0A0A0K6A70.0e+0090.08DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G4323... [more]
A0A6J1GRB60.0e+0087.55LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15... [more]
Match NameE-valueIdentityDescription
AT3G15930.19.5e-22155.78Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.11.5e-18642.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.17.4e-18140.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.21.0e-16941.30INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT3G22690.11.4e-16841.22CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 388..491
e-value: 6.6E-26
score: 92.8
coord: 263..387
e-value: 1.3E-30
score: 108.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 200..262
e-value: 4.2E-7
score: 31.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 13..199
e-value: 5.7E-22
score: 80.5
coord: 500..654
e-value: 6.3E-16
score: 60.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 301..596
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 174..599
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 411..436
e-value: 3.0E-6
score: 25.1
coord: 474..509
e-value: 3.1E-4
score: 18.7
coord: 307..337
e-value: 5.3E-4
score: 18.0
coord: 106..138
e-value: 2.1E-8
score: 31.9
coord: 338..372
e-value: 3.4E-9
score: 34.3
coord: 207..239
e-value: 1.0E-5
score: 23.4
coord: 439..473
e-value: 2.7E-7
score: 28.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 437..483
e-value: 1.9E-9
score: 37.6
coord: 102..150
e-value: 5.5E-11
score: 42.5
coord: 335..383
e-value: 1.2E-12
score: 47.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 511..536
e-value: 0.085
score: 13.2
coord: 279..305
e-value: 0.0054
score: 16.9
coord: 207..235
e-value: 1.2E-4
score: 22.1
coord: 411..436
e-value: 6.6E-6
score: 26.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 103..137
score: 12.024604
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..335
score: 8.911594
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 406..436
score: 10.095415
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 336..370
score: 12.846701
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 204..238
score: 11.542307
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 10.98328
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..507
score: 8.747175
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 610..734
e-value: 8.4E-40
score: 135.6
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 200..317
coord: 320..738
NoneNo IPR availablePANTHERPTHR47924:SF1SUBFAMILY NOT NAMEDcoord: 33..234
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 200..317
coord: 320..738
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 33..234

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C09G165200.1Cla97C09G165200.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding