Cla97C10G191040 (gene) Watermelon (97103) v2.5

Overview
NameCla97C10G191040
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat
LocationCla97Chr10: 9907712 .. 9909763 (+)
RNA-Seq ExpressionCla97C10G191040
SyntenyCla97C10G191040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCACTTTACCGCTTCCTCCTCCGCTCTCTCCGCCGTTCTTCAACCTCTCCGTCACACTCCCGAGCTCTTACCGTTGGTCCTCTGAACCACCATTTTCAGGAGCCGATTTCACCTTCCTCTCAAAGTTCTTCTCCCATCTCGCTCCTCCATGCCCGCTCATTTTCCTTTTCCTCTGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATTGAACCACCTCTCCATGCACTTCGTCGCGACAACTACCCACCCCCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGATTCCACATCCGCTCTTGTGGGGCCGCGTCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCTGGTGATCTTGATGCGGCTTCTGCGGTCGCTCGCCACTCTGTGTTCTCGAACACGCGGCCGACGGTTTTCACTTGTAACGCTATTATTGCTGCTATGTATCGAGCTAAAAGGTATAGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTATCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGCCGTGTTGATGTGGGTCTTGAGATTTATCGTCATATTATTGCGAACGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTGACTAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGTGAAATGTTGAATAAAGGGCACGGGGCTGATTCGTTGGTTTTTAATAATTTGATTTCCGGTTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTTTTTAATAGAGGGAAAGAAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAAATGATTCCAGCGACTTGCAATGTGCTGTTGGAGGTATTGCTTAGGCATGGGAAGAAAACGGAGGCTTGGACCTTATTCGATCAGATGTTGGATAACCACACTCCACCGAATTTCCAAGCAGTCAATTCAGATACGTTCAACATAATGGTTAATGAGTGCTTTAAGCTCGGCAAGTTCTCAGAGGCGGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCTATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCAGGGAATGATGACAGATGCAGAGACTTTCTTTGCTGAACTTTGCTCTAAGTCTTTGTCCCCTGATGTCCCAACTCATAGAACACTGATAGAATCTTATTTAAAGATTGGGCAGATTGATGATGTATTGAGAGTTTTTAACAGAATGGTCGATGTTGGTTTGAGAGTTGTTGCTAGCTTTGGAAATACGGTATTTGGTGAATTGATTAAGAATGGCAAGGCAATTGACTGTGCTCAGATTTTAACAAAAATGGGAGAGCGGGATCCTAAACCAGATCCCACATGCTATGACGTTGTGATTAGAGGGCTATGTAATGAAGGTGCGCTGGATGCTAGTCGGGAGTTGCTTGACCAGATAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGCATTTGTAAAGGCTGGTCGGCATGAAGAGATTGAAAGACTGCTAAATATGAATAGATGGGGACATGTTTCGTATCGCCCCCCCTCCGGACCCTCAAGAATTTCACATTCGCAGGTACCACCTCAAATGGAACCAGCTCAAGGACCCCCTAAAATGGCAGAACCAAATTGGCGGCCTTCCATAAACCCTCAAGCCAGAGGAAGTTATGCCCCTTCATCACCTCAGATGTCGGGTCCTAGTTATTTTCAATCAGGATCGGCTCAAATGACAGGTCCTAATTATTTTCAATCAGGATCGGCTCAGATGACAAGACCGCAACAGCCTTCATCCGATCCATCCCCAATGGAAGAACAGCATCACTCACAACAACCTCCTCAAATGGCTGGCCAGGGAGTAGCTTAA

mRNA sequence

ATGTCACTTTACCGCTTCCTCCTCCGCTCTCTCCGCCGTTCTTCAACCTCTCCGTCACACTCCCGAGCTCTTACCGTTGGTCCTCTGAACCACCATTTTCAGGAGCCGATTTCACCTTCCTCTCAAAGTTCTTCTCCCATCTCGCTCCTCCATGCCCGCTCATTTTCCTTTTCCTCTGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATTGAACCACCTCTCCATGCACTTCGTCGCGACAACTACCCACCCCCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGATTCCACATCCGCTCTTGTGGGGCCGCGTCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCTGGTGATCTTGATGCGGCTTCTGCGGTCGCTCGCCACTCTGTGTTCTCGAACACGCGGCCGACGGTTTTCACTTGTAACGCTATTATTGCTGCTATGTATCGAGCTAAAAGGTATAGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTATCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGCCGTGTTGATGTGGGTCTTGAGATTTATCGTCATATTATTGCGAACGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTGACTAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGTGAAATGTTGAATAAAGGGCACGGGGCTGATTCGTTGGTTTTTAATAATTTGATTTCCGGTTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTTTTTAATAGAGGGAAAGAAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAAATGATTCCAGCGACTTGCAATGTGCTGTTGGAGGTATTGCTTAGGCATGGGAAGAAAACGGAGGCTTGGACCTTATTCGATCAGATGTTGGATAACCACACTCCACCGAATTTCCAAGCAGTCAATTCAGATACGTTCAACATAATGGTTAATGAGTGCTTTAAGCTCGGCAAGTTCTCAGAGGCGGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCTATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCAGGGAATGATGACAGATGCAGAGACTTTCTTTGCTGAACTTTGCTCTAAGTCTTTGTCCCCTGATGTCCCAACTCATAGAACACTGATAGAATCTTATTTAAAGATTGGGCAGATTGATGATGTATTGAGAGTTTTTAACAGAATGGTCGATGTTGGTTTGAGAGTTGTTGCTAGCTTTGGAAATACGGTATTTGGTGAATTGATTAAGAATGGCAAGGCAATTGACTGTGCTCAGATTTTAACAAAAATGGGAGAGCGGGATCCTAAACCAGATCCCACATGCTATGACGTTGTGATTAGAGGGCTATGTAATGAAGGTGCGCTGGATGCTAGTCGGGAGTTGCTTGACCAGATAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGCATTTGTAAAGGCTGGTCGGCATGAAGAGATTGAAAGACTGCTAAATATGAATAGATGGGGACATGTTTCGTATCGCCCCCCCTCCGGACCCTCAAGAATTTCACATTCGCAGGTACCACCTCAAATGGAACCAGCTCAAGGACCCCCTAAAATGGCAGAACCAAATTGGCGGCCTTCCATAAACCCTCAAGCCAGAGGAAGTTATGCCCCTTCATCACCTCAGATGTCGGGTCCTAGTTATTTTCAATCAGGATCGGCTCAAATGACAGGTCCTAATTATTTTCAATCAGGATCGGCTCAGATGACAAGACCGCAACAGCCTTCATCCGATCCATCCCCAATGGAAGAACAGCATCACTCACAACAACCTCCTCAAATGGCTGGCCAGGGAGTAGCTTAA

Coding sequence (CDS)

ATGTCACTTTACCGCTTCCTCCTCCGCTCTCTCCGCCGTTCTTCAACCTCTCCGTCACACTCCCGAGCTCTTACCGTTGGTCCTCTGAACCACCATTTTCAGGAGCCGATTTCACCTTCCTCTCAAAGTTCTTCTCCCATCTCGCTCCTCCATGCCCGCTCATTTTCCTTTTCCTCTGCCGAAGAAGCTGCTGCCGAAAGACGCCGTAGAAAGCGCCGTCTTCGTATTGAACCACCTCTCCATGCACTTCGTCGCGACAACTACCCACCCCCACAGCGTGATCCCAATGCTCCTCGTCTTCCTGATTCCACATCCGCTCTTGTGGGGCCGCGTCTTAACCTTCACAATCGTGTTCAATCCCTGATTCGTGCTGGTGATCTTGATGCGGCTTCTGCGGTCGCTCGCCACTCTGTGTTCTCGAACACGCGGCCGACGGTTTTCACTTGTAACGCTATTATTGCTGCTATGTATCGAGCTAAAAGGTATAGTGATGCGATTGCACTGTTTCAGTTCTTCTTTAACCAGTCGAATATAGTTCCCAATGTTGTATCGTATAATAATTTGATTAATGCTCATTGCGATGAGGGCCGTGTTGATGTGGGTCTTGAGATTTATCGTCATATTATTGCGAACGCTCCGTTTAGTCCTTCGGCAGTGACTTATCGGCATTTGACTAAGGGATTGATTGATTCTGGGAGGATTGGGGAGGCTGTGGATCTTCTGCGTGAAATGTTGAATAAAGGGCACGGGGCTGATTCGTTGGTTTTTAATAATTTGATTTCCGGTTTTCTAAATTTGGAGAATTTGGAGAAGGCGAATGAACTGTTTGATGAGTTGAAGGAGAGGTGTTTGGTGTATGATGGAGTTGTGAATGCTACGTTCATGGATTGGTTTTTTAATAGAGGGAAAGAAAAGGAGGCCATGGAATCGTACAAGTCATTGCTTGATAGGCAATTCAAAATGATTCCAGCGACTTGCAATGTGCTGTTGGAGGTATTGCTTAGGCATGGGAAGAAAACGGAGGCTTGGACCTTATTCGATCAGATGTTGGATAACCACACTCCACCGAATTTCCAAGCAGTCAATTCAGATACGTTCAACATAATGGTTAATGAGTGCTTTAAGCTCGGCAAGTTCTCAGAGGCGGTAGAGACTTTCCGGAAGGTGGGAACTCAACCAAAGTCAAGGCCTTTTGCTATGGACGTTGCAGGGTATAATAATATCATTGCAAGGTTTTGTGAGCAGGGAATGATGACAGATGCAGAGACTTTCTTTGCTGAACTTTGCTCTAAGTCTTTGTCCCCTGATGTCCCAACTCATAGAACACTGATAGAATCTTATTTAAAGATTGGGCAGATTGATGATGTATTGAGAGTTTTTAACAGAATGGTCGATGTTGGTTTGAGAGTTGTTGCTAGCTTTGGAAATACGGTATTTGGTGAATTGATTAAGAATGGCAAGGCAATTGACTGTGCTCAGATTTTAACAAAAATGGGAGAGCGGGATCCTAAACCAGATCCCACATGCTATGACGTTGTGATTAGAGGGCTATGTAATGAAGGTGCGCTGGATGCTAGTCGGGAGTTGCTTGACCAGATAATGAGGTACGGTATTGGCCTCACTCCCACACTTCAGGAATTTGTTAAAGAGGCATTTGTAAAGGCTGGTCGGCATGAAGAGATTGAAAGACTGCTAAATATGAATAGATGGGGACATGTTTCGTATCGCCCCCCCTCCGGACCCTCAAGAATTTCACATTCGCAGGTACCACCTCAAATGGAACCAGCTCAAGGACCCCCTAAAATGGCAGAACCAAATTGGCGGCCTTCCATAAACCCTCAAGCCAGAGGAAGTTATGCCCCTTCATCACCTCAGATGTCGGGTCCTAGTTATTTTCAATCAGGATCGGCTCAAATGACAGGTCCTAATTATTTTCAATCAGGATCGGCTCAGATGACAAGACCGCAACAGCCTTCATCCGATCCATCCCCAATGGAAGAACAGCATCACTCACAACAACCTCCTCAAATGGCTGGCCAGGGAGTAGCTTAA

Protein sequence

MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSAEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFGELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGLTPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEPAQGPPKMAEPNWRPSINPQARGSYAPSSPQMSGPSYFQSGSAQMTGPNYFQSGSAQMTRPQQPSSDPSPMEEQHHSQQPPQMAGQGVA
Homology
BLAST of Cla97C10G191040 vs. NCBI nr
Match: XP_022984748.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1249.6 bits (3232), Expect = 0.0e+00
Identity = 637/707 (90.10%), Postives = 651/707 (92.08%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS++L++GPLNHH   PI PSSQSSSPISLLHARSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVI+GLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQM----EPA 600
           TP LQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM     P 
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGPPKMAEPNWRPSINPQARGSYAPSSPQMSGP--------------------SYFQSGS 660
           QG P MAEP+WRPSINPQARGSYAPSSPQM+GP                      +   S
Sbjct: 601 QGHPPMAEPHWRPSINPQARGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 AQMTGPNYFQSGSAQMTRPQQPSSDPSPMEEQHHSQQPPQMAGQGVA 684
            QMTGPNYFQSGSAQMTRPQQP  DPSPMEEQHHSQQPPQ+AGQ VA
Sbjct: 661 PQMTGPNYFQSGSAQMTRPQQPPFDPSPMEEQHHSQQPPQIAGQTVA 707

BLAST of Cla97C10G191040 vs. NCBI nr
Match: XP_023552663.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1240.3 bits (3208), Expect = 0.0e+00
Identity = 639/740 (86.35%), Postives = 651/740 (87.97%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS+AL++GPLNHH   PI PSSQSSSPISLLH RSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPIPPSSQSSSPISLLHVRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEP----- 600
           TPTLQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM P     
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 --------------------------------AQGPPKMAEPNWRPSINPQARGSYAPSS 660
                                            QG P MAEP+WRPSINPQA GSYAPSS
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSS 660

Query: 661 PQMSGP--------------------SYFQSGSAQMTGPNYFQSGSAQMTRPQQPSSDPS 684
           PQM+GP                      +   S QMTGPNYFQSGSAQMTRPQQPS DPS
Sbjct: 661 PQMTGPQGHTPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPSFDPS 720

BLAST of Cla97C10G191040 vs. NCBI nr
Match: XP_022984746.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita maxima] >XP_022984747.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1236.5 bits (3198), Expect = 0.0e+00
Identity = 637/740 (86.08%), Postives = 651/740 (87.97%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS++L++GPLNHH   PI PSSQSSSPISLLHARSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVI+GLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEP----- 600
           TP LQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM P     
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 --------------------------------AQGPPKMAEPNWRPSINPQARGSYAPSS 660
                                            QG P MAEP+WRPSINPQARGSYAPSS
Sbjct: 601 QGHPPMAEPHWQPSINPQAGGSCAPSSPQMTGPQGHPPMAEPHWRPSINPQARGSYAPSS 660

Query: 661 PQMSGP--------------------SYFQSGSAQMTGPNYFQSGSAQMTRPQQPSSDPS 684
           PQM+GP                      +   S QMTGPNYFQSGSAQMTRPQQP  DPS
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPPFDPS 720

BLAST of Cla97C10G191040 vs. NCBI nr
Match: XP_038905008.1 (pentatricopeptide repeat-containing protein At1g10270 [Benincasa hispida])

HSP 1 Score: 1231.1 bits (3184), Expect = 0.0e+00
Identity = 648/814 (79.61%), Postives = 656/814 (80.59%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYRFLLRSL RSSTSPS+SR LT+GPLNHHFQEPI P     SPISLLHARSF+FSSA
Sbjct: 1   MSLYRFLLRSLHRSSTSPSNSRTLTIGPLNHHFQEPIPP-----SPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRY DAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYGDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRI EAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIEEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNKVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGKA+DCAQILTKMGERDPKPDPTCYDVVIRGLCNEG LDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKAVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGELDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEPAQGPP 600
           TPTLQEFVKEAF KAGRHEEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM P QG P
Sbjct: 541 TPTLQEFVKEAFGKAGRHEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPQGSP 600

Query: 601 KMAEPNWRPSINPQARGSYAPSSP------------------------------------ 660
           +M EPNWRPSINPQARGSYAPSSP                                    
Sbjct: 601 QMTEPNWRPSINPQARGSYAPSSPQMSGSNYFQSGSDQMMRPQQPSSEPTQLGAPQGPRQ 660

Query: 661 ------------------------------------------------------------ 684
                                                                       
Sbjct: 661 MVEPNWQPSINPQARGSFAPSSPQLSDPSYFQTGSTQMTGPNYFQSGLAQMTRSQQPSYE 720

BLAST of Cla97C10G191040 vs. NCBI nr
Match: XP_023552661.1 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023552662.1 pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1229.2 bits (3179), Expect = 0.0e+00
Identity = 642/773 (83.05%), Postives = 653/773 (84.48%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS+AL++GPLNHH   PI PSSQSSSPISLLH RSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPIPPSSQSSSPISLLHVRSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQM----EPA 600
           TPTLQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM     P 
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGPPKMAEPNWRPSINPQARGSYAPSSPQMSGP--------------------------- 660
           QG P MAEP+WRPSINPQA GSYAPSSPQM+GP                           
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 ---------------------------SYFQS---------------------------- 684
                                      SY  S                            
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSSPQMTGPQGHTPMAEPHWRPSINPQAGG 720

BLAST of Cla97C10G191040 vs. ExPASy Swiss-Prot
Match: Q9SY69 (Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana OX=3702 GN=GRP23 PE=1 SV=1)

HSP 1 Score: 738.8 bits (1906), Expect = 5.5e-212
Identity = 385/635 (60.63%), Postives = 478/635 (75.28%), Query Frame = 0

Query: 53  RSFSFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPR 112
           R+ +FSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSALVG R
Sbjct: 86  RTMAFSSAEEAAAERRRRKRRLRIEPPLHALRRDPSAPPPKRDPNAPRLPDSTSALVGQR 145

Query: 113 LNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQF 172
           LNLHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRYS++I+LFQ+
Sbjct: 146 LNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKRYSESISLFQY 205

Query: 173 FFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS 232
           FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRHI+ANAPF+PS+VTYRHLTKGL+ +
Sbjct: 206 FFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTYRHLTKGLVQA 265

Query: 233 GRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVN 292
           GRIG+A  LLREML+KG  ADS V+NNLI G+L+L + +KA E FDELK +C VYDG+VN
Sbjct: 266 GRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKSKCTVYDGIVN 325

Query: 293 ATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLD 352
           ATFM+++F +G +KEAMESY+SLLD++F+M P T NVLLEV L+ GKK EAW LF++MLD
Sbjct: 326 ATFMEYWFEKGNDKEAMESYRSLLDKKFRMHPPTGNVLLEVFLKFGKKDEAWALFNEMLD 385

Query: 353 NHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIAR 412
           NH PPN  +VNSDT  IMVNECFK+G+FSEA+ TF+KVG++  S+PF MD  GY NI+ R
Sbjct: 386 NHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKPFVMDYLGYCNIVTR 445

Query: 413 FCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVV 472
           FCEQGM+T+AE FFAE  S+SL  D P+HR +I++YLK  +IDD +++ +RMVDV LRVV
Sbjct: 446 FCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAVKMLDRMVDVNLRVV 505

Query: 473 ASFGNTVFGELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLD 532
           A FG  VFGELIKNGK  + A++LTKMGER+PKPDP+ YDVV+RGLC+  ALD +++++ 
Sbjct: 506 ADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGLCDGDALDQAKDIVG 565

Query: 533 QIMRYGIGLTPTLQEFVKEAFVKAGRHEEIERLLN-----MNRWGHVSYRPPSGPSRISH 592
           +++R+ +G+T  L+EF+ E F KAGR EEIE++LN     +   G     PP  P+    
Sbjct: 566 EMIRHNVGVTTVLREFIIEVFEKAGRREEIEKILNSVARPVRNAGQSGNTPPRVPAVFGT 625

Query: 593 SQVPPQMEPAQGPPKMAEPNWRPSINPQARGSYAPSSPQMSGPSY----FQSGSAQMTGP 652
           +   PQ       P+   P     +     G    ++ Q +G +Y     Q+ S   T  
Sbjct: 626 TPAAPQQ------PRDRAPWTSQGVVHSNSGWANGTAGQTAGGAYKANNGQNPSWSNTSD 685

Query: 653 NYFQSGSAQMTRPQQPSS----DPSPMEEQHHSQQ 674
           N  Q   +  T  QQP S     P   ++Q  SQQ
Sbjct: 686 NQQQQSWSNQTAGQQPPSWSRQAPGYQQQQSWSQQ 714

BLAST of Cla97C10G191040 vs. ExPASy Swiss-Prot
Match: Q9M3A8 (Pentatricopeptide repeat-containing protein At3g49240, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=EMB1796 PE=1 SV=1)

HSP 1 Score: 343.2 bits (879), Expect = 6.8e-93
Identity = 202/530 (38.11%), Postives = 306/530 (57.74%), Query Frame = 0

Query: 46  PISLLHARSFSFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRL 105
           P   L  R  SF++ EEAAAERRRRKRRLR+EPP+++  R        P P ++PN P+L
Sbjct: 25  PQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPVNSFNRSQQQQSQIPRPIQNPNIPKL 84

Query: 106 PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAK 165
           P+S SALVG RL+LHN +  LIR  DL+ A+   RHSV+SN RPT+FT N ++AA  R  
Sbjct: 85  PESVSALVGKRLDLHNHILKLIRENDLEEAALYTRHSVYSNCRPTIFTVNTVLAAQLRQA 144

Query: 166 RYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVT 225
           +Y  A+     F NQ+ I PN+++YN +  A+ D  + ++ LE Y+  I NAP +PS  T
Sbjct: 145 KYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDVRKPEIALEHYKLFIDNAPLNPSIAT 204

Query: 226 YRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELK 285
           +R L KGL+ +  + +A+++  +M  KG   D +V++ L+ G +   + +   +L+ ELK
Sbjct: 205 FRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVVYSYLMMGCVKNSDADGVLKLYQELK 264

Query: 286 ERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLRH 345
           E+    V DGVV    M  +F +  EKEAME Y+  +  + + +M     N +LE L  +
Sbjct: 265 EKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEEAVGENSKVRMSAMAYNYVLEALSEN 324

Query: 346 GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSR 405
           GK  EA  LFD +   H PP   AVN  TFN+MVN     GKF EA+E FR++G   K  
Sbjct: 325 GKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVNGYCAGGKFEEAMEVFRQMG-DFKCS 384

Query: 406 PFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDV 465
           P   D   +NN++ + C+  ++ +AE  + E+  K++ PD  T+  L+++  K G+ID+ 
Sbjct: 385 P---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNVKPDEYTYGLLMDTCFKEGKIDEG 444

Query: 466 LRVFNRMVDVGLRVVASFGNTVFGELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRG 525
              +  MV+  LR   +  N +  +LIK GK +D A+    M     K D   Y  ++R 
Sbjct: 445 AAYYKTMVESNLRPNLAVYNRLQDQLIKAGK-LDDAKSFFDMMVSKLKMDDEAYKFIMRA 504

Query: 526 LCNEGALDASRELLDQIMRYG-IGLTPTLQEFVKEAFVKAGRHEEIERLL 566
           L   G LD   +++D+++    + ++  LQEFVKE   K GR  ++E+L+
Sbjct: 505 LSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEELRKGGREGDLEKLM 548

BLAST of Cla97C10G191040 vs. ExPASy Swiss-Prot
Match: Q9LEX6 (Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g60960 PE=2 SV=2)

HSP 1 Score: 229.2 bits (583), Expect = 1.4e-58
Identity = 147/402 (36.57%), Postives = 223/402 (55.47%), Query Frame = 0

Query: 90  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRP 149
           P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R +V   F   R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +V++ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK++EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL++HGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK----- 449
             FSE    F +           +    Y  +I   CE G ++DAE  FAE+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ +G++DD ++  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Cla97C10G191040 vs. ExPASy Swiss-Prot
Match: Q9LEX5 (Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g60980 PE=2 SV=1)

HSP 1 Score: 226.9 bits (577), Expect = 7.1e-58
Identity = 144/380 (37.89%), Postives = 219/380 (57.63%), Query Frame = 0

Query: 117 RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFF 176
           RV  LIR  GDLD A+  AR +VF++  +  T   C +II  M R KR  DA  L++FFF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 GRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLV----- 296
           GR+ +A   LR   +N+    D + +NNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK+ EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 416
           L++G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQ 473
           K+     D      II RFCE  M+++AE+ F +  +      DV T++T+I++Y+K G+
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Cla97C10G191040 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 1.1e-34
Identity = 99/413 (23.97%), Postives = 182/413 (44.07%), Query Frame = 0

Query: 125 GDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVS 184
           G +  A A+    V    RP + T + +I  +    R S+A+ L      +    P+ V+
Sbjct: 154 GRVSEAVALVDRMVEMKQRPDLVTVSTLINGLCLKGRVSEALVLIDRMV-EYGFQPDEVT 213

Query: 185 YNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREM 244
           Y  ++N  C  G   + L+++R  +       S V Y  +   L   G   +A+ L  EM
Sbjct: 214 YGPVLNRLCKSGNSALALDLFRK-MEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEM 273

Query: 245 LNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFNRGKE 304
             KG  AD + +++LI G  N    +   ++  E+  R ++ D V  +  +D F   GK 
Sbjct: 274 EMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKL 333

Query: 305 KEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQAVNSD 364
            EA E Y  ++ R       T N L++   +     EA  +FD M+     P+       
Sbjct: 334 LEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIV----- 393

Query: 365 TFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETF 424
           T++I++N   K  +  + +  FR++     S+    +   YN ++  FC+ G +  A+  
Sbjct: 394 TYSILINSYCKAKRVDDGMRLFREI----SSKGLIPNTITYNTLVLGFCQSGKLNAAKEL 453

Query: 425 FAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFGELIK 484
           F E+ S+ + P V T+  L++     G+++  L +F +M    + +     N +   +  
Sbjct: 454 FQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCN 513

Query: 485 NGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYG 538
             K  D   +   + ++  KPD   Y+V+I GLC +G+L  +  L  ++   G
Sbjct: 514 ASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLSEADMLFRKMKEDG 555

BLAST of Cla97C10G191040 vs. ExPASy TrEMBL
Match: A0A6J1JBF6 (pentatricopeptide repeat-containing protein At1g10270-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482933 PE=4 SV=1)

HSP 1 Score: 1249.6 bits (3232), Expect = 0.0e+00
Identity = 637/707 (90.10%), Postives = 651/707 (92.08%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS++L++GPLNHH   PI PSSQSSSPISLLHARSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVI+GLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQM----EPA 600
           TP LQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM     P 
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 QGPPKMAEPNWRPSINPQARGSYAPSSPQMSGP--------------------SYFQSGS 660
           QG P MAEP+WRPSINPQARGSYAPSSPQM+GP                      +   S
Sbjct: 601 QGHPPMAEPHWRPSINPQARGSYAPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSS 660

Query: 661 AQMTGPNYFQSGSAQMTRPQQPSSDPSPMEEQHHSQQPPQMAGQGVA 684
            QMTGPNYFQSGSAQMTRPQQP  DPSPMEEQHHSQQPPQ+AGQ VA
Sbjct: 661 PQMTGPNYFQSGSAQMTRPQQPPFDPSPMEEQHHSQQPPQIAGQTVA 707

BLAST of Cla97C10G191040 vs. ExPASy TrEMBL
Match: A0A6J1J312 (pentatricopeptide repeat-containing protein At1g10270-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482933 PE=4 SV=1)

HSP 1 Score: 1236.5 bits (3198), Expect = 0.0e+00
Identity = 637/740 (86.08%), Postives = 651/740 (87.97%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS++L++GPLNHH   PI PSSQSSSPISLLHARSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQSLSIGPLNHHLLSPIPPSSQSSSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDV LEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVSLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMAD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVI+GLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIQGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEP----- 600
           TP LQEFVKEAFVKAGR EEIERLLNMNRWGH  YRPPSGP RIS SQVPPQM P     
Sbjct: 541 TPALQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRPPSGPPRISQSQVPPQMGPPRPPP 600

Query: 601 --------------------------------AQGPPKMAEPNWRPSINPQARGSYAPSS 660
                                            QG P MAEP+WRPSINPQARGSYAPSS
Sbjct: 601 QGHPPMAEPHWQPSINPQAGGSCAPSSPQMTGPQGHPPMAEPHWRPSINPQARGSYAPSS 660

Query: 661 PQMSGP--------------------SYFQSGSAQMTGPNYFQSGSAQMTRPQQPSSDPS 684
           PQM+GP                      +   S QMTGPNYFQSGSAQMTRPQQP  DPS
Sbjct: 661 PQMTGPQGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPNYFQSGSAQMTRPQQPPFDPS 720

BLAST of Cla97C10G191040 vs. ExPASy TrEMBL
Match: A0A6J1EMZ6 (pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata OX=3662 GN=LOC111436060 PE=4 SV=1)

HSP 1 Score: 1224.5 bits (3167), Expect = 0.0e+00
Identity = 634/740 (85.68%), Postives = 646/740 (87.30%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRALTVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSSA 60
           MSLYR LLRS RRSSTSPSHS+AL++GPLNHH   P  PSSQ SSPISLLHARSF+FSSA
Sbjct: 1   MSLYRLLLRSFRRSSTSPSHSQALSIGPLNHHLLSPFPPSSQ-SSPISLLHARSFAFSSA 60

Query: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120
           EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS
Sbjct: 61  EEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQS 120

Query: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180
           LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP
Sbjct: 121 LIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVP 180

Query: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240
           NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL
Sbjct: 181 NVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDL 240

Query: 241 LREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300
           LREMLNKGHGADSLV+NNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN
Sbjct: 241 LREMLNKGHGADSLVYNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFFN 300

Query: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQA 360
           RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLL+HGKKTEAWTLFDQMLDNHTPPNFQA
Sbjct: 301 RGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLKHGKKTEAWTLFDQMLDNHTPPNFQA 360

Query: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTD 420
           VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNII RFCEQGMM D
Sbjct: 361 VNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIGRFCEQGMMGD 420

Query: 421 AETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVFG 480
           AETFFAELCSKSLSPDVPTHRTLIE+YLKI QIDD LRVFNRMVDVGLRVVASFGN VFG
Sbjct: 421 AETFFAELCSKSLSPDVPTHRTLIEAYLKIEQIDDALRVFNRMVDVGLRVVASFGNRVFG 480

Query: 481 ELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540
           ELIKNGK +DCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL
Sbjct: 481 ELIKNGKVVDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIGL 540

Query: 541 TPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQMEP----- 600
           TPTLQEFVKEAFVKAGR EEIERLLNMNRWGH  YR PSGP RIS SQVPPQM P     
Sbjct: 541 TPTLQEFVKEAFVKAGRSEEIERLLNMNRWGHAPYRSPSGPPRISQSQVPPQMGPPHPPP 600

Query: 601 --------------------------------AQGPPKMAEPNWRPSINPQARGSYAPSS 660
                                            QG P MAEP+WRPSINPQA GSYAPSS
Sbjct: 601 QGHPPMAEPHWRPSINPQAGGSYGPSSPQMTGPQGHPPMAEPHWRPSINPQAGGSYAPSS 660

Query: 661 PQMSGP--------------------SYFQSGSAQMTGPNYFQSGSAQMTRPQQPSSDPS 684
           PQM+GP                      +   S QMTGP YFQSGSAQMTRP QP  DPS
Sbjct: 661 PQMTGPQGHTPMAEPHWRPSINPQARGSYGPSSPQMTGPKYFQSGSAQMTRPHQPPFDPS 720

BLAST of Cla97C10G191040 vs. ExPASy TrEMBL
Match: A0A5A7TJN2 (ACT11D09.4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold70G00300 PE=4 SV=1)

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 625/680 (91.91%), Postives = 643/680 (94.56%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRAL-TVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSS 60
           MS YRFLLRSLRRSSTSPS++ AL T+ PLNHH    I PSSQ+SSPISLL ARSFSFSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSYAPALTTIAPLNHH----IPPSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLVFNNLISGFLNL NL KANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLVKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GKEKEAMESYKSLLDRQFKM+PATCNVLLEVLL+H KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMT 420
           AVNSDTFNIMVNECFKLGKF+EAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMM 
Sbjct: 361 AVNSDTFNIMVNECFKLGKFTEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVF 480
           DAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIG 540
           GELIKNGKA DCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD SRELLDQIMRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIMRYGIG 540

Query: 541 LTPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQM-EPAQG 600
           LTPTL+EFVK+AFVKAGRHEEIERLLNMN+WGH +YRPPSGP RIS SQVPPQM  P QG
Sbjct: 541 LTPTLEEFVKDAFVKAGRHEEIERLLNMNKWGHAAYRPPSGPPRISQSQVPPQMGRPLQG 600

Query: 601 PPKMAEPNWRPSINPQARGSYAPSSPQMSGPSYFQSGSAQMTGPNYFQSGSAQMTRPQQP 660
           PP+MAEPNWRPSINPQARGSY  SSPQMS PS+FQSG  QMTG NYFQSGSAQMT+PQ  
Sbjct: 601 PPQMAEPNWRPSINPQARGSY--SSPQMSSPSHFQSGPPQMTGSNYFQSGSAQMTKPQHS 660

Query: 661 SSDPSPMEEQHHSQQPPQMA 679
           S DP PMEE HHSQQPPQMA
Sbjct: 661 SFDPPPMEE-HHSQQPPQMA 673

BLAST of Cla97C10G191040 vs. ExPASy TrEMBL
Match: Q6E438 (ACT11D09.4 OS=Cucumis melo OX=3656 GN=ACT11D09.4 PE=4 SV=1)

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 625/680 (91.91%), Postives = 643/680 (94.56%), Query Frame = 0

Query: 1   MSLYRFLLRSLRRSSTSPSHSRAL-TVGPLNHHFQEPISPSSQSSSPISLLHARSFSFSS 60
           MS YRFLLRSLRRSSTSPS++ AL T+ PLNHH    I PSSQ+SSPISLL ARSFSFSS
Sbjct: 1   MSPYRFLLRSLRRSSTSPSYAPALTTIAPLNHH----IPPSSQTSSPISLLLARSFSFSS 60

Query: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120
           AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ
Sbjct: 61  AEEAAAERRRRKRRLRIEPPLHALRRDNYPPPQRDPNAPRLPDSTSALVGPRLNLHNRVQ 120

Query: 121 SLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180
           SLIRAGDLDAAS+VARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV
Sbjct: 121 SLIRAGDLDAASSVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIV 180

Query: 181 PNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVD 240
           PNVVSYNNLINAHCDEGRVDVGLE+YRHIIANAPFSPSAVTYRHLTKGLID+GRI EAVD
Sbjct: 181 PNVVSYNNLINAHCDEGRVDVGLEVYRHIIANAPFSPSAVTYRHLTKGLIDAGRIEEAVD 240

Query: 241 LLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVNATFMDWFF 300
           LLREMLNKGHGADSLVFNNLISGFLNL NL KANELFDELKERCLVYDGVVNATFMDWFF
Sbjct: 241 LLREMLNKGHGADSLVFNNLISGFLNLGNLVKANELFDELKERCLVYDGVVNATFMDWFF 300

Query: 301 NRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQ 360
           N+GKEKEAMESYKSLLDRQFKM+PATCNVLLEVLL+H KKTEAWTLFDQMLDNHTPPNFQ
Sbjct: 301 NQGKEKEAMESYKSLLDRQFKMVPATCNVLLEVLLKHEKKTEAWTLFDQMLDNHTPPNFQ 360

Query: 361 AVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMT 420
           AVNSDTFNIMVNECFKLGKF+EAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMM 
Sbjct: 361 AVNSDTFNIMVNECFKLGKFTEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMA 420

Query: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVASFGNTVF 480
           DAETFFAELCSKSLSPDVPTHRTLIESYLKI QIDD LRVFNRMVDVGLRVVASFGN VF
Sbjct: 421 DAETFFAELCSKSLSPDVPTHRTLIESYLKIEQIDDALRVFNRMVDVGLRVVASFGNMVF 480

Query: 481 GELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLDQIMRYGIG 540
           GELIKNGKA DCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALD SRELLDQIMRYGIG
Sbjct: 481 GELIKNGKAADCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDTSRELLDQIMRYGIG 540

Query: 541 LTPTLQEFVKEAFVKAGRHEEIERLLNMNRWGHVSYRPPSGPSRISHSQVPPQM-EPAQG 600
           LTPTL+EFVK+AFVKAGRHEEIERLLNMN+WGH +YRPPSGP RIS SQVPPQM  P QG
Sbjct: 541 LTPTLEEFVKDAFVKAGRHEEIERLLNMNKWGHAAYRPPSGPPRISQSQVPPQMGRPLQG 600

Query: 601 PPKMAEPNWRPSINPQARGSYAPSSPQMSGPSYFQSGSAQMTGPNYFQSGSAQMTRPQQP 660
           PP+MAEPNWRPSINPQARGSY  SSPQMS PS+FQSG  QMTG NYFQSGSAQMT+PQ  
Sbjct: 601 PPQMAEPNWRPSINPQARGSY--SSPQMSSPSHFQSGPPQMTGSNYFQSGSAQMTKPQHS 660

Query: 661 SSDPSPMEEQHHSQQPPQMA 679
           S DP PMEE HHSQQPPQMA
Sbjct: 661 SFDPPPMEE-HHSQQPPQMA 673

BLAST of Cla97C10G191040 vs. TAIR 10
Match: AT1G10270.1 (glutamine-rich protein 23 )

HSP 1 Score: 738.8 bits (1906), Expect = 3.9e-213
Identity = 385/635 (60.63%), Postives = 478/635 (75.28%), Query Frame = 0

Query: 53  RSFSFSSAEEAAAERRRRKRRLRIEPPLHALRRD-NYPPPQRDPNAPRLPDSTSALVGPR 112
           R+ +FSSAEEAAAERRRRKRRLRIEPPLHALRRD + PPP+RDPNAPRLPDSTSALVG R
Sbjct: 86  RTMAFSSAEEAAAERRRRKRRLRIEPPLHALRRDPSAPPPKRDPNAPRLPDSTSALVGQR 145

Query: 113 LNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAKRYSDAIALFQF 172
           LNLHNRVQSLIRA DLDAAS +AR SVFSNTRPTVFTCNAIIAAMYRAKRYS++I+LFQ+
Sbjct: 146 LNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIAAMYRAKRYSESISLFQY 205

Query: 173 FFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTYRHLTKGLIDS 232
           FF QSNIVPNVVSYN +INAHCDEG VD  LE+YRHI+ANAPF+PS+VTYRHLTKGL+ +
Sbjct: 206 FFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPFAPSSVTYRHLTKGLVQA 265

Query: 233 GRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLVYDGVVN 292
           GRIG+A  LLREML+KG  ADS V+NNLI G+L+L + +KA E FDELK +C VYDG+VN
Sbjct: 266 GRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVEFFDELKSKCTVYDGIVN 325

Query: 293 ATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLD 352
           ATFM+++F +G +KEAMESY+SLLD++F+M P T NVLLEV L+ GKK EAW LF++MLD
Sbjct: 326 ATFMEYWFEKGNDKEAMESYRSLLDKKFRMHPPTGNVLLEVFLKFGKKDEAWALFNEMLD 385

Query: 353 NHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIAR 412
           NH PPN  +VNSDT  IMVNECFK+G+FSEA+ TF+KVG++  S+PF MD  GY NI+ R
Sbjct: 386 NHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVGSKVTSKPFVMDYLGYCNIVTR 445

Query: 413 FCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVV 472
           FCEQGM+T+AE FFAE  S+SL  D P+HR +I++YLK  +IDD +++ +RMVDV LRVV
Sbjct: 446 FCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKAERIDDAVKMLDRMVDVNLRVV 505

Query: 473 ASFGNTVFGELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRGLCNEGALDASRELLD 532
           A FG  VFGELIKNGK  + A++LTKMGER+PKPDP+ YDVV+RGLC+  ALD +++++ 
Sbjct: 506 ADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIYDVVVRGLCDGDALDQAKDIVG 565

Query: 533 QIMRYGIGLTPTLQEFVKEAFVKAGRHEEIERLLN-----MNRWGHVSYRPPSGPSRISH 592
           +++R+ +G+T  L+EF+ E F KAGR EEIE++LN     +   G     PP  P+    
Sbjct: 566 EMIRHNVGVTTVLREFIIEVFEKAGRREEIEKILNSVARPVRNAGQSGNTPPRVPAVFGT 625

Query: 593 SQVPPQMEPAQGPPKMAEPNWRPSINPQARGSYAPSSPQMSGPSY----FQSGSAQMTGP 652
           +   PQ       P+   P     +     G    ++ Q +G +Y     Q+ S   T  
Sbjct: 626 TPAAPQQ------PRDRAPWTSQGVVHSNSGWANGTAGQTAGGAYKANNGQNPSWSNTSD 685

Query: 653 NYFQSGSAQMTRPQQPSS----DPSPMEEQHHSQQ 674
           N  Q   +  T  QQP S     P   ++Q  SQQ
Sbjct: 686 NQQQQSWSNQTAGQQPPSWSRQAPGYQQQQSWSQQ 714

BLAST of Cla97C10G191040 vs. TAIR 10
Match: AT3G49240.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 343.2 bits (879), Expect = 4.8e-94
Identity = 202/530 (38.11%), Postives = 306/530 (57.74%), Query Frame = 0

Query: 46  PISLLHARSFSFSSAEEAAAERRRRKRRLRIEPPLHALRR-----DNYPPPQRDPNAPRL 105
           P   L  R  SF++ EEAAAERRRRKRRLR+EPP+++  R        P P ++PN P+L
Sbjct: 25  PQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPPVNSFNRSQQQQSQIPRPIQNPNIPKL 84

Query: 106 PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSVFSNTRPTVFTCNAIIAAMYRAK 165
           P+S SALVG RL+LHN +  LIR  DL+ A+   RHSV+SN RPT+FT N ++AA  R  
Sbjct: 85  PESVSALVGKRLDLHNHILKLIRENDLEEAALYTRHSVYSNCRPTIFTVNTVLAAQLRQA 144

Query: 166 RYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVT 225
           +Y  A+     F NQ+ I PN+++YN +  A+ D  + ++ LE Y+  I NAP +PS  T
Sbjct: 145 KYG-ALLQLHGFINQAGIAPNIITYNLIFQAYLDVRKPEIALEHYKLFIDNAPLNPSIAT 204

Query: 226 YRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELK 285
           +R L KGL+ +  + +A+++  +M  KG   D +V++ L+ G +   + +   +L+ ELK
Sbjct: 205 FRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPVVYSYLMMGCVKNSDADGVLKLYQELK 264

Query: 286 ERC--LVYDGVVNATFMDWFFNRGKEKEAMESYKSLL--DRQFKMIPATCNVLLEVLLRH 345
           E+    V DGVV    M  +F +  EKEAME Y+  +  + + +M     N +LE L  +
Sbjct: 265 EKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYEEAVGENSKVRMSAMAYNYVLEALSEN 324

Query: 346 GKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQPKSR 405
           GK  EA  LFD +   H PP   AVN  TFN+MVN     GKF EA+E FR++G   K  
Sbjct: 325 GKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMVNGYCAGGKFEEAMEVFRQMG-DFKCS 384

Query: 406 PFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLSPDVPTHRTLIESYLKIGQIDDV 465
           P   D   +NN++ + C+  ++ +AE  + E+  K++ PD  T+  L+++  K G+ID+ 
Sbjct: 385 P---DTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNVKPDEYTYGLLMDTCFKEGKIDEG 444

Query: 466 LRVFNRMVDVGLRVVASFGNTVFGELIKNGKAIDCAQILTKMGERDPKPDPTCYDVVIRG 525
              +  MV+  LR   +  N +  +LIK GK +D A+    M     K D   Y  ++R 
Sbjct: 445 AAYYKTMVESNLRPNLAVYNRLQDQLIKAGK-LDDAKSFFDMMVSKLKMDDEAYKFIMRA 504

Query: 526 LCNEGALDASRELLDQIMRYG-IGLTPTLQEFVKEAFVKAGRHEEIERLL 566
           L   G LD   +++D+++    + ++  LQEFVKE   K GR  ++E+L+
Sbjct: 505 LSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEELRKGGREGDLEKLM 548

BLAST of Cla97C10G191040 vs. TAIR 10
Match: AT3G60960.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 229.2 bits (583), Expect = 1.0e-59
Identity = 147/402 (36.57%), Postives = 223/402 (55.47%), Query Frame = 0

Query: 90  PPQRDPNA-PRL-PDSTSALVGPRLNLHNRVQSLIRAGDLDAASAVARHSV---FSNTRP 149
           P  RDP++ P+L P S S +    ++L  RV+++I   +LD AS ++R +V   F   R 
Sbjct: 27  PLGRDPSSLPKLDPVSISYIDSRPISLRYRVRAMIEMSNLDEASKLSRLAVLNGFLVDRD 86

Query: 150 TVFTCNAIIAAMYRAKRYSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEI 209
           TVF CN++I AM  AKRY DAI+LF +FFN+S  +PN +S + +I AHCD+G VD  LE+
Sbjct: 87  TVFICNSVIGAMCSAKRYDDAISLFNYFFNESQTLPNTLSCDLIIKAHCDQGHVDDALEL 146

Query: 210 YRHIIANAPFSPSAVTYRHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFL 269
           YRHI+ +   +P   TY  L K L+D+ R  EA  L R M         +V++ LI GFL
Sbjct: 147 YRHILLDGRVAPGIETYMILAKALVDAKRFDEACVLARSM----SCCSFMVYDILIRGFL 206

Query: 270 NLENLEKANELFDELKERCLVYDG--------VVNATFMDWFFNRGKEKEAMESYKSLLD 329
           ++ N  KA+++F+ELK       G        + N +FM+++F +GK++EAME   +L D
Sbjct: 207 DIGNFVKASQIFEELKGLDSKLPGREYHKANAIFNVSFMNYWFKQGKDEEAMEILANLED 266

Query: 330 RQFKMIPATCNVLLEVLLRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKL 389
            Q  + P   N +L+VL++HGKKTEAW LF +M+        +  +S+T +IM       
Sbjct: 267 AQV-LNPIVGNRVLQVLVKHGKKTEAWELFGEMI--------EICDSETVDIMSE----- 326

Query: 390 GKFSEAVETFRKVGTQPKSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK----- 449
             FSE    F +           +    Y  +I   CE G ++DAE  FAE+ +      
Sbjct: 327 -YFSEKTVPFER-----------LRKTCYRKMIVSLCEHGKVSDAEKLFAEMFTDVDGGD 386

Query: 450 -SLSPDVPTHRTLIESYLKIGQIDDVLRVFNRMVDVGLRVVA 473
             + PD+   R +I  Y+ +G++DD ++  N+M    LR +A
Sbjct: 387 LLVGPDLLIFRAMINGYVSVGRVDDAIKTLNKMRISNLRKLA 398

BLAST of Cla97C10G191040 vs. TAIR 10
Match: AT3G60980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 226.9 bits (577), Expect = 5.0e-59
Identity = 144/380 (37.89%), Postives = 219/380 (57.63%), Query Frame = 0

Query: 117 RVQSLIR-AGDLDAASAVARHSVFSN--TRPTVFTCNAIIAAMYRAKRYSDAIALFQFFF 176
           RV  LIR  GDLD A+  AR +VF++  +  T   C +II  M R KR  DA  L++FFF
Sbjct: 38  RVSYLIRCVGDLDTAAKYARLAVFTSIKSESTTTICQSIIGGMLRDKRLKDAYDLYEFFF 97

Query: 177 NQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFS--PSAVTYRHLTKGLIDS 236
           NQ N+ PN   +N +I +   +G V+  L  +   I +      PS  ++R LTKGL+ S
Sbjct: 98  NQHNLRPNSHCWNYIIESGFQQGLVNDALHFHHRCINSGQVHDYPSDDSFRILTKGLVHS 157

Query: 237 GRIGEAVDLLR-EMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELKERCLV----- 296
           GR+ +A   LR   +N+    D + +NNLI GFL+L N +KAN +  E K   L+     
Sbjct: 158 GRLDQAEAFLRGRTVNRTTYPDHVAYNNLIRGFLDLGNFKKANLVLGEFKRLFLIALSET 217

Query: 297 --------YDGVV---NATFMDWFFNRGKEKEAMESY-KSLLDRQFKMIPATCNVLLEVL 356
                   Y+  V    ATFM+++F +GK+ EAME Y + +L  +  +   T N LL+VL
Sbjct: 218 KDDLHHSNYENRVAFLMATFMEYWFKQGKQVEAMECYNRCVLSNRLLVCAETGNALLKVL 277

Query: 357 LRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 416
           L++G+K  AW L+ ++LD +       ++SDT  IMV+ECF +G FSEA+ET++K   +P
Sbjct: 278 LKYGEKKNAWALYHELLDKNGTGK-GCLDSDTIKIMVDECFDMGWFSEAMETYKK--ARP 337

Query: 417 KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSKSLS-PDVPTHRTLIESYLKIGQ 473
           K+     D      II RFCE  M+++AE+ F +  +      DV T++T+I++Y+K G+
Sbjct: 338 KN-----DYLSDKYIITRFCENRMLSEAESVFVDSLADDFGYIDVNTYKTMIDAYVKAGR 397

BLAST of Cla97C10G191040 vs. TAIR 10
Match: AT5G28380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 190.3 bits (482), Expect = 5.2e-48
Identity = 113/322 (35.09%), Postives = 177/322 (54.97%), Query Frame = 0

Query: 162 YSDAIALFQFFFNQSNIVPNVVSYNNLINAHCDEGRVDVGLEIYRHIIANAPFSPSAVTY 221
           Y +AI+LF +FFN+S  +PN++S N +I AHCD+G VD  LE+YRHI+ +   +P   TY
Sbjct: 88  YDEAISLFDYFFNESQTLPNMLSCNLIIKAHCDQGSVDHALELYRHILLDGSLAPGIETY 147

Query: 222 RHLTKGLIDSGRIGEAVDLLREMLNKGHGADSLVFNNLISGFLNLENLEKANELFDELK- 281
           R LTK L+ + R+ EA D++R M       D  V++ LI GFL+     +A+++F+ELK 
Sbjct: 148 RILTKALVGAKRLDEACDVVRSMSR----CDFAVYDILIRGFLDKGKFVRASQIFEELKG 207

Query: 282 -------ERCLVYDGVVNATFMDWFFNRGKEKEAMESYKSLLDRQFKMIPATCNVLLEVL 341
                          + N +FMD++F +GK++EAME + +L   +  +   + N +L+ L
Sbjct: 208 PNSKLPWRNYHKAIAIFNVSFMDYWFKQGKDEEAMEIFATLEHAEL-LNTISGNGVLKCL 267

Query: 342 LRHGKKTEAWTLFDQMLDNHTPPNFQAVNSDTFNIMVNECFKLGKFSEAVETFRKVGTQP 401
           + HG+KTEAW LF  M+        +  +S+T  I+++   K G F E    F +V    
Sbjct: 268 VEHGRKTEAWELFLDMI--------EICDSETVGIIMS---KEGFFGEKTIPFERVRR-- 327

Query: 402 KSRPFAMDVAGYNNIIARFCEQGMMTDAETFFAELCSK------SLSPDVPTHRTLIESY 461
                      Y  +IA  C+QG M +AE  FA++ +          PDV T R +I  Y
Sbjct: 328 ---------TCYTRMIASLCQQGNMLEAEKLFADMFADVDGDDLLAGPDVSTFRAMINGY 382

Query: 462 LKIGQIDDVLRVFNRMVDVGLR 470
           +K+G++DD ++  N+M    LR
Sbjct: 388 VKVGRVDDAIKTLNKMKISNLR 382

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022984748.10.0e+0090.10pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita... [more]
XP_023552663.10.0e+0086.35pentatricopeptide repeat-containing protein At1g10270-like isoform X2 [Cucurbita... [more]
XP_022984746.10.0e+0086.08pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita... [more]
XP_038905008.10.0e+0079.61pentatricopeptide repeat-containing protein At1g10270 [Benincasa hispida][more]
XP_023552661.10.0e+0083.05pentatricopeptide repeat-containing protein At1g10270-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q9SY695.5e-21260.63Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana OX... [more]
Q9M3A86.8e-9338.11Pentatricopeptide repeat-containing protein At3g49240, mitochondrial OS=Arabidop... [more]
Q9LEX61.4e-5836.57Pentatricopeptide repeat-containing protein At3g60960, mitochondrial OS=Arabidop... [more]
Q9LEX57.1e-5837.89Pentatricopeptide repeat-containing protein At3g60980, mitochondrial OS=Arabidop... [more]
Q6NQ831.1e-3423.97Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1JBF60.0e+0090.10pentatricopeptide repeat-containing protein At1g10270-like isoform X2 OS=Cucurbi... [more]
A0A6J1J3120.0e+0086.08pentatricopeptide repeat-containing protein At1g10270-like isoform X1 OS=Cucurbi... [more]
A0A6J1EMZ60.0e+0085.68pentatricopeptide repeat-containing protein At1g10270-like OS=Cucurbita moschata... [more]
A0A5A7TJN20.0e+0091.91ACT11D09.4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold70G00300 PE=4... [more]
Q6E4380.0e+0091.91ACT11D09.4 OS=Cucumis melo OX=3656 GN=ACT11D09.4 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G10270.13.9e-21360.63glutamine-rich protein 23 [more]
AT3G49240.14.8e-9438.11Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G60960.11.0e-5936.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G60980.15.0e-5937.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G28380.15.2e-4835.09Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 110..306
e-value: 5.6E-34
score: 119.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 475..577
e-value: 1.0E-7
score: 33.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 307..474
e-value: 1.8E-25
score: 92.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 219..248
e-value: 0.01
score: 16.0
coord: 365..388
e-value: 0.012
score: 15.8
coord: 509..538
e-value: 0.21
score: 11.9
coord: 325..352
e-value: 0.021
score: 15.1
coord: 255..282
e-value: 5.9E-4
score: 19.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 401..445
e-value: 4.6E-8
score: 33.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 325..357
e-value: 6.6E-5
score: 20.8
coord: 219..252
e-value: 0.0018
score: 16.3
coord: 255..287
e-value: 8.5E-4
score: 17.3
coord: 405..437
e-value: 1.2E-4
score: 20.0
coord: 183..210
e-value: 4.3E-4
score: 18.3
coord: 439..468
e-value: 8.2E-5
score: 20.5
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 177..200
e-value: 9.9E-7
score: 28.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 322..356
score: 9.470621
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 181..216
score: 9.350046
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 9.788499
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..470
score: 10.490022
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 252..282
score: 8.977363
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..251
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 9.711769
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 66..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 611..683
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 576..683
NoneNo IPR availablePANTHERPTHR47937:SF2PENTATRICOPEPTIDE (PPR) REPEAT-CONTAINING PROTEIN, PF01535'-RELATEDcoord: 1..664
NoneNo IPR availablePANTHERPTHR47937PLASTID TRANSCRIPTIONALLY ACTIVE CHROMOSOME 2-LIKE PROTEINcoord: 1..664
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 155..388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G191040.1Cla97C10G191040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding