Cla97C01G011030 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G011030
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 17749995 .. 17752171 (+)
RNA-Seq ExpressionCla97C01G011030
SyntenyCla97C01G011030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTCTACGTAGGTATTTTAATAGCTGTATTGGAAAATACGGGCTGTTACAGCTTTCTCATCCTAATCTCAAACTATTAAGATGTTTCCTCTCTGCTGCAAACTACGGCACAGGTTCGGCTCATTGTGCCTTCACAGGGTCAGATACGGAGGAAAGCCGAGACTGGAACGCCTTCGCCGCTGCCCCTTTCAACGGCGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCATCTGAAACTTCAACCAATTCTACTGGGCTCTATGTTCTGGACCTCATCAACCGTGGCTATTTAGAACCTGAACGAACCCTTTATAGTAAGATGCTAAAAGAATGCACCCGCTTGCGCAAACTCAAGCAGGGCAGAGCCATTCACGCCCACATACAGGGTTCTACGTTTGAGAATGATCCGGTACTTCTAAATTTTATCTTAAACATGTATGCGAAATGTGGCAGTCTTGAGGAGGCACAGAACCTGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTGATCAGTGGCTATTCTCAGAGTGGGCGAGCGTCTGAAGCTCTTGCTTTGTTCCCTAAGATGCTCCACCTGGGCTTCCAACCCAATGAGTTCACCTTGTCCAGTCTGTTGAAGGCTTCTGGAGCTGGCCCTAGTGATGACCATGGCAGGCAACTTCATGCATTGTCCCTCAAATATGGCTATGATATGAATGTTCACGTGGGAAGTTCATTGCTCGATATGTATGCTAGGTGGGGGCATATGCAAGAAGCCAAAGTGATTTTTAACAGCCTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGAGCATGTGATGAGGCTGTTTTGGCAGATGCTGAGACAGGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTCTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGCGGACAGCCCATTGCTTATATTGGAAACACTCTTATTGACATGTATGCTAAATCAGGCAGCATCAAAGATGCAAAGAAGGTGTTCCAGAGGTTGGTTAAACAGGATGTAGTTTCATGGAACTCGATTATATCTGGATATGCGCAACACGGAATGGGAGCTGAAGCTTTACAGCTATTTGAGGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACTTTTCTCTCTGTTCTTACTGCTTGTAGTCATTCCGGACTTCTGCATGAAGGACGATATTATCTTGAACTGATGAAGAAATACGAGATAGAACCACAGGTTGCACATCATGTGACCGTAGTTGATCTTTTAGGTCGAGCAGGGCGACTTAACGAAGCCTACAAGTTCATAGAGGAAATGCCTATCAAACCTACTGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGCAGGATGCACAAGAATATGAATTTGGGTGTTTATGCGGCCGAACGGATTTTCGAGCTTGACCCTCATGACTCAGGTCCTCATGTACTTTTGTCTAATATTTATGCTTCTGCTGGTAGACTGAATGATGCTGCAAATGTAAGGAAGATGATGAAAGAGAGTGGGGTAAAGAAAGAACCTGCCTGTAGTTGGGTTGAAATTGAGAATGAAGTCCATATGTTTGTGGCGAATGATGATTCACATCCAATGAGAGCAGAAATCCAGAGGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGATCAGCAGGACAGAGAAGCAAAGCTACAATACCATAGTGAGAAGTTAGCATTAGCATTTTCAGTCTTGAAAACTCCTCCTGGATTAACCATTAGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTCTTGGAAAGAGAAATCATCGTAAGAGACACCAATAGATTTCACCATTTCCTTCATGGCTTGTGTTCTTGTAGGGACTATTGGTAGCCTATTACTTCTTCATTACTAAGATTTCGCCAGTAGATTTGAGATGTTCAGATATAGCTTTCAATGGCTTGAGTTCAAACTTACCAAGAAGAAGCCCAATACTTAAATGTTGGTTGAAGGAAGGTTCTCTTATCTAA

mRNA sequence

ATGTTTCTACGTAGGTATTTTAATAGCTGTATTGGAAAATACGGGCTGTTACAGCTTTCTCATCCTAATCTCAAACTATTAAGATGTTTCCTCTCTGCTGCAAACTACGGCACAGGTTCGGCTCATTGTGCCTTCACAGGGTCAGATACGGAGGAAAGCCGAGACTGGAACGCCTTCGCCGCTGCCCCTTTCAACGGCGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCATCTGAAACTTCAACCAATTCTACTGGGCTCTATGTTCTGGACCTCATCAACCGTGGCTATTTAGAACCTGAACGAACCCTTTATAGTAAGATGCTAAAAGAATGCACCCGCTTGCGCAAACTCAAGCAGGGCAGAGCCATTCACGCCCACATACAGGGTTCTACGTTTGAGAATGATCCGGTACTTCTAAATTTTATCTTAAACATGTATGCGAAATGTGGCAGTCTTGAGGAGGCACAGAACCTGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTGATCAGTGGCTATTCTCAGAGTGGGCGAGCGTCTGAAGCTCTTGCTTTGTTCCCTAAGATGCTCCACCTGGGCTTCCAACCCAATGAGTTCACCTTGTCCAGTCTGTTGAAGGCTTCTGGAGCTGGCCCTAGTGATGACCATGGCAGGCAACTTCATGCATTGTCCCTCAAATATGGCTATGATATGAATGTTCACGTGGGAAGTTCATTGCTCGATATGTATGCTAGGTGGGGGCATATGCAAGAAGCCAAAGTGATTTTTAACAGCCTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGAGCATGTGATGAGGCTGTTTTGGCAGATGCTGAGACAGGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTCTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGCGGACAGCCCATTGCTTATATTGGAAACACTCTTATTGACATGTATGCTAAATCAGGCAGCATCAAAGATGCAAAGAAGGTGTTCCAGAGGTTGGTTAAACAGGATGTAGTTTCATGGAACTCGATTATATCTGGATATGCGCAACACGGAATGGGAGCTGAAGCTTTACAGCTATTTGAGGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACTTTTCTCTCTGTTCTTACTGCTTGTAGTCATTCCGGACTTCTGCATGAAGGACGATATTATCTTGAACTGATGAAGAAATACGAGATAGAACCACAGGTTGCACATCATGTGACCGTAGTTGATCTTTTAGGTCGAGCAGGGCGACTTAACGAAGCCTACAAGTTCATAGAGGAAATGCCTATCAAACCTACTGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGCAGGATGCACAAGAATATGAATTTGGGTGTTTATGCGGCCGAACGGATTTTCGAGCTTGACCCTCATGACTCAGGTCCTCATGTACTTTTGTCTAATATTTATGCTTCTGCTGGTAGACTGAATGATGCTGCAAATGTAAGGAAGATGATGAAAGAGAGTGGGGTAAAGAAAGAACCTGCCTGTAGTTGGGTTGAAATTGAGAATGAAGTCCATATGTTTGTGGCGAATGATGATTCACATCCAATGAGAGCAGAAATCCAGAGGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGATCAGCAGGACAGAGAAGCAAAGCTACAATACCATAGTGAGAAGTTAGCATTAGCATTTTCAGTCTTGAAAACTCCTCCTGGATTAACCATTAGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTCTTGGAAAGAGAAATCATCGGACTATTGGTAGCCTATTACTTCTTCATTACTAAGATTTCGCCAGTAGATTTGAGATGTTCAGATATAGCTTTCAATGGCTTGAGTTCAAACTTACCAAGAAGAAGCCCAATACTTAAATGTTGGTTGAAGGAAGGTTCTCTTATCTAA

Coding sequence (CDS)

ATGTTTCTACGTAGGTATTTTAATAGCTGTATTGGAAAATACGGGCTGTTACAGCTTTCTCATCCTAATCTCAAACTATTAAGATGTTTCCTCTCTGCTGCAAACTACGGCACAGGTTCGGCTCATTGTGCCTTCACAGGGTCAGATACGGAGGAAAGCCGAGACTGGAACGCCTTCGCCGCTGCCCCTTTCAACGGCGTCCTCCAAGATGAAGACCTTCTTCGCAAGACCCACATTTCTTCATCTGAAACTTCAACCAATTCTACTGGGCTCTATGTTCTGGACCTCATCAACCGTGGCTATTTAGAACCTGAACGAACCCTTTATAGTAAGATGCTAAAAGAATGCACCCGCTTGCGCAAACTCAAGCAGGGCAGAGCCATTCACGCCCACATACAGGGTTCTACGTTTGAGAATGATCCGGTACTTCTAAATTTTATCTTAAACATGTATGCGAAATGTGGCAGTCTTGAGGAGGCACAGAACCTGTTTGATAAAATGCCTACAAGAGACATGGTTAGTTGGACTGTGCTGATCAGTGGCTATTCTCAGAGTGGGCGAGCGTCTGAAGCTCTTGCTTTGTTCCCTAAGATGCTCCACCTGGGCTTCCAACCCAATGAGTTCACCTTGTCCAGTCTGTTGAAGGCTTCTGGAGCTGGCCCTAGTGATGACCATGGCAGGCAACTTCATGCATTGTCCCTCAAATATGGCTATGATATGAATGTTCACGTGGGAAGTTCATTGCTCGATATGTATGCTAGGTGGGGGCATATGCAAGAAGCCAAAGTGATTTTTAACAGCCTGGCTGCAAAAAATGTGGTGTCTTGGAATGCTCTGATTGCTGGTCATGCTCGGAAGGGTGAAGGGGAGCATGTGATGAGGCTGTTTTGGCAGATGCTGAGACAGGATTTCGAACCTACACATTTTACATACTCTAGTGTTTTTACTGCTTGTGCCAGCTCTGGATCTTTGGAGCAAGGCAAATGGGTTCATGCCCATGTAATAAAATCTGGCGGACAGCCCATTGCTTATATTGGAAACACTCTTATTGACATGTATGCTAAATCAGGCAGCATCAAAGATGCAAAGAAGGTGTTCCAGAGGTTGGTTAAACAGGATGTAGTTTCATGGAACTCGATTATATCTGGATATGCGCAACACGGAATGGGAGCTGAAGCTTTACAGCTATTTGAGGAGATGCTGAAGGCCAAAGTTCAACCTAATGAAATTACTTTTCTCTCTGTTCTTACTGCTTGTAGTCATTCCGGACTTCTGCATGAAGGACGATATTATCTTGAACTGATGAAGAAATACGAGATAGAACCACAGGTTGCACATCATGTGACCGTAGTTGATCTTTTAGGTCGAGCAGGGCGACTTAACGAAGCCTACAAGTTCATAGAGGAAATGCCTATCAAACCTACTGCAGCTGTCTGGGGAGCCTTGCTTGGTGCTTGCAGGATGCACAAGAATATGAATTTGGGTGTTTATGCGGCCGAACGGATTTTCGAGCTTGACCCTCATGACTCAGGTCCTCATGTACTTTTGTCTAATATTTATGCTTCTGCTGGTAGACTGAATGATGCTGCAAATGTAAGGAAGATGATGAAAGAGAGTGGGGTAAAGAAAGAACCTGCCTGTAGTTGGGTTGAAATTGAGAATGAAGTCCATATGTTTGTGGCGAATGATGATTCACATCCAATGAGAGCAGAAATCCAGAGGATGTGGGAGAAAATAAGTGGGAAAATTAAAGAGATTGGGTATGTGCCAGACACAAGCCATGTGCTTTTCTTCATGGATCAGCAGGACAGAGAAGCAAAGCTACAATACCATAGTGAGAAGTTAGCATTAGCATTTTCAGTCTTGAAAACTCCTCCTGGATTAACCATTAGGATTAAGAAGAACATTAGAATATGTGGTGACTGCCATTCTGCATTCAAGTTTGCTTCAAAAGTCTTGGAAAGAGAAATCATCGGACTATTGGTAGCCTATTACTTCTTCATTACTAAGATTTCGCCAGTAGATTTGAGATGTTCAGATATAGCTTTCAATGGCTTGAGTTCAAACTTACCAAGAAGAAGCCCAATACTTAAATGTTGGTTGAAGGAAGGTTCTCTTATCTAA

Protein sequence

MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFAAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREIIGLLVAYYFFITKISPVDLRCSDIAFNGLSSNLPRRSPILKCWLKEGSLI
Homology
BLAST of Cla97C01G011030 vs. NCBI nr
Match: XP_038893938.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1210.3 bits (3130), Expect = 0.0e+00
Identity = 600/658 (91.19%), Postives = 615/658 (93.47%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFA 60
           MF   YF SCIGKYG LQL HP LK LR F  AA YGTG   CAF  S T ES+DWN   
Sbjct: 3   MFPHCYFKSCIGKYGPLQLFHPKLKPLRFFPIAAKYGTGLTPCAFMESGTAESQDWNP-T 62

Query: 61  AAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLR 120
            APFNG+LQDEDLLRKTHISSS TSTNSTGLYVLDLINRG LEPERTLY KML +CT LR
Sbjct: 63  VAPFNGILQDEDLLRKTHISSSFTSTNSTGLYVLDLINRGSLEPERTLYCKMLNKCTYLR 122

Query: 121 KLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLIS 180
           KLKQGRAIHAHIQGSTFEND VLLN ILNMYAKCGSLEEAQNLFDKMP RDMVSWTVLIS
Sbjct: 123 KLKQGRAIHAHIQGSTFENDLVLLNCILNMYAKCGSLEEAQNLFDKMPIRDMVSWTVLIS 182

Query: 181 GYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDM 240
           GYSQSGRASEALA FPKMLHLGFQPNEFTLSSLLKASGAGPSDD+GRQLHA SLKYGYDM
Sbjct: 183 GYSQSGRASEALAWFPKMLHLGFQPNEFTLSSLLKASGAGPSDDNGRQLHAFSLKYGYDM 242

Query: 241 NVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQML 300
           NVHVGSSLLDMYARWGHM+EA VIFNSLAAKNVVSWNALIAG+ARKGEGEHVMRLFWQML
Sbjct: 243 NVHVGSSLLDMYARWGHMREATVIFNSLAAKNVVSWNALIAGYARKGEGEHVMRLFWQML 302

Query: 301 RQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360
           RQDFEPTHFTYSSVF ACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK
Sbjct: 303 RQDFEPTHFTYSSVFIACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 362

Query: 361 DAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACS 420
           DAKKVFQRLVKQD+VSWNSIISGYA HG+G EALQLFEEML+AKVQPNEITFLSVLTACS
Sbjct: 363 DAKKVFQRLVKQDIVSWNSIISGYAHHGLGVEALQLFEEMLRAKVQPNEITFLSVLTACS 422

Query: 421 HSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWG 480
           HSGLL +GRYY ELMKKYEIEPQVAHHVTVVDLLGRAGRL+EA KFIEEMPI+PTAAVWG
Sbjct: 423 HSGLLDDGRYYFELMKKYEIEPQVAHHVTVVDLLGRAGRLHEANKFIEEMPIEPTAAVWG 482

Query: 481 ALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 540
           ALLGACRMHKNM+LGVYAAER+FELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG
Sbjct: 483 ALLGACRMHKNMDLGVYAAERVFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 542

Query: 541 VKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600
           VKKEPACSWVEIENEVHMFVANDDSHPMR EI+RMWEKISGKIKEIGYVPDTSHVLFFMD
Sbjct: 543 VKKEPACSWVEIENEVHMFVANDDSHPMREEIRRMWEKISGKIKEIGYVPDTSHVLFFMD 602

Query: 601 QQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           QQDRE KLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 603 QQDREVKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREII 659

BLAST of Cla97C01G011030 vs. NCBI nr
Match: XP_008464284.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 595/658 (90.43%), Postives = 616/658 (93.62%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFA 60
           MF   YFN CIGKYGLL LSH  LK LRCFL AA YGTG A CAFT S+  ES+DWN  A
Sbjct: 3   MFPHCYFNRCIGKYGLLALSHSKLKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP-A 62

Query: 61  AAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLR 120
            APF GVLQDEDLLR THISSS+ S++STGLYVLDLIN G LEPERTLYSKML +CT LR
Sbjct: 63  TAPFTGVLQDEDLLRTTHISSSDVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLR 122

Query: 121 KLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLIS 180
           KLKQGRAIHAHIQ S FENDPVLLNFILNMYAKCGSLEEAQ+LFDKMPT+D VSWTVLIS
Sbjct: 123 KLKQGRAIHAHIQSSAFENDPVLLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLIS 182

Query: 181 GYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDM 240
           GYSQS RASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHA SLKYGYDM
Sbjct: 183 GYSQSRRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDM 242

Query: 241 NVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQML 300
           NVHVGSSLLDMYARWGHM+EAKVIF SLAAKNVVSWNALIAGHARKGEGEHVMRLF QML
Sbjct: 243 NVHVGSSLLDMYARWGHMREAKVIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQML 302

Query: 301 RQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360
           RQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK
Sbjct: 303 RQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 362

Query: 361 DAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACS 420
           DAKKVFQRLVK+D+VSWNSIISGYAQHG+GAEALQLFE++LKAKVQPNEITFLSVLTACS
Sbjct: 363 DAKKVFQRLVKRDIVSWNSIISGYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACS 422

Query: 421 HSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWG 480
           HSGLL EG+YY ELMKK+ IEPQVAHHVTVVDLLGRAGRLNEA KFIEEMP++PTAAVWG
Sbjct: 423 HSGLLDEGKYYFELMKKHGIEPQVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWG 482

Query: 481 ALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 540
           ALLGACRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL DA NVRKMMKESG
Sbjct: 483 ALLGACRMHKNMDLGVYAAEKIFELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESG 542

Query: 541 VKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600
           VKKEPACSWVEIENEVHMFVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFMD
Sbjct: 543 VKKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 602

Query: 601 QQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 603 QQDREVKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREII 659

BLAST of Cla97C01G011030 vs. NCBI nr
Match: TYK15477.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 595/658 (90.43%), Postives = 616/658 (93.62%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFA 60
           MF   YFN CIGKYGLL LSH  LK LRCFL AA YGTG A CAFT S+  ES+DWN  A
Sbjct: 1   MFPHCYFNRCIGKYGLLALSHSKLKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP-A 60

Query: 61  AAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLR 120
            APF GVLQDEDLLR THISSS+ S++STGLYVLDLIN G LEPERTLYSKML +CT LR
Sbjct: 61  TAPFTGVLQDEDLLRTTHISSSDVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLR 120

Query: 121 KLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLIS 180
           KLKQGRAIHAHIQ S FENDPVLLNFILNMYAKCGSLEEAQ+LFDKMPT+D VSWTVLIS
Sbjct: 121 KLKQGRAIHAHIQSSAFENDPVLLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLIS 180

Query: 181 GYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDM 240
           GYSQS RASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHA SLKYGYDM
Sbjct: 181 GYSQSRRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDM 240

Query: 241 NVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQML 300
           NVHVGSSLLDMYARWGHM+EAKVIF SLAAKNVVSWNALIAGHARKGEGEHVMRLF QML
Sbjct: 241 NVHVGSSLLDMYARWGHMREAKVIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQML 300

Query: 301 RQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360
           RQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK
Sbjct: 301 RQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360

Query: 361 DAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACS 420
           DAKKVFQRLVK+D+VSWNSIISGYAQHG+GAEALQLFE++LKAKVQPNEITFLSVLTACS
Sbjct: 361 DAKKVFQRLVKRDIVSWNSIISGYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACS 420

Query: 421 HSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWG 480
           HSGLL EG+YY ELMKK+ IEPQVAHHVTVVDLLGRAGRLNEA KFIEEMP++PTAAVWG
Sbjct: 421 HSGLLDEGKYYFELMKKHGIEPQVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWG 480

Query: 481 ALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 540
           ALLGACRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL DA NVRKMMKESG
Sbjct: 481 ALLGACRMHKNMDLGVYAAEKIFELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESG 540

Query: 541 VKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600
           VKKEPACSWVEIENEVHMFVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFMD
Sbjct: 541 VKKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600

Query: 601 QQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 601 QQDREVKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREII 657

BLAST of Cla97C01G011030 vs. NCBI nr
Match: XP_004139511.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Cucumis sativus] >XP_011654725.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Cucumis sativus] >XP_031736146.1 pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 [Cucumis sativus] >KGN64985.1 hypothetical protein Csa_022816 [Cucumis sativus])

HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 586/659 (88.92%), Postives = 611/659 (92.72%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGL-LQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAF 60
           MF   YFN CIGKYG  L LS   LK L CFL AA YGT    CAF  S+T ES+DW+  
Sbjct: 3   MFPHCYFNRCIGKYGRPLALSPSKLKTLSCFLFAAKYGT---PCAFVESNTAESQDWDP- 62

Query: 61  AAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRL 120
             APF GVLQDEDLLR THISSS TS+NSTGLYVLDLIN G LEPERTLYSKML +CT L
Sbjct: 63  CTAPFTGVLQDEDLLRTTHISSSGTSSNSTGLYVLDLINCGSLEPERTLYSKMLNKCTYL 122

Query: 121 RKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLI 180
           RKLKQGRAIHAHIQ STFE+D VLLNFILNMYAKCGSLEEAQ+LFDKMPT+DMVSWTVLI
Sbjct: 123 RKLKQGRAIHAHIQSSTFEDDLVLLNFILNMYAKCGSLEEAQDLFDKMPTKDMVSWTVLI 182

Query: 181 SGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYD 240
           SGYSQSG+ASEALALFPKMLHLGFQPNEFTLSSLLKASG GPSD HGRQLHA SLKYGYD
Sbjct: 183 SGYSQSGQASEALALFPKMLHLGFQPNEFTLSSLLKASGTGPSDHHGRQLHAFSLKYGYD 242

Query: 241 MNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQM 300
           MNVHVGSSLLDMYARW HM+EAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLF QM
Sbjct: 243 MNVHVGSSLLDMYARWAHMREAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFLQM 302

Query: 301 LRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI 360
           LRQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI
Sbjct: 303 LRQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI 362

Query: 361 KDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTAC 420
           KDAKKVF+RLVKQD+VSWNSIISGYAQHG+GAEALQLFE+MLKAKVQPNEITFLSVLTAC
Sbjct: 363 KDAKKVFRRLVKQDIVSWNSIISGYAQHGLGAEALQLFEQMLKAKVQPNEITFLSVLTAC 422

Query: 421 SHSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVW 480
           SHSGLL EG+YY ELMKK++IE QVAHHVTVVDLLGRAGRLNEA KFIEEMPIKPTAAVW
Sbjct: 423 SHSGLLDEGQYYFELMKKHKIEAQVAHHVTVVDLLGRAGRLNEANKFIEEMPIKPTAAVW 482

Query: 481 GALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKES 540
           GALLG+CRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL+DAA VRKMMKES
Sbjct: 483 GALLGSCRMHKNMDLGVYAAEQIFELDPHDSGPHVLLSNIYASAGRLSDAAKVRKMMKES 542

Query: 541 GVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFM 600
           GVKKEPACSWVEIENEVH+FVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFM
Sbjct: 543 GVKKEPACSWVEIENEVHVFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFM 602

Query: 601 DQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           +QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFAS+VL REII
Sbjct: 603 NQQDRELKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASRVLGREII 657

BLAST of Cla97C01G011030 vs. NCBI nr
Match: XP_022973115.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1164.4 bits (3011), Expect = 0.0e+00
Identity = 572/638 (89.66%), Postives = 596/638 (93.42%), Query Frame = 0

Query: 22  PNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFAAA-PFNGVLQDEDLLRKTHIS 81
           P LK  +CF SAANYGTGS  C+ T SD+ E RDWNA AAA PF GVLQDEDLLRKTHIS
Sbjct: 20  PKLKPFKCFFSAANYGTGSPPCSLTESDSAEGRDWNAAAAAVPFTGVLQDEDLLRKTHIS 79

Query: 82  SSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFEND 141
           SSETST+STGLYVLDLIN G LEPERTLYSKML +CT LRKLK GR IH+HIQGSTFEND
Sbjct: 80  SSETSTSSTGLYVLDLINHGKLEPERTLYSKMLNKCTHLRKLKLGRVIHSHIQGSTFEND 139

Query: 142 PVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLH 201
            V+ N ILNMYAKCGSLEEA NLFDKMPTRDMVSWTVLISGYSQSGRA EAL LFP+M H
Sbjct: 140 LVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISGYSQSGRAFEALGLFPQMFH 199

Query: 202 LGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQE 261
            GFQPNEFTLSSLLKASGA PSD+HGRQLHA SLKYG++MNVHVGSSLLDMYARWGHMQE
Sbjct: 200 QGFQPNEFTLSSLLKASGASPSDEHGRQLHAFSLKYGFNMNVHVGSSLLDMYARWGHMQE 259

Query: 262 AKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACAS 321
           A+ IFN LAAKNVVSWNALIAGHARKGEGEHVM+LF QMLRQ+FEPTHFTYSSVFTACAS
Sbjct: 260 AEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLRQNFEPTHFTYSSVFTACAS 319

Query: 322 SGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI 381
           SGS EQGKWVHAHVIKSGGQP+AYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI
Sbjct: 320 SGSFEQGKWVHAHVIKSGGQPVAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI 379

Query: 382 ISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKKYEI 441
           ISGYAQHG+GAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLL EG+YY ELMKKYEI
Sbjct: 380 ISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKKYEI 439

Query: 442 EPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAE 501
           EPQV+HHVTVVDLLGRAGRL+EA KFI+EMPI+PTAAVWGALLGACRMHKNM+LG YAAE
Sbjct: 440 EPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGALLGACRMHKNMDLGAYAAE 499

Query: 502 RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFV 561
           RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIEN VHMFV
Sbjct: 500 RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENGVHMFV 559

Query: 562 ANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFS 621
           AND+SHPMR EIQ+MWEKISGKIKEIGYVPDTSHVLFFMDQQDRE KLQYHSEKLALAFS
Sbjct: 560 ANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFS 619

Query: 622 VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           VLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 620 VLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLGREII 657

BLAST of Cla97C01G011030 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 6.3e-219
Identity = 368/597 (61.64%), Postives = 454/597 (76.05%), Query Frame = 0

Query: 62  APFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLRK 121
           AP +   +DE L   ++     TS+N         +   Y+  +R  Y+ +LK+CT  + 
Sbjct: 24  APVSEDSEDESLKFPSNDLLLRTSSND--------LEGSYIPADRRFYNTLLKKCTVFKL 83

Query: 122 LKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLISG 181
           L QGR +HAHI  S F +D V+ N +LNMYAKCGSLEEA+ +F+KMP RD V+WT LISG
Sbjct: 84  LIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISG 143

Query: 182 YSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMN 241
           YSQ  R  +AL  F +ML  G+ PNEFTLSS++KA+ A      G QLH   +K G+D N
Sbjct: 144 YSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSN 203

Query: 242 VHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLR 301
           VHVGS+LLD+Y R+G M +A+++F++L ++N VSWNALIAGHAR+   E  + LF  MLR
Sbjct: 204 VHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLR 263

Query: 302 QDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 361
             F P+HF+Y+S+F AC+S+G LEQGKWVHA++IKSG + +A+ GNTL+DMYAKSGSI D
Sbjct: 264 DGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHD 323

Query: 362 AKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 421
           A+K+F RL K+DVVSWNS+++ YAQHG G EA+  FEEM +  ++PNEI+FLSVLTACSH
Sbjct: 324 ARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSH 383

Query: 422 SGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGA 481
           SGLL EG +Y ELMKK  I P+  H+VTVVDLLGRAG LN A +FIEEMPI+PTAA+W A
Sbjct: 384 SGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKA 443

Query: 482 LLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 541
           LL ACRMHKN  LG YAAE +FELDP D GPHV+L NIYAS GR NDAA VRK MKESGV
Sbjct: 444 LLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGV 503

Query: 542 KKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQ 601
           KKEPACSWVEIEN +HMFVAND+ HP R EI R WE++  KIKE+GYVPDTSHV+  +DQ
Sbjct: 504 KKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQ 563

Query: 602 QDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           Q+RE  LQYHSEK+ALAF++L TPPG TI IKKNIR+CGDCH+A K ASKV+ REII
Sbjct: 564 QEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREII 612

BLAST of Cla97C01G011030 vs. ExPASy Swiss-Prot
Match: Q9LIC3 (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 483.4 bits (1243), Expect = 4.3e-135
Identity = 252/609 (41.38%), Postives = 384/609 (63.05%), Query Frame = 0

Query: 58  AFAAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTL--YSKMLKE 117
           +F+++P N VLQ       T +  S+  +N  G     L+    L PE     Y  +L  
Sbjct: 11  SFSSSPTNYVLQ-------TILPISQLCSN--GRLQEALLEMAMLGPEMGFHGYDALLNA 70

Query: 118 CTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSW 177
           C   R L+ G+ +HAH+  + +     L   +L  Y KC  LE+A+ + D+MP +++VSW
Sbjct: 71  CLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSW 130

Query: 178 TVLISGYSQSGRASEALALFPKMLHLGFQPNEFT----LSSLLKASGAGPSDDHGRQLHA 237
           T +IS YSQ+G +SEAL +F +M+    +PNEFT    L+S ++ASG G     G+Q+H 
Sbjct: 131 TAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLG----LGKQIHG 190

Query: 238 LSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEH 297
           L +K+ YD ++ VGSSLLDMYA+ G ++EA+ IF  L  ++VVS  A+IAG+A+ G  E 
Sbjct: 191 LIVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEE 250

Query: 298 VMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLID 357
            + +F ++  +   P + TY+S+ TA +    L+ GK  H HV++      A + N+LID
Sbjct: 251 ALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLID 310

Query: 358 MYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAK-VQPNEI 417
           MY+K G++  A+++F  + ++  +SWN+++ GY++HG+G E L+LF  M   K V+P+ +
Sbjct: 311 MYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAV 370

Query: 418 TFLSVLTACSHSGLLHEGRYYLELM--KKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIE 477
           T L+VL+ CSH  +   G    + M   +Y  +P   H+  +VD+LGRAGR++EA++FI+
Sbjct: 371 TLLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIK 430

Query: 478 EMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLND 537
            MP KPTA V G+LLGACR+H ++++G     R+ E++P ++G +V+LSN+YASAGR  D
Sbjct: 431 RMPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWAD 490

Query: 538 AANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGY 597
             NVR MM +  V KEP  SW++ E  +H F AND +HP R E+    ++IS K+K+ GY
Sbjct: 491 VNNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGY 550

Query: 598 VPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKF 657
           VPD S VL+ +D++ +E  L  HSEKLAL F ++ T  G+ IR+ KN+RIC DCH+  K 
Sbjct: 551 VPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKI 606

BLAST of Cla97C01G011030 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-130
Identity = 240/558 (43.01%), Postives = 340/558 (60.93%), Query Frame = 0

Query: 102 LEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQ 161
           L+P       +L   + LR +  G+ IH +   S F++   +   +++MYAKCGSLE A+
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETAR 291

Query: 162 NLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGP 221
            LFD M  R++VSW  +I  Y Q+    EA+ +F KML  G +P + ++   L A     
Sbjct: 292 QLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLG 351

Query: 222 SDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIA 281
             + GR +H LS++ G D NV V +SL+ MY +   +  A  +F  L ++ +VSWNA+I 
Sbjct: 352 DLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMIL 411

Query: 282 GHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQP 341
           G A+ G     +  F QM  +  +P  FTY SV TA A        KW+H  V++S    
Sbjct: 412 GFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDK 471

Query: 342 IAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEML 401
             ++   L+DMYAK G+I  A+ +F  + ++ V +WN++I GY  HG G  AL+LFEEM 
Sbjct: 472 NVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ 531

Query: 402 KAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKK-YEIEPQVAHHVTVVDLLGRAGRL 461
           K  ++PN +TFLSV++ACSHSGL+  G     +MK+ Y IE  + H+  +VDLLGRAGRL
Sbjct: 532 KGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRL 591

Query: 462 NEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIY 521
           NEA+ FI +MP+KP   V+GA+LGAC++HKN+N    AAER+FEL+P D G HVLL+NIY
Sbjct: 592 NEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIY 651

Query: 522 ASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKIS 581
            +A        VR  M   G++K P CS VEI+NEVH F +   +HP   +I    EK+ 
Sbjct: 652 RAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLI 711

Query: 582 GKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICG 641
             IKE GYVPDT+ VL  ++   +E  L  HSEKLA++F +L T  G TI ++KN+R+C 
Sbjct: 712 CHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCA 771

Query: 642 DCHSAFKFASKVLEREII 659
           DCH+A K+ S V  REI+
Sbjct: 772 DCHNATKYISLVTGREIV 788

BLAST of Cla97C01G011030 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 4.2e-130
Identity = 234/592 (39.53%), Postives = 358/592 (60.47%), Query Frame = 0

Query: 104 PERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAK---CGSLEEA 163
           P+  ++  +LK CT +  L+ G ++H  I     + D    N ++NMYAK    GS    
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 164 QNLFDKMPTR---------------------------------DMVSWTVLISGYSQSGR 223
            N+FD+MP R                                 D+VS+  +I+GY+QSG 
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 224 ASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMNVHVGSS 283
             +AL +  +M     +P+ FTLSS+L           G+++H   ++ G D +V++GSS
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 284 LLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLRQDFEPT 343
           L+DMYA+   +++++ +F+ L  ++ +SWN+L+AG+ + G     +RLF QM+    +P 
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPG 342

Query: 344 HFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQ 403
              +SSV  ACA   +L  GK +H +V++ G     +I + L+DMY+K G+IK A+K+F 
Sbjct: 343 AVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFD 402

Query: 404 RLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLHE 463
           R+   D VSW +II G+A HG G EA+ LFEEM +  V+PN++ F++VLTACSH GL+ E
Sbjct: 403 RMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 462

Query: 464 G-RYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGALLGAC 523
              Y+  + K Y +  ++ H+  V DLLGRAG+L EAY FI +M ++PT +VW  LL +C
Sbjct: 463 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 522

Query: 524 RMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPA 583
            +HKN+ L    AE+IF +D  + G +VL+ N+YAS GR  + A +R  M++ G++K+PA
Sbjct: 523 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 582

Query: 584 CSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREA 643
           CSW+E++N+ H FV+ D SHP   +I    + +  ++++ GYV DTS VL  +D++ +  
Sbjct: 583 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 642

Query: 644 KLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
            L  HSE+LA+AF ++ T PG TIR+ KNIRIC DCH A KF SK+ EREII
Sbjct: 643 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREII 694

BLAST of Cla97C01G011030 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 1.7e-126
Identity = 225/562 (40.04%), Postives = 356/562 (63.35%), Query Frame = 0

Query: 101 YLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEA 160
           Y+    + ++ ++K C  L++L+    +H  +    F  D  +   ++  Y+KC ++ +A
Sbjct: 290 YVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDA 349

Query: 161 QNLFDKMP-TRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGA 220
             LF ++    ++VSWT +ISG+ Q+    EA+ LF +M   G +PNEFT S +L A   
Sbjct: 350 LRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPV 409

Query: 221 -GPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNA 280
             PS     ++HA  +K  Y+ +  VG++LLD Y + G ++EA  +F+ +  K++V+W+A
Sbjct: 410 ISPS-----EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSA 469

Query: 281 LIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTAC-ASSGSLEQGKWVHAHVIKS 340
           ++AG+A+ GE E  +++F ++ +   +P  FT+SS+   C A++ S+ QGK  H   IKS
Sbjct: 470 MLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKS 529

Query: 341 GGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLF 400
                  + + L+ MYAK G+I+ A++VF+R  ++D+VSWNS+ISGYAQHG   +AL +F
Sbjct: 530 RLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVF 589

Query: 401 EEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELM-KKYEIEPQVAHHVTVVDLLGR 460
           +EM K KV+ + +TF+ V  AC+H+GL+ EG  Y ++M +  +I P   H+  +VDL  R
Sbjct: 590 KEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSR 649

Query: 461 AGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLL 520
           AG+L +A K IE MP    + +W  +L ACR+HK   LG  AAE+I  + P DS  +VLL
Sbjct: 650 AGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLL 709

Query: 521 SNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMW 580
           SN+YA +G   + A VRK+M E  VKKEP  SW+E++N+ + F+A D SHP++ +I    
Sbjct: 710 SNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKL 769

Query: 581 EKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNI 640
           E +S ++K++GY PDTS+VL  +D + +EA L  HSE+LA+AF ++ TP G  + I KN+
Sbjct: 770 EDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIAFGLIATPKGSPLLIIKNL 829

Query: 641 RICGDCHSAFKFASKVLEREII 659
           R+CGDCH   K  +K+ EREI+
Sbjct: 830 RVCGDCHLVIKLIAKIEEREIV 846

BLAST of Cla97C01G011030 vs. ExPASy TrEMBL
Match: A0A5D3CU56 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold477G00540 PE=3 SV=1)

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 595/658 (90.43%), Postives = 616/658 (93.62%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFA 60
           MF   YFN CIGKYGLL LSH  LK LRCFL AA YGTG A CAFT S+  ES+DWN  A
Sbjct: 1   MFPHCYFNRCIGKYGLLALSHSKLKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP-A 60

Query: 61  AAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLR 120
            APF GVLQDEDLLR THISSS+ S++STGLYVLDLIN G LEPERTLYSKML +CT LR
Sbjct: 61  TAPFTGVLQDEDLLRTTHISSSDVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLR 120

Query: 121 KLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLIS 180
           KLKQGRAIHAHIQ S FENDPVLLNFILNMYAKCGSLEEAQ+LFDKMPT+D VSWTVLIS
Sbjct: 121 KLKQGRAIHAHIQSSAFENDPVLLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLIS 180

Query: 181 GYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDM 240
           GYSQS RASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHA SLKYGYDM
Sbjct: 181 GYSQSRRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDM 240

Query: 241 NVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQML 300
           NVHVGSSLLDMYARWGHM+EAKVIF SLAAKNVVSWNALIAGHARKGEGEHVMRLF QML
Sbjct: 241 NVHVGSSLLDMYARWGHMREAKVIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQML 300

Query: 301 RQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360
           RQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK
Sbjct: 301 RQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360

Query: 361 DAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACS 420
           DAKKVFQRLVK+D+VSWNSIISGYAQHG+GAEALQLFE++LKAKVQPNEITFLSVLTACS
Sbjct: 361 DAKKVFQRLVKRDIVSWNSIISGYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACS 420

Query: 421 HSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWG 480
           HSGLL EG+YY ELMKK+ IEPQVAHHVTVVDLLGRAGRLNEA KFIEEMP++PTAAVWG
Sbjct: 421 HSGLLDEGKYYFELMKKHGIEPQVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWG 480

Query: 481 ALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 540
           ALLGACRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL DA NVRKMMKESG
Sbjct: 481 ALLGACRMHKNMDLGVYAAEKIFELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESG 540

Query: 541 VKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600
           VKKEPACSWVEIENEVHMFVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFMD
Sbjct: 541 VKKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600

Query: 601 QQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 601 QQDREVKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREII 657

BLAST of Cla97C01G011030 vs. ExPASy TrEMBL
Match: A0A1S3CMN0 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502212 PE=3 SV=1)

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 595/658 (90.43%), Postives = 616/658 (93.62%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGLLQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFA 60
           MF   YFN CIGKYGLL LSH  LK LRCFL AA YGTG A CAFT S+  ES+DWN  A
Sbjct: 3   MFPHCYFNRCIGKYGLLALSHSKLKTLRCFLFAAKYGTGLAPCAFTESNMAESQDWNP-A 62

Query: 61  AAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLR 120
            APF GVLQDEDLLR THISSS+ S++STGLYVLDLIN G LEPERTLYSKML +CT LR
Sbjct: 63  TAPFTGVLQDEDLLRTTHISSSDVSSSSTGLYVLDLINCGSLEPERTLYSKMLNKCTYLR 122

Query: 121 KLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLIS 180
           KLKQGRAIHAHIQ S FENDPVLLNFILNMYAKCGSLEEAQ+LFDKMPT+D VSWTVLIS
Sbjct: 123 KLKQGRAIHAHIQSSAFENDPVLLNFILNMYAKCGSLEEAQDLFDKMPTKDRVSWTVLIS 182

Query: 181 GYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDM 240
           GYSQS RASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHA SLKYGYDM
Sbjct: 183 GYSQSRRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHAFSLKYGYDM 242

Query: 241 NVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQML 300
           NVHVGSSLLDMYARWGHM+EAKVIF SLAAKNVVSWNALIAGHARKGEGEHVMRLF QML
Sbjct: 243 NVHVGSSLLDMYARWGHMREAKVIFKSLAAKNVVSWNALIAGHARKGEGEHVMRLFSQML 302

Query: 301 RQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 360
           RQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK
Sbjct: 303 RQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIK 362

Query: 361 DAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACS 420
           DAKKVFQRLVK+D+VSWNSIISGYAQHG+GAEALQLFE++LKAKVQPNEITFLSVLTACS
Sbjct: 363 DAKKVFQRLVKRDIVSWNSIISGYAQHGLGAEALQLFEQVLKAKVQPNEITFLSVLTACS 422

Query: 421 HSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWG 480
           HSGLL EG+YY ELMKK+ IEPQVAHHVTVVDLLGRAGRLNEA KFIEEMP++PTAAVWG
Sbjct: 423 HSGLLDEGKYYFELMKKHGIEPQVAHHVTVVDLLGRAGRLNEANKFIEEMPMEPTAAVWG 482

Query: 481 ALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESG 540
           ALLGACRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL DA NVRKMMKESG
Sbjct: 483 ALLGACRMHKNMDLGVYAAEKIFELDPHDSGPHVLLSNIYASAGRLRDAGNVRKMMKESG 542

Query: 541 VKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 600
           VKKEPACSWVEIENEVHMFVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFMD
Sbjct: 543 VKKEPACSWVEIENEVHMFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFMD 602

Query: 601 QQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 603 QQDREVKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLGREII 659

BLAST of Cla97C01G011030 vs. ExPASy TrEMBL
Match: A0A0A0LSV8 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G171070 PE=3 SV=1)

HSP 1 Score: 1172.5 bits (3032), Expect = 0.0e+00
Identity = 586/659 (88.92%), Postives = 611/659 (92.72%), Query Frame = 0

Query: 1   MFLRRYFNSCIGKYGL-LQLSHPNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAF 60
           MF   YFN CIGKYG  L LS   LK L CFL AA YGT    CAF  S+T ES+DW+  
Sbjct: 3   MFPHCYFNRCIGKYGRPLALSPSKLKTLSCFLFAAKYGT---PCAFVESNTAESQDWDP- 62

Query: 61  AAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRL 120
             APF GVLQDEDLLR THISSS TS+NSTGLYVLDLIN G LEPERTLYSKML +CT L
Sbjct: 63  CTAPFTGVLQDEDLLRTTHISSSGTSSNSTGLYVLDLINCGSLEPERTLYSKMLNKCTYL 122

Query: 121 RKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLI 180
           RKLKQGRAIHAHIQ STFE+D VLLNFILNMYAKCGSLEEAQ+LFDKMPT+DMVSWTVLI
Sbjct: 123 RKLKQGRAIHAHIQSSTFEDDLVLLNFILNMYAKCGSLEEAQDLFDKMPTKDMVSWTVLI 182

Query: 181 SGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYD 240
           SGYSQSG+ASEALALFPKMLHLGFQPNEFTLSSLLKASG GPSD HGRQLHA SLKYGYD
Sbjct: 183 SGYSQSGQASEALALFPKMLHLGFQPNEFTLSSLLKASGTGPSDHHGRQLHAFSLKYGYD 242

Query: 241 MNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQM 300
           MNVHVGSSLLDMYARW HM+EAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLF QM
Sbjct: 243 MNVHVGSSLLDMYARWAHMREAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFLQM 302

Query: 301 LRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI 360
           LRQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI
Sbjct: 303 LRQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSI 362

Query: 361 KDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTAC 420
           KDAKKVF+RLVKQD+VSWNSIISGYAQHG+GAEALQLFE+MLKAKVQPNEITFLSVLTAC
Sbjct: 363 KDAKKVFRRLVKQDIVSWNSIISGYAQHGLGAEALQLFEQMLKAKVQPNEITFLSVLTAC 422

Query: 421 SHSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVW 480
           SHSGLL EG+YY ELMKK++IE QVAHHVTVVDLLGRAGRLNEA KFIEEMPIKPTAAVW
Sbjct: 423 SHSGLLDEGQYYFELMKKHKIEAQVAHHVTVVDLLGRAGRLNEANKFIEEMPIKPTAAVW 482

Query: 481 GALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKES 540
           GALLG+CRMHKNM+LGVYAAE+IFELDPHDSGPHVLLSNIYASAGRL+DAA VRKMMKES
Sbjct: 483 GALLGSCRMHKNMDLGVYAAEQIFELDPHDSGPHVLLSNIYASAGRLSDAAKVRKMMKES 542

Query: 541 GVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFM 600
           GVKKEPACSWVEIENEVH+FVANDDSHPMR EIQRMWEKISGKIKEIGYVPDTSHVLFFM
Sbjct: 543 GVKKEPACSWVEIENEVHVFVANDDSHPMREEIQRMWEKISGKIKEIGYVPDTSHVLFFM 602

Query: 601 DQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           +QQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKNIRICGDCHSAFKFAS+VL REII
Sbjct: 603 NQQDRELKLQYHSEKLALAFAVLKTPPGLTIRIKKNIRICGDCHSAFKFASRVLGREII 657

BLAST of Cla97C01G011030 vs. ExPASy TrEMBL
Match: A0A6J1IAI9 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111471639 PE=3 SV=1)

HSP 1 Score: 1164.4 bits (3011), Expect = 0.0e+00
Identity = 572/638 (89.66%), Postives = 596/638 (93.42%), Query Frame = 0

Query: 22  PNLKLLRCFLSAANYGTGSAHCAFTGSDTEESRDWNAFAAA-PFNGVLQDEDLLRKTHIS 81
           P LK  +CF SAANYGTGS  C+ T SD+ E RDWNA AAA PF GVLQDEDLLRKTHIS
Sbjct: 20  PKLKPFKCFFSAANYGTGSPPCSLTESDSAEGRDWNAAAAAVPFTGVLQDEDLLRKTHIS 79

Query: 82  SSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFEND 141
           SSETST+STGLYVLDLIN G LEPERTLYSKML +CT LRKLK GR IH+HIQGSTFEND
Sbjct: 80  SSETSTSSTGLYVLDLINHGKLEPERTLYSKMLNKCTHLRKLKLGRVIHSHIQGSTFEND 139

Query: 142 PVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLH 201
            V+ N ILNMYAKCGSLEEA NLFDKMPTRDMVSWTVLISGYSQSGRA EAL LFP+M H
Sbjct: 140 LVIQNSILNMYAKCGSLEEAHNLFDKMPTRDMVSWTVLISGYSQSGRAFEALGLFPQMFH 199

Query: 202 LGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQE 261
            GFQPNEFTLSSLLKASGA PSD+HGRQLHA SLKYG++MNVHVGSSLLDMYARWGHMQE
Sbjct: 200 QGFQPNEFTLSSLLKASGASPSDEHGRQLHAFSLKYGFNMNVHVGSSLLDMYARWGHMQE 259

Query: 262 AKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACAS 321
           A+ IFN LAAKNVVSWNALIAGHARKGEGEHVM+LF QMLRQ+FEPTHFTYSSVFTACAS
Sbjct: 260 AEAIFNGLAAKNVVSWNALIAGHARKGEGEHVMKLFRQMLRQNFEPTHFTYSSVFTACAS 319

Query: 322 SGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI 381
           SGS EQGKWVHAHVIKSGGQP+AYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI
Sbjct: 320 SGSFEQGKWVHAHVIKSGGQPVAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSI 379

Query: 382 ISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKKYEI 441
           ISGYAQHG+GAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLL EG+YY ELMKKYEI
Sbjct: 380 ISGYAQHGLGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLDEGQYYFELMKKYEI 439

Query: 442 EPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAE 501
           EPQV+HHVTVVDLLGRAGRL+EA KFI+EMPI+PTAAVWGALLGACRMHKNM+LG YAAE
Sbjct: 440 EPQVSHHVTVVDLLGRAGRLDEANKFIKEMPIEPTAAVWGALLGACRMHKNMDLGAYAAE 499

Query: 502 RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFV 561
           RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIEN VHMFV
Sbjct: 500 RIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENGVHMFV 559

Query: 562 ANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFS 621
           AND+SHPMR EIQ+MWEKISGKIKEIGYVPDTSHVLFFMDQQDRE KLQYHSEKLALAFS
Sbjct: 560 ANDESHPMREEIQKMWEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFS 619

Query: 622 VLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           VLKTPPG TIRIKKNIRICGDCHSAFKFASKVL REII
Sbjct: 620 VLKTPPGFTIRIKKNIRICGDCHSAFKFASKVLGREII 657

BLAST of Cla97C01G011030 vs. ExPASy TrEMBL
Match: A0A1S3CL86 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103502212 PE=3 SV=1)

HSP 1 Score: 1155.6 bits (2988), Expect = 0.0e+00
Identity = 570/623 (91.49%), Postives = 591/623 (94.86%), Query Frame = 0

Query: 36  YGTGSAHCAFTGSDTEESRDWNAFAAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLD 95
           YGTG A CAFT S+  ES+DWN  A APF GVLQDEDLLR THISSS+ S++STGLYVLD
Sbjct: 2   YGTGLAPCAFTESNMAESQDWNP-ATAPFTGVLQDEDLLRTTHISSSDVSSSSTGLYVLD 61

Query: 96  LINRGYLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCG 155
           LIN G LEPERTLYSKML +CT LRKLKQGRAIHAHIQ S FENDPVLLNFILNMYAKCG
Sbjct: 62  LINCGSLEPERTLYSKMLNKCTYLRKLKQGRAIHAHIQSSAFENDPVLLNFILNMYAKCG 121

Query: 156 SLEEAQNLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLK 215
           SLEEAQ+LFDKMPT+D VSWTVLISGYSQS RASEALALFPKMLHLGFQPNEFTLSSLLK
Sbjct: 122 SLEEAQDLFDKMPTKDRVSWTVLISGYSQSRRASEALALFPKMLHLGFQPNEFTLSSLLK 181

Query: 216 ASGAGPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVS 275
           ASGAGPSDDHGRQLHA SLKYGYDMNVHVGSSLLDMYARWGHM+EAKVIF SLAAKNVVS
Sbjct: 182 ASGAGPSDDHGRQLHAFSLKYGYDMNVHVGSSLLDMYARWGHMREAKVIFKSLAAKNVVS 241

Query: 276 WNALIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVI 335
           WNALIAGHARKGEGEHVMRLF QMLRQ FEPTHFTYSSVFTACASSGSLEQGKWVHAHVI
Sbjct: 242 WNALIAGHARKGEGEHVMRLFSQMLRQGFEPTHFTYSSVFTACASSGSLEQGKWVHAHVI 301

Query: 336 KSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQ 395
           KSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVK+D+VSWNSIISGYAQHG+GAEALQ
Sbjct: 302 KSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKRDIVSWNSIISGYAQHGLGAEALQ 361

Query: 396 LFEEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLG 455
           LFE++LKAKVQPNEITFLSVLTACSHSGLL EG+YY ELMKK+ IEPQVAHHVTVVDLLG
Sbjct: 362 LFEQVLKAKVQPNEITFLSVLTACSHSGLLDEGKYYFELMKKHGIEPQVAHHVTVVDLLG 421

Query: 456 RAGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVL 515
           RAGRLNEA KFIEEMP++PTAAVWGALLGACRMHKNM+LGVYAAE+IFELDPHDSGPHVL
Sbjct: 422 RAGRLNEANKFIEEMPMEPTAAVWGALLGACRMHKNMDLGVYAAEKIFELDPHDSGPHVL 481

Query: 516 LSNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRM 575
           LSNIYASAGRL DA NVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMR EIQRM
Sbjct: 482 LSNIYASAGRLRDAGNVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMREEIQRM 541

Query: 576 WEKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKN 635
           WEKISGKIKEIGYVPDTSHVLFFMDQQDRE KLQYHSEKLALAF+VLKTPPGLTIRIKKN
Sbjct: 542 WEKISGKIKEIGYVPDTSHVLFFMDQQDREVKLQYHSEKLALAFAVLKTPPGLTIRIKKN 601

Query: 636 IRICGDCHSAFKFASKVLEREII 659
           IRICGDCHSAFKFASKVL REII
Sbjct: 602 IRICGDCHSAFKFASKVLGREII 623

BLAST of Cla97C01G011030 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 761.9 bits (1966), Expect = 4.5e-220
Identity = 368/597 (61.64%), Postives = 454/597 (76.05%), Query Frame = 0

Query: 62  APFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTLYSKMLKECTRLRK 121
           AP +   +DE L   ++     TS+N         +   Y+  +R  Y+ +LK+CT  + 
Sbjct: 24  APVSEDSEDESLKFPSNDLLLRTSSND--------LEGSYIPADRRFYNTLLKKCTVFKL 83

Query: 122 LKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSWTVLISG 181
           L QGR +HAHI  S F +D V+ N +LNMYAKCGSLEEA+ +F+KMP RD V+WT LISG
Sbjct: 84  LIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISG 143

Query: 182 YSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMN 241
           YSQ  R  +AL  F +ML  G+ PNEFTLSS++KA+ A      G QLH   +K G+D N
Sbjct: 144 YSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSN 203

Query: 242 VHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLR 301
           VHVGS+LLD+Y R+G M +A+++F++L ++N VSWNALIAGHAR+   E  + LF  MLR
Sbjct: 204 VHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLR 263

Query: 302 QDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKD 361
             F P+HF+Y+S+F AC+S+G LEQGKWVHA++IKSG + +A+ GNTL+DMYAKSGSI D
Sbjct: 264 DGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHD 323

Query: 362 AKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSH 421
           A+K+F RL K+DVVSWNS+++ YAQHG G EA+  FEEM +  ++PNEI+FLSVLTACSH
Sbjct: 324 ARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSH 383

Query: 422 SGLLHEGRYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGA 481
           SGLL EG +Y ELMKK  I P+  H+VTVVDLLGRAG LN A +FIEEMPI+PTAA+W A
Sbjct: 384 SGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKA 443

Query: 482 LLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGV 541
           LL ACRMHKN  LG YAAE +FELDP D GPHV+L NIYAS GR NDAA VRK MKESGV
Sbjct: 444 LLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGV 503

Query: 542 KKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQ 601
           KKEPACSWVEIEN +HMFVAND+ HP R EI R WE++  KIKE+GYVPDTSHV+  +DQ
Sbjct: 504 KKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYVPDTSHVIVHVDQ 563

Query: 602 QDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
           Q+RE  LQYHSEK+ALAF++L TPPG TI IKKNIR+CGDCH+A K ASKV+ REII
Sbjct: 564 QEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREII 612

BLAST of Cla97C01G011030 vs. TAIR 10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 483.4 bits (1243), Expect = 3.1e-136
Identity = 252/609 (41.38%), Postives = 384/609 (63.05%), Query Frame = 0

Query: 58  AFAAAPFNGVLQDEDLLRKTHISSSETSTNSTGLYVLDLINRGYLEPERTL--YSKMLKE 117
           +F+++P N VLQ       T +  S+  +N  G     L+    L PE     Y  +L  
Sbjct: 11  SFSSSPTNYVLQ-------TILPISQLCSN--GRLQEALLEMAMLGPEMGFHGYDALLNA 70

Query: 118 CTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQNLFDKMPTRDMVSW 177
           C   R L+ G+ +HAH+  + +     L   +L  Y KC  LE+A+ + D+MP +++VSW
Sbjct: 71  CLDKRALRDGQRVHAHMIKTRYLPATYLRTRLLIFYGKCDCLEDARKVLDEMPEKNVVSW 130

Query: 178 TVLISGYSQSGRASEALALFPKMLHLGFQPNEFT----LSSLLKASGAGPSDDHGRQLHA 237
           T +IS YSQ+G +SEAL +F +M+    +PNEFT    L+S ++ASG G     G+Q+H 
Sbjct: 131 TAMISRYSQTGHSSEALTVFAEMMRSDGKPNEFTFATVLTSCIRASGLG----LGKQIHG 190

Query: 238 LSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEH 297
           L +K+ YD ++ VGSSLLDMYA+ G ++EA+ IF  L  ++VVS  A+IAG+A+ G  E 
Sbjct: 191 LIVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQLGLDEE 250

Query: 298 VMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLID 357
            + +F ++  +   P + TY+S+ TA +    L+ GK  H HV++      A + N+LID
Sbjct: 251 ALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVLQNSLID 310

Query: 358 MYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAK-VQPNEI 417
           MY+K G++  A+++F  + ++  +SWN+++ GY++HG+G E L+LF  M   K V+P+ +
Sbjct: 311 MYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLELFRLMRDEKRVKPDAV 370

Query: 418 TFLSVLTACSHSGLLHEGRYYLELM--KKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIE 477
           T L+VL+ CSH  +   G    + M   +Y  +P   H+  +VD+LGRAGR++EA++FI+
Sbjct: 371 TLLAVLSGCSHGRMEDTGLNIFDGMVAGEYGTKPGTEHYGCIVDMLGRAGRIDEAFEFIK 430

Query: 478 EMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLND 537
            MP KPTA V G+LLGACR+H ++++G     R+ E++P ++G +V+LSN+YASAGR  D
Sbjct: 431 RMPSKPTAGVLGSLLGACRVHLSVDIGESVGRRLIEIEPENAGNYVILSNLYASAGRWAD 490

Query: 538 AANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGY 597
             NVR MM +  V KEP  SW++ E  +H F AND +HP R E+    ++IS K+K+ GY
Sbjct: 491 VNNVRAMMMQKAVTKEPGRSWIQHEQTLHYFHANDRTHPRREEVLAKMKEISIKMKQAGY 550

Query: 598 VPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKF 657
           VPD S VL+ +D++ +E  L  HSEKLAL F ++ T  G+ IR+ KN+RIC DCH+  K 
Sbjct: 551 VPDLSCVLYDVDEEQKEKMLLGHSEKLALTFGLIATGEGIPIRVFKNLRICVDCHNFAKI 606

BLAST of Cla97C01G011030 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 468.8 bits (1205), Expect = 7.9e-132
Identity = 240/558 (43.01%), Postives = 340/558 (60.93%), Query Frame = 0

Query: 102 LEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEAQ 161
           L+P       +L   + LR +  G+ IH +   S F++   +   +++MYAKCGSLE A+
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETAR 291

Query: 162 NLFDKMPTRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGAGP 221
            LFD M  R++VSW  +I  Y Q+    EA+ +F KML  G +P + ++   L A     
Sbjct: 292 QLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLG 351

Query: 222 SDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIA 281
             + GR +H LS++ G D NV V +SL+ MY +   +  A  +F  L ++ +VSWNA+I 
Sbjct: 352 DLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMIL 411

Query: 282 GHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQP 341
           G A+ G     +  F QM  +  +P  FTY SV TA A        KW+H  V++S    
Sbjct: 412 GFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDK 471

Query: 342 IAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLFEEML 401
             ++   L+DMYAK G+I  A+ +F  + ++ V +WN++I GY  HG G  AL+LFEEM 
Sbjct: 472 NVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ 531

Query: 402 KAKVQPNEITFLSVLTACSHSGLLHEGRYYLELMKK-YEIEPQVAHHVTVVDLLGRAGRL 461
           K  ++PN +TFLSV++ACSHSGL+  G     +MK+ Y IE  + H+  +VDLLGRAGRL
Sbjct: 532 KGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRL 591

Query: 462 NEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIY 521
           NEA+ FI +MP+KP   V+GA+LGAC++HKN+N    AAER+FEL+P D G HVLL+NIY
Sbjct: 592 NEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIY 651

Query: 522 ASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMWEKIS 581
            +A        VR  M   G++K P CS VEI+NEVH F +   +HP   +I    EK+ 
Sbjct: 652 RAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLI 711

Query: 582 GKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICG 641
             IKE GYVPDT+ VL  ++   +E  L  HSEKLA++F +L T  G TI ++KN+R+C 
Sbjct: 712 CHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCA 771

Query: 642 DCHSAFKFASKVLEREII 659
           DCH+A K+ S V  REI+
Sbjct: 772 DCHNATKYISLVTGREIV 788

BLAST of Cla97C01G011030 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 466.8 bits (1200), Expect = 3.0e-131
Identity = 234/592 (39.53%), Postives = 358/592 (60.47%), Query Frame = 0

Query: 104 PERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAK---CGSLEEA 163
           P+  ++  +LK CT +  L+ G ++H  I     + D    N ++NMYAK    GS    
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 164 QNLFDKMPTR---------------------------------DMVSWTVLISGYSQSGR 223
            N+FD+MP R                                 D+VS+  +I+GY+QSG 
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 224 ASEALALFPKMLHLGFQPNEFTLSSLLKASGAGPSDDHGRQLHALSLKYGYDMNVHVGSS 283
             +AL +  +M     +P+ FTLSS+L           G+++H   ++ G D +V++GSS
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 284 LLDMYARWGHMQEAKVIFNSLAAKNVVSWNALIAGHARKGEGEHVMRLFWQMLRQDFEPT 343
           L+DMYA+   +++++ +F+ L  ++ +SWN+L+AG+ + G     +RLF QM+    +P 
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQMVTAKVKPG 342

Query: 344 HFTYSSVFTACASSGSLEQGKWVHAHVIKSGGQPIAYIGNTLIDMYAKSGSIKDAKKVFQ 403
              +SSV  ACA   +L  GK +H +V++ G     +I + L+DMY+K G+IK A+K+F 
Sbjct: 343 AVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFD 402

Query: 404 RLVKQDVVSWNSIISGYAQHGMGAEALQLFEEMLKAKVQPNEITFLSVLTACSHSGLLHE 463
           R+   D VSW +II G+A HG G EA+ LFEEM +  V+PN++ F++VLTACSH GL+ E
Sbjct: 403 RMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 462

Query: 464 G-RYYLELMKKYEIEPQVAHHVTVVDLLGRAGRLNEAYKFIEEMPIKPTAAVWGALLGAC 523
              Y+  + K Y +  ++ H+  V DLLGRAG+L EAY FI +M ++PT +VW  LL +C
Sbjct: 463 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 522

Query: 524 RMHKNMNLGVYAAERIFELDPHDSGPHVLLSNIYASAGRLNDAANVRKMMKESGVKKEPA 583
            +HKN+ L    AE+IF +D  + G +VL+ N+YAS GR  + A +R  M++ G++K+PA
Sbjct: 523 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 582

Query: 584 CSWVEIENEVHMFVANDDSHPMRAEIQRMWEKISGKIKEIGYVPDTSHVLFFMDQQDREA 643
           CSW+E++N+ H FV+ D SHP   +I    + +  ++++ GYV DTS VL  +D++ +  
Sbjct: 583 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 642

Query: 644 KLQYHSEKLALAFSVLKTPPGLTIRIKKNIRICGDCHSAFKFASKVLEREII 659
            L  HSE+LA+AF ++ T PG TIR+ KNIRIC DCH A KF SK+ EREII
Sbjct: 643 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREII 694

BLAST of Cla97C01G011030 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 454.9 bits (1169), Expect = 1.2e-127
Identity = 225/562 (40.04%), Postives = 356/562 (63.35%), Query Frame = 0

Query: 101 YLEPERTLYSKMLKECTRLRKLKQGRAIHAHIQGSTFENDPVLLNFILNMYAKCGSLEEA 160
           Y+    + ++ ++K C  L++L+    +H  +    F  D  +   ++  Y+KC ++ +A
Sbjct: 290 YVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSKCTAMLDA 349

Query: 161 QNLFDKMP-TRDMVSWTVLISGYSQSGRASEALALFPKMLHLGFQPNEFTLSSLLKASGA 220
             LF ++    ++VSWT +ISG+ Q+    EA+ LF +M   G +PNEFT S +L A   
Sbjct: 350 LRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSVILTALPV 409

Query: 221 -GPSDDHGRQLHALSLKYGYDMNVHVGSSLLDMYARWGHMQEAKVIFNSLAAKNVVSWNA 280
             PS     ++HA  +K  Y+ +  VG++LLD Y + G ++EA  +F+ +  K++V+W+A
Sbjct: 410 ISPS-----EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKDIVAWSA 469

Query: 281 LIAGHARKGEGEHVMRLFWQMLRQDFEPTHFTYSSVFTAC-ASSGSLEQGKWVHAHVIKS 340
           ++AG+A+ GE E  +++F ++ +   +P  FT+SS+   C A++ S+ QGK  H   IKS
Sbjct: 470 MLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFHGFAIKS 529

Query: 341 GGQPIAYIGNTLIDMYAKSGSIKDAKKVFQRLVKQDVVSWNSIISGYAQHGMGAEALQLF 400
                  + + L+ MYAK G+I+ A++VF+R  ++D+VSWNS+ISGYAQHG   +AL +F
Sbjct: 530 RLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAMKALDVF 589

Query: 401 EEMLKAKVQPNEITFLSVLTACSHSGLLHEGRYYLELM-KKYEIEPQVAHHVTVVDLLGR 460
           +EM K KV+ + +TF+ V  AC+H+GL+ EG  Y ++M +  +I P   H+  +VDL  R
Sbjct: 590 KEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCMVDLYSR 649

Query: 461 AGRLNEAYKFIEEMPIKPTAAVWGALLGACRMHKNMNLGVYAAERIFELDPHDSGPHVLL 520
           AG+L +A K IE MP    + +W  +L ACR+HK   LG  AAE+I  + P DS  +VLL
Sbjct: 650 AGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDSAAYVLL 709

Query: 521 SNIYASAGRLNDAANVRKMMKESGVKKEPACSWVEIENEVHMFVANDDSHPMRAEIQRMW 580
           SN+YA +G   + A VRK+M E  VKKEP  SW+E++N+ + F+A D SHP++ +I    
Sbjct: 710 SNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKDQIYMKL 769

Query: 581 EKISGKIKEIGYVPDTSHVLFFMDQQDREAKLQYHSEKLALAFSVLKTPPGLTIRIKKNI 640
           E +S ++K++GY PDTS+VL  +D + +EA L  HSE+LA+AF ++ TP G  + I KN+
Sbjct: 770 EDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIAFGLIATPKGSPLLIIKNL 829

Query: 641 RICGDCHSAFKFASKVLEREII 659
           R+CGDCH   K  +K+ EREI+
Sbjct: 830 RVCGDCHLVIKLIAKIEEREIV 846

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893938.10.0e+0091.19pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... [more]
XP_008464284.10.0e+0090.43PREDICTED: pentatricopeptide repeat-containing protein At3g24000, mitochondrial ... [more]
TYK15477.10.0e+0090.43pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_004139511.10.0e+0088.92pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... [more]
XP_022973115.10.0e+0089.66pentatricopeptide repeat-containing protein At3g24000, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9LIQ76.3e-21961.64Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9LIC34.3e-13541.38Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
Q3E6Q11.1e-13043.01Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LW634.2e-13039.53Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9ZUW31.7e-12640.04Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A5D3CU560.0e+0090.43Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CMN00.0e+0090.43pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X1 ... [more]
A0A0A0LSV80.0e+0088.92DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G1710... [more]
A0A6J1IAI90.0e+0089.66pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Cucurbit... [more]
A0A1S3CL860.0e+0091.49pentatricopeptide repeat-containing protein At3g24000, mitochondrial isoform X2 ... [more]
Match NameE-valueIdentityDescription
AT3G24000.14.5e-22061.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G13770.13.1e-13641.38Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.17.9e-13243.01Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G23330.13.0e-13139.53Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G27610.11.2e-12740.04Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 373..420
e-value: 2.5E-14
score: 53.2
coord: 271..319
e-value: 2.3E-12
score: 46.9
coord: 171..216
e-value: 3.3E-10
score: 40.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 347..375
e-value: 1.3E-4
score: 19.9
coord: 274..307
e-value: 2.9E-5
score: 22.0
coord: 173..207
e-value: 6.6E-6
score: 24.0
coord: 375..409
e-value: 2.1E-9
score: 35.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 347..372
e-value: 0.0012
score: 18.9
coord: 449..470
e-value: 0.23
score: 11.8
coord: 145..170
e-value: 0.0011
score: 19.0
coord: 518..541
e-value: 0.69
score: 10.3
coord: 246..269
e-value: 0.3
score: 11.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 408..442
score: 8.944478
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 171..205
score: 12.474017
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 373..407
score: 13.405727
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 140..170
score: 9.371969
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 307..341
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 11.465577
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 546..658
e-value: 1.3E-31
score: 109.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 347..582
e-value: 8.2E-43
score: 148.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 84..219
e-value: 7.9E-27
score: 95.8
coord: 224..331
e-value: 3.0E-22
score: 80.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 149..530
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 83..657
NoneNo IPR availablePANTHERPTHR47926:SF184BNACNNG64210D PROTEINcoord: 83..657

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G011030.2Cla97C01G011030.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding