Cla97C03G059270 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G059270
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr03: 8700759 .. 8702186 (-)
RNA-Seq ExpressionCla97C03G059270
SyntenyCla97C03G059270
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCAGGTTCAGTCGGAAGTTAGAAAGTCTGATGCTTTAAGCAAGAGAGGAACAATGGGCAGTAAAGCTATGTTTAAATGGGCAAAAACAGTCACACCTGCTCATGTTGAACAACTAATCCGAGCGGAACAAGACATAAACAAGGCACTTCTCATATTCGACTCTGCAACAGCCGAGTATACAAATGGTTTTAAGCACAATCTCAATACTTTTAGGCTCATGATTAGCAAGTTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAGGAGAAAATTGATGTCACTGAGGATATACTTCTCTCCATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATAAGAGTTTTCCATAAAATGCAGGATTTCCATTGCAAGCCTACAGAGAAGTCTTACATTTCAGTGCTTGCCATTCTTGTGGAAGAGAATCAATTAAGATTAGCTTTTAGATTTTATAGGTATATGAGAAAAGTAGGTATTCCCCCTACTGTAGCTTCTCTTAATGTTCTAATCAAAGCCTTTTGTAAGAATAGTGTAACCATGGAAAAAGCAATACACATTTTTCGTGAAATGTCTAATCATGGGTGTGAACCTGATTCATATACTTATGGAACTTTAATCAATGGATTATGTAGATTCGGAAACATTGTTGAGGCAAAGGAATTATTGCAAGAGATGGAGAAAAAAGGTTGTTCACCTTCTGTCATCACCTATAGTTCACTAATACATGGTCTTTGTCAGCTGAACAATGTGGATGAAGCAATGGGATTACTTGAAGATATGGTGGGCAAGGGTATCGAACCTAATGTGTTCACTTATAGTTCTCTAATGAATGGATTTTGCAAGGTTGGTCATTCTTCACGAGCTAGAGACCTCTTTGAGTTGATGGTCCAAAAACGCTTGAGGCCCAACATGATCAGTTATAGTACATTGATTAATGGACTTTGTAATGAAGGAAAACTAAATGAAGCTTTAGAGATTCTTGACAGAATGAAACTCCAAGGTTTGAAGCCAGATGCCGGGTTGTATGGGAAAATAGTTAATCGCCTCTGTGATGTTTGCAGATTCCAAGAAGCTGCAAACTTCTTGGATGAGATGGTCCTTTGTGGGATCACACCTAATAGAGTAACATGGAGCCTTCATGTCAAGACTCATAATAGAGTAATTCACGGTCTCTGTACTATCAACGATTCAAATCGTGCATTTCAGTTGTATCTTAGTGTCCTGACACGTGGTATTAATATCACTGTTGATACTTTTGATTCTTTGTTAAAATGCTTCTGTAACAAAAGAGATCTTCTTAAAACTTCTAGAATTCTGGATGAGATGGTGATTAATGGATGCATTCCTGAGAGAGAAATGTGGAGTACCATAGTTAATTGTTTTTGTGATGAATGA

mRNA sequence

ATGCTTCAGGTTCAGTCGGAAGTTAGAAAGTCTGATGCTTTAAGCAAGAGAGGAACAATGGGCAGTAAAGCTATGTTTAAATGGGCAAAAACAGTCACACCTGCTCATGTTGAACAACTAATCCGAGCGGAACAAGACATAAACAAGGCACTTCTCATATTCGACTCTGCAACAGCCGAGTATACAAATGGTTTTAAGCACAATCTCAATACTTTTAGGCTCATGATTAGCAAGTTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAGGAGAAAATTGATGTCACTGAGGATATACTTCTCTCCATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATAAGAGTTTTCCATAAAATGCAGGATTTCCATTGCAAGCCTACAGAGAAGTCTTACATTTCAGTGCTTGCCATTCTTGTGGAAGAGAATCAATTAAGATTAGCTTTTAGATTTTATAGGTATATGAGAAAAGTAGGTATTCCCCCTACTGTAGCTTCTCTTAATGTTCTAATCAAAGCCTTTTGTAAGAATAGTGTAACCATGGAAAAAGCAATACACATTTTTCGTGAAATGTCTAATCATGGGTGTGAACCTGATTCATATACTTATGGAACTTTAATCAATGGATTATGTAGATTCGGAAACATTGTTGAGGCAAAGGAATTATTGCAAGAGATGGAGAAAAAAGGTTGTTCACCTTCTGTCATCACCTATAGTTCACTAATACATGGTCTTTGTCAGCTGAACAATGTGGATGAAGCAATGGGATTACTTGAAGATATGGTGGGCAAGGGTATCGAACCTAATGTGTTCACTTATAGTTCTCTAATGAATGGATTTTGCAAGGTTGGTCATTCTTCACGAGCTAGAGACCTCTTTGAGTTGATGGTCCAAAAACGCTTGAGGCCCAACATGATCAGTTATAGTACATTGATTAATGGACTTTGTAATGAAGGAAAACTAAATGAAGCTTTAGAGATTCTTGACAGAATGAAACTCCAAGGTTTGAAGCCAGATGCCGGGTTGTATGGGAAAATAGTTAATCGCCTCTGTGATGTTTGCAGATTCCAAGAAGCTGCAAACTTCTTGGATGAGATGGTCCTTTGTGGGATCACACCTAATAGAGTAACATGGAGCCTTCATGTCAAGACTCATAATAGAGTAATTCACGGTCTCTGTACTATCAACGATTCAAATCGTGCATTTCAGTTGTATCTTAGTGTCCTGACACGTGGTATTAATATCACTGTTGATACTTTTGATTCTTTGTTAAAATGCTTCTGTAACAAAAGAGATCTTCTTAAAACTTCTAGAATTCTGGATGAGATGGTGATTAATGGATGCATTCCTGAGAGAGAAATGTGGAGTACCATAGTTAATTGTTTTTGTGATGAATGA

Coding sequence (CDS)

ATGCTTCAGGTTCAGTCGGAAGTTAGAAAGTCTGATGCTTTAAGCAAGAGAGGAACAATGGGCAGTAAAGCTATGTTTAAATGGGCAAAAACAGTCACACCTGCTCATGTTGAACAACTAATCCGAGCGGAACAAGACATAAACAAGGCACTTCTCATATTCGACTCTGCAACAGCCGAGTATACAAATGGTTTTAAGCACAATCTCAATACTTTTAGGCTCATGATTAGCAAGTTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAGGAGAAAATTGATGTCACTGAGGATATACTTCTCTCCATTTGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATAAGAGTTTTCCATAAAATGCAGGATTTCCATTGCAAGCCTACAGAGAAGTCTTACATTTCAGTGCTTGCCATTCTTGTGGAAGAGAATCAATTAAGATTAGCTTTTAGATTTTATAGGTATATGAGAAAAGTAGGTATTCCCCCTACTGTAGCTTCTCTTAATGTTCTAATCAAAGCCTTTTGTAAGAATAGTGTAACCATGGAAAAAGCAATACACATTTTTCGTGAAATGTCTAATCATGGGTGTGAACCTGATTCATATACTTATGGAACTTTAATCAATGGATTATGTAGATTCGGAAACATTGTTGAGGCAAAGGAATTATTGCAAGAGATGGAGAAAAAAGGTTGTTCACCTTCTGTCATCACCTATAGTTCACTAATACATGGTCTTTGTCAGCTGAACAATGTGGATGAAGCAATGGGATTACTTGAAGATATGGTGGGCAAGGGTATCGAACCTAATGTGTTCACTTATAGTTCTCTAATGAATGGATTTTGCAAGGTTGGTCATTCTTCACGAGCTAGAGACCTCTTTGAGTTGATGGTCCAAAAACGCTTGAGGCCCAACATGATCAGTTATAGTACATTGATTAATGGACTTTGTAATGAAGGAAAACTAAATGAAGCTTTAGAGATTCTTGACAGAATGAAACTCCAAGGTTTGAAGCCAGATGCCGGGTTGTATGGGAAAATAGTTAATCGCCTCTGTGATGTTTGCAGATTCCAAGAAGCTGCAAACTTCTTGGATGAGATGGTCCTTTGTGGGATCACACCTAATAGAGTAACATGGAGCCTTCATGTCAAGACTCATAATAGAGTAATTCACGGTCTCTGTACTATCAACGATTCAAATCGTGCATTTCAGTTGTATCTTAGTGTCCTGACACGTGGTATTAATATCACTGTTGATACTTTTGATTCTTTGTTAAAATGCTTCTGTAACAAAAGAGATCTTCTTAAAACTTCTAGAATTCTGGATGAGATGGTGATTAATGGATGCATTCCTGAGAGAGAAATGTGGAGTACCATAGTTAATTGTTTTTGTGATGAATGA

Protein sequence

MLQVQSEVRKSDALSKRGTMGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE
Homology
BLAST of Cla97C03G059270 vs. NCBI nr
Match: XP_038893902.1 (pentatricopeptide repeat-containing protein At5g46100 [Benincasa hispida])

HSP 1 Score: 888.6 bits (2295), Expect = 2.3e-254
Identity = 427/456 (93.64%), Postives = 448/456 (98.25%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKT+TPAHVEQLIRAEQ+INKALLIFDSATAEYTNGFKH+LNTFRLMISKL
Sbjct: 1   MGSKAMFKWAKTITPAHVEQLIRAEQEINKALLIFDSATAEYTNGFKHDLNTFRLMISKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGR+HKPLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRVHKPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSY+SVLAILVEENQL+LAFRFYRYMRKVGIPPTV SLNVLIKAFCKNS TM+KA+H+FR
Sbjct: 121 KSYVSVLAILVEENQLKLAFRFYRYMRKVGIPPTVTSLNVLIKAFCKNSGTMDKAMHVFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEME KGCSPSV+TY+SLIHGLCQLN
Sbjct: 181 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMETKGCSPSVVTYTSLIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAMGLLEDMVGKGI+PNVFTYSSLM+GFCK GHSSRARDL ELMVQKRLRPN+ISYS
Sbjct: 241 NVDEAMGLLEDMVGKGIKPNVFTYSSLMDGFCKAGHSSRARDLLELMVQKRLRPNVISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLCNEGKL+EALEILDRMKLQGLKPDAGLYGKIV+RLCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCNEGKLSEALEILDRMKLQGLKPDAGLYGKIVSRLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHVKTHNRVIHGLC+INDSNRAFQLYLSVLTRGINIT DTF SLLKCFC 
Sbjct: 361 GITPNRVTWSLHVKTHNRVIHGLCSINDSNRAFQLYLSVLTRGINITFDTFYSLLKCFCK 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK+SRILDEM+I+GCIPEREMWST+VNCFCDE
Sbjct: 421 KRDLLKSSRILDEMLISGCIPEREMWSTMVNCFCDE 456

BLAST of Cla97C03G059270 vs. NCBI nr
Match: XP_023001715.1 (pentatricopeptide repeat-containing protein At5g46100 [Cucurbita maxima])

HSP 1 Score: 875.5 bits (2261), Expect = 2.0e-250
Identity = 420/456 (92.11%), Postives = 442/456 (96.93%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKTVTPAHVEQLI+AE+DINKALLIFDSATAEYTNGFKH+LNTFRLMI KL
Sbjct: 1   MGSKAMFKWAKTVTPAHVEQLIQAERDINKALLIFDSATAEYTNGFKHDLNTFRLMIRKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEK+DVTEDI LSICRAYGRIH+PLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKLDVTEDIFLSICRAYGRIHRPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSYISV AILVEENQL+LAFRFYRYMRKVGIPPTVASLNVLIKA CKNS TM+KA+++FR
Sbjct: 121 KSYISVFAILVEENQLKLAFRFYRYMRKVGIPPTVASLNVLIKALCKNSGTMDKAMNMFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSN GCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSV+TY+S+IHGLCQLN
Sbjct: 181 EMSNQGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVVTYTSMIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAM LLEDM+ KGIEPNVFTYSSLM+GFCK GHS RARDL ELMVQKRLRPNMISYS
Sbjct: 241 NVDEAMDLLEDMMSKGIEPNVFTYSSLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLC EGK+NEALEILDRMKLQGL PDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCKEGKVNEALEILDRMKLQGLTPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHV+THNRVIHGLCT+NDSNRAFQLYLSVLTRGI++TVDTFDSLLKCFCN
Sbjct: 361 GITPNRVTWSLHVRTHNRVIHGLCTVNDSNRAFQLYLSVLTRGISLTVDTFDSLLKCFCN 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK SRILDEMVINGCIPEREMWST+VNCFCD+
Sbjct: 421 KRDLLKVSRILDEMVINGCIPEREMWSTVVNCFCDQ 456

BLAST of Cla97C03G059270 vs. NCBI nr
Match: KAG7019600.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 875.2 bits (2260), Expect = 2.6e-250
Identity = 420/456 (92.11%), Postives = 441/456 (96.71%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKTVTPAHVEQL++AE+DINKALLIFDSATAEYTNGFKH+LNTFRLMI KL
Sbjct: 1   MGSKAMFKWAKTVTPAHVEQLVQAERDINKALLIFDSATAEYTNGFKHDLNTFRLMIRKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEK DVTEDI LSICRAYGR+H+PLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRVHRPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSYISV AILVEENQL+LAFRFYRYMRKVGIPPTVASLNVLIKA CKNS TM+KA+++FR
Sbjct: 121 KSYISVFAILVEENQLKLAFRFYRYMRKVGIPPTVASLNVLIKALCKNSGTMDKAMNMFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSN GCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITY+S+IHGLCQLN
Sbjct: 181 EMSNQGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYTSMIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAM LLEDM+ KGIEPNVFTYSSLM+GFCK GHS RARDL ELMVQKRLRPNMISYS
Sbjct: 241 NVDEAMDLLEDMMSKGIEPNVFTYSSLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLC EGKLNEALEILDRMKLQGL PDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHV+THNRVIHGLCT+NDSNRAFQLYLSVLTRGI++TVDTFDSLLKCFCN
Sbjct: 361 GITPNRVTWSLHVRTHNRVIHGLCTVNDSNRAFQLYLSVLTRGISLTVDTFDSLLKCFCN 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK SRILDEMVINGCIPEREMWST+VNCFCD+
Sbjct: 421 KRDLLKISRILDEMVINGCIPEREMWSTVVNCFCDQ 456

BLAST of Cla97C03G059270 vs. NCBI nr
Match: KAA0057015.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26443.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 874.8 bits (2259), Expect = 3.4e-250
Identity = 424/475 (89.26%), Postives = 451/475 (94.95%), Query Frame = 0

Query: 1   MLQVQSEVRKSDALSKRGTMGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAE 60
           M QVQSEVR+SD+LSKRGTMGSKAMFKWAKTVTPAHV+QLI+AE+DI KAL+IFDSATAE
Sbjct: 1   MFQVQSEVRQSDSLSKRGTMGSKAMFKWAKTVTPAHVQQLIQAERDIKKALIIFDSATAE 60

Query: 61  YTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKP 120
           Y NGFKH++NTF LMISKL+SANQFRLAE LLDRMKEEKIDVTEDILLSICRAYGRIHKP
Sbjct: 61  YANGFKHDINTFSLMISKLISANQFRLAEALLDRMKEEKIDVTEDILLSICRAYGRIHKP 120

Query: 121 LDSIRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVL 180
           LDSIRVFHKM DFHCKPTEKSYISVLAILVEENQL+LAFRFYR MRK+GIPPTV SLNVL
Sbjct: 121 LDSIRVFHKMPDFHCKPTEKSYISVLAILVEENQLKLAFRFYRDMRKMGIPPTVTSLNVL 180

Query: 181 IKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKG 240
           IKAFCKNS TM+KA+H+FR MSNHG EPDSYTYGTLINGLCRFGNIVEAKELLQEME KG
Sbjct: 181 IKAFCKNSGTMDKAMHLFRTMSNHGFEPDSYTYGTLINGLCRFGNIVEAKELLQEMETKG 240

Query: 241 CSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRAR 300
           CSPSVITY+S+IHGLCQLNNVDEA+ LLEDM  K IEPNVFTYSSLM+GFCK GHSSRAR
Sbjct: 241 CSPSVITYTSIIHGLCQLNNVDEAVRLLEDMKDKNIEPNVFTYSSLMDGFCKAGHSSRAR 300

Query: 301 DLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRL 360
           D+  LMVQKRLRPNMISYSTL+NGLCNEGK+NEALEI DRMKLQGLKPDAGLYGKIVNRL
Sbjct: 301 DILGLMVQKRLRPNMISYSTLLNGLCNEGKINEALEIFDRMKLQGLKPDAGLYGKIVNRL 360

Query: 361 CDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLT 420
           CDV RFQEAANFLDEMVLCGI PNR+TWSLHV+THNRVIHGLCTINDSNRAFQLYLSVLT
Sbjct: 361 CDVSRFQEAANFLDEMVLCGIKPNRLTWSLHVRTHNRVIHGLCTINDSNRAFQLYLSVLT 420

Query: 421 RGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           RGI+ITVDTF+SLLKCFCNKRDL KTSRILDEMVINGCIP+ EMWST+VNCFCDE
Sbjct: 421 RGISITVDTFNSLLKCFCNKRDLPKTSRILDEMVINGCIPQGEMWSTMVNCFCDE 475

BLAST of Cla97C03G059270 vs. NCBI nr
Match: XP_023519166.1 (pentatricopeptide repeat-containing protein At5g46100 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 872.1 bits (2252), Expect = 2.2e-249
Identity = 421/456 (92.32%), Postives = 439/456 (96.27%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKTVTPAHVEQLI+AE+DINKALLIFDSATAEYTNGFKH+LNTFRLMI KL
Sbjct: 1   MGSKAMFKWAKTVTPAHVEQLIQAERDINKALLIFDSATAEYTNGFKHDLNTFRLMIRKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEK DVTEDI LSICRAYGRIH+PLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHRPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSYISV AILVEENQL LAFRFYRYMRKVGIPPTVASLNVLIKA CKNS TM+KA+++FR
Sbjct: 121 KSYISVFAILVEENQLNLAFRFYRYMRKVGIPPTVASLNVLIKALCKNSGTMDKAMNMFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSN GCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITY+S+IHGLCQLN
Sbjct: 181 EMSNQGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYTSMIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAM LLEDM+ KGIEPNVFTYSSLM+GFCK GHS RARDL ELMVQKRLRPNMISYS
Sbjct: 241 NVDEAMDLLEDMMSKGIEPNVFTYSSLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLC EGKLNEALEILDRMKLQGL PDAGLYGKIVN LCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNCLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHV+THNRVIHGLCT+NDSNRAFQLYLSVLTRGI++TVDTFDSLLKCFCN
Sbjct: 361 GITPNRVTWSLHVRTHNRVIHGLCTVNDSNRAFQLYLSVLTRGISLTVDTFDSLLKCFCN 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK SRILDEMVINGCIPEREMWST+VNCFCD+
Sbjct: 421 KRDLLKISRILDEMVINGCIPEREMWSTVVNCFCDQ 456

BLAST of Cla97C03G059270 vs. ExPASy Swiss-Prot
Match: Q9FNL2 (Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX=3702 GN=At5g46100 PE=2 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 1.7e-167
Identity = 273/457 (59.74%), Postives = 362/457 (79.21%), Query Frame = 0

Query: 20  MGSKA-MFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISK 79
           MGSK  MFKW+K +TP+ V +L+RAE+D+ K++ +FDSATAEY NG+ H+ ++F  M+ +
Sbjct: 1   MGSKVMMFKWSKNITPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLR 60

Query: 80  LVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPT 139
           LVSAN+F+ AE L+ RMK E   V+EDILLSICR YGR+H+P DS+RVFHKM+DF C P+
Sbjct: 61  LVSANKFKAAEDLIVRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPS 120

Query: 140 EKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIF 199
           +K+Y++VLAILVEENQL LAF+FY+ MR++G+PPTVASLNVLIKA C+N  T++  + IF
Sbjct: 121 QKAYVTVLAILVEENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIF 180

Query: 200 REMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQL 259
            EM   GC+PDSYTYGTLI+GLCRFG I EAK+L  EM +K C+P+V+TY+SLI+GLC  
Sbjct: 181 LEMPKRGCDPDSYTYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGS 240

Query: 260 NNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISY 319
            NVDEAM  LE+M  KGIEPNVFTYSSLM+G CK G S +A +LFE+M+ +  RPNM++Y
Sbjct: 241 KNVDEAMRYLEEMKSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTY 300

Query: 320 STLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVL 379
           +TLI GLC E K+ EA+E+LDRM LQGLKPDAGLYGK+++  C + +F+EAANFLDEM+L
Sbjct: 301 TTLITGLCKEQKIQEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMIL 360

Query: 380 CGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFC 439
            GITPNR+TW++HVKT N V+ GLC  N  +RAF LYLS+ +RGI++ V+T +SL+KC C
Sbjct: 361 GGITPNRLTWNIHVKTSNEVVRGLCA-NYPSRAFTLYLSMRSRGISVEVETLESLVKCLC 420

Query: 440 NKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
            K +  K  +++DE+V +GCIP +  W  ++    D+
Sbjct: 421 KKGEFQKAVQLVDEIVTDGCIPSKGTWKLLIGHTLDK 456

BLAST of Cla97C03G059270 vs. ExPASy Swiss-Prot
Match: Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 1.1e-62
Identity = 138/441 (31.29%), Postives = 234/441 (53.06%), Query Frame = 0

Query: 34  PAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKLVSANQFRLAETLLD 93
           P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+R +I KL    +F   E +L 
Sbjct: 7   PKHVTAVIKCQKDPMKALEMFNSMRKEV--GFKHTLSTYRSVIEKLGYYGKFEAMEEVLV 66

Query: 94  RMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTEKSYISVLAILVEE 153
            M+E   + + E + +   + YGR  K  +++ VF +M  + C+PT  SY +++++LV+ 
Sbjct: 67  DMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVDS 126

Query: 154 NQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDSYT 213
                A + Y  MR  GI P V S  + +K+FCK S     A+ +   MS+ GCE +   
Sbjct: 127 GYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTS-RPHAALRLLNNMSSQGCEMNVVA 186

Query: 214 YGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLEDMV 273
           Y T++ G        E  EL  +M   G S  + T++ L+  LC+  +V E   LL+ ++
Sbjct: 187 YCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVI 246

Query: 274 GKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYSTLINGLCNEGKLN 333
            +G+ PN+FTY+  + G C+ G    A  +   ++++  +P++I+Y+ LI GLC   K  
Sbjct: 247 KRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQ 306

Query: 334 EALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLCGITPNRVTWSLHV 393
           EA   L +M  +GL+PD+  Y  ++   C     Q A   + + V  G  P++       
Sbjct: 307 EAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQF------ 366

Query: 394 KTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRILDE 453
            T+  +I GLC   ++NRA  L+   L +GI   V  +++L+K   N+  +L+ +++ +E
Sbjct: 367 -TYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANE 426

Query: 454 MVINGCIPEREMWSTIVNCFC 474
           M   G IPE + ++ +VN  C
Sbjct: 427 MSEKGLIPEVQTFNILVNGLC 437

BLAST of Cla97C03G059270 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 7.3e-62
Identity = 139/503 (27.63%), Postives = 257/503 (51.09%), Query Frame = 0

Query: 32  VTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKLVSANQFRLAETL 91
           +TP  + +L+    +++ ++ +F    ++  NG++H+ + ++++I KL +  +F+  + L
Sbjct: 76  ITPFQLYKLLELPLNVSTSMELFSWTGSQ--NGYRHSFDVYQVLIGKLGANGEFKTIDRL 135

Query: 92  LDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYISVLAILV 151
           L +MK+E I   E + +SI R Y +   P  + R+  +M++ + C+PT KSY  VL ILV
Sbjct: 136 LIQMKDEGIVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILV 195

Query: 152 EENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDS 211
             N  ++A   +  M    IPPT+ +  V++KAFC  +  ++ A+ + R+M+ HGC P+S
Sbjct: 196 SGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVN-EIDSALSLLRDMTKHGCVPNS 255

Query: 212 YTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLED 271
             Y TLI+ L +   + EA +LL+EM   GC P   T++ +I GLC+ + ++EA  ++  
Sbjct: 256 VIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNR 315

Query: 272 MVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLF--------------------------- 331
           M+ +G  P+  TY  LMNG CK+G    A+DLF                           
Sbjct: 316 MLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDA 375

Query: 332 -----ELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVN 391
                +++    + P++ +Y++LI G   EG +  ALE+L  M+ +G KP+   Y  +V+
Sbjct: 376 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 435

Query: 392 RLCDVCRFQEAANFLDEMVLCGITPNRVTWSL---------------------------- 451
             C + +  EA N L+EM   G+ PN V ++                             
Sbjct: 436 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 495

Query: 452 HVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRIL 474
            V T N +I GLC +++   A  L   +++ G+     T+++L+  F  + ++ +  +++
Sbjct: 496 DVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLV 555

BLAST of Cla97C03G059270 vs. ExPASy Swiss-Prot
Match: O49436 (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX=3702 GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 230.7 bits (587), Expect = 3.4e-59
Identity = 135/421 (32.07%), Postives = 217/421 (51.54%), Query Frame = 0

Query: 58  TAEYTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRI 117
           +A     FK   +T   MI    ++  F   E LL R++ E   + E   + + RAYG+ 
Sbjct: 66  SAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSRIRLENRVIIERSFIVVFRAYGKA 125

Query: 118 HKPLDSIRVFHKMQD-FHCKPTEKSYISVLAILVEENQLRLAFRFYRYM----RKVGIPP 177
           H P  ++ +FH+M D F CK + KS+ SVL +++ E        FY Y+      + I P
Sbjct: 126 HLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISP 185

Query: 178 TVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKEL 237
              S N++IKA CK    +++AI +FR M    C PD YTY TL++GLC+   I EA  L
Sbjct: 186 NGLSFNLVIKALCKLRF-VDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLL 245

Query: 238 LQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCK 297
           L EM+ +GCSPS + Y+ LI GLC+  ++     L+++M  KG  PN  TY++L++G C 
Sbjct: 246 LDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCL 305

Query: 298 VGHSSRARDLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGL 357
            G   +A  L E MV  +  PN ++Y TLINGL  + +  +A+ +L  M+ +G   +  +
Sbjct: 306 KGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHI 365

Query: 358 YGKIVNRLCDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAF 417
           Y  +++ L    + +EA +   +M   G  PN V +S+       ++ GLC     N A 
Sbjct: 366 YSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSV-------LVDGLCREGKPNEAK 425

Query: 418 QLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCF 474
           ++   ++  G      T+ SL+K F       +  ++  EM   GC   +  +S +++  
Sbjct: 426 EILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIDGL 478

BLAST of Cla97C03G059270 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 2.4e-57
Identity = 131/411 (31.87%), Postives = 215/411 (52.31%), Query Frame = 0

Query: 64  GFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS 123
           G K +++TF ++I  L  A+Q R A  +L+ M    +   E    ++ + Y        +
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 124 IRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYM-RKVGIPPTVASLNVLIK 183
           +R+  +M +F C  +  S   ++    +E ++  A  F + M  + G  P   + N L+ 
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 184 AFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCS 243
             CK +  ++ AI I   M   G +PD YTY ++I+GLC+ G + EA E+L +M  + CS
Sbjct: 304 GLCK-AGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCS 363

Query: 244 PSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDL 303
           P+ +TY++LI  LC+ N V+EA  L   +  KGI P+V T++SL+ G C   +   A +L
Sbjct: 364 PNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMEL 423

Query: 304 FELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCD 363
           FE M  K   P+  +Y+ LI+ LC++GKL+EAL +L +M+L G       Y  +++  C 
Sbjct: 424 FEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCK 483

Query: 364 VCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRG 423
             + +EA    DEM + G++ N V       T+N +I GLC       A QL   ++  G
Sbjct: 484 ANKTREAEEIFDEMEVHGVSRNSV-------TYNTLIDGLCKSRRVEDAAQLMDQMIMEG 543

Query: 424 INITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFC 474
                 T++SLL  FC   D+ K + I+  M  NGC P+   + T+++  C
Sbjct: 544 QKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLC 586

BLAST of Cla97C03G059270 vs. ExPASy TrEMBL
Match: A0A6J1KLZ3 (pentatricopeptide repeat-containing protein At5g46100 OS=Cucurbita maxima OX=3661 GN=LOC111495767 PE=4 SV=1)

HSP 1 Score: 875.5 bits (2261), Expect = 9.7e-251
Identity = 420/456 (92.11%), Postives = 442/456 (96.93%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKTVTPAHVEQLI+AE+DINKALLIFDSATAEYTNGFKH+LNTFRLMI KL
Sbjct: 1   MGSKAMFKWAKTVTPAHVEQLIQAERDINKALLIFDSATAEYTNGFKHDLNTFRLMIRKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEK+DVTEDI LSICRAYGRIH+PLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKLDVTEDIFLSICRAYGRIHRPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSYISV AILVEENQL+LAFRFYRYMRKVGIPPTVASLNVLIKA CKNS TM+KA+++FR
Sbjct: 121 KSYISVFAILVEENQLKLAFRFYRYMRKVGIPPTVASLNVLIKALCKNSGTMDKAMNMFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSN GCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSV+TY+S+IHGLCQLN
Sbjct: 181 EMSNQGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVVTYTSMIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAM LLEDM+ KGIEPNVFTYSSLM+GFCK GHS RARDL ELMVQKRLRPNMISYS
Sbjct: 241 NVDEAMDLLEDMMSKGIEPNVFTYSSLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLC EGK+NEALEILDRMKLQGL PDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCKEGKVNEALEILDRMKLQGLTPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHV+THNRVIHGLCT+NDSNRAFQLYLSVLTRGI++TVDTFDSLLKCFCN
Sbjct: 361 GITPNRVTWSLHVRTHNRVIHGLCTVNDSNRAFQLYLSVLTRGISLTVDTFDSLLKCFCN 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK SRILDEMVINGCIPEREMWST+VNCFCD+
Sbjct: 421 KRDLLKVSRILDEMVINGCIPEREMWSTVVNCFCDQ 456

BLAST of Cla97C03G059270 vs. ExPASy TrEMBL
Match: A0A5D3DS89 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G001080 PE=4 SV=1)

HSP 1 Score: 874.8 bits (2259), Expect = 1.7e-250
Identity = 424/475 (89.26%), Postives = 451/475 (94.95%), Query Frame = 0

Query: 1   MLQVQSEVRKSDALSKRGTMGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAE 60
           M QVQSEVR+SD+LSKRGTMGSKAMFKWAKTVTPAHV+QLI+AE+DI KAL+IFDSATAE
Sbjct: 1   MFQVQSEVRQSDSLSKRGTMGSKAMFKWAKTVTPAHVQQLIQAERDIKKALIIFDSATAE 60

Query: 61  YTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKP 120
           Y NGFKH++NTF LMISKL+SANQFRLAE LLDRMKEEKIDVTEDILLSICRAYGRIHKP
Sbjct: 61  YANGFKHDINTFSLMISKLISANQFRLAEALLDRMKEEKIDVTEDILLSICRAYGRIHKP 120

Query: 121 LDSIRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVL 180
           LDSIRVFHKM DFHCKPTEKSYISVLAILVEENQL+LAFRFYR MRK+GIPPTV SLNVL
Sbjct: 121 LDSIRVFHKMPDFHCKPTEKSYISVLAILVEENQLKLAFRFYRDMRKMGIPPTVTSLNVL 180

Query: 181 IKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKG 240
           IKAFCKNS TM+KA+H+FR MSNHG EPDSYTYGTLINGLCRFGNIVEAKELLQEME KG
Sbjct: 181 IKAFCKNSGTMDKAMHLFRTMSNHGFEPDSYTYGTLINGLCRFGNIVEAKELLQEMETKG 240

Query: 241 CSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRAR 300
           CSPSVITY+S+IHGLCQLNNVDEA+ LLEDM  K IEPNVFTYSSLM+GFCK GHSSRAR
Sbjct: 241 CSPSVITYTSIIHGLCQLNNVDEAVRLLEDMKDKNIEPNVFTYSSLMDGFCKAGHSSRAR 300

Query: 301 DLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRL 360
           D+  LMVQKRLRPNMISYSTL+NGLCNEGK+NEALEI DRMKLQGLKPDAGLYGKIVNRL
Sbjct: 301 DILGLMVQKRLRPNMISYSTLLNGLCNEGKINEALEIFDRMKLQGLKPDAGLYGKIVNRL 360

Query: 361 CDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLT 420
           CDV RFQEAANFLDEMVLCGI PNR+TWSLHV+THNRVIHGLCTINDSNRAFQLYLSVLT
Sbjct: 361 CDVSRFQEAANFLDEMVLCGIKPNRLTWSLHVRTHNRVIHGLCTINDSNRAFQLYLSVLT 420

Query: 421 RGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           RGI+ITVDTF+SLLKCFCNKRDL KTSRILDEMVINGCIP+ EMWST+VNCFCDE
Sbjct: 421 RGISITVDTFNSLLKCFCNKRDLPKTSRILDEMVINGCIPQGEMWSTMVNCFCDE 475

BLAST of Cla97C03G059270 vs. ExPASy TrEMBL
Match: A0A6J1EKD3 (pentatricopeptide repeat-containing protein At5g46100 OS=Cucurbita moschata OX=3662 GN=LOC111434127 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 2.4e-249
Identity = 419/456 (91.89%), Postives = 440/456 (96.49%), Query Frame = 0

Query: 20  MGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKL 79
           MGSKAMFKWAKTVTPAHVEQL++AE+DINKALLIFDSATAEYTNGFKH+LNTFRLMI KL
Sbjct: 1   MGSKAMFKWAKTVTPAHVEQLVQAERDINKALLIFDSATAEYTNGFKHDLNTFRLMIRKL 60

Query: 80  VSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTE 139
           VSANQFRLAETLLDRMKEEK DVTEDI LSICRAYGR+H+PLDSIRVFHKMQDFHCKPTE
Sbjct: 61  VSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRVHRPLDSIRVFHKMQDFHCKPTE 120

Query: 140 KSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFR 199
           KSYISV AILVEENQL+LAFRFYRYMRKVGIPPTVASLNVLIKA CKNS TM+KA+++FR
Sbjct: 121 KSYISVFAILVEENQLKLAFRFYRYMRKVGIPPTVASLNVLIKALCKNSGTMDKAMNMFR 180

Query: 200 EMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLN 259
           EMSN GCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITY+S+IHGLCQLN
Sbjct: 181 EMSNQGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYTSMIHGLCQLN 240

Query: 260 NVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYS 319
           NVDEAM LLEDM+ KGIEPNVFTYSSLM+GFCK GHS RARDL ELMVQKRLRPNMISYS
Sbjct: 241 NVDEAMDLLEDMMSKGIEPNVFTYSSLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS 300

Query: 320 TLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 379
           TLINGLC EGKLNEALEILDRMKLQGL PDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC
Sbjct: 301 TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLC 360

Query: 380 GITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCN 439
           GITPNRVTWSLHV+THNRVIHGLCT+NDSNRAFQLYLSVLTRGI++TVDTFDSLLKCFCN
Sbjct: 361 GITPNRVTWSLHVRTHNRVIHGLCTVNDSNRAFQLYLSVLTRGISLTVDTFDSLLKCFCN 420

Query: 440 KRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           KRDLLK SRILDEMVINGCIPEREMWST+VN FCD+
Sbjct: 421 KRDLLKISRILDEMVINGCIPEREMWSTVVNFFCDQ 456

BLAST of Cla97C03G059270 vs. ExPASy TrEMBL
Match: A0A6J1CDW1 (pentatricopeptide repeat-containing protein At5g46100 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010648 PE=4 SV=1)

HSP 1 Score: 864.4 bits (2232), Expect = 2.2e-247
Identity = 418/475 (88.00%), Postives = 445/475 (93.68%), Query Frame = 0

Query: 1   MLQVQSEVRKSDALSKRGTMGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAE 60
           MLQVQSEV K DAL KRGTMGSKAMFKWAKTVTP+HVEQLI+AE+DINKALLIFDSAT+E
Sbjct: 1   MLQVQSEVGKFDALRKRGTMGSKAMFKWAKTVTPSHVEQLIQAERDINKALLIFDSATSE 60

Query: 61  YTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKP 120
           Y NGFKH+LNTFRLMISKLVSANQFR AETLLDRM EEK DVTEDI L+ICRAYGR+HKP
Sbjct: 61  YANGFKHDLNTFRLMISKLVSANQFRSAETLLDRMNEEKFDVTEDIFLTICRAYGRVHKP 120

Query: 121 LDSIRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVL 180
           LDSIR+FHKM+DF CKPTEKSYI+V AILVEENQL+LA RFYRYMRK+G PPTVASLNVL
Sbjct: 121 LDSIRIFHKMEDFQCKPTEKSYITVFAILVEENQLKLALRFYRYMRKMGFPPTVASLNVL 180

Query: 181 IKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKG 240
           IKAFCKNS TM+KA+HI REMSNHGCEPDSYTYGTLINGLC+ G IVEAKELLQEME KG
Sbjct: 181 IKAFCKNSGTMDKAMHILREMSNHGCEPDSYTYGTLINGLCKLGKIVEAKELLQEMETKG 240

Query: 241 CSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRAR 300
           CSPSV+TY+SLIHGLCQLNNVDEA+GLLEDM+GKGIEPNVFTYSSLM+GFCK GHSSRAR
Sbjct: 241 CSPSVVTYTSLIHGLCQLNNVDEAVGLLEDMMGKGIEPNVFTYSSLMDGFCKAGHSSRAR 300

Query: 301 DLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRL 360
           DL ELMVQKRLRPNMISYSTLINGLC EGKLNEALEILDRMKLQGLKPDAGLYGKIVN L
Sbjct: 301 DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLKPDAGLYGKIVNGL 360

Query: 361 CDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLT 420
           CD  RFQEAANFLDEMVL GITPNRVTWSLHV+THNRVI GLCTINDS+RAFQLYLSV T
Sbjct: 361 CDTSRFQEAANFLDEMVLGGITPNRVTWSLHVRTHNRVIDGLCTINDSSRAFQLYLSVQT 420

Query: 421 RGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
           RGI+ITVDTFD LLKCFC KRDL KT RILDEMVINGCIP+RE+WST+VNCFCD+
Sbjct: 421 RGISITVDTFDGLLKCFCKKRDLQKTYRILDEMVINGCIPQRELWSTVVNCFCDQ 475

BLAST of Cla97C03G059270 vs. ExPASy TrEMBL
Match: A0A0A0LRZ4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G075590 PE=4 SV=1)

HSP 1 Score: 846.3 bits (2185), Expect = 6.3e-242
Identity = 409/465 (87.96%), Postives = 440/465 (94.62%), Query Frame = 0

Query: 4   VQSEVRKSDALSKRGTMGSKAMFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTN 63
           VQSEVR+SD+L+KR TMGSKAMFKWAKTVTP HV+QLI+AE+DI KAL+IFDSATAEY N
Sbjct: 92  VQSEVRQSDSLNKRRTMGSKAMFKWAKTVTPTHVQQLIQAERDIKKALIIFDSATAEYAN 151

Query: 64  GFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS 123
           GFKH+LNTF LMISKL+SANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS
Sbjct: 152 GFKHDLNTFSLMISKLISANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS 211

Query: 124 IRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKA 183
           IRVFHKMQDFHCKPTEKSYISVLAILVEENQL+ AFRFYR MRK+GIPPTV SLNVLIKA
Sbjct: 212 IRVFHKMQDFHCKPTEKSYISVLAILVEENQLKSAFRFYRDMRKMGIPPTVTSLNVLIKA 271

Query: 184 FCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSP 243
           FCKNS TM+KA+H+FR MSNHGCEPDSYTYGTLINGLCRF +IVEAKELLQEME KGCSP
Sbjct: 272 FCKNSGTMDKAMHLFRTMSNHGCEPDSYTYGTLINGLCRFRSIVEAKELLQEMETKGCSP 331

Query: 244 SVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLF 303
           SV+TY+S+IHGLCQLNNVDEAM LLEDM  K IEPNVFTYSSLM+GFCK GHSSRARD+ 
Sbjct: 332 SVVTYTSIIHGLCQLNNVDEAMRLLEDMKDKNIEPNVFTYSSLMDGFCKTGHSSRARDIL 391

Query: 304 ELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDV 363
           ELM+QKRLRPNMISYSTL+NGLCNEGK+NEALEI DRMKLQG KPDAGLYGKIVN LCDV
Sbjct: 392 ELMIQKRLRPNMISYSTLLNGLCNEGKINEALEIFDRMKLQGFKPDAGLYGKIVNCLCDV 451

Query: 364 CRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGI 423
            RFQEAANFLDEMVLCGI PNR+TWSLHV+THNRVIHGLCTIN+SNRAFQLYLSVLTRGI
Sbjct: 452 SRFQEAANFLDEMVLCGIKPNRITWSLHVRTHNRVIHGLCTINNSNRAFQLYLSVLTRGI 511

Query: 424 NITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTI 469
           +ITVDTF+SLLKCFCNK+DL KTSRILDEMVINGCIP+ EMWST+
Sbjct: 512 SITVDTFNSLLKCFCNKKDLPKTSRILDEMVINGCIPQGEMWSTM 556

BLAST of Cla97C03G059270 vs. TAIR 10
Match: AT5G46100.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 590.5 bits (1521), Expect = 1.2e-168
Identity = 273/457 (59.74%), Postives = 362/457 (79.21%), Query Frame = 0

Query: 20  MGSKA-MFKWAKTVTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISK 79
           MGSK  MFKW+K +TP+ V +L+RAE+D+ K++ +FDSATAEY NG+ H+ ++F  M+ +
Sbjct: 1   MGSKVMMFKWSKNITPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLR 60

Query: 80  LVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPT 139
           LVSAN+F+ AE L+ RMK E   V+EDILLSICR YGR+H+P DS+RVFHKM+DF C P+
Sbjct: 61  LVSANKFKAAEDLIVRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPS 120

Query: 140 EKSYISVLAILVEENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIF 199
           +K+Y++VLAILVEENQL LAF+FY+ MR++G+PPTVASLNVLIKA C+N  T++  + IF
Sbjct: 121 QKAYVTVLAILVEENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIF 180

Query: 200 REMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQL 259
            EM   GC+PDSYTYGTLI+GLCRFG I EAK+L  EM +K C+P+V+TY+SLI+GLC  
Sbjct: 181 LEMPKRGCDPDSYTYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGS 240

Query: 260 NNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISY 319
            NVDEAM  LE+M  KGIEPNVFTYSSLM+G CK G S +A +LFE+M+ +  RPNM++Y
Sbjct: 241 KNVDEAMRYLEEMKSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTY 300

Query: 320 STLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVL 379
           +TLI GLC E K+ EA+E+LDRM LQGLKPDAGLYGK+++  C + +F+EAANFLDEM+L
Sbjct: 301 TTLITGLCKEQKIQEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMIL 360

Query: 380 CGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFC 439
            GITPNR+TW++HVKT N V+ GLC  N  +RAF LYLS+ +RGI++ V+T +SL+KC C
Sbjct: 361 GGITPNRLTWNIHVKTSNEVVRGLCA-NYPSRAFTLYLSMRSRGISVEVETLESLVKCLC 420

Query: 440 NKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFCDE 476
            K +  K  +++DE+V +GCIP +  W  ++    D+
Sbjct: 421 KKGEFQKAVQLVDEIVTDGCIPSKGTWKLLIGHTLDK 456

BLAST of Cla97C03G059270 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 242.3 bits (617), Expect = 8.0e-64
Identity = 138/441 (31.29%), Postives = 234/441 (53.06%), Query Frame = 0

Query: 34  PAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKLVSANQFRLAETLLD 93
           P HV  +I+ ++D  KAL +F+S   E   GFKH L+T+R +I KL    +F   E +L 
Sbjct: 7   PKHVTAVIKCQKDPMKALEMFNSMRKEV--GFKHTLSTYRSVIEKLGYYGKFEAMEEVLV 66

Query: 94  RMKEEKID-VTEDILLSICRAYGRIHKPLDSIRVFHKMQDFHCKPTEKSYISVLAILVEE 153
            M+E   + + E + +   + YGR  K  +++ VF +M  + C+PT  SY +++++LV+ 
Sbjct: 67  DMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVDS 126

Query: 154 NQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDSYT 213
                A + Y  MR  GI P V S  + +K+FCK S     A+ +   MS+ GCE +   
Sbjct: 127 GYFDQAHKVYMRMRDRGITPDVYSFTIRMKSFCKTS-RPHAALRLLNNMSSQGCEMNVVA 186

Query: 214 YGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLEDMV 273
           Y T++ G        E  EL  +M   G S  + T++ L+  LC+  +V E   LL+ ++
Sbjct: 187 YCTVVGGFYEENFKAEGYELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVI 246

Query: 274 GKGIEPNVFTYSSLMNGFCKVGHSSRARDLFELMVQKRLRPNMISYSTLINGLCNEGKLN 333
            +G+ PN+FTY+  + G C+ G    A  +   ++++  +P++I+Y+ LI GLC   K  
Sbjct: 247 KRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQ 306

Query: 334 EALEILDRMKLQGLKPDAGLYGKIVNRLCDVCRFQEAANFLDEMVLCGITPNRVTWSLHV 393
           EA   L +M  +GL+PD+  Y  ++   C     Q A   + + V  G  P++       
Sbjct: 307 EAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQF------ 366

Query: 394 KTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRILDE 453
            T+  +I GLC   ++NRA  L+   L +GI   V  +++L+K   N+  +L+ +++ +E
Sbjct: 367 -TYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILYNTLIKGLSNQGMILEAAQLANE 426

Query: 454 MVINGCIPEREMWSTIVNCFC 474
           M   G IPE + ++ +VN  C
Sbjct: 427 MSEKGLIPEVQTFNILVNGLC 437

BLAST of Cla97C03G059270 vs. TAIR 10
Match: AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 239.6 bits (610), Expect = 5.2e-63
Identity = 139/503 (27.63%), Postives = 257/503 (51.09%), Query Frame = 0

Query: 32  VTPAHVEQLIRAEQDINKALLIFDSATAEYTNGFKHNLNTFRLMISKLVSANQFRLAETL 91
           +TP  + +L+    +++ ++ +F    ++  NG++H+ + ++++I KL +  +F+  + L
Sbjct: 76  ITPFQLYKLLELPLNVSTSMELFSWTGSQ--NGYRHSFDVYQVLIGKLGANGEFKTIDRL 135

Query: 92  LDRMKEEKIDVTEDILLSICRAYGRIHKPLDSIRVFHKMQD-FHCKPTEKSYISVLAILV 151
           L +MK+E I   E + +SI R Y +   P  + R+  +M++ + C+PT KSY  VL ILV
Sbjct: 136 LIQMKDEGIVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILV 195

Query: 152 EENQLRLAFRFYRYMRKVGIPPTVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDS 211
             N  ++A   +  M    IPPT+ +  V++KAFC  +  ++ A+ + R+M+ HGC P+S
Sbjct: 196 SGNCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVN-EIDSALSLLRDMTKHGCVPNS 255

Query: 212 YTYGTLINGLCRFGNIVEAKELLQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLED 271
             Y TLI+ L +   + EA +LL+EM   GC P   T++ +I GLC+ + ++EA  ++  
Sbjct: 256 VIYQTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNR 315

Query: 272 MVGKGIEPNVFTYSSLMNGFCKVGHSSRARDLF--------------------------- 331
           M+ +G  P+  TY  LMNG CK+G    A+DLF                           
Sbjct: 316 MLIRGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDA 375

Query: 332 -----ELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVN 391
                +++    + P++ +Y++LI G   EG +  ALE+L  M+ +G KP+   Y  +V+
Sbjct: 376 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 435

Query: 392 RLCDVCRFQEAANFLDEMVLCGITPNRVTWSL---------------------------- 451
             C + +  EA N L+EM   G+ PN V ++                             
Sbjct: 436 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 495

Query: 452 HVKTHNRVIHGLCTINDSNRAFQLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRIL 474
            V T N +I GLC +++   A  L   +++ G+     T+++L+  F  + ++ +  +++
Sbjct: 496 DVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLV 555

BLAST of Cla97C03G059270 vs. TAIR 10
Match: AT4G20090.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 230.7 bits (587), Expect = 2.4e-60
Identity = 135/421 (32.07%), Postives = 217/421 (51.54%), Query Frame = 0

Query: 58  TAEYTNGFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRI 117
           +A     FK   +T   MI    ++  F   E LL R++ E   + E   + + RAYG+ 
Sbjct: 66  SAPKMGSFKLGDSTLSSMIESYANSGDFDSVEKLLSRIRLENRVIIERSFIVVFRAYGKA 125

Query: 118 HKPLDSIRVFHKMQD-FHCKPTEKSYISVLAILVEENQLRLAFRFYRYM----RKVGIPP 177
           H P  ++ +FH+M D F CK + KS+ SVL +++ E        FY Y+      + I P
Sbjct: 126 HLPDKAVDLFHRMVDEFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISP 185

Query: 178 TVASLNVLIKAFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKEL 237
              S N++IKA CK    +++AI +FR M    C PD YTY TL++GLC+   I EA  L
Sbjct: 186 NGLSFNLVIKALCKLRF-VDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLL 245

Query: 238 LQEMEKKGCSPSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCK 297
           L EM+ +GCSPS + Y+ LI GLC+  ++     L+++M  KG  PN  TY++L++G C 
Sbjct: 246 LDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCL 305

Query: 298 VGHSSRARDLFELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGL 357
            G   +A  L E MV  +  PN ++Y TLINGL  + +  +A+ +L  M+ +G   +  +
Sbjct: 306 KGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHI 365

Query: 358 YGKIVNRLCDVCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAF 417
           Y  +++ L    + +EA +   +M   G  PN V +S+       ++ GLC     N A 
Sbjct: 366 YSVLISGLFKEGKAEEAMSLWRKMAEKGCKPNIVVYSV-------LVDGLCREGKPNEAK 425

Query: 418 QLYLSVLTRGINITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCF 474
           ++   ++  G      T+ SL+K F       +  ++  EM   GC   +  +S +++  
Sbjct: 426 EILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLIDGL 478

BLAST of Cla97C03G059270 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 224.6 bits (571), Expect = 1.7e-58
Identity = 131/411 (31.87%), Postives = 215/411 (52.31%), Query Frame = 0

Query: 64  GFKHNLNTFRLMISKLVSANQFRLAETLLDRMKEEKIDVTEDILLSICRAYGRIHKPLDS 123
           G K +++TF ++I  L  A+Q R A  +L+ M    +   E    ++ + Y        +
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 124 IRVFHKMQDFHCKPTEKSYISVLAILVEENQLRLAFRFYRYM-RKVGIPPTVASLNVLIK 183
           +R+  +M +F C  +  S   ++    +E ++  A  F + M  + G  P   + N L+ 
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 184 AFCKNSVTMEKAIHIFREMSNHGCEPDSYTYGTLINGLCRFGNIVEAKELLQEMEKKGCS 243
             CK +  ++ AI I   M   G +PD YTY ++I+GLC+ G + EA E+L +M  + CS
Sbjct: 304 GLCK-AGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCS 363

Query: 244 PSVITYSSLIHGLCQLNNVDEAMGLLEDMVGKGIEPNVFTYSSLMNGFCKVGHSSRARDL 303
           P+ +TY++LI  LC+ N V+EA  L   +  KGI P+V T++SL+ G C   +   A +L
Sbjct: 364 PNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMEL 423

Query: 304 FELMVQKRLRPNMISYSTLINGLCNEGKLNEALEILDRMKLQGLKPDAGLYGKIVNRLCD 363
           FE M  K   P+  +Y+ LI+ LC++GKL+EAL +L +M+L G       Y  +++  C 
Sbjct: 424 FEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCK 483

Query: 364 VCRFQEAANFLDEMVLCGITPNRVTWSLHVKTHNRVIHGLCTINDSNRAFQLYLSVLTRG 423
             + +EA    DEM + G++ N V       T+N +I GLC       A QL   ++  G
Sbjct: 484 ANKTREAEEIFDEMEVHGVSRNSV-------TYNTLIDGLCKSRRVEDAAQLMDQMIMEG 543

Query: 424 INITVDTFDSLLKCFCNKRDLLKTSRILDEMVINGCIPEREMWSTIVNCFC 474
                 T++SLL  FC   D+ K + I+  M  NGC P+   + T+++  C
Sbjct: 544 QKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLC 586

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893902.12.3e-25493.64pentatricopeptide repeat-containing protein At5g46100 [Benincasa hispida][more]
XP_023001715.12.0e-25092.11pentatricopeptide repeat-containing protein At5g46100 [Cucurbita maxima][more]
KAG7019600.12.6e-25092.11Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAA0057015.13.4e-25089.26pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26443... [more]
XP_023519166.12.2e-24992.32pentatricopeptide repeat-containing protein At5g46100 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9FNL21.7e-16759.74Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX... [more]
Q9CA581.1e-6231.29Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
Q9FMF67.3e-6227.63Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
O494363.4e-5932.07Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX... [more]
Q9LFF12.4e-5731.87Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1KLZ39.7e-25192.11pentatricopeptide repeat-containing protein At5g46100 OS=Cucurbita maxima OX=366... [more]
A0A5D3DS891.7e-25089.26Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1EKD32.4e-24991.89pentatricopeptide repeat-containing protein At5g46100 OS=Cucurbita moschata OX=3... [more]
A0A6J1CDW12.2e-24788.00pentatricopeptide repeat-containing protein At5g46100 isoform X1 OS=Momordica ch... [more]
A0A0A0LRZ46.3e-24287.96Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G075590 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G46100.11.2e-16859.74Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G74580.18.0e-6431.29Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G64320.15.2e-6327.63Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G20090.12.4e-6032.07Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.11.7e-5831.87Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 71..102
e-value: 0.0014
score: 16.7
coord: 246..280
e-value: 7.5E-10
score: 36.4
coord: 316..350
e-value: 2.5E-10
score: 37.9
coord: 429..460
e-value: 2.1E-4
score: 19.2
coord: 353..384
e-value: 0.0017
score: 16.4
coord: 211..245
e-value: 7.0E-11
score: 39.6
coord: 176..210
e-value: 1.8E-7
score: 28.9
coord: 281..314
e-value: 4.3E-8
score: 30.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 420..473
e-value: 7.6E-4
score: 19.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 142..186
e-value: 5.3E-9
score: 36.1
coord: 250..292
e-value: 3.9E-14
score: 52.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 71..99
e-value: 0.052
score: 13.8
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 204..237
e-value: 6.4E-14
score: 51.4
coord: 310..342
e-value: 1.4E-12
score: 47.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..208
score: 11.421732
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 12.605553
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..460
score: 9.832344
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 244..278
score: 13.076888
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 138..172
score: 9.065053
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 314..348
score: 13.504379
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 209..243
score: 14.162057
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 389..475
e-value: 1.2E-13
score: 52.8
coord: 17..156
e-value: 2.6E-16
score: 61.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 163..236
e-value: 2.5E-23
score: 84.6
coord: 308..378
e-value: 1.2E-16
score: 62.8
coord: 237..307
e-value: 5.7E-25
score: 89.9
NoneNo IPR availablePANTHERPTHR47933PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 1, MITOCHONDRIALcoord: 22..474
NoneNo IPR availablePANTHERPTHR47933:SF25EMP16coord: 22..474
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 149..342

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G059270.2Cla97C03G059270.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding