Cla97C02G040850 (gene) Watermelon (97103) v2

NameCla97C02G040850
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr02 : 28723994 .. 28725448 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCAAGAACTGTCTCCAGATCGAGAGAAGAATCCTCCGCCTCCTTCATGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCATGGCCTCCACCAATCCAACCAAATCCTCTCCCATTTCATCTCCATTTGCGCCGCTTTCAACCACATTGACTATGCCAATCGCCTCTTTTCCCAATCCCATAACCCCAATATCTTCCTCTTCAATTCCATAATCAAAGCTCACTCCCTCTCCCCTCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCTATGAAGAATCACAGGATTGTTCCTGACGAATACACTTTTGCGCCGTTGCTTAAATCCTGCGCGAATCTCTGTGAGTATAGACTTGGTCAGTGTGTGATATCTGAAGTTTTGCGTCGTGGATTTTACTGTTTTGGGTCTATTCGTATTGGGGTGGTTGAGTTGTATGTCTGTTGTGAAAGGATGGAGGATGCGCGTAAGGTGTTTGATGAAATGCCTCACAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTTGCAAGATGGGCAATGTTGATTTTGGGTTGTGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATGATTTCCTGTTTAGCTCAAAATAGACGTGATGTTGAAGCTTTGGAACTCTTTCAACGGATGGAAGAATATGGTTTTAAACCAGATGAAGTTACAGTGGTCACAATGTTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGTTGGACAAAGGATCCATTCTTTTGCAAGTTCGAAGGGAGATTTGGTACATATTACGACGGTTGGGAATTCGCTAGTTGATTTTTACTGTAAATGTGGGAATACAGAAAGGGCTTACAACATTTTTCAGAAAATGACTTGCAAAAGTGTTGTCTCTTGGAATACAATGATTTTGGGCTTTGCTTTAAATGGGAAGGGGGAGTTTGCCATTGACCTTTTTATGGTGATGGGAAGAGAGGATGTGAAGCCTAATGATGCAACATTTGTAGCTGTCTTGACCGCTTGTGTCCATTCGGGATTGTTAGAGAAGGGTCGAGAGATATTTTCTTCAATGGCTGAGAAGTATGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGTGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGAACTCATGGTAACTTGAAACTTGCGGAAATGGCAGTGAAGGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAAGAAGGAAGATGGGAAGATGTTGAGAATGTCAGACGTTGGATGAGAGGAAAGAGCATCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA

mRNA sequence

ATGAGCAAGAACTGTCTCCAGATCGAGAGAAGAATCCTCCGCCTCCTTCATGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCATGGCCTCCACCAATCCAACCAAATCCTCTCCCATTTCATCTCCATTTGCGCCGCTTTCAACCACATTGACTATGCCAATCGCCTCTTTTCCCAATCCCATAACCCCAATATCTTCCTCTTCAATTCCATAATCAAAGCTCACTCCCTCTCCCCTCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCTATGAAGAATCACAGGATTGTTCCTGACGAATACACTTTTGCGCCGTTGCTTAAATCCTGCGCGAATCTCTGTGAGTATAGACTTGGTCAGTGTGTGATATCTGAAGTTTTGCGTCGTGGATTTTACTGTTTTGGGTCTATTCGTATTGGGGTGGTTGAGTTGTATGTCTGTTGTGAAAGGATGGAGGATGCGCGTAAGGTGTTTGATGAAATGCCTCACAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTTGCAAGATGGGCAATGTTGATTTTGGGTTGTGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATGATTTCCTGTTTAGCTCAAAATAGACGTGATGTTGAAGCTTTGGAACTCTTTCAACGGATGGAAGAATATGGTTTTAAACCAGATGAAGTTACAGTGGTCACAATGTTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGTTGGACAAAGGATCCATTCTTTTGCAAGTTCGAAGGGAGATTTGGTACATATTACGACGGTTGGGAATTCGCTAGTTGATTTTTACTGTAAATGTGGGAATACAGAAAGGGCTTACAACATTTTTCAGAAAATGACTTGCAAAAGTGTTGTCTCTTGGAATACAATGATTTTGGGCTTTGCTTTAAATGGGAAGGGGGAGTTTGCCATTGACCTTTTTATGGTGATGGGAAGAGAGGATGTGAAGCCTAATGATGCAACATTTGTAGCTGTCTTGACCGCTTGTGTCCATTCGGGATTGTTAGAGAAGGGTCGAGAGATATTTTCTTCAATGGCTGAGAAGTATGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGTGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGAACTCATGGTAACTTGAAACTTGCGGAAATGGCAGTGAAGGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAAGAAGGAAGATGGGAAGATGTTGAGAATGTCAGACGTTGGATGAGAGGAAAGAGCATCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA

Coding sequence (CDS)

ATGAGCAAGAACTGTCTCCAGATCGAGAGAAGAATCCTCCGCCTCCTTCATGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCATGGCCTCCACCAATCCAACCAAATCCTCTCCCATTTCATCTCCATTTGCGCCGCTTTCAACCACATTGACTATGCCAATCGCCTCTTTTCCCAATCCCATAACCCCAATATCTTCCTCTTCAATTCCATAATCAAAGCTCACTCCCTCTCCCCTCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCTATGAAGAATCACAGGATTGTTCCTGACGAATACACTTTTGCGCCGTTGCTTAAATCCTGCGCGAATCTCTGTGAGTATAGACTTGGTCAGTGTGTGATATCTGAAGTTTTGCGTCGTGGATTTTACTGTTTTGGGTCTATTCGTATTGGGGTGGTTGAGTTGTATGTCTGTTGTGAAAGGATGGAGGATGCGCGTAAGGTGTTTGATGAAATGCCTCACAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTTGCAAGATGGGCAATGTTGATTTTGGGTTGTGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATGATTTCCTGTTTAGCTCAAAATAGACGTGATGTTGAAGCTTTGGAACTCTTTCAACGGATGGAAGAATATGGTTTTAAACCAGATGAAGTTACAGTGGTCACAATGTTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGTTGGACAAAGGATCCATTCTTTTGCAAGTTCGAAGGGAGATTTGGTACATATTACGACGGTTGGGAATTCGCTAGTTGATTTTTACTGTAAATGTGGGAATACAGAAAGGGCTTACAACATTTTTCAGAAAATGACTTGCAAAAGTGTTGTCTCTTGGAATACAATGATTTTGGGCTTTGCTTTAAATGGGAAGGGGGAGTTTGCCATTGACCTTTTTATGGTGATGGGAAGAGAGGATGTGAAGCCTAATGATGCAACATTTGTAGCTGTCTTGACCGCTTGTGTCCATTCGGGATTGTTAGAGAAGGGTCGAGAGATATTTTCTTCAATGGCTGAGAAGTATGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGTGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGAACTCATGGTAACTTGAAACTTGCGGAAATGGCAGTGAAGGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAAGAAGGAAGATGGGAAGATGTTGAGAATGTCAGACGTTGGATGAGAGGAAAGAGCATCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA

Protein sequence

MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
BLAST of Cla97C02G040850 vs. NCBI nr
Match: XP_008453700.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo])

HSP 1 Score: 831.6 bits (2147), Expect = 1.3e-237
Identity = 445/484 (91.94%), Postives = 463/484 (95.66%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA
Sbjct: 1   MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIGYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           +RLFSQSHNPNIFLFNSIIKAHSLSPPF QSLLLFS MKNHRIVPD+YTFAPLLKSCANL
Sbjct: 61  DRLFSQSHNPNIFLFNSIIKAHSLSPPFHQSLLLFSLMKNHRIVPDQYTFAPLLKSCANL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           CEY LGQCVISEVL RGFYCFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMI
Sbjct: 121 CEYSLGQCVISEVLHRGFYCFGSIRIGVVELYVCCEKMEDAWKAFDEMSHRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCKMGNVDFGLC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX KPDEV
Sbjct: 181 RGFCKMGNVDFGLCLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKPDEV 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVTMLPVCSRLGAL+VGQRIHS+ SSKG+LV  T VGNSL+DFYCKCGN E AYNIFQK
Sbjct: 241 TVVTMLPVCSRLGALEVGQRIHSYTSSKGNLVGTTMVGNSLIDFYCKCGNIESAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +EDVKPNDATFVAVLTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEDVKPNDATFVAVLTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR
Sbjct: 361 RELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQ
Sbjct: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQ 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. NCBI nr
Match: XP_004144815.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis sativus] >KGN60987.1 hypothetical protein Csa_2G033910 [Cucumis sativus])

HSP 1 Score: 822.0 bits (2122), Expect = 1.1e-234
Identity = 442/484 (91.32%), Postives = 464/484 (95.87%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA
Sbjct: 1   MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           +RLFSQSHNPNIFLFNSIIKAHSLS PF QSLLLFSSMKNHRIVPD+YTFAPLLKSCANL
Sbjct: 61  DRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           CEY LGQCVISEV RRGFYCFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMI
Sbjct: 121 CEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCK GNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   DEV
Sbjct: 181 RGFCKTGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDEV 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVTMLPVCSRLGAL+VGQRIHS+ASSKG+LV ITTVGNSL+DFYCKCGN E+AYNIFQK
Sbjct: 241 TVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +E +KPNDATFVAVLTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LIKSMPMQPNATLWGA+LGACR
Sbjct: 361 RELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQ
Sbjct: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQ 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. NCBI nr
Match: XP_022156351.1 (pentatricopeptide repeat-containing protein At1g09190 [Momordica charantia])

HSP 1 Score: 763.1 bits (1969), Expect = 5.8e-217
Identity = 417/484 (86.16%), Postives = 444/484 (91.74%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           M+KN  +IER+ILRLLHGHK+R HLTQ HAHFLRHGLHQSNQIL+HFISIC A + + YA
Sbjct: 1   MNKNFREIERKILRLLHGHKTRMHLTQTHAHFLRHGLHQSNQILAHFISICGALDKMPYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
            R+FSQS NPNIFLFNS+IKAHSL  PF+QSLLLFSSMK  RIVPDEYTFAPLLKSC+NL
Sbjct: 61  IRVFSQSQNPNIFLFNSMIKAHSLCGPFEQSLLLFSSMKKQRIVPDEYTFAPLLKSCSNL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           C+Y+LGQCV  EVLRRGF  FGSIRIGVVELYVCCERMEDA+KVFD MPHRDV+VWNLMI
Sbjct: 121 CDYKLGQCVKGEVLRRGFEYFGSIRIGVVELYVCCERMEDAKKVFDAMPHRDVIVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCK GNVD G+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF PDEV
Sbjct: 181 RGFCKTGNVDLGIYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFAPDEV 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVT+LPVCSRLGA  VGQRIH++ASSKG+LV IT VGNSL+DFYCKCGNTE AYNIF K
Sbjct: 241 TVVTILPVCSRLGAPVVGQRIHAYASSKGNLVDITFVGNSLLDFYCKCGNTEGAYNIFNK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNTMILGFALNGKGE A DLFM MGR+DVKPNDATFVA+LTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTMILGFALNGKGELATDLFMEMGRKDVKPNDATFVALLTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           REIFSSM  KY+I PKLEHFGCMVDLLGR G VEEAHNLIKSMPMQPNATLWGALLGACR
Sbjct: 361 REIFSSMMHKYKIVPKLEHFGCMVDLLGRSGLVEEAHNLIKSMPMQPNATLWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAE+AVKELISLEPWNSGNYVLLSNMLA EGRWEDVENVR WMRGKS+ KAPGQ
Sbjct: 421 THGNLKLAELAVKELISLEPWNSGNYVLLSNMLAAEGRWEDVENVRGWMRGKSVTKAPGQ 480

Query: 481 SASG 485
           SA+G
Sbjct: 481 SANG 484

BLAST of Cla97C02G040850 vs. NCBI nr
Match: XP_023538021.1 (pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 747.3 bits (1928), Expect = 3.3e-212
Identity = 415/484 (85.74%), Postives = 437/484 (90.29%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC AFN I YA
Sbjct: 1   MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGAFNEIAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           NR+FSQS NPNIFLFNS+IKAHSLS PFQQSLLLFSSMKN RIVPDEYTFAPLLKSC+NL
Sbjct: 61  NRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNRRIVPDEYTFAPLLKSCSNL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
            +YRLG+CVI EVLRRGF  FGSIRIGVVELYVCCERM+DA+KVFDEMP RDVVVWNLMI
Sbjct: 121 YDYRLGKCVIGEVLRRGFEWFGSIRIGVVELYVCCERMDDAQKVFDEMPQRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCKMGNVD    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX      
Sbjct: 181 RGFCKMGNVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
                LPVCSRLGALDVGQ IHS+A+SK DLV+ T VGNSL+DFYCK GNTE+AYNIFQK
Sbjct: 241 XXXXXLPVCSRLGALDVGQMIHSYATSKADLVNTTMVGNSLIDFYCKSGNTEKAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNTMILGFALNGKGE AIDLF  MGR D KPNDAT VA+LTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTMILGFALNGKGELAIDLFTEMGRGDAKPNDATLVAILTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI+SMPMQPNATLWGALLGACR
Sbjct: 361 REVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRWEDVENVRR MRGK++KKAPG+
Sbjct: 421 THGNLKLAEMAANELISLEPSNSGNYVLLSNILAEEGRWEDVENVRRSMRGKNVKKAPGR 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. NCBI nr
Match: XP_022965696.1 (pentatricopeptide repeat-containing protein At1g09190 [Cucurbita maxima])

HSP 1 Score: 733.8 bits (1893), Expect = 3.8e-208
Identity = 407/484 (84.09%), Postives = 432/484 (89.26%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC  FN I YA
Sbjct: 1   MSKNYRNIERRILRLLSGHKSPTHLTQIHAHFLRHDLHQSNQILAHFISICGGFNEIAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           NR+FSQS NPNIFLFNS+IKAHSLS PF+QSLLLFSS+KN RIVPDEYTFAPLLKSC+NL
Sbjct: 61  NRVFSQSQNPNIFLFNSMIKAHSLSGPFEQSLLLFSSLKNRRIVPDEYTFAPLLKSCSNL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
            +YRLG+CVI EVLRRGF CFGSIRIGVVELYVCCERM+DA+KVFDEMPH DVVVWNLMI
Sbjct: 121 YDYRLGKCVIGEVLRRGFECFGSIRIGVVELYVCCERMDDAQKVFDEMPHTDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCKMGNVD    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX      
Sbjct: 181 RGFCKMGNVDLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
                    SRLGA+DVGQ IHS+A+SK DLV+ T VGNSL+DFYCK GNTERAYNIFQK
Sbjct: 241 XXXXXXXXXSRLGAIDVGQMIHSYATSKADLVNTTMVGNSLIDFYCKSGNTERAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNTMILGFALNGKGE AIDLFM MG+ D KPND T VA+LTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTMILGFALNGKGELAIDLFMEMGQGDAKPNDVTLVAILTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           +E+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI+SMPMQPNATLWGALLGACR
Sbjct: 361 QEVFSSMAEKYEIEPKLEHFGCMVDLLGRGGCVEEAHSLIRSMPMQPNATLWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMAV ELISLEP NSGNYVLLSN LAEE RWEDVENVRR MRGK++KKAPG+
Sbjct: 421 THGNLKLAEMAVNELISLEPSNSGNYVLLSNTLAEERRWEDVENVRRSMRGKNVKKAPGR 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. TrEMBL
Match: tr|A0A1S3BXN7|A0A1S3BXN7_CUCME (pentatricopeptide repeat-containing protein At1g09190 OS=Cucumis melo OX=3656 GN=LOC103494345 PE=4 SV=1)

HSP 1 Score: 831.6 bits (2147), Expect = 8.8e-238
Identity = 445/484 (91.94%), Postives = 463/484 (95.66%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA
Sbjct: 1   MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIGYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           +RLFSQSHNPNIFLFNSIIKAHSLSPPF QSLLLFS MKNHRIVPD+YTFAPLLKSCANL
Sbjct: 61  DRLFSQSHNPNIFLFNSIIKAHSLSPPFHQSLLLFSLMKNHRIVPDQYTFAPLLKSCANL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           CEY LGQCVISEVL RGFYCFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMI
Sbjct: 121 CEYSLGQCVISEVLHRGFYCFGSIRIGVVELYVCCEKMEDAWKAFDEMSHRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCKMGNVDFGLC XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX KPDEV
Sbjct: 181 RGFCKMGNVDFGLCLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKPDEV 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVTMLPVCSRLGAL+VGQRIHS+ SSKG+LV  T VGNSL+DFYCKCGN E AYNIFQK
Sbjct: 241 TVVTMLPVCSRLGALEVGQRIHSYTSSKGNLVGTTMVGNSLIDFYCKCGNIESAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +EDVKPNDATFVAVLTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEDVKPNDATFVAVLTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR
Sbjct: 361 RELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQ
Sbjct: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQ 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. TrEMBL
Match: tr|A0A0A0LJW6|A0A0A0LJW6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G033910 PE=4 SV=1)

HSP 1 Score: 822.0 bits (2122), Expect = 7.0e-235
Identity = 442/484 (91.32%), Postives = 464/484 (95.87%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA
Sbjct: 1   MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           +RLFSQSHNPNIFLFNSIIKAHSLS PF QSLLLFSSMKNHRIVPD+YTFAPLLKSCANL
Sbjct: 61  DRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           CEY LGQCVISEV RRGFYCFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMI
Sbjct: 121 CEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCK GNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   DEV
Sbjct: 181 RGFCKTGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDEV 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVTMLPVCSRLGAL+VGQRIHS+ASSKG+LV ITTVGNSL+DFYCKCGN E+AYNIFQK
Sbjct: 241 TVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +E +KPNDATFVAVLTACVHSGLLEKG
Sbjct: 301 MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LIKSMPMQPNATLWGA+LGACR
Sbjct: 361 RELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQ
Sbjct: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQ 480

Query: 481 SASG 485
           SASG
Sbjct: 481 SASG 484

BLAST of Cla97C02G040850 vs. TrEMBL
Match: tr|A0A2N9IWV2|A0A2N9IWV2_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56717 PE=4 SV=1)

HSP 1 Score: 611.7 bits (1576), Expect = 1.4e-171
Identity = 328/481 (68.19%), Postives = 403/481 (83.78%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSK  L++ERR+LR LHGHK+RT L +IHAHFLRHGL QSNQ+L+HF+S+C + + + YA
Sbjct: 1   MSKASLEVERRVLRFLHGHKTRTRLPEIHAHFLRHGLDQSNQVLAHFVSVCGSLDKMAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           + +F Q+HNPNI LFNS+IK +SL  PF++SL LFS +++  + P+EYTFAPLLKS + L
Sbjct: 61  DSVFCQTHNPNILLFNSMIKGYSLCGPFEKSLHLFSQLRSRGVRPNEYTFAPLLKSSSGL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           C+ +LGQCV +E++R GF CFGSIRIG+VE YV CERMEDARK+FDEM +RDVVVWN+MI
Sbjct: 121 CDCKLGQCVHAEIIRIGFECFGSIRIGIVEFYVTCERMEDARKMFDEMSYRDVVVWNMMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFC+ G++D GLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX F+ DE 
Sbjct: 181 RGFCETGDIDMGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFELDEA 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVV +LPVC+RLGA+DVGQ IHS+  S+G L ++  VGNSLVDFYCKCGN E A  IF +
Sbjct: 241 TVVIVLPVCARLGAVDVGQWIHSYLGSRGPLRNVIYVGNSLVDFYCKCGNLEVAQRIFNE 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           M  K+VVSWN MI G A NG+GE  ++LF  M  + + PNDATFV +LT CVH+GL+EKG
Sbjct: 301 MPRKNVVSWNVMISGLAFNGEGEVGVELFEEMMNKAMSPNDATFVGILTCCVHAGLVEKG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           +E+F+SMA K++IQPKLEH+GCMVDLLGR GCV EAH LI++MPM+PNA LWGALL ACR
Sbjct: 361 QELFASMAAKHQIQPKLEHYGCMVDLLGRSGCVREAHGLIRNMPMKPNAALWGALLSACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THG+++LAE+AVKELI++EPWNSGNYVLLSN+ AEEGRW++VE VR  M+ K +KK  GQ
Sbjct: 421 THGDIELAELAVKELINIEPWNSGNYVLLSNIYAEEGRWDEVEKVRVLMKEKCVKKVRGQ 480

Query: 481 S 482
           S
Sbjct: 481 S 481

BLAST of Cla97C02G040850 vs. TrEMBL
Match: tr|A0A2I4EMS6|A0A2I4EMS6_9ROSI (pentatricopeptide repeat-containing protein At1g09190 OS=Juglans regia OX=51240 GN=LOC108991010 PE=4 SV=1)

HSP 1 Score: 608.2 bits (1567), Expect = 1.6e-170
Identity = 331/481 (68.81%), Postives = 396/481 (82.33%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           MSK   + ERR+LRLLHGHK+RT + QIHAHFLRHGL QSNQ+L+HF+S+C + + + YA
Sbjct: 1   MSKAYREAERRVLRLLHGHKTRTQIPQIHAHFLRHGLDQSNQVLAHFVSVCGSLDKMAYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           NR+F Q+HNPNIFLFNS+IK +SL  PF+QSL LFS +K+  I PDEYTFAPLLKSC+ L
Sbjct: 61  NRIFLQTHNPNIFLFNSMIKGYSLCGPFEQSLHLFSLLKSRSIRPDEYTFAPLLKSCSGL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           C +++G+CV ++++R GF CFGSIRIG++E YV CERM+DA KVF+ M +RDVVVWNLMI
Sbjct: 121 CGFKIGKCVHADIIRVGFECFGSIRIGIIEFYVTCERMDDATKVFNAMSYRDVVVWNLMI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCKMGNV     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  +PDE 
Sbjct: 181 RGFCKMGNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLEPDEA 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVV +LPVC+RLGA DVGQ IHS + S+G L  + +VGNSLVDFYCKCGN E A+NIF +
Sbjct: 241 TVVIVLPVCARLGAFDVGQWIHSHSGSRGLLHDVISVGNSLVDFYCKCGNLEIAWNIFNE 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           M  K+VVSWN MI G A NGKGE  ++LF  M  + + PN ATFV  L  C H+GL+E+G
Sbjct: 301 MPVKNVVSWNAMISGLAFNGKGETGVNLFEEMINKGMSPNGATFVGALACCAHTGLVERG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           RE+F+SM   ++I PKLEH+GCMVDLLGR GCV EAH LI+SM M+PNA LWGALLGACR
Sbjct: 361 RELFASMTANHQIHPKLEHYGCMVDLLGRSGCVGEAHGLIRSMAMKPNAALWGALLGACR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           THG+L+LAE+AVKELI+LEPWNSGNYVLLSN+ AEEGRW++VE VR  M+ K +KKAPGQ
Sbjct: 421 THGDLELAELAVKELINLEPWNSGNYVLLSNIYAEEGRWDEVEKVRVLMKEKCVKKAPGQ 480

Query: 481 S 482
           S
Sbjct: 481 S 481

BLAST of Cla97C02G040850 vs. TrEMBL
Match: tr|A0A2P5EVF3|A0A2P5EVF3_9ROSA (Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_146390 PE=4 SV=1)

HSP 1 Score: 594.7 bits (1532), Expect = 1.8e-166
Identity = 326/484 (67.36%), Postives = 397/484 (82.02%), Query Frame = 0

Query: 1   MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYA 60
           M K C   ERR+LRLLHGHK+R  L QIHAHFLRHGLHQSNQ+L+HF+S+C + N + YA
Sbjct: 1   MGKACRDAERRVLRLLHGHKTRKQLPQIHAHFLRHGLHQSNQVLAHFVSVCWSLNRMGYA 60

Query: 61  NRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANL 120
           NR+F Q+ NPN+ LFNS+IK +SL  PF+QSL LFSSMK+  I PDEYTFAPLLK+C+NL
Sbjct: 61  NRVFRQARNPNMILFNSMIKGYSLCGPFEQSLHLFSSMKSRAIRPDEYTFAPLLKACSNL 120

Query: 121 CEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMI 180
           CE R+GQCV S+VLR GF  FGSI+IG+VEL V C RM DA+K+FDEMPHRDV+VWNL+I
Sbjct: 121 CELRMGQCVHSQVLRSGFELFGSIQIGIVELCVTCLRMGDAKKMFDEMPHRDVIVWNLLI 180

Query: 181 RGFCKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEV 240
           RGFCK GNVD GL XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF+PDE 
Sbjct: 181 RGFCKTGNVDMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFEPDEA 240

Query: 241 TVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK 300
           TVVT+LPVC+RLG +DVGQ IHS+  SKG L  + +VGN+LVDFYCK G+ E A +IF++
Sbjct: 241 TVVTVLPVCARLGVVDVGQWIHSYTDSKGLLQEVVSVGNALVDFYCKSGSLELASSIFKQ 300

Query: 301 MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKG 360
           M  K VVSWN MI G A NGKG+  ++LF  M      PN+ATF+ VL  C H+GL+E+G
Sbjct: 301 MPQKDVVSWNVMISGLAFNGKGQHGVELFEEMVDRGTNPNNATFIGVLACCAHAGLVERG 360

Query: 361 REIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR 420
           R +F+SM+  + I+PKLEH+GCMVD+LGR G ++EAH+LI+SMP++PNA LWG+LL +CR
Sbjct: 361 RGLFASMSLSHRIKPKLEHYGCMVDILGRSGHMKEAHDLIRSMPIKPNAALWGSLLSSCR 420

Query: 421 THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQ 480
           T+G+L+LAE+A+ ELI+LEPWNSGNYVLLSN+ AEEGRW+ V+ VR  MR K + K PG+
Sbjct: 421 TYGDLELAEIALHELINLEPWNSGNYVLLSNIYAEEGRWDKVDKVRVLMREKCVIKGPGR 480

Query: 481 SASG 485
           SA G
Sbjct: 481 SALG 484

BLAST of Cla97C02G040850 vs. Swiss-Prot
Match: sp|O80488|PPR23_ARATH (Pentatricopeptide repeat-containing protein At1g09190 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E70 PE=2 SV=1)

HSP 1 Score: 495.0 bits (1273), Expect = 9.7e-139
Identity = 286/477 (59.96%), Postives = 361/477 (75.68%), Query Frame = 0

Query: 6   LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFS 65
           ++IER++LRLLHGH +RT L +IHAH LRH LH SN +L+HFISIC + ++ DYANR+FS
Sbjct: 1   MEIERKLLRLLHGHNTRTRLPEIHAHLLRHFLHGSNLLLAHFISICGSLSNSDYANRVFS 60

Query: 66  QSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRL 125
              NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  DEYT+APLLKSC++L + R 
Sbjct: 61  HIQNPNVLVFNAMIKCYSLVGPPLESLSFFSSMKSRGIWADEYTYAPLLKSCSSLSDLRF 120

Query: 126 GQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCK 185
           G+CV  E++R GF+  G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC 
Sbjct: 121 GKCVHGELIRTGFHRLGKIRIGVVELYTSGGRMGDAQKVFDEMSERNVVVWNLMIRGFCD 180

Query: 186 MGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTM 245
            G+V+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX F PDE TVVT+
Sbjct: 181 SGDVEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFDPDEATVVTV 240

Query: 246 LPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS 305
           LP+ + LG LD G+ IHS A S G      TVGN+LVDFYCK G+ E A  IF+KM  ++
Sbjct: 241 LPISASLGVLDTGKWIHSTAESSGLFKDFITVGNALVDFYCKSGDLEAATAIFRKMQRRN 300

Query: 306 VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIF 365
           VVSWNT+I G A+NGKGEF IDLF  M  E  V PN+ATF+ VL  C ++G +E+G E+F
Sbjct: 301 VVSWNTLISGSAVNGKGEFGIDLFDAMIEEGKVAPNEATFLGVLACCSYTGQVERGEELF 360

Query: 366 SSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGN 425
             M E+++++ + EH+G MVDL+ R G + EA   +K+MP+  NA +WG+LL ACR+HG+
Sbjct: 361 GLMMERFKLEARTEHYGAMVDLMSRSGRITEAFKFLKNMPVNANAAMWGSLLSACRSHGD 420

Query: 426 LKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
           +KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ GQS
Sbjct: 421 VKLAEVAAMELVKIEPGNSGNYVLLSNLYAEEGRWQDVEKVRTLMKKNRLRKSTGQS 477

BLAST of Cla97C02G040850 vs. Swiss-Prot
Match: sp|Q9SIL5|PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 4.0e-84
Identity = 198/476 (41.60%), Postives = 300/476 (63.03%), Query Frame = 0

Query: 7   QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQ 66
           ++E   +  L   KSR    +I+A  + HGL QS+ +++  +  C     +DYA RLF+Q
Sbjct: 8   EVENYFIPFLQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQ 67

Query: 67  SHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VPDEYTFAPLLKSCANLCEYRL 126
             NPN+FL+NSII+A++ +  +   + ++  +      +PD +TF  + KSCA+L    L
Sbjct: 68  VSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYL 127

Query: 127 GQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCK 186
           G+ V   + + G          ++++Y+  + + DA KVFDEM  RD             
Sbjct: 128 GKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDXXXXXXXXXXXXX 187

Query: 187 MGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTM 246
                    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX       +PDE++++++
Sbjct: 188 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEMQLAGIEPDEISLISV 247

Query: 247 LPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS 306
           LP C++LG+L++G+ IH +A  +G L   T V N+L++ Y KCG   +A  +F +M  K 
Sbjct: 248 LPSCAQLGSLELGKWIHLYAERRGFLKQ-TGVCNALIEMYSKCGVISQAIQLFGQMEGKD 307

Query: 307 VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFS 366
           V+SW+TMI G+A +G    AI+ F  M R  VKPN  TF+ +L+AC H G+ ++G   F 
Sbjct: 308 VISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFD 367

Query: 367 SMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNL 426
            M + Y+I+PK+EH+GC++D+L R G +E A  + K+MPM+P++ +WG+LL +CRT GNL
Sbjct: 368 MMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNL 427

Query: 427 KLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
            +A +A+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Sbjct: 428 DVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGS 482

BLAST of Cla97C02G040850 vs. Swiss-Prot
Match: sp|Q9FMA1|PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 8.9e-84
Identity = 188/472 (39.83%), Postives = 290/472 (61.44%), Query Frame = 0

Query: 16  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLF 75
           +HG+  +T L Q H + +  GL++ N  ++ FI  C+   H+ YA  +F+    PN +L 
Sbjct: 23  IHGNNLKT-LKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYAYSVFTHQPCPNTYLH 82

Query: 76  NSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAPLLKSCANLCEYRLGQCVISE 135
           N++I+A S L  P   S+ +    K   +   PD +TF  +LK    + +   G+ +  +
Sbjct: 83  NTMIRALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQ 142

Query: 136 VLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVD-- 195
           V+  GF     +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D  
Sbjct: 143 VVVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEA 202

Query: 196 FGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTMLPVCS 255
             L            XXXXXXXXXXXXXXXXXXXXX         +PDEVT++ +L  C+
Sbjct: 203 RSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMENVEPDEVTLLAVLSACA 262

Query: 256 RLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN 315
            LG+L++G+RI S+   +G +    ++ N+++D Y K GN  +A ++F+ +  ++VV+W 
Sbjct: 263 DLGSLELGERICSYVDHRG-MNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWT 322

Query: 316 TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEK 375
           T+I G A +G G  A+ +F  M +  V+PND TF+A+L+AC H G ++ G+ +F+SM  K
Sbjct: 323 TIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSK 382

Query: 376 YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNLKLAEM 435
           Y I P +EH+GCM+DLLGR G + EA  +IKSMP + NA +WG+LL A   H +L+L E 
Sbjct: 383 YGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGER 442

Query: 436 AVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA 483
           A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  +KK  G+S+
Sbjct: 443 ALSELIKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESS 492

BLAST of Cla97C02G040850 vs. Swiss-Prot
Match: sp|Q9FI80|PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 3.6e-77
Identity = 192/478 (40.17%), Postives = 296/478 (61.92%), Query Frame = 0

Query: 25  LTQIHAHFLRHGLHQSNQILSHFISICAA----FNHIDYANRLFSQSHNPNIFLFNSIIK 84
           L+QIHA F++ G  +     +  +  CA        +DYA+++F+Q    N F +N+II+
Sbjct: 39  LSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIR 98

Query: 85  AHSLSPPFQQSL---LLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRG 144
             S S   +  +   L +  M +  + P+ +TF  +LK+CA   + + G+ +    L+ G
Sbjct: 99  GFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYG 158

Query: 145 FYCFGSIRIGVVELYVCCERMEDARKVFDE---------MPHR-----DVVVWNLMIRGF 204
           F     +   +V +YV C  M+DAR +F +         M  R     ++V+WN+MI G+
Sbjct: 159 FGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGY 218

Query: 205 CKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVV 264
            ++G+         XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +P+ VT+V
Sbjct: 219 MRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNYVTLV 278

Query: 265 TMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTC 324
           ++LP  SRLG+L++G+ +H +A   G  +    +G++L+D Y KCG  E+A ++F+++  
Sbjct: 279 SVLPAISRLGSLELGEWLHLYAEDSGIRID-DVLGSALIDMYSKCGIIEKAIHVFERLPR 338

Query: 325 KSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREI 384
           ++V++W+ MI GFA++G+   AID F  M +  V+P+D  ++ +LTAC H GL+E+GR  
Sbjct: 339 ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRY 398

Query: 385 FSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHG 444
           FS M     ++P++EH+GCMVDLLGR G ++EA   I +MP++P+  +W ALLGACR  G
Sbjct: 399 FSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQG 458

Query: 445 NLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
           N+++ +     L+ + P +SG YV LSNM A +G W +V  +R  M+ K I+K PG S
Sbjct: 459 NVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 515

BLAST of Cla97C02G040850 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 1.8e-76
Identity = 172/575 (29.91%), Postives = 266/575 (46.26%), Query Frame = 0

Query: 13  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHN 72
           L LLH  K+   L  IHA  ++ GLH +N  LS  I  C    H +   YA  +F     
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 73  PNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRLGQCV 132
           PN+ ++N++ + H+LS     +L L+  M +  ++P+ YTF  +LKSCA    ++ GQ +
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 156

Query: 133 ISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPH------------------- 192
              VL+ G      +   ++ +YV   R+EDA KVFD+ PH                   
Sbjct: 157 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXXXXXXXXXX 216

Query: 193 ------------------------------------------------------------ 252
                                                                       
Sbjct: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDESTMVTVVSAC 276

Query: 253 ---------RDVVVW-------------NLMIRGFCKMGNVDFGLCXXXXXXXXXXXXXX 312
                    R V +W             N +I  + K G ++                  
Sbjct: 277 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWN 336

Query: 313 XXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSK 372
                                       P++VT++++LP C+ LGA+D+G+ IH +   +
Sbjct: 337 TLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKR 396

Query: 373 -GDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNGKGEFAID 432
              + + +++  SL+D Y KCG+ E A+ +F  +  KS+ SWN MI GFA++G+ + + D
Sbjct: 397 LKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFD 456

Query: 433 LFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLL 483
           LF  M +  ++P+D TFV +L+AC HSG+L+ GR IF +M + Y++ PKLEH+GCM+DLL
Sbjct: 457 LFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLL 516

BLAST of Cla97C02G040850 vs. TAIR10
Match: AT1G09190.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 495.0 bits (1273), Expect = 5.4e-140
Identity = 286/477 (59.96%), Postives = 361/477 (75.68%), Query Frame = 0

Query: 6   LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFS 65
           ++IER++LRLLHGH +RT L +IHAH LRH LH SN +L+HFISIC + ++ DYANR+FS
Sbjct: 1   MEIERKLLRLLHGHNTRTRLPEIHAHLLRHFLHGSNLLLAHFISICGSLSNSDYANRVFS 60

Query: 66  QSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRL 125
              NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  DEYT+APLLKSC++L + R 
Sbjct: 61  HIQNPNVLVFNAMIKCYSLVGPPLESLSFFSSMKSRGIWADEYTYAPLLKSCSSLSDLRF 120

Query: 126 GQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCK 185
           G+CV  E++R GF+  G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC 
Sbjct: 121 GKCVHGELIRTGFHRLGKIRIGVVELYTSGGRMGDAQKVFDEMSERNVVVWNLMIRGFCD 180

Query: 186 MGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTM 245
            G+V+    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX F PDE TVVT+
Sbjct: 181 SGDVEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFDPDEATVVTV 240

Query: 246 LPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS 305
           LP+ + LG LD G+ IHS A S G      TVGN+LVDFYCK G+ E A  IF+KM  ++
Sbjct: 241 LPISASLGVLDTGKWIHSTAESSGLFKDFITVGNALVDFYCKSGDLEAATAIFRKMQRRN 300

Query: 306 VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIF 365
           VVSWNT+I G A+NGKGEF IDLF  M  E  V PN+ATF+ VL  C ++G +E+G E+F
Sbjct: 301 VVSWNTLISGSAVNGKGEFGIDLFDAMIEEGKVAPNEATFLGVLACCSYTGQVERGEELF 360

Query: 366 SSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGN 425
             M E+++++ + EH+G MVDL+ R G + EA   +K+MP+  NA +WG+LL ACR+HG+
Sbjct: 361 GLMMERFKLEARTEHYGAMVDLMSRSGRITEAFKFLKNMPVNANAAMWGSLLSACRSHGD 420

Query: 426 LKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
           +KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ GQS
Sbjct: 421 VKLAEVAAMELVKIEPGNSGNYVLLSNLYAEEGRWQDVEKVRTLMKKNRLRKSTGQS 477

BLAST of Cla97C02G040850 vs. TAIR10
Match: AT2G20540.1 (mitochondrial editing factor 21)

HSP 1 Score: 313.5 bits (802), Expect = 2.2e-85
Identity = 198/476 (41.60%), Postives = 300/476 (63.03%), Query Frame = 0

Query: 7   QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQ 66
           ++E   +  L   KSR    +I+A  + HGL QS+ +++  +  C     +DYA RLF+Q
Sbjct: 8   EVENYFIPFLQRVKSRNEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQ 67

Query: 67  SHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VPDEYTFAPLLKSCANLCEYRL 126
             NPN+FL+NSII+A++ +  +   + ++  +      +PD +TF  + KSCA+L    L
Sbjct: 68  VSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYL 127

Query: 127 GQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCK 186
           G+ V   + + G          ++++Y+  + + DA KVFDEM  RD             
Sbjct: 128 GKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDXXXXXXXXXXXXX 187

Query: 187 MGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTM 246
                    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX       +PDE++++++
Sbjct: 188 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEMQLAGIEPDEISLISV 247

Query: 247 LPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS 306
           LP C++LG+L++G+ IH +A  +G L   T V N+L++ Y KCG   +A  +F +M  K 
Sbjct: 248 LPSCAQLGSLELGKWIHLYAERRGFLKQ-TGVCNALIEMYSKCGVISQAIQLFGQMEGKD 307

Query: 307 VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFS 366
           V+SW+TMI G+A +G    AI+ F  M R  VKPN  TF+ +L+AC H G+ ++G   F 
Sbjct: 308 VISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFD 367

Query: 367 SMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNL 426
            M + Y+I+PK+EH+GC++D+L R G +E A  + K+MPM+P++ +WG+LL +CRT GNL
Sbjct: 368 MMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNL 427

Query: 427 KLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
            +A +A+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Sbjct: 428 DVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGS 482

BLAST of Cla97C02G040850 vs. TAIR10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 312.4 bits (799), Expect = 4.9e-85
Identity = 188/472 (39.83%), Postives = 290/472 (61.44%), Query Frame = 0

Query: 16  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLF 75
           +HG+  +T L Q H + +  GL++ N  ++ FI  C+   H+ YA  +F+    PN +L 
Sbjct: 23  IHGNNLKT-LKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYAYSVFTHQPCPNTYLH 82

Query: 76  NSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAPLLKSCANLCEYRLGQCVISE 135
           N++I+A S L  P   S+ +    K   +   PD +TF  +LK    + +   G+ +  +
Sbjct: 83  NTMIRALSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQ 142

Query: 136 VLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVD-- 195
           V+  GF     +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D  
Sbjct: 143 VVVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEA 202

Query: 196 FGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTMLPVCS 255
             L            XXXXXXXXXXXXXXXXXXXXX         +PDEVT++ +L  C+
Sbjct: 203 RSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMENVEPDEVTLLAVLSACA 262

Query: 256 RLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN 315
            LG+L++G+RI S+   +G +    ++ N+++D Y K GN  +A ++F+ +  ++VV+W 
Sbjct: 263 DLGSLELGERICSYVDHRG-MNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWT 322

Query: 316 TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEK 375
           T+I G A +G G  A+ +F  M +  V+PND TF+A+L+AC H G ++ G+ +F+SM  K
Sbjct: 323 TIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSK 382

Query: 376 YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNLKLAEM 435
           Y I P +EH+GCM+DLLGR G + EA  +IKSMP + NA +WG+LL A   H +L+L E 
Sbjct: 383 YGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGER 442

Query: 436 AVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA 483
           A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  +KK  G+S+
Sbjct: 443 ALSELIKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESS 492

BLAST of Cla97C02G040850 vs. TAIR10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 290.4 bits (742), Expect = 2.0e-78
Identity = 192/478 (40.17%), Postives = 296/478 (61.92%), Query Frame = 0

Query: 25  LTQIHAHFLRHGLHQSNQILSHFISICAA----FNHIDYANRLFSQSHNPNIFLFNSIIK 84
           L+QIHA F++ G  +     +  +  CA        +DYA+++F+Q    N F +N+II+
Sbjct: 39  LSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIR 98

Query: 85  AHSLSPPFQQSL---LLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRG 144
             S S   +  +   L +  M +  + P+ +TF  +LK+CA   + + G+ +    L+ G
Sbjct: 99  GFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYG 158

Query: 145 FYCFGSIRIGVVELYVCCERMEDARKVFDE---------MPHR-----DVVVWNLMIRGF 204
           F     +   +V +YV C  M+DAR +F +         M  R     ++V+WN+MI G+
Sbjct: 159 FGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGY 218

Query: 205 CKMGNVDFGLCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVV 264
            ++G+         XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +P+ VT+V
Sbjct: 219 MRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNYVTLV 278

Query: 265 TMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTC 324
           ++LP  SRLG+L++G+ +H +A   G  +    +G++L+D Y KCG  E+A ++F+++  
Sbjct: 279 SVLPAISRLGSLELGEWLHLYAEDSGIRID-DVLGSALIDMYSKCGIIEKAIHVFERLPR 338

Query: 325 KSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREI 384
           ++V++W+ MI GFA++G+   AID F  M +  V+P+D  ++ +LTAC H GL+E+GR  
Sbjct: 339 ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRY 398

Query: 385 FSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHG 444
           FS M     ++P++EH+GCMVDLLGR G ++EA   I +MP++P+  +W ALLGACR  G
Sbjct: 399 FSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQG 458

Query: 445 NLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS 482
           N+++ +     L+ + P +SG YV LSNM A +G W +V  +R  M+ K I+K PG S
Sbjct: 459 NVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCS 515

BLAST of Cla97C02G040850 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 288.1 bits (736), Expect = 1.0e-77
Identity = 172/575 (29.91%), Postives = 266/575 (46.26%), Query Frame = 0

Query: 13  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHN 72
           L LLH  K+   L  IHA  ++ GLH +N  LS  I  C    H +   YA  +F     
Sbjct: 37  LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQE 96

Query: 73  PNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCEYRLGQCV 132
           PN+ ++N++ + H+LS     +L L+  M +  ++P+ YTF  +LKSCA    ++ GQ +
Sbjct: 97  PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 156

Query: 133 ISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPH------------------- 192
              VL+ G      +   ++ +YV   R+EDA KVFD+ PH                   
Sbjct: 157 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXXXXXXXXXX 216

Query: 193 ------------------------------------------------------------ 252
                                                                       
Sbjct: 217 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDESTMVTVVSAC 276

Query: 253 ---------RDVVVW-------------NLMIRGFCKMGNVDFGLCXXXXXXXXXXXXXX 312
                    R V +W             N +I  + K G ++                  
Sbjct: 277 AQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWN 336

Query: 313 XXXXXXXXXXXXXXXXXXXXXXXXXXFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSK 372
                                       P++VT++++LP C+ LGA+D+G+ IH +   +
Sbjct: 337 TLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKR 396

Query: 373 -GDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNGKGEFAID 432
              + + +++  SL+D Y KCG+ E A+ +F  +  KS+ SWN MI GFA++G+ + + D
Sbjct: 397 LKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFD 456

Query: 433 LFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLL 483
           LF  M +  ++P+D TFV +L+AC HSG+L+ GR IF +M + Y++ PKLEH+GCM+DLL
Sbjct: 457 LFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLL 516

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008453700.11.3e-23791.94PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo][more]
XP_004144815.11.1e-23491.32PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis sativu... [more]
XP_022156351.15.8e-21786.16pentatricopeptide repeat-containing protein At1g09190 [Momordica charantia][more]
XP_023538021.13.3e-21285.74pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pep... [more]
XP_022965696.13.8e-20884.09pentatricopeptide repeat-containing protein At1g09190 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BXN7|A0A1S3BXN7_CUCME8.8e-23891.94pentatricopeptide repeat-containing protein At1g09190 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0LJW6|A0A0A0LJW6_CUCSA7.0e-23591.32Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G033910 PE=4 SV=1[more]
tr|A0A2N9IWV2|A0A2N9IWV2_FAGSY1.4e-17168.19Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56717 PE=4 SV=1[more]
tr|A0A2I4EMS6|A0A2I4EMS6_9ROSI1.6e-17068.81pentatricopeptide repeat-containing protein At1g09190 OS=Juglans regia OX=51240 ... [more]
tr|A0A2P5EVF3|A0A2P5EVF3_9ROSA1.8e-16667.36Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_146390 PE=4 ... [more]
Match NameE-valueIdentityDescription
sp|O80488|PPR23_ARATH9.7e-13959.96Pentatricopeptide repeat-containing protein At1g09190 OS=Arabidopsis thaliana OX... [more]
sp|Q9SIL5|PP165_ARATH4.0e-8441.60Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX... [more]
sp|Q9FMA1|PP433_ARATH8.9e-8439.83Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
sp|Q9FI80|PP425_ARATH3.6e-7740.17Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH1.8e-7629.91Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT1G09190.15.4e-14059.96Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20540.12.2e-8541.60mitochondrial editing factor 21[more]
AT5G56310.14.9e-8539.83Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G48910.12.0e-7840.17Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.11.0e-7729.91Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0080156 mitochondrial mRNA modification
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G040850.1Cla97C02G040850.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 151..172
e-value: 0.2
score: 11.9
coord: 174..202
e-value: 1.2E-6
score: 28.2
coord: 380..403
e-value: 0.13
score: 12.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 304..352
e-value: 7.4E-11
score: 42.0
coord: 203..249
e-value: 2.9E-10
score: 40.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 205..239
e-value: 1.6E-10
score: 38.5
coord: 174..204
e-value: 2.0E-7
score: 28.8
coord: 307..340
e-value: 2.4E-4
score: 19.0
coord: 279..305
e-value: 5.6E-4
score: 17.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..304
score: 7.41
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 408..438
score: 5.788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 203..237
score: 12.562
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 71..105
score: 7.991
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..370
score: 7.991
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 238..272
score: 5.645
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 172..202
score: 10.852
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 106..140
score: 6.566
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 10.052
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 442..476
score: 7.552
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 376..406
score: 6.149
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 143..275
e-value: 2.8E-31
score: 111.2
coord: 7..142
e-value: 1.0E-12
score: 50.2
coord: 276..370
e-value: 3.3E-23
score: 84.7
coord: 371..482
e-value: 5.0E-13
score: 51.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..482
NoneNo IPR availablePANTHERPTHR24015:SF748SUBFAMILY NOT NAMEDcoord: 1..482

The following gene(s) are paralogous to this gene:

None