Cla97C02G026810 (gene) Watermelon (97103) v2

NameCla97C02G026810
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr02 : 558397 .. 559875 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTCGTACTCATAGCACTTCTCGTTTCTCTTTCATTCTCAGAAGCTGTGCTCATCTTTATGCAATTGCCCAAGCAAAGCAAGCCCACGCTCAAATCCTCATCCATGGCTTCCTTCCGCATTTAACTCTTCTAACAGATCTTCTTTTGGTGTATTGCAAATGTGGGTTTCTTCATGACGCCCGCAAGGTGTTTGACAAAATGACTCACAGAAACATGCACTCCTGGAATATCTTGATTGCTTCTTATGTTCATAATTCTTTGTACTTTGATGCAATAAATGTGTTTAATGAGTTTACGGACCTTGGTTTTCTGCCTGACCACTATACTTTGCCTCAGATGCTTAAGGCTAGTGTTGGTATTGGGGATGTTTATTTGGGGAAGAGACTTCATTGTTGGACAATTAGGCTGGGATTTGAAGGTTACGTTGTTGTAGACAGTACGGTTTTAGACTTTTACGCTAAATATGGGATTGTGGGTGATGCTAAGAAGGTGTTTGATAATATGATCTTTAAAGATACAATTTCTTGGAACTCAATGATTTCTGGGTATGGGAGGGCTGGGCTTTATGAAGATGCATTGGATTGTTTCAAGCGCATGCTCTTGGAAGGAGTGAAAATGGATTTTATGACAATTCCTAGCGTTTTGAATGCTTGTGGAGGGGAAGGAGATTTGAGAAAAGGCAAAGAGATTCATTGCTTAGTTTTGAAGAGTATGGTATTTGCTGCTGATGTTGCAATTGGGAACTCATTGATTGATATGTATGCAAAGTGTGGAAGCCTGCTTGATTCTGAAAAGGTATTTTGGAATATGAGCAGCTTGAATATTGTTACATGGACCACAATGATATCTTGTTATGGGGCTCATGGGAAAGGAGAGAAATCCTTGGTCCTGTTTAACAAAATGAAGGATTGTGGAATCCAACCCAATTCTGTAACGCTGACGGCCATTTTGGCTAGTTGCAGTCATGCAGGTTACATCAATGAAGGTTGGAGAATTTTCCAGTCTATTGTTTCAGATTATGAGATTGAACTGACGATAGAACATTATGCTTGTGTTGTGGATCTTCTGAGTCGCCTCGGCTTTCTGCAGGAAGCGTTTCTGTTGATAAGAAATATGAAGCTCACCGCTGTTCCAAGCATTTGGGGTGCTCTGCTTTCTGGGTGTATGATCCACAGGAACCTTGAGATTGGAGAAATTGCAGCCAACCAGCTTTTCGAGTTGGAACCTACAAATCCAAGCAATTTTATAGCCTTGATTAGTATATATGAATCTCTTGGTATATCGCATGGTGCTTCGCTGATTAGAGAGAAGATGAGAGACCTTGGTTTGACGAAAATCCCTGGCTGCAGCTGCATAACCATTGATGGAGTTGTGCATAAATTCTACGGAGGGGACAATTCTCACCCTTTGGCTTTGAGGATATTTGAAACATTAAATGCAGTGAGACAAGCAACTGTAAACTGCGAAACAACTTAG

mRNA sequence

ATGTCTCGTACTCATAGCACTTCTCGTTTCTCTTTCATTCTCAGAAGCTGTGCTCATCTTTATGCAATTGCCCAAGCAAAGCAAGCCCACGCTCAAATCCTCATCCATGGCTTCCTTCCGCATTTAACTCTTCTAACAGATCTTCTTTTGGTGTATTGCAAATGTGGGTTTCTTCATGACGCCCGCAAGGTGTTTGACAAAATGACTCACAGAAACATGCACTCCTGGAATATCTTGATTGCTTCTTATGTTCATAATTCTTTGTACTTTGATGCAATAAATGTGTTTAATGAGTTTACGGACCTTGGTTTTCTGCCTGACCACTATACTTTGCCTCAGATGCTTAAGGCTAGTGTTGGTATTGGGGATGTTTATTTGGGGAAGAGACTTCATTGTTGGACAATTAGGCTGGGATTTGAAGGTTACGTTGTTGTAGACAGTACGGTTTTAGACTTTTACGCTAAATATGGGATTGTGGGTGATGCTAAGAAGGTGTTTGATAATATGATCTTTAAAGATACAATTTCTTGGAACTCAATGATTTCTGGGTATGGGAGGGCTGGGCTTTATGAAGATGCATTGGATTGTTTCAAGCGCATGCTCTTGGAAGGAGTGAAAATGGATTTTATGACAATTCCTAGCGTTTTGAATGCTTGTGGAGGGGAAGGAGATTTGAGAAAAGGCAAAGAGATTCATTGCTTAGTTTTGAAGAGTATGGTATTTGCTGCTGATGTTGCAATTGGGAACTCATTGATTGATATGTATGCAAAGTGTGGAAGCCTGCTTGATTCTGAAAAGGTATTTTGGAATATGAGCAGCTTGAATATTGTTACATGGACCACAATGATATCTTGTTATGGGGCTCATGGGAAAGGAGAGAAATCCTTGGTCCTGTTTAACAAAATGAAGGATTGTGGAATCCAACCCAATTCTGTAACGCTGACGGCCATTTTGGCTAGTTGCAGTCATGCAGGTTACATCAATGAAGGTTGGAGAATTTTCCAGTCTATTGTTTCAGATTATGAGATTGAACTGACGATAGAACATTATGCTTGTGTTGTGGATCTTCTGAGTCGCCTCGGCTTTCTGCAGGAAGCGTTTCTGTTGATAAGAAATATGAAGCTCACCGCTGTTCCAAGCATTTGGGGTGCTCTGCTTTCTGGGTGTATGATCCACAGGAACCTTGAGATTGGAGAAATTGCAGCCAACCAGCTTTTCGAGTTGGAACCTACAAATCCAAGCAATTTTATAGCCTTGATTAGTATATATGAATCTCTTGGTATATCGCATGGTGCTTCGCTGATTAGAGAGAAGATGAGAGACCTTGGTTTGACGAAAATCCCTGGCTGCAGCTGCATAACCATTGATGGAGTTGTGCATAAATTCTACGGAGGGGACAATTCTCACCCTTTGGCTTTGAGGATATTTGAAACATTAAATGCAGTGAGACAAGCAACTGTAAACTGCGAAACAACTTAG

Coding sequence (CDS)

ATGTCTCGTACTCATAGCACTTCTCGTTTCTCTTTCATTCTCAGAAGCTGTGCTCATCTTTATGCAATTGCCCAAGCAAAGCAAGCCCACGCTCAAATCCTCATCCATGGCTTCCTTCCGCATTTAACTCTTCTAACAGATCTTCTTTTGGTGTATTGCAAATGTGGGTTTCTTCATGACGCCCGCAAGGTGTTTGACAAAATGACTCACAGAAACATGCACTCCTGGAATATCTTGATTGCTTCTTATGTTCATAATTCTTTGTACTTTGATGCAATAAATGTGTTTAATGAGTTTACGGACCTTGGTTTTCTGCCTGACCACTATACTTTGCCTCAGATGCTTAAGGCTAGTGTTGGTATTGGGGATGTTTATTTGGGGAAGAGACTTCATTGTTGGACAATTAGGCTGGGATTTGAAGGTTACGTTGTTGTAGACAGTACGGTTTTAGACTTTTACGCTAAATATGGGATTGTGGGTGATGCTAAGAAGGTGTTTGATAATATGATCTTTAAAGATACAATTTCTTGGAACTCAATGATTTCTGGGTATGGGAGGGCTGGGCTTTATGAAGATGCATTGGATTGTTTCAAGCGCATGCTCTTGGAAGGAGTGAAAATGGATTTTATGACAATTCCTAGCGTTTTGAATGCTTGTGGAGGGGAAGGAGATTTGAGAAAAGGCAAAGAGATTCATTGCTTAGTTTTGAAGAGTATGGTATTTGCTGCTGATGTTGCAATTGGGAACTCATTGATTGATATGTATGCAAAGTGTGGAAGCCTGCTTGATTCTGAAAAGGTATTTTGGAATATGAGCAGCTTGAATATTGTTACATGGACCACAATGATATCTTGTTATGGGGCTCATGGGAAAGGAGAGAAATCCTTGGTCCTGTTTAACAAAATGAAGGATTGTGGAATCCAACCCAATTCTGTAACGCTGACGGCCATTTTGGCTAGTTGCAGTCATGCAGGTTACATCAATGAAGGTTGGAGAATTTTCCAGTCTATTGTTTCAGATTATGAGATTGAACTGACGATAGAACATTATGCTTGTGTTGTGGATCTTCTGAGTCGCCTCGGCTTTCTGCAGGAAGCGTTTCTGTTGATAAGAAATATGAAGCTCACCGCTGTTCCAAGCATTTGGGGTGCTCTGCTTTCTGGGTGTATGATCCACAGGAACCTTGAGATTGGAGAAATTGCAGCCAACCAGCTTTTCGAGTTGGAACCTACAAATCCAAGCAATTTTATAGCCTTGATTAGTATATATGAATCTCTTGGTATATCGCATGGTGCTTCGCTGATTAGAGAGAAGATGAGAGACCTTGGTTTGACGAAAATCCCTGGCTGCAGCTGCATAACCATTGATGGAGTTGTGCATAAATTCTACGGAGGGGACAATTCTCACCCTTTGGCTTTGAGGATATTTGAAACATTAAATGCAGTGAGACAAGCAACTGTAAACTGCGAAACAACTTAG

Protein sequence

MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVRQATVNCETT
BLAST of Cla97C02G026810 vs. NCBI nr
Match: XP_008460899.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis melo])

HSP 1 Score: 922.9 bits (2384), Expect = 4.5e-265
Identity = 441/492 (89.63%), Postives = 464/492 (94.31%), Query Frame = 0

Query: 1   MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60
           MS +HSTSRFSF+LRSCAH YAIAQAKQ HAQILIHGFLPHLTL+TDLLLVYCKCGFLHD
Sbjct: 1   MSLSHSTSRFSFLLRSCAHSYAIAQAKQTHAQILIHGFLPHLTLITDLLLVYCKCGFLHD 60

Query: 61  ARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVG 120
           AR VFDKMTHRNMHSWNILIASYVHNSLYFDA+NVFNEF D GFLPDHYTLPQM KASVG
Sbjct: 61  ARNVFDKMTHRNMHSWNILIASYVHNSLYFDALNVFNEFRDHGFLPDHYTLPQMFKASVG 120

Query: 121 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSM 180
            GDVYLGKRLHCWTI+LGF GYVVVDSTVLDFYAKYGIVGDA+KVFD+MIFKDT+SWNSM
Sbjct: 121 TGDVYLGKRLHCWTIKLGFVGYVVVDSTVLDFYAKYGIVGDARKVFDDMIFKDTVSWNSM 180

Query: 181 ISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMV 240
           ISGYGRAG+Y DALDCFKRMLLEG  MDFMTIPSVLNACGGEGDLRKGKEIHCLVLKS V
Sbjct: 181 ISGYGRAGVYRDALDCFKRMLLEGANMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSPV 240

Query: 241 FAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFN 300
            AADVA+GNSLIDMY+KCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFN
Sbjct: 241 LAADVAVGNSLIDMYSKCGSLLNSEKVFWNMSRLNIVTWTTMISCYGAHGKGEKSLVLFN 300

Query: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRL 360
           KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSD ++E T+EHYACVVDLLSR 
Sbjct: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDDKVEPTVEHYACVVDLLSRF 360

Query: 361 GFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALI 420
           GFL+EAFLLIRNMK+ A  SIWGALLSGCMIHRNLE GEIAANQLF+LEPTNPSNFIALI
Sbjct: 361 GFLKEAFLLIRNMKVKAAASIWGALLSGCMIHRNLEFGEIAANQLFKLEPTNPSNFIALI 420

Query: 421 SIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 480
           SIYESLG+ HG S+ REKMR LGLTK+PGCSCITIDGVVHKFYGG NSHPLALRIFETLN
Sbjct: 421 SIYESLGMLHGVSVTREKMRGLGLTKVPGCSCITIDGVVHKFYGGGNSHPLALRIFETLN 480

Query: 481 AVRQATVNCETT 493
           +VRQA V+CETT
Sbjct: 481 SVRQAAVSCETT 492

BLAST of Cla97C02G026810 vs. NCBI nr
Match: XP_011649399.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis sativus])

HSP 1 Score: 910.2 bits (2351), Expect = 3.0e-261
Identity = 435/492 (88.41%), Postives = 459/492 (93.29%), Query Frame = 0

Query: 1   MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60
           MS +HSTSRFSF+LRSCA  YAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD
Sbjct: 1   MSLSHSTSRFSFVLRSCAQSYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60

Query: 61  ARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVG 120
           AR VFDKM HRNMHSWNILIASYVHNS YFDA+NVFNEF  LGFLPDHYTLPQM KASVG
Sbjct: 61  ARNVFDKMAHRNMHSWNILIASYVHNSFYFDALNVFNEFRHLGFLPDHYTLPQMFKASVG 120

Query: 121 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSM 180
           IGD YLGKRLHCWTI+LGF GYVVV STVLDFYAK GIVGDA+KVFD+MIFKDT+SWNSM
Sbjct: 121 IGDAYLGKRLHCWTIKLGFVGYVVVGSTVLDFYAKCGIVGDARKVFDDMIFKDTVSWNSM 180

Query: 181 ISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMV 240
           ISGYGRAG+Y DALDCFKRMLLEG  MDFMTIPSVLNACGGEGDLRKGKEIHCLVLKS V
Sbjct: 181 ISGYGRAGVYMDALDCFKRMLLEGANMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSPV 240

Query: 241 FAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFN 300
           FAADVA+GNSLIDMY+KCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFN
Sbjct: 241 FAADVAVGNSLIDMYSKCGSLLNSEKVFWNMSRLNIVTWTTMISCYGAHGKGEKSLVLFN 300

Query: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRL 360
           KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSI+SD ++E T+EHYAC VDLLSR 
Sbjct: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIISDNKVEPTVEHYACAVDLLSRF 360

Query: 361 GFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALI 420
           GFL+EAFLLIRNMK+ A  SIWGALLSGCMIHRNLE GEIAANQLF+LEPTNPSNFIALI
Sbjct: 361 GFLKEAFLLIRNMKVKAAASIWGALLSGCMIHRNLEFGEIAANQLFKLEPTNPSNFIALI 420

Query: 421 SIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 480
           SIYESLG++HG SL REKMRDLGLTK+PGCSCI IDG+VHKFYGG NSHPLALRI E LN
Sbjct: 421 SIYESLGMTHGVSLTREKMRDLGLTKVPGCSCIIIDGIVHKFYGGGNSHPLALRISEPLN 480

Query: 481 AVRQATVNCETT 493
           +VRQA V+CETT
Sbjct: 481 SVRQAAVSCETT 492

BLAST of Cla97C02G026810 vs. NCBI nr
Match: KGN62149.1 (hypothetical protein Csa_2G302160 [Cucumis sativus])

HSP 1 Score: 899.4 bits (2323), Expect = 5.3e-258
Identity = 430/485 (88.66%), Postives = 453/485 (93.40%), Query Frame = 0

Query: 1   MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60
           MS +HSTSRFSF+LRSCA  YAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD
Sbjct: 1   MSLSHSTSRFSFVLRSCAQSYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60

Query: 61  ARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVG 120
           AR VFDKM HRNMHSWNILIASYVHNS YFDA+NVFNEF  LGFLPDHYTLPQM KASVG
Sbjct: 61  ARNVFDKMAHRNMHSWNILIASYVHNSFYFDALNVFNEFRHLGFLPDHYTLPQMFKASVG 120

Query: 121 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSM 180
           IGD YLGKRLHCWTI+LGF GYVVV STVLDFYAK GIVGDA+KVFD+MIFKDT+SWNSM
Sbjct: 121 IGDAYLGKRLHCWTIKLGFVGYVVVGSTVLDFYAKCGIVGDARKVFDDMIFKDTVSWNSM 180

Query: 181 ISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMV 240
           ISGYGRAG+Y DALDCFKRMLLEG  MDFMTIPSVLNACGGEGDLRKGKEIHCLVLKS V
Sbjct: 181 ISGYGRAGVYMDALDCFKRMLLEGANMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSPV 240

Query: 241 FAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFN 300
           FAADVA+GNSLIDMY+KCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFN
Sbjct: 241 FAADVAVGNSLIDMYSKCGSLLNSEKVFWNMSRLNIVTWTTMISCYGAHGKGEKSLVLFN 300

Query: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRL 360
           KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSI+SD ++E T+EHYAC VDLLSR 
Sbjct: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIISDNKVEPTVEHYACAVDLLSRF 360

Query: 361 GFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALI 420
           GFL+EAFLLIRNMK+ A  SIWGALLSGCMIHRNLE GEIAANQLF+LEPTNPSNFIALI
Sbjct: 361 GFLKEAFLLIRNMKVKAAASIWGALLSGCMIHRNLEFGEIAANQLFKLEPTNPSNFIALI 420

Query: 421 SIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 480
           SIYESLG++HG SL REKMRDLGLTK+PGCSCI IDG+VHKFYGG NSHPLALRI E LN
Sbjct: 421 SIYESLGMTHGVSLTREKMRDLGLTKVPGCSCIIIDGIVHKFYGGGNSHPLALRISEPLN 480

Query: 481 AVRQA 486
           +VRQA
Sbjct: 481 SVRQA 485

BLAST of Cla97C02G026810 vs. NCBI nr
Match: XP_023512346.1 (pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 888.3 bits (2294), Expect = 1.2e-254
Identity = 425/486 (87.45%), Postives = 452/486 (93.00%), Query Frame = 0

Query: 4   THSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARK 63
           + STSRFS ILR+CA L A+ QAKQAHAQILIHG +PHLTL+TDLLLVYCKCG LHDARK
Sbjct: 2   SQSTSRFSLILRNCARLNAVVQAKQAHAQILIHGLIPHLTLVTDLLLVYCKCGVLHDARK 61

Query: 64  VFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGD 123
           VFDKMTHRNMHSWNILIASYVH+SL+FDAINVFNEF  LGFLPDHYTLPQM K SVGIGD
Sbjct: 62  VFDKMTHRNMHSWNILIASYVHSSLHFDAINVFNEFRGLGFLPDHYTLPQMFKVSVGIGD 121

Query: 124 VYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISG 183
           VY+GKRLHCWTIRLGFEGYVVV STVLDFYAKYG+VGDAKKVFDNMI KDTISWNSMISG
Sbjct: 122 VYMGKRLHCWTIRLGFEGYVVVASTVLDFYAKYGVVGDAKKVFDNMILKDTISWNSMISG 181

Query: 184 YGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAA 243
           YGRAGLY DALDCFKRML EGV+MD MTIPSVLNACGGEGDLRKGKEIHCLVLKS +F A
Sbjct: 182 YGRAGLYGDALDCFKRMLFEGVRMDIMTIPSVLNACGGEGDLRKGKEIHCLVLKSQLFVA 241

Query: 244 DVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMK 303
           DVAIGNSLIDMYAKCGSL D+E+VF NMSS+NIVTWTTMISCYGAHGKGEKSL LFNKMK
Sbjct: 242 DVAIGNSLIDMYAKCGSLFDAEEVFRNMSSMNIVTWTTMISCYGAHGKGEKSLALFNKMK 301

Query: 304 DCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFL 363
            CGIQPNSVTLTAILASCSHAGYINEGWRIF+S VSDYE+ELT+EHYACVVDLLSR GFL
Sbjct: 302 ACGIQPNSVTLTAILASCSHAGYINEGWRIFKSFVSDYEVELTVEHYACVVDLLSRFGFL 361

Query: 364 QEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIY 423
           QEAF+LIRNMK TA  SIWGALLSGCM+H+NLEIGEIAANQLF+LEP NPSNFIALI IY
Sbjct: 362 QEAFMLIRNMKHTAAASIWGALLSGCMMHKNLEIGEIAANQLFKLEPMNPSNFIALIGIY 421

Query: 424 ESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVR 483
           ESLG+SHG SL REKMRDLGLTK+PGCS ITI+GVVHKFYGGD SHPL LRIFETL+AV+
Sbjct: 422 ESLGMSHGDSLTREKMRDLGLTKLPGCSWITIEGVVHKFYGGDKSHPLGLRIFETLSAVK 481

Query: 484 QATVNC 490
           QA+VNC
Sbjct: 482 QASVNC 487

BLAST of Cla97C02G026810 vs. NCBI nr
Match: XP_022971622.1 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 885.9 bits (2288), Expect = 6.1e-254
Identity = 424/489 (86.71%), Postives = 454/489 (92.84%), Query Frame = 0

Query: 4   THSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARK 63
           + STSRFSFILR+CA L A+AQAKQAHAQI+IHG +PHLTL+TD+LLVYCKCG +HDARK
Sbjct: 2   SQSTSRFSFILRNCARLNAVAQAKQAHAQIVIHGLIPHLTLVTDILLVYCKCGLVHDARK 61

Query: 64  VFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGD 123
           VFDKMTHRNMHSWNILIASYVH+SL+FDAINV NEF  LGFLPDHYTLPQM K SVGIGD
Sbjct: 62  VFDKMTHRNMHSWNILIASYVHSSLHFDAINVLNEFRGLGFLPDHYTLPQMFKVSVGIGD 121

Query: 124 VYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISG 183
           VY+GKRLHCWTI+LGFEGYVVV STVLDFYAKYG+VGDAKKVFDNMI KDTISWNSMISG
Sbjct: 122 VYMGKRLHCWTIKLGFEGYVVVASTVLDFYAKYGVVGDAKKVFDNMILKDTISWNSMISG 181

Query: 184 YGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAA 243
           YGRAGLY DALDCFKRML EGVKMD MTIPSVLNACGGEGDLRKGKEIHCLVLKS +F A
Sbjct: 182 YGRAGLYGDALDCFKRMLFEGVKMDIMTIPSVLNACGGEGDLRKGKEIHCLVLKSQLFVA 241

Query: 244 DVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMK 303
           DVAIGNSLIDMYAKCGSL D+E VF NMSS+NIVTWTTMISCYGAHGKGEKSL LFNKMK
Sbjct: 242 DVAIGNSLIDMYAKCGSLFDAENVFRNMSSVNIVTWTTMISCYGAHGKGEKSLALFNKMK 301

Query: 304 DCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFL 363
           DCGIQPNSVTLTAILASCSHAGYI+EGWRIF SIVSDYE+ELT+EHYACVVDLLSR GFL
Sbjct: 302 DCGIQPNSVTLTAILASCSHAGYISEGWRIFNSIVSDYEVELTVEHYACVVDLLSRFGFL 361

Query: 364 QEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIY 423
           QEAF+LIR+MK TA  SIWGALLSGCM+H+NLEIGEIAANQLF+LEPTNPSNFIALI IY
Sbjct: 362 QEAFMLIRSMKHTAAASIWGALLSGCMMHKNLEIGEIAANQLFKLEPTNPSNFIALIGIY 421

Query: 424 ESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVR 483
           ESLG+    SL REKMRDLGLTK+PGCS ITI+GVVHKFYGGD SHPL LRIFETL+AV+
Sbjct: 422 ESLGMLDCVSLTREKMRDLGLTKLPGCSWITIEGVVHKFYGGDKSHPLGLRIFETLSAVK 481

Query: 484 QATVNCETT 493
           QA+VNCETT
Sbjct: 482 QASVNCETT 490

BLAST of Cla97C02G026810 vs. TrEMBL
Match: tr|A0A1S3CDI3|A0A1S3CDI3_CUCME (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103499640 PE=4 SV=1)

HSP 1 Score: 922.9 bits (2384), Expect = 3.0e-265
Identity = 441/492 (89.63%), Postives = 464/492 (94.31%), Query Frame = 0

Query: 1   MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60
           MS +HSTSRFSF+LRSCAH YAIAQAKQ HAQILIHGFLPHLTL+TDLLLVYCKCGFLHD
Sbjct: 1   MSLSHSTSRFSFLLRSCAHSYAIAQAKQTHAQILIHGFLPHLTLITDLLLVYCKCGFLHD 60

Query: 61  ARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVG 120
           AR VFDKMTHRNMHSWNILIASYVHNSLYFDA+NVFNEF D GFLPDHYTLPQM KASVG
Sbjct: 61  ARNVFDKMTHRNMHSWNILIASYVHNSLYFDALNVFNEFRDHGFLPDHYTLPQMFKASVG 120

Query: 121 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSM 180
            GDVYLGKRLHCWTI+LGF GYVVVDSTVLDFYAKYGIVGDA+KVFD+MIFKDT+SWNSM
Sbjct: 121 TGDVYLGKRLHCWTIKLGFVGYVVVDSTVLDFYAKYGIVGDARKVFDDMIFKDTVSWNSM 180

Query: 181 ISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMV 240
           ISGYGRAG+Y DALDCFKRMLLEG  MDFMTIPSVLNACGGEGDLRKGKEIHCLVLKS V
Sbjct: 181 ISGYGRAGVYRDALDCFKRMLLEGANMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSPV 240

Query: 241 FAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFN 300
            AADVA+GNSLIDMY+KCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFN
Sbjct: 241 LAADVAVGNSLIDMYSKCGSLLNSEKVFWNMSRLNIVTWTTMISCYGAHGKGEKSLVLFN 300

Query: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRL 360
           KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSD ++E T+EHYACVVDLLSR 
Sbjct: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDDKVEPTVEHYACVVDLLSRF 360

Query: 361 GFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALI 420
           GFL+EAFLLIRNMK+ A  SIWGALLSGCMIHRNLE GEIAANQLF+LEPTNPSNFIALI
Sbjct: 361 GFLKEAFLLIRNMKVKAAASIWGALLSGCMIHRNLEFGEIAANQLFKLEPTNPSNFIALI 420

Query: 421 SIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 480
           SIYESLG+ HG S+ REKMR LGLTK+PGCSCITIDGVVHKFYGG NSHPLALRIFETLN
Sbjct: 421 SIYESLGMLHGVSVTREKMRGLGLTKVPGCSCITIDGVVHKFYGGGNSHPLALRIFETLN 480

Query: 481 AVRQATVNCETT 493
           +VRQA V+CETT
Sbjct: 481 SVRQAAVSCETT 492

BLAST of Cla97C02G026810 vs. TrEMBL
Match: tr|A0A0A0LM84|A0A0A0LM84_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G302160 PE=4 SV=1)

HSP 1 Score: 899.4 bits (2323), Expect = 3.5e-258
Identity = 430/485 (88.66%), Postives = 453/485 (93.40%), Query Frame = 0

Query: 1   MSRTHSTSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60
           MS +HSTSRFSF+LRSCA  YAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD
Sbjct: 1   MSLSHSTSRFSFVLRSCAQSYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHD 60

Query: 61  ARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVG 120
           AR VFDKM HRNMHSWNILIASYVHNS YFDA+NVFNEF  LGFLPDHYTLPQM KASVG
Sbjct: 61  ARNVFDKMAHRNMHSWNILIASYVHNSFYFDALNVFNEFRHLGFLPDHYTLPQMFKASVG 120

Query: 121 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSM 180
           IGD YLGKRLHCWTI+LGF GYVVV STVLDFYAK GIVGDA+KVFD+MIFKDT+SWNSM
Sbjct: 121 IGDAYLGKRLHCWTIKLGFVGYVVVGSTVLDFYAKCGIVGDARKVFDDMIFKDTVSWNSM 180

Query: 181 ISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMV 240
           ISGYGRAG+Y DALDCFKRMLLEG  MDFMTIPSVLNACGGEGDLRKGKEIHCLVLKS V
Sbjct: 181 ISGYGRAGVYMDALDCFKRMLLEGANMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSPV 240

Query: 241 FAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFN 300
           FAADVA+GNSLIDMY+KCGSLL+SEKVFWNMS LNIVTWTTMISCYGAHGKGEKSLVLFN
Sbjct: 241 FAADVAVGNSLIDMYSKCGSLLNSEKVFWNMSRLNIVTWTTMISCYGAHGKGEKSLVLFN 300

Query: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRL 360
           KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSI+SD ++E T+EHYAC VDLLSR 
Sbjct: 301 KMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIISDNKVEPTVEHYACAVDLLSRF 360

Query: 361 GFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALI 420
           GFL+EAFLLIRNMK+ A  SIWGALLSGCMIHRNLE GEIAANQLF+LEPTNPSNFIALI
Sbjct: 361 GFLKEAFLLIRNMKVKAAASIWGALLSGCMIHRNLEFGEIAANQLFKLEPTNPSNFIALI 420

Query: 421 SIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 480
           SIYESLG++HG SL REKMRDLGLTK+PGCSCI IDG+VHKFYGG NSHPLALRI E LN
Sbjct: 421 SIYESLGMTHGVSLTREKMRDLGLTKVPGCSCIIIDGIVHKFYGGGNSHPLALRISEPLN 480

Query: 481 AVRQA 486
           +VRQA
Sbjct: 481 SVRQA 485

BLAST of Cla97C02G026810 vs. TrEMBL
Match: tr|A0A2I4EAF5|A0A2I4EAF5_9ROSI (pentatricopeptide repeat-containing protein At5g04780-like OS=Juglans regia OX=51240 GN=LOC108987811 PE=4 SV=1)

HSP 1 Score: 641.3 bits (1653), Expect = 1.7e-180
Identity = 312/477 (65.41%), Postives = 368/477 (77.15%), Query Frame = 0

Query: 11  SFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDKMTH 70
           S ++R  A L AI+QAKQ HAQIL HGFL ++TL TDLLL Y KCGFL DAR+VFD M+ 
Sbjct: 10  SSLVRKSALLPAISQAKQTHAQILFHGFLTNVTLQTDLLLAYSKCGFLQDARRVFDNMSE 69

Query: 71  RNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYLGKRL 130
           RNMHSWNIL+ASY +N LY DAI+VF EF  +   PDHYT P + KA  GIGD YLGK L
Sbjct: 70  RNMHSWNILLASYANNFLYSDAIHVFLEFLKMDIRPDHYTFPPVFKACAGIGDAYLGKML 129

Query: 131 HCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAGLY 190
           H W +RLGFE Y+VV S++LDF+ K G + DA+ VF NM  KD+  WNSMISG GRAG Y
Sbjct: 130 HGWVVRLGFEQYIVVGSSILDFHLKCGNLVDARCVFSNMSSKDSAVWNSMISGSGRAGFY 189

Query: 191 EDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVAIGNS 250
            DAL+ F+ ML EG KMD MTIPS+LNACGGEGDL KGKEIH  V+KS +F  DVAIGNS
Sbjct: 190 ADALNYFRGMLNEGKKMDSMTIPSILNACGGEGDLIKGKEIHGQVVKSFIFDGDVAIGNS 249

Query: 251 LIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPN 310
           LIDMYAKCG L DSEKVF N+  LN++TWTTMISC G HGKGE SL LF KM+DCG +PN
Sbjct: 250 LIDMYAKCGCLHDSEKVFRNVRELNLITWTTMISCCGVHGKGEDSLALFKKMRDCGFKPN 309

Query: 311 SVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEAFLLI 370
            VTLTA+L+SCSH+G I++G RIF SI  DY +E ++EHYAC+VDLL R G+L+EA  L+
Sbjct: 310 CVTLTAVLSSCSHSGLIDQGRRIFDSISFDYGLEPSVEHYACMVDLLGRFGYLEEALALV 369

Query: 371 RNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESLGISH 430
           +NMK     S+WGALL+GCM+H+N+EIGEIAA QLFELEPTN SN+IAL SIY+SL I  
Sbjct: 370 KNMKPAPTASVWGALLAGCMMHKNVEIGEIAALQLFELEPTNCSNYIALCSIYDSLSIRD 429

Query: 431 GASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVRQATV 488
           G SLIR KMR LGL K PGCS ITI G + KFY GD+SHP +L + E  N + QA+V
Sbjct: 430 GVSLIRAKMRKLGLVKTPGCSWITIQGTIRKFYQGDHSHPQSLELHEIFNKIIQASV 486

BLAST of Cla97C02G026810 vs. TrEMBL
Match: tr|A0A2P4H960|A0A2P4H960_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_60463 PE=4 SV=1)

HSP 1 Score: 637.5 bits (1643), Expect = 2.5e-179
Identity = 303/475 (63.79%), Postives = 374/475 (78.74%), Query Frame = 0

Query: 11  SFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDKMTH 70
           S +LR+C  L AI+QAKQ HAQIL+ GFL ++TL TDLLL Y KCG L DAR+VFDKM+ 
Sbjct: 10  SSLLRNCVPLSAISQAKQTHAQILVLGFLSNVTLQTDLLLFYSKCGVLQDARQVFDKMSE 69

Query: 71  RNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYLGKRL 130
           RN++SWNI++ASY  NSL+ DA+NVF++F  +G  PDHYT+P + K+ VGIGD +LGK L
Sbjct: 70  RNIYSWNIMLASYAQNSLFLDAMNVFDDFLKMGLRPDHYTMPPVFKSCVGIGDTFLGKML 129

Query: 131 HCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAGLY 190
           H W +RLGFE YVVV S+VLDFY K+G + DAK+VF NM+ +D+  WNSMISG+GRAG Y
Sbjct: 130 HGWVVRLGFEEYVVVGSSVLDFYVKFGNLVDAKRVFSNMLSRDSAFWNSMISGFGRAGFY 189

Query: 191 EDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVAIGNS 250
            DAL+CF+ ML EGV  D MTIPS+LNACG EGDL KGKEIH  V+KS++F  DVAIGNS
Sbjct: 190 VDALNCFRSMLGEGVNKDSMTIPSILNACGREGDLMKGKEIHGQVVKSLIFGGDVAIGNS 249

Query: 251 LIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPN 310
           LIDMYAKCG L DSEKVF NM  LN+VTWTT+ISCYG HGKGE SL LF KMKDCG +PN
Sbjct: 250 LIDMYAKCGCLHDSEKVFRNMRELNLVTWTTLISCYGVHGKGEDSLALFKKMKDCGFKPN 309

Query: 311 SVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEAFLLI 370
            VTLTA+LASCSH+G I++G +IF SI  +Y ++ ++EHYAC+VDLL R G L+EA  L+
Sbjct: 310 CVTLTAVLASCSHSGLIDQGRKIFDSISFEYGLKPSVEHYACMVDLLGRFGHLEEALGLV 369

Query: 371 RNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESLGISH 430
           +NMK  A+ S+WGALL+GC++H+N+EIGE+AA QLFELEP N SN+IAL SIY+SL +  
Sbjct: 370 KNMKSVAIASVWGALLAGCVMHKNVEIGEVAALQLFELEPRNSSNYIALCSIYDSLSVCD 429

Query: 431 GASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVRQA 486
           G S+IR KMR+LGL K PG S +TI G +HKFY GD S P    I E L+ + +A
Sbjct: 430 GVSIIRAKMRNLGLVKTPGYSWMTIAGKIHKFYKGDCSRPQTKMIHEMLDQIIKA 484

BLAST of Cla97C02G026810 vs. TrEMBL
Match: tr|D7U4G1|D7U4G1_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_04s0044g01080 PE=4 SV=1)

HSP 1 Score: 636.3 bits (1640), Expect = 5.5e-179
Identity = 306/477 (64.15%), Postives = 366/477 (76.73%), Query Frame = 0

Query: 6   STSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVF 65
           + S  S ILR C    AI+QAKQ HAQIL+HGF+P++TL TDLLLVY KCG L DARKVF
Sbjct: 5   TASSLSCILRKCVPHSAISQAKQTHAQILVHGFIPNITLQTDLLLVYSKCGVLQDARKVF 64

Query: 66  DKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVY 125
           DKM  RNMHSWNILIASY HN  ++DA+ VF+ F  +GF PDH+TLP + KA  GIGD Y
Sbjct: 65  DKMVERNMHSWNILIASYAHNCFFYDALGVFDSFLKMGFRPDHFTLPPVFKACAGIGDSY 124

Query: 126 LGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYG 185
           LGK LH W IR+GFE YVVV S+VLDFYAK G + DA + F NM ++D++ WN MI G G
Sbjct: 125 LGKMLHSWVIRIGFEEYVVVGSSVLDFYAKCGGLVDAWRCFVNMSWRDSVVWNLMIVGLG 184

Query: 186 RAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADV 245
           +A  + DAL+CF+ ML EGVKMD  T+PS+L+ CGGEGDL KGKEIH  V+K+ +F  +V
Sbjct: 185 KACFFRDALECFRDMLSEGVKMDSRTVPSILSVCGGEGDLMKGKEIHGQVVKNQIFGCEV 244

Query: 246 AIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDC 305
           AIGNSLIDMYAKCG L DSEKVF  MS LN+VTWT+MISCYG HGKG ++L LF KMK C
Sbjct: 245 AIGNSLIDMYAKCGCLHDSEKVFTTMSELNLVTWTSMISCYGVHGKGHEALALFKKMKYC 304

Query: 306 GIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQE 365
           G QPN VT+TAILASCSH+G I +G +IF SI  DY  E + EHYAC+VDLL R G+L+E
Sbjct: 305 GFQPNCVTITAILASCSHSGLIEQGRKIFYSINLDYGFEPSAEHYACMVDLLGRFGYLEE 364

Query: 366 AFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYES 425
           AF LI+NMK  A  S+WGALL+GC++H+N+EIGEIAA+ LFELEP N SN+IAL S+Y+S
Sbjct: 365 AFELIKNMKSAATASVWGALLAGCLMHKNIEIGEIAAHCLFELEPRNSSNYIALCSMYDS 424

Query: 426 LGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAV 483
           LGI  G S  R KMR+LGL K PGCS IT+ G VHKFY  D SHP    I+ETL+ +
Sbjct: 425 LGIWDGVSRTRAKMRELGLVKTPGCSWITVAGRVHKFYQEDLSHPETQMIYETLHGI 481

BLAST of Cla97C02G026810 vs. Swiss-Prot
Match: sp|Q9LW32|PP258_ARATH (Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H34 PE=2 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 4.1e-92
Identity = 186/482 (38.59%), Postives = 284/482 (58.92%), Query Frame = 0

Query: 8   SRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDK 67
           S F   +++C+ L+ I   KQ H Q  + G+   + + + L+++Y  CG L DARKVFD+
Sbjct: 77  SSFPCAIKACSSLFDIFSGKQTHQQAFVFGYQSDIFVSSALIVMYSTCGKLEDARKVFDE 136

Query: 68  MTHRNMHSWNILIASYVHNSLYFDAINVF-------NEFTDLGFLPDHYTLPQMLKASVG 127
           +  RN+ SW  +I  Y  N    DA+++F       N+  D  FL D   L  ++ A   
Sbjct: 137 IPKRNIVSWTSMIRGYDLNGNALDAVSLFKDLLVDENDDDDAMFL-DSMGLVSVISACSR 196

Query: 128 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAK--YGIVGDAKKVFDNMIFKDTISWN 187
           +    L + +H + I+ GF+  V V +T+LD YAK   G V  A+K+FD ++ KD +S+N
Sbjct: 197 VPAKGLTESIHSFVIKRGFDRGVSVGNTLLDAYAKGGEGGVAVARKIFDQIVDKDRVSYN 256

Query: 188 SMISGYGRAGLYEDALDCFKRMLL-EGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 247
           S++S Y ++G+  +A + F+R++  + V  + +T+ +VL A    G LR GK IH  V++
Sbjct: 257 SIMSVYAQSGMSNEAFEVFRRLVKNKVVTFNAITLSTVLLAVSHSGALRIGKCIHDQVIR 316

Query: 248 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 307
            M    DV +G S+IDMY KCG +  + K F  M + N+ +WT MI+ YG HG   K+L 
Sbjct: 317 -MGLEDDVIVGTSIIDMYCKCGRVETARKAFDRMKNKNVRSWTAMIAGYGMHGHAAKALE 376

Query: 308 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 367
           LF  M D G++PN +T  ++LA+CSHAG   EGWR F ++   + +E  +EHY C+VDLL
Sbjct: 377 LFPAMIDSGVRPNYITFVSVLAACSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLL 436

Query: 368 SRLGFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFI 427
            R GFLQ+A+ LI+ MK+     IW +LL+ C IH+N+E+ EI+  +LFEL+ +N   ++
Sbjct: 437 GRAGFLQKAYDLIQRMKMKPDSIIWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYM 496

Query: 428 ALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFE 480
            L  IY   G       +R  M++ GL K PG S + ++G VH F  GD  HP   +I+E
Sbjct: 497 LLSHIYADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYE 556

BLAST of Cla97C02G026810 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 320.5 bits (820), Expect = 3.3e-86
Identity = 176/474 (37.13%), Postives = 272/474 (57.38%), Query Frame = 0

Query: 11  SFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLL-VYCKCGFLHDARKVFDKMT 70
           S +L +C+HL  +   K+ HA  L +G L   + +   L+ +YC C  +   R+VFD M 
Sbjct: 306 SSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMF 365

Query: 71  HRNMHSWNILIASYVHNSLYFDAINVFNEFTD-LGFLPDHYTLPQMLKASVGIGDVYLGK 130
            R +  WN +IA Y  N    +A+ +F    +  G L +  T+  ++ A V  G     +
Sbjct: 366 DRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKE 425

Query: 131 RLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAG 190
            +H + ++ G +    V +T++D Y++ G +  A ++F  M  +D ++WN+MI+GY  + 
Sbjct: 426 AIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSE 485

Query: 191 LYEDALDCFKRML-LE----------GVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 250
            +EDAL    +M  LE           +K + +T+ ++L +C     L KGKEIH   +K
Sbjct: 486 HHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIK 545

Query: 251 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 310
           + + A DVA+G++L+DMYAKCG L  S KVF  +   N++TW  +I  YG HG G++++ 
Sbjct: 546 NNL-ATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAID 605

Query: 311 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 370
           L   M   G++PN VT  ++ A+CSH+G ++EG RIF  +  DY +E + +HYACVVDLL
Sbjct: 606 LLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLL 665

Query: 371 SRLGFLQEAFLLIRNM-KLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNF 430
            R G ++EA+ L+  M +       W +LL    IH NLEIGEIAA  L +LEP   S++
Sbjct: 666 GRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHY 725

Query: 431 IALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHP 471
           + L +IY S G+   A+ +R  M++ G+ K PGCS I     VHKF  GD+SHP
Sbjct: 726 VLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHP 778

BLAST of Cla97C02G026810 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 318.2 bits (814), Expect = 1.7e-85
Identity = 169/473 (35.73%), Postives = 271/473 (57.29%), Query Frame = 0

Query: 13  ILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDKMTHRN 72
           +L +C       +    H+  + HG    L +   L+ +Y + G L D +KVFD+M  R+
Sbjct: 253 LLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRD 312

Query: 73  MHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYLGKRLHC 132
           + SWN +I +Y  N     AI++F E       PD  TL  +      +GD+   + +  
Sbjct: 313 LISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQG 372

Query: 133 WTIRLG-FEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAGLYE 192
           +T+R G F   + + + V+  YAK G+V  A+ VF+ +   D ISWN++ISGY + G   
Sbjct: 373 FTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFAS 432

Query: 193 DALDCFKRMLLEG-VKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVAIGNS 252
           +A++ +  M  EG +  +  T  SVL AC   G LR+G ++H  +LK+ ++  DV +  S
Sbjct: 433 EAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLY-LDVFVVTS 492

Query: 253 LIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPN 312
           L DMY KCG L D+  +F+ +  +N V W T+I+C+G HG GEK+++LF +M D G++P+
Sbjct: 493 LADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPD 552

Query: 313 SVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEAFLLI 372
            +T   +L++CSH+G ++EG   F+ + +DY I  +++HY C+VD+  R G L+ A   I
Sbjct: 553 HITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFI 612

Query: 373 RNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESLGISH 432
           ++M L    SIWGALLS C +H N+++G+IA+  LFE+EP +    + L ++Y S G   
Sbjct: 613 KSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWE 672

Query: 433 GASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVR 484
           G   IR      GL K PG S + +D  V  FY G+ +HP+   ++  L A++
Sbjct: 673 GVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQ 724

BLAST of Cla97C02G026810 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 314.7 bits (805), Expect = 1.8e-84
Identity = 154/427 (36.07%), Postives = 245/427 (57.38%), Query Frame = 0

Query: 58  LHDARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKA 117
           +   R+VF+ M  +++ S+N +IA Y  + +Y DA+ +  E       PD +TL  +L  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 118 SVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISW 177
                DV  GK +H + IR G +  V + S+++D YAK   + D+++VF  +  +D ISW
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 178 NSMISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 237
           NS+++GY + G Y +AL  F++M+   VK   +   SV+ AC     L  GK++H  VL+
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLR 371

Query: 238 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 297
              F +++ I ++L+DMY+KCG++  + K+F  M+ L+ V+WT +I  +  HG G +++ 
Sbjct: 372 G-GFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVS 431

Query: 298 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 357
           LF +MK  G++PN V   A+L +CSH G ++E W  F S+   Y +   +EHYA V DLL
Sbjct: 432 LFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLL 491

Query: 358 SRLGFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFI 417
            R G L+EA+  I  M +    S+W  LLS C +H+NLE+ E  A ++F ++  N   ++
Sbjct: 492 GRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYV 551

Query: 418 ALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFE 477
            + ++Y S G     + +R +MR  GL K P CS I +    H F  GD SHP   +I E
Sbjct: 552 LMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINE 611

Query: 478 TLNAVRQ 485
            L AV +
Sbjct: 612 FLKAVME 617

BLAST of Cla97C02G026810 vs. Swiss-Prot
Match: sp|Q9SI53|PP147_ARATH (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 3.1e-84
Identity = 164/474 (34.60%), Postives = 261/474 (55.06%), Query Frame = 0

Query: 7   TSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFD 66
           ++ +S +++ C    A+ +       +  +G  P + L+  L+ +Y K   L+DA ++FD
Sbjct: 61  SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD 120

Query: 67  KMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYL 126
           +M  RN+ SW  +I++Y    ++  A+ +          P+ YT   +L++  G+ DV  
Sbjct: 121 QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDV-- 180

Query: 127 GKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGR 186
            + LHC  I+ G E  V V S ++D +AK G   DA  VFD M+  D I WNS+I G+ +
Sbjct: 181 -RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQ 240

Query: 187 AGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVA 246
               + AL+ FKRM   G   +  T+ SVL AC G   L  G + H  ++K   +  D+ 
Sbjct: 241 NSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK---YDQDLI 300

Query: 247 IGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCG 306
           + N+L+DMY KCGSL D+ +VF  M   +++TW+TMIS    +G  +++L LF +MK  G
Sbjct: 301 LNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSG 360

Query: 307 IQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEA 366
            +PN +T+  +L +CSHAG + +GW  F+S+   Y I+   EHY C++DLL + G L +A
Sbjct: 361 TKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDA 420

Query: 367 FLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESL 426
             L+  M+       W  LL  C + RN+ + E AA ++  L+P +   +  L +IY + 
Sbjct: 421 VKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANS 480

Query: 427 GISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 481
                   IR +MRD G+ K PGCS I ++  +H F  GDNSHP  + + + LN
Sbjct: 481 QKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLN 528

BLAST of Cla97C02G026810 vs. TAIR10
Match: AT3G26782.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 340.1 bits (871), Expect = 2.2e-93
Identity = 186/482 (38.59%), Postives = 284/482 (58.92%), Query Frame = 0

Query: 8   SRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDK 67
           S F   +++C+ L+ I   KQ H Q  + G+   + + + L+++Y  CG L DARKVFD+
Sbjct: 77  SSFPCAIKACSSLFDIFSGKQTHQQAFVFGYQSDIFVSSALIVMYSTCGKLEDARKVFDE 136

Query: 68  MTHRNMHSWNILIASYVHNSLYFDAINVF-------NEFTDLGFLPDHYTLPQMLKASVG 127
           +  RN+ SW  +I  Y  N    DA+++F       N+  D  FL D   L  ++ A   
Sbjct: 137 IPKRNIVSWTSMIRGYDLNGNALDAVSLFKDLLVDENDDDDAMFL-DSMGLVSVISACSR 196

Query: 128 IGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAK--YGIVGDAKKVFDNMIFKDTISWN 187
           +    L + +H + I+ GF+  V V +T+LD YAK   G V  A+K+FD ++ KD +S+N
Sbjct: 197 VPAKGLTESIHSFVIKRGFDRGVSVGNTLLDAYAKGGEGGVAVARKIFDQIVDKDRVSYN 256

Query: 188 SMISGYGRAGLYEDALDCFKRMLL-EGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 247
           S++S Y ++G+  +A + F+R++  + V  + +T+ +VL A    G LR GK IH  V++
Sbjct: 257 SIMSVYAQSGMSNEAFEVFRRLVKNKVVTFNAITLSTVLLAVSHSGALRIGKCIHDQVIR 316

Query: 248 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 307
            M    DV +G S+IDMY KCG +  + K F  M + N+ +WT MI+ YG HG   K+L 
Sbjct: 317 -MGLEDDVIVGTSIIDMYCKCGRVETARKAFDRMKNKNVRSWTAMIAGYGMHGHAAKALE 376

Query: 308 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 367
           LF  M D G++PN +T  ++LA+CSHAG   EGWR F ++   + +E  +EHY C+VDLL
Sbjct: 377 LFPAMIDSGVRPNYITFVSVLAACSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLL 436

Query: 368 SRLGFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFI 427
            R GFLQ+A+ LI+ MK+     IW +LL+ C IH+N+E+ EI+  +LFEL+ +N   ++
Sbjct: 437 GRAGFLQKAYDLIQRMKMKPDSIIWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYM 496

Query: 428 ALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFE 480
            L  IY   G       +R  M++ GL K PG S + ++G VH F  GD  HP   +I+E
Sbjct: 497 LLSHIYADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYE 556

BLAST of Cla97C02G026810 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 320.5 bits (820), Expect = 1.8e-87
Identity = 176/474 (37.13%), Postives = 272/474 (57.38%), Query Frame = 0

Query: 11  SFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLL-VYCKCGFLHDARKVFDKMT 70
           S +L +C+HL  +   K+ HA  L +G L   + +   L+ +YC C  +   R+VFD M 
Sbjct: 306 SSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMF 365

Query: 71  HRNMHSWNILIASYVHNSLYFDAINVFNEFTD-LGFLPDHYTLPQMLKASVGIGDVYLGK 130
            R +  WN +IA Y  N    +A+ +F    +  G L +  T+  ++ A V  G     +
Sbjct: 366 DRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKE 425

Query: 131 RLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAG 190
            +H + ++ G +    V +T++D Y++ G +  A ++F  M  +D ++WN+MI+GY  + 
Sbjct: 426 AIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSE 485

Query: 191 LYEDALDCFKRML-LE----------GVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 250
            +EDAL    +M  LE           +K + +T+ ++L +C     L KGKEIH   +K
Sbjct: 486 HHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIK 545

Query: 251 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 310
           + + A DVA+G++L+DMYAKCG L  S KVF  +   N++TW  +I  YG HG G++++ 
Sbjct: 546 NNL-ATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAID 605

Query: 311 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 370
           L   M   G++PN VT  ++ A+CSH+G ++EG RIF  +  DY +E + +HYACVVDLL
Sbjct: 606 LLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLL 665

Query: 371 SRLGFLQEAFLLIRNM-KLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNF 430
            R G ++EA+ L+  M +       W +LL    IH NLEIGEIAA  L +LEP   S++
Sbjct: 666 GRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHY 725

Query: 431 IALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHP 471
           + L +IY S G+   A+ +R  M++ G+ K PGCS I     VHKF  GD+SHP
Sbjct: 726 VLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHP 778

BLAST of Cla97C02G026810 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 318.2 bits (814), Expect = 9.2e-87
Identity = 169/473 (35.73%), Postives = 271/473 (57.29%), Query Frame = 0

Query: 13  ILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFDKMTHRN 72
           +L +C       +    H+  + HG    L +   L+ +Y + G L D +KVFD+M  R+
Sbjct: 253 LLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRD 312

Query: 73  MHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYLGKRLHC 132
           + SWN +I +Y  N     AI++F E       PD  TL  +      +GD+   + +  
Sbjct: 313 LISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQG 372

Query: 133 WTIRLG-FEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGRAGLYE 192
           +T+R G F   + + + V+  YAK G+V  A+ VF+ +   D ISWN++ISGY + G   
Sbjct: 373 FTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFAS 432

Query: 193 DALDCFKRMLLEG-VKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVAIGNS 252
           +A++ +  M  EG +  +  T  SVL AC   G LR+G ++H  +LK+ ++  DV +  S
Sbjct: 433 EAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLY-LDVFVVTS 492

Query: 253 LIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCGIQPN 312
           L DMY KCG L D+  +F+ +  +N V W T+I+C+G HG GEK+++LF +M D G++P+
Sbjct: 493 LADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPD 552

Query: 313 SVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEAFLLI 372
            +T   +L++CSH+G ++EG   F+ + +DY I  +++HY C+VD+  R G L+ A   I
Sbjct: 553 HITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFI 612

Query: 373 RNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESLGISH 432
           ++M L    SIWGALLS C +H N+++G+IA+  LFE+EP +    + L ++Y S G   
Sbjct: 613 KSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWE 672

Query: 433 GASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLNAVR 484
           G   IR      GL K PG S + +D  V  FY G+ +HP+   ++  L A++
Sbjct: 673 GVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHPMYEEMYRELTALQ 724

BLAST of Cla97C02G026810 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 314.7 bits (805), Expect = 1.0e-85
Identity = 154/427 (36.07%), Postives = 245/427 (57.38%), Query Frame = 0

Query: 58  LHDARKVFDKMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKA 117
           +   R+VF+ M  +++ S+N +IA Y  + +Y DA+ +  E       PD +TL  +L  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 118 SVGIGDVYLGKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISW 177
                DV  GK +H + IR G +  V + S+++D YAK   + D+++VF  +  +D ISW
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 178 NSMISGYGRAGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLK 237
           NS+++GY + G Y +AL  F++M+   VK   +   SV+ AC     L  GK++H  VL+
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLR 371

Query: 238 SMVFAADVAIGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLV 297
              F +++ I ++L+DMY+KCG++  + K+F  M+ L+ V+WT +I  +  HG G +++ 
Sbjct: 372 G-GFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVS 431

Query: 298 LFNKMKDCGIQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLL 357
           LF +MK  G++PN V   A+L +CSH G ++E W  F S+   Y +   +EHYA V DLL
Sbjct: 432 LFEEMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLL 491

Query: 358 SRLGFLQEAFLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFI 417
            R G L+EA+  I  M +    S+W  LLS C +H+NLE+ E  A ++F ++  N   ++
Sbjct: 492 GRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYV 551

Query: 418 ALISIYESLGISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFE 477
            + ++Y S G     + +R +MR  GL K P CS I +    H F  GD SHP   +I E
Sbjct: 552 LMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINE 611

Query: 478 TLNAVRQ 485
            L AV +
Sbjct: 612 FLKAVME 617

BLAST of Cla97C02G026810 vs. TAIR10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 313.9 bits (803), Expect = 1.7e-85
Identity = 164/474 (34.60%), Postives = 261/474 (55.06%), Query Frame = 0

Query: 7   TSRFSFILRSCAHLYAIAQAKQAHAQILIHGFLPHLTLLTDLLLVYCKCGFLHDARKVFD 66
           ++ +S +++ C    A+ +       +  +G  P + L+  L+ +Y K   L+DA ++FD
Sbjct: 61  SATYSELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFD 120

Query: 67  KMTHRNMHSWNILIASYVHNSLYFDAINVFNEFTDLGFLPDHYTLPQMLKASVGIGDVYL 126
           +M  RN+ SW  +I++Y    ++  A+ +          P+ YT   +L++  G+ DV  
Sbjct: 121 QMPQRNVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCNGMSDV-- 180

Query: 127 GKRLHCWTIRLGFEGYVVVDSTVLDFYAKYGIVGDAKKVFDNMIFKDTISWNSMISGYGR 186
            + LHC  I+ G E  V V S ++D +AK G   DA  VFD M+  D I WNS+I G+ +
Sbjct: 181 -RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQ 240

Query: 187 AGLYEDALDCFKRMLLEGVKMDFMTIPSVLNACGGEGDLRKGKEIHCLVLKSMVFAADVA 246
               + AL+ FKRM   G   +  T+ SVL AC G   L  G + H  ++K   +  D+ 
Sbjct: 241 NSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK---YDQDLI 300

Query: 247 IGNSLIDMYAKCGSLLDSEKVFWNMSSLNIVTWTTMISCYGAHGKGEKSLVLFNKMKDCG 306
           + N+L+DMY KCGSL D+ +VF  M   +++TW+TMIS    +G  +++L LF +MK  G
Sbjct: 301 LNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSG 360

Query: 307 IQPNSVTLTAILASCSHAGYINEGWRIFQSIVSDYEIELTIEHYACVVDLLSRLGFLQEA 366
            +PN +T+  +L +CSHAG + +GW  F+S+   Y I+   EHY C++DLL + G L +A
Sbjct: 361 TKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDA 420

Query: 367 FLLIRNMKLTAVPSIWGALLSGCMIHRNLEIGEIAANQLFELEPTNPSNFIALISIYESL 426
             L+  M+       W  LL  C + RN+ + E AA ++  L+P +   +  L +IY + 
Sbjct: 421 VKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANS 480

Query: 427 GISHGASLIREKMRDLGLTKIPGCSCITIDGVVHKFYGGDNSHPLALRIFETLN 481
                   IR +MRD G+ K PGCS I ++  +H F  GDNSHP  + + + LN
Sbjct: 481 QKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLN 528

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008460899.14.5e-26589.63PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-... [more]
XP_011649399.13.0e-26188.41PREDICTED: pentatricopeptide repeat-containing protein At5g04780-like [Cucumis s... [more]
KGN62149.15.3e-25888.66hypothetical protein Csa_2G302160 [Cucumis sativus][more]
XP_023512346.11.2e-25487.45pentatricopeptide repeat-containing protein At3g24000, mitochondrial-like isofor... [more]
XP_022971622.16.1e-25486.71pentatricopeptide repeat-containing protein At5g04780, mitochondrial-like isofor... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CDI3|A0A1S3CDI3_CUCME3.0e-26589.63pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
tr|A0A0A0LM84|A0A0A0LM84_CUCSA3.5e-25888.66Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G302160 PE=4 SV=1[more]
tr|A0A2I4EAF5|A0A2I4EAF5_9ROSI1.7e-18065.41pentatricopeptide repeat-containing protein At5g04780-like OS=Juglans regia OX=5... [more]
tr|A0A2P4H960|A0A2P4H960_QUESU2.5e-17963.79Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_6... [more]
tr|D7U4G1|D7U4G1_VITVI5.5e-17964.15Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_04s0044g01080 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
sp|Q9LW32|PP258_ARATH4.1e-9238.59Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidop... [more]
sp|Q7Y211|PP285_ARATH3.3e-8637.13Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|O81767|PP348_ARATH1.7e-8535.73Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
sp|Q9LW63|PP251_ARATH1.8e-8436.07Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
sp|Q9SI53|PP147_ARATH3.1e-8434.60Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT3G26782.12.2e-9338.59Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.11.8e-8737.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33990.19.2e-8735.73Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23330.11.0e-8536.07Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G03880.11.7e-8534.60Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G026810.1Cla97C02G026810.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 274..453
e-value: 8.7E-25
score: 89.7
coord: 124..273
e-value: 3.8E-23
score: 84.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 1..123
e-value: 9.4E-17
score: 62.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 175..208
e-value: 5.6E-10
score: 36.8
coord: 277..310
e-value: 2.0E-9
score: 35.0
coord: 74..107
e-value: 7.7E-5
score: 20.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 75..104
e-value: 0.0012
score: 18.8
coord: 48..72
e-value: 0.0033
score: 17.5
coord: 175..205
e-value: 5.3E-10
score: 38.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 275..321
e-value: 1.7E-12
score: 47.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 208..242
score: 5.623
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..207
score: 12.66
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 41..71
score: 7.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 346..380
score: 7.07
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 6..40
score: 5.645
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 310..340
score: 7.366
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 275..309
score: 12.178
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 72..106
score: 9.471
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 244..274
score: 7.18
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 142..172
score: 5.59
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..446
score: 6.226
NoneNo IPR availablePANTHERPTHR24015:SF713SUBFAMILY NOT NAMEDcoord: 6..473
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 6..473

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C02G026810Melon (DHL92) v3.5.1mewmbB093
Cla97C02G026810Silver-seed gourdcarwmbB1117
Cla97C02G026810Cucurbita maxima (Rimu)cmawmbB632
Cla97C02G026810Cucurbita moschata (Rifu)cmowmbB605