ClCG01G011540 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G011540
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr01: 19154515 .. 19155441 (+)
RNA-Seq ExpressionClCG01G011540
SyntenyClCG01G011540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCAAACCACAAGACATAATCCCTTTCTACGCTGCTCTCCTGGAAGCATGCTCTTCCAAAAACAACCTCCACACCCTCGAGCAAATCCACGCTCTAACCATAAGACTCGGAATCTCTCACCACAATTTCATTCGAACCAAGCTCGCCTCCACCTACGCCGCCTGCGCCCAACTCCCACAAGCCCTCACCATCTTCTCCTTCGCCACTCGACGCCCTACCTACCTCTTCAATGCCCTCATCAGAGCGCACTCCTCTCTCCGTCTCTTCTCTCAATCCCTCTCCATTTTCCGCCACATGCTTCTCTCTGGCAAATCCATTGACCGTCATACTCTCCCGCCGGTGCTCAAGTCTTGTACCGGCCTCTCGTCCTTACGCCTCGGCCGCCAGGTTCATGGGGCTCTTGTGATTAATGGGTTCTCTGCAGATTTGCCGAATTTGAATGCTTTGATTACGATGTATGGCAAGTGCGGGGACTTGGGTAATGCACGGAAGGTGTTCGATGAAATGCCTGTGAGGAATGTGGTGTCATGGTCGGCGTTGATGGCGGGTTACGGTGTTCATGGGATGTTTGGGGAGGTGTTTGTGTTGTTTGAGAGGATGGTGGAAGAGGGGCAAAAGCCGGATGCGCTCACTTTTACAGCTCTTCTCACGGCGTGTAGCCATGGAGGGTTGCTTGACAGAGGGAAGGAGTATTTTGGTATGATGAGAATGGAGTTTGATTTGAGGCCTGGGTTGGAACATTATACATGTATGGTGGATTTGCTTGGGAGGGTGGGGCAAGTGGAAGAAGCAGAGAAGTTGATAATGGAGATGGAGATTAAGCCTGATGGGGCTTTGTGGGGAGCTCTGTTGAGTGCTTGTAGGATTCATGGGAAGACCGAGGTGGCTGAGAGGGTGCTAAAACGGTTTATCAACCAACAATGA

mRNA sequence

ATGCCCAAACCACAAGACATAATCCCTTTCTACGCTGCTCTCCTGGAAGCATGCTCTTCCAAAAACAACCTCCACACCCTCGAGCAAATCCACGCTCTAACCATAAGACTCGGAATCTCTCACCACAATTTCATTCGAACCAAGCTCGCCTCCACCTACGCCGCCTGCGCCCAACTCCCACAAGCCCTCACCATCTTCTCCTTCGCCACTCGACGCCCTACCTACCTCTTCAATGCCCTCATCAGAGCGCACTCCTCTCTCCGTCTCTTCTCTCAATCCCTCTCCATTTTCCGCCACATGCTTCTCTCTGGCAAATCCATTGACCGTCATACTCTCCCGCCGGTGCTCAAGTCTTGTACCGGCCTCTCGTCCTTACGCCTCGGCCGCCAGGTTCATGGGGCTCTTGTGATTAATGGGTTCTCTGCAGATTTGCCGAATTTGAATGCTTTGATTACGATGTATGGCAAGTGCGGGGACTTGGGTAATGCACGGAAGGTGTTCGATGAAATGCCTGTGAGGAATGTGGTGTCATGGTCGGCGTTGATGGCGGGTTACGGTGTTCATGGGATGTTTGGGGAGGTGTTTGTGTTGTTTGAGAGGATGGTGGAAGAGGGGCAAAAGCCGGATGCGCTCACTTTTACAGCTCTTCTCACGGCGTGTAGCCATGGAGGGTTGCTTGACAGAGGGAAGGAGTATTTTGGTATGATGAGAATGGAGTTTGATTTGAGGCCTGGGTTGGAACATTATACATGTATGGTGGATTTGCTTGGGAGGGTGGGGCAAGTGGAAGAAGCAGAGAAGTTGATAATGGAGATGGAGATTAAGCCTGATGGGGCTTTGTGGGGAGCTCTGTTGAGTGCTTGTAGGATTCATGGGAAGACCGAGGTGGCTGAGAGGGTGCTAAAACGGTTTATCAACCAACAATGA

Coding sequence (CDS)

ATGCCCAAACCACAAGACATAATCCCTTTCTACGCTGCTCTCCTGGAAGCATGCTCTTCCAAAAACAACCTCCACACCCTCGAGCAAATCCACGCTCTAACCATAAGACTCGGAATCTCTCACCACAATTTCATTCGAACCAAGCTCGCCTCCACCTACGCCGCCTGCGCCCAACTCCCACAAGCCCTCACCATCTTCTCCTTCGCCACTCGACGCCCTACCTACCTCTTCAATGCCCTCATCAGAGCGCACTCCTCTCTCCGTCTCTTCTCTCAATCCCTCTCCATTTTCCGCCACATGCTTCTCTCTGGCAAATCCATTGACCGTCATACTCTCCCGCCGGTGCTCAAGTCTTGTACCGGCCTCTCGTCCTTACGCCTCGGCCGCCAGGTTCATGGGGCTCTTGTGATTAATGGGTTCTCTGCAGATTTGCCGAATTTGAATGCTTTGATTACGATGTATGGCAAGTGCGGGGACTTGGGTAATGCACGGAAGGTGTTCGATGAAATGCCTGTGAGGAATGTGGTGTCATGGTCGGCGTTGATGGCGGGTTACGGTGTTCATGGGATGTTTGGGGAGGTGTTTGTGTTGTTTGAGAGGATGGTGGAAGAGGGGCAAAAGCCGGATGCGCTCACTTTTACAGCTCTTCTCACGGCGTGTAGCCATGGAGGGTTGCTTGACAGAGGGAAGGAGTATTTTGGTATGATGAGAATGGAGTTTGATTTGAGGCCTGGGTTGGAACATTATACATGTATGGTGGATTTGCTTGGGAGGGTGGGGCAAGTGGAAGAAGCAGAGAAGTTGATAATGGAGATGGAGATTAAGCCTGATGGGGCTTTGTGGGGAGCTCTGTTGAGTGCTTGTAGGATTCATGGGAAGACCGAGGTGGCTGAGAGGGTGCTAAAACGGTTTATCAACCAACAATGA

Protein sequence

MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKRFINQQ
Homology
BLAST of ClCG01G011540 vs. NCBI nr
Match: XP_004144516.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis sativus] >KGN43485.1 hypothetical protein Csa_020556 [Cucumis sativus])

HSP 1 Score: 559.7 bits (1441), Expect = 1.6e-155
Identity = 275/308 (89.29%), Postives = 290/308 (94.16%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKP +IIPFYAALL+ACSS NNLHTL+QIHALTI L ISHH+FIRTKLASTYAACAQLP
Sbjct: 1   MPKPHEIIPFYAALLDACSSTNNLHTLKQIHALTITLHISHHHFIRTKLASTYAACAQLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPTYLFN LIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT
Sbjct: 61  QATTIFSFATRRPTYLFNTLIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGAL+INGFSADLP+LNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGALLINGFSADLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMVEEGQKPD LTFT+LLTACSHGGL+++GKEYFGMMRMEF
Sbjct: 181 LMAGYGVHGMFGEVFRLFERMVEEGQKPDELTFTSLLTACSHGGLIEKGKEYFGMMRMEF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
            LRPGL+HYTCMVDLLGR GQVEEAEKLIMEMEI+PD ALWGA+LSACRIHGK +VA+RV
Sbjct: 241 HLRPGLQHYTCMVDLLGRSGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKVDVADRV 300

Query: 301 LKRFINQQ 309
            KRFI QQ
Sbjct: 301 QKRFIKQQ 308

BLAST of ClCG01G011540 vs. NCBI nr
Match: XP_023520705.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 553.1 bits (1424), Expect = 1.5e-153
Identity = 273/307 (88.93%), Postives = 285/307 (92.83%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQDIIPFYAALL+ACSS  N  TL+QIHALTIRLGISHH+FIRTKLASTYAAC  LP
Sbjct: 1   MPKPQDIIPFYAALLQACSSTKNHRTLKQIHALTIRLGISHHDFIRTKLASTYAACHHLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHML+SGK IDRHTLPPVLKSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRHMLVSGKPIDRHTLPPVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGA+VINGFS DLPNLNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGAVVINGFSTDLPNLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMVEEGQKPD LTFTALLTACSHGGL++RGKEYFGMM+M F
Sbjct: 181 LMAGYGVHGMFGEVFGLFERMVEEGQKPDELTFTALLTACSHGGLIERGKEYFGMMKMGF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
           DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEI+PD ALWGALL ACRIHGK EVA+RV
Sbjct: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIEPDEALWGALLGACRIHGKAEVADRV 300

Query: 301 LKRFINQ 308
             RF+ Q
Sbjct: 301 QTRFMKQ 307

BLAST of ClCG01G011540 vs. NCBI nr
Match: KAG6583697.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 553.1 bits (1424), Expect = 1.5e-153
Identity = 274/307 (89.25%), Postives = 284/307 (92.51%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQDIIPFYAALL+ACSS  N  TL+QIHALTIRLGISHH+FIRTKLASTYAAC  LP
Sbjct: 1   MPKPQDIIPFYAALLQACSSTKNHRTLKQIHALTIRLGISHHDFIRTKLASTYAACHHLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHMLLSGK IDRHTLPPV+KSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRHMLLSGKPIDRHTLPPVIKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGA+VINGFS DLPNLNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGAVVINGFSTDLPNLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMF EVF LFERMVEEGQKPD LTFTALLTACSHGGL++RGKEYFGMM M F
Sbjct: 181 LMAGYGVHGMFSEVFGLFERMVEEGQKPDELTFTALLTACSHGGLIERGKEYFGMMEMGF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
           DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEI+PD ALWGALLSACRIHGK EVA+RV
Sbjct: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIEPDEALWGALLSACRIHGKAEVADRV 300

Query: 301 LKRFINQ 308
             RFI Q
Sbjct: 301 QTRFIKQ 307

BLAST of ClCG01G011540 vs. NCBI nr
Match: XP_038877225.1 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 552.0 bits (1421), Expect = 3.3e-153
Identity = 274/308 (88.96%), Postives = 287/308 (93.18%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQ+IIPFYAALLEACS+  NLHTL+QIHALTIR  ISHH+FIRTKLASTYAACAQL 
Sbjct: 1   MPKPQEIIPFYAALLEACSATKNLHTLKQIHALTIRHRISHHDFIRTKLASTYAACAQLR 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHMLLS KS D+HT PPVLKSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRHMLLSRKSFDQHTFPPVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGAL+INGFSADLP+LNALITMY KCGDLG ARKVFDEMP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGALLINGFSADLPSLNALITMYCKCGDLGVARKVFDEMPERNTVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFG+VF LFER+VEEGQKPD LTFTALLTACSHGGL++R KEYFGMMRM F
Sbjct: 181 LMAGYGVHGMFGDVFGLFERLVEEGQKPDELTFTALLTACSHGGLIERAKEYFGMMRMRF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
           DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEI+PD ALWGALLSACRIH KTEVA RV
Sbjct: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIEPDEALWGALLSACRIHRKTEVAHRV 300

Query: 301 LKRFINQQ 309
            KRFINQQ
Sbjct: 301 QKRFINQQ 308

BLAST of ClCG01G011540 vs. NCBI nr
Match: KAG7019344.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 552.0 bits (1421), Expect = 3.3e-153
Identity = 273/307 (88.93%), Postives = 284/307 (92.51%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQDIIPFYAALL+ACSS  N  TL+QIHALTIRLGISHH+FIRTKLASTYAAC  LP
Sbjct: 1   MPKPQDIIPFYAALLQACSSTKNHRTLKQIHALTIRLGISHHDFIRTKLASTYAACHHLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHMLLSGK IDRHTLPPV+KSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRHMLLSGKPIDRHTLPPVIKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGA+VINGFS DLPNLNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGAVVINGFSTDLPNLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMF EVF LFERMVEEGQKPD LTFTALLTACSHGGL++RGKEYFGMM M F
Sbjct: 181 LMAGYGVHGMFSEVFGLFERMVEEGQKPDELTFTALLTACSHGGLIERGKEYFGMMEMGF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
           DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEI+PD ALWGALLSACRIHGK EVA+RV
Sbjct: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIEPDEALWGALLSACRIHGKAEVADRV 300

Query: 301 LKRFINQ 308
             RF+ Q
Sbjct: 301 QTRFMKQ 307

BLAST of ClCG01G011540 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 1.7e-56
Identity = 114/299 (38.13%), Postives = 177/299 (59.20%), Query Frame = 0

Query: 11  YAALLEACSSK----NNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIF 70
           Y  +L+AC +     N+L   ++IHA   R G S H +I T L   YA    +  A  +F
Sbjct: 181 YTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVF 240

Query: 71  SFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRHTLPPVLKSCTGLSS 130
                R    ++A+I  ++      ++L  FR M+   K  S +  T+  VL++C  L++
Sbjct: 241 GGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAA 300

Query: 131 LRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAG 190
           L  G+ +HG ++  G  + LP ++AL+TMYG+CG L   ++VFD M  R+VVSW++L++ 
Sbjct: 301 LEQGKLIHGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISS 360

Query: 191 YGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRP 250
           YGVHG   +   +FE M+  G  P  +TF ++L ACSH GL++ GK  F  M  +  ++P
Sbjct: 361 YGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKP 420

Query: 251 GLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKR 304
            +EHY CMVDLLGR  +++EA K++ +M  +P   +WG+LL +CRIHG  E+AER  +R
Sbjct: 421 QIEHYACMVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRR 479

BLAST of ClCG01G011540 vs. ExPASy Swiss-Prot
Match: Q9CAY1 (Putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H52 PE=1 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 4.3e-55
Identity = 111/293 (37.88%), Postives = 168/293 (57.34%), Query Frame = 0

Query: 14  LLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIFSFATRRP 73
           L+  C+    L     +H   ++ G+     +     + Y  C  +     +F     + 
Sbjct: 162 LVPLCTVPEYLWLGRSLHGQCVKGGLDSEVAVLNSFITMYMKCGSVEAGRRLFDEMPVKG 221

Query: 74  TYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQVHG 133
              +NA+I  +S   L    L ++  M  SG   D  TL  VL SC  L + ++G +V  
Sbjct: 222 LITWNAVISGYSQNGLAYDVLELYEQMKSSGVCPDPFTLVSVLSSCAHLGAKKIGHEVGK 281

Query: 134 ALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGMFGE 193
            +  NGF  ++   NA I+MY +CG+L  AR VFD MPV+++VSW+A++  YG+HGM GE
Sbjct: 282 LVESNGFVPNVFVSNASISMYARCGNLAKARAVFDIMPVKSLVSWTAMIGCYGMHGM-GE 341

Query: 194 V-FVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYTCM 253
           +  +LF+ M++ G +PD   F  +L+ACSH GL D+G E F  M+ E+ L PG EHY+C+
Sbjct: 342 IGLMLFDDMIKRGIRPDGAVFVMVLSACSHSGLTDKGLELFRAMKREYKLEPGPEHYSCL 401

Query: 254 VDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKRFI 306
           VDLLGR G+++EA + I  M ++PDGA+WGALL AC+IH   ++AE    + I
Sbjct: 402 VDLLGRAGRLDEAMEFIESMPVEPDGAVWGALLGACKIHKNVDMAELAFAKVI 453

BLAST of ClCG01G011540 vs. ExPASy Swiss-Prot
Match: Q9LND4 (Pentatricopeptide repeat-containing protein At1g06140, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E61 PE=2 SV=1)

HSP 1 Score: 213.0 bits (541), Expect = 4.8e-54
Identity = 104/296 (35.14%), Postives = 169/296 (57.09%), Query Frame = 0

Query: 14  LLEACSSKNNLHTLEQIHALTIRLG-ISHHNFIRTKLASTYAACAQLPQALTIFSFATRR 73
           L++AC +       + +H ++IR   I   ++++  +   Y  C  L  A  +F  +  R
Sbjct: 216 LVKACGNVFAGKVGKCVHGVSIRRSFIDQSDYLQASIIDMYVKCRLLDNARKLFETSVDR 275

Query: 74  PTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQVH 133
              ++  LI   +      ++  +FR ML      ++ TL  +L SC+ L SLR G+ VH
Sbjct: 276 NVVMWTTLISGFAKCERAVEAFDLFRQMLRESILPNQCTLAAILVSCSSLGSLRHGKSVH 335

Query: 134 GALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGMFG 193
           G ++ NG   D  N  + I MY +CG++  AR VFD MP RNV+SWS+++  +G++G+F 
Sbjct: 336 GYMIRNGIEMDAVNFTSFIDMYARCGNIQMARTVFDMMPERNVISWSSMINAFGINGLFE 395

Query: 194 EVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYTCM 253
           E    F +M  +   P+++TF +LL+ACSH G +  G + F  M  ++ + P  EHY CM
Sbjct: 396 EALDCFHKMKSQNVVPNSVTFVSLLSACSHSGNVKEGWKQFESMTRDYGVVPEEEHYACM 455

Query: 254 VDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKRFINQQ 309
           VDLLGR G++ EA+  I  M +KP  + WGALLSACRIH + ++A  + ++ ++ +
Sbjct: 456 VDLLGRAGEIGEAKSFIDNMPVKPMASAWGALLSACRIHKEVDLAGEIAEKLLSME 511

BLAST of ClCG01G011540 vs. ExPASy Swiss-Prot
Match: Q9CAA8 (Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H22 PE=3 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.1e-53
Identity = 101/288 (35.07%), Postives = 163/288 (56.60%), Query Frame = 0

Query: 11  YAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIFSFAT 70
           + ++L AC     ++  +QIHA  IR     H ++ + L   Y  C  L  A T+F    
Sbjct: 273 FGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMK 332

Query: 71  RRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQ 130
           ++    + A++  +       +++ IF  M  SG   D +TL   + +C  +SSL  G Q
Sbjct: 333 QKNVVSWTAMVVGYGQTGRAEEAVKIFLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQ 392

Query: 131 VHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGM 190
            HG  + +G    +   N+L+T+YGKCGD+ ++ ++F+EM VR+ VSW+A+++ Y   G 
Sbjct: 393 FHGKAITSGLIHYVTVSNSLVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGR 452

Query: 191 FGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYT 250
             E   LF++MV+ G KPD +T T +++ACS  GL+++G+ YF +M  E+ + P + HY+
Sbjct: 453 AVETIQLFDKMVQHGLKPDGVTLTGVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYS 512

Query: 251 CMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAE 299
           CM+DL  R G++EEA + I  M   PD   W  LLSACR  G  E+ +
Sbjct: 513 CMIDLFSRSGRLEEAMRFINGMPFPPDAIGWTTLLSACRNKGNLEIGK 560

BLAST of ClCG01G011540 vs. ExPASy Swiss-Prot
Match: P0C8Q2 (Pentatricopeptide repeat-containing protein At4g19191, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E1 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.1e-53
Identity = 110/306 (35.95%), Postives = 169/306 (55.23%), Query Frame = 0

Query: 5   QDIIPFYAALLEACSSKNNLHTLEQ---IHALTIRLGISHHNFIRTKLASTYAACAQLPQ 64
           ++  P  +  +   +S  N  TL Q   IH+  I LG            S Y+       
Sbjct: 250 EEFKPDLSTFINLAASCQNPETLTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCS 309

Query: 65  ALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTG 124
           A  +F   T R    +  +I  ++      ++L++F  M+ SG+  D  TL  ++  C  
Sbjct: 310 ARLLFDIMTSRTCVSWTVMISGYAEKGDMDEALALFHAMIKSGEKPDLVTLLSLISGCGK 369

Query: 125 LSSLRLGRQVHGALVINGFSAD-LPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 184
             SL  G+ +     I G   D +   NALI MY KCG +  AR +FD  P + VV+W+ 
Sbjct: 370 FGSLETGKWIDARADIYGCKRDNVMICNALIDMYSKCGSIHEARDIFDNTPEKTVVTWTT 429

Query: 185 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 244
           ++AGY ++G+F E   LF +M++   KP+ +TF A+L AC+H G L++G EYF +M+  +
Sbjct: 430 MIAGYALNGIFLEALKLFSKMIDLDYKPNHITFLAVLQACAHSGSLEKGWEYFHIMKQVY 489

Query: 245 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 304
           ++ PGL+HY+CMVDLLGR G++EEA +LI  M  KPD  +WGALL+AC+IH   ++AE+ 
Sbjct: 490 NISPGLDHYSCMVDLLGRKGKLEEALELIRNMSAKPDAGIWGALLNACKIHRNVKIAEQA 549

Query: 305 LKRFIN 307
            +   N
Sbjct: 550 AESLFN 555

BLAST of ClCG01G011540 vs. ExPASy TrEMBL
Match: A0A0A0K1F7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G041310 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 7.7e-156
Identity = 275/308 (89.29%), Postives = 290/308 (94.16%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKP +IIPFYAALL+ACSS NNLHTL+QIHALTI L ISHH+FIRTKLASTYAACAQLP
Sbjct: 1   MPKPHEIIPFYAALLDACSSTNNLHTLKQIHALTITLHISHHHFIRTKLASTYAACAQLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPTYLFN LIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT
Sbjct: 61  QATTIFSFATRRPTYLFNTLIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGAL+INGFSADLP+LNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGALLINGFSADLPSLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMVEEGQKPD LTFT+LLTACSHGGL+++GKEYFGMMRMEF
Sbjct: 181 LMAGYGVHGMFGEVFRLFERMVEEGQKPDELTFTSLLTACSHGGLIEKGKEYFGMMRMEF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
            LRPGL+HYTCMVDLLGR GQVEEAEKLIMEMEI+PD ALWGA+LSACRIHGK +VA+RV
Sbjct: 241 HLRPGLQHYTCMVDLLGRSGQVEEAEKLIMEMEIEPDEALWGAMLSACRIHGKVDVADRV 300

Query: 301 LKRFINQQ 309
            KRFI QQ
Sbjct: 301 QKRFIKQQ 308

BLAST of ClCG01G011540 vs. ExPASy TrEMBL
Match: A0A6J1I7X0 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111471666 PE=4 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 1.8e-152
Identity = 273/309 (88.35%), Postives = 286/309 (92.56%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQDIIPFYAALL+ACSS  N HTL+QIHALTIRLGISHH+FIRTKLASTYAAC  LP
Sbjct: 1   MPKPQDIIPFYAALLQACSSTKNHHTLKQIHALTIRLGISHHDFIRTKLASTYAACDHLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFRHMLLSGK IDRHTLPPVLKSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRHMLLSGKPIDRHTLPPVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHG +VINGFS DLPNLNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGFVVINGFSTDLPNLNALITMYGKCGDLGIARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMVEEGQKPD LTFTALLTACSHGGL++RGKEYFGMM+M F
Sbjct: 181 LMAGYGVHGMFGEVFGLFERMVEEGQKPDELTFTALLTACSHGGLIERGKEYFGMMKMGF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLI--MEMEIKPDGALWGALLSACRIHGKTEVAE 300
           +L+PGLEHYTCMVDLLGRVGQVEEAEKLI  MEMEI+PD ALWGALLSACRIHGK EVA+
Sbjct: 241 NLKPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEMEIEPDEALWGALLSACRIHGKAEVAD 300

Query: 301 RVLKRFINQ 308
           RV  RF+ Q
Sbjct: 301 RVQTRFMKQ 309

BLAST of ClCG01G011540 vs. ExPASy TrEMBL
Match: A0A6J1EHW3 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111434233 PE=4 SV=1)

HSP 1 Score: 548.1 bits (1411), Expect = 2.3e-152
Identity = 272/307 (88.60%), Postives = 284/307 (92.51%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKPQDIIPFYAALL+ACSS  N  TL+QIHALTIRLGISHH+FIRTKLASTYAAC  LP
Sbjct: 1   MPKPQDIIPFYAALLQACSSTKNHRTLKQIHALTIRLGISHHDFIRTKLASTYAACHHLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPT+LFNALIRAHSSLRLFSQSLSIFR MLLSGK IDRHTLPPV+KSCT
Sbjct: 61  QATTIFSFATRRPTFLFNALIRAHSSLRLFSQSLSIFRRMLLSGKPIDRHTLPPVIKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGA+VINGFS DLPNLNALITMYGKCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGAVVINGFSTDLPNLNALITMYGKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMF EVF LFERMVEEGQKPD LTFTALLTACSHGGL++RGKEYFGMM+M F
Sbjct: 181 LMAGYGVHGMFDEVFGLFERMVEEGQKPDELTFTALLTACSHGGLIERGKEYFGMMQMGF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
           DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEI+PD ALWGALLSACRIHGK EVA+RV
Sbjct: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIEPDEALWGALLSACRIHGKGEVADRV 300

Query: 301 LKRFINQ 308
             RF+ Q
Sbjct: 301 QTRFMKQ 307

BLAST of ClCG01G011540 vs. ExPASy TrEMBL
Match: A0A5A7UT42 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold786G00080 PE=4 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 4.8e-150
Identity = 266/308 (86.36%), Postives = 285/308 (92.53%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKP +IIPFYAALLEACSS  NLHTL+QIHALTI L ISHH+FIRTKLASTYAACAQLP
Sbjct: 1   MPKPHEIIPFYAALLEACSSTKNLHTLKQIHALTITLHISHHHFIRTKLASTYAACAQLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKS DRHT P VLKSCT
Sbjct: 61  QANTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSTDRHTFPLVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGAL+INGFSADLP+LNALITMY KCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGALLINGFSADLPSLNALITMYSKCGDLGVARKVFDGMPERNEVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMV+EGQ+PD LTFT+LLTACSHGGL+++GKEYF  MRMEF
Sbjct: 181 LMAGYGVHGMFGEVFRLFERMVKEGQRPDELTFTSLLTACSHGGLIEKGKEYFRTMRMEF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
            LRPGL+HYTCMVDLLGR+GQVEEAEKLIMEME++PD ALWGA+LSACRIHG+ +VA+RV
Sbjct: 241 HLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEMEPDEALWGAMLSACRIHGRVDVADRV 300

Query: 301 LKRFINQQ 309
            KRFI QQ
Sbjct: 301 QKRFIKQQ 308

BLAST of ClCG01G011540 vs. ExPASy TrEMBL
Match: A0A1S3C0K0 (pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103495634 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 6.3e-150
Identity = 266/308 (86.36%), Postives = 285/308 (92.53%), Query Frame = 0

Query: 1   MPKPQDIIPFYAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLP 60
           MPKP +IIPFYAALLEACSS  NLHTL+QIHALTI L ISHH+FIRTKLASTYAACAQLP
Sbjct: 1   MPKPHEIIPFYAALLEACSSTKNLHTLKQIHALTITLHISHHHFIRTKLASTYAACAQLP 60

Query: 61  QALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCT 120
           QA TIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKS DRHT P VLKSCT
Sbjct: 61  QANTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSTDRHTFPLVLKSCT 120

Query: 121 GLSSLRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 180
           GLSSLRLGRQVHGAL+INGFSADLP+LNALITMY KCGDLG ARKVFD MP RN VSWSA
Sbjct: 121 GLSSLRLGRQVHGALLINGFSADLPSLNALITMYSKCGDLGVARKVFDGMPERNGVSWSA 180

Query: 181 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 240
           LMAGYGVHGMFGEVF LFERMV+EGQ+PD LTFT+LLTACSHGGL+++GKEYF  MRMEF
Sbjct: 181 LMAGYGVHGMFGEVFRLFERMVKEGQRPDELTFTSLLTACSHGGLIEKGKEYFRTMRMEF 240

Query: 241 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 300
            LRPGL+HYTCMVDLLGR+GQVEEAEKLIMEME++PD ALWGA+LSACRIHG+ +VA+RV
Sbjct: 241 HLRPGLQHYTCMVDLLGRLGQVEEAEKLIMEMEMEPDEALWGAMLSACRIHGRVDVADRV 300

Query: 301 LKRFINQQ 309
            KRFI QQ
Sbjct: 301 QKRFIKQQ 308

BLAST of ClCG01G011540 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 221.1 bits (562), Expect = 1.2e-57
Identity = 114/299 (38.13%), Postives = 177/299 (59.20%), Query Frame = 0

Query: 11  YAALLEACSSK----NNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIF 70
           Y  +L+AC +     N+L   ++IHA   R G S H +I T L   YA    +  A  +F
Sbjct: 181 YTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVF 240

Query: 71  SFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGK--SIDRHTLPPVLKSCTGLSS 130
                R    ++A+I  ++      ++L  FR M+   K  S +  T+  VL++C  L++
Sbjct: 241 GGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAA 300

Query: 131 LRLGRQVHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAG 190
           L  G+ +HG ++  G  + LP ++AL+TMYG+CG L   ++VFD M  R+VVSW++L++ 
Sbjct: 301 LEQGKLIHGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISS 360

Query: 191 YGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRP 250
           YGVHG   +   +FE M+  G  P  +TF ++L ACSH GL++ GK  F  M  +  ++P
Sbjct: 361 YGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKP 420

Query: 251 GLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKR 304
            +EHY CMVDLLGR  +++EA K++ +M  +P   +WG+LL +CRIHG  E+AER  +R
Sbjct: 421 QIEHYACMVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRR 479

BLAST of ClCG01G011540 vs. TAIR 10
Match: AT3G11460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 216.5 bits (550), Expect = 3.1e-56
Identity = 111/293 (37.88%), Postives = 168/293 (57.34%), Query Frame = 0

Query: 14  LLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIFSFATRRP 73
           L+  C+    L     +H   ++ G+     +     + Y  C  +     +F     + 
Sbjct: 162 LVPLCTVPEYLWLGRSLHGQCVKGGLDSEVAVLNSFITMYMKCGSVEAGRRLFDEMPVKG 221

Query: 74  TYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQVHG 133
              +NA+I  +S   L    L ++  M  SG   D  TL  VL SC  L + ++G +V  
Sbjct: 222 LITWNAVISGYSQNGLAYDVLELYEQMKSSGVCPDPFTLVSVLSSCAHLGAKKIGHEVGK 281

Query: 134 ALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGMFGE 193
            +  NGF  ++   NA I+MY +CG+L  AR VFD MPV+++VSW+A++  YG+HGM GE
Sbjct: 282 LVESNGFVPNVFVSNASISMYARCGNLAKARAVFDIMPVKSLVSWTAMIGCYGMHGM-GE 341

Query: 194 V-FVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYTCM 253
           +  +LF+ M++ G +PD   F  +L+ACSH GL D+G E F  M+ E+ L PG EHY+C+
Sbjct: 342 IGLMLFDDMIKRGIRPDGAVFVMVLSACSHSGLTDKGLELFRAMKREYKLEPGPEHYSCL 401

Query: 254 VDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKRFI 306
           VDLLGR G+++EA + I  M ++PDGA+WGALL AC+IH   ++AE    + I
Sbjct: 402 VDLLGRAGRLDEAMEFIESMPVEPDGAVWGALLGACKIHKNVDMAELAFAKVI 453

BLAST of ClCG01G011540 vs. TAIR 10
Match: AT1G06140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 213.0 bits (541), Expect = 3.4e-55
Identity = 104/296 (35.14%), Postives = 169/296 (57.09%), Query Frame = 0

Query: 14  LLEACSSKNNLHTLEQIHALTIRLG-ISHHNFIRTKLASTYAACAQLPQALTIFSFATRR 73
           L++AC +       + +H ++IR   I   ++++  +   Y  C  L  A  +F  +  R
Sbjct: 216 LVKACGNVFAGKVGKCVHGVSIRRSFIDQSDYLQASIIDMYVKCRLLDNARKLFETSVDR 275

Query: 74  PTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQVH 133
              ++  LI   +      ++  +FR ML      ++ TL  +L SC+ L SLR G+ VH
Sbjct: 276 NVVMWTTLISGFAKCERAVEAFDLFRQMLRESILPNQCTLAAILVSCSSLGSLRHGKSVH 335

Query: 134 GALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGMFG 193
           G ++ NG   D  N  + I MY +CG++  AR VFD MP RNV+SWS+++  +G++G+F 
Sbjct: 336 GYMIRNGIEMDAVNFTSFIDMYARCGNIQMARTVFDMMPERNVISWSSMINAFGINGLFE 395

Query: 194 EVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYTCM 253
           E    F +M  +   P+++TF +LL+ACSH G +  G + F  M  ++ + P  EHY CM
Sbjct: 396 EALDCFHKMKSQNVVPNSVTFVSLLSACSHSGNVKEGWKQFESMTRDYGVVPEEEHYACM 455

Query: 254 VDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERVLKRFINQQ 309
           VDLLGR G++ EA+  I  M +KP  + WGALLSACRIH + ++A  + ++ ++ +
Sbjct: 456 VDLLGRAGEIGEAKSFIDNMPVKPMASAWGALLSACRIHKEVDLAGEIAEKLLSME 511

BLAST of ClCG01G011540 vs. TAIR 10
Match: AT1G68930.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 211.8 bits (538), Expect = 7.5e-55
Identity = 101/288 (35.07%), Postives = 163/288 (56.60%), Query Frame = 0

Query: 11  YAALLEACSSKNNLHTLEQIHALTIRLGISHHNFIRTKLASTYAACAQLPQALTIFSFAT 70
           + ++L AC     ++  +QIHA  IR     H ++ + L   Y  C  L  A T+F    
Sbjct: 273 FGSVLPACGGLGAINEGKQIHACIIRTNFQDHIYVGSALIDMYCKCKCLHYAKTVFDRMK 332

Query: 71  RRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTGLSSLRLGRQ 130
           ++    + A++  +       +++ IF  M  SG   D +TL   + +C  +SSL  G Q
Sbjct: 333 QKNVVSWTAMVVGYGQTGRAEEAVKIFLDMQRSGIDPDHYTLGQAISACANVSSLEEGSQ 392

Query: 131 VHGALVINGFSADLPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSALMAGYGVHGM 190
            HG  + +G    +   N+L+T+YGKCGD+ ++ ++F+EM VR+ VSW+A+++ Y   G 
Sbjct: 393 FHGKAITSGLIHYVTVSNSLVTLYGKCGDIDDSTRLFNEMNVRDAVSWTAMVSAYAQFGR 452

Query: 191 FGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEFDLRPGLEHYT 250
             E   LF++MV+ G KPD +T T +++ACS  GL+++G+ YF +M  E+ + P + HY+
Sbjct: 453 AVETIQLFDKMVQHGLKPDGVTLTGVISACSRAGLVEKGQRYFKLMTSEYGIVPSIGHYS 512

Query: 251 CMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAE 299
           CM+DL  R G++EEA + I  M   PD   W  LLSACR  G  E+ +
Sbjct: 513 CMIDLFSRSGRLEEAMRFINGMPFPPDAIGWTTLLSACRNKGNLEIGK 560

BLAST of ClCG01G011540 vs. TAIR 10
Match: AT4G19191.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 211.8 bits (538), Expect = 7.5e-55
Identity = 110/306 (35.95%), Postives = 169/306 (55.23%), Query Frame = 0

Query: 5   QDIIPFYAALLEACSSKNNLHTLEQ---IHALTIRLGISHHNFIRTKLASTYAACAQLPQ 64
           ++  P  +  +   +S  N  TL Q   IH+  I LG            S Y+       
Sbjct: 250 EEFKPDLSTFINLAASCQNPETLTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCS 309

Query: 65  ALTIFSFATRRPTYLFNALIRAHSSLRLFSQSLSIFRHMLLSGKSIDRHTLPPVLKSCTG 124
           A  +F   T R    +  +I  ++      ++L++F  M+ SG+  D  TL  ++  C  
Sbjct: 310 ARLLFDIMTSRTCVSWTVMISGYAEKGDMDEALALFHAMIKSGEKPDLVTLLSLISGCGK 369

Query: 125 LSSLRLGRQVHGALVINGFSAD-LPNLNALITMYGKCGDLGNARKVFDEMPVRNVVSWSA 184
             SL  G+ +     I G   D +   NALI MY KCG +  AR +FD  P + VV+W+ 
Sbjct: 370 FGSLETGKWIDARADIYGCKRDNVMICNALIDMYSKCGSIHEARDIFDNTPEKTVVTWTT 429

Query: 185 LMAGYGVHGMFGEVFVLFERMVEEGQKPDALTFTALLTACSHGGLLDRGKEYFGMMRMEF 244
           ++AGY ++G+F E   LF +M++   KP+ +TF A+L AC+H G L++G EYF +M+  +
Sbjct: 430 MIAGYALNGIFLEALKLFSKMIDLDYKPNHITFLAVLQACAHSGSLEKGWEYFHIMKQVY 489

Query: 245 DLRPGLEHYTCMVDLLGRVGQVEEAEKLIMEMEIKPDGALWGALLSACRIHGKTEVAERV 304
           ++ PGL+HY+CMVDLLGR G++EEA +LI  M  KPD  +WGALL+AC+IH   ++AE+ 
Sbjct: 490 NISPGLDHYSCMVDLLGRKGKLEEALELIRNMSAKPDAGIWGALLNACKIHRNVKIAEQA 549

Query: 305 LKRFIN 307
            +   N
Sbjct: 550 AESLFN 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004144516.11.6e-15589.29pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis sa... [more]
XP_023520705.11.5e-15388.93pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucur... [more]
KAG6583697.11.5e-15389.25Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_038877225.13.3e-15388.96pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Benin... [more]
KAG7019344.13.3e-15388.93Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
Q9STF31.7e-5638.13Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Q9CAY14.3e-5537.88Putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS... [more]
Q9LND44.8e-5435.14Pentatricopeptide repeat-containing protein At1g06140, mitochondrial OS=Arabidop... [more]
Q9CAA81.1e-5335.07Putative pentatricopeptide repeat-containing protein At1g68930 OS=Arabidopsis th... [more]
P0C8Q21.1e-5335.95Pentatricopeptide repeat-containing protein At4g19191, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K1F77.7e-15689.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G041310 PE=4 SV=1[more]
A0A6J1I7X01.8e-15288.35pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cuc... [more]
A0A6J1EHW32.3e-15288.60pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cuc... [more]
A0A5A7UT424.8e-15086.36Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C0K06.3e-15086.36pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT3G46790.11.2e-5738.13Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G11460.13.1e-5637.88Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G06140.13.4e-5535.14Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G68930.17.5e-5535.07pentatricopeptide (PPR) repeat-containing protein [more]
AT4G19191.17.5e-5535.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 281..304
e-value: 1.2
score: 9.5
coord: 248..273
e-value: 1.0E-4
score: 22.3
coord: 76..104
e-value: 0.046
score: 14.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 173..221
e-value: 1.8E-11
score: 44.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 176..210
e-value: 3.9E-7
score: 27.8
coord: 249..273
e-value: 2.6E-4
score: 18.9
coord: 148..175
e-value: 3.7E-4
score: 18.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 143..173
score: 8.549871
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 174..208
score: 11.509422
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 7..123
e-value: 1.6E-8
score: 36.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 237..308
e-value: 4.6E-11
score: 44.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 124..228
e-value: 2.3E-24
score: 87.7
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 15..205
NoneNo IPR availablePANTHERPTHR47924:SF18BNAC05G27170D PROTEINcoord: 15..205
coord: 92..305
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 92..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G011540.1ClCG01G011540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding