Clc08G02180 (gene) Watermelon (cordophanus) v2

Overview
NameClc08G02180
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
LocationClcChr08: 4088375 .. 4090341 (+)
RNA-Seq ExpressionClc08G02180
SyntenyClc08G02180
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTCCGTAGTTCCTATCTCTCTTTGTACAGTTAGGGATTGAGTCGTTTCTGATCCAATAGCAACTCCTTGATTTCCATTTCCTTCTGCAATAGGTTGAAGACTTGATTGAAGCAAGTCCGTTCCCTACGCCCAAATTCCCACTTTTGCCTACATTCCTCGCAGCTGTAAGTAGCATTTTACTCTTAGAATTTCCAAAACCTAATTTGTGAAAATCTGCTCATCAGTCTTAATCTTATGAAAATTTTCACTTCTCAGAATTCTTGGTATGGTTGTGATTTTGAGTACTAGGCAAATTTCAGCCTGGACTTCCTTTGCTTCATTAATTCCTATTATTCTTCTAAACTATGCCATTGAATTCTGTCTTTCCATTGGGATAGTTAAATCTGGAGGTTGCAGGAAATAAAAAATATTCTTCTAAAGCATGCTTGATTCCTCAGTAGCAATTTCTTATTTGTGCCTTGTTTACTGTACTATCTTTCCCTGCTCATTGTGAAGTCATAACAAGTTTCTATAATTATGCGGCAACACCTTCTTCGTCCCTGTAACTATAGGACTATTGAAACTGTTGCTGCTCATCTTGCCCCCAAAACGCAATTGCTTCACAACTCAATCTCCTCATCATCCCCCCTTTACCAACCGGACTTAAATGTTCACAACGAATCCAAAACTCTGATAACCAACGTAAATCATAAACAGTGTGAACATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAAGAAATTTTTGATTATGCTTGTCGTCAACCCCATTTTCGCCCATCGTCTTCCTCTCTCCCCATTCTTATCCTCAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATAATCAAAATCTATAGTGAAGCTGATTTACCAGATAAAGCTCTCAAAGCCTTTTATACTATGATAGAGTTTGGGTGTACACCTTCGTCCAAACACTTGAACCGTATACTAGAAATTTTGGTTTCTCACCGTAACTTCATTCGACCAGCTTATGATCTTTTTAAGAATGCCCGTCATCATGGAGTGCTGCCCAACACCAAGTCTTACAACATCCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACCTTGTTCAACAAAATGTTCGAACGAGATGTCGTTCCAGATGTCGAGTCATACCGCACATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTAGAAGATATGTTAAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGCTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCCCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGACGTGCCCGTGATGCTTGTAAGATTCTCGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTAACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAAGAGATGACATTGAAGGGTTTTTGCCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCTGTAACGTTGGCAGAATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAAGCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATGTGGAATTTGTGATGTCGAGGACACTGTTAAATTATGTGAAATTTTAGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTGCAATCTTCAAAATCACGAAGGGTTTAA

mRNA sequence

ATGAGGTCCGTTGAAGACTTGATTGAAGCAAGTCCGTTCCCTACGCCCAAATTCCCACTTTTGCCTACATTCCTCGCAGCTGTAATTTCTATAATTATGCGGCAACACCTTCTTCGTCCCTGTAACTATAGGACTATTGAAACTGTTGCTGCTCATCTTGCCCCCAAAACGCAATTGCTTCACAACTCAATCTCCTCATCATCCCCCCTTTACCAACCGGACTTAAATGTTCACAACGAATCCAAAACTCTGATAACCAACGTAAATCATAAACAGTGTGAACATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAAGAAATTTTTGATTATGCTTGTCGTCAACCCCATTTTCGCCCATCGTCTTCCTCTCTCCCCATTCTTATCCTCAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATAATCAAAATCTATAGTGAAGCTGATTTACCAGATAAAGCTCTCAAAGCCTTTTATACTATGATAGAGTTTGGGTGTACACCTTCGTCCAAACACTTGAACCGTATACTAGAAATTTTGGTTTCTCACCGTAACTTCATTCGACCAGCTTATGATCTTTTTAAGAATGCCCGTCATCATGGAGTGCTGCCCAACACCAAGTCTTACAACATCCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACCTTGTTCAACAAAATGTTCGAACGAGATGTCGTTCCAGATGTCGAGTCATACCGCACATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTAGAAGATATGTTAAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGCTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCCCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGACGTGCCCGTGATGCTTGTAAGATTCTCGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTAACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAAGAGATGACATTGAAGGGTTTTTGCCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCTGTAACGTTGGCAGAATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAAGCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATGTGGAATTTGTGATGTCGAGGACACTGTTAAATTATGTGAAATTTTAGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTGCAATCTTCAAAATCACGAAGGGTTTAA

Coding sequence (CDS)

ATGAGGTCCGTTGAAGACTTGATTGAAGCAAGTCCGTTCCCTACGCCCAAATTCCCACTTTTGCCTACATTCCTCGCAGCTGTAATTTCTATAATTATGCGGCAACACCTTCTTCGTCCCTGTAACTATAGGACTATTGAAACTGTTGCTGCTCATCTTGCCCCCAAAACGCAATTGCTTCACAACTCAATCTCCTCATCATCCCCCCTTTACCAACCGGACTTAAATGTTCACAACGAATCCAAAACTCTGATAACCAACGTAAATCATAAACAGTGTGAACATCAACCAGATTTCTCAATTGGGTCTCCATGTAGGGTCCAGAAACTCATTGCATCCCAATCTGATCCTCTTCTTGCCAAAGAAATTTTTGATTATGCTTGTCGTCAACCCCATTTTCGCCCATCGTCTTCCTCTCTCCCCATTCTTATCCTCAAGCTAGGGCGCTCCAAATACTTCTCTCTGATTGATGATCTTCTTCTTAGCTTCAAGTCTAGAGGCTACCCTGTCACTCCAACTGTCTTCTCCTACATAATCAAAATCTATAGTGAAGCTGATTTACCAGATAAAGCTCTCAAAGCCTTTTATACTATGATAGAGTTTGGGTGTACACCTTCGTCCAAACACTTGAACCGTATACTAGAAATTTTGGTTTCTCACCGTAACTTCATTCGACCAGCTTATGATCTTTTTAAGAATGCCCGTCATCATGGAGTGCTGCCCAACACCAAGTCTTACAACATCCTTATGCGTGCATTCTGTTGGAATGGAAATCTTAGCATTGCCTACACCTTGTTCAACAAAATGTTCGAACGAGATGTCGTTCCAGATGTCGAGTCATACCGCACATTAATGCAGGGCCTGTGCAGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTAGAAGATATGTTAAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGCTAAATAGTTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATGTTGCCCATTACAATACAGTTATAATGGGATTTTGCAGAGAAGGACGTGCCCGTGATGCTTGTAAGATTCTCGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCTTACCAGAGTTTAACTAATGGATTATGTGATCAAGGAATGTTTGAATTGGCAAAGGATTATGTTGAAGAGATGACATTGAAGGGTTTTTGCCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCTGTAACGTTGGCAGAATCGACGAATCGTGTAGTATTCTTGAAGATATGCTAAAGCATGGGAAAGCCCCTCATTCTGATACTTGGGAGATTATTATATGTGGAATTTGTGATGTCGAGGACACTGTTAAATTATGTGAAATTTTAGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCTCTGGTTTGGGTGAGTATTTAATTAGGAAGTTGCAATCTTCAAAATCACGAAGGGTTTAA

Protein sequence

MRSVEDLIEASPFPTPKFPLLPTFLAAVISIIMRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTLITNVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSRRV
Homology
BLAST of Clc08G02180 vs. NCBI nr
Match: XP_038886671.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida] >XP_038886672.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida] >XP_038886673.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida])

HSP 1 Score: 917.9 bits (2371), Expect = 3.8e-263
Identity = 448/482 (92.95%), Postives = 462/482 (95.85%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTLIT-NVNHK 92
           MR+HLLRPCNY TIET+AAH+ PKT LLH SISSSS LYQ DLNVH+ESKTLIT N+NHK
Sbjct: 1   MRRHLLRPCNYNTIETIAAHVVPKTPLLHKSISSSSSLYQRDLNVHDESKTLITININHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
           QC  QP FSIGSPCRVQKLIASQSDPLLAKEIF YACRQPHFRPSSSSLPILILKLGRSK
Sbjct: 61  QCGDQPHFSIGSPCRVQKLIASQSDPLLAKEIFYYACRQPHFRPSSSSLPILILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSRGYPVTPTVFSY+IKIY EADLPDKALKAFYTMIEFGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRGYPVTPTVFSYMIKIYGEADLPDKALKAFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDV+PDVESYR LMQGLCRKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKLRE
Sbjct: 241 RDVIPDVESYRILMQGLCRKNQVNGAVDLLEDMLNKGYIPDSLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNT I+GFCREGRA DACKILEDMQSNGCLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTAIIGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTLKGFCPHFS+IHALVKGF NVGRIDESCSILEDML HGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSIIHALVKGFRNVGRIDESCSILEDMLNHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HSDTWEIII GIC+VEDTVKLCEIL KILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSDTWEIIISGICEVEDTVKLCEILGKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

Query: 513 RV 514
           RV
Sbjct: 481 RV 482

BLAST of Clc08G02180 vs. NCBI nr
Match: KAG6592265.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 910.6 bits (2352), Expect = 6.1e-261
Identity = 440/481 (91.48%), Postives = 460/481 (95.63%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+T+ETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTLETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLSATNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
             E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEQQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSRGYP++PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNILMRAFCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDV+PDVESYR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVIPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQSNGCLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNGCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEM+LKGFCPHFSVIH LVKGF NVGRID+SCS+LED+LKHGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMSLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDILKHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWEI++ GIC+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEIVLSGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. NCBI nr
Match: XP_022932826.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita moschata] >XP_022932827.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 909.1 bits (2348), Expect = 1.8e-260
Identity = 441/481 (91.68%), Postives = 458/481 (95.22%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+T+ETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTLETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLSATNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
             E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEQQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSRGYP++PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNILMR FCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRVFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDVVPDVESYR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQSN CLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTLKGFCPHFSVIH LVKGF NVGRID+SCS+LEDMLKHGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWE+II G+C+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEMIISGVCEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. NCBI nr
Match: XP_023515125.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 908.7 bits (2347), Expect = 2.3e-260
Identity = 443/481 (92.10%), Postives = 458/481 (95.22%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+TIETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTIETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNATNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
           + E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  RLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSRGYP +PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRGYPFSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNILMRAFCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDVVPDVESYR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQSN CLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTLKGFCPHFSVIH LVKGF NVGRID+SCS+LEDMLK GKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKDGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWEI+I GIC+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEIVISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. NCBI nr
Match: XP_022974029.1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita maxima] >XP_022974030.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 907.1 bits (2343), Expect = 6.7e-260
Identity = 442/481 (91.89%), Postives = 458/481 (95.22%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+TIETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTIETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNDTNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
             E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLI+DLLLSFKSRGYP++PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLINDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHR+FIRPA+DLFKNARHHGVLPNTKSYNILMRAFCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRDFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDV+PDVESYR LMQGLCRKNQV GAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVIPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQ NGCLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQLNGCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTL GFCPHFSVIH LVKGF NVGRID+SCS+LEDMLKHGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLNGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWEIII GIC+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEIIISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. ExPASy Swiss-Prot
Match: Q8LDU5 (Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01400 PE=2 SV=2)

HSP 1 Score: 566.6 bits (1459), Expect = 2.8e-160
Identity = 269/450 (59.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 60  LHNSISSSSPLYQPDLNVHNESKTLITNVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLL 119
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 120 AKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYII 179
           AKEIFDYA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 180 KIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDLFKNARHHGV 239
           K+Y+EA LP+K L  FY M+EF  TP  KHLNRIL++LVSHR +++ A++LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 240 LPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCRKNQVNGAVD 299
           +PNT+SYN+LM+AFC N +LSIAY LF KM ERDVVPDV+SY+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 300 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTVIMGFC 359
           LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFC
Sbjct: 256 LLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFC 315

Query: 360 REGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFS 419
           RE RA DA K+L+DM SNGC PN VSY++L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 REDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 375

Query: 420 VIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGICDVEDTVKLCEILEKI 479
           V + LVKGFC+ G+++E+C ++E ++K+G+  HSDTWE++I  IC+ +++ K+   LE  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 435

Query: 480 LKKDVRRDTRIVEAGSGLGEYLIRKLQSSK 510
           +K+++  DTRIV+ G GLG YL  KLQ  +
Sbjct: 436 VKEEITGDTRIVDVGIGLGSYLSSKLQMKR 456

BLAST of Clc08G02180 vs. ExPASy Swiss-Prot
Match: Q9FNL2 (Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX=3702 GN=At5g46100 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 2.3e-53
Identity = 114/372 (30.65%), Postives = 196/372 (52.69%), Query Frame = 0

Query: 103 SPCRVQKLIASQSDPLLAKEIFDYACRQ--PHFRPSSSSLPILILKLGRSKYFSLIDDLL 162
           +P +V KL+ ++ D   +  +FD A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 163 LSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSH 222
           +  K     V+  +   I + Y     P  +L+ F+ M +F C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 223 RNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWN-GNLSIAYTLFNKMFERDVVPDVE 282
            N +  A+  +KN R  G+ P   S N+L++A C N G +     +F +M +R   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 283 SYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRM 342
           +Y TL+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 343 KVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMF 402
           K KG  P+V  Y++++ G C++GR+  A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 403 ELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEI- 462
           + A + ++ M L+G  P   +   ++ GFC + +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 463 ------IICGIC 465
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of Clc08G02180 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 201.8 bits (512), Expect = 1.8e-50
Identity = 105/318 (33.02%), Postives = 174/318 (54.72%), Query Frame = 0

Query: 171 TPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDL 230
           T +VF  ++K YS   L DKAL   +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 231 FKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCR 290
           FK      V PN  +YNIL+R FC+ GN+ +A TLF+KM  +  +P+V +Y TL+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 291 KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAH 350
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 351 YNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMT 410
           YNT+I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A +++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 411 LKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGIC---DVE 470
           ++G CP+      LV GF   G ++E+  +L +M  +G +P   T+  +I G C    +E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 471 DTVKLCE-ILEKILKKDV 485
           D + + E + EK L  DV
Sbjct: 433 DAIAVLEDMKEKGLSPDV 450

BLAST of Clc08G02180 vs. ExPASy Swiss-Prot
Match: Q9M302 (Pentatricopeptide repeat-containing protein At3g48810 OS=Arabidopsis thaliana OX=3702 GN=At3g48810 PE=2 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 1.7e-48
Identity = 121/445 (27.19%), Postives = 196/445 (44.04%), Query Frame = 0

Query: 87  NVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILK 146
           NVNH   E  P+ +      V K +  +S   LA   F        F+ +  +  ++I K
Sbjct: 27  NVNHLLTE-SPNHAEIKELDVVKRLRQESCVPLALHFFKSIANSNLFKHTPLTFEVMIRK 86

Query: 147 LGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPS 206
           L        +  LL   K +G+  +  +F  +I +Y +  L ++A++ FY + EFGC PS
Sbjct: 87  LAMDGQVDSVQYLLQQMKLQGFHCSEDLFISVISVYRQVGLAERAVEMFYRIKEFGCDPS 146

Query: 207 SKHLNRILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLF 266
            K  N +L+ L+   N I+  Y ++++ +  G  PN  +YN+L++A C N  +  A  L 
Sbjct: 147 VKIYNHVLDTLLG-ENRIQMIYMVYRDMKRDGFEPNVFTYNVLLKALCKNNKVDGAKKLL 206

Query: 267 NKMFERDVVPDVESYRT------------------------------LMQGLCRKNQVNG 326
            +M  +   PD  SY T                              L+ GLC+++   G
Sbjct: 207 VEMSNKGCCPDAVSYTTVISSMCEVGLVKEGRELAERFEPVVSVYNALINGLCKEHDYKG 266

Query: 327 AVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCN----------- 386
           A +L+ +M+ KG  P+ +SY+TL+N LC   ++  A+  L +M  +GC+           
Sbjct: 267 AFELMREMVEKGISPNVISYSTLINVLCNSGQIELAFSFLTQMLKRGCHPNIYTLSSLVK 326

Query: 387 -------------------------PDVAHYNTVIMGFCREGRARDACKILEDMQSNGCL 446
                                    P+V  YNT++ GFC  G    A  +   M+  GC 
Sbjct: 327 GCFLRGTTFDALDLWNQMIRGFGLQPNVVAYNTLVQGFCSHGNIVKAVSVFSHMEEIGCS 386

Query: 447 PNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSI 466
           PN+ +Y SL NG   +G  + A     +M   G CP+  V   +V+  C   +  E+ S+
Sbjct: 387 PNIRTYGSLINGFAKRGSLDGAVYIWNKMLTSGCCPNVVVYTNMVEALCRHSKFKEAESL 446

BLAST of Clc08G02180 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 6.1e-46
Identity = 119/418 (28.47%), Postives = 202/418 (48.33%), Query Frame = 0

Query: 103 SPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLS 162
           +P ++ KL+    +   + E+F +   Q  +R S     +LI KLG +  F  ID LL+ 
Sbjct: 77  TPFQLYKLLELPLNVSTSMELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQ 136

Query: 163 FKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIE-FGCTPSSKHLNRILEILVSHR 222
            K  G     ++F  I++ Y +A  P +  +    M   + C P+ K  N +LEILVS  
Sbjct: 137 MKDEGIVFKESLFISIMRDYDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVS-G 196

Query: 223 NFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESY 282
           N  + A ++F +     + P   ++ ++M+AFC    +  A +L   M +   VP+   Y
Sbjct: 197 NCHKVAANVFYDMLSRKIPPTLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIY 256

Query: 283 RTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKV 342
           +TL+  L + N+VN A+ LLE+M   G +PD  ++  ++  LC+  ++ EA K++ RM +
Sbjct: 257 QTLIHSLSKCNRVNEALQLLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLI 316

Query: 343 KGCNPD-------------------------------VAHYNTVIMGFCREGRARDACKI 402
           +G  PD                               +  +NT+I GF   GR  DA  +
Sbjct: 317 RGFAPDDITYGYLMNGLCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAV 376

Query: 403 LEDM-QSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFC 462
           L DM  S G +P++ +Y SL  G   +G+  LA + + +M  KG  P+      LV GFC
Sbjct: 377 LSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFC 436

Query: 463 NVGRIDESCSILEDMLKHGKAPHSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRD 488
            +G+IDE+ ++L +M   G  P++  +  +I   C      +  EI  ++ +K  + D
Sbjct: 437 KLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPD 493

BLAST of Clc08G02180 vs. ExPASy TrEMBL
Match: A0A6J1EXV3 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111439286 PE=4 SV=1)

HSP 1 Score: 909.1 bits (2348), Expect = 8.6e-261
Identity = 441/481 (91.68%), Postives = 458/481 (95.22%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+T+ETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTLETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLSATNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
             E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEQQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSRGYP++PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNILMR FCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILMRVFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDVVPDVESYR LMQGLCRKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVVPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYVPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQSN CLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQSNRCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTLKGFCPHFSVIH LVKGF NVGRID+SCS+LEDMLKHGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWE+II G+C+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEMIISGVCEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. ExPASy TrEMBL
Match: A0A6J1ICW5 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucurbita maxima OX=3661 GN=LOC111472648 PE=4 SV=1)

HSP 1 Score: 907.1 bits (2343), Expect = 3.3e-260
Identity = 442/481 (91.89%), Postives = 458/481 (95.22%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTL-ITNVNHK 92
           MRQHLLRPCNY+TIETVA HLAPKT LLHNSISSSS LYQPDLNVHNE KTL  TN+NHK
Sbjct: 1   MRQHLLRPCNYKTIETVAVHLAPKTPLLHNSISSSSSLYQPDLNVHNELKTLNDTNINHK 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
             E QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSS  +LILKLGRSK
Sbjct: 61  HLEEQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSFLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLI+DLLLSFKSRGYP++PTVFSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 121 YFSLINDLLLSFKSRGYPLSPTVFSYIIKIYGEADLPDKALKTFYTMIEFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHR+FIRPA+DLFKNARHHGVLPNTKSYNILMRAFCWNG+LSIAYTLFNKMF+
Sbjct: 181 RILEILVSHRDFIRPAFDLFKNARHHGVLPNTKSYNILMRAFCWNGDLSIAYTLFNKMFK 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           RDV+PDVESYR LMQGLCRKNQV GAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RDVIPDVESYRILMQGLCRKNQVIGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPDVAHYNTVI GFCREGRA DACKILEDMQ NGCLPNLVSYQSLTN
Sbjct: 301 AYKLLCRMKVKGCNPDVAHYNTVITGFCREGRALDACKILEDMQLNGCLPNLVSYQSLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAKDYVEEMTL GFCPHFSVIH LVKGF NVGRID+SCS+LEDMLKHGKAP
Sbjct: 361 GLCDQGMFELAKDYVEEMTLNGFCPHFSVIHTLVKGFINVGRIDDSCSVLEDMLKHGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HS+TWEIII GIC+VEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQ+SKSR
Sbjct: 421 HSETWEIIISGICEVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQASKSR 480

BLAST of Clc08G02180 vs. ExPASy TrEMBL
Match: A0A5A7SWW3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold155G00670 PE=4 SV=1)

HSP 1 Score: 905.2 bits (2338), Expect = 1.2e-259
Identity = 440/482 (91.29%), Postives = 457/482 (94.81%), Query Frame = 0

Query: 32  IMRQHLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTLITNVNHK 91
           IM  HLLRP NYRTIETVAAH+A    LLHN ISSSS LYQP LNVHNESKTLITN+NHK
Sbjct: 82  IMWLHLLRPGNYRTIETVAAHVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHK 141

Query: 92  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 151
           QCE QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSL +LILKLGRSK
Sbjct: 142 QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 201

Query: 152 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 211
           YFSLIDDLLLSFKSRGYPVTPT FSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LN
Sbjct: 202 YFSLIDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLN 261

Query: 212 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 271
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAY LFNKMFE
Sbjct: 262 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFE 321

Query: 272 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 331
            DV+PDVE+YRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+E
Sbjct: 322 GDVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKE 381

Query: 332 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 391
           AYKLLCRMKVKGCNPD+AHYNTVIMGFCREGRA DACKILEDMQSNGCLPNLVSY+SLTN
Sbjct: 382 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 441

Query: 392 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 451
           GLCDQGMFELAK YVEEMTLKGF PHFSVIHALVKGF NVGR+DESCS+LE MLKHGKAP
Sbjct: 442 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAP 501

Query: 452 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 511
           HSDTWEIII GIC+VEDTVK CEILEKILKKDVRRDTRIVEAG+GLGEYLIRKLQ+SKSR
Sbjct: 502 HSDTWEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSR 561

Query: 512 RV 514
           R+
Sbjct: 562 RI 563

BLAST of Clc08G02180 vs. ExPASy TrEMBL
Match: A0A1S3CIG1 (pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501324 PE=4 SV=1)

HSP 1 Score: 903.7 bits (2334), Expect = 3.6e-259
Identity = 438/478 (91.63%), Postives = 455/478 (95.19%), Query Frame = 0

Query: 36  HLLRPCNYRTIETVAAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTLITNVNHKQCEH 95
           HLLRP NYRTIETVAAH+A    LLHN ISSSS LYQP LNVHNESKTLITN+NHKQCE 
Sbjct: 4   HLLRPGNYRTIETVAAHVARNAPLLHNLISSSSSLYQPHLNVHNESKTLITNINHKQCED 63

Query: 96  QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSL 155
           QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSL +LILKLGRSKYFSL
Sbjct: 64  QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSKYFSL 123

Query: 156 IDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILE 215
           IDDLLLSFKSRGYPVTPT FSYIIKIY EADLPDKALK FYTMIEFGCTPSSK LNRILE
Sbjct: 124 IDDLLLSFKSRGYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIEFGCTPSSKQLNRILE 183

Query: 216 ILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVV 275
           ILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAY LFNKMFE DV+
Sbjct: 184 ILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYKLFNKMFEGDVI 243

Query: 276 PDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKL 335
           PDVE+YRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+EAYKL
Sbjct: 244 PDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLKEAYKL 303

Query: 336 LCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCD 395
           LCRMKVKGCNPD+AHYNTVIMGFCREGRA DACKILEDMQSNGCLPNLVSY+SLTNGLCD
Sbjct: 304 LCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCD 363

Query: 396 QGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDT 455
           QGMFELAK YVEEMTLKGF PHFSVIHALVKGF NVGR+DESCS+LE MLKHGKAPHSDT
Sbjct: 364 QGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHNVGRMDESCSVLEGMLKHGKAPHSDT 423

Query: 456 WEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSRRV 514
           WEIII GIC+VEDTVK CEILEKILKKDVRRDTRIVEAG+GLGEYLIRKLQ+SKSRR+
Sbjct: 424 WEIIISGICEVEDTVKFCEILEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASKSRRI 481

BLAST of Clc08G02180 vs. ExPASy TrEMBL
Match: A0A0A0K8U0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 2.0e-254
Identity = 433/482 (89.83%), Postives = 454/482 (94.19%), Query Frame = 0

Query: 33  MRQHLLRPCNYRTIETV-AAHLAPKTQLLHNSISSSSPLYQPDLNVHNESKTLITNVNHK 92
           M QHLLRPCNYRTIETV AAH+A K+ LL N ISSSS LYQP LNVHNESK LITNV H+
Sbjct: 1   MWQHLLRPCNYRTIETVAAAHVARKSPLLRNLISSSSSLYQPHLNVHNESKFLITNVKHE 60

Query: 93  QCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILKLGRSK 152
           QCE QPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSL +LILKLGRSK
Sbjct: 61  QCEDQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLLVLILKLGRSK 120

Query: 153 YFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLN 212
           YFSLIDDLLLSFKSR YPVTPT FSYIIKIY EADLPDKALK FYTMI+FGCTPSSK LN
Sbjct: 121 YFSLIDDLLLSFKSRRYPVTPTAFSYIIKIYGEADLPDKALKVFYTMIDFGCTPSSKQLN 180

Query: 213 RILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFE 272
           RILEILVSHRNFIRPA+DLFKNARHHGVLPNTKSYNIL+RAFCWNGN+SIAYTLFNKMFE
Sbjct: 181 RILEILVSHRNFIRPAFDLFKNARHHGVLPNTKSYNILIRAFCWNGNISIAYTLFNKMFE 240

Query: 273 RDVVPDVESYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 332
           R+V+PDVE+YRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE
Sbjct: 241 RNVIPDVETYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLRE 300

Query: 333 AYKLLCRMKVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTN 392
           AYKLLCRMKVKGCNPD+AHYNTVIMGFCREGRA DACKILEDMQSNGCLPNLVSY+SLTN
Sbjct: 301 AYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTN 360

Query: 393 GLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAP 452
           GLCDQGMFELAK YVEEMTLKGF PHFSVIHALVKGF ++GRI ESCS+LEDMLK GKAP
Sbjct: 361 GLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAP 420

Query: 453 HSDTWEIIICGICDVEDTVKLCEILEKILKKDVRRDTRIVEAGSGLGEYLIRKLQSSKSR 512
           HSDTWEIII GIC+VEDT K CE+ EKILKKDVRRDTRIVEAG+GLGEYLIRKLQ+S SR
Sbjct: 421 HSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKLQASISR 480

Query: 513 RV 514
           R+
Sbjct: 481 RI 482

BLAST of Clc08G02180 vs. TAIR 10
Match: AT4G01400.3 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 40053 Blast hits to 12380 proteins in 263 species: Archae - 4; Bacteria - 27; Metazoa - 366; Fungi - 374; Plants - 38347; Viruses - 0; Other Eukaryotes - 935 (source: NCBI BLink). )

HSP 1 Score: 566.6 bits (1459), Expect = 2.0e-161
Identity = 269/450 (59.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 60  LHNSISSSSPLYQPDLNVHNESKTLITNVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLL 119
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 120 AKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYII 179
           AKEIFDYA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 180 KIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDLFKNARHHGV 239
           K+Y+EA LP+K L  FY M+EF  TP  KHLNRIL++LVSHR +++ A++LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 240 LPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCRKNQVNGAVD 299
           +PNT+SYN+LM+AFC N +LSIAY LF KM ERDVVPDV+SY+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 300 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTVIMGFC 359
           LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFC
Sbjct: 256 LLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKLLCRMKLKGCNPDLVHYNTMILGFC 315

Query: 360 REGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFS 419
           RE RA DA K+L+DM SNGC PN VSY++L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 REDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 375

Query: 420 VIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGICDVEDTVKLCEILEKI 479
           V + LVKGFC+ G+++E+C ++E ++K+G+  HSDTWE++I  IC+ +++ K+   LE  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 435

Query: 480 LKKDVRRDTRIVEAGSGLGEYLIRKLQSSK 510
           +K+++  DTRIV+ G GLG YL  KLQ  +
Sbjct: 436 VKEEITGDTRIVDVGIGLGSYLSSKLQMKR 456

BLAST of Clc08G02180 vs. TAIR 10
Match: AT4G01400.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: COG4 transport (InterPro:IPR013167), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT5G46100.1); Has 26268 Blast hits to 8959 proteins in 289 species: Archae - 0; Bacteria - 3; Metazoa - 247; Fungi - 222; Plants - 25350; Viruses - 0; Other Eukaryotes - 446 (source: NCBI BLink). )

HSP 1 Score: 397.9 bits (1021), Expect = 1.2e-110
Identity = 204/435 (46.90%), Postives = 275/435 (63.22%), Query Frame = 0

Query: 60  LHNSISSSSPLYQPDLNVHNESKTLITNVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLL 119
           L + +S+SS       + H   K +++N         P   IGSP RVQKLIASQSDPLL
Sbjct: 16  LTSPLSTSSRFLFYSSSEHEARKPIVSN---------PKSPIGSPTRVQKLIASQSDPLL 75

Query: 120 AKEIFDYACRQPHFRPSSSSLPILILKLGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYII 179
           AKEIFDYA +QP+FR S SS  ILILKLGR +YF+LIDD+L   +S GYP+T  +F+Y+I
Sbjct: 76  AKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYPLTGEIFTYLI 135

Query: 180 KIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDLFKNARHHGV 239
           K+Y+EA LP+K L  FY M+EF  TP  KHLNRIL++LVSHR +++ A++LFK++R HGV
Sbjct: 136 KVYAEAKLPEKVLSTFYKMLEFNFTPQPKHLNRILDVLVSHRGYLQKAFELFKSSRLHGV 195

Query: 240 LPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCRKNQVNGAVD 299
           +PNT+SYN+LM+AFC N +LSIAY LF KM ERDVVPDV+SY+ L+QG CRK QVNGA++
Sbjct: 196 MPNTRSYNLLMQAFCLNDDLSIAYQLFGKMLERDVVPDVDSYKILIQGFCRKGQVNGAME 255

Query: 300 LLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAHYNTVIMGFC 359
           LL+DMLNKG++PD                                               
Sbjct: 256 LLDDMLNKGFVPD----------------------------------------------- 315

Query: 360 REGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFS 419
                                      ++L  GLCDQGMF+  K Y+EEM  KGF PHFS
Sbjct: 316 ---------------------------RTLIGGLCDQGMFDEGKKYLEEMISKGFSPHFS 367

Query: 420 VIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGICDVEDTVKLCEILEKI 479
           V + LVKGFC+ G+++E+C ++E ++K+G+  HSDTWE++I  IC+ +++ K+   LE  
Sbjct: 376 VSNCLVKGFCSFGKVEEACDVVEVVMKNGETLHSDTWEMVIPLICNEDESEKIKLFLEDA 367

Query: 480 LKKDVRRDTRIVEAG 495
           +K+++  DTRIV+ G
Sbjct: 436 VKEEITGDTRIVDVG 367

BLAST of Clc08G02180 vs. TAIR 10
Match: AT5G46100.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 211.5 bits (537), Expect = 1.6e-54
Identity = 114/372 (30.65%), Postives = 196/372 (52.69%), Query Frame = 0

Query: 103 SPCRVQKLIASQSDPLLAKEIFDYACRQ--PHFRPSSSSLPILILKLGRSKYFSLIDDLL 162
           +P +V KL+ ++ D   +  +FD A  +    +    SS   ++L+L  +  F   +DL+
Sbjct: 15  TPSQVIKLMRAEKDVEKSMAVFDSATAEYANGYVHDQSSFGYMVLRLVSANKFKAAEDLI 74

Query: 163 LSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSH 222
           +  K     V+  +   I + Y     P  +L+ F+ M +F C PS K    +L ILV  
Sbjct: 75  VRMKIENCVVSEDILLSICRGYGRVHRPFDSLRVFHKMKDFDCDPSQKAYVTVLAILV-E 134

Query: 223 RNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWN-GNLSIAYTLFNKMFERDVVPDVE 282
            N +  A+  +KN R  G+ P   S N+L++A C N G +     +F +M +R   PD  
Sbjct: 135 ENQLNLAFKFYKNMREIGLPPTVASLNVLIKALCRNDGTVDAGLKIFLEMPKRGCDPDSY 194

Query: 283 SYRTLMQGLCRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRM 342
           +Y TL+ GLCR  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  M
Sbjct: 195 TYGTLISGLCRFGRIDEAKKLFTEMVEKDCAPTVVTYTSLINGLCGSKNVDEAMRYLEEM 254

Query: 343 KVKGCNPDVAHYNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMF 402
           K KG  P+V  Y++++ G C++GR+  A ++ E M + GC PN+V+Y +L  GLC +   
Sbjct: 255 KSKGIEPNVFTYSSLMDGLCKDGRSLQAMELFEMMMARGCRPNMVTYTTLITGLCKEQKI 314

Query: 403 ELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEI- 462
           + A + ++ M L+G  P   +   ++ GFC + +  E+ + L++M+  G  P+  TW I 
Sbjct: 315 QEAVELLDRMNLQGLKPDAGLYGKVISGFCAISKFREAANFLDEMILGGITPNRLTWNIH 374

Query: 463 ------IICGIC 465
                 ++ G+C
Sbjct: 375 VKTSNEVVRGLC 385

BLAST of Clc08G02180 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 201.8 bits (512), Expect = 1.3e-51
Identity = 105/318 (33.02%), Postives = 174/318 (54.72%), Query Frame = 0

Query: 171 TPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPSSKHLNRILEILVSHRNFIRPAYDL 230
           T +VF  ++K YS   L DKAL   +     G  P     N +L+  +  +  I  A ++
Sbjct: 133 TSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENV 192

Query: 231 FKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLFNKMFERDVVPDVESYRTLMQGLCR 290
           FK      V PN  +YNIL+R FC+ GN+ +A TLF+KM  +  +P+V +Y TL+ G C+
Sbjct: 193 FKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCK 252

Query: 291 KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDVAH 350
             +++    LL  M  KG  P+ +SY  ++N LCR+ +++E   +L  M  +G + D   
Sbjct: 253 LRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVT 312

Query: 351 YNTVIMGFCREGRARDACKILEDMQSNGCLPNLVSYQSLTNGLCDQGMFELAKDYVEEMT 410
           YNT+I G+C+EG    A  +  +M  +G  P++++Y SL + +C  G    A +++++M 
Sbjct: 313 YNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMR 372

Query: 411 LKGFCPHFSVIHALVKGFCNVGRIDESCSILEDMLKHGKAPHSDTWEIIICGIC---DVE 470
           ++G CP+      LV GF   G ++E+  +L +M  +G +P   T+  +I G C    +E
Sbjct: 373 VRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKME 432

Query: 471 DTVKLCE-ILEKILKKDV 485
           D + + E + EK L  DV
Sbjct: 433 DAIAVLEDMKEKGLSPDV 450

BLAST of Clc08G02180 vs. TAIR 10
Match: AT3G48810.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 195.3 bits (495), Expect = 1.2e-49
Identity = 121/445 (27.19%), Postives = 196/445 (44.04%), Query Frame = 0

Query: 87  NVNHKQCEHQPDFSIGSPCRVQKLIASQSDPLLAKEIFDYACRQPHFRPSSSSLPILILK 146
           NVNH   E  P+ +      V K +  +S   LA   F        F+ +  +  ++I K
Sbjct: 27  NVNHLLTE-SPNHAEIKELDVVKRLRQESCVPLALHFFKSIANSNLFKHTPLTFEVMIRK 86

Query: 147 LGRSKYFSLIDDLLLSFKSRGYPVTPTVFSYIIKIYSEADLPDKALKAFYTMIEFGCTPS 206
           L        +  LL   K +G+  +  +F  +I +Y +  L ++A++ FY + EFGC PS
Sbjct: 87  LAMDGQVDSVQYLLQQMKLQGFHCSEDLFISVISVYRQVGLAERAVEMFYRIKEFGCDPS 146

Query: 207 SKHLNRILEILVSHRNFIRPAYDLFKNARHHGVLPNTKSYNILMRAFCWNGNLSIAYTLF 266
            K  N +L+ L+   N I+  Y ++++ +  G  PN  +YN+L++A C N  +  A  L 
Sbjct: 147 VKIYNHVLDTLLG-ENRIQMIYMVYRDMKRDGFEPNVFTYNVLLKALCKNNKVDGAKKLL 206

Query: 267 NKMFERDVVPDVESYRT------------------------------LMQGLCRKNQVNG 326
            +M  +   PD  SY T                              L+ GLC+++   G
Sbjct: 207 VEMSNKGCCPDAVSYTTVISSMCEVGLVKEGRELAERFEPVVSVYNALINGLCKEHDYKG 266

Query: 327 AVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCN----------- 386
           A +L+ +M+ KG  P+ +SY+TL+N LC   ++  A+  L +M  +GC+           
Sbjct: 267 AFELMREMVEKGISPNVISYSTLINVLCNSGQIELAFSFLTQMLKRGCHPNIYTLSSLVK 326

Query: 387 -------------------------PDVAHYNTVIMGFCREGRARDACKILEDMQSNGCL 446
                                    P+V  YNT++ GFC  G    A  +   M+  GC 
Sbjct: 327 GCFLRGTTFDALDLWNQMIRGFGLQPNVVAYNTLVQGFCSHGNIVKAVSVFSHMEEIGCS 386

Query: 447 PNLVSYQSLTNGLCDQGMFELAKDYVEEMTLKGFCPHFSVIHALVKGFCNVGRIDESCSI 466
           PN+ +Y SL NG   +G  + A     +M   G CP+  V   +V+  C   +  E+ S+
Sbjct: 387 PNIRTYGSLINGFAKRGSLDGAVYIWNKMLTSGCCPNVVVYTNMVEALCRHSKFKEAESL 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886671.13.8e-26392.95pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa ... [more]
KAG6592265.16.1e-26191.48Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022932826.11.8e-26091.68pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucur... [more]
XP_023515125.12.3e-26092.10pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucur... [more]
XP_022974029.16.7e-26091.89pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
Q8LDU52.8e-16059.78Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidop... [more]
Q9FNL22.3e-5330.65Pentatricopeptide repeat-containing protein At5g46100 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.8e-5033.02Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9M3021.7e-4827.19Pentatricopeptide repeat-containing protein At3g48810 OS=Arabidopsis thaliana OX... [more]
Q9FMF66.1e-4628.47Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EXV38.6e-26191.68pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A6J1ICW53.3e-26091.89pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A5A7SWW31.2e-25991.29Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CIG13.6e-25991.63pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like OS=Cuc... [more]
A0A0A0K8U02.0e-25489.83Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014780 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G01400.32.0e-16159.78FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G01400.11.2e-11046.90FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT5G46100.11.6e-5430.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.11.3e-5133.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G48810.11.2e-4927.19Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 384..416
e-value: 2.0E-5
score: 22.5
coord: 245..278
e-value: 1.9E-6
score: 25.7
coord: 174..206
e-value: 3.7E-4
score: 18.5
coord: 280..313
e-value: 3.7E-7
score: 27.9
coord: 351..382
e-value: 1.1E-9
score: 35.8
coord: 315..348
e-value: 1.9E-8
score: 32.0
coord: 422..451
e-value: 4.3E-4
score: 18.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 174..203
e-value: 0.024
score: 14.9
coord: 423..448
e-value: 0.013
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 307..340
e-value: 2.3E-8
score: 33.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 241..290
e-value: 4.8E-14
score: 52.3
coord: 346..395
e-value: 5.6E-15
score: 55.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 171..205
score: 9.240434
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 11.958836
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 417..451
score: 9.602157
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 12.232868
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 12.868624
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 382..416
score: 10.742131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 242..276
score: 11.553267
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 331..499
e-value: 9.0E-35
score: 122.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 76..221
e-value: 6.6E-12
score: 47.1
coord: 222..330
e-value: 1.2E-29
score: 104.9
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 88..504
NoneNo IPR availablePANTHERPTHR47942:SF2OS09G0532800 PROTEINcoord: 88..504

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc08G02180.2Clc08G02180.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000373 Group II intron splicing
biological_process GO:0032981 mitochondrial respiratory chain complex I assembly
biological_process GO:0000963 mitochondrial RNA processing
biological_process GO:0015031 protein transport
biological_process GO:0060628 regulation of ER to Golgi vesicle-mediated transport
biological_process GO:0006890 retrograde vesicle-mediated transport, Golgi to endoplasmic reticulum
cellular_component GO:0070939 Dsl1/NZR complex
cellular_component GO:0016020 membrane
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding