ClCG01G008820 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G008820
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr01: 10807031 .. 10812788 (+)
RNA-Seq ExpressionClCG01G008820
SyntenyClCG01G008820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTACTTCGCCAAAAACTGTCATCCTTTGTTTTCTCATCATCCAGAAATTCTCTTTGCTTATCCATACGAAAACTCATCTCAGTTGCAGTTGCAGATGAACTTATCAACGACGATGCAACTGTCAATTCCATATGCGATTCATTCACAAGACGCGAAACCTGGGACACTCTTTCTCGAAAATTCCAGTTGCTCCAACTTAACGATTACTTGGTTCAAAAAGTCCTGCTCAAATTCCAGCAACCCGTTGATGCCAAACGCGCTCTGGGGTTCTTCCACTGGTCCGCCAAGAGCAAGAATTTCAATCATGGGTCTCAATCGTACGGTATTATGATACATATTTTAGTGAAAGCGCGGCTGGTAATCGATGCTCGAGCTTTGCTCGAGTCAATTTTGAAGAAAAATGAGGGGAACTCTTTTGACTTCTCTGTTGTAGATTCGCTTTTGGATTCCTATGAGGTGACTGGTTCATCCCCATTTGTGTTCGATTTATTGATTCAGACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGTGTGTTTGTTCTCATTTGGAGGAACGTGGGTTCTCGTTGAGTCTGATAAGTTTCAACACTTTGATACATGTTGTGGAAAAATCTGATGAGAATCATAAGGTTTGGAAAATTTATGAGCAAATGATCCGGAAAAGAGTTTACCCAAATGCAATTACTGTTAGAATCATGATTAATTCGTTATGTAAGGAAGGTAAATTGCAAGAAACTTCTGATATGTTGAATAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATACTTGTTTGATTTACAGGATTTTGGAGGAGGGAAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAATATGGTTCTTGATGACATTGCTTATTCACTGATTGTTTATGCTAAGGTGAAAACTGGGAGTATAACATCTGCATGGGAAGTGTTTGAGGAAATGTCTAAAAGAGGGTTTCGGGCAAATTCTTTCCTTTATACATTGTTCATTGGTGTCCATTGTAGAGGGGGAAAGATTGAGGAAGCTAATTGCTTAATGCAAGAGATGGAAAACATGGGTCTGAAGCCATATCCGGAGACCTTTAATCTTCTCATTGAAGGGTGTGCTACGTCAGGACATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGATAGAGGGTTTCTTCCTAGTTGTTCAGTTTTCAACATGGTAATAGCTAAGATTTGTGAGGAAGGAGCTTTGAAAAATGCTAATGCATTGTTAACCATGTTATTAGATAAAGGGTTTTTACCTGATGAAACCACTTACACCAATCTGATCTTAGGATATAGGAAAAGTGGTGAAATTCAGGAGATTCTTAAGCTTTATTATGAAATGGAAGCTAGATTGCTGTCTCCTGGAGTGTCAGTTTTCTTTGTACTCATTGAGAGTCTTTGTCAATCTGGGAAATTGGAGGAAGCAGAGAAATATTGGAAGATTATGAAAGATAGTTCTATAACACCTAGTCTACCTATATATCAGACATTAACCTTGTTCTACTTGAAGAAGGGTAATAGAGCAAAAGCTCTAGAACTGTATAAAGAGATGATGTCCAATGGATAGAAGATTATCTGTTTCTAAGCATGTGCAGGTTTCAACATGAACTGAAACGTTCATATACACTCTCTTCTCTGGCAGGTTACGTTCAAAGAAGCAACTCAAATGAAAGGTGATGATTGGTGCCCTGCCTTTTATTTATTTATCTACTTAATTTCACTAAGAAAGAAAGCAGAGCTCTTCTAAATTCATAGTGAAAAATCAATTATCGGCTGTTTATTTAGCCTAGCCTTTATATGATAAGAAAGGGGTCAATTTAAGTGGAACTTTATCTTCAAGAAAACAGGGGCTATGGTTTCTGTTGGCCGTTTTTGCCAATTCTTGAGTTTACTTATCTACATTTTTAATCTGTATATGAATGGATCAAGAAGGCTTAGAAGTTTGATATCTGATAACATATGAATACACTTTGTGACAAAGTAGCGTTGATCAAGGTAATTTCATGCGACTGCTCATTCTCAGGATCCACTTGCGGAGGTTGACCTTCAGTTCAAATGGTGCAATCTGGTGGGGTAATCGTTCCATCAATGTACTTGAACAACTTGTGAGCCTCAAGGATCGGAGAAATCTGGAAGTTCCGAAGAACATGTTAGACAAATCAAGATGAACAGCTACGAGATTGCAAATGTTCATGAGAAGAAAATTACATTGAGGAGTTTTGAGAATTCTGTTGATCCGCCATCACTAGCAAGCTTCAGCGCAAAAGACGAATTTGGGATGCTTGGCGAAAAGAAAAATGTGAGTCTTACTAACTCTGATAGCATAGAGATGCTCAGAAAATTGATAATTTCTCCTCTCATTTATTAGGAGCCATAAGGCAGTATTTTTACAGTTATAGAGGACCAAGGTTAAATAAACTAACTCTATCTTAGTCACTAAACTCAACTATGCGTAACAGAAAATAACAAACTAACAATGTACATCATTATTAGATAGGTGTTAGATTAAATGTCATGTTACTTCGATGAGGAACTTTTTGGTTGGACTCTCTGGTTAGTTTCAAGTTAGGTTTCTGCGAGTTTTGAGCTCAACTTCTAATTGGCTCTTGGAGTTTTATAAGATAGGGGCGTTTGGGGAGTGGCTGAGTGGGTTATATTAACCTAGCCCACGGAGTAAAATACGCTAGGTATAATTGTACCCCTCCCTTTATGTTATCTTTTTCCTATCTCTTTGTTATTTTACTCCCTTTTAATTTGGAAAATTCTCAGAAATAACACATTTAAGTTTTTACCTTTTAAAACTAGCTAAACTAGTTTTTCTTGGTTCATCTCGGGTGAGATCGACCACCTAGAGATTTAGTAAGAAAATAATTTTTTCTAACTTATCGATTGTTAATTTTTTCAAATATTACGTAATCTTCTAAACCTAAATTTTATATATTCTTAAATTTTCTCTTAAGTTATTTTTACCATATATTATATCAAATACTTAAATTTATGTTTCCATGAATTTCCAAACTATCCAAATAAATTTATCAATTATCCAAATAGATCTCAAGTGATTGCATTTTGATTTTAAATTTTGGGAAAGATTAAATTTTGCAAAAAAAAAAGAGTAAGATAGGTAAAATATATAGAATCTCAGTATTAATCTTGAGCATATTAACCGAGATTTTATATATTTGGGGTTTCTCTCTCTCTCTCTATATATATTTTTTTGCAAAATTTGATATTTCTCCATTTAGTCACAAGTCAATTACTTAAACTATTTGAATAATTTGGAAATTCACATAATTGTAGAAACATAAATTAGCTAAAGATTGAAAAGTTACGTAAATATTTTATATAACATTTGGGAAAGATAACGGAATTTGTAAAAAATTTAAAAATATATAAATTTAGTATAAAATTTGGAAAGATTTAGAAAAGTTGGCAGAATCTATATGTATCAGTCCATAAGTTAGAAAAAGATAATTTCTTAACTAGTTTAACAAATTTTCAAATAATGTGGCAGTTTTGAAAAGTGAAAATTGAAAAATTAAGGGTGCTAATTTTTTACCATTTCTCCTCTGACTTCTCGTTCTCTCTCTCCCACATGCACTCGATCTTAAATCGATTCTCTCGTCTTCCCTTACTCTCTCACCTTTCGTCTCCTCTTTCAACCATCCTTCTCTCTGTTTAACCTCTCGCACCATCCCTTTCTCTCTCATCTTTCGTATAGTCCATGGATCATCATAAATCTCTATTATCTGCTCAAGTTTCTCACGGGGTACCTCCCCACCCCAACCGCCATTCCAAAAATCGAATCTGAAATCAAGAAGATCAGATGGTAAAATTGGAGCAAAGGCCACGACCCTTTTCAGAATTAGATATCTAAGCCCATCCTCGGCGATCGAAGAATCGAAAAATGCATGAAATGCCGTCGTCTTCGAGTTACCCCCAACAATTGGGCTCTGGTTCGTTGAGTAGGCAAAATCTACCCCACCCTTAAATTGGGAGGTGGATCTTTGGAATCGCTCACCATTTGGGGGTTGGGAAGGGAAATAGGAGAGAATTGAAGAGAAAAGGGGAGTGTGTCGATGGGGTGGGTATTGTGATCATGGGTGGAGCCTAAGCCGCCATGGGAGGAGGAGTGGAAGAAGAAATGGAATAGAAGTAAGATTTGCAATGGGAAATTGGGAATGAATGAATGAATGGATTCGAGGACATAAACGAAGAGAATCGTGTAAGGAGAGAGTAGTGGTGGCAGGGAGAGGCGACTCGCGCCGTTGAGAGATTTGGATTCAAAACGGTTTATGCATAAAGGTAGAGTGTAAAAACTCGAATCGAACCGGACCAGTTCAGTTCGGATTGTTTTTATTAGAACAAGAAGTCGAAAGGAGAACCAGATCTTGGTGCTGATCCTTTTTACTCTCTCACAACCACGTTCTTCAAATTCAAGCGGCTGATCAATTTTGGAAGGAAAATCGAAACCGAGCCAATCCACTCCAAACCTGAAATGAACCGGATCAGTTCGGGTCGGTTTAAAAAAAATACCCAACCCCGACCCACTTACACTCCTACATAAACGTTTTAATTTGCTTTACTTTGCAATTTGCATGGATTTATTTATTTATTTTAAATAAAACAACAAAGTTTGAAGATCCAATTTTCTATTTCTAAAGAAAGGGCAGGCTTATATTTTTTCGGAAAAAAAAAAAACATGAACTGAAAAACTTGCACCCCAAATACAAATTATTTTAAAAGTTAACTTTTCAAAACTAATTTTTCTAAATATTGTGTAATGTTTATACCAAATCTTATTTCTTTTCAAATTCTTTTGTAATATCTTAGAATGTTGATGTTAATCCTAAGAAAATCTCACCTAACAATTCGCCTTACCTTCCAACATTTGGGTGTTAAGAAAATTTGGTTTTTTTAATTTTAATTTTAATTTTTTATTTTTTATATATATTTTTTTATTATGTTCCCACCACCAATAGGCCAATTGATTGTAATTTTGAGAATCAAGAAATGAAAGTTACATAACACCTCAAGGCCCATTGACTAAATTTGTAATCAAGTATGAATGCGGGATATGTTTATAAATTAAAAATATAATAAATGGTTAGATATATAATATGTAATAAAAGTTGAAAGGTTGAAGTTTGAAATGGAGCAAATGGTTGATAGATTAGGGATGAAATTGATATTTTATTTATTAGAAAATGACTAAATTAGGATTTTACAATGTTAGTGTCCTTAGTCTCTCATCTTCTTCTCGAGAAAAAACCATGGCCATTGTAATTGCATTGTAATTGGAGTTCTTAATGAGTCTGATAGAGAAACTATCTTCGAAGTGAGATGGGGGATTTTAGGATTACCACTTTGGCTACCCGTTATAACTGACGGATGATTGGTTGTTTGGTTGTTTTGATTTTGTGTATTTTTCAGATGGCCAATACTGATGGTATACCAAACATAGACTGATGTTAAGAGAGGTGAAATGTGGGTATGGATGATGTTATAGATTCCTTGTATATTTAGGAATGCGTATCAACAAGGCTAGGAGCGCTATGAACAACTCCCGACTGGTCTGAAGACTTGAGTCTCAAACAAATCAAGTATGCAACCGCTTCGATTTCCTACGAAATCGACGAGGATTTGTAATAAGTTTTCTTTAATTTTTGTACAAAGTTTGAGCGATTCCATCTCAATTGCAG

mRNA sequence

ATGGCATTACTTCGCCAAAAACTGTCATCCTTTGTTTTCTCATCATCCAGAAATTCTCTTTGCTTATCCATACGAAAACTCATCTCAGTTGCAGTTGCAGATGAACTTATCAACGACGATGCAACTGTCAATTCCATATGCGATTCATTCACAAGACGCGAAACCTGGGACACTCTTTCTCGAAAATTCCAGTTGCTCCAACTTAACGATTACTTGGTTCAAAAAGTCCTGCTCAAATTCCAGCAACCCGTTGATGCCAAACGCGCTCTGGGGTTCTTCCACTGGTCCGCCAAGAGCAAGAATTTCAATCATGGGTCTCAATCGTACGGTATTATGATACATATTTTAGTGAAAGCGCGGCTGGTAATCGATGCTCGAGCTTTGCTCGAGTCAATTTTGAAGAAAAATGAGGGGAACTCTTTTGACTTCTCTGTTGTAGATTCGCTTTTGGATTCCTATGAGGTGACTGGTTCATCCCCATTTGTGTTCGATTTATTGATTCAGACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGTGTGTTTGTTCTCATTTGGAGGAACGTGGGTTCTCGTTGAGTCTGATAAGTTTCAACACTTTGATACATGTTGTGGAAAAATCTGATGAGAATCATAAGGTTTGGAAAATTTATGAGCAAATGATCCGGAAAAGAGTTTACCCAAATGCAATTACTGTTAGAATCATGATTAATTCGTTATGTAAGGAAGGTAAATTGCAAGAAACTTCTGATATGTTGAATAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATACTTGTTTGATTTACAGGATTTTGGAGGAGGGAAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAATATGGTTCTTGATGACATTGCTTATTCACTGATTGTTTATGCTAAGGTGAAAACTGGGAGTATAACATCTGCATGGGAAGTGTTTGAGGAAATGTCTAAAAGAGGGTTTCGGGCAAATTCTTTCCTTTATACATTGTTCATTGGTGTCCATTGTAGAGGGGGAAAGATTGAGGAAGCTAATTGCTTAATGCAAGAGATGGAAAACATGGGTCTGAAGCCATATCCGGAGACCTTTAATCTTCTCATTGAAGGGTGTGCTACGTCAGGACATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGATAGAGGGTTTCTTCCTAGTTGTTCAGTTTTCAACATGGTAATAGCTAAGATTTGTGAGGAAGGAGCTTTGAAAAATGCTAATGCATTGTTAACCATGTTATTAGATAAAGGGTTTTTACCTGATGAAACCACTTACACCAATCTGATCTTAGGATATAGGAAAAGTGGTGAAATTCAGGAGATTCTTAAGCTTTATTATGAAATGGAAGCTAGATTGCTGTCTCCTGGAGTGTCAGTTTTCTTTGTACTCATTGAGAGTCTTTGTCAATCTGGGAAATTGGAGGAAGCAGAGAAATATTGGAAGATTATGAAAGATAGTTCTATAACACCTAGTCTACCTATATATCAGACATTAACCTTGTTCTACTTGAAGAAGGGTAATAGAGCAAAAGCTCTAGAACTGTTTCAACATGAACTGAAACGTTCATATACACTCTCTTCTCTGGCAGGTTACGTTCAAAGAAGCAACTCAAATGAAAGGATCCACTTGCGGAGGTTGACCTTCAGTTCAAATGGTGCAATCTGGTGGGGTAATCGTTCCATCAATGTACTTGAACAACTTGTGAGCCTCAAGGATCGGAGAAATCTGGAAGTTCCGAAGAACATGTTAGACAAATCAAGATGAACAGCTACGAGATTGCAAATGTTCATGAGAAGAAAATTACATTGAGGAGTTTTGAGAATTCTGTTGATCCGCCATCACTAGCAAGCTTCAGCGCAAAAGACGAATTTGGGATGCTTGGCGAAAAGAAAAATATGGCCAATACTGATGGTATACCAAACATAGACTGATGTTAAGAGAGGTGAAATGTGGGTATGGATGATGTTATAGATTCCTTGTATATTTAGGAATGCGTATCAACAAGGCTAGGAGCGCTATGAACAACTCCCGACTGGTCTGAAGACTTGAGTCTCAAACAAATCAAGTATGCAACCGCTTCGATTTCCTACGAAATCGACGAGGATTTGTAATAAGTTTTCTTTAATTTTTGTACAAAGTTTGAGCGATTCCATCTCAATTGCAG

Coding sequence (CDS)

ATGGCATTACTTCGCCAAAAACTGTCATCCTTTGTTTTCTCATCATCCAGAAATTCTCTTTGCTTATCCATACGAAAACTCATCTCAGTTGCAGTTGCAGATGAACTTATCAACGACGATGCAACTGTCAATTCCATATGCGATTCATTCACAAGACGCGAAACCTGGGACACTCTTTCTCGAAAATTCCAGTTGCTCCAACTTAACGATTACTTGGTTCAAAAAGTCCTGCTCAAATTCCAGCAACCCGTTGATGCCAAACGCGCTCTGGGGTTCTTCCACTGGTCCGCCAAGAGCAAGAATTTCAATCATGGGTCTCAATCGTACGGTATTATGATACATATTTTAGTGAAAGCGCGGCTGGTAATCGATGCTCGAGCTTTGCTCGAGTCAATTTTGAAGAAAAATGAGGGGAACTCTTTTGACTTCTCTGTTGTAGATTCGCTTTTGGATTCCTATGAGGTGACTGGTTCATCCCCATTTGTGTTCGATTTATTGATTCAGACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGTGTGTTTGTTCTCATTTGGAGGAACGTGGGTTCTCGTTGAGTCTGATAAGTTTCAACACTTTGATACATGTTGTGGAAAAATCTGATGAGAATCATAAGGTTTGGAAAATTTATGAGCAAATGATCCGGAAAAGAGTTTACCCAAATGCAATTACTGTTAGAATCATGATTAATTCGTTATGTAAGGAAGGTAAATTGCAAGAAACTTCTGATATGTTGAATAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATACTTGTTTGATTTACAGGATTTTGGAGGAGGGAAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAATATGGTTCTTGATGACATTGCTTATTCACTGATTGTTTATGCTAAGGTGAAAACTGGGAGTATAACATCTGCATGGGAAGTGTTTGAGGAAATGTCTAAAAGAGGGTTTCGGGCAAATTCTTTCCTTTATACATTGTTCATTGGTGTCCATTGTAGAGGGGGAAAGATTGAGGAAGCTAATTGCTTAATGCAAGAGATGGAAAACATGGGTCTGAAGCCATATCCGGAGACCTTTAATCTTCTCATTGAAGGGTGTGCTACGTCAGGACATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGATAGAGGGTTTCTTCCTAGTTGTTCAGTTTTCAACATGGTAATAGCTAAGATTTGTGAGGAAGGAGCTTTGAAAAATGCTAATGCATTGTTAACCATGTTATTAGATAAAGGGTTTTTACCTGATGAAACCACTTACACCAATCTGATCTTAGGATATAGGAAAAGTGGTGAAATTCAGGAGATTCTTAAGCTTTATTATGAAATGGAAGCTAGATTGCTGTCTCCTGGAGTGTCAGTTTTCTTTGTACTCATTGAGAGTCTTTGTCAATCTGGGAAATTGGAGGAAGCAGAGAAATATTGGAAGATTATGAAAGATAGTTCTATAACACCTAGTCTACCTATATATCAGACATTAACCTTGTTCTACTTGAAGAAGGGTAATAGAGCAAAAGCTCTAGAACTGTTTCAACATGAACTGAAACGTTCATATACACTCTCTTCTCTGGCAGGTTACGTTCAAAGAAGCAACTCAAATGAAAGGATCCACTTGCGGAGGTTGACCTTCAGTTCAAATGGTGCAATCTGGTGGGGTAATCGTTCCATCAATGTACTTGAACAACTTGTGAGCCTCAAGGATCGGAGAAATCTGGAAGTTCCGAAGAACATGTTAGACAAATCAAGATGA

Protein sequence

MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLSRKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKARLVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSLCKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELFQHELKRSYTLSSLAGYVQRSNSNERIHLRRLTFSSNGAIWWGNRSINVLEQLVSLKDRRNLEVPKNMLDKSR
Homology
BLAST of ClCG01G008820 vs. NCBI nr
Match: XP_038875410.1 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Benincasa hispida] >XP_038875411.1 pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Benincasa hispida])

HSP 1 Score: 957.6 bits (2474), Expect = 5.1e-275
Identity = 486/533 (91.18%), Postives = 507/533 (95.12%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS F  SSS+NSLCLS RKLIS+A ADELINDDATVNSICDSFTRRE+WDTL+
Sbjct: 1   MALLRQKLSLFGLSSSKNSLCLSTRKLISIAAADELINDDATVNSICDSFTRRESWDTLT 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQLLQLND+LVQKVLLKFQQPVDAKRALGFFHWSAK KNFNHGSQS GIMIHILVKAR
Sbjct: 61  RKFQLLQLNDFLVQKVLLKFQQPVDAKRALGFFHWSAKRKNFNHGSQSCGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LVIDARALLESILKKNEG+SFDFSVVDSLLDSYEVT SSPFVFDLLIQTCAKLRLIDFAL
Sbjct: 121 LVIDARALLESILKKNEGSSFDFSVVDSLLDSYEVTSSSPFVFDLLIQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           CVCS L E GFSLSLISFNTLIHVVEKSDEN KVWKIYE MIRKRVYPNAIT RIMINSL
Sbjct: 181 CVCSRLGEHGFSLSLISFNTLIHVVEKSDENRKVWKIYEHMIRKRVYPNAITTRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNR+HGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRVHGSRCSASLIVNACLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITSAWEVFEEMSKRGF+ANSF+YTLFIGVHCR GKIEEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFQANSFIYTLFIGVHCREGKIEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETF+LLIEG ATSGHS+ESLRMCEKML+RGFLPSCSVFN+ IAKICEEG 
Sbjct: 361 MENMGLKPYPETFSLLIEGYATSGHSKESLRMCEKMLERGFLPSCSVFNLAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K ANALLT+LLDKGF+PDETTYTNLI+GY KSGEIQEILKLYYEMEARLLSPGVSVFF 
Sbjct: 421 VKKANALLTILLDKGFIPDETTYTNLIIGYWKSGEIQEILKLYYEMEARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           L+ SLCQ GKLEEAEKYWKIMKDSS+ PSLPIYQTLTLFYLKKGNRAKA EL+
Sbjct: 481 LVGSLCQYGKLEEAEKYWKIMKDSSLKPSLPIYQTLTLFYLKKGNRAKAQELY 533

BLAST of ClCG01G008820 vs. NCBI nr
Match: XP_008460528.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucumis melo] >KAA0062307.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK26670.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 940.3 bits (2429), Expect = 8.5e-270
Identity = 475/533 (89.12%), Postives = 502/533 (94.18%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS  V SSS+ SLCLSIRKLIS  VAD+LINDDATVNSIC+SFTRR++WD LS
Sbjct: 1   MALLRQKLSPIVLSSSKISLCLSIRKLISTPVADDLINDDATVNSICESFTRRQSWDALS 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQ L+LND LVQKVLLKFQQPVDAK ALGFFHWSAK KNFNHGSQSYGIMIHILVKAR
Sbjct: 61  RKFQFLELNDLLVQKVLLKFQQPVDAKLALGFFHWSAKRKNFNHGSQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALL+SILKKNEGNSFD+SVVDSLLDSY+VTGSSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLQSILKKNEGNSFDYSVVDSLLDSYKVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           C CSHLEERGFSLSLISFNTLIHV+EKSDEN KVWKIYEQMI KRVYPNAITVRIMINSL
Sbjct: 181 CFCSHLEERGFSLSLISFNTLIHVLEKSDENRKVWKIYEQMIGKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNRIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITS WEVFEEMSKRGF+ANSF+YTL IGVHCRGG++EEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSKRGFQANSFIYTLLIGVHCRGGEVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPY ETFNLLIEGCA SGHSEE LRMCEKML+RGFLPSCSVFN+ IAKICEEG 
Sbjct: 361 MENMGLKPYSETFNLLIEGCAISGHSEEILRMCEKMLERGFLPSCSVFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN LLT+LLDKGFLPDETTYTNLI+GYRKSGEI EILKLYYEMEARLLSPG+SVFF 
Sbjct: 421 VKKANELLTILLDKGFLPDETTYTNLIIGYRKSGEILEILKLYYEMEARLLSPGISVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           LI SLCQSG+LEEAEKY KI+KDSS+TPSL IYQ L LFYLKKGNRAKALEL+
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPSLSIYQALILFYLKKGNRAKALELY 533

BLAST of ClCG01G008820 vs. NCBI nr
Match: XP_031740872.1 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucumis sativus] >KGN51057.1 hypothetical protein Csa_008589 [Cucumis sativus])

HSP 1 Score: 934.9 bits (2415), Expect = 3.6e-268
Identity = 473/533 (88.74%), Postives = 502/533 (94.18%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS  V  SS+ SLCLS+R LIS+ VAD+LINDDATVNSICDS TRR++WDTLS
Sbjct: 1   MALLRQKLSPIVL-SSKISLCLSMRNLISIPVADKLINDDATVNSICDSLTRRQSWDTLS 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQ L+LND+LVQKVLLKFQQPVDAKRALGFFHWSAK KNFNHG QS+GIMIHILVKAR
Sbjct: 61  RKFQFLELNDFLVQKVLLKFQQPVDAKRALGFFHWSAKRKNFNHGPQSFGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALLESILKKNEGNSFD+SVVDSL+DSYEVTGSSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLESILKKNEGNSFDYSVVDSLMDSYEVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           CVCSHLEERGFSLSLISFNTLIHVVEKSDEN KVWKIYEQMIRKRVYPNAITVRIMINSL
Sbjct: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENLKVWKIYEQMIRKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNRIHGSRCSASLIVN CLIYRILEEGRVEDG+ LLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGITLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITS WEVFEEMS+RGF+ANSF+YTLFIGVHCRGGK+EEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSERGFQANSFIYTLFIGVHCRGGKVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETFNLLIEGCA SGHSEE L MCEKML+RGFLPSCSVFN+ I KICE+G 
Sbjct: 361 MENMGLKPYPETFNLLIEGCAISGHSEEILSMCEKMLERGFLPSCSVFNVAIDKICEKGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K ANALLT+LLDKGFLPDETTYTNLI+GYRKSGEIQEILKLYYEM ARLLSPGVSVFF 
Sbjct: 421 VKKANALLTILLDKGFLPDETTYTNLIIGYRKSGEIQEILKLYYEMGARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           LI SLCQSG+LEEAEKY KI+KDSS+TP L IYQ L L YLKKGNRAKALEL+
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPCLSIYQALILLYLKKGNRAKALELY 532

BLAST of ClCG01G008820 vs. NCBI nr
Match: XP_023550461.1 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 881.3 bits (2276), Expect = 4.7e-252
Identity = 449/531 (84.56%), Postives = 486/531 (91.53%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MA LRQKLS+F+ SSS+ S C SIRKL S+  ADELIN+DA VNSICDSFTRRE+WDTL+
Sbjct: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKF+ L+LND LVQKVLLKFQQ VDAKRALGFFHWSAK KNFNHG QSYGIMIHILVKAR
Sbjct: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LVIDARALLESILKKNEG+SF+FS+VDSLLD+YEVT SSPFVFDLLIQTCAKLRLIDFAL
Sbjct: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
            +C+HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYPN ITVRIMINSL
Sbjct: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAK+K G+I SA EVF+EMSKRGF+ANSF+YTLFIG HCR G+IEEA+CLM+E
Sbjct: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETFNLLIEGC     SEESLRMCEKML+RGF+PSCS FN+ IAKICEEG 
Sbjct: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN +LT+LLDKGFLPDETTYTNLI+GY K GE QEILKLYYEM+ARLLSPGVSVFF 
Sbjct: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALE 532
           LI S CQSG+LEEAEKY KIMKD SI PS+ IYQTL+LFYLKKGNRAKALE
Sbjct: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALE 528

BLAST of ClCG01G008820 vs. NCBI nr
Match: XP_022961108.1 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucurbita moschata])

HSP 1 Score: 879.4 bits (2271), Expect = 1.8e-251
Identity = 449/531 (84.56%), Postives = 485/531 (91.34%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MA LRQKLS+F+ SSS+ S C SIRKL S+A ADELIN+DA VNSICDSFTRRE+WDTL+
Sbjct: 1   MAFLRQKLSTFILSSSKISFCSSIRKLTSIAAADELINNDAIVNSICDSFTRRESWDTLT 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKF+ L+LND LVQKVLLKFQQ VDAKRALGFFHWSAK KNFNHG QSYGIMIHILVKAR
Sbjct: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LVIDARALLESILKKNEG+SF+FS+VDSLLD+YEVT SSPFVFDLLIQTCAKLRLIDFAL
Sbjct: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
            + +HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYPN ITVRIMINSL
Sbjct: 181 NMSAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLI+YAK+K G+I SA EVF+EMSKRGF+ANSF+YTLFIG HCRGG+IEEA+CLM+E
Sbjct: 301 IAYSLILYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRGGRIEEAHCLMEE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETFNLLIEGC     SEESLRMCEKML+RGF+PSCS FN+ IAKICEEG 
Sbjct: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN +LT LLDKGFLPDETTY NLI+GY K GE QEILKLYYEM+ARLLSPGVSVFF 
Sbjct: 421 VKKANEMLTTLLDKGFLPDETTYINLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALE 532
           LI S CQSGKLEEAEKY KIMKD SI PS+ IYQTL+LFYLKKGNRAKALE
Sbjct: 481 LIGSFCQSGKLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALE 528

BLAST of ClCG01G008820 vs. ExPASy Swiss-Prot
Match: Q3ECH5 (Pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g66345 PE=3 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.7e-135
Identity = 246/487 (50.51%), Postives = 334/487 (68.58%), Query Frame = 0

Query: 43  VNSICDSFTRRETWDTLSRKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNF 102
           ++ I  S    +TW+TLS KF  + L+D L++ +LL+F+ P  AK+AL FFHWS+ ++N 
Sbjct: 50  IDYISKSLQSNDTWETLSTKFSSIDLSDSLIETILLRFKNPETAKQALSFFHWSSHTRNL 109

Query: 103 NHGSQSYGIMIHILVKARLVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFV 162
            HG +SY + IHILVKARL+IDARAL+ES L  +  +S    +VDSLLD+YE++ S+P V
Sbjct: 110 RHGIKSYALTIHILVKARLLIDARALIESSLLNSPPDS---DLVDSLLDTYEISSSTPLV 169

Query: 163 FDLLIQTCAKLRLIDFALCVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMI 222
           FDLL+Q  AK+R ++    V   L + GF+LS+I+ NTLIH   KS  +  VW+IYE  I
Sbjct: 170 FDLLVQCYAKIRYLELGFDVFKRLCDCGFTLSVITLNTLIHYSSKSKIDDLVWRIYECAI 229

Query: 223 RKRVYPNAITVRIMINSLCKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVE 282
            KR+YPN IT+RIMI  LCKEG+L+E  D+L+RI G RC  S+IVNT L++R+LEE R+E
Sbjct: 230 DKRIYPNEITIRIMIQVLCKEGRLKEVVDLLDRICGKRCLPSVIVNTSLVFRVLEEMRIE 289

Query: 283 DGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFI 342
           + + LLKR+L KNMV+D I YS++VYAK K G + SA +VF+EM +RGF ANSF+YT+F+
Sbjct: 290 ESMSLLKRLLMKNMVVDTIGYSIVVYAKAKEGDLVSARKVFDEMLQRGFSANSFVYTVFV 349

Query: 343 GVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFL 402
            V C  G ++EA  L+ EME  G+ PY ETFN LI G A  G  E+ L  CE M+ RG +
Sbjct: 350 RVCCEKGDVKEAERLLSEMEESGVSPYDETFNCLIGGFARFGWEEKGLEYCEVMVTRGLM 409

Query: 403 PSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKL 462
           PSCS FN ++  + +   +  AN +LT  +DKGF+PDE TY++LI G+ +  +I + LKL
Sbjct: 410 PSCSAFNEMVKSVSKIENVNRANEILTKSIDKGFVPDEHTYSHLIRGFIEGNDIDQALKL 469

Query: 463 YYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLK 522
           +YEME R +SPG  VF  LI  LC  GK+E  EKY KIMK   I P+  IY  L   + K
Sbjct: 470 FYEMEYRKMSPGFEVFRSLIVGLCTCGKVEAGEKYLKIMKKRLIEPNADIYDALIKAFQK 529

Query: 523 KGNRAKA 530
            G++  A
Sbjct: 530 IGDKTNA 533

BLAST of ClCG01G008820 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 188.3 bits (477), Expect = 2.5e-46
Identity = 120/461 (26.03%), Postives = 227/461 (49.24%), Query Frame = 0

Query: 84  VDAKRALGFFHWSAKSKNF--NHGSQSYGIMIHILVKARLVIDARALLESILKKNEGNSF 143
           V  K AL F  W  K      +H  Q   I  HILV+AR+   AR +L+ +   +  +SF
Sbjct: 48  VHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF 107

Query: 144 DFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSHLEERGFSLSLISFNTL 203
            F    +L+ +Y +  S+P V+D+LI+   +  +I  +L +   +   GF+ S+ + N +
Sbjct: 108 VFG---ALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAI 167

Query: 204 IHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSLCKEGKLQETSDMLNRIHGSRC 263
           +  V KS E+  VW   ++M+++++ P+  T  I+IN LC EG  +++S ++ ++  S  
Sbjct: 168 LGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGY 227

Query: 264 SASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWE 323
           + +++    +++   ++GR +  + LL  M  K +  D   Y+++++   ++  I   + 
Sbjct: 228 APTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYL 287

Query: 324 VFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCA 383
           +  +M KR    N   Y   I      GK+  A+ L+ EM + GL P   TFN LI+G  
Sbjct: 288 LLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHI 347

Query: 384 TSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDET 443
           + G+ +E+L+M   M  +G  PS   + +++  +C+      A      +   G      
Sbjct: 348 SEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRI 407

Query: 444 TYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIM 503
           TYT +I G  K+G + E + L  EM    + P +  +  LI   C+ G+ + A++    +
Sbjct: 408 TYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRI 467

Query: 504 KDSSITPSLPIYQTLTLFYLKKGNRAKALELFQHELKRSYT 543
               ++P+  IY TL     + G   +A+ +++  +   +T
Sbjct: 468 YRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHT 505

BLAST of ClCG01G008820 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 2.3e-44
Identity = 114/464 (24.57%), Postives = 226/464 (48.71%), Query Frame = 0

Query: 76  VLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKARLVIDARALLESILKK 135
           +LLK Q   D    L F +W+   + F    +   I +HIL K +L   A+ L E +  K
Sbjct: 54  LLLKSQN--DQALILKFLNWANPHQFFTLRCKC--ITLHILTKFKLYKTAQILAEDVAAK 113

Query: 136 NEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSHLEERGFSLSL 195
              + +   V  SL ++Y++  S+  VFDL++++ ++L LID AL +    +  GF   +
Sbjct: 114 TLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 196 ISFNTLIHVVEKSDENHK-VWKIYEQMIRKRVYPNAITVRIMINSLCKEGKLQETSDMLN 255
           +S+N ++    +S  N      ++++M+  +V PN  T  I+I   C  G +     + +
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 256 RIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTG 315
           ++    C  +++    LI    +  +++DG  LL+ M  K +  + I+Y++++    + G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 316 SITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFN 375
            +     V  EM++RG+  +   Y   I  +C+ G   +A  +  EM   GL P   T+ 
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 376 LLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDK 435
            LI     +G+   ++   ++M  RG  P+   +  ++    ++G +  A  +L  + D 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 436 GFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEA 495
           GF P   TY  LI G+  +G++++ + +  +M+ + LSP V  +  ++   C+S  ++EA
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 496 EKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELFQHELK 539
            +  + M +  I P    Y +L   + ++    +A +L++  L+
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLR 513

BLAST of ClCG01G008820 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 5.1e-44
Identity = 109/414 (26.33%), Postives = 208/414 (50.24%), Query Frame = 0

Query: 122 VIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTG--SSPFVFDLLIQTCAKLRLIDFA 181
           VID   L  +I K  +     + +V +L    E  G   S +   ++I    + R + +A
Sbjct: 88  VIDFNRLFSAIAKTKQ-----YELVLALCKQMESKGIAHSIYTLSIMINCFCRCRKLSYA 147

Query: 182 LCVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINS 241
                 + + G+    + FNTL++ +       +  ++ ++M+     P  IT+  ++N 
Sbjct: 148 FSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNG 207

Query: 242 LCKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLD 301
           LC  GK+ +   +++R+  +    + +    ++  + + G+    + LL++M ++N+ LD
Sbjct: 208 LCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLD 267

Query: 302 DIAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQ 361
            + YS+I+    K GS+ +A+ +F EM  +GF+A+   Y   IG  C  G+ ++   L++
Sbjct: 268 AVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLR 327

Query: 362 EMENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEG 421
           +M    + P   TF++LI+     G   E+ ++ ++M+ RG  P+   +N +I   C+E 
Sbjct: 328 DMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKEN 387

Query: 422 ALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFF 481
            L+ A  ++ +++ KG  PD  T+  LI GY K+  I + L+L+ EM  R +      + 
Sbjct: 388 RLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYN 447

Query: 482 VLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
            L++  CQSGKLE A+K ++ M    + P +  Y+ L       G   KALE+F
Sbjct: 448 TLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIF 496

BLAST of ClCG01G008820 vs. ExPASy Swiss-Prot
Match: Q9LN69 (Putative pentatricopeptide repeat-containing protein At1g19290 OS=Arabidopsis thaliana OX=3702 GN=At1g19290 PE=3 SV=2)

HSP 1 Score: 178.3 bits (451), Expect = 2.6e-43
Identity = 126/506 (24.90%), Postives = 240/506 (47.43%), Query Frame = 0

Query: 66  LQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKARLVIDA 125
           L  +D L+  +L + +  ++ +  L  F+ ++K + F    ++Y  M+HIL +AR     
Sbjct: 66  LDFSDELLNSILRRLR--LNPEACLEIFNLASKQQKFRPDYKAYCKMVHILSRARNYQQT 125

Query: 126 RALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSH 185
           ++ L  ++  N      F V   L+  ++    SP VFD++++  A+  L+  AL V  +
Sbjct: 126 KSYLCELVALNHSG---FVVWGELVRVFKEFSFSPTVFDMILKVYAEKGLVKNALHVFDN 185

Query: 186 LEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSLCKEGK 245
           +   G   SL+S N+L+  + +  EN     +Y+QMI   V P+  T  I++N+ C+ G 
Sbjct: 186 MGNYGRIPSLLSCNSLLSNLVRKGENFVALHVYDQMISFEVSPDVFTCSIVVNAYCRSGN 245

Query: 246 L-------QETSDMLN----------------------------RIHGSR-CSASLIVNT 305
           +       +ET   L                             R+   R  S +++  T
Sbjct: 246 VDKAMVFAKETESSLGLELNVVTYNSLINGYAMIGDVEGMTRVLRLMSERGVSRNVVTYT 305

Query: 306 CLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWEVFEEMSKR 365
            LI    ++G +E+   + + + +K +V D   Y +++    +TG I  A  V + M + 
Sbjct: 306 SLIKGYCKKGLMEEAEHVFELLKEKKLVADQHMYGVLMDGYCRTGQIRDAVRVHDNMIEI 365

Query: 366 GFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCATSGHSEES 425
           G R N+ +    I  +C+ G++ EA  +   M +  LKP   T+N L++G   +G+ +E+
Sbjct: 366 GVRTNTTICNSLINGYCKSGQLVEAEQIFSRMNDWSLKPDHHTYNTLVDGYCRAGYVDEA 425

Query: 426 LRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDETTYTNLILG 485
           L++C++M  +  +P+   +N+++      GA  +  +L  M+L +G   DE + + L+  
Sbjct: 426 LKLCDQMCQKEVVPTVMTYNILLKGYSRIGAFHDVLSLWKMMLKRGVNADEISCSTLLEA 485

Query: 486 YRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIMKDSSITPS 536
             K G+  E +KL+  + AR L        V+I  LC+  K+ EA++    +      P+
Sbjct: 486 LFKLGDFNEAMKLWENVLARGLLTDTITLNVMISGLCKMEKVNEAKEILDNVNIFRCKPA 545

BLAST of ClCG01G008820 vs. ExPASy TrEMBL
Match: A0A5D3DTQ8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G003260 PE=4 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 4.1e-270
Identity = 475/533 (89.12%), Postives = 502/533 (94.18%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS  V SSS+ SLCLSIRKLIS  VAD+LINDDATVNSIC+SFTRR++WD LS
Sbjct: 1   MALLRQKLSPIVLSSSKISLCLSIRKLISTPVADDLINDDATVNSICESFTRRQSWDALS 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQ L+LND LVQKVLLKFQQPVDAK ALGFFHWSAK KNFNHGSQSYGIMIHILVKAR
Sbjct: 61  RKFQFLELNDLLVQKVLLKFQQPVDAKLALGFFHWSAKRKNFNHGSQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALL+SILKKNEGNSFD+SVVDSLLDSY+VTGSSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLQSILKKNEGNSFDYSVVDSLLDSYKVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           C CSHLEERGFSLSLISFNTLIHV+EKSDEN KVWKIYEQMI KRVYPNAITVRIMINSL
Sbjct: 181 CFCSHLEERGFSLSLISFNTLIHVLEKSDENRKVWKIYEQMIGKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNRIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITS WEVFEEMSKRGF+ANSF+YTL IGVHCRGG++EEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSKRGFQANSFIYTLLIGVHCRGGEVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPY ETFNLLIEGCA SGHSEE LRMCEKML+RGFLPSCSVFN+ IAKICEEG 
Sbjct: 361 MENMGLKPYSETFNLLIEGCAISGHSEEILRMCEKMLERGFLPSCSVFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN LLT+LLDKGFLPDETTYTNLI+GYRKSGEI EILKLYYEMEARLLSPG+SVFF 
Sbjct: 421 VKKANELLTILLDKGFLPDETTYTNLIIGYRKSGEILEILKLYYEMEARLLSPGISVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           LI SLCQSG+LEEAEKY KI+KDSS+TPSL IYQ L LFYLKKGNRAKALEL+
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPSLSIYQALILFYLKKGNRAKALELY 533

BLAST of ClCG01G008820 vs. ExPASy TrEMBL
Match: A0A1S3CCN8 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103499327 PE=4 SV=1)

HSP 1 Score: 940.3 bits (2429), Expect = 4.1e-270
Identity = 475/533 (89.12%), Postives = 502/533 (94.18%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS  V SSS+ SLCLSIRKLIS  VAD+LINDDATVNSIC+SFTRR++WD LS
Sbjct: 1   MALLRQKLSPIVLSSSKISLCLSIRKLISTPVADDLINDDATVNSICESFTRRQSWDALS 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQ L+LND LVQKVLLKFQQPVDAK ALGFFHWSAK KNFNHGSQSYGIMIHILVKAR
Sbjct: 61  RKFQFLELNDLLVQKVLLKFQQPVDAKLALGFFHWSAKRKNFNHGSQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALL+SILKKNEGNSFD+SVVDSLLDSY+VTGSSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLQSILKKNEGNSFDYSVVDSLLDSYKVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           C CSHLEERGFSLSLISFNTLIHV+EKSDEN KVWKIYEQMI KRVYPNAITVRIMINSL
Sbjct: 181 CFCSHLEERGFSLSLISFNTLIHVLEKSDENRKVWKIYEQMIGKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNRIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITS WEVFEEMSKRGF+ANSF+YTL IGVHCRGG++EEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSKRGFQANSFIYTLLIGVHCRGGEVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPY ETFNLLIEGCA SGHSEE LRMCEKML+RGFLPSCSVFN+ IAKICEEG 
Sbjct: 361 MENMGLKPYSETFNLLIEGCAISGHSEEILRMCEKMLERGFLPSCSVFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN LLT+LLDKGFLPDETTYTNLI+GYRKSGEI EILKLYYEMEARLLSPG+SVFF 
Sbjct: 421 VKKANELLTILLDKGFLPDETTYTNLIIGYRKSGEILEILKLYYEMEARLLSPGISVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           LI SLCQSG+LEEAEKY KI+KDSS+TPSL IYQ L LFYLKKGNRAKALEL+
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPSLSIYQALILFYLKKGNRAKALELY 533

BLAST of ClCG01G008820 vs. ExPASy TrEMBL
Match: A0A0A0KRP7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G423870 PE=4 SV=1)

HSP 1 Score: 934.9 bits (2415), Expect = 1.7e-268
Identity = 473/533 (88.74%), Postives = 502/533 (94.18%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MALLRQKLS  V  SS+ SLCLS+R LIS+ VAD+LINDDATVNSICDS TRR++WDTLS
Sbjct: 1   MALLRQKLSPIVL-SSKISLCLSMRNLISIPVADKLINDDATVNSICDSLTRRQSWDTLS 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKFQ L+LND+LVQKVLLKFQQPVDAKRALGFFHWSAK KNFNHG QS+GIMIHILVKAR
Sbjct: 61  RKFQFLELNDFLVQKVLLKFQQPVDAKRALGFFHWSAKRKNFNHGPQSFGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALLESILKKNEGNSFD+SVVDSL+DSYEVTGSSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLESILKKNEGNSFDYSVVDSLMDSYEVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
           CVCSHLEERGFSLSLISFNTLIHVVEKSDEN KVWKIYEQMIRKRVYPNAITVRIMINSL
Sbjct: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENLKVWKIYEQMIRKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQETSDMLNRIHGSRCSASLIVN CLIYRILEEGRVEDG+ LLKRMLQKNMVLDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGITLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAKVKTGSITS WEVFEEMS+RGF+ANSF+YTLFIGVHCRGGK+EEA+CLMQE
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSERGFQANSFIYTLFIGVHCRGGKVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETFNLLIEGCA SGHSEE L MCEKML+RGFLPSCSVFN+ I KICE+G 
Sbjct: 361 MENMGLKPYPETFNLLIEGCAISGHSEEILSMCEKMLERGFLPSCSVFNVAIDKICEKGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K ANALLT+LLDKGFLPDETTYTNLI+GYRKSGEIQEILKLYYEM ARLLSPGVSVFF 
Sbjct: 421 VKKANALLTILLDKGFLPDETTYTNLIIGYRKSGEIQEILKLYYEMGARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
           LI SLCQSG+LEEAEKY KI+KDSS+TP L IYQ L L YLKKGNRAKALEL+
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPCLSIYQALILLYLKKGNRAKALELY 532

BLAST of ClCG01G008820 vs. ExPASy TrEMBL
Match: A0A6J1HAX5 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111461713 PE=4 SV=1)

HSP 1 Score: 879.4 bits (2271), Expect = 8.6e-252
Identity = 449/531 (84.56%), Postives = 485/531 (91.34%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MA LRQKLS+F+ SSS+ S C SIRKL S+A ADELIN+DA VNSICDSFTRRE+WDTL+
Sbjct: 1   MAFLRQKLSTFILSSSKISFCSSIRKLTSIAAADELINNDAIVNSICDSFTRRESWDTLT 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKF+ L+LND LVQKVLLKFQQ VDAKRALGFFHWSAK KNFNHG QSYGIMIHILVKAR
Sbjct: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LVIDARALLESILKKNEG+SF+FS+VDSLLD+YEVT SSPFVFDLLIQTCAKLRLIDFAL
Sbjct: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
            + +HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYPN ITVRIMINSL
Sbjct: 181 NMSAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLI+YAK+K G+I SA EVF+EMSKRGF+ANSF+YTLFIG HCRGG+IEEA+CLM+E
Sbjct: 301 IAYSLILYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRGGRIEEAHCLMEE 360

Query: 361 MENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGA 420
           MENMGLKPYPETFNLLIEGC     SEESLRMCEKML+RGF+PSCS FN+ IAKICEEG 
Sbjct: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420

Query: 421 LKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFV 480
           +K AN +LT LLDKGFLPDETTY NLI+GY K GE QEILKLYYEM+ARLLSPGVSVFF 
Sbjct: 421 VKKANEMLTTLLDKGFLPDETTYINLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480

Query: 481 LIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALE 532
           LI S CQSGKLEEAEKY KIMKD SI PS+ IYQTL+LFYLKKGNRAKALE
Sbjct: 481 LIGSFCQSGKLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALE 528

BLAST of ClCG01G008820 vs. ExPASy TrEMBL
Match: A0A6J1JRQ4 (pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111487740 PE=4 SV=1)

HSP 1 Score: 869.0 bits (2244), Expect = 1.2e-248
Identity = 444/532 (83.46%), Postives = 485/532 (91.17%), Query Frame = 0

Query: 1   MALLRQKLSSFVFSSSRNSLCLSIRKLISVAVADELINDDATVNSICDSFTRRETWDTLS 60
           MA LR+KLS+F+ SSS+ S C SIRKL S+  A ELIN+DA VNSICDSFTRRE+WDTL+
Sbjct: 1   MAFLRRKLSTFILSSSKISFCSSIRKLNSITAAGELINNDAVVNSICDSFTRRESWDTLT 60

Query: 61  RKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKAR 120
           RKF+ L+LND LVQKVLLKFQQ VD+KRALGFFHWSAK KNFNHG QSYGIMIHILVKAR
Sbjct: 61  RKFETLELNDSLVQKVLLKFQQLVDSKRALGFFHWSAKRKNFNHGVQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFAL 180
           LV DARALLESILKKNEG+SF+FS+VDSLLD+YEVT SSPFVFDLLIQTCAKLRLIDFAL
Sbjct: 121 LVNDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180

Query: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSL 240
            +C+HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYP+ ITVRIMINSL
Sbjct: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPDVITVRIMINSL 240

Query: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300

Query: 301 IAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQE 360
           IAYSLIVYAK+K G+I  A EVF+EMSKRGF+ANSF+YTLFIG HCR G+IEEA+CLM+E
Sbjct: 301 IAYSLIVYAKLKIGNIKCAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360

Query: 361 MENMGLKPYPETFNLLI-EGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEG 420
           MENMGLKPYPETFNLLI EGC   G SEESLR+CEKML+RGF+PSCS FN+ IAKICEEG
Sbjct: 361 MENMGLKPYPETFNLLIEEGC---GDSEESLRLCEKMLERGFVPSCSSFNVAIAKICEEG 420

Query: 421 ALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFF 480
            +K AN +LT+LLDKGFLPDETTYTNLI+GY K GE QEILKLYYEM+ARLLSPGVSVFF
Sbjct: 421 DVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFF 480

Query: 481 VLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALE 532
            LI S CQSGKLEEAEKY+KIMKD SI PS+ IYQTL+LFYLKKGNRAKALE
Sbjct: 481 ALIGSFCQSGKLEEAEKYFKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALE 529

BLAST of ClCG01G008820 vs. TAIR 10
Match: AT1G66345.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 484.6 bits (1246), Expect = 1.2e-136
Identity = 246/487 (50.51%), Postives = 334/487 (68.58%), Query Frame = 0

Query: 43  VNSICDSFTRRETWDTLSRKFQLLQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNF 102
           ++ I  S    +TW+TLS KF  + L+D L++ +LL+F+ P  AK+AL FFHWS+ ++N 
Sbjct: 50  IDYISKSLQSNDTWETLSTKFSSIDLSDSLIETILLRFKNPETAKQALSFFHWSSHTRNL 109

Query: 103 NHGSQSYGIMIHILVKARLVIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFV 162
            HG +SY + IHILVKARL+IDARAL+ES L  +  +S    +VDSLLD+YE++ S+P V
Sbjct: 110 RHGIKSYALTIHILVKARLLIDARALIESSLLNSPPDS---DLVDSLLDTYEISSSTPLV 169

Query: 163 FDLLIQTCAKLRLIDFALCVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMI 222
           FDLL+Q  AK+R ++    V   L + GF+LS+I+ NTLIH   KS  +  VW+IYE  I
Sbjct: 170 FDLLVQCYAKIRYLELGFDVFKRLCDCGFTLSVITLNTLIHYSSKSKIDDLVWRIYECAI 229

Query: 223 RKRVYPNAITVRIMINSLCKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVE 282
            KR+YPN IT+RIMI  LCKEG+L+E  D+L+RI G RC  S+IVNT L++R+LEE R+E
Sbjct: 230 DKRIYPNEITIRIMIQVLCKEGRLKEVVDLLDRICGKRCLPSVIVNTSLVFRVLEEMRIE 289

Query: 283 DGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFI 342
           + + LLKR+L KNMV+D I YS++VYAK K G + SA +VF+EM +RGF ANSF+YT+F+
Sbjct: 290 ESMSLLKRLLMKNMVVDTIGYSIVVYAKAKEGDLVSARKVFDEMLQRGFSANSFVYTVFV 349

Query: 343 GVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFL 402
            V C  G ++EA  L+ EME  G+ PY ETFN LI G A  G  E+ L  CE M+ RG +
Sbjct: 350 RVCCEKGDVKEAERLLSEMEESGVSPYDETFNCLIGGFARFGWEEKGLEYCEVMVTRGLM 409

Query: 403 PSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKL 462
           PSCS FN ++  + +   +  AN +LT  +DKGF+PDE TY++LI G+ +  +I + LKL
Sbjct: 410 PSCSAFNEMVKSVSKIENVNRANEILTKSIDKGFVPDEHTYSHLIRGFIEGNDIDQALKL 469

Query: 463 YYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLK 522
           +YEME R +SPG  VF  LI  LC  GK+E  EKY KIMK   I P+  IY  L   + K
Sbjct: 470 FYEMEYRKMSPGFEVFRSLIVGLCTCGKVEAGEKYLKIMKKRLIEPNADIYDALIKAFQK 529

Query: 523 KGNRAKA 530
            G++  A
Sbjct: 530 IGDKTNA 533

BLAST of ClCG01G008820 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 188.3 bits (477), Expect = 1.8e-47
Identity = 120/461 (26.03%), Postives = 227/461 (49.24%), Query Frame = 0

Query: 84  VDAKRALGFFHWSAKSKNF--NHGSQSYGIMIHILVKARLVIDARALLESILKKNEGNSF 143
           V  K AL F  W  K      +H  Q   I  HILV+AR+   AR +L+ +   +  +SF
Sbjct: 88  VHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF 147

Query: 144 DFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSHLEERGFSLSLISFNTL 203
            F    +L+ +Y +  S+P V+D+LI+   +  +I  +L +   +   GF+ S+ + N +
Sbjct: 148 VFG---ALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAI 207

Query: 204 IHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSLCKEGKLQETSDMLNRIHGSRC 263
           +  V KS E+  VW   ++M+++++ P+  T  I+IN LC EG  +++S ++ ++  S  
Sbjct: 208 LGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGY 267

Query: 264 SASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWE 323
           + +++    +++   ++GR +  + LL  M  K +  D   Y+++++   ++  I   + 
Sbjct: 268 APTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYL 327

Query: 324 VFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCA 383
           +  +M KR    N   Y   I      GK+  A+ L+ EM + GL P   TFN LI+G  
Sbjct: 328 LLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHI 387

Query: 384 TSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDET 443
           + G+ +E+L+M   M  +G  PS   + +++  +C+      A      +   G      
Sbjct: 388 SEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRI 447

Query: 444 TYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIM 503
           TYT +I G  K+G + E + L  EM    + P +  +  LI   C+ G+ + A++    +
Sbjct: 448 TYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRI 507

Query: 504 KDSSITPSLPIYQTLTLFYLKKGNRAKALELFQHELKRSYT 543
               ++P+  IY TL     + G   +A+ +++  +   +T
Sbjct: 508 YRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHT 545

BLAST of ClCG01G008820 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 181.8 bits (460), Expect = 1.6e-45
Identity = 114/464 (24.57%), Postives = 226/464 (48.71%), Query Frame = 0

Query: 76  VLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKARLVIDARALLESILKK 135
           +LLK Q   D    L F +W+   + F    +   I +HIL K +L   A+ L E +  K
Sbjct: 54  LLLKSQN--DQALILKFLNWANPHQFFTLRCKC--ITLHILTKFKLYKTAQILAEDVAAK 113

Query: 136 NEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSHLEERGFSLSL 195
              + +   V  SL ++Y++  S+  VFDL++++ ++L LID AL +    +  GF   +
Sbjct: 114 TLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 196 ISFNTLIHVVEKSDENHK-VWKIYEQMIRKRVYPNAITVRIMINSLCKEGKLQETSDMLN 255
           +S+N ++    +S  N      ++++M+  +V PN  T  I+I   C  G +     + +
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 256 RIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTG 315
           ++    C  +++    LI    +  +++DG  LL+ M  K +  + I+Y++++    + G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 316 SITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFN 375
            +     V  EM++RG+  +   Y   I  +C+ G   +A  +  EM   GL P   T+ 
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 376 LLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDK 435
            LI     +G+   ++   ++M  RG  P+   +  ++    ++G +  A  +L  + D 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 436 GFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEA 495
           GF P   TY  LI G+  +G++++ + +  +M+ + LSP V  +  ++   C+S  ++EA
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 496 EKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELFQHELK 539
            +  + M +  I P    Y +L   + ++    +A +L++  L+
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLR 513

BLAST of ClCG01G008820 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 180.6 bits (457), Expect = 3.7e-45
Identity = 109/414 (26.33%), Postives = 208/414 (50.24%), Query Frame = 0

Query: 122 VIDARALLESILKKNEGNSFDFSVVDSLLDSYEVTG--SSPFVFDLLIQTCAKLRLIDFA 181
           VID   L  +I K  +     + +V +L    E  G   S +   ++I    + R + +A
Sbjct: 88  VIDFNRLFSAIAKTKQ-----YELVLALCKQMESKGIAHSIYTLSIMINCFCRCRKLSYA 147

Query: 182 LCVCSHLEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINS 241
                 + + G+    + FNTL++ +       +  ++ ++M+     P  IT+  ++N 
Sbjct: 148 FSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNG 207

Query: 242 LCKEGKLQETSDMLNRIHGSRCSASLIVNTCLIYRILEEGRVEDGVMLLKRMLQKNMVLD 301
           LC  GK+ +   +++R+  +    + +    ++  + + G+    + LL++M ++N+ LD
Sbjct: 208 LCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLD 267

Query: 302 DIAYSLIVYAKVKTGSITSAWEVFEEMSKRGFRANSFLYTLFIGVHCRGGKIEEANCLMQ 361
            + YS+I+    K GS+ +A+ +F EM  +GF+A+   Y   IG  C  G+ ++   L++
Sbjct: 268 AVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLR 327

Query: 362 EMENMGLKPYPETFNLLIEGCATSGHSEESLRMCEKMLDRGFLPSCSVFNMVIAKICEEG 421
           +M    + P   TF++LI+     G   E+ ++ ++M+ RG  P+   +N +I   C+E 
Sbjct: 328 DMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKEN 387

Query: 422 ALKNANALLTMLLDKGFLPDETTYTNLILGYRKSGEIQEILKLYYEMEARLLSPGVSVFF 481
            L+ A  ++ +++ KG  PD  T+  LI GY K+  I + L+L+ EM  R +      + 
Sbjct: 388 RLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYN 447

Query: 482 VLIESLCQSGKLEEAEKYWKIMKDSSITPSLPIYQTLTLFYLKKGNRAKALELF 534
            L++  CQSGKLE A+K ++ M    + P +  Y+ L       G   KALE+F
Sbjct: 448 TLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIF 496

BLAST of ClCG01G008820 vs. TAIR 10
Match: AT1G19290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 178.3 bits (451), Expect = 1.8e-44
Identity = 126/506 (24.90%), Postives = 240/506 (47.43%), Query Frame = 0

Query: 66  LQLNDYLVQKVLLKFQQPVDAKRALGFFHWSAKSKNFNHGSQSYGIMIHILVKARLVIDA 125
           L  +D L+  +L + +  ++ +  L  F+ ++K + F    ++Y  M+HIL +AR     
Sbjct: 66  LDFSDELLNSILRRLR--LNPEACLEIFNLASKQQKFRPDYKAYCKMVHILSRARNYQQT 125

Query: 126 RALLESILKKNEGNSFDFSVVDSLLDSYEVTGSSPFVFDLLIQTCAKLRLIDFALCVCSH 185
           ++ L  ++  N      F V   L+  ++    SP VFD++++  A+  L+  AL V  +
Sbjct: 126 KSYLCELVALNHSG---FVVWGELVRVFKEFSFSPTVFDMILKVYAEKGLVKNALHVFDN 185

Query: 186 LEERGFSLSLISFNTLIHVVEKSDENHKVWKIYEQMIRKRVYPNAITVRIMINSLCKEGK 245
           +   G   SL+S N+L+  + +  EN     +Y+QMI   V P+  T  I++N+ C+ G 
Sbjct: 186 MGNYGRIPSLLSCNSLLSNLVRKGENFVALHVYDQMISFEVSPDVFTCSIVVNAYCRSGN 245

Query: 246 L-------QETSDMLN----------------------------RIHGSR-CSASLIVNT 305
           +       +ET   L                             R+   R  S +++  T
Sbjct: 246 VDKAMVFAKETESSLGLELNVVTYNSLINGYAMIGDVEGMTRVLRLMSERGVSRNVVTYT 305

Query: 306 CLIYRILEEGRVEDGVMLLKRMLQKNMVLDDIAYSLIVYAKVKTGSITSAWEVFEEMSKR 365
            LI    ++G +E+   + + + +K +V D   Y +++    +TG I  A  V + M + 
Sbjct: 306 SLIKGYCKKGLMEEAEHVFELLKEKKLVADQHMYGVLMDGYCRTGQIRDAVRVHDNMIEI 365

Query: 366 GFRANSFLYTLFIGVHCRGGKIEEANCLMQEMENMGLKPYPETFNLLIEGCATSGHSEES 425
           G R N+ +    I  +C+ G++ EA  +   M +  LKP   T+N L++G   +G+ +E+
Sbjct: 366 GVRTNTTICNSLINGYCKSGQLVEAEQIFSRMNDWSLKPDHHTYNTLVDGYCRAGYVDEA 425

Query: 426 LRMCEKMLDRGFLPSCSVFNMVIAKICEEGALKNANALLTMLLDKGFLPDETTYTNLILG 485
           L++C++M  +  +P+   +N+++      GA  +  +L  M+L +G   DE + + L+  
Sbjct: 426 LKLCDQMCQKEVVPTVMTYNILLKGYSRIGAFHDVLSLWKMMLKRGVNADEISCSTLLEA 485

Query: 486 YRKSGEIQEILKLYYEMEARLLSPGVSVFFVLIESLCQSGKLEEAEKYWKIMKDSSITPS 536
             K G+  E +KL+  + AR L        V+I  LC+  K+ EA++    +      P+
Sbjct: 486 LFKLGDFNEAMKLWENVLARGLLTDTITLNVMISGLCKMEKVNEAKEILDNVNIFRCKPA 545

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875410.15.1e-27591.18pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Benincasa ... [more]
XP_008460528.18.5e-27089.12PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial ... [more]
XP_031740872.13.6e-26888.74pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucumis sa... [more]
XP_023550461.14.7e-25284.56pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucurbita ... [more]
XP_022961108.11.8e-25184.56pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q3ECH51.7e-13550.51Pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Arabidop... [more]
Q9LVQ52.5e-4626.03Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
Q9FIX32.3e-4424.57Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LPX25.1e-4426.33Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Q9LN692.6e-4324.90Putative pentatricopeptide repeat-containing protein At1g19290 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A5D3DTQ84.1e-27089.12Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CCN84.1e-27089.12pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucumis ... [more]
A0A0A0KRP71.7e-26888.74Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G423870 PE=4 SV=1[more]
A0A6J1HAX58.6e-25284.56pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucurbit... [more]
A0A6J1JRQ41.2e-24883.46pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT1G66345.11.2e-13650.51Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G55840.11.8e-4726.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.11.6e-4524.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G12775.13.7e-4526.33Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G19290.11.8e-4424.90Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 196..242
e-value: 1.5E-10
score: 41.1
coord: 334..381
e-value: 1.5E-8
score: 34.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 442..468
e-value: 4.1E-4
score: 20.4
coord: 108..136
e-value: 0.27
score: 11.6
coord: 480..505
e-value: 4.4E-4
score: 20.3
coord: 302..331
e-value: 0.059
score: 13.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 302..334
e-value: 0.001
score: 17.1
coord: 372..405
e-value: 1.5E-5
score: 22.8
coord: 481..509
e-value: 4.3E-5
score: 21.4
coord: 232..263
e-value: 9.5E-4
score: 17.2
coord: 337..368
e-value: 7.2E-5
score: 20.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 474..508
score: 10.468099
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 439..473
score: 10.413293
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 369..403
score: 10.731171
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 194..228
score: 8.933517
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 334..368
score: 11.202506
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 299..333
score: 9.96388
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 41..156
e-value: 8.0E-6
score: 27.3
coord: 157..255
e-value: 5.8E-16
score: 60.3
coord: 309..421
e-value: 2.4E-26
score: 94.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 422..593
e-value: 3.3E-23
score: 84.5
NoneNo IPR availablePANTHERPTHR45613:SF248PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN MITOCHONDRIALcoord: 1..542
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..542

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G008820.1ClCG01G008820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006396 RNA processing
molecular_function GO:0005515 protein binding