Cla97C04G076940 (gene) Watermelon (97103) v2.5

Overview
NameCla97C04G076940
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr04: 24481354 .. 24483195 (+)
RNA-Seq ExpressionCla97C04G076940
SyntenyCla97C04G076940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGCTCTCTCAACGCCATGGAACACTCAACTCAGAGAATTAGCAAAACGATGTCAATTTCTTCAAGCTCTAAGTCTCTATCCCCAACTACTTCGCCATGGTGGGCGCCCCAATGCCTTCACTTTCCCGTTTGCTCTCAAATCCTGCGCGGCCCTTTCCCTCCCTATACTCGGCGAACAATTCCATGGTCAAATTATCAAAGTTGGGTGTGAATTCGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTATTGCAGAGGCACTTTGGTCGAGAATGCCCGTAAAGTGTTCGATGAGAATTCCCAGTCCAGAAAGCTTAGTGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCAAAATGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCTGTTAATTCAGTTACGTTGCTGGGTTTGATCCCAGTTTGTGTTTCTCCGATTAATTTAGAGCTTGGATCGTCTCTACATGGCTCCACATTGAAATATGGATTGGATTCAGATGTCTCTGTTGTTAACTGCTTTATTACTATGTACATGAAATGTGGCTCGGTTGATTATGCACAGAAGCTGTTTGATGAAATGCCTGTGAAGGGTTTGATCTCTTGGAACGCTATGGTTTCTGGGTACGCGCAAAATGGGATGGCAACTAATGTTTTGGAGCTCTATCGTAACATGGATATGCATGGGGTTCGCCCGGATCCTGTAACTCATGTTGGGGTTTTATCATCTTGCGCTAACCTTGGGGCTCAGAGTGTTGGCCATGAGGTAGAATTAAAGATTCAAGCAAGTGGGTTTAACAATAATCCGTTTCTGAATAATGCTTTGATCAATATGTACGCAAGGTGTGGCAATTTAACAAAGGCACAAGCCGTGTTTGATGAAATGCCTGAGAGAACATTAGTTTCATGGACAGCAATTATAGGTGGCTATGGAATGCATGGACATGGAGAAATTGCAGTGCAGCTTTTCGAAGAGATGATAAGGAGTGGCATTGTACCTGATGGAACTGCATTTGTGAGTGTCTTGTCTGCCTGTAGCCATGCAGGGCTCACTGATCAGGGCTTGGAATACTTCAAAATGATGACGAGAAAATATCAATTGCAACCAGGTCCAGAGCATTATTCGTGTATCGTGGATCTTCTGGGGCGAGCAGGGCGGCTTAACGAAGCTCGAAATCTCATTGAATCCATGCCAATAAAGCCTGATGGTGCCGTCTGGGGAGCTCTTCTGGGTGCTTGTAAGATCCACAAGAATGTTGAGTTAGCAGAGTTGGCTTTTGAACATGTGATCGAGCTTGAACCTGCAAACATAGGATACTATGTCTTATTGTCAAACATTTACTATGATGCCAAGAACTCAAAAGGGGTTTTGAGGATCCGGTTTATGATGAAGCAGAGAAAGTTGAAGAAGGACCCTGGATGTAGCTATGTTGAATTGAAGGGCAGAGTTCATCCATTTATAGTTGGGGATAGAAACCATCCTCAGGCTGAAGAGATATATAGAGTTTTGGAAGAGTTAGAAGCGCTAGTACAGAAATTTGGAGAGCCTAAGAAGGATGATAGAGATGAAAGCAACAAAGATTTGTTAACTGGAGTTGGAGTTCATAGCGAAAAATTGGCTGTTGCTTTTGGACTCCTGAATACCATGGCTGGGACCGAAGTTGTGATCATAAAAAATCTTAGGATATGTGAAGATTGTCACTTGTTTTTCAAGATGGTTAGCAAAATTGTCCATCGTCAGTTAACTGTTAGAGATCCTACTCGCTTCCACCATTTTAAAAATGGGAGCTGTTCTTGTAAAGATTATTGGTAG

mRNA sequence

ATGAACGCTCTCTCAACGCCATGGAACACTCAACTCAGAGAATTAGCAAAACGATGTCAATTTCTTCAAGCTCTAAGTCTCTATCCCCAACTACTTCGCCATGGTGGGCGCCCCAATGCCTTCACTTTCCCGTTTGCTCTCAAATCCTGCGCGGCCCTTTCCCTCCCTATACTCGGCGAACAATTCCATGGTCAAATTATCAAAGTTGGGTGTGAATTCGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTATTGCAGAGGCACTTTGGTCGAGAATGCCCGTAAAGTGTTCGATGAGAATTCCCAGTCCAGAAAGCTTAGTGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCAAAATGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCTGTTAATTCAGTTACGTTGCTGGGTTTGATCCCAGTTTGTGTTTCTCCGATTAATTTAGAGCTTGGATCGTCTCTACATGGCTCCACATTGAAATATGGATTGGATTCAGATGTCTCTGTTGTTAACTGCTTTATTACTATGTACATGAAATGTGGCTCGGTTGATTATGCACAGAAGCTGTTTGATGAAATGCCTGTGAAGGGTTTGATCTCTTGGAACGCTATGGTTTCTGGGTACGCGCAAAATGGGATGGCAACTAATGTTTTGGAGCTCTATCGTAACATGGATATGCATGGGGTTCGCCCGGATCCTGTAACTCATGTTGGGGTTTTATCATCTTGCGCTAACCTTGGGGCTCAGAGTGTTGGCCATGAGGTAGAATTAAAGATTCAAGCAAGTGGGTTTAACAATAATCCGTTTCTGAATAATGCTTTGATCAATATGTACGCAAGGTGTGGCAATTTAACAAAGGCACAAGCCGTGTTTGATGAAATGCCTGAGAGAACATTAGTTTCATGGACAGCAATTATAGGTGGCTATGGAATGCATGGACATGGAGAAATTGCAGTGCAGCTTTTCGAAGAGATGATAAGGAGTGGCATTGTACCTGATGGAACTGCATTTGTGAGTGTCTTGTCTGCCTGTAGCCATGCAGGGCTCACTGATCAGGGCTTGGAATACTTCAAAATGATGACGAGAAAATATCAATTGCAACCAGGTCCAGAGCATTATTCGTGTATCGTGGATCTTCTGGGGCGAGCAGGGCGGCTTAACGAAGCTCGAAATCTCATTGAATCCATGCCAATAAAGCCTGATGGTGCCGTCTGGGGAGCTCTTCTGGGTGCTTGTAAGATCCACAAGAATGTTGAGTTAGCAGAGTTGGCTTTTGAACATGTGATCGAGCTTGAACCTGCAAACATAGGATACTATGTCTTATTGTCAAACATTTACTATGATGCCAAGAACTCAAAAGGGGTTTTGAGGATCCGGTTTATGATGAAGCAGAGAAAGTTGAAGAAGGACCCTGGATGTAGCTATGTTGAATTGAAGGGCAGAGTTCATCCATTTATAGTTGGGGATAGAAACCATCCTCAGGCTGAAGAGATATATAGAGTTTTGGAAGAGTTAGAAGCGCTAGTACAGAAATTTGGAGAGCCTAAGAAGGATGATAGAGATGAAAGCAACAAAGATTTGTTAACTGGAGTTGGAGTTCATAGCGAAAAATTGGCTGTTGCTTTTGGACTCCTGAATACCATGGCTGGGACCGAAGTTGTGATCATAAAAAATCTTAGGATATGTGAAGATTGTCACTTGTTTTTCAAGATGGTTAGCAAAATTGTCCATCGTCAGTTAACTGTTAGAGATCCTACTCGCTTCCACCATTTTAAAAATGGGAGCTGTTCTTGTAAAGATTATTGGTAG

Coding sequence (CDS)

ATGAACGCTCTCTCAACGCCATGGAACACTCAACTCAGAGAATTAGCAAAACGATGTCAATTTCTTCAAGCTCTAAGTCTCTATCCCCAACTACTTCGCCATGGTGGGCGCCCCAATGCCTTCACTTTCCCGTTTGCTCTCAAATCCTGCGCGGCCCTTTCCCTCCCTATACTCGGCGAACAATTCCATGGTCAAATTATCAAAGTTGGGTGTGAATTCGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTATTGCAGAGGCACTTTGGTCGAGAATGCCCGTAAAGTGTTCGATGAGAATTCCCAGTCCAGAAAGCTTAGTGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCAAAATGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCTGTTAATTCAGTTACGTTGCTGGGTTTGATCCCAGTTTGTGTTTCTCCGATTAATTTAGAGCTTGGATCGTCTCTACATGGCTCCACATTGAAATATGGATTGGATTCAGATGTCTCTGTTGTTAACTGCTTTATTACTATGTACATGAAATGTGGCTCGGTTGATTATGCACAGAAGCTGTTTGATGAAATGCCTGTGAAGGGTTTGATCTCTTGGAACGCTATGGTTTCTGGGTACGCGCAAAATGGGATGGCAACTAATGTTTTGGAGCTCTATCGTAACATGGATATGCATGGGGTTCGCCCGGATCCTGTAACTCATGTTGGGGTTTTATCATCTTGCGCTAACCTTGGGGCTCAGAGTGTTGGCCATGAGGTAGAATTAAAGATTCAAGCAAGTGGGTTTAACAATAATCCGTTTCTGAATAATGCTTTGATCAATATGTACGCAAGGTGTGGCAATTTAACAAAGGCACAAGCCGTGTTTGATGAAATGCCTGAGAGAACATTAGTTTCATGGACAGCAATTATAGGTGGCTATGGAATGCATGGACATGGAGAAATTGCAGTGCAGCTTTTCGAAGAGATGATAAGGAGTGGCATTGTACCTGATGGAACTGCATTTGTGAGTGTCTTGTCTGCCTGTAGCCATGCAGGGCTCACTGATCAGGGCTTGGAATACTTCAAAATGATGACGAGAAAATATCAATTGCAACCAGGTCCAGAGCATTATTCGTGTATCGTGGATCTTCTGGGGCGAGCAGGGCGGCTTAACGAAGCTCGAAATCTCATTGAATCCATGCCAATAAAGCCTGATGGTGCCGTCTGGGGAGCTCTTCTGGGTGCTTGTAAGATCCACAAGAATGTTGAGTTAGCAGAGTTGGCTTTTGAACATGTGATCGAGCTTGAACCTGCAAACATAGGATACTATGTCTTATTGTCAAACATTTACTATGATGCCAAGAACTCAAAAGGGGTTTTGAGGATCCGGTTTATGATGAAGCAGAGAAAGTTGAAGAAGGACCCTGGATGTAGCTATGTTGAATTGAAGGGCAGAGTTCATCCATTTATAGTTGGGGATAGAAACCATCCTCAGGCTGAAGAGATATATAGAGTTTTGGAAGAGTTAGAAGCGCTAGTACAGAAATTTGGAGAGCCTAAGAAGGATGATAGAGATGAAAGCAACAAAGATTTGTTAACTGGAGTTGGAGTTCATAGCGAAAAATTGGCTGTTGCTTTTGGACTCCTGAATACCATGGCTGGGACCGAAGTTGTGATCATAAAAAATCTTAGGATATGTGAAGATTGTCACTTGTTTTTCAAGATGGTTAGCAAAATTGTCCATCGTCAGTTAACTGTTAGAGATCCTACTCGCTTCCACCATTTTAAAAATGGGAGCTGTTCTTGTAAAGATTATTGGTAG

Protein sequence

MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLTGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFKNGSCSCKDYW
Homology
BLAST of Cla97C04G076940 vs. NCBI nr
Match: XP_038882846.1 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [Benincasa hispida])

HSP 1 Score: 1157.1 bits (2992), Expect = 0.0e+00
Identity = 561/613 (91.52%), Postives = 581/613 (94.78%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQ+RELAKRCQFLQ L LYPQ+LRHG RPNAFTFPFALKSCAALSLP LGE
Sbjct: 1   MNALSTPWNTQIRELAKRCQFLQVLILYPQMLRHGDRPNAFTFPFALKSCAALSLPRLGE 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGCEFEPFVQTGLISMYCRG+ VENARKVF+ENS S+ L+VCYNALISGYVS
Sbjct: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGSSVENARKVFNENSHSKMLTVCYNALISGYVS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCSDA+LLFRQMNEE VPVNSVTLLGLIPVCVSPINLELG SLHG TLKYGLD DVSV
Sbjct: 121 NSKCSDAILLFRQMNEESVPVNSVTLLGLIPVCVSPINLELGLSLHGFTLKYGLDLDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELY NMD HGV
Sbjct: 181 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYHNMDKHGV 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDPVT VGVLSSCANLGAQ VGHEVE KIQASGF NNPFLNNALINMYARCGNLTKAQA
Sbjct: 241 HPDPVTLVGVLSSCANLGAQGVGHEVEFKIQASGFTNNPFLNNALINMYARCGNLTKAQA 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL
Sbjct: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKM+ R YQL+PGPEHYSC+VDLLGRAGRLNEARNLIESMPIKPDGAVWGALL
Sbjct: 361 TDQGLEYFKMIKRNYQLEPGPEHYSCMVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIELEP NIGYYVLLSNIY +  NSKGVLRIR MMK+RKLKK
Sbjct: 421 GACKIHKNVELAELAFERVIELEPENIGYYVLLSNIYSNTMNSKGVLRIRIMMKERKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLT 540
           +PGCSYVELKGRVHPF+VGDRNHPQAEEIYRVLEELEALVQ+FGEPKKD  +ES  + +T
Sbjct: 481 NPGCSYVELKGRVHPFVVGDRNHPQAEEIYRVLEELEALVQEFGEPKKDYGEESKGEFIT 540

Query: 541 GVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFH 600
           GVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRD TRFH
Sbjct: 541 GVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDATRFH 600

Query: 601 HFKNGSCSCKDYW 614
           HF+NGSCSCKDYW
Sbjct: 601 HFRNGSCSCKDYW 613

BLAST of Cla97C04G076940 vs. NCBI nr
Match: XP_008462579.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucumis melo])

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 578/614 (94.14%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALS PILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSHPILGG 69

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGC FEPFVQTGLISMYC+G+LVENARKVFDEN  SRKL+VCYNALISGY S
Sbjct: 70  QFHGQIIKVGCIFEPFVQTGLISMYCKGSLVENARKVFDENFHSRKLTVCYNALISGYAS 129

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCSDAVLLFRQMNEEG+PVNSVTLLGLIP CVSPINLELGSSLH STLKYG DS+VSV
Sbjct: 130 NSKCSDAVLLFRQMNEEGIPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSEVSV 189

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 190 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 249

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
           RPDP+T VGVLSSCANLGAQSVGH VE KIQASGF NNPFLNNALINMYARCGNLTKAQ+
Sbjct: 250 RPDPITLVGVLSSCANLGAQSVGHAVEFKIQASGFTNNPFLNNALINMYARCGNLTKAQS 309

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 310 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 369

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R Y+L+PG EHYSC+VDLLGRAGRL EA+NLIESMPIKPDGAVWGALL
Sbjct: 370 TDQGLEYFKMMKRNYRLEPGQEHYSCMVDLLGRAGRLKEAQNLIESMPIKPDGAVWGALL 429

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIE EP NIGYYVLLSNIY DA NSKGVLRIR MMK++KLKK
Sbjct: 430 GACKIHKNVELAELAFERVIEHEPENIGYYVLLSNIYSDANNSKGVLRIRIMMKEKKLKK 489

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRVHPFIVGDRNHPQ++EIYRVLEELEA++ Q+FG+PKKD+R+ESNKD  
Sbjct: 490 DPGCSYVELKGRVHPFIVGDRNHPQSDEIYRVLEELEAIIMQEFGKPKKDNREESNKDFF 549

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           TGVGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKI  RQLTVRD TRF
Sbjct: 550 TGVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIADRQLTVRDATRF 609

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 610 HHFRNGSCSCKDYW 623

BLAST of Cla97C04G076940 vs. NCBI nr
Match: XP_023544808.1 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 550/613 (89.72%), Postives = 579/613 (94.45%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           M ALSTPWNTQLRELAKRCQFLQALSLY Q+LRHG  PNAFTFPFALKSCAALSLPILG 
Sbjct: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGCE EPFVQTGLISMYCRG+L+ NARKVFDE SQSRKL+VCYNALISGYVS
Sbjct: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSK SDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELG SLH STLKYGLDSDVSV
Sbjct: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGLSLHCSTLKYGLDSDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV++AQ LFD+MP KGLISWNAMVSGYAQNG+ATNVLELY NM++HG+
Sbjct: 181 VNCFITMYMKCGSVNHAQNLFDKMPEKGLISWNAMVSGYAQNGLATNVLELYHNMELHGI 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDP T VGVLSSCANLGAQSVG EVELKIQASGF NN FLNNALINMYARCGNLTKAQA
Sbjct: 241 HPDPFTLVGVLSSCANLGAQSVGREVELKIQASGFTNNQFLNNALINMYARCGNLTKAQA 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           +FDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFE+MIRSGIVPDGTAFVSVLSACSHAGL
Sbjct: 301 LFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEDMIRSGIVPDGTAFVSVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           T QG+EYFKMM R YQL+PGPEHYSC+VDLLGRAGRLNEARNLIESMPI+PDGAVWGALL
Sbjct: 361 TSQGMEYFKMMGRNYQLEPGPEHYSCMVDLLGRAGRLNEARNLIESMPIEPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIH+NV+LAELAFE V+ELEPANIGYYVLLSNIY D KNSKGVLRIR MMK+RKLKK
Sbjct: 421 GACKIHQNVKLAELAFERVVELEPANIGYYVLLSNIYNDTKNSKGVLRIRIMMKERKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLT 540
           DPGCSYVELKGRVHPF+VGDR+HPQAEEIYRVLEELEALV +FGE K+ DR+ESNKDL T
Sbjct: 481 DPGCSYVELKGRVHPFVVGDRSHPQAEEIYRVLEELEALVHEFGEAKRADREESNKDLFT 540

Query: 541 GVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFH 600
           G GVHSEKLAVAFGLLNT AGTEVV+IKNLRICEDCHLFFK+VSKIVHRQLTVRD TRFH
Sbjct: 541 GAGVHSEKLAVAFGLLNTTAGTEVVVIKNLRICEDCHLFFKIVSKIVHRQLTVRDATRFH 600

Query: 601 HFKNGSCSCKDYW 614
           HF+NGSCSCKDYW
Sbjct: 601 HFRNGSCSCKDYW 613

BLAST of Cla97C04G076940 vs. NCBI nr
Match: XP_004143385.1 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [Cucumis sativus] >KGN48268.1 hypothetical protein Csa_003516 [Cucumis sativus])

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 580/614 (94.46%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALSLPILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSLPILGS 69

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQI KVGC FEPFVQTGLISMYC+G+LV+NARKVF+EN  SRKL+VCYNAL+SGYVS
Sbjct: 70  QFHGQITKVGCVFEPFVQTGLISMYCKGSLVDNARKVFEENFHSRKLTVCYNALVSGYVS 129

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCS+AVLLFRQMNEEGVPVNSVTLLGLIP CVSPINLELGSSLH STLKYG DSDVSV
Sbjct: 130 NSKCSEAVLLFRQMNEEGVPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSDVSV 189

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 190 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 249

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDPVT VGVLSSCANLGAQSVGHEVE KIQASGF +NPFLNNALINMYARCGNLTKAQA
Sbjct: 250 HPDPVTLVGVLSSCANLGAQSVGHEVEFKIQASGFTSNPFLNNALINMYARCGNLTKAQA 309

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 310 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 369

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R YQL+PGPEHYSC+VDLLGRAGRL EA+ LIESMPIKPDGAVWGALL
Sbjct: 370 TDQGLEYFKMMKRNYQLEPGPEHYSCMVDLLGRAGRLKEAQTLIESMPIKPDGAVWGALL 429

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIELEP NIGYYVLLSNIY +A NSKGVLRIR MMK++KLKK
Sbjct: 430 GACKIHKNVELAELAFERVIELEPENIGYYVLLSNIYSNANNSKGVLRIRIMMKEKKLKK 489

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRVHPFIVGDRNH Q++EIYRVLEELEA++ Q+FG+P+KD+R+ESNKD  
Sbjct: 490 DPGCSYVELKGRVHPFIVGDRNHLQSDEIYRVLEELEAIIMQEFGKPEKDNREESNKDGF 549

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           T VGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRD TRF
Sbjct: 550 TRVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDATRF 609

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 610 HHFRNGSCSCKDYW 623

BLAST of Cla97C04G076940 vs. NCBI nr
Match: KAA0025251.1 (putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK07416.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1134.8 bits (2934), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 577/614 (93.97%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALS PILG 
Sbjct: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSHPILGG 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGC FEPFVQTGLISMYC+G+LVENARKVFDEN  SRKL+VCYNALISGY S
Sbjct: 61  QFHGQIIKVGCIFEPFVQTGLISMYCKGSLVENARKVFDENFHSRKLTVCYNALISGYAS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCSDAVLLFRQMNEEG+PVNSVTLLGLIP CVSPINLELGSSLH STLKYG DSDVSV
Sbjct: 121 NSKCSDAVLLFRQMNEEGIPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 181 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
           RPDP+T VGVLSSCANLGAQSVGH VE KIQASGF NNPFLNNALINMYARCGNLTKAQ+
Sbjct: 241 RPDPITLVGVLSSCANLGAQSVGHAVEFKIQASGFTNNPFLNNALINMYARCGNLTKAQS 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 301 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R Y+L+PG EHYSC+VDLLGRAGRL EA+NLIESMPIKPDGAVWGALL
Sbjct: 361 TDQGLEYFKMMKRNYRLEPGQEHYSCMVDLLGRAGRLKEAQNLIESMPIKPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIE EP NIGYYVLLSNIY DA NSKGVLRIR MMK++KLKK
Sbjct: 421 GACKIHKNVELAELAFERVIEHEPENIGYYVLLSNIYSDANNSKGVLRIRIMMKEKKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRV PFIVGDRNHPQ++EIYRVLEELEA++ Q+FG+PKKD+R+ESNKD  
Sbjct: 481 DPGCSYVELKGRVQPFIVGDRNHPQSDEIYRVLEELEAIIMQEFGKPKKDNREESNKDFF 540

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           TGVGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKI  RQLTVRD TRF
Sbjct: 541 TGVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIADRQLTVRDATRF 600

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 601 HHFRNGSCSCKDYW 614

BLAST of Cla97C04G076940 vs. ExPASy Swiss-Prot
Match: Q9CAY1 (Putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H52 PE=1 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 1.1e-235
Identity = 388/609 (63.71%), Postives = 485/609 (79.64%), Query Frame = 0

Query: 5   STPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHG 64
           STPWN +LRELA +  F +++SLY  +LR G  P+AF+FPF LKSCA+LSLP+ G+Q H 
Sbjct: 18  STPWNVRLRELAYQSLFSESISLYRSMLRSGSSPDAFSFPFILKSCASLSLPVSGQQLHC 77

Query: 65  QIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKC 124
            + K GCE EPFV T LISMYC+  LV +ARKVF+EN QS +LSVCYNALISGY +NSK 
Sbjct: 78  HVTKGGCETEPFVLTALISMYCKCGLVADARKVFEENPQSSQLSVCYNALISGYTANSKV 137

Query: 125 SDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCF 184
           +DA  +FR+M E GV V+SVT+LGL+P+C  P  L LG SLHG  +K GLDS+V+V+N F
Sbjct: 138 TDAAYMFRRMKETGVSVDSVTMLGLVPLCTVPEYLWLGRSLHGQCVKGGLDSEVAVLNSF 197

Query: 185 ITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDP 244
           ITMYMKCGSV+  ++LFDEMPVKGLI+WNA++SGY+QNG+A +VLELY  M   GV PDP
Sbjct: 198 ITMYMKCGSVEAGRRLFDEMPVKGLITWNAVISGYSQNGLAYDVLELYEQMKSSGVCPDP 257

Query: 245 VTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDE 304
            T V VLSSCA+LGA+ +GHEV   ++++GF  N F++NA I+MYARCGNL KA+AVFD 
Sbjct: 258 FTLVSVLSSCAHLGAKKIGHEVGKLVESNGFVPNVFVSNASISMYARCGNLAKARAVFDI 317

Query: 305 MPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQG 364
           MP ++LVSWTA+IG YGMHG GEI + LF++MI+ GI PDG  FV VLSACSH+GLTD+G
Sbjct: 318 MPVKSLVSWTAMIGCYGMHGMGEIGLMLFDDMIKRGIRPDGAVFVMVLSACSHSGLTDKG 377

Query: 365 LEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACK 424
           LE F+ M R+Y+L+PGPEHYSC+VDLLGRAGRL+EA   IESMP++PDGAVWGALLGACK
Sbjct: 378 LELFRAMKREYKLEPGPEHYSCLVDLLGRAGRLDEAMEFIESMPVEPDGAVWGALLGACK 437

Query: 425 IHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGC 484
           IHKNV++AELAF  VIE EP NIGYYVL+SNIY D+KN +G+ RIR MM++R  +K PG 
Sbjct: 438 IHKNVDMAELAFAKVIEFEPNNIGYYVLMSNIYSDSKNQEGIWRIRVMMRERAFRKKPGY 497

Query: 485 SYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLTGVGV 544
           SYVE KGRVH F+ GDR+H Q EE++R+L+ELE  V +       DR E   ++ +    
Sbjct: 498 SYVEHKGRVHLFLAGDRSHEQTEEVHRMLDELETSVMELAGNMDCDRGE---EVSSTTRE 557

Query: 545 HSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFKN 604
           HSE+LA+AFG+LN++ GTE+++IKNLR+CEDCH+F K VSKIV RQ  VRD +RFH+FK+
Sbjct: 558 HSERLAIAFGILNSIPGTEILVIKNLRVCEDCHVFLKQVSKIVDRQFVVRDASRFHYFKD 617

Query: 605 GSCSCKDYW 614
           G CSCKDYW
Sbjct: 618 GVCSCKDYW 623

BLAST of Cla97C04G076940 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 503.1 bits (1294), Expect = 4.6e-141
Identity = 257/643 (39.97%), Postives = 377/643 (58.63%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQII 67
           WNT  R  A     + AL LY  ++  G  PN++TFPF LKSCA       G+Q HG ++
Sbjct: 102 WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVL 161

Query: 68  KVGCEFEPFVQTGLISMYCRGTLVENARKVFD-----------------------ENSQS 127
           K+GC+ + +V T LISMY +   +E+A KVFD                       EN+Q 
Sbjct: 162 KLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQK 221

Query: 128 R------KLSVCYNALISGYVSNSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPIN 187
                  K  V +NA+ISGY       +A+ LF+ M +  V  +  T++ ++  C    +
Sbjct: 222 LFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGS 281

Query: 188 LELGSSLHGSTLKYGLDSDVSVVNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSG 247
           +ELG  +H     +G  S++ +VN  I +Y KCG ++ A  LF+ +P K +ISWN ++ G
Sbjct: 282 IELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGG 341

Query: 248 YAQNGMATNVLELYRNMDMHGVRPDPVTHVGVLSSCANLGAQSVGHEVELKI--QASGFN 307
           Y    +    L L++ M   G  P+ VT + +L +CA+LGA  +G  + + I  +  G  
Sbjct: 342 YTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVT 401

Query: 308 NNPFLNNALINMYARCGNLTKAQAVFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEM 367
           N   L  +LI+MYA+CG++  A  VF+ +  ++L SW A+I G+ MHG  + +  LF  M
Sbjct: 402 NASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRM 461

Query: 368 IRSGIVPDGTAFVSVLSACSHAGLTDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGR 427
            + GI PD   FV +LSACSH+G+ D G   F+ MT+ Y++ P  EHY C++DLLG +G 
Sbjct: 462 RKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGL 521

Query: 428 LNEARNLIESMPIKPDGAVWGALLGACKIHKNVELAELAFEHVIELEPANIGYYVLLSNI 487
             EA  +I  M ++PDG +W +LL ACK+H NVEL E   E++I++EP N G YVLLSNI
Sbjct: 522 FKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNI 581

Query: 488 YYDAKNSKGVLRIRFMMKQRKLKKDPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEEL 547
           Y  A     V + R ++  + +KK PGCS +E+   VH FI+GD+ HP+  EIY +LEE+
Sbjct: 582 YASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEM 641

Query: 548 EALVQKFG------EPKKDDRDESNKDLLTGVGVHSEKLAVAFGLLNTMAGTEVVIIKNL 607
           E L++K G      E  ++  +E  +  L     HSEKLA+AFGL++T  GT++ I+KNL
Sbjct: 642 EVLLEKAGFVPDTSEVLQEMEEEWKEGALRH---HSEKLAIAFGLISTKPGTKLTIVKNL 701

Query: 608 RICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFKNGSCSCKDYW 614
           R+C +CH   K++SKI  R++  RD TRFHHF++G CSC DYW
Sbjct: 702 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Cla97C04G076940 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 1.1e-137
Identity = 255/612 (41.67%), Postives = 370/612 (60.46%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHG-GRPNAFTFPFALKSCAALSLPILGEQFHGQI 67
           WNT +    K   +++++ ++  L+     R +  T    L + A L    LG Q H   
Sbjct: 188 WNTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLA 247

Query: 68  IKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSD 127
            K GC    +V TG IS+Y +   ++    +F E  +     V YNA+I GY SN +   
Sbjct: 248 TKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPD--IVAYNAMIHGYTSNGETEL 307

Query: 128 AVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFIT 187
           ++ LF+++   G  + S TL+ L+PV     +L L  ++HG  LK    S  SV     T
Sbjct: 308 SLSLFKELMLSGARLRSSTLVSLVPVSG---HLMLIYAIHGYCLKSNFLSHASVSTALTT 367

Query: 188 MYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVT 247
           +Y K   ++ A+KLFDE P K L SWNAM+SGY QNG+  + + L+R M      P+PVT
Sbjct: 368 VYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVT 427

Query: 248 HVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMP 307
              +LS+CA LGA S+G  V   ++++ F ++ +++ ALI MYA+CG++ +A+ +FD M 
Sbjct: 428 ITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMT 487

Query: 308 ERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLE 367
           ++  V+W  +I GYG+HG G+ A+ +F EM+ SGI P    F+ VL ACSHAGL  +G E
Sbjct: 488 KKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDE 547

Query: 368 YFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIH 427
            F  M  +Y  +P  +HY+C+VD+LGRAG L  A   IE+M I+P  +VW  LLGAC+IH
Sbjct: 548 IFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIH 607

Query: 428 KNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSY 487
           K+  LA    E + EL+P N+GY+VLLSNI+   +N      +R   K+RKL K PG + 
Sbjct: 608 KDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTL 667

Query: 488 VELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFG-EPKKD----DRDESNKDLLTG 547
           +E+    H F  GD++HPQ +EIY  LE+LE  +++ G +P+ +    D +E  ++L+  
Sbjct: 668 IEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELM-- 727

Query: 548 VGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHH 607
           V VHSE+LA+AFGL+ T  GTE+ IIKNLR+C DCH   K++SKI  R + VRD  RFHH
Sbjct: 728 VKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHH 787

Query: 608 FKNGSCSCKDYW 614
           FK+G CSC DYW
Sbjct: 788 FKDGVCSCGDYW 792

BLAST of Cla97C04G076940 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 3.1e-137
Identity = 249/611 (40.75%), Postives = 370/611 (60.56%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQII 67
           WNT +   ++      AL +   +     +P+  T    L + +AL L  +G++ HG  +
Sbjct: 204 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 263

Query: 68  KVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSDA 127
           + G +    + T L+ MY +   +E AR++FD      +  V +N++I  YV N    +A
Sbjct: 264 RSGFDSLVNISTALVDMYAKCGSLETARQLFD--GMLERNVVSWNSMIDAYVQNENPKEA 323

Query: 128 VLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFITM 187
           +L+F++M +EGV    V+++G +  C    +LE G  +H  +++ GLD +VSVVN  I+M
Sbjct: 324 MLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISM 383

Query: 188 YMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVTH 247
           Y KC  VD A  +F ++  + L+SWNAM+ G+AQNG   + L  +  M    V+PD  T+
Sbjct: 384 YCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTY 443

Query: 248 VGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMPE 307
           V V+++ A L        +   +  S  + N F+  AL++MYA+CG +  A+ +FD M E
Sbjct: 444 VSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE 503

Query: 308 RTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLEY 367
           R + +W A+I GYG HG G+ A++LFEEM +  I P+G  F+SV+SACSH+GL + GL+ 
Sbjct: 504 RHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKC 563

Query: 368 FKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIHK 427
           F MM   Y ++   +HY  +VDLLGRAGRLNEA + I  MP+KP   V+GA+LGAC+IHK
Sbjct: 564 FYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHK 623

Query: 428 NVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSYV 487
           NV  AE A E + EL P + GY+VLL+NIY  A   + V ++R  M ++ L+K PGCS V
Sbjct: 624 NVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMV 683

Query: 488 ELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDD-----RDESNKDLLTGV 547
           E+K  VH F  G   HP +++IY  LE+L   +++ G     +      ++  + LL+  
Sbjct: 684 EIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVLGVENDVKEQLLS-- 743

Query: 548 GVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHF 607
             HSEKLA++FGLLNT AGT + + KNLR+C DCH   K +S +  R++ VRD  RFHHF
Sbjct: 744 -THSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHF 803

Query: 608 KNGSCSCKDYW 614
           KNG+CSC DYW
Sbjct: 804 KNGACSCGDYW 809

BLAST of Cla97C04G076940 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 486.5 bits (1251), Expect = 4.4e-136
Identity = 235/610 (38.52%), Postives = 379/610 (62.13%), Query Frame = 0

Query: 7   PWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQI 66
           PWN  +R  ++   F  AL +Y  +      P++FTFP  LK+C+ LS   +G   H Q+
Sbjct: 86  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 145

Query: 67  IKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSD 126
            ++G + + FVQ GLI++Y +   + +AR VF+      +  V + A++S Y  N +  +
Sbjct: 146 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 205

Query: 127 AVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFIT 186
           A+ +F QM +  V  + V L+ ++       +L+ G S+H S +K GL+ +  ++    T
Sbjct: 206 ALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNT 265

Query: 187 MYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVT 246
           MY KCG V  A+ LFD+M    LI WNAM+SGYA+NG A   ++++  M    VRPD ++
Sbjct: 266 MYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTIS 325

Query: 247 HVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMP 306
               +S+CA +G+      +   +  S + ++ F+++ALI+M+A+CG++  A+ VFD   
Sbjct: 326 ITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTL 385

Query: 307 ERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLE 366
           +R +V W+A+I GYG+HG    A+ L+  M R G+ P+   F+ +L AC+H+G+  +G  
Sbjct: 386 DRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWW 445

Query: 367 YFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIH 426
           +F  M   +++ P  +HY+C++DLLGRAG L++A  +I+ MP++P   VWGALL ACK H
Sbjct: 446 FFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKH 505

Query: 427 KNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSY 486
           ++VEL E A + +  ++P+N G+YV LSN+Y  A+    V  +R  MK++ L KD GCS+
Sbjct: 506 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 565

Query: 487 VELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDD---RDESNKDLLTGVG 546
           VE++GR+  F VGD++HP+ EEI R +E +E+ +++ G     D    D ++++    + 
Sbjct: 566 VEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLC 625

Query: 547 VHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFK 606
            HSE++A+A+GL++T  GT + I KNLR C +CH   K++SK+V R++ VRD  RFHHFK
Sbjct: 626 SHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFK 685

Query: 607 NGSCSCKDYW 614
           +G CSC DYW
Sbjct: 686 DGVCSCGDYW 694

BLAST of Cla97C04G076940 vs. ExPASy TrEMBL
Match: A0A1S3CH87 (putative pentatricopeptide repeat-containing protein At3g11460 OS=Cucumis melo OX=3656 GN=LOC103500905 PE=3 SV=1)

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 578/614 (94.14%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALS PILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSHPILGG 69

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGC FEPFVQTGLISMYC+G+LVENARKVFDEN  SRKL+VCYNALISGY S
Sbjct: 70  QFHGQIIKVGCIFEPFVQTGLISMYCKGSLVENARKVFDENFHSRKLTVCYNALISGYAS 129

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCSDAVLLFRQMNEEG+PVNSVTLLGLIP CVSPINLELGSSLH STLKYG DS+VSV
Sbjct: 130 NSKCSDAVLLFRQMNEEGIPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSEVSV 189

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 190 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 249

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
           RPDP+T VGVLSSCANLGAQSVGH VE KIQASGF NNPFLNNALINMYARCGNLTKAQ+
Sbjct: 250 RPDPITLVGVLSSCANLGAQSVGHAVEFKIQASGFTNNPFLNNALINMYARCGNLTKAQS 309

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 310 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 369

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R Y+L+PG EHYSC+VDLLGRAGRL EA+NLIESMPIKPDGAVWGALL
Sbjct: 370 TDQGLEYFKMMKRNYRLEPGQEHYSCMVDLLGRAGRLKEAQNLIESMPIKPDGAVWGALL 429

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIE EP NIGYYVLLSNIY DA NSKGVLRIR MMK++KLKK
Sbjct: 430 GACKIHKNVELAELAFERVIEHEPENIGYYVLLSNIYSDANNSKGVLRIRIMMKEKKLKK 489

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRVHPFIVGDRNHPQ++EIYRVLEELEA++ Q+FG+PKKD+R+ESNKD  
Sbjct: 490 DPGCSYVELKGRVHPFIVGDRNHPQSDEIYRVLEELEAIIMQEFGKPKKDNREESNKDFF 549

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           TGVGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKI  RQLTVRD TRF
Sbjct: 550 TGVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIADRQLTVRDATRF 609

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 610 HHFRNGSCSCKDYW 623

BLAST of Cla97C04G076940 vs. ExPASy TrEMBL
Match: A0A0A0KFC9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G452690 PE=3 SV=1)

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 580/614 (94.46%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALSLPILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSLPILGS 69

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQI KVGC FEPFVQTGLISMYC+G+LV+NARKVF+EN  SRKL+VCYNAL+SGYVS
Sbjct: 70  QFHGQITKVGCVFEPFVQTGLISMYCKGSLVDNARKVFEENFHSRKLTVCYNALVSGYVS 129

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCS+AVLLFRQMNEEGVPVNSVTLLGLIP CVSPINLELGSSLH STLKYG DSDVSV
Sbjct: 130 NSKCSEAVLLFRQMNEEGVPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSDVSV 189

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 190 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 249

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDPVT VGVLSSCANLGAQSVGHEVE KIQASGF +NPFLNNALINMYARCGNLTKAQA
Sbjct: 250 HPDPVTLVGVLSSCANLGAQSVGHEVEFKIQASGFTSNPFLNNALINMYARCGNLTKAQA 309

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 310 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 369

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R YQL+PGPEHYSC+VDLLGRAGRL EA+ LIESMPIKPDGAVWGALL
Sbjct: 370 TDQGLEYFKMMKRNYQLEPGPEHYSCMVDLLGRAGRLKEAQTLIESMPIKPDGAVWGALL 429

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIELEP NIGYYVLLSNIY +A NSKGVLRIR MMK++KLKK
Sbjct: 430 GACKIHKNVELAELAFERVIELEPENIGYYVLLSNIYSNANNSKGVLRIRIMMKEKKLKK 489

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRVHPFIVGDRNH Q++EIYRVLEELEA++ Q+FG+P+KD+R+ESNKD  
Sbjct: 490 DPGCSYVELKGRVHPFIVGDRNHLQSDEIYRVLEELEAIIMQEFGKPEKDNREESNKDGF 549

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           T VGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRD TRF
Sbjct: 550 TRVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDATRF 609

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 610 HHFRNGSCSCKDYW 623

BLAST of Cla97C04G076940 vs. ExPASy TrEMBL
Match: A0A5D3CAP8 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G001430 PE=3 SV=1)

HSP 1 Score: 1134.8 bits (2934), Expect = 0.0e+00
Identity = 551/614 (89.74%), Postives = 577/614 (93.97%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           MNALSTPWNTQLRELAKRCQFLQALSLYPQ+LRHG RPNAFTFPFALKSCAALS PILG 
Sbjct: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSHPILGG 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGC FEPFVQTGLISMYC+G+LVENARKVFDEN  SRKL+VCYNALISGY S
Sbjct: 61  QFHGQIIKVGCIFEPFVQTGLISMYCKGSLVENARKVFDENFHSRKLTVCYNALISGYAS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSKCSDAVLLFRQMNEEG+PVNSVTLLGLIP CVSPINLELGSSLH STLKYG DSDVSV
Sbjct: 121 NSKCSDAVLLFRQMNEEGIPVNSVTLLGLIPACVSPINLELGSSLHCSTLKYGFDSDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV+YAQKLFDEMPVKGLISWNAMVSGYAQNG+ATNVLELYRNMDM+GV
Sbjct: 181 VNCFITMYMKCGSVNYAQKLFDEMPVKGLISWNAMVSGYAQNGLATNVLELYRNMDMNGV 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
           RPDP+T VGVLSSCANLGAQSVGH VE KIQASGF NNPFLNNALINMYARCGNLTKAQ+
Sbjct: 241 RPDPITLVGVLSSCANLGAQSVGHAVEFKIQASGFTNNPFLNNALINMYARCGNLTKAQS 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           VFD MPERTLVSWTAIIGGYGMHGHGEIAVQLF+EMIRSGI PDGTAFV VLSACSHAGL
Sbjct: 301 VFDGMPERTLVSWTAIIGGYGMHGHGEIAVQLFKEMIRSGIEPDGTAFVCVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           TDQGLEYFKMM R Y+L+PG EHYSC+VDLLGRAGRL EA+NLIESMPIKPDGAVWGALL
Sbjct: 361 TDQGLEYFKMMKRNYRLEPGQEHYSCMVDLLGRAGRLKEAQNLIESMPIKPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIHKNVELAELAFE VIE EP NIGYYVLLSNIY DA NSKGVLRIR MMK++KLKK
Sbjct: 421 GACKIHKNVELAELAFERVIEHEPENIGYYVLLSNIYSDANNSKGVLRIRIMMKEKKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALV-QKFGEPKKDDRDESNKDLL 540
           DPGCSYVELKGRV PFIVGDRNHPQ++EIYRVLEELEA++ Q+FG+PKKD+R+ESNKD  
Sbjct: 481 DPGCSYVELKGRVQPFIVGDRNHPQSDEIYRVLEELEAIIMQEFGKPKKDNREESNKDFF 540

Query: 541 TGVGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           TGVGVHSEKLAVAFGLLNT  G EVVIIKNLRICEDCHLFFKMVSKI  RQLTVRD TRF
Sbjct: 541 TGVGVHSEKLAVAFGLLNTTTGAEVVIIKNLRICEDCHLFFKMVSKIADRQLTVRDATRF 600

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 601 HHFRNGSCSCKDYW 614

BLAST of Cla97C04G076940 vs. ExPASy TrEMBL
Match: A0A6J1EDM7 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS=Cucurbita moschata OX=3662 GN=LOC111433216 PE=3 SV=1)

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 547/614 (89.09%), Postives = 575/614 (93.65%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           M ALSTPWNTQLRELAKRCQFLQALSLY Q+LRHG  PNAFTFPFALKSCAALSLPILG 
Sbjct: 1   MTALSTPWNTQLRELAKRCQFLQALSLYTQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGCE EPFVQTGLISMYCRG+L+ NARKVFDE SQSRKL+VCYNALISGYVS
Sbjct: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSK SDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELG SLH STLKYGLDSDVSV
Sbjct: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGLSLHCSTLKYGLDSDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV++AQ LFDEMP KGLISWNAMVSGYAQNG+A NVLELY NM++HG+
Sbjct: 181 VNCFITMYMKCGSVNHAQNLFDEMPEKGLISWNAMVSGYAQNGLAANVLELYHNMELHGI 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDP T VGVLSSCANLGAQSVG EVELKIQASGF NN FLNNALINMYARCGNLTKAQA
Sbjct: 241 HPDPFTLVGVLSSCANLGAQSVGREVELKIQASGFTNNQFLNNALINMYARCGNLTKAQA 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           +FDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFE+MIRSGIVPDGTAFVSVLSACSHAGL
Sbjct: 301 LFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEDMIRSGIVPDGTAFVSVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           T QG+EYFKMM R YQL+PGPEHYSC+VDLLGRAGRLNEARNLIESM I+PDGAVWGALL
Sbjct: 361 TSQGMEYFKMMGRNYQLEPGPEHYSCMVDLLGRAGRLNEARNLIESMAIEPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIH+NV+LAELAFE V+ELEPANIGYYVLLSNIY D KNSKGVLRIR MMK+RKLKK
Sbjct: 421 GACKIHQNVKLAELAFERVVELEPANIGYYVLLSNIYNDTKNSKGVLRIRIMMKERKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLT 540
           DPGCSYVELKGRVHPF+VGDR+HPQAEEIYR+LEEL ALV +FGE K+ DR+ESNKDL  
Sbjct: 481 DPGCSYVELKGRVHPFVVGDRSHPQAEEIYRLLEEL-ALVHEFGEAKRADREESNKDLFA 540

Query: 541 G-VGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           G  GVHSEKLAVAFGLLNT AGTEVVIIKNLRICEDCHLFFK+VSKIVHRQLTVRD TRF
Sbjct: 541 GAAGVHSEKLAVAFGLLNTTAGTEVVIIKNLRICEDCHLFFKIVSKIVHRQLTVRDATRF 600

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 601 HHFRNGSCSCKDYW 613

BLAST of Cla97C04G076940 vs. ExPASy TrEMBL
Match: A0A6J1IQ02 (putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS=Cucurbita maxima OX=3661 GN=LOC111477787 PE=3 SV=1)

HSP 1 Score: 1117.4 bits (2889), Expect = 0.0e+00
Identity = 544/614 (88.60%), Postives = 574/614 (93.49%), Query Frame = 0

Query: 1   MNALSTPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGE 60
           M ALS PWNT+LRELAKRCQFLQALSLY Q+LRHG  PNAFTFPFALKSCAALSLPILG 
Sbjct: 1   MTALSMPWNTRLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60

Query: 61  QFHGQIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVS 120
           QFHGQIIKVGCE EPFVQTGL+SMYCRG+L+ NARKVFDE SQSRKL+VCYNALISGYVS
Sbjct: 61  QFHGQIIKVGCESEPFVQTGLLSMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120

Query: 121 NSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSV 180
           NSK SDAVLLFRQMNEEGVPVNSVTLL LIPVCVSPINLELG SLH STLKYGLDSDVSV
Sbjct: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLLSLIPVCVSPINLELGLSLHCSTLKYGLDSDVSV 180

Query: 181 VNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGV 240
           VNCFITMYMKCGSV++AQ LFDEMP KGLISWNAMVSGYAQNG+ATNVLELY NM++HG+
Sbjct: 181 VNCFITMYMKCGSVNHAQNLFDEMPEKGLISWNAMVSGYAQNGLATNVLELYHNMELHGI 240

Query: 241 RPDPVTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQA 300
            PDP T VGVLSSCANLGAQSVG EVELKIQASGF NN FLNNALINMYARCGNLTKAQA
Sbjct: 241 HPDPFTLVGVLSSCANLGAQSVGREVELKIQASGFTNNQFLNNALINMYARCGNLTKAQA 300

Query: 301 VFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGL 360
           +FDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFE+MIRSGIVPDGTAFVSVLSACSHAGL
Sbjct: 301 LFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEDMIRSGIVPDGTAFVSVLSACSHAGL 360

Query: 361 TDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALL 420
           T QG+EYFKMM   YQL+PGPEHYSC+VDLLGRAGRLNEARNLIESM I+PDGAVWGALL
Sbjct: 361 TSQGMEYFKMMGGNYQLEPGPEHYSCMVDLLGRAGRLNEARNLIESMAIEPDGAVWGALL 420

Query: 421 GACKIHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKK 480
           GACKIH+NV+LAELAFE V+ELEPANIGYYVLLSNIY D KNSKGVLRIR MMK+RKLKK
Sbjct: 421 GACKIHQNVKLAELAFERVVELEPANIGYYVLLSNIYNDTKNSKGVLRIRIMMKERKLKK 480

Query: 481 DPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLT 540
           DPGCSYVELKGRVHPF+VGDR+HPQAEEIY VLEELEALVQ+FGE K+ DR+ESNKDL  
Sbjct: 481 DPGCSYVELKGRVHPFVVGDRSHPQAEEIYSVLEELEALVQEFGEAKRADREESNKDLFA 540

Query: 541 G-VGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRF 600
           G  GVHSEKLAVAFGLLNT AGTEVV+IKNLRICEDCHLFFK+VSKIVHRQLTVRD TRF
Sbjct: 541 GAAGVHSEKLAVAFGLLNTTAGTEVVVIKNLRICEDCHLFFKIVSKIVHRQLTVRDATRF 600

Query: 601 HHFKNGSCSCKDYW 614
           HHF+NGSCSCKDYW
Sbjct: 601 HHFRNGSCSCKDYW 614

BLAST of Cla97C04G076940 vs. TAIR 10
Match: AT3G11460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 817.4 bits (2110), Expect = 7.8e-237
Identity = 388/609 (63.71%), Postives = 485/609 (79.64%), Query Frame = 0

Query: 5   STPWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHG 64
           STPWN +LRELA +  F +++SLY  +LR G  P+AF+FPF LKSCA+LSLP+ G+Q H 
Sbjct: 18  STPWNVRLRELAYQSLFSESISLYRSMLRSGSSPDAFSFPFILKSCASLSLPVSGQQLHC 77

Query: 65  QIIKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKC 124
            + K GCE EPFV T LISMYC+  LV +ARKVF+EN QS +LSVCYNALISGY +NSK 
Sbjct: 78  HVTKGGCETEPFVLTALISMYCKCGLVADARKVFEENPQSSQLSVCYNALISGYTANSKV 137

Query: 125 SDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCF 184
           +DA  +FR+M E GV V+SVT+LGL+P+C  P  L LG SLHG  +K GLDS+V+V+N F
Sbjct: 138 TDAAYMFRRMKETGVSVDSVTMLGLVPLCTVPEYLWLGRSLHGQCVKGGLDSEVAVLNSF 197

Query: 185 ITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDP 244
           ITMYMKCGSV+  ++LFDEMPVKGLI+WNA++SGY+QNG+A +VLELY  M   GV PDP
Sbjct: 198 ITMYMKCGSVEAGRRLFDEMPVKGLITWNAVISGYSQNGLAYDVLELYEQMKSSGVCPDP 257

Query: 245 VTHVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDE 304
            T V VLSSCA+LGA+ +GHEV   ++++GF  N F++NA I+MYARCGNL KA+AVFD 
Sbjct: 258 FTLVSVLSSCAHLGAKKIGHEVGKLVESNGFVPNVFVSNASISMYARCGNLAKARAVFDI 317

Query: 305 MPERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQG 364
           MP ++LVSWTA+IG YGMHG GEI + LF++MI+ GI PDG  FV VLSACSH+GLTD+G
Sbjct: 318 MPVKSLVSWTAMIGCYGMHGMGEIGLMLFDDMIKRGIRPDGAVFVMVLSACSHSGLTDKG 377

Query: 365 LEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACK 424
           LE F+ M R+Y+L+PGPEHYSC+VDLLGRAGRL+EA   IESMP++PDGAVWGALLGACK
Sbjct: 378 LELFRAMKREYKLEPGPEHYSCLVDLLGRAGRLDEAMEFIESMPVEPDGAVWGALLGACK 437

Query: 425 IHKNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGC 484
           IHKNV++AELAF  VIE EP NIGYYVL+SNIY D+KN +G+ RIR MM++R  +K PG 
Sbjct: 438 IHKNVDMAELAFAKVIEFEPNNIGYYVLMSNIYSDSKNQEGIWRIRVMMRERAFRKKPGY 497

Query: 485 SYVELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDDRDESNKDLLTGVGV 544
           SYVE KGRVH F+ GDR+H Q EE++R+L+ELE  V +       DR E   ++ +    
Sbjct: 498 SYVEHKGRVHLFLAGDRSHEQTEEVHRMLDELETSVMELAGNMDCDRGE---EVSSTTRE 557

Query: 545 HSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFKN 604
           HSE+LA+AFG+LN++ GTE+++IKNLR+CEDCH+F K VSKIV RQ  VRD +RFH+FK+
Sbjct: 558 HSERLAIAFGILNSIPGTEILVIKNLRVCEDCHVFLKQVSKIVDRQFVVRDASRFHYFKD 617

Query: 605 GSCSCKDYW 614
           G CSCKDYW
Sbjct: 618 GVCSCKDYW 623

BLAST of Cla97C04G076940 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 503.1 bits (1294), Expect = 3.3e-142
Identity = 257/643 (39.97%), Postives = 377/643 (58.63%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQII 67
           WNT  R  A     + AL LY  ++  G  PN++TFPF LKSCA       G+Q HG ++
Sbjct: 102 WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVL 161

Query: 68  KVGCEFEPFVQTGLISMYCRGTLVENARKVFD-----------------------ENSQS 127
           K+GC+ + +V T LISMY +   +E+A KVFD                       EN+Q 
Sbjct: 162 KLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQK 221

Query: 128 R------KLSVCYNALISGYVSNSKCSDAVLLFRQMNEEGVPVNSVTLLGLIPVCVSPIN 187
                  K  V +NA+ISGY       +A+ LF+ M +  V  +  T++ ++  C    +
Sbjct: 222 LFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGS 281

Query: 188 LELGSSLHGSTLKYGLDSDVSVVNCFITMYMKCGSVDYAQKLFDEMPVKGLISWNAMVSG 247
           +ELG  +H     +G  S++ +VN  I +Y KCG ++ A  LF+ +P K +ISWN ++ G
Sbjct: 282 IELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGG 341

Query: 248 YAQNGMATNVLELYRNMDMHGVRPDPVTHVGVLSSCANLGAQSVGHEVELKI--QASGFN 307
           Y    +    L L++ M   G  P+ VT + +L +CA+LGA  +G  + + I  +  G  
Sbjct: 342 YTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVT 401

Query: 308 NNPFLNNALINMYARCGNLTKAQAVFDEMPERTLVSWTAIIGGYGMHGHGEIAVQLFEEM 367
           N   L  +LI+MYA+CG++  A  VF+ +  ++L SW A+I G+ MHG  + +  LF  M
Sbjct: 402 NASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRM 461

Query: 368 IRSGIVPDGTAFVSVLSACSHAGLTDQGLEYFKMMTRKYQLQPGPEHYSCIVDLLGRAGR 427
            + GI PD   FV +LSACSH+G+ D G   F+ MT+ Y++ P  EHY C++DLLG +G 
Sbjct: 462 RKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGL 521

Query: 428 LNEARNLIESMPIKPDGAVWGALLGACKIHKNVELAELAFEHVIELEPANIGYYVLLSNI 487
             EA  +I  M ++PDG +W +LL ACK+H NVEL E   E++I++EP N G YVLLSNI
Sbjct: 522 FKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNI 581

Query: 488 YYDAKNSKGVLRIRFMMKQRKLKKDPGCSYVELKGRVHPFIVGDRNHPQAEEIYRVLEEL 547
           Y  A     V + R ++  + +KK PGCS +E+   VH FI+GD+ HP+  EIY +LEE+
Sbjct: 582 YASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEM 641

Query: 548 EALVQKFG------EPKKDDRDESNKDLLTGVGVHSEKLAVAFGLLNTMAGTEVVIIKNL 607
           E L++K G      E  ++  +E  +  L     HSEKLA+AFGL++T  GT++ I+KNL
Sbjct: 642 EVLLEKAGFVPDTSEVLQEMEEEWKEGALRH---HSEKLAIAFGLISTKPGTKLTIVKNL 701

Query: 608 RICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFKNGSCSCKDYW 614
           R+C +CH   K++SKI  R++  RD TRFHHF++G CSC DYW
Sbjct: 702 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of Cla97C04G076940 vs. TAIR 10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 491.9 bits (1265), Expect = 7.5e-139
Identity = 255/612 (41.67%), Postives = 370/612 (60.46%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHG-GRPNAFTFPFALKSCAALSLPILGEQFHGQI 67
           WNT +    K   +++++ ++  L+     R +  T    L + A L    LG Q H   
Sbjct: 188 WNTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLA 247

Query: 68  IKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSD 127
            K GC    +V TG IS+Y +   ++    +F E  +     V YNA+I GY SN +   
Sbjct: 248 TKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPD--IVAYNAMIHGYTSNGETEL 307

Query: 128 AVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFIT 187
           ++ LF+++   G  + S TL+ L+PV     +L L  ++HG  LK    S  SV     T
Sbjct: 308 SLSLFKELMLSGARLRSSTLVSLVPVSG---HLMLIYAIHGYCLKSNFLSHASVSTALTT 367

Query: 188 MYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVT 247
           +Y K   ++ A+KLFDE P K L SWNAM+SGY QNG+  + + L+R M      P+PVT
Sbjct: 368 VYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVT 427

Query: 248 HVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMP 307
              +LS+CA LGA S+G  V   ++++ F ++ +++ ALI MYA+CG++ +A+ +FD M 
Sbjct: 428 ITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMT 487

Query: 308 ERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLE 367
           ++  V+W  +I GYG+HG G+ A+ +F EM+ SGI P    F+ VL ACSHAGL  +G E
Sbjct: 488 KKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDE 547

Query: 368 YFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIH 427
            F  M  +Y  +P  +HY+C+VD+LGRAG L  A   IE+M I+P  +VW  LLGAC+IH
Sbjct: 548 IFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIH 607

Query: 428 KNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSY 487
           K+  LA    E + EL+P N+GY+VLLSNI+   +N      +R   K+RKL K PG + 
Sbjct: 608 KDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTL 667

Query: 488 VELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFG-EPKKD----DRDESNKDLLTG 547
           +E+    H F  GD++HPQ +EIY  LE+LE  +++ G +P+ +    D +E  ++L+  
Sbjct: 668 IEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELM-- 727

Query: 548 VGVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHH 607
           V VHSE+LA+AFGL+ T  GTE+ IIKNLR+C DCH   K++SKI  R + VRD  RFHH
Sbjct: 728 VKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHH 787

Query: 608 FKNGSCSCKDYW 614
           FK+G CSC DYW
Sbjct: 788 FKDGVCSCGDYW 792

BLAST of Cla97C04G076940 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 490.3 bits (1261), Expect = 2.2e-138
Identity = 249/611 (40.75%), Postives = 370/611 (60.56%), Query Frame = 0

Query: 8   WNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQII 67
           WNT +   ++      AL +   +     +P+  T    L + +AL L  +G++ HG  +
Sbjct: 204 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 263

Query: 68  KVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSDA 127
           + G +    + T L+ MY +   +E AR++FD      +  V +N++I  YV N    +A
Sbjct: 264 RSGFDSLVNISTALVDMYAKCGSLETARQLFD--GMLERNVVSWNSMIDAYVQNENPKEA 323

Query: 128 VLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFITM 187
           +L+F++M +EGV    V+++G +  C    +LE G  +H  +++ GLD +VSVVN  I+M
Sbjct: 324 MLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISM 383

Query: 188 YMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVTH 247
           Y KC  VD A  +F ++  + L+SWNAM+ G+AQNG   + L  +  M    V+PD  T+
Sbjct: 384 YCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTY 443

Query: 248 VGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMPE 307
           V V+++ A L        +   +  S  + N F+  AL++MYA+CG +  A+ +FD M E
Sbjct: 444 VSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSE 503

Query: 308 RTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLEY 367
           R + +W A+I GYG HG G+ A++LFEEM +  I P+G  F+SV+SACSH+GL + GL+ 
Sbjct: 504 RHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKC 563

Query: 368 FKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIHK 427
           F MM   Y ++   +HY  +VDLLGRAGRLNEA + I  MP+KP   V+GA+LGAC+IHK
Sbjct: 564 FYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHK 623

Query: 428 NVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSYV 487
           NV  AE A E + EL P + GY+VLL+NIY  A   + V ++R  M ++ L+K PGCS V
Sbjct: 624 NVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMV 683

Query: 488 ELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDD-----RDESNKDLLTGV 547
           E+K  VH F  G   HP +++IY  LE+L   +++ G     +      ++  + LL+  
Sbjct: 684 EIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVLGVENDVKEQLLS-- 743

Query: 548 GVHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHF 607
             HSEKLA++FGLLNT AGT + + KNLR+C DCH   K +S +  R++ VRD  RFHHF
Sbjct: 744 -THSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHF 803

Query: 608 KNGSCSCKDYW 614
           KNG+CSC DYW
Sbjct: 804 KNGACSCGDYW 809

BLAST of Cla97C04G076940 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 486.5 bits (1251), Expect = 3.2e-137
Identity = 235/610 (38.52%), Postives = 379/610 (62.13%), Query Frame = 0

Query: 7   PWNTQLRELAKRCQFLQALSLYPQLLRHGGRPNAFTFPFALKSCAALSLPILGEQFHGQI 66
           PWN  +R  ++   F  AL +Y  +      P++FTFP  LK+C+ LS   +G   H Q+
Sbjct: 86  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 145

Query: 67  IKVGCEFEPFVQTGLISMYCRGTLVENARKVFDENSQSRKLSVCYNALISGYVSNSKCSD 126
            ++G + + FVQ GLI++Y +   + +AR VF+      +  V + A++S Y  N +  +
Sbjct: 146 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 205

Query: 127 AVLLFRQMNEEGVPVNSVTLLGLIPVCVSPINLELGSSLHGSTLKYGLDSDVSVVNCFIT 186
           A+ +F QM +  V  + V L+ ++       +L+ G S+H S +K GL+ +  ++    T
Sbjct: 206 ALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNT 265

Query: 187 MYMKCGSVDYAQKLFDEMPVKGLISWNAMVSGYAQNGMATNVLELYRNMDMHGVRPDPVT 246
           MY KCG V  A+ LFD+M    LI WNAM+SGYA+NG A   ++++  M    VRPD ++
Sbjct: 266 MYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTIS 325

Query: 247 HVGVLSSCANLGAQSVGHEVELKIQASGFNNNPFLNNALINMYARCGNLTKAQAVFDEMP 306
               +S+CA +G+      +   +  S + ++ F+++ALI+M+A+CG++  A+ VFD   
Sbjct: 326 ITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTL 385

Query: 307 ERTLVSWTAIIGGYGMHGHGEIAVQLFEEMIRSGIVPDGTAFVSVLSACSHAGLTDQGLE 366
           +R +V W+A+I GYG+HG    A+ L+  M R G+ P+   F+ +L AC+H+G+  +G  
Sbjct: 386 DRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWW 445

Query: 367 YFKMMTRKYQLQPGPEHYSCIVDLLGRAGRLNEARNLIESMPIKPDGAVWGALLGACKIH 426
           +F  M   +++ P  +HY+C++DLLGRAG L++A  +I+ MP++P   VWGALL ACK H
Sbjct: 446 FFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKH 505

Query: 427 KNVELAELAFEHVIELEPANIGYYVLLSNIYYDAKNSKGVLRIRFMMKQRKLKKDPGCSY 486
           ++VEL E A + +  ++P+N G+YV LSN+Y  A+    V  +R  MK++ L KD GCS+
Sbjct: 506 RHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSW 565

Query: 487 VELKGRVHPFIVGDRNHPQAEEIYRVLEELEALVQKFGEPKKDD---RDESNKDLLTGVG 546
           VE++GR+  F VGD++HP+ EEI R +E +E+ +++ G     D    D ++++    + 
Sbjct: 566 VEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEEAEETLC 625

Query: 547 VHSEKLAVAFGLLNTMAGTEVVIIKNLRICEDCHLFFKMVSKIVHRQLTVRDPTRFHHFK 606
            HSE++A+A+GL++T  GT + I KNLR C +CH   K++SK+V R++ VRD  RFHHFK
Sbjct: 626 SHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFK 685

Query: 607 NGSCSCKDYW 614
           +G CSC DYW
Sbjct: 686 DGVCSCGDYW 694

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882846.10.0e+0091.52putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [B... [more]
XP_008462579.10.0e+0089.74PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucum... [more]
XP_023544808.10.0e+0089.72putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [C... [more]
XP_004143385.10.0e+0089.74putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial [C... [more]
KAA0025251.10.0e+0089.74putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] ... [more]
Match NameE-valueIdentityDescription
Q9CAY11.1e-23563.71Putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS... [more]
Q9LN014.6e-14139.97Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9SUH61.1e-13741.67Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Q3E6Q13.1e-13740.75Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LTV84.4e-13638.52Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3CH870.0e+0089.74putative pentatricopeptide repeat-containing protein At3g11460 OS=Cucumis melo O... [more]
A0A0A0KFC90.0e+0089.74DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G4526... [more]
A0A5D3CAP80.0e+0089.74Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A6J1EDM70.0e+0089.09putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS... [more]
A0A6J1IQ020.0e+0088.60putative pentatricopeptide repeat-containing protein At3g11460, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
AT3G11460.17.8e-23763.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.13.3e-14239.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G30700.17.5e-13941.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.12.2e-13840.75Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.13.2e-13738.52mitochondrial editing factor 22 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 482..603
e-value: 1.3E-29
score: 102.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 283..308
e-value: 0.0024
score: 15.9
coord: 311..344
e-value: 9.3E-8
score: 29.8
coord: 210..243
e-value: 5.6E-6
score: 24.2
coord: 109..142
e-value: 2.2E-6
score: 25.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 108..150
e-value: 2.6E-8
score: 33.9
coord: 210..255
e-value: 1.7E-7
score: 31.3
coord: 310..356
e-value: 1.3E-8
score: 34.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 283..308
e-value: 5.3E-5
score: 23.2
coord: 181..209
e-value: 9.4E-4
score: 19.3
coord: 383..407
e-value: 0.12
score: 12.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 278..308
score: 9.54735
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 208..242
score: 11.081932
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 107..141
score: 9.821383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..343
score: 11.640958
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 6..157
e-value: 1.1E-21
score: 79.5
coord: 329..508
e-value: 3.4E-28
score: 100.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 158..273
e-value: 1.7E-22
score: 82.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 276..463
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 4..607
NoneNo IPR availablePANTHERPTHR24015:SF505OS01G0819800 PROTEINcoord: 4..607

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G076940.1Cla97C04G076940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding