Cla97C06G112990 (gene) Watermelon (97103) v2

NameCla97C06G112990
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr06 : 4040995 .. 4042857 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

mRNA sequence

ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

Coding sequence (CDS)

ATGCTGATTTGGCCGTCCATCCACTTTGGTCGTTCTCGTCTGGTCCATTCCTTTTCCGTCAACGCCCTCAAAGCCGCCGCCCCAGTGAATCCCATTCCCCGAGGTACCCAATCGCACAGCCTCATTATAAAGTTGGGATTGGCTAATGAACTTTCTGTTCAGAACAAGCTATTGAAGATTTATGTTAAGTGCAGGGATTTAGAAAGTGCAGGGAACGTGTTTGATGAAATGTCTAGGAGAAATGTTGTGTCGTGGAACACGGTGATATGTGGGCTTGTCGATTGCGGGTATGGAGCTGATTTTAAGCTGAGGCAGCACTCAATTTTTTTATATTTTAAGAAGATGTTGATGGGTATGGTGCGCCCAGATGGTGTCACATTTAATGGGTTGTTTCGATCTTGTGTTGTGTTGAACGATGTTGAAAGTGGCAGGCAATTGCATGGTTTTGTAATGAAAATTGGGTTTGATTTGGATTGTTTTGTGGGGAGTGCAGTGGTTGATTTTTATGGGAAATGTGGGTTATATGAAGATGCGAGATTAGCTTTTAGCTGCATTCTGTATAGGGATCTGGTTTTGTGGAATGTGATGTTGTACTGTTATGTGTTTAATTGTTTGGGGCGAGAAGCAATTGAAGTCTTTTGTTTGATGCAGTTGGAAGGTTTTAAAGGTGATGATTTTACATTCAGCAGCCTGCTAAGTTCGTGCAAGTATAAAGGATCAGGAGAATTGGGTAAGCAGCTCCATGGTCTTCTTATAAAACAGTCATTCGATTTAGATATTCTTGTGGCAAGTTCACTTGTCAATGTGTATGCTAAAAACGATAATTTATATGATGCTCGCAAGGTTTTTGATGAAATGCCATCTAGAAATTCTGTGTCTTGGACCACTATGATTGTGGGGTATGGGCAGCAAGAAGATGGAAAAGAGGCAGTGAAACTGTTCAGAAGAATGTTTGGGGAAGATTATTGCCCAGATGAATTAACTTTTGCTAGTGTGCTGAGTTCGTGTGGCTTTACGTGCGGGGCTTGTGAACTGATGCAAGTTCATTCCTGCTTGATAAAACTTGGTTTGGAAGCATTTCTGTCTATTAATAACGGGTTGATAAATGCATATTCGAAGTGTGGTATCATCTCCACAGCGTTACAATGCTTTAGATTAATTGCAGAACCAGATTTGGTAACATGGACATCAATTATATGTGGACTTGCACTTTGTGGCCTTGAGAAGGATGCTGTTGAGTTATTTGATAAGATGTTATCTTATGGCATTAAACCAGATAAAATTGCATTTCTTGGAGTTCTTTCTGCCTGTAGTCATGGGGGATTTGTAAGCATGGGGCTTCACTACTTCAACTTAATGACGATCCAATACCAAATTGTTCCTAATTCAGAGCATTTAACATGCTTGATCGACCTTCTCAGTAGAGCGGGTAGTCTAGACCAGGCTTTTGACCTTTTGAAATCAACGGCGAAGGAAGCTGGACCAGATGCTTTCAGGGCTTTCATTCGAGCATGTAGAACTCATGGGGACTTGAGATTAGCAGAATGGGCAATGGAATTTGCATCAGAGCCAAATGAACAAGTGAATTATTCTCTAGTGTCGAATATGTATGCTTCTGAAGGAAGATGGTCAGATGTGGCTAGAATGCGCAAACTGATGAAGGATAGTTGTGAAAGGAAAGCCCCAGGCCTTAGTTGGGTAGAAATTGCTGGTATAACCATTTGTTATAACCATTTGTTTGTATCGGGTGATAGATCCCATCCACAGTCTTCAGATCTCTATACAATGTTAGGATTATTACTAAACACGATGAAGAAGGATGACAACTCCGCAGCCCTCTGGGTAGATATTGTGCCCGATTGA

Protein sequence

MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKDDNSAALWVDIVPD
BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_011656346.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucumis sativus] >KGN45673.1 hypothetical protein Csa_6G005130 [Cucumis sativus])

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 543/620 (87.58%), Postives = 568/620 (91.61%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IP  T  HSL++KLGL NELSVQNKLL++
Sbjct: 1   MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPHDTLLHSLVVKLGLVNELSVQNKLLRV 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA N+FDEM+RRNVVSWNTVICGLVD GYG +FK+RQHSIFLYFKKMLMG+V
Sbjct: 61  YVKCRDLDSARNLFDEMARRNVVSWNTVICGLVDGGYGGEFKMRQHSIFLYFKKMLMGLV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSCILYRDLVLWNVMLYC VFN L REAIEVF LMQLEGFKGDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCCVFNSLSREAIEVFRLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH LLIKQSFDLDILVASSLVNVY KNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 241 GELGKQLHCLLIKQSFDLDILVASSLVNVYTKNDNLYDARKVFDEMPTRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQ E GKEAVKLFRRMF +DYCPDELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 301 YGQHEYGKEAVKLFRRMFRKDYCPDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGII+ ALQCFRLIAEPDLVTWTSIICGLALCGLEKDAV+LFDKMLS
Sbjct: 361 LSINNGLIYAYSKCGIIAAALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVKLFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+SEHLTCLIDLL RAGSLD
Sbjct: 421 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA RAFIRACRTHG+LRLA+ AMEFASEP+E VNYSLVSNMYASE
Sbjct: 481 QAFDLLKSMPKEAGPDALRAFIRACRTHGNLRLAKRAMEFASEPDEPVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+ D CE+K PGLSWVEIAG    YNHLF+SGDRSHPQS DLY MLGLL
Sbjct: 541 GRWSDVARMRKLINDRCEQKTPGLSWVEIAG----YNHLFISGDRSHPQSLDLYAMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD    A  VDIVP+
Sbjct: 601 LNTMKKDYKFTASQVDIVPE 616

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_008458191.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 528/612 (86.27%), Postives = 558/612 (91.18%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SGDRSHPQSSDLY MLGLL
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSGDRSHPQSSDLYAMLGLL 610

Query: 601 LNTMKKDDNSAA 613
           LNTMK+D  S A
Sbjct: 611 LNTMKEDYKSTA 618

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_022958961.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958962.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata] >XP_022958963.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 1040.0 bits (2688), Expect = 3.2e-300
Identity = 511/620 (82.42%), Postives = 552/620 (89.03%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA +N IPRGT+ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCCRLVHSFSFNVLKAAADLNSIPRGTKLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+V+CGYG +FK+R+ SI   FK MLM MV
Sbjct: 61  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVNCGYGGEFKMRERSILSCFKNMLMDMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARK FDEMP RNSVSWTTMIVG
Sbjct: 241 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKAFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS AL+CFRLIAEPDLV+WTSIICG A CGLEK AVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGTISPALRCFRLIAEPDLVSWTSIICGFAFCGLEKAAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS ++EAGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 481 EAFKLLKSVSEEAGPDAFRSFIRACRTHGRLRLAKWAMEFASDPCKPVNSSLMSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+KDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLLKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYEMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNT+KKD  S A  +DI P+
Sbjct: 601 LNTVKKDYKSTASNIDIEPE 616

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_023548540.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo] >XP_023548541.1 pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1039.6 bits (2687), Expect = 4.1e-300
Identity = 511/620 (82.42%), Postives = 550/620 (88.71%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG  RLVHSFS N LKAAA V  I RGT+ HSL+IKLGLANELSVQNKLLK+
Sbjct: 34  MLIWPSTHFGCCRLVHSFSFNVLKAAADVKSISRGTKLHSLVIKLGLANELSVQNKLLKV 93

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+VDCGYG +F++R+ SI   FK MLM MV
Sbjct: 94  YVKCRDLGRARNLFDEMRRRNVVSWNTVICGVVDCGYGGEFRMRERSILSCFKNMLMDMV 153

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+KIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 154 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKIGFDLDCFVGSAVVDFYAKCGLYEDARL 213

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL +EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 214 AFSSVLYKDLVLWNVMLYCYVFNCLAKEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 273

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH  LIK SFDLDILVASSLVN+YAKN++LYDARKVFDEMP RNSVSWTTMIVG
Sbjct: 274 GELGKQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 333

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRMF EDY PDELTFASVLSSCGFT GACEL+QVHSCLIKLG EAF
Sbjct: 334 YGQQEHGKEAVKLLRRMFEEDYYPDELTFASVLSSCGFTSGACELIQVHSCLIKLGFEAF 393

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS ALQCFRLIAEPDLV+WTSIICGLA CGLEKDAVELFDKMLS
Sbjct: 394 LSVNNGLINAYSKCGAISPALQCFRLIAEPDLVSWTSIICGLAFCGLEKDAVELFDKMLS 453

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
             I+PDKIAFLGVLSACSHGGFV+MGLHYFNLMT +YQIVP+SEHLTCLIDL+ RAGSLD
Sbjct: 454 QAIRPDKIAFLGVLSACSHGGFVNMGLHYFNLMTNEYQIVPDSEHLTCLIDLIGRAGSLD 513

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF+LLKS ++EAGPDAFR+FIRACRTHG L LA+WAMEFAS+P + VN SL+SNMYASE
Sbjct: 514 EAFNLLKSVSEEAGPDAFRSFIRACRTHGHLGLAKWAMEFASDPYKPVNCSLMSNMYASE 573

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVA MRKLMKD CE K PG SW+EIAG    YNH FVS DRSHPQSSDLY MLGLL
Sbjct: 574 GRWSDVAIMRKLMKDGCEPKVPGFSWIEIAG----YNHSFVSSDRSHPQSSDLYEMLGLL 633

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD  S A  +DI P+
Sbjct: 634 LNTMKKDYKSIASNIDIEPE 649

BLAST of Cla97C06G112990 vs. NCBI nr
Match: XP_023006538.1 (pentatricopeptide repeat-containing protein At2g46050, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1035.4 bits (2676), Expect = 7.8e-299
Identity = 511/620 (82.42%), Postives = 550/620 (88.71%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIWPS HFG SRLVHSFS N LKAAA VN IPRGTQ HSL+IKLGLANELSVQNKLLKI
Sbjct: 1   MLIWPSTHFGCSRLVHSFSFNVLKAAADVNSIPRGTQLHSLVIKLGLANELSVQNKLLKI 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL  A N+FDEM RRNVVSWNTVICG+VDCGYG +FK+R+ S    FK MLM MV
Sbjct: 61  YVKCRDLGRAWNLFDEMRRRNVVSWNTVICGVVDCGYGGEFKMRERSNLSCFKNMLMEMV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDGVTFNGLFRSC V+NDV SG+QLHGFV+K GFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGVTFNGLFRSCDVMNDVGSGKQLHGFVIKFGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFS +LY+DLVLWNVMLYCYVFNCL  EAIE+F LMQLEGF GDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSSVLYKDLVLWNVMLYCYVFNCLAEEAIEIFFLMQLEGFTGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELG QLH  LIK SFDLDILVASSLVN+YAKN++LYDARKVFDEMP RNSVSWTTMIVG
Sbjct: 241 GELGMQLHVHLIKHSFDLDILVASSLVNMYAKNNHLYDARKVFDEMPIRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKL RRM  EDY PDELTFASVLSSCGFT GA EL+QVHSCLIKLG EAF
Sbjct: 301 YGQQEHGKEAVKLLRRMLEEDYYPDELTFASVLSSCGFTSGASELIQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LS+NNGLINAYSKCG IS+AL+CFRLIAEPDLV+ TSIICGLA CG+EKDAVELFDKMLS
Sbjct: 361 LSVNNGLINAYSKCGAISSALRCFRLIAEPDLVSRTSIICGLAFCGVEKDAVELFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
            GI+PDKIAFLGVLSACSHGG+ +MGLHYFNLMT +YQIVP+SEHLTCLIDLL RAGSLD
Sbjct: 421 QGIRPDKIAFLGVLSACSHGGYANMGLHYFNLMTNEYQIVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           +AF LLKS +++AGPDAFR+FIRACRTHG LRLA+WAMEFAS+P + VN SL+SN+YASE
Sbjct: 481 EAFKLLKSVSEKAGPDAFRSFIRACRTHGHLRLAKWAMEFASDPYKPVNCSLMSNIYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKLMKDSCE K PG SW+EIAG    YNHLFVS DRSHPQSSDLY MLGLL
Sbjct: 541 GRWSDVARMRKLMKDSCEPKVPGFSWIEIAG----YNHLFVSSDRSHPQSSDLYAMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD  S A  +DI P+
Sbjct: 601 LNTMKKDYKSIASNIDIEPE 616

BLAST of Cla97C06G112990 vs. TrEMBL
Match: tr|A0A0A0K863|A0A0A0K863_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1)

HSP 1 Score: 1098.6 bits (2840), Expect = 0.0e+00
Identity = 543/620 (87.58%), Postives = 568/620 (91.61%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IP  T  HSL++KLGL NELSVQNKLL++
Sbjct: 1   MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPHDTLLHSLVVKLGLVNELSVQNKLLRV 60

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA N+FDEM+RRNVVSWNTVICGLVD GYG +FK+RQHSIFLYFKKMLMG+V
Sbjct: 61  YVKCRDLDSARNLFDEMARRNVVSWNTVICGLVDGGYGGEFKMRQHSIFLYFKKMLMGLV 120

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSAVVDFY KCGLYEDARL
Sbjct: 121 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSAVVDFYAKCGLYEDARL 180

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSCILYRDLVLWNVMLYC VFN L REAIEVF LMQLEGFKGDDFTFSSLLSSCKYKGS
Sbjct: 181 AFSCILYRDLVLWNVMLYCCVFNSLSREAIEVFRLMQLEGFKGDDFTFSSLLSSCKYKGS 240

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLH LLIKQSFDLDILVASSLVNVY KNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 241 GELGKQLHCLLIKQSFDLDILVASSLVNVYTKNDNLYDARKVFDEMPTRNSVSWTTMIVG 300

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQ E GKEAVKLFRRMF +DYCPDELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 301 YGQHEYGKEAVKLFRRMFRKDYCPDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 360

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGII+ ALQCFRLIAEPDLVTWTSIICGLALCGLEKDAV+LFDKMLS
Sbjct: 361 LSINNGLIYAYSKCGIIAAALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVKLFDKMLS 420

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+SEHLTCLIDLL RAGSLD
Sbjct: 421 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDSEHLTCLIDLLGRAGSLD 480

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA RAFIRACRTHG+LRLA+ AMEFASEP+E VNYSLVSNMYASE
Sbjct: 481 QAFDLLKSMPKEAGPDALRAFIRACRTHGNLRLAKRAMEFASEPDEPVNYSLVSNMYASE 540

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARMRKL+ D CE+K PGLSWVEIAG    YNHLF+SGDRSHPQS DLY MLGLL
Sbjct: 541 GRWSDVARMRKLINDRCEQKTPGLSWVEIAG----YNHLFISGDRSHPQSLDLYAMLGLL 600

Query: 601 LNTMKKDDNSAALWVDIVPD 621
           LNTMKKD    A  VDIVP+
Sbjct: 601 LNTMKKDYKFTASQVDIVPE 616

BLAST of Cla97C06G112990 vs. TrEMBL
Match: tr|A0A1S3C6T7|A0A1S3C6T7_CUCME (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497699 PE=4 SV=1)

HSP 1 Score: 1076.6 bits (2783), Expect = 0.0e+00
Identity = 528/612 (86.27%), Postives = 558/612 (91.18%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLL 600
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SGDRSHPQSSDLY MLGLL
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSGDRSHPQSSDLYAMLGLL 610

Query: 601 LNTMKKDDNSAA 613
           LNTMK+D  S A
Sbjct: 611 LNTMKEDYKSTA 618

BLAST of Cla97C06G112990 vs. TrEMBL
Match: tr|A0A1S3C796|A0A1S3C796_CUCME (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497699 PE=4 SV=1)

HSP 1 Score: 1029.2 bits (2660), Expect = 3.7e-297
Identity = 504/583 (86.45%), Postives = 533/583 (91.42%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           MLIW S HFGRSRLVHSFS N LKAAAPVN IPR T  HS+++KLGLANELSVQNKLLK+
Sbjct: 11  MLIWTSTHFGRSRLVHSFSFNVLKAAAPVNSIPRDTLLHSVVVKLGLANELSVQNKLLKV 70

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           YVKCRDL+SA ++FDEM RRN VSWNTVICGLVD GYG +FK RQ  IFLYFKKMLMG+V
Sbjct: 71  YVKCRDLDSARSLFDEMPRRNAVSWNTVICGLVDGGYGGEFKTRQRLIFLYFKKMLMGLV 130

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
            PDG+TFNGLFRSCVVLNDVESGRQLH FVMKIGFDLDCFVGSA+VDFY KCGLYEDARL
Sbjct: 131 DPDGITFNGLFRSCVVLNDVESGRQLHSFVMKIGFDLDCFVGSALVDFYAKCGLYEDARL 190

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AFSC LY+DLVLWNVMLYCYVFN L REAIE F LMQLEGFKGD+FTFSSLLSSCKYKGS
Sbjct: 191 AFSCTLYKDLVLWNVMLYCYVFNSLSREAIEGFRLMQLEGFKGDEFTFSSLLSSCKYKGS 250

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
           GELGKQLHGLLIKQSFDLDILVASSL++VYAKNDNLYDARKVFDEMP+RNSVSWTTMIVG
Sbjct: 251 GELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDNLYDARKVFDEMPTRNSVSWTTMIVG 310

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YGQQE GKEAVKLFRRMFG+DYC DELTFASVLSSCGFT GA ELMQVHSCLIKLG EAF
Sbjct: 311 YGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSSCGFTSGASELMQVHSCLIKLGFEAF 370

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTWTSIICGLA CGLEKDAV+LFDKMLS
Sbjct: 371 LSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTWTSIICGLAFCGLEKDAVKLFDKMLS 430

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT QYQ+VP+ EHLTCLIDLL RAGSLD
Sbjct: 431 YGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTNQYQLVPDPEHLTCLIDLLGRAGSLD 490

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAMEFASEPNEQVNYSLVSNMYASE 540
           QAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+WAMEF SEP+E VNYSLVSNMYASE
Sbjct: 491 QAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAKWAMEFISEPDEPVNYSLVSNMYASE 550

Query: 541 GRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSG 584
           GRWSDVARM KL+ D CE+K PGLSWVEIAG    YNHLF SG
Sbjct: 551 GRWSDVARMHKLINDRCEQKTPGLSWVEIAG----YNHLFKSG 589

BLAST of Cla97C06G112990 vs. TrEMBL
Match: tr|A0A1S4E2S1|A0A1S4E2S1_CUCME (pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X3 OS=Cucumis melo OX=3656 GN=LOC103497699 PE=4 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 4.2e-200
Identity = 348/397 (87.66%), Postives = 367/397 (92.44%), Query Frame = 0

Query: 216 MQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDN 275
           M+LEGFKGD+FTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSL++VYAKNDN
Sbjct: 4   MELEGFKGDEFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVASSLIDVYAKNDN 63

Query: 276 LYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSS 335
           LYDARKVFDEMP+RNSVSWTTMIVGYGQQE GKEAVKLFRRMFG+DYC DELTFASVLSS
Sbjct: 64  LYDARKVFDEMPTRNSVSWTTMIVGYGQQEYGKEAVKLFRRMFGKDYCLDELTFASVLSS 123

Query: 336 CGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAEPDLVTW 395
           CGFT GA ELMQVHSCLIKLG EAFLSINNGLI AYSKCGI++ ALQCFRLIAEPDLVTW
Sbjct: 124 CGFTSGASELMQVHSCLIKLGFEAFLSINNGLIYAYSKCGIVAAALQCFRLIAEPDLVTW 183

Query: 396 TSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTI 455
           TSIICGLA CGLEKDAV+LFDKMLSYGI+PDKIAFLGVLSACSHGGFVSMGLHYFNLMT 
Sbjct: 184 TSIICGLAFCGLEKDAVKLFDKMLSYGIRPDKIAFLGVLSACSHGGFVSMGLHYFNLMTN 243

Query: 456 QYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAE 515
           QYQ+VP+ EHLTCLIDLL RAGSLDQAFDLLKS  KEAGPDA  AFIRACRTHG+L+LA+
Sbjct: 244 QYQLVPDPEHLTCLIDLLGRAGSLDQAFDLLKSMRKEAGPDALTAFIRACRTHGNLKLAK 303

Query: 516 WAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITIC 575
           WAMEF SEP+E VNYSLVSNMYASEGRWSDVARM KL+ D CE+K PGLSWVEIAG    
Sbjct: 304 WAMEFISEPDEPVNYSLVSNMYASEGRWSDVARMHKLINDRCEQKTPGLSWVEIAG---- 363

Query: 576 YNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKDDNSAA 613
           YNHLF SGDRSHPQSSDLY MLGLLLNTMK+D  S A
Sbjct: 364 YNHLFKSGDRSHPQSSDLYAMLGLLLNTMKEDYKSTA 396

BLAST of Cla97C06G112990 vs. TrEMBL
Match: tr|A0A2N9F0N1|A0A2N9F0N1_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8236 PE=4 SV=1)

HSP 1 Score: 706.4 bits (1822), Expect = 5.5e-200
Identity = 348/611 (56.96%), Postives = 438/611 (71.69%), Query Frame = 0

Query: 1   MLIWPSIHFGRSRLVHSFSVNALKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKI 60
           M I  + HF      HSF  NALK  A +  +P+G Q H+ +IK G  N LS+QN++L +
Sbjct: 42  MPISMATHFTDPHSTHSFYSNALKVLAKMGFLPQGKQLHAHMIKFGFYNVLSLQNQILNV 101

Query: 61  YVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMV 120
           Y++C++   A  +F +M  RNVVSWNTVICG+VDC   ++ +L  +  F YF++ML+ +V
Sbjct: 102 YIRCKEFSDAQRLFGDMRVRNVVSWNTVICGVVDC--SSNNRLNLYLGFSYFRRMLLEIV 161

Query: 121 RPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARL 180
           RPD +TFNGLFR+C+ L+D    RQLH F++K+GFD++ FVGSA+V  Y  CG  EDAR 
Sbjct: 162 RPDDITFNGLFRACIELDDFVISRQLHCFIVKVGFDMNSFVGSALVKLYATCGFVEDARR 221

Query: 181 AFSCILYRDLVLWNVMLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGS 240
           AF  IL RDLVLWNVM+ CYV NCL +EA EVF LMQL+G KGD+FTFSSL S C   GS
Sbjct: 222 AFDGILSRDLVLWNVMVSCYVSNCLAKEAFEVFNLMQLKGVKGDEFTFSSLTSLCGTLGS 281

Query: 241 GELGKQLHGLLIKQSFDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVG 300
            +LGKQ+HGL+++ SFDLD+ VAS+L+++Y+K++++ DARK FD M  RN VSW TM+VG
Sbjct: 282 CDLGKQVHGLILRNSFDLDVQVASALIHMYSKSESINDARKAFDGMAIRNVVSWNTMVVG 341

Query: 301 YGQQEDGKEAVKLFRRMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAF 360
           YG   DGKEA++L R +    + PDELT  S+LSSCG      E+MQ H+C IKLG E F
Sbjct: 342 YGWHGDGKEAMQLLRELLEAGFYPDELTLTSILSSCGNLSATSEVMQAHACTIKLGFEVF 401

Query: 361 LSINNGLINAYSKCGIISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLS 420
           LSI+N LINAYSKCG I  A QCF  + EPDLVTWTSI+C  A  GL K+A +LFDKMLS
Sbjct: 402 LSISNALINAYSKCGTILGAYQCFSSVLEPDLVTWTSIVCAYAFHGLTKEATKLFDKMLS 461

Query: 421 YGIKPDKIAFLGVLSACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLD 480
           YGI PD I FLGV SAC+HGG V  GLHYFNLMT  YQIVP+SEH  CLIDLL R G LD
Sbjct: 462 YGIWPDPIIFLGVFSACNHGGLVKKGLHYFNLMTSYYQIVPDSEHYACLIDLLGRFGLLD 521

Query: 481 QAFDLLKSTAKEAGPDAFRAFIRACRTHGDLRLAEWAME--FASEPNEQVNYSLVSNMYA 540
           +AF++L S   E G +   AFI AC+ H +L LA+WA E  FA EPN  VNY+L+SN+YA
Sbjct: 522 EAFNVLTSMPMEPGSNTLGAFIGACKVHKNLGLAKWAAEKLFALEPNNPVNYTLMSNLYA 581

Query: 541 SEGRWSDVARMRKLMKDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLG 600
           SE  W DVAR+RK+M+D C  K PG SW+EIAG      H FVS D+SHPQ+ ++Y +LG
Sbjct: 582 SERLWHDVARVRKMMRDRCNSKVPGYSWIEIAGNV----HTFVSSDKSHPQALEVYVILG 641

Query: 601 LLLNTMKKDDN 610
            LL  MK D++
Sbjct: 642 TLLRLMKVDNH 646

BLAST of Cla97C06G112990 vs. Swiss-Prot
Match: sp|O82363|PP203_ARATH (Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E39 PE=3 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 2.1e-109
Identity = 224/549 (40.80%), Postives = 314/549 (57.19%), Query Frame = 0

Query: 24  KAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVV 83
           K +A ++ +    Q H  ++K G+ N L +QNKLL+ Y K R+ + A  +FDEM  RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESG 143
           +WN +I G++      D   R H  F Y  ++L   V  D V+F GL R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 RQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FYGKCGL  +AR  F  +L RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLM--QLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 263
            +  EA  +  LM      F+GD FTFSSLLS+C+     E GKQ+H +L K S+  DI 
Sbjct: 224 GMIDEAFGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQFDIP 283

Query: 264 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 323
           VA++L+N+YAK+++L DAR+ F+ M  RN VSW  MIVG+ Q  +G+EA++LF +M  E+
Sbjct: 284 VATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQMLLEN 343

Query: 324 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 383
             PDELTFASVLSSC       E+ QV + + K G   FLS+ N LI++YS+ G +S AL
Sbjct: 344 LQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLSEAL 403

Query: 384 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 443
            CF  I EPDLV+WTS+I  LA  G  ++++++F+ ML   +                  
Sbjct: 404 LCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLXXXXXXXXXXXXXXXXXX 463

Query: 444 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 503
                          Y+I    EH TCLIDLL RAG +D+A D+L S   E    A  AF
Sbjct: 464 XXXXXXXXXXXXXEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHALAAF 523

Query: 504 IRACRTHGDLRLAEWAME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC-E 563
              C  H      +W  +     EP + VNYS++SN Y SEG W+  A +RK  + +C  
Sbjct: 524 TGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRNCYN 583

Query: 564 RKAPGLSWV 568
            K PG SW+
Sbjct: 584 PKTPGCSWL 585

BLAST of Cla97C06G112990 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 2.3e-92
Identity = 196/590 (33.22%), Postives = 318/590 (53.90%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L  A  V+ +  G Q H + +KLGL   L+V N L+ +Y K R    A  VFD MS R++
Sbjct: 322 LATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDL 381

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLND-VE 142
           +SWN+VI G+   G      L   ++ L+ + +  G+ +PD  T   + ++   L + + 
Sbjct: 382 ISWNSVIAGIAQNG------LEVEAVCLFMQLLRCGL-KPDQYTMTSVLKAASSLPEGLS 441

Query: 143 SGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYV 202
             +Q+H   +KI    D FV +A++D Y +    ++A + F    + DLV WN M+  Y 
Sbjct: 442 LSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYT 501

Query: 203 FNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 262
            +  G + +++F LM  +G + DDFT +++  +C +  +   GKQ+H   IK  +DLD+ 
Sbjct: 502 QSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLW 561

Query: 263 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 322
           V+S ++++Y K  ++  A+  FD +P  + V+WTTMI G  +  + + A  +F +M    
Sbjct: 562 VSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMG 621

Query: 323 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 382
             PDE T A++  +        +  Q+H+  +KL       +   L++ Y+KCG I  A 
Sbjct: 622 VLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAY 681

Query: 383 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 442
             F+ I   ++  W +++ GLA  G  K+ ++LF +M S GIKPDK+ F+GVLSACSH G
Sbjct: 682 CLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSG 741

Query: 443 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 502
            VS    +   M   Y I P  EH +CL D L RAG + QA +L++S + EA    +R  
Sbjct: 742 LVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTL 801

Query: 503 IRACRTHGDL----RLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC 562
           + ACR  GD     R+A   +E   EP +   Y L+SNMYA+  +W ++   R +MK   
Sbjct: 802 LAACRVQGDTETGKRVATKLLEL--EPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHK 861

Query: 563 ERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKD 608
            +K PG SW+E+        H+FV  DRS+ Q+  +Y  +  ++  +K++
Sbjct: 862 VKKDPGFSWIEVKNKI----HIFVVDDRSNRQTELIYRKVKDMIRDIKQE 897

BLAST of Cla97C06G112990 vs. Swiss-Prot
Match: sp|P0C898|PP232_ARATH (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 335.5 bits (859), Expect = 1.3e-90
Identity = 188/561 (33.51%), Postives = 298/561 (53.12%), Query Frame = 0

Query: 34  RGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLV 93
           +G Q H  ++K G    L   N L+ +Y KCR+   A  VFD M  RNVVSW+ ++ G V
Sbjct: 24  QGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPERNVVSWSALMSGHV 83

Query: 94  DCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKI 153
                 D K         F +M    + P+  TF+   ++C +LN +E G Q+HGF +KI
Sbjct: 84  ---LNGDLK----GSLSLFSEMGRQGIYPNEFTFSTNLKACGLLNALEKGLQIHGFCLKI 143

Query: 154 GFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLGREAIEVF 213
           GF++   VG+++VD Y KCG   +A   F  I+ R L+ WN M+  +V    G +A++ F
Sbjct: 144 GFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYGSKALDTF 203

Query: 214 CLMQLEGFK--GDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDL--DILVASSLVNV 273
            +MQ    K   D+FT +SLL +C   G    GKQ+HG L++  F       +  SLV++
Sbjct: 204 GMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATITGSLVDL 263

Query: 274 YAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTF 333
           Y K   L+ ARK FD++  +  +SW+++I+GY Q+ +  EA+ LF+R+   +   D    
Sbjct: 264 YVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNSQIDSFAL 323

Query: 334 ASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAE 393
           +S++          +  Q+ +  +KL      S+ N +++ Y KCG++  A +CF  +  
Sbjct: 324 SSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEKCFAEMQL 383

Query: 394 PDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHY 453
            D+++WT +I G    GL K +V +F +ML + I+PD++ +L VLSACSH G +  G   
Sbjct: 384 KDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGMIKEGEEL 443

Query: 454 FNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHG 513
           F+ +   + I P  EH  C++DLL RAG L +A  L+ +   +     ++  +  CR HG
Sbjct: 444 FSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLLSLCRVHG 503

Query: 514 DLRLAE--WAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWV 573
           D+ L +    +    +     NY ++SN+Y   G W++    R+L      +K  G+SWV
Sbjct: 504 DIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKKEAGMSWV 563

Query: 574 EIAGITICYNHLFVSGDRSHP 589
           EI        H F SG+ SHP
Sbjct: 564 EIEREV----HFFRSGEDSHP 573

BLAST of Cla97C06G112990 vs. Swiss-Prot
Match: sp|Q9LRV9|PP228_ARATH (Pentatricopeptide repeat-containing protein At3g13880 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E89 PE=2 SV=1)

HSP 1 Score: 326.6 bits (836), Expect = 5.8e-88
Identity = 195/582 (33.51%), Postives = 305/582 (52.41%), Query Frame = 0

Query: 35  GTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVD 94
           G   H L++  GL+ ++ + N L+ +Y KC  L+ A ++FD    R+ VSWN++I G V 
Sbjct: 167 GELLHGLVVVNGLSQQVFLINVLIDMYSKCGKLDQAMSLFDRCDERDQVSWNSLISGYVR 226

Query: 95  CGYGAD---FKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLND--VESGRQLHGF 154
            G   +      + H   L      +G V         L   C+ LN+  +E G  +H +
Sbjct: 227 VGAAEEPLNLLAKMHRDGLNLTTYALGSV---------LKACCINLNEGFIEKGMAIHCY 286

Query: 155 VMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCY-----VFNC 214
             K+G + D  V +A++D Y K G  ++A   FS +  +++V +N M+  +     + + 
Sbjct: 287 TAKLGMEFDIVVRTALLDMYAKNGSLKEAIKLFSLMPSKNVVTYNAMISGFLQMDEITDE 346

Query: 215 LGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVAS 274
              EA ++F  MQ  G +    TFS +L +C    + E G+Q+H L+ K +F  D  + S
Sbjct: 347 ASSEAFKLFMDMQRRGLEPSPSTFSVVLKACSAAKTLEYGRQIHALICKNNFQSDEFIGS 406

Query: 275 SLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCP 334
           +L+ +YA   +  D  + F     ++  SWT+MI  + Q E  + A  LFR++F     P
Sbjct: 407 ALIELYALMGSTEDGMQCFASTSKQDIASWTSMIDCHVQNEQLESAFDLFRQLFSSHIRP 466

Query: 335 DELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCF 394
           +E T + ++S+C          Q+    IK G++AF S+    I+ Y+K G +  A Q F
Sbjct: 467 EEYTVSLMMSACADFAALSSGEQIQGYAIKSGIDAFTSVKTSSISMYAKSGNMPLANQVF 526

Query: 395 RLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVS 454
             +  PD+ T++++I  LA  G   +A+ +F+ M ++GIKP++ AFLGVL AC HGG V+
Sbjct: 527 IEVQNPDVATYSAMISSLAQHGSANEALNIFESMKTHGIKPNQQAFLGVLIACCHGGLVT 586

Query: 455 MGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRA 514
            GL YF  M   Y+I PN +H TCL+DLL R G L  A +L+ S+  +  P  +RA + +
Sbjct: 587 QGLKYFQCMKNDYRINPNEKHFTCLVDLLGRTGRLSDAENLILSSGFQDHPVTWRALLSS 646

Query: 515 CRTHGD----LRLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERK 574
           CR + D     R+AE  ME   EP    +Y L+ N+Y   G  S    +R+LM+D   +K
Sbjct: 647 CRVYKDSVIGKRVAERLMEL--EPEASGSYVLLHNIYNDSGVNSSAEEVRELMRDRGVKK 706

Query: 575 APGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLN 603
            P LSW+ I   T    H F   D SHP S  +YTML  + N
Sbjct: 707 EPALSWIVIGNQT----HSFAVADLSHPSSQMIYTMLETMDN 733

BLAST of Cla97C06G112990 vs. Swiss-Prot
Match: sp|Q9SS60|PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 321.2 bits (822), Expect = 2.5e-86
Identity = 190/594 (31.99%), Postives = 317/594 (53.37%), Query Frame = 0

Query: 17  SFSVNA-LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFD 76
           SF+V++ L A   +  + +G   H   +K G+ + + V N L+ +Y+K R    A  VFD
Sbjct: 207 SFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFD 266

Query: 77  EMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCV 136
           EM  R+ VS+NT+ICG +        ++ + S+ ++ +   +   +PD +T + + R+C 
Sbjct: 267 EMDVRDSVSYNTMICGYL------KLEMVEESVRMFLEN--LDQFKPDLLTVSSVLRACG 326

Query: 137 VLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNV 196
            L D+   + ++ +++K GF L+  V + ++D Y KCG    AR  F+ +  +D V WN 
Sbjct: 327 HLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNS 386

Query: 197 MLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQS 256
           ++  Y+ +    EA+++F +M +   + D  T+  L+S        + GK LH   IK  
Sbjct: 387 IISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSG 446

Query: 257 FDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFR 316
             +D+ V+++L+++YAK   + D+ K+F  M + ++V+W T+I    +  D    +++  
Sbjct: 447 ICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTT 506

Query: 317 RMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCG 376
           +M   +  PD  TF   L  C          ++H CL++ G E+ L I N LI  YSKCG
Sbjct: 507 QMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCG 566

Query: 377 IISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLS 436
            +  + + F  ++  D+VTWT +I    + G  + A+E F  M   GI PD + F+ ++ 
Sbjct: 567 CLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIY 626

Query: 437 ACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGP 496
           ACSH G V  GL  F  M   Y+I P  EH  C++DLLSR+  + +A + +++   +   
Sbjct: 627 ACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDA 686

Query: 497 DAFRAFIRACRTHGDLRLAEWAMEFASEPN-EQVNYS-LVSNMYASEGRWSDVARMRKLM 556
             + + +RACRT GD+  AE       E N +   YS L SN YA+  +W  V+ +RK +
Sbjct: 687 SIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSL 746

Query: 557 KDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKD 608
           KD    K PG SW+E+ G  +   H+F SGD S PQS  +Y  L +L + M K+
Sbjct: 747 KDKHITKNPGYSWIEV-GKNV---HVFSSGDDSAPQSEAIYKSLEILYSLMAKE 788

BLAST of Cla97C06G112990 vs. TAIR10
Match: AT2G46050.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 397.9 bits (1021), Expect = 1.1e-110
Identity = 224/549 (40.80%), Postives = 314/549 (57.19%), Query Frame = 0

Query: 24  KAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVV 83
           K +A ++ +    Q H  ++K G+ N L +QNKLL+ Y K R+ + A  +FDEM  RN+V
Sbjct: 44  KLSASLDHLSDVKQEHGFMVKQGIYNSLFLQNKLLQAYTKIREFDDADKLFDEMPLRNIV 103

Query: 84  SWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESG 143
           +WN +I G++      D   R H  F Y  ++L   V  D V+F GL R C    ++++G
Sbjct: 104 TWNILIHGVIQ--RDGDTNHRAHLGFCYLSRILFTDVSLDHVSFMGLIRLCTDSTNMKAG 163

Query: 144 RQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFN 203
            QLH  ++K G +  CF  +++V FYGKCGL  +AR  F  +L RDLVLWN ++  YV N
Sbjct: 164 IQLHCLMVKQGLESSCFPSTSLVHFYGKCGLIVEARRVFEAVLDRDLVLWNALVSSYVLN 223

Query: 204 CLGREAIEVFCLM--QLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 263
            +  EA  +  LM      F+GD FTFSSLLS+C+     E GKQ+H +L K S+  DI 
Sbjct: 224 GMIDEAFGLLKLMGSDKNRFRGDYFTFSSLLSACRI----EQGKQIHAILFKVSYQFDIP 283

Query: 264 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 323
           VA++L+N+YAK+++L DAR+ F+ M  RN VSW  MIVG+ Q  +G+EA++LF +M  E+
Sbjct: 284 VATALLNMYAKSNHLSDARECFESMVVRNVVSWNAMIVGFAQNGEGREAMRLFGQMLLEN 343

Query: 324 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 383
             PDELTFASVLSSC       E+ QV + + K G   FLS+ N LI++YS+ G +S AL
Sbjct: 344 LQPDELTFASVLSSCAKFSAIWEIKQVQAMVTKKGSADFLSVANSLISSYSRNGNLSEAL 403

Query: 384 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 443
            CF  I EPDLV+WTS+I  LA  G  ++++++F+ ML   +                  
Sbjct: 404 LCFHSIREPDLVSWTSVIGALASHGFAEESLQMFESMLQ-KLXXXXXXXXXXXXXXXXXX 463

Query: 444 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 503
                          Y+I    EH TCLIDLL RAG +D+A D+L S   E    A  AF
Sbjct: 464 XXXXXXXXXXXXXEFYKIEAEDEHYTCLIDLLGRAGFIDEASDVLNSMPTEPSTHALAAF 523

Query: 504 IRACRTHGDLRLAEWAME--FASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC-E 563
              C  H      +W  +     EP + VNYS++SN Y SEG W+  A +RK  + +C  
Sbjct: 524 TGGCNIHEKRESMKWGAKKLLEIEPTKPVNYSILSNAYVSEGHWNQAALLRKRERRNCYN 583

Query: 564 RKAPGLSWV 568
            K PG SW+
Sbjct: 584 PKTPGCSWL 585

BLAST of Cla97C06G112990 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 341.3 bits (874), Expect = 1.3e-93
Identity = 196/590 (33.22%), Postives = 318/590 (53.90%), Query Frame = 0

Query: 23  LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNV 82
           L  A  V+ +  G Q H + +KLGL   L+V N L+ +Y K R    A  VFD MS R++
Sbjct: 322 LATAVKVDSLALGQQVHCMALKLGLDLMLTVSNSLINMYCKLRKFGFARTVFDNMSERDL 381

Query: 83  VSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLND-VE 142
           +SWN+VI G+   G      L   ++ L+ + +  G+ +PD  T   + ++   L + + 
Sbjct: 382 ISWNSVIAGIAQNG------LEVEAVCLFMQLLRCGL-KPDQYTMTSVLKAASSLPEGLS 441

Query: 143 SGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYV 202
             +Q+H   +KI    D FV +A++D Y +    ++A + F    + DLV WN M+  Y 
Sbjct: 442 LSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEAEILFERHNF-DLVAWNAMMAGYT 501

Query: 203 FNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDIL 262
            +  G + +++F LM  +G + DDFT +++  +C +  +   GKQ+H   IK  +DLD+ 
Sbjct: 502 QSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLW 561

Query: 263 VASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGED 322
           V+S ++++Y K  ++  A+  FD +P  + V+WTTMI G  +  + + A  +F +M    
Sbjct: 562 VSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMG 621

Query: 323 YCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTAL 382
             PDE T A++  +        +  Q+H+  +KL       +   L++ Y+KCG I  A 
Sbjct: 622 VLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAY 681

Query: 383 QCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGG 442
             F+ I   ++  W +++ GLA  G  K+ ++LF +M S GIKPDK+ F+GVLSACSH G
Sbjct: 682 CLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSG 741

Query: 443 FVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAF 502
            VS    +   M   Y I P  EH +CL D L RAG + QA +L++S + EA    +R  
Sbjct: 742 LVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTL 801

Query: 503 IRACRTHGDL----RLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSC 562
           + ACR  GD     R+A   +E   EP +   Y L+SNMYA+  +W ++   R +MK   
Sbjct: 802 LAACRVQGDTETGKRVATKLLEL--EPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHK 861

Query: 563 ERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKD 608
            +K PG SW+E+        H+FV  DRS+ Q+  +Y  +  ++  +K++
Sbjct: 862 VKKDPGFSWIEVKNKI----HIFVVDDRSNRQTELIYRKVKDMIRDIKQE 897

BLAST of Cla97C06G112990 vs. TAIR10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 335.5 bits (859), Expect = 7.0e-92
Identity = 188/561 (33.51%), Postives = 298/561 (53.12%), Query Frame = 0

Query: 34  RGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLV 93
           +G Q H  ++K G    L   N L+ +Y KCR+   A  VFD M  RNVVSW+ ++ G V
Sbjct: 24  QGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPERNVVSWSALMSGHV 83

Query: 94  DCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLNDVESGRQLHGFVMKI 153
                 D K         F +M    + P+  TF+   ++C +LN +E G Q+HGF +KI
Sbjct: 84  ---LNGDLK----GSLSLFSEMGRQGIYPNEFTFSTNLKACGLLNALEKGLQIHGFCLKI 143

Query: 154 GFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCYVFNCLGREAIEVF 213
           GF++   VG+++VD Y KCG   +A   F  I+ R L+ WN M+  +V    G +A++ F
Sbjct: 144 GFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGYGSKALDTF 203

Query: 214 CLMQLEGFK--GDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDL--DILVASSLVNV 273
            +MQ    K   D+FT +SLL +C   G    GKQ+HG L++  F       +  SLV++
Sbjct: 204 GMMQEANIKERPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSATITGSLVDL 263

Query: 274 YAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCPDELTF 333
           Y K   L+ ARK FD++  +  +SW+++I+GY Q+ +  EA+ LF+R+   +   D    
Sbjct: 264 YVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGLFKRLQELNSQIDSFAL 323

Query: 334 ASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCFRLIAE 393
           +S++          +  Q+ +  +KL      S+ N +++ Y KCG++  A +CF  +  
Sbjct: 324 SSIIGVFADFALLRQGKQMQALAVKLPSGLETSVLNSVVDMYLKCGLVDEAEKCFAEMQL 383

Query: 394 PDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVSMGLHY 453
            D+++WT +I G    GL K +V +F +ML + I+PD++ +L VLSACSH G +  G   
Sbjct: 384 KDVISWTVVITGYGKHGLGKKSVRIFYEMLRHNIEPDEVCYLAVLSACSHSGMIKEGEEL 443

Query: 454 FNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRACRTHG 513
           F+ +   + I P  EH  C++DLL RAG L +A  L+ +   +     ++  +  CR HG
Sbjct: 444 FSKLLETHGIKPRVEHYACVVDLLGRAGRLKEAKHLIDTMPIKPNVGIWQTLLSLCRVHG 503

Query: 514 DLRLAE--WAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERKAPGLSWV 573
           D+ L +    +    +     NY ++SN+Y   G W++    R+L      +K  G+SWV
Sbjct: 504 DIELGKEVGKILLRIDAKNPANYVMMSNLYGQAGYWNEQGNARELGNIKGLKKEAGMSWV 563

Query: 574 EIAGITICYNHLFVSGDRSHP 589
           EI        H F SG+ SHP
Sbjct: 564 EIEREV----HFFRSGEDSHP 573

BLAST of Cla97C06G112990 vs. TAIR10
Match: AT3G13880.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 326.6 bits (836), Expect = 3.2e-89
Identity = 195/582 (33.51%), Postives = 305/582 (52.41%), Query Frame = 0

Query: 35  GTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFDEMSRRNVVSWNTVICGLVD 94
           G   H L++  GL+ ++ + N L+ +Y KC  L+ A ++FD    R+ VSWN++I G V 
Sbjct: 167 GELLHGLVVVNGLSQQVFLINVLIDMYSKCGKLDQAMSLFDRCDERDQVSWNSLISGYVR 226

Query: 95  CGYGAD---FKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCVVLND--VESGRQLHGF 154
            G   +      + H   L      +G V         L   C+ LN+  +E G  +H +
Sbjct: 227 VGAAEEPLNLLAKMHRDGLNLTTYALGSV---------LKACCINLNEGFIEKGMAIHCY 286

Query: 155 VMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNVMLYCY-----VFNC 214
             K+G + D  V +A++D Y K G  ++A   FS +  +++V +N M+  +     + + 
Sbjct: 287 TAKLGMEFDIVVRTALLDMYAKNGSLKEAIKLFSLMPSKNVVTYNAMISGFLQMDEITDE 346

Query: 215 LGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQSFDLDILVAS 274
              EA ++F  MQ  G +    TFS +L +C    + E G+Q+H L+ K +F  D  + S
Sbjct: 347 ASSEAFKLFMDMQRRGLEPSPSTFSVVLKACSAAKTLEYGRQIHALICKNNFQSDEFIGS 406

Query: 275 SLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFRRMFGEDYCP 334
           +L+ +YA   +  D  + F     ++  SWT+MI  + Q E  + A  LFR++F     P
Sbjct: 407 ALIELYALMGSTEDGMQCFASTSKQDIASWTSMIDCHVQNEQLESAFDLFRQLFSSHIRP 466

Query: 335 DELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCGIISTALQCF 394
           +E T + ++S+C          Q+    IK G++AF S+    I+ Y+K G +  A Q F
Sbjct: 467 EEYTVSLMMSACADFAALSSGEQIQGYAIKSGIDAFTSVKTSSISMYAKSGNMPLANQVF 526

Query: 395 RLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLSACSHGGFVS 454
             +  PD+ T++++I  LA  G   +A+ +F+ M ++GIKP++ AFLGVL AC HGG V+
Sbjct: 527 IEVQNPDVATYSAMISSLAQHGSANEALNIFESMKTHGIKPNQQAFLGVLIACCHGGLVT 586

Query: 455 MGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGPDAFRAFIRA 514
            GL YF  M   Y+I PN +H TCL+DLL R G L  A +L+ S+  +  P  +RA + +
Sbjct: 587 QGLKYFQCMKNDYRINPNEKHFTCLVDLLGRTGRLSDAENLILSSGFQDHPVTWRALLSS 646

Query: 515 CRTHGD----LRLAEWAMEFASEPNEQVNYSLVSNMYASEGRWSDVARMRKLMKDSCERK 574
           CR + D     R+AE  ME   EP    +Y L+ N+Y   G  S    +R+LM+D   +K
Sbjct: 647 CRVYKDSVIGKRVAERLMEL--EPEASGSYVLLHNIYNDSGVNSSAEEVRELMRDRGVKK 706

Query: 575 APGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLN 603
            P LSW+ I   T    H F   D SHP S  +YTML  + N
Sbjct: 707 EPALSWIVIGNQT----HSFAVADLSHPSSQMIYTMLETMDN 733

BLAST of Cla97C06G112990 vs. TAIR10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 321.2 bits (822), Expect = 1.4e-87
Identity = 190/594 (31.99%), Postives = 317/594 (53.37%), Query Frame = 0

Query: 17  SFSVNA-LKAAAPVNPIPRGTQSHSLIIKLGLANELSVQNKLLKIYVKCRDLESAGNVFD 76
           SF+V++ L A   +  + +G   H   +K G+ + + V N L+ +Y+K R    A  VFD
Sbjct: 207 SFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFD 266

Query: 77  EMSRRNVVSWNTVICGLVDCGYGADFKLRQHSIFLYFKKMLMGMVRPDGVTFNGLFRSCV 136
           EM  R+ VS+NT+ICG +        ++ + S+ ++ +   +   +PD +T + + R+C 
Sbjct: 267 EMDVRDSVSYNTMICGYL------KLEMVEESVRMFLEN--LDQFKPDLLTVSSVLRACG 326

Query: 137 VLNDVESGRQLHGFVMKIGFDLDCFVGSAVVDFYGKCGLYEDARLAFSCILYRDLVLWNV 196
            L D+   + ++ +++K GF L+  V + ++D Y KCG    AR  F+ +  +D V WN 
Sbjct: 327 HLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNS 386

Query: 197 MLYCYVFNCLGREAIEVFCLMQLEGFKGDDFTFSSLLSSCKYKGSGELGKQLHGLLIKQS 256
           ++  Y+ +    EA+++F +M +   + D  T+  L+S        + GK LH   IK  
Sbjct: 387 IISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSG 446

Query: 257 FDLDILVASSLVNVYAKNDNLYDARKVFDEMPSRNSVSWTTMIVGYGQQEDGKEAVKLFR 316
             +D+ V+++L+++YAK   + D+ K+F  M + ++V+W T+I    +  D    +++  
Sbjct: 447 ICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTT 506

Query: 317 RMFGEDYCPDELTFASVLSSCGFTCGACELMQVHSCLIKLGLEAFLSINNGLINAYSKCG 376
           +M   +  PD  TF   L  C          ++H CL++ G E+ L I N LI  YSKCG
Sbjct: 507 QMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCG 566

Query: 377 IISTALQCFRLIAEPDLVTWTSIICGLALCGLEKDAVELFDKMLSYGIKPDKIAFLGVLS 436
            +  + + F  ++  D+VTWT +I    + G  + A+E F  M   GI PD + F+ ++ 
Sbjct: 567 CLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIY 626

Query: 437 ACSHGGFVSMGLHYFNLMTIQYQIVPNSEHLTCLIDLLSRAGSLDQAFDLLKSTAKEAGP 496
           ACSH G V  GL  F  M   Y+I P  EH  C++DLLSR+  + +A + +++   +   
Sbjct: 627 ACSHSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDA 686

Query: 497 DAFRAFIRACRTHGDLRLAEWAMEFASEPN-EQVNYS-LVSNMYASEGRWSDVARMRKLM 556
             + + +RACRT GD+  AE       E N +   YS L SN YA+  +W  V+ +RK +
Sbjct: 687 SIWASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSL 746

Query: 557 KDSCERKAPGLSWVEIAGITICYNHLFVSGDRSHPQSSDLYTMLGLLLNTMKKD 608
           KD    K PG SW+E+ G  +   H+F SGD S PQS  +Y  L +L + M K+
Sbjct: 747 KDKHITKNPGYSWIEV-GKNV---HVFSSGDDSAPQSEAIYKSLEILYSLMAKE 788

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011656346.10.0e+0087.58PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial ... [more]
XP_008458191.10.0e+0086.27PREDICTED: pentatricopeptide repeat-containing protein At2g46050, mitochondrial ... [more]
XP_022958961.13.2e-30082.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
XP_023548540.14.1e-30082.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
XP_023006538.17.8e-29982.42pentatricopeptide repeat-containing protein At2g46050, mitochondrial [Cucurbita ... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K863|A0A0A0K863_CUCSA0.0e+0087.58Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G005130 PE=4 SV=1[more]
tr|A0A1S3C6T7|A0A1S3C6T7_CUCME0.0e+0086.27pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X1 ... [more]
tr|A0A1S3C796|A0A1S3C796_CUCME3.7e-29786.45pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X2 ... [more]
tr|A0A1S4E2S1|A0A1S4E2S1_CUCME4.2e-20087.66pentatricopeptide repeat-containing protein At2g46050, mitochondrial isoform X3 ... [more]
tr|A0A2N9F0N1|A0A2N9F0N1_FAGSY5.5e-20056.96Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8236 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|O82363|PP203_ARATH2.1e-10940.80Pentatricopeptide repeat-containing protein At2g46050, mitochondrial OS=Arabidop... [more]
sp|Q9SMZ2|PP347_ARATH2.3e-9233.22Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|P0C898|PP232_ARATH1.3e-9033.51Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
sp|Q9LRV9|PP228_ARATH5.8e-8833.51Pentatricopeptide repeat-containing protein At3g13880 OS=Arabidopsis thaliana OX... [more]
sp|Q9SS60|PP210_ARATH2.5e-8631.99Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT2G46050.11.1e-11040.80Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT4G33170.11.3e-9333.22Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15130.17.0e-9233.51Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G13880.13.2e-8933.51Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.11.4e-8731.99Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G112990.1Cla97C06G112990.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 467..487
e-value: 0.62
score: 10.3
coord: 56..82
e-value: 0.036
score: 14.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 290..336
e-value: 1.7E-8
score: 34.4
coord: 189..235
e-value: 2.9E-7
score: 30.5
coord: 390..438
e-value: 1.3E-7
score: 31.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 393..426
e-value: 1.7E-7
score: 29.0
coord: 292..326
e-value: 4.0E-6
score: 24.7
coord: 264..290
e-value: 0.0015
score: 16.5
coord: 191..224
e-value: 8.2E-4
score: 17.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 462..497
score: 7.026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 50..84
score: 8.495
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..289
score: 8.144
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 123..157
score: 7.015
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 189..223
score: 8.155
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 526..556
score: 6.533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 290..324
score: 10.512
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 224..258
score: 6.522
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 325..359
score: 5.207
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..390
score: 5.053
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 426..461
score: 5.064
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 391..425
score: 11.838
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 141..242
e-value: 1.8E-13
score: 52.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 243..317
e-value: 6.2E-7
score: 31.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 19..140
e-value: 5.6E-12
score: 47.6
coord: 340..581
e-value: 2.5E-32
score: 114.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 15..596
NoneNo IPR availablePANTHERPTHR24015:SF398SUBFAMILY NOT NAMEDcoord: 15..596

The following gene(s) are paralogous to this gene:

None