Cla97C03G066720 (gene) Watermelon (97103) v2

NameCla97C03G066720
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr03 : 30239110 .. 30240564 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

mRNA sequence

ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

Coding sequence (CDS)

ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

Protein sequence

MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQGLEVL
BLAST of Cla97C03G066720 vs. NCBI nr
Match: XP_011659935.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativus] >KGN66147.1 hypothetical protein Csa_1G573660 [Cucumis sativus])

HSP 1 Score: 879.4 bits (2271), Expect = 5.6e-252
Identity = 426/481 (88.57%), Postives = 453/481 (94.18%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CIHHL+N+RISS QL QIHAQLITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQY P+LFLFNTLIRCAPP HSIS FATWVST HFEFDD TFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEM +RNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQ GKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAASQLG+ ETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KT+DSPE D+FIGTGLVNMYSKCG +NSASSVFKQMKQ+NVLTWT+MATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYAS ERW DVEA+R+EMKIKGIENKAGCSS+QTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of Cla97C03G066720 vs. NCBI nr
Match: XP_008450741.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo])

HSP 1 Score: 870.9 bits (2249), Expect = 2.0e-249
Identity = 426/481 (88.57%), Postives = 451/481 (93.76%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CI+HL+NIRISS QL QIHAQ ITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 9   MHLAPRLSCINHLSNIRISSLQLLQIHAQFITNGFKSPSPYAKLITHLCKKSSSESIAHA 68

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQ+ P+LFLFNTLIRCAPPQ+SIS FA WVSTPHFEFDD TFIFVLGACARAPS
Sbjct: 69  HLIFRHHQHSPNLFLFNTLIRCAPPQYSISIFANWVSTPHFEFDDFTFIFVLGACARAPS 128

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEM VRNSVTWNAM
Sbjct: 129 VSTLMIGRQIHTHILKRGIVSNIWAQTTMIHFYSTNKDVGSARKVFDEMSVRNSVTWNAM 188

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQSGKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAAS LG+ ETG C
Sbjct: 189 IAGYCSQSGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASHLGMLETGVC 248

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KTIDSPE D+FIGTGLVNMYSKCG ++SASSVFKQMKQRNVLTWT+MATGLAVH
Sbjct: 249 VHAYIKKTIDSPEKDVFIGTGLVNMYSKCGLLSSASSVFKQMKQRNVLTWTSMATGLAVH 308

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 309 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 368

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY+LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 369 YGCIVDLLGRSGHLREAYELILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 428

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYASAERW DVEA+REEMKIKGIENKAG SSVQTTGSQG
Sbjct: 429 GGESFDDEWCVGSEDFVALSNVYASAERWDDVEALREEMKIKGIENKAGFSSVQTTGSQG 488

Query: 481 L 482
           L
Sbjct: 489 L 489

BLAST of Cla97C03G066720 vs. NCBI nr
Match: XP_023515866.1 (pentatricopeptide repeat-containing protein At3g18970 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 802.7 bits (2072), Expect = 6.6e-229
Identity = 396/478 (82.85%), Postives = 426/478 (89.12%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISS-RQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAY 60
           MRL+PRL CIHHLNNI  SS  QLKQIHAQLITN FK P PYAKLIAH C   SPEA AY
Sbjct: 1   MRLSPRLSCIHHLNNILSSSLLQLKQIHAQLITNAFKSPIPYAKLIAHFCNNHSPEATAY 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           A LI  HH++P +LFLFNTLIRCAPPQHSIS FA  VST HFEFDD TFIF+LGACARAP
Sbjct: 61  AHLISLHHRHPTNLFLFNTLIRCAPPQHSISIFANSVSTAHFEFDDFTFIFLLGACARAP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S  TL  G+QIHTHILKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEM VRN+VTWNA
Sbjct: 121 SPPTLTTGKQIHTHILKRGVVSNIWVQTTMIHFYAINKDVGAARKVFDEMSVRNNVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MIAGYCSQSG+VAQ+Y R++LELFRGMLVESTNS+V PTDTTMVCLLSAASQLGV ETG 
Sbjct: 181 MIAGYCSQSGRVAQKYGREALELFRGMLVESTNSDVNPTDTTMVCLLSAASQLGVLETGV 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVHAYIEKTIDSPEND+FIGTGLVNMYSKCGC+NSASSVFKQMKQRNVLTWTAMATGLA+
Sbjct: 241 CVHAYIEKTIDSPENDVFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+GKEALELL+AMG HGVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQ
Sbjct: 301 HGKGKEALELLEAMGGHGVKPNAVTFTSLLSGCCHGGLIEEGLHLFDVMERKFGVVPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVDLLGR GHL+EAY++ILGMP+ PDGVLWR L+SSC+VH DV+MGE+VGK LVE 
Sbjct: 361 HYGCIVDLLGRCGHLKEAYEMILGMPMAPDGVLWRGLMSSCMVHCDVEMGEKVGKFLVES 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
             G    DEWC GSEDFVALSNVYASA+RW +V+AVREEMKIKGI+NK G SSVQT G
Sbjct: 421 VVG----DEWCDGSEDFVALSNVYASAQRWENVKAVREEMKIKGIQNKRGYSSVQTRG 474

BLAST of Cla97C03G066720 vs. NCBI nr
Match: XP_022159202.1 (pentatricopeptide repeat-containing protein At3g18970 [Momordica charantia])

HSP 1 Score: 776.9 bits (2005), Expect = 3.9e-221
Identity = 385/485 (79.38%), Postives = 426/485 (87.84%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRLRCIHHLNN   SS QLKQIHAQLITNGFK PSP+AKLIA  C KSSP+A A+A
Sbjct: 1   MHLAPRLRCIHHLNNTPNSSLQLKQIHAQLITNGFKSPSPFAKLIAQFCDKSSPQATAHA 60

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIF+HH+  P+LFL NT IR +PPQHSI  FA W ST   EFDD TFIFVLGA ARAPS
Sbjct: 61  HLIFKHHR--PNLFLLNTFIRSSPPQHSIPVFAKWASTGDLEFDDFTFIFVLGAAARAPS 120

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           L+ LMIG+QIHT I KRG++SNIWVQTT IHFYA NKDV  ARKVFDEM VRNSVTWNAM
Sbjct: 121 LAALMIGKQIHTQIFKRGVISNIWVQTTAIHFYASNKDVSSARKVFDEMSVRNSVTWNAM 180

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQS +VA++YAR++LELF  MLV+S NSEVKPTDTTMVCLLSA SQLGV ETG+ 
Sbjct: 181 IAGYCSQSERVAEKYARNALELFLVMLVDS-NSEVKPTDTTMVCLLSAVSQLGVVETGAS 240

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYIEKTI SPE+D+F+GTGLV+MYSKCGC++SASSVFKQMKQRNVLTWTAMATGLAVH
Sbjct: 241 VHAYIEKTIHSPESDVFVGTGLVDMYSKCGCLHSASSVFKQMKQRNVLTWTAMATGLAVH 300

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           G+GKE+L LLD+M  HGVKPNAVTFTSLLSACCHGGL+EEGLHLF VMERKFG VP MQH
Sbjct: 301 GKGKESLLLLDSMEAHGVKPNAVTFTSLLSACCHGGLVEEGLHLFRVMERKFGAVPHMQH 360

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHL+EAY+LI+GMP+EPD VLWRSLLSSC+ HGDV+MGE+VGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLKEAYELIMGMPMEPDAVLWRSLLSSCMSHGDVEMGEKVGKLLVERQ 420

Query: 421 GGESSDDEWCVG-SEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQ 480
            G    +EWCVG SEDFVALSNVYASA++WGDVEAVREEM+IKGIENK G SS+QTTG +
Sbjct: 421 RGRHGFEEWCVGSSEDFVALSNVYASAQKWGDVEAVREEMRIKGIENKPGYSSLQTTGPR 480

Query: 481 GLEVL 485
           GLE L
Sbjct: 481 GLEGL 482

BLAST of Cla97C03G066720 vs. NCBI nr
Match: XP_022988094.1 (pentatricopeptide repeat-containing protein At3g18970 [Cucurbita maxima])

HSP 1 Score: 762.7 bits (1968), Expect = 7.6e-217
Identity = 380/478 (79.50%), Postives = 410/478 (85.77%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISS-RQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAY 60
           M L+PRL CIHHLNNI  SS  QLKQI AQLITN FK PSPYA                 
Sbjct: 1   MHLSPRLSCIHHLNNILNSSLLQLKQIQAQLITNAFKSPSPYAXXXXXXXXXXXXXXXXX 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
                  H++P +LFLFNTLIRCAPPQHSIS FA  +ST HFEFDD TFIF+LGACARAP
Sbjct: 61  XXXXXXXHRHPTNLFLFNTLIRCAPPQHSISIFANSLSTAHFEFDDFTFIFLLGACARAP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S  TL  G+QIHTH+LKRG+VSNIWVQTTMIHFYAINKDVG ARKVFDEM VRN+VTWNA
Sbjct: 121 SPPTLTTGKQIHTHVLKRGVVSNIWVQTTMIHFYAINKDVGAARKVFDEMSVRNNVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MIAGYCSQSG+VAQ+Y R++LELFRGMLVESTNS+V PTDTTMVCLLSAASQLGV ETG 
Sbjct: 181 MIAGYCSQSGRVAQKYGREALELFRGMLVESTNSDVNPTDTTMVCLLSAASQLGVLETGV 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVHAYIEKT DSPEND+FIGTGLVNMYSKCGC+NSASSVFKQMKQRNVLTWTAMATGLA+
Sbjct: 241 CVHAYIEKTTDSPENDVFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+GKEALELL+AMG HGVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQ
Sbjct: 301 HGKGKEALELLEAMGGHGVKPNAVTFTSLLSGCCHGGLIEEGLHLFDVMERKFGVVPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVDLLGR GHL+EAY+LILGMPV PDGVLWRSL+SSC+VH DV+MGE+VGK LVE 
Sbjct: 361 HYGCIVDLLGRCGHLKEAYELILGMPVAPDGVLWRSLMSSCMVHCDVEMGEKVGKFLVES 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
             G    DEWC GSEDFVALSNVYASA+RW +V+AVREEMKIKGI+NKAG SSVQT G
Sbjct: 421 VVG----DEWCDGSEDFVALSNVYASAQRWENVKAVREEMKIKGIQNKAGYSSVQTRG 474

BLAST of Cla97C03G066720 vs. TrEMBL
Match: tr|A0A0A0LZ63|A0A0A0LZ63_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573660 PE=4 SV=1)

HSP 1 Score: 879.4 bits (2271), Expect = 3.7e-252
Identity = 426/481 (88.57%), Postives = 453/481 (94.18%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CIHHL+N+RISS QL QIHAQLITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQY P+LFLFNTLIRCAPP HSIS FATWVST HFEFDD TFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEM +RNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQ GKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAASQLG+ ETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KT+DSPE D+FIGTGLVNMYSKCG +NSASSVFKQMKQ+NVLTWT+MATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYAS ERW DVEA+R+EMKIKGIENKAGCSS+QTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of Cla97C03G066720 vs. TrEMBL
Match: tr|A0A1S3BPA5|A0A1S3BPA5_CUCME (pentatricopeptide repeat-containing protein At3g18970 OS=Cucumis melo OX=3656 GN=LOC103492230 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 1.3e-249
Identity = 426/481 (88.57%), Postives = 451/481 (93.76%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CI+HL+NIRISS QL QIHAQ ITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 9   MHLAPRLSCINHLSNIRISSLQLLQIHAQFITNGFKSPSPYAKLITHLCKKSSSESIAHA 68

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQ+ P+LFLFNTLIRCAPPQ+SIS FA WVSTPHFEFDD TFIFVLGACARAPS
Sbjct: 69  HLIFRHHQHSPNLFLFNTLIRCAPPQYSISIFANWVSTPHFEFDDFTFIFVLGACARAPS 128

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEM VRNSVTWNAM
Sbjct: 129 VSTLMIGRQIHTHILKRGIVSNIWAQTTMIHFYSTNKDVGSARKVFDEMSVRNSVTWNAM 188

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQSGKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAAS LG+ ETG C
Sbjct: 189 IAGYCSQSGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASHLGMLETGVC 248

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KTIDSPE D+FIGTGLVNMYSKCG ++SASSVFKQMKQRNVLTWT+MATGLAVH
Sbjct: 249 VHAYIKKTIDSPEKDVFIGTGLVNMYSKCGLLSSASSVFKQMKQRNVLTWTSMATGLAVH 308

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 309 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 368

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY+LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 369 YGCIVDLLGRSGHLREAYELILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 428

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYASAERW DVEA+REEMKIKGIENKAG SSVQTTGSQG
Sbjct: 429 GGESFDDEWCVGSEDFVALSNVYASAERWDDVEALREEMKIKGIENKAGFSSVQTTGSQG 488

Query: 481 L 482
           L
Sbjct: 489 L 489

BLAST of Cla97C03G066720 vs. TrEMBL
Match: tr|M5WFE9|M5WFE9_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G343200 PE=4 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 7.9e-170
Identity = 308/486 (63.37%), Postives = 370/486 (76.13%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNG-FKFPSPYAKLIAHSCKKSSPEAI-A 60
           M   PR+R +  LN    S+ QLK+ HAQLIT+G  K P+ YAKLI      S P++   
Sbjct: 1   MHHLPRVRALFLLNLKLKSTHQLKRTHAQLITSGLLKSPTLYAKLIQQYGALSDPQSTNL 60

Query: 61  YAQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARA 120
           YA  +F+H    P+LFL NTLIRC  P+ SI  FA WVS     FDD T+ FVLGACAR 
Sbjct: 61  YAHFVFKHFD-EPNLFLLNTLIRCTQPKDSILVFANWVSKATLIFDDFTYKFVLGACARL 120

Query: 121 PSLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWN 180
           PS+STL++G QIH  I+K  +VSNI VQTT++HFYA NKD   AR+VFDEM V+NSVTWN
Sbjct: 121 PSVSTLLVGSQIHARIIKHDVVSNILVQTTLVHFYASNKDFVSARRVFDEMAVKNSVTWN 180

Query: 181 AMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETG 240
           AMI GYCSQ     +  ARD+L LFR ML +     VKPTDTTMVC+LSAASQLGV ETG
Sbjct: 181 AMITGYCSQ-----RESARDALVLFRDMLDDVCG--VKPTDTTMVCVLSAASQLGVLETG 240

Query: 241 SCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLA 300
           +CVH YIEK I  P ND+FIGTGLV MYSKCGC++ A S+FK+MK++N+LTWTAMATGLA
Sbjct: 241 ACVHGYIEKAIWVPHNDVFIGTGLVGMYSKCGCVDGALSIFKRMKEKNILTWTAMATGLA 300

Query: 301 VHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQM 360
           +HG+G EAL LLD M  +G+KPNAVTFTSLLSACCH GL+EEGLHLFH+M+  F V+PQM
Sbjct: 301 IHGKGNEALVLLDVMEAYGIKPNAVTFTSLLSACCHSGLVEEGLHLFHMMKSNFDVMPQM 360

Query: 361 QHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVE 420
           QHYGCIVD+L R G+L+EAY+ ++GMPVEPD VLWRSLLS+C VHGDV MGE+VGK L+ 
Sbjct: 361 QHYGCIVDMLSRRGYLKEAYEFVVGMPVEPDAVLWRSLLSACKVHGDVAMGEKVGKKLLH 420

Query: 421 RQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 480
            Q  ++  D   + SED+VALSN+YASAERW DVE VR+EMK+KGIENKAGCSS+QT+ +
Sbjct: 421 IQSAQTCAD-LTLKSEDYVALSNIYASAERWEDVEMVRQEMKVKGIENKAGCSSIQTSSN 477

Query: 481 QGLEVL 485
               VL
Sbjct: 481 ISNHVL 477

BLAST of Cla97C03G066720 vs. TrEMBL
Match: tr|A0A2P5B3D3|A0A2P5B3D3_PARAD (Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_275160 PE=4 SV=1)

HSP 1 Score: 601.3 bits (1549), Expect = 1.9e-168
Identity = 308/477 (64.57%), Postives = 367/477 (76.94%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAY- 60
           M   PRLR I  L    +S  QLKQ HAQLI NG   PS  AKLI   C  S+ ++  + 
Sbjct: 1   MLFLPRLRAIGFLRLKLLSIYQLKQAHAQLIVNGLNSPSLVAKLIQQYCSLSNQKSKYHN 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           AQL+F+H    P+LFL NTLIRC+ P+ SI  FA WVS   FEFDD T+IFVLGACAR+P
Sbjct: 61  AQLVFKHFD-QPNLFLLNTLIRCSQPKESILVFAQWVSRGDFEFDDFTYIFVLGACARSP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S+ TL  GRQIH  I+K G +SNI VQTT+IH YA NKD+G AR+VFDEM VRN+VTWNA
Sbjct: 121 SVPTLWAGRQIHARIMKCGTISNIMVQTTVIHSYASNKDMGSARRVFDEMVVRNNVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MI GYCSQ G      A D+L LFR ML +   +  KPTDTT+VC+LSAASQ GV ETG+
Sbjct: 181 MITGYCSQKGS-----ACDALVLFRDMLDDVDGA--KPTDTTIVCILSAASQFGVLETGA 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVH YIEKT   PEND+FIGTGLV+MYSKCGC+NSA S+F +MK++N+LTWTAMATGLA+
Sbjct: 241 CVHGYIEKTNWVPENDVFIGTGLVDMYSKCGCLNSALSIFIRMKEKNILTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+GKEAL+LLDAMG  G+KPNAVTFTSLL ACCH GL+EEGLHLF+ M  KF + PQMQ
Sbjct: 301 HGKGKEALQLLDAMGPSGLKPNAVTFTSLLLACCHSGLVEEGLHLFYNMS-KFNITPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVDLL R+G L+EAY+ I+ MP+EPD +LWRSLLS+  +HGDV MGE+VGKLL++ 
Sbjct: 361 HYGCIVDLLSRTGLLKEAYEFIMAMPIEPDTILWRSLLSASRIHGDVSMGEKVGKLLLQI 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTT 477
              +SS D   V SED+VALSN+YAS  +W DVE +RE MK+KGI+NKAGCSS+QTT
Sbjct: 421 HQEQSSLD---VTSEDYVALSNIYASVGKWADVEMLRENMKVKGIDNKAGCSSIQTT 465

BLAST of Cla97C03G066720 vs. TrEMBL
Match: tr|A0A2P6R5Z3|A0A2P6R5Z3_ROSCH (Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0451131 PE=4 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 4.3e-168
Identity = 308/485 (63.51%), Postives = 364/485 (75.05%), Query Frame = 0

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIA-Y 60
           M   PRLR +  LN    S+ QLKQ HAQLITNG K  S Y KLI   C  S PE+ + Y
Sbjct: 1   MHHLPRLRSLSLLNLKLKSTHQLKQAHAQLITNGLKSTSIYGKLIQQCCALSDPESTSLY 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           A L+F+H    P+LFL NTLIRC  P+ SI  FA WVS     FDD T+ FVLGACAR P
Sbjct: 61  AHLVFKHFD-EPNLFLLNTLIRCTQPKDSIFLFANWVSKASLCFDDFTYKFVLGACARLP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S+ TL++GR++H  I+K GI+SNI VQTT++H YA NKD+  ARKVFDEM  R SVTWNA
Sbjct: 121 SIPTLVVGREVHARIVKEGIISNILVQTTLVHCYASNKDLDSARKVFDEMTERTSVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MI GY S      +  ARD+L LFR ML    +S VKPTDTT VC+L+AASQLGV ETG+
Sbjct: 181 MITGYSSH-----RESARDALLLFRDML--DCDSGVKPTDTTTVCVLAAASQLGVLETGA 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVH Y+EK + +P+ D+F+GTGLV+MYSKCG ++SA ++FK+MKQRNVLTWTAMATGLA+
Sbjct: 241 CVHGYVEKAMPAPDGDVFMGTGLVDMYSKCGSVDSALTIFKRMKQRNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+G EALELLD M  HG  PN VTFTSLL+ACCH GL+EEGLHLFH+M+ KFGV P MQ
Sbjct: 301 HGKGSEALELLDVMKAHGTNPNEVTFTSLLAACCHVGLVEEGLHLFHMMKTKFGVTPHMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVDLL RSGHL EAYD I+ MPVEPD VLWRSLLS+C VHG+V MGE+VG+ L+  
Sbjct: 361 HYGCIVDLLSRSGHLNEAYDFIIAMPVEPDAVLWRSLLSACKVHGNVAMGEKVGRKLLHI 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQ 480
           Q  +SS D     SED+VALSN+YA AE+W  VE VREEMK+ GIENKAG SSVQTT + 
Sbjct: 421 QLTQSSAD-GTPKSEDYVALSNIYAYAEKWDAVEMVREEMKVMGIENKAGSSSVQTTSNH 476

Query: 481 GLEVL 485
            L+ L
Sbjct: 481 ALDGL 476

BLAST of Cla97C03G066720 vs. Swiss-Prot
Match: sp|Q9LJ69|PP243_ARATH (Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E93 PE=2 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 2.4e-121
Identity = 236/457 (51.64%), Postives = 308/457 (67.40%), Query Frame = 0

Query: 22  QLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIA-YAQLIFRHHQYPPSLFLFNTLI 81
           Q KQIHAQL+ NG    S + KLI H C K S E+ +  A L+       P  FLFNTL+
Sbjct: 23  QAKQIHAQLVINGCHDNSLFGKLIGHYCSKPSTESSSKLAHLLVFPRFGHPDKFLFNTLL 82

Query: 82  RCAPPQHSISTFATWVSTPHFEF-DDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRG- 141
           +C+ P+ SI  FA + S     + ++ TF+FVLGACAR+ S S L +GR +H  + K G 
Sbjct: 83  KCSKPEDSIRIFANYASKSSLLYLNERTFVFVLGACARSASSSALRVGRIVHGMVKKLGF 142

Query: 142 IVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARD 201
           +  +  + TT++HFYA N D+  ARKVFDEM  R SVTWNAMI GYCS   K     AR 
Sbjct: 143 LYESELIGTTLLHFYAKNGDLRYARKVFDEMPERTSVTWNAMIGGYCSHKDK-GNHNARK 202

Query: 202 SLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFI 261
           ++ LFR        S V+PTDTTMVC+LSA SQ G+ E GS VH YIEK   +PE D+FI
Sbjct: 203 AMVLFRRF--SCCGSGVRPTDTTMVCVLSAISQTGLLEIGSLVHGYIEKLGFTPEVDVFI 262

Query: 262 GTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGV 321
           GT LV+MYSKCGC+N+A SVF+ MK +NV TWT+MATGLA++GRG E   LL+ M   G+
Sbjct: 263 GTALVDMYSKCGCLNNAFSVFELMKVKNVFTWTSMATGLALNGRGNETPNLLNRMAESGI 322

Query: 322 KPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAY 381
           KPN +TFTSLLSA  H GL+EEG+ LF  M+ +FGV P ++HYGCIVDLLG++G ++EAY
Sbjct: 323 KPNEITFTSLLSAYRHIGLVEEGIELFKSMKTRFGVTPVIEHYGCIVDLLGKAGRIQEAY 382

Query: 382 DLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGS--EDF 441
             IL MP++PD +L RSL ++C ++G+  MGE +GK L+E +     +DE   GS  ED+
Sbjct: 383 QFILAMPIKPDAILLRSLCNACSIYGETVMGEEIGKALLEIE----REDEKLSGSECEDY 442

Query: 442 VALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           VALSNV A   +W +VE +R+EMK + I+ + G S V
Sbjct: 443 VALSNVLAHKGKWVEVEKLRKEMKERRIKTRPGYSFV 472

BLAST of Cla97C03G066720 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 270.4 bits (690), Expect = 3.9e-71
Identity = 169/468 (36.11%), Postives = 248/468 (52.99%), Query Frame = 0

Query: 22  QLKQIHAQLITNGFKFPSPYAKLIAHS---CKKSSPEAIAYAQLIFRHHQYPPSLFLFNT 81
           QLKQ+HA   T    +P   A L  +       SS   + YA  +F   +   S F++NT
Sbjct: 63  QLKQLHA--FTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSS-FMWNT 122

Query: 82  LIR-CA----PPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHTH 141
           LIR CA      + +   +   +       D  TF FVL ACA     S    G+Q+H  
Sbjct: 123 LIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE---GKQVHCQ 182

Query: 142 ILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQ 201
           I+K G   +++V   +IH Y     + +ARKVFDEM  R+ V+WN+MI         V  
Sbjct: 183 IVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMI------DALVRF 242

Query: 202 RYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTID-SP 261
                +L+LFR M         +P   TM  +LSA + LG    G+  HA++ +  D   
Sbjct: 243 GEYDSALQLFREM-----QRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDV 302

Query: 262 ENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDA 321
             D+ +   L+ MY KCG +  A  VF+ M++R++ +W AM  G A HGR +EA+   D 
Sbjct: 303 AMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDR 362

Query: 322 M--GTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGR 381
           M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ R
Sbjct: 363 MVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIAR 422

Query: 382 SGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHG-DVQMGERVGKLLVERQGGESSDDEW 441
           +G++ EA D+++ MP++PD V+WRSLL +C   G  V++ E + + ++  +    S +  
Sbjct: 423 AGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGN 482

Query: 442 CVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
           C G+  +V LS VYASA RW DV  VR+ M   GI  + GCSS++  G
Sbjct: 483 CSGA--YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEING 511

BLAST of Cla97C03G066720 vs. Swiss-Prot
Match: sp|Q9FG85|PP415_ARATH (Pentatricopeptide repeat-containing protein At5g43790 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E30 PE=2 SV=1)

HSP 1 Score: 263.5 bits (672), Expect = 4.7e-69
Identity = 167/477 (35.01%), Postives = 258/477 (54.09%), Query Frame = 0

Query: 8   RCIHHLNNIRISSRQLKQIHAQLITNGFKFPS-PYAKLIAHSCKKSSPEAIAYAQLIFRH 67
           RC++ ++  + S + LKQIHAQ+IT G    + P +KL+      SS   ++YA  I R 
Sbjct: 11  RCLNLISKCK-SLQNLKQIHAQIITIGLSHHTYPLSKLL----HLSSTVCLSYALSILR- 70

Query: 68  HQYP-PSLFLFNTLIRCAPPQH-SISTFATWVSTPHFEFDDLTFI----FVLGACARAPS 127
            Q P PS+FL+NTLI      H S  T   +            F+    F   +  +A  
Sbjct: 71  -QIPNPSVFLYNTLISSIVSNHNSTQTHLAFSLYDQILSSRSNFVRPNEFTYPSLFKASG 130

Query: 128 LSTL--MIGRQIHTHILK--RGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVT 187
                   GR +H H+LK    +  + +VQ  ++ FYA    +  AR +F+ +   +  T
Sbjct: 131 FDAQWHRHGRALHAHVLKFLEPVNHDRFVQAALVGFYANCGKLREARSLFERIREPDLAT 190

Query: 188 WNAMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHE 247
           WN ++A Y +           + ++    +L+     +V+P + ++V L+ + + LG   
Sbjct: 191 WNTLLAAYANS----------EEIDSDEEVLLLFMRMQVRPNELSLVALIKSCANLGEFV 250

Query: 248 TGSCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATG 307
            G   H Y+ K  ++   + F+GT L+++YSKCGC++ A  VF +M QR+V  + AM  G
Sbjct: 251 RGVWAHVYVLK--NNLTLNQFVGTSLIDLYSKCGCLSFARKVFDEMSQRDVSCYNAMIRG 310

Query: 308 LAVHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVP 367
           LAVHG G+E +EL  ++ + G+ P++ TF   +SAC H GL++EGL +F+ M+  +G+ P
Sbjct: 311 LAVHGFGQEGIELYKSLISQGLVPDSATFVVTISACSHSGLVDEGLQIFNSMKAVYGIEP 370

Query: 368 QMQHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLL 427
           +++HYGC+VDLLGRSG L EA + I  MPV+P+  LWRS L S   HGD + GE   K L
Sbjct: 371 KVEHYGCLVDLLGRSGRLEEAEECIKKMPVKPNATLWRSFLGSSQTHGDFERGEIALKHL 430

Query: 428 VERQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           +   G E  +      S ++V LSN+YA   RW DVE  RE MK   +    G S++
Sbjct: 431 L---GLEFEN------SGNYVLLSNIYAGVNRWTDVEKTRELMKDHRVNKSPGISTL 459

BLAST of Cla97C03G066720 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 9.0e-68
Identity = 165/476 (34.66%), Postives = 253/476 (53.15%), Query Frame = 0

Query: 12  HLNNIRISSR---QLKQIHAQLI-TNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHH 71
           HL ++ +SS     L+QIHA L+ T+  +    +   ++       P  I Y+  +F   
Sbjct: 13  HLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVF-SQ 72

Query: 72  QYPPSLFLFNTLIRC----APPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLST 131
           +  P+L   NT+IR       P      F +         + L+  F L  C ++     
Sbjct: 73  RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKS---GD 132

Query: 132 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAG 191
           L+ G QIH  I   G +S+  + TT++  Y+  ++   A KVFDE+  R++V+WN + + 
Sbjct: 133 LLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSC 192

Query: 192 YCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHA 251
           Y      +  +  RD L LF  M     +  VKP   T +  L A + LG  + G  VH 
Sbjct: 193 Y------LRNKRTRDVLVLFDKM-KNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHD 252

Query: 252 YIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRG 311
           +I++  +     L +   LV+MYS+CG ++ A  VF  M++RNV++WTA+ +GLA++G G
Sbjct: 253 FIDE--NGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFG 312

Query: 312 KEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYG 371
           KEA+E  + M   G+ P   T T LLSAC H GL+ EG+  F  M   +F + P + HYG
Sbjct: 313 KEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYG 372

Query: 372 CIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGG 431
           C+VDLLGR+  L +AY LI  M ++PD  +WR+LL +C VHGDV++GERV   L+E +  
Sbjct: 373 CVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAE 432

Query: 432 ESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 479
           E+          D+V L N Y++  +W  V  +R  MK K I  K GCS+++  G+
Sbjct: 433 EAG---------DYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGT 466

BLAST of Cla97C03G066720 vs. Swiss-Prot
Match: sp|Q9FX24|PPR71_ARATH (Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H68 PE=2 SV=2)

HSP 1 Score: 257.3 bits (656), Expect = 3.4e-67
Identity = 167/472 (35.38%), Postives = 262/472 (55.51%), Query Frame = 0

Query: 18  ISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFN 77
           +S  Q+KQ+ +  +T G    S     +   C  S    +++A  IFR+   P +   +N
Sbjct: 14  VSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIPKPLTND-WN 73

Query: 78  TLIR----CAPPQHSISTFATWV-----STPHFEFDDLTFIFVLGACARAPSLSTLMIGR 137
            +IR     + P  + S + + +     S+     D LT  F L ACARA   S +    
Sbjct: 74  AIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALCSSAM---D 133

Query: 138 QIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQS 197
           Q+H  I +RG+ ++  + TT++  Y+ N D+  A K+FDEM VR+  +WNA+IAG     
Sbjct: 134 QLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGL---- 193

Query: 198 GKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLG-VHETGSCVHAYIEK 257
             V+   A +++EL++ M  E     ++ ++ T+V  L A S LG V E  +  H Y   
Sbjct: 194 --VSGNRASEAMELYKRMETEG----IRRSEVTVVAALGACSHLGDVKEGENIFHGY--- 253

Query: 258 TIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMK-QRNVLTWTAMATGLAVHGRGKEA 317
                 +++ +    ++MYSKCG ++ A  VF+Q   +++V+TW  M TG AVHG    A
Sbjct: 254 ----SNDNVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRA 313

Query: 318 LELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVD 377
           LE+ D +  +G+KP+ V++ + L+AC H GL+E GL +F+ M  K GV   M+HYGC+VD
Sbjct: 314 LEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVD 373

Query: 378 LLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSD 437
           LL R+G LREA+D+I  M + PD VLW+SLL +  ++ DV+M E   + + E   G ++D
Sbjct: 374 LLSRAGRLREAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEM--GVNND 433

Query: 438 DEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 479
                   DFV LSNVYA+  RW DV  VR++M+ K ++   G S ++  G+
Sbjct: 434 G-------DFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGT 454

BLAST of Cla97C03G066720 vs. TAIR10
Match: AT3G18970.1 (mitochondrial editing factor 20)

HSP 1 Score: 437.2 bits (1123), Expect = 1.3e-122
Identity = 236/457 (51.64%), Postives = 308/457 (67.40%), Query Frame = 0

Query: 22  QLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIA-YAQLIFRHHQYPPSLFLFNTLI 81
           Q KQIHAQL+ NG    S + KLI H C K S E+ +  A L+       P  FLFNTL+
Sbjct: 23  QAKQIHAQLVINGCHDNSLFGKLIGHYCSKPSTESSSKLAHLLVFPRFGHPDKFLFNTLL 82

Query: 82  RCAPPQHSISTFATWVSTPHFEF-DDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRG- 141
           +C+ P+ SI  FA + S     + ++ TF+FVLGACAR+ S S L +GR +H  + K G 
Sbjct: 83  KCSKPEDSIRIFANYASKSSLLYLNERTFVFVLGACARSASSSALRVGRIVHGMVKKLGF 142

Query: 142 IVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARD 201
           +  +  + TT++HFYA N D+  ARKVFDEM  R SVTWNAMI GYCS   K     AR 
Sbjct: 143 LYESELIGTTLLHFYAKNGDLRYARKVFDEMPERTSVTWNAMIGGYCSHKDK-GNHNARK 202

Query: 202 SLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFI 261
           ++ LFR        S V+PTDTTMVC+LSA SQ G+ E GS VH YIEK   +PE D+FI
Sbjct: 203 AMVLFRRF--SCCGSGVRPTDTTMVCVLSAISQTGLLEIGSLVHGYIEKLGFTPEVDVFI 262

Query: 262 GTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGV 321
           GT LV+MYSKCGC+N+A SVF+ MK +NV TWT+MATGLA++GRG E   LL+ M   G+
Sbjct: 263 GTALVDMYSKCGCLNNAFSVFELMKVKNVFTWTSMATGLALNGRGNETPNLLNRMAESGI 322

Query: 322 KPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAY 381
           KPN +TFTSLLSA  H GL+EEG+ LF  M+ +FGV P ++HYGCIVDLLG++G ++EAY
Sbjct: 323 KPNEITFTSLLSAYRHIGLVEEGIELFKSMKTRFGVTPVIEHYGCIVDLLGKAGRIQEAY 382

Query: 382 DLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGS--EDF 441
             IL MP++PD +L RSL ++C ++G+  MGE +GK L+E +     +DE   GS  ED+
Sbjct: 383 QFILAMPIKPDAILLRSLCNACSIYGETVMGEEIGKALLEIE----REDEKLSGSECEDY 442

Query: 442 VALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           VALSNV A   +W +VE +R+EMK + I+ + G S V
Sbjct: 443 VALSNVLAHKGKWVEVEKLRKEMKERRIKTRPGYSFV 472

BLAST of Cla97C03G066720 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.4 bits (690), Expect = 2.2e-72
Identity = 169/468 (36.11%), Postives = 248/468 (52.99%), Query Frame = 0

Query: 22  QLKQIHAQLITNGFKFPSPYAKLIAHS---CKKSSPEAIAYAQLIFRHHQYPPSLFLFNT 81
           QLKQ+HA   T    +P   A L  +       SS   + YA  +F   +   S F++NT
Sbjct: 63  QLKQLHA--FTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIENHSS-FMWNT 122

Query: 82  LIR-CA----PPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHTH 141
           LIR CA      + +   +   +       D  TF FVL ACA     S    G+Q+H  
Sbjct: 123 LIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE---GKQVHCQ 182

Query: 142 ILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQ 201
           I+K G   +++V   +IH Y     + +ARKVFDEM  R+ V+WN+MI         V  
Sbjct: 183 IVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMI------DALVRF 242

Query: 202 RYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTID-SP 261
                +L+LFR M         +P   TM  +LSA + LG    G+  HA++ +  D   
Sbjct: 243 GEYDSALQLFREM-----QRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVDV 302

Query: 262 ENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDA 321
             D+ +   L+ MY KCG +  A  VF+ M++R++ +W AM  G A HGR +EA+   D 
Sbjct: 303 AMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDR 362

Query: 322 M--GTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGR 381
           M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ R
Sbjct: 363 MVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIAR 422

Query: 382 SGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHG-DVQMGERVGKLLVERQGGESSDDEW 441
           +G++ EA D+++ MP++PD V+WRSLL +C   G  V++ E + + ++  +    S +  
Sbjct: 423 AGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGN 482

Query: 442 CVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
           C G+  +V LS VYASA RW DV  VR+ M   GI  + GCSS++  G
Sbjct: 483 CSGA--YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEING 511

BLAST of Cla97C03G066720 vs. TAIR10
Match: AT5G43790.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 263.5 bits (672), Expect = 2.6e-70
Identity = 167/477 (35.01%), Postives = 258/477 (54.09%), Query Frame = 0

Query: 8   RCIHHLNNIRISSRQLKQIHAQLITNGFKFPS-PYAKLIAHSCKKSSPEAIAYAQLIFRH 67
           RC++ ++  + S + LKQIHAQ+IT G    + P +KL+      SS   ++YA  I R 
Sbjct: 11  RCLNLISKCK-SLQNLKQIHAQIITIGLSHHTYPLSKLL----HLSSTVCLSYALSILR- 70

Query: 68  HQYP-PSLFLFNTLIRCAPPQH-SISTFATWVSTPHFEFDDLTFI----FVLGACARAPS 127
            Q P PS+FL+NTLI      H S  T   +            F+    F   +  +A  
Sbjct: 71  -QIPNPSVFLYNTLISSIVSNHNSTQTHLAFSLYDQILSSRSNFVRPNEFTYPSLFKASG 130

Query: 128 LSTL--MIGRQIHTHILK--RGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVT 187
                   GR +H H+LK    +  + +VQ  ++ FYA    +  AR +F+ +   +  T
Sbjct: 131 FDAQWHRHGRALHAHVLKFLEPVNHDRFVQAALVGFYANCGKLREARSLFERIREPDLAT 190

Query: 188 WNAMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHE 247
           WN ++A Y +           + ++    +L+     +V+P + ++V L+ + + LG   
Sbjct: 191 WNTLLAAYANS----------EEIDSDEEVLLLFMRMQVRPNELSLVALIKSCANLGEFV 250

Query: 248 TGSCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATG 307
            G   H Y+ K  ++   + F+GT L+++YSKCGC++ A  VF +M QR+V  + AM  G
Sbjct: 251 RGVWAHVYVLK--NNLTLNQFVGTSLIDLYSKCGCLSFARKVFDEMSQRDVSCYNAMIRG 310

Query: 308 LAVHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVP 367
           LAVHG G+E +EL  ++ + G+ P++ TF   +SAC H GL++EGL +F+ M+  +G+ P
Sbjct: 311 LAVHGFGQEGIELYKSLISQGLVPDSATFVVTISACSHSGLVDEGLQIFNSMKAVYGIEP 370

Query: 368 QMQHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLL 427
           +++HYGC+VDLLGRSG L EA + I  MPV+P+  LWRS L S   HGD + GE   K L
Sbjct: 371 KVEHYGCLVDLLGRSGRLEEAEECIKKMPVKPNATLWRSFLGSSQTHGDFERGEIALKHL 430

Query: 428 VERQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           +   G E  +      S ++V LSN+YA   RW DVE  RE MK   +    G S++
Sbjct: 431 L---GLEFEN------SGNYVLLSNIYAGVNRWTDVEKTRELMKDHRVNKSPGISTL 459

BLAST of Cla97C03G066720 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 259.2 bits (661), Expect = 5.0e-69
Identity = 165/476 (34.66%), Postives = 253/476 (53.15%), Query Frame = 0

Query: 12  HLNNIRISSR---QLKQIHAQLI-TNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHH 71
           HL ++ +SS     L+QIHA L+ T+  +    +   ++       P  I Y+  +F   
Sbjct: 13  HLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVF-SQ 72

Query: 72  QYPPSLFLFNTLIRC----APPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLST 131
           +  P+L   NT+IR       P      F +         + L+  F L  C ++     
Sbjct: 73  RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKS---GD 132

Query: 132 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAG 191
           L+ G QIH  I   G +S+  + TT++  Y+  ++   A KVFDE+  R++V+WN + + 
Sbjct: 133 LLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSC 192

Query: 192 YCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHA 251
           Y      +  +  RD L LF  M     +  VKP   T +  L A + LG  + G  VH 
Sbjct: 193 Y------LRNKRTRDVLVLFDKM-KNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHD 252

Query: 252 YIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRG 311
           +I++  +     L +   LV+MYS+CG ++ A  VF  M++RNV++WTA+ +GLA++G G
Sbjct: 253 FIDE--NGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFG 312

Query: 312 KEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYG 371
           KEA+E  + M   G+ P   T T LLSAC H GL+ EG+  F  M   +F + P + HYG
Sbjct: 313 KEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYG 372

Query: 372 CIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGG 431
           C+VDLLGR+  L +AY LI  M ++PD  +WR+LL +C VHGDV++GERV   L+E +  
Sbjct: 373 CVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAE 432

Query: 432 ESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 479
           E+          D+V L N Y++  +W  V  +R  MK K I  K GCS+++  G+
Sbjct: 433 EAG---------DYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGT 466

BLAST of Cla97C03G066720 vs. TAIR10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 257.3 bits (656), Expect = 1.9e-68
Identity = 167/472 (35.38%), Postives = 262/472 (55.51%), Query Frame = 0

Query: 18  ISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFN 77
           +S  Q+KQ+ +  +T G    S     +   C  S    +++A  IFR+   P +   +N
Sbjct: 14  VSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIPKPLTND-WN 73

Query: 78  TLIR----CAPPQHSISTFATWV-----STPHFEFDDLTFIFVLGACARAPSLSTLMIGR 137
            +IR     + P  + S + + +     S+     D LT  F L ACARA   S +    
Sbjct: 74  AIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALCSSAM---D 133

Query: 138 QIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQS 197
           Q+H  I +RG+ ++  + TT++  Y+ N D+  A K+FDEM VR+  +WNA+IAG     
Sbjct: 134 QLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGL---- 193

Query: 198 GKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLG-VHETGSCVHAYIEK 257
             V+   A +++EL++ M  E     ++ ++ T+V  L A S LG V E  +  H Y   
Sbjct: 194 --VSGNRASEAMELYKRMETEG----IRRSEVTVVAALGACSHLGDVKEGENIFHGY--- 253

Query: 258 TIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMK-QRNVLTWTAMATGLAVHGRGKEA 317
                 +++ +    ++MYSKCG ++ A  VF+Q   +++V+TW  M TG AVHG    A
Sbjct: 254 ----SNDNVIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRA 313

Query: 318 LELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVD 377
           LE+ D +  +G+KP+ V++ + L+AC H GL+E GL +F+ M  K GV   M+HYGC+VD
Sbjct: 314 LEIFDKLEDNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVD 373

Query: 378 LLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSD 437
           LL R+G LREA+D+I  M + PD VLW+SLL +  ++ DV+M E   + + E   G ++D
Sbjct: 374 LLSRAGRLREAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEM--GVNND 433

Query: 438 DEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 479
                   DFV LSNVYA+  RW DV  VR++M+ K ++   G S ++  G+
Sbjct: 434 G-------DFVLLSNVYAAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGT 454

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011659935.15.6e-25288.57PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativu... [more]
XP_008450741.12.0e-24988.57PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo][more]
XP_023515866.16.6e-22982.85pentatricopeptide repeat-containing protein At3g18970 [Cucurbita pepo subsp. pep... [more]
XP_022159202.13.9e-22179.38pentatricopeptide repeat-containing protein At3g18970 [Momordica charantia][more]
XP_022988094.17.6e-21779.50pentatricopeptide repeat-containing protein At3g18970 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A0A0LZ63|A0A0A0LZ63_CUCSA3.7e-25288.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573660 PE=4 SV=1[more]
tr|A0A1S3BPA5|A0A1S3BPA5_CUCME1.3e-24988.57pentatricopeptide repeat-containing protein At3g18970 OS=Cucumis melo OX=3656 GN... [more]
tr|M5WFE9|M5WFE9_PRUPE7.9e-17063.37Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G343200 PE=4 SV=1[more]
tr|A0A2P5B3D3|A0A2P5B3D3_PARAD1.9e-16864.57Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_275160 P... [more]
tr|A0A2P6R5Z3|A0A2P6R5Z3_ROSCH4.3e-16863.51Putative pentatricopeptide OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0451131 P... [more]
Match NameE-valueIdentityDescription
sp|Q9LJ69|PP243_ARATH2.4e-12151.64Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana OX... [more]
sp|Q0WQW5|PPR85_ARATH3.9e-7136.11Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
sp|Q9FG85|PP415_ARATH4.7e-6935.01Pentatricopeptide repeat-containing protein At5g43790 OS=Arabidopsis thaliana OX... [more]
sp|Q9SN85|PP267_ARATH9.0e-6834.66Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9FX24|PPR71_ARATH3.4e-6735.38Pentatricopeptide repeat-containing protein At1g34160 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT3G18970.11.3e-12251.64mitochondrial editing factor 20[more]
AT1G59720.12.2e-7236.11Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G43790.12.6e-7035.01Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G47530.15.0e-6934.66Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G34160.11.9e-6835.38Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0080156 mitochondrial mRNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G066720.1Cla97C03G066720.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 289..322
e-value: 2.5E-4
score: 19.0
coord: 323..356
e-value: 1.6E-6
score: 25.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 175..186
e-value: 0.13
score: 12.5
coord: 147..172
e-value: 0.096
score: 12.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 286..334
e-value: 2.6E-11
score: 43.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 10.841
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 6.719
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..210
score: 7.52
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..419
score: 5.941
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..285
score: 7.552
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 357..387
score: 5.766
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 142..172
score: 6.204
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..356
score: 10.128
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 75..256
e-value: 2.6E-19
score: 71.7
coord: 257..480
e-value: 2.4E-40
score: 140.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 18..476
NoneNo IPR availablePANTHERPTHR24015:SF873SUBFAMILY NOT NAMEDcoord: 18..476

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C03G066720Silver-seed gourdcarwmbB0957
Cla97C03G066720Cucurbita maxima (Rimu)cmawmbB119
Cla97C03G066720Cucurbita moschata (Rifu)cmowmbB106
Cla97C03G066720Melon (DHL92) v3.6.1medwmbB235
Cla97C03G066720Melon (DHL92) v3.5.1mewmbB249