Cla97C03G053350 (gene) Watermelon (97103) v2

NameCla97C03G053350
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr03 : 2538749 .. 2539951 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCTCCTCCTTCGCCTTTGCATTCAGCATTCATGGCGGTTGAATACTCATCATGGTAACCTCAGCCATTTTCTCCCTCTATCTCAAATCCAAAGCTATCCAGCTCCCAACCTCTCTTTCACGAAATTTTGTTTGAATTTCTACTCCAATAGGGCACCTTCAAGATCCTTCAGGAGGAGAGCGAGTAAGAGGTTGAAATCCAGCCTCAAACCTACACTGGACGAAACCCAATTTCAGTTGGCAGTTTCTCAAATCCCTCCAAGGTTTACCTCCGAAGAACTTTGTAACGTCATTTCTCTCCAGGGAAATCCTTTGGTGTGTTTTGAGCTGTTCAACTGGGCTTCACAACAGCCTCGTTTCAGACATGATGTTTCCACTTATCAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGATCATGTTGTGAATCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATCTATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTTAAGCATATGCAGAACAACAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGTGGTCGTAATGCTTATATAAATCACATGTATATGGAGACTATTAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGACATATTTACTTTGAACTGTATGATAAAAGGGTATGTGCTTTCCCTGCATATTAATGATGCTCTTAGGATCTTTCACCAAATGGGTGTTGTGTATAGCTGCTTACCAAATTCATTTTCCTATGACTATTTGATTCACGGGTTATGCGCCCAAGGGCGAACAGATAATGCAAGGGAGTTGTGCAAGGAAATGAAGGAAAAGGGGTTTATGCCTAGTAGTATATCATATAATTCAATTGTGAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGATAGTAGAAGATCTCCTGATTTTATTACATATAGGACTGTATTGGATGAGCTGTGTAGACAAGGGAGGGTCGGAGAAGCGACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACATATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

mRNA sequence

ATGAACCTCCTCCTTCGCCTTTGCATTCAGCATTCATGGCGGTTGAATACTCATCATGGTAACCTCAGCCATTTTCTCCCTCTATCTCAAATCCAAAGCTATCCAGCTCCCAACCTCTCTTTCACGAAATTTTGTTTGAATTTCTACTCCAATAGGGCACCTTCAAGATCCTTCAGGAGGAGAGCGAGTAAGAGGTTGAAATCCAGCCTCAAACCTACACTGGACGAAACCCAATTTCAGTTGGCAGTTTCTCAAATCCCTCCAAGGTTTACCTCCGAAGAACTTTGTAACGTCATTTCTCTCCAGGGAAATCCTTTGGTGTGTTTTGAGCTGTTCAACTGGGCTTCACAACAGCCTCGTTTCAGACATGATGTTTCCACTTATCAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGATCATGTTGTGAATCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATCTATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTTAAGCATATGCAGAACAACAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGTGGTCGTAATGCTTATATAAATCACATGTATATGGAGACTATTAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGACATATTTACTTTGAACTGTATGATAAAAGGGTATGTGCTTTCCCTGCATATTAATGATGCTCTTAGGATCTTTCACCAAATGGGTGTTGTGTATAGCTGCTTACCAAATTCATTTTCCTATGACTATTTGATTCACGGGTTATGCGCCCAAGGGCGAACAGATAATGCAAGGGAGTTGTGCAAGGAAATGAAGGAAAAGGGGTTTATGCCTAGTAGTATATCATATAATTCAATTGTGAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGATAGTAGAAGATCTCCTGATTTTATTACATATAGGACTGTATTGGATGAGCTGTGTAGACAAGGGAGGGTCGGAGAAGCGACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACATATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

Coding sequence (CDS)

ATGAACCTCCTCCTTCGCCTTTGCATTCAGCATTCATGGCGGTTGAATACTCATCATGGTAACCTCAGCCATTTTCTCCCTCTATCTCAAATCCAAAGCTATCCAGCTCCCAACCTCTCTTTCACGAAATTTTGTTTGAATTTCTACTCCAATAGGGCACCTTCAAGATCCTTCAGGAGGAGAGCGAGTAAGAGGTTGAAATCCAGCCTCAAACCTACACTGGACGAAACCCAATTTCAGTTGGCAGTTTCTCAAATCCCTCCAAGGTTTACCTCCGAAGAACTTTGTAACGTCATTTCTCTCCAGGGAAATCCTTTGGTGTGTTTTGAGCTGTTCAACTGGGCTTCACAACAGCCTCGTTTCAGACATGATGTTTCCACTTATCAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGATCATGTTGTGAATCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATCTATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTTAAGCATATGCAGAACAACAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGTGGTCGTAATGCTTATATAAATCACATGTATATGGAGACTATTAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGACATATTTACTTTGAACTGTATGATAAAAGGGTATGTGCTTTCCCTGCATATTAATGATGCTCTTAGGATCTTTCACCAAATGGGTGTTGTGTATAGCTGCTTACCAAATTCATTTTCCTATGACTATTTGATTCACGGGTTATGCGCCCAAGGGCGAACAGATAATGCAAGGGAGTTGTGCAAGGAAATGAAGGAAAAGGGGTTTATGCCTAGTAGTATATCATATAATTCAATTGTGAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGATAGTAGAAGATCTCCTGATTTTATTACATATAGGACTGTATTGGATGAGCTGTGTAGACAAGGGAGGGTCGGAGAAGCGACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACATATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

Protein sequence

MNLLLRLCIQHSWRLNTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGFMPSSISYNSIVNAMALNGEVEEAVNYLWEMIDSRRSPDFITYRTVLDELCRQGRVGEATSLLRELQEKDLVDGHTYRKLLYVLEDDYGNLN
BLAST of Cla97C03G053350 vs. NCBI nr
Match: XP_008463631.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo] >XP_008463637.1 PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo] >XP_008463644.1 PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo])

HSP 1 Score: 575.5 bits (1482), Expect = 1.4e-160
Identity = 359/391 (91.82%), Postives = 372/391 (95.14%), Query Frame = 0

Query: 11  HSWRL-NTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSS 70
           HS  L NTHH NL+HFL +SQIQ+YPAPNLSFTKFCLNFYS  APSRSFRRRA+KRLK+S
Sbjct: 38  HSLSLNNTHHCNLTHFLRVSQIQTYPAPNLSFTKFCLNFYSKTAPSRSFRRRANKRLKAS 97

Query: 71  LKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQ 130
           LKPTLDE QFQLAVS+IPPRFT EEL NVISLQ +PLVCFELFNWASQQPRF+HDVS+Y+
Sbjct: 98  LKPTLDEAQFQLAVSKIPPRFTPEELRNVISLQKDPLVCFELFNWASQQPRFKHDVSSYE 157

Query: 131 ITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 190
           ITIKKLGEAKMYEEMDHVVNQ LAV SIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN
Sbjct: 158 ITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 217

Query: 191 RNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMI 250
           RNLNCRPSIRTYNLLFTAFLSRGRN YINH+YMETIRCLFRQMVNDDGIEPDIF LNCMI
Sbjct: 218 RNLNCRPSIRTYNLLFTAFLSRGRNTYINHVYMETIRCLFRQMVNDDGIEPDIFALNCMI 277

Query: 251 KGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGX 310
           KGYVLSLH+NDALRIFHQMGVVYSCLPNS+SYDYLIHGL AQ RTDNARELC EMKEKG 
Sbjct: 278 KGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGLSAQARTDNARELCNEMKEKGF 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN
Sbjct: 398 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 428

BLAST of Cla97C03G053350 vs. NCBI nr
Match: XP_022970418.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970419.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970420.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970421.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 573.5 bits (1477), Expect = 5.4e-160
Identity = 358/390 (91.79%), Postives = 370/390 (94.87%), Query Frame = 0

Query: 11  HSWRLNTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSL 70
           HS  LNT HGN SH L LSQIQ+YPAP LSFTKFCLNFYSNRAPSRSFRRRASKRLKS +
Sbjct: 38  HSLLLNTDHGNFSHILRLSQIQNYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRI 97

Query: 71  KPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQI 130
           KP L+E QFQ A+SQIPPRF SEEL NVIS+QG+PLVCFELFNWASQQ RFRHDVSTY+I
Sbjct: 98  KPKLNEAQFQQAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEI 157

Query: 131 TIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNR 190
           TIKKLGEAKMYEEMD+VVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNR
Sbjct: 158 TIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNR 217

Query: 191 NLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIK 250
           NLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVN DGIEPDIFTLNCMIK
Sbjct: 218 NLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVN-DGIEPDIFTLNCMIK 277

Query: 251 GYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXX 310
           GYVLSLH+NDALR+FHQMGVVYS LPNSFSYDYLIHGLCAQ RTDNARELC EMKEKG X
Sbjct: 278 GYVLSLHVNDALRVFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFX 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXX QEK+LVDGHTYRKLLYVLEDDYGN+N
Sbjct: 398 XXXVQEKELVDGHTYRKLLYVLEDDYGNVN 426

BLAST of Cla97C03G053350 vs. NCBI nr
Match: XP_004147297.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis sativus] >KGN64750.1 hypothetical protein Csa_1G086930 [Cucumis sativus])

HSP 1 Score: 570.9 bits (1470), Expect = 3.5e-159
Identity = 355/391 (90.79%), Postives = 368/391 (94.12%), Query Frame = 0

Query: 11  HSWRL-NTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSS 70
           HSW L NTHH NL HFL +SQI  Y  PNLSFT F L FYS  APSRSFR+RA+KRLKSS
Sbjct: 38  HSWWLNNTHHYNLPHFLRVSQIHPYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSS 97

Query: 71  LKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQ 130
           LKP LDETQFQLAVS+IPPRFTSEELCNVISLQ +PLVCFELFNWASQQPRFRHD S+Y+
Sbjct: 98  LKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYE 157

Query: 131 ITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 190
           ITIKKLGEAKMYEEMDHVVNQ LAV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN
Sbjct: 158 ITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNN 217

Query: 191 RNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMI 250
           RNLNCRPSIRTYNLLFTAFLSRGRN YINHMYMETIRCLFRQMVNDDGIEPDIF+LNCMI
Sbjct: 218 RNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMI 277

Query: 251 KGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGX 310
           KGYVLSLH+NDALRIFHQMGVVYSCLPNS+S+DYLIHGLCAQ RTDNA+ELC EMKEKG 
Sbjct: 278 KGYVLSLHVNDALRIFHQMGVVYSCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGF 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN
Sbjct: 398 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 428

BLAST of Cla97C03G053350 vs. NCBI nr
Match: XP_022965006.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 570.5 bits (1469), Expect = 4.6e-159
Identity = 357/390 (91.54%), Postives = 369/390 (94.62%), Query Frame = 0

Query: 11  HSWRLNTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSL 70
           HSW LNT HGN SH L LSQIQ+YPAP LSFTKFCLNFYSN APSRSFRRRASKRLKS +
Sbjct: 38  HSWFLNTDHGNFSHILRLSQIQNYPAPVLSFTKFCLNFYSNSAPSRSFRRRASKRLKSRI 97

Query: 71  KPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQI 130
           KP L+E QFQ A+SQIPPRF SEEL NVIS+QG+PLVCFELFNWASQQ RFRHDVSTY+I
Sbjct: 98  KPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEI 157

Query: 131 TIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNR 190
           TIKKLGEAKMYEEMD+VVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN+
Sbjct: 158 TIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNK 217

Query: 191 NLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIK 250
           NLNCRPSIRTYNLLFTAFLSRGRNAYIN MYMETIRCLFRQMVN DGIEPDIFTLNCMIK
Sbjct: 218 NLNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIK 277

Query: 251 GYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXX 310
           GYVLSLH+NDALRIFHQMGVVYS LPNSFSYDYLIHGLCAQ RTDNARELC EMKEKG X
Sbjct: 278 GYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFX 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXX QEK+LVDGHTYRKLLYVLEDDYGN+N
Sbjct: 398 XXXVQEKELVDGHTYRKLLYVLEDDYGNVN 426

BLAST of Cla97C03G053350 vs. NCBI nr
Match: XP_023519586.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo] >XP_023519587.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo] >XP_023519588.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 568.5 bits (1464), Expect = 1.7e-158
Identity = 355/390 (91.03%), Postives = 369/390 (94.62%), Query Frame = 0

Query: 11  HSWRLNTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSL 70
           HSW LNT HGN SH   LSQIQ+YPAP LSFTKFCLNFYSNRAPSRSFRRRASKRLKS +
Sbjct: 38  HSWLLNTDHGNFSHIFRLSQIQNYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRI 97

Query: 71  KPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQI 130
           KP L+E QFQ A+SQIPPRF SEEL NVIS+QG+PLVCFELFNWASQQ RFRHDVSTY+I
Sbjct: 98  KPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEI 157

Query: 131 TIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNR 190
           TIKKLGEAKMYEEMD+VVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN+
Sbjct: 158 TIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNK 217

Query: 191 NLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIK 250
           N+NCRPSIRTYNLLFTAFLSRGRNAYIN MYMETIRCLFRQMVN DGIEPDIFTLNCMIK
Sbjct: 218 NVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIK 277

Query: 251 GYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXX 310
           GYVLSLH+NDALRIFHQMGVVYS LPNSFSYDYLIHGLCAQ RTDNARELC EMKEKG X
Sbjct: 278 GYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFX 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXX QEK+LVDGHTYRKLLYVLEDDYGN++
Sbjct: 398 XXXVQEKELVDGHTYRKLLYVLEDDYGNVH 426

BLAST of Cla97C03G053350 vs. TrEMBL
Match: tr|A0A1S3CJQ6|A0A1S3CJQ6_CUCME (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501729 PE=4 SV=1)

HSP 1 Score: 575.5 bits (1482), Expect = 9.4e-161
Identity = 359/391 (91.82%), Postives = 372/391 (95.14%), Query Frame = 0

Query: 11  HSWRL-NTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSS 70
           HS  L NTHH NL+HFL +SQIQ+YPAPNLSFTKFCLNFYS  APSRSFRRRA+KRLK+S
Sbjct: 38  HSLSLNNTHHCNLTHFLRVSQIQTYPAPNLSFTKFCLNFYSKTAPSRSFRRRANKRLKAS 97

Query: 71  LKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQ 130
           LKPTLDE QFQLAVS+IPPRFT EEL NVISLQ +PLVCFELFNWASQQPRF+HDVS+Y+
Sbjct: 98  LKPTLDEAQFQLAVSKIPPRFTPEELRNVISLQKDPLVCFELFNWASQQPRFKHDVSSYE 157

Query: 131 ITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 190
           ITIKKLGEAKMYEEMDHVVNQ LAV SIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN
Sbjct: 158 ITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 217

Query: 191 RNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMI 250
           RNLNCRPSIRTYNLLFTAFLSRGRN YINH+YMETIRCLFRQMVNDDGIEPDIF LNCMI
Sbjct: 218 RNLNCRPSIRTYNLLFTAFLSRGRNTYINHVYMETIRCLFRQMVNDDGIEPDIFALNCMI 277

Query: 251 KGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGX 310
           KGYVLSLH+NDALRIFHQMGVVYSCLPNS+SYDYLIHGL AQ RTDNARELC EMKEKG 
Sbjct: 278 KGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGLSAQARTDNARELCNEMKEKGF 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN
Sbjct: 398 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 428

BLAST of Cla97C03G053350 vs. TrEMBL
Match: tr|A0A0A0LS50|A0A0A0LS50_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086930 PE=4 SV=1)

HSP 1 Score: 570.9 bits (1470), Expect = 2.3e-159
Identity = 355/391 (90.79%), Postives = 368/391 (94.12%), Query Frame = 0

Query: 11  HSWRL-NTHHGNLSHFLPLSQIQSYPAPNLSFTKFCLNFYSNRAPSRSFRRRASKRLKSS 70
           HSW L NTHH NL HFL +SQI  Y  PNLSFT F L FYS  APSRSFR+RA+KRLKSS
Sbjct: 38  HSWWLNNTHHYNLPHFLRVSQIHPYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSS 97

Query: 71  LKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQ 130
           LKP LDETQFQLAVS+IPPRFTSEELCNVISLQ +PLVCFELFNWASQQPRFRHD S+Y+
Sbjct: 98  LKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYE 157

Query: 131 ITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN 190
           ITIKKLGEAKMYEEMDHVVNQ LAV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN
Sbjct: 158 ITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNN 217

Query: 191 RNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMI 250
           RNLNCRPSIRTYNLLFTAFLSRGRN YINHMYMETIRCLFRQMVNDDGIEPDIF+LNCMI
Sbjct: 218 RNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMI 277

Query: 251 KGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGX 310
           KGYVLSLH+NDALRIFHQMGVVYSCLPNS+S+DYLIHGLCAQ RTDNA+ELC EMKEKG 
Sbjct: 278 KGYVLSLHVNDALRIFHQMGVVYSCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGF 337

Query: 311 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 338 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 397

Query: 371 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 401
           XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN
Sbjct: 398 XXXXXQEKDLVDGHTYRKLLYVLEDDYGNLN 428

BLAST of Cla97C03G053350 vs. TrEMBL
Match: tr|A0A2N9FRL4|A0A2N9FRL4_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17745 PE=4 SV=1)

HSP 1 Score: 456.8 bits (1174), Expect = 4.9e-125
Identity = 303/374 (81.02%), Postives = 332/374 (88.77%), Query Frame = 0

Query: 29  SQIQSYPAPN-LSFTKFCL---NFYSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVS 88
           +Q Q  P P  LS    CL   + YS +APSRSFRRR SKR K + KP L+E QFQ AVS
Sbjct: 5   NQTQVAPRPQLLSNEILCLSLNSLYSTKAPSRSFRRRESKRSKLNAKPRLNEAQFQRAVS 64

Query: 89  QIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEM 148
           Q+ PRFT+EELCNVI+LQ +P+VCFELFNWASQQ RFRHDV TY ITIKKLG AKMYEEM
Sbjct: 65  QLTPRFTAEELCNVITLQEDPIVCFELFNWASQQHRFRHDVCTYHITIKKLGAAKMYEEM 124

Query: 149 DHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLL 208
           D+VVNQVLAVP IGSE LYNT+IYF TEARKLTRA+N+FKHM+N+RNL+CRPSIRTYNLL
Sbjct: 125 DNVVNQVLAVPYIGSEALYNTVIYFSTEARKLTRAVNVFKHMRNSRNLDCRPSIRTYNLL 184

Query: 209 FTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRI 268
           F AFLSRG N+YINHMYMETIRCLFRQMVN DGI+PDIF+LN MIKGYVLSLH+NDALR+
Sbjct: 185 FAAFLSRGNNSYINHMYMETIRCLFRQMVN-DGIQPDIFSLNSMIKGYVLSLHVNDALRV 244

Query: 269 FHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXXXXXXXXXXXXXXXX 328
           FHQM VVY CLPNSFSYDYLIHGLCAQGRT+NA+ELC +MK+KG XXXXXXXXXXXXXXX
Sbjct: 245 FHQMDVVYKCLPNSFSYDYLIHGLCAQGRTNNAKELCDQMKQKGFXXXXXXXXXXXXXXX 304

Query: 329 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHT 388
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   D+VDGHT
Sbjct: 305 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDIVDGHT 364

Query: 389 YRKLLYVLEDDYGN 399
           Y+KLLYVLEDD+GN
Sbjct: 365 YKKLLYVLEDDFGN 377

BLAST of Cla97C03G053350 vs. TrEMBL
Match: tr|A0A061GUJ4|A0A061GUJ4_THECC (Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao OX=3641 GN=TCM_041079 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 6.4e-125
Identity = 302/379 (79.68%), Postives = 336/379 (88.65%), Query Frame = 0

Query: 26  LPLSQIQ--SYPAPNLSFTKFCLN----FYSNRAPSRSFRRRASKRLKSSLKPTLDETQF 85
           L L+QI   S  +P  +   FC N    FYS RAPSRSFRRR +KRLK+S KP LD+ +F
Sbjct: 52  LGLTQIDPLSVISPTANLHPFCYNSFTCFYSTRAPSRSFRRRINKRLKASSKPVLDQPKF 111

Query: 86  QLAVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAK 145
           + AVSQ+ PRFT+EELCNVI+L+ +PLVC+ELFNWA QQPRFRHDVSTY ITIKKLG AK
Sbjct: 112 EKAVSQLLPRFTAEELCNVITLEEDPLVCWELFNWAVQQPRFRHDVSTYHITIKKLGVAK 171

Query: 146 MYEEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIR 205
           MYEEMD VVNQVLA+ + GSE LYNT+IYFFTEARKLTRA+NIFKHM+NNR L+CRPSIR
Sbjct: 172 MYEEMDVVVNQVLALRTFGSEPLYNTIIYFFTEARKLTRAVNIFKHMRNNRKLDCRPSIR 231

Query: 206 TYNLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHIN 265
           TYN+LFTA LSRGR++YINHMYMETIRCLFRQMVN DGIEPD+F+LN MIKGYVLSLH+N
Sbjct: 232 TYNILFTAMLSRGRDSYINHMYMETIRCLFRQMVN-DGIEPDVFSLNSMIKGYVLSLHVN 291

Query: 266 DALRIFHQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXXXXXXXXXXX 325
           DALR+FHQMGVVY CLPNS+SYD+LI+GLCAQGRT+NARELC EMK+ G XXXXXXXXXX
Sbjct: 292 DALRVFHQMGVVYKCLPNSYSYDFLIYGLCAQGRTNNARELCNEMKKNGFXXXXXXXXXX 351

Query: 326 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDL 385
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   DL
Sbjct: 352 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDL 411

Query: 386 VDGHTYRKLLYVLEDDYGN 399
           VDGHTYRKLLY +EDD+GN
Sbjct: 412 VDGHTYRKLLYAMEDDFGN 429

BLAST of Cla97C03G053350 vs. TrEMBL
Match: tr|A0A2P4KFP1|A0A2P4KFP1_QUESU (Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=58331 GN=CFP56_48489 PE=4 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 2.7e-123
Identity = 300/373 (80.43%), Postives = 328/373 (87.94%), Query Frame = 0

Query: 30  QIQSYPAPNL-SFTKFCL---NFYSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQ 89
           + Q  P P L S    CL   + YS RAPSRSFRRR SKR K + KPTLDE QFQ AVSQ
Sbjct: 86  ETQVSPCPQLPSKQILCLSLYSLYSTRAPSRSFRRRESKRSKLNSKPTLDEAQFQRAVSQ 145

Query: 90  IPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMD 149
           +PPRFT+EELCNVI+   +P+VC ELFNWASQQ RFRHDV TY +TIKKLG A+MYEEMD
Sbjct: 146 LPPRFTAEELCNVITFIEDPIVCLELFNWASQQHRFRHDVCTYHVTIKKLGAARMYEEMD 205

Query: 150 HVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLF 209
            VVNQVLAV  +GSE LYNT+IYFFTEAR+LTRA+NIF HM+N+RNL+CRPSIRTYNLLF
Sbjct: 206 DVVNQVLAVQCVGSEALYNTIIYFFTEARRLTRAVNIFNHMRNSRNLDCRPSIRTYNLLF 265

Query: 210 TAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIF 269
            AFLSRG N+YINHMYMETIRCLFR+MVN DGI+PDIFTLN MIKGYVLSLH+NDALRIF
Sbjct: 266 AAFLSRGNNSYINHMYMETIRCLFRRMVN-DGIQPDIFTLNSMIKGYVLSLHVNDALRIF 325

Query: 270 HQMGVVYSCLPNSFSYDYLIHGLCAQGRTDNARELCKEMKEKGXXXXXXXXXXXXXXXXX 329
           HQM VVY CLPNSFSYDYLIHGLCAQGRT+NA++LC EMK+KG XXXXXXXXXXXXXXXX
Sbjct: 326 HQMDVVYKCLPNSFSYDYLIHGLCAQGRTNNAKQLCDEMKQKGFXXXXXXXXXXXXXXXX 385

Query: 330 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTY 389
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   D+VDG+TY
Sbjct: 386 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDIVDGYTY 445

Query: 390 RKLLYVLEDDYGN 399
           RKLLYVLEDD+GN
Sbjct: 446 RKLLYVLEDDFGN 457

BLAST of Cla97C03G053350 vs. Swiss-Prot
Match: sp|Q9ZUY1|PP173_ARATH (Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g27800 PE=3 SV=2)

HSP 1 Score: 365.9 bits (938), Expect = 5.6e-100
Identity = 172/260 (66.15%), Postives = 211/260 (81.15%), Query Frame = 0

Query: 49  YSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVC 108
           YS   P+RS RRR S R KSS KP L+ ++F   +S++PPRFT EEL + I+L+ +P +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 109 FELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIY 168
           F LFNWASQQPRF H+  +Y I I+KLG AKMY+EMD +VNQVL+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 169 FFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCL 228
           +FT+A KL RA+NIF+HM  ++NL CRP+IRTY++LF A L RG N+YINH+YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 229 FRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGL 288
           FRQMV D GIEPD+F LNC++KGYVLSLH+NDALRIFHQM VVY C PNSF+YDYLIHGL
Sbjct: 276 FRQMV-DSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGL 335

Query: 289 CAQGRTDNARELCKEMKEKG 309
           CAQGRT NAREL  EMK KG
Sbjct: 336 CAQGRTINARELLSEMKGKG 354

BLAST of Cla97C03G053350 vs. Swiss-Prot
Match: sp|Q9LSF5|PP254_ARATH (Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g25210 PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 3.4e-20
Identity = 64/228 (28.07%), Postives = 116/228 (50.88%), Query Frame = 0

Query: 38  NLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCN 97
           +LSF+    +  S+ +PSR   R            T  ETQF+  +  + P FT+ ++  
Sbjct: 33  SLSFSSVSSSPESHTSPSRIRTR------------TPLETQFETWIQNLKPGFTNSDVVI 92

Query: 98  VISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSI 157
            +  Q +P +  ++F W +QQ  ++H+   Y   IK+    K    ++ ++ +V+A    
Sbjct: 93  ALRAQSDPDLALDIFRWTAQQRGYKHNHEAYHTMIKQAITGKRNNFVETLIEEVIAGACE 152

Query: 158 GSETLYNTMIYFFTEARKL-TRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAY 217
            S  LYN +I F    + L  RA +++  M   R+ + +P + TY LL ++ L R     
Sbjct: 153 MSVPLYNCIIRFCCGRKFLFNRAFDVYNKML--RSDDSKPDLETYTLLLSSLLKRFNKLN 212

Query: 218 INHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRI 265
           + ++Y+  +R L +QM   +G+ PD F LN +IK Y   L +++A+R+
Sbjct: 213 VCYVYLHAVRSLTKQM-KSNGVIPDTFVLNMIIKAYAKCLEVDEAIRV 245

BLAST of Cla97C03G053350 vs. Swiss-Prot
Match: sp|Q9LW84|PP236_ARATH (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 73.9 bits (180), Expect = 4.4e-12
Identity = 41/163 (25.15%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 107 VCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIG-SETLYNT 166
           V  + F WA ++  F+HD STY   I+ L EA++Y EM   + +V+    +  S  + + 
Sbjct: 108 VKIQFFKWAGKRRNFQHDCSTYMTLIRCLEEARLYGEMYRTIQEVVRNTYVSVSPAVLSE 167

Query: 167 MIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETI 226
           ++     A+ +++A+++F   +  +   C+P+  TYN +    +  G++  ++ +Y E  
Sbjct: 168 LVKALGRAKMVSKALSVFYQAKGRK---CKPTSSTYNSVILMLMQEGQHEKVHEVYTE-- 227

Query: 227 RCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQM 269
                 M N+    PD  T + +I  Y      + A+R+F +M
Sbjct: 228 ------MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEM 259

BLAST of Cla97C03G053350 vs. Swiss-Prot
Match: sp|Q9MAG8|PPR79_ARATH (Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis thaliana OX=3702 GN=At1g53330 PE=3 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 6.0e-09
Identity = 50/184 (27.17%), Postives = 91/184 (49.46%), Query Frame = 0

Query: 91  TSEELCNVISLQGNPLVCFELF---NWASQQPR--FRHDVSTYQITIKKLGEAKMYEEMD 150
           +S  L +++  + +P    +LF   +  S  P+  FR+ +  Y I I KLG +KM++E+D
Sbjct: 8   SSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGSKMFDELD 67

Query: 151 HVVNQVLA-VPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLL 210
            V+  +      + +E ++  +I FF   +  +RA+++F  M   R   C+ ++++ N L
Sbjct: 68  QVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYR---CQRTVKSLNSL 127

Query: 211 FTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRI 269
            +A L  G          E  +   R    D+  +PD  T N +I G   S   +DAL++
Sbjct: 128 LSALLKCG----------ELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKL 178

BLAST of Cla97C03G053350 vs. Swiss-Prot
Match: sp|Q9LK57|PP226_ARATH (Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g13160 PE=1 SV=1)

HSP 1 Score: 62.8 bits (151), Expect = 1.0e-08
Identity = 39/170 (22.94%), Postives = 78/170 (45.88%), Query Frame = 0

Query: 82  AVSQIPPRFTSEELCNVISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMY 141
           A +  PP+     L  +++ + +P    E F  A Q   FR +++ Y+ T+++L  AK +
Sbjct: 31  AATPSPPK---PSLITLVNDERDPKFITEKFKKACQAEWFRKNIAVYERTVRRLAAAKKF 90

Query: 142 EEMDHVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTY 201
           E ++ ++ +    P++  E     +I  +        A  +F  M      NC+ +  ++
Sbjct: 91  EWVEEILEEQNKYPNMSKEGFVARIINLYGRVGMFENAQKVFDEMPER---NCKRTALSF 150

Query: 202 NLLFTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKG 252
           N L         NA +N    + +  +F+++     IEPD+ + N +IKG
Sbjct: 151 NALL--------NACVNSKKFDLVEGIFKELPGKLSIEPDVASYNTLIKG 186

BLAST of Cla97C03G053350 vs. TAIR10
Match: AT2G27800.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 365.9 bits (938), Expect = 3.1e-101
Identity = 172/260 (66.15%), Postives = 211/260 (81.15%), Query Frame = 0

Query: 49  YSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVC 108
           YS   P+RS RRR S R KSS KP L+ ++F   +S++PPRFT EEL + I+L+ +P +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 109 FELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIY 168
           F LFNWASQQPRF H+  +Y I I+KLG AKMY+EMD +VNQVL+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 169 FFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCL 228
           +FT+A KL RA+NIF+HM  ++NL CRP+IRTY++LF A L RG N+YINH+YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 229 FRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGL 288
           FRQMV D GIEPD+F LNC++KGYVLSLH+NDALRIFHQM VVY C PNSF+YDYLIHGL
Sbjct: 276 FRQMV-DSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGL 335

Query: 289 CAQGRTDNARELCKEMKEKG 309
           CAQGRT NAREL  EMK KG
Sbjct: 336 CAQGRTINARELLSEMKGKG 354

BLAST of Cla97C03G053350 vs. TAIR10
Match: AT5G27300.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 220.3 bits (560), Expect = 2.1e-57
Identity = 116/249 (46.59%), Postives = 154/249 (61.85%), Query Frame = 0

Query: 49  YSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCNVISLQGNPLVC 108
           YS   P+RS RRR S R KSS KP L+E++FQ  +S++PPRFT EEL + I+L+ +P +C
Sbjct: 89  YSTSVPTRSLRRRISSRKKSSTKPILNESKFQETISKLPPRFTPEELADAITLEEDPFLC 148

Query: 109 FELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIGSETLYNTMIY 168
           F LFNWASQQPRF H+  +Y I I+KLG AK                             
Sbjct: 149 FHLFNWASQQPRFTHENCSYHIAIRKLGAAK----------------------------- 208

Query: 169 FFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETIRCL 228
               + KL RA+NIF+HM N+RNL CRP++RTY++LF A L RG N++INH+YMET+R L
Sbjct: 209 ----SGKLIRAVNIFRHMVNSRNLECRPTMRTYHILFKALLGRGNNSFINHLYMETVRSL 268

Query: 229 FRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQMGVVYSCLPNSFSYDYLIHGL 288
           FRQMV D GIEPD+F LNC++KG   +++  + L      G V    PN  SY+ L++  
Sbjct: 269 FRQMV-DSGIEPDVFALNCLVKG--RTINTRELLSEMKGKGFV----PNGKSYNSLVNAF 297

Query: 289 CAQGRTDNA 298
              G  D+A
Sbjct: 329 ALSGEIDDA 297

BLAST of Cla97C03G053350 vs. TAIR10
Match: AT3G25210.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 100.9 bits (250), Expect = 1.9e-21
Identity = 64/228 (28.07%), Postives = 116/228 (50.88%), Query Frame = 0

Query: 38  NLSFTKFCLNFYSNRAPSRSFRRRASKRLKSSLKPTLDETQFQLAVSQIPPRFTSEELCN 97
           +LSF+    +  S+ +PSR   R            T  ETQF+  +  + P FT+ ++  
Sbjct: 33  SLSFSSVSSSPESHTSPSRIRTR------------TPLETQFETWIQNLKPGFTNSDVVI 92

Query: 98  VISLQGNPLVCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSI 157
            +  Q +P +  ++F W +QQ  ++H+   Y   IK+    K    ++ ++ +V+A    
Sbjct: 93  ALRAQSDPDLALDIFRWTAQQRGYKHNHEAYHTMIKQAITGKRNNFVETLIEEVIAGACE 152

Query: 158 GSETLYNTMIYFFTEARKL-TRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAY 217
            S  LYN +I F    + L  RA +++  M   R+ + +P + TY LL ++ L R     
Sbjct: 153 MSVPLYNCIIRFCCGRKFLFNRAFDVYNKML--RSDDSKPDLETYTLLLSSLLKRFNKLN 212

Query: 218 INHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRI 265
           + ++Y+  +R L +QM   +G+ PD F LN +IK Y   L +++A+R+
Sbjct: 213 VCYVYLHAVRSLTKQM-KSNGVIPDTFVLNMIIKAYAKCLEVDEAIRV 245

BLAST of Cla97C03G053350 vs. TAIR10
Match: AT3G16010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 73.9 bits (180), Expect = 2.4e-13
Identity = 41/163 (25.15%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 107 VCFELFNWASQQPRFRHDVSTYQITIKKLGEAKMYEEMDHVVNQVLAVPSIG-SETLYNT 166
           V  + F WA ++  F+HD STY   I+ L EA++Y EM   + +V+    +  S  + + 
Sbjct: 108 VKIQFFKWAGKRRNFQHDCSTYMTLIRCLEEARLYGEMYRTIQEVVRNTYVSVSPAVLSE 167

Query: 167 MIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNAYINHMYMETI 226
           ++     A+ +++A+++F   +  +   C+P+  TYN +    +  G++  ++ +Y E  
Sbjct: 168 LVKALGRAKMVSKALSVFYQAKGRK---CKPTSSTYNSVILMLMQEGQHEKVHEVYTE-- 227

Query: 227 RCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRIFHQM 269
                 M N+    PD  T + +I  Y      + A+R+F +M
Sbjct: 228 ------MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEM 259

BLAST of Cla97C03G053350 vs. TAIR10
Match: AT1G53330.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 63.5 bits (153), Expect = 3.3e-10
Identity = 50/184 (27.17%), Postives = 91/184 (49.46%), Query Frame = 0

Query: 91  TSEELCNVISLQGNPLVCFELF---NWASQQPR--FRHDVSTYQITIKKLGEAKMYEEMD 150
           +S  L +++  + +P    +LF   +  S  P+  FR+ +  Y I I KLG +KM++E+D
Sbjct: 8   SSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGSKMFDELD 67

Query: 151 HVVNQVLA-VPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLL 210
            V+  +      + +E ++  +I FF   +  +RA+++F  M   R   C+ ++++ N L
Sbjct: 68  QVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYR---CQRTVKSLNSL 127

Query: 211 FTAFLSRGRNAYINHMYMETIRCLFRQMVNDDGIEPDIFTLNCMIKGYVLSLHINDALRI 269
            +A L  G          E  +   R    D+  +PD  T N +I G   S   +DAL++
Sbjct: 128 LSALLKCG----------ELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKL 178

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008463631.11.4e-16091.82PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
XP_022970418.15.4e-16091.79pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
XP_004147297.13.5e-15990.79PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
XP_022965006.14.6e-15991.54pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
XP_023519586.11.7e-15891.03pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3CJQ6|A0A1S3CJQ6_CUCME9.4e-16191.82pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like OS=Cuc... [more]
tr|A0A0A0LS50|A0A0A0LS50_CUCSA2.3e-15990.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086930 PE=4 SV=1[more]
tr|A0A2N9FRL4|A0A2N9FRL4_FAGSY4.9e-12581.02Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17745 PE=4 SV=1[more]
tr|A0A061GUJ4|A0A061GUJ4_THECC6.4e-12579.68Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao O... [more]
tr|A0A2P4KFP1|A0A2P4KFP1_QUESU2.7e-12380.43Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=5... [more]
Match NameE-valueIdentityDescription
sp|Q9ZUY1|PP173_ARATH5.6e-10066.15Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidop... [more]
sp|Q9LSF5|PP254_ARATH3.4e-2028.07Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidop... [more]
sp|Q9LW84|PP236_ARATH4.4e-1225.15Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
sp|Q9MAG8|PPR79_ARATH6.0e-0927.17Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis th... [more]
sp|Q9LK57|PP226_ARATH1.0e-0822.94Pentatricopeptide repeat-containing protein At3g13160, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT2G27800.13.1e-10166.15Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27300.12.1e-5746.59pentatricopeptide (PPR) repeat-containing protein[more]
AT3G25210.11.9e-2128.07Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G16010.12.4e-1325.15Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G53330.13.3e-1027.17Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G053350.1Cla97C03G053350.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 163..208
e-value: 1.8E-4
score: 21.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 276..323
e-value: 9.1E-12
score: 44.9
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 345..373
e-value: 2.1E-7
score: 30.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 243..268
e-value: 0.019
score: 15.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 162..189
e-value: 0.0021
score: 16.1
coord: 279..312
e-value: 1.5E-7
score: 29.1
coord: 243..277
e-value: 1.9E-4
score: 19.4
coord: 314..347
e-value: 4.7E-4
score: 18.1
coord: 349..379
e-value: 8.6E-5
score: 20.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..311
score: 12.682
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 10.084
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 9.887
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 241..271
score: 7.322
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 197..231
score: 5.031
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 124..154
score: 5.546
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 159..189
score: 7.783
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 255..396
e-value: 1.8E-31
score: 111.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 47..393
NoneNo IPR availablePANTHERPTHR24015:SF1017PENTATRICOPEPTIDE PPR REPEAT-CONTAINING PROTEINcoord: 47..393

The following gene(s) are paralogous to this gene:

None