CsGy1G012900 (gene) Cucumber (Gy14) v2

NameCsGy1G012900
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At2g27800, mitochondrial-like
LocationChr1 : 8330643 .. 8331929 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCTCCTCCTTCGAATTCGCATTCACTTCTCAACCAATTCCATTGACCACCTCTTTAATTCCAACCCATCATATCCACATTTCGTTTGTTTCAGAAGATTTTCGATTCATTCCTGGTGGTTGAACAATACCCATCATTATAACCTCCCCCATTTTCTCCGTGTATCTCAAATCCATCCCTACTCAGGTCCCAACCTCTCTTTTACGAATTTTCTTTTGAAATTCTATTCAAGGGCGGCACCTTCAAGATCTTTCCGGAAGAGAGCGAATAAGAGGTTGAAATCCAGCCTCAAACCTAAACTGGACGAAACCCAATTTCAGTTAGCAGTTTCGAAAATCCCTCCGAGGTTTACCTCTGAAGAACTCTGTAATGTAATTTCTCTCCAAAGAGATCCTTTGGTTTGTTTTGAGTTGTTCAATTGGGCTTCACAGCAGCCTCGTTTCAGACATGATGATTCCTCTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGACCATGTTGTGAATCAGGCGCTTGCTGTTTCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTGTCAATATATTTAAGCATATGCAGAACAATAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGAGGTCGTAATACTTATATAAATCATATGTATATGGAGACTATCAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGATATATTTAGTTTGAATTGTATGATAAAGGGCTATGTGCTTTCCCTGCATGTTAATGATGCGCTTAGGATCTTTCACCAAATGGGGGTTGTGTATAGCTGCTTACCAAATTCATATTCCTATGACTATTTGATTCATGGGTTATGCGCCCAAGCTCGTACCGATAATGCAAAGGAATTGTGTAATGAAATGAAGGAAAAGGGGTTTGTGCCTAGTAGTATTTCGTATAATTCAATTGTGAATGCTCTGGCTCTCAATGGAGAGGTTGAAGATGCAGTGAATTATTTGTGGGAGATGATTGATAATAGAAGATCTCCTGATTTTATTACGTATAAGACAGTATTGGACGAACTGTGTAGACAAGGGAAGGTGGTTGAAGCCACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

mRNA sequence

ATGAACCTCCTCCTTCGAATTCGCATTCACTTCTCAACCAATTCCATTGACCACCTCTTTAATTCCAACCCATCATATCCACATTTCGTTTGTTTCAGAAGATTTTCGATTCATTCCTGGTGGTTGAACAATACCCATCATTATAACCTCCCCCATTTTCTCCGTGTATCTCAAATCCATCCCTACTCAGGTCCCAACCTCTCTTTTACGAATTTTCTTTTGAAATTCTATTCAAGGGCGGCACCTTCAAGATCTTTCCGGAAGAGAGCGAATAAGAGGTTGAAATCCAGCCTCAAACCTAAACTGGACGAAACCCAATTTCAGTTAGCAGTTTCGAAAATCCCTCCGAGGTTTACCTCTGAAGAACTCTGTAATGTAATTTCTCTCCAAAGAGATCCTTTGGTTTGTTTTGAGTTGTTCAATTGGGCTTCACAGCAGCCTCGTTTCAGACATGATGATTCCTCTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGACCATGTTGTGAATCAGGCGCTTGCTGTTTCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTGTCAATATATTTAAGCATATGCAGAACAATAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGAGGTCGTAATACTTATATAAATCATATGTATATGGAGACTATCAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGATATATTTAGTTTGAATTGTATGATAAAGGGCTATGTGCTTTCCCTGCATGTTAATGATGCGCTTAGGATCTTTCACCAAATGGGGGTTGTGTATAGCTGCTTACCAAATTCATATTCCTATGACTATTTGATTCATGGGTTATGCGCCCAAGCTCGTACCGATAATGCAAAGGAATTGTGTAATGAAATGAAGGAAAAGGGGTTTGTGCCTAGTAGTATTTCGTATAATTCAATTGTGAATGCTCTGGCTCTCAATGGAGAGGTTGAAGATGCAGTGAATTATTTGTGGGAGATGATTGATAATAGAAGATCTCCTGATTTTATTACGTATAAGACAGTATTGGACGAACTGTGTAGACAAGGGAAGGTGGTTGAAGCCACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

Coding sequence (CDS)

ATGAACCTCCTCCTTCGAATTCGCATTCACTTCTCAACCAATTCCATTGACCACCTCTTTAATTCCAACCCATCATATCCACATTTCGTTTGTTTCAGAAGATTTTCGATTCATTCCTGGTGGTTGAACAATACCCATCATTATAACCTCCCCCATTTTCTCCGTGTATCTCAAATCCATCCCTACTCAGGTCCCAACCTCTCTTTTACGAATTTTCTTTTGAAATTCTATTCAAGGGCGGCACCTTCAAGATCTTTCCGGAAGAGAGCGAATAAGAGGTTGAAATCCAGCCTCAAACCTAAACTGGACGAAACCCAATTTCAGTTAGCAGTTTCGAAAATCCCTCCGAGGTTTACCTCTGAAGAACTCTGTAATGTAATTTCTCTCCAAAGAGATCCTTTGGTTTGTTTTGAGTTGTTCAATTGGGCTTCACAGCAGCCTCGTTTCAGACATGATGATTCCTCTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTATGAAGAAATGGACCATGTTGTGAATCAGGCGCTTGCTGTTTCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTCACTGAGGCAAGGAAGTTGACTAGAGCTGTCAATATATTTAAGCATATGCAGAACAATAGAAACTTGAACTGTAGGCCTTCAATTAGAACATATAATCTACTTTTTACTGCATTCTTGAGTCGAGGTCGTAATACTTATATAAATCATATGTATATGGAGACTATCAGATGTCTCTTCAGACAGATGGTGAATGATGATGGGATTGAACCTGATATATTTAGTTTGAATTGTATGATAAAGGGCTATGTGCTTTCCCTGCATGTTAATGATGCGCTTAGGATCTTTCACCAAATGGGGGTTGTGTATAGCTGCTTACCAAATTCATATTCCTATGACTATTTGATTCATGGGTTATGCGCCCAAGCTCGTACCGATAATGCAAAGGAATTGTGTAATGAAATGAAGGAAAAGGGGTTTGTGCCTAGTAGTATTTCGTATAATTCAATTGTGAATGCTCTGGCTCTCAATGGAGAGGTTGAAGATGCAGTGAATTATTTGTGGGAGATGATTGATAATAGAAGATCTCCTGATTTTATTACGTATAAGACAGTATTGGACGAACTGTGTAGACAAGGGAAGGTGGTTGAAGCCACGAGTTTGTTGAGGGAATTGCAAGAGAAGGATCTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGATTATGGAAATCTAAATTGA

Protein sequence

MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIHPYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFVPSSISYNSIVNALALNGEVEDAVNYLWEMIDNRRSPDFITYKTVLDELCRQGKVVEATSLLRELQEKDLVDGHTYRKLLYVLEDDYGNLN
BLAST of CsGy1G012900 vs. NCBI nr
Match: XP_004147297.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis sativus] >KGN64750.1 hypothetical protein Csa_1G086930 [Cucumis sativus])

HSP 1 Score: 721.1 bits (1860), Expect = 2.2e-204
Identity = 427/428 (99.77%), Postives = 428/428 (100.00%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH
Sbjct: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
           PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS
Sbjct: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL
Sbjct: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           SCLPNSYS+DYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGNLN
Sbjct: 421 EDDYGNLN 428

BLAST of CsGy1G012900 vs. NCBI nr
Match: XP_008463631.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo] >XP_008463637.1 PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo] >XP_008463644.1 PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo])

HSP 1 Score: 652.9 bits (1683), Expect = 7.5e-184
Identity = 397/428 (92.76%), Postives = 408/428 (95.33%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MN L RIRIHFSTNSI+HLFNSNPSYPHF+CFRRFSIHS  LNNTHH NL HFLRVSQI 
Sbjct: 1   MNPLFRIRIHFSTNSINHLFNSNPSYPHFICFRRFSIHSLSLNNTHHCNLTHFLRVSQIQ 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
            Y  PNLSFT F L FYS+ APSRSFR+RANKRLK+SLKP LDE QFQLAVSKIPPRFT 
Sbjct: 61  TYPAPNLSFTKFCLNFYSKTAPSRSFRRRANKRLKASLKPTLDEAQFQLAVSKIPPRFTP 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EEL NVISLQ+DPLVCFELFNWASQQPRF+HD SSYEITIKKLGEAKMYEEMDHVVNQAL
Sbjct: 121 EELRNVISLQKDPLVCFELFNWASQQPRFKHDVSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AVSSIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RNTYINH+YMETIRCLFRQMVNDDGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHVYMETIRCLFRQMVNDDGIEPDIFALNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           SCLPNSYSYDYLIHGL AQARTDNA+ELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SCLPNSYSYDYLIHGLSAQARTDNARELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGNLN
Sbjct: 421 EDDYGNLN 428

BLAST of CsGy1G012900 vs. NCBI nr
Match: XP_022965006.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 583.2 bits (1502), Expect = 7.3e-163
Identity = 369/428 (86.21%), Postives = 389/428 (90.89%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MNLLLRIRI F  NSI+ L NS PS PHFVC RRFS HSW+L NT H N  H LR+SQI 
Sbjct: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWFL-NTDHGNFSHILRLSQIQ 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
            Y  P LSFT F L FYS +APSRSFR+RA+KRLKS +KPKL+E QFQ A+S+IPPRF S
Sbjct: 61  NYPAPVLSFTKFCLNFYSNSAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RFRHD S+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN+NLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YIN MYMETIRCLFRQMVN DGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           S LPNS+SYDYLIHGLCAQARTDNA+ELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QEK+LVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEKELVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGN+N
Sbjct: 421 EDDYGNVN 426

BLAST of CsGy1G012900 vs. NCBI nr
Match: XP_022970418.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970419.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970420.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima] >XP_022970421.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita maxima])

HSP 1 Score: 579.3 bits (1492), Expect = 1.1e-161
Identity = 368/428 (85.98%), Postives = 387/428 (90.42%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MNLLLRIRI F  NSI+ L NS PS P+FVC RRFS HS  L NT H N  H LR+SQI 
Sbjct: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPYFVCLRRFSFHSLLL-NTDHGNFSHILRLSQIQ 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
            Y  P LSFT F L FYS  APSRSFR+RA+KRLKS +KPKL+E QFQ A+S+IPPRF S
Sbjct: 61  NYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQQAISQIPPRFNS 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RFRHD S+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YINHMYMETIRCLFRQMVN DGIEPDIF+LNCMIKGYVLSLHVNDALR+FHQMGVVY
Sbjct: 241 RNAYINHMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRVFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           S LPNS+SYDYLIHGLCAQARTDNA+ELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QEK+LVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEKELVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGN+N
Sbjct: 421 EDDYGNVN 426

BLAST of CsGy1G012900 vs. NCBI nr
Match: XP_023519586.1 (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo] >XP_023519587.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo] >XP_023519588.1 pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 576.6 bits (1485), Expect = 6.8e-161
Identity = 366/428 (85.51%), Postives = 386/428 (90.19%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MNLLLRIRI F  NSI+ L NS PS PHFVC RRFS HSW L NT H N  H  R+SQI 
Sbjct: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLL-NTDHGNFSHIFRLSQIQ 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
            Y  P LSFT F L FYS  APSRSFR+RA+KRLKS +KPKL+E QFQ A+S+IPPRF S
Sbjct: 61  NYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RFRHD S+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN+N+NCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YIN MYMETIRCLFRQMVN DGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           S LPNS+SYDYLIHGLCAQARTDNA+ELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX QEK+LVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVQEKELVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGN++
Sbjct: 421 EDDYGNVH 426

BLAST of CsGy1G012900 vs. TAIR10
Match: AT2G27800.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 366.7 bits (940), Expect = 2.0e-101
Identity = 172/261 (65.90%), Postives = 214/261 (81.99%), Query Frame = 0

Query: 77  YSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVC 136
           YS + P+RS R+R + R KSS KP L+ ++F   +SK+PPRFT EEL + I+L+ DP +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 137 FELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIY 196
           F LFNWASQQPRF H++ SY I I+KLG AKMY+EMD +VNQ L+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 197 FFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCL 256
           +FT+A KL RAVNIF+HM  ++NL CRP+IRTY++LF A L RG N+YINH+YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 257 FRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGL 316
           FRQMV D GIEPD+F+LNC++KGYVLSLHVNDALRIFHQM VVY C PNS++YDYLIHGL
Sbjct: 276 FRQMV-DSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGL 335

Query: 317 CAQARTDNAKELCNEMKEKGF 338
           CAQ RT NA+EL +EMK KGF
Sbjct: 336 CAQGRTINARELLSEMKGKGF 355

BLAST of CsGy1G012900 vs. TAIR10
Match: AT5G27300.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 222.6 bits (566), Expect = 4.6e-58
Identity = 117/249 (46.99%), Postives = 156/249 (62.65%), Query Frame = 0

Query: 77  YSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVC 136
           YS + P+RS R+R + R KSS KP L+E++FQ  +SK+PPRFT EEL + I+L+ DP +C
Sbjct: 89  YSTSVPTRSLRRRISSRKKSSTKPILNESKFQETISKLPPRFTPEELADAITLEEDPFLC 148

Query: 137 FELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIY 196
           F LFNWASQQPRF H++ SY I I+KLG AK                             
Sbjct: 149 FHLFNWASQQPRFTHENCSYHIAIRKLGAAK----------------------------- 208

Query: 197 FFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCL 256
               + KL RAVNIF+HM N+RNL CRP++RTY++LF A L RG N++INH+YMET+R L
Sbjct: 209 ----SGKLIRAVNIFRHMVNSRNLECRPTMRTYHILFKALLGRGNNSFINHLYMETVRSL 268

Query: 257 FRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGL 316
           FRQMV D GIEPD+F+LNC++KG   +++  + L      G V    PN  SY+ L++  
Sbjct: 269 FRQMV-DSGIEPDVFALNCLVKG--RTINTRELLSEMKGKGFV----PNGKSYNSLVNAF 297

Query: 317 CAQARTDNA 326
                 D+A
Sbjct: 329 ALSGEIDDA 297

BLAST of CsGy1G012900 vs. TAIR10
Match: AT3G25210.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 99.8 bits (247), Expect = 4.5e-21
Identity = 62/220 (28.18%), Postives = 114/220 (51.82%), Query Frame = 0

Query: 74  LKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDP 133
           L F S ++   S    +  R ++ L     ETQF+  +  + P FT+ ++   +  Q DP
Sbjct: 34  LSFSSVSSSPESHTSPSRIRTRTPL-----ETQFETWIQNLKPGFTNSDVVIALRAQSDP 93

Query: 134 LVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNT 193
            +  ++F W +QQ  ++H+  +Y   IK+    K    ++ ++ + +A +   S  LYN 
Sbjct: 94  DLALDIFRWTAQQRGYKHNHEAYHTMIKQAITGKRNNFVETLIEEVIAGACEMSVPLYNC 153

Query: 194 MIYFFTEARKL-TRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMET 253
           +I F    + L  RA +++  M   R+ + +P + TY LL ++ L R     + ++Y+  
Sbjct: 154 IIRFCCGRKFLFNRAFDVYNKML--RSDDSKPDLETYTLLLSSLLKRFNKLNVCYVYLHA 213

Query: 254 IRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRI 293
           +R L +QM   +G+ PD F LN +IK Y   L V++A+R+
Sbjct: 214 VRSLTKQM-KSNGVIPDTFVLNMIIKAYAKCLEVDEAIRV 245

BLAST of CsGy1G012900 vs. TAIR10
Match: AT3G16010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 68.2 bits (165), Expect = 1.4e-11
Identity = 38/173 (21.97%), Postives = 86/173 (49.71%), Query Frame = 0

Query: 125 NVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSS 184
           +++ +  +  V  + F WA ++  F+HD S+Y   I+ L EA++Y EM   + + +  + 
Sbjct: 98  SILEIDVEINVKIQFFKWAGKRRNFQHDCSTYMTLIRCLEEARLYGEMYRTIQEVVRNTY 157

Query: 185 IG-SETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNT 244
           +  S  + + ++     A+ +++A+++F   +  +   C+P+  TYN +    +  G++ 
Sbjct: 158 VSVSPAVLSELVKALGRAKMVSKALSVFYQAKGRK---CKPTSSTYNSVILMLMQEGQHE 217

Query: 245 YINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQM 297
            ++ +Y E        M N+    PD  + + +I  Y      + A+R+F +M
Sbjct: 218 KVHEVYTE--------MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEM 259

BLAST of CsGy1G012900 vs. TAIR10
Match: AT5G18475.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 65.1 bits (157), Expect = 1.2e-10
Identity = 47/215 (21.86%), Postives = 98/215 (45.58%), Query Frame = 0

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           E   +++  +RDP    ++FN ASQQ  F H++++Y + +  L   K +  +D +++Q  
Sbjct: 57  ESAVSLMKRERDPQGVLDIFNKASQQKGFNHNNATYSVLLDNLVRHKKFLAVDAILHQMK 116

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
             +    E+L+  ++  F+ +    + + +F  +Q    +  +PS+   +      +  G
Sbjct: 117 YETCRFQESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARV--KPSLNAISTCLNLLIDSG 176

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
                    +   R L     ++ G++P+    N ++K +  +  +N A  +  +M    
Sbjct: 177 E--------VNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFLVVEEMKRSG 236

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEK 336
              PNS +Y  L+  L A +R+  A EL  +M  K
Sbjct: 237 ISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISK 261

BLAST of CsGy1G012900 vs. Swiss-Prot
Match: sp|Q9ZUY1|PP173_ARATH (Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g27800 PE=3 SV=2)

HSP 1 Score: 366.7 bits (940), Expect = 3.5e-100
Identity = 172/261 (65.90%), Postives = 214/261 (81.99%), Query Frame = 0

Query: 77  YSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDPLVC 136
           YS + P+RS R+R + R KSS KP L+ ++F   +SK+PPRFT EEL + I+L+ DP +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 137 FELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNTMIY 196
           F LFNWASQQPRF H++ SY I I+KLG AKMY+EMD +VNQ L+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 197 FFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMETIRCL 256
           +FT+A KL RAVNIF+HM  ++NL CRP+IRTY++LF A L RG N+YINH+YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 257 FRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYLIHGL 316
           FRQMV D GIEPD+F+LNC++KGYVLSLHVNDALRIFHQM VVY C PNS++YDYLIHGL
Sbjct: 276 FRQMV-DSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGL 335

Query: 317 CAQARTDNAKELCNEMKEKGF 338
           CAQ RT NA+EL +EMK KGF
Sbjct: 336 CAQGRTINARELLSEMKGKGF 355

BLAST of CsGy1G012900 vs. Swiss-Prot
Match: sp|Q9LSF5|PP254_ARATH (Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g25210 PE=2 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 8.0e-20
Identity = 62/220 (28.18%), Postives = 114/220 (51.82%), Query Frame = 0

Query: 74  LKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRDP 133
           L F S ++   S    +  R ++ L     ETQF+  +  + P FT+ ++   +  Q DP
Sbjct: 34  LSFSSVSSSPESHTSPSRIRTRTPL-----ETQFETWIQNLKPGFTNSDVVIALRAQSDP 93

Query: 134 LVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYNT 193
            +  ++F W +QQ  ++H+  +Y   IK+    K    ++ ++ + +A +   S  LYN 
Sbjct: 94  DLALDIFRWTAQQRGYKHNHEAYHTMIKQAITGKRNNFVETLIEEVIAGACEMSVPLYNC 153

Query: 194 MIYFFTEARKL-TRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMET 253
           +I F    + L  RA +++  M   R+ + +P + TY LL ++ L R     + ++Y+  
Sbjct: 154 IIRFCCGRKFLFNRAFDVYNKML--RSDDSKPDLETYTLLLSSLLKRFNKLNVCYVYLHA 213

Query: 254 IRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRI 293
           +R L +QM   +G+ PD F LN +IK Y   L V++A+R+
Sbjct: 214 VRSLTKQM-KSNGVIPDTFVLNMIIKAYAKCLEVDEAIRV 245

BLAST of CsGy1G012900 vs. Swiss-Prot
Match: sp|Q9LW84|PP236_ARATH (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 2.6e-10
Identity = 38/173 (21.97%), Postives = 86/173 (49.71%), Query Frame = 0

Query: 125 NVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSS 184
           +++ +  +  V  + F WA ++  F+HD S+Y   I+ L EA++Y EM   + + +  + 
Sbjct: 98  SILEIDVEINVKIQFFKWAGKRRNFQHDCSTYMTLIRCLEEARLYGEMYRTIQEVVRNTY 157

Query: 185 IG-SETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNT 244
           +  S  + + ++     A+ +++A+++F   +  +   C+P+  TYN +    +  G++ 
Sbjct: 158 VSVSPAVLSELVKALGRAKMVSKALSVFYQAKGRK---CKPTSSTYNSVILMLMQEGQHE 217

Query: 245 YINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQM 297
            ++ +Y E        M N+    PD  + + +I  Y      + A+R+F +M
Sbjct: 218 KVHEVYTE--------MCNEGDCFPDTITYSALISSYEKLGRNDSAIRLFDEM 259

BLAST of CsGy1G012900 vs. Swiss-Prot
Match: sp|Q3E9F0|PP392_ARATH (Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana OX=3702 GN=At5g18475 PE=2 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 2.2e-09
Identity = 47/215 (21.86%), Postives = 98/215 (45.58%), Query Frame = 0

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           E   +++  +RDP    ++FN ASQQ  F H++++Y + +  L   K +  +D +++Q  
Sbjct: 57  ESAVSLMKRERDPQGVLDIFNKASQQKGFNHNNATYSVLLDNLVRHKKFLAVDAILHQMK 116

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
             +    E+L+  ++  F+ +    + + +F  +Q    +  +PS+   +      +  G
Sbjct: 117 YETCRFQESLFLNLMRHFSRSDLHDKVMEMFNLIQVIARV--KPSLNAISTCLNLLIDSG 176

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
                    +   R L     ++ G++P+    N ++K +  +  +N A  +  +M    
Sbjct: 177 E--------VNLSRKLLLYAKHNLGLQPNTCIFNILVKHHCKNGDINFAFLVVEEMKRSG 236

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEK 336
              PNS +Y  L+  L A +R+  A EL  +M  K
Sbjct: 237 ISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISK 261

BLAST of CsGy1G012900 vs. Swiss-Prot
Match: sp|Q9MAG8|PPR79_ARATH (Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis thaliana OX=3702 GN=At1g53330 PE=3 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 3.2e-08
Identity = 50/184 (27.17%), Postives = 91/184 (49.46%), Query Frame = 0

Query: 119 TSEELCNVISLQRDPLVCFELF---NWASQQPR--FRHDDSSYEITIKKLGEAKMYEEMD 178
           +S  L +++  + DP    +LF   +  S  P+  FR+    Y+I I KLG +KM++E+D
Sbjct: 8   SSFRLASLLRRENDPSAAMKLFRNPDPESTNPKRPFRYSLLCYDIIITKLGGSKMFDELD 67

Query: 179 HV-VNQALAVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLL 238
            V ++       + +E ++  +I FF   +  +RA+++F  M   R   C+ ++++ N L
Sbjct: 68  QVLLHLKTDTRIVPTEIIFCNVINFFGRGKLPSRALHMFDEMPQYR---CQRTVKSLNSL 127

Query: 239 FTAFLSRGRNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRI 297
            +A L  G          E  +   R    D+  +PD  + N +I G   S   +DAL++
Sbjct: 128 LSALLKCG----------ELEKMKERLSSIDEFGKPDACTYNILIHGCSQSGCFDDALKL 178

BLAST of CsGy1G012900 vs. TrEMBL
Match: tr|A0A0A0LS50|A0A0A0LS50_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086930 PE=4 SV=1)

HSP 1 Score: 721.1 bits (1860), Expect = 1.5e-204
Identity = 427/428 (99.77%), Postives = 428/428 (100.00%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH
Sbjct: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
           PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS
Sbjct: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL
Sbjct: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           SCLPNSYS+DYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGNLN
Sbjct: 421 EDDYGNLN 428

BLAST of CsGy1G012900 vs. TrEMBL
Match: tr|A0A1S3CJQ6|A0A1S3CJQ6_CUCME (pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501729 PE=4 SV=1)

HSP 1 Score: 652.9 bits (1683), Expect = 5.0e-184
Identity = 397/428 (92.76%), Postives = 408/428 (95.33%), Query Frame = 0

Query: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60
           MN L RIRIHFSTNSI+HLFNSNPSYPHF+CFRRFSIHS  LNNTHH NL HFLRVSQI 
Sbjct: 1   MNPLFRIRIHFSTNSINHLFNSNPSYPHFICFRRFSIHSLSLNNTHHCNLTHFLRVSQIQ 60

Query: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120
            Y  PNLSFT F L FYS+ APSRSFR+RANKRLK+SLKP LDE QFQLAVSKIPPRFT 
Sbjct: 61  TYPAPNLSFTKFCLNFYSKTAPSRSFRRRANKRLKASLKPTLDEAQFQLAVSKIPPRFTP 120

Query: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180
           EEL NVISLQ+DPLVCFELFNWASQQPRF+HD SSYEITIKKLGEAKMYEEMDHVVNQAL
Sbjct: 121 EELRNVISLQKDPLVCFELFNWASQQPRFKHDVSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240
           AVSSIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RNTYINH+YMETIRCLFRQMVNDDGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHVYMETIRCLFRQMVNDDGIEPDIFALNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360
           SCLPNSYSYDYLIHGL AQARTDNA+ELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SCLPNSYSYDYLIHGLSAQARTDNARELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNLN 429
           EDDYGNLN
Sbjct: 421 EDDYGNLN 428

BLAST of CsGy1G012900 vs. TrEMBL
Match: tr|A0A061GUJ4|A0A061GUJ4_THECC (Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao OX=3641 GN=TCM_041079 PE=4 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 9.2e-122
Identity = 309/430 (71.86%), Postives = 347/430 (80.70%), Query Frame = 0

Query: 2   NLLLRIRIHFSTNSIDHLFNSNPSYPHFVCF-----RRFSIHSWWLNNTHHYNLPHFLRV 61
           N   + +I  STN     F++ P Y  F  +     + F      L  T    L      
Sbjct: 11  NFYHKTKIFISTNP----FHNFPYYSVFSSYLNPFIKDFKPKESLLGLTQIDPLSVISPT 70

Query: 62  SQIHPYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPP 121
           + +HP+        N    FYS  APSRSFR+R NKRLK+S KP LD+ +F+ AVS++ P
Sbjct: 71  ANLHPFC------YNSFTCFYSTRAPSRSFRRRINKRLKASSKPVLDQPKFEKAVSQLLP 130

Query: 122 RFTSEELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVV 181
           RFT+EELCNVI+L+ DPLVC+ELFNWA QQPRFRHD S+Y ITIKKLG AKMYEEMD VV
Sbjct: 131 RFTAEELCNVITLEEDPLVCWELFNWAVQQPRFRHDVSTYHITIKKLGVAKMYEEMDVVV 190

Query: 182 NQALAVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAF 241
           NQ LA+ + GSE LYNT+IYFFTEARKLTRAVNIFKHM+NNR L+CRPSIRTYN+LFTA 
Sbjct: 191 NQVLALRTFGSEPLYNTIIYFFTEARKLTRAVNIFKHMRNNRKLDCRPSIRTYNILFTAM 250

Query: 242 LSRGRNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQM 301
           LSRGR++YINHMYMETIRCLFRQMVN DGIEPD+FSLN MIKGYVLSLHVNDALR+FHQM
Sbjct: 251 LSRGRDSYINHMYMETIRCLFRQMVN-DGIEPDVFSLNSMIKGYVLSLHVNDALRVFHQM 310

Query: 302 GVVYSCLPNSYSYDYLIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXX 361
           GVVY CLPNSYSYD+LI+GLCAQ RT+NA+ELCNEMK+ GFXXXXXXXXXXXXXXXXXXX
Sbjct: 311 GVVYKCLPNSYSYDFLIYGLCAQGRTNNARELCNEMKKNGFXXXXXXXXXXXXXXXXXXX 370

Query: 362 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKL 421
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   DLVDGHTYRKL
Sbjct: 371 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLVDGHTYRKL 429

Query: 422 LYVLEDDYGN 427
           LY +EDD+GN
Sbjct: 431 LYAMEDDFGN 429

BLAST of CsGy1G012900 vs. TrEMBL
Match: tr|A0A2N9FRL4|A0A2N9FRL4_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17745 PE=4 SV=1)

HSP 1 Score: 445.3 bits (1144), Expect = 1.6e-121
Identity = 293/354 (82.77%), Postives = 321/354 (90.68%), Query Frame = 0

Query: 73  LLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQRD 132
           L   YS  APSRSFR+R +KR K + KP+L+E QFQ AVS++ PRFT+EELCNVI+LQ D
Sbjct: 25  LNSLYSTKAPSRSFRRRESKRSKLNAKPRLNEAQFQRAVSQLTPRFTAEELCNVITLQED 84

Query: 133 PLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLYN 192
           P+VCFELFNWASQQ RFRHD  +Y ITIKKLG AKMYEEMD+VVNQ LAV  IGSE LYN
Sbjct: 85  PIVCFELFNWASQQHRFRHDVCTYHITIKKLGAAKMYEEMDNVVNQVLAVPYIGSEALYN 144

Query: 193 TMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYMET 252
           T+IYF TEARKLTRAVN+FKHM+N+RNL+CRPSIRTYNLLF AFLSRG N+YINHMYMET
Sbjct: 145 TVIYFSTEARKLTRAVNVFKHMRNSRNLDCRPSIRTYNLLFAAFLSRGNNSYINHMYMET 204

Query: 253 IRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDYL 312
           IRCLFRQMVN DGI+PDIFSLN MIKGYVLSLHVNDALR+FHQM VVY CLPNS+SYDYL
Sbjct: 205 IRCLFRQMVN-DGIQPDIFSLNSMIKGYVLSLHVNDALRVFHQMDVVYKCLPNSFSYDYL 264

Query: 313 IHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 372
           IHGLCAQ RT+NAKELC++MK+KGFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 265 IHGLCAQGRTNNAKELCDQMKQKGFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 324

Query: 373 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVLEDDYGN 427
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   D+VDGHTY+KLLYVLEDD+GN
Sbjct: 325 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDIVDGHTYKKLLYVLEDDFGN 377

BLAST of CsGy1G012900 vs. TrEMBL
Match: tr|A0A0D2RRL4|A0A0D2RRL4_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_004G012900 PE=4 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 1.3e-120
Identity = 299/415 (72.05%), Postives = 343/415 (82.65%), Query Frame = 0

Query: 14  NSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIHPYS--GPNLSFTN 73
           N + +LF+  P + +F  +  FS +     N  H         +QI+P S   P      
Sbjct: 11  NCLINLFHRCP-FHYFPFYSVFSCNLGTFKNPQHSP----FTFTQINPSSITSPTSISHQ 70

Query: 74  FLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTSEELCNVISLQR 133
           F   FYS  APSRS+R+R NKRLK+S KP LD+ +FQ  +S++PPRFT++EL NVI+L+ 
Sbjct: 71  FYTYFYSTKAPSRSYRRRVNKRLKASQKPVLDQAKFQQVISQLPPRFTADELYNVITLED 130

Query: 134 DPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQALAVSSIGSETLY 193
           DPLVC+ELFNWA+QQPRF+H+ S+Y ITIKKLG AKMYEEMD VVNQ LA+ S GSE LY
Sbjct: 131 DPLVCWELFNWAAQQPRFKHNVSTYHITIKKLGVAKMYEEMDVVVNQVLALRSFGSEPLY 190

Query: 194 NTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRGRNTYINHMYME 253
           NTMIYFF EARKLTRAVNIFKHM+NNR  +CRPSIRTYN+LFTA LSRG+++YINHMYME
Sbjct: 191 NTMIYFFAEARKLTRAVNIFKHMRNNRKFDCRPSIRTYNILFTAMLSRGKDSYINHMYME 250

Query: 254 TIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVYSCLPNSYSYDY 313
           TIRCLFRQMV DDGIEPD+F+LN MIKGYVLSLHVNDALR+FHQMGVVY CLPN++SYDY
Sbjct: 251 TIRCLFRQMV-DDGIEPDVFTLNSMIKGYVLSLHVNDALRVFHQMGVVYKCLPNAFSYDY 310

Query: 314 LIHGLCAQARTDNAKELCNEMKEKGFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 373
           LI+GLCAQ RT+NA+ELC+EMK  GF XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 311 LIYGLCAQGRTNNARELCDEMKRNGFTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370

Query: 374 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQEKDLVDGHTYRKLLYVLEDDYGN 427
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ KDLVDGHTYRKLLY +ED YG+
Sbjct: 371 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQSKDLVDGHTYRKLLYAMEDSYGD 419

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004147297.12.2e-20499.77PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
XP_008463631.17.5e-18492.76PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
XP_022965006.17.3e-16386.21pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
XP_022970418.11.1e-16185.98pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
XP_023519586.16.8e-16185.51pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT2G27800.12.0e-10165.90Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27300.14.6e-5846.99pentatricopeptide (PPR) repeat-containing protein[more]
AT3G25210.14.5e-2128.18Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G16010.11.4e-1121.97Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G18475.11.2e-1021.86Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9ZUY1|PP173_ARATH3.5e-10065.90Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidop... [more]
sp|Q9LSF5|PP254_ARATH8.0e-2028.18Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidop... [more]
sp|Q9LW84|PP236_ARATH2.6e-1021.97Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
sp|Q3E9F0|PP392_ARATH2.2e-0921.86Pentatricopeptide repeat-containing protein At5g18475 OS=Arabidopsis thaliana OX... [more]
sp|Q9MAG8|PPR79_ARATH3.2e-0827.17Putative pentatricopeptide repeat-containing protein At1g53330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LS50|A0A0A0LS50_CUCSA1.5e-20499.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086930 PE=4 SV=1[more]
tr|A0A1S3CJQ6|A0A1S3CJQ6_CUCME5.0e-18492.76pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like OS=Cuc... [more]
tr|A0A061GUJ4|A0A061GUJ4_THECC9.2e-12271.86Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao O... [more]
tr|A0A2N9FRL4|A0A2N9FRL4_FAGSY1.6e-12182.77Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17745 PE=4 SV=1[more]
tr|A0A0D2RRL4|A0A0D2RRL4_GOSRA1.3e-12072.05Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_004G012900 PE=4 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G012900.1CsGy1G012900.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 285..424
e-value: 7.5E-29
score: 103.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 87..248
e-value: 2.0E-17
score: 65.2
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 373..401
e-value: 9.8E-7
score: 28.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 377..407
e-value: 2.4E-4
score: 19.1
coord: 342..375
e-value: 2.6E-4
score: 19.0
coord: 190..217
e-value: 0.0025
score: 15.9
coord: 272..305
e-value: 3.3E-4
score: 18.6
coord: 307..340
e-value: 3.9E-7
score: 27.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 191..236
e-value: 1.4E-4
score: 21.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 304..352
e-value: 5.7E-11
score: 42.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 272..296
e-value: 0.025
score: 14.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..299
score: 7.092
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 187..217
score: 7.761
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 9.756
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 11.827
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 9.887
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 73..421
NoneNo IPR availablePANTHERPTHR24015:SF1017PENTATRICOPEPTIDE PPR REPEAT-CONTAINING PROTEINcoord: 73..421

The following gene(s) are paralogous to this gene:

None