CsGy7G004170 (gene) Cucumber (Gy14) v2

NameCsGy7G004170
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr7 : 3111995 .. 3114379 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGAAGCTCCATTGCTCTCAGTCATGGCTTTTTTGCTCCAATTTCAAGTTACTTCGGGCTCTATTCTACTCAACAAAATCCCTACCTTCTCCAAGTACCGAGGATACCCTCTTCCGAAGGGTTTATCGGGCAGGTGATCCTCGAACCTCGATTGTTCGCGTGTTGGACCAATGGGTCGAAGAAGGCCGACAAGTCAAACAGTCCGATCTCCAAACGCTTATCAAGCAGCTCAGGAAGTTTGGTCGCTTCAACCAAGCTCTGCAGGTACCCTTGTTCCGAAATCGCATTCCTTCTTACCACTTGTTGCCTTGTACTATACTTGAAAATTTTCTCATTCTGGTCTGGCCCTTTCATATGATAATGAATGTTTAGTTCGTTGAAGAGGAAATTGGGAGGCGTCTAAAAATTCATTTGTTGGTATCAACTTTAGCTTACCTTGTATCTATTAAAGCATATACCTTTGGAGCTTAAAGGTTCCAATCCTCTCGCCTCACATGTTATTGAACAAAAAAAAAAACTGCCTATTGGTGTGAAATTTCCGACTGACCCAAAAGCTTAAACTAATGAGTTATGGTAAATATAATTATTATCAATACTCTAACTCTCCTCTTCACTTGTAAGTTTGAAAATTTGAAGAAAACCTACACAACAAAAATCAATATTTTAACATGAGTTTGAACATGAAACCTCATGTACTGATAACATTCATTAATCTAATAAGCTTGGACTGATGGGTTATTTGCTAGTGATATTGACACTTCAACAAACAACTTGTATCTCTTTTATCCCAACAATAGGAGCTCCTTTTCATTTTAAAAAAAATGGTATGCATCTTTATTGAGGAAGGAGACTGTTCTTTGTAACCCATCAGAACAGTAGAACGTTTATGTTCTGTCAGCTTTTAGTACAGACCATTTGGTAAGTTTAAACTTCCCCAGCTTGGGCTTTCTTTACCCACAATTAAGGGCAACGTTGTATTTTCAGTCACTATATTAATGATCTTACCTAGTAAGGTGGACTAGTTGTGGCAGTCCTCTTTCCCTTTTTCTTTGTCTTCTCTCAAACTGAAGCGTTGAACTTAGATCATCTATGAATGGTAAACCATAAATATGTCTTCGTTATCCACAATTTGGCATTATTGATTACTGTAATAGTGTAATGCAACAGTTGTGCGAATGGGTACGTAATGAAAGGAACCAATGCCTGTCGACTGGGGACATTGCTGTTGAGTTGCACTTAATTTCAAAAGCTCGTGGTTTGGAACAAGCTGAGAAGTATTTTAGCAGCATTGGGGAATCTTCAAGAGATCATAAGGTTTATGGAGCACTTCTACACTGTTATGTAGAGAATAAAAATTTGAAAAAGGCAGAGGCAATCATGCAGAAAATGAGGGAAGTAGGATTTATGAAAACACCACTTTCTTATAATGCTATGTTAAACCTTTATGCTCATCTCGGTAAACATGAGAAACTTGCTGAATTATTGAAAGAAATGGAAGAAATGGGAATTGGTCCGGATAGATTTACATATAATATTCGTATGAATGCTTATGCAGCTGCTTCCGATATAACAAACATGGAAAAGCTTTTGTCAAAAATGGAGGCAGATCCACTAGTTGCTACGGATTGGCATACTTATTTTGTCGTAGGAAATGGATATTTCAAAGCTGGTCTTTCTGAAAATAGTATATTGATGCTGAAGAAAGCAGAACAATTCATTGGTGACAAGCAAAAATGGCTTGCATACCAATATCTCATGACACTATATGCTGCTATTGGAAATAAGGATGAGGTGTATCGGGTTTGGAACTTGTACACGAATCTGCGAAAGAGATTCAATTCCGGATATCTTTGTATAATAAGTTCATTAATGAAACTGGACGATATCGATGGTGCTGAAAGAATCTTGAAGGAATGGGAATCAGGGGATACATCTTTTGATTTCAGAATCCCAAACATGATGATAAATAGTTATTGTATGAAGGGATTTGTGGATAAGGCCGAAGCATATATAAACAGGCTTATAGAGACTGGCAAGGAACCAGAAGCAAATACTTGGGATCTACTGGCAAGTGGATATCATTCTAATGGTTTGACGAATAAAGTAGCAGAAACTCTGAAGAAAGCAATCTCAGTTAGTCCACCTCATTGGAAGCCTAAGTATCATATCTTGGCCGCATGTCTTGAATATTTGAAAACAAATGAAAATGTGGACTTGGCAGAGGAAATCATAGGGCTCCTTTGCAAACGTGATATTTTTCCCTTAAACATTTGCAAGAGATTAGAAGATTATATCCGCAGTGAAAACCAAAACTCAATCAAGTGCCTTGATCTACTTGGCCTGAAAGGTCAGAATGAGGAACCCGATCAAGTGTTAGATTGA

mRNA sequence

ATGGTGAAGCTCCATTGCTCTCAGTCATGGCTTTTTTGCTCCAATTTCAAGTTACTTCGGGCTCTATTCTACTCAACAAAATCCCTACCTTCTCCAAGTACCGAGGATACCCTCTTCCGAAGGGTTTATCGGGCAGGTGATCCTCGAACCTCGATTGTTCGCGTGTTGGACCAATGGGTCGAAGAAGGCCGACAAGTCAAACAGTCCGATCTCCAAACGCTTATCAAGCAGCTCAGGAAGTTTGGTCGCTTCAACCAAGCTCTGCAGTTGTGCGAATGGGTACGTAATGAAAGGAACCAATGCCTGTCGACTGGGGACATTGCTGTTGAGTTGCACTTAATTTCAAAAGCTCGTGGTTTGGAACAAGCTGAGAAGTATTTTAGCAGCATTGGGGAATCTTCAAGAGATCATAAGGTTTATGGAGCACTTCTACACTGTTATGTAGAGAATAAAAATTTGAAAAAGGCAGAGGCAATCATGCAGAAAATGAGGGAAGTAGGATTTATGAAAACACCACTTTCTTATAATGCTATGTTAAACCTTTATGCTCATCTCGGTAAACATGAGAAACTTGCTGAATTATTGAAAGAAATGGAAGAAATGGGAATTGGTCCGGATAGATTTACATATAATATTCGTATGAATGCTTATGCAGCTGCTTCCGATATAACAAACATGGAAAAGCTTTTGTCAAAAATGGAGGCAGATCCACTAGTTGCTACGGATTGGCATACTTATTTTGTCGTAGGAAATGGATATTTCAAAGCTGGTCTTTCTGAAAATAGTATATTGATGCTGAAGAAAGCAGAACAATTCATTGGTGACAAGCAAAAATGGCTTGCATACCAATATCTCATGACACTATATGCTGCTATTGGAAATAAGGATGAGGTGTATCGGGTTTGGAACTTGTACACGAATCTGCGAAAGAGATTCAATTCCGGATATCTTTGTATAATAAGTTCATTAATGAAACTGGACGATATCGATGGTGCTGAAAGAATCTTGAAGGAATGGGAATCAGGGGATACATCTTTTGATTTCAGAATCCCAAACATGATGATAAATAGTTATTGTATGAAGGGATTTGTGGATAAGGCCGAAGCATATATAAACAGGCTTATAGAGACTGGCAAGGAACCAGAAGCAAATACTTGGGATCTACTGGCAAGTGGATATCATTCTAATGGTTTGACGAATAAAGTAGCAGAAACTCTGAAGAAAGCAATCTCAGTTAGTCCACCTCATTGGAAGCCTAAGTATCATATCTTGGCCGCATGTCTTGAATATTTGAAAACAAATGAAAATGTGGACTTGGCAGAGGAAATCATAGGGCTCCTTTGCAAACGTGATATTTTTCCCTTAAACATTTGCAAGAGATTAGAAGATTATATCCGCAGTGAAAACCAAAACTCAATCAAGTGCCTTGATCTACTTGGCCTGAAAGGTCAGAATGAGGAACCCGATCAAGTGTTAGATTGA

Coding sequence (CDS)

ATGGTGAAGCTCCATTGCTCTCAGTCATGGCTTTTTTGCTCCAATTTCAAGTTACTTCGGGCTCTATTCTACTCAACAAAATCCCTACCTTCTCCAAGTACCGAGGATACCCTCTTCCGAAGGGTTTATCGGGCAGGTGATCCTCGAACCTCGATTGTTCGCGTGTTGGACCAATGGGTCGAAGAAGGCCGACAAGTCAAACAGTCCGATCTCCAAACGCTTATCAAGCAGCTCAGGAAGTTTGGTCGCTTCAACCAAGCTCTGCAGTTGTGCGAATGGGTACGTAATGAAAGGAACCAATGCCTGTCGACTGGGGACATTGCTGTTGAGTTGCACTTAATTTCAAAAGCTCGTGGTTTGGAACAAGCTGAGAAGTATTTTAGCAGCATTGGGGAATCTTCAAGAGATCATAAGGTTTATGGAGCACTTCTACACTGTTATGTAGAGAATAAAAATTTGAAAAAGGCAGAGGCAATCATGCAGAAAATGAGGGAAGTAGGATTTATGAAAACACCACTTTCTTATAATGCTATGTTAAACCTTTATGCTCATCTCGGTAAACATGAGAAACTTGCTGAATTATTGAAAGAAATGGAAGAAATGGGAATTGGTCCGGATAGATTTACATATAATATTCGTATGAATGCTTATGCAGCTGCTTCCGATATAACAAACATGGAAAAGCTTTTGTCAAAAATGGAGGCAGATCCACTAGTTGCTACGGATTGGCATACTTATTTTGTCGTAGGAAATGGATATTTCAAAGCTGGTCTTTCTGAAAATAGTATATTGATGCTGAAGAAAGCAGAACAATTCATTGGTGACAAGCAAAAATGGCTTGCATACCAATATCTCATGACACTATATGCTGCTATTGGAAATAAGGATGAGGTGTATCGGGTTTGGAACTTGTACACGAATCTGCGAAAGAGATTCAATTCCGGATATCTTTGTATAATAAGTTCATTAATGAAACTGGACGATATCGATGGTGCTGAAAGAATCTTGAAGGAATGGGAATCAGGGGATACATCTTTTGATTTCAGAATCCCAAACATGATGATAAATAGTTATTGTATGAAGGGATTTGTGGATAAGGCCGAAGCATATATAAACAGGCTTATAGAGACTGGCAAGGAACCAGAAGCAAATACTTGGGATCTACTGGCAAGTGGATATCATTCTAATGGTTTGACGAATAAAGTAGCAGAAACTCTGAAGAAAGCAATCTCAGTTAGTCCACCTCATTGGAAGCCTAAGTATCATATCTTGGCCGCATGTCTTGAATATTTGAAAACAAATGAAAATGTGGACTTGGCAGAGGAAATCATAGGGCTCCTTTGCAAACGTGATATTTTTCCCTTAAACATTTGCAAGAGATTAGAAGATTATATCCGCAGTGAAAACCAAAACTCAATCAAGTGCCTTGATCTACTTGGCCTGAAAGGTCAGAATGAGGAACCCGATCAAGTGTTAGATTGA

Protein sequence

MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLSYNAMLNLYAHLGKHEKLAELLKEMEEMGIGPDRFTYNIRMNAYAAASDITNMEKLLSKMEADPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPKYHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLGLKGQNEEPDQVLD
BLAST of CsGy7G004170 vs. NCBI nr
Match: KGN43610.1 (hypothetical protein Csa_7G047450 [Cucumis sativus])

HSP 1 Score: 849.0 bits (2192), Expect = 8.2e-243
Identity = 487/493 (98.78%), Postives = 490/493 (99.39%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60
           MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV
Sbjct: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60

Query: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120
           EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL
Sbjct: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120

Query: 121 EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180
           EQAE+YFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX
Sbjct: 121 EQAEEYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240

Query: 241 TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300
           TDWHTYFVVGNGYFKAGLSENSI MLKKAEQ IGDKQKWLAYQYLMTLYAAIGNKDEVYR
Sbjct: 241 TDWHTYFVVGNGYFKAGLSENSISMLKKAEQLIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300

Query: 301 VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 360
           VWNLYTNL+KRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDF+IPNMMINSYC 
Sbjct: 301 VWNLYTNLQKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFKIPNMMINSYCT 360

Query: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420
           KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK
Sbjct: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420

Query: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480
           YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG
Sbjct: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480

Query: 481 LKGQNEEPDQVLD 494
           LKGQNEEPDQVLD
Sbjct: 481 LKGQNEEPDQVLD 493

BLAST of CsGy7G004170 vs. NCBI nr
Match: XP_011659707.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At1g74580 [Cucumis sativus])

HSP 1 Score: 823.5 bits (2126), Expect = 3.7e-235
Identity = 473/493 (95.94%), Postives = 479/493 (97.16%), Query Frame = 0

Query: 1    MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60
            MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV
Sbjct: 530  MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 589

Query: 61   EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120
            EEGRQV QSDLQ LIKQLR FGRFN ALQLCEW RNERN+C S G IA++LHLISKARGL
Sbjct: 590  EEGRQVNQSDLQKLIKQLRTFGRFNHALQLCEWERNERNKCPSPGHIAIQLHLISKARGL 649

Query: 121  EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180
            EQAE+YFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX
Sbjct: 650  EQAEEYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 709

Query: 181  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX DPLVA
Sbjct: 710  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXADPLVA 769

Query: 241  TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300
            TDWH YF VGNGYFKAGLSENSI MLKKAEQ IGDKQKWLAYQYLMTLYAAIGNKDEVYR
Sbjct: 770  TDWHIYFTVGNGYFKAGLSENSISMLKKAEQLIGDKQKWLAYQYLMTLYAAIGNKDEVYR 829

Query: 301  VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 360
            VWNLYTNL+KRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDF+IPNMMINSYC 
Sbjct: 830  VWNLYTNLQKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFKIPNMMINSYCT 889

Query: 361  KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420
            KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK
Sbjct: 890  KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 949

Query: 421  YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480
            YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG
Sbjct: 950  YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 1009

Query: 481  LKGQNEEPDQVLD 494
            LKGQNEEPDQVLD
Sbjct: 1010 LKGQNEEPDQVLD 1022

BLAST of CsGy7G004170 vs. NCBI nr
Match: KGN43609.1 (hypothetical protein Csa_7G047440 [Cucumis sativus])

HSP 1 Score: 777.7 bits (2007), Expect = 2.3e-221
Identity = 452/486 (93.00%), Postives = 460/486 (94.65%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60
           MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV
Sbjct: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60

Query: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120
           EEGRQV QSDLQ LIKQLR FGRFN ALQLCEW RNERN+C S G IA++LHLISKARGL
Sbjct: 61  EEGRQVNQSDLQKLIKQLRTFGRFNHALQLCEWERNERNKCPSPGHIAIQLHLISKARGL 120

Query: 121 EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180
           EQAE+YFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX
Sbjct: 121 EQAEEYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX DPLVA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXADPLVA 240

Query: 241 TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300
           TDWH YF VGNGYFKAGLSENSI MLKKAEQ IGDKQKWLAYQYLMTLYAAIGNKDEVYR
Sbjct: 241 TDWHIYFTVGNGYFKAGLSENSISMLKKAEQLIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300

Query: 301 VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 360
           VWNLYTNL+KRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDF+IPNMMINSYC 
Sbjct: 301 VWNLYTNLQKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFKIPNMMINSYCT 360

Query: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420
           KGFVDKAEAYI+RLIE GKEP A  WD LASGYHSNGLTNK AETLKKAISVSPP WKP 
Sbjct: 361 KGFVDKAEAYISRLIENGKEPRAYAWDRLASGYHSNGLTNKAAETLKKAISVSPPRWKPN 420

Query: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480
           Y ILAACLEYLKTN NV+LAEEIIGLLCKRDIFPLNICKRLEDYI SENQNSIKCLDLLG
Sbjct: 421 YDILAACLEYLKTNGNVELAEEIIGLLCKRDIFPLNICKRLEDYIHSENQNSIKCLDLLG 480

Query: 481 LKGQNE 487
           LK QNE
Sbjct: 481 LKDQNE 486

BLAST of CsGy7G004170 vs. NCBI nr
Match: XP_016901673.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 718.4 bits (1853), Expect = 1.7e-203
Identity = 424/494 (85.83%), Postives = 447/494 (90.49%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSP-STEDTLFRRVYRAGDPRTSIVRVLDQW 60
           M+KLHCSQSWLF SNFK+L+ALFYSTKSLPS  STEDTLFRRV+RAGDPR SIVRVLDQW
Sbjct: 40  MMKLHCSQSWLFSSNFKVLQALFYSTKSLPSSRSTEDTLFRRVFRAGDPRISIVRVLDQW 99

Query: 61  VEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARG 120
           +EEGR+V QSD+Q LIKQLRKFGRFN ALQLCEW+ NERN+  S GDIAV+LHLISKARG
Sbjct: 100 IEEGRKVNQSDIQALIKQLRKFGRFNHALQLCEWIHNERNKNPSPGDIAVQLHLISKARG 159

Query: 121 LEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXX 180
           LEQAEKYFSSI ESSRDHKVYGALL+CYVENKNL+KAEAIMQKMREVGFMKTPL XXXXX
Sbjct: 160 LEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSXXXXX 219

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD LV
Sbjct: 220 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLV 279

Query: 241 ATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVY 300
           A DWH YF VGNGY KAG SEN ILMLKKAEQ IGDKQKW AY+YL+TLY AIGNKDEVY
Sbjct: 280 AMDWHAYFTVGNGYLKAGFSENGILMLKKAEQLIGDKQKWSAYEYLITLYGAIGNKDEVY 339

Query: 301 RVWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYC 360
           RVWNLY+NL KRFNSGYLC+I+SLMKLDDIDGAERILKEWESGDT FDFRIPNMMINSYC
Sbjct: 340 RVWNLYSNLEKRFNSGYLCMINSLMKLDDIDGAERILKEWESGDTCFDFRIPNMMINSYC 399

Query: 361 MKGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKP 420
            KGF+DKAEAYI+RLIE GKEP A  WD L SGYHSNGLTNK AET+KKAISVSPP WKP
Sbjct: 400 TKGFMDKAEAYISRLIENGKEPRAFAWDRLVSGYHSNGLTNKAAETMKKAISVSPPRWKP 459

Query: 421 KYHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLL 480
             HI+AACLEYLKTN NV+LAEEIIGLLCK DIFP NIC RLEDYI SENQ SIKCLDLL
Sbjct: 460 NNHIVAACLEYLKTNGNVELAEEIIGLLCKGDIFPSNICNRLEDYIHSENQTSIKCLDLL 519

Query: 481 GLKGQNEEPDQVLD 494
            LKGQ+E  D  LD
Sbjct: 520 DLKGQSEGLDHELD 533

BLAST of CsGy7G004170 vs. NCBI nr
Match: XP_022147816.1 (pentatricopeptide repeat-containing protein At2g20710, mitochondrial [Momordica charantia])

HSP 1 Score: 604.0 bits (1556), Expect = 4.6e-169
Identity = 370/494 (74.90%), Postives = 412/494 (83.40%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSL-PSPSTEDTLFRRVYRAGDPRTSIVRVLDQW 60
           M+KLHCSQ W  C   K  RALFYSTK+L  SPS ED+L+RRV +AGDPR SI RVLDQW
Sbjct: 1   MMKLHCSQPWRGCCTSKAFRALFYSTKALTSSPSPEDSLYRRVSQAGDPRISIRRVLDQW 60

Query: 61  VEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARG 120
           VEEGR VK SDLQ LIKQLRKF RFN ALQLCEW+ NE N   S GDIA+ LHLISK  G
Sbjct: 61  VEEGRLVKISDLQKLIKQLRKFRRFNHALQLCEWISNEMNHDPSPGDIAIRLHLISKVYG 120

Query: 121 LEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXX 180
           LEQAEKYFSSI ESSRD++VYGALL+CYVE+++L+KAE IMQKMRE+GFMKTPL  XXXX
Sbjct: 121 LEQAEKYFSSINESSRDYRVYGALLNCYVEDRDLEKAEEIMQKMRELGFMKTPLSFXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX D L+
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXADRLI 240

Query: 241 ATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVY 300
             DWH Y+VV NGYFKAGLSE SI+MLK++EQ IGDKQKW AY+ L+TLYAAIGNK EVY
Sbjct: 241 TMDWHAYYVVANGYFKAGLSEKSIMMLKRSEQLIGDKQKWFAYECLITLYAAIGNKAEVY 300

Query: 301 RVWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYC 360
           RVWNLYTNL++R+N+ YLCIISSLMKLDDI+GAE+ILKEWESGDT FDF+IPNMMIN YC
Sbjct: 301 RVWNLYTNLKRRYNTAYLCIISSLMKLDDIEGAEKILKEWESGDTCFDFKIPNMMINIYC 360

Query: 361 MKGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKP 420
            KG VDKAEAYI+RL+E+GKEP+ANTWD LA+GYH+NG T K  ET+KKAIS S P WKP
Sbjct: 361 RKGLVDKAEAYISRLMESGKEPQANTWDRLATGYHANGQTMKAVETIKKAISASQPGWKP 420

Query: 421 KYHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLL 480
             H LAACLE+LKTNENV++AEEII LL K DI  + IC  L DY+ SE Q S   LD L
Sbjct: 421 NDHTLAACLEFLKTNENVEVAEEIIRLLRKHDIVSIRICDGLVDYVHSEIQTS-SALDQL 480

Query: 481 GLKGQNEEPDQVLD 494
           GL GQ E  +   D
Sbjct: 481 GLDGQIERHNHASD 493

BLAST of CsGy7G004170 vs. TAIR10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 328.9 bits (842), Expect = 5.2e-90
Identity = 169/438 (38.58%), Postives = 255/438 (58.22%), Query Frame = 0

Query: 15  NFKLLRA-LFYSTKSLPSP-STEDTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQ 74
           N+ L R+ LF+S K+ PSP    DTL RRV R+GDP  SI++VLD W+++G  VK S+L 
Sbjct: 15  NYILRRSFLFHSGKTTPSPLDPYDTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELH 74

Query: 75  TLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGLEQAEKYFSSIGE 134
           ++IK LRKF RF+ ALQ+ +W+   R   +S GD+A+ L LI+K  GL +AEK+F +I  
Sbjct: 75  SIIKMLRKFSRFSHALQISDWMSEHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPM 134

Query: 135 SSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXX 194
             R++ +YGALL+CY   K L KAE + Q+M+E+GF+K  L                   
Sbjct: 135 ERRNYHLYGALLNCYASKKVLHKAEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVE 194

Query: 195 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNG 254
                                                      D  +  DW TY    NG
Sbjct: 195 KLLREMEDETVKPDIFTVNTRLHAYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANG 254

Query: 255 YFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRF 314
           Y KAGL+E ++ ML+K+EQ +  +++  AY+ LM+ Y A G K+EVYR+W+LY  L   +
Sbjct: 255 YIKAGLTEKALEMLRKSEQMVNAQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFY 314

Query: 315 NSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYIN 374
           N+GY+ +IS+L+K+DDI+  E+I++EWE+G + FD RIP+++I  YC KG ++KAE  +N
Sbjct: 315 NTGYISVISALLKMDDIEEVEKIMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVN 374

Query: 375 RLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPKYHILAACLEYLK 434
            L++  +  + +TW+ LA GY   G   K  E  K+AI VS P W+P   +L +C++YL+
Sbjct: 375 ILVQKWRVEDTSTWERLALGYKMAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLE 434

Query: 435 TNENVDLAEEIIGLLCKR 451
              +++   +I+ LL +R
Sbjct: 435 GQRDMEGLRKILRLLSER 452

BLAST of CsGy7G004170 vs. TAIR10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 216.5 bits (550), Expect = 3.8e-56
Identity = 175/415 (42.17%), Postives = 270/415 (65.06%), Query Frame = 0

Query: 36  DTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVR 95
           + +++++     P      VL+QW + GR++ + +L  ++K+LRK+ R NQAL++ +W+ 
Sbjct: 67  NAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMN 126

Query: 96  NERNQC-LSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLK 155
           N   +  LS  D A++L LI K RG+  AE++F  + E+ +D +VYG+LL+ YV  K+ +
Sbjct: 127 NRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSRE 186

Query: 156 KAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 215
           KAEA++  MR+ G+   PL    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 187 KAEALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 246

Query: 216 XXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIG 275
           XXXXXXXXXXXXXXXXXXXX D  +  +W T+  +   Y K G +E +   L+K E  I 
Sbjct: 247 XXXXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARIT 306

Query: 276 DKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRF-NSGYLCIISSLMKLDDIDGAE 335
            + + + Y YL++LY ++GNK E+YRVW++Y ++     N GY  ++SSL+++ DI+GAE
Sbjct: 307 GRNR-IPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAE 366

Query: 336 RILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLIETGKEPEANTWDLLASGY 395
           ++ +EW    +S+D RIPN+++N+Y     ++ AE   + ++E G +P ++TW++LA G+
Sbjct: 367 KVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGH 426

Query: 396 HSNGLTNKVAETLKKAISV-SPPHWKPKYHILAACLEYLKTNENVDLAEEIIGLL 448
                 ++    L+ A S     +W+PK  +L+   +  +   +V   E ++ LL
Sbjct: 427 TRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELL 480

BLAST of CsGy7G004170 vs. TAIR10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 203.0 bits (515), Expect = 4.3e-52
Identity = 116/407 (28.50%), Postives = 202/407 (49.63%), Query Frame = 0

Query: 40  RRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERN 99
           + + R   PR S+  +L + ++ G  V  S+L+ + K+L +  R++ ALQ+ EW+ N+++
Sbjct: 42  KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKD 101

Query: 100 QCLSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKV----YGALLHCYVENKNLKK 159
              S  DIA+ L LI K  GL+Q E+YF  +  SS   +V    Y  LL  YV+NK +K+
Sbjct: 102 IEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRAYVKNKMVKE 161

Query: 160 AEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           AEA+M+K+  +GF+ TP                                           
Sbjct: 162 AEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWMN 221

Query: 220 XXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGD 279
                               D  V   W +   + N Y K+G  E + L+L+ AE+ + +
Sbjct: 222 ACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDAEKML-N 281

Query: 280 KQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRFNS-GYLCIISSLMKLDDIDGAER 339
           +   L Y +L+TLYA++GNK+ V R+W +  ++  R +   Y+C++SSL+K  D++ AER
Sbjct: 282 RSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAER 341

Query: 340 ILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLIETGKEPEANTWDLLASGYH 399
           +  EWE+   ++D R+ N+++ +Y   G + KAE+    ++E G  P   TW++L  G+ 
Sbjct: 342 VFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYKTWEILMEGWV 401

Query: 400 SNGLTNKVAETLKKA-ISVSPPHWKPKYHILAACLEYLKTNENVDLA 441
                 K  + + +  + +   HW+P ++I+ A  EY +  E ++ A
Sbjct: 402 KCENMEKAIDAMHQVFVLMRRCHWRPSHNIVMAIAEYFEKEEKIEEA 447

BLAST of CsGy7G004170 vs. TAIR10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 200.7 bits (509), Expect = 2.1e-51
Identity = 120/434 (27.65%), Postives = 208/434 (47.93%), Query Frame = 0

Query: 18  LLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQ 77
           L+ + +Y T  +     + TL+ ++   GDP++S+   L  WV+ G++V  ++L  ++  
Sbjct: 11  LIASRYYYTNRV----KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHD 70

Query: 78  LRKFGRFNQALQLCEWVRNERNQCL-STGDIAVELHLISKARGLEQAEKYFSSIGESSRD 137
           LR+  RF  AL++ +W+ NE   C+ S  + AV L LI +  G   AE+YF ++ E  ++
Sbjct: 71  LRRRKRFLHALEVSKWM-NETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKN 130

Query: 138 HKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXX 197
            K YGALL+CYV  +N++K+    +KM+E+GF+ + L                       
Sbjct: 131 DKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLE 190

Query: 198 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKA 257
                                                     +  DW+TY V    Y   
Sbjct: 191 EMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDG 250

Query: 258 GLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNL-RKRFNSG 317
           G  + ++ +LK +E  + +K+    Y +L+TLYA +G K EV R+W+L  ++ ++R N  
Sbjct: 251 GDCDRAVELLKMSENRL-EKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQD 310

Query: 318 YLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLI 377
           YL ++ SL+K+D +  AE +L EW+S    +DFR+PN +I  Y  K   +KAEA +  L 
Sbjct: 311 YLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLA 370

Query: 378 ETGKEPEANTWDLLASGYHSNGLTNKVAETLKKA--ISVSPPHWKPKYHILAACLEYLKT 437
             GK     +W+L+A+ Y   G      + +K A  + V    W+P   ++ + L ++  
Sbjct: 371 RRGKATTPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGD 430

Query: 438 NENVDLAEEIIGLL 448
             ++   E  +  L
Sbjct: 431 EGSLKEVESFVASL 438

BLAST of CsGy7G004170 vs. TAIR10
Match: AT1G28020.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 183.3 bits (464), Expect = 3.5e-46
Identity = 113/334 (33.83%), Postives = 161/334 (48.20%), Query Frame = 0

Query: 41  RVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQ 100
           R+  A      I+ VL+QW ++G QV  S ++ +IK+LR   +  QALQ+ EW+  E+  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 101 CLSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKVYGALLHCYV-ENKNLKKAEAI 160
            L   D A  LHLI    GLE+AEK+F SI +++R   VY +LL+ Y   +K L KAEA 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 161 MQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220
            QKMR++G +  P+                                              
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 221 XXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKW 280
                              +  +WHT   +   Y +A  S  ++ ML+  EQ +  K   
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 281 LAYQYLMTLYAAIGNKDEVYRVWNLY-TNLRKRFNSGYLCIISSLMKLDDIDGAERILKE 340
            AY +LM LY   GN++EV RVW LY + + +R N+GY  +I SL+K+DDI GAE I K 
Sbjct: 281 SAYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYKV 340

Query: 341 WESGDTSFDFRIPNMMINSYCMKGFVDKAEAYIN 373
           WES    FD RIP M+ + Y  +G  +KAE  +N
Sbjct: 341 WESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMN 374

BLAST of CsGy7G004170 vs. Swiss-Prot
Match: sp|Q9SKU6|PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 9.4e-89
Identity = 169/438 (38.58%), Postives = 255/438 (58.22%), Query Frame = 0

Query: 15  NFKLLRA-LFYSTKSLPSP-STEDTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQ 74
           N+ L R+ LF+S K+ PSP    DTL RRV R+GDP  SI++VLD W+++G  VK S+L 
Sbjct: 15  NYILRRSFLFHSGKTTPSPLDPYDTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELH 74

Query: 75  TLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGLEQAEKYFSSIGE 134
           ++IK LRKF RF+ ALQ+ +W+   R   +S GD+A+ L LI+K  GL +AEK+F +I  
Sbjct: 75  SIIKMLRKFSRFSHALQISDWMSEHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPM 134

Query: 135 SSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXX 194
             R++ +YGALL+CY   K L KAE + Q+M+E+GF+K  L                   
Sbjct: 135 ERRNYHLYGALLNCYASKKVLHKAEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVE 194

Query: 195 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNG 254
                                                      D  +  DW TY    NG
Sbjct: 195 KLLREMEDETVKPDIFTVNTRLHAYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANG 254

Query: 255 YFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRF 314
           Y KAGL+E ++ ML+K+EQ +  +++  AY+ LM+ Y A G K+EVYR+W+LY  L   +
Sbjct: 255 YIKAGLTEKALEMLRKSEQMVNAQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFY 314

Query: 315 NSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYIN 374
           N+GY+ +IS+L+K+DDI+  E+I++EWE+G + FD RIP+++I  YC KG ++KAE  +N
Sbjct: 315 NTGYISVISALLKMDDIEEVEKIMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVN 374

Query: 375 RLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPKYHILAACLEYLK 434
            L++  +  + +TW+ LA GY   G   K  E  K+AI VS P W+P   +L +C++YL+
Sbjct: 375 ILVQKWRVEDTSTWERLALGYKMAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLE 434

Query: 435 TNENVDLAEEIIGLLCKR 451
              +++   +I+ LL +R
Sbjct: 435 GQRDMEGLRKILRLLSER 452

BLAST of CsGy7G004170 vs. Swiss-Prot
Match: sp|Q8LPS6|PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 216.5 bits (550), Expect = 6.8e-55
Identity = 175/415 (42.17%), Postives = 270/415 (65.06%), Query Frame = 0

Query: 36  DTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVR 95
           + +++++     P      VL+QW + GR++ + +L  ++K+LRK+ R NQAL++ +W+ 
Sbjct: 67  NAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMN 126

Query: 96  NERNQC-LSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLK 155
           N   +  LS  D A++L LI K RG+  AE++F  + E+ +D +VYG+LL+ YV  K+ +
Sbjct: 127 NRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSRE 186

Query: 156 KAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 215
           KAEA++  MR+ G+   PL    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 187 KAEALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 246

Query: 216 XXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIG 275
           XXXXXXXXXXXXXXXXXXXX D  +  +W T+  +   Y K G +E +   L+K E  I 
Sbjct: 247 XXXXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARIT 306

Query: 276 DKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRF-NSGYLCIISSLMKLDDIDGAE 335
            + + + Y YL++LY ++GNK E+YRVW++Y ++     N GY  ++SSL+++ DI+GAE
Sbjct: 307 GRNR-IPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAE 366

Query: 336 RILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLIETGKEPEANTWDLLASGY 395
           ++ +EW    +S+D RIPN+++N+Y     ++ AE   + ++E G +P ++TW++LA G+
Sbjct: 367 KVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGH 426

Query: 396 HSNGLTNKVAETLKKAISV-SPPHWKPKYHILAACLEYLKTNENVDLAEEIIGLL 448
                 ++    L+ A S     +W+PK  +L+   +  +   +V   E ++ LL
Sbjct: 427 TRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELL 480

BLAST of CsGy7G004170 vs. Swiss-Prot
Match: sp|Q3E911|PP400_ARATH (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 7.8e-51
Identity = 116/407 (28.50%), Postives = 202/407 (49.63%), Query Frame = 0

Query: 40  RRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERN 99
           + + R   PR S+  +L + ++ G  V  S+L+ + K+L +  R++ ALQ+ EW+ N+++
Sbjct: 42  KEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKD 101

Query: 100 QCLSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKV----YGALLHCYVENKNLKK 159
              S  DIA+ L LI K  GL+Q E+YF  +  SS   +V    Y  LL  YV+NK +K+
Sbjct: 102 IEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRAYVKNKMVKE 161

Query: 160 AEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 219
           AEA+M+K+  +GF+ TP                                           
Sbjct: 162 AEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWMN 221

Query: 220 XXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGD 279
                               D  V   W +   + N Y K+G  E + L+L+ AE+ + +
Sbjct: 222 ACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDAEKML-N 281

Query: 280 KQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNLRKRFNS-GYLCIISSLMKLDDIDGAER 339
           +   L Y +L+TLYA++GNK+ V R+W +  ++  R +   Y+C++SSL+K  D++ AER
Sbjct: 282 RSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAER 341

Query: 340 ILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLIETGKEPEANTWDLLASGYH 399
           +  EWE+   ++D R+ N+++ +Y   G + KAE+    ++E G  P   TW++L  G+ 
Sbjct: 342 VFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYKTWEILMEGWV 401

Query: 400 SNGLTNKVAETLKKA-ISVSPPHWKPKYHILAACLEYLKTNENVDLA 441
                 K  + + +  + +   HW+P ++I+ A  EY +  E ++ A
Sbjct: 402 KCENMEKAIDAMHQVFVLMRRCHWRPSHNIVMAIAEYFEKEEKIEEA 447

BLAST of CsGy7G004170 vs. Swiss-Prot
Match: sp|Q84JR3|PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 3.8e-50
Identity = 120/434 (27.65%), Postives = 208/434 (47.93%), Query Frame = 0

Query: 18  LLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQ 77
           L+ + +Y T  +     + TL+ ++   GDP++S+   L  WV+ G++V  ++L  ++  
Sbjct: 11  LIASRYYYTNRV----KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHD 70

Query: 78  LRKFGRFNQALQLCEWVRNERNQCL-STGDIAVELHLISKARGLEQAEKYFSSIGESSRD 137
           LR+  RF  AL++ +W+ NE   C+ S  + AV L LI +  G   AE+YF ++ E  ++
Sbjct: 71  LRRRKRFLHALEVSKWM-NETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKN 130

Query: 138 HKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXX 197
            K YGALL+CYV  +N++K+    +KM+E+GF+ + L                       
Sbjct: 131 DKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLE 190

Query: 198 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKA 257
                                                     +  DW+TY V    Y   
Sbjct: 191 EMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDG 250

Query: 258 GLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYRVWNLYTNL-RKRFNSG 317
           G  + ++ +LK +E  + +K+    Y +L+TLYA +G K EV R+W+L  ++ ++R N  
Sbjct: 251 GDCDRAVELLKMSENRL-EKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQD 310

Query: 318 YLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCMKGFVDKAEAYINRLI 377
           YL ++ SL+K+D +  AE +L EW+S    +DFR+PN +I  Y  K   +KAEA +  L 
Sbjct: 311 YLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLA 370

Query: 378 ETGKEPEANTWDLLASGYHSNGLTNKVAETLKKA--ISVSPPHWKPKYHILAACLEYLKT 437
             GK     +W+L+A+ Y   G      + +K A  + V    W+P   ++ + L ++  
Sbjct: 371 RRGKATTPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGD 430

Query: 438 NENVDLAEEIIGLL 448
             ++   E  +  L
Sbjct: 431 EGSLKEVESFVASL 438

BLAST of CsGy7G004170 vs. Swiss-Prot
Match: sp|Q9C7F1|PPR61_ARATH (Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis thaliana OX=3702 GN=At1g28020 PE=3 SV=2)

HSP 1 Score: 183.3 bits (464), Expect = 6.4e-45
Identity = 113/334 (33.83%), Postives = 161/334 (48.20%), Query Frame = 0

Query: 41  RVYRAGDPRTSIVRVLDQWVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQ 100
           R+  A      I+ VL+QW ++G QV  S ++ +IK+LR   +  QALQ+ EW+  E+  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 101 CLSTGDIAVELHLISKARGLEQAEKYFSSIGESSRDHKVYGALLHCYV-ENKNLKKAEAI 160
            L   D A  LHLI    GLE+AEK+F SI +++R   VY +LL+ Y   +K L KAEA 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 161 MQKMREVGFMKTPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 220
            QKMR++G +  P+                                              
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 221 XXXXXXXXXXXXXXXXDPLVATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKW 280
                              +  +WHT   +   Y +A  S  ++ ML+  EQ +  K   
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 281 LAYQYLMTLYAAIGNKDEVYRVWNLY-TNLRKRFNSGYLCIISSLMKLDDIDGAERILKE 340
            AY +LM LY   GN++EV RVW LY + + +R N+GY  +I SL+K+DDI GAE I K 
Sbjct: 281 SAYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYKV 340

Query: 341 WESGDTSFDFRIPNMMINSYCMKGFVDKAEAYIN 373
           WES    FD RIP M+ + Y  +G  +KAE  +N
Sbjct: 341 WESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMN 374

BLAST of CsGy7G004170 vs. TrEMBL
Match: tr|A0A0A0K1S9|A0A0A0K1S9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047450 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 5.4e-243
Identity = 487/493 (98.78%), Postives = 490/493 (99.39%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60
           MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV
Sbjct: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60

Query: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120
           EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL
Sbjct: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120

Query: 121 EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180
           EQAE+YFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX
Sbjct: 121 EQAEEYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240

Query: 241 TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300
           TDWHTYFVVGNGYFKAGLSENSI MLKKAEQ IGDKQKWLAYQYLMTLYAAIGNKDEVYR
Sbjct: 241 TDWHTYFVVGNGYFKAGLSENSISMLKKAEQLIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300

Query: 301 VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 360
           VWNLYTNL+KRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDF+IPNMMINSYC 
Sbjct: 301 VWNLYTNLQKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFKIPNMMINSYCT 360

Query: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420
           KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK
Sbjct: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420

Query: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480
           YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG
Sbjct: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480

Query: 481 LKGQNEEPDQVLD 494
           LKGQNEEPDQVLD
Sbjct: 481 LKGQNEEPDQVLD 493

BLAST of CsGy7G004170 vs. TrEMBL
Match: tr|A0A0A0K3R7|A0A0A0K3R7_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047440 PE=4 SV=1)

HSP 1 Score: 777.7 bits (2007), Expect = 1.5e-221
Identity = 452/486 (93.00%), Postives = 460/486 (94.65%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60
           MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV
Sbjct: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQWV 60

Query: 61  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 120
           EEGRQV QSDLQ LIKQLR FGRFN ALQLCEW RNERN+C S G IA++LHLISKARGL
Sbjct: 61  EEGRQVNQSDLQKLIKQLRTFGRFNHALQLCEWERNERNKCPSPGHIAIQLHLISKARGL 120

Query: 121 EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180
           EQAE+YFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX
Sbjct: 121 EQAEEYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX DPLVA
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXADPLVA 240

Query: 241 TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300
           TDWH YF VGNGYFKAGLSENSI MLKKAEQ IGDKQKWLAYQYLMTLYAAIGNKDEVYR
Sbjct: 241 TDWHIYFTVGNGYFKAGLSENSISMLKKAEQLIGDKQKWLAYQYLMTLYAAIGNKDEVYR 300

Query: 301 VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 360
           VWNLYTNL+KRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDF+IPNMMINSYC 
Sbjct: 301 VWNLYTNLQKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFKIPNMMINSYCT 360

Query: 361 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 420
           KGFVDKAEAYI+RLIE GKEP A  WD LASGYHSNGLTNK AETLKKAISVSPP WKP 
Sbjct: 361 KGFVDKAEAYISRLIENGKEPRAYAWDRLASGYHSNGLTNKAAETLKKAISVSPPRWKPN 420

Query: 421 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLLG 480
           Y ILAACLEYLKTN NV+LAEEIIGLLCKRDIFPLNICKRLEDYI SENQNSIKCLDLLG
Sbjct: 421 YDILAACLEYLKTNGNVELAEEIIGLLCKRDIFPLNICKRLEDYIHSENQNSIKCLDLLG 480

Query: 481 LKGQNE 487
           LK QNE
Sbjct: 481 LKDQNE 486

BLAST of CsGy7G004170 vs. TrEMBL
Match: tr|A0A1S4E0B6|A0A1S4E0B6_CUCME (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103495690 PE=4 SV=1)

HSP 1 Score: 718.4 bits (1853), Expect = 1.1e-203
Identity = 424/494 (85.83%), Postives = 447/494 (90.49%), Query Frame = 0

Query: 1   MVKLHCSQSWLFCSNFKLLRALFYSTKSLPSP-STEDTLFRRVYRAGDPRTSIVRVLDQW 60
           M+KLHCSQSWLF SNFK+L+ALFYSTKSLPS  STEDTLFRRV+RAGDPR SIVRVLDQW
Sbjct: 40  MMKLHCSQSWLFSSNFKVLQALFYSTKSLPSSRSTEDTLFRRVFRAGDPRISIVRVLDQW 99

Query: 61  VEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARG 120
           +EEGR+V QSD+Q LIKQLRKFGRFN ALQLCEW+ NERN+  S GDIAV+LHLISKARG
Sbjct: 100 IEEGRKVNQSDIQALIKQLRKFGRFNHALQLCEWIHNERNKNPSPGDIAVQLHLISKARG 159

Query: 121 LEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXX 180
           LEQAEKYFSSI ESSRDHKVYGALL+CYVENKNL+KAEAIMQKMREVGFMKTPL XXXXX
Sbjct: 160 LEQAEKYFSSIRESSRDHKVYGALLNCYVENKNLEKAEAIMQKMREVGFMKTPLSXXXXX 219

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD LV
Sbjct: 220 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSLV 279

Query: 241 ATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVY 300
           A DWH YF VGNGY KAG SEN ILMLKKAEQ IGDKQKW AY+YL+TLY AIGNKDEVY
Sbjct: 280 AMDWHAYFTVGNGYLKAGFSENGILMLKKAEQLIGDKQKWSAYEYLITLYGAIGNKDEVY 339

Query: 301 RVWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYC 360
           RVWNLY+NL KRFNSGYLC+I+SLMKLDDIDGAERILKEWESGDT FDFRIPNMMINSYC
Sbjct: 340 RVWNLYSNLEKRFNSGYLCMINSLMKLDDIDGAERILKEWESGDTCFDFRIPNMMINSYC 399

Query: 361 MKGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKP 420
            KGF+DKAEAYI+RLIE GKEP A  WD L SGYHSNGLTNK AET+KKAISVSPP WKP
Sbjct: 400 TKGFMDKAEAYISRLIENGKEPRAFAWDRLVSGYHSNGLTNKAAETMKKAISVSPPRWKP 459

Query: 421 KYHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNSIKCLDLL 480
             HI+AACLEYLKTN NV+LAEEIIGLLCK DIFP NIC RLEDYI SENQ SIKCLDLL
Sbjct: 460 NNHIVAACLEYLKTNGNVELAEEIIGLLCKGDIFPSNICNRLEDYIHSENQTSIKCLDLL 519

Query: 481 GLKGQNEEPDQVLD 494
            LKGQ+E  D  LD
Sbjct: 520 DLKGQSEGLDHELD 533

BLAST of CsGy7G004170 vs. TrEMBL
Match: tr|A0A2P4JTJ4|A0A2P4JTJ4_QUESU (Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=58331 GN=CFP56_65317 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 1.4e-121
Identity = 291/472 (61.65%), Postives = 364/472 (77.12%), Query Frame = 0

Query: 2   VKLHCSQSWLFCSNFKLLRALFYSTKSLPSPSTE-DTLFRRVYRAGDPRTSIVRVLDQWV 61
           ++L  S  W      ++L   FYST++L  PS   D+L+ RV RAGDP+ SI+RV+DQW+
Sbjct: 1   MRLLSSNPWRGYGISRVLGVFFYSTRTLARPSPPIDSLYSRVSRAGDPKVSIIRVIDQWL 60

Query: 62  EEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKARGL 121
           EEGRQV+QSD+  +IKQLRKF R++QALQ+ EWV ++R+  LS G+IA+ L LISK  GL
Sbjct: 61  EEGRQVQQSDILMMIKQLRKFRRYSQALQIFEWVSDQRHHDLSPGEIAIRLDLISKVHGL 120

Query: 122 EQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXXXX 181
           E+AEKYF SI  + R ++VYGALL+CY ENK+L KAEA MQKMRE+GF+KT LXXXXXXX
Sbjct: 121 EEAEKYFDSIPNTLRVYQVYGALLNCYAENKSLDKAEATMQKMRELGFLKTSLXXXXXXX 180

Query: 182 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPLVA 241
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX DPL+ 
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXADPLIT 240

Query: 242 TDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEVYR 301
            DW++Y V  NGY KAGL E ++ ML+KAEQ + +K +  AY+  +T Y AI NKD +Y 
Sbjct: 241 IDWNSYVVAANGYLKAGLHEKTLEMLQKAEQLVSNKTRKSAYEIFLTQYTAIQNKDGLYH 300

Query: 302 VWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSYCM 361
           +WNLY N+ + +NSGYLC+ISSL+KLDDIDGAE+IL+EWES +T FD RIP+++I++YC 
Sbjct: 301 IWNLYKNMGRFYNSGYLCMISSLLKLDDIDGAEKILEEWESENTFFDSRIPHLLISAYCR 360

Query: 362 KGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWKPK 421
           KG ++KAEAY+NRLIE GKE E+  W  LA+GYH +G   K  ET+KKAI  S P W P 
Sbjct: 361 KGLLEKAEAYMNRLIECGKETESTKWGRLATGYHVHGQMEKAVETMKKAILASQPGWMPS 420

Query: 422 YHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNS 473
              LAACL+YLK N   ++AEEI+ LL +   F   I +RL +YI SEN++S
Sbjct: 421 RFTLAACLDYLKGNGEAEVAEEILRLLREHVHFSTGIYERLLNYIHSENKDS 472

BLAST of CsGy7G004170 vs. TrEMBL
Match: tr|A0A2N9HF60|A0A2N9HF60_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38096 PE=4 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 1.0e-116
Identity = 286/474 (60.34%), Postives = 354/474 (74.68%), Query Frame = 0

Query: 2   VKLHCSQSWLFCSNFKLLRALFY---STKSLPSPSTEDTLFRRVYRAGDPRTSIVRVLDQ 61
           +KL  S  W      ++L ALFY    T    SP   ++L+ R+ +AG+PR  I+RVLDQ
Sbjct: 1   MKLLGSNPWRGNGISRVLGALFYYSTGTGGRKSPPI-NSLYLRISQAGNPRVPIIRVLDQ 60

Query: 62  WVEEGRQVKQSDLQTLIKQLRKFGRFNQALQLCEWVRNERNQCLSTGDIAVELHLISKAR 121
           W+EEGR V+QS+L  +IKQLRK+ R++ ALQ+ EW+ ++RN  LS GDIA+ L LISK R
Sbjct: 61  WLEEGRHVQQSELLIIIKQLRKYRRYSHALQISEWISDQRNYDLSPGDIAIRLDLISKVR 120

Query: 122 GLEQAEKYFSSIGESSRDHKVYGALLHCYVENKNLKKAEAIMQKMREVGFMKTPLXXXXX 181
           GLE+AEKYF +I  +SR ++VYGALL+CY  NK+L KAEA +QKMRE+G +KT L XXXX
Sbjct: 121 GLEEAEKYFDTIPNTSRVYQVYGALLNCYAGNKSLDKAEATLQKMRELGLLKTSLSXXXX 180

Query: 182 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPL 241
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDP+
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPV 240

Query: 242 VATDWHTYFVVGNGYFKAGLSENSILMLKKAEQFIGDKQKWLAYQYLMTLYAAIGNKDEV 301
           +  DW+ Y    NGY KAGL E +  +LKKAEQ +    + +AY+  +TLY AI  KDEV
Sbjct: 241 ITIDWNAYVAAANGYLKAGLHEKTFEILKKAEQLVSSSARKVAYEIFLTLYTAIHKKDEV 300

Query: 302 YRVWNLYTNLRKRFNSGYLCIISSLMKLDDIDGAERILKEWESGDTSFDFRIPNMMINSY 361
           YR+WN Y N+ K +NSGYLC+ISSL+KLDDIDGAE+IL+EWESG+  FD R+PN+MI +Y
Sbjct: 301 YRIWNSYKNMEKFYNSGYLCMISSLLKLDDIDGAEKILEEWESGNKFFDIRVPNVMITAY 360

Query: 362 CMKGFVDKAEAYINRLIETGKEPEANTWDLLASGYHSNGLTNKVAETLKKAISVSPPHWK 421
           C KG  +KAEAYINRLIE GKE  ++TW  LA+GYH +G   K  ET+KK I  S P WK
Sbjct: 361 CKKGLFEKAEAYINRLIECGKETNSSTWGRLATGYHEHGQMEKAVETMKKTILASQPGWK 420

Query: 422 PKYHILAACLEYLKTNENVDLAEEIIGLLCKRDIFPLNICKRLEDYIRSENQNS 473
                LAACL+YLK    V++AEEI+ LL K  +F   I +RL +YI +EN +S
Sbjct: 421 LNRLTLAACLDYLKGKGEVEVAEEILRLLKKHSLFSTGIHERLLNYIHNENPDS 473

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN43610.18.2e-24398.78hypothetical protein Csa_7G047450 [Cucumis sativus][more]
XP_011659707.13.7e-23595.94PREDICTED: putative pentatricopeptide repeat-containing protein At1g74580 [Cucum... [more]
KGN43609.12.3e-22193.00hypothetical protein Csa_7G047440 [Cucumis sativus][more]
XP_016901673.11.7e-20385.83PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_022147816.14.6e-16974.90pentatricopeptide repeat-containing protein At2g20710, mitochondrial [Momordica ... [more]
Match NameE-valueIdentityDescription
AT2G20710.15.2e-9038.58Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.13.8e-5642.17Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27460.14.3e-5228.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.12.1e-5127.65Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G28020.13.5e-4633.83Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SKU6|PP166_ARATH9.4e-8938.58Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
sp|Q8LPS6|PPR3_ARATH6.8e-5542.17Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
sp|Q3E911|PP400_ARATH7.8e-5128.50Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
sp|Q84JR3|PP334_ARATH3.8e-5027.65Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
sp|Q9C7F1|PPR61_ARATH6.4e-4533.83Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K1S9|A0A0A0K1S9_CUCSA5.4e-24398.78Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047450 PE=4 SV=1[more]
tr|A0A0A0K3R7|A0A0A0K3R7_CUCSA1.5e-22193.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G047440 PE=4 SV=1[more]
tr|A0A1S4E0B6|A0A1S4E0B6_CUCME1.1e-20385.83pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cuc... [more]
tr|A0A2P4JTJ4|A0A2P4JTJ4_QUESU1.4e-12161.65Pentatricopeptide repeat-containing protein, mitochondrial OS=Quercus suber OX=5... [more]
tr|A0A2N9HF60|A0A2N9HF60_FAGSY1.0e-11660.34Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS38096 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy7G004170.1CsGy7G004170.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 352..376
e-value: 0.0022
score: 18.0
coord: 139..168
e-value: 0.0012
score: 18.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 174..206
e-value: 3.9E-7
score: 27.8
coord: 139..168
e-value: 3.2E-5
score: 21.8
coord: 352..381
e-value: 2.0E-4
score: 19.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 174..218
e-value: 8.8E-10
score: 38.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 136..170
score: 9.668
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 171..205
score: 11.498
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 347..381
score: 9.164
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 312..346
score: 5.009
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 31..66
score: 5.196
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 206..240
score: 8.155
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 38..168
e-value: 5.7E-12
score: 47.3
coord: 169..279
e-value: 8.6E-17
score: 63.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 313..443
e-value: 2.3E-11
score: 45.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 243..417
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 18..464
NoneNo IPR availablePANTHERPTHR24015:SF644SUBFAMILY NOT NAMEDcoord: 18..464

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsGy7G004170Cucumber (Gy14) v2cgybcgybB041
CsGy7G004170Bottle gourd (USVL1VR-Ls)cgyblsiB457
CsGy7G004170Cucumber (Gy14) v2cgybcgybB153
CsGy7G004170Cucumber (Gy14) v2cgybcgybB156
CsGy7G004170Cucurbita maxima (Rimu)cgybcmaB896
CsGy7G004170Cucurbita maxima (Rimu)cgybcmaB909
CsGy7G004170Cucurbita maxima (Rimu)cgybcmaB944
CsGy7G004170Cucurbita maxima (Rimu)cgybcmaB955
CsGy7G004170Cucurbita maxima (Rimu)cgybcmaB960
CsGy7G004170Cucurbita moschata (Rifu)cgybcmoB877
CsGy7G004170Cucurbita moschata (Rifu)cgybcmoB892
CsGy7G004170Cucurbita moschata (Rifu)cgybcmoB927
CsGy7G004170Cucurbita moschata (Rifu)cgybcmoB943
CsGy7G004170Cucurbita pepo (Zucchini)cgybcpeB910
CsGy7G004170Cucurbita pepo (Zucchini)cgybcpeB944
CsGy7G004170Cucurbita pepo (Zucchini)cgybcpeB964
CsGy7G004170Cucurbita pepo (Zucchini)cgybcpeB973
CsGy7G004170Cucumber (Chinese Long) v2cgybcuB324
CsGy7G004170Bottle gourd (USVL1VR-Ls)cgyblsiB462
CsGy7G004170Melon (DHL92) v3.5.1cgybmeB468
CsGy7G004170Melon (DHL92) v3.6.1cgybmedB454
CsGy7G004170Melon (DHL92) v3.6.1cgybmedB465
CsGy7G004170Watermelon (Charleston Gray)cgybwcgB497
CsGy7G004170Watermelon (Charleston Gray)cgybwcgB499
CsGy7G004170Watermelon (Charleston Gray)cgybwcgB527
CsGy7G004170Watermelon (97103) v1cgybwmB546
CsGy7G004170Watermelon (97103) v1cgybwmB561
CsGy7G004170Wild cucumber (PI 183967)cgybcpiB330
CsGy7G004170Wild cucumber (PI 183967)cgybcpiB332
CsGy7G004170Silver-seed gourdcarcgybB0032
CsGy7G004170Silver-seed gourdcarcgybB0717
CsGy7G004170Silver-seed gourdcarcgybB0864
CsGy7G004170Cucumber (Chinese Long) v3cgybcucB346
CsGy7G004170Cucumber (Chinese Long) v3cgybcucB348
CsGy7G004170Watermelon (97103) v2cgybwmbB497
CsGy7G004170Watermelon (97103) v2cgybwmbB524
CsGy7G004170Wax gourdcgybwgoB633
CsGy7G004170Wax gourdcgybwgoB655