CsGy1G001040 (gene) Cucumber (Gy14) v2

NameCsGy1G001040
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat
LocationChr1 : 589827 .. 591914 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

mRNA sequence

ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

Coding sequence (CDS)

ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

Protein sequence

MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_004138266.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis sativus] >KGN63534.1 hypothetical protein Csa_1G003520 [Cucumis sativus])

HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 565/695 (81.29%), Postives = 565/695 (81.29%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60
           MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL
Sbjct: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60

Query: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120
           ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD
Sbjct: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120

Query: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180
           KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD
Sbjct: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180

Query: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240
           GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI
Sbjct: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240

Query: 241 GEL--------------------------------------------------------- 300
           GEL                                                         
Sbjct: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMXXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360

Query: 361 -------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420
                        IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV
Sbjct: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420

Query: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480
           KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY
Sbjct: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480

Query: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540
           QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG
Sbjct: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540

Query: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 566
           RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Sbjct: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_016903201.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1002.3 bits (2590), Expect = 6.7e-289
Identity = 528/698 (75.64%), Postives = 538/698 (77.08%), Query Frame = 0

Query: 1   MASIVGCLPNIS---LTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  S              KSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSXXXXXXXXXXXXXXKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGEL------------------------------------------------------ 300
           LE+GEL                                                      
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWXXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 ----------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
                           IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 566
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_022921781.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 959.9 bits (2480), Expect = 3.8e-276
Identity = 502/699 (71.82%), Postives = 533/699 (76.25%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPNIS+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVACLPNISVTSITHLSQFPENPKSLILQRCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+SP NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSPHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFK NEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKSGFKPNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM +R+ VAWNSMLSGYTKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSQRATVAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGEL------------------------------------------------------ 300
           LE+GEL                                                      
Sbjct: 241 LELGELIGEYILSKGIRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXXXXAKVDANEVTMVSVLYSCAVLGAYETGKWVHSYIKRKKMKLT 360

Query: 361 ----------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
                           IDRSVEVF+ M F NVFTWTALIQGLANNGEGKMAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMPFANVFTWTALIQGLANNGEGKMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LE
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRRGFDIEPRIEHYGCMVDILGRAGLLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIANMPIPPNAVVWRTLLASCKAHKNVEMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 566
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_023516242.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 952.2 bits (2460), Expect = 7.9e-274
Identity = 498/699 (71.24%), Postives = 533/699 (76.25%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPNIS+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVACLPNISVTSITHLSQFPENPKSLILQKCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+S  NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSSHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILK GFKSNEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKFGFKSNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM ER+ VAWNSMLSGYTKNGLWDEVVKLFRK+LELRIEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSERATVAWNSMLSGYTKNGLWDEVVKLFRKMLELRIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGEL------------------------------------------------------ 300
           LE+GEL                                                      
Sbjct: 241 LELGELIGEYIVSKGLRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXDANEVTMVSVLYSCAVLGAYETGKWVHSYITRKKMKLT 360

Query: 361 ----------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
                           IDRSVEVF+ M F NVFTWTALIQGLANNGEGKMAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMPFANVFTWTALIQGLANNGEGKMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMR+ FDIEPRIEHYGCMVDILGRAG LE
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRKGFDIEPRIEHYGCMVDILGRAGLLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHK+++MAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIVNMPIPPNAVVWRTLLASCKAHKSVQMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 566
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_022987229.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 948.7 bits (2451), Expect = 8.8e-273
Identity = 495/699 (70.82%), Postives = 533/699 (76.25%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSI---TQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPN S+TSI   +QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVVCLPNTSVTSINHLSQFPENPKSLILQKCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+SP NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSPHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFKSNEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKSGFKSNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM ER+ VAWNSMLSGYTKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSERATVAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGEL------------------------------------------------------ 300
           LE+GEL                                                      
Sbjct: 241 LELGELIGEYIVSKGLRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAXXXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXDANEVTMVSVLYSCAVLGAYETGKWVHSYIKRKKMRLT 360

Query: 361 ----------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
                           IDRSVEVF+ M+F NVFTWTALIQGLANNGEG+MAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMAFANVFTWTALIQGLANNGEGEMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG L+
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRRGFDIEPRIEHYGCMVDILGRAGLLQ 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIANMPIPPNAVVWRTLLASCKAHKNVEMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 566
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

BLAST of CsGy1G001040 vs. TAIR10
Match: AT1G31920.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 434.1 bits (1115), Expect = 1.3e-121
Identity = 218/571 (38.18%), Postives = 344/571 (60.25%), Query Frame = 0

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPII--TEAVLESAALLLPDTIDYALSIFNHIDKP 86
           +L++C    + +QVHA  +K           +  + + A     ++++YA SIF  ID P
Sbjct: 36  LLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASIFRGIDDP 95

Query: 87  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 146
            +  +N MIRG     S + AL  + +M ++  + D FT+  +LKAC+R+K++REG+Q+H
Sbjct: 96  CTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSIREGKQIH 155

Query: 147 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 206
             + K G +++ FV+N+LI MY  CG++ ++  VF+ +  ++  +W+SM+S     G+W 
Sbjct: 156 GQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMGMWS 215

Query: 207 EVVKLFRKIL-ELRIEFDDVTMISVLMACGRLANLEIG------------EL-------- 266
           E + LFR +  E  ++ ++  M+S L+AC     L +G            EL        
Sbjct: 216 ECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQTSL 275

Query: 267 ---------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPND 326
                    +D+++ +F++M  +N  T++A+I GLA +GEG+ AL  FS M++  ++P+ 
Sbjct: 276 VDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMIKEGLEPDH 335

Query: 327 VTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID 386
           V ++ VL+ACSH+ LV +GR +F  M ++  +EP  EHYGC+VD+LGRAG LEEA + I 
Sbjct: 336 VVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEALETIQ 395

Query: 387 NMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVED 446
           ++P   N V+WRT L+ CR  +NIE+ + + + + +L   + GDY+L+SN Y+     +D
Sbjct: 396 SIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQMWDD 455

Query: 447 AIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGY 506
             R R+ I  K +K+ PG S++EL G  H F S+D  H   KEI+  L +M  Q+K  GY
Sbjct: 456 VARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLKFEGY 515

Query: 507 VPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKF 566
            P+     L  +EE K+  +  HS+K+AIA+GL+ T P + I+I++NLRMC DCH  TK 
Sbjct: 516 SPDLTQILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHTYTKK 575

BLAST of CsGy1G001040 vs. TAIR10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 426.0 bits (1094), Expect = 3.6e-119
Identity = 241/626 (38.50%), Postives = 351/626 (56.07%), Query Frame = 0

Query: 22  NPKSLI--LQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAAL--LLPDTIDYALSI 81
           +P SL   +  C+T +DL Q+HA  +K+ ++ D +    +L   A   L    +DYA  I
Sbjct: 22  HPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKI 81

Query: 82  FNHIDKPESSAYNVMIRGLAFKRSPDNAL----LLFKKMHEKSVQHDKFTFSSVLKACSR 141
           FN + +    ++N +IRG + +   D AL    L ++ M ++ V+ ++FTF SVLKAC++
Sbjct: 82  FNQMPQRNCFSWNTIIRGFS-ESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAK 141

Query: 142 MKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDG---------MPE 201
              ++EG+Q+H L LK GF  +EFV + L++MY  CG +  AR +F           M +
Sbjct: 142 TGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTD 201

Query: 202 R-----SIVAWNSMLSGYTKNGLWDEVVKLFRKILELR---------------------- 261
           R      IV WN M+ GY + G       LF                             
Sbjct: 202 RRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXX 261

Query: 262 ---------IEFDDVTMISVLMACGRLANLEIGE-------------------------- 321
                    I  + VT++SVL A  RL +LE+GE                          
Sbjct: 262 XXXXXXXGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYS 321

Query: 322 ---LIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIG 381
              +I++++ VF+ +  +NV TW+A+I G A +G+   A++ F  M +  V+P+DV +I 
Sbjct: 322 KCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYIN 381

Query: 382 VLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFP 441
           +L+ACSH  LV++GR  F+ M     +EPRIEHYGCMVD+LGR+G L+EA +FI NMP  
Sbjct: 382 LLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIK 441

Query: 442 PNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVR 501
           P+ V+W+ LL +CR   N+EM ++    +  + P  SG Y+ LSN YA  G   +   +R
Sbjct: 442 PDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMR 501

Query: 502 SLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTD 561
             +KEK+I+K PGCSLI++DGV+HEF  ED  H  +KEI+  L ++  +++  GY P T 
Sbjct: 502 LRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITT 561

Query: 562 DARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF 566
              L  EEE KE  + +HSEK+A A+GLI TSP   IRI KNLR+C DCH++ K IS+V+
Sbjct: 562 QVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 621

BLAST of CsGy1G001040 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 424.9 bits (1091), Expect = 8.0e-119
Identity = 196/481 (40.75%), Postives = 308/481 (64.03%), Query Frame = 0

Query: 116 SVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVA 175
           +V+ D+ T  +V+ AC++  ++  G QVH  I   GF SN  + N LI +Y+ CG++  A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 176 RHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRL 235
             +F+ +P + +++WN+++ GYT   L+ E + LF+++L      +DVTM+S+L AC  L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 236 ANLEIGEL-------------------------------IDRSVEVFKEMSFKNVFTWTA 295
             ++IG                                 I+ + +VF  +  K++ +W A
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 296 LIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDF 355
           +I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +M +D+
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 356 DIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKS 415
            + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E+ E  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 416 LEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHE 475
            E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D VVHE
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

Query: 476 FFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIA 535
           F   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSEKLAIA
Sbjct: 621 FIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIA 680

Query: 536 YGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDY 566
           +GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+DG+CSCNDY
Sbjct: 681 FGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of CsGy1G001040 vs. TAIR10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 423.3 bits (1087), Expect = 2.3e-118
Identity = 221/577 (38.30%), Postives = 339/577 (58.75%), Query Frame = 0

Query: 21  ENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNH 80
           +NP  L++ +C + ++L Q+ A+ +K+       + + +          ++ YA  +F  
Sbjct: 30  QNP-ILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSYARHLFEA 89

Query: 81  IDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREG 140
           + +P+   +N M RG +   +P     LF ++ E  +  D +TF S+LKAC+  KAL EG
Sbjct: 90  MSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEG 149

Query: 141 EQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN 200
            Q+H L +K G   N +V  TLI MY  C  +  AR VFD + E  +V +N+M++GY + 
Sbjct: 150 RQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARR 209

Query: 201 GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIDRSVEVFKEMSFKNVF 260
              +E + LFR++    ++ +++T++SVL +C  L +L++G+ I +  +  K    K V 
Sbjct: 210 NRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAK--KHSFCKYVK 269

Query: 261 TWTALIQGLANNGEGKMALEFFSSMLENDV------------------------------ 320
             TALI   A  G    A+  F  M   D                               
Sbjct: 270 VNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXX 329

Query: 321 -KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEA 380
            +P+++TF+G+L+ACSH   V++GR  F+ M   F I P I+HYG MVD+L RAG LE+A
Sbjct: 330 XQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDA 389

Query: 381 YQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALV 440
           Y+FID +P  P  ++WR LLA+C +H N+++AEK  E I  L+ +H GDY++LSN YA  
Sbjct: 390 YEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARN 449

Query: 441 GRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQI 500
            + E    +R ++K+++  K+PGCS IE++ VVHEFFS DG    + ++H ALD+M+K++
Sbjct: 450 KKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKEL 509

Query: 501 KRLGYVPNTD-DARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDC 560
           K  GYVP+T         ++ KE ++ +HSEKLAI +GL+ T P TTIR+ KNLR+CRDC
Sbjct: 510 KLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDC 569

Query: 561 HNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 566
           HNA K IS +F R +++RD  RFHHF+DG CSC D+W
Sbjct: 570 HNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsGy1G001040 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 422.5 bits (1085), Expect = 4.0e-118
Identity = 226/565 (40.00%), Postives = 330/565 (58.41%), Query Frame = 0

Query: 37  LQQVHAHLLKTRRLLDPIITEAVLESAAL-LLPDTIDYALSIFNHIDKPESSAYNVMIRG 96
           L+Q+HA LL+T  + +  +    L   AL L+P  I+Y+  +F+    P  S  N MIR 
Sbjct: 27  LRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRLNPTLSHCNTMIRA 86

Query: 97  LAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKS 156
            +  ++P     LF+ +    S+  +  + S  LK C +   L  G Q+H  I   GF S
Sbjct: 87  FSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGLQIHGKIFSDGFLS 146

Query: 157 NEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKI- 216
           +  +  TL+ +Y+ C     A  VFD +P+R  V+WN + S Y +N    +V+ LF K+ 
Sbjct: 147 DSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKMK 206

Query: 217 --LELRIEFDDVTMISVLMACGRLANLEIGELI--------------------------- 276
             ++  ++ D VT +  L AC  L  L+ G+ +                           
Sbjct: 207 NDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRCG 266

Query: 277 --DRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLS 336
             D++ +VF  M  +NV +WTALI GLA NG GK A+E F+ ML+  + P + T  G+LS
Sbjct: 267 SMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGLLS 326

Query: 337 ACSHACLVDQGRHLFNSMRR-DFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPN 396
           ACSH+ LV +G   F+ MR  +F I+P + HYGC+VD+LGRA  L++AY  I +M   P+
Sbjct: 327 ACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMKPD 386

Query: 397 AVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSL 456
           + +WRTLL +CR H ++E+ E+ + H+  L+   +GDY+LL NTY+ VG+ E    +RSL
Sbjct: 387 STIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELRSL 446

Query: 457 IKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPN-TDD 516
           +KEK I   PGCS IEL G VHEF  +D  H   +EI+  L ++ +Q+K  GYV   T +
Sbjct: 447 MKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEITSE 506

Query: 517 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 566
                 EE K  ++ +HSEKLAIA+G++ T P TTIR++KNLR C DCHN  KF+S V++
Sbjct: 507 LHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDVYD 566

BLAST of CsGy1G001040 vs. Swiss-Prot
Match: sp|Q9C6T2|PPR68_ARATH (Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H11 PE=2 SV=1)

HSP 1 Score: 434.1 bits (1115), Expect = 2.4e-120
Identity = 218/571 (38.18%), Postives = 344/571 (60.25%), Query Frame = 0

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPII--TEAVLESAALLLPDTIDYALSIFNHIDKP 86
           +L++C    + +QVHA  +K           +  + + A     ++++YA SIF  ID P
Sbjct: 36  LLKRCHNIDEFKQVHARFIKLSLFYSSSFSASSVLAKCAHSGWENSMNYAASIFRGIDDP 95

Query: 87  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 146
            +  +N MIRG     S + AL  + +M ++  + D FT+  +LKAC+R+K++REG+Q+H
Sbjct: 96  CTFDFNTMIRGYVNVMSFEEALCFYNEMMQRGNEPDNFTYPCLLKACTRLKSIREGKQIH 155

Query: 147 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 206
             + K G +++ FV+N+LI MY  CG++ ++  VF+ +  ++  +W+SM+S     G+W 
Sbjct: 156 GQVFKLGLEADVFVQNSLINMYGRCGEMELSSAVFEKLESKTAASWSSMVSARAGMGMWS 215

Query: 207 EVVKLFRKIL-ELRIEFDDVTMISVLMACGRLANLEIG------------EL-------- 266
           E + LFR +  E  ++ ++  M+S L+AC     L +G            EL        
Sbjct: 216 ECLLLFRGMCSETNLKAEESGMVSALLACANTGALNLGMSIHGFLLRNISELNIIVQTSL 275

Query: 267 ---------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPND 326
                    +D+++ +F++M  +N  T++A+I GLA +GEG+ AL  FS M++  ++P+ 
Sbjct: 276 VDMYVKCGCLDKALHIFQKMEKRNNLTYSAMISGLALHGEGESALRMFSKMIKEGLEPDH 335

Query: 327 VTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFID 386
           V ++ VL+ACSH+ LV +GR +F  M ++  +EP  EHYGC+VD+LGRAG LEEA + I 
Sbjct: 336 VVYVSVLNACSHSGLVKEGRRVFAEMLKEGKVEPTAEHYGCLVDLLGRAGLLEEALETIQ 395

Query: 387 NMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVED 446
           ++P   N V+WRT L+ CR  +NIE+ + + + + +L   + GDY+L+SN Y+     +D
Sbjct: 396 SIPIEKNDVIWRTFLSQCRVRQNIELGQIAAQELLKLSSHNPGDYLLISNLYSQGQMWDD 455

Query: 447 AIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGY 506
             R R+ I  K +K+ PG S++EL G  H F S+D  H   KEI+  L +M  Q+K  GY
Sbjct: 456 VARTRTEIAIKGLKQTPGFSIVELKGKTHRFVSQDRSHPKCKEIYKMLHQMEWQLKFEGY 515

Query: 507 VPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKF 566
            P+     L  +EE K+  +  HS+K+AIA+GL+ T P + I+I++NLRMC DCH  TK 
Sbjct: 516 SPDLTQILLNVDEEEKKERLKGHSQKVAIAFGLLYTPPGSIIKIARNLRMCSDCHTYTKK 575

BLAST of CsGy1G001040 vs. Swiss-Prot
Match: sp|Q9FI80|PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 426.0 bits (1094), Expect = 6.5e-118
Identity = 241/626 (38.50%), Postives = 351/626 (56.07%), Query Frame = 0

Query: 22  NPKSLI--LQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAAL--LLPDTIDYALSI 81
           +P SL   +  C+T +DL Q+HA  +K+ ++ D +    +L   A   L    +DYA  I
Sbjct: 22  HPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKI 81

Query: 82  FNHIDKPESSAYNVMIRGLAFKRSPDNAL----LLFKKMHEKSVQHDKFTFSSVLKACSR 141
           FN + +    ++N +IRG + +   D AL    L ++ M ++ V+ ++FTF SVLKAC++
Sbjct: 82  FNQMPQRNCFSWNTIIRGFS-ESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAK 141

Query: 142 MKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDG---------MPE 201
              ++EG+Q+H L LK GF  +EFV + L++MY  CG +  AR +F           M +
Sbjct: 142 TGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTD 201

Query: 202 R-----SIVAWNSMLSGYTKNGLWDEVVKLFRKILELR---------------------- 261
           R      IV WN M+ GY + G       LF                             
Sbjct: 202 RRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDXXXXXXXXXXXXXXXXXXXXXXXXXXXX 261

Query: 262 ---------IEFDDVTMISVLMACGRLANLEIGE-------------------------- 321
                    I  + VT++SVL A  RL +LE+GE                          
Sbjct: 262 XXXXXXXGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYS 321

Query: 322 ---LIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIG 381
              +I++++ VF+ +  +NV TW+A+I G A +G+   A++ F  M +  V+P+DV +I 
Sbjct: 322 KCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYIN 381

Query: 382 VLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFP 441
           +L+ACSH  LV++GR  F+ M     +EPRIEHYGCMVD+LGR+G L+EA +FI NMP  
Sbjct: 382 LLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIK 441

Query: 442 PNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVR 501
           P+ V+W+ LL +CR   N+EM ++    +  + P  SG Y+ LSN YA  G   +   +R
Sbjct: 442 PDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMR 501

Query: 502 SLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTD 561
             +KEK+I+K PGCSLI++DGV+HEF  ED  H  +KEI+  L ++  +++  GY P T 
Sbjct: 502 LRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITT 561

Query: 562 DARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVF 566
              L  EEE KE  + +HSEK+A A+GLI TSP   IRI KNLR+C DCH++ K IS+V+
Sbjct: 562 QVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVY 621

BLAST of CsGy1G001040 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.4e-117
Identity = 196/481 (40.75%), Postives = 308/481 (64.03%), Query Frame = 0

Query: 116 SVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVA 175
           +V+ D+ T  +V+ AC++  ++  G QVH  I   GF SN  + N LI +Y+ CG++  A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 176 RHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRL 235
             +F+ +P + +++WN+++ GYT   L+ E + LF+++L      +DVTM+S+L AC  L
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHL 380

Query: 236 ANLEIGEL-------------------------------IDRSVEVFKEMSFKNVFTWTA 295
             ++IG                                 I+ + +VF  +  K++ +W A
Sbjct: 381 GAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNA 440

Query: 296 LIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDF 355
           +I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +M +D+
Sbjct: 441 MIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDY 500

Query: 356 DIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKS 415
            + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E+ E  
Sbjct: 501 KMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESF 560

Query: 416 LEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHE 475
            E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D VVHE
Sbjct: 561 AENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHE 620

Query: 476 FFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIA 535
           F   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSEKLAIA
Sbjct: 621 FIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIA 680

Query: 536 YGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDY 566
           +GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+DG+CSCNDY
Sbjct: 681 FGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of CsGy1G001040 vs. Swiss-Prot
Match: sp|Q8LK93|PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 423.3 bits (1087), Expect = 4.2e-117
Identity = 221/577 (38.30%), Postives = 339/577 (58.75%), Query Frame = 0

Query: 21  ENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNH 80
           +NP  L++ +C + ++L Q+ A+ +K+       + + +          ++ YA  +F  
Sbjct: 30  QNP-ILLISKCNSLRELMQIQAYAIKSHIEDVSFVAKLINFCTESPTESSMSYARHLFEA 89

Query: 81  IDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREG 140
           + +P+   +N M RG +   +P     LF ++ E  +  D +TF S+LKAC+  KAL EG
Sbjct: 90  MSEPDIVIFNSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEG 149

Query: 141 EQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN 200
            Q+H L +K G   N +V  TLI MY  C  +  AR VFD + E  +V +N+M++GY + 
Sbjct: 150 RQLHCLSMKLGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARR 209

Query: 201 GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIDRSVEVFKEMSFKNVF 260
              +E + LFR++    ++ +++T++SVL +C  L +L++G+ I +  +  K    K V 
Sbjct: 210 NRPNEALSLFREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAK--KHSFCKYVK 269

Query: 261 TWTALIQGLANNGEGKMALEFFSSMLENDV------------------------------ 320
             TALI   A  G    A+  F  M   D                               
Sbjct: 270 VNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXX 329

Query: 321 -KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEA 380
            +P+++TF+G+L+ACSH   V++GR  F+ M   F I P I+HYG MVD+L RAG LE+A
Sbjct: 330 XQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDA 389

Query: 381 YQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALV 440
           Y+FID +P  P  ++WR LLA+C +H N+++AEK  E I  L+ +H GDY++LSN YA  
Sbjct: 390 YEFIDKLPISPTPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARN 449

Query: 441 GRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQI 500
            + E    +R ++K+++  K+PGCS IE++ VVHEFFS DG    + ++H ALD+M+K++
Sbjct: 450 KKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKEL 509

Query: 501 KRLGYVPNTD-DARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDC 560
           K  GYVP+T         ++ KE ++ +HSEKLAI +GL+ T P TTIR+ KNLR+CRDC
Sbjct: 510 KLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDC 569

Query: 561 HNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 566
           HNA K IS +F R +++RD  RFHHF+DG CSC D+W
Sbjct: 570 HNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsGy1G001040 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 422.5 bits (1085), Expect = 7.1e-117
Identity = 226/565 (40.00%), Postives = 330/565 (58.41%), Query Frame = 0

Query: 37  LQQVHAHLLKTRRLLDPIITEAVLESAAL-LLPDTIDYALSIFNHIDKPESSAYNVMIRG 96
           L+Q+HA LL+T  + +  +    L   AL L+P  I+Y+  +F+    P  S  N MIR 
Sbjct: 27  LRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRLNPTLSHCNTMIRA 86

Query: 97  LAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKS 156
            +  ++P     LF+ +    S+  +  + S  LK C +   L  G Q+H  I   GF S
Sbjct: 87  FSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGLQIHGKIFSDGFLS 146

Query: 157 NEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKI- 216
           +  +  TL+ +Y+ C     A  VFD +P+R  V+WN + S Y +N    +V+ LF K+ 
Sbjct: 147 DSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKMK 206

Query: 217 --LELRIEFDDVTMISVLMACGRLANLEIGELI--------------------------- 276
             ++  ++ D VT +  L AC  L  L+ G+ +                           
Sbjct: 207 NDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRCG 266

Query: 277 --DRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLS 336
             D++ +VF  M  +NV +WTALI GLA NG GK A+E F+ ML+  + P + T  G+LS
Sbjct: 267 SMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGLLS 326

Query: 337 ACSHACLVDQGRHLFNSMRR-DFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPN 396
           ACSH+ LV +G   F+ MR  +F I+P + HYGC+VD+LGRA  L++AY  I +M   P+
Sbjct: 327 ACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMKPD 386

Query: 397 AVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSL 456
           + +WRTLL +CR H ++E+ E+ + H+  L+   +GDY+LL NTY+ VG+ E    +RSL
Sbjct: 387 STIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELRSL 446

Query: 457 IKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPN-TDD 516
           +KEK I   PGCS IEL G VHEF  +D  H   +EI+  L ++ +Q+K  GYV   T +
Sbjct: 447 MKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEITSE 506

Query: 517 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 566
                 EE K  ++ +HSEKLAIA+G++ T P TTIR++KNLR C DCHN  KF+S V++
Sbjct: 507 LHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDVYD 566

BLAST of CsGy1G001040 vs. TrEMBL
Match: tr|A0A0A0LRD6|A0A0A0LRD6_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G003520 PE=4 SV=1)

HSP 1 Score: 1083.9 bits (2802), Expect = 0.0e+00
Identity = 565/695 (81.29%), Postives = 565/695 (81.29%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60
           MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL
Sbjct: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60

Query: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120
           ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD
Sbjct: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120

Query: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180
           KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD
Sbjct: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180

Query: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240
           GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI
Sbjct: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240

Query: 241 GEL--------------------------------------------------------- 300
           GEL                                                         
Sbjct: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMXXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360

Query: 361 -------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420
                        IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV
Sbjct: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420

Query: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480
           KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY
Sbjct: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480

Query: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540
           QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG
Sbjct: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540

Query: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 566
           RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Sbjct: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600

BLAST of CsGy1G001040 vs. TrEMBL
Match: tr|A0A1S4E4P2|A0A1S4E4P2_CUCME (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502371 PE=4 SV=1)

HSP 1 Score: 1002.3 bits (2590), Expect = 4.4e-289
Identity = 528/698 (75.64%), Postives = 538/698 (77.08%), Query Frame = 0

Query: 1   MASIVGCLPNIS---LTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  S              KSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSXXXXXXXXXXXXXXKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGEL------------------------------------------------------ 300
           LE+GEL                                                      
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWXXXXXX 300

Query: 301 ------------------------------------------------------------ 360
                                                                       
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 ----------------IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
                           IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 566
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

BLAST of CsGy1G001040 vs. TrEMBL
Match: tr|A0A2N9IX34|A0A2N9IX34_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56483 PE=4 SV=1)

HSP 1 Score: 763.1 bits (1969), Expect = 4.5e-217
Identity = 391/687 (56.91%), Postives = 469/687 (68.27%), Query Frame = 0

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  S+T+I+QFPENPK+LILQ+CKT KDL Q+HAHL+KTR + +P +TE +LESAA+LLP
Sbjct: 12  PITSITTISQFPENPKTLILQKCKTSKDLNQIHAHLIKTRLIHNPAVTENLLESAAILLP 71

Query: 69  DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVL 128
           +T+DYALSIF  I +P+SSAYNVMIRGL +K+SP  A+ LFKKM E SV+ DKFTF  VL
Sbjct: 72  NTMDYALSIFRDIGEPDSSAYNVMIRGLTYKQSPHEAIFLFKKMLEASVEPDKFTFPCVL 131

Query: 129 KACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIV 188
           KACSR++ L EGEQVH  I+K G+ S+ FVENTLI MYANCG++ VAR VFDGM ER +V
Sbjct: 132 KACSRLRTLSEGEQVHGHIVKCGYGSSGFVENTLIHMYANCGRVEVARRVFDGMSERGVV 191

Query: 189 AWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELI---- 248
           AWNSM SGYT++G W+EVVKLFR++ EL + FD+VTMISVL +CGRLA+LE+G+ I    
Sbjct: 192 AWNSMFSGYTRSGCWEEVVKLFRRMRELGVGFDEVTMISVLTSCGRLADLELGQWISEYI 251

Query: 249 ------------------------------------------------------------ 308
                                                                       
Sbjct: 252 EENGIKKSLALMTSLVDMYAKCGKVDTARRLFDQMDQRDVVAWSAMISGYCQANRCREAL 311

Query: 309 ------------------------------------------------------------ 368
                                                                       
Sbjct: 312 DLFNEMQKASVEPNEVTMVSVLYACAVLGALETGKWVHFYIKKKKLKLTVTLGTALIDFY 371

Query: 369 ------DRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFI 428
                 D S+E+F +M+ KNVFTWTALIQGLA+NG+GK ALEFF  M E +++PNDVTFI
Sbjct: 372 AKCGSVDSSIEIFTKMTSKNVFTWTALIQGLASNGQGKRALEFFYLMREKNIEPNDVTFI 431

Query: 429 GVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPF 488
           G L+ACSHA LVD+GR LF SM +DF IEPRIEHYGCMVDILGRAG +EEAYQFI NMP 
Sbjct: 432 GALAACSHAGLVDEGRSLFVSMSKDFSIEPRIEHYGCMVDILGRAGLIEEAYQFIKNMPI 491

Query: 489 PPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRV 548
            PNAVVWRTLLASC+AHKN+++ E+S  HITRLEP HSGD ILLSN YALVGR EDA++V
Sbjct: 492 QPNAVVWRTLLASCKAHKNVKIGEESFRHITRLEPPHSGDCILLSNIYALVGRCEDALKV 551

Query: 549 RSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNT 566
           R  ++E+ IKK PGCSLIEL+GVVHEF SED  H H KEI+  ++ M+K+IK  GYVPN 
Sbjct: 552 RCQMREERIKKTPGCSLIELEGVVHEFLSEDDGHAHLKEIYGGVEDMIKRIKSAGYVPNP 611

BLAST of CsGy1G001040 vs. TrEMBL
Match: tr|F6GTR8|F6GTR8_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g09300 PE=4 SV=1)

HSP 1 Score: 748.4 bits (1931), Expect = 1.1e-212
Identity = 381/682 (55.87%), Postives = 464/682 (68.04%), Query Frame = 0

Query: 14  TSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDY 73
           TSI+ FPENPK+LIL+QCKT +DL ++HAHL+KTR LL P + E +LESAA+LLP ++DY
Sbjct: 17  TSISLFPENPKTLILEQCKTIRDLNEIHAHLIKTRLLLKPKVAENLLESAAILLPTSMDY 76

Query: 74  ALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSR 133
           A+SIF  ID+P+S AYN+MIRG   K+SP  A+LLFK+MHE SVQ D+FTF  +LK CSR
Sbjct: 77  AVSIFRQIDEPDSPAYNIMIRGFTLKQSPHEAILLFKEMHENSVQPDEFTFPCILKVCSR 136

Query: 134 MKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSM 193
           ++AL EGEQ+HALI+K GF S+ FV+NTLI MYANCG++ VAR VFD M ER++  WNSM
Sbjct: 137 LQALSEGEQIHALIMKCGFGSHGFVKNTLIHMYANCGEVEVARRVFDEMSERNVRTWNSM 196

Query: 194 LSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIDR------- 253
            +GYTK+G W+EVVKLF ++LEL I FD+VT++SVL ACGRLA+LE+GE I+R       
Sbjct: 197 FAGYTKSGNWEEVVKLFHEMLELDIRFDEVTLVSVLTACGRLADLELGEWINRYVEEKGL 256

Query: 254 ------------------------------------------------------------ 313
                                                                       
Sbjct: 257 KGNPTLITSLVDMYAKCGQVDTARRLFDQMDRRDVVAWSAMISGYSQASRCREALDLFHE 316

Query: 314 ------------------------------------------------------------ 373
                                                                       
Sbjct: 317 MQKANIDPNEITMVSILSSCAVLGALETGKWVHFFIKKKRMKLTVTLGTALMDFYAKCGS 376

Query: 374 ---SVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSA 433
              S+EVF +M  KNV +WT LIQGLA+NG+GK ALE+F  MLE +V+PNDVTFIGVLSA
Sbjct: 377 VESSIEVFGKMPVKNVLSWTVLIQGLASNGQGKKALEYFYLMLEKNVEPNDVTFIGVLSA 436

Query: 434 CSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV 493
           CSHA LVD+GR LF SM RDF IEPRIEHYGCMVDILGRAG +EEA+QFI NMP  PNAV
Sbjct: 437 CSHAGLVDEGRDLFVSMSRDFGIEPRIEHYGCMVDILGRAGLIEEAFQFIKNMPIQPNAV 496

Query: 494 VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIK 553
           +WRTLLASC+ HKN+E+ E+SL+ +  LEP HSGDYILLSN YA VGR EDA++VR  +K
Sbjct: 497 IWRTLLASCKVHKNVEIGEESLKQLIILEPTHSGDYILLSNIYASVGRWEDALKVRGEMK 556

Query: 554 EKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARL 566
           EK IKK PGCSLIELDGV+HEFF+ED  H  S+EI++A++ MMKQIK  GYVPNT +ARL
Sbjct: 557 EKGIKKTPGCSLIELDGVIHEFFAEDNVHSQSEEIYNAIEDMMKQIKSAGYVPNTAEARL 616

BLAST of CsGy1G001040 vs. TrEMBL
Match: tr|M5W9L5|M5W9L5_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G165600 PE=4 SV=1)

HSP 1 Score: 741.1 bits (1912), Expect = 1.8e-210
Identity = 385/687 (56.04%), Postives = 461/687 (67.10%), Query Frame = 0

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+I QFP NPK+LILQQCKT +DL QVHAHL+KTR LL+P ITE +LESAA+LLP
Sbjct: 13  PLTAITTIPQFPHNPKTLILQQCKTTRDLNQVHAHLIKTRLLLNPTITENLLESAAILLP 72

Query: 69  DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVL 128
           + +DYALSIF+++D+P++  YN+MIR L +K SP  A LLFKKM E S + D+FT SS+L
Sbjct: 73  NAMDYALSIFHNLDEPDTLVYNIMIRSLTYKLSPLEAFLLFKKMQESSAEPDEFTLSSIL 132

Query: 129 KACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIV 188
           KACS+++ALREGEQ+HA I+K GFKSN FVENTLI MYA CG++ VAR VFDG+PER+ +
Sbjct: 133 KACSKLRALREGEQIHAHIVKCGFKSNGFVENTLIHMYATCGELEVARRVFDGLPERARM 192

Query: 189 AWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGE------ 248
           AWNSML+GY KN  WDEVVKLF ++L+L + FD+VT+ SVL ACGRLANLE+GE      
Sbjct: 193 AWNSMLAGYMKNKCWDEVVKLFHEMLKLGVGFDEVTLTSVLTACGRLANLELGEWIGDYI 252

Query: 249 ------------------------------------------------------------ 308
                                                                       
Sbjct: 253 EANRLKGNIALVTSLVDMYAKCGQVETARRFFDRMDRRDVVAWSAMISGYSQANRCREAL 312

Query: 309 ------------------------------------------------------------ 368
                                                                       
Sbjct: 313 DLFHDMQKANVDPNEVTMVSVLYSCAVLGALKTGKWVEFYIKKEKLKLTVNLGTALIDFY 372

Query: 369 ----LIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFI 428
                ID S+EVF  M   NVF+WTALIQGLA+NG+GK ALE+F  M E ++KPN+VTFI
Sbjct: 373 AKCGCIDSSIEVFNRMPSTNVFSWTALIQGLASNGQGKGALEYFQLMQEKNIKPNNVTFI 432

Query: 429 GVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPF 488
            VLSACSHA LV++GR+LF SM +DF IEPRIEHYG MVDILGRAG +EEAYQFI NMP 
Sbjct: 433 AVLSACSHAGLVNEGRNLFTSMIKDFGIEPRIEHYGSMVDILGRAGLIEEAYQFIKNMPI 492

Query: 489 PPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRV 548
            PNAVVWRTLLASCRAHKN+E+ E+SL+HI  LE  HSGDYILLSN YA V R EDAIRV
Sbjct: 493 QPNAVVWRTLLASCRAHKNVEIGEESLKHIISLETPHSGDYILLSNIYASVDRREDAIRV 552

Query: 549 RSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNT 566
           R  ++EK I+K PGCSLIELDGV++EFF+ED    H +E+++A   MMK+IK  GYVP T
Sbjct: 553 RDQMREKGIEKAPGCSLIELDGVIYEFFAEDKACPHLEEVYNATHDMMKRIKEAGYVPYT 612

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004138266.10.0e+0081.29PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis s... [more]
XP_016903201.16.7e-28975.64PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-... [more]
XP_022921781.13.8e-27671.82pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
XP_023516242.17.9e-27471.24pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
XP_022987229.18.8e-27370.82pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT1G31920.11.3e-12138.18Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48910.13.6e-11938.50Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.18.0e-11940.75Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.12.3e-11838.30Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G47530.14.0e-11840.00Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9C6T2|PPR68_ARATH2.4e-12038.18Pentatricopeptide repeat-containing protein At1g31920 OS=Arabidopsis thaliana OX... [more]
sp|Q9FI80|PP425_ARATH6.5e-11838.50Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH1.4e-11740.75Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q8LK93|PP145_ARATH4.2e-11738.30Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
sp|Q9SN85|PP267_ARATH7.1e-11740.00Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LRD6|A0A0A0LRD6_CUCSA0.0e+0081.29Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G003520 PE=4 SV=1[more]
tr|A0A1S4E4P2|A0A1S4E4P2_CUCME4.4e-28975.64pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
tr|A0A2N9IX34|A0A2N9IX34_FAGSY4.5e-21756.91Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56483 PE=4 SV=1[more]
tr|F6GTR8|F6GTR8_VITVI1.1e-21255.87Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g09300 PE=4 SV=... [more]
tr|M5W9L5|M5W9L5_PRUPE1.8e-21056.04Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G165600 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G001040.1CsGy1G001040.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 154..241
e-value: 3.6E-19
score: 70.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 242..503
e-value: 2.1E-33
score: 118.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 26..153
e-value: 3.6E-14
score: 55.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 340..418
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 188..217
e-value: 6.5E-8
score: 32.2
coord: 160..186
e-value: 0.15
score: 12.3
coord: 332..356
e-value: 0.071
score: 13.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 188..221
e-value: 1.6E-6
score: 25.9
coord: 260..293
e-value: 2.4E-7
score: 28.5
coord: 88..118
e-value: 0.0016
score: 16.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 257..305
e-value: 8.7E-12
score: 44.9
coord: 85..132
e-value: 4.0E-10
score: 39.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 329..359
score: 6.445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.027
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 361..391
score: 5.886
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 395..429
score: 6.106
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 155..185
score: 7.794
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 9.153
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..323
score: 6.873
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 11.696
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 85..119
score: 9.087
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 431..555
e-value: 1.9E-39
score: 134.4
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 244..482
coord: 46..172
coord: 172..242
NoneNo IPR availablePANTHERPTHR24015:SF1650SUBFAMILY NOT NAMEDcoord: 244..482
NoneNo IPR availablePANTHERPTHR24015:SF1650SUBFAMILY NOT NAMEDcoord: 46..172
coord: 172..242

The following gene(s) are paralogous to this gene:

None